NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F053214

Metagenome / Metatranscriptome Family F053214

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F053214
Family Type Metagenome / Metatranscriptome
Number of Sequences 141
Average Sequence Length 48 residues
Representative Sequence MNDLISRLLAHERIRAARHHIERTDEGTLARQAALSAIPAPTGAEGRRAA
Number of Associated Samples 118
Number of Associated Scaffolds 141

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 110
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.950 % of family members)
Environment Ontology (ENVO) Unclassified
(43.262 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.645 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 50.00%    β-sheet: 0.00%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 141 Family Scaffolds
PF00682HMGL-like 70.21
PF05960DUF885 7.80
PF07690MFS_1 1.42
PF13561adh_short_C2 0.71
PF07676PD40 0.71
PF04389Peptidase_M28 0.71

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 141 Family Scaffolds
COG4805Uncharacterized conserved protein, DUF885 familyFunction unknown [S] 7.80


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil21.28%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.51%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.26%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.55%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.55%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.84%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.13%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.13%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.42%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.42%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.42%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment0.71%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.71%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.71%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.71%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.71%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.71%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.71%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.71%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.71%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004780Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare1FreshEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014883Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT760_16_10DEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018064Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026924Grasslands soil microbial communities from Chapel Hill, North Carolina, USA that are Nitrogen fertilized -Nitrogen NN332 (SPAdes)EnvironmentalOpen in IMG/M
3300027573Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300033803Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_0_10EnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25384J37096_1010227013300002561Grasslands SoilMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAPTGAEAQRAAR
JGI25382J43887_1011902523300002908Grasslands SoilVDHAVSRLLAHDRIRAARHHIERTDEVTLARQAALSAIPAPTG
JGI25382J43887_1033333923300002908Grasslands SoilVIAALLANERIRAARAHIERSDEVTLVRQAALSAIPAPTGAEA
Ga0055437_1006477013300004009Natural And Restored WetlandsMNDLIAKLSAHERIRAARHHIERTDEVTLARQAALSAIPAPTGAEAQRATRTAELFREVGLG
Ga0062378_1021715213300004780Wetland SedimentMPPPESIDALVAHERVRAARQYIERSDEVTLARQAALSAIPAPTGAERARGARLA
Ga0066683_1076015823300005172SoilMHEVISGLLGHDRVKAARQHIERTDDVTLARQAALSAIPAPTGAEA
Ga0066676_1030593513300005186SoilMHTMPGPLDGLLSHDRVRAARAHIERSDEATLARQATLSAIPAP
Ga0066689_1002581433300005447SoilVDQTISRLLGHERIRAARHHVERTDEVTLARQAALCAIPAPTGAEGHRAARVAELFRAV
Ga0070699_10079851313300005518Corn, Switchgrass And Miscanthus RhizosphereMPDAIDTLLANERVRAARIHIERFDEVTLARQATLSAIPAPTGAEAARG
Ga0070735_1045679613300005534Surface SoilMHDLIAKLAAHERVRAARAHIERSDEVTLARQAAISAIPAPTGAEFERGSYTEALFREI
Ga0070695_10022153823300005545Corn, Switchgrass And Miscanthus RhizosphereMNDLIAKLLSHDRVRAARQHIERTDEITLARQAAISAIPAPTGAEAQRATRAAELFREV
Ga0070695_10058763113300005545Corn, Switchgrass And Miscanthus RhizosphereVIDGLLAHDRVRAARVHIERSDEATLARQASLSAIPAPTGEERLRGARVAQMFSDA
Ga0066661_1001843413300005554SoilVEHTLSRLLAHERIRAARRHVERADDVTLARQAALSATPAPTGAEGARGARVAELF
Ga0066704_1032429813300005557SoilVNLDYLLSHPRIQAARAHLERTDEVTLARQVELSGLAAPTGAETRRGARVAELFREIGLRDG
Ga0066704_1062892613300005557SoilMHDLMARLLAHERVRAARHHCERTDEVTLARQAALSAIPAPTGAEAQRA
Ga0066670_1079181213300005560SoilMNDLIARLLAHERIRAARRHVERADDLTLARQAALSATPAPTG
Ga0066670_1099355423300005560SoilVVDQTISRLLAHDRVRAARSHLERTDEVTLARQAALCAIPAP
Ga0066703_1049947413300005568SoilMHDLMARLLAHERVRAARHHCERTDEVTLARQAALSAIPAPTG
Ga0066705_1091073523300005569SoilMNDLIARLLAHDRIRAARRHVERADDVTLARQAELSATPAPTGAEGAR
Ga0066702_1016044823300005575SoilVEHTLSRLLAHERIRAARRHVERTDDVTLARQAALSATPAPTGAEGARGAR
Ga0066691_1024815313300005586SoilVIDSLLAHDRIRAARAHIERSDEVTLARQAELSAIPAPTGSEAARAAKVVELFAGVGL
Ga0066654_1030435913300005587SoilVVDQTISRLLAHDRVRAARSHLERTDEVTLARQAALCAIPAPTGAEGHR
Ga0066706_1139041523300005598SoilMAMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAPTGAEAQRAARVA
Ga0079222_1001917813300006755Agricultural SoilMDHAVSRLLAHERIRAARHHIERTDEVTLARQTALSAIPAPTGAEDRR
Ga0066665_1045427113300006796SoilMAMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAPTGAEAQRAARVAE
Ga0066665_1145979713300006796SoilVIDSLLAHDRIRAARAHIERSDEVTLARQAELSAIPAPTGSEAARAAKVVELFAGVGLHD
Ga0075433_1122594923300006852Populus RhizosphereMPEPIDSVFAHERVRAARHHIERSDESTLARQAALSAIPAPTGAEGARGAH
Ga0075420_10096521423300006853Populus RhizosphereMPESIDAVVAHERVRAARYHIERSDENTLARQATLSAIPAPTG
Ga0075426_1035638713300006903Populus RhizosphereVIDAVLLNERVHTARAYIERSDEATLGRQAALSAIPAPTGAEAARGARVAEM
Ga0079219_1003116833300006954Agricultural SoilMEQTLARLLAHERVRAARHHIERTDEVTVGRQIALSAIPS
Ga0099793_1051781113300007258Vadose Zone SoilMIDAVLLSERVRTARAHIERSDEATLGRQAALSAIPAPTGAEGARGARVAEM
Ga0066710_10094206523300009012Grasslands SoilMHEVISGLLGHDRVKAARQHIERTDEVTLTRQAALSAIP
Ga0066710_10124865313300009012Grasslands SoilMNELIARLLGHERVRAARHHIERTDEVTLARQAALS
Ga0066710_10224880113300009012Grasslands SoilMNDLISRLLAHERIRAARHHIERTDEVTLARQAALSAIPAP
Ga0066710_10269883913300009012Grasslands SoilMEGLLSHDRLRAARAHIERSDEATLGRQAALSAIPA
Ga0099827_1133494913300009090Vadose Zone SoilVEQTISRLLAHERLRAARHHVERTDELTLGRQAELCAIAAPTGAEARRATRVAELF
Ga0066709_10263098913300009137Grasslands SoilMAYGPHRYAMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAPTGAEAQRAARVAE
Ga0105067_106479023300009812Groundwater SandMDARISELFSHPLVRSARAHLESSDDLTLSRQAELSAIPAPTGAEGARATRLAAMLRDTG
Ga0105066_116932823300009822Groundwater SandMPDPIDMLLDEPRVRAARAHIERSDELTLARQAALSAIPAPTGAEAARGARVAELFR
Ga0099796_1022361413300010159Vadose Zone SoilMLESMTALLSHDRVRAARAHIERSDEATLGRQAALSAIPAPTGAEGARGARVADMLSE
Ga0134070_1044270813300010301Grasslands SoilMDALLSHDRVRTARAHIERSDEATLARQAALSAIPAPTGAERARGTRVAEMLARRG*
Ga0134082_1020594123300010303Grasslands SoilLALMNDLISRLLAHDRIRAARHHIERTDELTLARQASLSAIPAPTGAEGQRAARVAELFREI
Ga0134111_1050965523300010329Grasslands SoilMLPSIHALIAHDRIRAARAHIERADETTLARQAALSAIPAPTGAET
Ga0134080_1021195623300010333Grasslands SoilVIDAVLRNERVRTARAHIERSDEATLGRQAALSAIPAPTGAEAARGARVAEMLVAI
Ga0134080_1037277613300010333Grasslands SoilVDHALSRLLAHERIRAARHHIERTDEGTLARQAALSAIPAPTGA
Ga0134063_1016615723300010335Grasslands SoilVDQTISRLLGHERIRAARHHVERTDEVTLARQAALCAIPAPTGAEGHRAARVAELFR
Ga0134071_1015346323300010336Grasslands SoilMHEVISGLLGHDRVKAARQHIERTDDVTLARQAALSAIPAPTGAEAQRAARV
Ga0134128_1117574923300010373Terrestrial SoilVDHAVSRLLAHERIRAARHYIERTDEVTLARQAAL
Ga0134128_1249797413300010373Terrestrial SoilMPSDTIDALVAHERVRAARHHIERSDESTLARQAALSAIPAPTGSEGARGAHVAG
Ga0134126_1038011613300010396Terrestrial SoilMDHAVSRLLAHARIRAARHHIERTDEVTLARQAALSTIPAPTG
Ga0134121_1036073023300010401Terrestrial SoilVDHAVSRLLAHERIRAARHYIERTDEVTLARQAALSTIPAPT
Ga0137392_1026622113300011269Vadose Zone SoilMDHAVSRLLAHERIRAARHHIERTDEGTLARQAALSA
Ga0137393_1065842823300011271Vadose Zone SoilMAMHELISGLLGHDRVKAARQHIERTDEVTLARQAALSAIPAPTGAEAQRAAWVA
Ga0137451_122898713300011438SoilMPESIDRVVAHEQVRAARHHIEHSDERTLARQAALSAIPAPTGAEGA
Ga0137452_118243413300011441SoilMNDLIAKLAAHERIRAARHHIERTDEVTLARQAALSAIPAPTG
Ga0137445_105880013300012035SoilMNDLIAKLAAHERIRAARHHIERTDEVTLARQAALSAIPAPTGAEAQRAARTAELFR
Ga0137388_1063709213300012189Vadose Zone SoilMTESITALLAHDRVRTARAHIERSDEATLGRQAALSAI
Ga0137363_1044104723300012202Vadose Zone SoilVKKCRSFVTMPLPESIDAVVAHDRVRAACHHIERSDESTLVRQVALSAIPAPTGAEGA
Ga0137399_1127805113300012203Vadose Zone SoilMLGPLDAVLSHDRVRAARSHIERSDEATLARQAALSAIPA
Ga0137380_1040354813300012206Vadose Zone SoilLVDQTISKLLAHERIRAARHHLERTEEVTLARQAALSAVP
Ga0137381_1053930423300012207Vadose Zone SoilMNDVIAKLLGHERVRAARHHIERTDQATLARQAALSAIPAPTGAEAQRATH
Ga0137376_1034337913300012208Vadose Zone SoilMDALLSHDRVRTARAHIEHSDEATLARQAALSAIPAPT
Ga0137376_1065585413300012208Vadose Zone SoilVIDAILLNERVRTARAHIERSDEATLGRQAALSAIPAPTGAEGARGARVAEMFGGIGL
Ga0137379_1044068623300012209Vadose Zone SoilMNDLIAELLAHDRVRAARHHIERSDEVTLARQAALSAVPAPTGAEGQRAARVAE
Ga0137370_1052675313300012285Vadose Zone SoilVIDTVLQNERVRAAGAFIERSDEATLGRQAALSAIPAPTGAEGARGARVSAML
Ga0137367_1013773713300012353Vadose Zone SoilMEQTISRLLAHERIRAARHHLERTEEVTLARQAALSAIPAPTGAEGRR
Ga0137369_1029756313300012355Vadose Zone SoilMNDLIARLLAHDRIRAARHHIERTDAATLARQAALSAIPAPTG
Ga0137384_1048780823300012357Vadose Zone SoilLLAHERIRAARHHVERTDEVTLARQAALSAIPAPTGAEGRRA
Ga0137375_1129838113300012360Vadose Zone SoilMNDLIARLLAHDRIRAARHHIERTDAATLARQAALSAIPAPTGAE
Ga0137360_1018338333300012361Vadose Zone SoilMPGTLDALFSHDRVRAARAHIERSDEVTLARQAALSAIP
Ga0137390_1135815023300012363Vadose Zone SoilMPDDIDTLLANERVRAARTHIERFDEVTLTRQATLSAIPAPTGAEA
Ga0137373_1009032433300012532Vadose Zone SoilMNDLIARLLAHDRIRAARHHIERTDAATLARQAALSAI
Ga0137398_1025685123300012683Vadose Zone SoilVIDALLAHARVRAAREHVERSDEATLSRQAALSAIPAPTGAERARGG
Ga0137398_1027662113300012683Vadose Zone SoilMDPLFSHPRIAAARAHLERTDQVTLERQAVLSAVPAPTGAEGLRATRVAELFREVGLRD
Ga0137398_1045982113300012683Vadose Zone SoilVIDAVLLNERVRTARAHIERSDEVTLGRQAALSAIPAPTGAEAAR
Ga0137397_1028662423300012685Vadose Zone SoilMLPSIHALVAHDRIRAARAHIERADETTLARQAALSAIPAPTGAGGARGRRIAHMFRAA*
Ga0137397_1062014613300012685Vadose Zone SoilMPSPDSIAVLVAHDRVRAARHHIERSDEHTLARQAALSAIPAPTGAEGARGA
Ga0137396_1004077033300012918Vadose Zone SoilVIDAVLLNERVRTARAHIERSDEATLGRQAALSAIPAPTGAEGARGARVAEMLGGIGLQD
Ga0137416_1150981023300012927Vadose Zone SoilMPSSESIGALVAHERVRAARLYIERSDESTLARQAALSAIPAPNVAGS*
Ga0137404_1211687913300012929Vadose Zone SoilMDGLLSHDRVRAARAHIERSDEATLARQAALSAIPAP
Ga0137410_1010214113300012944Vadose Zone SoilMIDAVLLNERVRTARAHIERSDEATLGRQAALSAIPAPTGAEGAR
Ga0137410_1185769623300012944Vadose Zone SoilMVKKRRSLRAMLSSESIDALVAHERVRAARQYIERCDERTLGRQAALSAIPAPTGAE
Ga0134110_1051372023300012975Grasslands SoilMNDLIARLLAHQRIRAARRHVERADELTLARQAALSATPAPT
Ga0134075_1000952313300014154Grasslands SoilLVDQTISRLLAHERIRAARHHLERTEEITLARQAALSA
Ga0134079_1068616613300014166Grasslands SoilVIDALLAHDRIRAARAHIERSDEVTLARQAELSAIPAPTG
Ga0180086_117261013300014883SoilMNDLIAKLSAHGRIRAARLHIERTDEVTLARQAAL
Ga0137418_1125900713300015241Vadose Zone SoilMTESITALLAHDRVRAARAHIERSDEATLGRQAALSAIPAPTGA
Ga0137409_1000620513300015245Vadose Zone SoilMDHAVSRLLAHERIRAARHHIERTDEVTLARQAALSAI
Ga0134073_1029342213300015356Grasslands SoilMNDLISRLLAHERIRAARRHVERADDVTLARQAALSATPAPTGAEGARGARVA
Ga0134083_1015793923300017659Grasslands SoilMNDLISRLLAHDRIRAARHHIERTDEVTLARQATLSAIGAPTGAE
Ga0187773_1040989623300018064Tropical PeatlandMLPSIHALVAHDRIRAARAHIERSDEATLARQAALSAIPAPTGAE
Ga0184612_1002111343300018078Groundwater SedimentMPYSIDALLNETRVRGARAHIERSDEATLARQAALSAIP
Ga0066667_1045005413300018433Grasslands SoilVDQTISRLLGHERIRAARHHVERTDEVTLARQAALCAIP
Ga0066662_1186218313300018468Grasslands SoilVEHTLSRLLAHERIRAARRHVERADDVTLARQAALSATPAPTGAEGARGARV
Ga0137408_132596813300019789Vadose Zone SoilVDHAVSRLLAHERIRAARHHIERTDEVTLARQAALSTIPPAPTGAEGRRAARVA
Ga0193713_115565413300019882SoilMNDVIAKLSAHERIRAARHHIERTDEVTLARQAALSAIPAPTGAET
Ga0210378_1004064913300021073Groundwater SedimentMPYSIDALLNETRVRGARAHIERSDEATLARQAALSAIPAPTGAEAKRG
Ga0222625_162543513300022195Groundwater SedimentMPESIDSVVAHERVRAARHHIERSDESTLARQAALSAIPAPTGAEGARGAHLAEL
Ga0137417_110041123300024330Vadose Zone SoilMDHAVSRLLAHERIRAARHHIERTDEVTLVRQAALSAIPAPTGAEGRRDAH
Ga0209641_1023571213300025322SoilVDHTISHLLAHERIRAARHHIERTDEVTLARQAALSAIPAPTGAEAQRGTR
Ga0207684_1119916513300025910Corn, Switchgrass And Miscanthus RhizosphereMDHAVSRLLAHERIRAARHHIERTDEGTLARQAALS
Ga0207684_1154027813300025910Corn, Switchgrass And Miscanthus RhizosphereMPGSIDTVVAHERVRAARQHIERSDENTLARQAALSAIPAPTGAEGA
Ga0209350_114264113300026277Grasslands SoilMAMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAPTGAEAQRAAR
Ga0209235_101886533300026296Grasslands SoilVNLDCLLSHPRIQAARAHLERTDEVTLARQVELCGLAAPTGAETRRGARVAELFREI
Ga0209237_125332013300026297Grasslands SoilMDHAVSRLLAHERIRAARQHIERTDEVTLARQAALSAIPAPTGAEGRRAAR
Ga0209237_127710523300026297Grasslands SoilMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAP
Ga0209236_122263013300026298Grasslands SoilVDHALSRLLAHDRIRAARHHIERTDEGTLARQAALSAIPAPTGAEGRRAAHVADLFRTIG
Ga0209027_101607833300026300Grasslands SoilVDQTISRLLAHDRVRAARSHLERTDEVTLARQAALCAIPAPTGAEGRRAAHVA
Ga0209238_105610813300026301Grasslands SoilVKTVEHTLSRLLAHERIRAARRHVERADELTLTRQAALSATPAPTG
Ga0209468_111477613300026306SoilMNDLISRLLAHDRIRAARHHIERTDEGTLARQAALSAIPAP
Ga0209468_113240813300026306SoilMHEVISSLLSHDRVKAARQHIERTDEATLARQAALSAIPAPTGAEAERAARVAE
Ga0209469_112274723300026307SoilMNELIARLLGHERVRAARHHIERTDELTLARQASLSAIPAPTGAEGQRAARVAELF
Ga0209239_114709213300026310Grasslands SoilMNDLISRLLAHDRIRAARHHIERTDELTLARQASLSAIPAPTGAEGQ
Ga0209470_108204713300026324SoilVIDALLAHDRIRAACAHIERSDEVTLARQAELSAIPAPTGSEAARAAKVVELFAGVGLHDDAV
Ga0209473_102117513300026330SoilVDQTISRLLAHDCVRAARSHLERSDEVTLARQAALCAIPAPTGAEGRRAAHVAELFRTVG
Ga0209803_101055113300026332SoilVDQTISRLLAHERIRAARHHLERTEEITLARQAALSAV
Ga0209803_132248223300026332SoilMHEVISSLLSHDRVKAARQHIERTDEATLARQAAL
Ga0209808_111048513300026523SoilVKAVEHTLSRLLAHERIRAARRHVERADERTLARQAALSATPAPTGAEAARGARVAE
Ga0209058_106977913300026536SoilMNDLISRLLAHERIRAARHHIERTDEVTLARQAVLSAIPAPT
Ga0209157_129445113300026537SoilMNDLISRLLAHDRIRAARHHIERTDELTLARQASLSAIPAPTGA
Ga0209056_1008746613300026538SoilMNDLISRLLAHERIRAARHHIERTDEGTLARQAALSAIPAPTGAEGRRAA
Ga0209056_1049263923300026538SoilMHEVISGLLGHDRVKAARQHIERTDDVTLARQATLSAIPAPTGAEAQR
Ga0209376_111741313300026540SoilMNDLLSRLLAHERIRAARHHIERTDEVTLARQAALSAIPAPTGAEGRRAAHL
Ga0209161_1021154323300026548SoilMDHAVSRLLAHERIRAARHHIERTDEVTLARQTAL
Ga0208339_10020623300026924SoilMDHAVSRLLAHERIRAARHHIERTDEVTLARQTALSAIPA
Ga0208454_102935723300027573SoilMIDALLTHDRVRAARVHIERSDEATLARQASLSAIP
Ga0209811_1013710723300027821Surface SoilVIDGLLAHDRVRAARLHIERSDEATLARQASLSAIPAPTGAERARGTRV
Ga0209180_1002842813300027846Vadose Zone SoilMIDAVLLNERVRTARAHIERSDEATLARQAALSAIPAP
Ga0209590_1008708733300027882Vadose Zone SoilMDDLIVKLLAHDRIQAARHHIERTDELILARQAAL
Ga0209868_102764513300027947Groundwater SandVEQTISRLLAHERIRAARHHIERTDEVTLARQAALSAIPAPTGAEAQRG
Ga0137415_1084906423300028536Vadose Zone SoilMPSSESIGALVAHERVRAARLYIERSDESTLARQAALSAIPAPTGAEGARGAHVAD
Ga0307469_1003132633300031720Hardwood Forest SoilMDHAVSRLLAHERIRAARQHVERTDEVTLARQTALSAIPAPTGAEGRRASRV
Ga0307469_1112245313300031720Hardwood Forest SoilMIDALLAHDRVRAARAHIERSDEATLARQASLSAIPAPTGAERARGTCVAQMFAD
Ga0307469_1134720313300031720Hardwood Forest SoilVIDALLAHDRIRTARAHIERSDELTLERQAALSAIPAPTGAEA
Ga0307469_1163352923300031720Hardwood Forest SoilMIGLAFVGTMNDLIAKLLSHDRVRAARQHIERTDEITLARQAAISAIPAPTGAEAQRATRAAELFREVG
Ga0307473_1117333723300031820Hardwood Forest SoilMDHAVSRLLAHERIRAARHHIERTDEGTLTRQAALSAIPAPTGA
Ga0214473_1175439623300031949SoilMEDPIPRLLAHDRVRAARRHLERSDEGTLAVQAELSAIPAPTGAEGR
Ga0307471_10434877823300032180Hardwood Forest SoilMIGLAFVGTMNDLIAKLLSHDRVRAARQHIERTDEITLARQAAISAIPAP
Ga0335079_1144803123300032783SoilMLNELVAKLLAHDRMRAARQHIERTDEVTLARQAALSAIPAPTGA
Ga0314862_0174026_3_1793300033803PeatlandMHDVIAKLLAHERIRAARHHLERTDEVTLARQSALSAIPAPTGAEGQRAVATADLFREV
Ga0364928_0098521_2_1543300033813SedimentMPGTIDALLSHDRVRAARAHIERSDEATLVRQAALSSIPAPTGAETARGAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.