NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103852

Metagenome / Metatranscriptome Family F103852

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103852
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 42 residues
Representative Sequence GAIEVHTFDGLPEHRMVPSPSQPETMRAIDTITDFIRRQTA
Number of Associated Samples 96
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 2.97 %
% of genes from short scaffolds (< 2000 bps) 1.98 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.010 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(12.871 % of family members)
Environment Ontology (ENVO) Unclassified
(32.673 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(38.614 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 26.09%    β-sheet: 0.00%    Coil/Unstructured: 73.91%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF09084NMT1 21.78
PF01594AI-2E_transport 4.95
PF03401TctC 3.96
PF01568Molydop_binding 2.97
PF07883Cupin_2 1.98
PF00211Guanylate_cyc 1.98
PF01717Meth_synt_2 1.98
PF05977MFS_3 1.98
PF12695Abhydrolase_5 1.98
PF03358FMN_red 0.99
PF012572Fe-2S_thioredx 0.99
PF01206TusA 0.99
PF01972SDH_sah 0.99
PF01510Amidase_2 0.99
PF00296Bac_luciferase 0.99
PF09242FCSD-flav_bind 0.99
PF08299Bac_DnaA_C 0.99
PF07969Amidohydro_3 0.99
PF04191PEMT 0.99
PF02541Ppx-GppA 0.99
PF01738DLH 0.99
PF05706CDKN3 0.99
PF02775TPP_enzyme_C 0.99
PF02423OCD_Mu_crystall 0.99
PF10589NADH_4Fe-4S 0.99
PF13432TPR_16 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 21.78
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 21.78
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 4.95
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 3.96
COG0248Exopolyphosphatase/pppGpp-phosphohydrolaseSignal transduction mechanisms [T] 1.98
COG0616Periplasmic serine protease, ClpP classPosttranslational modification, protein turnover, chaperones [O] 1.98
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 1.98
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 1.98
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 1.98
COG0425Sulfur carrier protein TusA (tRNA thiolation, molybdenum cofactor biosynthesis)Translation, ribosomal structure and biogenesis [J] 0.99
COG0593Chromosomal replication initiation ATPase DnaAReplication, recombination and repair [L] 0.99
COG1905NADH:ubiquinone oxidoreductase 24 kD subunit (chain E)Energy production and conversion [C] 0.99
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.99
COG2423Ornithine cyclodeaminase/archaeal alanine dehydrogenase, mu-crystallin familyAmino acid transport and metabolism [E] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.01 %
All OrganismsrootAll Organisms0.99 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300006865|Ga0073934_10279228Not Available1079Open in IMG/M
3300009156|Ga0111538_11204766Not Available957Open in IMG/M
3300014311|Ga0075322_1004192All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2543Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere12.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.91%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.93%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.93%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.95%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands4.95%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.97%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.98%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment1.98%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.98%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.98%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.99%
Freshwater Microbial MatEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Microbial Mat0.99%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.99%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment0.99%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Termite NestEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Termite Nest0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.99%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.99%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.99%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.99%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.99%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300004020Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2EnvironmentalOpen in IMG/M
3300004047Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006169Termite nest microbial communities from Madurai, IndiaEnvironmentalOpen in IMG/M
3300006194Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009087Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300011442Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT138_2EnvironmentalOpen in IMG/M
3300012167Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT333_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012510Arabidopsis rhizosphere microbial communities from North Carolina - M.Col.9.old.080610Host-AssociatedOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300014267Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailC_D1EnvironmentalOpen in IMG/M
3300014306Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushSE_CattailNLB_D1EnvironmentalOpen in IMG/M
3300014311Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailB_D1EnvironmentalOpen in IMG/M
3300014314Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_TuleA_D2EnvironmentalOpen in IMG/M
3300014879Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_10DEnvironmentalOpen in IMG/M
3300015201Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S014-104B-1 (version 2)EnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020195Freshwater microbial mat bacterial communities from Lake Vanda, McMurdo Dry Valleys, Antarctica - Oligotrophic Lake LV.19.P2.IBEnvironmentalOpen in IMG/M
3300021051Subsurface sediment microbial communities from Mancos shale, Colorado, United States - Mancos A1EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025932Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026018Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027778Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027840Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare2Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027876Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032003Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D1EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033434Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day10_CT_bEnvironmentalOpen in IMG/M
3300033482Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D1_CEnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C687J26631_1022704013300002124SoilRGGTIEAQTFDGLPEHRMVPSPTQPETMRAIETITAFIRRQTG*
Ga0055440_1006086613300004020Natural And Restored WetlandsASYRKRGGVIEVHSFDGLPEHRMVPSPSQPESMRFIETVSAFIRQKAG*
Ga0055499_1001264413300004047Natural And Restored WetlandsKRGGVIEVHSFDGLPEHRMAPSPSQPESMRFIDTVSAFIRQKTG*
Ga0066678_1090409413300005181SoilAIEVQTFDGLPEHRMVPSPSQPETMRVIDTITAFIRRHTA*
Ga0065707_1075076513300005295Switchgrass RhizosphereRGGQIEVHTFEGLPEHRMVPSRAQPETMRAMDTIAEFIRRQT*
Ga0070670_10036363623300005331Switchgrass RhizosphereKRGGQIEVHTFEGLPEHRMVPSRAQPETMRAMDTIAEFIRRQT*
Ga0070671_10039953223300005355Switchgrass RhizosphereYRKRGGQIEVHTFEGLPEHRMVPSRAQPETMRAIDTIAEFIRRQT*
Ga0066682_1078755523300005450SoilGAIEVHTFDGLPEHRMVPSPSQPETMRAIDTITDFIRRQTA*
Ga0070685_1065478133300005466Switchgrass RhizosphereIASYRKRGGTIEVHTFAGLPEHRMVPALNQPETMRVIDTIIGFIGRQNR*
Ga0070665_10098031023300005548Switchgrass RhizosphereEVHTFEGLPEHRVVPSPAQPETMRAMDTITGFIERQTS*
Ga0066698_1003866013300005558SoilIEVQTFDGLPEHRMVPSPSQPETMRVIDTITAFIRRHTA*
Ga0070702_10185103823300005615Corn, Switchgrass And Miscanthus RhizosphereGAIEVHTFDGLPEHRMVPSPDKPETLRFVDIVTEFVRRHS*
Ga0066905_10109368113300005713Tropical Forest SoilRGGAIEVHTFDGLPEHRMVPSPSQPDTMRVIDTTTTFIRRHSK*
Ga0066905_10139737913300005713Tropical Forest SoilVHTFDGLPEHRMVPSPSQPDTMRVIDTMTAFIRRHSK*
Ga0066905_10144686513300005713Tropical Forest SoilFDGLPEHRMVPSPAQPETMRAIDTIAAFIRRHTG*
Ga0066903_10387620613300005764Tropical Forest SoilVQTFEGLPEHRMVPSPAQPETMRAMDTITDFIKRQTA*
Ga0082029_157714723300006169Termite NestEVHTFDALPEHRVVPSPSQPETMRLIDTITEFISRKAR*
Ga0075427_1009504823300006194Populus RhizosphereRFIASYRKRGGVLEDHTFEGLPEHRMVPSPDNPETMRFVDIVTAFIRRQS*
Ga0075430_10005078543300006846Populus RhizosphereHTFEGLPEHRMVPSPDNPETMRFVDIVTAFIRRQS*
Ga0075433_1042703413300006852Populus RhizosphereKRGGQIEVHTFEGLPEHRMVPSRAQPETMRAMDIIAEFIRRQT*
Ga0075420_10037412623300006853Populus RhizosphereFDALPEHRVVPSPSQPDTMRLIDTIAEFISRKAR*
Ga0073934_1027922823300006865Hot Spring SedimentRSYRERGGAIEEHTFAGLSEHRIVPSPTMPETMRLIQAVTAFIRRQSGSLLC*
Ga0075424_10016512513300006904Populus RhizosphereRGGQIEVHTFEGLPEHRVVPSPAQPETMRAMDIITGFIERQTA*
Ga0075424_10142769813300006904Populus RhizosphereGTNEVHTFEGLPEHRMVPSPSEPETMRCIDVMTEFIHRQSR*
Ga0075419_1015587823300006969Populus RhizosphereGGVLEDHTFEGLPEHRMVPSPDNPETMRFVDIVTAFIRRQS*
Ga0075435_10004378213300007076Populus RhizosphereASYRKRGGAIEADTFDGLPEHRIVPSPEKPETMRFVDAIAAFIRRHGA*
Ga0099791_1023521413300007255Vadose Zone SoilTFEGLPEHRMVPSPDNPETMRFIDIVTAFIRRQTA*
Ga0105107_1114474623300009087Freshwater SedimentGAIEVHAFDGLPDPRMVPSPEQPETMRFIDIVSAFISAKA*
Ga0111539_1238467913300009094Populus RhizosphereIEVHTFEGLPEHRMVSSRAQPETMRAMDIIAEFIRRQT*
Ga0075418_1197725413300009100Populus RhizosphereFDGLPEHRMVPSPSQPDTMRAIDTITAFIRRQTG*
Ga0114129_1056845533300009147Populus RhizosphereIEVHTFDGLPEHRMVPSPSEPETMRVIDTITAFIRRQAG*
Ga0111538_1120476613300009156Populus RhizosphereRFIASYRKRGGVLEEHTFEGLPEHRMVPSPDNPETMRFIDIVTAFIRRQTR*
Ga0105248_1079789923300009177Switchgrass RhizosphereSYRKRGGQIEVHTFEGLPEHRVVPSPAQPETMREMDIITGFIERQTA*
Ga0105237_1175746013300009545Corn RhizosphereGQIEVHTFEGLPEHRVVPSPAQPETMRAMDTITGFIERQTS*
Ga0126374_1147325023300009792Tropical Forest SoilIEVDTFDGLPEHRIVPSPDKPETMRFVDAITAFIRWHA*
Ga0126382_1234448823300010047Tropical Forest SoilASYRKCGGAIEVHTFDGLPEHRMVPSPDQPDTMRCMETITTFIRRYS*
Ga0126377_1233964723300010362Tropical Forest SoilIASYRKRGGAIEVHTFDGLPEHRMVPSPSQPDTMRVIDTMTAFIRRHNK*
Ga0137452_103528313300011441SoilKRGGPIEVHTFNGLPEHRMVPSPIEPETMRFIDIVSTFIWRQTK*
Ga0137437_118304623300011442SoilRGGAIEVHTFDGLPEHRMVPSPSEPETMRFIDTVSAFIGRQNK*
Ga0137319_109760623300012167SoilEVHTFDGLPEHQMVPSPAQPETMRAMETMTTFIRRQTG*
Ga0137382_1119779713300012200Vadose Zone SoilQTFDGLPEHRMVPSTSQPETMRAIDTITTFIRCQTE*
Ga0137362_1112131523300012205Vadose Zone SoilAIEVHTFDGLPEHRMVPSPSQPETMRAIDTITDFIRRQTA*
Ga0137378_1073467413300012210Vadose Zone SoilSYRKRGGAIEVQTFDGLPEHRMVPSSSQPETMRAIDTITAFIRRQTG*
Ga0137370_1005332543300012285Vadose Zone SoilFEGLPEHRMVPSPSQPETMRAIDTITGFIRRQGA*
Ga0157316_107682513300012510Arabidopsis RhizosphereIEVQTFDGLPEHRMVPSPDQPQTIRAIETSTGFIRRQTG*
Ga0137394_1002857213300012922Vadose Zone SoilYRKRGGAIEVQTFNGLPEHRMVPSTSQPETMRAIDTITTFIRRQFE*
Ga0137407_1005522813300012930Vadose Zone SoilTFDGLPEHRMVPSPSQPETMHAIETITAFIRRQTG*
Ga0164300_1006425123300012951SoilGGQLEVHKFEGLPEHRMVPSRAQPETMRAIDTIAEFIRRQT*
Ga0134087_1051631823300012977Grasslands SoilRKRGGAIEVHTFDGLPEHRMVPSPSQPETMRAIDTITDFIRRQTA*
Ga0164304_1114565513300012986SoilHTFEGLPEHRMVPSPSQPETMHLIDTITTFIRRQTA*
Ga0075313_118129313300014267Natural And Restored WetlandsYQKRGGAIEVHTFTGLPEHGMVPSPSKPETMRAIEIIAAFIRQHGG*
Ga0075346_112942113300014306Natural And Restored WetlandsIEVHTLDGLPEHRMVPSPSEPETMRFIDIVRGFIAAKG*
Ga0075322_100419243300014311Natural And Restored WetlandsFIASYRKRGGAIEAHTFEGLPEHRMVPSPDKPETMRFMDLVTAFIRRHSA*
Ga0075316_117479913300014314Natural And Restored WetlandsKRGGAIEVHTFEGLPEHRMVPSPDKPETMRFIDTVTDFIRRQTT*
Ga0180062_115036123300014879SoilSYRKRGGAIEVHTFDGLPEHRMVPSPSQPETMRAIGTITAFITAKA*
Ga0173478_1031791823300015201SoilRKRGGQIEVHTFEGLPEHRVVPSPAQPETMRAMDTITGFIERQTS*
Ga0132255_10147972913300015374Arabidopsis RhizosphereFIASYRKRGGQIEVHTFEGLPEHRIVPSPAQPETMRAMDTITEFVRRQA*
Ga0134083_1024689613300017659Grasslands SoilRKRGGPIEAQTFDGLPEHRMVPSPSQPETMRAIDTITGFIRRQSA
Ga0184610_111764713300017997Groundwater SedimentFIASYRKRGGAIEVHTFEGLPEHRMVPSPAQPETMRAMETITTFIRRQTA
Ga0184610_125562313300017997Groundwater SedimentGAIEVHTFDGLPEHRMVPSPSQPETMRVIDTITAFIRRQS
Ga0184638_103695013300018052Groundwater SedimentKRGAAIEVHTFDGLPEHRMVPSPAQPETMRVIDTITAFIRRQTA
Ga0184626_1011708413300018053Groundwater SedimentKRGGTIEVQTFDGLPEHRMVPSPSQPETMRAIDTITAFIRRQTG
Ga0184640_1043068923300018074Groundwater SedimentGGAIEAQTFDGLPEHRIVQSPTQPATMRLIETITAFIRRQTPAWSAEQLQV
Ga0184633_1054591723300018077Groundwater SedimentVHTFDGLPEHRMVPSPSQPETMRAIDTISAFIQRQTG
Ga0184625_1043987723300018081Groundwater SedimentRKRGAAIEVHTFDGLPEHRMVPSPAQPETMRVIDTITAFIRRQTA
Ga0190265_1153283913300018422SoilIEVHTFDGLPEHRVVPSPSQPDTMRLIDTITAFISNKA
Ga0190275_1001581113300018432SoilRGGAIEVDTFDGLPEHRMVPSPSEPETMRFIEVVSAFIRQQSK
Ga0190270_1133664123300018469SoilAIEVHTFDGLPEHRMVPSPSEPETMRFIDIVSAFIGRQT
Ga0184646_158869223300019259Groundwater SedimentASYRKRGGAIEAQTFDGLPEHRMVPSPSQPETMRAMDTINGFIRRHSG
Ga0193707_118271023300019881SoilKRGGVLEDHTFEGLPEHRMVPSADQPETMRFIDIVTAFIRRQAA
Ga0163150_1014046123300020195Freshwater Microbial MatITSYRKRGGAIEVDTFDGLPEHRMVPSPDQPETMRFIDAVSAFIAAKT
Ga0206224_100192413300021051Deep Subsurface SedimentASYRKRGGAIEVHTFDNLPEHRMVPSASQPETMRVIDTITAFVRRQTG
Ga0210378_1001828613300021073Groundwater SedimentRGGAIEVHTFDGLPEHRMVPSPAQPETMRCMETITTFIRRQTA
Ga0126371_1157971023300021560Tropical Forest SoilEVQTFEGLPEHRMVPSPAQPETMRAMDTITDFIKRQTA
Ga0207657_1121030423300025919Corn RhizosphereRKRGGAVEVHTFDGLPEHRMVPSPDKPETVRFIDIVTEFIRRHS
Ga0207690_1186892613300025932Corn RhizosphereGQIEVHTFEGLPEHRVVPSPAQPETMRAMDTITGFIERQTS
Ga0207640_1157630123300025981Corn RhizosphereFIASYRKRGGQIEVHTFEGLPEHRVVPSPAQPETMRAMDTITGFIERQTS
Ga0208418_100949123300026018Natural And Restored WetlandsVHSFAGLPEHRMVPSPSQPESMRFIETVSVFIRQKTG
Ga0208685_105623923300027513SoilVHTFDGLPEHRMVPSPDQPETMRFIDIVSAFIQQQTK
Ga0209464_1007854133300027778Wetland SedimentAIEVHSFDGLPEHRMVPSPSEPETMRFIDIVSAFIRRQAR
Ga0209726_1037113023300027815GroundwaterHSFDGLPEHRMVPSPSQPETMRFIEIVSAFIQQQAN
Ga0209683_1011108913300027840Wetland SedimentVHTFDGLPEHRMVPSPSQPESMRFIDIVRTFIGQQPG
Ga0209974_1044363113300027876Arabidopsis Thaliana RhizosphereIEVHSFDGLPEHGMVPSPSEPETMRVIDTIAAFIRRQS
Ga0209481_1028667113300027880Populus RhizosphereGGVLEDHTFEGLPEHRMVPSPDNPETMRFVDIVTAFIRRQS
Ga0268265_1054226513300028380Switchgrass RhizosphereGVLEDHTFEGLPEHRMVPSPDMPETMRFIDIVTGFIRRHAV
Ga0268265_1146899913300028380Switchgrass RhizosphereRGGQIEVHTFEGLPEHRVVPSPAQPETMRAMDTITGFIERQTS
Ga0307305_1042714713300028807SoilASYRKRGGAIEVHTFDGLPEHRMVPSPSQPETMRAIDTITDFIRRQTA
Ga0302046_1089542723300030620SoilHTFNGLPEHRMTPSPGQPETMRVIDTIMTFIRRQTGWRLGTEVR
Ga0299913_1023090513300031229SoilEVHTFAGLPEHRMVPSPNQPDTMRVIDTITAFIQRQTA
Ga0247727_1101763223300031576BiofilmAETFDGLPENRMVPSPSQPETTRFIETVTAFIRRQTG
Ga0306918_1046992323300031744SoilRKRGGQIEVHTFEGLPEHRMVPSPAQPETMRAMDTITDFIKRQTA
Ga0307473_1058134213300031820Hardwood Forest SoilRGGAIEVETFDGLPEHRMVPSPSQPETMRAIDTITAFVRRQTT
Ga0310897_1008726113300032003SoilGQIEVHTFEGLPEHRMVPSCAQPETMRAMDTIAEFIRRQT
Ga0306924_1045126713300032076SoilFIASYRKRGGRIEVQTFEGLPEHRMVPSPAQPETMRAMDTITDFIKRQTA
Ga0335085_1148876123300032770SoilSYRKRGGANDVHTFEGLPEHRMVPSPDQPETMRCIDTITAFIRRHTV
Ga0214471_1116624223300033417SoilASYRKRGGAIEVHTFEGLPEHRMVPSPSQPETMRVIETITTFIRRQTG
Ga0316613_1027601613300033434SoilIASYRKRGGAIEVHTFDGLPEHRMVPSPDQPETMRFIDIVSAFISAKA
Ga0316627_10048651023300033482SoilRKRGGAIEVHTFDGLPEHRMVPSPDQPETMRFIDIVSAFISAKA
Ga0364942_0136308_658_7983300034165SedimentYRKRGGAIEVHSFDGLPEHRMVPSPSQPETMRAIETITEFVRRQTG
Ga0364934_0240413_2_1093300034178SedimentHTFDSLPEHRMVPSPSQPETMRFIDIVSTFITAKT
Ga0370495_0339493_382_4953300034257Untreated Peat SoilVHTFDGLPEHRMVPSPSQPETMRFIDIVSAFVRQRTG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.