NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F081311

Metagenome / Metatranscriptome Family F081311

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F081311
Family Type Metagenome / Metatranscriptome
Number of Sequences 114
Average Sequence Length 46 residues
Representative Sequence WLARTGCEGAEAQRVRELLADRMTDDGSAWTDTKIVIRARKSQS
Number of Associated Samples 97
Number of Associated Scaffolds 114

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 1.77 %
% of genes near scaffold ends (potentially truncated) 96.49 %
% of genes from short scaffolds (< 2000 bps) 94.74 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (86.842 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(26.316 % of family members)
Environment Ontology (ENVO) Unclassified
(32.456 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(75.439 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 16.67%    β-sheet: 2.78%    Coil/Unstructured: 80.56%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 114 Family Scaffolds
PF02629CoA_binding 88.60
PF13607Succ_CoA_lig 3.51
PF08241Methyltransf_11 0.88
PF06305LapA_dom 0.88
PF13620CarboxypepD_reg 0.88
PF12897Asp_aminotransf 0.88
PF00296Bac_luciferase 0.88
PF00082Peptidase_S8 0.88
PF00155Aminotran_1_2 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 114 Family Scaffolds
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.88
COG3771Lipopolysaccharide assembly protein YciS/LapA, DUF1049 familyCell wall/membrane/envelope biogenesis [M] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A86.84 %
All OrganismsrootAll Organisms13.16 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000891|JGI10214J12806_12448910Not Available546Open in IMG/M
3300000891|JGI10214J12806_13035631Not Available536Open in IMG/M
3300000956|JGI10216J12902_110534027Not Available524Open in IMG/M
3300001305|C688J14111_10219362Not Available593Open in IMG/M
3300001305|C688J14111_10224135Not Available587Open in IMG/M
3300001532|A20PFW1_1221353Not Available837Open in IMG/M
3300001537|A2065W1_10017745Not Available735Open in IMG/M
3300001686|C688J18823_10079243Not Available2279Open in IMG/M
3300001686|C688J18823_10852769Not Available578Open in IMG/M
3300001686|C688J18823_10890657Not Available565Open in IMG/M
3300002568|C688J35102_118057321Not Available526Open in IMG/M
3300002568|C688J35102_118496155Not Available565Open in IMG/M
3300002568|C688J35102_119550841Not Available718Open in IMG/M
3300002568|C688J35102_120543584Not Available1163Open in IMG/M
3300004081|Ga0063454_101815592Not Available534Open in IMG/M
3300004114|Ga0062593_101015958Not Available852Open in IMG/M
3300004114|Ga0062593_101665387Not Available695Open in IMG/M
3300004479|Ga0062595_101063582Not Available703Open in IMG/M
3300004480|Ga0062592_102529601Not Available517Open in IMG/M
3300005093|Ga0062594_102616547Not Available557Open in IMG/M
3300005176|Ga0066679_10235266Not Available1177Open in IMG/M
3300005177|Ga0066690_11034599Not Available514Open in IMG/M
3300005294|Ga0065705_10423571Not Available852Open in IMG/M
3300005331|Ga0070670_101138084Not Available712Open in IMG/M
3300005337|Ga0070682_100695071Not Available815Open in IMG/M
3300005445|Ga0070708_101678287Not Available591Open in IMG/M
3300005468|Ga0070707_101180139Not Available732Open in IMG/M
3300005529|Ga0070741_11735467Not Available507Open in IMG/M
3300005536|Ga0070697_101709799Not Available563Open in IMG/M
3300005556|Ga0066707_10102935All Organisms → cellular organisms → Bacteria1757Open in IMG/M
3300005569|Ga0066705_10458884Not Available799Open in IMG/M
3300005598|Ga0066706_11190617Not Available579Open in IMG/M
3300005842|Ga0068858_102070412Not Available563Open in IMG/M
3300006046|Ga0066652_100075107All Organisms → cellular organisms → Bacteria2649Open in IMG/M
3300006574|Ga0074056_11627337Not Available914Open in IMG/M
3300006755|Ga0079222_10702625Not Available801Open in IMG/M
3300006796|Ga0066665_10737460All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300006797|Ga0066659_11437413Not Available576Open in IMG/M
3300006800|Ga0066660_10223074All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales → Solirubrobacteraceae → Solirubrobacter1454Open in IMG/M
3300006871|Ga0075434_100708801Not Available1024Open in IMG/M
3300006904|Ga0075424_100435937All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1398Open in IMG/M
3300007004|Ga0079218_13864963Not Available509Open in IMG/M
3300009094|Ga0111539_10827494Not Available1077Open in IMG/M
3300010038|Ga0126315_10236180Not Available1111Open in IMG/M
3300010038|Ga0126315_10992657Not Available562Open in IMG/M
3300010040|Ga0126308_10173439All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia1374Open in IMG/M
3300010041|Ga0126312_10978189Not Available619Open in IMG/M
3300010042|Ga0126314_10754228Not Available715Open in IMG/M
3300010043|Ga0126380_10568560Not Available886Open in IMG/M
3300010303|Ga0134082_10307702Not Available665Open in IMG/M
3300010329|Ga0134111_10249403Not Available728Open in IMG/M
3300010337|Ga0134062_10185039Not Available942Open in IMG/M
3300010403|Ga0134123_10526109Not Available1119Open in IMG/M
3300011996|Ga0120156_1016908Not Available1426Open in IMG/M
3300012011|Ga0120152_1081375Not Available956Open in IMG/M
3300012350|Ga0137372_10860960Not Available647Open in IMG/M
3300012355|Ga0137369_10549982Not Available809Open in IMG/M
3300012396|Ga0134057_1081783Not Available516Open in IMG/M
3300012397|Ga0134056_1310423Not Available815Open in IMG/M
3300012469|Ga0150984_117147996Not Available1243Open in IMG/M
3300012957|Ga0164303_10922260Not Available614Open in IMG/M
3300012961|Ga0164302_11444723Not Available564Open in IMG/M
3300012961|Ga0164302_11626287Not Available538Open in IMG/M
3300012977|Ga0134087_10071859All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia1404Open in IMG/M
3300012984|Ga0164309_11434551All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300012985|Ga0164308_11787752Not Available572Open in IMG/M
3300012989|Ga0164305_11560849Not Available588Open in IMG/M
3300013294|Ga0120150_1097831Not Available555Open in IMG/M
3300013297|Ga0157378_10194305Not Available1916Open in IMG/M
3300013765|Ga0120172_1101197Not Available700Open in IMG/M
3300013772|Ga0120158_10140410Not Available1357Open in IMG/M
3300014157|Ga0134078_10399925Not Available616Open in IMG/M
3300014166|Ga0134079_10109903Not Available1064Open in IMG/M
3300014326|Ga0157380_13068952All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300014827|Ga0120171_1156633All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria516Open in IMG/M
3300014969|Ga0157376_11985971Not Available619Open in IMG/M
3300015077|Ga0173483_10332362Not Available756Open in IMG/M
3300015077|Ga0173483_10802338Not Available544Open in IMG/M
3300015356|Ga0134073_10220303Not Available640Open in IMG/M
3300018433|Ga0066667_12335186Not Available501Open in IMG/M
3300019361|Ga0173482_10328782Not Available684Open in IMG/M
3300019869|Ga0193705_1029498Not Available1180Open in IMG/M
3300020015|Ga0193734_1025232Not Available1122Open in IMG/M
3300021344|Ga0193719_10417767Not Available550Open in IMG/M
3300022694|Ga0222623_10068781Not Available1370Open in IMG/M
3300022756|Ga0222622_10542924Not Available834Open in IMG/M
3300025939|Ga0207665_11693167All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium501Open in IMG/M
3300026530|Ga0209807_1022634All Organisms → cellular organisms → Bacteria3057Open in IMG/M
3300026530|Ga0209807_1213326Not Available667Open in IMG/M
3300026548|Ga0209161_10418261Not Available589Open in IMG/M
3300027787|Ga0209074_10054549Not Available1233Open in IMG/M
3300028381|Ga0268264_11678676Not Available646Open in IMG/M
3300028704|Ga0307321_1014996Not Available1333Open in IMG/M
3300028715|Ga0307313_10039431Not Available1360Open in IMG/M
3300028718|Ga0307307_10006286All Organisms → cellular organisms → Bacteria3070Open in IMG/M
3300028755|Ga0307316_10055443Not Available1336Open in IMG/M
3300028778|Ga0307288_10140684Not Available903Open in IMG/M
3300028778|Ga0307288_10208024Not Available755Open in IMG/M
3300028791|Ga0307290_10062030Not Available1354Open in IMG/M
3300028799|Ga0307284_10222691Not Available745Open in IMG/M
3300028799|Ga0307284_10244807Not Available712Open in IMG/M
3300028814|Ga0307302_10018061All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3184Open in IMG/M
3300028828|Ga0307312_10760335All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium HR36642Open in IMG/M
3300028876|Ga0307286_10228849Not Available678Open in IMG/M
3300028884|Ga0307308_10630656Not Available514Open in IMG/M
3300028884|Ga0307308_10657835Not Available502Open in IMG/M
3300028885|Ga0307304_10293215Not Available717Open in IMG/M
3300028885|Ga0307304_10324501Not Available684Open in IMG/M
3300028885|Ga0307304_10503572Not Available555Open in IMG/M
3300031366|Ga0307506_10420608Not Available551Open in IMG/M
3300031858|Ga0310892_10224062Not Available1145Open in IMG/M
3300031911|Ga0307412_10637857Not Available907Open in IMG/M
3300031995|Ga0307409_101631667Not Available673Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil26.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.16%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil8.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil7.89%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost7.02%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil4.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil4.39%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.51%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.63%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.63%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.75%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.75%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.75%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.75%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.88%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.88%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.88%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.88%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.88%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.88%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001305Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300001532Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A20-PF 12A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001537Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A20-65 cm-11A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005337Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3L metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006574Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAA (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010041Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot104AEnvironmentalOpen in IMG/M
3300010042Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105BEnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011996Permafrost microbial communities from Nunavut, Canada - A39_65cm_12MEnvironmentalOpen in IMG/M
3300012011Permafrost microbial communities from Nunavut, Canada - A30_65cm_6MEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013294Permafrost microbial communities from Nunavut, Canada - A3_65cm_0MEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013765Permafrost microbial communities from Nunavut, Canada - A30_80cm_6MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014827Permafrost microbial communities from Nunavut, Canada - A3_80cm_18MEnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300020015Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m1EnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028704Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_379EnvironmentalOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028755Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_356EnvironmentalOpen in IMG/M
3300028778Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_142EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031366Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 25_SEnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031911Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-1Host-AssociatedOpen in IMG/M
3300031995Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-2Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1244891023300000891SoilCEGAEAERVKELLAERMTEDGSAWTDTKIVIRARKSQS*
JGI10214J12806_1303563113300000891SoilWLARTGCEGDAADRVRSLLADRTENGFWADTKILIRARKKRPA*
JGI10216J12902_11053402713300000956SoilKVHPLEDWLARTGCEGEEAERVKELLADRMTDDGTAWTDVKLVLKARKSQR*
C688J14111_1021936223300001305SoilHALEDWLARTGCEGEEADRVRKLLADRMTADGTAWVDTKLVIRARKSQS*
C688J14111_1022413513300001305SoilTHPFDAWLARTGCEGEEAERVRALLAERTTPDGSGWTDVKVLLRARKRA*
A20PFW1_122135313300001532PermafrostLARTGCEGDDAKRVRKLLADCMTDDGKAWRDTKILLRARRSQK*
A2065W1_1001774523300001537PermafrostARTGCEGDEAVQVRELLADRMTDDGGAWVDTKILLKARKSQR*
C688J18823_1007924313300001686SoilLARTGCEGADAERVRELLADRMTDDGSAWVDTKIVIRARKSQS*
C688J18823_1085276923300001686SoilGCEGEEADRVRKLLADRMTADGTAWVDTKLVIRARKSQS*
C688J18823_1089065723300001686SoilDWLARTGCXGEEAERVRELLADRMTEDGSAWVDTKIVIRARKSQS*
C688J35102_11805732123300002568SoilELHEKVHPLEDWLARTGCEGAEAQQVRELLADRMTPDGGAWADTKIVLKARKSQR*
C688J35102_11849615523300002568SoilGCEGEEGERVKELLADRMTDDGTAWTDVKLILKARKSQR*
C688J35102_11955084113300002568SoilGCEGEEAARVKELLADRMTHDGTAWSDVKLVLKARKSQR*
C688J35102_12054358423300002568SoilVAFDDKLHPFESWLARTGCEGDEAERVRGLLAGRTEGDAWRDTKILLRARKRR*
Ga0063454_10181559223300004081SoilLFEKVHPLEDWLARTGCEGDEAIRVKELLADRMTDDRTAWSDVKLVLKARKSQR*
Ga0062593_10101595823300004114SoilLARTGCEGAEAERVKELLAERMTEDGSAWTDTKIVIRARKSQS*
Ga0062593_10166538713300004114SoilRTGCEGDAADRVRSLLADRTENGFWADTKILIRARKKRPA*
Ga0062595_10106358213300004479SoilTGCEGAEAERVRELLADRMTADGSAWTDTKIVIRARKSQS*
Ga0062592_10252960123300004480SoilPLEDWLARTGCEGEEAERVKQLLSDRMTDDGTAWTDLKILLKARKSQR*
Ga0062594_10261654723300005093SoilPLEDWLARTGCEGDEAERVRELLADRLTTDGTAWVDTKIVIRARKSQS*
Ga0066679_1023526633300005176SoilRTGCEGEEAGRVKELLADRMTEDCTAWTDVKIVLKARKSQR*
Ga0066690_1103459923300005177SoilTGCEGDEAERVKELLADRLSEDRTAWSDVKLVLKARKSER*
Ga0065705_1042357113300005294Switchgrass RhizosphereEAWLARTGCEGDEAERVKELLADRMTEDGGAWTDVKIVIRARKSQS*
Ga0070670_10113808423300005331Switchgrass RhizosphereFEKVHELEAWLARTGCEGDEAELVRELLADRLTTDGTAWVDTKIVIRARKSQS*
Ga0070682_10069507113300005337Corn RhizospherePLEDWLARTGCAGTEAERVRELLADRMNDDGTAWTDTKIVIRARKSQS*
Ga0070708_10167828723300005445Corn, Switchgrass And Miscanthus RhizosphereHPLEDWLARTGCEGEEAEQVKELLADRMTEDGTAWTDVKIVLRARKSQR*
Ga0070707_10118013923300005468Corn, Switchgrass And Miscanthus RhizosphereDDWLARTGCEGEEAERVKKLLFDRMTDDGTAWTDVKLILKARKSRR*
Ga0070741_1173546713300005529Surface SoilRTGCEGEEADRVRELLADRMTEDGTAWIDTKLILRARKPQA*
Ga0070697_10170979913300005536Corn, Switchgrass And Miscanthus RhizosphereWLARTGCEGEEAERVKELLSDRMTAEGTAWTDVKLVLKARKSRR*
Ga0066707_1010293513300005556SoilPLEDWLARTGCEGDEAERVRELLADRMTDDGSAWIDTKIIIRARKARS*
Ga0066705_1045888423300005569SoilTHPLEAWLARTGCEGADAEHVRELLADRMTDDGSAWTDTKIILRARRGDA*
Ga0066706_1119061713300005598SoilEAERVKELLADRMTDDGVAWTDVKLVLKARKSQR*
Ga0068858_10207041213300005842Switchgrass RhizosphereEDWLARTGCEGTEAERVRELLADRMNDDGTAWTDTKIVIRARKSQS*
Ga0066652_10007510713300006046SoilGCEGAHAEHVRELLADRMTDDGSAWTDTKIILRARKGDA*
Ga0074056_1162733723300006574SoilLARTGCEGAEAERVCELLADRLTDDGTAWVDTKLVIRARKSQS*
Ga0079222_1070262523300006755Agricultural SoilGEEAERVKELLADRMTDDGAAWKDVKLILKGRKPQS*
Ga0066665_1073746033300006796SoilWLARTGCEGEEARRVEELLADRMTDDGAAWTDVKICLKARRSQR*
Ga0066659_1143741323300006797SoilFEKVHPLEDWLARTGCEGQEAERVKELLSDRMTADGTAWTDVKLVLKAMKSQR*
Ga0066660_1022307413300006800SoilEDKEAERVKELLADRMTEDRTAWTDVKIVLKARRSQR*
Ga0075434_10070880113300006871Populus RhizosphereEDARRVKELLAPVLVGDGDAWRDVKIVIKARKSQR*
Ga0075424_10043593733300006904Populus RhizosphereEKTHPLEAWLARTGCEGADAERVKELLADRMTEDGRAWTDTKIVIRARRSQS*
Ga0079218_1386496313300007004Agricultural SoilGLDVEETALFEKHHPVEAWLARTGCEGAEAERVRELLADHIVDGRYVDRKILFRARKGA*
Ga0111539_1082749423300009094Populus RhizosphereEEAERVKDLLADRMTDDGAAWKDVKLILKARKPPS*
Ga0126315_1023618013300010038Serpentine SoilDAERVKELLAPVTTPDGKAWLDVKILLKARKSQG*
Ga0126315_1099265723300010038Serpentine SoilVDDWLARAGCDGDEAERVRELLAERILDGEYVDMKILIRARKR*
Ga0126308_1017343933300010040Serpentine SoilFEREHPLEDWLARTGCEGEEAERVRELLADRMTADGTVWVDTKLVIRARKSQT*
Ga0126312_1097818913300010041Serpentine SoilARTGCEGDEADRVRELLADRTDGDEYVDTKILLRARKGASS*
Ga0126314_1075422813300010042Serpentine SoilGCEGAEAERVEELLAPVATPDGRAWLDVKVLLRARKSQT*
Ga0126380_1056856013300010043Tropical Forest SoilPFDAWLARTGCEGEEAERVRALLSARTTPDGSAWTDVKVLLRARKGT*
Ga0134082_1030770213300010303Grasslands SoilRTGCDGEDAERVRELLADRMTDDGSAWTDTKIILRARRGDA*
Ga0134111_1024940323300010329Grasslands SoilVHPLEDWLVRTGCEGEEAERVKELLADRMTDDGVAWTDVKLVLKARKSQR*
Ga0134062_1018503923300010337Grasslands SoilLRRTGCEGDEAERVKGLLADRMTDDGAAWTDVKLVLKARKSQR*
Ga0134123_1052610923300010403Terrestrial SoilWLGRTGCEGEEAERVKELLSDRMTDDGTAWTDVKLILKTRKSQS*
Ga0120156_101690833300011996PermafrostEDWLARTGCEGDDAKRVRKLLADRMTSDGQAWKDTKILLRARKSQK*
Ga0120152_108137523300012011PermafrostYFEKTHPLEDWLARTGCEGDDAKRVRKLLADRMTADGKAWRDTKILLRARRSQK*
Ga0137372_1086096013300012350Vadose Zone SoilRTGCVGEEAERVKELLAPLLTADDKAWTDVKIVLKARKSQK*
Ga0137369_1054998223300012355Vadose Zone SoilHPLEDWLARTGCEGEEAELVKELLVERMTDDGTAWTDVKLIVKARKSQS*
Ga0134057_108178323300012396Grasslands SoilFPGVHPLEDWLARTGCEGEEAERVRELLADRMTGDGTAWTDVKIVLKARKSQR*
Ga0134056_131042323300012397Grasslands SoilLEDWLARTGCEGEEAERVKELLADRMTDDGTAWTDVKLILKARKSRR*
Ga0150984_11714799633300012469Avena Fatua RhizosphereFEKEHPLEDWLARTGCEGEEAERVRELLADRMTEDGSAWGDTKIVIRARKSQS*
Ga0164303_1092226013300012957SoilEDWLARTGCEGDEAARVKELLADRMTDDGSAWTDVKLILKARKSQR*
Ga0164302_1144472323300012961SoilARTGCEGEDAERVRELLADRLTDDGSAWVDTKIVIRARKWQR*
Ga0164302_1162628723300012961SoilVHPLEDWLARTGCEGEEAERVKELLSDRMTDDGTAWTDVKILLKARKSQP*
Ga0134087_1007185933300012977Grasslands SoilDAWLDRTGCEGADAEHVRELLADRMTEDGSAWTDTKIILRARKGDA*
Ga0164309_1143455123300012984SoilVHKLDDWLARTGCEGDEAARVTELLAPQLLDDGSAWRDTKILLKCRKSQS*
Ga0164308_1178775213300012985SoilEWLARTGCVGEEAERVTQLLAPLLVDGGKAWQDTKILLRVRKAGA*
Ga0164305_1156084913300012989SoilRTGCEGAEAERVRELLADRLTDDGTAWVDTKLVIRARKSQS*
Ga0120150_109783113300013294PermafrostRTGCEGDDAKRVRKLLADRMTSDGQAWKDTKILLRARKSQK*
Ga0157378_1019430533300013297Miscanthus RhizosphereFEKVHPLEDWLGRTGCEGEEAERVKELLSDRMTDDDTAWTDVKLILKTRKSQS*
Ga0120172_110119723300013765PermafrostFEKTHPLEDWLARTGCEGDDAKRVRKLLADRMTSDGQAWKDTKILLRARKSQK*
Ga0120158_1014041013300013772PermafrostDEAERVKELLADRMTEDGSAWTDTKIVIRARKSQS*
Ga0134078_1039992523300014157Grasslands SoilTGCEGEEAERVKALLSDRMTDDGTAWTDVKLILKARKSRR*
Ga0134079_1010990323300014166Grasslands SoilARTGCEGEEAERVKELLADRMTDDGVAWTDVKLVLKARKSQR*
Ga0157380_1306895223300014326Switchgrass RhizospherePLEDWLARTGCEGDEAERVKELLSDRMTDDGAAWTDVKLILKTRKSQS*
Ga0120171_115663323300014827PermafrostPLEDWLARTGCEGDDAKRVRKLLADRMTADGKAWRDTKILLRARRSQK*
Ga0157376_1198597113300014969Miscanthus RhizospherePLEDWLARTGCEGTESERVRGLLADRMNDDGTAWTDTKIVIRARKSQS*
Ga0173483_1033236223300015077SoilEIEEVELFDKRHQMNDWLARTGCEGDEADRVRSLLADRTENGFWADTKILIRARKKRPA*
Ga0173483_1080233813300015077SoilLARTGCEGDEAERVRALLASRTEGDAWRDTKILLRARKR*
Ga0134073_1022030323300015356Grasslands SoilCCEKEHPLEDWLARTGCEGDEAERVRELLADRMTADGGAWVDTKIVIRARKSQS*
Ga0066667_1233518623300018433Grasslands SoilPFEKVHPLEDWLARTACEGEEADRVKELLADRMTDDGTAWTDVKIVLKARKSRR
Ga0173482_1032878223300019361SoilHPLEDWLARTGCEGEEAERVRELLADRMTDDGTAWVDTKLVIRARKSQS
Ga0193705_102949813300019869SoilWLARTGCEGEKAERVKELLADRMTEDGSAWTDVKIVIRARKSQS
Ga0193734_102523223300020015SoilTHPLEAWLARTGCEGEEAERVKALLADRMTEDGNAWTDVKIVIRARKSQN
Ga0193719_1041776723300021344SoilRTGCEGEEAERVKELLSDRMTDDGTAWTDVKVILKARKRQS
Ga0222623_1006878113300022694Groundwater SedimentQVECFEKTHPLEDWLARTGCEGAEAQRVRELLADRMTDDGSAWTDTKIVIRARKSQS
Ga0222622_1054292423300022756Groundwater SedimentLEEWFARTGCEGEEAERVKELLSDRMTDDGTAWTDVKVILKARKRQS
Ga0207665_1169316723300025939Corn, Switchgrass And Miscanthus RhizosphereRTGCEGEEAGRVRELLSDRMTNDGTAWTDVKILLKARKSHR
Ga0209807_102263433300026530SoilVPGARGHEGEEAERVKELLADRMTDDGVAWTDVKLVLKARKSQR
Ga0209807_121332613300026530SoilTHPLEAWLARTGCEGADAEHVRELLADRMTDDGSAWTDTKIILRARRGDA
Ga0209161_1041826113300026548SoilEEAERVKELLADRMTDDGVAWTDVKLVLKARKSQR
Ga0209074_1005454923300027787Agricultural SoilQRQLFEKVHPLEDWLARTGCEGEEAERVKELLSDRMTEDGTAWTDVKLVLKARKSQS
Ga0268264_1167867623300028381Switchgrass RhizosphereWLARTGCEGDEAERVKELLADRMTEDGSAWTDTKIVIRARKSES
Ga0307321_101499633300028704SoilWLARTGCEGAEAQRVRELLADRMTDDGSAWTDTKIVIRARKSQS
Ga0307313_1003943113300028715SoilEKVHPLEDWLARTGCEGEEAERVKALLFDRMTDDGTAWTDVKVILKARKRQS
Ga0307307_1000628643300028718SoilARTGCEGEEAERVRELLADRLTDDGTAWVDTKLVIRARKSQS
Ga0307316_1005544313300028755SoilEGEEAERVKELLADRMIEDGSAWTDVKIVIRARKSQS
Ga0307288_1014068423300028778SoilWLARTGCEGEEAERVKELLADRMTGDGTAWTDTKIIIRARKARS
Ga0307288_1020802413300028778SoilEDWLARTGCEGEEAERVRELLADRVTDDGTAWVDTKLVIRARKSQS
Ga0307290_1006203033300028791SoilDFFEKTHPLEDWLARTGCEGDDAKRVRKLLADRMTSDGQAWTDMKILLRARRSQR
Ga0307284_1022269113300028799SoilFEKEHPLEDWLARTGCEGEEAERVRELLADRLTDDGTAWVDTKLVIRARKSQS
Ga0307284_1024480723300028799SoilFARTGCEGEEAERVKELLSDRMTDDGTAWTDVKVILKARKRQS
Ga0307302_1001806113300028814SoilHPLEDWLARTGCEGEEAERVRELLADRMTSDGTAWTDTKILLRARKSPS
Ga0307312_1076033513300028828SoilTGCEGDDAKRVRKLLADRMTDDGQAWTDTKILVRARRSQR
Ga0307286_1022884923300028876SoilHLLEDWLARTGCEGAEAQRVRELLADRMTDDGSAWTDTKIVIRARKSQS
Ga0307308_1063065613300028884SoilGCVGAEAERVRELLADRMTEDGAAWSDLKIVLKTRKSQR
Ga0307308_1065783513300028884SoilPLEDWLARTGCEGEEAERVRELLADRLTDDGTAWVDTKLVIRARKSQS
Ga0307304_1029321523300028885SoilDWLARTGCEGDDAKRVRKLLADRMTSDGEAWTDTKILLRARRSQK
Ga0307304_1032450123300028885SoilQEWLARTGCVGDEAERVRELLADRMTPDGSAWTDAKVVLKARKSQT
Ga0307304_1050357223300028885SoilESFEKTHALEDWLARTGCEGEEAERVRELLADRMTPDRAAWTDKKIVIRARKSPS
Ga0307506_1042060823300031366SoilEKTHPFDAWLARTGCEGVEADRVRELLAPLTSADGDSWTDTKILLRARKAAG
Ga0307473_1074800923300031820Hardwood Forest SoilADAARVRELLADRTTDDGSVWIDTKIIIRARKARS
Ga0310892_1022406223300031858SoilRRLFEKVHPLEDWLARTGCEGEEAERAKELLADRMTDDGAAWKDVKLILKARKPQS
Ga0307412_1063785723300031911RhizosphereGCAGETAERVKALLADRMTPHRAAWTDVKLVLKGRKSQG
Ga0307409_10163166723300031995RhizosphereWLARTGCAGETAERVKALLADRMTPDRAAWTDVKLVLKGRKSQG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.