NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F078981

Metagenome / Metatranscriptome Family F078981

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078981
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 99 residues
Representative Sequence MSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEHLRFQSYDVVVAEYPNPNWKGSQALQLL
Number of Associated Samples 97
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 64.04 %
% of genes near scaffold ends (potentially truncated) 96.55 %
% of genes from short scaffolds (< 2000 bps) 79.31 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.414 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(20.690 % of family members)
Environment Ontology (ENVO) Unclassified
(32.759 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.793 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 18.55%    β-sheet: 12.90%    Coil/Unstructured: 68.55%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF13502AsmA_2 6.03
PF00072Response_reg 5.17
PF13450NAD_binding_8 3.45
PF13545HTH_Crp_2 2.59
PF11999Ice_binding 2.59
PF01243Putative_PNPOx 1.72
PF05973Gp49 1.72
PF13407Peripla_BP_4 0.86
PF13538UvrD_C_2 0.86
PF12019GspH 0.86
PF03683UPF0175 0.86
PF00882Zn_dep_PLPC 0.86
PF07730HisKA_3 0.86
PF00202Aminotran_3 0.86
PF00487FA_desaturase 0.86
PF13520AA_permease_2 0.86
PF02321OEP 0.86
PF01261AP_endonuc_2 0.86
PF09861Lar_N 0.86
PF01850PIN 0.86
PF00005ABC_tran 0.86
PF00486Trans_reg_C 0.86
PF05170AsmA 0.86
PF01569PAP2 0.86
PF00196GerE 0.86
PF05532CsbD 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.72
COG3657Putative component of the toxin-antitoxin plasmid stabilization moduleDefense mechanisms [V] 1.72
COG4679Phage-related protein gp49, toxin component of the Tad-Ata toxin-antitoxin systemDefense mechanisms [V] 1.72
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 0.86
COG2886Predicted antitoxin, contains HTH domainGeneral function prediction only [R] 0.86
COG2982Uncharacterized conserved protein AsmA involved in outer membrane biogenesisCell wall/membrane/envelope biogenesis [M] 0.86
COG3237Uncharacterized conserved protein YjbJ, UPF0337 familyFunction unknown [S] 0.86
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 0.86
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.86
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.86
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.86
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms97.41 %
UnclassifiedrootN/A2.59 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002671|Ga0005481J37269_104656All Organisms → cellular organisms → Bacteria1205Open in IMG/M
3300002911|JGI25390J43892_10148199All Organisms → cellular organisms → Bacteria → Acidobacteria546Open in IMG/M
3300004091|Ga0062387_101522412All Organisms → cellular organisms → Bacteria → Acidobacteria537Open in IMG/M
3300005167|Ga0066672_10383956All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300005172|Ga0066683_10017753All Organisms → cellular organisms → Bacteria3915Open in IMG/M
3300005172|Ga0066683_10739541All Organisms → cellular organisms → Bacteria → Acidobacteria578Open in IMG/M
3300005177|Ga0066690_10915394All Organisms → cellular organisms → Bacteria → Acidobacteria559Open in IMG/M
3300005181|Ga0066678_10538339All Organisms → cellular organisms → Bacteria → Acidobacteria776Open in IMG/M
3300005186|Ga0066676_10242519All Organisms → cellular organisms → Bacteria1174Open in IMG/M
3300005187|Ga0066675_11262906All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas stutzeri group → Pseudomonas stutzeri subgroup → Pseudomonas stutzeri546Open in IMG/M
3300005451|Ga0066681_10024039All Organisms → cellular organisms → Bacteria3155Open in IMG/M
3300005541|Ga0070733_10096135All Organisms → cellular organisms → Bacteria → Acidobacteria1887Open in IMG/M
3300005559|Ga0066700_10526281All Organisms → cellular organisms → Bacteria → Acidobacteria824Open in IMG/M
3300005560|Ga0066670_10616688All Organisms → cellular organisms → Bacteria → Acidobacteria660Open in IMG/M
3300005560|Ga0066670_10621379All Organisms → cellular organisms → Bacteria → Proteobacteria657Open in IMG/M
3300005566|Ga0066693_10223305All Organisms → cellular organisms → Bacteria → Acidobacteria740Open in IMG/M
3300005568|Ga0066703_10298599All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium974Open in IMG/M
3300005574|Ga0066694_10107786All Organisms → cellular organisms → Bacteria1306Open in IMG/M
3300005586|Ga0066691_10304492All Organisms → cellular organisms → Bacteria → Acidobacteria940Open in IMG/M
3300005586|Ga0066691_10522362All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300005591|Ga0070761_10071639All Organisms → cellular organisms → Bacteria1967Open in IMG/M
3300005591|Ga0070761_10845124All Organisms → cellular organisms → Bacteria → Acidobacteria577Open in IMG/M
3300006050|Ga0075028_100364161All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Synechococcales → Merismopediaceae → Synechocystis → unclassified Synechocystis → Synechocystis sp. PCC 7509819Open in IMG/M
3300006176|Ga0070765_100121682All Organisms → cellular organisms → Bacteria2290Open in IMG/M
3300006796|Ga0066665_10667654All Organisms → cellular organisms → Bacteria → Acidobacteria825Open in IMG/M
3300006797|Ga0066659_10716647All Organisms → cellular organisms → Bacteria → Acidobacteria819Open in IMG/M
3300006954|Ga0079219_10238673All Organisms → cellular organisms → Bacteria1069Open in IMG/M
3300006954|Ga0079219_10424177All Organisms → cellular organisms → Bacteria → Acidobacteria894Open in IMG/M
3300007076|Ga0075435_100806780All Organisms → cellular organisms → Bacteria → Acidobacteria817Open in IMG/M
3300007788|Ga0099795_10662292All Organisms → cellular organisms → Bacteria → Acidobacteria500Open in IMG/M
3300009089|Ga0099828_11017037All Organisms → cellular organisms → Bacteria → Acidobacteria738Open in IMG/M
3300009137|Ga0066709_101772008All Organisms → cellular organisms → Bacteria → Acidobacteria869Open in IMG/M
3300009137|Ga0066709_101893224All Organisms → cellular organisms → Bacteria → Acidobacteria831Open in IMG/M
3300009634|Ga0116124_1022513All Organisms → cellular organisms → Bacteria → Acidobacteria2017Open in IMG/M
3300010303|Ga0134082_10039695All Organisms → cellular organisms → Bacteria → Proteobacteria1791Open in IMG/M
3300010343|Ga0074044_10448072All Organisms → cellular organisms → Bacteria → Acidobacteria844Open in IMG/M
3300011271|Ga0137393_10268235All Organisms → cellular organisms → Bacteria → Acidobacteria1451Open in IMG/M
3300012202|Ga0137363_10004259All Organisms → cellular organisms → Bacteria → Acidobacteria8759Open in IMG/M
3300012203|Ga0137399_10234180All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1502Open in IMG/M
3300012203|Ga0137399_10546873All Organisms → cellular organisms → Bacteria → Acidobacteria972Open in IMG/M
3300012205|Ga0137362_10021899All Organisms → cellular organisms → Bacteria4943Open in IMG/M
3300012285|Ga0137370_10272804All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1005Open in IMG/M
3300012362|Ga0137361_10382100All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1293Open in IMG/M
3300012924|Ga0137413_11385519All Organisms → cellular organisms → Bacteria → Acidobacteria567Open in IMG/M
3300012925|Ga0137419_10233893All Organisms → cellular organisms → Bacteria → Acidobacteria1377Open in IMG/M
3300012930|Ga0137407_11471434All Organisms → cellular organisms → Bacteria → Acidobacteria648Open in IMG/M
3300012977|Ga0134087_10173419All Organisms → cellular organisms → Bacteria → Acidobacteria952Open in IMG/M
3300015241|Ga0137418_10918413All Organisms → cellular organisms → Bacteria → Acidobacteria642Open in IMG/M
3300015242|Ga0137412_11307460All Organisms → cellular organisms → Bacteria → Acidobacteria505Open in IMG/M
3300015356|Ga0134073_10232556All Organisms → cellular organisms → Bacteria → Acidobacteria628Open in IMG/M
3300017822|Ga0187802_10397548All Organisms → cellular organisms → Bacteria → Acidobacteria545Open in IMG/M
3300017924|Ga0187820_1012608All Organisms → cellular organisms → Bacteria → Acidobacteria2050Open in IMG/M
3300017943|Ga0187819_10122097All Organisms → cellular organisms → Bacteria → Acidobacteria1555Open in IMG/M
3300020579|Ga0210407_11025007Not Available628Open in IMG/M
3300020580|Ga0210403_10120643All Organisms → cellular organisms → Bacteria → Acidobacteria2133Open in IMG/M
3300021046|Ga0215015_10529506All Organisms → cellular organisms → Bacteria → Acidobacteria585Open in IMG/M
3300021086|Ga0179596_10639665All Organisms → cellular organisms → Bacteria → Acidobacteria539Open in IMG/M
3300021171|Ga0210405_10031596All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4240Open in IMG/M
3300021171|Ga0210405_10083595All Organisms → cellular organisms → Bacteria2516Open in IMG/M
3300021402|Ga0210385_11406927All Organisms → cellular organisms → Bacteria → Acidobacteria533Open in IMG/M
3300021405|Ga0210387_10835370All Organisms → cellular organisms → Bacteria → Acidobacteria813Open in IMG/M
3300021420|Ga0210394_10283916All Organisms → cellular organisms → Bacteria → Acidobacteria1451Open in IMG/M
3300021432|Ga0210384_10389712All Organisms → cellular organisms → Bacteria → Acidobacteria1255Open in IMG/M
3300021432|Ga0210384_11625535All Organisms → cellular organisms → Bacteria → Acidobacteria551Open in IMG/M
3300021475|Ga0210392_11122772All Organisms → cellular organisms → Bacteria → Acidobacteria589Open in IMG/M
3300021477|Ga0210398_10085701All Organisms → cellular organisms → Bacteria → Acidobacteria2559Open in IMG/M
3300021479|Ga0210410_10784607All Organisms → cellular organisms → Bacteria → Acidobacteria837Open in IMG/M
3300021559|Ga0210409_10161580All Organisms → cellular organisms → Bacteria → Acidobacteria2050Open in IMG/M
3300022523|Ga0242663_1115255All Organisms → cellular organisms → Bacteria → Acidobacteria549Open in IMG/M
3300024178|Ga0247694_1001513All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia3966Open in IMG/M
3300024279|Ga0247692_1047213All Organisms → cellular organisms → Bacteria → Acidobacteria666Open in IMG/M
3300024290|Ga0247667_1007246All Organisms → cellular organisms → Bacteria → Acidobacteria2309Open in IMG/M
3300025905|Ga0207685_10668788All Organisms → cellular organisms → Bacteria → Acidobacteria563Open in IMG/M
3300025912|Ga0207707_11194257All Organisms → cellular organisms → Bacteria → Acidobacteria616Open in IMG/M
3300025916|Ga0207663_10355596All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1110Open in IMG/M
3300026305|Ga0209688_1112611All Organisms → cellular organisms → Bacteria → Acidobacteria514Open in IMG/M
3300026310|Ga0209239_1180822All Organisms → cellular organisms → Bacteria → Acidobacteria794Open in IMG/M
3300026317|Ga0209154_1184817All Organisms → cellular organisms → Bacteria → Acidobacteria825Open in IMG/M
3300026323|Ga0209472_1015884All Organisms → cellular organisms → Bacteria → Acidobacteria3674Open in IMG/M
3300026482|Ga0257172_1010999All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1491Open in IMG/M
3300026529|Ga0209806_1005407All Organisms → cellular organisms → Bacteria7250Open in IMG/M
3300026529|Ga0209806_1178177All Organisms → cellular organisms → Bacteria → Acidobacteria776Open in IMG/M
3300026551|Ga0209648_10236207All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1365Open in IMG/M
3300027651|Ga0209217_1005618All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4179Open in IMG/M
3300027655|Ga0209388_1033880All Organisms → cellular organisms → Bacteria → Acidobacteria1466Open in IMG/M
3300027671|Ga0209588_1185113All Organisms → cellular organisms → Bacteria → Acidobacteria653Open in IMG/M
3300027674|Ga0209118_1044642All Organisms → cellular organisms → Bacteria → Acidobacteria1324Open in IMG/M
3300027684|Ga0209626_1199273All Organisms → cellular organisms → Bacteria → Acidobacteria531Open in IMG/M
3300027748|Ga0209689_1024081All Organisms → cellular organisms → Bacteria → Acidobacteria3712Open in IMG/M
3300027765|Ga0209073_10182045All Organisms → cellular organisms → Bacteria → Acidobacteria791Open in IMG/M
3300027768|Ga0209772_10001427All Organisms → cellular organisms → Bacteria → Acidobacteria6021Open in IMG/M
3300027787|Ga0209074_10338820All Organisms → cellular organisms → Bacteria → Acidobacteria613Open in IMG/M
3300027853|Ga0209274_10051132All Organisms → cellular organisms → Bacteria → Acidobacteria1962Open in IMG/M
3300027853|Ga0209274_10072168All Organisms → cellular organisms → Bacteria1670Open in IMG/M
3300027853|Ga0209274_10721449All Organisms → cellular organisms → Bacteria → Acidobacteria514Open in IMG/M
3300027884|Ga0209275_10188479All Organisms → cellular organisms → Bacteria → Acidobacteria1110Open in IMG/M
3300027911|Ga0209698_10920855All Organisms → cellular organisms → Bacteria → Acidobacteria655Open in IMG/M
3300028536|Ga0137415_10327607All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1335Open in IMG/M
3300028906|Ga0308309_10004917All Organisms → cellular organisms → Bacteria8050Open in IMG/M
3300029636|Ga0222749_10051000All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → unclassified Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter sp. SbA71827Open in IMG/M
3300031057|Ga0170834_109541357All Organisms → cellular organisms → Bacteria → Acidobacteria625Open in IMG/M
3300031720|Ga0307469_11100685All Organisms → cellular organisms → Bacteria → Acidobacteria746Open in IMG/M
3300031753|Ga0307477_10640333All Organisms → cellular organisms → Bacteria → Acidobacteria714Open in IMG/M
3300031753|Ga0307477_10701251All Organisms → cellular organisms → Bacteria → Acidobacteria677Open in IMG/M
3300031753|Ga0307477_10799695All Organisms → cellular organisms → Bacteria → Acidobacteria626Open in IMG/M
3300031754|Ga0307475_10099592All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2266Open in IMG/M
3300031754|Ga0307475_10220590All Organisms → cellular organisms → Bacteria → Acidobacteria1517Open in IMG/M
3300031754|Ga0307475_10308866All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1270Open in IMG/M
3300031754|Ga0307475_11426921All Organisms → cellular organisms → Bacteria → Acidobacteria533Open in IMG/M
3300031962|Ga0307479_11662703All Organisms → cellular organisms → Bacteria → Acidobacteria593Open in IMG/M
3300032180|Ga0307471_101702117All Organisms → cellular organisms → Bacteria → Acidobacteria784Open in IMG/M
3300032180|Ga0307471_102119042All Organisms → cellular organisms → Bacteria → Acidobacteria707Open in IMG/M
3300032205|Ga0307472_100005477All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5858Open in IMG/M
3300032205|Ga0307472_101731800All Organisms → cellular organisms → Bacteria → Acidobacteria618Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil17.24%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.52%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil11.21%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil6.90%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.31%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.45%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.59%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.59%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.72%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.72%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.72%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.86%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.86%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.86%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.86%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.86%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002671Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF130 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009634Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_13_150EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017924Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_5EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024279Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK33EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026305Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027768Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0005481J37269_10465623300002671Forest SoilMAMPRPSTSEVLSDRRGVRSEGAASHRRRSLRVLFVHRDADAVDCCVQEMEKAQFIVNADVVLNLAQCTESLHSQTFDVVVAEYPSPSWKGSQSLKRLQQT
JGI25390J43892_1014819913300002911Grasslands SoilMSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEHLRFQSYDVVVAE
Ga0062387_10152241213300004091Bog Forest SoilMARLLQNELRSGMSSFRSENVPSHRRRPLRVLFIHRDADVVDSCLEELKKAQFTVSADFVLTLAQCLKQLRSQTYEVVIAEYPSPSWKG
Ga0066672_1038395613300005167SoilMSQPAQNEIQSDQRSGRPESTASHRPRPLHVLFVHHDRDAVERCLQELKKAQFTVSAGFVLTLAQCTEQLRFQRYDVVVAEYPSPNWKGSQALQLL
Ga0066683_1001775313300005172SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTVEEVPLLFVTTSK
Ga0066683_1073954123300005172SoilMSSLTQNNVQADQIGFRPEGTEGKSPPLRILFIHRDADTVESCLQELKKARFTVSADIALNLAQCTEQLRSESYDVVVAEYPSPSWKGSQGLQSLRQTVEEVPLLFVTTSK
Ga0066690_1091539413300005177SoilMSSLTQNNVQADQIGFRRESTEGKSPPLRILFIHRDANTVESCLQELKKARFTVSADIALNLAQCTEQLRSESYDVVVAEYPSPSWKGSQGLQSLRQTV
Ga0066678_1053833913300005181SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSP
Ga0066676_1024251933300005186SoilMSSLTQNKIPSDPRDSRPGSTDSHRSRPLHILFVHRQADTVQCCLQELKKAQFVVSADTALDLAQCTKQLRSQSYDVVVAE
Ga0066675_1126290613300005187SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTVE
Ga0066681_1002403913300005451SoilMSQLAQNEIQSDQRTSRPESTASHRRRPLRVLFVHHDANVMESCLQELKKAQFTVSADSVLTLAQCTEQLRLQTYDVVVAEYPSPN
Ga0070733_1009613523300005541Surface SoilMSNLAQNELQSDQRRGQPEGSASHRCRPLRVLFVHRDADTIDSCLEELKKGQYSVSADFVLNPAQCVERLRSQAYEVVIAEYPKPSWQGSQALQLVHEEGREIPLLFLTSSMRNESIARL
Ga0066700_1052628113300005559SoilMSSLTQNNVQSDQRGFRPESTEGKSPPLRILFIHRDADTVESCLQELKKAQFTVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTV
Ga0066670_1061668823300005560SoilMSSLIQSKVPSDPRGSRLGSTDSHRSRPLRILFFHREADTVECCLQELKKAQFVVSADTALDLAQCTEQLRSQSYDVVVAEYPSPSWKGSQGLQFLRQTVEDTPLLFVTTSKG
Ga0066670_1062137913300005560SoilMSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEHLRFQSYDVVVAEYPNPNWKGSQALQLL
Ga0066693_1022330523300005566SoilMSSLAQNNVQSDKSGFRPESTDSLRFRPLRILFIHRDAVTVESCLQELKKAQFAVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGAQGLQSLRQTVEEVPLLFVTTSKG
Ga0066703_1029859923300005568SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPS
Ga0066694_1010778613300005574SoilMSSLTQNKIPSDPRDSRPGSTDSHRSRPLHILFVHREADTVECCLQELKKAQFVVSADTALDLAQCTEQLRSQSYDVVVA
Ga0066691_1030449223300005586SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTV
Ga0066691_1052236223300005586SoilMSSLTQNNVQSDQKGFRPESTEGKSRPLRILFIHRDADTVESCLQELKKAQFAVSADIALNLAQCTEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTVE
Ga0070761_1007163943300005591SoilMCASMSNLSQNHLRSDLASFWSEGKASRRRRPLRVLFVHRDAEVVDNCLEELKKAQFTVSADFVLNLAQCGERLRSQSYDVVVAEYP
Ga0070761_1084512413300005591SoilMSNLAQNKLRPDLASFQLGSSTSHRRRPLRILFVHRDAEVVENCLDALKKSQFTVSADIVLNLSQCAEQFHSQSYEVVVAEYPCPSSKGCQALRLLQKKFQEIPLLFVASASGSESIVQI
Ga0075028_10036416113300006050WatershedsMAERATWFAPGVGTMARLAQNEFRSGLSSFRSESTPSHRRRHLRVLLIHRDAEVIDGCLEELKKAQFIVSADFVLTLAQCREQLRSQTYDVVIAEYPSSSWKGPQALELLHQTVQEIPLL
Ga0070765_10012168223300006176SoilMTCVSMSNLTQNDLRSDLASFWSESKASHRRRLLRVLFVHRDAEVVDNCLEELKKAQFMVSADFVLNLAQCGERLRSQSYDVVIAEYPCPSLKGSEALQVLH
Ga0066665_1066765423300006796SoilMSSLTQNNVQSDQRGFRPESTEGKSPPLRILFIHRDANTVESCLQELKKARFTVSADIALNLAQCTEQLRSESYDVVVAEYPSPSWKGSQGLQSLRQT
Ga0066659_1071664713300006797SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKG
Ga0079219_1023867313300006954Agricultural SoilMASMSHLAQNEIQSDQRSSRPESTASHRRRPLRVLFVHRDADAIESCLQELKNAQFTVGADSVLTLAQCTEQLRLQSYDVVVAEYPSPNW
Ga0079219_1042417723300006954Agricultural SoilMHVVFMDGVGGSRESVSMSSLTQNNVQSDQRGFRPGSTDSRRCPPLHMLFIHRDADTVECCLQELKKAQFTVSADIALNLAQCTEQLRSQSYDVVVAEYPSPSWKGAQGLQFLRQTVEEIPLLFVTTSKGKESIAE
Ga0075435_10080678013300007076Populus RhizosphereMSSVTQSKMQSGPVGFRPGSTDSRRFPPLHMLFIHRDADTVECCLQELKKAQFTVSADIALNLTQCTEQLRSQSYDVVVAEYPSPSWNGAQGLQF
Ga0099795_1066229213300007788Vadose Zone SoilMSNLAQNEFQSARRSVRPGSAASHRRRPLNVLFIHRDADVVESCLEELKKARFTVSADLVLTLAQCTQQLRSQTYDVVVAEYPSPNWKGSQALQLLHQTVQEIPLLFVTTAM
Ga0099828_1101703723300009089Vadose Zone SoilMDGLGVLGMASVSKLLRSEIQSERGNIRPESAGSHRRRPIRVLFIHRDADGVDSCVQELEKAQFTVAADVVLTLAQCGEQLRFQSYDVVVAEYPSPSWKRPQALQLLQQTLQEIPLVFLTTAMGSKPI
Ga0066709_10177200823300009137Grasslands SoilMSSLTQNNVQADQIGFRPEGTEGKSPPLRILFIHRDADTVESCLQELKKARFTVSADIALNLAQCTEQLRSESYDVVVAEYPSPSWKGSQGLQSLRQTVEEVPLLFVTTSKG
Ga0066709_10189322413300009137Grasslands SoilMSSLTQNNVQSDQKGFRPESTDPHRFRPLRILFIHRDADTVESCLQELKKAQFAVSVDIALNLAQCAEQLRSQSYDVVVAEYPSP
Ga0116124_102251323300009634PeatlandMASIAELGQNEIQSDRRNVGLESTASHRRRPLRVLFIHRDAEVVDNCLQELEKARFIVSADVVLTLAQCTEQLRSHSYDVVVAEYPSPSWKRSQALQLLHQTVQEIPLL
Ga0134082_1003969553300010303Grasslands SoilMSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEHLRFQSYDVVVAEYPNPNWKGSQALQLLHQAVQE
Ga0074044_1044807223300010343Bog Forest SoilMSNLAQSVIQSDLANPRGESKPSHRRRPLRVLFVHRDAEVVDDCLEELKKAQFTVSADFVLNLGQCGERLRSQSYDVVVAEYPCP
Ga0150983_1118842113300011120Forest SoilMARLAQNEIQSDLRSFRPQSAAAHRRRPLSVLFVHRDADAVDNCLEELKKAQFIVNSDLVLTLAQCTQQLRSQSYDVVVA
Ga0137393_1026823513300011271Vadose Zone SoilMALMSHLAQNEIQSDQRSGRPDSTASHRRRPLRVLFVHRDADAVENCLQELKKAQFTVSADFVLTMAQCTKQLRSQPYDVVVAEYPSPNWKGAQALQLLHQTVQEIP
Ga0137363_10004259113300012202Vadose Zone SoilMSSLTQSKFQSDQRGFRPKSTDLYRSCPVHILFIHRDADTVECCLQELKKARFVVSADTALSLAQCTEQLRSQSYDVVVAEYPSPSWKGSQGLQFL
Ga0137399_1023418013300012203Vadose Zone SoilMSNLAQNEFQSTRRSLRPESAASHRRRPLSVLFVHRDADVVDSCVEELKKARFTVSADLVLTLTQCTQQLRSQTYDVVVAEYPSPNWKGSQA
Ga0137399_1054687323300012203Vadose Zone SoilMALMSHLAQNEIQSDQRSGRPDSTASHRRRPLRVLFVHRDADAVENCLQELKKAQFTVSADFVLTLAQCTKQLRSQPYDVVVAEYPSPNWKGAQALQLLHQT
Ga0137362_1002189913300012205Vadose Zone SoilMSSLTQNKIPSDPRGSRPGSTDSHRSRPIHILFVHREADTVECCLQELKKAQFVVSADTALSLAQCTEQLRSQSYDVVVAEYPSPSWKGSQGLQFL
Ga0137370_1027280423300012285Vadose Zone SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHSDADTVESCLQELKKAHFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTVE
Ga0137361_1038210033300012362Vadose Zone SoilMDGFGVLGMASVSKLLRSEVQSERRNIRPESAASHRRRPIRVLFIHRDADGVDSCVQELEKAQFTVAADVVLTLAQCAEQLRFQSYDVVVAEYPSPSWKRPQ
Ga0137413_1138551913300012924Vadose Zone SoilMSNLAQNEFQSTRRSLRPESAASHRRRPLSVLFVHRDADVVDSCLEELKKARFTVSADLALTLTQCAQQLRSQTYDVVVAEYPSPNWKGSQALQLLHQTVQEI
Ga0137419_1023389323300012925Vadose Zone SoilMSSRTQNKIPSDPRGSRPGSTDSHRSRPIHILFVHRKADTVECCLQELKKAQFVVSADTALSLAQCTEQLRSQSYDVVVAEYPSPSWKGS
Ga0137407_1147143413300012930Vadose Zone SoilMSNLAQNEFQSARRSFRPESAGSHRRCSLSVLFVHRDADAIESCLEELKKARFTVSADFVLNLTQCTERLHSQSYDVVVAEYPSPSWKGPQALQLLRQTV
Ga0134087_1017341923300012977Grasslands SoilMSSLIQSKVPSDPRGSRLGSTDSHRSRPLRILFFHREADTVECCLQELKKAQFVVSADTALDLAQCTEQLRSQSYDVVVAEYPSPSLKGSQGLQFLRQTVEDTPLLFV
Ga0137418_1091841313300015241Vadose Zone SoilMSSLTQSKIRSVPITESTDSQRFRPLHILFIHRDAETVECCLQELKKAQFIVSADIALNLAQCAEQLRSQSYDVVVAEYPSP
Ga0137412_1130746013300015242Vadose Zone SoilMSSLTQSKFQSDQRGFRPGSTDLHRSCPLHILFIHRDADTVECCLQELKKARFVVSADTALNLAQCTEQLRSQSFGVVVAEYLSPSWKGSHGLQFLR
Ga0134073_1023255613300015356Grasslands SoilMASMSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEQLRFQSYDVVVAEYPNPNWK
Ga0187802_1039754813300017822Freshwater SedimentMAKLAQNETQSDLPVFRPESTPSHRRRSLRVLFVHRDSDAVDSCLQELKKAQFTVGADFVLNLTQCAEQLRSQSYDVIVVEYPSPSCKGSKRLQLLHQTVQEVPLIFLTTGFATESVAELTAHGA
Ga0187820_101260823300017924Freshwater SedimentMAKLAQNETQAHLPVFRPESTSSHRRRSLRVLFVHRDSDAVDSCLQELKKAQFTVGADFVLNLTQCAEQLRSQSYDVIVVEYPSP
Ga0187819_1012209723300017943Freshwater SedimentMAKLAQSEIQSDLPVFRLESTSSHRRRSLRVLFVHRDSDAVDSCLQELKKAQFTVGADFVLNLTQCAEQLRAQSYDVIVVEYPSPSCKGSRLLQLLHQTVQ
Ga0210407_1102500723300020579SoilMSNLTQRGWQSDLASLRLEGGATHRRRSIRVLFVHRDAEVVDNCLQELKKARFVVSADVVLTLKQCKEQLHS
Ga0210403_1012064333300020580SoilMSNLTQIALQSDQARFRTESPAPHRRRPLRMLFIHRDAEVVDNCLEELKKAQFTVSADFVLNLAQCAERLNSQSYD
Ga0215015_1052950613300021046SoilMDGPGVLRMASISKLSQNEVQSDRRTGRTEKAAAHRRRPLRVLFIHRDADGVDCCVQELEKAQFTVGADVVLTLAQCAEQLCFESYDVVVAEYPSPSWKGPQALQFLQQTLQE
Ga0179596_1063966513300021086Vadose Zone SoilMSSLTQSKFQSDQRGFRPKSTDLYRSCPVHILFIHRDADTVECCLQELKKARFVVSADTVLNLAQCTEQLRSQSYDVVVAEYPSPSWKGSQGLQFLRQTVEEVPLLFVTTSKGKESIAELTAHGACD
Ga0210405_1003159613300021171SoilMASISRLGQTEIQSNRTNVRPDGAASHRRRPLHVLFIHREADAVDACLQELEKAQFTVGADVVLTLAQCTKQLRFQSYDVVVAEYPSPSWKGSQ
Ga0210405_1008359553300021171SoilMANLFQKESSPHQGSVCPEGPPSHRRRTLSVLFVHRDANAIDSCVQELEKGQFTVIHDFVLNLGQCAVQLRSQRYDVIVVEYPSPSCKGSQVLQLLHQT
Ga0210385_1140692713300021402SoilMSNLAQNDLRPDLASSWSESKSPHRRRSLRVLFVHRDAEVVDNCLEELKKAQFIVSAGFVLNLAQCGDRLRSQSYDVVVAEYPCPSLKGSRALQVL
Ga0210387_1083537013300021405SoilMASISRLGQTEIQSNRTNVRPDGAASHRRRPLHVLFIHREADAVDACLQELEKAQFTVGADVVLTLAQCTKQLRFQSYDVVVAEYPSPSWKGSQALQLLQQTL
Ga0210394_1028391613300021420SoilMDGFGWLRAWASMSNLAQNEFQSARRSSRPESAASHRRHPLNVLFMHRDADVVESCLEELKKARFTVCADLVLTLAQCAQQLRSQTYDVVVAEYPSPNWK
Ga0210384_1038971223300021432SoilMSNLAQNEFRSVRKDFRPETSGAHRRRPLKVLFIHRDADVVDSCLEELKKAQFTISSDLVLTLAQCAQQLRQQTYDVVVAEYPSPSWKGSQALQLL
Ga0210384_1162553513300021432SoilMSNLALSSWRSDLARFRTTSHRRRPLNVLFVHRDAEVVENCVEALKKAQFVVSADFVLNLPQCAERLNSQSFDVVVAEYPCPSLKASQTLH
Ga0210392_1112277213300021475SoilMSNLAQNDLQCEKANFRREGTPSHRRRSIRVLFVHRDADVVDNCVQELKKARFQVSADLVLSLAQCAQQLRSQTYDMVVAEYPSPSWKGSQALQLLHQTVQETPLLFVTNAIGSESI
Ga0210398_1008570143300021477SoilLAMCVSMSNLTLNDLRSDLASFWSESKASHRRRPLRVLFVHRDAEVVDNCLEELKKAQFVVSADFVLNLAQCGERLRSQSYDVVVAEYPCPSLKGSHALEVLHKQLRET
Ga0210410_1078460723300021479SoilMSTVAQHEFQSARRSFRPESAAPHRRRPLSVLFIHRDADVVDSCVEELKRARFTVSADLVLTLAQCTQQLRRQNYDVVVAEYPSPSWKGSQALQLLHQTVQEIPLLFVTTAMGGESIAELTIHGA
Ga0210409_1016158043300021559SoilMSNLAQNELQSAQRSFRPESVAAHRRRPINVLFVHRDADVVDNCLEELKKAQFIVNSDFVLTLAQCTQQLQSQSYDVVVAEYPSPSWKGPQALQLLHQTVQEIPLLFLT
Ga0242663_111525513300022523SoilMSNLTQIALQSDQARFRTESPASHRRRPLRMLFIHRDAEVVDNCLEELKKAQFTVSADFVLNLAQCAERLHAQSYDVVVAEYPCPSLKGSRALQVLHKEL
Ga0247694_100151313300024178SoilMSNLAQSELQADLASFRPESTASQRRRPLRVLFVHREAEIIENCLEELKKAQFIVSADFVLNLAQCAERLHSQSYDVVVAEYPRPSLKEAQALQVLHQELQDIPL
Ga0247669_103146513300024182SoilMCASMSNLARSELQADLASFRPESTASQRRRRVRVLFVHREAEIIENCLEELKKAQFIVSADFVLNLAQCAERL
Ga0247692_104721313300024279SoilMSNLAQSELQADLASFRPESTASQRRRPLRVLFVHREAEIIENCLEELKKAQFIVSADFVLNLAQCAERLHSQSYDVVVAEYPHPSLKGSQALQVLHQKLQHIPLL
Ga0247667_100724633300024290SoilMSNLAQSELQADLASFRPESTASQRRRPLRVLFVHREAEIIENCLEELKKAQFIVSADFVLNLAQCAERLHSQSYDVVVAEYPHPSLKG
Ga0207685_1066878813300025905Corn, Switchgrass And Miscanthus RhizosphereMPSISQLVRNEILSGRGDIRPDSTTSHRPRPLRVLFIHRDADVVDSCLEELKKAQFTVRADLVLTIAQCTQQLRSQTYDVVVAEYPSPNWKGSQSLQLLRQTVQEMP
Ga0207707_1119425713300025912Corn RhizosphereMSNLAQSELQADLASFRPESTASQRRRPLRVLFVHREAEIIENCLEELKKAQFIVSADFVLNLAQCAERLHSQSYDVVVAEYPHPSLKGSQALQV
Ga0207663_1035559613300025916Corn, Switchgrass And Miscanthus RhizosphereMSHLAQNKIQSDQRSGRPESTASHRRRPLRVLFVHRDANAIESCLQELKKAQFTVSADFVLTLAQCAEQLRFQSYDVVVAEYPS
Ga0209688_111261113300026305SoilMSSLAQNNVQSDKSGFRPESTDSLRFRPLRILFIHRDAVTVESCLQELKKAQFAVSADIALNLAQCAEQLRSQSYDVVVAEYPSPS
Ga0209239_118082223300026310Grasslands SoilMSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEHLRFQSYDVVVAEYPNPNWKGSQALQLLH
Ga0209154_118481723300026317SoilMHVVFMDGFGGSGESVSMSSLTQNNVQSDQRGFRPESTDSRRFPPLHILFIHRDADTVECCLQELKKAQFTVSADIALNLAQCTEQLRSQSYDVVVAEYPSPSWNGSQGLQSL
Ga0209472_101588463300026323SoilMSQLAQNEIQSDQRTSRPESTASHRRRPLRVLFVHHDANVMESCLQELKKAQFTVSADSVLTLAQCTEQLRLQTYDVVVAEYPSPNWKGSQ
Ga0257172_101099923300026482SoilMSNLAQNEFQSTRRSLRPESAASHRRRPLSVLFVHRDADVVDSCVEELKKARFTVSADLVLTLTQCTQQLRSQTYDVVVAEYPSPNWKGSQALQLLHQ
Ga0209806_100540783300026529SoilMSSLTQNNVRSDQSGFRPESTDSQRFRPLRILFIHRDADTVESCLQELKKAQFMVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWKGSQGLQSLRQTVEEVPLLFVTTSKGKESIA
Ga0209806_117817713300026529SoilMSSLTQNNVQADQIGFRPEGTEGKSPPLRILFIHRDADTVESCLQELKKARFTVSADIALNLAQCTEQLRSESYDVVVAEYPSPSWKGSQGLQSLRQTVEEVPLLFVTTSKGKESIA
Ga0209648_1023620723300026551Grasslands SoilMSNRAQNEFQSTRRSLRPESAASHRRRPLSVLFVHRDADVVDSCVEELKKARFTVSADLVLTLTQCKQQLRSQTYDVVVA
Ga0209217_100561863300027651Forest SoilMDGLGVAAMASISKLSQSENRSDRRSLRPDRSASHRRRSLRVLFIHRDADAVDSCVEELKKAQFTVGVDVVLTLAQCTAQLRLQS
Ga0209388_103388033300027655Vadose Zone SoilMSNLAQNEFQSARRSFRPESAGSHRRCSLSVLFVHRDADAIESCLEELKKARFTVSADFVLNLTQCTERLHSQSYDVVVAEYPSPSWK
Ga0209588_118511313300027671Vadose Zone SoilMSNLAQNEFQSARRSFRPESAGSHRRCSPSVLFVHRDADVIESCLEELKKARFTVSADFVLNLTQCTERLHSQSYDVVVAEYPSPSWKGPQALQLLRQTVQEIPLLFVTTAMGSESIAQL
Ga0209118_104464223300027674Forest SoilMSNLVQSEFQSAQRSFRPETAASHRRRPLSVLFIHRDADAVDSCVEELKKASFTVSADLVLTLAQCRQQLRSQAYDVVVAEYPSPNWKGSQALQLLHQTVQEIPLLFVTTAIGSESIAQLTAD
Ga0209626_119927313300027684Forest SoilMSNLAQNEFQSARRSLRPESAASHRRRPLNVLFMHRDADVVESCLEELKKARFTVCADLVLTLAQCTQQLRSQTYDVVVAEYPSPNW
Ga0209689_102408113300027748SoilMSSLTQNNVQSDQRGFRPESTEGKSPPLRILFIHRDADTVESCLQELKKAQFTVSADIALNLAQCAEQLRSQSYDVVVAEYPSPSWK
Ga0209073_1018204513300027765Agricultural SoilMSSLTQSKIQSDPIGFGPRSTDSHRSRALHILFIHRDADTVDSCVQELRKAQFTVSADIALNLAQCRQLLHSDSYDVVVAEYPSPSW
Ga0209772_1000142713300027768Bog Forest SoilMSNMAQSELQSDPANFRREGRAAHRRRAIRVLFVHRDADVVDNCVQELKKARFQVCAEVVLSLGHCAQQLRCQTYDLVVAEYPSPSWKGSQALQLLHQTVQEIP
Ga0209074_1033882013300027787Agricultural SoilMSSVTQSRMQSDPAGFRPGNTDSRRFPPLHMLFIHRDADTVECCLQELKKAQFTVSADIALNLAQCTEQLRFQSYDVVV
Ga0209274_1005113213300027853SoilMCASMSNLSQNHLRSDLASFWSEGKASRRRRPLRVLFVHRDAEVVDNCLEELKKAQFTVSADFVLNLAQCGERLRSQSYDVVVAEY
Ga0209274_1007216833300027853SoilMSNLTLNDLRSDLASFWSESKASHRRRPLRVLFVHRDAEVVDNCLEELKKAQFVVSADFVLNLAQCGERLRSQSYDVVVAEYPC
Ga0209274_1072144923300027853SoilMSNLAQNKLRPDLASFQLGSSTSHRRRPLRILFVHRDAEVVENCLDALKKSQFTVSADIVLNLSQCAEQFHSQSYEVVVAEYPCPSSKGCQALRLLQKK
Ga0209275_1018847913300027884SoilMSNLTQNDLRSDLASFWSESKASHRRRLLRVLFVHRDAEVVDNCLEELKKAQFMVSADFVLNLAQCGERLRSQSYDVVIAEYPCPSLKGSQ
Ga0209698_1092085523300027911WatershedsMTNLAQNELHSARRNFRSESSGAHRRRPLKVLFIHRDADVVDSCLEELKKAQFTISSDLVLTLGQCAQQLRQQPYDVVVAEYASPSWKGSQALQLLHQTVREVPLIFV
Ga0137415_1032760723300028536Vadose Zone SoilMDGFGVLGMASVSKLLRSEVQSERRNIRPESAASHRRRPIRVLFIHRDADGVDSCVQELEKAQFTVAADVVLTLAQCAEQLRFQSYDVVVAEYPSPSWKRPQALQLLQQTLQEIPSCS
Ga0308309_10004917103300028906SoilMSNLAQNALQSGHAKFRTESAVSHGRCPLRVLFVHRDAEVVDSCLEELKKAQFTVSADLVLNLTQCFEQLQSQSYDVVVAEYPCPGPKRSRALQGLHKELQETPL
Ga0222749_1005100013300029636SoilMSNLALSSWRSDLARFRTTSHRRRPLNVLFVHRDAEVVENCVEALKKAQFVVSADFVLNLPQCAERLNSQSFDVVVAEYPCPSLKASQTLHVLKKKFQEIPLL
Ga0170834_10954135723300031057Forest SoilMNASMPNLARNDLPSNQPSFRPESTASHRRCPLKVLFVHRDAEVVDNCLGELKKAQFVVSADFVLNLAQCAERLHSQSYDVIVAEYPCPRWKGYQAFQGLHQELGQIP
Ga0307469_1110068513300031720Hardwood Forest SoilMSNLVQSDLQPDPISFRTEGRPSHRRRSLHVLFVHRDADAIDCCVEELKKAQFIVSADFVLTLAQCRQQLRSQTYDVVVAEYPSPSWKGAQALQLLHQTVQEIPLLF
Ga0307477_1064033323300031753Hardwood Forest SoilMARLAQNEIQSELRSFRPESTATHRRRPLSVLFVHRDADAVHNCLEELKKAQFIVNSDLVLTLAQCTQQLRSQSYDVVVAEYPSASWKGPQAMQLLHQ
Ga0307477_1070125113300031753Hardwood Forest SoilMANLAQNEFQSVQRSFRPQSAVTHRRCPLSVLFVHRDADVVDDCLEELKKAQFIVNSDFVLTLAQCTQQLRSQSYDVVVAEYPSPSWKGPQALQFLHQTVQEIPLVFLTAVTGNQS
Ga0307477_1079969513300031753Hardwood Forest SoilLSSLQSKSSLLRPDPIGFRPGGTDSHRSCPLHILFIHRDADTVECCVQELKKAQFTVSADIALNLAQCTEQLRSHSYDVV
Ga0307475_1009959213300031754Hardwood Forest SoilMPNVAQNKIQSDPVSSRKQSSPSHRRRSLRVLFVHRDADAIDCCLEELKKAQFIVSADFVLTLAQCRQQLHSQTFDVVVAEYPSPSWKGPQALQLLHQTVQEIP
Ga0307475_1022059013300031754Hardwood Forest SoilMANLAQNEFQSVQRSFRPQSAVAHRRCPLSVLFVHRDADVVDDCLEELKKAQFIVNSDFVLTLAQCTQQLRSQSYDVVVAEYPSPSWKGPQALQFFHQTVQEIPLVFLTAATGNESIAKLTADGP
Ga0307475_1030886623300031754Hardwood Forest SoilMSSLTRTKTQSDPIGFPPASRDSHRSRPLYILFIHRDADTVECCLQELKKAQFTVSADIALNLAQCTEQLRSQSYDVVVAECPSPSWKGSQGLQFLRQTVEGVPL
Ga0307475_1142692113300031754Hardwood Forest SoilMSFSHPYLYGWFWRLRMREFTSNIVHNELQSEPKSVPLEGTASHRRRPLNVLFIHRDADVVDACQEELKKAGFGVSADLVLTIAQCTEQLRSHSYDVVIAEYPSPNWKGSQALQLLRQTVQEIPLLFVTTA
Ga0307479_1166270313300031962Hardwood Forest SoilMDGPGVLRMASISKLSPNEIQTDRRIGRTERAAAHRRRPLRVLFIHRDADGVDCCVQELEKAQFTVGADVVLTLAQCAEQLRFEPYDVVVAEYPSPSWKGPQALQFLQQT
Ga0307471_10170211713300032180Hardwood Forest SoilMARLAQNEIQSRLRSFRPESTATHRRRPLSVLFVHRDADAVDSCLEELKKAQFIVNSDLVLTLAQCTQQLRSQSYDVVVAEYPSASWKGPQALQLLDQTVHE
Ga0307471_10211904213300032180Hardwood Forest SoilMVQVWASMSKLAQNEFQSARRGSRPETAAPHRRRPLSVLFVHREADVIDICLEELRKARFTVSADFVLNLAQCREQLRSQSYDVVVAEYPSPSWKGPQALQLLHQTVQEIPLLFVTTAM
Ga0307472_10000547723300032205Hardwood Forest SoilMSHLAQNEIQSDQRSGRPESTASHRRRPLRVLFVHRDADAIESCLQELKKAQFTVSADFVLTLAQCTEHLRLQSYDVVVAEYPNPNWK
Ga0307472_10173180013300032205Hardwood Forest SoilMSHLAQNKIQSDQRSGRPEITASHRRRPLRALFVHRDANVIDNCLEELKKAQFTVSADFVLTLAQCAEQLRFQSYDVVV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.