NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F059480

Metagenome / Metatranscriptome Family F059480

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F059480
Family Type Metagenome / Metatranscriptome
Number of Sequences 134
Average Sequence Length 109 residues
Representative Sequence EWSVAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEIAGELAEVENSLFRSSATERDLLNAARRLHDIAHPSPGRLRR
Number of Associated Samples 119
Number of Associated Scaffolds 134

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.77 %
% of genes near scaffold ends (potentially truncated) 96.27 %
% of genes from short scaffolds (< 2000 bps) 90.30 %
Associated GOLD sequencing projects 110
AlphaFold2 3D model prediction Yes
3D model pTM-score0.73

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (88.060 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.164 % of family members)
Environment Ontology (ENVO) Unclassified
(28.358 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.701 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 57.60%    β-sheet: 0.00%    Coil/Unstructured: 42.40%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.73
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.285.1.1: MtlR-liked3brja13brj0.67195
f.13.1.0: automated matchesd3ddla_3ddl0.67097
f.13.1.0: automated matchesd6eida_6eid0.65931
a.285.1.1: MtlR-liked2hkta_2hkt0.65166
a.160.1.5: AadK C-terminal domain-liked2pbea12pbe0.64329


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 134 Family Scaffolds
PF07726AAA_3 79.10
PF01078Mg_chelatase 7.46
PF01882DUF58 2.24
PF01944SpoIIM 1.49
PF01259SAICAR_synt 0.75
PF00118Cpn60_TCP1 0.75
PF13541ChlI 0.75
PF01128IspD 0.75
PF13242Hydrolase_like 0.75
PF05974DUF892 0.75

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 134 Family Scaffolds
COG1721Uncharacterized conserved protein, DUF58 family, contains vWF domainFunction unknown [S] 2.24
COG1300Stage II sporulation protein SpoIIM, component of the engulfment complexCell cycle control, cell division, chromosome partitioning [D] 1.49
COG0152Phosphoribosylaminoimidazole-succinocarboxamide synthaseNucleotide transport and metabolism [F] 0.75
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 0.75
COG0746Molybdopterin-guanine dinucleotide biosynthesis protein ACoenzyme transport and metabolism [H] 0.75
COG1207Bifunctional protein GlmU, N-acetylglucosamine-1-phosphate-uridyltransferase/glucosamine-1-phosphate-acetyltransferaseCell wall/membrane/envelope biogenesis [M] 0.75
COG12112-C-methyl-D-erythritol 4-phosphate cytidylyltransferaseLipid transport and metabolism [I] 0.75
COG2068CTP:molybdopterin cytidylyltransferase MocACoenzyme transport and metabolism [H] 0.75
COG3685Ferritin-like metal-binding protein YciEInorganic ion transport and metabolism [P] 0.75


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms88.06 %
UnclassifiedrootN/A11.94 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001154|JGI12636J13339_1008449All Organisms → cellular organisms → Bacteria1620Open in IMG/M
3300001359|A3035W6_1026606All Organisms → cellular organisms → Bacteria756Open in IMG/M
3300001359|A3035W6_1125959All Organisms → cellular organisms → Bacteria1105Open in IMG/M
3300001409|JGI20185J14861_1006570All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300001593|JGI12635J15846_10642622Not Available614Open in IMG/M
3300001593|JGI12635J15846_10748218All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300001661|JGI12053J15887_10053559All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2253Open in IMG/M
3300001664|P5cmW16_1079393All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300002908|JGI25382J43887_10455207All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria542Open in IMG/M
3300004479|Ga0062595_100198454All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1238Open in IMG/M
3300004479|Ga0062595_100508762All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300005166|Ga0066674_10359950All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium682Open in IMG/M
3300005167|Ga0066672_10187393All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1314Open in IMG/M
3300005172|Ga0066683_10752671All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia571Open in IMG/M
3300005177|Ga0066690_10195683All Organisms → cellular organisms → Bacteria1344Open in IMG/M
3300005178|Ga0066688_10398414All Organisms → cellular organisms → Bacteria893Open in IMG/M
3300005179|Ga0066684_10219765All Organisms → cellular organisms → Bacteria1236Open in IMG/M
3300005186|Ga0066676_10734273All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300005363|Ga0008090_13806257All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300005363|Ga0008090_14319718All Organisms → cellular organisms → Bacteria1836Open in IMG/M
3300005444|Ga0070694_101921674All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → unclassified Actinomyces → Actinomyces sp. oral taxon 448505Open in IMG/M
3300005447|Ga0066689_10685622All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium642Open in IMG/M
3300005518|Ga0070699_100320265All Organisms → cellular organisms → Bacteria1394Open in IMG/M
3300005542|Ga0070732_10680690All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300005546|Ga0070696_100159503All Organisms → cellular organisms → Bacteria1661Open in IMG/M
3300005549|Ga0070704_100002349All Organisms → cellular organisms → Bacteria10615Open in IMG/M
3300005549|Ga0070704_101528172All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300005558|Ga0066698_10513673All Organisms → cellular organisms → Bacteria814Open in IMG/M
3300005569|Ga0066705_10695445All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300005574|Ga0066694_10101939All Organisms → cellular organisms → Bacteria1343Open in IMG/M
3300006041|Ga0075023_100151297Not Available855Open in IMG/M
3300006055|Ga0097691_1016457All Organisms → cellular organisms → Bacteria3364Open in IMG/M
3300006162|Ga0075030_100679044All Organisms → cellular organisms → Bacteria815Open in IMG/M
3300006172|Ga0075018_10465928Not Available653Open in IMG/M
3300006354|Ga0075021_10664369All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300006755|Ga0079222_11122599All Organisms → cellular organisms → Bacteria693Open in IMG/M
3300006804|Ga0079221_10602507All Organisms → cellular organisms → Bacteria741Open in IMG/M
3300006854|Ga0075425_102956117All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Nocardioidaceae → Nocardioides → Nocardioides alkalitolerans520Open in IMG/M
3300006864|Ga0066797_1104132All Organisms → cellular organisms → Bacteria994Open in IMG/M
3300006914|Ga0075436_101112470All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300007076|Ga0075435_100271548All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1446Open in IMG/M
3300007265|Ga0099794_10220598All Organisms → cellular organisms → Bacteria974Open in IMG/M
3300009012|Ga0066710_101592648All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300009012|Ga0066710_103493040All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300009012|Ga0066710_104351045Not Available529Open in IMG/M
3300009012|Ga0066710_104513684All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300009088|Ga0099830_11608836All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300009093|Ga0105240_11449954All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300009137|Ga0066709_101430238All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300010301|Ga0134070_10238964Not Available677Open in IMG/M
3300010336|Ga0134071_10061258All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1728Open in IMG/M
3300010364|Ga0134066_10214827All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300010401|Ga0134121_11905396All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300011269|Ga0137392_11489074All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300011271|Ga0137393_10170629All Organisms → cellular organisms → Bacteria1826Open in IMG/M
3300011999|Ga0120148_1009676All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia → Kallotenuales → Kallotenuaceae → Kallotenue → Kallotenue papyrolyticum2403Open in IMG/M
3300012096|Ga0137389_10722991All Organisms → cellular organisms → Bacteria855Open in IMG/M
3300012096|Ga0137389_11302099All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300012204|Ga0137374_10236925All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1541Open in IMG/M
3300012209|Ga0137379_10074315All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia3268Open in IMG/M
3300012351|Ga0137386_10841091All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300012356|Ga0137371_10945979All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia655Open in IMG/M
3300012360|Ga0137375_10093437All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia3090Open in IMG/M
3300012363|Ga0137390_11020876All Organisms → cellular organisms → Bacteria778Open in IMG/M
3300012918|Ga0137396_10310014All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1167Open in IMG/M
3300012925|Ga0137419_10143640All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1715Open in IMG/M
3300012927|Ga0137416_10667465All Organisms → cellular organisms → Bacteria911Open in IMG/M
3300012927|Ga0137416_12119089All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300012929|Ga0137404_12062946Not Available532Open in IMG/M
3300012944|Ga0137410_11970018All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300013768|Ga0120155_1030411All Organisms → cellular organisms → Bacteria1712Open in IMG/M
3300014056|Ga0120125_1153906All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300015052|Ga0137411_1137058Not Available696Open in IMG/M
3300015241|Ga0137418_10251650All Organisms → cellular organisms → Bacteria1499Open in IMG/M
3300015356|Ga0134073_10098247All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300015359|Ga0134085_10074437All Organisms → cellular organisms → Bacteria1385Open in IMG/M
3300015373|Ga0132257_101791971All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300017936|Ga0187821_10333473All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300018468|Ga0066662_10540991All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1073Open in IMG/M
3300018468|Ga0066662_11058084All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300018482|Ga0066669_10681697All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300019878|Ga0193715_1008318All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia → Kallotenuales → Kallotenuaceae → Kallotenue → Kallotenue papyrolyticum2222Open in IMG/M
3300020006|Ga0193735_1110394All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300020010|Ga0193749_1021730All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1255Open in IMG/M
3300020022|Ga0193733_1148251All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300021178|Ga0210408_10801270Not Available738Open in IMG/M
3300021420|Ga0210394_10483237All Organisms → cellular organisms → Bacteria1090Open in IMG/M
3300021479|Ga0210410_11655438All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300021560|Ga0126371_13629277All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300024249|Ga0247676_1005896All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1825Open in IMG/M
3300024330|Ga0137417_1253541All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300025509|Ga0208848_1000287All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi9764Open in IMG/M
3300025885|Ga0207653_10258110All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300025913|Ga0207695_10988038All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300026223|Ga0209840_1100321Not Available625Open in IMG/M
3300026277|Ga0209350_1149634All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300026295|Ga0209234_1193319All Organisms → cellular organisms → Bacteria698Open in IMG/M
3300026298|Ga0209236_1251995All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300026310|Ga0209239_1336175All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300026313|Ga0209761_1069252All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia1879Open in IMG/M
3300026320|Ga0209131_1197050All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium941Open in IMG/M
3300026325|Ga0209152_10063547All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1321Open in IMG/M
3300026330|Ga0209473_1096271All Organisms → cellular organisms → Bacteria1238Open in IMG/M
3300026331|Ga0209267_1190420All Organisms → cellular organisms → Bacteria797Open in IMG/M
3300026515|Ga0257158_1129361All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300026523|Ga0209808_1206667All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300026524|Ga0209690_1276558All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300027565|Ga0209219_1006886All Organisms → cellular organisms → Bacteria2514Open in IMG/M
3300027633|Ga0208988_1043106All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1158Open in IMG/M
3300027655|Ga0209388_1096327All Organisms → cellular organisms → Bacteria849Open in IMG/M
3300027660|Ga0209736_1141652Not Available641Open in IMG/M
3300027678|Ga0209011_1091453All Organisms → cellular organisms → Bacteria892Open in IMG/M
3300027681|Ga0208991_1090474All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300027787|Ga0209074_10137829All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium865Open in IMG/M
3300027787|Ga0209074_10444696All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300027842|Ga0209580_10277372All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300027842|Ga0209580_10460296All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300027903|Ga0209488_10280987All Organisms → cellular organisms → Bacteria1245Open in IMG/M
3300027911|Ga0209698_10909157All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300027915|Ga0209069_10550253Not Available657Open in IMG/M
3300028803|Ga0307281_10298361Not Available601Open in IMG/M
3300028828|Ga0307312_10817804All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300031670|Ga0307374_10231651All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300031672|Ga0307373_10239978All Organisms → cellular organisms → Bacteria1232Open in IMG/M
3300031672|Ga0307373_10275095All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1099Open in IMG/M
3300031720|Ga0307469_12102926All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300031720|Ga0307469_12520938All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300031740|Ga0307468_101990302All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300031754|Ga0307475_10427707All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1064Open in IMG/M
3300032180|Ga0307471_101756919All Organisms → cellular organisms → Bacteria773Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.43%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.19%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.46%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds4.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.48%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost4.48%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.73%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.73%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.99%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.24%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil2.24%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil2.24%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.24%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.49%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil1.49%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.49%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil1.49%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.75%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.75%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.75%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.75%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.75%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001154Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1EnvironmentalOpen in IMG/M
3300001359Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A30-35cm)- 6 month illuminaEnvironmentalOpen in IMG/M
3300001409Arctic peat soil from Barrow, Alaska - NGEE Surface sample 53-1 deep-072012EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300001664Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - 5cm_reassembledEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005363Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome F II A100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006055Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 deep-072012EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006864Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 3 DNA2013-193EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011999Permafrost microbial communities from Nunavut, Canada - A28_65cm_6MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013768Permafrost microbial communities from Nunavut, Canada - A35_65cm_0MEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024249Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK17EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025509Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 shallow-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026223Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 2 DNA2013-190 (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031670Soil microbial communities from Risofladan, Vaasa, Finland - OX-3EnvironmentalOpen in IMG/M
3300031672Soil microbial communities from Risofladan, Vaasa, Finland - OX-2EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12636J13339_100844933300001154Forest SoilRRFGPLIERPAEMARSDVEWSVAVGQLLRRSGARAVTLGLLANATERVVASRNGLSVQPRERFWNALWVRDPQVAGDLAQVENSLVAASTTESGLLDAAQQLHRIAHPVAPTKTLERR*
A3035W6_102660613300001359PermafrostVGLVLRGRPFGPLIPRPAHAVRADVEWAVAVGEMLRRSSARAVTLGLLASATERAVSAQTGIPLQPRERFWNALWVRVPELAAELAEAENALHAAAVSEPELLKAAQRLHRIAHPASGRVA*
A3035W6_112595913300001359PermafrostLVAVFLGLLLRGRRFGPLVARPAEAARSDVEWAVAVGELLRRSGARTVTLGLLATASERAVAARTGLPLKPRERFWNALWIRAPQLASELAEAENALYGSAGGDAELLKAAQRLHDVAYPISPERRPLRPTVPKEAK*
JGI20185J14861_100657013300001409Arctic Peat SoilGIMWLLVAVFVGLLLRGRGFGALIPRQVEAARADTEWAVAVGELLRRSGARAVTLGVLATASERAVAARTGLPLKPRERFWNALWIRAPELAGALAEAETALHNSVGGDADLLKAAQRLHDVAYPISPERSRLRPTASKEVK*
JGI12635J15846_1064262213300001593Forest SoilGLLAYATERAVATRTGIPLQPRERFWNALWVRAPEIAADLAEVETSLHASGSTEPDLLKTAQRLHRIAHPAPPGKAQGARGAKS*
JGI12635J15846_1074821823300001593Forest SoilAEVARTDVEWAVAVGQLLRRSGARAVTLGLLAVATERAVATQTGIPLQPRERFWNALWVRVPGLAAELAEAENALHDSAANEADLLNAAQRLHRIAHPAAAGPRTRARPSDRVA*
JGI12053J15887_1005355913300001661Forest SoilHAVTLGLLASATERSVSVQTGIPLQPRERFWNALWVRAPQLAAELAEAENVINASAAGEAELFRAAQRLHQIARPVTPDKVAKVAR*
P5cmW16_107939323300001664PermafrostQAARADVEWAVAVGQLLRRSSARALTLGLLASATERAVSAQTGIPLQPRERFWNALWVRVPGLAADLAEAENVLYASAAGEAELLQAAQRLHQIARPVTPDRVARGAR*
JGI25382J43887_1045520723300002908Grasslands SoilRPAEIARSDAEWSVAVGQLLRRSSARGVALGLLAGATERAVAARTGLPLQPRERFWNALWVRAPELAHDLAAVENSLHASSATERDMLDAARRLHAIAHPAPSGKTR*
Ga0062595_10019845413300004479SoilVGQLLRRSGARAVTLGLLAVATERAVAAQTGIPLQPRERFWNALWVRVPGLASDLAEAENALLASATSEGDLLKAAQRLHRIAHPEANGKRARTSSGV*
Ga0062595_10050876213300004479SoilGLVLRGRNFGPLIPAPAETGRSDAEWSVAVGRLLRRSSARAITLGLLASATERAVASRTGLPLQPRERFWNALWVRAPELAAELAEAETTIHSSSASEADLLLAARRLHRIAHPTPGSPPQARPKGAF*
Ga0066674_1035995023300005166SoilVGRVPETVRSDVEWSVAVGQLLRRSSARRVTLGLLAGATERAVALHTGLPVQPRERFWNALWVRAPEVARELAEVENSLDTSSASEHDVLTSARRLHEIAHPAATRRK*
Ga0066672_1018739313300005167SoilAEWSVAVGQLLRRSSARGVSLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETSLYTSSSTERDLLTAASRLHAIAHPRPKKLAGPGR*
Ga0066683_1075267113300005172SoilMWLLVAVFFGLFLRGRRFGPLVERPAELARSDVEWSVAVGQLLRRSSASAVTLGLLAGATERAVALRTGLPLQPRERFWNALWVRAPEIAAELAEVENSLFRSSATERDLLNAARRLHEIAHPAPSPRLRGIPTPSARGPR*
Ga0066690_1019568323300005177SoilLRRSSARGVTLGLLAAATERAVAARTGLPLQPRERFWNALWVRAPAVAEELAQVENSIAASPAGERDLLDAARKLHEIAHPAPSPRLRRGSSP*
Ga0066688_1039841423300005178SoilWGAGLLWLLVAVFFGLLLRGRRFGPLIERRAEVARSDVEWSVAVGQLLQRSSARAVTLGVLARAAERAVASRTGLPLQPRERFWNALWVRAPVIASELAEVENSLVAPSPSERDLLNAARRLHEIAHPSPRRLRK*
Ga0066684_1021976513300005179SoilVGRPAEVARSDAEWSVAVGQLLRRSSARAVTLGMLATATERAVASQTGLPLQPRERFWNALWVRAPEVAAKLAEVESSLYAASATEGQLLDAARRLHEIAHPAGMRAR*
Ga0066676_1073427323300005186SoilAEWSVAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETSLYTSSSTERDLLTAASRLHAIAHPRPKNLAGPGR*
Ga0008090_1380625713300005363Tropical Rainforest SoilLRRSGARAVTLGVLANATERAVATRTGLPLQPRERFWNALWVRAPEIAAALANVESSLFSASSSERSLLDAARRLHEIAHPRPRPR*
Ga0008090_1431971813300005363Tropical Rainforest SoilEVARSDVEWSKAVGQLLRRSSARAVTLGLLATATERAVAARTGLPMQPRERFWQALWVRAPDVAGELAQVESSLTSSSASERDLMKAARRLHEIAHPAPRR*
Ga0070694_10192167413300005444Corn, Switchgrass And Miscanthus RhizosphereLRGRRFGPLVGRPAELARSDAEWSVAVGELLRRSSARTVTLGLLATATERAVAAQTGLPLQPRERFWNALWVRAPEVAGELAEIETSLHAASSSEGQLLNAARRLHGIARPSTVKTR*
Ga0066689_1068562213300005447SoilVGQLLRRSSARRVTLGLLAGATERAVALHTGLPVQPRERFWNALWVRAPEVARELAEVENSLHTSSASEHDVLNAARRLHEIAHPAPIRRK*
Ga0070699_10032026513300005518Corn, Switchgrass And Miscanthus RhizosphereLLWLLIAVFTGLILRGRSFGPLIPRPAEVARVDAEWAVAVGQLLRRSSARAVTLGLLASATERSLAARTGIPLQPRERFWNALWVRMPEIAADLAEAENALAESAATEHELLKTAQRLHGIARPVPIERRPTN*
Ga0070732_1068069023300005542Surface SoilRRAGARTVTLGLLATATERAVALRTGLPVQPREQFWNALWVRAPELAAELAAAENDLHQSGATEAGLLNAARRLHRIAQPIAEERRRQGAVRRPA*
Ga0070696_10015950333300005546Corn, Switchgrass And Miscanthus RhizosphereVGQLLRRSSARAVTIGLLAHATEREVAAQNGIPMQPRERFWNALWVRAPEVARELAEVEDSLRAAPSGEGELLHAARRLHRIAHPVIRR*
Ga0070704_10000234913300005549Corn, Switchgrass And Miscanthus RhizosphereRRFGPLIERPAEIARSDAEWSVAVGQMLRRSSARAVTLGLLASAAERAVASRTGLALQPRERFWNALWVRAPEVAGELAEVETSLYAASGNEGDLLKAARRLHQIAYPVAHQPHPGAPRRTSP*
Ga0070704_10152817213300005549Corn, Switchgrass And Miscanthus RhizosphereRRFGPLVERPAEIARSDAEWSVAVGQLLRRSSARAVTIGLLAHATEREVAAQNGIPMQPRERFWNALWVRAPEVAKELAEVEDSLRATPAGERELLHAARRLHRIAHPVIRR*
Ga0066698_1051367323300005558SoilEWSVAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEIAGELAEVENSLFRSSATERDLLNAARRLHDIAHPSPGRLRR*
Ga0066705_1069544513300005569SoilPIIERQAEVARSDAEWSVAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETSLYTSSSTERDLLTAASRLHAIAHPRPKKLAGPGR*
Ga0066694_1010193923300005574SoilGQLLRRSSARTVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVSGRAPSGRSLRDGPSSPQARKT*
Ga0066702_1008127113300005575SoilSSARGVTLGLLAAATERAVAARTGLPLHPRERFWNALWVRAPEVAAELAQVESSIHASSSTERGLLNAARRLHAIAHPAPKRLPGSVR*
Ga0075023_10015129723300006041WatershedsRSSARAVTLGLLATATERSVAAQTGIPLHPRERFWNALWVRVPGLAADLAEAEYALHDSAANEADLLKAAQRLHRIAHPAPAGKRGRTEPSDRVA*
Ga0097691_101645743300006055Arctic Peat SoilVTLGFLAAATERAVAARTGIQLQPRERFWNALWVRVPGLAADLAEAENALYASAASEPELLAAAQRLHHIAHPVAEERRRRVPVSGRVA*
Ga0075030_10067904413300006162WatershedsGPLIPRPAEPARSDVEWSVAVGRLLRRSGARSLTLGLLTAATERSVADRTGLPLQPRDRFWNALWVRAPVLAAELAEAEVALNGSAESDRDLVHAARRLHRIAYPTIGSVAQAPQGSLDERRP*
Ga0075018_1046592813300006172WatershedsLGLLASATERAVSVQTGIPLQPRERFWNALWVRAPELAAELAEAENVASGSAAGEAELLKAAQRLHHLARPATQDKVARAAR*
Ga0075021_1066436923300006354WatershedsSDVEWSVAVGQMLRRSSARAMTLGLLATATERAVASRAGLPMQPRERFWNALWVHAPELASELAAAEETISVSGSSDRELLGIAQRLHHVAYPVTERRAGPRARRRA*
Ga0079222_1112259913300006755Agricultural SoilGLWLRGRRFGPQVERPPEVARSDAEWSVAVGQLLRRSSARSVTLGLLAAATERAVATHTGLPVQPRDRFWNALWVRDPETARRLAEVEGSLHRADVSERDVLEAARKLHDIAHPQPAVTRR*
Ga0079221_1060250723300006804Agricultural SoilGRRFGPVVPRPAEEARSDVEWSVAVGQLLRRSGAGRMTMGMLAIATERAVAARTGLPLQPRERFWNALWVRAPEVASDLAQAETEMTAAAGNERDLLNAARRLHDIAYPPSPRAGKT*
Ga0075425_10295611713300006854Populus RhizosphereHHGLTIGAFAPQAWLATSWGAAIMWLLVAVFFGLLLRGRRFGPLVGRAPETVRSDVEWSVAVGQLLRRSSAGRVTLGLLASATERAVALHTGLPVQPRERFWNALWVRAPEVARELAEVENSLHTSSASEHDVLTAARRLHEIAHPAPTRRK*
Ga0066797_110413223300006864SoilSFGPLIARPSELARTDVEWAVAVGQLLRRSGARAVTLGLLAVATERAVAAQTGIPLQPRERFWNALWVRVPGLAADLAEAENALHASAASEGDLLKAAQRLHRIAHPAPPGERARTLTSDRVA*
Ga0075436_10111247013300006914Populus RhizosphereFFAFFLRGRRFGPLVPRPAEIPRSDAEWAVAVGELLRRSGARNVTLGLLATATERAVAARTGLPLQPRERFWNALWVRAPEVAKELAAVENSLYAATAGEHDVLNAARRLHRIAHPTPGQQP*
Ga0075435_10027154833300007076Populus RhizosphereRPAEVARSDAEWSVAVGQLLRRSSARGVALGLLAAATERAVAARTGLPLQPRERFWNALWVRAPELASDLAEVENSLHASSATERDMLDAARRLHAIAHPAPSGKTR*
Ga0099794_1022059823300007265Vadose Zone SoilLWLLIAVFAGLILRGRSFGPLIPRPPEVARVDAEWAVAVGQLLRRSSARAVTLGLLASATERSLAARTGIPLQPRERFWNALWVRVPEIAADLAEAEHALAASASNEHELLKGAQRLHRIAHPVPEEKAPPVRALRAGPNSPHGGEARPRTP*
Ga0066710_10159264813300009012Grasslands SoilAEVARSDAEWSVAVGQLLRRSSARAVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVTGKPR
Ga0066710_10349304013300009012Grasslands SoilVSVSTLPMRTSARAVTQGILATANERAVESQTGLPLQPRERFWNALWVRAPEVAAKLAEVESSLYAASATEGQLLDAARRLHEIAHPAGMRAR
Ga0066710_10435104523300009012Grasslands SoilLLRRSSARAVTLGLLAHATEREVAAQNGLPMQPRERFWNALWVRAPEVAKELAEVEDSLHAAAPTGERDLLDAARRLHRIAHPVIRR
Ga0066710_10451368413300009012Grasslands SoilRTVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVSGRAPSGRSLRDGSTSPQVGKT
Ga0099830_1160883613300009088Vadose Zone SoilLLRGRRFGPLIERPAEMARSDVEWSVAVGQLLRRSSARAVTLGLLANATERAVASRNGLSVQPRERFWNALWVRDPQVAGDLAQIENSLPAASATEGGLLGAAQQLHRIAHPVTQKKTLEHR*
Ga0105240_1144995423300009093Corn RhizosphereFGLLLRGRRFGPLVERAAEVARSDAEWSIAVGQMLRRSSARAVTLGLLAHATERAVAAQHGLPMQPRASFWNALWVRAPDVARELAEVEDSLPAASVGERQLLGAARRLHGIAHPSPTSRTLRGAPPSPASLEGTRR*
Ga0066709_10143023813300009137Grasslands SoilRRSSARAVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVTGKPR*
Ga0134070_1023896413300010301Grasslands SoilSARAVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVTGKPR*
Ga0134071_1006125833300010336Grasslands SoilGQLLRRSSARRVTLGLLAGATERAVALHTGLPVQPRERFWNALWVRAPEVARELAEVENSLHTSSASEHDVLNAARRLHEIAHPAPIRRK*
Ga0134066_1021482723300010364Grasslands SoilRRFGPLIERPAEAARSDAEWSVAVGQLLRRSSARAVTLGLLAHATEREVAAQNGLPMQPRERFWNALWVRAPEVARELAEVEDSLHAAAPTGERDLLDAARRLHRIAHPVIRR*
Ga0134121_1190539613300010401Terrestrial SoilLLLRGRRFGPLIERPADAARSDAEWSVAVGQLLRRSSARAVTLGLLAHATEREVAAQNGLPMQPRERFWNALWVRAPEVARELAEVEDSLRAAPSGEGELLHAARRLHRIAHPVIRR*
Ga0137392_1148907413300011269Vadose Zone SoilEIARTDVEWAVAVGQLLRRSSARAVALGLLAVATERAVSAQTGIPLQPRERFWNALWVRVPGLAADLAEAENSLHDSAASEADLLKAAQRLHRIAHPVAPGKGGRSLPSDRVA*
Ga0137393_1017062933300011271Vadose Zone SoilAVTLGLLALATERAVATQTGIPLQPRERFWNALWVRVPGLAADLAEAENALYASAASEPELLKAAQRLHRIAHPAAAGRVA*
Ga0120148_100967633300011999PermafrostVLRGRPFGPLIPRPAHAVRADVEWAVAVGEMLRRSSARAVTLGLLASATERAVSAQTGIPLQPRERFWNALWVRVPELAAELAEAENALHAAAVSEPELLKAAQRLHRIAHPASGRVA*
Ga0137389_1072299123300012096Vadose Zone SoilPLIERPAEMARSDVEWSVAVGQLLRRSSARAVTLGLLANATERAVASRNGLSVQPRERFWNALWVRDPQVAGDLAQIENSLPAASATEGGLLGAAQQLHRIAHPVTQKKTLEHR*
Ga0137389_1130209923300012096Vadose Zone SoilWAVAVGQLLRRSSARAVTLGLLAVATERAVSAQTGIPLQPRERFWNALWVRVPGLAADLAEAENALYASAASEPELLKAAQRLHRIAHPVAAGRVA*
Ga0137374_1023692533300012204Vadose Zone SoilLVAVFFGLLLRGRRFGPLMERPAEVARSDAEWSAAVGQLLRRSSARAVTLGLLANATERAVASYNGLPLQPRERFWNALWVRAPEVASELAQVEDTLQSAAATERDLLEAAGRLHRIAHPVEERTRGAMR*
Ga0137379_1007431513300012209Vadose Zone SoilVGQLLRRSSARGVALGLLAGATERAVAARTGLPLQPRERFWNALWVRAPELAHDLAAVENSLHVSSATERDMLDAARRLHAIAHPAPSGKTR*
Ga0137386_1084109113300012351Vadose Zone SoilEWSVAVGQLLRRSSARAVTLGLLAGATERAVATRTGLPLQPRERFWNALWVRAPEVAGELAEVETSLYAAASNESDLLKAARRLHAIAHPPQKRPGQTR*
Ga0137371_1094597913300012356Vadose Zone SoilLRGRRFGPLVERPAERARSDVEWSVAVGQLLRRSSASAVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEIAAELAEIENSLFRSSATERDLLNAARRLHEIAHPAPSPRLRGIPTPSARGPR*
Ga0137375_1009343743300012360Vadose Zone SoilEVARSDAEWSAAVGQLLRRSSARAVTLGLLANATERAVASYNGLPLQPRERFWNALWVRAPEVASELAQVEDTLQSAAATERDLLEAAGRLHRIAHPVEERTRGAMR*
Ga0137390_1102087613300012363Vadose Zone SoilLVMRGRRFGPVVPRPAEVARSDVEWSVAVGQLLRRSSARAVTLGMLANATERAVAARTGLPLQPRERFWNALWVRAPEIADELARVENTLVGSSATERDLLNAARRLHEIAHPGPRK*
Ga0137396_1031001423300012918Vadose Zone SoilERPAEVARSDAEWSVAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIEYSLYSSSSTERDLLNAASRLHAIAHPQPKKLPEPRR*
Ga0137419_1014364033300012925Vadose Zone SoilARVDAEWAVAVGQLLRRSSARAVTLGVLATATERSLAARTGIPLQPRERFWNALWVRVPEIAAELAEAENALMASASTEAELLRSAQRLHRIAHPAPQEPRLDG*
Ga0137416_1066746523300012927Vadose Zone SoilVFIGLLLRGRSFGPLIPRLPEVARVDAEWAVAVGQLLRRSSARAVTLGVLAAATERSLAARTGIPLQPRERFWNALWVRVPEIAAELAEAENALMASASTEAELLKSAQRLHRIAHPVPQERRPAS*
Ga0137416_1211908913300012927Vadose Zone SoilAEWSVAVGQLLRRSSARGVTLGLLASATERAVAARTGLPLQPRERFWNALWVRAPEVAAELSQIESSLYSSSSSERELLHAARKLHEIAHPKPKRLTGPTR*
Ga0137404_1206294623300012929Vadose Zone SoilGLLATATERSLAARTGIPLQPRERFWNALWVRLPETAADLAEAENALMASASTEAELLRSAQRLHRIAHPVPQEPRLDG*
Ga0137410_1197001813300012944Vadose Zone SoilPAEVARSDAEWSVAVGQLLQRSSARGVTLGLLAGATERAVAARTGIPLQPRERFWNALWVRAPETAAELAQIENSLYGSSSTERDLLNAARKLHAIAHPTPGGRR*
Ga0120155_103041133300013768PermafrostMRADVEWAVAVGALLRRSSARAVTLGLLASATERTVAARMGIPLQPRERFWNALWVRVPELAAELAEAENALHASAMSEPELLKAAQRLHRIAHPASGRVA*
Ga0120125_115390613300014056PermafrostPLQAARADVEWAVAVGQLLRRSSARALTLGLLASATERAVSAQTGIPLQPRERFWNALWVRVPGLAADLAEAENVLYASAAGEAELLQAAQRLHQIARPVTPDRVARGAR*
Ga0137411_113705813300015052Vadose Zone SoilRSSARAVTLGAVTLGLLATATERSLAARTGIPLQPRERFWNALWVRVPEIAADLAEAENALMASAANEHELLKSAQRLHRIAHPVPEDRRRTT*
Ga0137418_1025165013300015241Vadose Zone SoilRGRSFGPLIPRPLESPRVDAEWAVAVGQLLRRSSARAVTLGLLATATERSLAARTGIPLQPRERFWNALWVRVPEIAADLAEAENALMASASTEAELLKSAQRLHRIAHPVPQEKAPPVPALRAGPTSPHGGEARRPAS*
Ga0134073_1009824713300015356Grasslands SoilAIMWLLVAVFFGLLLRGRRFGPLVGRAPETVRSDVEWSVAVGQLLRRSSARRVTLGLLAGATERAVALHTGLPVQPRERFWNALWVRAPEVARELAEVENSLHTSSASEHDVLTAARRLHEIAHPAATRRK*
Ga0134085_1007443713300015359Grasslands SoilEWSVAVGQLLRRSSARAVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVTGKPR*
Ga0132257_10179197123300015373Arabidopsis RhizosphereVEVARSDAEWSIAVGQMLRRSSARAVTLGLLAHATERAVAAQNGLPMQPRASFWNALWVRAPDVARELAEVEDSLPAASVGERQLLGAARRLHRIAHPAPISRTLRGAPPSPGSLEETRR
Ga0187821_1033347323300017936Freshwater SedimentERPSETARSDVEWAVAVGRMLRRSNARSVTLGLLANATERSVATYAGLAAQPRDRFWQALWVRAPRVAAELAEIEDSLGASSATEAEVMRVAQRLHRIAHRGLD
Ga0066662_1054099123300018468Grasslands SoilAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETALYSSASTERDLLNAAGRLHAIAHPRPKKLAAPRR
Ga0066662_1105808423300018468Grasslands SoilVAVGQLLRRSSARGVTLGLLAAATERAVAARTGLPLHPRERFWNALWVRAPEVAAELAQVESSIHASSSTERGLLNAARRLHAIAHPAPKRLPGSVR
Ga0066669_1068169713300018482Grasslands SoilEWSVAVGQLLRRSSARAVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVSGRAPSGRSLRDGPTSPQVGKT
Ga0193715_100831833300019878SoilGQLLRRSGARAVTLSLLASATERSLAARTGIPLQPRERFWNALWVRAPEIAADLAEAENALVASASTEHDLLKSAQRLHRIAHPVPDERRPTT
Ga0193735_111039423300020006SoilLLIAMFAGLILRGRSFGPLIPRPAEVARVDAEWAVAVGQLLRRSGARAVTLGLLASATERSLAARTGIPLQPRERFWNALWVRAPEIAADLAEAENALVASASTEHDLLKSAQRLHRIAHPVPDERRPTT
Ga0193749_102173023300020010SoilARALTLGLLATATERAVSVQTGIPLQPRERFWNALWVRVPGLAAELADAENVLNASAASEAELLKAAQRLHQIARPAPPDSSARVAR
Ga0193733_114825123300020022SoilLVAVFVGLLLRGRRFGPLIARPAEAARSDVEWAVAVGELLRRSGARTVTLGLLATASERAVAARTGLPLKPRERFWNALWIRAPQLASELAEAENALYGSAGGEAELLKAAQRLHDVAYPISPERRPLRPTVPKEAR
Ga0210408_1080127023300021178SoilRSGARAVTLGLLAEATERAVATQTGIPLQPRERFWNALWVRVPGLAADLAEAENALHDSAASEADLLKAALRLHRIAHPPAAGPRSRAQSSDRVA
Ga0210394_1048323713300021420SoilEAARTDVEWAVAVGRLLRRSSARAVTLGLLATATERAVSSQTGIPLQPRERFWNALWVRVPGLAADLAEAENALHDSAATEADLLKAAQRLHRIAHPAAAGPRGARQPSDRVA
Ga0210410_1165543813300021479SoilVFVGLVLRGRSFGPHIQRPAEAARTDVEWAVAVGQLLRRSGARAVTLGLLAEATERAVATQTGIPLQPRERFWNALWVRVPGLAADLAEAENTLHDSAASEADLLKAAQRLHRIAHPVAPGAGSRAQSSDRVA
Ga0126371_1362927723300021560Tropical Forest SoilWLLVAVFAGLVLRGRRFGPLVARPPESARSDAEWSVAVGQLLRRSSARNVTLGLLAGATERAVASRTGLPLQPRERFWNALWVRAPEIAAELARVETSLAASSSSERDMLEAARRLHEIAHPQPRRS
Ga0247676_100589633300024249SoilWLLVAVFAGLVLRGRSFGPIITRQRELARTDVEWAVAVGQLLRRSGARAVTLGLLAVATERAVAAQTGIPLQPRERFWNALWVRVPGLASDLAEAENALLASATSEGDLLKAAQRLHRIAHPEGNGKRPRTASGV
Ga0137417_125354123300024330Vadose Zone SoilGQLLRRSSARAVTLGVLAAATERSLAARTGIPLQPRERFWNALWVRVPEIAAELAEAENALMASASTEAELLRSAQRLHRIAHPAPQEPRLDG
Ga0208848_100028713300025509Arctic Peat SoilRRSSARAVTLGLLASATERAVAARTGIPLQPRERFWNALWVRVPGLAADLAEAENALYASAASEPELLAAAQRLHHIAHPVAEERRRPVPVSGRVA
Ga0207653_1025811013300025885Corn, Switchgrass And Miscanthus RhizosphereVGQLLRRSRARAVTLGMLANATERAVAARTGLPLQPRERFWNALWVRAPEIAGDLANVENTLVASSSSERDLLNAARRLHEIAHPGPRR
Ga0207695_1098803823300025913Corn RhizosphereFGLLLRGRRFGPLVERAAEVARSDAEWSIAVGQMLRRSSARAVTLGLLAHATERAVAAQHGLPMQPRASFWNALWVRAPDVARELAEVEDSLPAASVGERQLLGAARRLHGIAHPSPTSRTLRGAPPSPASLEGTRR
Ga0209840_110032123300026223SoilTLGLLATATERAVAARTGIPLQPRERFWNALWVRVPGLAADLAEAENALYASAASEPELLAAAQRLHHIAHPVAEERRRRVPVSGRVA
Ga0209350_114963423300026277Grasslands SoilRGRRFGPLVGRPAEVARSDAEWSVAVGQLLRRSSARAVTLGLLATATERAVASRTGLPLQPRERFWNALWVRAPEVAGELAEVENSLFAASASEGQLLQAARRLHDIAHPVTGKPR
Ga0209234_119331913300026295Grasslands SoilVGQLLRRSSARAVTLGMLATATERAVASQTGLPLQPRERFWNALWVRAPEVAAKLAEVESSLYAASATEGQLLDAARRLHEIAHPAGMRAR
Ga0209236_125199523300026298Grasslands SoilRPAEVARSDAEWSVAVGQLLRRSSARGVTLGLLASATERAVAARTGLPLQPRERFWNALWVRAPEVAAELSQIESSLYSSSSSERDLLHAARKLHEIAHPKPKRLTGPTR
Ga0209239_133617513300026310Grasslands SoilRRSSARGVTLGLLAAATERAVAARTGLPLHPRERFWNALWVRAPEVAAELAQVESSIHASSSTERGLLNAARRLHAIAHPAPKRLPGSVR
Ga0209761_106925213300026313Grasslands SoilLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETSLYSSASTERDLLNAAGRLHAIAHPRPKKLAAPRR
Ga0209131_119705013300026320Grasslands SoilSARAVTLGVLATATERSLAARTGIPLQPRERFWNALWVRVPEIAAELAEAENALMASASTEAELLKSAQRLHRIAHPVPQVPRLDG
Ga0209687_123826713300026322SoilRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETSLYTSSSTERDLLTAASRLHAIAHPRPKNLAGPGR
Ga0209152_1006354713300026325SoilPIIERQAEVARSDAEWSVAVGQLLRRSSARGVTLGLLAGATERAVAARTGLPLQPRERFWNALWVRAPEVAAELAQIETSLYTSSSTERDLLTAASRLHAIAHPRPKNLAGPGR
Ga0209473_109627123300026330SoilVGRPAEVARSDAEWSVAVGQLLRRSSARAVTLGMLATATERAVASQTGLPLQPRERFWNALWVRAPEVAAKLAEVESSLYAASATEGQLLDAARRLHEIAHPAGMRAR
Ga0209267_119042013300026331SoilRGRRFGPLIERRAEVARSDVEWSVAVGQLLQRSSARAVTLGVLARAAERAVASRTGLPLQPRERFWNALWVRAPVIASELAEVENSLVAPSPSERDLLNAARRLHEIAHPSPRRLRK
Ga0257158_112936113300026515SoilSDFAPQAWLLTPWGAGLLWLLVAVFVGLALRGRTFGPLIPRPTEAARTDVEWAVAVGQLLRRSSARSVTLGLLASATERAVATRTGIPLQPRGRFWNALWVRAPEIAAELAEVENSLHASGSTEPDLLKMAQRLHRIAHPAPRSKA
Ga0209808_120666713300026523SoilVRSDAEWSVAVGQLLRRSSARAVTLGMLATATERAVASQTGLPLQPRERFWNALWVRAPEVAAKLAEVESSLYAASATEGQLLDAARRLHEIAHPAGMRAR
Ga0209690_127655813300026524SoilRPAEIARSDAEWSVAVGQLLRRSSARGVALGLLAGATERAVAARTGLPLQPRERFWNALWVRAPELAHDLAAVENSLHVSSATERDMLDAARRLHAIAHPAPSGKTR
Ga0209219_100688653300027565Forest SoilFGPVIRRPAEAARTDVEWAVAVGQLLRRSSARAVTLGLLAVATERSVASQTGIPLQPRERFWNALWVRVPGLAADLAEAENALHDSAANEADLLKAAQRLHRIAHPAPAGMRGQKHPSDRVA
Ga0208988_104310613300027633Forest SoilIESAVAVGQLLRRSSARAVTLGLLATATERSLAARTGIPLQPRERFWNALWVRVPEIAADLAEAENALMASASTEAELLKSAQRLHRIAHPAPQERHIPS
Ga0209388_109632723300027655Vadose Zone SoilARAVTLGLLASATERSLAARTGIPLQPRERFWNALWVRVPELAAELAEAENALHAAAVSEPELLKAAQRLHSIAHPASGRVA
Ga0209736_114165213300027660Forest SoilVTLGLLASATERSVAVRTGIPMQPRERFWNALWVRVPAVAADLAEAENALYASAASEADLLKAAQRLHRIAHPVAAGSRDRAPVSDRVA
Ga0209011_109145323300027678Forest SoilEAARADVEWAVAVGQLLRRSGARAVTLGLLATATERSVAVRTGIPLQPRERFWNALWVRVPSVAAELAEAENALYASAASEADLMKAAQRLHRIAHPVAVGPRDRAPVSDRVA
Ga0208991_109047413300027681Forest SoilLIRRPAEAARADVEWAVAVGQLLRRSSAHAVTLGLLASATERSVSVQTGIPLQPRERFWNALWVRAPQLAAELAEAENVINASAAGEAELFRAAQRLHQIARPVTPDKVAKVAR
Ga0208989_1000940413300027738Forest SoilAPQAWLMTPWGAGLLWLLVAVFAGLLLRGRSFGPLIRRPAEAARADVEWAVAVGQLLRRSSARALTLGLLASATERSVSVQTGIPLQPRERFWNALWVRAPQLAAELAEAENVINASAAGEAELFRAAQRLHQIARPVTPDKVAKVAR
Ga0209074_1013782923300027787Agricultural SoilSVAVGQLLRRSGARAVTLGVLATATERAVAARTGLPLQPRERFWNALWVRAPEVASELAQAESMLPASSADERALLDAARRLHDIAHPQTRRSSH
Ga0209074_1044469613300027787Agricultural SoilDAEWSVAVGQLLRRSSARSVTLGLLAAATERAVATHTGLPVQPRDRFWNALWVRDPETARRLAEVEGSLHRADVSERDVLEAARKLHDIAHPQPAVTRR
Ga0209580_1027737223300027842Surface SoilNFGPLIGRRADAERSDAEWSTAVGRLLRRSGARAVTLGLLALATERAVASRTGLPQQPRERFWNALWVRAPELAADLAEAENTLQSSSATETDLLGAAQRLHRLAHPVSGERLRQRVKGE
Ga0209580_1046029613300027842Surface SoilVGELLRRAGARTVTLGLLATATERAVALRTGLPVQPREQFWNALWVRAPELAAELAAAENDLHQSGATEAGLLNAARRLHRIAQPIAEERRRQGAVRRPA
Ga0209488_1028098713300027903Vadose Zone SoilAVAVGQLLRRSSARSVTLGLLASATERAVATRTGIPLQPRERFWNALWVRAPEIAADLADVENSLHASGSTEPDLLKMAQRLHRIAHPAPRGKG
Ga0209698_1090915713300027911WatershedsAMFFGLLLRGRQFGPLIPRPAEPARSDVEWSVAVGRLLRRSGARSLTLGLLTAATERSVADRTGLPLQPRDRFWNALWVRAPVLAAELAEAEVALNGSAESDRDLVHAARRLHRIAYPTIGSVAQAPQGSLDERRP
Ga0209069_1055025313300027915WatershedsVTLGLLATATERSVATQTGIPLQPRERFWNALWVRVPGLAADLAEAEYALHDSAANEADLLKAAQRLHRIAHPAPAGKRGRTEPSDRVA
Ga0307281_1029836113300028803SoilARAVTLGLLATATERSLAARTGIPLQPRERFWNALWVRVPEIAADLAEAENALITSASTEQDLLKSAQRLHRIAHPAPIERRHSA
Ga0307312_1081780413300028828SoilLLWLLIAVFAGLILRGRSFGPLIPRPAEVARVDAEWAVAVGQLLRRSGARAVTLGLLASATERSLAARTGIPLQPRERFWNALWVRAPEIAADLAEAENALVASASTEHDLLKSAQRLHRIAHPVPDERRPTT
Ga0308309_1035444313300028906SoilVLSDFAPQAWVTTPWGIGLLWLLVAVFVGLVLRGRSFGPIIRRPAEAARTDVEWAVAVGRLLRRSSARAVTLGLLATATERAVSSQTGIPLQPRERFWNALWVRVPVLAADLAEAENALHDSAATEADLLKAAQRLHRIAHPAAAGTRGARQPSDRVA
Ga0307374_1023165123300031670SoilQLLRRSSARAVTLGLLASATERAVASRTGLPLQPRERFWNALWVRAPELAAELAEAENTLHTSSATEADLLSAARRLHQIAHPVPGVPTGARSKGAV
Ga0307373_1023997813300031672SoilQLLRRSSARAVTLGLLASATERAVASRTGLPLQPRERFWNALWVRAPELAAELAEAENTLHTSSATEADLLSAARRLHQIAHPVSGVPTGARSKGAV
Ga0307373_1027509513300031672SoilDAGRSDSEWSVAVGQLLRRSSARAVTLGLLASATERAVASRTGLPLQPRERFWNALWVRAPELAAELAEAENTLHASSATEADLLGAARRLHRIAHPVPGLPTGARSTGAV
Ga0307469_1210292613300031720Hardwood Forest SoilFGPLIPRPAEVARVDAEWAVAVGQLLRRSSARAVTLGLLASATERSLAARTGIPLQPRERFWNALWVRMPEIAADLADTENALGASAATEHELLKTAQRLHRIARPVPIERRPTN
Ga0307469_1252093813300031720Hardwood Forest SoilLLERPIEGARSDAEWSVAVGQLLRRSSARAVTLGLLAHATERAVAAQNGLPMQPRERFWNALWVRAPEVARELAAVEESLHAASGGDRDLLQAARRLHRIAHPVIRR
Ga0307468_10199030223300031740Hardwood Forest SoilWAVAVGQLLRRSSARAMTLGLLAHATERAVASYNGLPLQPRERFWNALWVRAPEVARELAQIEDSLQAASASERDVLNAARKLHAIAHPVAERARTKAR
Ga0307475_1042770723300031754Hardwood Forest SoilRGRRFGPLVGRPAEVARSDVEWSVAVGQLLRRSGARAVTLGVLANATERAVAIRTGLPLQPRERFWNALWVRAPEVAGELAHAESTLYAASADEHALLEAARRLHDIAHPGTRRT
Ga0307471_10175691923300032180Hardwood Forest SoilAFFLRGRRFGPLVPRPPELVRSDAEWSVAVGELLRRSGARDVTLGLLATATERAVAARTGLPLQPRERFWNALWVRAPAIAADLAAVENSLYSASASERDVLNAARRLHRIAHPAPGQP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.