NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F055862

Metagenome / Metatranscriptome Family F055862

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F055862
Family Type Metagenome / Metatranscriptome
Number of Sequences 138
Average Sequence Length 118 residues
Representative Sequence MTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSSPGDEPSELYSGLRAVGFTPRLPSAQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHTTASVR
Number of Associated Samples 102
Number of Associated Scaffolds 138

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 62.96 %
% of genes near scaffold ends (potentially truncated) 29.71 %
% of genes from short scaffolds (< 2000 bps) 88.41 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (50.725 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(20.290 % of family members)
Environment Ontology (ENVO) Unclassified
(32.609 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.101 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 20.69%    β-sheet: 21.38%    Coil/Unstructured: 57.93%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.104.1.3: LplA-liked3a7ra13a7r0.66414
d.161.1.1: ADC synthased2g5fa12g5f0.62736
d.282.1.1: SSo0622-liked1tlja_1tlj0.62313
d.104.1.0: automated matchesd5idha15idh0.62207
d.199.1.1: DNA-binding C-terminal domain of the transcription factor MotAd1kafa_1kaf0.62195


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 138 Family Scaffolds
PF01176eIF-1a 51.45
PF13451zf-trcl 5.80
PF00528BPD_transp_1 4.35
PF01381HTH_3 2.90
PF00296Bac_luciferase 1.45
PF13462Thioredoxin_4 1.45
PF01458SUFBD 1.45
PF12911OppC_N 0.72
PF00501AMP-binding 0.72
PF00496SBP_bac_5 0.72
PF01081Aldolase 0.72
PF16822ALGX 0.72
PF04191PEMT 0.72
PF13439Glyco_transf_4 0.72
PF13419HAD_2 0.72
PF09947DUF2180 0.72

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 138 Family Scaffolds
COG0361Translation initiation factor IF-1Translation, ribosomal structure and biogenesis [J] 51.45
COG0719Fe-S cluster assembly scaffold protein SufBPosttranslational modification, protein turnover, chaperones [O] 1.45
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.45
COG08002-keto-3-deoxy-6-phosphogluconate aldolaseCarbohydrate transport and metabolism [G] 0.72


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms50.72 %
UnclassifiedrootN/A49.28 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004463|Ga0063356_103877130Not Available644Open in IMG/M
3300004463|Ga0063356_105190794Not Available559Open in IMG/M
3300005167|Ga0066672_10283324All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300005167|Ga0066672_10895551All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300005176|Ga0066679_10734108All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300005177|Ga0066690_10412018All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300005178|Ga0066688_10914682All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300005187|Ga0066675_10214908All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1358Open in IMG/M
3300005187|Ga0066675_11323937All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300005435|Ga0070714_100811729All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria906Open in IMG/M
3300005445|Ga0070708_100021045All Organisms → cellular organisms → Bacteria5508Open in IMG/M
3300005445|Ga0070708_100876145Not Available843Open in IMG/M
3300005447|Ga0066689_10899646Not Available547Open in IMG/M
3300005467|Ga0070706_100030566All Organisms → cellular organisms → Bacteria4964Open in IMG/M
3300005467|Ga0070706_100063406All Organisms → cellular organisms → Bacteria3415Open in IMG/M
3300005467|Ga0070706_100715102Not Available929Open in IMG/M
3300005467|Ga0070706_101009041Not Available767Open in IMG/M
3300005468|Ga0070707_100961016Not Available819Open in IMG/M
3300005468|Ga0070707_101770204Not Available585Open in IMG/M
3300005471|Ga0070698_100201080All Organisms → cellular organisms → Bacteria1928Open in IMG/M
3300005471|Ga0070698_101260517All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300005534|Ga0070735_10700527All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300005537|Ga0070730_10631608Not Available682Open in IMG/M
3300005537|Ga0070730_10806727Not Available591Open in IMG/M
3300005542|Ga0070732_10005810All Organisms → cellular organisms → Bacteria6707Open in IMG/M
3300005542|Ga0070732_10330283Not Available917Open in IMG/M
3300005542|Ga0070732_10447590Not Available781Open in IMG/M
3300005561|Ga0066699_10191256All Organisms → cellular organisms → Bacteria1420Open in IMG/M
3300005561|Ga0066699_10723336Not Available708Open in IMG/M
3300005575|Ga0066702_10065972All Organisms → cellular organisms → Bacteria1988Open in IMG/M
3300005575|Ga0066702_10193003Not Available1230Open in IMG/M
3300005576|Ga0066708_10175996All Organisms → cellular organisms → Bacteria1330Open in IMG/M
3300005576|Ga0066708_10307583All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300005764|Ga0066903_106214932All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium624Open in IMG/M
3300006028|Ga0070717_10249440Not Available1568Open in IMG/M
3300006028|Ga0070717_11935407Not Available532Open in IMG/M
3300006032|Ga0066696_10292481All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300006177|Ga0075362_10586313All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300006237|Ga0097621_101841876All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300006800|Ga0066660_10113300All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1946Open in IMG/M
3300006800|Ga0066660_10213104Not Available1483Open in IMG/M
3300006800|Ga0066660_10419518All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1110Open in IMG/M
3300009012|Ga0066710_100518790All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1798Open in IMG/M
3300009012|Ga0066710_101421971Not Available1074Open in IMG/M
3300009038|Ga0099829_11343382All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300009088|Ga0099830_11043892Not Available677Open in IMG/M
3300009089|Ga0099828_10327355Not Available1380Open in IMG/M
3300009090|Ga0099827_10075695Not Available2613Open in IMG/M
3300009400|Ga0116854_1125669All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300010100|Ga0127440_1047160Not Available631Open in IMG/M
3300010125|Ga0127443_1128687Not Available753Open in IMG/M
3300010142|Ga0127483_1022702Not Available660Open in IMG/M
3300010154|Ga0127503_11327548All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300010166|Ga0126306_10069755All Organisms → cellular organisms → Bacteria2481Open in IMG/M
3300010857|Ga0126354_1087803All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1133Open in IMG/M
3300011120|Ga0150983_13231167Not Available537Open in IMG/M
3300011270|Ga0137391_10284115All Organisms → cellular organisms → Bacteria1429Open in IMG/M
3300011271|Ga0137393_10174432All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1806Open in IMG/M
3300011271|Ga0137393_10481045All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300012096|Ga0137389_10956438Not Available734Open in IMG/M
3300012199|Ga0137383_10862198All Organisms → cellular organisms → Bacteria661Open in IMG/M
3300012201|Ga0137365_10407423All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300012203|Ga0137399_10244147All Organisms → cellular organisms → Bacteria1471Open in IMG/M
3300012203|Ga0137399_11286324Not Available615Open in IMG/M
3300012204|Ga0137374_10001823All Organisms → cellular organisms → Bacteria24651Open in IMG/M
3300012208|Ga0137376_11250011Not Available632Open in IMG/M
3300012209|Ga0137379_11310539Not Available629Open in IMG/M
3300012212|Ga0150985_105034700All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium790Open in IMG/M
3300012212|Ga0150985_122208171All Organisms → cellular organisms → Bacteria1610Open in IMG/M
3300012350|Ga0137372_10378770Not Available1077Open in IMG/M
3300012356|Ga0137371_10965472Not Available647Open in IMG/M
3300012363|Ga0137390_11030806Not Available774Open in IMG/M
3300012363|Ga0137390_11174812Not Available715Open in IMG/M
3300012363|Ga0137390_11954903All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300012373|Ga0134042_1152774All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium949Open in IMG/M
3300012376|Ga0134032_1211172All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300012380|Ga0134047_1088903All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1174Open in IMG/M
3300012382|Ga0134038_1050877Not Available520Open in IMG/M
3300012384|Ga0134036_1052760Not Available571Open in IMG/M
3300012389|Ga0134040_1293338Not Available612Open in IMG/M
3300012390|Ga0134054_1030118Not Available604Open in IMG/M
3300012391|Ga0134035_1035384Not Available568Open in IMG/M
3300012403|Ga0134049_1179132All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium990Open in IMG/M
3300012409|Ga0134045_1131757All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1025Open in IMG/M
3300012410|Ga0134060_1078124All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300012918|Ga0137396_10307068All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300012951|Ga0164300_11150932Not Available511Open in IMG/M
3300013308|Ga0157375_11680910Not Available751Open in IMG/M
3300013772|Ga0120158_10223554Not Available963Open in IMG/M
3300014969|Ga0157376_11779219Not Available652Open in IMG/M
3300015357|Ga0134072_10345330All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300015371|Ga0132258_12865722Not Available1199Open in IMG/M
3300018468|Ga0066662_10244079Not Available1462Open in IMG/M
3300018468|Ga0066662_10623054Not Available1014Open in IMG/M
3300018482|Ga0066669_10544407Not Available1010Open in IMG/M
3300018482|Ga0066669_11419284Not Available629Open in IMG/M
3300021151|Ga0179584_1144090Not Available743Open in IMG/M
3300022527|Ga0242664_1084451Not Available631Open in IMG/M
3300025910|Ga0207684_10013959All Organisms → cellular organisms → Bacteria6940Open in IMG/M
3300025910|Ga0207684_10056009Not Available3344Open in IMG/M
3300025910|Ga0207684_11079352Not Available669Open in IMG/M
3300025922|Ga0207646_10207249All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1771Open in IMG/M
3300025929|Ga0207664_11233254Not Available666Open in IMG/M
3300025929|Ga0207664_11242371Not Available664Open in IMG/M
3300026308|Ga0209265_1145569Not Available606Open in IMG/M
3300026319|Ga0209647_1052113All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2195Open in IMG/M
3300026527|Ga0209059_1023720All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2787Open in IMG/M
3300026542|Ga0209805_1406500Not Available525Open in IMG/M
3300026552|Ga0209577_10439254Not Available916Open in IMG/M
3300027842|Ga0209580_10010492All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi3985Open in IMG/M
3300027842|Ga0209580_10224399All Organisms → cellular organisms → Bacteria → Terrabacteria group933Open in IMG/M
3300027842|Ga0209580_10276085Not Available836Open in IMG/M
3300027862|Ga0209701_10353135All Organisms → cellular organisms → Bacteria831Open in IMG/M
3300027874|Ga0209465_10375228All Organisms → cellular organisms → Bacteria → Proteobacteria712Open in IMG/M
3300027875|Ga0209283_10128601All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1672Open in IMG/M
3300027882|Ga0209590_10291617All Organisms → cellular organisms → Bacteria1046Open in IMG/M
3300027903|Ga0209488_11236376Not Available501Open in IMG/M
3300027986|Ga0209168_10332086Not Available744Open in IMG/M
3300027986|Ga0209168_10467569Not Available610Open in IMG/M
3300028536|Ga0137415_10727617Not Available804Open in IMG/M
3300030635|Ga0247627_10168266All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300030829|Ga0308203_1066398All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium572Open in IMG/M
3300030998|Ga0073996_12125547All Organisms → cellular organisms → Bacteria1107Open in IMG/M
3300030998|Ga0073996_12135252Not Available624Open in IMG/M
3300030998|Ga0073996_12267099All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1242Open in IMG/M
3300031047|Ga0073995_11865833All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1115Open in IMG/M
3300031047|Ga0073995_12098863All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300031093|Ga0308197_10260005Not Available621Open in IMG/M
3300031231|Ga0170824_110513239All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium869Open in IMG/M
3300031421|Ga0308194_10310861All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300031469|Ga0170819_15178287All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300031996|Ga0308176_10878245All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria941Open in IMG/M
3300032756|Ga0315742_11761521Not Available678Open in IMG/M
3300034384|Ga0372946_0026580All Organisms → cellular organisms → Bacteria2557Open in IMG/M
3300034681|Ga0370546_032112Not Available751Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil20.29%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil15.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere12.32%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil10.87%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil8.70%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.07%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.35%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.17%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil2.17%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.45%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere1.45%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.45%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.45%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.72%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.72%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.72%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.72%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.72%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere0.72%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.72%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.72%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006177Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-2Host-AssociatedOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009400Soil microbial community of the Robinson Ridge, Antarctica. Combined Assembly of Gp0139162, Gp0138857, Gp0138858EnvironmentalOpen in IMG/M
3300010100Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010125Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010142Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010857Boreal forest soil eukaryotic communities from Alaska, USA - W1-3 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012373Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012376Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012380Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012382Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012384Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012389Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012390Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012391Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012403Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012409Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021151Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022527Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030635Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Bnb4 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030829Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_357 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030998Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-3A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031047Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1B (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032756Forest Soil Metatranscriptomics Site 2 Humus Litter Mineral Combined AssemblyEnvironmentalOpen in IMG/M
3300034384Forest soil microbial communities from Eldorado National Forest, California, USA - SNFC_MG_KNG_2.2EnvironmentalOpen in IMG/M
3300034681Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_121 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0063356_10387713013300004463Arabidopsis Thaliana RhizosphereMNQAAEPAADDGSPAIDHKVAQSPLLPGVSVSLTARQRGNYDIVVYSPAGDEPAELYAGLRAVGFVSGVAADARGGSRRQAFGREGSALLGGWTGPERERFVGEARRTLRRFGFAFVPEVPYTERTRSS*
Ga0063356_10519079413300004463Arabidopsis Thaliana RhizosphereMTQTIDHKVSVSALLPGVSVSLNARMPASFDVVVFSPTGDQPVGLVSGLRAVGFMPRGPAEQWATGERQKFGRDGSALVGGWTGPEREQFIADARRTLRRFGFVLVPEIPVATRGK*
Ga0066672_1028332433300005167SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYLQSTPR*
Ga0066672_1089555123300005167SoilMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSAPGDEPSELYSGLRAVGFNPGPPDTHSAGVVRQSFGRDGSALLGGWTGPERERFIAEARRTL
Ga0066679_1073410813300005176SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPARARLLADARRTLRRFGFAFVPEVPYLQSTPR*
Ga0066690_1041201823300005177SoilMSQRAHPEEVATSMSETIDHKVSVSPLLPGVSVSLNAQSHGGFQVVVFSSADTEPSELYSGLRAVGFTSRAPVPQKPGVVRQSFARDGSALLGGWTGPERERFLAEARRTLRRFGFTFVPEVPFRQNSPS*
Ga0066688_1091468223300005178SoilMSQRAHPEEVATSMSETIDHKVSVSPLLPGVSVSLNAQSHGGFQVIVFSSADTEPSELYSGLRAVGFTSRAPVPQKPGVVRQSFARDGSALLGGWTGPERERFLAEARRTLRRFGFTFVPEVPYRQNTPR*
Ga0066675_1021490823300005187SoilMSETIDHKVSVSPVLPGVSVSLNAQPYGGFQIVVFSPAANEPSELYSGLRAVGFSARATAAQSTATVRQSFGRDGSGLLGGWTGPERERFLGEARRTLRRFGFAFVPEVPHVQRA*
Ga0066675_1132393713300005187SoilMSQPASQKEVGSSMSKTIDHKVSVSPLLPGVSVSLNAQPYGGFQVVVFSSAGNEPSELYAGLRAVGFASRDPGPQPTGGVRQSFGRAGSALLGGWTAPERERFIAEARRTLRRFGFAFVPEVPSQQKAPR*
Ga0070714_10081172923300005435Agricultural SoilMTQSTSDGKAPSISENIGHKVSWSSLLPGVAVSLNAQSHGGFQVVVFSPPGNEPSEVYAGLRAVGFAAREQSAQVVGTVRKSFGRDGSGLLGGWTGPERERFLADARRTLRRFGFAFVPEVAYARGSSG*
Ga0070708_10002104553300005445Corn, Switchgrass And Miscanthus RhizosphereMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRLPSAQKAGVVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFSFVPEVPHSKGAH*
Ga0070708_10087614533300005445Corn, Switchgrass And Miscanthus RhizosphereTHTINHQVSVSPLLPGVSVSLNARPYASYDVVVFSSPGDEPSELYSGLRAVGFNPGLPSAQQAGMVRQSFGRDGSALLGGWTGPERERYLAEARRTLRRFGFSFVPEVPHTKGSMR*
Ga0066689_1089964613300005447SoilESLRPDARTRELPLPMSQRPILKEIAASMSETIDHKVSVSPLLPGVSVSLNAQPYGGFQVVVFSSAGDEPSELYSGLCAVGVNTRGPVPQQSGGVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFAFVPEVPFRQTSSG*
Ga0070706_10003056653300005467Corn, Switchgrass And Miscanthus RhizosphereMTHTINHQVSVSPLLPGVSVSLNARPYASYDVVVFSSPGDEPSELYSGLRAVGFNPGLPSAQQAGMVRQSFGRDGSALLGGWTGPERERYLAEARRTLRRFGFSFVPEVPHTKGSMR*
Ga0070706_10006340653300005467Corn, Switchgrass And Miscanthus RhizosphereMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRSPSAQKAGVVRQSFGRDGSALLGGWTGPERERFLAEARRTLRR
Ga0070706_10071510213300005467Corn, Switchgrass And Miscanthus RhizosphereMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSSPGDEPSELYSGLRAVGFNPGQPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHV
Ga0070706_10100904113300005467Corn, Switchgrass And Miscanthus RhizosphereMSQRAHLEETVTGMSETIAHKVAVSALLPGVSVSLNAQPYGGFQVVVFTSAGSEPVELYSRLRAIGFKSRDAVAQRAGGVRQGFGRDGSALLGGWTGPERERFLGEARRTLRGFGFTFVPEVPYLQSSPR*
Ga0070707_10096101613300005468Corn, Switchgrass And Miscanthus RhizosphereLSLPLNQPKESAATMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSSPGDEPSELYSGLRAVGFNPGQPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHVKGSVR*
Ga0070707_10177020423300005468Corn, Switchgrass And Miscanthus RhizosphereMSQRAHLEETVTGMSETIAHKVAVSALLPGVSVSLNAQPYGGFQVVVFTAAGSEPAELYSRLRAIGFKSRDAVAQRAGGVRQGFGRDGSALLGGWTGPERERFLGEARRTLRGFGFTFVPEV
Ga0070698_10020108043300005471Corn, Switchgrass And Miscanthus RhizosphereMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRSPSAQKAGMVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFSFVPEVPHSKGAH*
Ga0070698_10126051723300005471Corn, Switchgrass And Miscanthus RhizosphereMSQTIAHKVSVSALLPGVSVSLNALPYGGFQVVVFSSTGNEPSELYSGLRAVGFTSRDPVAQPGGNVRQSFGRDGSALLGGWTGPERERFMAEARRTLRRFGFAFVPEVPYLAGTPR*
Ga0070735_1070052723300005534Surface SoilMSQAIGHKVSVSQLLPGVSVSFNAQSFGGFEVVVYSPDGEEPLELYSGLRAVGFASRHPATQSDGVRQPFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVRYQQKPAV*
Ga0070730_1063160823300005537Surface SoilMTQTINHQVSISSILPGVSVSLNARPYASFDVVVYSSPGDEPSELYSGLRAVGFTPGIPSAQPTGTVRQSFGRDGSALLGGWTGPERERYIAEARRTLRRFGFAFVPEVPHVKGSIR*
Ga0070730_1080672723300005537Surface SoilMTQTINHQVSVSQLLPGVSVSLNARAYASFDIVVFSSPGDEPAELYSGLRAVGFNPGTPSAQTEGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHSK
Ga0070732_1000581093300005542Surface SoilMSQPVGPLKKTGGTVPATIGHKVSVSSLLPGVSVSLNAQPYGGFQVVVFASAENEPSELYSGLRAVGFTSRAPLVQSAGTVRQSFGRDGSALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYQQNSSH*
Ga0070732_1033028313300005542Surface SoilPPGGPVKETGGSVSATIGHKVSVSPLLPGVSVSLNAQSYGGFQVVVFASAGNEPTELYSGLRAVGFTSRDPVMLSAGTVRQSFGRDGSGLLGGWTGPERERFIAEARRTLRRFGFAFVPEVPYQQTSSH*
Ga0070732_1044759023300005542Surface SoilMSETIGHKVSVSPVLPGVSVSLNAQPYGGLQVVVFASDDNEPSELYSGLRAVGFIPRAPVTRSAGTVRQSFGRDGSALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYQKTSSH*
Ga0066699_1019125653300005561SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLR
Ga0066699_1072333623300005561SoilMSQRAHPEEVATSMSETIDHKVSVSPLLPGVSVSLNAQSHGGFQVIVFSSADTEPSELYSGLRAVGFTSRAPVPQKPGVVRQSFARDGSALLGGWTGPERERFLAEARRALRR
Ga0066702_1006597243300005575SoilMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSAPGDEPSELYSGLRAVGFNPGPPDTHSAGVVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHTKGAAR*
Ga0066702_1019300323300005575SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR*
Ga0066708_1017599633300005576SoilMLSQPIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR*
Ga0066708_1030758333300005576SoilGGVESTMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSAPGDEPSELYSGLRAVGFNPGPPDTHSAGVVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHTKGAAR*
Ga0066903_10621493223300005764Tropical Forest SoilMSTSKPQVSVSQLLPGVSVSLHAQPYGGFHVVVFSPVDSEPSELYSGLRAVGFTSGGPIAQPAGVVRHSFRRDGSALLGGWTVPERERFIAEARRTLRRFGFARVPEVPHSQATPRRH*
Ga0070717_1024944033300006028Corn, Switchgrass And Miscanthus RhizosphereMSQAPDHKVSVSSLLPGVSVSFNAQSFGGFEVVVYSAASDEPSELYSGLRAVGFTARHPAAQADLVRQTFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVRYQEKLPSGGG*
Ga0070717_1193540713300006028Corn, Switchgrass And Miscanthus RhizosphereVSPVLPGVSVSVNAQSFGGYEVVVYSAAGSEPSELYSGLRAVGFTSRRPAAQADVVRQTFERDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVRYQQDR*
Ga0066696_1029248123300006032SoilMSQRAHPEEVAMSMSETIDHKVSVSPLLPGVSVSLNAQSHGGFQVTVFSSAGTEPSELYSGLRAVGFTSRGPAAQRPGVVRESFARDGSALLGGWTGPERERFLAEARRTLRRFGFTFVPEVPFRQNSPS*
Ga0075362_1058631313300006177Populus EndosphereMSQTINHKVSVSQLLPGVSVSLNARPYASYDVVVFSPAGDEPSELYAGLLAVGFTPRELAVDSAGSVRKCFGRDGSALLGAWTGPERERFTAEARRTLRRFGFTFVPEVPYVQAARA*
Ga0097621_10184187613300006237Miscanthus RhizosphereMTQLEKEGPTMSQTINHKVSISQLLPGVSVSLNARPPAMYDVVVFSAAGNEPSELYTGLRSIGFTARGPAVQEAGGVCQHFGREGSALLGGWTGPQREQYIGDARRTLRRFGFTLVPEIPYASTASQ*
Ga0066660_1011330013300006800SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYLQSSAH*
Ga0066660_1021310423300006800SoilMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSAPGDEPSELYSGLRAVGFNPGPPDTHSAGVVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHTKGAGR*
Ga0066660_1041951813300006800SoilRQGAGRNESLRPHARTRELPLPMSQRAHPEEVATSMSETIDHKVSVSPLLPGVSVSLNAQSHGGFQVVVFSSADTEPSELYSGLRAVGFTSGAPVPQKPGVVRQTFARDGSALLGGWTGPERERFLAEARRALRRFGFTFVPEVLYKQNTPR*
Ga0066710_10051879023300009012Grasslands SoilMSQTINHKVSISQLLPGVSVSLNARPPASYDVVVFSAEGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASTSR
Ga0066710_10142197133300009012Grasslands SoilMSQTIDHKVSVSTLLPGVSVSLNAQPYGGFQIVVFSSADNEPSELYSGLRAVGFAYRNPASHGTGTVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFTFVPEVPFPQTAQ
Ga0099829_1134338213300009038Vadose Zone SoilMSETIDHKVSVSPLLPGVSVSLNAQPYGGFEVKVFSSAGIEPSELYAGLRAVGFSSRAPLPQRAGMLHQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFAFVPEIPYSESNLR*
Ga0099830_1104389213300009088Vadose Zone SoilMTHTINHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPSELYSGLRAVGFTPRPLSSQAAGMVRQSFGRDGSALLGGWSGPERERFLAEARRTLRRFGFSFVPEVPHAKGSVR*
Ga0099828_1032735523300009089Vadose Zone SoilMTQTINHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPSELYSGLRAVGFTPRPLSAQAAGVVRQSFGRDGSALLGGWSGPERERFLAEARRTLRRFGFSFVPEVPHTKGSVR*
Ga0099827_1007569533300009090Vadose Zone SoilMKERKAIMTLTIDHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRTSSAQPAGMVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHAKPSAG*
Ga0116854_112566923300009400SoilGIAVSLTAKPPYSTFDVVVFAQAGSDPQELYSGLRAVGFKARAQPVQESDGVRQSFGRDGSGMFASWSTPERERFIAEARRTLRRFGFAFVPEIHLAGDNLP*
Ga0127440_104716023300010100Grasslands SoilMTLKESASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR
Ga0127443_112868733300010125Grasslands SoilMIQKETASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR*
Ga0127483_102270213300010142Grasslands SoilMSLTINHKVSISQLLPGVSVSLNARPPASYDVVVFSAEGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASASQ*
Ga0127503_1132754813300010154SoilVSQTINHKVSISQLLPGVSVSLNARPPATYDVVVFSSAGDEPSELYSGLRSIGFTARGSAVQEAGGVRQHFGREGSALHGGWTGPQREQYIGDARRTLRRFGFTLVPEIPYASTASR*
Ga0126306_1006975513300010166Serpentine SoilMIQHTKGAASMAQAIDHKVSVSTLLPGVSVSLNARAPASYDVVVFSPAGDQPSELFSGLRAVGFTSRGPSEQGASGERQRFGREGSGLLGGWTGPEREQFIAEARRTLRRFGFLFVPEIAHGAGASKETGARR*
Ga0126354_108780333300010857Boreal Forest SoilMTQQKEKGSSMSQTINHQVSVSQLLPGVSVSLNARPYASYDIVVFSQAGEEPSELYSGLRAVGFTPRTAAVEPAGTIRQSFGRDGSALLGGWSGPERERFMGEARRTLRRFGFSFVPEVPHSAATSR*
Ga0150983_1323116723300011120Forest SoilSVSQLLPGVSVSLNARPYASFDVVVFSSPGDEPSELYSGLRAVGFTPGEPSAAHPVGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFAFVPEVPHVKGSIR*
Ga0137391_1028411533300011270Vadose Zone SoilMTIEHKVSISQLLPGVSVSLNALPYASFDVVVFSAPGDEPAELYSGLRAVGFTPRAVSAQPAGMVRQSFRREGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHTKASAG*
Ga0137393_1017443213300011271Vadose Zone SoilMTQTINHQVSVSQLLPGVSVSLNARAYASFDVVVFSSPGDEPSELYSGLRAVGFTPRLPSAQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHTTASVR*
Ga0137393_1048104523300011271Vadose Zone SoilMTIEHKVSISQLLPGVSVSLNALPYASFDVVVFSAPGDEPAELYSGLRAVGFTPRAASAQPAGMVRQSFRREGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHTKASAG*
Ga0137389_1095643823300012096Vadose Zone SoilMKESKAIMTLTIDHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRTSSAQPAGMVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHT*
Ga0137383_1086219823300012199Vadose Zone SoilMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSAPGDEPAELYSGLRAVGFTPRTSSAQSAGMVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFSFVPEVPHTKASAG*
Ga0137365_1040742323300012201Vadose Zone SoilMSETIGHKVSVSPVLPGVSVSLNAQPYGGFQIVVFSPAADEPSELYSGLRAVGFTARGMVAQAAGTVRQSFGRDGSGLLGGWTGPERERFLGEARRTLRRFGFAFVPEVPYLQPTPR*
Ga0137399_1024414733300012203Vadose Zone SoilMTIEHKVSISQLLPGVSVSLNALPYASFDVVVFSAPGDEPAELYSGLRAVGFTPRAVSAQPAGMVRQSFRREGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHTRPSAG*
Ga0137399_1128632423300012203Vadose Zone SoilMTQTISHQVSVSQLLPGVSVSLNARPYASFDVVVFSSPDEAPAELFSGLRAVGFTPRLPTTESAGIVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHVKGSAR*
Ga0137374_1000182363300012204Vadose Zone SoilMSQTINHKVSISQLLPGVSVSLNARPPASYDVVVFSASGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASTASQ*
Ga0137376_1125001123300012208Vadose Zone SoilMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSAPGDEPAELYSGLRAVGFTPRTSSAQSAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHTKGAGR*
Ga0137379_1131053923300012209Vadose Zone SoilMSTSKTQVSLSELLPGVSVSLHAQPYGGFHVVVFSPVESEPSELYSGLRAVGFRSGGPIPQPAGVVRHSFRRDGSALLGGWTGPERERFSAEARRTLRRFGFARVPEVPHPQAAARQH*
Ga0150985_10503470023300012212Avena Fatua RhizosphereMSLIINHKVAVSELLPGVAVSLNARPPARYDIVVFSQTDGEPTELYSGLRAVGFTPGGPAETRADGGVRQVFGREGSALLGGWSGPERERFIGDARRTLRRFGFSLVPEIPVTPRA*
Ga0150985_12220817133300012212Avena Fatua RhizosphereMTQREKERPPMSQTINHKVAISSLLPGVSVSLNARPPSTYDVVVFSAAENEPLELYSGLRSIGFTARGPAVQEAGGMRQNFGRQGSAMHGGWTGPQREQYIGDARRTLRRFGFTLVPEIPYASSTSN*
Ga0137372_1037877023300012350Vadose Zone SoilMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRLPSAQKAGVVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFSFVPEVPHTKGAH*
Ga0137371_1096547223300012356Vadose Zone SoilMSTSKTQVSLSELLPGVSVSLHAQPYGGFHVVVFSPVESEPSELYSGLRAVGFRSGGPIPQPAGVVRHSFRRDGSALLGGWTGPERERFIAEARRTLRRFGFARVPEVPHPQAAARQH*
Ga0137390_1103080633300012363Vadose Zone SoilMTQIINHDVSVSQLLPGVSVSLNARPYASFDVVVFSSPGDEPSELYSGLRAVGFNPGKPSAQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHVKGSV
Ga0137390_1117481213300012363Vadose Zone SoilTTKMPNTIGHKVSVSPLLPGVSVSLNAQPYGGFHVVVFSSAGNEPSELYSGLRALGFVSRDTVTQSAGDVRQSFGRDGSTLLGGWTGPERERFLGDARRTLRRFGFTFVPEVAYLAGTRS
Ga0137390_1195490313300012363Vadose Zone SoilMKESKAIMTLTIDHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRTSSAQPAGMVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVP
Ga0134042_115277423300012373Grasslands SoilVAGRAQPYDLRAAALVTAIVDPAREGRPIMSLTINHKVSISQLLPGVSVSLNARPPASYDVVVFSAAGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASASQ*
Ga0134032_121117213300012376Grasslands SoilMTLKESASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANLVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEVPLHKKGYGGSRIM*
Ga0134047_108890323300012380Grasslands SoilVAGRAQPYDLRAAALVTAIVDPAREGRPIMSLTINHKVSISQLLPGVSVSLNARPPASYDVVVFSAEGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASASQ*
Ga0134038_105087713300012382Grasslands SoilMIQKETASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEVPFLKTTSR*
Ga0134036_105276023300012384Grasslands SoilMTLKESASMLSQTIGHKVSVSQLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEV
Ga0134040_129333813300012389Grasslands SoilMIQKETASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTS
Ga0134054_103011813300012390Grasslands SoilMTLKESASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFV
Ga0134035_103538423300012391Grasslands SoilMIQKESASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR*
Ga0134049_117913223300012403Grasslands SoilMSLTINHKVSISQLLPGVSVSLNARPPASYDVVVFSAEGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASTSR*
Ga0134045_113175723300012409Grasslands SoilMSLTINHKVSISQLLPGVSVSLNARPPASYDVVVFSAEGNEPSELYSGLRAIGFNARGPAVQQAGGVRQHFGREGAALLGGWTGPQREQFIGDARRTLRRFGFTLVPEIPYASTSQ*
Ga0134060_107812423300012410Grasslands SoilMIQKETASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKT
Ga0137396_1030706823300012918Vadose Zone SoilMIIDHKVSISQLLPGVSVSLNARPYPGFDVVVFSSPGDEPAELYSGLRAVGFTPRLPSAEPTGMVRQSFSRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHTRPSAG*
Ga0164300_1115093223300012951SoilMTQLEKEGPTMSQTVNHKVSISQLLPGVSVSLNARPPAMYDVVVFSAAGNEPSELYTGLRSIGFTARGPAVQEAGGVCQHFGREGSALLGGWTGPQREQYIGDARRTLRRFGFTLVPEIPYASTASQ*
Ga0157375_1168091013300013308Miscanthus RhizosphereMSQTINHKVSISQLLPGVSVSLNARPPAMYDVVVFSAAGNEPSELYTGLRSIGFTARGPAVQEAGGVCQHFGREGSALLGGWTGPQREQYIGDARRTLRRFGFTLVPEIPYASTASQ*
Ga0120158_1022355423300013772PermafrostMTQTINHQVSVSQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFNPGLPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHVKGTPR*
Ga0157376_1177921913300014969Miscanthus RhizosphereMTQLEKEGPTMSQTINHKVSISQLLPGVSVSLNARPPAMYDVVVFSAAGNEPSELYTGLRSIGFTARGPAVQEAGGVCQHFGREGSALLGGWTGPQREQYIGDARRTLRRFGFTLVELL
Ga0137409_1063865033300015245Vadose Zone SoilVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRSPSVQKPRTVRQSFGRDGSALLGGWSGPERERFLADARRTLRR
Ga0134072_1034533023300015357Grasslands SoilMTLKESASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYLQSTPR*
Ga0132258_1286572223300015371Arabidopsis RhizosphereMSQAPDHKVSVSTLLPGVSVSFNAQSFGGFEVVVYSAVGSEPSELYSGLRALGFTSRQPAAQADVLRQTFGRDGSALLGGWTGPERERFLADARRTLRRFGFAFVPEVRYQENLPPGPAART*
Ga0066662_1024407933300018468Grasslands SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYLQSTPR
Ga0066662_1062305413300018468Grasslands SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDIVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR
Ga0066669_1054440723300018482Grasslands SoilMSETISHKVSVSVVLPGVSVSLNAQPYGGFQIVVFSPVANEPSELYSGLLAVGFTAREMAAQAAGTVRQSFGRDGSGLLGGWTGPGRERFLGEARRTLRRFGFAFVPEVPYVQHTLR
Ga0066669_1141928413300018482Grasslands SoilQSINHKVSISQMLPGVSVSLNARPPATYDVVIVSAAVDEPSELYTGLRAIGFSARGPAVQQAGGVRQNFGREGSAMHGGWTGPERERFIGDARRTLRRFGFVLVPEIPYMQAAPQ
Ga0179584_114409023300021151Vadose Zone SoilMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSTPGDEPAELYSGLRAVGFTPRTSSAQSAGMVRQSFGRDGSALLGGWSGPERERFLAEARRTLRRFGFSFVPEVPHTKTSAG
Ga0242664_108445113300022527SoilMSEAIGHKVSVSPALPGVSVSFNAQSHGGFEVLVYSADGSEPSELYAGLQAIGFTSRGQGVQPGSVRQSFGRHGSGLVGGWTGPERERYLADARRTLRRFGFAFVPEVPYRPSLAV
Ga0207684_1001395963300025910Corn, Switchgrass And Miscanthus RhizosphereMTHTINHQVSVSPLLPGVSVSLNARPYASYDVVVFSSPGDEPSELYSGLRAVGFNPGLPSAQQAGMVRQSFGRDGSALLGGWTGPERERYLAEARRTLRRFGFSFVPEVPHTKGSMR
Ga0207684_1005600913300025910Corn, Switchgrass And Miscanthus RhizosphereMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRSPSAQKAGVVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFSFVPEVPHSKGAH
Ga0207684_1107935213300025910Corn, Switchgrass And Miscanthus RhizosphereMSQRAHLEETVTGMSETIAHKVAVSALLPGVSVSLNAQPYGGFQVVVFTSAGSEPVELYSRLRAIGFKSRDAVAQRAGGVRQGFGRDGSALLGGWTGPERERFLGEARRTLRGFGFTFVPEVPYLQSSPR
Ga0207663_1024519813300025916Corn, Switchgrass And Miscanthus RhizosphereIKDGFEVMVYSSDGNEPSELYAGLRAVGFTSRGPAAQPGVVRQSFGRYGSALLGGWTSPERERFLADARRTLRRFGFAFIPEVPYQAKAAG
Ga0207646_1020724943300025922Corn, Switchgrass And Miscanthus RhizosphereMTIDHKVSISQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRLPSAQKAGVVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFSFVPEVPHSKGAH
Ga0207664_1123325413300025929Agricultural SoilMTQSTSDGKAPSISENIGHKVSWSSLLPGVAVSLNAQSHGGFQVVVFSPPGNEPSEVYAGLRAVGFAAREQSAQVVGTVRKSFGRDGSGLLGGWTGPERERFLADARRTLRRFGFAFV
Ga0207664_1124237113300025929Agricultural SoilMSQAPDHKVSVSSLLPGVSVSFNAQSFGGFEVVVYSAASDEPSELYSGLRAVGFTARHPAAQADLVRQTFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVAYAHSS
Ga0209265_114556923300026308SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKT
Ga0209647_105211353300026319Grasslands SoilMTIEHKVSISQLLPGVSVSLNALPYASFDVVVFSAPGDEPAELYSGLRAVGFTPRAVSAQPAGMVRQSFRREGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHAKASAG
Ga0209059_102372033300026527SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVVVFSPTGEEPSELYAGLRAVGFTPSGPGTQPADVVRQSFGRDGSALLGGWTGPERERFIADARRTLRRFGFAFVPEVPFLKTTSR
Ga0209805_140650013300026542SoilSVSPVLPGVSVSLNAQPYGGIQIVVFSPAANEPSELYTGLRAVGFTARDMVAQAAGTVRQSFGRDGSGLLGGWTGPERERFLGEARRTLRRFGFAFVPEVPYVQSTPQ
Ga0209577_1043925443300026552SoilMTLKESASMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFGFAFVPE
Ga0209580_1001049253300027842Surface SoilMSQPVGPLKKTGGTVPATIGHKVSVSSLLPGVSVSLNAQPYGGFQVVVFASAENEPSELYSGLRAVGFTSRAPLVQSAGTVRQSFGRDGSALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYQQNSSH
Ga0209580_1017745123300027842Surface SoilSLNAQPYGGFQVVVFASDDNEPSELYSGLRAVGFIPRAPVTRSAGTVRQSFGRDGSALLGGWTGPERERFLADARRTLRRFGFAFVPEVPYQKTSSH
Ga0209580_1022439933300027842Surface SoilMSETIGHKVSVSPVLPGVSVSLNAQPYGGLQVVVFASDDNEPSELYSGLRAVGFTSRAPVTRSARIVRQSFGRDGSALLGGRTGPERERFLAEARRTLRRFGFGTAVPEN
Ga0209580_1027608523300027842Surface SoilGVSVSLNAQSYSGFQVVVFASAGNEPTELYSGLRAVGFTSRDPVMLSAGTVRQSFGRDGSGLLGGWTGPERERFIAEARRTLRRFGFAFVPEVPYQQTSSH
Ga0209701_1035313523300027862Vadose Zone SoilMTHTINHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPSELYSGLRAVGFTPRPLSSQAAGMVRQSFGRDGSALLGGWSGPERERFLAEARRTLRRFGFSFVPEVPHTKGSVR
Ga0209465_1037522813300027874Tropical Forest SoilMSTSKPQVSVSQLLPGVSVSLHAQPYGGFHVVVFSPVDSEPSELYSGLRAVGFTSGGPIAQPAGVVRHSFRRDGSALLGGWTVPERERFIAEARRTLRRFGFARVPE
Ga0209283_1012860143300027875Vadose Zone SoilMTQTINHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPSELYSGLRAVGFTPRPLSAQAAGVVRQSFGRDGSALLGGWSGPERERFLAEARRTLRRFGFSFVPEVPHTKGSVR
Ga0209590_1029161713300027882Vadose Zone SoilSKAIMTLTIDHQVSVSPLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFTPRTSSAQPAGMVRQSFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHAKPSA
Ga0209488_1123637623300027903Vadose Zone SoilMLSQTIGHKVSVSQLLPGVAVSLNARPYASYDVTVYSPAGEEPSELYAGLRAVGFTPRGPVPPAANVVRQSFGRDGGALLGGWTGPERERFLADARRTLRRFG
Ga0209168_1033208623300027986Surface SoilMTQPTSDGKRGTISESIGHKVSWSSLLPGVAVSLNAQSYGGFQVVVFSSAASEPAEVYAGLRAVGFAARDQSTLVEGGIRQSFGRDGSASLGGWTGPERERFLADARRTL
Ga0209168_1046756923300027986Surface SoilMSQAIGHKVSVSQLLPGVSVSFNAQSFGGFEVVVYSPDGEEPLELYSGLRAVGFASRHPATQSDGVRQPFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVRYQQKPAV
Ga0137415_1072761723300028536Vadose Zone SoilMIIDHKVSISQLLPGVSVSLNARPYPGFDVVVFSSPGDEPAELYSGLRAVGFTPRLPSAEPTGMVRQSFSRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPHTKASAG
Ga0247627_1016826623300030635SoilMTQHTEGAPSVTTTVNHKVSLSTLLPGVSVSLNARQPAGYDVVVISPAGNQPSELISGLRAVGFRPRGVVEQSAAGERQLFGREGSALLGGWTGPEREQFIADARRALRRFGFSRIPEIPYAPGASK
Ga0308203_106639813300030829SoilMTQTINHSVSISQLLPGVSVSLNARPYSSFDVVVFSSPGDEPSELYSGLRAVGFNPGQPSNQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHVKGSIR
Ga0073996_1212554733300030998SoilMSQSTISHQVSVSQLLPGVSVSLNARPYASFDVVVFSSPGDEPAELYSGLRAVGFNPGTPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFAFVPEVPHVKGSVR
Ga0073996_1213525213300030998SoilMTQTISHQVSVSQLLPGVSVSLNARPYASFDVVVFSSPDEAPAELFSGLRAVGFTPRLPTTESAGIVRQSFGRDGSALLGGWTGPERERFLAEARRT
Ga0073996_1226709923300030998SoilMTQQKEKGSSMSQTINHQVSVSQLLPGVSVSLNARPYASYDIVVFSQAGEEPSELYSGLRAVGFTPRTAAVEPAGTIRQSFGRDGSALLGGWSGPERERFMGEARRTLRRFGFSFVPEVPHSAATSR
Ga0073995_1186583333300031047SoilMSQTINHQVSISQLLPGVSVSLNARPYASYDIVVFSPAGEEPSELYSGLRAVGFTPRTAAVEPSGTIRQSFGRDGSALLGGWSGPERERFMGEARRTLRRFGFSFVPEVPHSAATSR
Ga0073995_1209886313300031047SoilTLLPGVSVSLNARPYASYDVVVFSNEGDEPSELYSGLRAVGFKPRPAAAPAEGGVRQSFGRDGSALLGGWTGGERERFIGDARRTLRRFGFAFVPEVPHTDSQSPKAR
Ga0308197_1026000513300031093SoilMTQTINHSVSVSQLLPGVSVSLNARPYSSFDVVVFSSPGDEPSELYSGLRAVGFNPGQPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFAFVPEVPHVKGSVR
Ga0170824_11051323913300031231Forest SoilMSQTINHKVSISQLLPGVSVSLNARPPATYDVVVFSAAGNEPSELYSGLRSIGFTARGPAVLQADGVRQHFGREGSAAFGGWTAPQREQFIGDARRTLRRFGFTLVPEIPHVSTASQ
Ga0308194_1031086123300031421SoilMTQTISHSVSVSQLLPGVSVSLNARPYSSFDVVVFSSPGGEPSELYSGLRAVGFNPGQPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHVKGSIR
Ga0170819_1517828713300031469Forest SoilMMQKASEENTASLQERIGHKVSVSTLLPGVAVSLNALSYGGFQVVVFSSAGNDPSEVYAGLRAVGFASRDESVAPAGGVRQSFGRDGSALLGGWTGPERERFLAEARKKKIDKKEIKIHK
Ga0308176_1087824513300031996SoilMTQSSSEEKMGSMSESVGHKVSVSSLLPGVAVSLNAQSHGGFQVVVFSSAGNEPSEVYAGLRAVGFASRDHAISADGSVRQTFGRDGSALLGGWTGPERERFLAEARRTLRRFGFAFVPEVPYAVHSSR
Ga0315742_1176152113300032756Forest SoilPTMPPTINHEVSVSQLLPGVSVSLNARPYASFDIVVFSAPGGEPSELYSGLRAVGFNPGQPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFAFVPEVPHVKGSVR
Ga0372946_0026580_404_7873300034384SoilMSEQKDSAVTMSQTIGHKVSVSALLPGVSVSLNERPYPSYDIVVFSPAGEVPADLYSGLRAVGFNPGASVADASGAVRQTFGRTGSAMLGGWTGPERERFIGEARRTLRRFGFTLVPEIPHPASKAR
Ga0370546_032112_399_7493300034681SoilMTQTINHSVSVSQLLPGVSVSLNARPYSSFDVVVFSSPGDEPSELYSGLRAVGFNPGQPSTQPAGMVRQSFGRDGSALLGGWTGPERERFIAEARRTLRRFGFSFVPEVPHVKGSIR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.