NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F095127

Metagenome / Metatranscriptome Family F095127

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095127
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 138 residues
Representative Sequence MRHHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTAGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPA
Number of Associated Samples 98
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 50.48 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 87.62 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (75.238 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.143 % of family members)
Environment Ontology (ENVO) Unclassified
(40.952 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 13.13%    β-sheet: 36.87%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF00156Pribosyltran 33.33
PF01894UPF0047 20.00
PF00215OMPdecase 4.76
PF00988CPSase_sm_chain 0.95
PF02142MGS 0.95
PF02786CPSase_L_D2 0.95
PF02770Acyl-CoA_dh_M 0.95
PF02738MoCoBD_1 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG0432Thiamin phosphate synthase YjbQ, UPF0047 familyCoenzyme transport and metabolism [H] 20.00
COG0505Carbamoylphosphate synthase small subunitAmino acid transport and metabolism [E] 1.90
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms75.24 %
UnclassifiedrootN/A24.76 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000891|JGI10214J12806_11327594All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1041Open in IMG/M
3300002907|JGI25613J43889_10067667All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium972Open in IMG/M
3300005293|Ga0065715_10499729All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium781Open in IMG/M
3300005336|Ga0070680_100443523All Organisms → cellular organisms → Bacteria1108Open in IMG/M
3300005355|Ga0070671_101259230All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria652Open in IMG/M
3300005367|Ga0070667_100846176All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria850Open in IMG/M
3300005444|Ga0070694_101834174Not Available517Open in IMG/M
3300005518|Ga0070699_100641215Not Available969Open in IMG/M
3300005518|Ga0070699_101275835All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria674Open in IMG/M
3300005530|Ga0070679_101861185Not Available550Open in IMG/M
3300005546|Ga0070696_101654011All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria551Open in IMG/M
3300005549|Ga0070704_101566117Not Available607Open in IMG/M
3300006050|Ga0075028_100926963Not Available538Open in IMG/M
3300006806|Ga0079220_10295210Not Available998Open in IMG/M
3300006806|Ga0079220_10307200Not Available984Open in IMG/M
3300006854|Ga0075425_100340355All Organisms → cellular organisms → Bacteria1727Open in IMG/M
3300007076|Ga0075435_101384093All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria616Open in IMG/M
3300007265|Ga0099794_10114875All Organisms → cellular organisms → Bacteria1351Open in IMG/M
3300009088|Ga0099830_10075638All Organisms → cellular organisms → Bacteria2454Open in IMG/M
3300009089|Ga0099828_11652132All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria564Open in IMG/M
3300009176|Ga0105242_12909547Not Available529Open in IMG/M
3300009822|Ga0105066_1083695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria693Open in IMG/M
3300010046|Ga0126384_12164305Not Available535Open in IMG/M
3300010360|Ga0126372_10070920All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2479Open in IMG/M
3300010362|Ga0126377_12771677Not Available565Open in IMG/M
3300010397|Ga0134124_10910397All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium886Open in IMG/M
3300011270|Ga0137391_11452437Not Available531Open in IMG/M
3300011270|Ga0137391_11452439Not Available531Open in IMG/M
3300011395|Ga0137315_1066234Not Available522Open in IMG/M
3300011414|Ga0137442_1085671All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium676Open in IMG/M
3300011436|Ga0137458_1279195Not Available509Open in IMG/M
3300012199|Ga0137383_10622452All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria789Open in IMG/M
3300012203|Ga0137399_10157384All Organisms → cellular organisms → Bacteria1820Open in IMG/M
3300012210|Ga0137378_11154032All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria690Open in IMG/M
3300012226|Ga0137447_1014294All Organisms → cellular organisms → Bacteria1137Open in IMG/M
3300012361|Ga0137360_11103948All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria685Open in IMG/M
3300012925|Ga0137419_10661695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium844Open in IMG/M
3300012929|Ga0137404_10370833All Organisms → cellular organisms → Bacteria1257Open in IMG/M
3300012929|Ga0137404_12292973Not Available505Open in IMG/M
3300012931|Ga0153915_10868273Not Available1048Open in IMG/M
3300012931|Ga0153915_13014042All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria548Open in IMG/M
3300012958|Ga0164299_10273756Not Available1022Open in IMG/M
3300012960|Ga0164301_10171869All Organisms → cellular organisms → Bacteria1350Open in IMG/M
3300012961|Ga0164302_10146722All Organisms → cellular organisms → Bacteria1383Open in IMG/M
3300012986|Ga0164304_10013415All Organisms → cellular organisms → Bacteria3732Open in IMG/M
3300014325|Ga0163163_10179518All Organisms → cellular organisms → Bacteria2164Open in IMG/M
3300014882|Ga0180069_1067207All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium823Open in IMG/M
3300015170|Ga0120098_1005999All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300015254|Ga0180089_1090562All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium634Open in IMG/M
3300015373|Ga0132257_102128534All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria725Open in IMG/M
3300017936|Ga0187821_10001693All Organisms → cellular organisms → Bacteria6837Open in IMG/M
3300018027|Ga0184605_10382435All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium632Open in IMG/M
3300018028|Ga0184608_10262952All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium759Open in IMG/M
3300018053|Ga0184626_10147037All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1002Open in IMG/M
3300018061|Ga0184619_10077427All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1473Open in IMG/M
3300018063|Ga0184637_10234675All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300019880|Ga0193712_1099201All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium629Open in IMG/M
3300019883|Ga0193725_1083644All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium773Open in IMG/M
3300019998|Ga0193710_1031960Not Available531Open in IMG/M
3300019999|Ga0193718_1101605All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria588Open in IMG/M
3300020022|Ga0193733_1159761Not Available604Open in IMG/M
3300020070|Ga0206356_10685387Not Available1001Open in IMG/M
3300021086|Ga0179596_10048797All Organisms → cellular organisms → Bacteria1745Open in IMG/M
3300021418|Ga0193695_1110937Not Available581Open in IMG/M
3300021559|Ga0210409_11333196All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria593Open in IMG/M
3300022534|Ga0224452_1096033All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium904Open in IMG/M
3300025160|Ga0209109_10219012All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium933Open in IMG/M
3300025904|Ga0207647_10128973All Organisms → cellular organisms → Bacteria1487Open in IMG/M
3300025906|Ga0207699_10340797Not Available1056Open in IMG/M
3300025916|Ga0207663_10186313All Organisms → cellular organisms → Bacteria1486Open in IMG/M
3300025917|Ga0207660_10435853All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300025931|Ga0207644_11208378All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria635Open in IMG/M
3300025961|Ga0207712_10303472All Organisms → cellular organisms → Bacteria1311Open in IMG/M
3300025986|Ga0207658_10061268All Organisms → cellular organisms → Bacteria2811Open in IMG/M
3300026023|Ga0207677_10304819All Organisms → cellular organisms → Bacteria1317Open in IMG/M
3300026285|Ga0209438_1005840All Organisms → cellular organisms → Bacteria4179Open in IMG/M
3300026371|Ga0257179_1008261All Organisms → cellular organisms → Bacteria1042Open in IMG/M
3300026475|Ga0257147_1001833All Organisms → cellular organisms → Bacteria2277Open in IMG/M
3300026480|Ga0257177_1002077All Organisms → cellular organisms → Bacteria2103Open in IMG/M
3300026494|Ga0257159_1072073All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria596Open in IMG/M
3300026551|Ga0209648_10734047Not Available539Open in IMG/M
3300027765|Ga0209073_10103267All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1008Open in IMG/M
3300027846|Ga0209180_10054387All Organisms → cellular organisms → Bacteria2216Open in IMG/M
3300027862|Ga0209701_10247866All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1042Open in IMG/M
3300027862|Ga0209701_10392440All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria776Open in IMG/M
3300027882|Ga0209590_10160504All Organisms → cellular organisms → Bacteria1401Open in IMG/M
3300027886|Ga0209486_10933421Not Available578Open in IMG/M
3300027903|Ga0209488_10110330All Organisms → cellular organisms → Bacteria2066Open in IMG/M
3300027952|Ga0209889_1015988All Organisms → cellular organisms → Bacteria1756Open in IMG/M
3300028809|Ga0247824_10695825All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium619Open in IMG/M
3300028884|Ga0307308_10188228All Organisms → cellular organisms → Bacteria989Open in IMG/M
3300028885|Ga0307304_10397520Not Available622Open in IMG/M
(restricted) 3300031150|Ga0255311_1122310All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria571Open in IMG/M
(restricted) 3300031248|Ga0255312_1024453All Organisms → cellular organisms → Bacteria1441Open in IMG/M
3300031820|Ga0307473_10021925All Organisms → cellular organisms → Bacteria2623Open in IMG/M
3300031943|Ga0310885_10594347All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria613Open in IMG/M
3300032174|Ga0307470_11559842All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria551Open in IMG/M
3300032180|Ga0307471_100158575All Organisms → cellular organisms → Bacteria2191Open in IMG/M
3300033432|Ga0326729_1010284All Organisms → cellular organisms → Bacteria1644Open in IMG/M
3300033501|Ga0326732_1048594All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria701Open in IMG/M
3300033502|Ga0326731_1047738Not Available1006Open in IMG/M
3300033513|Ga0316628_102187523All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria733Open in IMG/M
3300033551|Ga0247830_11062954All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium646Open in IMG/M
3300034090|Ga0326723_0041959All Organisms → cellular organisms → Bacteria1924Open in IMG/M
3300034090|Ga0326723_0201163All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria882Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.14%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.43%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.67%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil5.71%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.76%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil4.76%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere3.81%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.86%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.86%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.86%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.86%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.90%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.90%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.90%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.95%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.95%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.95%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.95%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.95%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.95%
FossillEnvironmental → Terrestrial → Soil → Fossil → Unclassified → Fossill0.95%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.95%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.95%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.95%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.95%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011395Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT200_2EnvironmentalOpen in IMG/M
3300011414Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT266_2EnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300014882Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231B'_16_10DEnvironmentalOpen in IMG/M
3300015170Fossil microbial communities from human bone sample from Teposcolula Yucundaa, Mexico - TP48EnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300019999Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025986Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033501Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF12FN SIP fractionEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1132759433300000891SoilMRPDRAAARWLTLLPLVALLAGCGSTGASRTVQIAERPQWTAGDSWTYRGKGRDGAYTITRRVLGERVFEGVLAYEIEAGDSHYWYTRQLGYLARVTGGQTVRRAMPPEDWQWPLQVGRS
JGI25613J43889_1006766723300002907Grasslands SoilMRHSRAAARLLALLPLVALLAGCESAGESRTVQLTERPRWTAGESWTYRGKGKDGAYTITRKVLREGIFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRTWSATVTWVDGPAQDRSFVLTG
Ga0065715_1049972923300005293Miscanthus RhizosphereMRHSRAAARLLAPLALVALLAGCESAGESRTVQLTERPRWTAGESWTYRGKGKDGAYTITRRVLREGIFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRTWSATVTWV
Ga0070680_10044352313300005336Corn RhizosphereMRPDRAAARWLTLLPLVALLAGCGSTGASRTVQIAERPQWTAGDSWTYRGKGRDGAYTITRRVLGERVFEGVLAYEIEAGDSHYWYTRQLGYLARVTGGQTVRRAMPPEDWQWPLQVGRSWSATVTWVNGPAQDQRFVLTGVWVV
Ga0070671_10125923023300005355Switchgrass RhizosphereVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQQQTY
Ga0070667_10084617623300005367Switchgrass RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQQQTYTLTSVWT
Ga0070694_10183417413300005444Corn, Switchgrass And Miscanthus RhizosphereMRPDRAAARWLTLLPLVALLAGCGSTGASRTVQIAERPQWTAGDSWTYRGKGRDGAYTITRRVLGERVFEGVLAYEIEAGDSHYWYTRQLGYLARVTGGQTVRRAMPPEDWQWPLQVG
Ga0070699_10064121513300005518Corn, Switchgrass And Miscanthus RhizosphereVPGALAHIRYAFLVRLRRPLHAAAGLLATVALSLGLAGCESGGISRTVHIAERPTWTAGNSWTYRGRGPNGAYTITRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQW
Ga0070699_10127583513300005518Corn, Switchgrass And Miscanthus RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTD
Ga0070679_10186118513300005530Corn RhizosphereMRPDRAAARWLTLLPLVALLAGCGSTGASRTVQIAERPQWTAGDSWTYRGKGRDGAYTITRRVLGERVFEGVLAYEIEAGDSHYWYTRQLGYLARVTGGQTVRRAMPPEDWQWPLQVGKQWSATVTWTDKGDQERTFVLTGVWLVEAY
Ga0070696_10165401113300005546Corn, Switchgrass And Miscanthus RhizosphereVTGALPHLRYALKVRLRRPIHTAAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYTVTRKVVSEGLFEGRDCYQIEVGDSRYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPIQTGKQWSATVTWTDS
Ga0070704_10156611723300005549Corn, Switchgrass And Miscanthus RhizosphereMRHSRAAARLLAPLALVALLAGCESAGESRTVQLTERPRWTAGESWTYRGKGKDGAYTITRRVLREGIFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRTWSATVTWVDG
Ga0075028_10092696313300006050WatershedsALPQPAPRRPPRLAPSQVSGALPHLRYALKVRLRRPIHAVAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYSVTRKVLSEGLFEGRDCYQIEAGDSRYWYTKQLGYLARTNGDKTVRLAAPPEDWQWPIQIGKQWSATVTWTDSGEQTRTFVLTGVWLV
Ga0079220_1029521023300006806Agricultural SoilLGYAFVVTRPSRVLAAAGLLTALAFSIALAGCESSGVSRIVQMAERPVWAVGDSWTYRGRGPAGAYTVTRKVLREGIFAARDCYQIEAGDARYWYTKQLGYLARTSGEKTVRLASPPEDWQWPLQVGKQWSA
Ga0079220_1030720013300006806Agricultural SoilMTTLALSIALAGCESSGVSRTVPIAERPTWSAGDSWTYRGRGPSGTYNVTRKVLREGVFEGREAYEIEAGDTHYWYTKQLGYLARTKGDQTTRLARPPEDWQWPIQV
Ga0075425_10034035513300006854Populus RhizosphereVVRLRRPFFPPVVLLAVLVVSMGFAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWTDRTDTQQQTY
Ga0075435_10138409323300007076Populus RhizosphereVAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYTVTRKVLSEGLFEGRDCYQIEAGDSRYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPIQTGKQWSATVTWTDSGEQTRTFVLTGVW
Ga0099794_1011487513300007265Vadose Zone SoilVLAGCESGGISRTVHVAERPIWTAGNSWTYRGRGPNGAYTVTRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTDQGEQ
Ga0099830_1007563853300009088Vadose Zone SoilMRHHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTAGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWP
Ga0099828_1165213223300009089Vadose Zone SoilMILQKPMRSDRAAAVLLTILPLAILSAGCAGTGESRSVQIAERPTWTAGDSWTYRGRGPNGAYTIIRKVLREGVFDGREAYEIQAGDARYWYTKGLGYLARTSGERNVRLAMPPEDWQWPLQVGKSWSATVTWVDTGEQ
Ga0105242_1290954713300009176Miscanthus RhizosphereLRYAFLVRLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTVTRKVLSEGIFEGRDCYQIEGGDARYWYTKQLGYLARTRGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTDKGDQERTF
Ga0105066_108369513300009822Groundwater SandMIFPTPMRPDRAAAVLLTILPLAILSAGCAGTGESRSVQLAERPTWIAGDSWTYRGRGANGAYTITRKVLREGVFDGREAYEIQAGDARYWYTKRLGYLARTSGERTVRLAVPPEDWQWPLQVGKSWSATVTWVDSGEQERRFVLTSV
Ga0126384_1216430513300010046Tropical Forest SoilMAHRVIVLLLAVLPLAALSAGCTTTGESRTVHLAERPAWVAGDSWTYRGQGREGAYTITRKVLREGVFEGRPAYEVEAGNVNYWYTKQLGYMARVRGDKTERLASPPEDWQWPLQVGKSWSAT
Ga0126372_1007092013300010360Tropical Forest SoilMAHRVIVLLLAVLPLATLSAGCTTTGESRTVQLAERPAWVAGDSWTYRGQGREGAYTITRKVVREGIFEGRPAYEVEAGNVHYWYTKMLGYLARVRGDKTERLAAPPEDWQWPLQVGKSWSATVDWTDR
Ga0126377_1277167713300010362Tropical Forest SoilMLALSVSVLLAGCESSGVSRTVRMAERPIWSAGDSWTYRGRGPSGTYNVTRKVLREGVFEGWEAYEIEAGDSHYWYTKQLGYLARTKGDQTTRVARPPEDWQWPIQIGKQWSAIVTWTDRTDTEQKIYTLTSV
Ga0134124_1091039723300010397Terrestrial SoilMLEGRLMRHSRAAARLLAPLALVALLAGCESAGESRTVQLTERPRWTAGESWTYRGKGKDGAYTITRKVLREGIFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRTWSATV
Ga0137391_1145243713300011270Vadose Zone SoilMRHHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTAGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPA
Ga0137391_1145243913300011270Vadose Zone SoilMLGDKLMRHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPSWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPA
Ga0137315_106623413300011395SoilMRHHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDCQWPPQDGRSWLATVTGVGGPARRPRCAPPAPRPAAAALLH
Ga0137442_108567123300011414SoilMLEDSLMRPHRAAARLLTLLPLVALLAGCESTGESRTVQVSERPRWTAGESWTYRGKGRDGAYTITRRVLREGVFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAIPPEDWQWPLQVGRSWSATVTWVDGPAQDQRF
Ga0137458_127919513300011436SoilMRHHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEIEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGKS
Ga0137383_1062245223300012199Vadose Zone SoilLAHIRYAFLVRLRRPLHAAAGLLATVALSLGLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTD
Ga0137399_1015738443300012203Vadose Zone SoilLRYAFLVRLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLSEGIFEGRDCYQVEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKG
Ga0137378_1115403213300012210Vadose Zone SoilVRLRRPLHAAAGLLATVALSLGLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQW
Ga0137447_101429433300012226SoilMLGDKLMRHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPSWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEIEAGDSRYWYTKRLGYLARVTGDRTVRRATPPEDWQWPLQVGRSWSATVTWVDGPAQDQRFVL
Ga0137360_1110394823300012361Vadose Zone SoilLAHIRYAFLVRLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLSEGIFEGRDCYQVEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKGEQERTFV
Ga0137419_1066169513300012925Vadose Zone SoilMLEGRLMPHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEGYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQVGRSWSATATWVDGPAQDRSFVLTGVWTVE
Ga0137404_1037083313300012929Vadose Zone SoilMLEGRLMPHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEGYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQVGRSWSATVTWVDGAAQDQRF
Ga0137404_1229297313300012929Vadose Zone SoilLRYAFLVRLRRPLHTAAGLLATVALSIVLGGCESGGISRTVHIAERPIWTAGNSWTYRGKGPNGAYTVTRKVLSEGIFEGRDCYQIEGGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKGEQERTFVLPGCGWWRPTR
Ga0153915_1086827323300012931Freshwater WetlandsMAGALSHLGYAFLVRLPRAVHAAAGLLAALALSIALAGCESSGVSRTVQLAERPIWAAGDSWTYRGRGPAGAYTVTRKVLREGIFGGRDGYQIEAGDARYWYTKQLGYLARTQGDKTVRLATPPEDWQWPLQVGKQWSATVTWTDS
Ga0153915_1301404223300012931Freshwater WetlandsLGYAFLVRLPRPVHAAAGLLAALALSIALAGCESSSVSRTVQMAERPIWAAGDSWTYRGRGPAGAYTVTRKVLREGIFGGRDGYQIEAGDARYWYTKQLGYLARTQGDKTVRLATPPEDWQWPLQVGKQWSATVTWTDS
Ga0164299_1027375613300012958SoilLRYAFVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDGGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWTDRTDTQQQTYTLTSVWTVE
Ga0164301_1017186933300012960SoilLRYAFVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDGGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWTDRTDTQQQTYTLTSVWPV
Ga0164302_1014672233300012961SoilLRYAFVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDGGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWTDRTDTQQQTYTLTSVWTVEVYEEVKTP
Ga0164304_1001341513300012986SoilLRYAFVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDGGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWTDRT
Ga0163163_1017951843300014325Switchgrass RhizosphereLRYAFVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWTDRTDTQQQTYTLTSVWTVEVYEEVKT
Ga0180069_106720723300014882SoilMRRHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPL
Ga0120098_100599933300015170FossillMRHHRAATRVLTLLSLVGLLAGCGSAGQSRTVQIAERPRWTTGDSWTYRGKGRDGSYTITRRVLREGVFEGTEAYEVEAGDSRYWYTKGLGYLARITGGRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGAAQDQRFVLTA
Ga0180089_109056213300015254SoilMRHHRAAARLLTLLPLVALLAGCESSGESRTVQMSERPRWTVGDSWTYRGKGRDGAYTITRRVLREGVFDGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPAQ
Ga0132257_10212853423300015373Arabidopsis RhizosphereLRYAFVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQQQTYTLTSVWTVEAYE
Ga0187821_1000169313300017936Freshwater SedimentLGYAFVVTRPSRVLAAAGLLTALAFSIALAGCESSGVSRIVQMAERPVWAVGDSWTYRGRGPAGAYTVTRKVLREGIFADRDCYQIEAGDSRYWYTKQLGYLARTSGEKTVRLAAPPEDWQWP
Ga0184605_1038243513300018027Groundwater SedimentMSLGLGADNVKCGMLEDRLMPHHRAAVRLLILLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPAEDWQWPLQVGRSWSATVTWV
Ga0184608_1026295223300018028Groundwater SedimentMRHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVDAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPAEDWQWPLQVGRSWSATVTWVDGAAQDQ
Ga0184626_1014703713300018053Groundwater SedimentMRHHRAAARLLTLLPLVALLAGCGSTGESRSVQLAERPQWTAGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQW
Ga0184619_1007742743300018061Groundwater SedimentMSLGLGADNVKSGMLEDRLMPHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQVGRSWSATV
Ga0184637_1023467533300018063Groundwater SedimentMILQTPMRPDRAACVLLAILALAILSGGCAGTGESRSVQLAERPAWTAGDSWTYRGRGANGVYTITRKVLREGVFDGREAYEVQAGDARYWYTKRLGYLARTSGERTVRLAMPPEDWQWPLQVGKSWSATVTWVD
Ga0193712_109920123300019880SoilMSLGLGADNVKCGMLEDRLMPHHRAAVRLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPED
Ga0193725_108364413300019883SoilMRHPRAAARLLALLPLVALLAGCESTGESRTVQLSERPRWTAGESWTYRGKGKDGAYTITRKVLREGIFEGYDAYEVQAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDW
Ga0193710_103196013300019998SoilMLEGKLMRHPRAAARLLALLPLVALLAGCESTGESRTVQLSERPRWTVGESWTYRGKGKDGAYTITRKVLREGIFEGYDAYEVQAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGP
Ga0193718_110160513300019999SoilVAGLLATVALSIVLAGCESGGISRTVHIAERPIWSAGNSWTYRGKGPNGTYTVTRKVLSAGIFEGRDCYQIEAGDARYWYTKELGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTDKGEQERTFV
Ga0193733_115976113300020022SoilMSLGLGADNVKCGMLEDRLMPHHRAAVRLLILLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGVFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQVGRSWSATVTWVDGAAQDQRFLLTGVWTVETYEEVK
Ga0206356_1068538723300020070Corn, Switchgrass And Miscanthus RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVG
Ga0179596_1004879743300021086Vadose Zone SoilVLWRTLRYAFLVRLRRPLHAAAGLLATVALSLVLAGCESGGISRTVHVAERPIWTAGNSWTYRGRGPNGAYTVTRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTD
Ga0193695_111093723300021418SoilMSLGLGADNVKCGMLEDRLMPHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQVGRSWSATVTWVDGAAQDQRFLLT
Ga0210409_1133319623300021559SoilVSGALPHLRYALKVRLRRPIHAAAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYSVTRKVLSEGLFEGRDCYQIEAGDSRYWYTKQLGYLARTNGDKTVRLAAPPEDWQWPIQIGKQWSATVTWTDSGEQTRTFVLTGVWLV
Ga0224452_109603323300022534Groundwater SedimentMRHHRAATRLLTLLPLVALLAGCGSTGESRTVQIAERPSWTVGDSWTYRGKGRDGAYTITRRVLREGIFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPAQDQR
Ga0209109_1021901223300025160SoilMRHHRVAALLLTILPLAALLAGCESAGTSRIVHVAERPRWTAGDSWTYRGKGREGAYTITRQVLREGVFEGQDAYEVQAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGK
Ga0207647_1012897333300025904Corn RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQQQTYTLTSVWTVEAYEEVKTPAGT
Ga0207699_1034079713300025906Corn, Switchgrass And Miscanthus RhizosphereVTGALPHLRYALKVRLRRPIHTAAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYTVTRKVVSEGLFEGRDCYQIEVGDSRYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKGEQERTFVLT
Ga0207663_1018631313300025916Corn, Switchgrass And Miscanthus RhizosphereVTGALPHLRYALKVRLRRPIHTAAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYTVTRKVVSEGLFEGRDCYQIEVGDSRYWYTKQLGYLARTSGDKTVRLAAPPEDWQW
Ga0207660_1043585333300025917Corn RhizosphereMRPDRAAARWLTLLPLVALLAGCGSTGASRTVQIAERPQWTAGDSWTYRGKGRDGAYTITRRVLGERVFEGVLAYEIEAGDSHYWYTRQLGYLARVTGGQTVRRAMPPEDWQWPLQVGRSWSATVTWV
Ga0207644_1120837823300025931Switchgrass RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRT
Ga0207712_1030347233300025961Switchgrass RhizosphereVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQ
Ga0207658_1006126813300025986Switchgrass RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQQQTYTLTSVWTVEAYEE
Ga0207677_1030481933300026023Miscanthus RhizosphereVVRLRRPFFPPVVLLAVLVVSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDTQQQTYTLTSV
Ga0209438_100584013300026285Grasslands SoilLRYAFLVRLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLSEGIFEGRDCYQVEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKGEQER
Ga0257179_100826113300026371SoilMRHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPSWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPL
Ga0257147_100183313300026475SoilLRYAFLVRLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLSEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWP
Ga0257177_100207743300026480SoilMRHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPSWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPAQDQRFVLTG
Ga0257159_107207313300026494SoilLRYAFLVRLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTITRKVLSEGIFEGRDCYQVEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKGEQERTFVLT
Ga0209648_1073404713300026551Grasslands SoilMRYAFLVRLRRPLHAAAGLLATVALSLGLAGCESGGISRTVHIAERPIWTAGNSWTYRGRGPNGAYTVTRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTDQGEQERTFVLTGVWLVESYEEVKTPA
Ga0209073_1010326723300027765Agricultural SoilLGYAFVVTRPSRVLAAAGLLTALAFAIALAGCESSGVSRIVQMAERPVWNVGDSWTYRGRGPAGAYTVTRKVLREGIFAARDCYQIEAGDARYWYTKQLGYLARTSGEKTVRLASPPEDWQWPLQVGKQWS
Ga0209180_1005438713300027846Vadose Zone SoilMRHHRAAARLLTLLPLVALLAGCGSTGESRTVQIAERPSWTVGDSWTYRGKGRDGAYTITRRVLREGVFEGHDAYEVEAGDSRYWYTKRLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPAQ
Ga0209701_1024786613300027862Vadose Zone SoilMILQKPMRPDRAAAVLLTILLLAILSAGCAGTGESRSVQIAERPTWTAGDSWTYRGRGPNGAYTIIRKVLREGVFDGREAYEIQAGDARYWYTKGLGYLARTSGGRTVRLAMPPEDWQWPLQV
Ga0209701_1039244013300027862Vadose Zone SoilVPGALAHIRYAFLVRLRRPLHAAAGLLATVALSLVLAGCESGGISRTVHVAERPIWTAGNSWTYRGRGPNGAYTVTRKVLKEGIFEGRDCYQVEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTW
Ga0209590_1016050413300027882Vadose Zone SoilMILPRPMRLDRVAAVLLTILPLAVLSAGCAGTGESRSVQLAERPTWTAGDTWTYRGRGSNGAYTITRKVLREAVFDGRDAYEIQAGDARYWYTKGLGYLARTNGERTVRRAMPPEDWQWPLQVGKS
Ga0209486_1093342123300027886Agricultural SoilMRLDRAAARLLTLLPLVALLAGCESAGPSRPAQLAERPHWTAGESWTYRGKGRDGAYTITRRVVREGIFEGYEAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPAQDQR
Ga0209488_1011033013300027903Vadose Zone SoilVPGALAHIRYAFLVRLRRPLHAAAGLLATVALSLVLAGCESGGISRTVHVAERPIWTAGNSWTYRGRGPNGAYTVTRKVLKEGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSATVTWTDQGEQERTFVLTG
Ga0209889_101598813300027952Groundwater SandMILQTPMRPDRAACVLLAILALAILSGGCAGTGESRSVQLAERPAWTAGDSWTYRGRGANGVYTITRKVLREGVFDGRDAYEIQAGDARYWYTKRLGYLARTSGERTVRLA
Ga0247824_1069582523300028809SoilMRHSRAAARLLAPLALVALLAGCESAGESRTVQLTERPRWTAGESWTYRGKGKDGAYTITRKVLREGIFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWP
Ga0307308_1018822813300028884SoilMSLGLGADNVKCGMLEDRLMPHHRAAVRLLILLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGVFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQ
Ga0307304_1039752013300028885SoilLLTLLPLVALLAGCGSTGESRTVQIAERPRWTAGESWTYRGKGRDGAYTITRRVLREGIFEGVEAYEVEAGDSRYWYTKQLGYLARVTGGRTVRLAMPPEDWQWPLQVGRSWSATVTWVDGAAQDQR
(restricted) Ga0255311_112231023300031150Sandy SoilMILPKPMRPNRAAAVLLTILPLAILSAGCAGTGESRSVQLAERPTWTAGDSWTYRGHGPNGAYTIIRKVLREGVFEGREAYEIQVGDARYWYTKGLGYLARTSGGRTVRLAMPPEDWQWPLQVGK
(restricted) Ga0255312_102445313300031248Sandy SoilLGYAFLVKRPRRVLAAAGLLAALAFSIALAGCESSGVSRIVQIAERPVWNVGDSWTYRGRGPAGAYTVTRKVLREGIFAARDCYQIEAGDARYWYTKQLGYLARTNGEKTVRIAAPPEDWQWPLQVGKQWSATVTWTDSGE
Ga0307473_1002192553300031820Hardwood Forest SoilMILPTPMRLDRAAAVLLTLLPLALVSAGCAGTGESRSVQLAERPTWTAGDSWTYRGRGPNGVYTITRKVLREGVFDGRQAYEVQAGDARYWYTKGLGYLARTSGERNVRLAMPPEDWQWPLQVGKSW
Ga0310885_1059434713300031943SoilVVRLRRPFFPPVVLLAVLVLSMGLAGCESSGVSRTVQIGERPTWNAGDSWTYRGRGPSGTFNVTRKVLREGVFEGREAYEVDAGGTHYWYTKQLGYLARTKGDETTRLARPPEDWQWPIQVGKQWSAIVAWIDRTDT
Ga0307470_1155984223300032174Hardwood Forest SoilVSGALPHLRYALTVRLRRPIHAAAGLLAVLAVAFSLAGCESSGVSRTVHIGERPTWVVGDSWTYRGRGPKGPYTVTRKVVSEGLFEGRDCYQIEVGDSRYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPIQTGKQWSATVTWTDSG
Ga0307471_10015857543300032180Hardwood Forest SoilVSSGLAHLRYAFLVRLLRRPLHAAAGLLATVALSIVLAGCESGGISRTVHIAERPIWAAGNSWTYRGQGPNGAYTVTRKVLREGIFEGRDCYQIEAGDARYWYTKQLGYLARTSGDKTVRLAAPPEDWQWPLQVGKQWSSTVTWTDKGEQER
Ga0326729_101028433300033432Peat SoilVSGASSHLGYAFLVKLPRRVHATAGLLAALAFSIALAGCESSGVSRIVQMAERPVWSVGDSWTYRGRGPAGAYTVTRKVLREGIFAARDCYQIEAGDARYWYTKALGYLARTSGEKTVRLAAPPEDWQWPLQVGKQWSATVTWTDSGEQERTYV
Ga0326732_104859423300033501Peat SoilVRLPRPVHAAAGLLAALALSIALAGCESSSVSRTVQMAERPIWAAGDSWTYRGRGPAGAYTVTRKVLREGIFGGRDGYQIEAGDARYWYTKQLGYLARTQGDKTVRLATPPEDWQWPLQVGKQWSATVTWTDSGEQERTYVVT
Ga0326731_104773823300033502Peat SoilVSGALSHLGYAFLVKLPRRVHAAAWLLAALTFSIAHAGCESSGVSRTVQMVERPVWSVGDSWTYRGRGPAGAYTVTRKVLREGIFAARDCYQIEAGDARYWYTKALGYLARTSGEKTVRLAAPPEDWQWPLQ
Ga0316628_10218752313300033513SoilMRHHRAVAWLLTLLPLVALLAGCESTGKSHTVQLSERPRWTAGESWTYRGKGRDGAYTITRRVLREGVFEGYDAYEVEAGDSRYWYTKQLGYLARVTGGRTVRRAMPPEDWQWPLQVGRSWSATVTWVDGPAQDQRFLLTGVWTV
Ga0247830_1106295413300033551SoilMRHSRAAARLLAPLALVALLAGCESAGESRTVQLTERPRWTAGESWTYRGKGKDGAYTITRKVLREGIFEGYDAYEVEAGDSRYWYTKQLGYLARVTGDRTVRRAMPPEDWQWPLQVGRSWSATVTWM
Ga0326723_0041959_1494_19223300034090Peat SoilLGYAFLVKLPRRVHATAGLLAALAFSIALAGCESSGVSRTVQIAERPEWTTGDSWTYRGRGPAGAYTVTRKVLREGIFAARDCYQIEAGDARYWYTKQLGYLARTSGEKTVRIAAPPEDWQWPLQVGKQWSATVTWTDSGEQE
Ga0326723_0201163_3_4343300034090Peat SoilLVRLPRPVHAAAGLLAALALSIALAGCESSSVSRTVQMAERPIWAAGDSWTYRGRGPAGAYTVTRKVLREGIFGGRDGYQIEAGDARYWYTKQLGYLARTQGDKTVRLATPPEDWQWPLQVGKQWSATVTWTDSGEQERTYVVT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.