NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F055515

Metagenome / Metatranscriptome Family F055515

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F055515
Family Type Metagenome / Metatranscriptome
Number of Sequences 138
Average Sequence Length 98 residues
Representative Sequence FATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGAPNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Number of Associated Samples 130
Number of Associated Scaffolds 138

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.72 %
% of genes near scaffold ends (potentially truncated) 99.28 %
% of genes from short scaffolds (< 2000 bps) 88.41 %
Associated GOLD sequencing projects 121
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (79.710 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(10.870 % of family members)
Environment Ontology (ENVO) Unclassified
(32.609 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(39.855 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 57.58%    Coil/Unstructured: 42.42%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 138 Family Scaffolds
PF08546ApbA_C 84.06
PF00496SBP_bac_5 6.52
PF01425Amidase 2.17
PF12911OppC_N 0.72
PF10103Zincin_2 0.72

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 138 Family Scaffolds
COG1893Ketopantoate reductaseCoenzyme transport and metabolism [H] 84.06
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 2.17


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A79.71 %
All OrganismsrootAll Organisms20.29 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001991|JGI24743J22301_10126444Not Available563Open in IMG/M
3300002121|C687J26615_10165720Not Available560Open in IMG/M
3300003503|JGI26141J51220_1004122Not Available899Open in IMG/M
3300004052|Ga0055490_10108359Not Available789Open in IMG/M
3300004145|Ga0055489_10231036All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300005093|Ga0062594_103132789Not Available518Open in IMG/M
3300005328|Ga0070676_10811376Not Available691Open in IMG/M
3300005332|Ga0066388_103959131Not Available755Open in IMG/M
3300005338|Ga0068868_100123306All Organisms → cellular organisms → Bacteria2115Open in IMG/M
3300005341|Ga0070691_10389585All Organisms → cellular organisms → Bacteria783Open in IMG/M
3300005444|Ga0070694_100015296All Organisms → cellular organisms → Bacteria → Proteobacteria4815Open in IMG/M
3300005444|Ga0070694_100690069Not Available829Open in IMG/M
3300005444|Ga0070694_100811853Not Available768Open in IMG/M
3300005445|Ga0070708_100294011Not Available1529Open in IMG/M
3300005467|Ga0070706_101634211Not Available587Open in IMG/M
3300005468|Ga0070707_100157492All Organisms → cellular organisms → Bacteria2212Open in IMG/M
3300005468|Ga0070707_102013095All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300005518|Ga0070699_100066715All Organisms → cellular organisms → Bacteria3124Open in IMG/M
3300005536|Ga0070697_100566852Not Available996Open in IMG/M
3300005536|Ga0070697_101113963All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300005543|Ga0070672_100994363All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300005546|Ga0070696_100194932Not Available1509Open in IMG/M
3300005547|Ga0070693_100101984All Organisms → cellular organisms → Bacteria1750Open in IMG/M
3300005555|Ga0066692_10544425Not Available735Open in IMG/M
3300005880|Ga0075298_1002936Not Available1149Open in IMG/M
3300005921|Ga0070766_10756028Not Available660Open in IMG/M
3300006845|Ga0075421_100838250Not Available1055Open in IMG/M
3300006852|Ga0075433_10071926All Organisms → cellular organisms → Bacteria3040Open in IMG/M
3300006881|Ga0068865_100990906All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300006903|Ga0075426_10892081Not Available670Open in IMG/M
3300009088|Ga0099830_11653366Not Available534Open in IMG/M
3300009162|Ga0075423_12799427Not Available534Open in IMG/M
3300009166|Ga0105100_10494561Not Available745Open in IMG/M
3300009171|Ga0105101_10455287Not Available626Open in IMG/M
3300009821|Ga0105064_1083869Not Available640Open in IMG/M
3300010396|Ga0134126_10458005Not Available1475Open in IMG/M
3300010397|Ga0134124_11236962Not Available768Open in IMG/M
3300011410|Ga0137440_1097822Not Available595Open in IMG/M
3300011444|Ga0137463_1265958Not Available637Open in IMG/M
3300012203|Ga0137399_10166091All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1773Open in IMG/M
3300012205|Ga0137362_11595169Not Available539Open in IMG/M
3300012205|Ga0137362_11598951Not Available539Open in IMG/M
3300012362|Ga0137361_11029574Not Available743Open in IMG/M
3300012363|Ga0137390_11238466Not Available693Open in IMG/M
3300012917|Ga0137395_10657143Not Available758Open in IMG/M
3300012923|Ga0137359_10150567All Organisms → cellular organisms → Bacteria2074Open in IMG/M
3300012927|Ga0137416_10273441Not Available1389Open in IMG/M
3300012927|Ga0137416_10685022Not Available899Open in IMG/M
3300012989|Ga0164305_10340043Not Available1126Open in IMG/M
3300013102|Ga0157371_10476772All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300013307|Ga0157372_11941603Not Available676Open in IMG/M
3300014262|Ga0075301_1167993Not Available521Open in IMG/M
3300014968|Ga0157379_11428141Not Available671Open in IMG/M
3300015077|Ga0173483_10569818Not Available617Open in IMG/M
3300015200|Ga0173480_11152467All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300015259|Ga0180085_1077888Not Available965Open in IMG/M
3300015374|Ga0132255_102934220Not Available729Open in IMG/M
3300017936|Ga0187821_10056837Not Available1408Open in IMG/M
3300017997|Ga0184610_1009317Not Available2428Open in IMG/M
3300018028|Ga0184608_10117874Not Available1123Open in IMG/M
3300018059|Ga0184615_10043483All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → unclassified Acidimicrobiia → Acidimicrobiia bacterium2497Open in IMG/M
3300018071|Ga0184618_10046970Not Available1555Open in IMG/M
3300018074|Ga0184640_10437649Not Available583Open in IMG/M
3300018079|Ga0184627_10018783All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3370Open in IMG/M
3300018084|Ga0184629_10224486Not Available977Open in IMG/M
3300018429|Ga0190272_12281483Not Available583Open in IMG/M
3300019879|Ga0193723_1032172Not Available1577Open in IMG/M
3300020004|Ga0193755_1169779Not Available648Open in IMG/M
3300020022|Ga0193733_1045383Not Available1241Open in IMG/M
3300021073|Ga0210378_10008197All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi4563Open in IMG/M
3300021081|Ga0210379_10310083Not Available691Open in IMG/M
3300021088|Ga0210404_10314790Not Available863Open in IMG/M
3300021170|Ga0210400_10129602All Organisms → cellular organisms → Bacteria2018Open in IMG/M
3300021404|Ga0210389_10851919Not Available711Open in IMG/M
3300021420|Ga0210394_11783330Not Available513Open in IMG/M
3300022195|Ga0222625_1094065Not Available662Open in IMG/M
3300022694|Ga0222623_10323781Not Available590Open in IMG/M
3300025165|Ga0209108_10166541Not Available1154Open in IMG/M
3300025324|Ga0209640_10823874Not Available729Open in IMG/M
3300025580|Ga0210138_1004568Not Available2534Open in IMG/M
3300025899|Ga0207642_10305427Not Available924Open in IMG/M
3300025907|Ga0207645_11240755Not Available500Open in IMG/M
3300025910|Ga0207684_10852950Not Available768Open in IMG/M
3300025922|Ga0207646_11642917Not Available553Open in IMG/M
3300025934|Ga0207686_10889247Not Available718Open in IMG/M
3300025935|Ga0207709_10397894Not Available1052Open in IMG/M
3300025938|Ga0207704_10394651Not Available1090Open in IMG/M
3300025938|Ga0207704_10819370Not Available778Open in IMG/M
3300025940|Ga0207691_10618069Not Available917Open in IMG/M
3300025959|Ga0210116_1096435Not Available582Open in IMG/M
3300025961|Ga0207712_10095072All Organisms → cellular organisms → Bacteria2203Open in IMG/M
3300026001|Ga0208000_100821Not Available1413Open in IMG/M
3300026023|Ga0207677_10279560Not Available1369Open in IMG/M
3300026089|Ga0207648_10118498All Organisms → cellular organisms → Bacteria2327Open in IMG/M
3300026285|Ga0209438_1224113Not Available508Open in IMG/M
3300026333|Ga0209158_1261495Not Available593Open in IMG/M
3300026345|Ga0257148_1018955Not Available557Open in IMG/M
3300026469|Ga0257169_1050985Not Available651Open in IMG/M
3300026507|Ga0257165_1029477Not Available953Open in IMG/M
3300026514|Ga0257168_1077802Not Available734Open in IMG/M
3300026535|Ga0256867_10059055Not Available1533Open in IMG/M
3300026557|Ga0179587_11044282Not Available538Open in IMG/M
3300027326|Ga0209731_1069393Not Available546Open in IMG/M
3300027614|Ga0209970_1007099Not Available1837Open in IMG/M
3300027617|Ga0210002_1008272Not Available1583Open in IMG/M
3300027695|Ga0209966_1140113Not Available561Open in IMG/M
3300027765|Ga0209073_10218052Not Available731Open in IMG/M
3300027787|Ga0209074_10410009Not Available570Open in IMG/M
3300027815|Ga0209726_10167320Not Available1201Open in IMG/M
3300027875|Ga0209283_10143644Not Available1581Open in IMG/M
3300027889|Ga0209380_10465472Not Available739Open in IMG/M
3300027954|Ga0209859_1067933Not Available568Open in IMG/M
3300028587|Ga0247828_10843458Not Available585Open in IMG/M
3300028722|Ga0307319_10308624Not Available524Open in IMG/M
3300028784|Ga0307282_10668554Not Available503Open in IMG/M
3300028792|Ga0307504_10117601Not Available868Open in IMG/M
3300028809|Ga0247824_10924192Not Available547Open in IMG/M
3300028814|Ga0307302_10223955Not Available920Open in IMG/M
3300028828|Ga0307312_10719051Not Available661Open in IMG/M
3300028828|Ga0307312_11187592Not Available504Open in IMG/M
3300028906|Ga0308309_10151870All Organisms → cellular organisms → Bacteria → Proteobacteria1864Open in IMG/M
3300029636|Ga0222749_10023016All Organisms → cellular organisms → Bacteria2593Open in IMG/M
3300030336|Ga0247826_10093471All Organisms → cellular organisms → Bacteria → Proteobacteria1848Open in IMG/M
(restricted) 3300031150|Ga0255311_1061118Not Available798Open in IMG/M
(restricted) 3300031197|Ga0255310_10184314All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300031228|Ga0299914_10855566Not Available754Open in IMG/M
3300031720|Ga0307469_11814711Not Available589Open in IMG/M
3300031944|Ga0310884_10726458Not Available603Open in IMG/M
3300031962|Ga0307479_10521507Not Available1171Open in IMG/M
3300032075|Ga0310890_10240015All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1269Open in IMG/M
3300032180|Ga0307471_103154126Not Available584Open in IMG/M
3300033417|Ga0214471_10074016All Organisms → cellular organisms → Bacteria2791Open in IMG/M
3300033486|Ga0316624_11657250Not Available590Open in IMG/M
3300033513|Ga0316628_101377467Not Available939Open in IMG/M
3300033550|Ga0247829_10439003Not Available1077Open in IMG/M
3300033814|Ga0364930_0208954Not Available663Open in IMG/M
3300034178|Ga0364934_0256286Not Available663Open in IMG/M
3300034819|Ga0373958_0022948Not Available1170Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.14%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.70%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.52%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment5.07%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere5.07%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.35%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.90%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.90%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere2.90%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.17%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.17%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.17%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.45%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.45%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.45%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.45%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.45%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.45%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.45%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.45%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.45%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.45%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.45%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.72%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.72%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.72%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.72%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.72%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.72%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.72%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.72%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001991Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2Host-AssociatedOpen in IMG/M
3300002121Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1EnvironmentalOpen in IMG/M
3300003503Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AMHost-AssociatedOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004145Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009166Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011410Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT222_2EnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014262Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1EnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015200Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 (version 2)EnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025940Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025959Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026001Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104 (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026345Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-AEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027326Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027614Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027617Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027695Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Rhizosphere soil Co-N PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028722Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_368EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031228Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M
3300034819Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI24743J22301_1012644423300001991Corn, Switchgrass And Miscanthus RhizosphereAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
C687J26615_1016572013300002121SoilGKWSYPVPPTASRGVRTEYGAPNTLAVELRERSIVAYVNGRPVATAELPAEASGTLGLYVDQRGMEVLFSNLRVSELPPIR*
JGI26141J51220_100412223300003503Arabidopsis Thaliana RhizosphereTYRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL*
Ga0055490_1010835923300004052Natural And Restored WetlandsAFGLMFAGRGGRAADRTFATFGLTANGTYRMASWSGKWTYPVPPTATRSVKTEYGALNTLAVELRDRSVVAYVNGRPVATAELGAEAAGTLGFYVDQRGMEVVFSQLRVIELAPMR*
Ga0055489_1023103613300004145Natural And Restored WetlandsVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRARSIVAYVNGRPVATAELSTEATGTLGLYVDQRGMEVLFNNLRVSELPPIR*
Ga0062594_10313278913300005093SoilRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070676_1081137613300005328Miscanthus RhizosphereKGTREGAFGLMFASRGAAADRTFATLGLTANGTYRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL
Ga0066388_10395913123300005332Tropical Forest SoilLMFGSRGSADGRTFSTLGLTANGTYRVASWNGKWLYPVPPTASRSVKTDYGAQNTLAVEVRDRSVVAFVNGRPVATAELGVEAAGTLGLYVDQRGMEVLFTNVRVTELSPMR*
Ga0068868_10012330613300005338Miscanthus RhizosphereEGAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
Ga0070691_1038958523300005341Corn, Switchgrass And Miscanthus RhizosphereRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL*
Ga0070694_10001529673300005444Corn, Switchgrass And Miscanthus RhizosphereWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL*
Ga0070694_10069006923300005444Corn, Switchgrass And Miscanthus RhizosphereLTANGTYRVASWAGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070694_10081185313300005444Corn, Switchgrass And Miscanthus RhizosphereAFGLMFASRGGADNRTFATFGLTANGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR*
Ga0070708_10029401123300005445Corn, Switchgrass And Miscanthus RhizosphereVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070706_10163421113300005467Corn, Switchgrass And Miscanthus RhizosphereVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070707_10015749233300005468Corn, Switchgrass And Miscanthus RhizosphereATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTETSGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070707_10201309513300005468Corn, Switchgrass And Miscanthus RhizosphereGTREGAFGLMFASRGGGDHRTFATFGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR*
Ga0070699_10006671513300005518Corn, Switchgrass And Miscanthus RhizosphereRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTETSGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070697_10056685213300005536Corn, Switchgrass And Miscanthus RhizosphereVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTETSGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0070697_10111396313300005536Corn, Switchgrass And Miscanthus RhizosphereRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
Ga0070672_10099436313300005543Miscanthus RhizospherePVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
Ga0070696_10019493213300005546Corn, Switchgrass And Miscanthus RhizosphereGEGRTFATLGLTANGTYRVASWAGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSHLRVVELLPLR*
Ga0070693_10010198413300005547Corn, Switchgrass And Miscanthus RhizosphereWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR*
Ga0066692_1054442513300005555SoilRKGTREGAFGLMFASRGAGEGRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGAPNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0075298_100293623300005880Rice Paddy SoilLRKGSREGAFGLMFAGRGGPGGRVFATLGLTANGTYRVASWTGKWAYPVPPTATRTVKTEYGALNTLAVEVRERSVVAYVNGRPVATAELGTEAAGTVGLYVDQRGMEVVFSNLRVVELLPLR*
Ga0070766_1075602813300005921SoilFGLMFGSRGAADGRTFATFGLTANGTYRVAHWTGKWSYPVPPTASRSVKSDYGAVNHLAVEVRDKSIVAYVNGRPVATAELPADGAGTVGLFVDQRGMEVVFSNLRVVELVPMR*
Ga0075421_10083825013300006845Populus RhizosphereVRIEITARLRKGTREGAFGLMFGSRGGADNRTFATLGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTMAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSALPPIR*
Ga0075433_1007192633300006852Populus RhizosphereREGAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
Ga0068865_10099090623300006881Miscanthus RhizosphereYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR*
Ga0075426_1089208113300006903Populus RhizosphereRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELSPLR*
Ga0099830_1165336613300009088Vadose Zone SoilTYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0075423_1279942713300009162Populus RhizosphereREGAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFAQLRVVELAPLR*
Ga0105100_1049456113300009166Freshwater SedimentANGTYRVASWTGKWAYPVPPTATRSVKTEYGTLNTLAVELREKSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSALPPIR*
Ga0105101_1045528723300009171Freshwater SedimentFATFGLTANGTYRVASWTGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFSNLRVSALPPIR*
Ga0105064_108386913300009821Groundwater SandDNRTFATFGLTGNGTYRVASWSGKWSYPVPPTATRSIKTEYGALNTLAVELRDKSIVAYVNGRPVATAELAAEASGTLGFYVDQRGMEVVFSHLRVSELAPMR*
Ga0134126_1045800523300010396Terrestrial SoilKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAEPASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
Ga0134124_1123696213300010397Terrestrial SoilGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137440_109782223300011410SoilATFGLTANGTYRVASWSGGKWSYPVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSDLPPIR*
Ga0137463_126595823300011444SoilPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR*
Ga0137399_1016609113300012203Vadose Zone SoilATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137362_1159516923300012205Vadose Zone SoilRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR*
Ga0137362_1159895123300012205Vadose Zone SoilRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPSEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137361_1102957423300012362Vadose Zone SoilLMFASRGAGEGRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPSEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137390_1123846623300012363Vadose Zone SoilGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAVEASGTLGLYVDQRGMEVLFSNLRVSELPPIR*
Ga0137395_1065714313300012917Vadose Zone SoilYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137359_1015056733300012923Vadose Zone SoilLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPSEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137416_1027344113300012927Vadose Zone SoilASRGAGEGRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR*
Ga0137416_1068502223300012927Vadose Zone SoilASWSGGKWNYPVPPTATRSAKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR*
Ga0164305_1034004323300012989SoilAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFAQLRVVELAPLR*
Ga0157371_1047677223300013102Corn RhizosphereLMFASRGAAADRTFATLGLTANGTYRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL*
Ga0157372_1194160323300013307Corn RhizosphereLGLTANGTYRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL*
Ga0075301_116799323300014262Natural And Restored WetlandsGTYRVASWSGGKWTYPVPPTATRSVKTEYGAPNTLAVELRDRSVVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSALPPIR*
Ga0157379_1142814113300014968Switchgrass RhizosphereNRTFATFGLTANGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR*
Ga0173483_1056981823300015077SoilLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR*
Ga0173480_1115246713300015200SoilTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL*
Ga0180085_107788823300015259SoilGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQQGMEVLFTNLRVSELPPIR*
Ga0132255_10293422023300015374Arabidopsis RhizosphereLGLTANGTYRVASWTGKWAYPVPPTATRTVKTEYGALNTLAVELREKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELPPLR*
Ga0187821_1005683723300017936Freshwater SedimentAFGLMFASRGSAEARTFATLGLTANGTYRVASWAGKWAYPVPPTATRSVKTEYGALNTLAVELREKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR
Ga0184610_100931733300017997Groundwater SedimentAFATFGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0184608_1011787413300018028Groundwater SedimentIEVTARLRKGTREGAFGLMFASRGGADNRTFATFGLTANGTYRVASWIGGKWDYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNFRVSELPPIR
Ga0184615_1004348313300018059Groundwater SedimentRVASWSGGKWSYPVPPTVSRSVKTEYGAPNAMAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSALPPIR
Ga0184618_1004697013300018071Groundwater SedimentGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0184640_1043764913300018074Groundwater SedimentNGTYRIASWNGKWSYPVPWTPSRSVKTEYGVPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0184627_1001878323300018079Groundwater SedimentMFASRGGADNRTFATFGLTANGTYRVASWSGGKWDYPVPPTATRSVKTEYGAQNTVAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0184629_1022448623300018084Groundwater SedimentGLMFASRGGADNRAFATFGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0190272_1228148323300018429SoilVRIEVTARLRKGTREGAFGLMFASRGGGDNRTFATFGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0193723_103217233300019879SoilLTANGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFSNLRVSELPPVR
Ga0193755_116977923300020004SoilSWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0193733_104538313300020022SoilGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0210378_1000819753300021073Groundwater SedimentYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0210379_1031008323300021081Groundwater SedimentATFGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0210404_1031479023300021088SoilRGGAENRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR
Ga0210400_1012960213300021170SoilASRGAAENRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR
Ga0210389_1085191923300021404SoilPITATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR
Ga0210394_1178333013300021420SoilTANGTYRVAHWTGKWSYPVPPTASRSVKSDYGAVNHLAVEVRDKSIVAYVNGRPVATAELPADGAGTVGLFVDQRGMEVVFSNLRVVELVPMR
Ga0222625_109406513300022195Groundwater SedimentSRGGGDNRTFATFGLTANGTYRVASWSGGKWNYTVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0222623_1032378123300022694Groundwater SedimentPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0209108_1016654113300025165SoilGGKWSYPVPPTASRGVKTEYGAPNTLAVELRERSIVAYVNGRPVATAELPAEASGTLGLYVDQRGMEVLFSNLRVSELPPIR
Ga0209640_1082387413300025324SoilFGLMFAGRGPVDNRTFATFGLTGNGTYRVASWSGKWSYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELAAEAAGTLGFYVDQRGMEVVFSNLRVSELAPMR
Ga0210138_100456813300025580Natural And Restored WetlandsASRGGGDNRTFATVGLTANGTYRVASWSGGKWTYPVPPTATRSVKTEYGAPNTLAVELRDRSVVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSALPPIR
Ga0207642_1030542713300025899Miscanthus RhizosphereNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR
Ga0207645_1124075513300025907Miscanthus RhizosphereKGTREGAFGLMFASRGAAADRTFATLGLTANGTYRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSQLRVVELLPL
Ga0207684_1085295013300025910Corn, Switchgrass And Miscanthus RhizosphereIEVSARQRKGTREGAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFAQLRVVELAPLR
Ga0207646_1164291713300025922Corn, Switchgrass And Miscanthus RhizosphereATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTETSGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0207686_1088924713300025934Miscanthus RhizosphereATFGLTANGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR
Ga0207709_1039789423300025935Miscanthus RhizosphereRKGTREGAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0207704_1039465123300025938Miscanthus RhizosphereRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0207704_1081937013300025938Miscanthus RhizosphereTARLRKGTREGAFGLMFASRGGADNRTFATFGLTANGTYRVASWIGGKWNYPAPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR
Ga0207691_1061806913300025940Miscanthus RhizosphereRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0210116_109643513300025959Natural And Restored WetlandsVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRARSIVAYVNGRPVATAELSTEATGTLGLYVDQRGMEVLFNNLRVSELPPIR
Ga0207712_1009507233300025961Switchgrass RhizosphereGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR
Ga0208000_10082113300026001Rice Paddy SoilLTANGTYRVASWTGKWAYPVPPTATRTVKTEYGALNTLAVEARERSVVAYVNGRPVATAELGTEAAGTVGLYVDQRGMEVVFSNLRVVELLPLR
Ga0207677_1027956013300026023Miscanthus RhizosphereAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0207648_1011849813300026089Miscanthus RhizosphereASRGGADNRTFATFGLTANGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR
Ga0209438_122411313300026285Grasslands SoilKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0209158_126149513300026333SoilFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGAPNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0257148_101895513300026345SoilFGLMFASRGAGEGRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0257169_105098523300026469SoilTYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0257165_102947713300026507SoilYRVASWSGGKWNYPVPPTATRSVKTEFGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0257168_107780213300026514SoilGLMFASRGAAEGRTFATLGLTANGTYRVASWTGKWTYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0256867_1005905523300026535SoilGRGGPDNRTFATFALTANGTYRVASWSSGKWSYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELSAQASGTLGFYVDQRGMEVLFTNLRVSALPPIR
Ga0179587_1104428223300026557Vadose Zone SoilGAFGLMFASRGAAENRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0209731_106939323300027326Forest SoilARQRKGTREGAFGLMFASRGAAENRTFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0209970_100709943300027614Arabidopsis Thaliana RhizosphereKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL
Ga0210002_100827233300027617Arabidopsis Thaliana RhizosphereANGTYRVASWNGKWVYPVPPTATRSVKTDYGALNTLAVELRDRSIVAYVNGRPVATAELGAEAAGALGLYVDQRGMEVVFSHLRVVELAPL
Ga0209966_114011313300027695Arabidopsis Thaliana RhizosphereESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0209073_1021805213300027765Agricultural SoilGSAESRTFATLGLTANGTYRVAAWNGKWAYPVPPTATRSVKTEYGAVNHLAVETRERSVVAYVNGRPVATAELPGEAAGTLGFYVDQRGMEVLFSQLRVVQLAPLR
Ga0209074_1041000913300027787Agricultural SoilLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELSPLR
Ga0209726_1016732013300027815GroundwaterASRGGADNRTFATFGLTANGTYRVASWSGGKWSYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0209283_1014364433300027875Vadose Zone SoilASRGGADNRAFATFGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTVAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0209380_1046547223300027889SoilKGTREGAFGLMFGSRGAADGRTFATFGLTANGTYRVAHWTGKWSYPVPPTASRSVKSDYGAVNHLAVEVRDKSIVAYVNGRPVATAELPADGAGTVGLFVDQRGMEVVFSNLRVVELVPM
Ga0209859_106793313300027954Groundwater SandTLGLTANGTYRVASWSGKWSYPVPPTATRSIKTEYGALNTLAVELRDKSIVAYVNGRPVATAELAAEASGTLGFYVDQRGMEVVFSNLRVSELAPMR
Ga0247828_1084345813300028587SoilVSARQRKGTREGAFGLMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFAQLRVVELAPLR
Ga0307319_1030862423300028722SoilLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0307282_1066855413300028784SoilTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0307504_1011760113300028792SoilKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0247824_1092419223300028809SoilTRLFSRSSTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0307302_1022395523300028814SoilWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0307312_1071905123300028828SoilSWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0307312_1118759213300028828SoilRGGADNRTFATLGLTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELATEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0308309_1015187013300028906SoilHLPVQCATREGAFGLMFGSRGAADGRTFATFGLTANGTYRVAHWTGKWSYPVPPTASRSVKSDYGAVNHLAVEVRDKSIVAYVNGRPVATAELPADGAGTVGLFVDQRGMEVVFSNLRVVELVPMR
Ga0222749_1002301613300029636SoilPTATRSVRTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR
Ga0247826_1009347113300030336SoilAHFPLHAATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
(restricted) Ga0255311_106111813300031150Sandy SoilLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVIELLPLR
(restricted) Ga0255310_1018431413300031197Sandy SoilFATLGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELREKSIVAYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSNLRVVELAPLR
Ga0299914_1085556613300031228SoilDNRTFATFALTANGTYRVASWSSGKWSYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELSAQASGTLGFYVDQRGMEVLFTNLRVSALPPIR
Ga0307469_1181471123300031720Hardwood Forest SoilRVAAWSGGKWNYPVPPTASRSVRTEYGAPNTLGVELRDRSIVAYVNGRPVATAELAEQASGTLGLYVDQRGMEVLFTNLRVSDLAPLR
Ga0310884_1072645823300031944SoilDARTFATLGLTANGTYRVASWTGKWTYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLGLYVDQRGMEVVFSQLRVVELLPLR
Ga0307479_1052150723300031962Hardwood Forest SoilGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLGLYVDQRGMEVVFSQLRVVELLPLR
Ga0310890_1024001513300032075SoilMFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR
Ga0307471_10315412613300032180Hardwood Forest SoilGLTANGTYRVASWTGKWAYPVPPTATRSVKTEYGALNTLAVELRDKSIVSYVNGRPVATAELPTEASGTLALYVDQRGMEVVFSQLRVVELLPLR
Ga0214471_1007401613300033417SoilASWSGGKWSYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEASGTLGLYVDQRGMEVLFTNLRVSALPPIR
Ga0316624_1165725013300033486SoilGPDARTFATLGLTANGTYRVASWTGKWTYPVPPTATRSVKTEYGALNTLAVELREKSIVAYVNGRPVATAELPTEASGTLGLYVDQRGMEVVFSNLRVVELAPLR
Ga0316628_10137746723300033513SoilGLMFASRGGPDARTFATLGLTANGTYRVASWTGKWTYPVPPTATRSVKTEYGALNTLAVELRDKSIVAYVNGRPVATAELPTEASGTLGLYVDQRGMEVVFSQLRVVELLPLR
Ga0247829_1043900323300033550SoilNGTYRVASWIGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELAAEAGGTLGLYVDQRGMEVLFTNLRVSELPPSR
Ga0364930_0208954_2_2563300033814SedimentSWSGKWSYPVPPTATRSVKTEYGALNALAVELRDKSIVAYVNGRPVATAELAAEASGTLGFYVDQRGMEVVFSHLRVSELAPMR
Ga0364934_0256286_3_3323300034178SedimentSRGGGDNRTFATFALTANGTYRVASWSGGKWNYPVPPTATRSVKTEYGAPNTLAVELRDRSIVAYVNGRPVATAELSAEASGTLGLYVDQRGMEVLFTNLRVSELPPIR
Ga0373958_0022948_837_11693300034819Rhizosphere SoilFASRGSAESRTFATLGLTANGTYRVAAWSGKWAYPVPPTATRSVKTEYGAVNHLAVELREKSLVAYVNGRPVATAELASDAAGTLGLYVDQRGMEVLFTQLRVVELAPLR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.