NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072612

Metagenome / Metatranscriptome Family F072612

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072612
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 241 residues
Representative Sequence DGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Number of Associated Samples 109
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 22.22 %
% of genes near scaffold ends (potentially truncated) 38.84 %
% of genes from short scaffolds (< 2000 bps) 41.32 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (80.992 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(41.322 % of family members)
Environment Ontology (ENVO) Unclassified
(33.058 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.719 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.75%    β-sheet: 10.00%    Coil/Unstructured: 51.25%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF00106adh_short 11.57
PF04149DUF397 7.44
PF13577SnoaL_4 4.13
PF12867DinB_2 4.13
PF13561adh_short_C2 2.48
PF00805Pentapeptide 1.65
PF13302Acetyltransf_3 1.65
PF01370Epimerase 1.65
PF07730HisKA_3 0.83
PF14312FG-GAP_2 0.83
PF02583Trns_repr_metal 0.83
PF13560HTH_31 0.83
PF13751DDE_Tnp_1_6 0.83
PF00005ABC_tran 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG1357Uncharacterized conserved protein YjbI, contains pentapeptide repeatsFunction unknown [S] 1.65
COG1937DNA-binding transcriptional regulator, FrmR familyTranscription [K] 0.83
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.83
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.83
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.83
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A80.99 %
All OrganismsrootAll Organisms19.01 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003505|JGIcombinedJ51221_10104916All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1126Open in IMG/M
3300005187|Ga0066675_10174323All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1494Open in IMG/M
3300005439|Ga0070711_100409173Not Available1103Open in IMG/M
3300005536|Ga0070697_100064075All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3002Open in IMG/M
3300005610|Ga0070763_10389722Not Available781Open in IMG/M
3300006046|Ga0066652_100596606Not Available1041Open in IMG/M
3300006175|Ga0070712_100164172Not Available1717Open in IMG/M
3300006176|Ga0070765_100270794All Organisms → cellular organisms → Bacteria1563Open in IMG/M
3300006804|Ga0079221_10192552Not Available1109Open in IMG/M
3300006893|Ga0073928_10021789All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia6546Open in IMG/M
3300009012|Ga0066710_102499354Not Available747Open in IMG/M
3300010048|Ga0126373_10386763All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1420Open in IMG/M
3300010358|Ga0126370_10193587Not Available1529Open in IMG/M
3300010366|Ga0126379_10255508All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1729Open in IMG/M
3300010366|Ga0126379_10491699All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1297Open in IMG/M
3300010373|Ga0134128_10227484All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. G12095Open in IMG/M
3300010376|Ga0126381_100273848All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptosporangiales → Streptosporangiaceae2300Open in IMG/M
3300010880|Ga0126350_11139239Not Available2740Open in IMG/M
3300010937|Ga0137776_1621232All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia6388Open in IMG/M
3300010937|Ga0137776_1655395Not Available1846Open in IMG/M
3300012210|Ga0137378_10042159All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptosporangiales → Thermomonosporaceae → Actinomadura → Actinomadura nitritigenes4087Open in IMG/M
3300012356|Ga0137371_10176293All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1676Open in IMG/M
3300012363|Ga0137390_10156960Not Available2262Open in IMG/M
3300012683|Ga0137398_10245816All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1191Open in IMG/M
3300016294|Ga0182041_10264280Not Available1408Open in IMG/M
3300016319|Ga0182033_10477589Not Available1067Open in IMG/M
3300016341|Ga0182035_10468952Not Available1071Open in IMG/M
3300016404|Ga0182037_10347016All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Devosiaceae1208Open in IMG/M
3300018433|Ga0066667_10443736Not Available1058Open in IMG/M
3300020199|Ga0179592_10135555Not Available1129Open in IMG/M
3300020582|Ga0210395_10010185All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria6994Open in IMG/M
3300021171|Ga0210405_10354462Not Available1157Open in IMG/M
3300021180|Ga0210396_10544860Not Available1012Open in IMG/M
3300021407|Ga0210383_10274394All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1448Open in IMG/M
3300021433|Ga0210391_10497870All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia957Open in IMG/M
3300021477|Ga0210398_10100756All Organisms → cellular organisms → Bacteria2345Open in IMG/M
3300022722|Ga0242657_1072889Not Available800Open in IMG/M
3300025915|Ga0207693_10010429All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria7540Open in IMG/M
3300025915|Ga0207693_10160341Not Available1769Open in IMG/M
3300025928|Ga0207700_10981394Not Available756Open in IMG/M
3300025929|Ga0207664_10136741All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2068Open in IMG/M
3300026550|Ga0209474_10263380Not Available1051Open in IMG/M
3300027867|Ga0209167_10114340Not Available1392Open in IMG/M
3300028906|Ga0308309_10199248Not Available1649Open in IMG/M
3300031546|Ga0318538_10095297Not Available1529Open in IMG/M
3300031564|Ga0318573_10106448Not Available1444Open in IMG/M
3300031573|Ga0310915_10488459Not Available875Open in IMG/M
3300031668|Ga0318542_10081268Not Available1536Open in IMG/M
3300031708|Ga0310686_108039901Not Available1515Open in IMG/M
3300031708|Ga0310686_108949501Not Available852Open in IMG/M
3300031715|Ga0307476_10308831All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Pseudonocardiales → Pseudonocardiaceae → Amycolatopsis → Amycolatopsis lexingtonensis1161Open in IMG/M
3300031724|Ga0318500_10177799Not Available1012Open in IMG/M
3300031764|Ga0318535_10070889Not Available1491Open in IMG/M
3300031769|Ga0318526_10225422Not Available766Open in IMG/M
3300031779|Ga0318566_10006173All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria4666Open in IMG/M
3300031796|Ga0318576_10047714Not Available1845Open in IMG/M
3300031798|Ga0318523_10097150Not Available1442Open in IMG/M
3300031799|Ga0318565_10060672Not Available1771Open in IMG/M
3300031832|Ga0318499_10052163Not Available1532Open in IMG/M
3300032010|Ga0318569_10262465Not Available803Open in IMG/M
3300032060|Ga0318505_10250217Not Available834Open in IMG/M
3300032064|Ga0318510_10206449Not Available795Open in IMG/M
3300032205|Ga0307472_100813589Not Available855Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil41.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.26%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.44%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.44%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.44%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil5.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.79%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil3.31%
SedimentEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Sediment → Sediment1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.65%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.65%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.83%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.83%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010937Fumarole sediment microbial communities, Furnas, Sao Miguel, Azores. Combined Assembly of Gp0156138, Gp0156139EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029701Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-O (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031546Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f23EnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031668Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f23EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031764Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f27EnvironmentalOpen in IMG/M
3300031769Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f24EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300031779Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f22EnvironmentalOpen in IMG/M
3300031796Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f24EnvironmentalOpen in IMG/M
3300031798Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f19EnvironmentalOpen in IMG/M
3300031799Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031831Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f20EnvironmentalOpen in IMG/M
3300031832Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f25EnvironmentalOpen in IMG/M
3300031845Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f18EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032009Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f19EnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032039Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f21EnvironmentalOpen in IMG/M
3300032043Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f24EnvironmentalOpen in IMG/M
3300032044Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f20EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032064Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f17EnvironmentalOpen in IMG/M
3300032067Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f22EnvironmentalOpen in IMG/M
3300032068Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f21EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10022113213300000364SoilPPHPVLKTTQLAKAGCEFLISIEPSRNMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGLADNAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF*
JGIcombinedJ51221_1010491613300003505Forest SoilYVDAANVFDGYVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALVAKWLTMMNGSGIPYRVVLYSESNDKAFKTAAEWXAYWSYYAPVVKDAGVTLAYDCGMGFKALPRAAAYFPSNPSPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIGSASDPRIPGIHRVTQAF*
Ga0066685_1019003023300005180SoilPPHPPLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0066676_1037120123300005186SoilPPLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0066675_1017432323300005187SoilRRVSLPAKRSAATTPLVGATVDLASYKSKNYLDAANTFDGLVGLPMATTLQKIYMGHGELPPHPPLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0070714_10133692513300005435Agricultural SoilQALLAKWLTMMNGSGIPYRVALYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAIAKGGRADVIDSASDPRIPGIHRVAQAF*
Ga0070711_10040917313300005439Corn, Switchgrass And Miscanthus RhizosphereMQAPVTTRREREPLQGPSRREFLGGAAGIAAITALPASATIPARRTTLPSRRSTATTPAVGATVDLASYGGKNYLDAAHTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWN
Ga0070697_10006407523300005536Corn, Switchgrass And Miscanthus RhizosphereLRGLSRREFLGGAAGVAAVAALPASATIPSRHPSLPARRGAAITPVVGATVDLASYSVKNYLDAANIYDGFVGLPLATTIQKVYMGHGEFPPHPVLKMTQLAKAGCEFLVSIEPSRNMVASEQALLAKWLTMMNGSGIPYRVALYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIISLASAGKLPLGAIFFNAIAKGGRADVIDSASDPRIPGIHRVAQAF*
Ga0070704_10126458513300005549Corn, Switchgrass And Miscanthus RhizosphereALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGAVLAYNCGCGFKALPRAAAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGLADTAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAIAKGGRADVIDSASDPRIPGIHRVAQAF*
Ga0070763_1038972213300005610SoilPSRRSAATTPVVGATVDLASYGGKNYLDAANTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASD
Ga0070766_1051810713300005921SoilGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF*
Ga0066696_1018510123300006032SoilLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0066652_10059660623300006046SoilIPARRVSLPAKRSAATTPLVGATVDLASYKSKNYLDAANTFDGLVGLPMATTLQKIYMGHGELPPHPPLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIISRASAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0070712_10016417223300006175Corn, Switchgrass And Miscanthus RhizosphereVDLASYGGKNYLDAAHTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF*
Ga0070765_10027079423300006176SoilMTTRREREPLQGPSRREFLGGAAGIAAITALPASATIPARRTGLPSRRSAATTPVVGATVDLASYGGKNYLDAANTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF*
Ga0079222_1107541313300006755Agricultural SoilKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLAMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGLADNAGIPAGVAEWDWSAGNLIFTPMTLPWWNAYCEYIISRASAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0079222_1207745313300006755Agricultural SoilLVSIEPSRSMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKAAGVTLAYNCGCGFKSLPRAEAYFPSNPTPDELRMDFYATSFRGGSRLDTLIGLADNAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVTLATAGKLGLGAIFFNAVAKGGFADVISSAS
Ga0079221_1019255213300006804Agricultural SoilLQGLSRRGFLGGAAGVAALAALPASVTIPSRHASLPARHPGVPARRSAATTPAFGSTVDLASYTSKNYVDAANTFDGFVGLPMATTLQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRAMVASEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGQADNAGIPAGIAEWDWSAGNPIFTPMTLPWWN
Ga0079221_1027401113300006804Agricultural SoilYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPEELWMDFYATSFRGGSRIDTLIGLADSAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGRLGLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF*
Ga0079220_1086871513300006806Agricultural SoilSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPEELWMDFYATSFRGGSRIDTLIGLADSAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGRLGLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF*
Ga0079220_1099742913300006806Agricultural SoilGATVDLASYKSKNYLDAANTFDGLVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGQADNAGIPAGIAEWDWSAGNPIFTPMTLPWWNAYCEYIVTLATA
Ga0073928_1002178983300006893Iron-Sulfur Acid SpringSLTVPSGRTSRPARRSAATTPVVGATVDLASYHTKNYVDAANTFDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHRVTQAF*
Ga0075426_1146593113300006903Populus RhizosphereQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKAAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGLADNAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAIAKGGRADVIDSASD
Ga0066710_10249935413300009012Grasslands SoilGATVDLASYKSKNYLDAANTFDGLVGLPMATTLQKIYMGHGELPPHPPLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRLDTLIGQADNAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVTLATAGKLGLGAIFFNAVAKGGFADVISSASDP
Ga0126373_1038676323300010048Tropical Forest SoilVQGLSRREFLGGAAGIVAVTALPASAATPARHASHPPRRSAATTPLVGATVDLASYGGKNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRSMVASEQAVLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGVILAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF*
Ga0126370_1019358723300010358Tropical Forest SoilVQGLSRREFLGGAAGIVAVTALPASATTPARHASHPARRSVATTPLVGATVDLASYGSRNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRSMVASEQALLAKWLIMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVIKNAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKG
Ga0126378_1057447013300010361Tropical Forest SoilPPLKMTQLAKTGCEFLVSIEPSRSMVASEQAVLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF*
Ga0126377_1290080213300010362Tropical Forest SoilGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVTLATAGKL
Ga0126379_1025550833300010366Tropical Forest SoilVQGLSRREFLGGAAGIVAVTALPASAATPARHASHPPRRSAATTPLVGATVDLASYGGKNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRSMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF*
Ga0126379_1049169913300010366Tropical Forest SoilLASYGGKNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPYPLLKMTQLAKTGCEFLVSIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAQEWFAYWSFYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF*
Ga0126379_1320565413300010366Tropical Forest SoilFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVASEQALVAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKEAGVTLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKL
Ga0134128_1022748413300010373Terrestrial SoilGLPLATTIQKVYMGHGEFPPHPVLKMTQLAKAGCEFLVSIEPSRNMVATEQALLAKWLTMMNGSGIPYRVALYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAIAKGGRADVIASASDPRIPGIHRVAQAF*
Ga0126381_10027384823300010376Tropical Forest SoilVQGLSRREFLGGAAGIVAVTALPASAATPARHASHPPRRSAATTPLVGATVDLASYGGKNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLLWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVGQAF*
Ga0126383_1010990753300010398Tropical Forest SoilSMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGVVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF*
Ga0126350_1113923923300010880Boreal Forest SoilVDLASYGGKNYLDAANTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF*
Ga0137776_162123253300010937SedimentVDLASYGSKNYVDAAHTFDGYVGLPLATTIQKVYMGHGEFPPHPPLKMTQLANAGCEFLVSIEPSRNMVASEQALLAKWLAMMNNSGIPYRVVLYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVVCAHDCGCGFKALPRAAAYFPSNPGPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAIAKGGRADVIASASDPRIPGIHRVTQAF*
Ga0137776_165539523300010937SedimentVFGATVDLASYGSKTYVDAANTFDGYAGLPLATTIQKVYMGHGEFPPHPVLKMTQLVKAGCEFLVSIEPSRNMVASEQTLLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYDCGCGFKALPRAAAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFWNALGKGGRADVIASAADPRIPGIHRVAQAF*
Ga0137392_1110089713300011269Vadose Zone SoilLLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGNLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVAQAF*
Ga0137391_1044661013300011270Vadose Zone SoilGEFPPHPPLKMTQLAKAGCEFLVSIEPSRTMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVAQAF*
Ga0137378_1004215923300012210Vadose Zone SoilVDLASYGDKNYLDAANIFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRTMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVAQAF*
Ga0137377_1002581463300012211Vadose Zone SoilMMTAAIEPSRNMVPSEPTLLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPAPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCGYIVSRASAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0137370_1010369123300012285Vadose Zone SoilLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIISRASAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0137371_1017629313300012356Vadose Zone SoilKRSAATTPLVGATVDLASYRSKNYLDAANTFDGLVGLPMATTLQKIYMGHGEFPPHPPLKATQLAKTGCEFLVSIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF*
Ga0137390_1015696013300012363Vadose Zone SoilVDLASYGDKNYLDAANIFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRTMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRI
Ga0137398_1024581613300012683Vadose Zone SoilAATTPAVGATVDLASYGGKNYLDAANIFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGNLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF*
Ga0182041_1026428013300016294SoilMRTQREREPVQGLSRREFLGGGAGIVAITALPASAAIPARHASHPARRRAATAPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLP
Ga0182033_1047758913300016319SoilAVAALPASVTTPARLASHPARRRAATTPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0182035_1046895213300016341SoilRTRREREPVQGLSRREFLGGAAGVAAVAALPASVTIPARHTSHPARRSAATTPLVGATVDLASYGGKNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRSMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0182037_1034701623300016404SoilPSRHARLPARHAAATTPVFGSTVDLASYSSKNYLDAANTFDGFAGLPMATTIQKVYMGHGELPPHPPLKMTQLAKAGCEFLVSIEPSKNMVASEQALLAKWLTMMNNSGIRYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0182039_1187382213300016422SoilVASEQALLAKWLTMMNNSGIRYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRIDTLIGQANAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIGSLGSAGKLQLGTIFWNAIGKGGRADVIDSASDPRIPGI
Ga0182038_1055302623300016445SoilKMTQLAKTGCEFLVSIEPSRSMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSYYARVVKNAGVTLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIGSLGSAGKLQLGTIFWNAIGKGGRADVIDSASAPRIPGVHRVAQAF
Ga0066667_1044373613300018433Grasslands SoilEREPVQGLSRREFLGGAAGIASVTALPASATIPARLASIPANRSAATTPLVGATVHLAGHMSKNYLDAANTFDGLVGLPMATTLQKIYMGHGELPPHPPLKATQLAKTGCEFLISIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDILIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF
Ga0179592_1013555513300020199Vadose Zone SoilAITALPASAAIPARRTSLPSRRGAATTPAVGATVDLASYGGKNYLDAANIFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLARAGCEFLVSIEPSRSMAPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGNLIFTPMTLPWWNAYCDYLVTLAAAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF
Ga0210407_1084598213300020579SoilKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF
Ga0210395_1001018523300020582SoilVTIRKGKPLQGPSRREFFGGAAGIVAAAALPASLTVPSGRTSRPARRSAATTPVVGATVDLASYHTKNYVDAANVFDGYVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALVAKWLTMMNGSGIPYRVVLYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVTLAYDCGMGFKALPRAAAYFPSNPSPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIGSASDPRIPGIHRVTQAF
Ga0210404_1088344913300021088SoilNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPTGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHR
Ga0210405_1035446223300021171SoilMMTRREREPLQGPSRREFLGGAAGIAAITALPASATIPARRTGLPARRSAATTPVVGATVDLASYGGKNYLDAANTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVV
Ga0210405_1062590113300021171SoilQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPTGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHRVIQAF
Ga0210396_1054486013300021180SoilVTIRKGKPLQGPSRREFIGGAAGIAAVTALPASLIVPSGRASRPARRSAATTPVVGATVDLASYHTKNYVDAANTFDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHRVIQAF
Ga0210393_1066629513300021401SoilFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF
Ga0210385_1083650223300021402SoilTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYDCGMGFKALPRAAAYFPSNPSPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIGSASDPRIPGIHRVTQAF
Ga0210397_1020880723300021403SoilLSMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF
Ga0210397_1126046813300021403SoilHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPTGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIF
Ga0210389_1066842913300021404SoilDGFVGLPMATAIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHRVTQAF
Ga0210387_1092213913300021405SoilCEFLVSIEPSRSMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF
Ga0210383_1027439423300021407SoilVTIRKGKPLQGPSRREFIGGAAGIAAVTALPASLTVPSGRTSRPARRSAATTPVVGATVDLASYHTKNYVDAANTFDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHRVIQAF
Ga0210391_1049787023300021433SoilYHTKNYVDAANVFDGYVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALVAKWLTMMNGSGIPYRVVLYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVTLAYDCGMGFKALPRAAAYFPSNPSPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIGSASDPRIPGIHRVTQAFYPPPGASQA
Ga0210398_1010075633300021477SoilALPASLTVPSGRTSRPARRSAATTPVVGATVDLASYHTKNYVDAANVFDGYVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALVAKWLTMMNGSGIPYRVVLYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVTLAYDCGMGFKALPRAAAYFPSNPSPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIGSASDPRIPGIHRVTQAF
Ga0210402_1017439413300021478SoilVDLASYAVKNYLDAANIYDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRAMVPSEQALLAKWLTMMNNSGIPYRVALYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPPNPGPDELWMDFYATSFRGGSRIDTLIGLADAVGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATADKLTL
Ga0210402_1064339513300021478SoilVTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF
Ga0210409_1014533023300021559SoilMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF
Ga0242657_107288913300022722SoilNYLYAANTFDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCDFLVSIEPSRTMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPTGIAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF
Ga0207693_1001042913300025915Corn, Switchgrass And Miscanthus RhizospherePVVGATVDLASYAVKNYLDAANIYDGFVGLPLATTIQKVYMGHGEFPPHPVLKMTQLAKAGCEFLVSIEPSRNMVASEQALLAKWLTMMNGSGIPYRVALYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAIAKGGRADVIASASDPRIPGIHRVAQAF
Ga0207693_1016034123300025915Corn, Switchgrass And Miscanthus RhizosphereVDLASYGGKNYLDAAHTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVTQAF
Ga0207646_1119649813300025922Corn, Switchgrass And Miscanthus RhizospherePHPPLKMTQLAKAGCEFLVSIEPSRNMVASEQAVLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGQADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASPGKLPLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF
Ga0207700_1073452423300025928Corn, Switchgrass And Miscanthus RhizosphereLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGQADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASPGKLPLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF
Ga0207700_1098139413300025928Corn, Switchgrass And Miscanthus RhizosphereAALPASATIPSRHASLPAGRGAAITPVVGATVDLASYAVKNYLDAANIYDGFVGLPLATTIQKVYMGHGEFPPHPVLKMTQLAKAGCEFLVSIEPSRNMVASEQALLAKWLTMMNGSGIPYRVALYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGK
Ga0207664_1013674113300025929Agricultural SoilAANTFDGLVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVASEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGLADNAGIPAGVAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF
Ga0209474_1026338013300026550SoilANTFDGLVGLPMATTLQKIYMGHGELPPHPPLKATQLAKTGCEFLISIEPSRNMVVSEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCAYIVSRATAGKLPLGSIFFNAVAKGGFADVISSASDPRIPGVHRVAQAF
Ga0209178_109874313300027725Agricultural SoilYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPEELWMDFYATSFRGGSRIDTLIGLADSAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGRLGLGAIFFNAVAKGGFADVISSASDPRIPGIHRVAQAF
Ga0209167_1011434023300027867Surface SoilVTIRKGKPLQGPSRREFIGGAAGIAAVTALPASLTVPSGRPGRPTRRSAATTPVVGATVDLASYHTKNYVDAANTFDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQATLSKWLTMMNNSGINYRVVLYSECNDKAFKTSSEWFAYWSYYAPVVKDAGVVCAYDCGMGFKALPRASAYFPSNPTPDELWMDFYATSFRGGSRLEEVMGLAEDAGIPAGIAEWNWSAGDLIFTPMTMPWWNAYCEYIANLATAGKLTLGAIFFNAVAKGGRADVIASTQDPRIPGVHRVVSAF
Ga0308309_1019924823300028906SoilMTTRREREPLQGPSRREFLGGAAGIAAITALPASATIPARRTGLPSRRSAATTPVVGATVDLASYGGKNYLDAANTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYNCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVVQAF
Ga0222748_102712123300029701SoilPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPTGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIASASDPRIPGIHRVTQAF
Ga0318541_1021008823300031545SoilTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318538_1009529713300031546SoilDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318573_1010644823300031564SoilRAATAPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0310915_1048845913300031573SoilNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALLGKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIGSASDPRIPGVHRVAQAF
Ga0318555_1047906413300031640SoilTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318542_1008126823300031668SoilNYLDTANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0310686_10803990113300031708SoilVTIRKGKPLQGPSRREFFGGAAGIVAVAALPASLTVPSGRTSHPARRSAATTPVVGATVDLASYHTKNYVDAANVFDGYVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALVAKWLTMMNGSGIPYRVVLYSESNDKAFKTAAEWFAYWSYYAPVVKDAGVTLAYDCGMGFKALPRAAAYFPSNPSPDELWMYFYATSFRGGSRLDTLIGLADAAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCDYIVTLATAGKLPLGAIFFNAVAKGGTADVIGS
Ga0310686_10894950113300031708SoilARAIGAAVTIRKGKPLQGPSRREFIGGTAGIAAVTALPASLTVPSGRTSRPARRSAATTPVVGATVDLASYHTKNYVDAANTFDGFVGLPLATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSKTMTSAEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGATLAYDCGMGFKALPRAAAYFPSNPSPDEVWMDFYATSFRGGSRLDTVIGLADAAGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCDYIVTLATA
Ga0307476_1030883113300031715Hardwood Forest SoilAVAALPASATIPSRHASLRARRSAATTPAFGATVDLASYSSKNYLDAANTFDGFVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRNMVPSEQALLAKWLTMMNSSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVVCAYDCGCGFKALPRAAAYFPSNPAPDELWMDFYATSFRGGSRLDTLIGLADAAGIPAGVAEWDWSAGDLIFTPMTLPWWNAYCDYLVTLATAGKLSLGSIFFNAVAKGGTADVIASASDPRIPGIHRVIQAF
Ga0318500_1017779913300031724SoilGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVAQAF
Ga0318535_1007088923300031764SoilIPARHASHPARRRAATAPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318526_1022542213300031769SoilGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318521_1078475013300031770SoilSKNMVASEQALLAKWLTMMNNSGIRYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRIDTLIGQANAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIASLATAGKLELGTIFWNAIGKGGRADVIDSASDPRIPGIHRVAQA
Ga0318566_1000617313300031779SoilAVTALPASAAIPARHASHPARRRAATAPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318576_1004771413300031796SoilVTALPASAAIPARHASHPARRRAATAPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318523_1009715023300031798SoilPTSAATAPLVGATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318565_1006067233300031799SoilTVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318567_1004478113300031821SoilPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318564_1010910413300031831SoilSRSMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318499_1005216323300031832SoilATVDLASYGGKNYLDAANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318511_1000491953300031845SoilNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVISSASDPRIPGVHRVAQAF
Ga0318512_1015981013300031846SoilGLPLATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318551_1058934713300031896SoilGFAGLPMATTIQKVYMGHGELPPHPPLKMTQLAKAGCEFLVSIEPSKNMVASEQALLAKWLTMMNNSGIRYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRIDTLIGQANAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIASLATAGKLELGTIFWNAIG
Ga0306923_1238119213300031910SoilSKNMVASEQALLAKWLTMMNNSGIRYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRIDTLIGQANAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIGSLGSAGKLQLGTIFWNAIGKGGRADV
Ga0306921_1209578713300031912SoilHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGG
Ga0310912_1019388023300031941SoilFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSYYAPVVKNAGVTLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0306922_1055129123300032001SoilAKTGCEFLVSIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318563_1001031753300032009SoilLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318569_1026246513300032010SoilTVDLASYSSKNYLDAANTFDGFAGLPMATTIQKVYMGHGELPPHPPLKMTQLAKAGCEFLVSIEPSKNMVASEQALLAKWLTMMNNSGIRYRVVLYSEANDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRIDTLIGQANAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIGSLGSAGKLQLGTIFWNAIGKGGRADVIDSASDPRIPGIHRVAQAF
Ga0318559_1034142413300032039SoilTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALLAKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSYYAPVVKNAGVTLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318556_1040772013300032043SoilFDGFAGLPMATTIQKVYMGHGELPPHPPLKMTQLAKAGCEFLVSIEPSKNMVASEQALLAKWLTMMNNSGIRYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGVTLAYNCGCGFKALPRAEAYFPSNPTPDELWMDFYATSFRGGSRIDTLIGQANAAGIPAGIAEWNWSAGDLIFTPMTLPWWNAYCEYIGSLGSAGKLQLGTIFWNAIGKGGRADVIDSASDPRIPGIHR
Ga0318558_1056146413300032044SoilWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318505_1025021713300032060SoilGATVDLASYGGKNYLDAANTFDGFVGLPMATTLQKIYMGHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318510_1020644913300032064SoilYLDTANTFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGIHRVAQAF
Ga0318524_1008552523300032067SoilASEQALLGKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSYYAPVVKNAGVTLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQAF
Ga0318553_1005600013300032068SoilFDGLVGLPMATTLQKVYMSHGEFPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALIAKWLTMMNSSGIPYRVVLYSEANDKAFKTAAEWFAYWSFYAPVVKDAGAVLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIDSASDPRIPGVHRVAQA
Ga0307472_10081358913300032205Hardwood Forest SoilASYSSKNYLDAANTFDGLVGLPMATTIQKVYMGHGEFPPHPPLKMTQLAKAGCEFLVSIEPSRSMVPSEQALLAKWLTMMNNSGIPYRVVLYSESNDKAFKTAPEWFAYWSYYAPVVKDAGAVLAYNCGCGFKALPRAAAYFPSNPTPDELWMDFYATSFRGGSRLDTLIGLADTAGIPAGIAEWDWSAGNLIFTPMTLPWWNAYCEYIVSLASAGKLPLGAIFFNAIAKGGRADVIGSASDPRIPGIHRVAQAF
Ga0306920_10253094713300032261SoilPPHPPLKMTQLAKTGCEFLVSIEPSRNMVASEQALLGKWLTMMNNSGIPYRVVLYSEANDKAFKTAAEWFAYWSYYAPVVKNAGVTLAYNCGCGFKALPRAEAYFPSNPSPDELWMDFYATSFRGGSRIDTLIGQADAIGIPAGIAEWDWSAGDLIFTPMTLPWWNAYCEYIVSLASAGKLSLGSIFFNAVAKGGRADVIGSASDPRIPGVHRVAQAF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.