NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F092344

Metagenome / Metatranscriptome Family F092344

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092344
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 200 residues
Representative Sequence DEVNPLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARARLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYL
Number of Associated Samples 98
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 1.87 %
% of genes from short scaffolds (< 2000 bps) 1.87 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.73

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.131 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(22.430 % of family members)
Environment Ontology (ENVO) Unclassified
(18.692 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.121 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.80%    β-sheet: 6.07%    Coil/Unstructured: 41.12%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.73
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.29.13.1: Bacillus cereus metalloprotein-liked3d19a13d190.70442
a.29.13.1: Bacillus cereus metalloprotein-liked3d19a23d190.70119
a.29.13.1: Bacillus cereus metalloprotein-liked3dbya23dby0.6983
a.29.5.0: automated matchesd2e0aa12e0a0.68788
f.70.1.1: Extracellular adhesion domain of SabAd4o5ja_4o5j0.68617


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF12759HTH_Tnp_IS1 0.93
PF13808DDE_Tnp_1_assoc 0.93



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.13 %
All OrganismsrootAll Organisms1.87 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300018466|Ga0190268_10121361All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1267Open in IMG/M
3300026376|Ga0257167_1009876All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1256Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil22.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil14.02%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.41%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil5.61%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.61%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.74%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.80%
AgaveHost-Associated → Plants → Phyllosphere → Phylloplane/Leaf Surface → Unclassified → Agave2.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.87%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.87%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.93%
SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Sediment0.93%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.93%
TerrestrialEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.93%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.93%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.93%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.93%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.93%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.93%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.93%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave0.93%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300004016Agave microbial communities from Guanajuato, Mexico - As.Ma.rzHost-AssociatedOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006194Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1Host-AssociatedOpen in IMG/M
3300006421Deep-sea sediment bacterial and archaeal communities from Fram Strait - Hausgarten IEnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300012022Terrestrial microbial communites from a soil warming plot in Okalahoma, USA - C6EnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012901Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S119-311C-1EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300014488Bulk soil microbial communities from Mexico - San Felipe (SF) metaGEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026029Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027381Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027383Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027561Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027691Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027750Agave microbial communities from Guanajuato, Mexico - As.Ma.rz (SPAdes)Host-AssociatedOpen in IMG/M
3300027761Agave microbial communities from Guanajuato, Mexico - As.Sf.rz (SPAdes)Host-AssociatedOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028705Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_115EnvironmentalOpen in IMG/M
3300028708Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_152EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030514Agave microbial communities from Guanajuato, Mexico - Mg.Ma.rz (v2)Host-AssociatedOpen in IMG/M
3300030606Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031228Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031424Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_150 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031769Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f24EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031845Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f18EnvironmentalOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031897Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f16EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032004Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-3Host-AssociatedOpen in IMG/M
3300032055Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f23EnvironmentalOpen in IMG/M
3300032064Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f17EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300034663Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034667Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_104701612228664022SoilLRGLAQRCGRALTQLQKRGVQGLSGALEQVQIILRSVKEHHLFTSGQADKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVIQGARARITAMHEVIKPLMRQIVHWLSTGQVAPNKIVHVGIPQARAIVRNKSGKKTEFGLAYLIGRLGGGYLFGRRIAANTDEKQMPLQALAEYRAIFGQTATPELVVY
JGI11643J11755_1153276913300000787SoilADTTAQELPIGYPNEPGILRGLAQRCGRALIQMNKRGMQGLESALPQVQTILRSVTEHHLFTPGKAAKREVLTRILKEVGALIVHTRPLVKRLETSADRVMQSARSRLLAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKTGKKTEFGLAYLISRLGGGYLFGERIAANADERQMPLK
JGI1027J12803_10134736313300000955SoilADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARSRLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLLGERIVANADERQMPLKALAGYRAIFGQEATPELVVYDRGGDS
Ga0058689_1017651913300004016AgaveAQRCGRALTQLKKRGIQGLDGALDQVQTILRSVKEHHLFTTGQADKRQVLTRILREVGELMVQTRALVERLVTPPDHVIQSARSRLMAMREVIKPLMGQIVHWVTTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLFGERITANADERQMPLKAL
Ga0066395_1050494513300004633Tropical Forest SoilAYAALGQAGIDAVNHLVSKEAHRYGFIDEGVLSADPTAQELPIGYPNEPGILRGLAQRCGRALTQLAKRGLRGLDRVQEQVQTILRSVKEHHLFTTGQADKREVLTRILKEVGALIVQTRPLVARLETSADRVIQSARARLMAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAIVRNKSGKQTEFGLAYLISRVGGGYLFGERIAANADERQMPLKALWGYRAIF
Ga0062594_10280556113300005093SoilEVNHLVLKEAHRYGFIDEGVLSADTTAQELPLGYPNEPGILRGLAQRCGRALTQLAKRGMCGLDRVQEQVQTILRSVNEHHLFTAGKADKREVLTRILKEVGALIVQTRPLVERLATSADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAMVRNKAGKKTEF
Ga0066388_10464556013300005332Tropical Forest SoilLGKAGIDEVNHLVIKAAQRYGLIDEGVVSADTTAQELPMGYPNDPGILRGRAQRCGRALMQLIKRGLGGLDRAQEQVQTILHSVQEHHLFTTGKADKREVLTRIVQEVGAWIVQTRPLVERLETSADRVIQSARARLMAMPEGIKPLGGQIVQWLSTGKVAADKIVHVGIPQARAIVRNKSGKKTECGLAYRISRVGGGYLFGERIAAKADARQMPRKALWGYRAIFGQDATP
Ga0066682_1070157213300005450SoilAQRGGRALTQMQQRGIAGLASALAQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAGIQRARSRLMAMHEVIKPLMGQMVHGIATGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYVLGERIVANADERQMPLKTLAGYRAIFGQEATPELVVYDRGGDSTPTRQQLALAGVK
Ga0070697_10173161413300005536Corn, Switchgrass And Miscanthus RhizosphereLPLGYPNEPGILRGLAQRCGRALTQLAKRGMCGLDRVQEQVQTILRSVNEHHLFTAGKADKREVLTRILKEVGALIVQTRPLVERLATSADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAMVRNKAGKKTEFGLAYLISRLGGGYLFGERLAANADERQMPLKALWG
Ga0068863_10042239823300005841Switchgrass RhizosphereMSILLQHNSGRSSLLFVNYFFAKTTLQQPWDQVQTILRSVKEPHLFTSGKADKRQVLIRILRAVGELMVQTRPLLQRLGTSADAGIQRARSRLMAMHEVIKPLMGQIVPWMATGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLLGERIVANADERQMPLKALAGYRAIFGQEATPELVV
Ga0075427_1003451813300006194Populus RhizosphereVVLMLIKTFDSRQMEAYVAENVVARVFIGRHGDAQAQIRDHSNIARAYAALGKAGLDELNHLVIKEAHRFGFVDERVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMNKRGIQGLESALNQVQTILRSVKEHHLFTSGKAAKREVLTRILTEVGALIVHTRPLVKRLETSADRVIQSARLRLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRVGGGYLFGERIAANADERQMPLKALWGYRAIFGQ
Ga0075427_1011849013300006194Populus RhizosphereLGKAGIDEVNHLVIKAAHRYGFIDEGVLSANTTAQELPIGYPNEPGILRGLAQRCGRALTQLVKRGLCGLDRAQEQVQTILRSVKEHHLFAAGKAHKREVLTRILKEVGTLIVQTRPLVERLATSADRVIQRARARLLAMHEVIKPLLGQIVQWISTGKVAANKIVH
Ga0082247_1094615413300006421SedimentLPIGYPNEPGILRGLAQRVGRTLTQLKKRGIQGLESALDQTQTILRSVKEHHLFTKGREEKREVLTRILTEVGQLMVQTRPLIESLETSSDQAIQSARSRLVAMHEVIKPLMSQIVHWITTGQVAANKIIHVGIRQARAIVRNKAGKKTEFGLGYLISRLGGGYVF
Ga0075428_10190986713300006844Populus RhizosphereAQAQIRDHSNIARAYAALGKAGVDEVNHLVIKEAQRYGLIDEGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLAKRGICGLDRGQEQVQTILRSVKEHHLFAAGKAHKREVLTRILKEVGTLIVQTRPLVERLATSADRVIQRARARLLAMHEVIKPLLGQIVQWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFG
Ga0079217_1058485813300006876Agricultural SoilSADTTAQELPMGYPNEAGILRGLAQRCGRALTQLTKRGVQGLESALDQVQTILRSVKEHHLFTTGKADKHQVLTRILREVGGLMVQTRALVERLATPSNRVIQSARLRLMAMHEVIKPLMGQIVHWLTTGKVAANKIVHVGIPQARAIVRHKAGKKTEFGLAYLISRLGGGYVFGERMTANADERQMPLKALAGYRAIFGPKATPELVVYDRGGDSTPTRQRLALEGVRDVGIQPKGKR
Ga0079217_1113069713300006876Agricultural SoilRDVQCQIRDHSNIARAYAALGQAGIDEVTHLVIKEAHGFGFVDEGVMSSDTTAQELPIGYPNEPGILRGLAQRCGRALRRLKQRGVQGLEATQEQVETVLRSVKEHHLFTKSREEKCQVLTRILTEVGELMVQTRSLTERLSTSLDRVIQNARSRLMAMYEVIKPLMGQIVHWLTTGNVAPNKIVHVGIPQARAIV
Ga0079218_1208526013300007004Agricultural SoilHRFGFVDAGVVSSDTTAQELPIGYPNEPGILRGLAQRCGRALRRLKQRGVQGLEAALEQAETIVRSVKEHHLFAKSKADKRQVLTRILTEVGELMVQTRPLVERLATSWERVIQSGRSRLMAMHEVIKPLMGQIVHWLTTGHVAANKIVHVGIPQARAIVRHKAGKKTEFGLAYLISRLGGGYVFGERMTANADERQMPLKALAGYRAIFGPKATPE
Ga0099828_1107858113300009089Vadose Zone SoilDGGRGLPWDVSLYVPLVVLRLIKALDSRDMDAYLAENVVARVFIGRQHHATAQIRDHSNIARAYAALGKAGIEDVTHLLIREAHGVGFVDAGSLSADTTAQELPIGSPNEPGILRGLAQRCGRAFERLKQRGMAGLEGALDQVQTILRSVKEHHLFTRDKADKRQVLIRILREVGALMVQTRPLVEHLANRSDRVIQSARSRLMAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIP
Ga0111539_1202913413300009094Populus RhizosphereGPLVVLMLVKNLNARDMEAYLAENVVARVFLGRQDDPMPQIRDHSNIARAYAALGKAGVDEVNHLVIKEAHRFGFVDDGSLSADTTAQELPMGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQTILRSVKEHHLFTSGQADKRQVLTRILTEAGALMVQTRTLIERLSTRSDRVMQSARSRLTAMHEVVKPLMRQIVHWLSTGQVAPNKMVHV
Ga0066709_10114679913300009137Grasslands SoilILRGLAQRCGRALTQMQQRGLAGLASALDQVQTILRSVKEHHLFTSGNADKRQVLSRILREVGELMVQTRPLLQRLGTSADAGIQRARSRLMAMHEVIKPLMGQIVHWIATGKVAANTIVHVGIPQARAIVRNKAGKKTEFGLASLISRLGGGYVLGERIVANADERQMPLKALAG*
Ga0075423_1101867023300009162Populus RhizosphereMAGLEGALAQVQTISRSVKAHHLCTRDKADKRQGLMRILREVGALMVPTRPLVEPLANRSDRVRQSARSRLRALHEVIKPLMGQMVQWMSTGKVAANQSVHGGMPQARAIVRNKAGKKTELGLASLMGRLGGGSLFGTRIAANADERQMPLQALAE*
Ga0105076_106485213300009816Groundwater SandPLVVLMLIKSFDSRQREAYLAENVVARVFLGRHHDAHAQIRDHSNIARAYAALGKEGIEEVNTLVVKEAHRFGFVDEGILSADTTAQELPIGYPNEPGILRVLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGELMVQTRPLIKRLESSADQVMQSARLRLMAMQEVIKPLMGQMAHWISTGKVAANKIVHVGIPQ
Ga0126384_1036910423300010046Tropical Forest SoilGKAGIDEVNHLVIKEAHRFGFVDEGSLSADTTAQELSMGYPNEPGILRGLAQRCGRALTQMQQRGIAGLDSALDQVQTILRSVKAHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLETSADSVIQRARARLMAMHEGIKPLMGQIVHWITTGKVAANKIVHVGIPQARAIVRHKAGKKTGFGLAYLISRLGGG*
Ga0126382_1228697013300010047Tropical Forest SoilRAYAALGKAGADEVNHLVAKEAHRFGFVDAGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQTILRSVKEHHLFTAGQADKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVIQSARARLTAMHEVIKPLMGQIVHWLSTGKVAPNKIVHVG
Ga0126372_1039629433300010360Tropical Forest SoilAQMRDHSNIARAYAALGKAGIDEVNHLVIKEAHRFGFVDEGSLSADTTAQELSIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLDSALDQVQTILRSVKAHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLETSADSVIQRARARLMAMHEGIKPLMGQIVHWITTGKVAANKIVHVGIPQARAIVRHKAGKKTGFGLAYLISRLGGG*
Ga0126377_1079264713300010362Tropical Forest SoilAQRYGFIDEGVLSVDTTAQELPIGYPNEPGILRGLAQRCGWALTQLAKRGMCGLDRGQEQGQTILRSVKEHPLFTAGKADQREVLTRILKEVGALIVHTRPLVERLETHADRVIQRVRARRLAMHEVLKPLRRQIVQGITTGKVAANQIVHVGIPQARAIVRNKAGKKTEFGLA*
Ga0126377_1225689513300010362Tropical Forest SoilKEAHRYGFIDEGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLAKRGIGGLARVQEQVQTIVCSVKEHHLFTAGKADKREVLTRILKEVGALIVQTRPLVERLETSADRVIQSARARLMAMYEVIKPLRGQIVQWISTGKVAANKIVHVGIPQARAMVRNKSGKKTEFGLAYLISRVGGGYLCGERIAANADERQMTLKAL
Ga0137392_1105434413300011269Vadose Zone SoilYVPLVVLMLIKSFDSRQMEAYLAENVVARVFLGRHHDAHAQIRDHSNIARAYAALGKEGIEEVNTLVVKEAHRFGFVDEGILSADTTAQELPIGYPNEPGILRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGELMVQTRPLIKRLESSADQVMQSARLRLMAMQEVIKPLMGQIAHWISTGKVAANKIVHV
Ga0137432_112292813300011439SoilQIDSRQMETYVAENVVARLFIGRHRNVQAQIRDHSNIARAYAALGKVGIEEVTRLVIKEAHHFGFVDEGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALMQLHKRGVQGLDRALEQVQTIVRSVKEHHLFSKGREEKREVLTRILTEVGALMVHTRPLIEELATRADRVLQSARSRLAAMHGVIKPLMRHIVQWLTTGKVAANKIIHVGIPQARAIVRNKAGKKTEFGLGYLISRLGGGYVFGTLIAANADERQMPLKALAGYREIFGQEA
Ga0120191_1015413713300012022TerrestrialADTTAQELPIGYPNEPGILRGLAQRCGRALTQLAKRGICGLDRGQEQVQTILRSVKEHHLFAAGKAHKREVLTRILKEVGTLIVQTRPLVERLATSADRVIQRARARLLAMHEVIKPLLGQIVQWISTGKGAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLFGE
Ga0137381_1085730213300012207Vadose Zone SoilGIEEVTRLVIKEAHRFGFVDEGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALMQLHKRGVQGLDRALEQVQTILRSGKEYHLCSKGREEKREILTRILTAVGALMVHTRPLIEGLETRADRVVQSARSRLAAMHGVIKPLMRQIVQWLTTGKVAANKIIHVGIPQARAIVRNKAGKKTEFGLGYLISRLGGGYVFGTLIAANADERQMPLKALAGYREIFGQEATPELVVYDRGGGATRTCQRRALEGVKQVGILPKG
Ga0137378_1091410913300012210Vadose Zone SoilIGRYRDVKAQIRDHSNIARAYAALGKAGIEEVTRLVIKEAHRFGFVDEGVLSADTTAPELPMGYPNEPGILRGLAQRCGRALMQLHKRGVQGLDRALEQVQTILRSGKEYHLCSKGREEKREILTRILTAVGALMVHTRPLIEGLETRADRVVQSARSRLAAMHGVIKPLMRHMVQWLTTGKVAANKILHVGIPQARAIVRNKAGKKTEFGLGYLSSRLGGGYVFGTLIAANADERQMPLKALAGSREIFGQEATPELVVYDR
Ga0150985_10053565713300012212Avena Fatua RhizosphereGLDELNHLVIKEAHRFGFVDERVLSADTTAQELPIGYPNEPGILRGLAQRCGRALIQMNKRGMQGLESALHQVQTILRSVKEHHLFTPGKAAKREVLTRILKEVGALIVHTRPLVKRLETSADRVMQSARSRLLAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKV
Ga0137358_1085564013300012582Vadose Zone SoilFDSRQMEASLAENVVARVFLGRHHDAHAQIRDHSNIARAYAALGKEGIKEVNTLVVKEAHRFGFVDEGILSADTTAQEFPMGYPNEPGILRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGELMVQTRPLIKRLESSADQVMQSARLRLMAMQEVIKPLMGQIAHWISTG
Ga0137397_1089883613300012685Vadose Zone SoilYVPLVVLMLIKHIDSRQMEAYVAENVVARVFIGRYRNVQAQIRDHSNIARAYAALGKAGIEEVTRLVIKEAHRFGFVDEGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALMQLHKRGVQGLDRALEQVQTIVRSVKEHHLFSKGREEKRAILTRILTEVGALMVHTRPLIEGLETRADRVVQSARSRLAAMHGVIKPLMRQIVQWLTTGKVAANKI
Ga0157303_1021373613300012896SoilAQIRDHSNIARAYTALGKEGIDAVNRLVLKEAHRFGFVDEGSLSADTTAQELPMGYPNEPGILRGLAQRCGRALTQLHKRGVQGLDSALAQVQTILRSVKEHHLFATCTTDKRQVLTRILREVGELMVQTRPLVERLATPLDRVIRSARLRLMAMHEVIQPLMGQIVHWVTTGKVAANKIIHVGIPQ
Ga0157303_1021909813300012896SoilIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQIILRSVKEHHLFTLGQADKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVIQGARSRITAMHEVIKPLMRQIVHWLSTGQVAPNKIVHVGIPQARAIVRNKSGKKTEFGLAYLIGRLGGGYLFGRRIAANTDEKQMPLQALAEYRA
Ga0157288_1029935813300012901SoilIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQIILRSVKEHHLFTLGQADKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVIQGARSRITAMHEVIKPLMRQIVHWLSTGQVAPNKIVHVGIPQARAIVRNKSGKKTEFGLAYLIGRLGGGYLFGRRIAANTDEKQMPLQALA*
Ga0164298_1133504413300012955SoilVFIGRQHDPQAQMRDHSNIARAYAALGKAGIDEVNHLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAGIQRARSRLMAMHEVIKPLMGQIVHWIATG
Ga0164303_1146803213300012957SoilTTAQDLPRVYPNEPGILRRLAQRCGRALTQLAKRGMCGLDRVQEQVQTILRSVNEHHLFTAGKADKREVLTRILKEVGALIVLTRPLVERLATSADRVIQSACGRLLAMHEVSKALMGQIVQWISTGKVAANKIVHVGIPQARAMVRNKAGKKTEFGLAYLISRLGGGYL
Ga0164301_1103433013300012960SoilKEAHRYGFIDEGVLSADTTAQELPLGYPNEPGILRGLAQRCGRALTQLAKRGMCGLDRVQEQVQTILRSVNEHHLFTAGKADKREVLTRILKEVGALIVQTRPLVERLATSADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANKIVHVGITQARAMVRNKAGKKTEFGLAYLISRLGGGYLFGERLAANADERQMRLKALWGYRAIFGQ
Ga0182001_1031529013300014488SoilRFGFVEAGVLSSDTTAQELAIGYPNEPGILRGLAQRCGRALLRLKTRGVQGLDSALEQSETILRSVKEHHLFTKGKTETCQVLTRILTAVGELMVQTRALVERLATPSDRVIQSARSRLMAMHEVIKPLMGQIVHWLSTGKVAANKIVHVGIPQARAIVRHKAGKKTEFGLAYLISRLGGGYVFGERITANADERQMPLKALAGYGAIFGPKA
Ga0137411_128556013300015052Vadose Zone SoilPNEPAGILRGLAQRCGRALMQLHKRGVQGLDRALEQVQTIVRSVKEHHLFSKGREEKRAILTRILTEVGALMVHTRPLIEGLETRADRVVQSARSRLAAMHGVIKPLMRQIVQWLTTGKVAANKIIHVGIPQARAIVRNKAGKKTEFGLGYLISRLGGG*
Ga0137420_143559313300015054Vadose Zone SoilGILRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGALMVQTRPLIKRLESSADQVMQSARLRLMAMQEVIKPLMGQIAHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLINRLGGGYLFGTRIEANAMRTRSRCH
Ga0137403_1091654413300015264Vadose Zone SoilKAGIEAVNQLIVKEAHGLGFVDAGSLSADTTAQELPMGYPNEPGILRGLAQRCGRALARLKERGIPGLGDALDQVQTILRSVKEHHLFTRDKADKRQVLIRILREVGELMVQTRPLVKRLEASSDQVIQSARSRLLAMQEVIKPLMGQIVHWISTGKVAANKMVHVGILQARAIVRNKTGKKTEFGLAYLINRLGGGYVFGTRIEANADEKQMPLRALAESRAIFGQEATPE*
Ga0132256_10269720113300015372Arabidopsis RhizosphereLLTLPGYMKRRMRKLSRRSRGGPNENIDGLDRAQIRDHSNIARAYAALGKAGVDEVHHLVIKEAHRFGFVDDGSLSADTTAQELPMGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQTILRSVKEHHLFTAGQANKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVMQGARSRI
Ga0163161_1126586513300017792Switchgrass RhizosphereVVLMLIKVFDSRQMEDYLAENAVARVFIGRQHVAKAQIRDHSNIARAYAALGKAGVDEVNHLVIKEAHRFGFVDDGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQTILRSVKEHHLFTSGQADKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVIQGARSRITAMHEVIKPLMRQIVHWLSTGQVAPN
Ga0190266_1079399813300017965SoilMQQRGIAGDATALDEVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSSDAVMQRARARLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYVLGERIVANADERQMPLKALAGYRAIFGQEATPELVVYDRGGDSTLTRQQLALAGVK
Ga0190269_1160157213300018465SoilMRLKKRGAQGLDAALAQTETILRSVKEHHLFTKGREEKRQVLTRILTEVGELMVQTRPLVERLGTRWDRVIQSACSRLMAMHEVIKPLMGQIVHGLTTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLINRLGGGDVFGERIEANADERQMPLKALAGYRKLFGQAATPELVVYD
Ga0190268_1012136123300018466SoilVVARVFIGRHPNAKAQIRDHSNIARAYAALGKAGIDAVHCLVIQEAHGFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLDGALDQVQTILRSVKEHHIFTPGKADKRQVLTRIMREVGELMVQTRALVERLVTQSERVIQNARSRLVTMHEVIKPLMGQIVHWLTTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGSLFGERITANADERQMPLKALAEYRAIFGQEATPELVVYDRGG
Ga0190268_1012956323300018466SoilMLIKHFDSRQMEAYLAENVVARVFIGRQRDVQAQIRDHSNIARAYAALGQAGIDEVTHLVIKQAHAFGFVDEGVISSDTTAQELAMGYPNEPGILRGLAQRCGRALMRLKTRGIQGLDAALEQTETILRSVKAHHLFTKGKTEKREVLTRILTEVGGLISQTRVIATAVQGCSDRVIQSARARLVAMHDVIKPLMGQIVHWLTTGKWRPTKSCMWVSRRHEPSCATKLARRPNLAWPICSAAWAVGMCLAN
Ga0190268_1108323513300018466SoilHGKNVDGGPGLPWDVSLYVPLVVLMLIKTFDSRQMEAYLAENVVARVFMGRHRDVQAQIRDHSNIARAYAALGKAGIDEVTHLVLKEAHRFGFVDEGVMSSDTTAQELPMGYPNEPGILRGLAQRCGRALRRLKKGGVQGLEAALEQVETVLRSVKEHHLFTKGREAKHQVLIRILTEVGELMVQTRPLIERLGTSLDRVIQNARSRLMAMHEVIQ
Ga0207642_1097309713300025899Miscanthus RhizosphereKEAHRFGFVDEGSLSADTTAQELPMGYPNEPGMLRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEPHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAGIQRARSRLMAMHEVIKPLMGQIVPWMATGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLIS
Ga0207712_1164941813300025961Switchgrass RhizosphereRAQGQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSSDAVMQRARARLMAMHEVIKPLMGQIGHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLLGERIVANADERQMPLKALAGYRAIFGQEATPELVVYDRGGDSTPTRQQLALAGVKEEGMQ
Ga0208002_102583813300026029Natural And Restored WetlandsVSADTTAQELPMGYPNEPGILRGLAQRCGRALTQVKKRGVQGLDGVLEQVQTIVRSVKEHHLFTEGKADKREVLTRILREVGALIVHTRPLVEHLATRSERVMQSARSRLLAMHEVIKPLRGQIVQWLVTGTVAPNKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLFGERMAANADERPMPL
Ga0257163_105450713300026359SoilPIGYPNEPGILRGLAQRVGRTLTQLKTRGIQGLDYALDQVQTVLRSVKEHHLFTKGREEKREILTRILTEVGELVAQTRPLVEALETSSDQVMQSARSRLVAMHEVIKPLMRQIVHWLTTGKVAANKIIHVGIPQARAIVRNKTGKKTEFGLGYLISRLGGGYVFGTLIAANADERQMPLKALSGYREIFGQEATPELFVYDRGGDSTLTCQK
Ga0257173_106706413300026360SoilVQAQIRDHSNIARAYAALGKAGIEEVTRLVIKEAHRFGFVDEGVLSADTTAQELPMGYPNEPGILRGLAQRVGRTLTQLKTRGIQGLDYALDQVQTVLRSVKEHHLFTKGREEKREVLTRILTEVGELVAQTRPLVEALETSSDQVMQSARSRLVAMHEVIKPLMRQMVHWLTTG
Ga0257167_100987633300026376SoilMGYPNEPGILRGLAQRVGRTLTQLKTRGIQGLDYALDQVQTVLRSVKEHHLFTKGREEKREVLTRILTEVGELVAQTRPLVEALETSSDQVMQSARSRLVAMHEVIKPLMRQMVHWLTTGKVAANKIIHVGIPQARAIVRNKAGKKTEFGLGYLISRLGGGYVFGTLIAANADERQMPLKALSGYREIFGQEATPELFVYDRGGDSTLTCQTLALEGVK
Ga0257147_106357513300026475SoilGILRGLAQRCGRALMQLHKRGVQGLDRALEQVQTIVRSVKEHHLFSKGREEKRAILTRILTEVGALMVHTRPLIEGLETRADRVVQSARSRLAAMHGVIKPLMRQIVQWLTTGKVAANKIIHVGIPQARAIVRNKAGKKTEFGLGYLISRLGGGYVFGTLIAANADERQMPLKALAGYREIFGQEA
Ga0209842_109333913300027379Groundwater SandYPNEPGILRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRGKADKRQVLVRLLREVGALMVQTRPLIKRLESSADQVLQSARLRLMAMQEVIKPLMGQIAHGISTGKVAANKIGHVGIPQARASVRNKAGKKTEFGLAYLINRLGGGYLFGTRIEANAD
Ga0208983_109363913300027381Forest SoilVVLMLVKAYDSRQMEAYVAENVVARLFIGRYATVQAQIRDHSNIARAYAALGKAGIEAVNQLIVKEAHGLGFVDAGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALARLKERGIPGLGDALDQVQTIVRSVKEHHLFTRDKADKRQVLIRILREVGELMVQTRPLVKRLESSSDQVIQSARSR
Ga0209213_109665113300027383Forest SoilAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGALMVQTRPLIKRLESSADQVMQSARLRLMAMQEVIKPLMGQIAHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLINRLGGGYLFGTRIEANADEKQMPLRALAEYRAIFGQEA
Ga0209887_112635213300027561Groundwater SandHLVIKEAHRVGFVDERVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMKQRGIAGLDNALAQVQTIVRSVKEHHLFTSDKAAKREVLTRILKEVGALIVQTRPLVERLETSADRVIQSARSRLLAMHEVIKPLMRQIVHWISTGKVAANKIVHVGIPQARAIV
Ga0256866_114039613300027650SoilLVVKEAHGFGFVDEGVVSADTTAQERPIGYPNEPGMLRGLAQRCGRVLTQLTKRGVQGLDDALGQVQTILGSVKEHHLFTEGKGHKRQVLTRILREVGELMVQTRPLVERLARRSERVIQSARSRRLAMHEVIKPRMGQIVHWLSTGKVAAHQIVHVGIPPARAIVRNTAGKQTELGLAYLISRLGGGYLFGERVAAHADERQMPLKTLAGYRCILG
Ga0256866_121614113300027650SoilTAQELPIGYPNEPGILRGLAQRCGRALRQLQQRGVQGLDGALEQVQTIVRSVKEHHLFTAGKAEKRQVLTRILREVGALMVQTRPLIENLSTSSQRVIQSARARLKAMHEVIKPLMGQIVHWVSTGQVAAHKIVHVGIPQARAIVRNKSGKKTEFGLAYLISRLGGGYRFS
Ga0209485_117831913300027691Agricultural SoilRDVQAQIRDHSNIARAYAALGKSGIDEVNRLIVKEAHRFGFVDEGVVSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVDGLQGALEQVQTILRSVKEHHLFTKGKSDKREVLTRIVKEVGELIGQTRPLVEALGTSTDRVIQSARSRLMAMHAVIKPLIIQIVQWMTTGVVATNKIVHVGIPQARAIVRNKTGKKVEFGLAYLLSRLG
Ga0209485_119366013300027691Agricultural SoilFIGRHHNIKAQIRDHSNIARAHTALGKAGIEEITHLVIKQAHRFGFVDEGVISSDTTAQELPIGYPNEPGILRGLAQRCGRALIQLQKPGVQGLEGTVEQVEIVLRSVKAHHLFTQGKTEKREVLTRILREVGTLIGQTRVIETAVAGRSDRVIQSARSRLLVMHAVIKVLMGQIVHWVSTGKVAANKIVHVGIPQARAIVRHKAGKQT
Ga0209461_1018812213300027750AgavePGILRGLAQRCGRALKRLKERGIQGLDGSLEQVERILRSVKEHHLFTKDKTEKREVLTRILREVGTLIGQSRVIETAVRGNSDRVIQRARSRLVSMHEVIKVLMSQIVHWVSTGKVAAHKIVHVGIPQARAIVRNKAGKKTEFGLAYLINRLGGGYVFGERIEAHADERKMPL
Ga0209462_1019128313300027761AgaveRLVKEHHLFTSGKADKRQVLTRIMREVGELMVQTRPLVERLAPRSERVIQRARSRLMAMHEVIKPLMGQIVHWLTTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYVFGERIAANADERQMPLKALAGYRAIFGPKAAPELVVYDRGGDSTPTRPRLTVE
(restricted) Ga0233416_1026464613300027799SedimentHGQAQMRDHSNIARADAALGKGGIESVHHVILKEAQGLGCVDAGRLSADTTAPELPMGYPNEPGILRGLAQRCGRAFPRLKQQGVRGLEGAVDQVQTILRSVKEHHLFTRAKADKRQVLIRILREVGELMVQTRPLVERLESSVDQVMQSARSRLMAMQDVIKPLMGHMVHWISTGKVAAHKIIHVGIPQARAIVR
Ga0209486_1117694313300027886Agricultural SoilIGYANEPGILRGLAQRCGRALRRLKQRGVQGLEAALEQAETIVRSVKEHHLFAKSKADKRQVLTRILTEVGELMVQTRPLVERLATSWERVIQSARSRLMAMHEVIKPLMGQIVHWLTTGHVAANKIVHVGIPQARAIVRHKAGKKTEFGLAYLISRLGGGYVFGERMTANADE
Ga0209382_1174856713300027909Populus RhizosphereQMEAYVAENVVARVFIGRHGEAQAQIRDHSNIARAYAALGKAGVDEVNHLVIKEAQRYGLIDEGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLAKRGICGLDRGQEQVQTILRSVKEHHLFAAGKAHKREVLTRILKEVGTLIVQTRPLVERLATSADRVIQRARARLLAMHEVIKPLLGQIVQWISTGKVAANK
Ga0209853_116820513300027961Groundwater SandHSNIARAYAALGKAGIDEVNHLVIKEAHRVGFVDERVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMKQRGIAGLDNALAQVQTIVRSVKEHHLFISDKAAKREVLTRILKEVGALIVQTRPLVERLETSADRVIQSARSRLLAMHEVIKPLMRQIVHWISTGKVAANK
Ga0307276_1012246913300028705SoilAVNQLIVKEAHGLGFVDAGSLSADTTAQELPMGSPNEPGILRGLAQRCGRALARLKERGIPGLGDAWDQVQTILRSGKEHHLFTRDKADKRQVLIRILREVGALMVQTRPLVKRLEASADQVMQSARSRLLAMQEVIKPLMGQSVHGISTGKVAANKMVHVGILQARAIVRNKTGKKTEFGLAYLINRLGGG
Ga0307295_1023240313300028708SoilLRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGELMVQTRPLIKRLESSADQVMQSARLRLMAMQEVIKPLMGQIAHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLINRLGGGYLFGTRIEANADEKQMPLRALAE
Ga0307284_1043422613300028799SoilPIGSPNEPGILRGLAQRCGRALERLKQRGVAGLEGVLDQGQTILHSVKEHHLFTRDKADKRQVLMRILREVGALMVQTRPLVEHLENHSDRVMQSARSRLMAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLIGRLGGGSLFGTRIAANADERQM
Ga0307302_1038761213300028814SoilLFIGRYRDVKAQIRDHSNIARAYAALGKAGIEEVTRLVIKEAHRFGFVDEGVLSADTTAQELPMGYPNEPGILRGLAQRVGRTLTQLKTRGIQGLDYALDQVQTVLRSVKEHHLFTKGREEKREVLTRILTEVGELVAQTRPLVEALETSSDQVMQSARSRLVAMHEVIKPLMRQIVHWLTTGKVAAH
Ga0247827_1089159913300028889SoilMGYPNEPGILRGLAQRCGRALTQLAKRGIGGLDRSKEQVQTILRSVKEHHLFAAGKADKREVLTRILKEVGALIVQTRPLVERLATRADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAIVRNKTGKKTEFGLAYLISRLGGGYLFGERIAANADERQMPLKALWGYRAIFGQEATPTLV
Ga0268253_1033827713300030514AgaveLRRLKQRGVQGLDGALEQVETVLRSVKEHHLFTQGREEKRQVLTRILKEVGELMVQTRPLVERLGTRLDRVLQSACSRLVAMHEVIKPLMGQIVHWLTTGKVAANKIVHVGIPQARAIVRNKAGKKTAFGLAYLISRLGGGDVFGERIEANADERQMPLKALAGYRAIFGPEATPE
Ga0299906_1125067713300030606SoilAHGLGFVDAGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTRLQQRGVQGLAGALDQVQTILRSVKEYHLFTRDKVDKRQVLTRILREVGALMGQTRPLVERRGPSADRGMQRARSRLMAMHEVIKPLMGQMVHWVSTGKVAANKIVHVGIPQARAIVRNKVGKKTEFGLAYLL
Ga0268386_1079986313300030619SoilLIGYPNEPGILRGLAQRVGRALTQLQKRGVRGLEGAVDQVQMILRSVKEHHLFTTGKADKREVLVRILREVGELMVQTRPLLDALVSSADRVTQRARGQLWAMHEVIKPLTGQMVQWITTGKVAKDKIVHVGIPQARAIVRDKAGKKVEFGLAYLISRLGGGYLFGTLIAANADERQMPLKALEGYRSIFGPQATPELVV
Ga0302046_1083861413300030620SoilMRDHATIARAYAARGKEGLNDVHTLVVQEAHGVGFVDEGVGSAAPTAQELPLGSPNEPGILRGLAQRCGRALRPLRQRGGQGLDGALEQVQTIVRSVQEHQRFPAGKGQQRQGLTRILREGGALMGQTRPLVKALSAWSQRGIQSARARLLALHAVSKPLLGPMVHWLSTGQVAAHKIVPVGIPQARAIVRNQAGKQTECGLASLSRRRGGGYLCGERLAAKADERQMP
Ga0308197_1022617313300031093SoilEGGPGLPWDVSLYVPLVVLMLIKSFDSRQMEAYLAENVVARVFLGRHHDAHAQIRDHSNIARAYAALGKEGIEEVNTLVVKEAHRFGFVDEGILSADTTAHEVPIGSPNAPGILRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKDHHLLTRDKADKRQVLVRILREVGELLGQTRPLSKRLESSADQVMQSARLRLMAMQEVIKPLMGQ
Ga0308197_1034764713300031093SoilALEQVQTILRSVKEHHLFSKGREEKREILTRILTEVGALMVHTRPLIEGLETRADRVVQSARSRLAAMHGVIKPLMRQIVQWLTTGKVAANKILHVGIPQARAIVRNNAGKKTEFGLGSLISRLGGGYVFGTLIAANADERQMPLKALAGYREIFGQEATPELVVYDRGGDSTRTCQRLALEGVKQV
Ga0299914_1143608513300031228SoilYPNEPGILRGLAQRCGRALMRLKKRGVQGLDAALEQTETILRSVKEHHLFTKGREEKRQVLTRILTEVGELMVQTRPLVERLGTSLDRVIHNARSRLMAMHEVIKPLMGQIVHWLTTGQVAANKIVHVGIPQARAIVRHKAGKKTEFGLAYLISRLGGGYVFGERIAANADERQMPLKAL
Ga0308194_1034645913300031421SoilHLILQEAHGVGFVDAGSRSADTTAQELPIGSPNEPGSLRGLAQRCGRALERLKQRGVAGLEGVLDQGQTILHSVKEHHLFTRDKADKRQVLIRILREVGALMVQTRPLVEHLENHSDRVMQSARSRLMAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAIVRNKAGKKT
Ga0308179_101487713300031424SoilYVPLVVLMLIKSFDSRQMEAYLAENVVARVFLGRHHDAHAQIRDHSNIARAYAALGKEGIEEVNTLVVKEAHRFGFVDEGILSADTTAHEVPIGSPNEPGILRGLAQRVGRALVQLKKRGVLGLEAALDQVQTVLRSVKEHHLFTRDKADKRQVLVRILREVGELMGQTRPLSKRLESSADPVMQSARLRLMAMQEVIKPLMGQIAHWISTGKVAANKIVHVGIPQARAIVRNKTGKKTEFGLAYLINRLGGGYLFGTRIEANADEK
Ga0310915_1112840613300031573SoilLAKRGMCGLDRGQEQVQTILRSVKEYHLFTAGKAAKREVLTRILKEVGALIVQTRPLVERLETHADRVIQSVRARLLAMHEVLKPLMRQIVQWITTGKVAANKIVHVGIPQARAIVRNKVGKKTEFGLAYLISRLGGGYLLGERIAANADEREMPLKALWGYRAIFGQNATPELVVYDRG
Ga0318561_1050376513300031679SoilRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARSRLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNQAGKKTEFGLAYLISRLGGGYLLGERIVANADERQMPLKALAGYRAIFGQEATPELVVYDRGATQPRPASSSRWQGSRMWACSPRGIGRGRLPRQS
Ga0318574_1054100013300031680SoilVSLYVPLVVLMLIKAFDSRQMEAYLAENVVARVFIGRQHDPQAQMRDHSNIARAYAALGKAGIDEVNHLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARSRLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGI
Ga0307469_1137281413300031720Hardwood Forest SoilLIKTFDSRQTEAYVAENVVARVFIGRHCDAQAQMRDHSNIARAYAALGKAGSDEVNHLVHKEAHRYGFIDEGVLSADTTAQELPLGYPNEPGILRGLAQRCGRALTQLAKRGMCGLDRVQEQVQTILRSVKEHHLFTAGKADKREVLTRILTEVGALIVQTRPLVERLATSADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANNIVHVGIPQARAIVR
Ga0307468_10128272513300031740Hardwood Forest SoilAFGKAGSDEVNHLVHKEAHRYGFIDEGVLSADTTAQELPLGYPNEPGILRGLAQRCGRALTQLAKRGLCGLDRVQEQVQTILRSVKEHHLFTAGKADKREVLTRILTEVGALIVQTRPLVERLATSADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANNIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLFGERLAANADERQMPLKALW
Ga0318526_1033307813300031769SoilAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARSRLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYLLGERIVANADERQMPLKALAGYRAIFGQEATPELVVYDRGGDSTPTRQQLALAGVK
Ga0318547_1092761513300031781SoilQAQMRDHSNIARAYAALGKAGIDEVNPLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVMQRARSRLMAMHEVIKPLMGQIVHWISTGKVAANK
Ga0318511_1050498013300031845SoilDEVNPLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARARLMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYL
Ga0310907_1033118013300031847SoilRHCDAQAQIRDHSNIARAYAALGKAGIDEVNHLVIKEAHRYGLIDEGVLSADTTAQELPMGYPNEPGILRGLAQRCGRALTQLAKRGIGGLDRSQEQVQTILRSVKEHHLFAAGKADKREVLTRILKEVGALIVQTRPLVERLATRADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAIVRNKTGKKTEFGLAYLISRLGGGYLFGERIAANADERQMPLKPEFDS
Ga0310904_1009660223300031854SoilLKHYLSLYVPLVVLMLIKVFDSRQMEAYVAENVVARVFIGRHCDAQAQIRDHSNIARAYAALGKAGIDEVNHLVIKEAHRYGLIDEGVLSADTTAQELPMGYPNEPGILRGLAQRCGRALTQLAKRGMCGLDRVQEQVQTILRSVNEHHLFTAGKADKREVLTRILKEVGALIVQTRPLVERLATSADRVIQSARARLLAMHEVIKPLMGQIVQWISTGKVAANKIVHVGIPQARAMVRNKAGKKTEFGLAYLISRLGGGYLF
Ga0306919_1130298013300031879SoilHRYGFIDEGVLSADTTAQERPIGYPNEPGILRGVAQRCGRALTQLAKRGMCGLDRGQEQVQTILRSVKEYHLFTAGKAAKREVLTRILKEVGALIVQTRPLVERLETHADRVIQSVRARLLAMHEVLKPLMRQIVQWITTGKVAANKIVHVGIPQARAIVRNKVGKKTEFGLAYLISRLGGGY
Ga0318520_1105525513300031897SoilSNIARAYAALGKAGIDEVNPLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVMQRARSRLMAMHEVIKPLMGQIVHWISTGKVAA
Ga0310913_1082066413300031945SoilGLPWDVSLYVPLVVLMLIKAFDSRQMEAYLAENVVARVFIGRQHDPQAQMRDHSNIARAYAALGKAGIDEVNPLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARSRLMAMHEVIKPLMGQIGHWI
Ga0310909_1126134513300031947SoilIGQGGIDEVNHLVIQEAHRYGFIDEGVLSADTTAQKRPIGYPNEPGILRGLAQRCGRALTQLAKRGMCGLDRGQEQVQTILRSVKEYHLFTAGKAAKREVLTRILKEVGALIVQTRPLVERLATSADRVIQSVRARLLAMHAVIKPLMRQIVQWISTGKVAAHKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGG
Ga0307414_1178184813300032004RhizosphereLARALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSSDAVMQRARARRMAMHEVIKPLMGQIVHWISTGKVAANKIVHVGIPQARAIVRNKVGKKTEFGLAYLISRLGGGYVLGERIVANADERQMPLKALAGYRAIFGQEATPELVVYDRGGDSTLPRQQLALAGVKDVG
Ga0318575_1048410113300032055SoilSLYVPLVVLMLIKAFDSRQMEAYLAENVVARVFIGRQHDPQAQMRDHSNIARAYAALGKAGIDEVNPLVIKEAHRFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQMQQRGIAGLASALDQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVMQRARSRLMAMHEVIKPLMGQIVH
Ga0318510_1024053813300032064SoilLVIKEAHGFGFVDAGVLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLVKRGIWGLDRAQGQVQTILRSVKEHHLFTSGKADKRQVLIRILREVGELMVQTRPLLQRLGTSADAVIQRARARLMAMHEVIKPLMGQIGHWIATGKVAANKIVHVGIPQARAIVRNKAGKKTEFGLAYLISRLGGGYGFGERIVANADERQMPLKALAGYRAIFGQDATPELVVYDRGGDSTATRQRLALAGVK
Ga0307470_1177505213300032174Hardwood Forest SoilAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLSGALEQVQTILRSVKEHHLFTSGQADKRQVLTRILTEVGALMVQTRPLIERLGTRSDRVIQGARSRITAMHEVIKPLMRQIVHWLSTGQVAPNKIVHVGIPQARAIVRNKSGKKTEFGLAYLIGRLGGGYLFGRRI
Ga0314784_103361_1_5883300034663SoilMLIKGFDSRQMEAYLAENVVARVFIGRHHNAKAQIRDHSNIARGYAALGKAGIDAVNCLVIQEAHGFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLDGALDQVQTILRSVKEHPIFTPGKADKRQVLTRIMREVGELMVQTRALVERLVTQSERMIQNARSRLVTRHEVIKPLMG
Ga0314792_238384_4_5253300034667SoilVVARVFIGRHHNAKAQIRDHSNIARAYAALGKAGIDAVNCLVIQEAHGFGFVDEGSLSADTTAQELPIGYPNEPGILRGLAQRCGRALTQLQKRGVQGLDGALDQVQTILRSVKEHPIFTPGKADKRQVLTRIMREVGELMVQTRALVERLATQSERVIQNARSRLVTRHEVIK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.