NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F091651

Metagenome / Metatranscriptome Family F091651

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091651
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 175 residues
Representative Sequence MTDTLHPPRINDPRLPAYNARRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGMVVVGGLLGRWYGTQPTWTQAAKITVGTWLVFGVGIYTILGAGIFGQHLLAGPVWHAASLLIVFGVYGLSLHAA
Number of Associated Samples 81
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(34.579 % of family members)
Environment Ontology (ENVO) Unclassified
(37.383 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(51.402 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 69.27%    β-sheet: 0.00%    Coil/Unstructured: 30.73%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF00270DEAD 18.69
PF00271Helicase_C 6.54
PF00701DHDPS 3.74
PF12804NTP_transf_3 0.93
PF12831FAD_oxidored 0.93
PF01070FMN_dh 0.93
PF04851ResIII 0.93
PF00082Peptidase_S8 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 7.48
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.93
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil34.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.41%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.67%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock3.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.87%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.87%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.87%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.87%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.93%
WatershedsEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds0.93%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.93%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.93%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.93%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.93%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.93%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.93%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.93%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.93%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.93%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.93%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.93%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.93%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005994Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009029Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 1 DNA2013-189EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010860Boreal forest soil eukaryotic communities from Alaska, USA - C5-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012897Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S074-202C-1EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300013765Permafrost microbial communities from Nunavut, Canada - A30_80cm_6MEnvironmentalOpen in IMG/M
3300014058Permafrost microbial communities from Nunavut, Canada - A3_65cm_0.25MEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019279Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021358Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R3Host-AssociatedOpen in IMG/M
3300021362Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R09EnvironmentalOpen in IMG/M
3300021374Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R08EnvironmentalOpen in IMG/M
3300021388Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R8Host-AssociatedOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300021861Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2016 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021953Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R07EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300030997Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-3B (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0063356_10403587313300004463Arabidopsis Thaliana RhizosphereMTDTIELPPPTAQATLDDHNSRRFGFAAGVGAAAIMLLAIALLRALSGVLSLPEIVAEGILARMPGALFSSVLDALQHAAKPLFYVSVGIGMLLVGGLLGRWYGERPGWKRAIRIVIGLWLVFGLVVY
Ga0066672_1087237113300005167SoilMTDTLQAPPASTPQLPPYNAQRIGFTAGVVAGLIMLLAIVVLRALSGVTSLPEVVAEGLLGIIPGALFSAVLDSLQHAAKPLFYLSVTIGMVVVGGLLGRWYSNRPTAQQAAKIVLGVWAAFGMVVYTLLGAGIFGQHLQAG
Ga0066677_1038606213300005171SoilMTDTLQAPPASTPQLPPYNAQRIGFTAGVVAALIMLLAIVVLRALSGVTSLPEVVAEGLLGIIPGALFSAVLDSLQHAAKPLFYLSVAIGMVVVGGLLGRWYSNGPTAQQAAKIVLGVWAAFGMVVYTLLGAGIFGQHLQAGPVWHGLSLLIVFGVYGLTLFETYALLAHRALPTLPDVTRRALLRNAVVALVATVGVGTA
Ga0066679_1037789313300005176SoilMTETLDKPVEADDTTPLPEYNARRIGFAAGVVAAAAMLVAIVLLRILSGVESLPEVVAEGVLVILPGALFSAVLDSLQHAAKPLFYVGVGIAMLIVGGLIGRWYATRPTWQRAVQVVLGVWLVFGIGVYTVLGAGIFGQYLQAGPIWHGGSLLVVFSVFGLSLWHAFQALVHRAQPALADTSRRVFLRNAAVALVATLGAGSVWRLMSAG
Ga0066679_1075195613300005176SoilMADTLQTPDLDAGRLPIYNTQRIGFAAGVLAAMVMLLAIIVLRLLSGVTSLPEVVAEGLLVVMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGLLGRWYANQPTWSQAVKIVLGLWLIFGLGVYLILGAGLFGQHLQAGVVAHGGSLLIVFGVFGLALYHAYAALAHRAAPAIPDI
Ga0066684_1023265723300005179SoilMTETLDKPVEVDETAPLPEYNARRIGFAAGVVAAAVMLVAIVVLRALSGVESLPEAVAEGILVNLPGALFSAVLDALQHAAKPLFYGGVGIAMLIVGGLLGQWYAARPTWQRAAQIVLGVWLIFGVGVYTVLGAGIFGQYLQAGAMWHGGSLLI
Ga0070680_10181516813300005336Corn RhizosphereSGVVSLPEIVGEGILILMPGALFSAVLDNLQHAAKPLFYLAVAIGMLIVGGFLGRWYASDPGWKQATRICLGAWLVFGLGVYTLLGAGLFGQQLQAGPIWHGVSLLIVFGVFAVALYESYALLSMRVVPAAPDLTRRTLLRSSVLAVVATLTAGATWRVLSGTSSGVALPLASGSG
Ga0066681_1080353113300005451SoilPGGGGMTETLDKPVEVDETAPLSECNARRIGFAAGVVAAAVMLVAIVVLRALSGVESLPEVVAEGILVNLPGALFSAVLDALQHAAKPLFYGGVGIAMLIVGGLLGQWYAARPTWQRAAQIVLGVWLIFGVGAYTVLGAGIFGQYLQAGAMWHGGSLLIVFGVFGLSLWHALKALAHRAQPALADTSRR
Ga0066687_1056986413300005454SoilMAETIQAPPIPEPGLPEYNARRIGFTAGVAAAAVMLIAIAILRALSGVLSLPEIVAEGLLVNMPGALFSAVLDALQHSAKPLFYVAVGVGMLIVGGLLGRWYANDPSGRQAIKIVVGVWLVFGLGVYTVLGAGIFGQHLQAGPIWHGLS
Ga0070706_10013111113300005467Corn, Switchgrass And Miscanthus RhizosphereMTETLEAPTVDGSRLADYNARRIGFAAGVVAAAVMCVAIVVLRLLSGVLSLPEIVAEGLLVNMPGALFSAVLDALQHAAKPLFYLAIVIGMLLIGGVLGRWFATEPTWQRAAKIVVGAWLVFGLGVYTVLGAGLFGQHLQAGPIWHGGSLLIVFGVYGLALWHVHAPLAHRAEPALPNVSRRDFLRNTAVALVATIGAGTAWRVLAVGDSGATTAPLL
Ga0070706_10090665623300005467Corn, Switchgrass And Miscanthus RhizosphereMANTLQAPPTDESRLARYNAQRIGFTSGVVAATVMLVAIVLLRVLSGVQSLPEVVAEGLLVMMPGALFSAVLDALQHSAKPLFYLAVAIGMLVVGGFLGRWYATSPGWQQAGRIVGGVWLVLGVGVYTVLGAGIFGQHLQAGPIWHGLSLLIVVGVFGVGLFEAYAAMAERAAPGSP
Ga0070707_10088301923300005468Corn, Switchgrass And Miscanthus RhizosphereMTETLEAPTVDGSRLADYNARRIGFAAGVVAAAVMCVAIVVLRLLSGVLSLPEIVAEGLLVNMPGALFSAVLDALQHAAKPLFYLAIVIGMLLIGGVLGRWFATEPTWQRAAKIVVGAWLVFGLGVYTVLGAGLFGQHLQAGPI
Ga0070707_10149071113300005468Corn, Switchgrass And Miscanthus RhizosphereMTETLEAHAVDEPRIPEYNARRIGFTAGVVAAGVMLVAIAILRTLSGVMSLPEVVAEGLLINMPGALFSTVLDALQHAAKPLFYVAVGIAMVIVGGFLGRWYAGDATWERAAKLVIGAWLVFGLVVYTLLGAGLFGQHLSAGAVWHGATLLIIFGVYGVALFHAYSALVHRAVPTVPDISRRVFLRNAVVATVATVGA
Ga0070698_10087921213300005471Corn, Switchgrass And Miscanthus RhizosphereMTDTLHAPPTSELRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPSWTQAAKIAIGTWVVFGVGIYTILGAGIFGQHLLAGAAWHAPSLLIVFGVYGVGLHEIYAVLAHRAVPTLPDVTRRALLRNAVVAVVATVGAGTAWRLITGGDFGSDSAPQAGGSPVA
Ga0066695_1043565913300005553SoilMTETIEAAAEAPSSLAPYNSQRFGFAAGVAAATVMVLVIVLLRILSGVVSLPEVVAEGLLARMPGALFSAVLDSLQHAAKPLFYLAVVIGMILVGGLLGRWYGDQPGWRQAGRIVLGVWLVFGLVIYTLLGAG
Ga0066699_1017277113300005561SoilMTDTLQAPDVGAPRLAAYNARRIGFAAGVAAAAIMLLAIVLLRLLSGVMSLPEVVAEGLLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLFGRWYADRPGWGQVLRLVLGVWLVFGIGVYT
Ga0066699_1023502733300005561SoilMADTLQTPDLEARRLPIYNTQRIGFAAGVLAAMVMLLAIVVLRLLSGVTSLPEVVAEGLLVVMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGLLGRWYANQPTWSQAVKIVLGLWLIFGLGVYLILGAGLF
Ga0066699_1027156713300005561SoilMAETIEAPPITEQRLAVYDSRRLGFTAGVVAAIGMLIAIVILRLISGVVSLPEIVAEGLLVAMPGALFSAVLDTLQHAAKPLFYLAVAIGVVVVGGLLGRWYGGNPTLQQAVKIVLSVWLVFGLGVYTVFGAGVFGQRLIAGPIWHGFTLLLVVAV
Ga0066705_1064108513300005569SoilMADTLQAPDLEARRLPIYNTQRIGFAAGVLAAMVMLLAIVVLRLLSGVTSLPEVVAEGLLVVMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGLLGRWYANQPTWSQAVKIVLGLWLIFGLGVYLILGAGLFGQHLQAGVVAHGGSLLIVFGVFGLALYHAYAALAHRAAPAIPDISRRVLLRNAAVGLVATVGAGSLW
Ga0066702_1074839313300005575SoilMADTLQTPDLEAGRLPIYNTQRIGFAAGVLAAMVMLLAIVVLRLLSGVTSLPEVVAEGLLVVMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGLLGRWYANQPTWSQAVKIVLGLWLIFGLGVYLILGAGLFGQHLQAGVVWHGGSLLIVFGVFGLALYHAYAALAHRAAPAIPDITRRVL
Ga0068854_10176248713300005578Corn RhizosphereAAGVIAAAVMLVAIVALRVLSGVVSLPEIVGEGVLILMPGAVFSTVLDNLQHAAKPLFYLAVAIGMLIVGGFLGRWYASDPGWKQATRIGIGAWLAFGLGVYTLLGAGLFGQQLQAGPIWHGLSLLIVFGVFAVALFESYAQLARRFVVARRPDLTRRTLLRNSVVALVATLGTGATWRVLSGGTAGGL
Ga0066903_10006688513300005764Tropical Forest SoilMADTLQAPDVGARRLPVYNAQRIGFAAGVLAAAVMLLAIVVLRLLSGVTSLPEVVAEGLLVVMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGLLGRWYANQPTWSQAVRIVLGLWLAFGLGVYLILGAGLFGQHLQAGVVWHAGSLLIVFGVFGIS
Ga0066903_10683427213300005764Tropical Forest SoilMTDTLAAPAGPEKRPVPPTSAPLADYNARRIGFTAGVIAAACMLVAIVLLRVLSGVISLPEIVAEGILTMLPGALFSAVLDSLQHAAKPLFYLAVGIGILIVGGLLGRWYSSSPGWKQAARIVVGVWIVFGVVVYTVLGAGIFGQNLLAGPIWHAVTLLIVCGVFGIALYEAYAFLERRTMERAGET
Ga0068858_10237489213300005842Switchgrass RhizosphereMTDTLDAPAVVGHVAALPRYQARRIGFAAGVIAAAVMLVAIVVLRILSGVLSLPEVVAEGLLMMMPGALFSAVLDSLQHAAKPLFYLAVGIGALVVGGFLGRMYASAPTWRQIVKIVLGVWLVVGLGVYTVLGAGIFGQQLGAGPIWHGLSLLVVVG
Ga0066789_1024789813300005994SoilMADTLKAPALGPAPALPAYNSRRIGFASGVVAAVLMLVAIVVLRVLSGVLSLPEVVAEGLLMNMPGAVFSAVLDSLQHAAKPLFYVAVGIGMLVVGGFLGRLYSSAPTWTQIAKIVIGAWLVIGVGVYTVLGAGIFGQHLQAGLVWHGVSLLVVVGVFGLAL
Ga0070717_1100139313300006028Corn, Switchgrass And Miscanthus RhizosphereMTGTLETPSSNVDEPDLAAYNAQRVGFTAGVIAAAAMLVAIVVLRLLSGVISLPEIVAEGILVNLPGAVFSAVLDSLQHSAKPLFYLAIAISIVVVGGLLGRFFAARPTWQRAVQIVLGAWIVFGVVLYTVLGGGIFGQHLQAGPIWHGGSLLVVFGVYGLTLWYAYAALAHRADPATASPTRREFLRTVAVA
Ga0075028_10044297713300006050WatershedsMTDTLQAPPSVAPRLPAYNARRIGFTAGVAAAMVMLGAIVVLRVLSGVQSLPEIVAEGLLGIMPGALFSAVLDSLQHAAKPLFYVSVGIGMLVVGGFLGRWYSSQPTARQAIKIALGAWLVFGVGIYTILGAGIFGQYLSAGPVWHGLSLLVVFGVFGLALYEAYGVLERRVMP
Ga0066660_1160654313300006800SoilMTETLDKPVEADDTTPLPEYNARRIGFAAGVVAAAAMLVAIVLLRILSGVESLPEVVAEGVLVILPGALFSAVLDSLQHAAKPLFYVGVGIAMLIVGGLIGRWYATRPTWQRAVQVVLGVWLVFGIGVYTVLGAGIFGQYLQAGPIWHGGSLLV
Ga0075436_10153775113300006914Populus RhizosphereLDAPRVAAYNARRIGFAAGVIAAAIMLVAIVLLRLISGVMSLPEVVAEGVLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLLGRWYADRPGWGQVLKLVLGVWLVFGLGVYTILGAGLFGQHLQAGALWHALSLLIVFGVFGIALYHAYAGLVHRAYPSEP
Ga0099794_1059575113300007265Vadose Zone SoilMADTLRTAHTSEVHLPAYNAQRIGFAAGVVAAVVMLLAIVIVRVLSGVTSLPEIVGEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVAIGMIVVGGFLGRWYGSQPTWTQAAKIALGAWVIFGLGIYTILGAGIFGQHLLAGPVWHGSSLLIVFGVYGLTLHETYGLLARRALPALATPD
Ga0066710_10010475843300009012Grasslands SoilMTDTLQAPDVGAPRLAAYNARRIGFAAGVAAAAIMLLAIVLVRLLSGVMSLPEVVAEGLLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLFGRWYADRPGWGQVLRLVLGVWLVFGIGVYTILGAGLFGQHLQAGVLWHAVSLLIVIGVFGIALYHAYAALVHRALPSEPDHTRRLLLRNAAVGIVA
Ga0066710_10334605213300009012Grasslands SoilNESRLPDYNAHRLGFAAGTVAAAAMLLAIAILRLVSGVLSLPEIVAEGLLVNMPGALFSAVLDALQHAAKPLFYLAVVIGMLIVGGILGRFYSSRPGWQQAAKLVGGVWLVFGLGVYTILGAGLFGQQLQAGPIWHGVTLLLVFGVFGLALFHVYAALAHRAEPAAPVTSRRIFLRNAAVAMLATLGAGTVWRVLMSGESGGS
Ga0066793_1060688013300009029Prmafrost SoilMADTLKAPALGPAPALPAYNSRRIGFASGVVAAVLMLVAIVVLRVLSGVLSLPEVVAEGLLMNMPGAVFSAVLDSLQHAAKPLFYVAVGIGMLVVGGFLGRRYSSAPTWTQIAKIVSGAWLVIGVGIYTVLGAGIFGQHLQAGLVWHGV
Ga0099829_1057122423300009038Vadose Zone SoilMTDTLHTALTREPRLPTYNARRIGFAAGVVAAVVMLLAIVVLRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQQAAKPLFYLSVAIGMVLVGGLLGRWYGTQPTWTQAAKIAIGTWVVFGLGI
Ga0099829_1129317213300009038Vadose Zone SoilTQEPTLVDEPRLSQYNVHRIGFAAGVIAAAAMLVAIAVLRLLSGATSLPEVVAQGLLTNMPGALFSAVLDALQHSAKPLFYVAVGVGMLLVGGFLGQWYAARPTWQQAVKIILGVWLVFGLGVYTLLGAGIFGQYLEAGPVWHGVSLLIVLGVYGVALWDAYAMLAHRAMPALPEMSRRAFLRDAAVAMVATVGVGASWR
Ga0099829_1167625013300009038Vadose Zone SoilAALVMLLAIVVLRLLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYLSVGIGMVVVGGFLGRWYGTQPTWTRAAKIAVGTWLVFGVGIYTILGAGIFGQHLLAGAVWHAASLLIVFAVYGLALFETYAMLAHRAVPTLPDITRRTLLRNTVVAVVATIGAGTAWR
Ga0099830_1014701523300009088Vadose Zone SoilMTDTLHTPLTREPRLPTYNARRIGFAAGVVAAVVMLLAIVVLRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQQAAKPLFYLSVAIGMVLVGGLLGRWYGTQPTWTQAAKIAIGTWVVFGLGIYTILGAGIFGQHLLAGPVWHASSLLIVFGVYGLALYETYALLARRAMP
Ga0099830_1093976913300009088Vadose Zone SoilMANTLQAPPTDEPRLARYNAQRIGFTSGVVAAAVMLVAIVLLRVLSGVLSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPTWAQAAKIAIGSWVVFGVGIYTILGAGIFGQHLLAGPMWHAASLLIVFGVYGLGL
Ga0099830_1113935213300009088Vadose Zone SoilMSETQEPTLVDEPRLSQYNVRRIGFAAGVIAAAAMLVAIAVLRLLSGATSLPEVVAQGLLTNMPGALFSAVLDALQHSAKPLFYVAVGVGMLLVGGFLGQWYAARPTWQQAVKIILGVWLVFGLGVYTLLGAGIFGQYLEAGPVWHGLSLLIVLGVYGVAL
Ga0099830_1151388513300009088Vadose Zone SoilMTDTLHAPPHNIEPRLAHYNSRRIGFTSGVVAAAAMLLAIVVLRLISGVTSLPEVVAEGLLVMMPGALFSAVLDSLQHAAKPLFYVAVGIGMLIVGGFLGRWYGSQPTWRQAAKIVFGTWAVFGVGVYTIL
Ga0099830_1163913713300009088Vadose Zone SoilNSVDNGVLQRETEMTDTLHTPRTSELRLPAYNAQRIGFAAGVVAAVVMLVAIAILRVLSGVTSLPEIVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVGIGMIVVGGFLGRWYGSSQPSWRQAAKIVIGTWLVFGVGIYTVLGAGIFGQHLLAGAVWHAASLLIVFGVYGVSLYSA
Ga0099828_1024032723300009089Vadose Zone SoilMTDTLHTPLTREPRLPTYNARRIGFAAGVVAAVVMLLAIVVLRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQQAAKPLFYLSVAIGMVLVGGLLGRWYGTQATWTQAVKIAIGTWVVFGLGIYTILGAGIFGQHLLAGPVWHASSLLIVFGVYGLALYETYALLARRAMPTLVTPD
Ga0099828_1155185913300009089Vadose Zone SoilMTDTLQAPPTSEPRLPAYNAQRIGFTAGVIAALAMLLAIVVLRVLSGVLSLPEVIAEGLLGIMPGALFSAVLDSLQHAAKPLFYLSVGIGMLVVGGFLGRSYGSQPSAQQAAKIALGAWLVFGLVVYTILGAGIFGQHL
Ga0099827_1063790223300009090Vadose Zone SoilMANTLQAPPTDELRLARYNALRIGFTSGVVAAAVMLVAIVLLRVLSGVLSLPEVVAEGLLVLMPGALFSAVLDALQHAAKPLFYLAVVIGMLIVGGFIGRWYANQPTWQQAARIVVGAWLILGLGAYTVLGAGIFGQHLQAGPIW
Ga0099827_1129371013300009090Vadose Zone SoilMTDTLHTPRTSELRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPTWAQAAKIAIGSWVVFGVGIYTILGAGISGQHLLAGPIWHAASLLIVFGVYGLGLYETYGLLARRAMPRLITPDAT
Ga0066709_10341998113300009137Grasslands SoilMLVAIVVLRVLSGVTSLPEVVAEGLLINMPGALFSAVLDSLQHAAKPLFYLAVGIGILIVGGLLGRLYATEPTWRQIAKIVVGVWLVIGLGVYTLLGAGIFGQHLQAGPIWHGGSLLVVVGVFGVALFETYTLLARRALQAVGSPPDESRRTLLRNAVAAVVATLATGAAWRLMSGTDLSSPGGQAIAPGA
Ga0127503_1004381813300010154SoilMADTLKAPAVADTPLLPRYNSRRIGFAAGVVAAALMLAAIIVLRVLSGVPSLPEIIADGVLAILPGAIFSAVLDSMQHAAKPLFYLAVGIGALIVGGFLGRVYASQPTWSQIAKIVVGVWLVIGVVVYTVLGAGIFGQQLQAGPIWHGA*
Ga0126377_1002764863300010362Tropical Forest SoilMTDILEAPTTDEPRLPEYNARRLGFTAGVIAAAVMLVAIVILRLLSGVESLPEVVAEGILINLPGALFSAVLDSLQHSAKPLFFLAIAIGILIVGGLLGRWYSVAPSVQRAAKIVLGVWAVFGLIVYTLLGAGIFGQHLQAGPVWHALSLLVVFGVYGVALWHAYAFLAHRAVPELPNTSRRVFLRNAAVVMVATIGAGSLWRLAM
Ga0126351_110742513300010860Boreal Forest SoilMADTQKAPALAQPPSLPPYNASRIGFAAGVVAAALMLAAIVVLRVISGVISLPEVIAEGLLMNMPGALFSAVLDSLQHAAKPLFYLAVGIGALVVGGFLGRLYASEPTWTQIAKIVVAVWLVTGLGVYTLLGAGIFGQHLQAGPIWHGASLLVVVGVFGVALFETYAMLARRALQAAGAPPDE
Ga0137392_1010328413300011269Vadose Zone SoilMTDTLHTALTREPRLPTYNARRIGFAAGVVAAVVMLLAIVVLRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQQAAKPLFYLSVAIGMVLVGGLLGRWYGTQPTWTQAAKIAIGTWVVFGLGIYTILGAGIFGQHLLAGPVWHASSLLIVFGVYGLSLYETYALLARRAMPTLVTPDVTRRTLLRNAAIAL
Ga0137391_1047115423300011270Vadose Zone SoilMTDTLHTARTREPRLPTYNARRIGFAAGVVAAVVMLLAIVVLRALSGVTSLPEVIAEGLLGVMPGALFSAVLDSLQHAAKPLFYLSVGIGMVLVGGLLGRWYGSQPTWSQAARIALGAWVVFGLGIYTILGAGVFGQHLQAGPVWHGLSLLIVFGVFGLALYETYARLAQR
Ga0137391_1141201313300011270Vadose Zone SoilNASRIGFAAGVVAAALMLVAIVVLRVLSGVTSLPEVVAEGVLINMPGALFSAVPDSLKHAAKPLFYLAIVIGMVLVGGGLGRWFASEPTWQRAAKIVVGAWLVFGLGVYTVLGAGLFGQRLQAGPIWHGLSLLIVFGVYGLALWHAYAALAHRAQPAIADVSRRVFLQNAAVALVATVGA
Ga0137393_1047810823300011271Vadose Zone SoilMSETQEPTLVDEPRLSQYNVHRIGFAAGVIAAAAMLVAIAVLRLLSGATSLPEVVAQGLLTNMPGALFSAVLDALQHTAKPLFYVAVGVGMLLVGGSLGQWYASRPTWQQAVKIILGVWLVFGLGVYTLLGAGIFGQYLEAGPVWHGLSLLIVLGVYGVALGDAYPMLA
Ga0137388_1037257523300012189Vadose Zone SoilMTDTLHTPLTREPRLPTYNARRIGFAAGVVAAVVMLLAIVVLRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQQAAKPLFYLSVAIGMVLVGGLLGRWYGTQPTWTQAAKIAIGTWVVFGLGIYTILGAGIFGQHLLAGPVWHASSLLIVFGVYGLSLYETYALLARRAMPTLVTPDVTRRTLLRNAAIALVATVGTGAAWRLISGVDVGSIGSPVATGG
Ga0137388_1088770913300012189Vadose Zone SoilMTDTLHTPRTNELRLPAYNAQRIGFAAGVVAAVVMLVAIAILRVLSGVTSLPEIVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVGIGMIVVGGFLGRWYGSSQPSWRQAAKIVIGTWLVFGAGNYTVLGAGIF
Ga0137388_1105575423300012189Vadose Zone SoilMSETQEPTLVDEPRLSQYNVHRIGFAAGVIAAAAMLVAIAVLRLLSGATSLPEVVAQGLLTNMPGALFSAVLDALQHSAKPLFYVAVGVGMLLVGGFLGQWYAARPTWQQAVKIILGVWLVFGLGVYTLLGAGIFGQHLLAGPVWHGASLLIVFGVYGLTLYETYAMLAHRAAPTL
Ga0137388_1168295913300012189Vadose Zone SoilMANTLQAPPTDEPRLARYNAQRIGFTSGVVAAAIMLVAIVLLRVLSGVLSLPEIVAEGLLVLMPGALFSAVLDVLQHAAKPLFYLAVAIGMLIVGGLIGRWYANEPTWKQAARIVVGAWLILGLGAYTVLGAGIFGQRLQAGPIWHGLSLLIVVGVFGVALFETYAWLA
Ga0137399_1030602613300012203Vadose Zone SoilMTDTLHTPRTNELRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGMIVVGGFLGRWYGSSQPSWRQAAKIVIGTWLVFGVGIYTVLGAGIFGQHLLAGAVWHAASLLIVLGVYGVSLYSAY
Ga0137374_1023369213300012204Vadose Zone SoilMHHLTEYNSRRLGFAAGIGAAALMLLALALLRALSGVSSLPEIVAEGILARMPGALFSTVLDALQHAAKPLFYVAVGIGMLLVGGLLGRWYGEQPGWRNAARLVIGTWLVFGLVVYTLLGAGIFGQALQAGPVWHALSLLMVFTVFGVGLVGLHDLLTQRAAPSAS
Ga0137381_1086058813300012207Vadose Zone SoilMTDTLHPPRINDPRLPAYNARRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGMVVVGGLLGRWYGTQPTWTQAAKITVGTWLVFGVGIYTILGAGIFGQHLLAGPVWHAASLLIVFGVYGLSLHAA
Ga0137379_1050238913300012209Vadose Zone SoilMTDTLHPPRTSESRLPAYNAQRIGFAAGVVAAVVMLLAIVLLRALSGVTSLPEIVGEGLLSVMPGALFSAVLDSLQHAAKPLFYFSVAIGMVVVGGFLGRWFGAQPTWTQAAKIAIGTWVVFGVGIYTI
Ga0150985_10393598363300012212Avena Fatua RhizosphereMTDTLEPPVAQRAELASYNAQRIGFSAGVIAAAVMLVVIVVLRALSGVISLPEIVAEGLLTLMPGALFSVVLDSLQHAAKPLFYLAVAIGMLVVGGFLGRWYATAPGWRQAVKLVLGVWIIFGLGVYTLLGAGIFGAQLVAGPIWHA
Ga0137386_1032987323300012351Vadose Zone SoilMTDTLHPPRINDPRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGMVVVGGLLGRWYGTQPTCTQAAKITVGTWLVFGVGIYTILGAGIFGQHLLAGPVWHAASLLIVLGVYGLSLHAAYGMLAHRAVPTLPDITRRTLLRNAAIALVAAVGAGTAWRLLTGVGGGSVGLPVAEGSAATAAE
Ga0137367_1010705813300012353Vadose Zone SoilMHHLTEYNSRRLGFAAGIGAAALMLLALALLRALSGVSSLPEIVAEGILARMPGALFSTVLDALQHAAKPLFYVAVGIGMLLVGGLLGRWYGEQPGWRNAARLVIGTWLVFGLVVYTLLGAGIFGRALQAGPVWHALS
Ga0137369_1036918923300012355Vadose Zone SoilMTETIEAAAEAPSSLAPYNSQRFGFAAGVAAATVMVLVIVLLRVLSGVVSLPEVVAEGLLARMPGALFSAVLDSLQHAAKPLFYLAIVIGMILVGGLLGRWYGDQPGWRQAGRIVLGVWLVFGLLIYT
Ga0137384_1030838713300012357Vadose Zone SoilLADTLKAPAQVDRAPSLARYNASRIGFAAGVVAAALMLVAIVVLRVLSGVTSLPEVVAEGLLINMPGALFSAVLDSLQHAAKPLFYLAVGIGILIVGGLLGRLYATEPTWRQIAKIVVGVWLVIGLGVYTLLGAGIFGQHLQAGPIWHGGSLLVVVGVFGVALFETYTLLAR
Ga0137375_1130314413300012360Vadose Zone SoilMADTLEAPIAGRPRLASYNAERIGFSAGVIASLVMLAAIAILRALSGVTSLPEVVAEGLLVLMPGALFSAILDSLQHAAKPLFYLGVGIGMLVVGGFVGRFYATVPGWKQAAKIVVGLWLVFGLGAYTILGAGL
Ga0137361_1090166113300012362Vadose Zone SoilMTDTLHPPRASEPLLPAYNAQRIGFAAGVVAAVLMLLAIVVLRLLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYLSVGIGMVVVGGFLGRWYGTQPTWTRAAKIAVGTWLVFGVGIYTILGAGIFGQHLLAGAVWHAASLLIVFAVYG
Ga0137390_1035853723300012363Vadose Zone SoilMTDTLHTSRTSAPRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQHAAKPLFYLSVGIGMVLVGGLLGRWYSTQPTWIQAAKIAVGAWLVFGLGIYTILGAGIFGQHLLAGPIWHALSLLIVFGVFALTLYETYGLLAQRALPALVAPDATRRTLLRNAVVALVATIGAGTAWRLILGVDTGSVGLPVAAGGAATVAEPNAAPYDVKGIASEVTPTAGFYTVSKNFIDPAVAVGGWRLKIDGLVEQPLELSYE
Ga0137390_1112822313300012363Vadose Zone SoilMTDTLHAPPTGESRLPAYNAQRLGFTAGVVAAVVMLLAIVVLRALSGVTSLPEVIAEGLLGVMPGALFSAVLDSLQHAAKPLFYLSVGIGMVLVGGLLGRWYGSQPTWSQAARIALGAWVVFGLGIYTILGAGVFGQHLQAGPVWHGLSLLIVFGVFGLALYETYARLAQRAMPTLPDVTRRTLLRNAVVALVATVGAGTAWRLISGVDSGSIGLPVAAGGAATAAEPNAPPYDVKG
Ga0137390_1140642113300012363Vadose Zone SoilFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLGVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPTWAQAAKIAIGSWVVFGVGIYTILGAGIFGQHLLAGPMWHAASLLIVFGV*
Ga0150984_11996828513300012469Avena Fatua RhizosphereMTETIDSPPTVHPSLEEHNSRRFGFAAGVGAAAIMLVAIAILRALSNVLSLPEVIAEGLLARMPGALFSSVLDALQHTAKPLFYVSVGIGMLLVGGLLGRWYGEAPGWKRAWRIVIGCWVVFGLVVYTLLGAGIFGQAL
Ga0157285_1036667613300012897SoilLAAAVMLGAIVVLRVLSGVVSLPEIVGEGILILMPGALFSAVLDNLQHAAKPLFYLAVAIGMLIVGGFLGRWYASNPGWKQATRIGVGAWLVFGLGVYALLGAGLFGQHLQAGPIWHGLSLLVVCGVFATALFESYALLSRRVSIAPSPDLSRRTLLRNAVVALAATIT
Ga0137395_1080826623300012917Vadose Zone SoilMTDTLHTPRTSELRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEIVGEGLLSVMPGALFSAVLDSLQHAAKPLFYFSVAIGMVVVGGFLGRWYGSQPSARQAAKISIGAWLVLGVGVYTVLGAGIFGQHLIAGPFWH
Ga0164306_1149348913300012988SoilAGVIGAAVMLLAIVVLRLLSGVTSLPEIVAEGVLINLPGAVFSAVLDALQHSAKPLFYLAIAIGILIVGGLLGRWFAAEPTWQRAAKIVVGVWLIFGLGVYLVLGAGIFGQHLQAGPIWHGLSLLLVFGVYGLALWHAYGFLAHRALPVLPDTRRRDFLRNTAVLVVATVGAGSIWRLAMGGDISTETAPVP
Ga0120172_107390113300013765PermafrostMADTLKAPASAQVPTLPRYNSRRIGFTAGVIAAALMLAAIVVLRVLSGVLSLPEVVAEGLLMNMPGALFSAVLDSLQHAAKPLFYVSVGIGVLVVGGFLGRFYASQPTWAQIAKIVVGVWLVTGLGVYTVLGAGFFGQHLQAGPIWHGVSLLVVFGVF
Ga0120149_107117113300014058PermafrostMADTLKAPASAQAPTLPSYNSRRIGFTAGVIAAALMLAAIVVLRVLSGVLSLPEVVAEGLLMSMPGALFSAVLDSLQHAAKPLFYVSVGIGVLVVGGFLGRFYASQPTWAQIAKIVVGVWLVTGLGVYTVLGAGFFGQHLE
Ga0132255_10139112023300015374Arabidopsis RhizosphereMTDTITPTTTTAQEPLEDYNSRRFGFAAGVGAAAIMLLAIALLRALSGVLSLPEVVAEGILARMPGALFSTVLDAMQHAAKPLFYVSVGVGMLLVGGLLGRWYGEEPGWSRALRIVIGCWLMFGLVLYTLLGAGIFGSALQAGPI
Ga0066662_1124535813300018468Grasslands SoilMTDTLQAPDVGAPRLAAYNARRIGFAAGVVAAAIMLLAIVVLRLLSGVMSLPEVVAEGLLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLFGRWYADRPGWGQVLRLVLGVWLVFGIGVYTILGAGLFGQHLQAGVLWHAVSLLIVIGVFGIALYHAYAALVHRALPSEPDHTRRLLLRNAAVGIVATVGAGSLWRLLSGEGASTFAA
Ga0066662_1192287013300018468Grasslands SoilMTDTVHTPPTSAPQLATYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVALGMIVVGGFLGRWYGSQPTWTQAAKIALGAWVLFGLGIYTILGAGIFGQHLLAGPVWHGLSL
Ga0184642_111860613300019279Groundwater SedimentLADTLKAPAVDQAPSLAPYNASRIGFAAGVVAAALMLVATVLLRVLSGVISLPEVIGEGLLMNMPGALFSAVLDSLQHAAKPLFYLAVGIGALVVGGFLGRLYASEPTPRQIAKIVIAVWLVTGLGVYTLLGAGIFGQHLQAGPLWHAGSLLVVLGVFGMALFETYTLLARRALQVAGTPRDESRRTLL
Ga0213873_1003032923300021358RhizosphereMTDTLEAATDQSPRLPDYNARRLGFTAGVIAAAVMLLAIAVLRLLSGVQSLPEVVAEGVLVNLPGALFSAVLDSLQHAAKQLFYLAIAIGMLIVGGLLGRWFAAAPTWQRACKMVIGLWVVFGVVIYTLLGAGIFGSALSAGPVWHGLSLLLIFGVYGLALWHTYAWLAHRAMPQLTDTSRRIFLRNAAVAMVATVGLGSLWRLTIGGTVAGGT
Ga0213882_1007014813300021362Exposed RockMTDTLEAATDQSPRLPDYNARRLGFTAGVIAAAVMLLAIAVLRLLSGVQSLPEVVAEGVLVNLPGALFSAVLDSLQHAAKQLFYLAIAIGMLIVGGLLGRWFAAAPTWQRACKMVIGLWVVFGVVIYTLLGAGIFGSALSAGPVWHGLSLLLIFGVYGLALWHTYAWLAHRAMPQLPDTSRRIFLRNAAVAMVATVGLGSLWRLTIGGTVAGGTQVPTRVAAGGSTPALPPNPPPFDLKGISPEITDVNDFYTVSKNFI
Ga0213882_1033701023300021362Exposed RockMAHTLETGSPTTPRLPDYNSRRFGFSAGVIAATVMLVAIVVLRVISGVQSLPEIVAEGVLTVIPGALFSAVLDSLQHAAKPLFYVAVAIGMLLVGGFLGRWYGGDPTWKHAARIVLSAWLVFGVGVYTLLGAGIFGQHLTGGP
Ga0213881_1056548113300021374Exposed RockMTGTLEAPLVDEPQLPQYNAQRIGFAAGVIAAAVMLVGIVVLRLLSNVESLPEIVAEGILVNLPGALFSAVLDALQHSAKPLFYVGVGIAILVVGGILGRVFASRPTWQRATQIVLGAWLVFGLGVYTFLGGGIFGQHLQAGPVW
Ga0213875_1002262933300021388Plant RootsMTDILEAAPVEAPRLPDYNARRIGFTAGVIAAAVMLLAIAILRLLSGVESLPEVVAEGILVNLPGAVFSAVLDSLQHAAKELFYLAVAIAMLIIGGGLGRWYAGAPSWQRAAKMVVGLWAVFGVVVYTVLGAGIFGQQLEAGSVWHGLTLLLVFGVYGLALWHTYALLAHRAMPALPDTSRRVFLRNAAVAMVAIVGAGSIWRLMGRGGGAATATSTAPVAS
Ga0126371_1077130813300021560Tropical Forest SoilMTATIEAPPTLAPFNSQRFGFAAGVVAATVMVLVIAVLRGLSSVSSLPEVIAEGLLARMPGALFSAVLDALQHAAKPLFYLAIIIGMLVVGGLLGRWYGGQPGWLRAARIVLGAWLLFGLVVYTLLGAGIFGQS
Ga0213853_1006449513300021861WatershedsMADTLKAPAPVQAPTLPAYNARRIGFAAGVIAAALMLVAIIVLRVLSGVLSLPEVVAEGLLMMMPGALFSAVLDSLQHAAKPLFYVAVGIGALVVGGFLGRLYARSPTWMQLAKIVVGVWLVTGVGVYTVLGAGIFGQHLQAGPIWHGLSLFVVVGVFGLALAESYAMLAGRARQADGLPDERRRTLLRSAVVGVVATLATGAVW
Ga0213880_1009277113300021953Exposed RockMTGTLEAPLVDEPQLPQYNAQRIGFAAGVIAAAVMLVGIVVLRLLSNVESLPEIVAEGILVNLPGALFSAVLDALQHSAKPLFYVGVGIAILVVGGILGRVFASRPTWQRATQIVLGAWLVFGLGVYTFLGGGIFGQHLQAGPVWHGGSLLIVFGLFGVALWHAYFALAHRAVPSLPDVSRREF
Ga0242660_124900413300022531SoilMTDTLHAPPQNTAVRLADYNSRRIGFTSGVIAAALMLLAIVVLRLISGVTSLPEVVAEGLLVMMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGFLGRLYSSQPTWRQAAKIAIGTWLVFGLGVYTVLGAGIFGQHLLAGPAWHALSLLVVL
Ga0242660_125424613300022531SoilMTDTLHAPPQNIAPRLAAYNLRRIGFTSGVVAAAAMLLAIVVLRLISGVTSLPEVVAEGLLVMMPGALFSAVLDSLQHAAKPLFYVAVGIGMLIVGGFLGSWYGSQPTWRQAAKIALGAWVVFGVGVYTVLGAGIFGQHLQAGPVWHALSLLVVF
Ga0242654_1031595613300022726SoilGMADTVNAPAPAETPVLPAYNARRIGFTAGVIAAALMLVAIIVLRVLSGVPSLPEVVADGVLAILPGAIFSAVLDSLQHAAKPLFYLAVGIGALIVGGFLGRFYARQPTWSQVARIVVGVWLVTGVGVYTVLGAGIFGQHLQAGPIWHGVSLLLVVGVFGLALYETYALLERRALHIAGTPDTGRRTLLRSA
Ga0207684_1009825623300025910Corn, Switchgrass And Miscanthus RhizosphereMTETLEAPTVDGSRLADYNARRIGFAAGVVAAAVMCVAIVVLRLLSGVLSLPEIVAEGLLVNMPGALFSAVLDALQHAAKPLFYLAIVIGMLLIGGVLGRWFATEPTWQRAAKIVVGAWLVFGLGVYTVLGAGLFGQHLQAGPIWHGGSLLIVFGVYGLALWHVHAPLAHRAEPALPNVSRRDFLRNTAVALVATIGAGTAWRVPAVS
Ga0207684_1012953323300025910Corn, Switchgrass And Miscanthus RhizosphereMTDTLHAPPTSELRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPSWTQAAKIAIGTWVVFGVGIYTILGAGIFGQHLLAGAAWHAPSLLIVFGVYGVGLHEIYAVLAHRAVPTLPDVTRRALLRNAVVAVVATVGAGTAWRLITGGDFGSDSAPQAGGSPVAAAAPNAAPF
Ga0207646_1007907033300025922Corn, Switchgrass And Miscanthus RhizosphereMTDTLHAPPTSELRLPAYNAQRIGFAAGVVAAVVMLLAIVILRVLSGVTSLPEVVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPSWTQAAKIAIGTWVVFGVGIYTILGAGIFGQHLLAGAAWHAPSLLIVFGVY
Ga0207640_1135749113300025981Corn RhizosphereLSEVRQSRVDPRLPAYNARRLGFAAGVIAAAVMLVAIVALRVLSGVVSLPEIVGEGVLILMPGAVFSTVLDNLQHAAKPLFYLAVAIGMLIVGGFLGRWYASDPGWKQATRIGIGAWLAFGLGVYTLLGAGLFGQQLQAGPIWHGLSLLIVFGVFAVALFESYAQLARRFVVARRPDLTRRTLLRNSVVALVATLGTGATWRVLSGGTAGGL
Ga0209471_108878513300026318SoilMTDTLQAPDVGAPRLAAYNARRIGFAAGVAAAAIMLLAIVLLRLLSGVMSLPEVVAEGLLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLFGRWYADRPGWGQVLRLVLGVWLVFGIGVYTILGAGLFGQHLQAGVLWHAVSLLIVIGVFG
Ga0209802_111085623300026328SoilMTDTLQAPDVGAPRLAAYNARRIGFAAGVAAAAIMLLAIVVLRLLSGVMSLPEVVAEGLLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLFGRWYADRPGWGQVLRLVLGVWLVFGIGVYTILGAGLFGQHLQAGVLWHAVSLLIVIGVFGIALYHAYAALVHRALPIEPDHTRRLLLRNAAVGIVATVGAGSLWRLLSGEGASTFAAPPVASGTGALSAATPNPP
Ga0209158_119227913300026333SoilMTDTLQAPDVGAPRLAAYNARRIGSAAGVAAAAIMLLAIVLLRLLSGEMSLPEVVAEGLLMLLPGVLFSAVLDSLQHAAKPLFYLAVGIGILIVGGLFGRWYADRPGWGQVLRLVLGVWLVFGIGVYTILGAGLFGQHLQAGVLWHAVSLLIVIGVFGIALYHAYAALVHRALPSEPDHTRRLLLRNAAVGIVATVGAGSLWRLLSGEGASTFAAP
Ga0209808_115881313300026523SoilMADTLQAPDLEARRLPIYNSQRIGFAAGVLAAMVMLLAIVVLRLLSGVTSLPEVVAEGLLVIMPGALFSAVLDSLQHAAKPLFYVAVGIGMLIVGGLLGRWYANQPTWSQAVKIVLGLWLIFGLGVYLILGAGLFGQHLQAGVVWHGGSLLIVFGVFGLALYHAYAALAHRAAPAIPDITRRVLLRNAAVGLVATVGAGSLWRIVAGGGSETTAPVAATAP
Ga0209577_1047976313300026552SoilMADTLQTPDLEARRLPIYNTQRIGFAAGVLAAMVMLLAIVVLRLLSGVTSLPEVVAEGLLVVMPGALFSAVLDSLQHAAKPLFYLAVGIGMLIVGGLLGRWYANQPTWSQAVKIVLGLWLIFGLGVYLILGAGLFGQHLQAGVVAHGGSLLIVFGVFGLALYHAYAALAHRAAPAIPDISRRVLLRNAAVGLVATVGAGSLWRIVAGGG
Ga0179593_125014713300026555Vadose Zone SoilMADTLKRPASAETPLLPRYNSRRIGFAAGVIAATLMLAAILVLRVLSGVPSMPEVVADGLLAIMPGALFSAVLDSLQHAAKPLFYLGVGIGALIVGGLLGRFYSSNPTWTQIAKIVVGVWLVT
Ga0209701_1042324813300027862Vadose Zone SoilMTDTLHTPRTSELRLPAYNAQRIGFAAGVVAAVVMLVAIAILRVLSGVTSLPEIVAEGLLSVMPGALFSAVLDSLQHAAKPLFYVSVGIGIIVVGGFLGRWYGTQPTWAQAAKIAIGSWVVFGVGIYTILGAGIFGQHLLAGPLWHGLSLLIVFGVFGLALFETYAALAHRATPTLPDVSRRALLRNAVVALVATVGAGTAWRLISGVDSGSSALP
Ga0073997_1186946313300030997SoilAVVVLRVLSGVTSLPEIVAEGLLVMMPGALFSAVLDSLQHAAKPLFYAAVGVGMLIVGGFLGRWYGGQPTWRQAAKIVGGTWLVFGVGIYTLLGAGIFGQRLAAGPIWHGLSLLVVFGVFGIALSEVYGLLERRALPAAESVDGTRRTLLRNAVVAMVATVGLGSAWRLMSGVDAGFGGLRTPVAGGEGPAIALAPNAPPFDLPGLAPEVTATGDFYTVSKN
Ga0170824_10186354613300031231Forest SoilAPPLPEQPRLAAYNARRIGFTAGVAAAVVMLAAIVVLRLLSGVMSLPEVVAEGLLVLMPGALFSAVLDSLQHAAKPLFYLAVAIGMLVVGGFLGRWYSNQPGWRQAAKIVAGLWLVFGIGVYTILGAGIFGQHLPAGPVWHALSLLVVFGVFGLGFLWLRRLAEFEVPERFLASRATAPTGD
Ga0170824_10835107713300031231Forest SoilMADTLKAPALPATPRLPGYNSRRLGFAAGVVAAALMLTAIVVLRVLSGVPSLPEIVADGVLAIMPGALFSAVLDSLQHAAKPLFYVAVGIGALIVGGFLGRVYSSQPTWPQVAKIVVGVWLVIGVGVYFVLGAGIFGKELQAGPIWHGVSLLVVVGVFGLALFE
Ga0310910_1072020013300031946SoilMTGTLETPLVDEPHLPTYNAQRIGFTAGVIAAAVMLAAIIVLRLLSGVESLPEIVAEGILVNLPGALFSAVLDALQHSAKPLFYLAVGIGILVVGGLLGRWFGARPTWQRATQIVLGAWLVFGLGVYTLLGGGLFGQHLQAGPIWHGGSLLIVFGVYGITLWHAYTALAHRAEPAMP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.