NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F073342

Metagenome Family F073342

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073342
Family Type Metagenome
Number of Sequences 120
Average Sequence Length 117 residues
Representative Sequence WMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV
Number of Associated Samples 92
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.83 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.333 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(46.667 % of family members)
Environment Ontology (ENVO) Unclassified
(66.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(73.333 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 22.30%    β-sheet: 29.73%    Coil/Unstructured: 47.97%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF01925TauE 45.00
PF00390malic 4.17
PF10604Polyketide_cyc2 4.17
PF03949Malic_M 3.33
PF02776TPP_enzyme_N 0.83
PF00361Proton_antipo_M 0.83
PF00296Bac_luciferase 0.83
PF04909Amidohydro_2 0.83
PF10851DUF2652 0.83
PF04679DNA_ligase_A_C 0.83
PF03098An_peroxidase 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 45.00
COG0281Malic enzymeEnergy production and conversion [C] 7.50
COG0686Alanine dehydrogenase (includes sporulation protein SpoVN)Amino acid transport and metabolism [E] 3.33
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.83
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.33 %
All OrganismsrootAll Organisms1.67 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300010401|Ga0134121_10006983All Organisms → cellular organisms → Bacteria9087Open in IMG/M
3300011998|Ga0120114_1000052All Organisms → cellular organisms → Bacteria41193Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil46.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.67%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.17%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere9.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.83%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.50%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.67%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.83%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011998Permafrost microbial communities from Nunavut, Canada - A30_35cm_6MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020015Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m1EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028708Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_152EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25386J43895_1004408733300002912Grasslands SoilIWMGVPRVDFFAGARGSLVGAEYHCIHGENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWKTSGIKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESARASV*
Ga0066672_1001832113300005167SoilRGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV*
Ga0066672_1055528913300005167SoilRGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPARTRVDTAISWNIKGIRAPVLSFMATRMLRRYGALYDKRVAEMIQASARASV*
Ga0066683_1049769913300005172SoilQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKVDVTRA*
Ga0066673_1019960623300005175SoilDPKLRQIWMGVPRVDFKAGARGSLIGAEYHCIHGPNQKTVFKVLGCTAPTDITMQIGFPFVGTVWRTDRIEAEGSATTRVDTAISWQTSGIKGRVLDFMASRMLRRYGDLYDKRVAEMIQENARTSV*
Ga0066679_1003298233300005176SoilTPEEVFRHFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV*
Ga0066690_1042798923300005177SoilWMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV
Ga0066688_1011174813300005178SoilFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV*
Ga0066684_1004830133300005179SoilYFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV*
Ga0066685_1056551913300005180SoilKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLEARKADVTRTRV*
Ga0066678_1017215013300005181SoilRGSLVGAEYHCIHGPNQTTVFKILGCNAPSEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWQASGIASPVLNFMASRMLRRYGALYDKRVAEMIQESARERV*
Ga0066676_1024434223300005186SoilGAEYHCIHGENQKTVFKVLDSSAPNEITMQMDFPLAGTVWRTDRIEAEGPATTRVDTAIAWQAHGIKAPLVDFMVRRMLLKYGAVYNKRVAEMLAVPDQETARASV*
Ga0066676_1047003213300005186SoilTVDAATAKIRSTARFPGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVTPV*
Ga0066675_1036098033300005187SoilWMGVPRVDFRPGARGSLVGAEYHCVHGPNQITVFKVLGCNAPTDITMQIGFPFVGTVRRTDRIEAEGPSTTRVDTAIAWKTSGIKGAVLDFVASRMLRRYGALYDQRVSEMIEASKADVTRA*
Ga0066675_1084639123300005187SoilYFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENRKTVFKVLGCNAPKEITMQIEFPFVGIVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVTPV*
Ga0070676_1090195123300005328Miscanthus RhizosphereGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWQTSGIKGRVLDFMASRMLRRYGGLYDKRVAEMIQENARASV*
Ga0070708_10117930823300005445Corn, Switchgrass And Miscanthus RhizosphereTDPKLRQIWMGLPRVDFFAGARGSLVGAEYHCIHGENQKAVFKVLACEVPSELTMQINFPMVGSIWQTDRVAQEGPATTRVDRAIAWQSRGLRAPLVDFMVARLLRKYGEQYDRRVTELLSA*
Ga0070708_10199384613300005445Corn, Switchgrass And Miscanthus RhizosphereAEYHCIHGENQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAAAEGPSTTRVDTAISWKTTGIKAPVLDFMATRMLRRFGALYDKRVAEMLQESARASV*
Ga0066686_1029888713300005446SoilENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWKTSGIKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESARASV*
Ga0066686_1047613833300005446SoilQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVTPV*
Ga0066689_1017412023300005447SoilGARGSLVGAEYHCIHGENQKTVFRILACDAPRELTMQMDFPFVGTVWRTDRVAPEGAATTRVDTAITWHARGIAAPLADFMASRMLRKYGAIYDRRVSEMLSA*
Ga0066689_1039602313300005447SoilDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLREYGALYDKRVAEMLEASKVDVTRA*
Ga0066681_1033799523300005451SoilEVFRYFTDPKLRQIWMGVPRVDFKAGARGSLIGAEYHCIHGSNQKTVFKVLGCTAPTDITMQIGFPFVGTVWRTDRIEAEGSATTRVDTAISWQTSGIKGRVLDFMASRMLRRYGDLYDKRVAEMIQENARTSV*
Ga0070707_10118657313300005468Corn, Switchgrass And Miscanthus RhizosphereEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPATTRVDTAISWNIKGIRAPVLNFMASRMLRRYGALYDKRVAEMIQESARTSA*
Ga0070707_10173492113300005468Corn, Switchgrass And Miscanthus RhizosphereVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNITGIKAPVLNFMATRMLRRYGALYDKRVAEMIQESARTRV*
Ga0070698_10076600623300005471Corn, Switchgrass And Miscanthus RhizosphereTAKIRSTARFPGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNITGIKAPVLNFMATRMLRRYGALYDKRVAEMIQESARTRV*
Ga0070699_10065567513300005518Corn, Switchgrass And Miscanthus RhizosphereDPKLRQIWMGLPRVDFFAGARGSLVGAEYHCIHGENQKAVFKVLACEVPSELTMQIDFPMVGSIWQTDRVAQEGPATTRVDRAIAWQSRGLRAPLVDFMVARLLRKYGEQYDRRVTELLSA*
Ga0070697_10173496613300005536Corn, Switchgrass And Miscanthus RhizosphereYHCIHGENQKTVFKVLGCNAPREITMQVEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKSSGIKAPVLNFMATRMLRKYGALYDRRVAEMIQESARASV*
Ga0066697_1030339913300005540SoilQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLEARKADVTRTRV*
Ga0070696_10102287223300005546Corn, Switchgrass And Miscanthus RhizosphereGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPATTRVDTAISWNIKGFRAPVLNFMASRMLRRYGALYDKRVAEMIQESARTSA*
Ga0070696_10156616923300005546Corn, Switchgrass And Miscanthus RhizosphereSRFPGTPDEVFRYFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWQTSGIKGRVLDFMASRMLRRYGGLYDKRVAEMIQENARASV*
Ga0066661_1031049123300005554SoilAKIRSTARFPGTPEQVFKHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCKAPTEITMQIEFPFVGPVWRTDRAAAEGPSTTRVDTAISWKTTGIKAPVLDFMATRMLRRYGALYDKRVAEMLQESARASV*
Ga0066692_1058668513300005555SoilPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKADVTRA*
Ga0066707_1012722033300005556SoilDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKVDVTRA*
Ga0066707_1026885613300005556SoilEYHCIHGENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWKTSGIKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESARASV*
Ga0066700_1098767623300005559SoilGAEYHCIHGENQKTVFKVLDSSAPNEITMQMDFPLAGTVWRTDRIEAEGPATTRVDTAIVWQAHGIKAPLVDFMVRRMLLKYGAVYNKRVAEMLAAPDQETARASV*
Ga0066699_1018807733300005561SoilDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIKAPVLNFMATRMLRRYGALYDKRVAEMLQESARASV*
Ga0066699_1063438323300005561SoilHFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFRVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV*
Ga0066693_1000402413300005566SoilYFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV*
Ga0066703_1044836313300005568SoilRGSLVGAEYHCIHGENQKTVFRVLACEAPSEITMQMDFPFVGTVWRTDRVAPEGPASTRVDTAITWHARGLMAPLADLMASRMLRKYGALYDERVSEMLRA*
Ga0066705_1025842133300005569SoilTVFKVLGCNAPTDITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAIAWKTSGIKGAVLGFVAARMLRRYGALYDQRVSEMIEASKADVTRV*
Ga0066705_1048738913300005569SoilPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQESARARV*
Ga0066694_1008350033300005574SoilFHYFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGIVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVTPV*
Ga0066708_1005943213300005576SoilTVFKVLGCNAPTDITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAIAWKTSGIKGAVLGFVAARMLRRYGVLYDQRVSEMIEASKADVTRV*
Ga0066706_1011183743300005598SoilQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKADVTRA*
Ga0066696_1004274613300006032SoilYPGTPEQVFRYFTDASLRKLWMGVPRVDFVPGARGSLVGAEYHCIHGENQKTVFKVLDSSAPNEITMQMDFPLAGTVWRTDRIEAEGPATTRVDTAIAWQAHGIKAPLVDFMVRRMLLKYGAVYNKRVAEMLAAPDQETARASV*
Ga0066658_1082319423300006794SoilPRVDFQPGGRGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGALYDKRVAEMIQENARASV*
Ga0066660_1111804713300006800SoilDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV*
Ga0066660_1168732313300006800SoilFMPGARGSLVGAEYHCIHGENQKTVFRVLACEAPSEITMQMDFPFVGTVWRTDRVAPEGPASTRVDTAITWHARGLMAPLADLMASRMLRKYGALYDERVSEMLRA*
Ga0075425_10283103713300006854Populus RhizosphereFHPGARGSLVGAEYHCIHGENQKSVFKVLGCKAPTELTMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAIAWNTSGIKAPVLNFMVSRMLRKYGALYDKRVAEMIQESARASV*
Ga0066710_10163095513300009012Grasslands SoilNQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKVDVTRA
Ga0066710_10227587413300009012Grasslands SoilNQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLEARKADVTRTRV
Ga0066710_10457540413300009012Grasslands SoilDPVTAKIRGTTRFPGTPDEVFRYFTDPKLRQIWMGVPRVDFQAGARGSLVGAEYHCIHGPNQKTVFKVLGCTAPTEITMQIGFPFVGTVRRTDRIEAEGPSTTRVDTAISWKTSGIKGPLLDFMATRMLRRYGALYDKRVSEMLEARKEDVTRV
Ga0066709_10115238833300009137Grasslands SoilVDPATAKIRGTTRFPGTPDEVFRYFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAIWWDAHGIKAPVLNFMASRMLRRYGALYDKRVSEMLEARKADVTRV*
Ga0066709_10131958823300009137Grasslands SoilIRSTARFPGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQVEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIKAPVLNFMASRMLRRYGALYDKRVAEMLQESARASV*
Ga0066709_10156066133300009137Grasslands SoilSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGIVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVTPV*
Ga0066709_10200567223300009137Grasslands SoilSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLEARKADVTRTRV*
Ga0134084_1020071623300010322Grasslands SoilGAEYHCIHGPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV*
Ga0134086_1005544513300010323Grasslands SoilAVFKVLGCSAPREITMQIDFPMVGSIQRTDRIEAEGPSTTRVDTAIAWKASGIKAPLVDVFVARMLRKYGAEYDKRVAEMLQENARVSV*
Ga0134111_1042236523300010329Grasslands SoilFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV*
Ga0134121_1000698313300010401Terrestrial SoilKMRSTSRFPGTPDEVFRYFTDAKLRQMWRGVARVDFQRGARGWRVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWQTSGIKGRVLDFMASRMLRRYGGLYDKRVAEMIQENARASV*
Ga0120114_1000052423300011998PermafrostMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIDAQGPSTSRVDTAISWNTSGIAAPVLDFIASRMLRRYGALYDKRVAEMLEADKADVTRTQV*
Ga0137389_1068475913300012096Vadose Zone SoilTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPRELTMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSSGIKAPVLNFMATRMLRKYGALYDRRVAEMIQESARARV*
Ga0137399_1064136523300012203Vadose Zone SoilLTDPKLRQIWMGVPRVDFHSGARGSLVGAEYHCIHGANRTTVFKVVGCKAPTEITMQIDFPFVGKVWRTDRAEAEGPSTTRVDTAISWNTAGIRAPVLDFLARGMLRRYGALYDKRVTEMLQEKARASV*
Ga0137399_1173299613300012203Vadose Zone SoilRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVVGCTAPTEITMQIGFPFVGTVWRTDRIQAEGPSTTRVDTAISWHSRGIAAPVLDFMASRMLRRYGALYDKRVAEMIQEKARASV*
Ga0137381_1165939513300012207Vadose Zone SoilIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIAFPFVGTVWRTDRIEAEGPSTTRVDTAISWQTSGIKGAVLDFMASRMLRRYGALYDKRVAEMIQENARASV*
Ga0137379_1000214613300012209Vadose Zone SoilMGVPRVDFHPGARGSLVGAEYHCIHGPNQKTVFKVVGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDAAISWKTSGIKGPLLDFMATRMLRRYGALYDKRVSEMLEASKA
Ga0137377_1131153523300012211Vadose Zone SoilFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKGPLLDFMATRMLRRYGALYDKRVSEMLEASKAGVTRA*
Ga0137370_1021317823300012285Vadose Zone SoilMGVPRVDFQAGARGSLVGAEYHCIHGPNQKTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKAPILDFIATRMLRRYGALYDKRVAEMLEARKADVIGTEV*
Ga0137386_1042881213300012351Vadose Zone SoilMGVPRVDFHPGARGSLVGAEYHCTHGPNQKTVFKVVGCKAPTEITMQIEFPFVGTVWRTDHAEAEGPSTTRVDTAISWKTSGIKGPLLDFMATRMLRRYGALYDKRVSEMLEASKADVTHA*
Ga0137384_1080077023300012357Vadose Zone SoilGVPRVDFHPGARGSLVGAEYHCIHGPNQKTVFKVVGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKGPLLDFMATRMLRRYGALYDKRVSEMLEASKADVTHA
Ga0134110_1048483413300012975Grasslands SoilATAKIRSTARFPGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKVDVTRA*
Ga0134081_1027920713300014150Grasslands SoilPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGASTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV*
Ga0137418_1091519323300015241Vadose Zone SoilRGSLVRAEYHCIHGPNQKTVFKVLGCTAPTEITMQIGFPFVGTVRRTDRIEAEGPSTTRVDTAISWKTSGIKGPLLDFMATRMLRRYGALYDKRVSEMLEARKADVTRV*
Ga0134089_1051549413300015358Grasslands SoilMGVPRVDFFAGARGSLVGAEYHCIRGENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWRTSGVKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESA
Ga0134085_1022871913300015359Grasslands SoilQPGARGSLVGAEYHCIHGPNQTTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWNAHGVKAPVLNFMASRMLRKYGAIYDRRVSEMLSA*
Ga0134112_1021900423300017656Grasslands SoilGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIQAEGPSTTRVDTAISWKTSGIKAPMLNFMATRMLRKYGALYDQRVREMLEGGKAGVARV
Ga0134083_1009910213300017659Grasslands SoilPGARGSLVGAEYHCIHGENQKTVFRILACDAPRLLTMQMDFPFVGTVWRTDRVAPEGEATTRVDTAITWHARGIAAPLADFMASRMLRKYGAIYDRRVSEMLSA
Ga0134083_1058934223300017659Grasslands SoilRSTARFPGTPEQVFKHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAAAEGPSTTRVDTAISWKTTGIKAPVLDFMATRMLRRFGALYDKRVAEMLQESARASV
Ga0184605_1019024113300018027Groundwater SedimentGARGSLVGAEYHCIHGPNQTTVFKVVGCTAPTEITMQIEFPFVGTVWRTDRVEAEGPATTRVDTAISWKTSGIKAPVLDFMASRMLRRYGALYDKRVSEMLEARQADVTRV
Ga0184605_1041346313300018027Groundwater SedimentTPTVDPSTAKIRGTTRFPGTPDEVFRYFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEITMQIGFPFVGTVWRTDRIESEGPSTTRVDTAISWNTHGIAGPVLDFMASRMLRRYGALYDKRVAEMIQENARASV
Ga0066655_1010741113300018431Grasslands SoilDAKLRQLWMGVPRVDFMPGARGSLVGAEYHCIHGENQKTVFRILACDAPRELTVQMDFPFVGTVWRTDRVAPEGVATTRVDTAITWHARGIAAPLADFMASRMLRKYGAIYDRRVSEMLS
Ga0066655_1025321113300018431Grasslands SoilDASLRKLWMGVPRVDFVPGARGSLVGAEYHCIHGENQQTVFKVLDSSAPNEITMQMDFPLAGTVWRTDRIEAEGPATTRVDTAIAWQAHGIKAPLVDFMVRRMLLRYGAVYNKRVAEMLAAPDQETARASV
Ga0066667_1032675413300018433Grasslands SoilRFPGTPEDVFRYLTDPGLRKIWMGVPRVDFFAGARGSLVGAEYHCIHGENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWKTSGIKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESARASV
Ga0066667_1090561813300018433Grasslands SoilPEEVFRHFTDPKLRQIWMGVPRVDFHSGARGSLVGAEYHCIHGPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV
Ga0066669_1073973413300018482Grasslands SoilNQKTVFKVLGCTAPTEITMQIGFPFVGTVLRTDRIEAEGPSTTRVDTAISWKTSGIKAPILDFMATRMLRRYGALYDKRVAEMLQGRKADVIGAEV
Ga0066669_1211313313300018482Grasslands SoilFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVPPV
Ga0193734_104783423300020015SoilVDPVTAKMRGTTRFPGTPDEVFRYFTDPKLRQIWMGVPRVDFQAGARGSLVGAEYHCIHGPNQKTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKAPILDFMASRMLRRYGDLYDKRVAEMIQENARTSV
Ga0193695_103941123300021418SoilCGPNQKTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKAPILDFMATRMLRRYGDLYDKRVAEMIQENARTSV
Ga0222622_1018436923300022756Groundwater SedimentMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIAAPVLDFMASRMLKKYGALYDKRVAEMIQENARASV
Ga0207653_1042065013300025885Corn, Switchgrass And Miscanthus RhizosphereFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWQTSGIKGRVLDFMASRMLRRYGGLYDKRVAEMIQENARASV
Ga0207646_1121474313300025922Corn, Switchgrass And Miscanthus RhizosphereVDPATAKIRSTARFPGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNITGIKAPVLNFMATRMLRRYGALYDKRVAEMIQESARTRV
Ga0209055_106728733300026309SoilPEEVFRHFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPGQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV
Ga0209266_122410513300026327SoilWMGVPRVDFFAGARGSLVGAEYHCIHGENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWKTSGIKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESARASV
Ga0209802_111410713300026328SoilSRTGQKSVFKVLGCNAPTEITMQIDFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRKYGALYDKRVAEMIQENARASV
Ga0209473_119490613300026330SoilRFKVLGCKAPREITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQDSARARV
Ga0209158_132814423300026333SoilTAKIRSTARFPGTPEEVFRHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLQESARASV
Ga0209377_106922533300026334SoilHCIHGENQKAVFKVLDCSAPQEITMQMDFPMVGNVWRTDRVEAEGPSTTRVDTAVSWKTSGIKAPFLDFMVGRLLRKYGAEYDKRVAEMLQESARASV
Ga0209057_110608013300026342SoilENQKTVFKVLGCNAPKEITMQIEFPFVGIVWRTDRAEAEGPSTTRVDTAISWTSRGIKAPVLNFMASRMLRRYGALYDKRVTEMLEARKADVTPV
Ga0209057_114454823300026342SoilPDLRRIWMGVARLDYFAGARGSLVGAEYHCIHGENQKTVFKVLGCSAPREITMQMGFPFVGTVWRTDRIAAEGPHTTRVDSALSWRTSGIRAPFLDFVATRLLRRYGALYNKRVTELIAAPGATTRASV
Ga0209690_120784713300026524SoilTDAQLRQLWMGVPRVDFMPGARGSLVGAEYHCIHGENQKTVFRILACDAPRELTMQMDFPFVGTVWRTDRVAPEGAATTRVDTAITWHARGIAAPLADFMASRMLRKYGAIYDRRVSEMLSA
Ga0209807_101198653300026530SoilPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQESARARV
Ga0209160_120650023300026532SoilTVFRVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTSGIKAPVLDFMAARMLRRYGAQYDKRVAEMIQENARASV
Ga0209157_111786813300026537SoilFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKVDVTRA
Ga0209056_1036181613300026538SoilGENQKTVFKVLGCNAPKEITMQVEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIKAPVLNFMASRMLRRYGALYDKRVAEMLQESARASV
Ga0209376_134928823300026540SoilCNAPKEITMQIEFPFVGIVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLEARKADVTRTRV
Ga0209161_1024770913300026548SoilEYHCIHGPNQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWKTTGIKAPVLDFMAARMLRRYGAQYDKRVAEMLQDSARARV
Ga0209161_1038924923300026548SoilYHCIHGPNQTTVFKVLGCNAPTDITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAIAWKTSGIKGAVLGFVAARMLRRYGALYDQRVSEMIEASKADVTRV
Ga0209689_1000815253300027748SoilPGARGSLVGAEYHCIHGENQKTVFKVLDSSAPNEITMQMDFPLAGTVWRTDRIEAEGPATTRVDTAIAWQAHGIKASLVDFMVRRMLLKYGAVYNKRVAEMLAVPDQETARASV
Ga0209689_123861423300027748SoilLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPSTTRVDTAISWNSRGIRAPVLNFMATRMLRRYGALYDKRVAEMLEARKADVTRTRV
Ga0209689_124060313300027748SoilLGCNAPKEITMQIEFPFVGTVWRTDRAEAEGPWTTRVDSAISWTSRGIKAPVLNFMASRMLRKYGALYDKRVAEMLEASKVDVTRA
Ga0137415_1138782213300028536Vadose Zone SoilGTPEQVFKHFTDPKLRQIWMGVPRVDFHPGARGSLVGAEYHCIHGENQKTVFKVLGCKAPTEITMQIEFPFVGTVWRTDRAAAEGPSTTRVDTAISWKTTGIKAPVLDFVATRMLRRYGALYDKRVAEMLQESARASV
Ga0307295_1024132513300028708SoilVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVVGCTAPTEITMQIEFPFVGTVWRTDRVEAEGPATTRVDTAISWKTSGIKAPVLDFMASRMLRRYGALYDKRVSEMLEARQADVTRV
Ga0307307_1015448923300028718SoilQKTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKAPILDFMATRMLRRYGALYDKRVAEMLEAGKTDVIGTEV
Ga0307302_1015564333300028814SoilIHGPNQTTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGITAPVLDFMASRMLKKYGALYDKRVAEMIQENARASV
Ga0307277_1000267113300028881SoilGPNQTTVFKVLGCNAPTEITMQIGFPFVGTVWRTDRIQAEGPSTTRVDTAISWHTSGIAAPVLDFMASRMLRKYGALYDKRVAEMIQENARASV
Ga0307308_1056124213300028884SoilFPGTPDEVFRYFTDPKLGQIWMGVPRVDFQAGARGSLVGAEYHCIHGPNQKTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKAPILDFMATRMLRRYGDLYDKRVAEMIQENARTSV
Ga0307469_1145799623300031720Hardwood Forest SoilFFTDPNLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKGPLLDLFASRMLRRYGALYDKRVSEMLEARKADVTRA
Ga0307471_10164861113300032180Hardwood Forest SoilARGSLVGAEYHCIHGPNQTTVFKVLGCNAPTEMTMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWKTSGIKGPLLDLFASRMLRRYGALYDKRVSEMLEARKADVTRA
Ga0307471_10360459213300032180Hardwood Forest SoilEEVFRFFTDPKLRQIWMGVPRVDFQPGARGSLVGAEYHCIHGPNQTTVFKVLGCTAPTEITMQIGFPFVGTVWRTDRIEAEGPSTTRVDTAISWHTSGITAPVLDFMASRMLRRYGALYDKRVAEMLQENARASV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.