NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F094020

Metagenome Family F094020

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094020
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 124 residues
Representative Sequence MLVNSGVARFQNCAHETVAQILKSLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQKTREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAG
Number of Associated Samples 96
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 81.82 %
% of genes near scaffold ends (potentially truncated) 10.38 %
% of genes from short scaffolds (< 2000 bps) 8.49 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(38.679 % of family members)
Environment Ontology (ENVO) Unclassified
(42.453 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.340 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 69.03%    β-sheet: 0.00%    Coil/Unstructured: 30.97%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF05598DUF772 1.89
PF07386DUF1499 0.94
PF07369DUF1488 0.94
PF00034Cytochrom_C 0.94
PF03404Mo-co_dimer 0.94
PF13185GAF_2 0.94
PF09913DUF2142 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG4446Uncharacterized conserved protein, DUF1499 familyFunction unknown [S] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005344|Ga0070661_100520575Not Available954Open in IMG/M
3300005456|Ga0070678_100875937Not Available819Open in IMG/M
3300012924|Ga0137413_10263111Not Available1190Open in IMG/M
3300012939|Ga0162650_100015732Not Available1040Open in IMG/M
3300018051|Ga0184620_10083840Not Available959Open in IMG/M
3300018054|Ga0184621_10017841Not Available2144Open in IMG/M
3300018054|Ga0184621_10157184Not Available818Open in IMG/M
3300018481|Ga0190271_10429415Not Available1421Open in IMG/M
3300026041|Ga0207639_10166042Not Available1865Open in IMG/M
3300026121|Ga0207683_10223571Not Available1716Open in IMG/M
3300028793|Ga0307299_10028656Not Available2018Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil38.68%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.72%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.72%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere3.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere2.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere2.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.89%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.89%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.89%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere1.89%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere1.89%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.89%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.94%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.94%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.94%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.94%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.94%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.94%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.94%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.94%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.94%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005161Soil and rhizosphere microbial communities from Laval, Canada - mgLPAEnvironmentalOpen in IMG/M
3300005162Soil and rhizosphere microbial communities from Laval, Canada - mgLABEnvironmentalOpen in IMG/M
3300005165Soil and rhizosphere microbial communities from Laval, Canada - mgHMCEnvironmentalOpen in IMG/M
3300005168Soil and rhizosphere microbial communities from Laval, Canada - mgLPCEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005333Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-3 metaGHost-AssociatedOpen in IMG/M
3300005344Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaGHost-AssociatedOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012939Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t1i015EnvironmentalOpen in IMG/M
3300012943Backyard soil microbial communities from Emeryville, California, USA - Original compost - Back yard soil (BY)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015258Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_1DaEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019873Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s1EnvironmentalOpen in IMG/M
3300019875Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300023266Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S220-509R-4EnvironmentalOpen in IMG/M
3300025903Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025937Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026121Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027560Soil and rhizosphere microbial communities from Laval, Canada - mgLPC (SPAdes)EnvironmentalOpen in IMG/M
3300028707Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_148EnvironmentalOpen in IMG/M
3300028708Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_152EnvironmentalOpen in IMG/M
3300028712Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_139EnvironmentalOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028720Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_357EnvironmentalOpen in IMG/M
3300028722Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_368EnvironmentalOpen in IMG/M
3300028755Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_356EnvironmentalOpen in IMG/M
3300028778Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_142EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028810Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_151EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028880Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_181EnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032122Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D4EnvironmentalOpen in IMG/M
3300034690Sediment microbial communities from East River floodplain, Colorado, United States - 60_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062589_10118973123300004156SoilMLVKSGVACFQHCAYEKVARILESLRTFQRAERISVRPQFISLIEAMRQRVDFLLSQRKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGITKTRKTVPTLIVAA
Ga0066807_103981613300005161SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLPHRILYLYGTTKNRKTLLTLIVPAAFAGLTLIILFLKMGMSDQADSEHAGSRNVTTSGR
Ga0066814_1006173723300005162SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQA
Ga0066869_1013273713300005165SoilMLVNPGVAGFQNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQAGSE
Ga0066809_1009756013300005168SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAA
Ga0070690_10078167913300005330Switchgrass RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSTQAGNEHASTVKG
Ga0070677_1044814513300005333Miscanthus RhizosphereMLVNSGVACFQNCAYETAAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAG
Ga0070661_10052057513300005344Corn RhizosphereMLVNSGVACFQNCAYETAAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQRQLRLLTTRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQ
Ga0070692_1101967013300005345Corn, Switchgrass And Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQADSEHAGSRNVTTSGRTQAPDWTIFDDAF
Ga0070675_10012459213300005354Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVY
Ga0070673_10056996013300005364Switchgrass RhizosphereIRGEALNIPASPQRRGCIDSLDSAMLVNSGVARFQDCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALFRQSHLRLLATRARDGVHAILLQSEPLQHRILPPVWHHKDS*
Ga0070667_10067459323300005367Switchgrass RhizosphereVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKLLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALI
Ga0070678_10087593713300005456Miscanthus RhizosphereMLVNSGVACFQNCAYETAAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATR
Ga0068867_10094691013300005459Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQAGSEHAGSRNVT
Ga0068855_10145736623300005563Corn RhizosphereMLVNSGGARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAIL
Ga0068866_1018212023300005718Miscanthus RhizosphereMLVNPGVAGFQNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQADSEHARSRNVTTSGRTQAPDWTIFDDA
Ga0068866_1048827513300005718Miscanthus RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVL
Ga0068861_10050534923300005719Switchgrass RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQAD
Ga0097621_10123798113300006237Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFFKM
Ga0068871_10124027713300006358Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFFKMGMSDQAGSEHAGSRNVT
Ga0105245_1070108713300009098Miscanthus RhizosphereMLVNPGVAGFQNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKLLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILHLYDITKTRKTVPTLIVAAFAGLALIIV
Ga0105247_1071748713300009101Switchgrass RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALFRQSHLRLLATRARDGVYAILLQSEPLQHKILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSTQAGNEHAGTVKGATPSVIQVP
Ga0105243_1301291013300009148Miscanthus RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATR
Ga0111538_1413316623300009156Populus RhizosphereMTVNPGVARFQQHAHETLAQMQDTLRTFHRVVGPSCRTHLVSLTDFLLSQGKPMWRALRQRIDEALSRQSHLRLLATRARSGVYAIPLQSEPFLHSIRHLYDTIKNRKTLL
Ga0105242_1305749613300009176Miscanthus RhizosphereMLVKSGVACFQHCAYEKVARILESLRTFQRAVRISVRPQFVSLIEAMRQRVDFLLSQRKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQH
Ga0105249_1141815213300009553Switchgrass RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFFKMGMSDQAGSEHAASRNVTTSGRTQGTDWTIFDDAFVK
Ga0126308_1092743713300010040Serpentine SoilMLVNSGVARFQDYAHETVSQILESLRTFQRVVGISRRPQFISLIEAARQRVDFLLSQGKPLWRAVRQGTGEALSGQSHLRLLATRARDGVYAILLQSEPFLNSIRQLYGPSSTRKIVPTM
Ga0134127_1022035213300010399Terrestrial SoilMLVNSGVACFQNCAYETVAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDYLLSQRKPLWRAFIQETREALSRQSHLRLLAT
Ga0137397_1100722213300012685Vadose Zone SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLPATRARDGVYAILLQSEPFLYRIRDLYGTTKNRKTLLTLIVAAFAALALIILFLKMGMSDQAGNEHAGSVKGTTSSLTQVPDWTIFDDAF
Ga0137394_1042172123300012922Vadose Zone SoilVAFQNYAHEAVAQILESLRTFQRAVGISGRPQLISLIEAMRQRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLPATRARDGVYAILLQSEPFLYRIRDLYGTTKNRKTLLTLIVAAFAGLALIILMLLKMGMSINRATNTPVR*
Ga0137413_1026311123300012924Vadose Zone SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLLATRARDGVYAILLQSEPFLYRIRDLYGTTKNCI
Ga0162650_10001573213300012939SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQAGGEHAGSRNVTTSGRTQAPDWTIFDDAFAKQP
Ga0164241_1115296023300012943SoilMLVNSGGARFQNYAHETVAQILASLRTFQRVASISRRPQFVSLIEAMRQRVDFLVSQGKPWCAFIEEAREALSRQSHLRMLATRARDGVYAILLQS*
Ga0164303_1004103843300012957SoilMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRA
Ga0164303_1066737923300012957SoilMLVKSGVACFQHCAYEKVARILESLRTFQRAERISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARD
Ga0164299_1159322823300012958SoilMLVKSGVACFQHCAYEKVARILESLRTFQRAERISVRPQFISLIEAMRQRVDFLLSQRKPLWRAFIQETREALSRQSHLR
Ga0164302_1000996213300012961SoilMLVKSGVACFQHCAYEKVARILESLRTFQRAERISVRPQFISLIEAMRQRVDFLLSQLKPLWRAFIQETREALSRQSHLRLLATR
Ga0157374_1221346713300013296Miscanthus RhizosphereMLVNPGVAGFQNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQADSEHARSRNVTTSGRTQAPDWTIFDD
Ga0157379_1223032613300014968Switchgrass RhizosphereMTVNQGVARFQHYAHAMLAQMQDTMRTFQRVVGPSCRTRLTFLIEAMRQRVDFLLSQSKPLWRKVCQRTDELLSRQRHLRLLATQARACVYAVLLQSKPLLDGIRHLCGTPSPI
Ga0157376_1108212313300014969Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGVLQRTGEA
Ga0137418_1064879513300015241Vadose Zone SoilLKAFATGDSLESAMLVNPGVARFQHYAHETVAQIQEFLRTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLLATRARDGVYAILLQSEPFLHRIRHLYGTTKNRKTVPTLIVAAFAGLALIILFLKMGMSDQAGNEHAGSVKGTTSSLTQV
Ga0180093_112126113300015258SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIIRPSSVDRNTPSPIFS*
Ga0132257_10223484823300015373Arabidopsis RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRTVGISVRPHFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVY
Ga0132257_10224807823300015373Arabidopsis RhizosphereMLVKSGVACFQHCAYEKVARILESLRTFQRAERISVRPQFISLIEAMRQRVDFLLSQRKPLWRAFIQETREALSRQSHLRLLATRARDGVY
Ga0184620_1008384013300018051Groundwater SedimentMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAGN
Ga0184621_1001784113300018054Groundwater SedimentMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAVLLQSEPFLHRIRHLYGTSSTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAGNEHAGTVKGATPS
Ga0184621_1015718423300018054Groundwater SedimentMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLPA
Ga0184617_123603513300018066Groundwater SedimentMLVNPGVARFQHYAHETVAQIQDSLRTSQRVVGLSRRPQFISLIEAMRQRVDFLLSQGKPLWHALRQRTDEAPSRQSHLRLLATRARDGVYAVRLQSEPFLH
Ga0184635_1004910233300018072Groundwater SedimentMLVNSGVACFQNCAYETVAQILESIRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAGNEHAGTVKGATPKCDPSPRLDNFRSRIR
Ga0184639_1007330413300018082Groundwater SedimentMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADLLLLQGKPLWRGSLQRTGEALSRQSHLKLLATRARDGVYAI
Ga0184628_1008923743300018083Groundwater SedimentMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEAL
Ga0190270_1136400513300018469SoilMLVNSGVACFQNCAYETVAQILESIRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQRKPLWRAFIQETREALSRQSHLKEPLINLRIMAHFAF
Ga0190270_1142877223300018469SoilMLVNSGVARFQNCAHETLAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILPPVWH
Ga0190270_1176866713300018469SoilMLVNSGVASFQNYANGTVAQIQESLRTFHRVLGFSHQTQLISPIEAMRLRVDFLLSLGKPQWRALRQRTGEALSHQMHLRLLATRARDGAHAILLQSEPFLHRIRSLYVTTKIRKTL
Ga0190270_1253385113300018469SoilMLRQLRERDDSKSGRSAFSTFQHHAHETVVQIQESLRAFQRVVGLSGRHQFVSFIEAMRQRVDFLLSQGKPLLRALRERLDEALSRQSHLRLLATRALDGVYAILLQSE
Ga0190274_1262908413300018476SoilMLVNSGVARFQNCAHEMVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQRKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILPPVWHHKDS
Ga0190274_1296481513300018476SoilMIVNPGVARFQHHAHETVVQIQESLRAFQRVVGLSGRHQFVSFIEAMRQRVDFLLSQGKSLWRALRQRTDEALSRQTHLRLLATRALDGVYAILLQSEPFLHRIRHLYGTPSSSKAVPTLIVAAFVGLALTILMLLKMELGLIFPPS
Ga0190271_1042941533300018481SoilMLVNSGVACFQNCAYETVAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRIL
Ga0193704_103422513300019867SoilMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAGNEHAGTVKGATPSVIQ
Ga0193700_102669513300019873SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSPAHALRQRTDEALSRQSHLRLLATRARDGVYAILLQSGIRDLYGTTKNRKTLLTLIVAAFAGLALIILLLKMGMSDQ
Ga0193701_108111413300019875SoilMLVNPGVARFQHYAHETVAQIQDSLRTSQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSPAHALRQRTDEALSRQSHLRLPATRARDGVYAILLQSEPFLYRIRDLYGTTKNRKTLLTLIVAAFAGLALIILFLKMGMSDQAGNEHAGSVKGTTLSLTQV
Ga0193731_103971523300020001SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRIDEALSRQSHLRLLATRARDGVYAIL
Ga0210380_1021759523300021082Groundwater SedimentMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQAGSEHAGSRNVTTSG
Ga0193719_1023582313300021344SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLLATRARDGVYAILLRSEPFLYRIRDLYGTTKNRKTLLTLIVAAFAGLALIILFL
Ga0222622_1134623113300022756Groundwater SedimentASPQRRGCNDSLDSAMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAGNEHAGTVKGATPSVIQ
Ga0247789_113318613300023266SoilMLVNSGVACFQNCAYETAAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRTRDGVCAIRLQSEPFLHRIRHLYATLSTRKTVPTMIVAAFA
Ga0207680_1070530613300025903Switchgrass RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGITKARKTVPTLIVAAFAGLALIIVLAKMGIRNQAG
Ga0207652_1175151313300025921Corn RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQSEPLQHRILHLYGITKTRKTVPTLIVAAFAGLALMGT
Ga0207709_1063354423300025935Miscanthus RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQQRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSTQAG
Ga0207669_1068800213300025937Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYLYGTTKNRKTLLTLIVPAAFAGLALIILFLKMGMSDQADSEHAGSRNVTTSGRTQ
Ga0207712_1194023313300025961Switchgrass RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAIL
Ga0207639_1016604213300026041Corn RhizosphereMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALFRQSHLRLLATRARDGV
Ga0207708_1000550213300026075Corn, Switchgrass And Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHRILYRYGTTKNRKTLLT
Ga0207648_1068751423300026089Miscanthus RhizosphereMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQREPLLHR
Ga0207683_1022357133300026121Miscanthus RhizosphereMLVNSGVACFQNCAYETAAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHKILHLYGTTKTRKTVPTL
Ga0257146_103738113300026374SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRTDEALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVP
Ga0179593_100483963300026555Vadose Zone SoilMLVNPGVARLQHYAHETVAQIQESLRAFQRVVGLSRRPQFITLIEAMRQRVDFLLSQGKPLWHALRQRTDEALSRQSHLRLLAPRARDGVYAVLLQSETVSA
Ga0207981_108646423300027560SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQS
Ga0307291_108326413300028707SoilMLVNPGVARFQHYAHETVAQIQESLRTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSLWHALRQRIDEALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTL
Ga0307295_1021224613300028708SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALCQRADEAVSRQS
Ga0307285_1019673013300028712SoilMLVNSGVACFQNCAYETVAQILESLRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALS
Ga0307313_1019225023300028715SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALRQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTK
Ga0307298_1008588423300028717SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSPAHALRQRTDEALSRQSHLRLPATRARDGVYAILLQSEPFLYRI
Ga0307317_1022840213300028720SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQAEPLLRTSRQVFGEFLSRQPHLKLLAARMRHRLDTVLLQSEPLLNKLRYLYGAPSSRKTVV
Ga0307319_1031194613300028722SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRIVGLSRRPQFISLIDAMRHRVDFLLSQAEPLLRTSRQVFGEFLSRQSHLKLLAARMRHRLDTVLLQSEPLLNKLRYLYGAPSSRKTVVMLIAAAIAGLTLIVLLVEMGMSNQM
Ga0307316_1037447813300028755SoilMLVNSGVACFQNCAYETVAQILESIRTFQRAVRISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAIL
Ga0307288_1011554223300028778SoilMLVNPGVARFQHYAHETVAQIQDSLRTSQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALCQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTKNRRTLLTLIVAAFAGLALIILLLKIGMSDQAGNEHAGSVKGTTSSLTQVPDWTIFDDAFA
Ga0307282_1042704113300028784SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALRQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTKNRKTLLTLIVAAFAGLALIILLLKMGMSDQAGNEHAGSVKGTTSSLTQ
Ga0307282_1048153413300028784SoilMLVNPGVARFQHYAHETVAQIQESLHTFQRVVGLSRRPQFISLIDAMRYRVDFLLSQGKSPAHALRQRTDEALSRQSHLRLPATRARDGVYAILLQSEPFLYRIRDLYGTTKNRKTLLTLIVAAFAGLALIILFLKMGMS
Ga0307323_1007539813300028787SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALRQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTKNRKTLLTLIVAAFAGLALIILLLKMGMSDQAGNEHAGSVKEPLR
Ga0307290_1026323413300028791SoilMLVNSGVARFQNCAHETVAQILKSLRTFQRAVGISVRPQFLSLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLL
Ga0307299_1002865643300028793SoilMLVNSGVARFQNCAHETVAQILESLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQAGNEHAGTVKGATPSVIQVPDWT
Ga0307287_1036498313300028796SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATQARDGVYAILLQREPLLHEILYLYGTTKNRKTLLTLIVPAAFAGLALIILF
Ga0307284_1017362413300028799SoilMLVNPGVARFQHYAHETVAQIQDSLRTSQRVVGLSRRPQFISLIDAMRYRVDFLLSQAEPLLRTSRQVFGEFLSRQSHLKLLAARMRHRLDTV
Ga0307305_1014088413300028807SoilMLVNPGVARFQHYAHETVAQIQDSLRTSQRVVGLSRRPQFISLIDAMRHRVDFLLSQAEPLLRTSRQVFGEFLSRQSHLKLLAARMRHRLDTVLLQSEPLLNKLRYLYGAPSSRK
Ga0307305_1027019713300028807SoilMLVNSGVARFQNCAHETVAQILKSLRTFQRAVGISVRPQFISLIEAMRQRVDFLLSQGKPLWRAFIQKTREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILHLYGTTKTRKTVPTLIVAAFAG
Ga0307294_1028436513300028810SoilMLVNPGVARFQHYAHETVAQIQESLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALRQRADEAVSRQSHLRLLAT
Ga0307296_1061165613300028819SoilMLVNPGVARLQHYAHETVAQIQESLRAFQRVVDLSRRPQFISLIEAMRQRVDFLLSQGKPLWHALRQRTDEALSRQRHLRLLATRARDGVYAVLLQSEPFLHRIRHLYGTSSTRKTVPTLIVAALAGLALIIVLAKMGISDQAGDERAGTVKGATSSQAPAPDW
Ga0307312_1001854113300028828SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALRQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTKNRKTLLTLIVAAFAGLALIILLLKMGMSDQAGNEHAGSVKGTTSSLTQVPDWTIFDD
Ga0307286_1022100213300028876SoilMLVNPGVARFQHYAHETVAQIQDSLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALRQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTKNRKTLLTLIVAAFAGLALIILLLKMGMSDQAGNEHAGSVKGTTSSLTQVPDWTIFDDAFANQP
Ga0307300_1028690913300028880SoilMLVNPGVARFQHYAHETVAQIQESLRTFQRVVGLSRRPQFISLIDAMRHRVDFLLSQGKLLWHALCQRADEAVSRQSHLRLLATLARDDVYAILLQSEPFLHRIRHLYGTTKNRKTLLTLIVAAFAGLALIILLLKMGMSDQAGNEHAGLVKGTTSSLTQVP
Ga0307499_1023032113300031184SoilMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLATRARDGVYAILLQRGPLLH
Ga0310900_1016196523300031908SoilMLVNSGVARFQNCAHETVAQILESLRTFQRTVGISVRPHFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRILPPVWHHKDS
Ga0310885_1055289513300031943SoilMLVNSGVARFQNCAHETVAQILESLRTFQRTVGISVRPHFISLIEAMRQRVDFLLSQGKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQHRI
Ga0310895_1044469213300032122SoilMLVKSGVACFQHCAYEKVARILESLRTFQRAERISVRPQFISLIEAMRQRVDFLLSQLKPLWRAFIQETREALSRQSHLRLLATRARDGVYAILLQSEPLQQRILHLYGTTKTRKTVPTLIVAAFAGLALIIVLAKMGMSNQEGNEHAGTVKGATPSVIQVPDWTIFDH
Ga0364923_0024841_1119_13583300034690SedimentMLVNPGVAGFKNYAHETVAQILESLRAFQRALGISISTIEAMLRQRADFLLLQGKPLWRGFLQRTGEALSRQSHLKLLAT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.