NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F093918

Metagenome / Metatranscriptome Family F093918

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093918
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 171 residues
Representative Sequence MSAKNRPYEQFGPFILFKKLEADALGDLWRAGRVDGGQLGPVMAVRRLSGGSRAALTESAAEASQLVPLLSGTTFARDQTIDVVNGVPFVAHEYAGGRSLRHIVDRARGGAGITPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRV
Number of Associated Samples 98
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.94 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.057 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(26.415 % of family members)
Environment Ontology (ENVO) Unclassified
(37.736 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.943 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 22.05%    β-sheet: 27.69%    Coil/Unstructured: 50.26%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF02618YceG 33.96
PF03652RuvX 1.89

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG1559Endolytic transglycosylase MltG, terminates peptidoglycan polymerizationCell wall/membrane/envelope biogenesis [M] 33.96
COG0816YqgF/RuvX protein, pre-16S rRNA maturation RNase/Holliday junction resolvase/anti-termination factorTranslation, ribosomal structure and biogenesis [J] 1.89


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.06 %
All OrganismsrootAll Organisms0.94 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005178|Ga0066688_10009916All Organisms → cellular organisms → Bacteria → Acidobacteria4635Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil26.42%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.38%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.49%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.60%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.72%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere3.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.89%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.89%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.89%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.89%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.94%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.94%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.94%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.94%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.94%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.94%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.94%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.94%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.94%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005834Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C1-2Host-AssociatedOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300010044Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot60EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015164Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-4b, rock/ice/stream interface)EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300023268Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L106-311C-6EnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024224Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14EnvironmentalOpen in IMG/M
3300025321Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025909Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026121Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028072Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK16EnvironmentalOpen in IMG/M
3300031047Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1B (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032074Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R1EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033812Sediment microbial communities from East River floodplain, Colorado, United States - 65_j17EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066674_1040342613300005166SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEAHNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGALIPQFVWISDDGEIRVAG
Ga0066690_1063496613300005177SoilMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGSIMAVRRLLGGNRAALTASAGEAHNLVPLLAGTTFAKDQSIDVANGIPFVAYEYAGGRSLRHIVDRARGGNGVSPNPIPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQ
Ga0066688_1000991663300005178SoilMPGKTRPYEQFGPFILFKKLESDALGDLWRAARIDDRHLGPLVALRRLTGGNRETLTQSADDAREIAPLLSGTSFAKDQIIDVVNGVPSIAHAYSGGRSLRHIIDRARGGSGITPNPIPIDQAILIAEKVALSLATTADLRHGGVRLSHGALIPHFIWVSDDGEIRVAGQQLGKGLIASLKDSKIAGSIGRYFSPDYQN
Ga0068869_10040087913300005334Miscanthus RhizosphereMASKNRPYEQFGPFILFKRLESHALGDLWRAGRIDGTQLGRTVTVHRLTGGKRDAFVAAASEARALAPLLTGTTFAKEQVIDVIDGVPFIAHEYGGGRSLRHIVDRARGANGAAPNPVPLEQAIVIAEKVALSLTTTAD
Ga0070687_10063904613300005343Switchgrass RhizosphereMASKNRPYEQFGSFILFKKLESDALGDLWRAGRIDGTQLGRIVALRRLTGGRRDLFVSAAGEARALAPLLTGTTFAKDQVIDVMNGTPFIAHEYGGGRSLRHIVDRARGANGAAPNPIPIDQAIVIAEKVSLSMATTADLRYLGNRLTHGALVPQFIWIADDGEIRVAGQQLG
Ga0070673_10210270113300005364Switchgrass RhizosphereFVVLSYAAPFSNHVWGSVSMAAKNRPYEQFGPFILFKRLESDSLGDLWRAGTLDGAQLGKTVAVHRLTGGRREAFVAAAAEARALAPLLTGTTFAKDQTIDVVNGTPFVAHEYGGGRSLRHIVDRARGANGASPNPIPLEQAIVIAEKVALSLATTADLRYLGNRLSHGALIPQFIWI
Ga0070705_10120534413300005440Corn, Switchgrass And Miscanthus RhizosphereMASKNRPYEQFGSFILFKKLESDALGDLWRAGRIDGTQLGRIVALRRLTGGRRDLFVSAAGEARALAPLLTGTTFAKDQVIDVMNGTPFIAHEYGGGRSLRHIVDRARGANGAAPNPIPIDQAIVIAEKVSLSMATTADLRYLGNRLTHGALVPQFIWIADDGEIRVA
Ga0066689_1064050713300005447SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEAHNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGALIPQFVWISDDGEIRVAGQHLGKGMVASLKDAKVAAEIARYFSPECQSSGDAAKASD
Ga0066681_1034008623300005451SoilMASKNRPYEQFGSFILFKRLEADALGDLWRAGRIEGTQLGHTVALRRRTGGKRDAFAAAASEARALAPLLTGTTFAKEQTIDVLNGTPFIAYEYGGGRSLRYIVDRARGANGTPANPIPLDQAIVIAEKVSLSLATTADLRYQGNRLTHGALIPQLIWIADDGEIRVAGQQLGKGLIASLKDSKVAANVGRYFSPEYQHSGEPTQ
Ga0066687_1054623213300005454SoilMASKNRPYEQFGSFILFKRLESDALGDLWRAGRIDGAQLGRTVALRRLLGGKREAFAAAAGEARTLAPLLTGTTFAKEQTIDVLGGVPFIAHEYGGGRSLRHVVDRARGISGTVPNPVPLDQAIVIAEKVALSLATTADLRYMGNRLVHGALIPQFIWIADDGEIRVAGQQLGKAMIASLAEPKFHAEFGRYFA
Ga0070706_10201358213300005467Corn, Switchgrass And Miscanthus RhizosphereILFKKLESDALGDLWRAARIDDRHLGPLVAVHRLSGGNREALVAAAADARATVPLLSGTSFVKDQMIDVANGVPYVAHEYGGGRSLRHIVDRAHGAAGITPNPIPIDQAVLIAEKVALSLATTADLRHGGQRLSHGALIPQFIWISDDGEIRVAGQQLGKGLIASLSDAKVAAS
Ga0070706_10215453413300005467Corn, Switchgrass And Miscanthus RhizosphereIEGNKLGPLVALRRLSGGNREALLEAATDAKAIVPVLTGTSFARNQSIDVVDGVPYIAHDYAGGRSLRHIVDRARGGTNSVANPIPLDQAIIVAEKVALSLATTGDLRYGDKRLTHGGLIPQFIWISDDGEIRVAGQQLGRGVAASLKDTRVAGEIARYFAPEYQAN
Ga0070741_1169501213300005529Surface SoilPGKTRPYEQFGPFILFKKLESDALGDLWRAARIDDRHLGPLVALRRLTGGNREALAQSAADAREIVPLLDGTSFVKEQMIDVINGVPFVAHEYGGGRSLRHIVDRAHGGNGITPNPIPIDQAVLIAEKVALSLATTADLRHGGVRLNHGALIPQFIWISDEGEIRVAGQQL
Ga0070679_10190068313300005530Corn RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDGGQLGPIMAVRRLSGGNRAALVASAGEASQLVPLLSGTTFAKDQTIDVANGIPFVAHEYSGGRSLRHIVDRARGGNGVTANPIPIDQAIVIAEKIALSLSTTADLKYLGQR
Ga0066661_1030963913300005554SoilMASKNRPYEQFGSFILFKRLESDALGDLWRAGRIDGTQLGRTVALRRLLGGKREAFVAAAGEARALAPLLTGTTFAKDQTIDVLQDMPFIAHEYGGGRSLRHIVDRARGTSGSAPNPIPLDQAIIIAEKVALSMATTADLRYLGNRLAHGALIPQFIWIADDGEIRVAGQQL
Ga0066661_1048477113300005554SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEARNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGALIPQFVWISDDGEIRVAGQHLGKGMVASLKDAKVAAEIARYFSPECQSSGDAAKASDVYSM
Ga0066661_1080167213300005554SoilTPLSNQPGGLSMSAKNRPYEQFGTFILFKRMGGDALGDLWRAGRIDGTQVGATVAVHRLTGGNREAFVRAANEANALAPLLTGTTFAKEQVIDVVNGVPFIAHEYGGGRSLRHIVDRARGASAAAPNPIPLDQAMVIAEKVALSLATTADLRYLGNRLTHGALVPQFVWIAEDGEIRVAGQQL
Ga0066704_1087691013300005557SoilMPAKPYEQFGPFILFKKLESDALGDLWRAGRIEGDHLAGIVALRRLSGGNRDALVQGAGEARNVVSLLSGTSFVKNQVIDVIDGVPVIEHEYGGGRSLRHIVDRARGGAGLSPNPIPIEQAIVIAEKVALSLATTTELRYAGERLSHGALIPQFVWINN
Ga0066700_1109289413300005559SoilRPYEQFGPFILFKKLESDALGDLWRSARLEDRHLGSLVALRRLTGGNREALVESAADARAIVPLLTGTSFVKNQIIDVFHGVPFIAHEYGGGRSLRHIVDRSRGGNGITPNPIPIDQAILIAEKIALSLATTADLRHGGERLAHGALIPQFIWISDDGEIRVAGQQLGRGLIAS
Ga0066699_1096206023300005561SoilMASKNRPYEQFGSFILFKRLESDALGDLWRAGRIDGAQLGRTVALRRLLGGKREAFAAAAGEARTLAPLLTGTTFAKEQIIDVLGGVPFIAHEYGGGRSLRHVVDRARGISGTVPNPVPLDQAIVIAEKVALSLATTADLRYMGNRLVHGAL
Ga0066694_1058305813300005574SoilGSFILFKRLETDALGDLWRAGKVDGGQLAGTVALRRLSGGSREAIAQSAAGAREIVPMLHGTSFVRNQSIDVVDGVPYVAHELAGGRSLRYIVDRARGGTNHMPNPIPLDQAIVVAEKVALSLATTGDLKYGDKRLLHGALLPQFIWISDDGEIRVAGQQLGRGIIASLRDPR
Ga0066702_1095542913300005575SoilMSAKNRPYEQFGPFILFKKLETDALGDLWRAARVDNGQLGATMAVRRLLGGNRAALTASAGEAHNLVPLLAGTTFAKDQTIDVTNGIPFVAHDYAGGRSLRHIVDRARGGNGVTANPVPLDQAIVIAEKIALSLATTADLRYLGNRL
Ga0068857_10049997123300005577Corn RhizosphereMAAKNRPYEQFGSFILFKRLEGDSLGDVWRAGKIDGEQLGKTVAVHRLTGGKRDAFVACANEARALAPLLTGTTFAKEQVIDVVNGTPFIAHEYGGGRSLRHIVDRARGANGAAPNPIPLEQAVVIAEKVALSLATTADLRYMGNRLAHGGLVPQFIWIADDGEIRVAGQQLGKAMIASLAEPKFHLELARYFAPEYQTLGEPTKSSEVYSM
Ga0066654_1032456813300005587SoilMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGPAMAVRRLLGGNRAALTEAAGEAHNLVPLLAGTTFAKDQTIDVANGIPFVAHEYAGGRSLRHIVDRARGGNGVTANPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVGGQQLGKGLIASMSDTKVAADVGRYFAAEYRASGEISKTT
Ga0068859_10199822413300005617Switchgrass RhizosphereMAASKKPYEQFGPFILFKKLEQDALGDLWRAARVDGGQLSELLAVRRLSGGNREALVAAASAARDIVPLLNGTSFVKNQVIDVIGGVPFVAHEYANGRSLRHIVDRARGGGGAGSVPNPIPIDQAIVIAEKVALSLATTADLRYGGNRLSHGALIPQFIWISDDGEIR
Ga0068851_1107976213300005834Corn RhizosphereGGPSSMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGPTMAVRRLLGGNRAALTASAGEAHNLVPLLAGTTFAKDQTIDVANGIPFVAHEYAGGRSLRHIVDRARGGNGVTANPIPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGE
Ga0070717_1156774813300006028Corn, Switchgrass And Miscanthus RhizosphereGVRNMAGKGRPYEQFGSFILFKKFEADALGDLWRAGRSDGNAIGATVALRRLSGGNRQALLAAAAEAKAIVPMLTGTSFARNQIIDVIDGIPYISHDYSGGRSLRHIIDRARGGTNAMPNPMPLDQAIVIAEKVALSLATTNDLRYGEKRLTHGALIPQFIWISDDGEIRVAGQQLGRGMVASLNPEIARYFAPEYRT
Ga0066696_1031354913300006032SoilMASKSRPYEQFGSFILFKRMETDSLGDLWRAGRVDGGQLAGTVALRRLSGGAREAIAQSAAEAREIVPMLSGTSFARNQSVGTIDGIPYLAHEYAGGRSLRHIVDRARGGTNQMPNPIPLDQAIVVAEKVALSLAT
Ga0066696_1091070713300006032SoilMASKNRPYEQFGSFILFKRLESDALGDLWRAGRIDGAQLGRTVALRRLLGGKREAFAAAAGEARTLAPLLTGTTFAKEQTIAVLGGVPFIAHEYGGGRSLRHVVDRARGISGTVPNPVPLDQAIVIAEKVALSLATTADLRYMGNRLVHGALIPQFIWI
Ga0066656_1086471213300006034SoilGGPPSMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGHTMAVRRLSGGNRAALTASAGEAHNLVPLLAGTTFAKDQTIDVANGIPFVAHEYSGGRSLRHIVDRARGGNGVSPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVGGQQLGKGLIASMSDTRVAA
Ga0066656_1110449213300006034SoilKNRPYEQFGSFILFKKLESDALGDLWRAGRIDGAQLGRIVAVRRLIGGKRDLFVSAAGEARALAPLLTGTTFAKDQVIDILNGTPFIAHEYGGGRSLRHIVDRTRGANGAAPNPIPLDQAIVIAEKVSLSLATTADLRYLGNRLTHGALVPQFIWIADDGEIRVAGQQL
Ga0068871_10202867913300006358Miscanthus RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGPTMAVRRLLGGNRAALTASAGEAHSLVPLLAGTTFAKDQTIDVANGIPFIAHEYAGGRSLRHIVDRARGGNGVTPNPIPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVGGQQIGKGLI
Ga0066653_1047428413300006791SoilMASKSRPYEQFGSFILFKRLETDALGDLWRAGKIDGGQLAGTVALRRLSGGSREAIVQSVAGAREIVPMLHGTSFVRSQSIDTVDGVPYIAYEFAGGRSLRYIVDRARGGTNHMPNPIPIDQAIVVAEKVALSLATTGDLKYGDKRLLHGALLPQFIWISDDGEIRVAGQQLGRGIIASLRDARVA
Ga0066665_1046882823300006796SoilMASKNRPYEQFGSFILFKRLETDALGDLWRAGKIDGGQLAGTVALRRLSGGSREAIVQSTAGAREIVPMLHGTSFVRNQSIDVVDGVPYIAYEFAGGRSLRYIVDRARGGTNHMPNPIPLDQAIVVAEKVALSLATTGDLKYGDKRLLHGALLPQFIRISDDGEIRVAGQQLGLGIIASLRDARVAGEISRYVAPDIRATGEPTKSSEVFSMGAIL
Ga0066659_1015414113300006797SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEAHNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGAL
Ga0066659_1139126213300006797SoilMAGKTRPYEQFASYILFKKLEADALGDLWRAARIDDRHLGPLVALRRLTGGNREAMIQSATDARAIAPMLNGTSFVKEQMIDVVSGVPVITHEYSGGRSLRHVIDRARGGAGVVANPVPLDQAILVAEKVALSLATLADLRYSGNRLMHGALIPQFIWIADDGEIRVAGQQ
Ga0066660_1155159113300006800SoilRVILPRTAQKPGGPSSMSAKNRPYEQFGSFILFKKLETDTLGDLWRAGRVDNGQLGPTMAVRRLLGGNRAALTASAGEAHNLVPLLAGTTFAKDQTIDVANGIPFVAYEYAGGRSLRHIVDRARGGNGVSPNPIPLDQAIVIAEKIALSLATTADLRYLGSRLAHGALIPQFVW
Ga0079221_1119892823300006804Agricultural SoilMPGKTRPYEQFGPFILFKKLESDALGDLFRAARIDDRHLGPLVALRRLTGGNREALLRSAEDAKPIVPLLTGTSFAKDQQVDIINGTPFVAHEYSGGRSLRHIIDRARGGNGITPNPIPLDQAILIAEKIALSLATTADLRHA
Ga0075425_10236392013300006854Populus RhizosphereGSYILFKKLESDALGDLWRAARIDDRHLGPLLAVRRLSGGNREALVQAASDAKAIVPMLTGSSFVKDQIVDVLNGIPYVAHEYAGGRSLRHIVDRAHGGTGITPNPIPIDQAILIAEKVALSLATTADQRYGGTRLSHGALIPQFIWISDDGEVRVAGQQLAKGIIASLKDPKVATAIGRYFSPELRTGSEPSQAS
Ga0075426_1125000013300006903Populus RhizosphereMPGKTRPYEQFGPFILFKKLESDALGDLFRAARIDDRHLGPLVALRRLTGGNREALLRSAEDAKPIVPLLTGTSFAKDQQVDIINGTPFVAHEYSGGRSLRHIIDRARGGNGITPNPIPLDQAILIAEKIALSLATTADLRHAGTRLSHGGLIPQF
Ga0075424_10125063113300006904Populus RhizosphereMPSKNRPYEQFGSFILFKKLESDALGDLWRAGRIEGGQLGRTVALRRLMGGKRDAFVTAAGEARALAPLLTGTTFGKDQVIDVMGGVPFIAHEYGGGRSLRYIVDRARGANGAMPNPIPLDQAIVIAEKVALSLATTADLRYLGNRLAHGGLV
Ga0079219_1117770013300006954Agricultural SoilMPSKNRPYEQFGSFILFKRLESDALGDLWRAARIDGTQLGHTVALRRLLGGRRDAFVAAAGDARTLAPLLTGTTFAKEQTIDVLGGVPFIAHEYGGGRSLRHIVDRARGANGSTPNPIPLDQAIVIAERVALSLATTADLRYMGNRLAHGALIPQFIWIADDGEIRVAGQQ
Ga0099793_1060723413300007258Vadose Zone SoilTFILFKKLDTDSLGELWRAARLDGSTLSPMLALRRMSGGNRLAMTQSASEASQLVPLLSGTTFAKDQTIDVIDGVPFVAHEYSGGRSLRHIVDRARGDSGTTPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLSHGALIPQFVWITDEGEIRVAGQQLGKGLIASMSDSKVAAELGRYFA
Ga0066710_10094285513300009012Grasslands SoilMPAKTRPYEQFGPFILFKKLESDALGDFWRAGRIEDRHLGSLVAVRRLTGGNREALLESANDARAIVPLLAGTSFVKDQTIDVLNGVPYVAHEYGGGRSLRHIVDRARGGNGITPNPIPIDQAILIAEKIALSLATTADL
Ga0066709_10215398623300009137Grasslands SoilMPGKTRPYEQFGPFILFKKLESDALGDLWRAARIDDRHLGPIVTLRRLTGGDRETLTQSADDAREIAPLLSGTSFAKDQIIDVVNGVPSIAHAYSGGRSLRHIIDRARGGSGITPNPIPIDQAILIAEKVALSLATTADLRHGGARLSHGALIPHFIWVSDDGEIRV
Ga0105243_1209305613300009148Miscanthus RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDSGQLGPIMAVRRLSGGNRAALLQSAAEASQLVPLLSGTTFAKDQTIDVVNGVPFVAHEYSGGRSLRHIVDRARGGNGITANPIPIDQAIVIAEKIALSLSTTADLKFLGNRLAHGALIPQFVWITDEGEIRVAGQQLG
Ga0126310_1060412223300010044Serpentine SoilMAASKKPYEQFGPFILFKKLEQDALGDLWRAARIDGGQLGDFVALRRLSGGNREALVTSANAAREIVPLLNGTSFVKNQVIDVVGSVPFIAHEYANGRSLRHIVDRARGGTGVLANPVPIDQAIVIAEKVALSLATTADLRYGGNRLSHGALIPQFVWISD
Ga0134064_1029783323300010325Grasslands SoilMASKSRPYEQFGSFILFKRLETDALGDLWRAGKVDGGQLAGTVALRRLSGGNREAIAQSAAGAREIVPMLHGTSFVRSQSIDVVDGVPYVAHELAGGRSLRYIVDRARGGTNHMPNPIPLDQAIVVAEKVALSLATTGDLKYGDKRLL
Ga0134121_1278594513300010401Terrestrial SoilMASKSRPYEQFGSFILFKRLETDALGDLWRAGKIDGGQLAGTVALRRLSGGNREAIAQSAAGAREIVPALSGASFVRAQAIDVVDGVPCISYEYAGGRSLRHIVDRARGGTNQMPHPIPLDQAIVVAEKVALSLATTADLKYGDKRLLHGAL
Ga0134123_1098097023300010403Terrestrial SoilMAASKKPYEQFGPFILFKKLEQDALGDLWRAARVDGGQLSELLAVRRLSGGNREALVASASSAREIVPLLNGTSFVKNQVIDVIGGVPFVAHEYANGRSLRHIVDRARGGAAVMPNPIPIDQAIVIAEKVALSLATTADLRYGGNRLSHGALIPQFIWISDDGEIRVAGQQLASGVIESLVDPKVASDVGRYF
Ga0137392_1096156113300011269Vadose Zone SoilMASRTRPYEQFGPFILFKKLEVDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEARNVAPLLTGTSFVKNQVIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKVSLSLETLGNIKRENVRLVHG
Ga0137393_1170037913300011271Vadose Zone SoilILFKKLDTDSLGELWRAASLDGTTLSPMLALRRMSGGNRTAMTQSASEASQLVPLLSGTTFAKDQTIGVIDGIPFVAHEYSGGRSLRHIVDRARGDAGTTPNPVPLDQSIVIAEKIALSLATTADLRYLGNRLSHGALIPQFVWVTDEGEIRVAGQQLGKGIVASMADAKVA
Ga0137458_127610113300011436SoilMAAKNRPYEQFGSFILFKRLEGDSLGDLWRAGRIDGAQLGKTVAVHRLTGGKREAFVACANDARALAPLLTGTTFAKEQIIDVVNGTPFIAHEYGGGRSLRHIVDRARGANGAAPNPIPLEQAIVIAEKVALSLATTADL
Ga0137364_1073170413300012198Vadose Zone SoilMASKNRPYEQFGSFILFKKLESDALGDLWRAGQIDGTQLGRTVAVRRLAGGKRDAFVAAAGEARALAPLLTGTTFAKEQVIDVISGTPFIAHEYGGGRSLRHIVDRARGANGAAPNPIPLDQAIVIAEKVSLSLATTADLRYLGNRLTHGALVPQFIWIADDGEIR
Ga0137372_1051314123300012350Vadose Zone SoilVPSKTRPYEQFGNYILFRKLEADPLSELWRAARIDNRHLGPLIALRRLIGGNREALIAAATDTRDMVPLLTGTSFVKEQAIDVISGIPFIAHAYAGGRSLRHIVDRARGGAGVIPNPIPIDQAIHIAEKIALSLTTLNELRYGGNRLSHGG
Ga0137369_1077305713300012355Vadose Zone SoilMASKSRPYEQFGSFILFKRLETDALGDLWRAGKIDGGQLAGTVALRRLTGGSREAIAQSAAGAREIVPMLHGTSFVRNQSIDVVDGVPYIAYEFAGGRSLRYIVDRARGGTNHLPNPIPIDQAIVVAEKVALSLATTGDLKY
Ga0137368_1035721423300012358Vadose Zone SoilMASKSRPYEQFGSFILFKRLETDALGDLWRAGKVDGGQLAGTVALRRLSGGSREAIAQSAAGAREIVPMLHGTSFVRNQSIEVVDGVPYIAYEFAGGRSLRYIVDRARGGTNHMPNPIPLDQAIVVAEKVALSLATTGDLKYGDK
Ga0137375_1139247213300012360Vadose Zone SoilMASKSRPYEQFGSFILFKRLETDALGDLWRAGKVDGGQLAGTVALRRLTGGSREAIAQSAAGAREIVPMLHGTSFVRNQSIEVVDGVPYIAYEFAGGRSLRYIVDRARGGTNHMPNPIPLDQAIVVAEKVALSLATTGDLKYGDK
Ga0137361_1044169913300012362Vadose Zone SoilMAAKNRPFEQFGSFILFKKLESDGLGDLWRAGRIDNGQLGSVLAVRRLSGGNREGFALSAATARQLVPQLTGTSFAKHQEIDAVNGVPYIAHEYGGGRSLRHIIDRARGGNGVLPNPLPIDQAIVIAEKVALSLSTTADLKMNDKRLAH
Ga0137410_1206306613300012944Vadose Zone SoilEQFGSFILFKKLETDALGDLWRAGRVDNGQLGPTMAVRRLLGGNRAALTASAGEAHNLVPLLAGTTFAKDQTIDIANGIPFVAHEYAGGRSLRHIIDRARGGNGVTPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVDGQQLGKGLV
Ga0126369_1233277413300012971Tropical Forest SoilMASKNRPYEQFGSFILFKKLESDALGDLWRAGRIEGTQLGRVVAVRRLTGGKRDAFLAAAGEAHALAPLLTGTTFAKDQTIDVLGGVPFIAHEYGGGRSLRHIVDRARGANGTAPNPIPLDQAIVIAEKVALSLATTADLRYMGNRLAHGALIPQFVWIADDGEIRVAGQQLGKAMIASLHDPKFHAEFGRYFA
Ga0134087_1073541613300012977Grasslands SoilMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGPTMAVRRLLGGNRAALTAAAGEAHNLVPLLAGTTFAKDQTIDVANGIPFVAHEYAGGRSLRHIVDRARGGNGVTANPVPLDQAIVIAEKIALSLATTADLKYLGNRLAHGALIPQFVWITDEGEIRVGGQ
Ga0134075_1032660723300014154Grasslands SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEARNVAPLLTGTSFVKNQGIDVVESVPVIWHDYGGGRSLRHIVDRARGGSAVSPHPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGALIPQFVWISDDGEIRVAGQQLGKGM
Ga0167652_105400613300015164Glacier Forefield SoilMAPKAKPYEQFGPYILFKRLEADALGDLWRAVRIENNQLGSLVALRRLTGGDRAALLQSASEARAIVPLLKGTTFVKGQVIDSVDATPFLAFDYPGGRSLRHIVDRARGGAGVSPNPIPLDQAVLIAERVALSLATTADMRYGNDRLTHGAVLPQFIWITDDGEVRLAGQQLGK
Ga0184629_1066832513300018084Groundwater SedimentRRFSNPIGGPAMAAKNRPYEQFGSFILFKRLEGDSLGDLWRAGRIDGAQLGKTVAVHRLTGGKREAFVACANDARALAPLLTGTTFAKEQIIDVVNGTPFIAHEYGGGRSLRHIVDRARGANGAAPNPIPLEQAIVIAEKVALSLATTADLRYMGNRLAHGGLVPQFIWIADDG
Ga0066655_1059810813300018431Grasslands SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEARNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRL
Ga0066667_1224578713300018433Grasslands SoilPYEQFGSFILFKKLETDALGDLWRAGRVDNGQLGPTMAVRRLLGGNRAALTAAAGEAHNLVPLLAGTTFAKDQTIDIANGIPFVAHEYAGGRSLRHIVDRARGGNGVTANPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVGGQQLGKG
Ga0066662_1122982213300018468Grasslands SoilMPSRTRPYEQFGPFILFKKLETDALGDLWRAGRIDGDKLAGLVALRRLTGGNREALSQSAAEARSVAPLLIGTSFVKNQVIDVVESVPVISHDYGGGRSLRHIVDRARGGAGVSPNPIPIDQAIVIAEKIALSLATTAELRYSGSRLWHGALIPQF
Ga0066669_1066504513300018482Grasslands SoilMAAKNKPYEQFGPYILFKKLESDPLSELWRAARIENGQVGTLVAVRRLTGGDRAALSASANGAHDLVPHFTGVTFAKHQVIGVQNDVPFIAHDYAGGRSLRHIVNRARGAQGVTPNPVPLDQAIGIAEKISLSLATLADLRNSAGTRLAHGALIPQFIWVDDGGEIRRS
Ga0210404_1055018713300021088SoilMAAKNRPYEQFGTFILFKKLETDSLGDLWRAGRIDGTQLGPTMAVRRLSGGNRAAITQSATEASQLVPLLSGTTFAKDQTIDVVNGTPFVAHEYSGGRSLRHIVDRARGDGGVTPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVAGQQLGKGLIASM
Ga0193699_1004913823300021363SoilMAAKNRPYEQFGTFILFKKLDTDSLGDLWRAAHVEGSALGPMLALRRMSGGNRAAMTQAASEANQLVPLLSGTTFAKEQMIDIANGIPFVAHEYAGGRSLRHIVDRARGDGGTTPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVSGQQLGKGLIASFSDAKVASELG
Ga0242657_121869813300022722SoilLGDLWRAGRVDGGQLGPIMAVRRLIGGNRAALVQSAADANQLVPLLSGTTFAKDQTIDVANGVPFVAHEYLGGRSLRHIVDRARGGAGMTPNPIPLDQAIVVAEKIALSLATTADLRYLGNRLAHGALIPQFIWITDEGEIRVAGQQLGKGLIASLQDAKVAGELGRYFAAEYRASG
Ga0247765_100978733300023268Plant LitterMAAKSRPYEQFGPYILFKKLEADALGDLWRAARIDGTQLGPLVAVRRLTGGDREALTAAAMTARDLVPQMSGTSFARNQVIDVMNGVPFIAHDYAGGRSLRHIVDRARGGPGITPNPIPIEQAIVIAERVALSLDTLGNMRSGQTKLTHGALIPQFVWITDDGEIRVAGQMLGPGLIASMKKDAKVGAELARYFPPEYRESGESQKTSVVYGAGA
Ga0247694_101245123300024178SoilMSAKNRPYEQFGPFILFKKFESDALGDLWRAGRVDGGQLGPIVAVRRLSGGNRAALVQSAAEANQLVPLLSGTTFAKDQTIDVANGIPYVAHEYSGGRSLRHIVDRARGGAGVTANPVPIDQAIVIAEKIALSLSTTADLRYLGNRLAHGALIPQFVWITDEGEIRVAGQQLGKGLIA
Ga0247673_106180013300024224SoilFILFKKFESDALGDLWRAGRVDGGQLGPIMAVRRLSGGNRAALVQSAAEANQLVPLLSGTTFAKDQTIDVANGIPYVAHEYSGGRSLRHIVDRARGGAGVTANPVPIDQAIVIAEKIALSLSTTADLRYLGNRLAHGALIPQFVWITDEGEIRVAGQQLGKGLIASMSDSKVAAELGRYF
Ga0207656_1006514823300025321Corn RhizosphereMAAKNKPYEQFGPYILFKKLESDALSELWRAARIENGQLGPLVALRRFTGGNRDAMVAAAEHAKSIVGGLSGTSFAKSQLVDVINGTPFVAHEYSGGRSLRHIVDRARGGNGITPNPIPLDQAVAIAEKVALSLATTADLRFGGDRLAHGALIPQFVWITDDGEIRVAGQQLGSGVVASLKDARVTSD
Ga0207705_1010116613300025909Corn RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDAGQLGPIMAVRRLSGGNRAAMLAAAATASQLVPLLSGTTFAKDQTIDVVNGIPFVAHEYSGGRSLRHIVDRARGGNGITANPIPIDQAMVIAEKIALSLSTTADLKYLGN
Ga0207684_1127731913300025910Corn, Switchgrass And Miscanthus RhizosphereMASRTRPYEQFGPFILFKKLEVDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEARNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGALIPQFVWISDDGEIRVAGQHLGKGMVASLRDAKVAAEIAR
Ga0207684_1131869913300025910Corn, Switchgrass And Miscanthus RhizosphereKGRPYEQFGSFLLFKKLESDALGDLWRAGRIEGNQLGPLVALRRLSGGNREALLEAATDAKAIVPMLTGTSFARNQSIDVIDGIPYIAHDYAGGRSLRHIVDRARGGTNSVANPIPLDQAIIVAEKVALSLATTGDLRYGDKRLTHGGLIPQFIWISDDGEIRVAGQQLGRGVAASLKDTRVAGEIARYFAPEYQAN
Ga0207652_1110137313300025921Corn RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDGGQLGPIMAVRRLSGGNRAALVASAGEASQLVPLLSGTTFAKDQTIDVANGIPFVAHEYSGGRSLRHIVDRARGGNGVTANPIPIDQAIVIAEKIALSLSTTADLKYLGQRLAHGALIPQFVWITDEGEIRVAGQQLGKGLIGSFSDSKVA
Ga0207646_1089180823300025922Corn, Switchgrass And Miscanthus RhizosphereMASRTKPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLAGGNREALAQNAAEARNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYAGSRLSHGALIPQFVWISDDGEIRVAGQHLGKGMVASLKDAKVAA
Ga0207646_1124473613300025922Corn, Switchgrass And Miscanthus RhizosphereMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNRGALAQNAVEARNVVPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSSVSPNPIPIDQAIVIAEKIALSLATTVELRYAGS
Ga0207700_1102638013300025928Corn, Switchgrass And Miscanthus RhizosphereMPGKTRPYEQFGPFILFKKLESDALGDLFRAARIDDRHLGPLVALRRLTGGNREALVRSATDAKAIVPLLTGTSFAKDQQIDVINGTPFVAHEYGSGRSLRHVIDRARGGNGITPNPIPLDQAILIAEKIALSLATTAD
Ga0207711_1202759113300025941Switchgrass RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDGGQLGPIMAVRRLSGGNRATLLQSAATASQLVPLLSGTTFAKDQTIDVANGIPFVAHEYSGGRSLRHIVDRARGGNGVTANPIPIDQAIVIAEKVALSLSTTAD
Ga0207651_1184559713300025960Switchgrass RhizosphereNNNSLRGILRATFPNKFLGVRSMASKNRPYEQFGPFILFKRLESDSLGDLWRAGTLDGAQLGKTVAVHRLTGGRREAFVAAAAEARALAPLLTGTTFAKDQTIDVVNGTPFVAHEYGGGRSLRHIVDRARGANGASPNPIPLEQAIVIAEKVALSLATTADLRYLGNRLSHGALIPQFIW
Ga0207639_1100827713300026041Corn RhizosphereMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDAGQLGPIMAVRRLSGGNRAAMLAAAATASQLVPLLSGTTFAKDQTIDVVNGIPFVAHEYSGGRSLRHIVDRARGGNGITANPIPIDQAMVIAEKIALSLSTTADLKYLGNRLSHGALIPQFVWI
Ga0207683_1201373913300026121Miscanthus RhizosphereRPYEQFGPFILFKRLESHALGDLWRAGRIDGTQLGRTVTVHRLTGGKRDAFVAAASEARALAPLLTGTTFAKEQVIDVIDGVPFIAHEYGGGRSLRHIVDRARGANGAAPNPVPLEQAIVIAEKVALSLATTADLRYLGNRLAHGALVPQFIWIADDGEIRVAGQQLGKAMLASL
Ga0209237_120248013300026297Grasslands SoilGMCRCTRCNGTGTKKRRRLHRNKCTGGGILLSIQKDFRSESMPGKTRPYEQFGPFILFKKLESDALGDLWRAARIDDRHLGPLVALRRLTGGNREALTQSADDARAIAPLLSGTSFAKDQIIDVVNGVPYIAHAYSGGRSLRHIIDRARGGSGITPNPIPIDQAILIAEKVALSLATTADLRHGGARLSHGALIPHFIWVSDDGEIRVAGQ
Ga0209647_114896323300026319Grasslands SoilMAAKNRPFEQFGSFILFKKLESDGLGDLWRAGRIDNGQLGSVLAVRRLSGGNREGFALSAATARQLVPQLTGTSFAKHQEIDAVNGIPYISHEYSGGRSLRHILDRARGGTGVAPNPLPIDQAIVIAEKVALSLSTTSDLKMNDKRLAHGALVPQFIWITDDGEIRVAGQQLGGGFLASMNDATFAGQFGRYFSPEYRTSGEPTKGSEVFSM
Ga0209687_116811113300026322SoilMASKNRPYEQFGSFILFKRLESDALGDLWRAGRIDGAQLGRTVALRRLLGGKREAFAAAAGEARTLAPLLTGTTFAKEQTIDVLGGVPFIAHEYGGGRSLRHVVDRARGISGTVPNPVPLDQAIVIAEKVALSLATTADLRYMGNRLVHGALIPQFIWIADDGEIRVAGQQLGKAMIASLAEPKFHAEFGRYFAPE
Ga0209152_1038250113300026325SoilMPAKPYQQFGPFILFKKLESDSLGDLWRAGRIDGDHLAGVVALRRLSGGNRDALVQSANEARNVAPILSGTSFVKDQVIDVVDGVPVIAHEYGGGRSLRHVVDRARGGSGVSPNPIPIDQAIVIAEKVALSLATTTELRYAGNRLSH
Ga0209803_113321013300026332SoilMASRTRPYEQFGPFILFKKLETDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEAHNVAPLLTGTSFVKNQAIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYA
Ga0257157_105548513300026496SoilVPSKTRPYDQFGNYILFKKLEADPLSELWRAARIDDRHLGPLVALRRLIGGDRDALVRAATDARDMIPLLSGSSFVKQQLIDVINGTPYIAHEYSGGRSLRHIVDRAHGGAGVTPNPVPIDQAIHIAEKIALSLTTLNDLRYGGNRLSHGALI
Ga0209056_1043670313300026538SoilMPGKTRPYEQFGPFILFKKLESDALGDLWRAARIDDRHLGPLVALRRLTGGNRETLTQSADDAREIAPLLSGTSFAKDQIIDVVNGVPSIAHAYSGGRSLRHIIDRARGGSGITPNPIPIDQAILIAEKVALSLATTADLRHGGVRLSHGALIPHFIWVSDDGEI
Ga0209523_111382413300027548Forest SoilMSAKNRPYEQFGPFILFKKLEADALGDLWRAGRVDGGQLGPVMAVRRLSGGSRAALTESAAEASQLVPLLSGTTFARDQTIDVVNGVPFVAHEYAGGRSLRHIVDRARGGAGITPNPVPLDQAIVIAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRV
Ga0209283_1035444623300027875Vadose Zone SoilMASRTRPYEQFGPFILFKKLEVDALSDLWRAGRIDGDKLAGLVALRRLTGGNREALAQNAAEARNVAPLLTGTSFVKNQVIDVVESVPVIWHDYGGGRSLRHIVDRARGGSGVSPNPIPIDQAIVIAEKIALSLATTVELRYSGSRLSHGALIPQFVWISDDGEIRVAGQHLGKGMVA
Ga0247675_104635513300028072SoilMSAKNRPYEQFGPFILFKKFESDALGDLWRAGRVDGGQLGPIMAVRRLSGGNRAALVQSAAEANQLVPLLSGTTFAKDQTIDVANGIPYVAHEYSGGRSLRHIVDRARGGAGVTANPVPIDQAIVIAEKIALSLSTTADLRYLGNRLAHGALIPQFVWITDEGEIRVA
Ga0073995_1092720513300031047SoilALGDLWRAGRVDNGQLGHTMAVRRLLGGNRAALTASAGEAHNLVPLLAGTTFAKDQTIDIANGIPFVAHEYAGGRSLRHIVDRARGGNGVSPNPIPLDQAIVIAEKIALSLSTTADLRYLGNRLAHGALIPQFVWITDEGEIRVGGQALGKGLIASMSDTKVASELGRYFAAEYRASAEISKTTDVYS
Ga0307468_10104844313300031740Hardwood Forest SoilMASKNRPYEQFGSFILFKKLESDALGDLWRAGKIDGSQLGHTVAVRRLIGGKRDVFVSAASEARALAPLLTGTTFAKEQVIDAIGGIPFIAHEYGGGRSLRYIVDRARGAGGTAPNPIPIDQAIVIAEKVALSMATTADLRYMGNRLTHGALIP
Ga0307473_1104048313300031820Hardwood Forest SoilMPAKPYEQFGPFILFKKLESDALGDLWRAGRIEGDHLAGVVALRRLSGGNRDALVQSASEARNVAPLLSGTSFVKNQVIDVVDGIPVVAHEYGGGRSLRHIVDRARGGAGVSPNPIPIEQAIVIGEKVALSLATTTELRYAGKRLSHGALIPQFVWINNDGEI
Ga0307479_1106053623300031962Hardwood Forest SoilMPGKTRPYEQFGPFILFKKLESDALGDLWRAARIDDRHLGPLVAVRRLSGGNREALVQAAADAREIVPMLSGSSFVKDQMIDVANGVPYVAHEYGGGRSLRHVVDRAHGAAGITPNPIPIDQAIHIAERVALSLATTADLRHGGARLSHGALIPQFIWISDDGEIRVAGQQLGKGL
Ga0308173_1199669413300032074SoilPRTAQKSGGPSSMSAKNRPYEQFGSFILFKKLETDALGDLWRAGRVDNAQLGPTMAVRRLLGGNRAALTAAAGEAHNLVPLLAGTTFAKDQTIDVANGIPFVAHEYAGGRSLRHIVDRARGGNGVTANPVPLDQAIVIAEKVALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVG
Ga0307471_10001839363300032180Hardwood Forest SoilMSAKNRPYEQFGPFILFKKLETDALGDLWRAGRVDGGQLGPIMAVRRMSGGNRAALLQSAADANQLVPLLSGTTFAKDQTIDVANGIPFVAHEYSGGRSLRHIVDRARGGAGITPNPVPLDQAIVVAEKIALSLATTADLRYLGNRLAHGALIPQFVWITDEGEIRVAGQQLGKGLVASLHDAKVGGEL
Ga0307472_10086986813300032205Hardwood Forest SoilDLWRAGRVDGGQLGPIMAVRRLSGGNRAALVASAAEASQLVPLLSGTTFAKDQTIDVVGGIPLVAHEYSGGRSLRHIVDRARGGNGVTANPIPIDQAIVIAEKIALSLSTTADLKYLGNRLAHGALIPQFVWITDEGEIRVAGQQLGKGLIGSFSDSKVASELGRYFAAEYRAGAEISKTTDVYSSPATSRPMQ
Ga0364926_127454_2_5353300033812SedimentMAAKNRPYEQFGSFILFKRLESDALRDLWRAGRIDGTQLGKTVAIHRLAGGNRGAFVACANEAGALAPLLTGTTFAKEQVIGVANGTPFIAHEYGGGRSLRHIVDRARGMNGSAPNPIPLDQALVIAEKVALSLATTADLRYMGNRLAHGGLVPQFIWIADDGEIRVAGQQLGKAMIA
Ga0364940_0051656_570_11063300034164SedimentMAARNRPYEQFGPYILFKKLESDALTELWRAARIDGNALGAVVALRRFHAGNRQELVAAARAAREAASLLTGSSFVKAQTIDVIDNVPFVAHELAGGRSLRHIVERARGGAGITPNPIPIDQSIVIAEKVALSLATTAELRYAGDRMVHGALIPQFVWIGDDGEIRVAGQLLGPAVVAS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.