NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F022245

Metagenome / Metatranscriptome Family F022245

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F022245
Family Type Metagenome / Metatranscriptome
Number of Sequences 215
Average Sequence Length 85 residues
Representative Sequence ANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQSYDSAIVPTKLTIVTATSVTYCVDSTVGGKVWNKNGPGADIVSGACT
Number of Associated Samples 166
Number of Associated Scaffolds 215

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 3.26 %
% of genes from short scaffolds (< 2000 bps) 0.93 %
Associated GOLD sequencing projects 159
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.744 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(22.791 % of family members)
Environment Ontology (ENVO) Unclassified
(29.767 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.744 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 21.82%    β-sheet: 19.09%    Coil/Unstructured: 59.09%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 215 Family Scaffolds
PF07963N_methyl 20.93
PF13633Obsolete Pfam Family 16.28
PF13544Obsolete Pfam Family 7.91
PF05157T2SSE_N 3.26
PF00437T2SSE 0.93
PF01202SKI 0.47
PF02589LUD_dom 0.47
PF12802MarR_2 0.47
PF10604Polyketide_cyc2 0.47
PF13649Methyltransf_25 0.47
PF01636APH 0.47



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.74 %
All OrganismsrootAll Organisms3.26 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005558|Ga0066698_10098620All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1925Open in IMG/M
3300005764|Ga0066903_100301719All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2535Open in IMG/M
3300006046|Ga0066652_100010816All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia5827Open in IMG/M
3300006175|Ga0070712_100061465All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2654Open in IMG/M
3300009012|Ga0066710_100151476All Organisms → cellular organisms → Bacteria3226Open in IMG/M
3300012014|Ga0120159_1083886All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium934Open in IMG/M
3300012200|Ga0137382_10017680All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium4011Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil22.79%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.95%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.05%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost5.12%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.65%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.33%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.86%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.86%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.86%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.40%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.40%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.40%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.93%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.93%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.93%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.47%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.47%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.47%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.47%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.47%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.47%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.47%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.47%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.47%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000886Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-65cm-3A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001535Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A3-PF-15A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001538Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-PF 4A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005161Soil and rhizosphere microbial communities from Laval, Canada - mgLPAEnvironmentalOpen in IMG/M
3300005165Soil and rhizosphere microbial communities from Laval, Canada - mgHMCEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005366Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaGHost-AssociatedOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006605Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009801Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30EnvironmentalOpen in IMG/M
3300010096Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011992Permafrost microbial communities from Nunavut, Canada - A23_65cm_12MEnvironmentalOpen in IMG/M
3300011998Permafrost microbial communities from Nunavut, Canada - A30_35cm_6MEnvironmentalOpen in IMG/M
3300012010Permafrost microbial communities from Nunavut, Canada - A7_35cm_12MEnvironmentalOpen in IMG/M
3300012014Permafrost microbial communities from Nunavut, Canada - A10_80cm_6MEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012399Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012405Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012907Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S044-104R-1EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300013501Permafrost microbial communities from Nunavut, Canada - A35_65cm_0.25MEnvironmentalOpen in IMG/M
3300013770Permafrost microbial communities from Nunavut, Canada - A15_5cm_18MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018067Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_coexEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019868Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025903Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025926Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025932Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025940Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028705Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_115EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028713Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_184EnvironmentalOpen in IMG/M
3300028714Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_196EnvironmentalOpen in IMG/M
3300028721Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_355EnvironmentalOpen in IMG/M
3300028722Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_368EnvironmentalOpen in IMG/M
3300028744Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_367EnvironmentalOpen in IMG/M
3300028755Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_356EnvironmentalOpen in IMG/M
3300028768Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_119EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300028880Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_181EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300030511Bulk soil microbial communities from Mexico - Amatitan (Am) metaG (v2)EnvironmentalOpen in IMG/M
3300030986Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_143 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031096Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_194 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031366Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 25_SEnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034447Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_119 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AL3A1W_152779213300000886PermafrostRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKTWNKAGPGADIVSGACP*
JGI10216J12902_10099505023300000956SoilRAAIPAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLDFPNAPTSVTYCIQSTVGGKIWRKNGPGQAIENLAC*
A3PFW1_1002138613300001535PermafrostNNSAAKANVRAAIPAVEAFNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIKTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP*
A10PFW1_1150314413300001538PermafrostNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVSGACT*
C688J18823_1004788943300001686SoilRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSITYCVDSTVGGKTWNKNGPGADIVTDPCT*
C688J18823_1043693913300001686SoilANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTDPCT*
JGIcombinedJ26739_10094728623300002245Forest SoilFRDRANNSAAKANVRAAIPAVEAYNADNLSTGNSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKSGPGADIVSGACP*
C688J35102_12068765433300002568SoilDRANNSAAKANVRAAIPAIEAYNADNLGTGNSSGYAGMTVSYLRDNYDSELVTTKLAFPNAPTSVTYCVQSTVGGKTWSKNGPGAAIVNAVC*
Ga0063455_10130813323300004153SoilNSAAKANVRAAIPAIEAYNADNLGTGNSAGYAGMTVSYLRDNYDSELVTTKLDFPVAPTSITYCVQSTVGGKTWSKNGPGAAIVNAVC*
Ga0063455_10161172213300004153SoilYLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSSGYAGMTVSVLRDTYDSELVTTKLAFPNAPTSITYCVESTVGGKTWRKNGPGQSIENLAC*
Ga0062590_10298664613300004157SoilSYLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSFLRDNYDSELVTTKLDFPNAPTSVTYCVQSTVGGKTWSKNGPGAAIVNAVC*
Ga0062595_10236610113300004479SoilANNSAAKANVRAAIPAVEAYNADNTGTGASAGYAGMTVSLLQVYDSAIVPTKLLIKSADAVTYCVQSTVGGKVFNKAGPGADIVTGACP*
Ga0062591_10069845013300004643SoilIPAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLAFPSAPTSVTYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0062594_10273779723300005093SoilLSFRDRANNSAAKANVRAAIPAIEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLAFPSAPTSITYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0062594_10296162623300005093SoilNNSAAKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVALLRDTYDSELVTTKLSFPNAPTSVTYCVQSTVGGKTWRKNGPGAALENLAC*
Ga0066807_104092713300005161SoilLAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKVWNKAGPGADIVTNACP*
Ga0066869_1006775713300005165SoilRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT*
Ga0066683_1066870513300005172SoilSFRDRANNSAAKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVQSTVGGKTWRKNGPGQSIENLAC*
Ga0066680_1055896723300005174SoilNQSAAKANVRAAVPAVEAYNADNTGTGNSAGYAGMTVSGLQLYDSAIVPTKLTIQSATSLTYCVQSTVGPATWKKAGPGADIVTGACP*
Ga0066680_1073276813300005174SoilVRAAVPAVEAYNADNTGTGNSAGYAGLTISALQTYDSALVPTKLTIQSADSVTYCIQSTVGSATWKKAGPGADIVTGACP*
Ga0066673_1074182223300005175SoilANNSAAKANVRAAIPAIEAYNADNTGTGNSSGYNGMTISYLRDNYDSELVTTKLSLPSAPTSVTYCVESTVGGKTWRKNGPGAAIDNNTCP*
Ga0066671_1005589133300005184SoilVEAYNADNTGTGNSAGYAGMTVSALQTYDSAIVPSKLLIKSADSLTYCVQSTVGPAVWNKAGPGADIVTGACP*
Ga0066675_1117509923300005187SoilIPAIEAYNADNTGTGNSSGYNGMTISYLRDNYDSELVTTKLSLPSAPTSVTYCVESTVGGKTWRKNGPGAAIDNNTCP*
Ga0070690_10132458923300005330Switchgrass RhizosphereSYLSFRDRANNSAAKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT*
Ga0070688_10141792413300005365Switchgrass RhizosphereSFRDRANNSAAKANVRAAIPAVEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLDFPNAPTSVTYCVQSTVGGKTWSKNGPGAAIVNAVC*
Ga0070659_10204807023300005366Corn RhizosphereRANNSAAKANVRASIPAIEAYNADNVGTGNSAGYAGMTVSYLRDNHDSELVTTKLSLPSAPTSVTYCIQSTVGGKTWSKNGPGAAIVNAVC*
Ga0070685_1069591223300005466Switchgrass RhizosphereIPAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLDFTNAPTSVTYCVQSTVGGKTWSKNGPGAAIVNAVC*
Ga0070672_10080961213300005543Miscanthus RhizosphereLLAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKNGPGADIVSGQCT*
Ga0070695_10153932713300005545Corn, Switchgrass And Miscanthus RhizosphereRANNSAAKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVALLRDTYDSELVTTKLSFPNAPTSVTYCVQSTVGGKTWRKNGPGAALENLAC*
Ga0070696_10120889623300005546Corn, Switchgrass And Miscanthus RhizosphereLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVAGLRDTYDSELVTTKLSFPNAPTSVTYCVESTVGGKTWRKNGPGAALENLAC*
Ga0066692_1011883143300005555SoilAVEAYNADNTGTGNSAGYAGMTVSGLQTYDSAIVPTKLTIQSADSVTYCVQSTVGGATWKKAGPGADIVTGACP*
Ga0066707_1003558163300005556SoilIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP*
Ga0066707_1024538713300005556SoilVPAVEAYNADNTGTGNSAGYAGMTVSGLQNYDSAIVPTKLTIQSADSLTYCIQSTVGPATWKKAGPGADIVTGACP*
Ga0066707_1088531523300005556SoilANNSAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIKSADSITYCVESTVGGKVWNKAGPGADIVTGACP*
Ga0066698_1009862013300005558SoilRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKDGPGADIVSGACT*
Ga0066700_1014009113300005559SoilEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKAGPGADIISGACP*
Ga0066700_1081842613300005559SoilPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP*
Ga0066699_1066345113300005561SoilANNSAAKANVRAAIPAVEAYNADNVGTGNSAGYAGMTVSLLQAYDSAIVPTKLSIVSATSVTYCVQSTVGGKTWNKNGPGADIVTGVC*
Ga0070664_10026893143300005564Corn RhizosphereANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT*
Ga0066703_1040766213300005568SoilDRANNSAAKANVRAAIPAVEAYNADNTGTGTSAGYAGMTVSALQTYDSAIVPTKLFIKSADATTYCIQSTVGNAVWNKQGPGADIVTGACP*
Ga0066702_1056155323300005575SoilIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLFIKSADSVTYCVESTVGGKVWNKAGPGADIVSGSCP*
Ga0068854_10141176813300005578Corn RhizosphereLLAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT*
Ga0066654_1012606813300005587SoilANVRAAIPAVEAYNADNTGTGTSAGYAGMSISGLQTYDSAIVPTKLFIKTANATTYCIQSTVGNAVWNKQGPGADIVTGACP*
Ga0066654_1069180013300005587SoilAIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLFIKSADSVTYCVESTVGGKVWNKAGPGADIVTGACP*
Ga0066706_1151436013300005598SoilLSFRDRANNSAAKANVRAAIPAIEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLAFPNPPTSGTYCVQSTVGGKTWSKNGPGAAIQNATC*
Ga0066903_10030171913300005764Tropical Forest SoilGFTLIELFVVAIILGILLVITVASYLSYRDHEDSSTARANVLAAIPAIKAYKAANSSNGYAGMTVSYLRDHYDSGLVTTKLSFPKSPTSRTYCVESTVGGNTWRENGPGQPVEHEAC*
Ga0066651_1082682413300006031SoilNQSAAKANVRAAVPAVEAYNADNTGTGNSAGYAGMTVSALQNYDSAIVPTKLTIQSADSLTYCVQSTVGPATWKKAGPGADIVTGACP*
Ga0066656_1089188323300006034SoilIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLFIKSADSVTYCVQSTVGGKVWNKAGPGADIVTGACP*
Ga0066656_1110184623300006034SoilIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLTIKSADSVTYCVESTVGGKVWNKAGPGADIVTGSCP*
Ga0066652_10001081683300006046SoilAKANVRAAVPAVEAYNADNTGTGNSAGYAGMSVSALQNYDSAIVPTKLTIQSADSLTYCIQSTVGPATWKKAGPGADIVTGACP*
Ga0066652_10020196913300006046SoilAAIPAMEAYNADNTGTGNSAGYAGMTVSYLRDTYDSELVTTKLAFPNAPTSVTYCVQSTVGGKVWRKNGPGQAIENLPC*
Ga0066652_10147087413300006046SoilANQSAAKANVRAAVPAVEAYNADNTGTGNSAGYAGMSVSGLQNYDSAIVPTKLTIQSATSLTYCIQSTVGPATWKKSGPGADIVTGSCP*
Ga0066652_10179523423300006046SoilPSYLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSSGYAGMTVSYLRDNYDSELVTTKLSFPNVPTSVTYCVESTVGGKTWRKNGPGASIENLAC*
Ga0066652_10198204723300006046SoilPSYLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSSGYAGMTVSYLRDNYDSELVTTKLAFPNVPTSVTYCVQSTVGGKTWRKNGPGQAIENLAC*
Ga0070712_10006146513300006175Corn, Switchgrass And Miscanthus RhizosphereRANNSAAKANVRAAIPAVEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLSLPSAPTSVTYCIQSTVGGKTWSKNGPGAAIANATCP*
Ga0074057_1188922613300006605SoilSAAKANVRAAIPAVEAYNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTVFRATSITYCVDSTVGGKVWKKDGPGADITVGQCT*
Ga0079222_1104606823300006755Agricultural SoilRANNSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLAFPSAPTSVTYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0066653_1056451213300006791SoilNSAAKANVRAAIPAMEAYNGDNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLNFPNAPTSVTYCVESTVGGKTWRKNGPGQAIENLPC*
Ga0079221_1062574323300006804Agricultural SoilANVRAAVPAMEAYNADNTGTGSSAGYAGATVAALSTYDSAIVPTKLTIQSANSVTYCMQSTVGAATWKKAGPGADIVTGACP*
Ga0075429_10121811923300006880Populus RhizosphereAIPAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLDFPSAPTSVTYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0079219_1145276123300006954Agricultural SoilSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSFLRDNYDSELVTTKLDFPSPPTSVTYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0066710_10015147613300009012Grasslands SoilIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLFIKSADSVTYCVESTVGGKVWNKAGPGADIVTGSCP
Ga0066710_10451818913300009012Grasslands SoilNNSAAKANVRAAIPAIEAFNADNIGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVQSTVGGKTWRKNGPGQSIENLAC
Ga0066710_10481901123300009012Grasslands SoilAVEAYNADNTGTGNSAGYAGMTVSGLQAYDSAIVPTKLTIQSATSLTYCIQSTVGPATWKKSGPGADIVTGSCP
Ga0099827_1053583433300009090Vadose Zone SoilADNTGTGNSAGYAGLTISALQTYDSALVPTKLTIQSADSVTYCIQSTVGAATWKKAGPGADIVTGACP*
Ga0066709_10050638213300009137Grasslands SoilAIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLTIKSADSVTYCVESTVGGKVWNKAGPGADIVTGSCP*
Ga0066709_10107844633300009137Grasslands SoilSYLSFKDRANSAAAKANVRAGIPAVEAFNADNTGTGASAGYAGMTVSLLQTYDSAIVPTKLHIGTGASLPTATTYCVSSTIGGNTWWKGGPGADIVQDTTATPLPSGC*
Ga0066709_10293025323300009137Grasslands SoilRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQSYDSAIVPTKLNIVAATSVTYCVESTVGGKVWNKNGPGADIVSGAC*
Ga0066709_10413564513300009137Grasslands SoilAIPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLFIKSADSVTYCVQSTVGGKVWNKAGPGADIVTGACP*
Ga0114129_1210037313300009147Populus RhizosphereAAKANVRAAIPAVEAYNADNNGTGNSAGYAGMTVSLLQAYDSAIVPTKLTIVSANSVTYCVQSTVGGKTWNKNGPGADIVTGVC*
Ga0105056_105108423300009801Groundwater SandPAIEAYNADNVGTGNSAGYAGMTVSLLRDTYDSELVTTKLSFPNAPTSVTYCVQSTVGGKTWRKNGPGQSIENLAC*
Ga0127473_107314813300010096Grasslands SoilAIEAYNADNTGTGASGGYAGATVSALQAYDSAIVPTKLTIQSASSTSYCIQSTVGSATWKKAGPGADIVSGACP*
Ga0134070_1021596623300010301Grasslands SoilANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQSYDSAIVPTKLTIVTATSVTYCVDSTVGGKVWNKNGPGADIVSGACT*
Ga0134082_1056646323300010303Grasslands SoilVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLTIASADSVTYCVSSTVGGRTWNKKGPGADIVTGVCT*
Ga0134084_1029944023300010322Grasslands SoilVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP*
Ga0134064_1029593813300010325Grasslands SoilNSAAKANVRAAIPAIEAYNADNIGTGNSAGYAGMTVSFLRDTYDSELVTTKLAFPNAPTSVTYCIQSTVGGKIWRKNGPGQAIENLAC*
Ga0134071_1023644713300010336Grasslands SoilNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLNIKTATSITYCVDSTVGGKVWNKNGPGADIVSGAC*
Ga0134071_1042267713300010336Grasslands SoilNNSAAKANVRAAILAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLSFPNAPTSVTYCIQSTVGGKTWRKNGPGQAIENLAC*
Ga0126379_1142988623300010366Tropical Forest SoilSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSFLRDNYDSELVTTKLAFPSAPTSVTYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0126379_1286499723300010366Tropical Forest SoilVVAIILGILLVITVASYLSYRDHEDSSTARANVLAAIPAIKAYKAANSSNGYAGMTVSYLRDHYDSGLVTTKLSFPKSPTSRTYCVESTVGGNTWRENGPGQPVEHEAC*
Ga0134124_1159236513300010397Terrestrial SoilEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT*
Ga0126383_1004309613300010398Tropical Forest SoilEAYNADNTGTGNSAGYAGMTVSFLRDNYDSELVTTKLAFPNAPTSVTYCVQSTVGGKTWRKNGPGAAIENLPC*
Ga0126383_1010114913300010398Tropical Forest SoilKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSYLRDNYDSELVTTKLAFPNPPTSVTYCVESTVGGKTWRKNGPGAAIENLPC*
Ga0137391_1073974113300011270Vadose Zone SoilAIPAVEAYNADNLGTGNSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP*
Ga0120146_100325913300011992PermafrostIPAVEAFNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIVTATSVTYCVDSTVGGKVWNKGGPGADIVSGACP*
Ga0120114_110060323300011998PermafrostPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIVTATSVTYCVDSTVGGKVWNKGGPGADIVSGACP*
Ga0120118_114974223300012010PermafrostSFRDRANNSAAKANVRAAIPAVEAFNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIVTATSVTYCVDSTVGGKVWNKGGPGADIVSGACP*
Ga0120159_108388633300012014PermafrostIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNLGTGNSAGYAGMTVSLLQAYDSAIVPTKLNITTANSVTYCVDSTVGGKVWNKAGPGADIVSGACP*
Ga0120159_121861813300012014PermafrostYLSFRDRANNSAAKANVRAAIPAVEAYNADNTGTVNSAGYAGMTVSLLQTYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKDGPGADIVSGACT*
Ga0137382_1001768013300012200Vadose Zone SoilIPAIEAFNADNIGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVQSTVGGKTWRKNGPGQSIENLAC*
Ga0137382_1019909243300012200Vadose Zone SoilVPSYLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSSGYAGMTVSYLRDNYDSELVTTKLAFPNIPTSVTYCVQSTVGGKTWRKNGPGQAIENLAC*
Ga0137382_1127225423300012200Vadose Zone SoilAKANVRAAIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKSGPGADIVSGACP*
Ga0137399_1105131413300012203Vadose Zone SoilAFNADNLGTGNSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTGSCP*
Ga0137374_1088713813300012204Vadose Zone SoilNNSAAKANVRAAIPAIEAFNADNLGTGNSAGYAGMTVSLLRDTYDSELVTTKLSFPNAPTSVTYCVQSTVGGKTWRKNGPGQSIENLVC*
Ga0137376_1003665953300012208Vadose Zone SoilYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP*
Ga0137376_1063785613300012208Vadose Zone SoilAYNADNLGTGNSAGYAGMTVSLLQAYDSAIVPTKLNIKTATSVTYCVDSTVGGKTWNKNGPGADIVTNACT*
Ga0137376_1106073923300012208Vadose Zone SoilEAYNADNTGTGASGGYTGMTVSALQAYDSAIVPAKLTIVTADSTTYCVQSSVGGGTYSKSGPGAEIVSGACP*
Ga0137376_1136982023300012208Vadose Zone SoilAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQNYDSAIVPQKLVIFRATSVTYCVDSTVGGKVYNKDGPGADIVAGACT*
Ga0137376_1180291213300012208Vadose Zone SoilSAAKANVRAAIPAVEAYNADNTGTGASAGYAGMTVSLLQTYDSAIVPTKLFIKSADATTYCVQSTVGGKVFNKAGPGADIVTGACP*
Ga0137379_1181486013300012209Vadose Zone SoilYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLTITSADSVTYCVSSTVGGKTWNKNGPGADIVSGVCT*
Ga0137378_1182365213300012210Vadose Zone SoilYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTNACT*
Ga0137377_1112765223300012211Vadose Zone SoilYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKDGPGADIVSGACT*
Ga0137377_1166998713300012211Vadose Zone SoilNNSAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQSYDSAIVPTKLNIVAATSVTYCVESTVGGKVWNKNGPGADIVSGAC*
Ga0150985_11704192713300012212Avena Fatua RhizosphereSYLSFRDRANNSAAKANVRAAIPAIEAYNADNVGTGNSSGYAGMTVSYLRDNYDSELVTTKLDFPNAPTSATYCIQSTVGGKVWRKNGPGQAIENLSC*
Ga0150985_11904020343300012212Avena Fatua RhizosphereRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTDPCT*
Ga0137387_1077659713300012349Vadose Zone SoilNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIKSADSVTYCVESTVGGKVWNKAGPGADIVTGACP*
Ga0137372_1060953623300012350Vadose Zone SoilIILGILLGTAATSYLSCRDRANYSAAKANVRAAITAIEAFNADNTGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNPPTSVTYCVESTVGGKTWRKNGPGQSIENLAC*
Ga0137369_1033785513300012355Vadose Zone SoilDRANNSAAKANVRAAIPAIEAYNADNLGTGNSAGYAGMTMTGLRDTYDSELSPSKLSFPNAPTSVTYCVQSTVGGKIWRKNGPGANIENLAC*
Ga0137371_1016636743300012356Vadose Zone SoilAKANVRAAIPAVEAYNADNLGSGNSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVFNKAGPGADIVTGLCP*
Ga0137371_1025862913300012356Vadose Zone SoilDNTGTANSAGYAGMTVSLLQSYDSAIVPTKLNIVAATSLTYCVESTVGGKVWNKNGPGADIVSGAC*
Ga0137371_1085249123300012356Vadose Zone SoilAAKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVESTVGGKTWRKNGPGQSIENLAC*
Ga0137384_1122585013300012357Vadose Zone SoilYNADNLGTANSAGYAGMTVSLLQSYDSAIVPTKLNIASATSVTYCVDSTVGGKVWNKAGPGADIVSGACP*
Ga0137385_1120109623300012359Vadose Zone SoilAAIPAVEAYNADNTGTGASAGYAGMTVALLQAYDSAIVPSKLFIKTASSTTYCVQSTVAQATWNKAGPGADIVTGACP*
Ga0134061_119414623300012399Grasslands SoilAKANVRAAIPAVEAYNADNTGTGTSAGYAGMSISGLQTYDSAIVPTKLFIKTANATTYCIQSTVGNAVWNKQGPGADIVTGACP*
Ga0134041_140440613300012405Grasslands SoilADNTGTGASGGYAGATVSALQAYDSAIVPTKLTIQSASSTSYCIQSTVGSATWKKAGPGADIVSGACP*
Ga0150984_11240741123300012469Avena Fatua RhizosphereAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTDPCT*
Ga0157283_1001551643300012907SoilANNSAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVSGACT*
Ga0137359_1179661923300012923Vadose Zone SoilAIPAVEAYNADNTGTGASAGYAGMTVSLLQVYDSAIVPTKLLIKSADATTYCVQSTVGGKVFNKAGPGADIVTGACP*
Ga0137419_1182543813300012925Vadose Zone SoilGTNVCAAIPAVEAYNADKHETANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGGKVWNKAGPGADITVGACP*
Ga0137416_1225981013300012927Vadose Zone SoilRDRANNSAAKANVRAAIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKSGPGADIVSGACP*
Ga0137404_1121707013300012929Vadose Zone SoilAIPAIEAFNADNIGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVESTVGGKTWRKNGPGQSIENLAC*
Ga0137407_1201899613300012930Vadose Zone SoilNVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLNIKTATSVTYCVDSTVGGKVWNKAGPGADIVSGACP*
Ga0164298_1106684713300012955SoilAFNADNTSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVSGACS*
Ga0164299_1089378213300012958SoilNADNVGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKNGPGADIISGACT*
Ga0164299_1092909223300012958SoilAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLPVFRATSITYCVDSTVGGKVWKKDGPGADITVGQCT*
Ga0134087_1011734633300012977Grasslands SoilAKADVRAAVPAVEAYNADNTGTGASGGYTGMTVSALQAYDSAIVPAKLTIVTADSTTYCVQSSVGGGTYSKSGPGAEIVSGACP*
Ga0134087_1076947823300012977Grasslands SoilSFRDRANNSAAKANIRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQSYDSAIVPTKLTIATATSITYCVDSTVGGKVWNKNGPGADIVSGAC*
Ga0164304_1003783713300012986SoilPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVAGQCT*
Ga0164304_1127398923300012986SoilAAIPAVEAYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKNGPGADIISGACT*
Ga0164304_1140551823300012986SoilVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLKAYDSAIVPTKLTVFRATSITYCVDSTVGGKVWKKDGPGADITVGQCT*
Ga0164304_1154107413300012986SoilLSFRDRANNSAAKANVRAAIPAVEAFNADNLGTGNSAGYAGMTVSLLQAYDSAIVPTKLTIKSATSVTYCVDSTVGGKVWNKDGPGADIVSGACS*
Ga0164307_1099131113300012987SoilVPSYLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSFLRDNYDSELVTTKLAFPSAPTSVTYCVQSTVGGKTWSKNGPGAAIANAVCP*
Ga0164305_1223969413300012989SoilAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT*
Ga0157371_1086788413300013102Corn RhizosphereAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT*
Ga0157371_1129036613300013102Corn RhizospherePAIEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLSLPSAPTSVTYCIQSTVGGKTWSKNGPGAAIANASCP*
Ga0157374_1261333313300013296Miscanthus RhizosphereIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT*
Ga0157374_1293281913300013296Miscanthus RhizosphereIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKNGPGADIVSGQCT*
Ga0157375_1235867313300013308Miscanthus RhizosphereVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKNGPGADIVSGQCT*
Ga0157375_1264026313300013308Miscanthus RhizosphereLSFRDRANNSAAKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT*
Ga0120154_101847813300013501PermafrostANVRAEIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFTATSITYCVDSTVGGKVWNKNGPGADIVSGACT*
Ga0120123_117478213300013770PermafrostFRDRVNNSAAKANVRAAIPAVEAFNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLFIKSADATTYCVQSTVGGKVFNKAGPGADIVTGACP*
Ga0120158_1004386173300013772PermafrostADNLGTGNSAGYAGMTVSLLQAYDSAIVPTKLNITTANSVTYCVDSTVGGKVWNKAGPGADIVSGACP*
Ga0134079_1017232623300014166Grasslands SoilRDRANNSAAKANVRAAIPAMEAYNADNTGTGNSAGYAGMTVSYLRDTYDSELVTTKLAFPNAPTSVTYCVQSTVGGKVWRKNGPGQAIENLPC*
Ga0134079_1070983223300014166Grasslands SoilSAAKANVRAAVPAVEAYNADNTGTGNSAGYAGMTVSGLQAYDSAIVPAKLTIQSATSLTYCVQSTVGPATWKKAGPGADIVTGACP*
Ga0157376_1090567533300014969Miscanthus RhizosphereANNSAAKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT*
Ga0132258_1113402513300015371Arabidopsis RhizosphereRANNSAAKANVRAAIPAIEAYNADNTGTGNSAGYAGMTVSFLRDNYDSELVTTKLAFPSAPTSLTYCVQSTVGGKTWSKNGPGAAIANATCP*
Ga0132258_1165326143300015371Arabidopsis RhizosphereSAAKANVRAAIPAIEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLSFPNAPTSVTYCVQSTVGGKTWSKNGPGAAIVNAVC*
Ga0132256_10364073713300015372Arabidopsis RhizosphereNNSAAKANVRAAIPAIEAYNADNTGTANSAGYNGMTVSYLRDNYDSELVTTKLAFPNAPTSVTYCIESTVGGKTWRKNGPGAAIENLPC*
Ga0132255_10005050013300015374Arabidopsis RhizosphereLSFRDRANNSAAKANVRAAIPAIEAYNADNTGTANSAGYNGMTVSYLRDNYDSELVTTKLSFPNPPTSVTYCIQSTVGGKTWRKNGPGAAIENLPC*
Ga0132255_10523314313300015374Arabidopsis RhizosphereFRDRANNSAAKANVRAAIPAIEAYNADNTGTANSSGYNGMTVSYLRDNYDSELVTTKLAFPNAPTSVTYCIESTVGGKTWRKNGPGAAIENLPC*
Ga0184605_1043685323300018027Groundwater SedimentDRANNSAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKNGPGADIISGACT
Ga0184605_1049815523300018027Groundwater SedimentLAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVSGACS
Ga0184619_1011928513300018061Groundwater SedimentYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLVIFRATSVTYCVDSTVGGKVWNKDGPGADIVAGACT
Ga0184611_119081513300018067Groundwater SedimentNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKNGPGADIVTGPC
Ga0184618_1022408423300018071Groundwater SedimentVRAAIPAVEAYNADNTGTGASAGYAGMTVSLLQTYDSAIVPTKLLIKSADATTYCVESTVGGKVFNKAGPGADIVTGACP
Ga0066667_1052319723300018433Grasslands SoilAIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGAKAWNKAGPGADIVSGLCP
Ga0066667_1145448923300018433Grasslands SoilAIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKSGPGADIISGACP
Ga0066662_1219088823300018468Grasslands SoilAYNADNTGTGASAGYAGMTVTALSAYDSAIVPTKLFIKTASATTYCIQSTVAQATWNKNGPGADIVTGACP
Ga0193720_106259213300019868SoilYNADNVGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWDKNGPGADIISGACT
Ga0193722_114248013300019877SoilPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWDKNGPGADIISGACT
Ga0193747_111813213300019885SoilAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSITYCVDSTVGGKTWQKAGPGADIVTNACP
Ga0193755_103583943300020004SoilNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTNACT
Ga0193749_108331223300020010SoilVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLNIKTATSITYCVDSTVGGKVWNKAGPGADIVSGACP
Ga0210378_1024544023300021073Groundwater SedimentANNSAAKANVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTNACT
Ga0207692_1061848223300025898Corn, Switchgrass And Miscanthus RhizosphereIPAIEAYNADNVGTGNSAGYAGMTVSYLRDNYDSELVTTKLAFPSAPTSITYCVQSTVGGKTWSKNGPGAAISNATCP
Ga0207680_1107423823300025903Switchgrass RhizosphereSYLSFRDRANNSAAKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT
Ga0207659_1110301413300025926Miscanthus RhizosphereEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT
Ga0207700_1138292323300025928Corn, Switchgrass And Miscanthus RhizosphereYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT
Ga0207644_1151855313300025931Switchgrass RhizosphereRDRANNSAAKANVRAAIPAVEAYNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT
Ga0207690_1037609113300025932Corn RhizosphereAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKNGPGADIVSGQCT
Ga0207691_1139303613300025940Miscanthus RhizosphereRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT
Ga0207711_1142246223300025941Switchgrass RhizosphereEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT
Ga0207679_1023811343300025945Corn RhizosphereKANVRAAIPAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT
Ga0207676_1152520013300026095Switchgrass RhizosphereLLAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT
Ga0207676_1211478113300026095Switchgrass RhizospherePAVEAFNADNNSTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFRANSVTYCVDSTVGGKVWDKDGPGADIVSGPCT
Ga0209801_114702023300026326SoilNQSAAKANVRAAVPAVEAYNADNTGTGNSAGYAGMTVSGLQNYDSAIVPTKLTIQSADSLTYCIQSTVGPATWKKAGPGADIVTGACP
Ga0209378_108928713300026528SoilAAIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKAGPGADIISGACP
Ga0209701_1067376713300027862Vadose Zone SoilNNSAAKANVRAAIPAVEAYNADNANSGNSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWNKSGPGADIVSGACP
Ga0209590_1020354513300027882Vadose Zone SoilAVEAYNADNTSTGASAGYTGMTVALLQAYDSAIVPAKLTIVSASATTYCVQSTVAQATWNKNGPGADITTGACP
Ga0307276_1005164223300028705SoilAAIPAIEAFNADNTGTGNSAGYYGMTVSLLRDSYDSELVPTKLAFPNAPTSITYCVQSTVGGKTWRKNGPGQSIENLAC
Ga0307293_1022187013300028711SoilYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTVSLLQAYDSAIVPTKLTIFTATSVTYCVDSTVGGKVWNKNGPGADIVSGACT
Ga0307303_1009753723300028713SoilVPSYLSFRDRANNSAAKANVRAAIPAIEAYNADNLGTANSSGYAGMTVSLLRDTYDSELVTTKLSFPTAPTSVTYCVQSTVGGKTWRKNGPGQSIENLVC
Ga0307309_1010368813300028714SoilVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWDKDGPGADIVSGACT
Ga0307315_1016838723300028721SoilPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTDPCT
Ga0307319_1006975333300028722SoilRDRANNSAAKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVESTVGGKTWRKNGPGQSIENLAC
Ga0307318_1017779213300028744SoilAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWSKSGPGADIVSGACP
Ga0307316_1004459513300028755SoilSYLSFRDSANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTAACT
Ga0307280_1013273813300028768SoilKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSVTYCVESTVGGKTWRKNGPGQSIENLAC
Ga0307282_1003116013300028784SoilILLAIAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTVFRATSVTYCVDSTVGGKVWKKDGPGADITVGQCT
Ga0307323_1036559913300028787SoilAIPAIEAFNADNLGTGNSAGYAGMTVSLLRDTYDSELVTTKLSFPNAPTSVTYCVQSTVGGKTWRKNGPGQSIENLTC
Ga0307299_1042086513300028793SoilSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTNACT
Ga0307287_1041626923300028796SoilAKANVRAAIPAIEAFNADNIGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSITYCVQSTVGGKTWRKNGPGQSIENLAC
Ga0307284_1000182073300028799SoilFRDRANNSAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWDKDGPGADIVSGACT
Ga0307284_1046349113300028799SoilRAAIPAVEAYNADNTGTGASAGYAGMTVSLLQTYDSAIVPTKLLIKTADATTYCVESTVGGKVFNKAGPGADIVTGACP
Ga0307305_1029290113300028807SoilAAIPAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSITYCVDSTVGGKVWNKNGPGADIISGACT
Ga0307292_1022374023300028811SoilAAKANVRAAIPAIEAFNADNTGTGNSAGYAGMTVSLLRDTYDSELVTTKLAFPNAPTSITYCVQSTVGGKTWRKNGPGQSIENLAC
Ga0307292_1022691823300028811SoilKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVSGACA
Ga0307310_1008213013300028824SoilRANNSAAKANVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTDPCT
Ga0307289_1042630213300028875SoilAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWDKDGPGADIVSGACT
Ga0307286_1019413813300028876SoilNVRAAIPAVEAFNADNVGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTDPCT
Ga0307278_1025480713300028878SoilNVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLVIFRATSVTYCVDSTVGGKVWNKDGPGADIVAGACT
Ga0307300_1006510513300028880SoilANNSAAKANVRAAIPAVEAYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKDGPGADIVSGACT
Ga0307277_1007794333300028881SoilFRDRANNSAAKANVRAAIPAVEAYNADNLSTGNSAGYAGMTVSLLQAYDSAIVPTKLNIQTATSVTYCVDSTVGGKVWSKSGPGADIVSGACP
Ga0268241_1012375413300030511SoilAVPAMEAYNADNTGTGASGGYAGATVTALQQYDSAIVPTKLTIQSATSVTYCIQSTVGAATWKKAGPGADIVTGACP
Ga0308154_10564713300030986SoilNSAAKANVRAAIPAVEAYNADNLGTANSAGYAGMTVSLLQAYDSAIVPTKLHIQTATSVTYCVDSTVGGKVWSKSGPGADIVSGACP
Ga0308178_103367623300030990SoilSFRDRANNSAAKANVRAAIPAVEAYNADNLGTADSAGYAGMTISLLQNYDSAIVPQKLVIFRATSITYCVDSTVGGKVWNKAGPGADIVSGACT
Ga0308193_106694513300031096SoilYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTVFRATSVTYCVDSTVGGKVWKKDGPGADITVGQCT
Ga0308187_1028794523300031114SoilAVEAYNADNTGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKNGPGADIISGACT
Ga0307495_1016969513300031199SoilRAAIPAVEAYNADNVGTANSAGYAGMTVSLLQAYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWNKNGPGADIVAGACT
Ga0307497_1007303213300031226SoilAFNADNVGTANSAGYAGMTVSLLQAYDSAIVPTKLTVFRATSITYCVDSTVGGKVWKKDGPGADITVGQCT
Ga0307497_1035477223300031226SoilAIPSYLSFRDRANNSAAKANVRAAIPAVEAFNADNTGTGNSAGYAGMTISLLQAYDSAIVSTKLHIQTATSVTYCVDSTVGGKTWNKNGPGADIVTSPCT
Ga0307506_1031258823300031366SoilVEAYNADNTGTANSAGYAGMTVSLLQTYDSAIVPTKLTIFRATSVTYCVDSTVGGKVWKKDGPGADITVGQCT
Ga0308194_1029682023300031421SoilYNADNTGTGNSAGYAGMTVSLLQAYDSAIVPTKLTISSADSVTYCVSSTVGGKTWNKNGPGADIVTGIC
Ga0307472_10179051123300032205Hardwood Forest SoilIEAYNADNTGTANSAGYAGMTVSYLRDNYDSELVTTKLAFPNAPTSVTYCVQSTVGGKTWRKNGPGAAIENLPC
Ga0370544_12745_2_2593300034447SoilAAKANVRAAVPAVEAYNADNTGTGASAGYAGMTVSLLQAYDSAIVPTKLTVFRATSVTYCVDSTVGGKVWDKNGPGADIISGACT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.