NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F062622

Metagenome / Metatranscriptome Family F062622

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F062622
Family Type Metagenome / Metatranscriptome
Number of Sequences 130
Average Sequence Length 109 residues
Representative Sequence MKVRFAATLIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRSRGRFVGA
Number of Associated Samples 113
Number of Associated Scaffolds 130

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 66.67 %
% of genes near scaffold ends (potentially truncated) 16.15 %
% of genes from short scaffolds (< 2000 bps) 16.15 %
Associated GOLD sequencing projects 107
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.846 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(25.385 % of family members)
Environment Ontology (ENVO) Unclassified
(25.385 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.462 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 27.91%    β-sheet: 22.48%    Coil/Unstructured: 49.61%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 130 Family Scaffolds
PF003892-Hacid_dh 40.77
PF028262-Hacid_dh_C 36.92
PF01451LMWPc 0.77
PF00413Peptidase_M10 0.77

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 130 Family Scaffolds
COG5549Predicted Zn-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 0.77


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.85 %
All OrganismsrootAll Organisms16.15 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000956|JGI10216J12902_104369987All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria511Open in IMG/M
3300002568|C688J35102_118144716All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria533Open in IMG/M
3300004153|Ga0063455_101642152All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria508Open in IMG/M
3300004480|Ga0062592_102702118All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria502Open in IMG/M
3300005093|Ga0062594_102738799All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium547Open in IMG/M
3300005347|Ga0070668_101791012All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium565Open in IMG/M
3300005574|Ga0066694_10441451All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria609Open in IMG/M
3300005587|Ga0066654_10643836All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria590Open in IMG/M
3300005713|Ga0066905_101765206All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300006058|Ga0075432_10538141All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria526Open in IMG/M
3300009038|Ga0099829_11300273All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium601Open in IMG/M
3300012951|Ga0164300_11037661All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium531Open in IMG/M
3300013307|Ga0157372_13436948All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium503Open in IMG/M
3300015357|Ga0134072_10389669All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria546Open in IMG/M
3300026326|Ga0209801_1356151All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium515Open in IMG/M
3300028716|Ga0307311_10225498All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium553Open in IMG/M
3300028718|Ga0307307_10209728All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria618Open in IMG/M
3300028719|Ga0307301_10206568All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium638Open in IMG/M
3300028744|Ga0307318_10296070All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium567Open in IMG/M
3300028793|Ga0307299_10366818All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium540Open in IMG/M
3300028824|Ga0307310_10735033All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium507Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil25.38%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.08%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.15%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.85%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.08%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.08%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.08%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.31%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.31%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.31%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.54%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.54%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.54%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.54%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.54%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.77%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.77%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.77%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.77%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.77%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.77%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.77%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.77%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.77%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.77%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.77%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.77%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001334Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A21-65cm)- 6 month illuminaEnvironmentalOpen in IMG/M
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009801Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30EnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012941Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t4i015EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019868Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028713Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_184EnvironmentalOpen in IMG/M
3300028716Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028719Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182EnvironmentalOpen in IMG/M
3300028722Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_368EnvironmentalOpen in IMG/M
3300028744Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_367EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028778Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_142EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028802Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_SEnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300028880Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_181EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300032074Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R1EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10436998713300000956SoilMPKRFLMRTTLAAFLLALVIVPSANAYHLEGGRWPTSTIRYYNEVPAYTWAVDTAAYAWNSSGTRVQFLKSSRKDADVLLGIRWFKIAGEARMQR
A2165W6_104928323300001334PermafrostMPKRQAMKVRLAATLAVTLVAIALPSSASAFRLEGGRWPTATITYYNEIPAYAWAVDSAAYAWNTSGARVRFLKSSRRDAKVLVGIRWFRAAGDANIQRTNGRFVAAKVGIQNGQD
C688J18823_1080633513300001686SoilMPKRMLMRATLAALLLALVLAPAAGAYRFEGGRWPTTTIRYYNEVPAYTWAVDTAAYAWNTSGAHVQFLKSSRRDADVLLGIRWFKVAGEARIQRVNRRIVGAKIGIRIGQDRYVMALV
C688J35102_11814471623300002568SoilMPKRMLMRATLAALLLALVLVPAAGAYRFEGGRWPTTTIRYYNEVPAYTWAVDTAAYAWNTSGARVQFLKSSRRDADVLLGIRWFKVAGEARIQRVNRRIVGAKIGIRSGQD
Ga0063454_10036016613300004081SoilMPKRFLMRTILVACLLALVLVPSAGAYRLEGGRWPTTTVRYYNEVPAYTWAVDSAAFAWNTSGAHVRFVKSSRQNADVLVGIRWFKIAGEARVHRLAGRIVRAEVGIQSGND
Ga0063455_10148732323300004153SoilMPKPILMRATLAAFLLALVLVPSAGAYRLEGGRWPTRTIRYYNEVPAYTWAVDSAAFAWNSSGAHVQFLKSSRKDADVLVGIRWFKIAGEARL
Ga0063455_10164215223300004153SoilMPKRILMRATLAALLLALVLVPDAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAYAWNTSGARVQFLKSSRRDADVLLGIRWFKTAGEARIQRVNRRIVGAKVGIRSGQ
Ga0062592_10270211813300004480SoilMPKRILMRATLAAFLLALFLVPSAGAYHLEGGRWPANTVRYYNEVPAYAWAVDTAAYAWNSSGARVQFVKSSRKDADVLLGIRWFKIAGEARIRRLNGRIVGARVGIRN
Ga0062591_10019874133300004643SoilMPKPEPMRLGLAALLGATLVAVVAAPNADSFRFEGGRWPTTTITYYNEVPAYTWAVDTAAYAWNTSGARVRFLKTSRRDAKVLVGIRWFKAAGDANVQRANGRFTSAKVGIQSGQ
Ga0062594_10273879923300005093SoilMKVRFAAPLIVTLVAIALPSPAGAYHLEGGRWPTATITYYNEVPAYTWAIDTAAYAWNTSGARVRFVKSSRRDARVLVGIRWYKVAGDAHVQRIDGRFVGAKVGIQS
Ga0062594_10279267713300005093SoilMPTTEPMRATLAALLLAVLFVPTAGAYRLEGGRWPTSTIRYYNEVPAYNWAVDTAAYSWNTSGARVQFLKTSRKNADVLIGIRWFKIAG
Ga0066683_1042013613300005172SoilMKERLALTLAVALVAIAVPSAAGAYRFEGGRWPTATITYYNEVPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAGDANVQRTTG
Ga0066678_1033375813300005181SoilMPKRQAMKVRFAATLIVMLVAIALPSAAGAFRLEGGRWPTSTITYYNEIPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAG
Ga0066678_1054906723300005181SoilMKLRLAAILGAAFVAASCTSGAGAYRLEGGRWSTATIPYYNEVPAYAWAVDTAAYAWNTSGARVQFVRSSRRDARVLIGIRWFENAGDANVQRVKGRFVGARVGIRKGQDRYTMALVVTHELGHVL
Ga0070680_10195666713300005336Corn RhizosphereMPKPFLMRATLAALLLALVLVPSADAYSFLGGRWQTTTITYYNEVPAYTWAVDSAAYAWNSSGAHVRLLKSSRSNAKILLGVQWFRPGGETRPVIRGGRIYGAKIGIRNGLDRYTMALVVAHELG
Ga0070689_10105432213300005340Switchgrass RhizosphereMKVRFAAPLIVTLVAIALPSPAGAYHLEGGRWPTATITYYNEVPAYTWAIDTAAYAWNTSGARVRFVKSSRRDARVLVGIRWYKVAGDAHVQRIDG
Ga0070692_1050935113300005345Corn, Switchgrass And Miscanthus RhizosphereMPKRILMRATLAALLLAVVFVPTAGAYRLEGGRWPTTTIRYFNEVPAYTWAVDTAAFAWNSSGAHVQLLKSSRRDANVLVGIRWFKSAGEARVQRLNGRIVGAKVGIQSGQDRYTMAL
Ga0070668_10179101213300005347Switchgrass RhizosphereMPKRTHVNVRLAAILGAAIVAVTAVPSAGAYRLEGGRWPTRTITYYNEVPAYSWAVDTAAYAWNTSGARVQFVKMPRRDAKVLVGVRWFKVAGDANVQRLKDGRFIGAQVGIRTGQDRYT
Ga0070671_10083114613300005355Switchgrass RhizosphereMKVRFAAPLIVTLVAIALPSPAGAYHLEGGRWPTATITYYNEVPAYTWAIDAAAYAWNTSGARVRFVKSSRRDARVLVGIRWYKVAGDAHVQRIDGR
Ga0070705_10093333523300005440Corn, Switchgrass And Miscanthus RhizosphereMRLSSAAALIATLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDA
Ga0073909_1043219513300005526Surface SoilMPKPFLMRTTFAALLLALVLVPSANAYHFEGGRWSTTTIRYYNEVPAYTWAVDTAAFAWNSSGAHVQFLKSSRQNADVLLGIRWFKIAGEAR
Ga0070695_10104332913300005545Corn, Switchgrass And Miscanthus RhizosphereMRLSSAATLIATLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRSRGRFVGAKVGI
Ga0066695_1026279213300005553SoilMKVRLAATLIVTLVAIALPSPAGAYKLEGGRWPTATISYFNEVPAYTWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKAAGDAYVQRSKGRFVSAKVGIQSGQDRYTMALVVA
Ga0066698_1048320923300005558SoilMKAMLVAFLLALTLAPAADAYRFEGGRWPTTTITYYNEVSAYTWAVDTAAYAWNSSGAHVQFLKSPSARNADVLVGVRWFKIAG
Ga0066700_1069759123300005559SoilMKERLALTLAVALVAIAVPSAAGAYRFEGGRWPTATITYYNEVPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAGDASVQRTTGGFVSAKVGIQNGQDRYTMA
Ga0066703_1061609013300005568SoilMPKRKAMKVRLALTLALALVAIGIPCSAGAFRLEGGRWPTGTITYYNEIPAYTWAVDSAAYAWNTSGARVQFVKTSRRNANVLIGIRWFKAAGDASVQRGNGGFVSAKVGIQNGQERYTM
Ga0066694_1044145113300005574SoilMPKRILMRATLAAFLLALVLVPSAGAYGLEGGRWATPTITYYNEVPAYTWAVDTAAYAWNTSGAHVRFLKSSRRDAKVLLGVQWFKPGGETRPEMRGGRIYGAKVGIRNGLDRYTMALVVAHELGHVLGLG
Ga0066654_1064383613300005587SoilMPTPERMRVGLAAFLGAALFAAVAVPSADSFRLEGGRWPTTVITYYNEVPAYSWTVDSAAYAWNTSGARVRFLKTSRRNAKVLVGIRWFKAAGDANVHRTNGRFTSALVGIQSGQDRYTMALVVAHEFGH
Ga0066905_10176520623300005713Tropical Forest SoilMPKRILMRTTLAALLLTLVLVPSANAYTFLGGRWQNTTITYYNEVPAYTWAVDTAAYAWNSSGAHVRFLKSSRRNAQVLLGVQWFRPGGETRPNVRNGRIYGAKIGIRNGL
Ga0075432_1053814123300006058Populus RhizosphereMPKSSLMRATLAAILLALVLVPSAGAYRLEGSRWATTTITYYNEVPAYTWAVDSAAYAWNSSGARVQFLKSSRKNAKVLLGIQWFTPAGEAIVDRRHGRIYGAKVGIRSG
Ga0066653_1046035513300006791SoilMPKRILMRATLAAFLLALVLVPSAGAYGLEGGRWATPTITYYNEVPAYTWAVDTAAYAWNTSGAHVRFLKSSRRDAKVLLGVQWFKPGGETRPDMRGGRIYGAKVGIRNGLDRYTMALVVAHELGHVLG
Ga0066653_1059547923300006791SoilMPKPILMRATLAALLLALVLVPTASAYRLEGGRWPTSTIRYYNEVPAYTWSVDTAAYAWNTSGARVQFLKSSKQDADVLIGIRWFKVAGEARLHRLAGRIVRAEVRIQSGNDRYVMALVA
Ga0066665_1037076313300006796SoilMKVRLALTLALALVAIAIPCSAGAFRLEGGRWPTGSITYYNEIPAYAWAVDSAAYAWNTSGARVQFVKTSRRNANVLIGIRWFKAAGDASVQRGTGGFVSAKVGIQNGQ
Ga0066665_1133178313300006796SoilMPKTSAMKALLVAILLAVTFAPAAAAYHLEGGRWPTTTITYYNEVPAYAWSVDTAAFAWNSSGARVQFLKTSHARNADVLVGVRWFKIAGEAHIE
Ga0075434_10051547223300006871Populus RhizosphereMPKSSLMRATLAAILLALVLVPSAGAYRLEGSRWATTTITYYNEVPAYTWAVDSAAYAWNSSGARVQFLKSSRKNAKVLLGIQWFTPAGEAIVDRRHGRIYGAKVGIRSGLDR*
Ga0068865_10219731923300006881Miscanthus RhizosphereMPKRTHVNVRLAAILGAAIVAVTAVPSAGAYRLEGGRWPTRTITYYNEVPAYSWAVDTAAYAWNTSGARVQFVKVPRRDAKVLVGVRWFKVAGDANVQRLKD
Ga0075424_10178262713300006904Populus RhizosphereMPTTEPMRATLAALLLAVVFVPTAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDSAAFAWNSSGAHVQFLKSSRKNADVLVGIRWFKIA
Ga0079219_1110894923300006954Agricultural SoilMPKSLLMRATLAAILLALTLVPSAGAYTFLGGRWHATTIPYYNEVPAYRWSVDTAAYAWNTSGARVRFVKSSRRDAKVLLGVRWFRLGGEARPEIRGGRIYSAKVGIRNGLD
Ga0099793_1021088413300007258Vadose Zone SoilMPKRRTMKVRLALTLALALVAIAIPCSAGAFRLEGGRWPTGTITYYNEIPAYTWAVDSAAYAWNTSGARVQFVKTSRRNANVLIGIRWFKAAGDASVQRGTGGFVSAKVGIQNGQ
Ga0099829_1130027313300009038Vadose Zone SoilMKVRPAAIVAVTLVAIAIPSSAGAFRLEGGRWPTSTITYYNEIPAYAWAVDSAAYAWNTSGAHVQFLKSSRRNAKVLVGIRWFKAAGDASIQRTNGRFVGAKVGIQNGQDRYTM
Ga0066709_10054831813300009137Grasslands SoilMKIRSAATLIVTLVAIVLPSSAGAYQLEGGRWPTATITYYNEVPAYAWAVDTAAYTWNTSGARVQFRKSSRRDAKVLLGIRWFKALGEANVQRNNKGRIVNAKVGIQSGQDRYTM
Ga0105056_100469533300009801Groundwater SandMPKRLTMKVRIAAIIGAALVAITVVPGSKAFQLEGGRWPTSTITYYNEVPAYSWAVDNAAYAWNTSGARVRFLKSSRRDAKVLVGIRWFKAAG
Ga0105072_103179923300009818Groundwater SandMKVKVAATTLGAALLAITFVPGSSAYRIEGGRWPTATITYYNEVPAYSWAVYTAAYAWNTSGARVRFVKSSRRNAKVLVGIRWFKEAGDANVQRVNG
Ga0105058_110749323300009837Groundwater SandMKVKAAATLGAALLAITFVPGSSAYRIEGGRWPTATITYYNEVPAYSWALDTAAYAWNTSGARVRFVKSSRRNAKVLVGIRWFKVAGDANVQRVNGRFLGAQ
Ga0126311_1135736613300010045Serpentine SoilMPTRNLMRATLVALLIAVVFVPTAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNSSGARLQFLKSSRSDADVLLGIRWFKHAGEARVQRRNGRIVGAKVGIQSGQDRYT
Ga0134070_1026748713300010301Grasslands SoilMKVRLALTLAVTLVAIAIPSAAGAFRLEGGRWPTATITYYNEIPAYAWAVDTAAYAWNTSGARVRFLKTSRRAAKVLIGIRWFKVAGDANVQRVNGRFTGAQVG
Ga0134071_1019251923300010336Grasslands SoilMPKRQTMKLRFAATLIVTLVAIALPSSAGAYQLEGGRWPTVTITYYTEVPAYAWAVDTAAYTWNTSGARVQFRKSSRRDAKVLLGIRWFKALGEANVQ
Ga0134128_1226026623300010373Terrestrial SoilMRATLAALLLAVLFVPTAGAYRLEGGRWPTSTIRYYNEVPAYNWAVDTAAYAWNTSGARVQFVKSSRRDAKVLLGIRWFKVAGDANVQRTNGRFLGAQVGIR
Ga0105239_1175933913300010375Corn RhizosphereMRATLAVLLLAVVFVPTAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDSAAFAWNSSGAHVQFLKSSRKDADVLVGIRWFKIAGEARIQRAAGRMYGAQIGIRSGQDRYVMALVTAHEQGEAAE
Ga0137392_1096789213300011269Vadose Zone SoilMKERLALTLAVALVAIAVPSAAGAYRFEGGRWRSATITYYNEVPAYAWAVDSAAYAWNTSGARVQFLKSSRGDATVLVGIRWFKAAGDASVQRSTSGFVSAKVG
Ga0137393_1092246513300011271Vadose Zone SoilMKVRLAATLAVTLVAIAVPSSAGAFRLEGGRWPTATVTYYNEIPAYAWAVDSAAYAWNTSGAHVQFLKSSRRNAKVLVGIRWFKAAGDAN
Ga0137364_1009747853300012198Vadose Zone SoilMPKRILMRATLAAFLLALFLVPSAGAYRLEGGRWPASTIRYYNEVPAYTWAVDTAAYAWNSSGARVQFVKSSRKDADVLLGIRWFNVAGEARIRRLNGR
Ga0137364_1036479123300012198Vadose Zone SoilMKVRLALTLAPALALVAIAIPCSAGAFRLEGGRWPTGTITYYNEIPAYTWAVDSAAYAWNTSGARVQFVKTSRRNANVLIGIRWFKAAGDASVQRGNGGFVSAKVGIQNGQERYTMALVVAH
Ga0137364_1110648823300012198Vadose Zone SoilMKVRSAAILIVTLVAIALPSPAGAYQLEGGRWPTPTITYYNEVPAYAWAVDTAAYAWNTSGARVQFRKSSRRDAKVLLGIRWFKALGETSVQLNNKG
Ga0137383_1062821623300012199Vadose Zone SoilMKERLAVTLAVALVAIAVPSAAGAYRFEGGRWPTATITYYNEVPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAGDASVQRTTGGFVSAKVGIQNGQDRYTMALVVAHEL
Ga0137365_1052197823300012201Vadose Zone SoilMKVRLALILALTLVAIAVPSSAGAFRLEGGRWPTATITYYNEIPAYAWAVDSAAYAWNTSGARVQFVKTSRRDAKVLIGIRWFKAAGDASVQRSKGGFVS
Ga0137363_1040252323300012202Vadose Zone SoilMKERLAVTLAVALVAIAVPSAAGAYRFEGGRWPTATITYYNEVPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFRAAGDASVQRTTGGFVSAKVGIQNG
Ga0137380_1037241123300012206Vadose Zone SoilMPKRIDVKATLAALLLALTLVPAANAFRLEGAHWPTTTITYYNEVPAYSWAVDSAAYAWNTSGARVRFLKSSRRDAKVLIGIRWFKVAGDANVRHL
Ga0137380_1087575123300012206Vadose Zone SoilMQLRLAALLGAALVAATATPAADSFRLEGGRWPTTTITYYNEVPAYSWAVDTAAYAWNTSGARVKFVKTSRRDAKVLIGIRWFKVAGDANVQRVNGRFMGAKVGIRSGEDRYTMALVVTHELGHVL
Ga0137381_1041482413300012207Vadose Zone SoilMPKRQTMKLRFAATLIVTLVAIALPSSAGAYQLEGGRWPTATITYYNEVPAYAWAVDTAAYTWNTSGARVQFRKSSRRDAKVLLGIRWFKALGEANVQRNNKGRIVSAKVGIQSGQDRYTMAL
Ga0137376_1020430743300012208Vadose Zone SoilMKIRFAATLIVTLVAIVLPSSAGAYQLEGGRWPTATITYYNEVPAYAWAVDTAAYAWNTSGARVQFRKSSRRDAKVLLGIRWFKALG
Ga0137378_1084341413300012210Vadose Zone SoilMKVRLALILALTLVAIAVPSSAGAFRLEGGRWPTATITYYNEIPAYAWAVDSAAYAWNTSGARVQFVKTSRRDAKVLIGIRWFKAAGDASVQRSNRGFVSAKVGIQNGQDR
Ga0137378_1131004713300012210Vadose Zone SoilMKATLAAFLLALTLVPTAGAYRFEGGRWPTTTITYYNEVPAYTWAVDTAAYAWNSSGAHVQFLKSPSARNADVLVGVRWFKIAGEARIQRFSGRIIGAQIGIQNGQDRYTMALV
Ga0137370_1058064123300012285Vadose Zone SoilMPKPMLMRATLAAFLLALVLVPSAGAYRLEGGRWPTATITYYNEVPAYSWSVDTAAYAWNTSGARVQFVKSSRRDARVLVGIRWFKVAGDAHVQRVNRRFVGAQVG
Ga0137373_1019384133300012532Vadose Zone SoilMPKRILMRATLAALLLVLVFVPTAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLKSSRQNADVLVGIRWFKIAGEARLHRLNGRIVRAEVGIQSGHDRYVMTLVTAHEF
Ga0137397_1073088213300012685Vadose Zone SoilMPKRTAMKMRSAAILTVTLVALALPSAAGAYQLEGGRWPTPTITYFNEVPAYAWAVDSAAYAWNTSGARVQFRKSSRREAKVLVGI
Ga0137404_1017551513300012929Vadose Zone SoilVTLAATLVAIAIPSSAGAFRLEGGRWPTSTITYYNEVPAYAWAVDSAAYAWNTSGARVRFLKSSRRDAKVLVGIRWFRAAGDANVQRVNGRFVGAKVGIQNGQDRYTMALVVAHELGHVL
Ga0162652_10008124313300012941SoilIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRQGLESGVGP*
Ga0164300_1103766123300012951SoilMKVRFAAPLIVTLVAIALPSPAGAYHLEGGRWPTATITYYNEVPAYTWAVDTAAYAWNTSGARVQFVKSSRRDARVLVGIRWYKLAGDANVQR
Ga0164298_1042817513300012955SoilMKLRSAAILIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRSRGRFLAAKVGIQSGQDRYTM
Ga0134076_1035900213300012976Grasslands SoilMPKRQTMKLRFAATLIVTLVAIALPSSAGAYQLEGGRWPTVTITYYNEVPAYAWAVDSAAYAWNTSGARVQLVKSSRRDAKVLVGIRWFKAAGDANVHRTN
Ga0134087_1020186413300012977Grasslands SoilMPKRFLMRTILVACLLALVLVPSAGAYRLEGGRWPTTTVRYYNEVPAYTWAVDSAAFAWNTSGAHVRFVKSSRQNADVLVGIRWFKI
Ga0164309_1128018413300012984SoilMRTTLVVSLLALVLVPSAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDSAAFAWNSSGAHVRFLKSSRKDADVLIGIRWFKVAGEARIHRLAGRIVAAEVGIQSGNDRYVMALVTTH
Ga0164307_1083885623300012987SoilMKLRSAAILIVTLVAIALPSSAGAYELEGGRWPTATITYFNEVPAYSWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWF
Ga0164305_1076584723300012989SoilMPKPSLMRTTLVVSLLALVLVPSAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDSAAFAWNSSGAHVQFLKSSRKDADVLVGIRWFKIAGEARVQRVNGRIVSAKVGIR
Ga0157374_1119272523300013296Miscanthus RhizosphereMRATLAVLLLAVVFVPTAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDSAAFAWNSSGAHVQFLKSSRTNADVLVGIRWFKIAGEARIQRAAGRMYGAQIGIRSGQDRYVMALVTAH
Ga0157372_1343694813300013307Corn RhizosphereMKVRFAAPLIVTLVAIALPSPAGAYQFEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQVVKSSRRNAKVLVGIRWVKAAGDAYVQRSRGR
Ga0120125_109884913300014056PermafrostMKVRLAATLAVTLVAIALPSSASAFRLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVHFLKGSRRDAKVLIGIRWFKAAGDANVQRNKGRFVSAKVGIQSGQDRYTMALVVAHELG
Ga0134078_1041824113300014157Grasslands SoilMPKPILMRAMLAAFLLALVLVPSAGAYRLEGGRWPTHTIRYYNEVPAYTWAVDTAAYAWNSSGVRVQFLKSSKKNADVLVGIRWFKIAGEARLHRLAGRIVR
Ga0173483_1022952613300015077SoilMPKRILMRATLAAFLLALFLVPSAGAYHLEGGRWPANTVRYYNEVPAYTWAVDTAAYAWNSSGARVQFVKSSRKDADVLLGIRWFKIAGEARIRRLNGRIVGARVGIRNGQDRYV
Ga0134072_1038966923300015357Grasslands SoilMPKRFLMRTILVACLLALVLVPSAGAYRLEGGRWPTTTVRYYNEVPAYTWAVDSAAFAWNTSGAHVRFVKSSRQNADVLVGIRWFKIAGEARVHRLAGRIVRAEVGIQSGNDRYVMALIT
Ga0134089_1024201513300015358Grasslands SoilMPKRLTMKLRFAATLIVTLVAIALPSSAGAYQLEGGRWPTATITYYNEVPAYAWAVDTAAYTWNTSGARVQFRKSSRRDAKVLLGIRWFKALGEAN
Ga0132257_10047557513300015373Arabidopsis RhizosphereMPKSLLMRATLAAILLALTLVPSAGAYTFLGGRWHPTTIPYYNEVPAYRWSVDTAAYAWNTSGARVRFVKSSRRDAKVLLGVRWFRLGGEARPDIRGGRIYSAKVGIRNGLDRYTMALVV
Ga0134083_1007244723300017659Grasslands SoilMPKRQTMKLRFAATLIVTLVAIALPSSAGAYQLEGGRWPTATITYYNEVPAYAWAVDTAAYTWNTSGARVQFRKSSRRDAKVLLGIRWFKALGEANVQRN
Ga0184623_1023822023300018056Groundwater SedimentMKVRIAATIGAALVAITVVPGSKAFQLEGGRWPTSTITYYNEVPAYSWAVDNAAYAWNTSGARVRFLKSSRRDAKVLVGIRWFKAAGDASVQRVNGRFLSAKVGIRNGQDRY
Ga0184619_1006980933300018061Groundwater SedimentVTLAVTLGAIAIPSSAGAFRLEGGRWPTSTITYYNEIPAYAWAVDSAAYAWNTSGARVRFLKSSRRDAKVLIGIRWFKVAGDANIQ
Ga0184618_1035036423300018071Groundwater SedimentMRATLVALLLAVVFVPAAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLRSPRKTADVLVGIRWFKVAGEAHLHRLNGRIFRAEVGIRSG
Ga0184625_1046463123300018081Groundwater SedimentMPKRFLMRATLAALLLAVAFVPTAGAYRLEGGRWPTPTIRYYNEIPAYRWAVDSAAFAWNSSGARVQFLKSSRRDADVLVGIRWFKIAGE
Ga0066669_1084746623300018482Grasslands SoilMKERLAVTLAVALVAIAVPSAAGAYRFEGGRWPTATITYYNEVPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAGDANVQRTTGGFVSAKVGILIGQDRYTMALVVAH
Ga0066669_1198705123300018482Grasslands SoilMPKRILMRATLAAFLLALFLVPSAGAYHLEGGHWPANTIRYYNEVPAYTWAVDTAAYAWNSSGARVQLVKSSRKDADVLLGIRWFKVAG
Ga0184643_100322623300019255Groundwater SedimentMPKRQAMKVRFAATLILMLVAIGLPSPAGAYQLEGGRWPKATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKAAGDANVQRSNGRFVSAKVGIQSGQDRYTMALVVAHELG
Ga0193720_103365023300019868SoilVGGENLQGLSLPCRNEQAMRLRSAATLIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKAAGDAYVQRSRGRFVGAKVGIQSGQDRYTMALVVTHE
Ga0210381_1006656623300021078Groundwater SedimentVSHAETDLMRATLVALLLAVVFVPAAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLRSPRKTADVLVGIRWFKVAGEAHLHRLNGRIFRAEVGIRSGEDRYTMALVTAHEFGHVL
Ga0207687_1086233423300025927Miscanthus RhizosphereMKVRFAAPLIVTLVAIALPSPAGAYHLEGGRWPTATITYYNEVPAYTWAIDTAAYAWNTSGARVRFVKSSRRDARVLVGIRWYKVAGDAHVQRIDGRFVGAKVGIQSGQDRYTMALV
Ga0207706_1124882723300025933Corn RhizosphereMPKRILMRATLAALLLAVVFVPTAGAYRLEGGRWPTTTIRYFNEVPAYTWAVDTAAFAWNSSGAHVQLLKSSRRDANVLVGIRWFKSAGEARVQRLNGRIVGAKVGIQSGQDRYTMALVTAH
Ga0207704_1181836013300025938Miscanthus RhizosphereMPKRTHVNVRLAAILGAAIVAVTAVPSAGAYRLEGGRWPTRTITYYNEVPAYSWAVDTAAYAWNTSGARVQFVKMPRRDAKVLVGVRWFKVAGDANVQRLKDGRFIGAQV
Ga0209801_135615123300026326SoilMPKRQAMKVRFAATLIVMLVAIALPSAAGAFRLEGGRWPTSTITYYNEIPAYAWAVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAGD
Ga0209058_115592613300026536SoilMPKRQTMKLRFAATLIVTLVAIALPSSAGAYQLEGGRWPTATITYYNEVPAYAWAVDTAAYTWNTSGARVQFRKSSRRDAKVLLGIRWFKALGEANVQRNNKGRIVSAKVGIQSG
Ga0209056_1035462113300026538SoilMKVRLALTLALALVAIAIPCSAGAFRLEGGRWPTGSITYYNEIPAYAWAVDSAAYAWNTSGARVQFVKTSRRNANVLIGIRWFKAAGDASVQRGTGGFVSAKVGIQNGQERYTMALVVAHELGHVLGL
Ga0209073_1034151713300027765Agricultural SoilMPKPFLMRATLAALLLALVLVPSADAYSFLGGRWQTTTITYYNEVPAYTWAVDSAAYAWNSSGAHVRLLKSSRSNAKILLGVQWFRPGGETRPVIRGGRIYGAKI
Ga0209382_1062758813300027909Populus RhizosphereMPKRILMRATLAAFLLALFLVPSAGAYHLEGGRWPANTVRYYNEVPAYTWAVDTAAYAWNSSGARVQFVKSSRKDADVLLGIRWFKVAGEARIRRLNGRIVGARVGIRNGQDRYVMALVTAHE
Ga0307303_1005087223300028713SoilMKLRSAAILIVTLVAIALPSSAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKAAGDAYVQRSRGRFIGAKVGIQSGQDRYTMALVVTHELGHVLGL
Ga0307311_1018921223300028716SoilMPKRILMRAMLVTLLLAVVFVPAAGAYRLEGGRWPTRTIKYYNEVPAYTWAVDTAAFAWNSSGARVQFLKTSRQNADVLVGIRWFTGKAGEARVHRLNGRIIRAEVGIQSGQDRYTMAL
Ga0307311_1022549813300028716SoilMTVRLAATLGAAFVVLTLVPAAHAYRLEGGRWPTATITYYNEVPAYAWAVDTAAYAWNTSGARVQFLKTSRRNAKVLVGIRWFKVAGDANVQRVNGRFLGARVGIQS
Ga0307307_1020972823300028718SoilMPKPILMRATLVALLLAVIVVPAAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNTSGARVQFLKSSRQNADVLVGIRWFKIAGEARLHRSTSGRIVRAEVGIQSGNDRYVMALVTAHEFGHVLGL
Ga0307301_1020656813300028719SoilMPKRQAMKLRFAATLIVTLVAIALPSPAGAYQLEGGRWPTPRITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKVAGDANVQRSKGRFLSAKVGIQSGQDRYTMALVVAHELGHVLGLD
Ga0307319_1000089113300028722SoilMPKRVAMKVRLALTLAVMLVAIVIPSSAGAFRLEGGRWPTATITYYNEIPAYAWAVDSAAYAWNTSGARVQFLKSSRRNAKVLVGIRWYKEAGDAHVQRVNGRFVSAKVGIQNGQDRYTMALVVAHELVHV
Ga0307319_1013736313300028722SoilMNARLAALLGTLVLLLVLGLAPSAGAYRLEGGRWPQAVITYYNEVPAYSWAVDTAAYAWNTSGARVQFLKSSRRDAKVLLGIRWFRMAGDANVQQVNGR
Ga0307318_1029607023300028744SoilMKLRSAATLIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRSKGRFVGAKVGIQSGQDRYTMALVVTHELGHVL
Ga0307320_1036807313300028771SoilMKVKVAATIGAALVAITVVPGSSAFELEGGRWPTSTIAYYNEVPAYSWAVDTAAYAWNTSGARVRFVKSSRRDAKVLVGIRW
Ga0307288_1009415223300028778SoilVSHAETDLMRATLVALLLAVVFVPAAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLRSPRRSADVLVGIRWFTGKAGEARIHRLSGR
Ga0307288_1032097513300028778SoilMRLRSAAILIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRSRGRFLGAKVGIQS
Ga0307299_1036681813300028793SoilMKVRFAATLIVTLVAIALPSPAGAYQLEGGRWPTATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRNAKVLVGIRWFKAAGDAYVQRSRGRFVGA
Ga0307503_1008478513300028802SoilMKVRFAAPLIVTLVAIALPPQAGAYHLEGGRWPSATITYYNEVPAYTWAIDTAAYAWNTSGARVRFVKSSRRDARVLVGIRWYKVAGDAHVQRINGRFVGAKVGIQS
Ga0307305_1011370223300028807SoilMPKRTAMKVRLAVTLAVTLVAIAIPSSAGAFRLEGGRWPTSTITYYTEIPAYAWAVDSAAFAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAGDANVQRVNGRFVAAKVGIQNGQD
Ga0307305_1019000813300028807SoilMPKRILMRATLVALLLAVVFVPAAGAYRVEGGRWPTKKIRYYNEVPAYTWAVDTAAFAWNSSGAHVQFLKSPRKSANVLVGIRWFTGKAGEARI
Ga0307305_1040034913300028807SoilMPKRQAMKVRFEATLIVMLVAIALPSPAGAYQLEGGRWPKATITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKAAGDANVQRVNGRFIAAK
Ga0307302_1026961523300028814SoilMPKRILMRATLAALLLAVVFVPTTGAYRLEGGRWPTPTIRYYNEVPAYRWAVDSAAFAWNSSGARVQFLKSSRRDADVLIGIRWFKIAGEARIQRVQGRIVGAHVGIRSGQD
Ga0307296_1054968113300028819SoilMPKRILMRATLVALLLTVVFVPAAGAYRLEGGRWPTKTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLRSPRRSADVLVGIRWFTGKAGEARIHR
Ga0307310_1041788823300028824SoilMPKRQAMKLRFAATLIVTLVAIALPSPAGAYQLEGGRWPTPRITYFNEVPAYAWAVDSAAYAWNTSGARVQFVKSSRRDAKVLVGIRWFKVAGDANVQRSKGRFLSAKVGIQSGQDRYTMALVVAHELGHVL
Ga0307310_1073503313300028824SoilMPKRTAMKVRLAVTLAVTLVAIAIPSSAGAFRLEGGRWPTSTITYYTEIPAYAWAVDSAAFAWNTSGARVQFLKSSRRDAKVLVGIRWFKAAG
Ga0307312_1043443923300028828SoilMPKRILMRATLVALLLAVVFVPAAGAYRLEGGRWPTKTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLRSPRRSADVLVGIRWFTGKAGEARIHRLSGRIVRAEVGIQSGQDRYTMALVTTHEFGHVLWLDHE
Ga0307312_1079730223300028828SoilMKVRFAAPLIVTLVAIALPSLAGAYHLEGGRWPTATITYYNEVPAYAWAVDTAAYAWNTSGARVQFVKSSRRDARVLVGIRWYKIAGDAHVQRVNGRFVGAKV
Ga0307286_1024540613300028876SoilMPKRILMRATLVALLLTVVFVPAAGAYRLEGGRWPTKTIRYYNEVPAYTWAVDTAAFAWNSSGARVQFLRSPRKTADVLVGIRWFKVAGEAHLHRLNGRIFRAEV
Ga0307278_1010496513300028878SoilMPKRVAMKVRLTLTLAVMLVAIAIPSSAGAFRLEGGRWPTATITYYNEIPAYAWSVDSAAYAWNTSGARVQFLKSSRRDAKVLVGIRWYKEAGDAHIHRMNGRFV
Ga0307300_1007208813300028880SoilMNARLAALLGTLVLLLVLGLAPSAGAYRLEGGRWPQAVITYYNEVPAYSWAVDTAAYAWNTSGARVQFLKSSRRDAKVLLGIRWFRMAGDANVQQVNGRFVSAQVGIKSGQDRYTMALVVAHELGHVLG
Ga0308175_10113933213300031938SoilMPTPTLMRTTLAAILIALVFVPTAGSYSLLGGRWQSTTITYYNEVPAYTWAVDTAAYAWNTSGAHVRFVKSSRRNAKVLLGIQWFRPGGEARPDIRNGRIYGAKV
Ga0308173_1037169913300032074SoilMPKRFLMRTILATCLLALVLAPSAGAYRLEGGRWPTTTVRYYNEVTAYTWAVDSAAFAWNTSGAHVRFLKSSRRNADVLVGIRWFKIAGEARVHRFAGRIVRAEVGIQSGN
Ga0307470_1109244223300032174Hardwood Forest SoilMPTTEPMRATLAALLLAVVFVPTAGAYRLEGGRWPTTTIRYYNEVPAYTWAVDTAAFAWNSSGAHVQFLKSSRKNADVLVGIRWFKIAGEARIQRAAGRLYGAQIGIRSG
Ga0310810_1038078333300033412SoilMPKPFLMRATLAALLLALVLVPSADAYSFLGGRWQTTTITYYNEVPAYTWAVDSAAYAWNSSGAHVRLLKSSRSNAKILLGVQWFRPGGETRPVIRGGRIYGAKIGIRNGLDR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.