NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100826

Metagenome / Metatranscriptome Family F100826

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100826
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 141 residues
Representative Sequence MDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG
Number of Associated Samples 77
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 69.61 %
% of genes near scaffold ends (potentially truncated) 43.14 %
% of genes from short scaffolds (< 2000 bps) 86.27 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (58.824 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(42.157 % of family members)
Environment Ontology (ENVO) Unclassified
(45.098 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(56.863 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.58%    β-sheet: 20.47%    Coil/Unstructured: 47.95%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF01202SKI 12.75
PF01656CbiA 0.98
PF09084NMT1 0.98
PF01070FMN_dh 0.98
PF02371Transposase_20 0.98
PF12680SnoaL_2 0.98
PF07040DUF1326 0.98
PF00589Phage_integrase 0.98
PF00903Glyoxalase 0.98
PF13495Phage_int_SAM_4 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.98
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 0.98
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.98
COG3547TransposaseMobilome: prophages, transposons [X] 0.98
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 0.98
COG5588Uncharacterized conserved protein, DUF1326 domainFunction unknown [S] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A58.82 %
All OrganismsrootAll Organisms41.18 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_100606187All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1445Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_105129916Not Available571Open in IMG/M
3300000858|JGI10213J12805_10407619Not Available592Open in IMG/M
3300004463|Ga0063356_104667264Not Available589Open in IMG/M
3300005178|Ga0066688_10184041Not Available1321Open in IMG/M
3300005183|Ga0068993_10131520Not Available829Open in IMG/M
3300005294|Ga0065705_10535999Not Available743Open in IMG/M
3300005294|Ga0065705_10940067All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas putida group → Pseudomonas mosselii564Open in IMG/M
3300005332|Ga0066388_100303506All Organisms → cellular organisms → Bacteria2242Open in IMG/M
3300005445|Ga0070708_100078205All Organisms → cellular organisms → Bacteria → Proteobacteria2990Open in IMG/M
3300005445|Ga0070708_100594109All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1044Open in IMG/M
3300005447|Ga0066689_10840709Not Available569Open in IMG/M
3300005468|Ga0070707_100099613All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2816Open in IMG/M
3300005536|Ga0070697_100108007All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella gemina2316Open in IMG/M
3300005555|Ga0066692_10496239Not Available777Open in IMG/M
3300005713|Ga0066905_100453264All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1055Open in IMG/M
3300005881|Ga0075294_1031878Not Available544Open in IMG/M
3300006794|Ga0066658_10543601Not Available631Open in IMG/M
3300006847|Ga0075431_100314951All Organisms → cellular organisms → Bacteria1579Open in IMG/M
3300006852|Ga0075433_11197661All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium659Open in IMG/M
3300006969|Ga0075419_10667029All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium735Open in IMG/M
3300007004|Ga0079218_12932518Not Available573Open in IMG/M
3300009012|Ga0066710_102029917All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium850Open in IMG/M
3300009012|Ga0066710_102718791Not Available705Open in IMG/M
3300009038|Ga0099829_10653412Not Available874Open in IMG/M
3300009038|Ga0099829_11737388Not Available512Open in IMG/M
3300009089|Ga0099828_10231430All Organisms → cellular organisms → Bacteria → Proteobacteria1654Open in IMG/M
3300009089|Ga0099828_10470980All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1133Open in IMG/M
3300009089|Ga0099828_10991136All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium749Open in IMG/M
3300009090|Ga0099827_10018819All Organisms → cellular organisms → Bacteria4741Open in IMG/M
3300009090|Ga0099827_10076288All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2603Open in IMG/M
3300009090|Ga0099827_10377785All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1209Open in IMG/M
3300009090|Ga0099827_10571617Not Available974Open in IMG/M
3300009090|Ga0099827_10719461Not Available863Open in IMG/M
3300009090|Ga0099827_11099382Not Available690Open in IMG/M
3300009090|Ga0099827_11892011Not Available520Open in IMG/M
3300009094|Ga0111539_10970641All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium988Open in IMG/M
3300009147|Ga0114129_10439018All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1714Open in IMG/M
3300009162|Ga0075423_12182972Not Available601Open in IMG/M
3300009444|Ga0114945_10533504Not Available708Open in IMG/M
3300009691|Ga0114944_1018758All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2371Open in IMG/M
3300009691|Ga0114944_1181532All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium835Open in IMG/M
3300010046|Ga0126384_10155395All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_41766Open in IMG/M
3300010047|Ga0126382_10730694All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium834Open in IMG/M
3300010359|Ga0126376_10428586All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1201Open in IMG/M
3300010360|Ga0126372_12967422Not Available526Open in IMG/M
3300010362|Ga0126377_10035428All Organisms → cellular organisms → Bacteria4257Open in IMG/M
3300010366|Ga0126379_11521410Not Available775Open in IMG/M
3300011269|Ga0137392_11525421Not Available527Open in IMG/M
3300012189|Ga0137388_11940596Not Available518Open in IMG/M
3300012201|Ga0137365_10009310All Organisms → cellular organisms → Bacteria → Proteobacteria7794Open in IMG/M
3300012201|Ga0137365_10487777Not Available906Open in IMG/M
3300012204|Ga0137374_10435880Not Available1032Open in IMG/M
3300012205|Ga0137362_11752038Not Available508Open in IMG/M
3300012206|Ga0137380_10875898Not Available772Open in IMG/M
3300012206|Ga0137380_11050102Not Available695Open in IMG/M
3300012207|Ga0137381_10628276All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella935Open in IMG/M
3300012207|Ga0137381_10818536Not Available807Open in IMG/M
3300012211|Ga0137377_10691503All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium955Open in IMG/M
3300012355|Ga0137369_10773153Not Available657Open in IMG/M
3300012358|Ga0137368_10123869All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1954Open in IMG/M
3300012359|Ga0137385_10499873Not Available1029Open in IMG/M
3300012359|Ga0137385_11088953Not Available658Open in IMG/M
3300012359|Ga0137385_11318521Not Available585Open in IMG/M
3300012360|Ga0137375_10579794All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300012360|Ga0137375_10757416Not Available786Open in IMG/M
3300012361|Ga0137360_11580505Not Available561Open in IMG/M
3300012362|Ga0137361_10739442Not Available897Open in IMG/M
3300012363|Ga0137390_11235455Not Available694Open in IMG/M
3300012532|Ga0137373_10638165Not Available800Open in IMG/M
3300012532|Ga0137373_10638820All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium799Open in IMG/M
3300012685|Ga0137397_10814017Not Available693Open in IMG/M
3300012927|Ga0137416_11946404Not Available539Open in IMG/M
3300015358|Ga0134089_10527852Not Available520Open in IMG/M
3300016319|Ga0182033_10788723All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium836Open in IMG/M
3300017659|Ga0134083_10344631All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium640Open in IMG/M
3300018071|Ga0184618_10314714Not Available667Open in IMG/M
3300019233|Ga0184645_1003873All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300019458|Ga0187892_10280829Not Available839Open in IMG/M
3300022195|Ga0222625_1003869Not Available672Open in IMG/M
3300022563|Ga0212128_10048613All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2734Open in IMG/M
3300022563|Ga0212128_10182247Not Available1344Open in IMG/M
3300022563|Ga0212128_10458436Not Available784Open in IMG/M
3300025580|Ga0210138_1030298Not Available1186Open in IMG/M
3300025922|Ga0207646_10207582All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1769Open in IMG/M
3300026002|Ga0208907_110113Not Available500Open in IMG/M
3300027862|Ga0209701_10031245All Organisms → cellular organisms → Bacteria3459Open in IMG/M
3300027875|Ga0209283_10229202All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1233Open in IMG/M
3300027875|Ga0209283_10554388Not Available734Open in IMG/M
3300027882|Ga0209590_10013662All Organisms → cellular organisms → Bacteria3905Open in IMG/M
3300027882|Ga0209590_10053467All Organisms → cellular organisms → Bacteria2261Open in IMG/M
3300027886|Ga0209486_10951398Not Available573Open in IMG/M
3300027907|Ga0207428_11074528Not Available563Open in IMG/M
3300027910|Ga0209583_10413435Not Available645Open in IMG/M
3300028536|Ga0137415_11322799Not Available539Open in IMG/M
3300030006|Ga0299907_10138804All Organisms → cellular organisms → Bacteria2007Open in IMG/M
3300031093|Ga0308197_10440509Not Available520Open in IMG/M
3300031094|Ga0308199_1113111Not Available611Open in IMG/M
3300031114|Ga0308187_10467563Not Available512Open in IMG/M
3300031228|Ga0299914_10486154Not Available1068Open in IMG/M
3300031421|Ga0308194_10149459Not Available719Open in IMG/M
3300034644|Ga0370548_071445Not Available657Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil42.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.86%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.86%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.88%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.90%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.96%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.96%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.98%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.98%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.98%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000858Soil microbial communities from Great Prairies - Wisconsin Native Prairie soilEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005881Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_202EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300019233Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026002Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_202 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031228Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034644Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_123 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10060618733300000364SoilMGSSPMDDLSNSLGVSEDDIVVDVRYCLLYILRKENLDLREAWVQRAIMERFQGYTHQNAIRLHRIAVAHDDCIVFDLQLMNPDVAIQEIVDTISDELRDLLRWPHAPEVAHPWREIRILTIGAPESAEETIMAYIEAVRRSMSDHEG*
INPhiseqgaiiFebDRAFT_10512991613300000364SoilMERRKMHDLYNILGVSEDDIVVDVRYCLLYVLRKDIXDLREDWIHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPEFAEQEIMAYIEAVRRSTLDHNG*
JGI10213J12805_1040761913300000858SoilLGVSEDDVDIYIRYGITYILREGILDLSEAWVHRPIIERFRGYTHQNAVRLQRIAVDRDDCIVFDLQLVNPDVAIQEIVDEIHAVLCELLPWPQALETDNPWREIRIFTIGDAEAAEKDLAAYLEAVRRSKPKDKN*
Ga0063356_10466726413300004463Arabidopsis Thaliana RhizosphereMHDLYRILGVSEDDVDADIRYRITYILREGMLDLSEAWIHRPIIERFRGYTHQNALRLQRIAIDRDDCIVFDLQLVNPDVAIQEIVDAIDADLGDLLPWPQALERENPWREIGIFTIGDPEAAERDLA
Ga0066688_1018404123300005178SoilMDDLYNILGVSEDDIVVDVRYCILYILHKDILDLREEWVSRPLMERFQGYTHQNALRLNRMAIAHDDCIVVDLHLVDPDVAIQQIVDDIADVLCACLPWPPTLGVESPWREVRILTIGDPASAEQEITAYIEAVRRSTSDKDE*
Ga0068993_1013152013300005183Natural And Restored WetlandsMDDLYSILGVSEDDVVINVRYGVIFILREGLLDLSEAWVHRPLREWCSGYTCQNAIRLHCLTIAHDDCLVFDLQLLAPDVAIQEVVDDISAVLGALLPWPPALEVEHPWREVQIYTIGDPESAEQELTAYIEAIRRSKPEDGA*
Ga0065705_1053599913300005294Switchgrass RhizosphereMHELYSILGVSEDDIDIDIRYCITYVLREGILDLSEAWVHRPIIERFRGYTHQNAVRLHRIAVDQDDCIVFHLQLVNPDIAIQEIVDEIHGALCELLPWPQALEIENLWYEVRIFTVGDPEAVEKDLAAYLEAVRRSKSKGED*
Ga0065705_1094006713300005294Switchgrass RhizosphereMHDLYSILGVSEDDVDVYIRYGITYILREGILDLSEAWVHRPIIERFRGYTHQNAVRLQRIAVDRDDCIVFDLQLVNPDVAIQEIVDEIHAVLCELLPWPQALETDNPWREIQIFTIGDPEAAEKDLAAYLEAVRRSK
Ga0066388_10030350633300005332Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREIQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0070708_10007820523300005445Corn, Switchgrass And Miscanthus RhizosphereMDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0070708_10059410913300005445Corn, Switchgrass And Miscanthus RhizosphereLREDILDLREAWVQRAIMERFQGYRHQNALRLHRVAIAPDDCIVFDLQLVDPDVAIQQIVDDISDVLSALLPWPHAPGMAHPWREVRIMTIGAPESSEEEITAYIEAVRRSTSDNDG*
Ga0066689_1084070913300005447SoilMHDLYSILSVSADDIVIDIHYCIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEVEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGG*
Ga0070707_10009961343300005468Corn, Switchgrass And Miscanthus RhizosphereMGSSTMDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLRRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0070697_10010800713300005536Corn, Switchgrass And Miscanthus RhizosphereGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIEQFQGYTYQNALRLRRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0066692_1049623913300005555SoilNKGMEPTASSVRSSLAPASGSSSGLALAFKFRCRYEQRMERRKMHDLYSILSVSADDIVIDIHYCIIYVLREGMLDLSEAWVHRPILERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEVEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGG*
Ga0066905_10045326423300005713Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARTAEHPWREIQIVTIGAPESAEQEILAYIEAVRRSTL
Ga0075294_103187813300005881Rice Paddy SoilSADDIVIDIHYCIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNAIRLNRIAVAYDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEGEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGG*
Ga0066658_1054360113300006794SoilMDDLYNILGVSEDDIVVDVRYCILYILHKDILDLREEWVSRPLMERFQGYTHQNALRLNRMAIAHDDCIVVDLHLVDPDVAIQQIVDDIADVLCACLPWPPTLGVESPWREVRILTIGDPAS
Ga0075431_10031495133300006847Populus RhizosphereMHDLYRILGVSEDDVDADIRYRITYILREGMLNLSEAWVHRPIIERFRGYTHQNALRLQRIVIDRDDCIVFDLQLVNPDVAIQEIVDAIDAALGDLLPWPQALERENPWREIGIFTIGDPETAERDLAAYLEAVRRSKPEDKNDAS*
Ga0075433_1119766123300006852Populus RhizosphereIVVDVRYCLLYVLRKDILDLREDWLHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0075419_1066702913300006969Populus RhizosphereMERRQMDDLYNILGVSEDDIVVDVRYCLLYVLRKDILDLREDWLHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0079218_1293251813300007004Agricultural SoilMQELYSILGVSEDDIDVDIRYCITYVLHEDILDLSEAWVHRPIIERFRGYTHQNAVRLHRIAVDQDDCIVFHLQLVNPDVAIQEIVDEIYGALCELLPWPHALEMEHRWYEIRIFTVGDPETVEKDLTAYLEAVRRSKLKDDD*
Ga0066710_10202991713300009012Grasslands SoilMDDLYNILSVSEDDIVIDVRYCLLSILREDILDLREDWLQRAIMERFQGYTHQNALRLHRIATAHDDCLVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG
Ga0066710_10271879113300009012Grasslands SoilMDDLYNILGVSEDDIVVDVRYCILYILHKDILDLREEWVSRPLMERFQGYTHQNALRLNRMAIAHDDCIVVDLHLVDPDVAIQQIVDDISDVLCACLPWPHTLEGEHPWREVRMMTLGDPASAKQEITAYIEAVRRNKSDNDG
Ga0099829_1065341213300009038Vadose Zone SoilMGRKKVDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIMERFQGYTHQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDMLSELLPWPHAPGVVSPWREVRIMTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0099829_1173738813300009038Vadose Zone SoilMDDLSNILGVSEDDIVLDVRYCLLYILRQDILDLREAWVQRVIMERFQGYTHQNALRLHRIAIAHDDCIVFDLQLIDPDVAIQQIVDDIADELCACLLWPHALGGEHPWREVRILTIGDPASAEQEITAYIEAVRRSKSDKDA*
Ga0099828_1023143013300009089Vadose Zone SoilMGRKKVDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDMLSELLPWPHAPGVVSPWREVRIMTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0099828_1047098023300009089Vadose Zone SoilMDDLSNILGVSEDDIVLDVRYCLLYILRQDILDLREAWVQRVIMERFQGYTHQNALRLHRIAIAHDDCIVFDLQLVDPDVAIQAIADHISDVLRELLPWPHTLKMASPWREVRILTIGAPESA
Ga0099828_1099113613300009089Vadose Zone SoilLYVLREGLLDLREDWVSRPIMERFQGYTHQNTIRLNRIAIAHDDCIVFDLHIVDPNIAIQQIVDDISNVLCACLPWPHALGVEHPWREVRIVTIGDPASAEQEITAYIEAVRRSTSDNDG
Ga0099827_1001881953300009090Vadose Zone SoilMGSSTMDDLSSILGVSEDDIVLDVRYCLLYILRKDILDLREAWVQRTIMERFQGYTHQNALRLNRIAIAHDNCLVFDLQLVDPDVAIQEIADDISDVLNELLPWPHALGVASPWREVRMLTIGASESAEEEITAYIEAVRRSTSDNDG*
Ga0099827_1007628833300009090Vadose Zone SoilMMDDLYNILGVSEDDIVVDVRYCLLYVLHEDILDLREDWVQRSIMERFQGYTHQNAVRLHRIAVAHDDCIVCDLQLVDPDAAIQEIADDISDELRALLPWPHAPGMAHPWREIRIMTIGAPESAEEEITAYIEAVRRSTSGNDG*
Ga0099827_1037778523300009090Vadose Zone SoilMEKSTMDNLYNILGVSEDDIVVDVRYCLLYVLREGLLDLREDWVSRPIMERFQGYTHQNALRLHHIAVAHDDCLVFDLHLVDPDVAIQQIVDDISDELSALLPWPHALEVEHPWREVRIMTIGDLASAEQEITAYIEAVRRSKSDNDE*
Ga0099827_1057161723300009090Vadose Zone SoilMDDLSNILGVSEDDIVLDVRYCLLYILRQDILDLREAWVQRVIMERFQGYTHQNALRLHRIAIAHDDCIVFDLQLVDPDVAIQAIADHISDVLRELLPWPHTLKMASPWREVRILTIGAPESAEEEITAYIGAVRRSQSDNDG*
Ga0099827_1071946123300009090Vadose Zone SoilRMERRKMNDLYSILSVSADDIVIDVRYCIIYILREGMLDLSEAWVHRPILERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLSALLLWPHVLEGEHPWREVRIFTIGAPESAEKELTAYIEAVRRSKPDNGG*
Ga0099827_1109938213300009090Vadose Zone SoilMDDLYNILGVSEDDMVVDLRYCLLYVLRKDILDLREDWVSRLIMARFQGYTHQNALRLHRIAVAHDDCIVFDLHLVDPNVAIQQIVDDISDELSDLLPWPHALGVEHPWREVRIMTIGDPASAEQEMTAYIEAVRRSTSDKQG*
Ga0099827_1189201123300009090Vadose Zone SoilGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0111539_1097064123300009094Populus RhizosphereMERRKMHDLYNILGVSEDDIVVDVRYCLLYVLRKDILDLREDWIHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0114129_1043901823300009147Populus RhizosphereMERRKMHDLYNILGVSEDDIVVDVRYCLLYVLRKDILDLREDWLHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0075423_1218297213300009162Populus RhizosphereMERRKMHDLYNILGVSEDDIVVDVRYCLLYVLRKDILDLREDWIHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAAHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0114945_1053350413300009444Thermal SpringsMERRKMNDLYSILSVSADDIVIDVRYCIIYVLREGILDLSEAWVHRPIIERFSGYTHQNAIRLHRIAVAHDDCLVFDLQLVDPDVAIQEIVDDISDLLRERLPWPHALEGEPPWREVRICTIGAPESAEQDLTAYIEAVRRSKPDNDG
Ga0114944_101875833300009691Thermal SpringsMERSKMNDLYNTLGVSEDDIVVDVRYCILYVLREDILDLREGWVQRSIMERFRGYRHQNAIRLHRIAVAHDDCIVFDLQLVDPDVAIQQIVDDISDVLSALLPWPHASGTAHPWCEVRILTIGAPESAEEEITAYIEAVRRSTSDNDG*
Ga0114944_118153213300009691Thermal SpringsGYCLIYVLREDILDLREAWVQRAIMERFQGYTHQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDELSALLAWPHVLGMASPWREVRMLTIGAPESAEEEITAYIEAVRRSTLDNDK*
Ga0126384_1015539523300010046Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVNDIADVLRELLPWPHARAAEHPWREIQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0126382_1073069413300010047Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARTAEHPWREIQIVTI
Ga0126376_1042858613300010359Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRALLPWPHARAAEHPWREVQIVTI
Ga0126372_1296742213300010360Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRALLPWPHARAAEHPWREVQIVTIGAPESA
Ga0126377_1003542843300010362Tropical Forest SoilMERRQMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRMAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0126379_1152141023300010366Tropical Forest SoilMERNKMHDLYTILGVSEDDIVVDVRYCLLYALRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRMAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREIQIVTIGAPESAEQEILAYIEAVRRSTLDHDG*
Ga0137392_1152542113300011269Vadose Zone SoilMDDLSNILGVSEDDIVLDVRYCLLYILRQDILDLREAWVQRVIMERFQGYTHQNALRLHRIAIAHDDCIVFDLQLVDPDVAIQAIADHISDVLRELLPWPHTLKMASPWREVRILTIGAPESAEEEITAYIGAVRRSQSDN
Ga0137388_1194059613300012189Vadose Zone SoilMMDDLYNILGVSEDDIVVDVRYCLLYVLHEDILDLREDWVQRSIMERFQGYTHQNAVRLHRIAVAHDDCIVCDLQLVDPDAAIQEIADDISDELRALLPWPHAPGMAHPWREIRIMTIGAPESAEEEITAYIEAVRRSTSGSDG*
Ga0137365_1000931043300012201Vadose Zone SoilMERSTMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLHEDWVSRLIMERFQDYTHQNALRLNRIAIAHDDCIVFDLHLVDPDVAIQQIVDDISDELSDLLPWPHALGVEHPWREVRMMTIGDPASAEQEMTAYIEAVRRSMSDKQG*
Ga0137365_1048777723300012201Vadose Zone SoilMDDLSNILGVSEDDIVLDVRYCLLYLLRQDILDLREAWVQRVIMERFQGYTHQNALRLHRIAIAHDDCIVFGLQLVDPDVAIQAIADHISDVLRELLPWPHTLKMASPWREVRILTIGAPGSAEEEITAYIEAVRCSQSDNDG*
Ga0137374_1043588023300012204Vadose Zone SoilMARRKMPDLYNIWGVSEDDMVVDVRYGLLYALRKDSLDLREDWVHRAIMERCRGYTHQKAIRLHRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRAILPWPHARAAEHPWREVQIVTIGAPGAAEQEIMAYIEAVRRSRLDKDGEWYE
Ga0137362_1175203813300012205Vadose Zone SoilMHDLYSILSVSADDIVIDIHYCIICVLREGMLDLSEAWVHRPILERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEVEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGG*
Ga0137380_1087589813300012206Vadose Zone SoilMGSSTMDDLSNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIMERFQGYTHQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDMLSALLPWPHAPGVVSPWREVRIMTIGAPESAEQEITAYIEAVRRSQSDNDG*
Ga0137380_1105010223300012206Vadose Zone SoilMERRKMHDLYSILSVSADDIVIDIHYCIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEVEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGGELPLSQVASEDT*
Ga0137381_1062827613300012207Vadose Zone SoilMERRRMHDLYSILSVSADDIVIDIHYCIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEGEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGGELPLSQVASEDT*
Ga0137381_1081853623300012207Vadose Zone SoilMGSSTMDDLSNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRVIMERFQGYTHQNALRLHRIAVAHDDCLVFDLHLVDPDVAIQQTVDDISDVLRECLPWPHALGVEHPWREVRIMTIGDPASAEQEMTAYIEAVRRSMSDKQG*
Ga0137377_1069150323300012211Vadose Zone SoilTMNDLYTLLGVSEDDMVVDLRYCLLYVLRKDSLDLREDWVSRPIMERFQGYTHQNALRLNRIAVAHDDCLVFDLQLIDPNVAIQQIVDDISDVLCECLPWPHALEVKHPWREVQILTIGDPASAEQEITAYIEAVRRSTSDNDG*
Ga0137369_1077315313300012355Vadose Zone SoilMERRKMNDLHSILSVSADDIIIDVRYCIIYVLREGILDLSEMWVHRPIIERFRGYTHQNAIRLNRLAVAHDDCLVFDLHLVDPDVAIQQIVDDISDVLRKCLPWPHALGVESPWREVQIVTIGDPASAEQEITAYIEAVRRCKSENEG
Ga0137368_1012386943300012358Vadose Zone SoilYGILYVLRKDSLDLREAWVSRLIMERFQSYTHQNALRLHRIAVAHDDCIVFDLHLVDPDVAIQQIVDDISDELSALLPWPHALEVEHPWREVRIMTIGDPASAEQEITSYIEAVRRSTSDNDG*
Ga0137385_1049987323300012359Vadose Zone SoilMGSSTMDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIMERFQGYTHQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPEAAEQEITAYIEAVRRSQSDNDG*
Ga0137385_1108895313300012359Vadose Zone SoilIYFCATPYNILGVSEDDIVLDVRYCLLYVLRKDILDLHEDWVSRLIMERFQDYTHQNALRLNRIAIAHDDCIVFDLHLVDPDVAIQQIVDDISDELSDLLPWPHALGVEHPWREVRIMTIGDPASAEQEMTAYIEAVRRSMSDKQG*
Ga0137385_1131852113300012359Vadose Zone SoilLLLLAATEYAREWVKFFGRCKQRMERSTMDDLYNILGVSEEDIVVDLRYGILYVLRKDSLDLREAWVSRLIMERFQSYTHQNALRLHRIAVAHDDCLVFDLHLVDPDVAIQQIVDDISDVLCACLPWPHTLEGEHPWREVRMMTLGDPASAKQEITAYIEAVRRNKSDNDG*
Ga0137375_1057979413300012360Vadose Zone SoilARIGRSTMDDLSNILGVPEDDMVVDVRYGLLYVLREGILDLREDWVSRPIMERFQGYTHQNAIRLNRLAVAHDDCLVFDLHLVDPDVAIQQIVDDISDVLRKCLPWPHALGVESPWREVQIVTIGDPASAEQEITAYIEAVRRCKSENEG*
Ga0137375_1075741623300012360Vadose Zone SoilMARRKMPDLYNIWGVSEDDMVVDVRYGLLYALRKDSLDLREDWVHRAIMERCRGYTHQKAIRLHRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRAILPWPHARAAEHPWREVQIVTIGAPGAAEQEIMAYIEAVRRARLDKDGEWYEPRCP
Ga0137360_1158050513300012361Vadose Zone SoilVIDVRYCIIYVLREDILDLSEAWVHRPIIERFRGYTHQNAIRLNRIAVAHDDCLVFDLRLVDPDVAIQEIVDDISDLLSALLLWPHVLEGEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNDG*
Ga0137361_1073944213300012362Vadose Zone SoilRSTMDDLSTILGVSEDDMVVDVRYCLLYVLRKDILDLREDGVARLIMERFQDYTHQNALRLHHIAVAHDDCLVFDLQLIDPNIAIQQIVDDISDELCACLPWPHALEVKHPWREVRIMTIGDPAPAEQEITAYIEAVRRSMSDKQG*
Ga0137390_1123545513300012363Vadose Zone SoilMEKSTMDNLYNILGVSEDDIVVDVRYCLLYVLREGLLDLREDWVSRPIMERFQGYTHQNTIRLNRIAIAHDDCIVFDLHIVDPNIAIQQIVDDISNVLCACLPWPHALGVEHPWREVRIVTIGDPASAEQEITAYIEAVRRSKSDKDA*
Ga0137373_1063816513300012532Vadose Zone SoilMDDLSNILGVPEDDMVVDVRYGLLYVLREGILDLREDWVSRPIMERFQGYTRQNAIRLNRLAVAHDDCLVFDLHLVDPDVAIQQIVDDISDVLRKCLPWPHALGVESPWREVQIVTIGDPASAEQEITAYIEAVRRCKSENEG*
Ga0137373_1063882013300012532Vadose Zone SoilVSADDIIIDVRYCIIYVLREGILDLSEMWVHRPIIERFRGYTHQNAIRLNRIAVAHDDCLVFDLQLVDPDVAIQEIVDDISDLLSELLPWPHALEVEHPWREVRIFTIGAPESAEKDLTAYIEAVRRSKPDKDG*
Ga0137397_1081401723300012685Vadose Zone SoilMDDLYHILGVSEDDMVVDLRYGILYVLRKDILDLREAWVSRLIMERFQGYTHQNAIRLHRIAVAHDDCLVFDLHLLATDVAIQQIVDDISDELSALLPWPHTLEVEPPWREIRIVTLGAPESAEQEITAYIEAVRRSTSDNDG*
Ga0137416_1194640413300012927Vadose Zone SoilMERRKMHDLYSILSVSADDIVIDIRYCIIYVLREGMLDLSEAWVHRPILEKFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPNVAIQEIVDDIRDLLRELLPWPPTLEVEHPWREVRICTIGAPEAAE
Ga0134089_1052785213300015358Grasslands SoilMERSTMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLHEDWVSRLIMERFQDYTHQNALRLNHIAIAHDDCIVFDLHLVDPDVAIQQIVDDISDELSDLLPWPHALGVEHPWREVRMMTIGDPASAEQ
Ga0182033_1078872313300016319SoilEDDIVVDVRYCLLYVLRKDILDLREDWVHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIAEVLRELLSWPHARAAEHPWRAVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG
Ga0134083_1034463113300017659Grasslands SoilMDDLYNILGVSEDDIVLDVRYCLLYVLRKDILDLHEDWVSRLIMERFQDYTHQNALRLNRMAIAHDDCIVVDLHLVDPDVAIQQIVDDIADVLCACLPWPPTLGVESPWREVRILTIGDPASAEQEITAYIEA
Ga0184618_1031471423300018071Groundwater SedimentMDDLYNILGVSEDDIVVDVRYCLLYVLRKDILDLREDWVARLIMARFQGYTHQNALRLHRIAVAHDDCLVFDLHLLAPDVAIQQIVDDISDELSALLPWPHALEVAHPWREVRIMTIGAPESAEEEITAYIEAVRRSKSENES
Ga0184645_100387323300019233Groundwater SedimentMNDLYSILSVSADDIVIDVRYGIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNATRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPAFDVEHPWREVRIFTLGAPESAEKDLTAYIEAVRRSKPDNDG
Ga0187892_1028082913300019458Bio-OozeMNDLYSILSVSADDIVIDIRYGIIYVLREGMLDLHEAWVHRPIIERFRGYTHQNAIRLHRIAVAQDDCLVFDLQLVDPDVAIQEIVDDISALLRELLPWPHALAGEPPWREVRICTIGAPESAERELTAYIEAVRRSKPDNEG
Ga0222625_100386923300022195Groundwater SedimentMVVDLRYGILYVLHEGLLDLREAWVSRLIMERFQGYTHQNALRLHHIAVAHDDCIMFDLHLVDPDVAIQQIVDDISDVLRECLPWPHVLEVASPWREVQIMTIGDPASAEQEITAYIEAVRRSKSDNDG
Ga0212128_1004861333300022563Thermal SpringsMERSKMNDLYNTLGVSEDDIVVDVRYCILYVLREDILDLREGWVQRSIMKRFRGYRHQNAIRLHRIAVAHDDCIVFDLQLVDPDVAIQQIVDDISDVLSALLPWPHASGTAHPWCEVRILTIGAPESAEEEITAYIEAVRRSTSDNDG
Ga0212128_1018224713300022563Thermal SpringsVDDLYSILGVSEDDIVINLRYGVIYVLRAGLLDLSEAWVHRPLLERFSRYTCQNAIRLHRLTVAHDDCLVFDLQLLDPDIAIQEVVDDISAVLRELLPWPHALEVEHPWREIQVCTIGHPESVEQELTAYIEAVRRSKPEDES
Ga0212128_1045843623300022563Thermal SpringsIDIRYCIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNAIRLHRIAVAHDDCLVFALQLVDPNVAIQEIVDHISALLSEVLPWPHAREVEHPWREIRIFTLGAPESAEQEITAYIEAVRRSKPENDEE
Ga0210138_103029813300025580Natural And Restored WetlandsMDDLYSILGVSEDDVVINVRYGVIFILREGLLDLSEAWVHRPLREWCSGYTCQNAIRLHCLTIAHDDCLVFDLQLLAPDVAIQEVVDDISAVLGALLPWPPALEVEHPWREVQIYTIGDPESAEQELTAYIEAIRRSKPEDGA
Ga0207646_1020758233300025922Corn, Switchgrass And Miscanthus RhizosphereMDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLRRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQEITAYIEAVRRSQSDNDG
Ga0208907_11011313300026002Rice Paddy SoilHYCIIYVLREGMLDLSEAWVHRPIIERFRGYTHQNAIRLNRIAVAYDDCLVFALQLVDPDVAIQEIVDDISDLLRELLPWPPTLEGEHPWREVRICTIGAPEAAEKEIRAYIEAVRRSKPDNGG
Ga0209701_1003124523300027862Vadose Zone SoilMGRKKVDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIMERFQGYTHQNALRLHRIAVAHDDCIVCDLQLVDPDVAIQEIADDISDVLSALLPWPHALEVASPWREVRIMTIGAPESAEQEITAYIEAVRRSQSDNDG
Ga0209283_1022920213300027875Vadose Zone SoilMDDLSNILGVSEDDIVLDVRYCLLYILRQDILDLREAWVQRVIMERFQGYTHQNALRLHRIAIAHDDCIVFDLQLVDPDVAIQAIADHISDVLRELLPWPHTLKMASPWREVRILTIGAPESAE
Ga0209283_1055438823300027875Vadose Zone SoilMGRKKVDDLYNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIMERFQGYTHQNALRLHRIAVAHDDCIVFDLQLVDPDVAIQEIADDISDMLSELLPWPHAPGVVSPWREVRIMTIGAPESAEQEITAYIEAVRRSQSDNDG
Ga0209590_1001366243300027882Vadose Zone SoilMDDLSSILGVSEDDIVLDVRYCLLYILRKDILDLREAWVQRTIMERFQGYTHQNALRLNRIAIAHDNCLVFDLQLVDPDVAIQEIADDISDVLNELLPWPHALGVASPWREVRMLTIGASESAEEEITAYIEAVRRSTSDNDG
Ga0209590_1005346723300027882Vadose Zone SoilMMDDLYNILGVSEDDIVVDVRYCLLYVLHEDILDLREDWVQRSIMERFQGYTHQNAVRLHRIAVAHDDCIVCDLQLVDPDAAIQEIADDISDELRALLPWPHAPGMAHPWREIRIMTIGAPESAEEEITAYIEAVRRSTSGNDG
Ga0209486_1095139813300027886Agricultural SoilMQELYSILGVSEDDIDVDIRYCITYVLHEDILDLSEAWVHRPIIERFRGYTHQNAVRLHRIAVDQDDCIVFHLQLVNPDVAIQEIVDEIYGALCELLPWPHALEMEHRWYEIRIFTVGDPETVEKDLTAYLEAVRRSKLKDDD
Ga0207428_1107452813300027907Populus RhizosphereMERRKMHDLYNILGVSEDDIVVDVRYCLLYVLRKDILDLREDWIHRAIMERFRGYTHQNAIRLQRIAVAHDDCIVFDLQLVDPDVAIQQIVDDIADVLRELLPWPHARAAEHPWREVQIVTIGAPESAEQEILAYIEAVRRSTLDHDG
Ga0209583_1041343513300027910WatershedsMDDLSNILGVSEDDIVVDVRYCLLYILREDILDLREAWVQRAIIERFQGYTYQNALRLHRIAVAHDDCIVCDLQLVDPDAAIQEIADDISDVLSALLPWPHALEVASPWREVRILTIGAPESAEQE
Ga0137415_1132279913300028536Vadose Zone SoilMHDLYSILSVSADDIVIDIRYCIIYVLREGMLDLSEAWVHRPILEKFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPNVAIQEIVDDIRDLLRELLPWPPTLEVEHPWREVRICTIGAPEAAE
Ga0299907_1013880443300030006SoilMHDLYSILGVSEDDIDADIRYGITYILREGILDLSEAWVHRPIIERFRGYTHQNAIRLQRIAIDRDDCIVFYLQLVNPDVAIQEIVDEIHAALCELLPWPQALELENPWQEIQIFTVGDPEAAEKDLAAYLEAVRRSKPKDKN
Ga0308197_1044050913300031093SoilLYSILSVSADDIVIDIHYCIIYVLREGMLDLSEAWVHRPILERFRGYTHQNAIRLNRIAVAHDDCLVFALQLVDPDVAIQEIVDDIFALLRERLPWPHALEGEHPWREVRIFTIGAPETAEKDLTAYIEAVRRSKPDNEG
Ga0308199_111311123300031094SoilMDDLSNILGVSEDDIVVDVRYCLLYVLRKDTLDLREDWVSRPIMQRFQGYTHQNALRLHHITVAHDDCLVFDLHLVDLDVAIQQIVDDISDELRQLLPWPHALEVKHPWLEPQGVTDVPLGDPFFGP
Ga0308187_1046756313300031114SoilMDDLSNILGVSEDDMVVDVRYGLLYVLREGILDPREDWVSRPIMERFQGYTHQNALRLNHIAVAHDDCIVVDLHLVDPDMAIQQIVDDISDELCELLPWPHALEVKHPWREVRIMTIGDPASAEQEITAYIEAVRRSTSDKDA
Ga0299914_1048615413300031228SoilMHDLYSILGVSEDDIDADIRYGITYILREGILDLSEAWVHRPIIERFRGYTHQNAIRLQRIAIDRDDCIVFYLQLVNPDVAIQEIVDEIHAALCELLPWPQALELENPWQEIQIFTVGDPEAAEKDLAAYLEAVRRSKSKDKN
Ga0308194_1014945913300031421SoilMDDLSNILGVSEDDIVVDVRYCLLYVLRKDTLDLREDWVSRPIMQRFQGYTHQNALRLHHITVAHDDCLVFDLHLVDRDVAIQQIVDDISDELRQLLPWPHALEVKHPWLEPQGVTDVPLGDPFFGP
Ga0370548_071445_68_4663300034644SoilMERSTMDDLSNILGVSEDDIVVDVRYCLLYVLRKDTLDLREDWVSRPIMQRFQGYTHQNALRLHHITVAHDDCFVFDLHLVDRDVAIQQIVDDISDELRQLLPWPHALEVKHPWLEPQGVTDVPLGDPFFGP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.