NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F089588

Metagenome / Metatranscriptome Family F089588

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089588
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 126 residues
Representative Sequence MDVSDAVETGWTTRTVVSAFVVIALASFCSPTLQTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRERAPPV
Number of Associated Samples 90
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 56.88 %
% of genes near scaffold ends (potentially truncated) 35.78 %
% of genes from short scaffolds (< 2000 bps) 69.72 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.21

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.083 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(28.440 % of family members)
Environment Ontology (ENVO) Unclassified
(44.954 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(34.862 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 19.87%    β-sheet: 0.00%    Coil/Unstructured: 80.13%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.21
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF07549Sec_GG 25.69
PF01699Na_Ca_ex 11.01
PF01370Epimerase 3.67
PF01904DUF72 1.83
PF00583Acetyltransf_1 1.83
PF02355SecD_SecF 1.83
PF12847Methyltransf_18 0.92
PF05988DUF899 0.92
PF13847Methyltransf_31 0.92
PF01168Ala_racemase_N 0.92
PF02254TrkA_N 0.92
PF12680SnoaL_2 0.92
PF12681Glyoxalase_2 0.92
PF00999Na_H_Exchanger 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG0341Preprotein translocase subunit SecFIntracellular trafficking, secretion, and vesicular transport [U] 27.52
COG0342Preprotein translocase subunit SecDIntracellular trafficking, secretion, and vesicular transport [U] 27.52
COG0387Cation (Ca2+/Na+/K+)/H+ antiporter ChaAInorganic ion transport and metabolism [P] 11.01
COG0530Ca2+/Na+ antiporterInorganic ion transport and metabolism [P] 11.01
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 1.83
COG0025NhaP-type Na+/H+ or K+/H+ antiporterInorganic ion transport and metabolism [P] 0.92
COG0475Kef-type K+ transport system, membrane component KefBInorganic ion transport and metabolism [P] 0.92
COG3004Na+/H+ antiporter NhaAEnergy production and conversion [C] 0.92
COG3263NhaP-type Na+/H+ and K+/H+ antiporter with C-terminal TrkAC and CorC domainsEnergy production and conversion [C] 0.92
COG4312Predicted dithiol-disulfide oxidoreductase, DUF899 familyGeneral function prediction only [R] 0.92
COG4651Predicted Kef-type K+ transport protein, K+/H+ antiporter domainInorganic ion transport and metabolism [P] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.08 %
UnclassifiedrootN/A0.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16779604All Organisms → cellular organisms → Bacteria1408Open in IMG/M
2088090014|GPIPI_16780301All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter2675Open in IMG/M
2124908045|KansclcFeb2_ConsensusfromContig64328All Organisms → cellular organisms → Bacteria992Open in IMG/M
2162886007|SwRhRL2b_contig_491820All Organisms → cellular organisms → Bacteria945Open in IMG/M
2170459003|FZ032L002FIPW7All Organisms → cellular organisms → Bacteria500Open in IMG/M
2170459003|FZ032L002JEAEUAll Organisms → cellular organisms → Bacteria508Open in IMG/M
2170459009|GA8DASG01D7X6OAll Organisms → cellular organisms → Bacteria507Open in IMG/M
2170459009|GA8DASG02FIJVPAll Organisms → cellular organisms → Bacteria501Open in IMG/M
2170459019|G14TP7Y01DUDY7All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300000559|F14TC_102883320All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300000955|JGI1027J12803_107401564All Organisms → cellular organisms → Bacteria664Open in IMG/M
3300002899|JGIcombinedJ43975_10112043All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300002914|JGI25617J43924_10054895All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1452Open in IMG/M
3300003911|JGI25405J52794_10001438All Organisms → cellular organisms → Bacteria3913Open in IMG/M
3300003911|JGI25405J52794_10081913All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300004114|Ga0062593_100063339All Organisms → cellular organisms → Bacteria2395Open in IMG/M
3300004643|Ga0062591_101352593All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300005294|Ga0065705_10011770All Organisms → cellular organisms → Bacteria2980Open in IMG/M
3300005294|Ga0065705_10893708All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300005332|Ga0066388_100248384All Organisms → cellular organisms → Bacteria2417Open in IMG/M
3300005446|Ga0066686_10431664All Organisms → cellular organisms → Bacteria900Open in IMG/M
3300005554|Ga0066661_10338121All Organisms → cellular organisms → Bacteria923Open in IMG/M
3300005937|Ga0081455_10001083All Organisms → cellular organisms → Bacteria34289Open in IMG/M
3300005937|Ga0081455_10102527All Organisms → cellular organisms → Bacteria2294Open in IMG/M
3300006871|Ga0075434_100118873All Organisms → cellular organisms → Bacteria2657Open in IMG/M
3300009012|Ga0066710_100034164All Organisms → cellular organisms → Bacteria5996Open in IMG/M
3300009137|Ga0066709_100207155All Organisms → cellular organisms → Bacteria2582Open in IMG/M
3300009143|Ga0099792_11197618All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300009162|Ga0075423_11827457All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300009553|Ga0105249_11696032All Organisms → cellular organisms → Bacteria704Open in IMG/M
3300010043|Ga0126380_10178760All Organisms → cellular organisms → Bacteria1396Open in IMG/M
3300010373|Ga0134128_11341209All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300010375|Ga0105239_11043495All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300010868|Ga0124844_1099025All Organisms → cellular organisms → Bacteria1106Open in IMG/M
3300010868|Ga0124844_1209331All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300012198|Ga0137364_10820308All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300012199|Ga0137383_10175086All Organisms → cellular organisms → Bacteria1572Open in IMG/M
3300012200|Ga0137382_10164669All Organisms → cellular organisms → Bacteria1510Open in IMG/M
3300012203|Ga0137399_10085816All Organisms → cellular organisms → Bacteria2401Open in IMG/M
3300012205|Ga0137362_10123859All Organisms → cellular organisms → Bacteria2194Open in IMG/M
3300012210|Ga0137378_11349441All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300012285|Ga0137370_10196602All Organisms → cellular organisms → Bacteria1182Open in IMG/M
3300012356|Ga0137371_10019066All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae5277Open in IMG/M
3300012362|Ga0137361_11557629All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300012582|Ga0137358_10819809All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300012907|Ga0157283_10254686All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300012908|Ga0157286_10156137All Organisms → cellular organisms → Bacteria732Open in IMG/M
3300012922|Ga0137394_10237632All Organisms → cellular organisms → Bacteria1557Open in IMG/M
3300012922|Ga0137394_11612872All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300012923|Ga0137359_10037245All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4199Open in IMG/M
3300012924|Ga0137413_10998358All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300012925|Ga0137419_10796937All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300012927|Ga0137416_10545860All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300012929|Ga0137404_10034055All Organisms → cellular organisms → Bacteria3793Open in IMG/M
3300012948|Ga0126375_11191286All Organisms → cellular organisms → Bacteria633Open in IMG/M
3300013296|Ga0157374_10094437All Organisms → cellular organisms → Bacteria2858Open in IMG/M
3300013297|Ga0157378_10798349All Organisms → cellular organisms → Bacteria969Open in IMG/M
3300013308|Ga0157375_11089268All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300015077|Ga0173483_10813331All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300015371|Ga0132258_10050567All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus9503Open in IMG/M
3300015372|Ga0132256_100029263All Organisms → cellular organisms → Bacteria4971Open in IMG/M
3300015372|Ga0132256_101524562All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300015372|Ga0132256_103503191Not Available528Open in IMG/M
3300015373|Ga0132257_103038606All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300015374|Ga0132255_101073907All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300018000|Ga0184604_10014256All Organisms → cellular organisms → Bacteria1760Open in IMG/M
3300018027|Ga0184605_10287871All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300018051|Ga0184620_10265023All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300018071|Ga0184618_10336640All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300018076|Ga0184609_10105308All Organisms → cellular organisms → Bacteria1271Open in IMG/M
3300018081|Ga0184625_10166410All Organisms → cellular organisms → Bacteria1153Open in IMG/M
3300018433|Ga0066667_10338023All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1191Open in IMG/M
3300018433|Ga0066667_10468999All Organisms → cellular organisms → Bacteria1031Open in IMG/M
3300019361|Ga0173482_10414630All Organisms → cellular organisms → Bacteria630Open in IMG/M
3300019875|Ga0193701_1001212All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus4393Open in IMG/M
3300019877|Ga0193722_1001866All Organisms → cellular organisms → Bacteria5023Open in IMG/M
3300019878|Ga0193715_1000584All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia8683Open in IMG/M
3300019881|Ga0193707_1006726All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3984Open in IMG/M
3300019881|Ga0193707_1102000All Organisms → cellular organisms → Bacteria856Open in IMG/M
3300019882|Ga0193713_1019034All Organisms → cellular organisms → Bacteria2043Open in IMG/M
3300019883|Ga0193725_1013351All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus2304Open in IMG/M
3300019885|Ga0193747_1014478All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus1920Open in IMG/M
3300019886|Ga0193727_1001349All Organisms → cellular organisms → Bacteria10123Open in IMG/M
3300019887|Ga0193729_1145130All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300020001|Ga0193731_1122214All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300020004|Ga0193755_1041841All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus1505Open in IMG/M
3300020006|Ga0193735_1015387All Organisms → cellular organisms → Bacteria2396Open in IMG/M
3300020006|Ga0193735_1181280All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300020062|Ga0193724_1000313All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus11672Open in IMG/M
3300021078|Ga0210381_10029398All Organisms → cellular organisms → Bacteria1533Open in IMG/M
3300021344|Ga0193719_10000363All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia16658Open in IMG/M
3300021344|Ga0193719_10082756All Organisms → cellular organisms → Bacteria1395Open in IMG/M
3300021411|Ga0193709_1000164All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus27777Open in IMG/M
3300021411|Ga0193709_1097738All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300022694|Ga0222623_10292865All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300022694|Ga0222623_10385485All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300022756|Ga0222622_10942082All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300026538|Ga0209056_10002299All Organisms → cellular organisms → Bacteria20713Open in IMG/M
3300026548|Ga0209161_10052974All Organisms → cellular organisms → Bacteria2641Open in IMG/M
3300028784|Ga0307282_10252898All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300028807|Ga0307305_10000349All Organisms → cellular organisms → Bacteria16125Open in IMG/M
3300028828|Ga0307312_10000881All Organisms → cellular organisms → Bacteria14905Open in IMG/M
3300028828|Ga0307312_10187619All Organisms → cellular organisms → Bacteria1325Open in IMG/M
3300028828|Ga0307312_10420731All Organisms → cellular organisms → Bacteria878Open in IMG/M
3300028878|Ga0307278_10191754All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300030916|Ga0075386_12179130All Organisms → cellular organisms → Bacteria1247Open in IMG/M
3300031720|Ga0307469_10366268All Organisms → cellular organisms → Bacteria1217Open in IMG/M
3300032421|Ga0310812_10042454All Organisms → cellular organisms → Bacteria1721Open in IMG/M
3300032421|Ga0310812_10514785All Organisms → cellular organisms → Bacteria538Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil28.44%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.34%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment5.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.59%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil3.67%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere3.67%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.75%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.75%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.75%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.75%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere2.75%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.83%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.83%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.92%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.92%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.92%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2124908045Soil microbial communities from Great Prairies - Kansas assembly 1 01_01_2011EnvironmentalOpen in IMG/M
2162886007Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
2170459003Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
2170459009Grass soil microbial communities from Rothamsted Park, UK - July 2009 indirect DNA Tissue lysis 0-10cmEnvironmentalOpen in IMG/M
2170459019Litter degradation MG4EngineeredOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002899Soil microbial communities from Manhattan, Kansas, USA - Combined assembly of Kansas soil 100-500um Nextera (ASSEMBLY_DATE=20140607)EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010868Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (PacBio error correction)EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012907Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S044-104R-1EnvironmentalOpen in IMG/M
3300012908Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S089-202R-1EnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019875Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s2EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020062Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_020493402088090014SoilADTNRPISAAVSGFVVIALASLCSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTLRERAPPV
GPIPI_014803302088090014SoilMDEVKPFASQKIKPYGTDVSDSVDTNRTISAAVSGFVVIAFASFCSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTLRERAPPV
KansclcFeb2_043211802124908045SoilMDVNEVIEAKWTTRAVVSAFGVIALASFCSPSLQAVHGFTTALQHHQEQFNHKGKIDFCVLAKTPTKVSTRKGSAQQPVQLFYSAGDVSLCRITFSARSLCATGIHSDITRGSLTLRERAPPV
SwRhRL2b_0533.000035802162886007Switchgrass RhizosphereMDVNEVIEANWTIRAVVSAFGVIALASFCPPSLQAVHGFTTALQHHQEQFNHKGKIDFCVLAKTPTKVSTRKGSAQQPVQLFYSAGDVSLRRITFSARSLCATGIHSDITRGSL
E4A_017480902170459003Grass SoilMNVSEAIEANWTIRAAASAFVVIALASFCSPSLQALHSSTTALQHHQEQFNHKGKINFCVLAKTPTKVCIRKGSAQQPVQLFYSVDNVSLRCINFSPRSLCATGIHSDIARGSLTLRERAPPV
E4A_112714602170459003Grass SoilMNVSEAIEANWTIRAAASAFVVIALASFCSPSLQALHSSTTALQHRQEQFNHKGKINFCVLAKTPTKVCIRKGSAQQPVQLFYCVGNVSLRRINFSPRSLCATGIHSDIARGSLTLRERARRFKAASDLSALLSRRLHRL
F47_097148602170459009Grass SoilMNVSERIEANWTIRAAASAFGVIALASFCSPSLQALLSSTTALQHHQERFNHKGKINFCVLAKTPTKVCIRKGSAQQPVQLFYSVGNVSLRRINFSPRSLCATGIHSDIARGSLTLRERAPPV
F47_024901102170459009Grass SoilGMNVSERIEANWTIRAAASAFGVIALASFCSPSLQALLSSTTALQHHQERFNHKGKINFCVLAKTPTKVCIRKGSAQQPVQLFYSVGNVSLRRINFSPRSLCATGIHSDIARGSLTLRERAPPV
4MG_048319002170459019Switchgrass, Maize And Mischanthus LitterMDVNEVIEANWTIRAVVSAFGVIALASFCSPSLQAVHGFTTALQHHQEQFNQKGKIDFCVLAKTLTKVSIRKGSAQQPVQLFYSAGDVSLRRIKFSARSLCATGIHSDITRGSLTLRERAPPV
F14TC_10288332013300000559SoilSAFVAIALASFCSPTLQTLHSSKTPVQHHQEQFDHKGKIEFCVLAKTPAKVSIRKGSAQQSVQLFYSASDVSLQGLSLSARSICATAIHSEIARGSVTFRERAPPV*
JGI1027J12803_10740156423300000955SoilMDVNDAVDINWTIREVVSAVVVIALASFCSPGLQPLHGSTPPFQHHQEQFDHKGKNNLCVLAKTPTKVGIRKSNAQQPVQFVRSAGDVSLHRLNFSPRSLFATGIHSDIARGSLTLRERA
JGIcombinedJ43975_1011204313300002899SoilVETGWTARRMMSAFVAIALASFCSPTLQTLHSSKTPVQHHQEQFDHKGKIEFCVLAKTPAKVSIRKGSAQQSVQLFYSASDVSLQGLSLSARSICATAIHSEIARGSVAYRERAPPV*
JGI25617J43924_1005489513300002914Grasslands SoilMDVSDADETGWTTRTVVSAFVVIALASFCSPTLKTLHSSATPVQHHQEQLDHKGKIEFSALTKTPTKVCTRKGSPQQLVQLFYSASHVSLRRPNLSARSICATAIHSEIARGNVTFRERAPPV*
JGI25405J52794_1000143843300003911Tabebuia Heterophylla RhizosphereMKWTNKAAVSAFVLIALTSFCAPSLQTFYNSTTGAQRHRQQFNDKGKVNFCVLAKSPTKVCIRKSSAQQLVQLFYSIREVSLRRINFPARSVCLVKIHSGITRGSRTLRERGPPV*
JGI25405J52794_1008191323300003911Tabebuia Heterophylla RhizosphereVSAFVLIALASFCVPSLQTFYSSTTGVQRHRQQFNDKGKVNFCVLAKSPTKLGIRKSSAQQLVQLFCSVGEVSLRRINFPAHSVCLVNIHSGITRGSLTLRERAPPV*
Ga0062593_10006333923300004114SoilMDVSDAVETAWTTRTVVSAFVAIALASFCSPTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTSTKVCTRKDSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRKRAPPV
Ga0062591_10135259313300004643SoilMDVSDAVETAWTTRTVVSAFVAIALASFCSPTLHSSTTPVQHHQEQFDQKGKIEFCVLAKTSTKVCTRKDSAQQSVQLFYSPSDVSPHRLNLSARSICATAIHSEIARGSVTF
Ga0065705_1001177023300005294Switchgrass RhizosphereMDVNEVIEANWTXRAVVSAFGVIALASFCSPSLQAVHGFTXALQHHQEQFNHKGQIDFCVLAKTPTKVSIRKGSAQQPVQLFNSAGDVSLRRIKFSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0065705_1089370823300005294Switchgrass RhizosphereVSEGIEANWTIRAAVSAFGVIALASFCSPSLQALHSSTTALQHHQKQFNHKGKINFCVLAKTPTKVCIRKGSAQQPVQLFYFAGDVSLRRINFSPRSLCATGI
Ga0066388_10024838423300005332Tropical Forest SoilMKSFASKKVKRYGTDVTDAVETNKTIRAAVSAFVVIALASFCSPSLQILNSSATDGQPHREQFNDKGKINFCVLAKTPAKVCIRKGCSQQPVQLFYSADNVSLRRTNFSPRSLCTTEIPSAITRGSLTLRERAPPV*
Ga0066686_1043166423300005446SoilMDVMEAVETDWTRRTVVNAFVVIALASLCSPTLKTLHSSATPVQDHQEQFDHKGKIEFCVLTKTPIKVCTRKGSAQRLVQLVCFAGDVSLHRLNFSPRSLCATEIHSDITRGSLTLRERAPPV*
Ga0066661_1033812113300005554SoilMDVSEPVKTGWTTRTVMSAFVVIALASFCSPTLKTLHSSTTPVQHHQEQFDHKGKIEFCVLTKTPTKVCTRKGSAQQLVQLFYSASDASLRRLDLSARSICATAIHSEIARGNVTFRERAPPV*
Ga0081455_10001083213300005937Tabebuia Heterophylla RhizosphereVSNGVEMKWTNKAAVSAFVLIALTSFCAPSLQTFYNSTTGAQRHRQQFNDKGKVNFCVLAKSPTKVCIRKSSAQQLVQLFYSIREVSLRRINFPARSVCLVKIHSGITRGSRTLRERGPPV*
Ga0081455_1010252723300005937Tabebuia Heterophylla RhizosphereVSNAVEMKWTIKAGVSAFVLIALASFCVPSLQTFYSSTTGVQRHRQQFNDKGKVNFCVLAKSPTKLGIRKSSAQQLVQLFCSVGEVSLRRINFPAHSVCLVNIHSGITRGSLTLRERAPPV*
Ga0075434_10011887323300006871Populus RhizosphereMDVSDAVDINRTIREVVSAVVVIALASFCSPGLQPLHGSTPPFQHHQEQFDHKGKNNLCVLAKTPTKVGIRKSNAQQPVQFVRSAGDVSLHRLNFSPRSLFATGIHSDIARGSLTLRERAPPV*
Ga0066710_10003416433300009012Grasslands SoilMDVSEPVKTGWTTRTVMSAFVVIALASFCSPTLKTLHSSTTPVQHHQEQFDHKGKIEFCVLTKTPTKVCTRKGSAQQLVQLFYSASDASLRRLDLSARSICATAIHSEIARGNVTFRERAPPV
Ga0066709_10020715523300009137Grasslands SoilMDVSEPVKTGWTTRTVMSAFVVIALASFCSPTLKTLHSSTTPVQHHQEQFDHKGKIEFCVLTKTPTKVCTRKGSAQQLVQLFYSASDASLGRLDLSARSICATAIHSEIARGNVTFRERAPPV*
Ga0099792_1119761813300009143Vadose Zone SoilLPHNTTDRISIQLDKRFASENIKPYGMNVSEGIEANWTIRAVVSAFGLIALASFCSPSLQPLHSSTTALHHHQEQFNHKGKINFCVLAKTPTKVCIRKGSSQQPVQLFYSAGNVSLRRINFSARSLCATRIHSDIARGSLTLR
Ga0075423_1182745713300009162Populus RhizosphereMDVSVDINRTIREVVSAVVVIALASFCSPGLQPLHGSTPPFQHHQQQFDHKGKNNLCVLAKTPTKVGIRKSNAQQPVHFVRSAGDVSLHRLNFSPRSLFATGIHSDIARGSLTLRERAPPV*
Ga0105249_1169603223300009553Switchgrass RhizosphereMDEVKPFASQKIKPYGTDVSDSVDTNRTISAAVSGFVVIALASFCSPGLQTLHSSTADAQHHREQFNHKGKINLWVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTL
Ga0126380_1017876023300010043Tropical Forest SoilMQSADGRMKSFASKKVKRYGTDVTDAVETNKTIRAAVSAFVVIALASFCSPSLQILNSSATGRQPHREQFNDKGKINFCVLAKTPAKVCIRKGCSQQPVQLFYSADNVSLRRTNFSPRSLCTTEIPSAITRGSLTLRERAPPV*
Ga0134128_1134120923300010373Terrestrial SoilTVSGHERDHIHGVGVMQSADARGKLFASQKIKPYGTDVREAVDTNRTINAAVSGFVVIALASLCSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0105239_1104349513300010375Corn RhizospherePASSTEKNCCGAETYEHTKFAANESAKISVAFSITMSGHERDHIHGVGVMQSADARGKLFASQKIKPYGTDVSDSVDTNRTISAAVSGFVVIALASFCSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSHRVNFPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0124844_109902523300010868Tropical Forest SoilMQSADGRMKSFASKKVKRYGTDVTDAVETNKTIRAAVSAFVVIALASFCSPSLQILNSSATDGQPHREQFNDKGKINFCVLAKTPAKVCIRKGCSQQPVQLFYSADNVSLRRTNFSPRSLCTTEIPSAITRGSLTLRERAPPV*
Ga0124844_120933123300010868Tropical Forest SoilMQSADGRMKSFASKKVKRYGTDVTDAVETNKTIRAAVSAFVVIALASFCSPSLQILNSSATDGQPHREQFNDKGKINFCVLAKTPAKVCIRKGCSQQPVQLFYSADNVSLRRTNFSPRSLCTTEIPSAITRGSLTLRERAPP
Ga0137364_1082030823300012198Vadose Zone SoilMDVSDAVETGWTTRPVVSAFVVIALASFCSPTLQTLHSSRTPVQHHQEQFDHNGKIEFCVLAKTPTKVSNRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSV
Ga0137383_1017508623300012199Vadose Zone SoilMDVNEVIEANWTTRAVVSAFGVIALASFCSPSLQAVHGFTTALQHQQEQFNHNGKIDFCVLAKTPAKVSIRKGNAQQPVQLFYSAGDVSLRRIKYSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0137382_1016466923300012200Vadose Zone SoilMGEVKPFASQKIKPYGTDVSDAVDTKWIGRAAVNAFVVIALASFCSPGLQTLSSSTAGAHHRAQFNHKGKINLCVLAKTPTKVCIRKGGAQKPVQVFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0137399_1008581613300012203Vadose Zone SoilSWIKRFASENIKPYGIDVSEAIEANRTIRAVVSAFGLIALASFCSPSLQALHSSTTALHHHQEQFNHKGKINFCVLAKTSTKVCIRKGSAQQPVQLFYSAGNVSLRRINFSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0137362_1012385923300012205Vadose Zone SoilMDVSDAVETGWTTRTVVSAFVVIALASFCSPTLQILHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRERAPPV*
Ga0137378_1134944113300012210Vadose Zone SoilVSAFGVIALASFCSPSLQAVHGFTTALQHQQEQFNHNGKIDFCVLAKTPAKVSIRKGNAQQPVQLFYSAGDVSLRRIKYSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0137370_1019660223300012285Vadose Zone SoilMDVSDAVETGWTTRTVVSAFVAIALASFCSPTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTSTKVCTRKDSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRKRAPPV
Ga0137371_1001906623300012356Vadose Zone SoilMDVNEVIEANWTTRAVVSAFGVIALASFCSPSLQAVHGFTTALQHQQEQFNQNGKIDFCVLAKTPAKISIRKGNAQQPVQLFYSAGDVSLRRIKYSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0137361_1155762923300012362Vadose Zone SoilMDVSEAVETGWTTRTVVSAFVVIALASFCSPTLQILHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDISLHRLSLSARSICATAIHSEIARGSVTFRERAPPV*
Ga0137358_1081980913300012582Vadose Zone SoilMDVSDGVETGWTTRTLVSAFVGIALASFCSPTLQTEHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTNVCTRKGSAQQSVQFFYSASDVSLHRPNLSAHSFCATAIHSEIARGSVTFRERA
Ga0157283_1025468613300012907SoilMDEVTPFASQKIKPYGTDVSDSADTNRPISAAVSGFVVIALASLCSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSHRVNFPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0157286_1015613713300012908SoilMDVNEVIEANWTIRAVVSAFGVIALASFCSPSLQAVHGFTNALQHHQEQFNHKGKINFCVLAKTPTKVSIRKGSAQQPVQLFYSAGDVSLRRIKFSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0137394_1023763223300012922Vadose Zone SoilMNVSEGIEANWTIRAVVSAFGLIALASFCSPSLQALHSSTTALHHHQEQFNHKGKINFCVLAKTPTKVCIRKSSAQQPVQLFYSAGNVSLRGINFSARSLCATRIHSDITRGSLTLRERAPPV*
Ga0137394_1161287213300012922Vadose Zone SoilFASENIKPYGMNVSEGIEANWTIRAVVSAFGLIALASFCSPSLQTLHGSITALQHHQEQFNHKGKINFCVLAKTPTKVCIRKSSAQQPVQLFYSTGNVSLRGINFSARSLCATRIHSDITRGSLTLHERAPPV*
Ga0137359_1003724533300012923Vadose Zone SoilMDVSDTVETGCTTRTVVSAFVVIALACFCSPTLQTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCPRKGSAQQSVQLFYSASDVSLHRLNLSARSICSAAIHSEIARGSVTFRERAPPV*
Ga0137413_1099835813300012924Vadose Zone SoilLPHNTTDRISIQLDKRFASENIKPYGMNVSEGIEANWTIRAVVSAFGLIALASFCSPSLQALHSSTTALHHHQEQFNHKGKINFCVLAKTPTKVCIRKSSAQQPVQLFYSAGNVSLRGINFSARSLCAAGIPSAITRGSLTLRERAPPV*
Ga0137419_1079693723300012925Vadose Zone SoilVSEAIEANRTIRAVVSAFGLIALASFCSPSLQTLHGSITALQHHQEQFNHKGKINFCVLAKTPTKVCIRKSSAQQPVQLFYSAGNVSLRRINFSARSLCATGIPSAITRGSLTLRERAPPV*
Ga0137416_1054586023300012927Vadose Zone SoilMNVSEGIEANRTIRAAVSAFGVIALASFCSPSLQTLHGSITALQHHQEQFNHKGKINFCVLAKTPTKVCIRKSSAQQPVQLFYSAGNVSLRRINFSARSLCATRIHSDIARGSLTLRERAPPV*
Ga0137404_1003405533300012929Vadose Zone SoilLESCNQRIDEAKPFASQKIKPYETDVSDSVDTNRTISAAVSGFVAIALASFCSPGLQTLSNSTAGAHHRAQFNDKGKINLCVLAKTPTKICVRKGAQQPVQLSYSAGDVSLRHINFSACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0126375_1119128623300012948Tropical Forest SoilMQSADAQMKSFASKKVKRYGTDVTDAVETNKTIRAAVSAFVVIALASFCSPSLQILNSSATDGQPHREQFNDKGKINFCVLAKTPAKVCIRKGCSQQPVQLFYSADNVSLRRTNFSPRSLCTTEIPSAITRGSLTLRER
Ga0157374_1009443723300013296Miscanthus RhizosphereMDEVKPFASQKIKPYGTDVSDSVDTNRTISAAVSGFVVIALASFCSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0157378_1079834923300013297Miscanthus RhizosphereMDEVKPFASQKIKPYGTNVSDAVDTNWTTRAAVSAFVVIALASFCSLGLQTLHSPTAGAQHHRGQFNDKGKINLCVLAKTPTKVCIRKGAAQQTVKLFYYAGDISLRRINSPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0157375_1108926823300013308Miscanthus RhizosphereMGEVKPFASQKIESYGTDVSDAVDTNRTVRAAVSAFVVIALASFCSPGLQTLHSSTSNAQHHREQLNYKGKINLCVFAKTPTKVCIRKGGPQQPVQVFYSAGDTSSRRINFSACSLCPPRIRSGIARGSLTLRERAPPV*
Ga0173483_1081333113300015077SoilKPYGTDVSDAAETNWTIRAGVSAFVAIALASFCSPSLQTLNGSITGLQHHQEQFNHKGKIDFCVLAKTPTKVSIRKGSAQQPVQLFYSAGDVSLRRIKFSARSLCATGIHSDITRGSLTLRERAPPV*
Ga0132258_1005056753300015371Arabidopsis RhizosphereMDEVKPFASQKIKPYGTNVRDAVDTNWTTRAAVSAFVVIALASFCSLGLQTLHSPTAGAQHHRGQFNGKGKINLCVLAKTPTKVCIRKGAAQQTVKLFYYAGDISLRRINSPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0132256_10002926323300015372Arabidopsis RhizosphereMGEVKPFASQKIESYGTDVSDAVDTNRTVRGAVSAFVVIALASFCSPGLQTLHSSTSNAQHHREQLNYKGKINLCVFAKTPTKVCIRKGGPQQPVQVFYSAGDTSSRRINFSACSLCPPRIRSGIARGSLTLRERAPPV*
Ga0132256_10152456223300015372Arabidopsis RhizosphereMDEVKPFASQKIKPYGTNVSDAVDTNWTTRAAVSAFVVIALASFCSLGLQTLHSPTAGAQHHRGQFNGKGKINLCVLAKTPTKVCIRKGAAQQTVKLFYYAGDISLRRINSPACSLCPPRIHSGIARGSLTLRERAPPV*
Ga0132256_10350319113300015372Arabidopsis RhizosphereVSGGVDRKWNIKAAVSAFVLIALGSFCSPSLQTFYGSTAGAQRNRQEFNDNGKVNLCVLAKTSTKVCVRKSNAQQLVQLFSTVGEISSRSVNFPAHSLCPLQSYSGITGGSLTL
Ga0132257_10303860623300015373Arabidopsis RhizosphereMDEVMPFASQKIKPYGTNVSDAVDTNWTTRAAVSAFVVIALASFCSLGLQTLHSPTAGAQHHRGQFNGKGKINLCVLAKTPTKVCIRKGAAQQTVKLLYYAGDISLRRINSPACSLCPPRIHSGIARGSLT
Ga0132255_10107390723300015374Arabidopsis RhizosphereMDEVMPFASQKIKPYGTNVRDAVDTNWTTRAAVSAFVVIALASFCSLGLQTLHSPTAGAQHHRGQFNGKGKINLCVLAKTPTKVCIRKGAAQQTVKLFYYAGDISLRRINSPACSLCPPRIHSGIARGSLTLRER
Ga0184604_1001425633300018000Groundwater SedimentMGEVKPFASQKIKPYGTDVSDAVDTKWIGGAAVNAFVVIALASFCSPGLQTLSSSTAGAHHPAQFNHKGKINLCVLAKTPTKVCIRKGGAQKPVQVFYSAGDISSRRINFPACSHCSPRIHSGIARGSLTLRERAPPV
Ga0184605_1028787123300018027Groundwater SedimentMDVSDAVETGWTTRTVVSAFVVIALASFCSPTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVFTRKGSAQQSVQLFYSASDVSLRRPTLSALSICATEIHSEIARGNVTFRERAPPV
Ga0184620_1026502313300018051Groundwater SedimentFVVIALASFCSPTLQTLHSSTTPVQHHQEQFDHNGKIEFCVLAKTPAKVSIRKGSAQQSVQLFYSASDVSLRRPTLSALSICATAIHSEIARGNVTFRERAPPV
Ga0184618_1033664013300018071Groundwater SedimentMDVSDAVETGWTTRTVVSAFVVIALASFCSPTLQTLHSSTTPVQHHQEQFDHKGKIEFCVLTKTSTKVCTRKGSPQQLVQVFYSASHVSLRRPNLSARSICATAIHSEIARGSVTFRKRAPPV
Ga0184609_1010530823300018076Groundwater SedimentMALALFSPGLQTLHSSTADAQHHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFSACSLCPPRIYSGIARGSLTLRERAPPV
Ga0184625_1016641013300018081Groundwater SedimentKWIGRAAVNAFVVIALASFCSPGLQTLSSSTAGAHHRAQFNHKGKINLCVLAKTPTKVCIRKGGAQKPVQVFYSARDISLRRINFRACSLCPPRIHSGIARGSLTLRERAPPV
Ga0066667_1033802323300018433Grasslands SoilVSDAVEIGWTTRTMVSAFVAIALASFCSPALQTLQSSTTRVQHRQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLPTRSICATAIHSEIARGNVTFRER
Ga0066667_1046899923300018433Grasslands SoilMEAVETDWTRRTVVNAFVVIALASLCSPTLKTLHSSATPVQDHQEQFDHKGKIEFCVLTKTPIKVCTRKGSAQRLVQLVCFAGDVSLHRLNFSPRSLCATEIHSDITRGSLTLRERAPPV
Ga0173482_1041463013300019361SoilMDVNEVIEANWTIRAVVSAFGVIALASFCSPSLQAVHGFTTALQHHQEQFNQKGKIDFCVLAKTPTKVSIRKGSAQQPVQLFYSAGDVSLRRIKFSARSLCATRIHSDIARGSLTLRERAPPV
Ga0193701_100121223300019875SoilMDVRDAFETGSTTRTVVSAFVVIALASLSSPTLKTVHRSARSVQHHQEQFDHKGKMELCVLAKTSTKVCSRKGSAQQSFQVFYSVSDVSLRRLNLSARSICATAIHSEIAGGNSTFRERAPPV
Ga0193722_100186643300019877SoilMDVSDAVETGRTTRTVVSAFLVIALASFCSPALQTIHSSTSPTQHHQKQFDQKGEIEFCVLAKTPAKVSTRKSSAQQLIQLFYSASDVSLQRLSLSARSICATAIHSEIARGSVPFRERAPPA
Ga0193715_100058433300019878SoilMDVSDADETGWTTRTVVSAFVVIALASFCSPTLKTLHSSATPVQHHQEQLDHKGKIEFSALTKTPTKVCTRKGSPQQLVQLFYSASHVSLRRPNLSARSICATAIHSEIARGNVTFRERAPPV
Ga0193707_100672623300019881SoilMNVSEAIEANRTIRAVVSAFGLIALASFCSPSLQTLHGSITALQHHQEQFNHKGKINFCVLAKTPTKLCIRKSSAQQPVQLFHSAGNVSLRRINFSARSLCATGIPSAITRGSLTLRERAPPV
Ga0193707_110200023300019881SoilSAIISAALVSCNQRINEVKPFAPQKIKPYGADVSDAVDSNWTIRAAMSVFVVIALASFCSPGLQTLHSSTTGAQHHREQFSYKGKINPCVLAKTPPKVCIRKGGAQQPVQLFHSAGDISLRRISFPACSLCPHRIHSGIARGSLTLRERAPPV
Ga0193713_101903423300019882SoilMDVSDAVETGRTTRTVVSAFLVIALASFCSPALQTIHSSTSPAQHHQKQFDQKGEIEFCVLAKTPAKVSTRKSSAQQLIQLFYSASDVSLQRLSLSARSICATAIHSEIARGSVPFRERAPPA
Ga0193725_101335133300019883SoilVSEAIEANRTISAVVSAFGLIALASFCSPSLQTLHGSTTALQHHQEQFNHKGKINFCVLAKTPTKVCIRKGSAQQPGQLFYSAGDVSLRRINFSARSLCATGIPSAITRGSLTLRERAPP
Ga0193747_101447833300019885SoilMDEVKPFASQKIKPYGTDVSDSVDTNRTISATVSGFVVIALASFCSPGLQTLHRSTTDAQDHREQFNHKGKINLCVLAKTPTKVCIRKGGAQQPVQIFYSAGDISSRRINFPACSLCPPRIHSGIARGSLTLRERAPPV
Ga0193727_1001349113300019886SoilMDVSDALKIGWTTRTVMSAFVAIALASFCSPTLKTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSACDVSLHGLNLSSRSICATAIHSEIARGSVTFRERAPPL
Ga0193729_114513023300019887SoilMDVSDAVETAWTTRTVVSAFVGVALASFCSLTLHCSTSSVQRQQEQFDHKGEIEFCVLAKAPAKVCTRKVSAQHSVQLFYSASDGSLHRVNLPARLICAIAIHSEIARGSVTFRERAPPV
Ga0193731_112221413300020001SoilIRAGVTAFGLIALASFCSPSLQALHSSPTALQHHQKQFNHKGKINFCVLAKTPAKVCIRKGSAQQPVQLFYSAGDVSLRRINLSARSLCATRIRSDIARGSLTLRERAPPV
Ga0193755_104184123300020004SoilMDVSEAIEANRTIRAVVSAFGLIALASFCSPSLQALHSSTTVLQHHQKQFNHKEKINFCVLAKTPAKVCIRKGSAQQPVQLFYSAGNVSLRRINFSARSLCATRIHSDIARGSLTLRERAPPV
Ga0193735_101538733300020006SoilMDVSDAVETGRTTRTVVSAFLVIALASFCSPALQTIHSPTSPTQHHQKQFDQKGEIEFCVLAKTPAKVSTRKSSAQQLIQLFYSASDVSLQRLSLSARSICATAIHSEIARGSVPFRERAPPA
Ga0193735_118128013300020006SoilFVVIALASLSSPTLKTVHRSARSVQHHQEQFDHKGKMELCVLAKTSTKVCSRKGSAQQSFQVFYSVSDVSVRRLNLSARSICATAIHSEIAGGNSTFRERAPPV
Ga0193724_1000313153300020062SoilMDVSEAIEANRTIRAVVSAFGLIALASFCSPSLQTLHGSITALQHHQEQFNHKGKINFCVLAKTPTKVCIRKGSAQQPGQLFYSAGDVSLRRINFSARSLCATGIPSAITRGSLTLRERAPPV
Ga0210381_1002939823300021078Groundwater SedimentMDVSDAVETGWITRTVVSAFVVIALASFCSPTLQTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSV
Ga0193719_1000036323300021344SoilMDVSDAVETGWTTRTVVSAFVVIALASFCSPTLQTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRERAPPV
Ga0193719_1008275613300021344SoilMDEVKRFASQKIKPYGTDVSDAVETNWTIRAVVSAFVVIALASFCSPSLQTLHSSVSALQHHQEQFNHKGEINFWVLAKTPAKLCIRKGSAQQPVQLFYSAGDVSLRRINFSARSLYATRIHSDIGRGSLTLRERAPPV
Ga0193709_1000164123300021411SoilMDTIKSFASQKIKPYGTDVSDGVETNWTIRAAVSAFVVIALASFCLPGLPTLHSSTAGAQQHREQFNHKKKINLCVLAKTPAKVCIRKGGAQQPVKLFYYAGDISLRRINFPACSLCPPGIHSGIARGSLMLRERAPPV
Ga0193709_109773823300021411SoilMDEVKRFASQKIKPYGTDVSDAVETNWTIRAVVSAFVVIALASFCSPSLQTLHSSVSALQHHQEQFNHKGEINFWVLAKTPAKLCIRKGSAQQPVQLFYSAGDVSLRRINFSARSLYATRIHSDIGRGSLTLRERAPP
Ga0222623_1029286513300022694Groundwater SedimentVSAFVVIALASFCSPTLQTLHSSTTPVQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRERAPPV
Ga0222623_1038548523300022694Groundwater SedimentMDEVKRFASQKIKPYGTDVSDAVETNWTIRAVVSAFVVIALASFCSPSLQTLHSSVSALQHHQEQFNHKGEINFWVLAKTPAKLCIRKGSAQQPVQLFYSAGDVSLRRINFSARSLYATRIHSDIGRGS
Ga0222622_1094208223300022756Groundwater SedimentDTVEMNRTVRAAVSAFVVIALASFCSPDLQTLHSSASNAQHHREQFNYKGKINLCVLAKTPTKVCVRKGGPQQPVQIFYSAGDTSSRRINFSACSLCPPRIHSGIARGSLTLRERAPPV
Ga0209056_10002299163300026538SoilMDVMEAVETDWTRRTVVNAFVVIALASLCSPTLKTLHSSATPVQDHQEQFDHKGKIEFCVLTKTPIKVCTRKGSAQRLVQLVCFAGDVSLHRLNFSPRSLCATEIHSDITRGSLTLRERAPPV
Ga0209161_1005297433300026548SoilMEAVETDWTRRTVVNAFVVIALASLCSPTLKTLHSSATPVQDHQEQFDHKGKIEFCVLTKTPIKVCTRKGSAQRLVQLVCFAGDVSLHRLNFSPRSLCATEIHSD
Ga0307282_1025289813300028784SoilVVSAFVVIALASFCSPTLHSSTTPVQHHQEQFDHKGKIEFCVLTKTSTKVCTRKDSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRKRAPPV
Ga0307305_1000034993300028807SoilVRDAFETGSTTRTVVSAFVVIALASLSSPTLKTVHRSARSVQHHQEQFDHKGKMELCVLAKTSTKVCSRKGSAQQSFQVFYSVSDVSVRRLNLSARSICATAIHSEIAGGNSTFRERAPP
Ga0307312_10000881113300028828SoilVRDAFETGSTTRTVVSAFVVIALASLSSPTLKTVHRSARSVQHHQEQFDHKGKMELCVLAKTSTKVCSRKGSAQQSFQVFYSVSDVSLRRLNLSARSICATAIHSEIAGGNSTFRERAPP
Ga0307312_1018761923300028828SoilMDVSDAVETGWTTRTGVSAFVVIALASFCSPTLQTLDSSTTPIQHHQEQFDHKGKIEFCVLAKTPTKVCTRKGSAQQSVQLFYSASDVSLHRLNLSARSICATAIHSEIARGSVTFRERAPPV
Ga0307312_1042073123300028828SoilMDVSNADETGWTTRTVVSAFVVIALASFCSPTPKTLHSSATSVQHHQEQLDHKGKIELCVLAKAPTKVCTRKGGAQQLVQLFYSASDISLHRLGLSARSICATAIYSEIARGSVTFRERAPPV
Ga0307278_1019175423300028878SoilMDVSDAVETGWTTRTVVSAFVVTALACFCSPTLQSLHSSTTPVRHHQEQFDHKGKIEFCVLAKDPTKVCTRKGSAQQLVQLFYSASDISLHRLSLSARSICATAIHSEIARGSVTFRERAPPV
Ga0075386_1217913023300030916SoilMNVSEAIEANWTIRAAASAFVVIALASFCSPSLQALHSSTTALQHHQEQFNHKGKINFCVLAKTPTKVCIRKGSAQQLVQLFYSVGNVSLRRINFSPRSLCATGIHSDIARGSLTLRERAPPV
Ga0307469_1036626823300031720Hardwood Forest SoilSEGIEANWTIRAAVSAFGVIALASFCSPSLQALHSSTTALQHHQKQFNHEGKINFCVLAKTPAKLCIRKGSVQQTVQLFNSAGDVSLRRINLSARSLCATGIHSDIARRSLTLRERAPPV
Ga0310812_1004245423300032421SoilMDVNEVIEANWTIRAVVSAFGVIALASFCSPSLQAVHGFTTALQHHQEQFNHKGKIDFCVLAKTPTKVSIRKGSAQQLVQLFNSAGDVSLRRIKFSARSLCATRIHSDIARGSLTLRERAPPV
Ga0310812_1051478513300032421SoilSAALESCNQRMDEIKPFASQKIKPYGTDVSDAVETNWTIRAAVSAFVVIALASFCSPSLQTLNSSTTGAQHHREQFTHKVKINFCVLAKTPTKICIRKGSAQQPVQLFYSAGDVSLRRINFPACSLCPPRIHSGIARGSLTLRERAPPA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.