NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102772

Metagenome Family F102772

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102772
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 114 residues
Representative Sequence MLIHARRLAFPVLTVPFVAGVCLFGGCATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE
Number of Associated Samples 79
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 60.40 %
% of genes near scaffold ends (potentially truncated) 35.64 %
% of genes from short scaffolds (< 2000 bps) 84.16 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.010 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil
(15.842 % of family members)
Environment Ontology (ENVO) Unclassified
(27.723 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.535 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 34.48%    β-sheet: 0.00%    Coil/Unstructured: 65.52%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF17171GST_C_6 2.97
PF00497SBP_bac_3 1.98
PF00351Biopterin_H 1.98
PF00561Abhydrolase_1 1.98
PF00536SAM_1 0.99
PF03466LysR_substrate 0.99
PF12607DUF3772 0.99
PF02308MgtC 0.99
PF12697Abhydrolase_6 0.99
PF07883Cupin_2 0.99
PF13367PrsW-protease 0.99
PF05099TerB 0.99
PF13414TPR_11 0.99
PF12804NTP_transf_3 0.99
PF01921tRNA-synt_1f 0.99
PF03472Autoind_bind 0.99
PF00263Secretin 0.99
PF01494FAD_binding_3 0.99
PF00596Aldolase_II 0.99
PF13545HTH_Crp_2 0.99
PF00440TetR_N 0.99
PF12680SnoaL_2 0.99
PF01979Amidohydro_1 0.99
PF13561adh_short_C2 0.99
PF17172GST_N_4 0.99
PF03401TctC 0.99
PF00072Response_reg 0.99
PF13356Arm-DNA-bind_3 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.98
COG2197DNA-binding response regulator, NarL/FixJ family, contains REC and HTH domainsTranscription [K] 1.98
COG3186Phenylalanine-4-hydroxylaseAmino acid transport and metabolism [E] 1.98
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.99
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.99
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.99
COG1285Magnesium uptake protein YhiD/SapB, involved in acid resistanceInorganic ion transport and metabolism [P] 0.99
COG1384Lysyl-tRNA synthetase, class ITranslation, ribosomal structure and biogenesis [J] 0.99
COG3174Membrane component of predicted Mg2+ transport system, contains DUF4010 domainInorganic ion transport and metabolism [P] 0.99
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 0.99
COG3793Tellurite resistance protein TerBInorganic ion transport and metabolism [P] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.01 %
UnclassifiedrootN/A0.99 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000956|JGI10216J12902_102834222All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300000956|JGI10216J12902_103140157All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales970Open in IMG/M
3300000956|JGI10216J12902_104638402All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300000956|JGI10216J12902_121069704All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300004156|Ga0062589_101803004All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300004463|Ga0063356_105682011All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300004480|Ga0062592_101817470All Organisms → cellular organisms → Bacteria596Open in IMG/M
3300004643|Ga0062591_101047932All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300005468|Ga0070707_100109639All Organisms → cellular organisms → Bacteria2677Open in IMG/M
3300005518|Ga0070699_100071410All Organisms → cellular organisms → Bacteria3019Open in IMG/M
3300005518|Ga0070699_100386405All Organisms → cellular organisms → Bacteria1264Open in IMG/M
3300005529|Ga0070741_10105264All Organisms → cellular organisms → Bacteria2976Open in IMG/M
3300005536|Ga0070697_101128373All Organisms → cellular organisms → Bacteria698Open in IMG/M
3300005548|Ga0070665_101522334All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300005577|Ga0068857_102067459All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300005607|Ga0070740_10185663All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300005718|Ga0068866_10149358All Organisms → cellular organisms → Bacteria1351Open in IMG/M
3300005718|Ga0068866_10937664All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300005764|Ga0066903_100987630All Organisms → cellular organisms → Bacteria1537Open in IMG/M
3300006038|Ga0075365_10234504All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1289Open in IMG/M
3300006038|Ga0075365_10770093All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300006051|Ga0075364_10900884All Organisms → cellular organisms → Bacteria602Open in IMG/M
3300006844|Ga0075428_100288432All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1766Open in IMG/M
3300006847|Ga0075431_102212828All Organisms → cellular organisms → Bacteria504Open in IMG/M
3300006880|Ga0075429_100318748All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300008886|Ga0115930_1000085All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales196644Open in IMG/M
3300009078|Ga0105106_10598326All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300009094|Ga0111539_10785946All Organisms → cellular organisms → Bacteria1108Open in IMG/M
3300009100|Ga0075418_11287771All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300009147|Ga0114129_12434648All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300009153|Ga0105094_10176660All Organisms → cellular organisms → Bacteria1221Open in IMG/M
3300009162|Ga0075423_12175340All Organisms → cellular organisms → Bacteria602Open in IMG/M
3300009166|Ga0105100_10204375All Organisms → cellular organisms → Bacteria1177Open in IMG/M
3300009176|Ga0105242_13185753All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300009789|Ga0126307_10017519All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales5348Open in IMG/M
3300009789|Ga0126307_10572282All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300009840|Ga0126313_10574887All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300009840|Ga0126313_11841864All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300009870|Ga0131092_10084446All Organisms → cellular organisms → Bacteria → Proteobacteria3870Open in IMG/M
3300010005|Ga0120997_100802All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales21025Open in IMG/M
3300010036|Ga0126305_11219471All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300010037|Ga0126304_10606799All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300010038|Ga0126315_10158638All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Reyranellaceae → Reyranella → Reyranella soli1343Open in IMG/M
3300010038|Ga0126315_10271980All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300010040|Ga0126308_10702243All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300010041|Ga0126312_10409496All Organisms → cellular organisms → Bacteria965Open in IMG/M
3300010041|Ga0126312_10941824All Organisms → cellular organisms → Bacteria630Open in IMG/M
3300010041|Ga0126312_11255504All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300010042|Ga0126314_10711831All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300010044|Ga0126310_10068461All Organisms → cellular organisms → Bacteria2045Open in IMG/M
3300010045|Ga0126311_11186063All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300010052|Ga0133944_1008907All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium8914Open in IMG/M
3300010052|Ga0133944_1011856All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Rhodopseudomonas → Rhodopseudomonas palustris6666Open in IMG/M
3300010166|Ga0126306_11689478All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300012201|Ga0137365_10254020All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1307Open in IMG/M
3300012212|Ga0150985_102316749All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300012469|Ga0150984_121599725All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300013297|Ga0157378_11505161All Organisms → cellular organisms → Bacteria718Open in IMG/M
3300014264|Ga0075308_1055817All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300014270|Ga0075325_1228067All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300014271|Ga0075326_1039014All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Methyloceanibacter1235Open in IMG/M
3300014272|Ga0075327_1007323All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3403Open in IMG/M
3300014272|Ga0075327_1031742All Organisms → cellular organisms → Bacteria1575Open in IMG/M
3300015371|Ga0132258_10338189All Organisms → cellular organisms → Bacteria → Proteobacteria3718Open in IMG/M
3300015371|Ga0132258_10572665All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Reyranellaceae → Reyranella → Reyranella soli2832Open in IMG/M
3300015371|Ga0132258_11233021All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1891Open in IMG/M
3300015371|Ga0132258_13006775All Organisms → cellular organisms → Bacteria1168Open in IMG/M
3300015372|Ga0132256_100837430All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300015372|Ga0132256_101786114All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300015374|Ga0132255_102223252All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300015374|Ga0132255_103732856All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300018000|Ga0184604_10032302All Organisms → cellular organisms → Bacteria1340Open in IMG/M
3300018066|Ga0184617_1047192All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1083Open in IMG/M
3300018422|Ga0190265_11941783All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria695Open in IMG/M
3300018429|Ga0190272_10179500All Organisms → cellular organisms → Bacteria1502Open in IMG/M
3300018432|Ga0190275_12378221All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300018432|Ga0190275_13380486All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300018466|Ga0190268_10515337All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300018466|Ga0190268_10578718All Organisms → cellular organisms → Bacteria789Open in IMG/M
3300018469|Ga0190270_11906079All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300018481|Ga0190271_10462281All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae1375Open in IMG/M
3300018920|Ga0190273_10696253All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300018920|Ga0190273_11901308All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300019377|Ga0190264_11774763All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300019884|Ga0193741_1075413All Organisms → cellular organisms → Bacteria867Open in IMG/M
3300020202|Ga0196964_10512696All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300020215|Ga0196963_10010250All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales4271Open in IMG/M
3300025899|Ga0207642_10050350All Organisms → cellular organisms → Bacteria1879Open in IMG/M
3300025961|Ga0207712_11755223All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300026111|Ga0208291_1026327All Organisms → cellular organisms → Bacteria1067Open in IMG/M
3300027907|Ga0207428_11036644All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300027909|Ga0209382_10857483All Organisms → cellular organisms → Bacteria961Open in IMG/M
3300028590|Ga0247823_10347474All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1120Open in IMG/M
3300030619|Ga0268386_10136212All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1879Open in IMG/M
3300031731|Ga0307405_12090584All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300031824|Ga0307413_10127842All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium1734Open in IMG/M
3300031824|Ga0307413_11793864All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300031911|Ga0307412_10073483Not Available2340Open in IMG/M
3300032002|Ga0307416_103263479All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300032080|Ga0326721_10022463All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2461Open in IMG/M
3300034155|Ga0370498_100670All Organisms → cellular organisms → Bacteria673Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil15.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.86%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere8.91%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands5.94%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere5.94%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere4.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.96%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.97%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.97%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere2.97%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.98%
LiquidEnvironmental → Aquatic → Unclassified → Unclassified → Unclassified → Liquid1.98%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil1.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.98%
SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment0.99%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.99%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.99%
Micrasterias Crux-Melitensis (Mzch 98) AssociatedHost-Associated → Microbial → Bacteria → Unclassified → Unclassified → Micrasterias Crux-Melitensis (Mzch 98) Associated0.99%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.99%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.99%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.99%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005607Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2EnvironmentalOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006038Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-5Host-AssociatedOpen in IMG/M
3300006051Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-4Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300008886Microbial communities associated with unicellular green alga Micrasterias crux-melitensis, Germany - (MZCH: 98)Host-AssociatedOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009153Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009166Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009789Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot28EnvironmentalOpen in IMG/M
3300009840Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105AEnvironmentalOpen in IMG/M
3300009870Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Linkou plantEngineeredOpen in IMG/M
3300010005Microbial communities associated with xenic strain Fischerella muscicola UTEX 1829EnvironmentalOpen in IMG/M
3300010036Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot26EnvironmentalOpen in IMG/M
3300010037Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot25EnvironmentalOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010041Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot104AEnvironmentalOpen in IMG/M
3300010042Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105BEnvironmentalOpen in IMG/M
3300010044Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot60EnvironmentalOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010052Microbial community associated with the xenic strain of Eucapsis sp. UTEX 1529EnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014264Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Joice_ThreeSqA_D2_rdEnvironmentalOpen in IMG/M
3300014270Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailA_D1EnvironmentalOpen in IMG/M
3300014271Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailA_D2EnvironmentalOpen in IMG/M
3300014272Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailB_D1EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300018920Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 ISEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019884Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2s2EnvironmentalOpen in IMG/M
3300020202Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10EnvironmentalOpen in IMG/M
3300020215Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_5EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026111Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028590Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day30EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300031731Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-1Host-AssociatedOpen in IMG/M
3300031824Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-2Host-AssociatedOpen in IMG/M
3300031911Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-1Host-AssociatedOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032080Soil microbial communities from Southern Great Plains, Lamont, Oklahoma, United States - SGP_1_2016EnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10283422223300000956SoilICVFSECDATAQSADDLPPPAEPWLAEPGDWAADFNEAQVACYNGSMSACDSIWISDRVLLDSFLSEYGRSCGGRVDVREIRRMNLTCAEAFPDHD*
JGI10216J12902_10314015723300000956SoilMLSPAGRRASGLLALPFAAGICVFSKCEAMAQSADDLPAAAEPWLAEPGDWAADFNEAQLACYQGSMSACDSIWLSDRVLLDTFLAEYGRSCGGRVDSREITRADLTCVEAFPGHD*
JGI10216J12902_10463840223300000956SoilMLSPAGRRASVMLALPFAAGICVFSERGATAQTADELPPAAEPVLAEPGDWAADFNEAQVGCYEGSMSACDSIWLDDRVLLDSLLGQYGRTCGGRADHREISLASLTCREAFPGHE*
JGI10216J12902_12106970413300000956SoilMLHHARRRASRMLLVPLVGGVCLFGECDTRAQSAGDLPNAQEPWLAEPGDWAADFNEAQVACYQGSMSACDSIWLNQRVLLDSFLGKYGRTCGGRVDIREIDRSDLTCTGAFPGHE*
Ga0062589_10180300413300004156SoilLSGEPRWLKLPLRWKDGEMLIHARRLTFRVLTVPFVAGVCLFGGCTTWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0063356_10568201123300004463Arabidopsis Thaliana RhizosphereMLSPACRRASGLPVVLFVAGIHLFSECDARAQPTDALPAASEPWLAEEGDWAADFNQAQLACYQGSMSACDSIWLSERVLLDSFLDHYGRTCGGRVDMDEISRANLTCVEAFPGHE*
Ga0062592_10181747023300004480SoilPRQALEGGQMLVQARRRASRMLAVPMIAGVGLFAGCHAKAQSGPDLPAAEQPWLAEQGEWAADFNEAQMACYEGSMAACDAIWLNNRVLLDSWLHQYGRTCGGRVDLRAIRRANVDCTEAFPGHE*
Ga0062591_10104793213300004643SoilMLNHSGRRASRMLVVPFVASVCLVWECDTRAQSALPPAQEPSLAGPGDRNAVFNQAQIACYQGSMRACDSIGLDDRILMDSFLGQYGRTCGGRANLRAMRRQSLNCTEAFPGHE*
Ga0070707_10010963933300005468Corn, Switchgrass And Miscanthus RhizosphereMLVVPFVAGGYLFGECDTRAQSFGDLPYAEEPWLAEPGDWSADFNEAQTACYQGSMRACDSIWLDDRVLIDTFLGQYGRTCGGRVDIGEIRRANVTCVKAFPGHE*
Ga0070699_10007141043300005518Corn, Switchgrass And Miscanthus RhizosphereMGTLLNDAHRRASRMLVVPFVAGGYLFGECDTRAQSFGDLPYAEEPWLAEPGDWSADFNEAQTACYQGSMRACDSIWLDDRVLIDTFLSQYGRTCGGRVDISEIRRANVTCVKAFPGHE*
Ga0070699_10038640533300005518Corn, Switchgrass And Miscanthus RhizosphereMGTLLNDARRRASRMLVVPFVAGSCLFGEIETRAQSFGDLPSAQEPWLAEPGDPYADFNEAQTACYQASMRACDSIWLDERVLIDTFLDQYGRTCGGRVDISEIRRADVTCVEAFPGHE*
Ga0070741_1010526413300005529Surface SoilMSPKSHYTRTAAIAFVAGVLLHWGCHAEAQKPVGDLPEAQQPWLAEEGDWAADFNQAQMACYEGSMKACDSIWSNERVLMDTFLYKYGRTCGGRVDLHEIVQANLTCIEAFPGHE*
Ga0070697_10112837313300005536Corn, Switchgrass And Miscanthus RhizosphereARVGLFIISPVDSRLATAGRMGTLLNDAHRRASRMLVVPFVAGGYLFGECDTRAQSFGDLPNAEEPWLAEPGDWSADFNEAQTACYQGSMRACDSIWLDDRVLIDTFLGQYGRTCGGRVDISEIRRANVTCVEAFPGHE*
Ga0070665_10152233413300005548Switchgrass RhizosphereMLIHTRRLAFPALAIPFVAGVCLFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRA
Ga0068857_10206745913300005577Corn RhizosphereSPPCVPRVGYTVRCWGLPVRGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0070740_1018566323300005607Surface SoilMSPESHCAGAVVIAFVAGVLLPWDCRAEDQKPGGDLPQAQEPWLAEEGDWAADFNQAQIGCYQGSMKACDSIWSNERVLMDSFLYKYGRTCGGRVDLHEIVQANLTCVEAFPGHE*
Ga0068866_1014935823300005718Miscanthus RhizosphereMGQRASLAQAATALEAWGKMLIHARRLAFPALAMPFVAGVCLFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0068866_1093766423300005718Miscanthus RhizosphereNHAHRLAAGLLAVSFAAGANLSWNCDARAQSGDELPAPEEPWLAQPGDWAADFNQAPIACYQGSMSACDAIWLSNRVLLDSFLDQYGRSCGGRVDPRALRSANLSCTEAFPGHE*
Ga0066903_10098763023300005764Tropical Forest SoilMLNHACVRASRLLGVSFVACACLFWQCETKAQFGLPSAVEPWLAESGDRYADFNLAQIACYRGSMSACDAIWLSDRVLLDTLLSRYGRTCGGRVDYRAISRAGLACTEAFPGYE*
Ga0075365_1023450413300006038Populus EndosphereMLIQARRRASRMLAVPMIAGFGLFAGCSTKAQSAGDLPAAEQPWLAEQGEWAADFNEAQIACYEGSMNACDAIWLNDRVLLDSWLGQYGRSCGGRADLRAIRRANLSC
Ga0075365_1077009313300006038Populus EndosphereMLIHARRLAFPALAIPFVAGVCLFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0075364_1090088413300006051Populus EndosphereMLIQARRRASRMLAVPMVAGVCLLAGCHAKAQTAGNLPPAEEPWLAEQGEWAADFNQAQIACYEGSMNACDAIWLNNRVLLDSWLHQYGRTCGGRVDLRAIRRANVDCTEAFPGHE*
Ga0075428_10028843223300006844Populus RhizosphereMLNQARRRAARMLVIALAAGVYLFSQSDTRAQSAGDLPNAAEPWLAEPGDWAADFNEAQIACYQGSMRACDLIWLNQRVLLDSPLGQYGRTCGGRADLRAIRRASLSCTEAFPGNE*
Ga0075431_10221282813300006847Populus RhizosphereARMPAGLGTFVVSPVGSTLGTAGRMGTMLNPARRRASRMLLGPFVAGVCLFWQCDARAQSLPPAEQPWLAEPGDWAADFNQAQIACYQGSMRACDLIWLNQRVLLDSWLGQYGRTCGGRVDLRTIRRANLNCTEAFPGNE*
Ga0075429_10031874833300006880Populus RhizosphereMATMLNQARRRAARMLVIALAAGVYLFSQSDTRAQSAGDLPNAAEPWLAEPGDWAADFNEAQIACYQGSMRACDLIWLNQRVLLDSPLGQYGRTCGGRANLRAIRRASLSCTEAFPGNE*
Ga0115930_1000085603300008886Micrasterias Crux-Melitensis (Mzch 98) AssociatedMSNDASRLRAVSFAASVYLFFEAGLAAQTVDELPDPQEPMLAEPGDWAADFNEAQLACYQGSMRACDSIWLSERVLLDSFLDHYGRTCGGRVDLREIRRANLNCTEAFPGHD*
Ga0105106_1059832623300009078Freshwater SedimentMLSHARRRASRMLVVPFVASVCLFWECDARAQSAGNLPAAEEPWLAEAGDRYADFNQAQVACFQGSMRACDLIWLNERILLDSWLGQYGRTCGGRVDIRVIRRANLNCTEAFPGHE*
Ga0111539_1078594623300009094Populus RhizosphereMLAVPMVAGVCLLAGCHAKAQTAGHLPPAEEPWLAEQGEWAADFNQAQIACYEGSMNACDAIWLNNRVLLDSWLHQYGRTCGGRVDLRAIRRANVDCTEAFPGHE*
Ga0075418_1128777113300009100Populus RhizosphereMLLGPFVAGVCLFWQCDARAQSLPPAEQPWLAEPDDWAADFNQAQIACYQGSMRACDLIWLNQRVLLDSWLGQYGRTCGGRVDLRTIRRANLNCTEAFPGYE*
Ga0114129_1243464813300009147Populus RhizosphereLSGKPDWPKLPQRWKDGEMSNHARRLAFPVLTVPFVAGVWLFGGCATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSNRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE*
Ga0105094_1017666013300009153Freshwater SedimentMGTLLNHARRRASRMLVVPFVASVCLFWECDARAQSAGNLPAAEEPWLAEAGDRYADFNQAQVACFQGSMRACDLIWLNERILLDSWLGQYGRTCGGRVDIRVIRRANLNCTEAFPGHE*
Ga0075423_1217534013300009162Populus RhizosphereMLVVPLVGGVCLFGECDTRAQSAGDLPNAQEPWLAEPGDWAADFNDAQVACYQGSMSACDSIWLNQRVLLDSFLGKYGRTCGGRVDIREIDRSDLTCTGAFP
Ga0105100_1020437523300009166Freshwater SedimentMLSHARRRASRMLVVPFVASVCLFWECDARAQSAGNLPAAEEPWLAEAGDRYADFNQAQVACFQGSMRACDLIWLNERILLDSWLGQYGRTCGGRVDIRVIRRANLNCTEAFPDHE*
Ga0105242_1318575313300009176Miscanthus RhizosphereALEGGQMLVQARRRASRMLAVPMIAGVGLFAGCHAKAQSGPDLPAAEQPWLAEQGEWAADFNEAQMACYEGSMAACDAIWLNNRVLLDSWLHQYGRTCGGRVDLRAIRRANVDCTEAFPGHE*
Ga0126307_1001751943300009789Serpentine SoilMAKMLIHGRRLAFPALAIPFVAGVCLFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDAIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGND
Ga0126307_1057228223300009789Serpentine SoilMLIHVRRLAFTALTVPFVAGVCLFGGCATWAQSAGNLPPAAAEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILVDSWLGQYGRTCGGRADLRAIRRANLTCAEA
Ga0126313_1057488723300009840Serpentine SoilMLIHARRLAFPVLTVPFVAGVCLFGGCATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0126313_1184186413300009840Serpentine SoilFCLFAEYDTKAQTAGALPAAAEPWLAEQGDWAADFNQAQIACYEGSMNACDAIWLNDRVLRDSWLGQYSRTCAGRADLRAIRRANLSCTEAFPGNE*
Ga0131092_1008444643300009870Activated SludgeMSSHARHPTARMFAALLVMSGGLFRELEATAQSTSDLPEAAEPYLAEEGDWAADFNQAQIACYQGSMRACDSIWLSQRVLIDSWLSEYGRSCGGRVDPRAIRRANLTCVEAFPDYE*
Ga0120997_10080233300010005SedimentMSNDASRLRAILSAASVYLFFEAGLAAQTVDELPDPQEPMLAEPGDWAADFNEAQSACYQGSMRACDSIWLSERVLLDSFLDHYGRTCGGRVDLREIRRANLNCTEAFPGHD*
Ga0126305_1121947113300010036Serpentine SoilMLIHVRRLAFTALTVPFVAGVCPFGGCATWAQSAGNLPPAAAEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGND*
Ga0126304_1060679923300010037Serpentine SoilMPIHARRLAFAVLAIPFVAGVCLFAGVAVWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE*
Ga0126315_1015863823300010038Serpentine SoilMLAVPFVAGVCLFGGCDASAQSAGNLPPAVEPWLAEPGNYYADFNDAQIACFEGSMRACDLIWLSDRILLDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0126315_1027198033300010038Serpentine SoilWAGLGGMGKMLIHAHRRASRMLAFPIVAGLCLLAGPHTKAQTAGNLPAAAEPSLAQQGDWAADFNQAQIACYEGSMRACDAIWLNDRVLLDSWLGQYGRTCGGRADLRAIRRANLSCIEAFPGNE*
Ga0126308_1070224323300010040Serpentine SoilMLIHAHRRASRMLSLPIVAGLCLLAGPHTKAQTAGNLPAAAEPSLAQQGDWAADFNQAQIACYEGSMKACDAIWLNDRVLLDSWLGQYGRTCGGRADLRAIRRANLSCTEAFPGNE*
Ga0126312_1040949613300010041Serpentine SoilMLIHAHRRASRMLALPIVAGLCLLAGPHTKAQTAGNLPAAAEPSLAQQGDWAADFNQAQIACYEGSMKACDAIWLNDRVLLDSWLGQYGRTCGGRADLRAIRRANLSCTEAFPGNE*
Ga0126312_1094182413300010041Serpentine SoilKDGTMLNHARRRASRMLLVPLVAGVYLFWQGDTRAQSAGNLPRATEPWLAEPGDWAADFNQAQIECYQGSLRACDSIWLHRRVLLDSSLGQYGRTCGGRADIRAIRRANLTCADAFPGHD
Ga0126312_1125550423300010041Serpentine SoilMAKMLIHGRRLAFPVLTVPFVAGVCLFGGCATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE
Ga0126314_1071183113300010042Serpentine SoilMLIHARRLAFPMLTVPFVAGVCLFGGCATWAQSAGNLPPAAAEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGND*
Ga0126310_1006846143300010044Serpentine SoilMRVVDGLSGKPDWPKLPQRWKDGEMSNHARRLAFRVLTVPFVAGVCLFGGCATWAQSAGNLPPAAEEPRLAEQGNYYADFNEAQIACYEGSMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE*
Ga0126311_1118606313300010045Serpentine SoilMLIHAHRRASRMLAFPIVAGLCLLAGPHTKAQTAGNLPAAAEPSLAQQGDWAADFNQAQIACYEGSMKACDAIWLNDRVLLDSWLGQYGRTCGGRADLRAIRRANLSCTEAFPGNE*
Ga0133944_100890753300010052LiquidMLVVPLAAGIHLFLQCDARAQAADALPAAAEPWLAEPGDWAADFNEAQVACYEGSMTACDSIWMNDRVLFDTFLGDYGRSCGGRVDIREIRRANLTCAEAFPGYE*
Ga0133944_101185623300010052LiquidMSNNASRLWAIPLAASACLFFEAGLVAQTADELPNAQEPLLAEPDDWAADFNEAQLACFQGSMRACDSIRLNERILLDSFLAQYGRTCGGRVDLREIRRANLNCTEAFPGHD*
Ga0126306_1168947813300010166Serpentine SoilMEDGQLSNHVRRRSCRILVVPFAVSVCLMWARDTRAQSAGDLPREAEPRLAESGDLSAHFNEAQIACYQGSMRACDSIWLNERVLLDTFLGQYGRTCGGRVDLRAIRRANVTCIEAFPGHE*
Ga0137365_1025402023300012201Vadose Zone SoilIIWAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRFLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0150985_10231674913300012212Avena Fatua RhizosphereMLIAPFIVGVYLFCECETRAQSADELPNAEEPWLGEPGDRFAEFNEAQIACYQGSMRACDSIWLNEGVLLDSFLGQYGRTCGGRVDIRTIRRANANCTEAFPGHE*
Ga0150984_12159972513300012469Avena Fatua RhizosphereMLIAPFIAGVYLFCECETRAQSADELPNAEEPWLGEPGDRFAEFNEAQIACYQGSMRACDSIWLNEGVLLDSFLGQYGRTCGGRVDIRTIRRANANCTEAFPGHE*
Ga0157378_1150516113300013297Miscanthus RhizosphereMLIHTRRLAFPALAIPFVAGVCLFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE*
Ga0075308_105581713300014264Natural And Restored WetlandsMSRHARRRAFQMSVVPFIVGVYLFSDCNARAQYADDLPDAEEPWLAESGDWTADFNELQIACYQGSMRACDAIWLDNRVLLDSPLGQYGRTCGGRVDIVEMRRADLSCTEAFPGHE*
Ga0075325_122806713300014270Natural And Restored WetlandsGASPILVVPFAAIVCFFLEWDTTRAQSADDLPSAEEPWLAEPGDWAADFNELQIACYQGSMSACDSIWLNGRVLLDTFLFQYGRSCGGRVDPREISRANITCTEAFPGYE*
Ga0075326_103901413300014271Natural And Restored WetlandsMLVVPLVAGIDLFWACDTRAQYADDLPSAEEPWLAEPGDWAADFNDLQIACYQGSMSACDAIWLDGRVLLDSLLGQYGRTCGGRVDIDEIRRADLSCTEAFPGHE*
Ga0075327_100732313300014272Natural And Restored WetlandsTFDLLWACDARAQYADDLPSAEEPWLAEPGDWSADFNDLQIACYQGSMSACDAIWLDGRVLLDSFLGQYGRTCGGRVDIDEIRRADLSCTEAFPGHE*
Ga0075327_103174223300014272Natural And Restored WetlandsVVIEPTIVDGVLWAMKRLAIASNHARGASHILVVPFAAIVCFFLEWDTTRAQSADDLPSAEEPWLAEPGDWAADFNELQIACYQGSMSACDSIWLNGRVLLDTFLFQYGRSCGGRVDPREISRANITCTEAFPGYE*
Ga0132258_1033818933300015371Arabidopsis RhizosphereMSNSARFISRMLAITVVASILFCGCQISAQSANDLPPPGDEPWLAEPGNYYADFNEAQIACYAGSMEACDLIGFSERILMDTWLSRYGRTCGGRVDLRAIMRANLSCTEVFPGH*
Ga0132258_1057266523300015371Arabidopsis RhizosphereMLAVPMVAGVCLLAGCHAKAQTAGNLPPAEEPWLAEQGEWAADFNQAQIACYEGSMNACDAIWLNNRVLLDSWLHQYGRTCGGRVDLRAIRRANVDCTEAFPGHE*
Ga0132258_1123302123300015371Arabidopsis RhizosphereMSNDAPRRAFRIWLVPAIAGASLFWTFAARAQSADDLPTPEEPWLAEPGDWAADFNDAQVACYQGSMSACDSIWMSQRVLFDSFLSKYGRTCGGRADARQLTFANMTCTEAFPGHE*
Ga0132258_1300677523300015371Arabidopsis RhizosphereMAPVFCHVGRLAFRVSVIPLVVGVLLLWECHANAQKPGGDLPPAQQPWLAEAGDWAADFNQAQMACYEGSMRACDSLWANHRVLIDSFLWTYGRTCGGRVDVHEIMQANLTCTDAFPGHE
Ga0132256_10083743033300015372Arabidopsis RhizosphereMLNHARRRASQMLVAPLVASVCLFWECETRAQTADDLPSPQEPWMAEPGDWSADFNEAQTACYQGSMRACDSIWVSDRVLLDSFLYEYGRTCGGRVEMREIRSANVTCTEAFPGHE*
Ga0132256_10178611423300015372Arabidopsis RhizosphereGTMSNDAPRRAFRIWLVPAIAGASLFWTFAARAQSADDLPTPEEPWLAEPGDWAADFNDAQVACYQGSMSACDSIWMSQRVLFDSFLSKYGRTCGGRADARQLTFANMTCTEAFPGHE*
Ga0132255_10222325213300015374Arabidopsis RhizosphereMSNSARFISRMLAITVVASILFCGCQISAQSANDLPPPGDEPWLAEPGNYYADFNEAQIACYAGSMEACDLIGFSERILMDTWLSRYGRTCGGRVD
Ga0132255_10373285613300015374Arabidopsis RhizosphereARRLAFPALAIPFVAGVCLFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAAAFPGNE*
Ga0184604_1003230213300018000Groundwater SedimentMLIHARRLACSVLTVPFVAGVCLFGGCATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANLTCAAAFPGNE
Ga0184617_104719223300018066Groundwater SedimentMLIHARRLAFPVLTVPFVAGVCLFGGCATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANLTCAAAFPGNE
Ga0190265_1194178323300018422SoilMLNPVCRRASLMLVVPFVAGVYLFWECDAGAQSVGNLPRAAEPWLAEPGDWAADFNQAQTACYQGSMRACDLIWLNDRVLLDSGLGQYGRTCGGRVDLRAIRRANLTCTEAFPGHE
Ga0190272_1017950023300018429SoilMLIHARRLAFPALTVRFVAGVCLFGGCATWAQSGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSNRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE
Ga0190275_1237822113300018432SoilARCRVSRMLVVPFVASVSLFWECDVGAQSAGELPEAAEPLLAEPSDRYADYNEAQIACYRGSMSACDSIWLDEGVLMDSILGQYGRTCGGRVDLRAIRRENVTCTEVFPAHE
Ga0190275_1338048613300018432SoilMLIHARRLAFPMLTVPFVAGVWLFGGCAIWAQSAGNLPPAAEEPRLAEPGNYYAGFNEAQIACYEGSMRACDVIWLSDRILLDSWLGQYGRTCGGRADLRA
Ga0190268_1051533723300018466SoilMSNHADRRAFRMLMVPFVAGVCLFQDCDAKAQSAGDLPAPEEPWLAEPGDWSAAFNEAQIACYRGSMTACDSIWLNERVLLDSVLGQYGRTCGGRVDIRAIRRANVTCVEAFPGHE
Ga0190268_1057871823300018466SoilMLIHARRLAFRVLAVPFLAGICLFGGCDTWAQSASNLPPAAEEPRLAEPGNYYADFNEAQVACYEGSMRACDVIWLSNRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE
Ga0190270_1190607923300018469SoilRRRVFRMVVAPCVAGICLFSEGEARAQSADELPNAEEPWLGEPGDRFAEFNEAQIACYQGSMRACDSIWLNEGVLLDSFLGQYGRTCGGRVDIRTIRRANANCTEAFPGHE
Ga0190271_1046228123300018481SoilMLALPFAAGICVFSECGALAQSADDLPMAAEPLLAEPGDWSADFNQAQIACYQGSMSACDSIWLDDRVLLDSLLGQYGRTCGGRADRREISLANLTCREAFPGND
Ga0190273_1069625323300018920SoilMDHARRRAARMLALPLVAGLCLLCRSDAMAQSAGDLPEAQEPWLAEPDDRHAEFNEAQLACYRGSMTACDLIWLNEGILLDSPLAQYGRTCGGRVELR
Ga0190273_1190130813300018920SoilMLNHARRASRMLMVPFVASICLFWETHARAQSAGDLPPAEEPWLAEPGDRYAEFNDAQIACYQGSMRACDAIWLNERLLLDSLLAQYGRTCGGRVDRR
Ga0190264_1177476313300019377SoilMLIHARRLAFAMLTVPFVAGVCLFGGCATWAQSARNLPPASEEPRLAEPGNCYADFNEAQIACYEGSMRACDVIWLSDRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE
Ga0193741_107541323300019884SoilMSNHARRLAFRVLTVPFVAGVCLFGGCTTWAQSAGNLPPAAEEPRLAELGNYYADFNEAQIACYEGSMRACDVIWLSNRILMDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE
Ga0196964_1051269613300020202SoilMLIHARRLAFPMLTVPFVAGVCLFGGCATWAQPAGNLPPAAEEPRLAEPGTYYADFNEAQIACYEGSMRACDVIWLSDRILLDSWLGQYGRTCGG
Ga0196963_1001025043300020215SoilMLIHARRLAFPMLTVPFVAGVCLFGGCATWAQPAGNLPPAAEEPRLAEPGTYYADFNEAQIACYEGSMRACDVIWLSDRILLDSWLGQYGRTCGGRADLRAIRRANLRCAEAFPGNE
Ga0207642_1005035023300025899Miscanthus RhizosphereMKQFRARTLCVVMISALFGGCATWAQSAGSLPPAAEEPRLAEPGNYYADFNAAQIACYEGAMRACDVIWLSDRILIDSWLGQYGRTCGGRADLRAIRRANLTCAEAFPGNE
Ga0207712_1175522323300025961Switchgrass RhizosphereMLNHACHRAFRIFVVPLVGSVCLFGECDTRAQSADDLPAAQEPWLAEPGDWAADFNDAQVGCYQGSMSACDSIWLDQRVLLDTFLDKYGRSCGGRVDIREIQRANLTCTEYFPGHD
Ga0208291_102632723300026111Natural And Restored WetlandsVVIEPTIVDGVLWAMKRLAIASNHARGASHILVVPFAAIVCFFLEWDTTRAQSADDLPSAEEPWLAEPGDWAADFNELQIACYQGSMSACDSIWLNGRVLLDTFLFQYGRSCGGRVDPREISLANITCT
Ga0207428_1103664423300027907Populus RhizosphereMLIQARRRASRMLAVPMVAGVCLLAGCHAKAQTAGNLPPAEEPWLAEQGEWAADFNQAQIACYEGSMNACDAIWLNNRVLLDSWLHQYGRTCGGRVDLRAIRRANVDCTEAFPGHE
Ga0209382_1085748323300027909Populus RhizosphereARMLVIALAAGVYLFSQSDTRAQSAGDLPNAAEPWLAEPGDWAADFNEAQIACYQGSMRACDLIWLNQRVLLDSPLGQYGRTCGGRADLRAIRRASLSCTEAFPGNE
Ga0247823_1034747423300028590SoilMLIPAGRRASRMLALPFAAGICVFSECDATAQSVDDLPPASEPWLAEAGDWAADFNEAQVACYQGSMSACDSIWLSDRVLLDTFLSEYGRSCGGRVDAREIRRANLTCAEAFPGHD
Ga0268386_1013621223300030619SoilMSNLARRRASRMLVVPLVASFDLFWACDTRAQYGDDLPSAEEPWLAEPGDWSADFNDLQIACYQGSMSACDAIWLDGRVLLDSLLGQYGRTCGGRVDIDEIRGADLSCTEAFPGHE
Ga0307405_1209058413300031731RhizosphereEMLIHARCLAFPVLTVPFVVGVCLFGECATWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSNRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE
Ga0307413_1012784213300031824RhizosphereFSECGALAQSTDELPMAAEPLLAEPGDWYADFNQDQIACYQGSMSACDSIWLDDRVLLDSLLGQYGRTCGGRADRREISLENLTCKEAFPGHD
Ga0307413_1179386413300031824RhizosphereMLIHARRLAFRVLAVAFVAGVCLFGGCTTWAQSAGNLPPAAEEPRLAEPGNYYADFNEAQIACYEGSMRACDVIWLSNRILMDSWLGQYGRTCGGRADLRAIRRANVTCAEAFPGNE
Ga0307412_1007348323300031911RhizosphereMLSPAGRRASVMLALPFAAGICVFSERGATAQTADELPPAAEPVPAEPGDWAADFNEAQVGCYEGSMSACDSIWLDDRVLLDSLLGQYGRTCGGRADHREISLASLTCREAFPGHE
Ga0307416_10326347913300032002RhizosphereMLSPAGRRASVMLALPFAAGICVFSERGATAQTADELPPAAEPVLAEPGDWAADFNEAQVGCYEGSMSACDSIWLDDRVLLDSLLGQYGRTCGGRADHREISLASLTCREAFPGHE
Ga0326721_1002246323300032080SoilMSNHADRRAFRMLMVPFVAGVCLFQDCDAKAQSAGDLPAAEEPWLAEPGDWSAAFNEAQIACYRGSMRACDSIWLNERVLLDSVLGQYGRTCGGRVDIRTIRRANVTCAEAFPGHE
Ga0370498_100670_1_3273300034155Untreated Peat SoilASRMLVVPFVASVYLFWESDTRAQSTDDLPTAEEPWLAEPGDVYADFNEAQIACYQGSMTACDSIWLDERLLLDSPLSQYGRTCGGRVDHRAISLANVTCTEAFPGHE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.