NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F095267

Metagenome / Metatranscriptome Family F095267

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095267
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 126 residues
Representative Sequence MNRLIAALLVLGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK
Number of Associated Samples 90
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 67.62 %
% of genes near scaffold ends (potentially truncated) 34.29 %
% of genes from short scaffolds (< 2000 bps) 82.86 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (64.762 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(11.429 % of family members)
Environment Ontology (ENVO) Unclassified
(24.762 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 7.01%    β-sheet: 21.66%    Coil/Unstructured: 71.34%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF01557FAA_hydrolase 12.38
PF02678Pirin 11.43
PF01047MarR 7.62
PF01253SUI1 6.67
PF12802MarR_2 1.90
PF13493DUF4118 1.90
PF064393keto-disac_hyd 1.90
PF03712Cu2_monoox_C 0.95
PF07635PSCyt1 0.95
PF13229Beta_helix 0.95
PF13453zf-TFIIB 0.95
PF08974DUF1877 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG1741Redox-sensitive bicupin YhaK, pirin superfamilyGeneral function prediction only [R] 11.43
COG0023Translation initiation factor 1 (eIF-1/SUI1)Translation, ribosomal structure and biogenesis [J] 6.67


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms65.71 %
UnclassifiedrootN/A34.29 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_101475297All Organisms → cellular organisms → Bacteria1654Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101560057All Organisms → cellular organisms → Bacteria1523Open in IMG/M
3300000955|JGI1027J12803_101420521All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300000955|JGI1027J12803_106664857Not Available609Open in IMG/M
3300000956|JGI10216J12902_110723783All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium → unclassified Symbiodinium → Symbiodinium sp. KB8747Open in IMG/M
3300004157|Ga0062590_100630353All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium949Open in IMG/M
3300004157|Ga0062590_101387012Not Available698Open in IMG/M
3300004463|Ga0063356_100748992All Organisms → cellular organisms → Bacteria1353Open in IMG/M
3300005340|Ga0070689_101701365Not Available574Open in IMG/M
3300005441|Ga0070700_101774087All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfatibacillum → Desulfatibacillum aliphaticivorans531Open in IMG/M
3300005456|Ga0070678_100548285All Organisms → cellular organisms → Bacteria1026Open in IMG/M
3300005456|Ga0070678_101136056Not Available722Open in IMG/M
3300005468|Ga0070707_100921927All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium838Open in IMG/M
3300005531|Ga0070738_10096093All Organisms → cellular organisms → Bacteria1598Open in IMG/M
3300005534|Ga0070735_10115554All Organisms → cellular organisms → Bacteria1680Open in IMG/M
3300005534|Ga0070735_10263512All Organisms → cellular organisms → Bacteria1043Open in IMG/M
3300005536|Ga0070697_101975084All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfosarcina → unclassified Desulfosarcina → Desulfosarcina sp. BuS5522Open in IMG/M
3300005538|Ga0070731_10002094All Organisms → cellular organisms → Bacteria19555Open in IMG/M
3300005541|Ga0070733_11151611Not Available520Open in IMG/M
3300005542|Ga0070732_10236823All Organisms → cellular organisms → Bacteria1092Open in IMG/M
3300005544|Ga0070686_100581978All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300005549|Ga0070704_101689456Not Available585Open in IMG/M
3300005719|Ga0068861_102429269Not Available527Open in IMG/M
3300005836|Ga0074470_10998373All Organisms → cellular organisms → Bacteria94903Open in IMG/M
3300006046|Ga0066652_100096494All Organisms → cellular organisms → Bacteria → Proteobacteria2374Open in IMG/M
3300006791|Ga0066653_10334414All Organisms → cellular organisms → Bacteria → Proteobacteria765Open in IMG/M
3300006844|Ga0075428_101331414All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium755Open in IMG/M
3300006845|Ga0075421_100994169Not Available950Open in IMG/M
3300006846|Ga0075430_100232678All Organisms → cellular organisms → Bacteria1528Open in IMG/M
3300006852|Ga0075433_10263122All Organisms → cellular organisms → Bacteria1529Open in IMG/M
3300006854|Ga0075425_100936350Not Available990Open in IMG/M
3300006871|Ga0075434_100532533All Organisms → cellular organisms → Bacteria1195Open in IMG/M
3300006904|Ga0075424_101668811Not Available675Open in IMG/M
3300006954|Ga0079219_10889716All Organisms → cellular organisms → Bacteria718Open in IMG/M
3300007004|Ga0079218_12807942Not Available583Open in IMG/M
3300007076|Ga0075435_101386389Not Available616Open in IMG/M
3300009012|Ga0066710_100925863All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1342Open in IMG/M
3300009094|Ga0111539_10047161All Organisms → cellular organisms → Bacteria5151Open in IMG/M
3300009146|Ga0105091_10219880All Organisms → cellular organisms → Bacteria909Open in IMG/M
3300009147|Ga0114129_10826676All Organisms → cellular organisms → Bacteria1179Open in IMG/M
3300009156|Ga0111538_10332433All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1923Open in IMG/M
3300009609|Ga0105347_1447782Not Available559Open in IMG/M
3300010397|Ga0134124_12605398Not Available548Open in IMG/M
3300010397|Ga0134124_12869133Not Available526Open in IMG/M
3300010398|Ga0126383_12536651Not Available597Open in IMG/M
3300010399|Ga0134127_10014130All Organisms → cellular organisms → Bacteria6019Open in IMG/M
3300010400|Ga0134122_10015558All Organisms → cellular organisms → Bacteria5683Open in IMG/M
3300010400|Ga0134122_10090164All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2412Open in IMG/M
3300010400|Ga0134122_10111796All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2177Open in IMG/M
3300010400|Ga0134122_10304855All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Anaeromyxobacteraceae → Anaeromyxobacter → unclassified Anaeromyxobacter → Anaeromyxobacter sp. Fw109-51369Open in IMG/M
3300010401|Ga0134121_10000081All Organisms → cellular organisms → Bacteria104675Open in IMG/M
3300010401|Ga0134121_10987317All Organisms → cellular organisms → Bacteria825Open in IMG/M
3300010403|Ga0134123_10127113All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2074Open in IMG/M
3300011435|Ga0137426_1120272All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium754Open in IMG/M
3300011436|Ga0137458_1056262All Organisms → cellular organisms → Bacteria1067Open in IMG/M
3300011437|Ga0137429_1123592All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium795Open in IMG/M
3300012041|Ga0137430_1150000All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300012212|Ga0150985_107046937Not Available559Open in IMG/M
3300012685|Ga0137397_11095973Not Available580Open in IMG/M
3300012893|Ga0157284_10128970Not Available695Open in IMG/M
3300012944|Ga0137410_10000245All Organisms → cellular organisms → Bacteria38682Open in IMG/M
3300012971|Ga0126369_12856884Not Available565Open in IMG/M
3300016270|Ga0182036_11025157Not Available681Open in IMG/M
3300016371|Ga0182034_11647717Not Available564Open in IMG/M
3300017965|Ga0190266_10128266All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300018482|Ga0066669_10461066All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1093Open in IMG/M
3300019360|Ga0187894_10138457All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1251Open in IMG/M
3300019362|Ga0173479_10235705All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium795Open in IMG/M
3300020140|Ga0179590_1138349Not Available664Open in IMG/M
3300020202|Ga0196964_10058223All Organisms → cellular organisms → Bacteria1642Open in IMG/M
3300021384|Ga0213876_10236799All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300023168|Ga0247748_1005356All Organisms → cellular organisms → Bacteria1606Open in IMG/M
3300025922|Ga0207646_10081511All Organisms → cellular organisms → Bacteria2893Open in IMG/M
3300025936|Ga0207670_10711963All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300026121|Ga0207683_10781708All Organisms → cellular organisms → Bacteria886Open in IMG/M
3300026121|Ga0207683_11706638Not Available579Open in IMG/M
3300027869|Ga0209579_10000884All Organisms → cellular organisms → Bacteria26717Open in IMG/M
3300027869|Ga0209579_10411835Not Available733Open in IMG/M
3300027907|Ga0207428_10261718All Organisms → cellular organisms → Bacteria1288Open in IMG/M
3300027986|Ga0209168_10217247All Organisms → cellular organisms → Bacteria953Open in IMG/M
3300031093|Ga0308197_10247479Not Available631Open in IMG/M
3300031538|Ga0310888_10504838Not Available724Open in IMG/M
3300031716|Ga0310813_10003695All Organisms → cellular organisms → Bacteria9317Open in IMG/M
3300031716|Ga0310813_10100913All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2243Open in IMG/M
3300031716|Ga0310813_10175909All Organisms → cellular organisms → Bacteria1739Open in IMG/M
3300031716|Ga0310813_11622816Not Available604Open in IMG/M
3300031720|Ga0307469_10806305Not Available862Open in IMG/M
3300031740|Ga0307468_100561506Not Available922Open in IMG/M
3300031820|Ga0307473_10281037All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300031858|Ga0310892_11102842Not Available563Open in IMG/M
3300031908|Ga0310900_10532939All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Anaeromyxobacteraceae → Anaeromyxobacter → unclassified Anaeromyxobacter → Anaeromyxobacter sp. Fw109-5918Open in IMG/M
3300031912|Ga0306921_10954298All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300031954|Ga0306926_12685703Not Available541Open in IMG/M
3300031962|Ga0307479_10203229All Organisms → cellular organisms → Bacteria1951Open in IMG/M
3300032059|Ga0318533_11378039Not Available515Open in IMG/M
3300032075|Ga0310890_10388155All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1033Open in IMG/M
3300032163|Ga0315281_10063006All Organisms → cellular organisms → Bacteria4439Open in IMG/M
3300032421|Ga0310812_10021357All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2288Open in IMG/M
3300033412|Ga0310810_10208494All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2199Open in IMG/M
3300034659|Ga0314780_224202All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300034660|Ga0314781_094772Not Available593Open in IMG/M
3300034661|Ga0314782_157269All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300034662|Ga0314783_155060Not Available532Open in IMG/M
3300034667|Ga0314792_189465All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Anaeromyxobacteraceae → Anaeromyxobacter → unclassified Anaeromyxobacter → Anaeromyxobacter sp. Fw109-5572Open in IMG/M
3300034668|Ga0314793_151450Not Available525Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere11.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.52%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil9.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil9.52%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil8.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.67%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.76%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.81%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere3.81%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.86%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.86%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.90%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.90%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.90%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.95%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.95%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.95%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.95%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.95%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.95%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.95%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.95%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.95%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005531Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009146Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011435Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2EnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300012041Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT754_2EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012893Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S059-202B-1EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020202Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10EnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300023168Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S064-202C-5EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026121Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032163Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G07_0EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300034659Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034660Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034661Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034662Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034667Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034668Metatranscriptome of lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10147529723300000364SoilMKNLVGALMLCSLAACASHGEQPEIVPKPHTKIAKASILPSKDIPSERDGVPADQTGAIMFVCANSEKHEDKEVFISKCPSCSELNYFYWNSHNSEFVCFACTKAMDSSAVKCPECGRPPRLVRTRPQAK*
INPhiseqgaiiFebDRAFT_10156005723300000364SoilMKNMPVALLILGLLGCASSGEPHIVQKPHTRIAKAGILPGKDIPNERNGIPLEATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSPNSQFVCFACTKPVDNALVKCGECGKQPLKVRTRAIPK*
JGI1027J12803_10142052123300000955SoilMKNLVGALMLCSLAACASHGEQPEIVPKPHTKIAKASILPSKDIPSERDGVPADQTGAIMFVCANSEKHEDKEVFISKCPSCSELNYFYWNSHNSEFVCFACTKAMDSSAVKCPECGR
JGI1027J12803_10666485713300000955SoilMKNMPVALLILGLLGCASSGEPHIVQKPHTRIAKAGILPGKDIPNERNGIPLEATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSPNSQFVCFACTKPVDNALV
JGI10216J12902_11072378323300000956SoilMNRITAALLVLGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPVDATGAIMFVCANSEKHEDKEVLISKCPSCSENNYFYWDSDASQFNCFACTKAVDNAYVKCPDCGKTPLKVRTRATGK*
Ga0062590_10063035313300004157SoilMKKLIGVLSLLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPAERDGIPIDATGAIMFVCAGSEKHEDKEVLISKCPSCSEQNYFYWDSPGQHFTCFACTKALDNAVVKCPDCGKQPHKVRTRAQAK*
Ga0062590_10138701213300004157SoilHNCINRGISMKKLIGVLSLLVLASCASSGDPTIVKKPHTRVAKAGILPSKDIPSERNGIPIDSTGAIMFICSGSEKHEDKEVLISKCPSCSEQNYFYWDGAGSQFICYACTKALDNAAVKCPDCGKQPHKVRTRATGK*
Ga0063356_10074899223300004463Arabidopsis Thaliana RhizosphereMKNLPVALLILGLVGCASSGEPTIVKKPHTRIAKAGILPGKDIPNERDGIPLDATGAIMFVCAGSEKHEDKEVLISKCPSCSEQNYFYWDSAGTQFVCFACTKPVDNAVVKCGDCGKQPLKVRTRATPK*
Ga0070689_10170136513300005340Switchgrass RhizosphereKKPHTRVAKAGILPSKDIPSERDGIPIDATGAIMFVCAGSEKHEDKEVLISKCPDCSEQNYFYWDSANSQFVCFACTKPVDNAIVKCGECGKQPLKVRTRATPK*
Ga0070700_10177408713300005441Corn, Switchgrass And Miscanthus RhizosphereMKNLPVALLILGLVGCASSGEPTIVKKPHVRVAKAGILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFICFACTKPVDNAIVKCGECG
Ga0070678_10054828523300005456Miscanthus RhizosphereMKRLAGALLFCTLAACASRGEQPEIVAKPHTKIAKAGILPSKEIPAERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGEVNYFYWNSHNSEFVCYACTKAIDSSVVKCPECGRPPRIVRTRPQAK*
Ga0070678_10113605613300005456Miscanthus RhizosphereMKKLIGVLSLLVLASCASSGDPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGAIMFVCSGSEKHEDKEVLITKCPSCSEQNYFYWDSPGQHFTCFACTKALDNAVVKCPDCGKQPHKVRTRAQAK*
Ga0070707_10092192713300005468Corn, Switchgrass And Miscanthus RhizosphereMRNLCGALSLLVLASCASSGEPQIVKKPHTRIAKAGILPSKEIPSERDGIPIDATGAIMFVCANSEKHEDKEVLISKCPSCSEQNYFYWDSAGSHFTCFACTKVLDNAAVKCPDCGKQPHKVRTRAQAK*
Ga0070738_1009609333300005531Surface SoilMIKTTVGAALLCALAACASHGEQPEIVVKPHTKIARAGILPSREIPDERNGIPKDQTGAIMFVCAGSDKHEDKEVFISKCPSCGELNYFYWDTHASEFVCYACMKAIDSAQVKCPECGRPPRLVRTRPQAK*
Ga0070735_1011555433300005534Surface SoilMKPKLMAGAWVLCSLAACASHGEQPEIVAKPHTKIAKASILPTKEIPSERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWDSHNSEFVCFACTKAIDSALVKCPECGRPPRLVRTRPQAK*
Ga0070735_1026351223300005534Surface SoilLYEDINYGGSRMIKTTAGAFLLCALAACASHGEQPEIVVKPHTKIAKAGILPSREIPAERNGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWDSNASEFVCYACTKPVDSSIVKCPECGKPPRIVRTRPQAK*
Ga0070697_10197508413300005536Corn, Switchgrass And Miscanthus RhizosphereMKRLISSLAILSLAACATTGEEPSIVKKPHLKVNKAGLFPSRDIPAERDGIPADATGAIMFVCSGSDKHEDKEVLISKCPSCSENNYFYWDSHNSQFVCFACTKAVDNALVKCSECGRPPHKVRTRATGK*
Ga0070731_10002094123300005538Surface SoilMIQKIAGILLLCSLAACAAHGDQPEVVPKPHTKIAKAGLFPSKEIPAERDGIPADSTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSTTSEFICFACTKAIDSSLVKCPECGRPPRLVRTRPQAK*
Ga0070733_1115161123300005541Surface SoilGILTSLAALFLASCATTGEEPTIVHKPHARIAQASILPSREIPAERDGIPVDATGAIMFVCSGSDKHEDKEVLITRCPSCFESNYFYWDSANAQFVCFACTKPVDNALIRCPDCGRPPHKVRTRATAK*
Ga0070732_1023682313300005542Surface SoilMKPKLMAGAWVLCSLAACASHGEQPEIVAKPHTKIAKASILPTKEIPSERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSQNSEFVCFACTKAIDNSAVKCPECGRPPRLVRTRPQAK*
Ga0070686_10058197823300005544Switchgrass RhizosphereMKNLPVALLILGLVGCSTSEPTIVKKPHTRVAKAGILPGKDIPNERDGIPIDATGAIMFVCAGSDKHEDKEVLISKCPACSEVNYFYWDAANSQFVCFACTKPVDNALVKCGECGKQPLKVRTRATPK*
Ga0070704_10168945623300005549Corn, Switchgrass And Miscanthus RhizosphereALLILGLVGCASSGEPTIVKKPHVRVAKAGILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFICFACTKPVDNAVVKCGECGKQPLKVRTRATPK*
Ga0068861_10242926913300005719Switchgrass RhizosphereILGLVGCASSGEPHIVQKPHTRIAKASILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFICFACTKPVDNAIVKCGECGKQPLKVRTRATPK
Ga0074470_10998373433300005836Sediment (Intertidal)MKRSVAVLLISTLAACASHGEQPEIVAKPHTKIAKASILPSREIPAERDGIPADQTGAIMFVCAGSDKHEDKEVFISKCPSCGEMNYFYWDTHNSEFVCFACTKAIDSSAVKCPECGKPPRIVRTRPQTK*
Ga0066652_10009649443300006046SoilMKKLIGALSLLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGSIMFVCSGSEKHEDKEVLITKCPSCSEQNYFYWDSAGSHFTCYACTKALDNAAVKCPDCGKQPHKVRTRAQAK*
Ga0066653_1033441423300006791SoilLLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGSIMFVCSGSEKHEDKEVLITKCPSCSEQNYFYWDSAGSHFTCYACTKALDNAAVKCPDCGKQPHKVRTRAQAK
Ga0075428_10133141413300006844Populus RhizospherePTIVRKPHTRVAKAGILPGKDIPNERNGIPLDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDAAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0075421_10099416923300006845Populus RhizosphereMKNLPVALLILGLVGCASSGEPTIVKKPHARVAKAGILPGKDIPSERDGIPLDATGAIMFVCAGSEKHENKEVLITKCPSCSESNYFYWDSGNSQFVCFACTKPVDNAVVKCGDCGKQPLKVRTRATPK*
Ga0075430_10023267813300006846Populus RhizosphereMSRIVAALLVVGLVGCASSGDPTIVKKPHTRVAKAGILPGKDIPNERNGIPLDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDAAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0075433_1026312233300006852Populus RhizosphereMKNLPVALLILGLVGCASSGDPAIVKKPHTKIAKAGILPGKDIPDERNGIPVDATGAIMFVCSGSDKHEDKEVLISKCPACSELNYFYWDSANSQFVCFACTKAVDNAVVKCGECGKQPLKVRTRATPK*
Ga0075425_10093635023300006854Populus RhizosphereMKNLPVALLILGLVGCASSGDPAIVKKPHTKIAKAGILPGKDIPDERNGIPVDATGAIMFVCSGSDKHEDKEVLISKCPACSELNYFYWDSANSQFVCFACTKPVDNAVVKCGECGKQPLKVRTRATPK*
Ga0075434_10053253323300006871Populus RhizosphereMKNLPVALLILGLVGCASSGEPHIVQKPHTRIAKASILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFVCFACTKPVDNAIVKCGECGKQPLKVRTRATPK*
Ga0075424_10166881123300006904Populus RhizosphereKPHTRIAKASILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFVCFACTKPVDNAIVKCGECGKQPLKVRTRATPK*
Ga0079219_1088971623300006954Agricultural SoilMIMKNLPVALLILGLVGCASSGEPHIVQKPHTRIAKASILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFVCFACTKPVDNAIVKCGECGKQPLKVRTRATPK*
Ga0079218_1280794213300007004Agricultural SoilMKKLIGALSLLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGAIMFICANSEKHEDKEVLISKCPSCSEVNYFYWDAAGSHFTCFACTKALDNAAVKCPDCGKQPHKVRTRASAK*
Ga0075435_10138638913300007076Populus RhizosphereMKNLPVALLILGLVGCASSGDPAIVKKPHTKIAKAGILPGKDIPDERNGIPVDATGAIMFVCSGSDKHEDKEVLISKCPACSELNYFYWDSANSQFVCFACTKPVDNAVVKCGECGKQPLKVRTRATP
Ga0066710_10092586333300009012Grasslands SoilVLALLGFASCGTTGEEPTIVQKSHPRINKAAILPSRDIPAERDGIPIDATGAIMFVCSGSEKHEDKEVLISKCPSCSENNYFYWDSHNSQFICFACTKAVDNAIVKCSECGRPPNKVRTRATAK
Ga0111539_1004716163300009094Populus RhizosphereMNRLIAALLVLGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0105091_1021988013300009146Freshwater SedimentTIVKKPHTKVAKASILPGKDIPSERNGIPVDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSEVAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0114129_1082667623300009147Populus RhizosphereMNRLIAALLVLGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSETNYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0111538_1033243333300009156Populus RhizosphereMKNLPVALLILGLVGCASSGDPAIVKKPHTRVAKAGILPGKDIPDERNGIPLDATGAIMFVCAGSEKHEDKEVLISKCPACSEVNYFYWDAANSQFVCFACTKPVDNALVKCGECGKQPLKVRTRATPK*
Ga0105347_144778213300009609SoilMNRIVAALLVVGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPVDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSEVAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGKEAFGNSAQRP
Ga0134124_1260539823300010397Terrestrial SoilMIMKNLPAALLILGLVGCASSGEPHIVQKPHTRIAKASILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFICFACTKPVDNAIVKCGECGKQPLKV
Ga0134124_1286913313300010397Terrestrial SoilASHGEQPEIVPKPHTKIAKASTLPSKDIPSERDGVPADQTGAIMFVCANSEKHEDKEVFISKCPSCSELNYFYWNSHNSEFVCYACTKAVDSAAVKCPECGRPPRIVRTRPQAK*
Ga0126383_1253665113300010398Tropical Forest SoilEKLIGALALLTLASCASTDDPTIVKKPHTRVAKAGILPSKDIPSERNGIPIDSTGAIMFVCSGSDKHEDKEVLISKCPSCSEVNYFYWDSAGSHFTCFACEKALDNAAVKCPDCGKQPHKVRTRAIGK*
Ga0134127_1001413053300010399Terrestrial SoilMKNLPVALLILGLVGCASSGDPAIVKKPHTRVAKAGILPGKDIPDERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPACSEANYFYWDSANAQFVCFACTKPVDNAVVKCGECGKQPLKVRTRATPK*
Ga0134122_1001555853300010400Terrestrial SoilMKKLIGALVLCSMAACASHGEQPEIVAKPHTKIAKAGILPSKDIPEERDGIPKDQTGAIMFVCAGSEKHEDKEVFISKCPSCSELNYFYWNSHNSEFVCYACTKAIDSASVKCPECGRPPRIVRTRPQAK*
Ga0134122_1009016433300010400Terrestrial SoilMKKLLGALALLTLASCATETEPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGAIMFVCSNSDKHEDKEVLISKCPSCSEQNYFYWDAANSQFTCFACTKAVDNAVVKCPECGKQPHKVRTRATAK*
Ga0134122_1011179613300010400Terrestrial SoilMKNLPVALLVLGLVGCATSEPTIVKKPHTRVAKAGILPGKDIPNERDGIPLDATGAIMFVCAGSDKHEDKEVLISKCPACSEVNYFYWNAADSQFVCFACTKPVDNSLVKCGECGKQPLKVRTRATPK*
Ga0134122_1030485523300010400Terrestrial SoilMKRLTAALLCLGLVSCASSGEPTIVQKPHATRVAKASILPGGKDIPSERNGIPVDATGAIMFVCANSDKHEDKEVLISKCPACSEINYFYWDSAGSQFVCFACTKAVDNAVVKCPECGAQPHKVRTRATGK*
Ga0134121_10000081173300010401Terrestrial SoilMKNLPVALLILGLVGCATGEPTIVKKPHTRVAKAGILPGKDIPNERDGIPLDATGAIMFVCAGSDKHEDKEVLISKCPACSEVNYFYWNAADSQFVCFACTKPVDNSLVKCGECGKQPLKVRTRATPK*
Ga0134121_1098731713300010401Terrestrial SoilMKRLAGVLLFSTLAACASRGEQPEIVAKPHTKIAKAGILPSREIPSERDGIPADQTGAIMFVCAGSDKHEDKEVFISKCPSCGEVNYFYWNSHNSEFVCFACTKAIDSSAVKCPECGRPPRLVRTRPQAK*
Ga0134123_1012711313300010403Terrestrial SoilAACATTGEEPSIVKKPHLKVNKAGLFPSRDIPAERDGIPADATGAIMFVCSGSDKHEDKEVLISKCPSCSENNYFYWDSHNSQFVCFACTKAVDNALVKCSECGRPPHKVRTRATGK*
Ga0137426_112027213300011435SoilRITNDIWGRVMSRIVAALLVVGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPVDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSESAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0137458_105626213300011436SoilMTMTRILSALLILGLAACASTGQEPSIVKKPHPRIAKAGILPSRDIPAERDGIPLDATGAIMFVCSNSDKHEDKEVLISKCPSCSENNYFYWDAAGSQFSCFACTKAVDNASVKCPECGKQPHKVRTRATAK*
Ga0137429_112359223300011437SoilMSRIVAALLVVGLVGCASSGDPTIVKKPHTRIAKAGILPGKDIPNERNGIPLDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSESAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0137430_115000023300012041SoilMSRIVAALMVVGLVGCASSGDPTIVKKPHTRVAKAGILPGKDIPNERNGIPLDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSEAAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK*
Ga0150985_10704693713300012212Avena Fatua RhizosphereVEYRFSTTVSEPPGAHAPWPAAKLLCASHGEQPEIVAKPHTKIAKAGILPSKEIPAERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSHNSEFVCFACTKAIDSAAVKCPECGRPPRIVRTRPQAK*
Ga0137397_1109597313300012685Vadose Zone SoilMKLKLMAGALVRCSLAACASRGEQPEIVAKPHTKIAKASILPSKEIPAERDGMPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSHNSEFVCFACTKATDSAAVKCPECGRPPRIVRTRPQAK*
Ga0157284_1012897013300012893SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDADNSGFLCYACTKAVDNAYVKCPDCGKQPLKVRTPEFAAWIREQEADVALVIA
Ga0137410_10000245153300012944Vadose Zone SoilMKRIAALLTICSLAACASKGEQPEVVAKPHTKIAKASILPSREIPEERDGIPKDQTGAIMFVCANSDKHEDKEVFISKCPSCGELNYFYWNSGNSEFVCFACTKAIDSAQVKCPECGKPPRIVRTRPQAK*
Ga0126369_1285688423300012971Tropical Forest SoilDDPTIVKKPHTRVAKAGILPSKEIPSERNGIPIDSTGAIMFVCSGSEKHEDKEVLISKCPSCSEQNYFYWDSAGSQFTCFACEKPLNNALVKCPDCGKQPHKVRTRATAK*
Ga0182036_1102515723300016270SoilMKKLLSGLSLLLLASCASTDDPTIVKKPHTRVAKAGILPSKDIPAERDGIPIDATGAIMFVCSGSEKHEDKEVLISKCPSCSEQNYFYWDSAGSHFTCFACTKALDNALVKCPDCGKQPSRVRTRAQAK
Ga0182034_1164771723300016371SoilMKNLPVALLILGLVACASSGEPTIVPKPHTRVAKAGLLPGKDLPSERDGIPLEATGAIMFVCAGSDRHEDKEVLITKCPSCSEQNYFYWDSANSQFVCFACTKPVDNAVVKCGECGKQPL
Ga0190266_1012826623300017965SoilMKNLVGALMLCSLAACASHGEQPEIVPKPHTKIAKASILPSKDIPAERDGVPADQTGAIMFVCANSEKHEDKEVFISKCPACSELNYFYWNSHNSEFVCFACTKAMDSTAVKCPECGRPPRIVRTRPQAK
Ga0066669_1046106623300018482Grasslands SoilLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGSIMFVCSGSEKHEDKEVLITKCPSCSEQNYFYWDSAGSHFTCYACTNALDNAAVKCPDCGKQPHKVRTRAQAK
Ga0187894_1013845733300019360Microbial Mat On RocksMKNLPVALLILGLVGCASSAEPTIVKKPHTRIAKAGILPSKELPNERDGIPLDATGAIMFVCAGSDKHEDKEVLISKCPTCSEQNYFYWDSGNSQFVCFACTKPVDNAVVKCGDCGKQPLKVRTRATPK
Ga0173479_1023570523300019362SoilMNRFTAALLVLGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATSK
Ga0179590_113834913300020140Vadose Zone SoilMKLKLMAGALVLCSLAACASHGEQPEIVAKPHTKIAKASILPSKEIPVERDGIPADQTGAIMFVCSGSEKHEDKEVFISKCPSCGELNYFYWNSHNSEFVCYACTKAIDSAAVKCPECGRPPRIVRTRPQAK
Ga0196964_1005822323300020202SoilMNRLIAALLVLGLASCASTGEPTIVKKPHTRVAKASLLPSRDIPNERNGIPVDATGAIMFVCANSDKHEDKEVLISKCPACSENNYFYWDSDASQFNCFACTKPVDNAHVKCPDCGKPPMKVRTRATSK
Ga0213876_1023679913300021384Plant RootsMKIKSLVGALVLGSLAACASHGEQPEIVPKPHTKIAKASILPSKDIPSERDGIPADQTGAIMFVCSGSEKHEDKEVFISKCPSCGELNYFYWNSHNSEFVCFACTKAIDNAV
Ga0247748_100535633300023168SoilMKRLAGALLLCSLAACASRGEQPDIVVKPHTKIAKAGILPSKEIPAERDGIPADQTGAIMFVCAGSDKHEDKEVFISKCPSCSEVNYFYWNSHNSEFVCYACTKAIDSSVVKCPECGRPPRIVRTRPQAK
Ga0207646_1008151143300025922Corn, Switchgrass And Miscanthus RhizosphereMRNLCGALSLLVLASCASSGEPQIVKKPHTRIAKAGILPSKEIPSERDGIPIDATGAIMFVCANSEKHEDKEVLISKCPSCSEQNYFYWDSAGSHFTCFACTKVLDNAAVKCPDCGKQPHKVRTRAQAK
Ga0207670_1071196323300025936Switchgrass RhizosphereMKNLPVALLILGLVGCATGEPTIVKKPHTRVAKAGILPGKDIPNERDGIPLDATGAIMFVCAGSDKHEDKEVLISKCPACSEVNYFYWNAADSQFVCFACTKPVDNSLVKCGECGKQPLKVRTRATPK
Ga0207683_1078170823300026121Miscanthus RhizosphereMKRLAGALLFCTLAACASRGEQPEIVAKPHTKIAKAGILPSKEIPAERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGEVNYFYWNSHNSEFVCYACTKAIDSSVVKCPECGRPPRIVRTRPQAK
Ga0207683_1170663813300026121Miscanthus RhizosphereMKKLIGVLSLLVLASCASSGDPTIVKKPHTRVAKAGILPSKDIPSERDGIPIDATGAIMFVCSGSEKHEDKEVLITKCPSCSEQNYFYWDSPGQHFTCFACTKALDNAVVKCPDCGKQPHKVRTRAQAK
Ga0209579_10000884123300027869Surface SoilMIQKIAGILLLCSLAACAAHGDQPEVVPKPHTKIAKAGLFPSKEIPAERDGIPADSTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSTTSEFICFACTKAIDSSLVKCPECGRPPRLVRTRPQAK
Ga0209579_1041183523300027869Surface SoilMKPKLMAGAWVLCSLAACASHGEQPEIVAKPHTKIAKASILPTKEIPSERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWDSHNSEFVCFACTKAIDSALVKCPECGRPPRLVRTRPQAK
Ga0207428_1026171813300027907Populus RhizosphereMNRLIAALLVLGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATSK
Ga0209168_1021724723300027986Surface SoilMIKTTAGAFLLCALAACASHGEQPEIVVKPHTKIAKAGILPSREIPAERNGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWDSNASEFVCYACTKPVDSSIVKCPECGKPPRIVRTRPQAK
Ga0308197_1024747913300031093SoilMKLKLTVGALVLSSLAACASRGEQPEIVAKPHTKIAKASILPSRDIPAERDGIPADQTGAIMFVCAGSEKHEDKEVFISKCPSCAELNYFYWNSHNSEFVCFACTKAIDSAAVKCPECGRPPRIVRTRPQAK
Ga0310888_1050483813300031538SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSETNYFYWDSDASQFNCFACTKAVGNAVVKCPDCGKQPLKVRTRATSK
Ga0310813_1000369553300031716SoilMKKLIGVLSLLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPAERDGIPIDATGAIMFVCSGSEKHEDKEVLISKCPSCSEVNYFFWDSAGSHFTCYACTKALDNALVKCPDCGKQPHKVRTRAQAK
Ga0310813_1010091353300031716SoilMKKLIGALSLLVLASCASSGEPTIVKKPHTRVAKAGILPSKDIPAERDGIPIDATGSIMFVCSGSDKHEDKEVLITKCPSCSEQNYFYWDSAGSHFTCYACTKALDNALVKCPDCGKQPHKVRTRAQAK
Ga0310813_1017590943300031716SoilIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDADNSGFLCYACTKAVDNAYVKCPDCGKQPLKVRTRATSK
Ga0310813_1162281613300031716SoilMKRLAGALLLCSLAACASRGEQPDIVVKPHTKIAKAGILPSKEIPSERDGIPADQTGAIMFVCAGSDKHEDKEVFISKCPSCSEVNYFYWNSHNSEFVCYACTKAIDSSVVKCPECGRPPRIVRTRPQAK
Ga0307469_1080630523300031720Hardwood Forest SoilMKRMICALSILAVAGCASSSEPHIVQKPHTRIAKAGILPSKEIPSERDGIPIDATGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSHNSEFVCFACTKAIDSAAVKCPECGRPPRIVRTRPQAK
Ga0307468_10056150623300031740Hardwood Forest SoilMKQMLAALLCLGLVSCASSGEPTIVKKPHTRVAKAGILPSRDIPSERDGIPIDATGAIMFVCAGSDKHEDKEVLISKCPACSELNYFYWDSGNSQFVCFACAKPVDNAVVKCGDCGKQPLKVRTRATPK
Ga0307473_1028103723300031820Hardwood Forest SoilMKRTIGALLILALGACASSGDPSIVAKPHPKIAKAGILPSKELPEERDGVPKDATGAIMFICANSDKHEDKEVLISRCPSCSETNYFYMDHHTSYYVCFACTKTLDPAVIKCPDCGKQPRLLRTRAINK
Ga0310892_1110284213300031858SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGVPVDATGAIMFVCANSEKHEDKEVLISKCPACSEINYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK
Ga0310900_1053293913300031908SoilVVGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSEINYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATSK
Ga0306921_1095429813300031912SoilIVPKPHTKVARAGILPSREIPAERDGIPADQTGAIMFVCSGSERHEDKEVFISKCPSCGELNYFYWNTHNSEFVCYACTKAVDSTVVKCPECGRPPRIVRTRPQAK
Ga0306926_1268570323300031954SoilMKNLPVALLILGLVACASSGEPTIVPKPHTRVAKAGLLPGKDLPSERDGIPLEATGAIMFVCAGSDRHEDKEVLITKCPSCSEQNYFYWDSANSQFVCFACTKPVDNAVVKCGDCGKQPLKVRTRAIPK
Ga0307479_1020322933300031962Hardwood Forest SoilMKLKLIAGALVLSCLAACASHGEQPEIVAKPHTKIAKASILPTKEIPSERDGIPVDQTGAIMFVCAGSEKHEDKEVFISKCPSCGELNYFYWNSQNSEFVCFACTKAIDNSAVKCPECGRPPRLVRTRPQAK
Ga0318533_1137803913300032059SoilMKNLPVALLILGLVACASSGEPTIVPKPHTRVAKAGLLPGKDLPSERDGIPLEATGAIMFVCAGSDRHEDKEVMITKCPSCSEQNYFYWDSANSQFVCFACTKPVDNAVVKCGDC
Ga0310890_1038815523300032075SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDAAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATGK
Ga0315281_1006300643300032163SedimentMKRLLGTMLVLGLASCATTGEEPTIVKKPHLKVAKAGILPSREIPAERDGIPADATGAIMFVCSGSEKHEDKEVLISKCPSCSENNYFYWDTHNSQFICFACTKAVDNAIVKCPDCGRPPHKVHTRPTGK
Ga0310812_1002135733300032421SoilMKNLPVALLILGLVGCASSGEPHIVQKPHTRIAKASILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFVCFACTKPVDNAIVKCGECGKQPLKVRTRATPK
Ga0310810_1020849433300033412SoilLSPAVYIVQKPHTRIAKAGILPGKDIPNERNGIPIDATGAIMFVCAGSDKHEDKEVLISKCPDCSEQNYFYWDSANSQFVCFACTKPVDNAIVKCGECGKQPLKVRTRATPK
Ga0314780_224202_163_5013300034659SoilSSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSEINYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATSK
Ga0314781_094772_188_5773300034660SoilMNRFVAALLVVGLVGCASSGEPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSETNYFYWDSDASQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATSK
Ga0314782_157269_5_3943300034661SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGVPVDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDADNSGFLCYACTKAVDNAYVKCPDCGKQPLKVRTRATGK
Ga0314783_155060_13_4023300034662SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGVPVDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDADNSGFLCYACTKAVDNAYVKCPDCGKQPLKVRTRATSK
Ga0314792_189465_235_5703300034667SoilSGDPTIVKKPHTKVAKASILPGKDIPSERNGVPVDATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDADNSGFLCYACTKAVDNAYVKCPDCGKQPLKVRTRATSK
Ga0314793_151450_9_3983300034668SoilMNRFTAALLVLGLVGCASSGDPTIVKKPHTKVAKASILPGKDIPSERNGIPADATGAIMFVCANSEKHEDKEVLISKCPACSENNYFYWDSDAAQFNCFACTKAVDNAVVKCPDCGKQPLKVRTRATSK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.