NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086706

Metagenome Family F086706

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086706
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 70 residues
Representative Sequence MDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEVVAAPNAGVRVAR
Number of Associated Samples 102
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 88.18 %
% of genes near scaffold ends (potentially truncated) 18.18 %
% of genes from short scaffolds (< 2000 bps) 73.64 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (69.091 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(17.273 % of family members)
Environment Ontology (ENVO) Unclassified
(44.545 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(38.182 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 43.43%    β-sheet: 0.00%    Coil/Unstructured: 56.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF02805Ada_Zn_binding 26.36
PF08238Sel1 4.55
PF08281Sigma70_r4_2 4.55
PF08240ADH_N 1.82
PF02566OsmC 0.91
PF00593TonB_dep_Rec 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG2169Methylphosphotriester-DNA--protein-cysteine methyltransferase (N-terminal fragment of Ada), contains Zn-binding and two AraC-type DNA-binding domainsReplication, recombination and repair [L] 26.36
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 0.91
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms69.09 %
UnclassifiedrootN/A30.91 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001661|JGI12053J15887_10396751Not Available663Open in IMG/M
3300003994|Ga0055435_10005572All Organisms → cellular organisms → Bacteria2145Open in IMG/M
3300004058|Ga0055498_10138468Not Available523Open in IMG/M
3300004156|Ga0062589_100076398Not Available2020Open in IMG/M
3300004157|Ga0062590_100046512All Organisms → cellular organisms → Bacteria2368Open in IMG/M
3300005205|Ga0068999_10026440All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium914Open in IMG/M
3300005336|Ga0070680_100529652All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1009Open in IMG/M
3300005343|Ga0070687_100498859Not Available819Open in IMG/M
3300005345|Ga0070692_10084866All Organisms → cellular organisms → Bacteria1712Open in IMG/M
3300005345|Ga0070692_11353996Not Available512Open in IMG/M
3300005468|Ga0070707_100311191All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1531Open in IMG/M
3300005471|Ga0070698_100724599All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium937Open in IMG/M
3300005546|Ga0070696_100058246All Organisms → cellular organisms → Bacteria2698Open in IMG/M
3300005829|Ga0074479_10598820All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium994Open in IMG/M
3300006881|Ga0068865_101611414Not Available584Open in IMG/M
3300007004|Ga0079218_10258113All Organisms → cellular organisms → Bacteria1381Open in IMG/M
3300007255|Ga0099791_10154006All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300009038|Ga0099829_10005672All Organisms → cellular organisms → Bacteria7612Open in IMG/M
3300009089|Ga0099828_10002811All Organisms → cellular organisms → Bacteria12065Open in IMG/M
3300010399|Ga0134127_10028831All Organisms → cellular organisms → Bacteria4408Open in IMG/M
3300011270|Ga0137391_10606372All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300011419|Ga0137446_1102838All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300011427|Ga0137448_1089855All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium822Open in IMG/M
3300011427|Ga0137448_1112180All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300011436|Ga0137458_1023980Not Available1520Open in IMG/M
3300012035|Ga0137445_1103801All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium577Open in IMG/M
3300012040|Ga0137461_1032232All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1375Open in IMG/M
3300012133|Ga0137329_1054764Not Available525Open in IMG/M
3300012171|Ga0137342_1129704Not Available537Open in IMG/M
3300012225|Ga0137434_1019275All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300012226|Ga0137447_1033876Not Available860Open in IMG/M
3300012355|Ga0137369_10515973All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300012360|Ga0137375_10119741All Organisms → cellular organisms → Bacteria2633Open in IMG/M
3300012685|Ga0137397_10064403All Organisms → cellular organisms → Bacteria2647Open in IMG/M
3300012922|Ga0137394_10046539All Organisms → cellular organisms → Bacteria3569Open in IMG/M
3300012927|Ga0137416_11607775All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300012929|Ga0137404_11112400All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300012930|Ga0137407_10031440All Organisms → cellular organisms → Bacteria → Proteobacteria4141Open in IMG/M
3300012930|Ga0137407_10529241All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1102Open in IMG/M
3300014269|Ga0075302_1026120All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1074Open in IMG/M
3300014308|Ga0075354_1032991All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300014324|Ga0075352_1095172Not Available774Open in IMG/M
3300014873|Ga0180066_1057638All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300014881|Ga0180094_1076819All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium745Open in IMG/M
3300014885|Ga0180063_1239447Not Available581Open in IMG/M
3300015241|Ga0137418_10600030Not Available864Open in IMG/M
3300015254|Ga0180089_1010990Not Available1574Open in IMG/M
3300015256|Ga0180073_1013965Not Available1395Open in IMG/M
3300015259|Ga0180085_1060944All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1086Open in IMG/M
3300017997|Ga0184610_1211497Not Available649Open in IMG/M
3300018000|Ga0184604_10198230All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300018027|Ga0184605_10020471All Organisms → cellular organisms → Bacteria2647Open in IMG/M
3300018028|Ga0184608_10090914Not Available1264Open in IMG/M
3300018031|Ga0184634_10275358All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium772Open in IMG/M
3300018054|Ga0184621_10150176Not Available839Open in IMG/M
3300018059|Ga0184615_10308723All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium881Open in IMG/M
3300018075|Ga0184632_10031529All Organisms → cellular organisms → Bacteria2264Open in IMG/M
3300018078|Ga0184612_10260922All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium893Open in IMG/M
3300018079|Ga0184627_10010010All Organisms → cellular organisms → Bacteria4427Open in IMG/M
3300018084|Ga0184629_10339648All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium791Open in IMG/M
3300018422|Ga0190265_10023611All Organisms → cellular organisms → Bacteria4961Open in IMG/M
3300018422|Ga0190265_10255857All Organisms → cellular organisms → Bacteria1800Open in IMG/M
3300018422|Ga0190265_10643195All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium RIFCSPHIGHO2_02_FULL_69_131181Open in IMG/M
3300018429|Ga0190272_10780171All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium876Open in IMG/M
3300019360|Ga0187894_10342566All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium678Open in IMG/M
3300019458|Ga0187892_10026875All Organisms → cellular organisms → Bacteria4954Open in IMG/M
3300019487|Ga0187893_10067838All Organisms → cellular organisms → Bacteria3409Open in IMG/M
3300019869|Ga0193705_1084915All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300019879|Ga0193723_1039816All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1399Open in IMG/M
3300019881|Ga0193707_1026090All Organisms → cellular organisms → Bacteria1921Open in IMG/M
3300019883|Ga0193725_1009671All Organisms → cellular organisms → Bacteria2744Open in IMG/M
3300019997|Ga0193711_1005286Not Available1614Open in IMG/M
3300020002|Ga0193730_1042731Not Available1312Open in IMG/M
3300020003|Ga0193739_1004809All Organisms → cellular organisms → Bacteria3642Open in IMG/M
3300020018|Ga0193721_1140584All Organisms → cellular organisms → Bacteria → Proteobacteria591Open in IMG/M
3300020060|Ga0193717_1128783All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300022694|Ga0222623_10033325All Organisms → cellular organisms → Bacteria1960Open in IMG/M
3300025324|Ga0209640_10978324Not Available653Open in IMG/M
3300025899|Ga0207642_10668448Not Available652Open in IMG/M
3300025907|Ga0207645_10039386All Organisms → cellular organisms → Bacteria3028Open in IMG/M
3300025917|Ga0207660_10020226All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4464Open in IMG/M
3300025922|Ga0207646_10399459All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1241Open in IMG/M
3300025957|Ga0210089_1020836Not Available767Open in IMG/M
3300026285|Ga0209438_1000260All Organisms → cellular organisms → Bacteria15406Open in IMG/M
3300026320|Ga0209131_1003137All Organisms → cellular organisms → Bacteria11048Open in IMG/M
3300026360|Ga0257173_1000884All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2236Open in IMG/M
3300026371|Ga0257179_1001694Not Available1653Open in IMG/M
(restricted) 3300027799|Ga0233416_10018477All Organisms → cellular organisms → Bacteria2294Open in IMG/M
3300027815|Ga0209726_10086404All Organisms → cellular organisms → Bacteria1955Open in IMG/M
3300028536|Ga0137415_10224927All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1690Open in IMG/M
3300028596|Ga0247821_10710299Not Available657Open in IMG/M
3300028791|Ga0307290_10365223Not Available528Open in IMG/M
3300028792|Ga0307504_10368264All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium558Open in IMG/M
3300028812|Ga0247825_10063150All Organisms → cellular organisms → Bacteria2471Open in IMG/M
3300028819|Ga0307296_10124802Not Available1385Open in IMG/M
3300028828|Ga0307312_10587250All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium737Open in IMG/M
3300028828|Ga0307312_10910199All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium583Open in IMG/M
3300030606|Ga0299906_10506827Not Available924Open in IMG/M
(restricted) 3300031150|Ga0255311_1128381Not Available557Open in IMG/M
(restricted) 3300031197|Ga0255310_10000380All Organisms → cellular organisms → Bacteria10219Open in IMG/M
3300031720|Ga0307469_10017468All Organisms → cellular organisms → Bacteria3786Open in IMG/M
3300031720|Ga0307469_10465066All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1099Open in IMG/M
3300031740|Ga0307468_101815956All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium578Open in IMG/M
3300033233|Ga0334722_10387653All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1010Open in IMG/M
3300033417|Ga0214471_10001650All Organisms → cellular organisms → Bacteria17610Open in IMG/M
3300033417|Ga0214471_11308622Not Available548Open in IMG/M
3300033513|Ga0316628_101031819Not Available1092Open in IMG/M
3300034155|Ga0370498_119815Not Available622Open in IMG/M
3300034164|Ga0364940_0054708Not Available1079Open in IMG/M
3300034177|Ga0364932_0265858Not Available649Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil17.27%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil14.55%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.73%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment10.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.45%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.64%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.73%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands2.73%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.82%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.82%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.82%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.82%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.82%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.91%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.91%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.91%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.91%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.91%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.91%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.91%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.91%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300005205Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300011427Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT418_2EnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012040Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT746_2EnvironmentalOpen in IMG/M
3300012133Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT121_2EnvironmentalOpen in IMG/M
3300012171Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT466_2EnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014269Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D1EnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300014324Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D1EnvironmentalOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300015256Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT333_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025957Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028596Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glycerol_Day14EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030606Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1039675123300001661Forest SoilMDFNAYATEKIVAGRLADLRAARARTALLESAGVRPRGVGAAVGSAFIRLGRWLARGEAGAAPNAGVRVAR*
Ga0055435_1000557243300003994Natural And Restored WetlandsMDLNVYVTEKIVAVRLADLRAAGARAALIESARVGPRGMGPALGGALIRLGRWLAQGEVVAASNAGVRVAR*
Ga0055498_1013846823300004058Natural And Restored WetlandsMDLNIYATEMIAAGRLVELRAAGARAALADSARGGPRGVGPVVGSALIRLGRWLAQGEVVAASNAGVRVAR*
Ga0062589_10007639843300004156SoilMDFNLYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA*
Ga0062590_10004651223300004157SoilMDFNPYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA*
Ga0068999_1002644013300005205Natural And Restored WetlandsMDLNIYATEMIAAGRLVELRAAGARAALADSARGGPRGVGPVVGSALIRLGRWLAQGEVVAAS
Ga0070680_10052965213300005336Corn RhizosphereMDFNAYVVEKLTETRLADLRAAGARAALAESARVEPRGMGLAVGSALIRLGRWLAPGDVAAPNRGVRVGR*
Ga0070687_10049885913300005343Switchgrass RhizosphereATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA*
Ga0070692_1008486613300005345Corn, Switchgrass And Miscanthus RhizosphereYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA*
Ga0070692_1135399623300005345Corn, Switchgrass And Miscanthus RhizosphereMDINLYVTEKIVAERLAGLRAAGAQAALIRSARIGPRGVGPVLGEALIRLGHWLAPDDAGVPPNAGVRVARRA*
Ga0070707_10031119123300005468Corn, Switchgrass And Miscanthus RhizosphereMDFNTYAAEKMAEARLTDLRAAAARAALIESARVGPRGVGPAVGGALIRLGRWLPQGETVAAPNAGVRVAR*
Ga0070698_10072459923300005471Corn, Switchgrass And Miscanthus RhizosphereMDFNAYVVEKLAETRLADLRAARARAALAESARVEPHGVGPAVGSALIRLGRWLAQGEAVAAPNGGVRAGR*
Ga0070696_10005824643300005546Corn, Switchgrass And Miscanthus RhizosphereMDFNAYVVEKLTETRLADLRAAGARAALAESARVEPRGMGLAVGSALIRLGRWLAPGDVAAPNGGVRVGR*
Ga0074479_1059882023300005829Sediment (Intertidal)MDLNIYASEKIAAGRLVELRAAGARAALIELARVGPRGIGPALGGALIRVGRWLAQGEVVAASNAGVRVAR*
Ga0068865_10161141423300006881Miscanthus RhizosphereMDFNLYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALLRLGHWLASDEAGAVSNPGVRVARRA
Ga0079218_1025811333300007004Agricultural SoilMDINTYALEKIANGRLTELRAAGARVALIESARIGPRGVGVAVGSALIRLGHWLAPAEEAAAPNA
Ga0099791_1015400623300007255Vadose Zone SoilMDFNLYATEKIAAGRLADLRAASAQVALLESAGVGPRGVGAAVGSAFIRLGRWLAPGEGVAPNAGVRVAR*
Ga0099829_1000567273300009038Vadose Zone SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEVVAAPNAGVRVAR*
Ga0099828_10002811103300009089Vadose Zone SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR*
Ga0134127_1002883173300010399Terrestrial SoilMDFNAYVVEKLTETRLADLRAAGARAALAEAARVEPRGMGPASGSALIRLGRWLAPGDVAAPNGGVRVGR*
Ga0137391_1060637223300011270Vadose Zone SoilMDFNTYATEKIVAGRLADLRATSAQVALLESAGVGPRGVGAAVGSAFIRLGRWLAPGEGVAAPNAGVRVAR*
Ga0137446_110283823300011419SoilMDFNTYAVEKMMEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVGAAPNAGVRVAR*
Ga0137448_108985513300011427SoilMDFNTYAVEKMMEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR*
Ga0137448_111218013300011427SoilNTYAAEKMVEGRLADLRAAGARAALIASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR*
Ga0137458_102398033300011436SoilDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGKVGAAPNAGVRVAR*
Ga0137445_110380113300012035SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEVVAAPNAGVRV
Ga0137461_103223213300012040SoilMDFNTYAAEKMVEVRLADLRAAGARAALIASARVGPRGVGPAVGGALIRLGRWLAPGE
Ga0137329_105476413300012133SoilMDFNTYAAEKMVEVRLADLRAAGAREALLASARVGPRGVGPAVGGALIRLGRWLAQGKVGAAANAGVRVAR*
Ga0137342_112970423300012171SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGVLIRLGRWLAPGEVVAAPNAGVRVAR*
Ga0137434_101927523300012225SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEGVAAPNAGVRVAR*
Ga0137447_103387613300012226SoilERRAVMDFNTYTAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGKVGAGPNAGVRVAR*
Ga0137369_1051597313300012355Vadose Zone SoilMDFNIYAIEKIVAGRLADLRAAGARVVLLESAGVGPRGLGAAVGSTFIRLGRWLAPGERAAAPNAGVRVAR*
Ga0137375_1011974123300012360Vadose Zone SoilMDFNIYATEKIVAGRLADLRAAGARVVLLESAGVGPRGLGAAVGSTFIRLGRWLAPGEGAAAPNAGVRVAR*
Ga0137397_1006440353300012685Vadose Zone SoilMDFNLYATEKIAAGRLADLRAASAQAALIRTARIGPRGVGPVLGGALIRLGHWLAPDEAGAASNPGVRVARRA*
Ga0137394_1004653923300012922Vadose Zone SoilMDFNAYATEMIVAGRLADLRAVRARTALLESAGVRPRGVGAAVGSAFIRLGRRLARGEAGAAPNAGVRVAR*
Ga0137416_1160777523300012927Vadose Zone SoilATEKIVAGRLADLRAAGARTALLESAGVRPRGVGAAVGSAFIRLGRWLARGEAGAAPNAGVRVAR*
Ga0137404_1111240023300012929Vadose Zone SoilMDFNAYATEKIVAGRLADLRAAGARTALLESAGVRPRGVGSAVGSAFIRLGRWLARGEAGAAPNAGVRVAR*
Ga0137407_1003144063300012930Vadose Zone SoilMDFNTYATEKIVAGRLADLRAARARAALLESAGVGPRGVGAAVGSAFIRLGRWLAPDEGVATPNAGVRVAR*
Ga0137407_1052924123300012930Vadose Zone SoilMDFNAYATEKIVAGRLADLRAVRARTALLESAGVRPRGVGSAVGSAFIRLGRWLARGEAGAAPNAGVRVAR*
Ga0075302_102612023300014269Natural And Restored WetlandsMDLNVYVTEKIVAVRLADLRAAGARAALIESARVGPRGMGPALGGALIRLGRWLAQGEVVAASNAGVRGAR*
Ga0075354_103299123300014308Natural And Restored WetlandsMDLNIYATEMIAAGRLVGLRAAGARAALADSARGGPRGVGPVVGSALIRLGRWLAQGEVVAASNAGVRVAR*
Ga0075352_109517223300014324Natural And Restored WetlandsMDLNVYITEKIVAVRLADLRAAGARAALIESARVGPRGMGPALGGALIRLGRWLAQGEVVAASNAGVRVAR*
Ga0180066_105763823300014873SoilMDFNTYAAEKMVEVRLADLRAAGAREALLASARVGPRGVGPAVGGALIRLGRWLAQGKVGAAPNAGVRVAR*
Ga0180094_107681923300014881SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGRVLIRLGRWLAEGEVVAVANAGVRAAR*
Ga0180063_123944723300014885SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGVLIRLGRWLAEGEVV
Ga0137418_1060003023300015241Vadose Zone SoilIVAGRLADLRAVRARTALLESAGVRPRGVGSAVGSAFIRLGRWLARGEAGAAPNAGVRVAR*
Ga0180089_101099023300015254SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVGAAPNAGVRVAR*
Ga0180073_101396533300015256SoilMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGKVGAGPNAGVRVAR*
Ga0180085_106094443300015259SoilMDFNTYAAEKMVEVRLADLRAAGARAALIASARVGPRGVGPAVGGALIRLGRWLAPGEGVAAPNAGVRVAR*
Ga0184610_121149723300017997Groundwater SedimentMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0184604_1019823013300018000Groundwater SedimentMDFNLYATEKIAAGRLADLRAASARVVLLESAGVGPRELGAAVGSTFIRLGRWLAPGEGAAAPNAGVRVAR
Ga0184605_1002047133300018027Groundwater SedimentMDFNIYATEKIVAGHLADLRAAGARVVLLESAGVGPRGLGAAVGSAFIRFGRWLAPGEGAAAPNAGVRVAR
Ga0184608_1009091413300018028Groundwater SedimentMDFNTYATEKIVAGRLADLRAAGARVVLLESAGVGPRGLGAAVGSTFIRLGRWLAPGEVVAAPNAGVRVAR
Ga0184634_1027535813300018031Groundwater SedimentMDFNTYAVEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGSALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0184621_1015017623300018054Groundwater SedimentMDFNTYAAEKMVEVRLADLRAAGARAALIASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0184615_1030872323300018059Groundwater SedimentMDFNTYAVEKMMEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0184632_1003152943300018075Groundwater SedimentMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPTVGGALIRLGRWLAPGEVVAAPNAGVRVAR
Ga0184612_1026092213300018078Groundwater SedimentMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGRAVGGALIRLGRWLAPGEVVAAPNAGVRVAR
Ga0184627_1001001063300018079Groundwater SedimentMDFNTYAAEKMVEVRLADLRAAGARAGLLASARVGPRGVGPAVGSALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0184629_1033964823300018084Groundwater SedimentMEDNVYAIEVMVAQRLADLRAAGARAALVESGGSGHRGAAVVVGAGLIRLGRWLAQGEGVAAPNAGVRVAR
Ga0190265_1002361143300018422SoilMDGNLYATEKMAAARLAELRADRTRAALVESARGGRRGAGVAVGSALIRLGRWFAGGEAVAAPNAGVRVGR
Ga0190265_1025585743300018422SoilQLVQVRLADLRAAGARASLIESARVGPRGVRSALGGALIRLGHWLAPGEGVAAPNAGVRVAR
Ga0190265_1064319533300018422SoilMDINTYALEKIANGRLTELRAAGARVALIESVRIGPRGVGVAVGSALIRLGHWLAPAEEAAAPNAGVRVAR
Ga0190272_1078017123300018429SoilMDFNTYAAEKMVEVRLADLRAAGARAALIASARVGPRGVGPAVGGALIRLGRWLAQGDVVAAPNAGVRVAR
Ga0187894_1034256623300019360Microbial Mat On RocksMDFNVYSVEKLVEVRLAELRAAGRRAALIESVRVGPRGLGPAVGGALIRLGHWLAQGEAVTAPNAGVRVAP
Ga0187892_1002687573300019458Bio-OozeMDLNVYSVEKLVEVRLAELRAAGRRAALIESVRVGPRGMGAAVGGALIRLGHWLAQREAVTAPNAGVRVAP
Ga0187893_1006783823300019487Microbial Mat On RocksMDLNVYSVEKLVEVRLAELRAAGRRAALIESVRVGPRGVGPAVGGALIRLGHWLAQREAVTAPNAGVRVAP
Ga0193705_108491523300019869SoilDFNIYATEKIVAGHLADLRAAGARVVLLESAGVGPRGLGAAVGSTFIRLGRWLAPGEGAAAPNAGVRVAR
Ga0193723_103981623300019879SoilMDFNLYATEKIAAGRLADLRSASAQAALVRAARIGPRGVAPVLGGALIRLGHWLAPDDAGVASNPGVRVARRA
Ga0193707_102609033300019881SoilMDFNTYATEKIVAGRLADLRAASAQVALLEAAGVGPRGVGAAVGSAFIRLGRWLAPGEGVAAPNAGVRVAR
Ga0193725_100967133300019883SoilMDFNTYATEKIVAGRLADLRAASAQVALLESAGVGPRGVGAAVGSAFIRLGRWLAPGEGVAAPNAGVRVAR
Ga0193711_100528633300019997SoilMDFNLYATEKIAAGRLADLRSASAQAALVRAARIGPRGAALVLGGALIRLGHWLAPDEAGVASNPGVRVARRA
Ga0193730_104273123300020002SoilMDFNIYATEKIVAGHLADLRAAGARVVLLESAGVGPRGLGAAVGSTFIRLGRWLAPGEGAAAPNAGVRVAR
Ga0193739_100480923300020003SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEVVAAPNAGVRVAR
Ga0193721_114058423300020018SoilMDFNTYATEKIVAGRLADLRAAGARVVLLESAGVGPRGLGAAVGSTFIRLGRWLAPGEGAAAPNAGVRVAR
Ga0193717_112878313300020060SoilMDVNTYAIEKIADGRLADLRAAGARAALLESARLGPRGVGPAVGSALIRLGHWLAPDDAGAAPNAGVRVAR
Ga0222623_1003332523300022694Groundwater SedimentMDLNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0209640_1097832423300025324SoilMDFNTYAVEKMVEVRLADLRAAGARAALLASARVGPRGVGPAIGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0207642_1066844823300025899Miscanthus RhizosphereMDFNLYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA
Ga0207645_1003938633300025907Miscanthus RhizosphereMDFNPYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA
Ga0207660_1002022673300025917Corn RhizosphereMDFNAYVVEKLTETRLADLRAAGARAALAESARVEPRGMGLAVGSALIRLGRWLAPGDVAAPNRGVRVGR
Ga0207646_1039945923300025922Corn, Switchgrass And Miscanthus RhizosphereMDFNTYAAEKMAEARLTDLRAAAARAALIESARVGPRGVGPAVGGALIRLGRWLTQGETVAAPNAGVRVAR
Ga0210089_102083623300025957Natural And Restored WetlandsMDLNIYATEMIAAGRLVELRAAGARAALADSARGGPRGVGPVVGSALIRLGRWLAQGEVVAASNAGVRVVR
Ga0209438_1000260123300026285Grasslands SoilMDFNLYATEKIVAGRLADLRAASAQAALIRTARIGPRGVGPVLGGALIRLGHWLAPDEAGAPSNPGVRVARRA
Ga0209131_100313783300026320Grasslands SoilMDFNLYATEKIVAGRLADLRAASAQAALIRTARIGPRGVGPVLGGALIRLGHWLAPDEAGAASNPGVRVARRA
Ga0257173_100088443300026360SoilMDFNTYAAEKMVEVRLADLRAAGARAAVLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0257179_100169443300026371SoilEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
(restricted) Ga0233416_1001847723300027799SedimentMDYNVYATEKIAAGRLADLRAVRERIALVESARGRRRGVGAAVGAALIRVGRWLAQDEGVAAANAGVRLAR
Ga0209726_1008640423300027815GroundwaterMDFNIYAAEKMVEVRLADLRAAGARAALITSARVGPRGVGPAVGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0137415_1022492733300028536Vadose Zone SoilMDFNAYATEKIVAGRLADLRAARARTALLESAGVRPRGVGAAVGSAFIRLGRWLARGEAGAAPNAGVRVAR
Ga0247821_1071029913300028596SoilMDYNLYATEKIAAGRLADLRAASAQAALIRTARTGPRGVGPALGGALIRLGHWLASDEAGAVSNPGVRVARRA
Ga0307290_1036522323300028791SoilMDFNIYATEKIVAGHLADLRAAGARVVLLESAGVGPRGLGAAVGSAFIRLGRWLAPGEGAAAPNAGVRVAR
Ga0307504_1036826413300028792SoilMDFNLYATEKIAAGRLADLRAASAQAALIRTARVGPRGVGPVLGGALIRLGHWLAPDEVDVASNPGVR
Ga0247825_1006315043300028812SoilMDINLYVTEKIVAERLAGLRAAGAQAALIRSARIGPRGVGPVLGEALIRLGHWLAPDDAGVPPNAGVRVARRA
Ga0307296_1012480233300028819SoilMDFNTYATEKIVAGRLADLRAAGARVVLLESAGVGPRGLGAAVGSAFIRLGRWLAPGEGAAAPNAGVRVAR
Ga0307312_1058725013300028828SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEV
Ga0307312_1091019913300028828SoilMDFNTYATEKIVAGRLADLRAARARAALLESAGVGPRGLGAAVGSAFIRLGRWLAPDEGVAAPNAGVRVAR
Ga0299906_1050682723300030606SoilMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAPGEGVAAPNAGVRVAR
(restricted) Ga0255311_112838113300031150Sandy SoilMDLNTYATEKIAAGRLADLRAACARVALIESAGVGPRGVGPVVGSALIRLGHWLAPDEAVAAPNAGVRVAR
(restricted) Ga0255310_1000038093300031197Sandy SoilMDINLYVTEKIVADRLAGLRAAGAQAGLIRSARIGPRGVGPALGQALIRLGHWLAPDEAGGPPNPGVRVARRA
Ga0307469_1001746833300031720Hardwood Forest SoilMDFNLYATEKIAAGRLADLRSASAQAALVSAARIGPRGVALVLGGALIRLGHWLAPDEAGVASNPGVRVARRA
Ga0307469_1046506623300031720Hardwood Forest SoilMDLNTYAIEKIANGRLAELRAAGARVALIEAARIGPRGVGATVGSALIRLGHWLAPTEVAAPPNAGVRVVR
Ga0307468_10181595623300031740Hardwood Forest SoilMDFNAYVVEKLAETRLADLRAACAQATLAGSARAEPRGVRPAVGSALIRLGRWLAPGEVAAPNGGVRVGR
Ga0334722_1038765323300033233SedimentMDVNTYAIEKLVAGRLADLRAVSARVALLRSARIGRRGVGVAVGAALIRLGHWLAPGEAVSVPNAGVRVAR
Ga0214471_1000165093300033417SoilMDFNTYAVEKMVEGRLADLRAAGARAALLASARVGPRGVGPAIGGALIRLGRWLAQGEVVAAPNAGVRVAR
Ga0214471_1130862213300033417SoilDFNTYAVEKMVEVRLADLRAAGARAALVESARVGPRGVGPAVGGALIRLGRWLASGEVVAAPNAGVRVAP
Ga0316628_10103181933300033513SoilMDLNMYATEKIVAARLADLRVASARVALIESAGVGPRGVGPVVGSALIRLGHWLAPDEAVAAPNAEVRVAR
Ga0370498_119815_360_5753300034155Untreated Peat SoilMELNTYAIEKIAAGRLADLRAAGARAVLVESARIGPRGVGPAVGSALIRLGRWLAPDAVVAAPNAGVRVAR
Ga0364940_0054708_757_9723300034164SedimentMDFNTYAAEKMVEVRLADLRAAGARAALLASARVGPRGVGPAVGGALIRLGRWLAQGEVVAPPNAGVRVAR
Ga0364932_0265858_205_4203300034177SedimentMDFNTYAAEKMVEVRLADLRAAGAREALLASARVGPRGVGPAVGGALIRLGRWLAQGKVGAAANAGVRVAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.