NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F083152

Metagenome Family F083152

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F083152
Family Type Metagenome
Number of Sequences 113
Average Sequence Length 43 residues
Representative Sequence FGGREGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD
Number of Associated Samples 103
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.88 %
% of genes near scaffold ends (potentially truncated) 98.23 %
% of genes from short scaffolds (< 2000 bps) 85.84 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.24

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (66.372 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(18.584 % of family members)
Environment Ontology (ENVO) Unclassified
(25.664 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.248 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 11.76%    Coil/Unstructured: 88.24%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.24
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF00890FAD_binding_2 33.63
PF00990GGDEF 8.85
PF13487HD_5 5.31
PF13189Cytidylate_kin2 2.65
PF01590GAF 2.65
PF02518HATPase_c 2.65
PF01266DAO 1.77
PF13185GAF_2 1.77
PF00226DnaJ 0.88
PF13365Trypsin_2 0.88
PF13492GAF_3 0.88
PF00881Nitroreductase 0.88
PF13633Obsolete Pfam Family 0.88
PF02694UPF0060 0.88
PF01467CTP_transf_like 0.88
PF00589Phage_integrase 0.88
PF00520Ion_trans 0.88
PF02720DUF222 0.88
PF16158N_BRCA1_IG 0.88
PF00027cNMP_binding 0.88
PF00578AhpC-TSA 0.88
PF05598DUF772 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG1742Uncharacterized inner membrane protein YnfA, drug/metabolite transporter superfamilyGeneral function prediction only [R] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms66.37 %
UnclassifiedrootN/A33.63 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2189573000|GPBTN7E01EXWTUNot Available517Open in IMG/M
3300000891|JGI10214J12806_10586680All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1761Open in IMG/M
3300002121|C687J26615_10088993All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300002123|C687J26634_10313590Not Available525Open in IMG/M
3300002916|JGI25389J43894_1030117All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300004114|Ga0062593_103368238Not Available513Open in IMG/M
3300005174|Ga0066680_10875546All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300005181|Ga0066678_10098157All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1768Open in IMG/M
3300005332|Ga0066388_100517338All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1827Open in IMG/M
3300005444|Ga0070694_101046476Not Available679Open in IMG/M
3300005467|Ga0070706_100729601Not Available918Open in IMG/M
3300005518|Ga0070699_101060194All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium743Open in IMG/M
3300005540|Ga0066697_10679040Not Available565Open in IMG/M
3300005552|Ga0066701_10047323All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2343Open in IMG/M
3300005552|Ga0066701_10873737All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300005557|Ga0066704_10613837Not Available697Open in IMG/M
3300005568|Ga0066703_10214614All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1169Open in IMG/M
3300005598|Ga0066706_11522076All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300005615|Ga0070702_100275690Not Available1152Open in IMG/M
3300005618|Ga0068864_100017872All Organisms → cellular organisms → Bacteria5915Open in IMG/M
3300005764|Ga0066903_108584269Not Available520Open in IMG/M
3300006173|Ga0070716_100462496All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria927Open in IMG/M
3300006796|Ga0066665_10067169All Organisms → cellular organisms → Bacteria → Proteobacteria2533Open in IMG/M
3300006845|Ga0075421_100418959All Organisms → cellular organisms → Bacteria1605Open in IMG/M
3300006854|Ga0075425_100354648Not Available1689Open in IMG/M
3300006854|Ga0075425_102566181Not Available563Open in IMG/M
3300007265|Ga0099794_10007020All Organisms → cellular organisms → Bacteria4597Open in IMG/M
3300009089|Ga0099828_11027475All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium734Open in IMG/M
3300009137|Ga0066709_100561226All Organisms → cellular organisms → Bacteria1619Open in IMG/M
3300009147|Ga0114129_10498100All Organisms → cellular organisms → Bacteria1592Open in IMG/M
3300009147|Ga0114129_10610982All Organisms → cellular organisms → Bacteria1412Open in IMG/M
3300009157|Ga0105092_10402266All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300009444|Ga0114945_10653317All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → Mesorhizobium japonicum → Mesorhizobium japonicum MAFF 303099639Open in IMG/M
3300009815|Ga0105070_1132672All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria506Open in IMG/M
3300010359|Ga0126376_11409641All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium722Open in IMG/M
3300010362|Ga0126377_13582188All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300010366|Ga0126379_12606654Not Available603Open in IMG/M
3300010375|Ga0105239_11732719All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria723Open in IMG/M
3300010397|Ga0134124_10647867Not Available1040Open in IMG/M
3300010398|Ga0126383_11925947Not Available679Open in IMG/M
3300011269|Ga0137392_10003019All Organisms → cellular organisms → Bacteria10288Open in IMG/M
3300011419|Ga0137446_1104613All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria675Open in IMG/M
3300012198|Ga0137364_10151558All Organisms → cellular organisms → Bacteria1676Open in IMG/M
3300012204|Ga0137374_10662634All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria788Open in IMG/M
3300012209|Ga0137379_10962885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria758Open in IMG/M
3300012211|Ga0137377_11695786All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300012351|Ga0137386_10237948All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1309Open in IMG/M
3300012357|Ga0137384_10866722All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium728Open in IMG/M
3300012685|Ga0137397_10100026All Organisms → cellular organisms → Bacteria2125Open in IMG/M
3300012685|Ga0137397_11048759Not Available597Open in IMG/M
3300012907|Ga0157283_10189961All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium639Open in IMG/M
3300012923|Ga0137359_11277946All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300012929|Ga0137404_10076183All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2648Open in IMG/M
3300012930|Ga0137407_11021056Not Available783Open in IMG/M
3300012948|Ga0126375_11042330Not Available669Open in IMG/M
3300012977|Ga0134087_10170267All Organisms → cellular organisms → Bacteria959Open in IMG/M
3300014157|Ga0134078_10221955All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300014321|Ga0075353_1183521All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium RIFCSPHIGHO2_02_FULL_69_13549Open in IMG/M
3300014883|Ga0180086_1173374Not Available561Open in IMG/M
3300015053|Ga0137405_1041128All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1081Open in IMG/M
3300015357|Ga0134072_10197109All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300016319|Ga0182033_10401079All Organisms → cellular organisms → Bacteria1159Open in IMG/M
3300018031|Ga0184634_10125998All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300018061|Ga0184619_10332675All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium694Open in IMG/M
3300018433|Ga0066667_10145707All Organisms → cellular organisms → Bacteria1662Open in IMG/M
3300021081|Ga0210379_10248449All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium772Open in IMG/M
3300021170|Ga0210400_10107774All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2212Open in IMG/M
3300021178|Ga0210408_10903019Not Available687Open in IMG/M
3300025899|Ga0207642_10961696Not Available549Open in IMG/M
3300025910|Ga0207684_10538619All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium999Open in IMG/M
3300025910|Ga0207684_10993941Not Available702Open in IMG/M
3300025922|Ga0207646_10106427All Organisms → cellular organisms → Bacteria2516Open in IMG/M
3300025945|Ga0207679_11218400Not Available691Open in IMG/M
3300025961|Ga0207712_11352414Not Available637Open in IMG/M
3300026023|Ga0207677_12316477Not Available500Open in IMG/M
3300026329|Ga0209375_1092710All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1373Open in IMG/M
3300026334|Ga0209377_1214213Not Available637Open in IMG/M
3300026342|Ga0209057_1018915All Organisms → cellular organisms → Bacteria3937Open in IMG/M
3300026369|Ga0257152_1013397All Organisms → cellular organisms → Bacteria865Open in IMG/M
3300026507|Ga0257165_1002012All Organisms → cellular organisms → Bacteria2561Open in IMG/M
3300026528|Ga0209378_1207527Not Available640Open in IMG/M
3300026529|Ga0209806_1011254All Organisms → cellular organisms → Bacteria4809Open in IMG/M
3300026529|Ga0209806_1019280All Organisms → cellular organisms → Bacteria3522Open in IMG/M
3300026532|Ga0209160_1332939All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300026540|Ga0209376_1050055All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2413Open in IMG/M
3300026548|Ga0209161_10218918All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1031Open in IMG/M
3300026548|Ga0209161_10257273Not Available900Open in IMG/M
3300026550|Ga0209474_10544392All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300026759|Ga0207527_102018All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium691Open in IMG/M
3300027646|Ga0209466_1065498All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium733Open in IMG/M
3300027655|Ga0209388_1045675All Organisms → cellular organisms → Bacteria1264Open in IMG/M
3300027835|Ga0209515_10115140All Organisms → cellular organisms → Bacteria1761Open in IMG/M
3300027874|Ga0209465_10054680All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1915Open in IMG/M
3300028792|Ga0307504_10357202Not Available564Open in IMG/M
3300028828|Ga0307312_10996105Not Available555Open in IMG/M
(restricted) 3300031197|Ga0255310_10144781Not Available651Open in IMG/M
3300031198|Ga0307500_10200509Not Available595Open in IMG/M
3300031562|Ga0310886_10436296Not Available779Open in IMG/M
3300031719|Ga0306917_10125002All Organisms → cellular organisms → Bacteria → Proteobacteria1879Open in IMG/M
3300031723|Ga0318493_10440128Not Available716Open in IMG/M
3300031770|Ga0318521_10009985All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4026Open in IMG/M
3300031781|Ga0318547_10816670Not Available581Open in IMG/M
3300031805|Ga0318497_10090830All Organisms → cellular organisms → Bacteria1629Open in IMG/M
3300031820|Ga0307473_10957365Not Available622Open in IMG/M
3300031820|Ga0307473_11089043Not Available588Open in IMG/M
3300031910|Ga0306923_10571448All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1272Open in IMG/M
3300032180|Ga0307471_102593584Not Available642Open in IMG/M
3300032180|Ga0307471_103882382All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria528Open in IMG/M
3300032205|Ga0307472_100094433All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2036Open in IMG/M
3300032205|Ga0307472_102568006Not Available519Open in IMG/M
3300033433|Ga0326726_11105240All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium771Open in IMG/M
3300034114|Ga0364938_134778All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii502Open in IMG/M
3300034165|Ga0364942_0040332All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1497Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.58%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.08%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.08%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.31%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.42%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.54%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.65%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.77%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.77%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.77%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.77%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.89%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.89%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.89%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.89%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.89%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.89%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.89%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.89%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.89%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.89%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2189573000Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis 0-21cm (T0 for microcosms)EnvironmentalOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300002121Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1EnvironmentalOpen in IMG/M
3300002123Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_3EnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009815Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012907Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S044-104R-1EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014321Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D1EnvironmentalOpen in IMG/M
3300014883Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT760_16_10DEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026369Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-AEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026759Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3w-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027646Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031198Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 14_SEnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031723Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f23EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031805Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300034114Sediment microbial communities from East River floodplain, Colorado, United States - 9_s17EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
N55_105715402189573000Grass SoilAAAAGVAEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKP
JGI10214J12806_1058668013300000891SoilHCRVFSEGELYQRFGHEKNFHLQKIPGAWKHGELPACFEPIEFGSFFVSYTK*
C687J26615_1008899313300002121SoilRRPGFRLVKLPGEWKHGELPDCFEAIEFGAFFVAYAKEG*
C687J26634_1031359033300002123SoilQTDFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYSKG*
JGI25389J43894_103011723300002916Grasslands SoilELRARFGGRKGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD*
Ga0062593_10336823813300004114SoilHCRVFSEGELYQRFGHEQNFQVQKIPGEWGHGKLPDCFEPIEFGSFFVSYSKA*
Ga0066680_1087554623300005174SoilEAELRARFGGRKGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD*
Ga0066678_1009815713300005181SoilGNEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFSKP*
Ga0066388_10051733813300005332Tropical Forest SoilQTFGPRKDFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG*
Ga0070694_10104647623300005444Corn, Switchgrass And Miscanthus RhizosphereKELRQTFGPRKDFRLVKVPGAWKHGELPSCFEPIEFGSFFVSYAKDGG*
Ga0070706_10072960113300005467Corn, Switchgrass And Miscanthus RhizosphereELYQRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK*
Ga0070699_10106019433300005518Corn, Switchgrass And Miscanthus RhizosphereRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK*
Ga0066697_1067904013300005540SoilLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG*
Ga0066701_1004732313300005552SoilFGNEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFSKP*
Ga0066701_1087373713300005552SoilRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG*
Ga0066704_1061383713300005557SoilLRQTFGHRKDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYAKDGG*
Ga0066703_1021461423300005568SoilGNEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFAKP*
Ga0066706_1152207623300005598SoilEAELRARFGGREGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD*
Ga0070702_10027569023300005615Corn, Switchgrass And Miscanthus RhizosphereGELYQRFGHRPNFRLQKIPGAWKHGELPACFEPIEFGSFFVAFAK*
Ga0068864_10001787213300005618Switchgrass RhizosphereELYQRFGHLPNFRLQKIPGAWKHGELPACFEPIEFGSFFVAFAK*
Ga0066903_10858426913300005764Tropical Forest SoilRWGNAPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0070716_10046249623300006173Corn, Switchgrass And Miscanthus RhizosphereGGEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKP*
Ga0066665_1006716943300006796SoilREGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD*
Ga0075421_10041895913300006845Populus RhizosphereGELYQRFGHLPNFRLQKIPGAWKHGELPACFEPIAFGSFFVAFAK*
Ga0075425_10035464813300006854Populus RhizospherePNFRLQKIPGAWKHGELPACFEPIEFGSFFVAFAK*
Ga0075425_10256618113300006854Populus RhizosphereRRWGGEPGFRLAKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0099794_1000702013300007265Vadose Zone SoilSEGELYQRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYAK*
Ga0099828_1102747523300009089Vadose Zone SoilRWGGEPGFRLAKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0066709_10056122613300009137Grasslands SoilRKDFRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG*
Ga0114129_1049810013300009147Populus RhizosphereFTEKELRQTFGHRKDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYTKDGS*
Ga0114129_1061098213300009147Populus RhizosphereFTEKELRQTFGHRKDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYTKDGG*
Ga0105092_1040226613300009157Freshwater SedimentSEGELYQRVGHELNFHVQKIPGTWGHGKLPDCFEPIEFGSFFVSYSKP*
Ga0114945_1065331723300009444Thermal SpringsELRERFGRYAGFRVVKIPGAWKHGELPDCFEPIEFGSFFVAFGRP*
Ga0105070_113267223300009815Groundwater SandVFTEGELHRRFGQRKNFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYAKS*
Ga0126376_1140964113300010359Tropical Forest SoilADLHKRWGNAPGFRVVNIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0126377_1358218813300010362Tropical Forest SoilRRWGGQIGFRLVKIPGVIKADEHGEFPSCLEPIEFGSFFVALTK*
Ga0126379_1260665413300010366Tropical Forest SoilRVFSEADLHKRWGHAPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0105239_1173271923300010375Corn RhizosphereWGGEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKP*
Ga0134124_1064786743300010397Terrestrial SoilGHEQNFQVQKIPGEWGHGKLPDCFEPIEFGSFFVSYSKA*
Ga0126383_1192594713300010398Tropical Forest SoilPEFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFTKA*
Ga0137392_1000301913300011269Vadose Zone SoilRFGNEPGFHLSKIPGAWKHGEIPACFEPVEFGSFFVSFSKP*
Ga0137446_110461313300011419SoilDLRRRWSGEPGFRVVKIPGAWKHGEIPDCFEPDEFGSFFVSFAKA*
Ga0137364_1015155833300012198Vadose Zone SoilARFGERPGFRLAKIPGAWKHGELPDCFEPIEFGSFFVSYATER*
Ga0137374_1066263423300012204Vadose Zone SoilVFSEADLRRRWGGEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFARP*
Ga0137379_1096288513300012209Vadose Zone SoilQRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYAK*
Ga0137377_1169578613300012211Vadose Zone SoilTFGPRKDFRLVKVPGAWKHGELPSCFEPIEFGSFFVSYAKDGG*
Ga0137386_1023794813300012351Vadose Zone SoilDLRRRWGGEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFARP*
Ga0137384_1086672213300012357Vadose Zone SoilGFRVAKISGAWKHGEIPDCFEPVEFGSFFVSFAKP*
Ga0137397_1010002633300012685Vadose Zone SoilELYQRFGHERNFQVQKIPGAWGHGKLPDCFEPIEFGSFFVSYSK*
Ga0137397_1104875913300012685Vadose Zone SoilPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0157283_1018996113300012907SoilEKNFHLQKIPGAWKHGELPACFEPIEFGSFFVSYTK*
Ga0137359_1127794623300012923Vadose Zone SoilELYQRFGHEKNFQVQKIPGAWGHGKLPDCFEPIEFGSFFVSYSK*
Ga0137404_1007618313300012929Vadose Zone SoilELRERFGGRPDFRLVKIPGAWKHGELPDCFEPVEFGSFFVSYSKA*
Ga0137407_1102105623300012930Vadose Zone SoilELYQRFGHEENFHVQKIPGAWGHGKLPDCFEPIEFGSFFVSYSKA*
Ga0126375_1104233023300012948Tropical Forest SoilGFRVVKIPGAWKHGEIPDCFEAVEFGSFFVSFTKA*
Ga0134087_1017026713300012977Grasslands SoilELRQTFGHRKDFRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG*
Ga0134078_1022195513300014157Grasslands SoilFGGREGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD*
Ga0075353_118352133300014321Natural And Restored WetlandsPGFRIVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0180086_117337413300014883SoilPGFLVVKISGAWKHGEIPDCFEPVEFGSFFVSFAKA*
Ga0137405_104112823300015053Vadose Zone SoilLRKRWGGEPGFRVVKIPGAWKHGEIPECFEPVEFGSFFVSFARP*
Ga0134072_1019710913300015357Grasslands SoilTEAELRARFGGREGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD*
Ga0182033_1040107913300016319SoilTFGPRKDFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG
Ga0184634_1012599813300018031Groundwater SedimentSELRQRFGHRKDFRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG
Ga0184619_1033267523300018061Groundwater SedimentDLRRRWGGEPEFRLVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA
Ga0066667_1014570723300018433Grasslands SoilGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATSG
Ga0210379_1024844933300021081Groundwater SedimentEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFDKA
Ga0210400_1010777413300021170SoilPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK
Ga0210408_1090301923300021178SoilQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK
Ga0207642_1096169623300025899Miscanthus RhizosphereEKELRQTFGPRKDFRLVKVPGAWKHGELPSCFEPIEFGSFFVSYAKDGG
Ga0207684_1053861923300025910Corn, Switchgrass And Miscanthus RhizosphereCRVFSEGELYQRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVAYSK
Ga0207684_1099394123300025910Corn, Switchgrass And Miscanthus RhizosphereYQRFGHEKNFRMEKIPGAWKHGELPDCFEPIEFGSFFVSYSK
Ga0207646_1010642733300025922Corn, Switchgrass And Miscanthus RhizosphereGNEPGFHLTKIPGAWKHGEIPACFEPVEFGSFFVSLKKP
Ga0207679_1121840023300025945Corn RhizosphereQTFGPRKDFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG
Ga0207712_1135241413300025961Switchgrass RhizosphereELRQTFGPRKDFRLVKVPGAWKHGELPSCFEPIEFGSFFVSYAKDGG
Ga0207677_1231647713300026023Miscanthus RhizosphereRWGGEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKP
Ga0209375_109271013300026329SoilFSETELRRRFGNEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFAKP
Ga0209377_121421323300026334SoilTGKELWQTFGHRKDFRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG
Ga0209057_101891553300026342SoilEAELRARFGGRAGFRLVKIPGAWKHGELPDCFEPIEFGSFFVAYATSG
Ga0257152_101339713300026369SoilGELYQRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK
Ga0257165_100201233300026507SoilETELRRRFASEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFSKP
Ga0209378_120752713300026528SoilKELRQTFGHRKDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYAKDGG
Ga0209806_101125413300026529SoilHRRDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYAKDGG
Ga0209806_101928033300026529SoilETELRRRFGNEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFAKP
Ga0209160_133293913300026532SoilFGGREGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATD
Ga0209376_105005523300026540SoilVFTEKELRQTFGHRKDFRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG
Ga0209161_1021891823300026548SoilFSETELRRRFGNEPGFHLAKIPGAWKHGEIPACFEPVEFGSFFVSFSKP
Ga0209161_1025727323300026548SoilDFRLVKVPGAWKHGELPNCFEPIEFGSFFVSYAKDGG
Ga0209474_1054439223300026550SoilGGRAGFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYATSG
Ga0207527_10201823300026759SoilHEKNFHLQKIPGAWKHGELPACFEPIEFGSFFVSYTK
Ga0209466_106549813300027646Tropical Forest SoilEADLHTRWGRALGFRVVKIPGAWKHGEIPDCFEAVEFGSFFVSFTKA
Ga0209388_104567553300027655Vadose Zone SoilVFSEGELYQRFGHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK
Ga0209515_1011514043300027835GroundwaterGFRLVKIPGEWKHGELPDCFEPIEFGSFFVAYGKP
Ga0209465_1005468013300027874Tropical Forest SoilDLHKRWGHAPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA
Ga0307504_1035720213300028792SoilHRKDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYAKDGG
Ga0307312_1099610513300028828SoilWGGEPGFRLVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA
(restricted) Ga0255310_1014478133300031197Sandy SoilVFSETELERRFGGEKGFSLVKIPGAWKHGELPDCFEPIEFGSFFVSYSKS
Ga0307500_1020050913300031198SoilTEKELRQTFGPRKDFRLVKVPGAWKHGELPSCFEPIEFGSFFVSYAKDGG
Ga0310886_1043629623300031562SoilGHRKDFRLVKVPGAWKHGELPDCFEPIEFGSFFVSYTKDGS
Ga0306917_1012500233300031719SoilPRKDFRLVKVPGASKHGELPACFEPIEFGSFFVSYAKDGG
Ga0318493_1044012813300031723SoilGHRKDFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG
Ga0318521_1000998553300031770SoilRVFSEAELHTRWGSAPEFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFTKA
Ga0318547_1081667023300031781SoilVFTEKELRAAFGHRKDFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG
Ga0318497_1009083013300031805SoilEKELRGAFGHRKDFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG
Ga0307473_1095736513300031820Hardwood Forest SoilFRLVKVPGAWKHGELPACFEPIEFGSFFVSYAKDGG
Ga0307473_1108904313300031820Hardwood Forest SoilGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKP
Ga0306923_1057144823300031910SoilHTRWGSAPEFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFTKA
Ga0307471_10259358413300032180Hardwood Forest SoilTEAELHERFGRRPAFRLAKLPGEWKHGKLPDCFEPVEFGAFFVSYAKEG
Ga0307471_10388238213300032180Hardwood Forest SoilRVFSQDELYQRFGHEKNFRLEKIPGAWKHGELPDCFEPIEFGSFFVSYSK
Ga0307472_10009443313300032205Hardwood Forest SoilGEPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKP
Ga0307472_10256800613300032205Hardwood Forest SoilHQPNFRLQKIPGAWKHGELPACFEPIEFGSFFVSYSK
Ga0326726_1110524013300033433Peat SoilPGFRVVKIPGAWKHGEIPDCFEPVEFGSFFVSFAKA
Ga0364938_134778_373_5013300034114SedimentARFGGEPDFRLVKIPGRFKPGEFPAALEPVEFGSFFVAFAKR
Ga0364942_0040332_3_1553300034165SedimentVFSAAELDRRFGGQTDFRLVKIPGAWKHGELPDCFEPIEFGSFFVSYSKG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.