NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069398

Metagenome / Metatranscriptome Family F069398

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069398
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 46 residues
Representative Sequence MTTRIPLLELGSLQSRLPELIAKKGEPPWSEALVLTDDIQAFLIC
Number of Associated Samples 120
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 95.97 %
% of genes near scaffold ends (potentially truncated) 94.35 %
% of genes from short scaffolds (< 2000 bps) 91.94 %
Associated GOLD sequencing projects 119
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (95.161 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.710 % of family members)
Environment Ontology (ENVO) Unclassified
(30.645 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(36.290 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 24.66%    β-sheet: 19.18%    Coil/Unstructured: 56.16%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF02653BPD_transp_2 41.13
PF00496SBP_bac_5 8.87
PF13632Glyco_trans_2_3 8.87
PF01717Meth_synt_2 5.65
PF01042Ribonuc_L-PSP 3.23
PF07690MFS_1 3.23
PF08240ADH_N 2.42
PF13641Glyco_tranf_2_3 1.61
PF03952Enolase_N 1.61
PF01039Carboxyl_trans 0.81
PF02558ApbA 0.81
PF04321RmlD_sub_bind 0.81
PF00079Serpin 0.81
PF03069FmdA_AmdA 0.81
PF00005ABC_tran 0.81
PF13594Obsolete Pfam Family 0.81
PF14907NTP_transf_5 0.81
PF08028Acyl-CoA_dh_2 0.81
PF06325PrmA 0.81
PF07152YaeQ 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 5.65
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 3.23
COG0451Nucleoside-diphosphate-sugar epimeraseCell wall/membrane/envelope biogenesis [M] 1.61
COG0702Uncharacterized conserved protein YbjT, contains NAD(P)-binding and DUF2867 domainsGeneral function prediction only [R] 1.61
COG0148EnolaseCarbohydrate transport and metabolism [G] 1.61
COG1086NDP-sugar epimerase, includes UDP-GlcNAc-inverting 4,6-dehydratase FlaA1 and capsular polysaccharide biosynthesis protein EpsCCell wall/membrane/envelope biogenesis [M] 1.61
COG4826Serine protease inhibitorPosttranslational modification, protein turnover, chaperones [O] 0.81
COG4799Acetyl-CoA carboxylase, carboxyltransferase componentLipid transport and metabolism [I] 0.81
COG4681Uncharacterized conserved protein YaeQ, suppresses RfaH defectFunction unknown [S] 0.81
COG3897Protein N-terminal and lysine N-methylase, NNT1/EFM7 familyPosttranslational modification, protein turnover, chaperones [O] 0.81
COG2890Methylase of polypeptide chain release factorsTranslation, ribosomal structure and biogenesis [J] 0.81
COG2421Acetamidase/formamidaseEnergy production and conversion [C] 0.81
COG2264Ribosomal protein L11 methylase PrmATranslation, ribosomal structure and biogenesis [J] 0.81
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.81
COG1091dTDP-4-dehydrorhamnose reductaseCell wall/membrane/envelope biogenesis [M] 0.81
COG1090NAD dependent epimerase/dehydratase family enzymeGeneral function prediction only [R] 0.81
COG1089GDP-D-mannose dehydrataseCell wall/membrane/envelope biogenesis [M] 0.81
COG1088dTDP-D-glucose 4,6-dehydrataseCell wall/membrane/envelope biogenesis [M] 0.81
COG1087UDP-glucose 4-epimeraseCell wall/membrane/envelope biogenesis [M] 0.81
COG0825Acetyl-CoA carboxylase alpha subunitLipid transport and metabolism [I] 0.81
COG0777Acetyl-CoA carboxylase beta subunitLipid transport and metabolism [I] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms95.16 %
UnclassifiedrootN/A4.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000443|F12B_10352288All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300003994|Ga0055435_10075459All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria857Open in IMG/M
3300004009|Ga0055437_10037375All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300004024|Ga0055436_10036172All Organisms → cellular organisms → Bacteria1286Open in IMG/M
3300004463|Ga0063356_102731210Not Available760Open in IMG/M
3300005186|Ga0066676_10442373All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria880Open in IMG/M
3300005204|Ga0068997_10003765All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1841Open in IMG/M
3300005206|Ga0068995_10094666All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300005332|Ga0066388_102433229All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria950Open in IMG/M
3300005336|Ga0070680_100333201All Organisms → cellular organisms → Bacteria → Proteobacteria1289Open in IMG/M
3300005343|Ga0070687_100624894All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium743Open in IMG/M
3300005353|Ga0070669_101828800All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300005363|Ga0008090_14593958All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria584Open in IMG/M
3300005367|Ga0070667_101734300All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300005406|Ga0070703_10353762All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300005457|Ga0070662_101930924All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300005536|Ga0070697_100155071All Organisms → cellular organisms → Bacteria1932Open in IMG/M
3300005544|Ga0070686_100831889All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium746Open in IMG/M
3300005545|Ga0070695_100489150All Organisms → cellular organisms → Bacteria950Open in IMG/M
3300005575|Ga0066702_10736766All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300005616|Ga0068852_102354247All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005713|Ga0066905_102072404All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria529Open in IMG/M
3300005764|Ga0066903_102960104Not Available920Open in IMG/M
3300006046|Ga0066652_101394039All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria656Open in IMG/M
3300006163|Ga0070715_10131838All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300006800|Ga0066660_10133555All Organisms → cellular organisms → Bacteria1814Open in IMG/M
3300006845|Ga0075421_100073685All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4308Open in IMG/M
3300006847|Ga0075431_101919158All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300007004|Ga0079218_12925478All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300009078|Ga0105106_10577239All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300009101|Ga0105247_10879968All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium690Open in IMG/M
3300009143|Ga0099792_10806691All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300009148|Ga0105243_11452788All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium708Open in IMG/M
3300009148|Ga0105243_12572529All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300009171|Ga0105101_10217483All Organisms → cellular organisms → Bacteria924Open in IMG/M
3300009553|Ga0105249_10665493All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300009610|Ga0105340_1578463All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300009792|Ga0126374_10845077All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria704Open in IMG/M
3300010046|Ga0126384_12327192Not Available517Open in IMG/M
3300010362|Ga0126377_12303545All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300010366|Ga0126379_10432068All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1373Open in IMG/M
3300010376|Ga0126381_100530387All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1665Open in IMG/M
3300010398|Ga0126383_11183582All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria854Open in IMG/M
3300010398|Ga0126383_11394848All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria790Open in IMG/M
3300010399|Ga0134127_10123336All Organisms → cellular organisms → Bacteria2309Open in IMG/M
3300011425|Ga0137441_1132111All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium617Open in IMG/M
3300011429|Ga0137455_1149793All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300012096|Ga0137389_10400401All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300012199|Ga0137383_10558185All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria838Open in IMG/M
3300012202|Ga0137363_11065129All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300012353|Ga0137367_10320879All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1106Open in IMG/M
3300012357|Ga0137384_10929896All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria700Open in IMG/M
3300012357|Ga0137384_11008365All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300012362|Ga0137361_11525316All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300012532|Ga0137373_10015373All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi7794Open in IMG/M
3300012582|Ga0137358_10885089All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300012895|Ga0157309_10281656All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300012923|Ga0137359_11172532All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium656Open in IMG/M
3300012927|Ga0137416_11956245All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria537Open in IMG/M
3300012948|Ga0126375_10236614All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1227Open in IMG/M
3300013297|Ga0157378_10813675All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria961Open in IMG/M
3300014326|Ga0157380_11561359All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium715Open in IMG/M
3300014885|Ga0180063_1267811All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300015052|Ga0137411_1120705All Organisms → cellular organisms → Bacteria1228Open in IMG/M
3300015077|Ga0173483_10954011All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium510Open in IMG/M
3300015262|Ga0182007_10036666All Organisms → cellular organisms → Bacteria1649Open in IMG/M
3300015373|Ga0132257_100325751All Organisms → cellular organisms → Bacteria1851Open in IMG/M
3300017657|Ga0134074_1279473All Organisms → cellular organisms → Bacteria605Open in IMG/M
3300018000|Ga0184604_10320614All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300018075|Ga0184632_10221995All Organisms → cellular organisms → Bacteria830Open in IMG/M
3300018422|Ga0190265_10737986All Organisms → cellular organisms → Bacteria1107Open in IMG/M
3300018429|Ga0190272_12504475All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300018469|Ga0190270_13356114All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300020579|Ga0210407_10186664All Organisms → cellular organisms → Bacteria1608Open in IMG/M
3300020580|Ga0210403_10487041All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300020581|Ga0210399_10332113All Organisms → cellular organisms → Bacteria1269Open in IMG/M
3300021073|Ga0210378_10115078All Organisms → cellular organisms → Bacteria1044Open in IMG/M
3300021170|Ga0210400_11574910All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium519Open in IMG/M
3300021560|Ga0126371_12970299All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria574Open in IMG/M
3300022726|Ga0242654_10319992Not Available576Open in IMG/M
3300025899|Ga0207642_10338412All Organisms → cellular organisms → Bacteria884Open in IMG/M
3300025910|Ga0207684_10359503All Organisms → cellular organisms → Bacteria1253Open in IMG/M
3300025915|Ga0207693_11402630All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300025931|Ga0207644_10435053All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300025933|Ga0207706_11707665All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium507Open in IMG/M
3300026361|Ga0257176_1021074All Organisms → cellular organisms → Bacteria943Open in IMG/M
3300026369|Ga0257152_1025001All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium640Open in IMG/M
3300026371|Ga0257179_1044997All Organisms → cellular organisms → Bacteria566Open in IMG/M
3300026377|Ga0257171_1070159All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium614Open in IMG/M
3300026482|Ga0257172_1033018All Organisms → cellular organisms → Bacteria936Open in IMG/M
3300026530|Ga0209807_1140539All Organisms → cellular organisms → Bacteria941Open in IMG/M
3300026551|Ga0209648_10606361All Organisms → cellular organisms → Bacteria602Open in IMG/M
3300027181|Ga0208997_1041563All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium678Open in IMG/M
3300027577|Ga0209874_1151480All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300027617|Ga0210002_1069742All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300027722|Ga0209819_10171376Not Available758Open in IMG/M
3300027765|Ga0209073_10008786All Organisms → cellular organisms → Bacteria2656Open in IMG/M
3300027787|Ga0209074_10205745All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300027954|Ga0209859_1030327All Organisms → cellular organisms → Bacteria913Open in IMG/M
3300028381|Ga0268264_11828006All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium617Open in IMG/M
3300028673|Ga0257175_1037509All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300028803|Ga0307281_10275040All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium623Open in IMG/M
3300028811|Ga0307292_10234491All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300028812|Ga0247825_10587908All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium797Open in IMG/M
3300028885|Ga0307304_10315968All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium693Open in IMG/M
3300030336|Ga0247826_11097771All Organisms → cellular organisms → Bacteria635Open in IMG/M
(restricted) 3300031150|Ga0255311_1022000All Organisms → cellular organisms → Bacteria1314Open in IMG/M
(restricted) 3300031248|Ga0255312_1010021All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2256Open in IMG/M
3300031455|Ga0307505_10466357All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300031564|Ga0318573_10032088All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2465Open in IMG/M
3300031720|Ga0307469_10770903All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300031740|Ga0307468_102179707All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300031751|Ga0318494_10015618All Organisms → cellular organisms → Bacteria3681Open in IMG/M
3300031819|Ga0318568_10012941All Organisms → cellular organisms → Bacteria4369Open in IMG/M
3300031847|Ga0310907_10535540All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300031893|Ga0318536_10179472All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1077Open in IMG/M
3300031897|Ga0318520_10169589All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1275Open in IMG/M
3300032063|Ga0318504_10199371All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria934Open in IMG/M
3300032261|Ga0306920_103252085Not Available607Open in IMG/M
3300033004|Ga0335084_10053891All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4152Open in IMG/M
3300033004|Ga0335084_10174827All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2232Open in IMG/M
3300033551|Ga0247830_10422189All Organisms → cellular organisms → Bacteria1040Open in IMG/M
3300033813|Ga0364928_0056746All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300034818|Ga0373950_0025919All Organisms → cellular organisms → Bacteria1061Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.71%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.26%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.26%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.84%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands4.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.03%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.23%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.42%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.42%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.42%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.42%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.61%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.61%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.61%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.61%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.61%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.61%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.61%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.61%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.61%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.81%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.81%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.81%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.81%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.81%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.81%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.81%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.81%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil0.81%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000443Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.2B clc assemlyEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005204Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D2EnvironmentalOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005363Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome F II A100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011425Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT244_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012895Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S208-509C-2EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026369Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-AEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027617Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027722Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031564Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f21EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031751Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f24EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300031893Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f28EnvironmentalOpen in IMG/M
3300031897Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f16EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M
3300034818Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_3Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F12B_1035228823300000443SoilMTTTRIPLLEIGSLQSRLPDLIAKKGELPWSEAVVLTDDIQAFLICHPPGQP
Ga0055435_1007545923300003994Natural And Restored WetlandsMADRIPLMEIGKLQARVSELKAKKGPPPWSEALVMTDDVQ
Ga0055437_1003737513300004009Natural And Restored WetlandsMTTPPRIPLLQLGSLQSRLPDLIAKKGAPPWSEPLVLTDDIQAFLICHPPGQPNDTH
Ga0055436_1003617213300004024Natural And Restored WetlandsMTTPPRIPLLQLGSLQSRLPDLIAKKGAPPWSEPLVLTDDIQAFLICHPPGQPNDT
Ga0063356_10273121033300004463Arabidopsis Thaliana RhizosphereMTDRVPLLEVGKLLAGLEELKAKKGEPPWSDAVVLTDDIQ
Ga0066676_1044237313300005186SoilMADRVPLMEIGSLQARLEDLKEKKGPPPWSEAMVMTDDIQAFIICHAP
Ga0068997_1000376533300005204Natural And Restored WetlandsMTDRIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAF
Ga0068995_1009466623300005206Natural And Restored WetlandsMTDRIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQ
Ga0066388_10243322913300005332Tropical Forest SoilMADRVPLLEIGKLHARVSELKAKKGAPPWSEAVVLTDDIQ
Ga0070680_10033320133300005336Corn RhizosphereVARAPTGSLNDLTDRIPLLQLGSLRSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQPND
Ga0070687_10062489423300005343Switchgrass RhizosphereMTTTRIPLLELGSLQSRLPDLIAKKGKPPWSEALVLTDDIQAFLICHPPG
Ga0070669_10182880023300005353Switchgrass RhizosphereMTERIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQPNDTHYHH
Ga0008090_1459395823300005363Tropical Rainforest SoilMPERVPLLEVGKLLASLEDLKAKKGEPPWSDAVVLTDDIQAFIIC
Ga0070667_10173430023300005367Switchgrass RhizosphereMTTTRIPLLELGSLQSRLPDLIAKKGKPPWSEALVLTDDI
Ga0070703_1035376213300005406Corn, Switchgrass And Miscanthus RhizosphereMTTRIPLLELGSRQSCLPDLIAKKGEPPWSEALVLTDDIQAFLICHP
Ga0070662_10193092413300005457Corn RhizosphereMTERIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICH
Ga0070697_10015507113300005536Corn, Switchgrass And Miscanthus RhizosphereMTTRVPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQ
Ga0070686_10083188923300005544Switchgrass RhizosphereMTTTRIPLLELGSLQSRLPDLIAKKGKPPWSEALVLTDDIQAFLICHP
Ga0070695_10048915023300005545Corn, Switchgrass And Miscanthus RhizosphereMTERIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQPNDTHYHHHD
Ga0066702_1073676623300005575SoilMADRMPLLDIGKLRARVDELRARKGPPPWSDTLVMTDD
Ga0068852_10235424713300005616Corn RhizosphereMTTTRIPLLELGSLQSRLPDLIAKKGKPPWSEALVLTDDIQAFLICHPPGQPN
Ga0066905_10207240413300005713Tropical Forest SoilMPDRIPLLEIGSLQARLEDLKEKKGPPPWSEAMVMTDDIQAFIICH
Ga0066903_10296010413300005764Tropical Forest SoilMTERMPLLEIGKLLASIHDLEAKHGEPPWSDPVVLTDDIQAFIIC
Ga0066652_10139403923300006046SoilMADRVPLMEIGSLQARLEDLKEKKGPPPWSEAMVMTDDIQAFIICH
Ga0070715_1013183823300006163Corn, Switchgrass And Miscanthus RhizosphereMTTRIPRLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLIFGPKHL*
Ga0066660_1013355533300006800SoilMADRMPLLDIGKLRARVDELRARKGPPPWSDTLVMTD
Ga0075421_10007368553300006845Populus RhizosphereMAVRVPLMETGRLRARVDELRARKGPPPWSDCLVLTDDIQAF
Ga0075431_10191915813300006847Populus RhizosphereMTERVPLLEVGALLARLDDLKAKKGEPPWSDALVLTDDIQAFIICH
Ga0079218_1292547813300007004Agricultural SoilMTERIPLLQIGSLQSRLPELIARKGAPPWSEAVVLTDDIQA
Ga0105106_1057723913300009078Freshwater SedimentMTERVPLLQLGSLQSRLPELIAKKGTPPWSEAVVLTDDIQAFLICHPPGQPNDTHYHHHD
Ga0105247_1087996813300009101Switchgrass RhizosphereMTTTRIPLLELGSLQSRLPDPIAKKGEPPWSEALVLTDDIQAFLI
Ga0099792_1080669123300009143Vadose Zone SoilMPLLDIGKLRARVDELRARKGPPPWSDTLVMTDDIQAFIICH
Ga0105243_1145278813300009148Miscanthus RhizosphereMTERIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPP
Ga0105243_1257252923300009148Miscanthus RhizosphereMTTRVPLLELGSLQSRLPDLIAKKGEPPWSEALVMTDDIQAFIICHAPGHAND
Ga0105101_1021748313300009171Freshwater SedimentMTERVPLLQLGSLQSRLPELIAKKGTPPWSEAVVLTDDIQAFLICHPPG
Ga0105249_1066549323300009553Switchgrass RhizosphereMAVRMPLLETGRLWTRVDELRARNGPAPWSEALVLTDDIQAFIICHPPGH
Ga0105340_157846323300009610SoilMAIRVPLLETGRLWTRVEELRARKGAAPWSEALVLTD
Ga0126374_1084507713300009792Tropical Forest SoilMADRAPLMEIGKLHARVSELKAKKGAPPWSEAVVLTDDIQAFIICHPPGQPNDTH
Ga0126384_1232719223300010046Tropical Forest SoilMTERVPLLEVGKLLASVHDLEAKHGEPPWSHPVVLTDDIQAFIICHA
Ga0126377_1230354523300010362Tropical Forest SoilMETGRLRARVDELRARKGPPPWSDVLVLTDDIQAFVICHPPGQP
Ga0126379_1043206813300010366Tropical Forest SoilMTERMPLLEIGKLLASVHDLEAKHGEPPWSDPVVLTDDIQAFIICHPPGHPNDT
Ga0126381_10053038713300010376Tropical Forest SoilMTERVPLLEVGQLLAGPADLKAKHGEPPWSHPVVLTDDIQAFIICH
Ga0126383_1118358223300010398Tropical Forest SoilMADRVPLLEIGKLHARVSELKATKGAPPWSEAVVLTDDIQAFIICHLPG
Ga0126383_1139484823300010398Tropical Forest SoilMVDRVPLLEIGKLHARVSELKAKKGAPPWSEAVVL
Ga0134127_1012333643300010399Terrestrial SoilLTDRIPLLQLGSLRSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQPNDTHY
Ga0137441_113211123300011425SoilMTERIPLLQLGSLQSRLPDLIAKKGAPPWSEAVVLTD
Ga0137455_114979313300011429SoilMAIRVPLMETGRLWTRVEELRARKGAAPWSEALVLTDDIQAFIICHPP
Ga0137389_1040040123300012096Vadose Zone SoilMTTRIPLLELGSLQSRLPELIAKKGEPPWSEALVLTDDIQAFLICHPPGQPNDT
Ga0137383_1055818513300012199Vadose Zone SoilMADRIPLMEIGKLQMRVSELKTKKGAAPWSEALVMTDDTQAFIICQAP
Ga0137363_1106512913300012202Vadose Zone SoilLDVGKLRARVEELRARKGAPPWSDTLVMTDDIQAFIICHLPGHPN
Ga0137367_1032087923300012353Vadose Zone SoilMADRIPLMEIGKLQMRVSELKTKKGAAPWSEALVMTD
Ga0137384_1092989623300012357Vadose Zone SoilMADRIPLMEIGKLQMRVSELKTKKGAAHWSEALVMTDNTQAFIICQAPGHP
Ga0137384_1100836523300012357Vadose Zone SoilMTTRIPLLELGSLQSRLPELIAKKGEPPWSEALVLTDDIQAFLICHPP
Ga0137361_1152531613300012362Vadose Zone SoilMADRLPLLDIGKLRAPVEDLRARKGAPPWSDTLVMTDDIQA
Ga0137373_1001537313300012532Vadose Zone SoilMAVRIPCLEVGRLQSRLDEIKRSKGQPPWSETLVMTDDIQAFVICHPAGQPNDTHYHLHD
Ga0137358_1088508923300012582Vadose Zone SoilMTDRIPLLQLGSLQSRLPELIAKKGAPPWSEPVVLTDDIQAFLICHPPGQPNDTHYHHHDERWVV
Ga0157309_1028165613300012895SoilMTTTRIPLLEIGSLQSRLPDLIAKKGEPPWSEAVVLTDDIQAFLICHSP
Ga0137359_1117253223300012923Vadose Zone SoilMTTRIPLLELGSLQSRLPELIAKKGEPPWSEALVLTDDIQAFLICHPPG*
Ga0137416_1195624523300012927Vadose Zone SoilMADHVPLMEIGSLQVRLEDLKEKKGPPPWSEAMVMTDDIQAFIICHAP
Ga0126375_1023661413300012948Tropical Forest SoilMTERMPLLEVGKLLASVHDLEAKHGEPPWSHPVVLTDDIQAFTICHA
Ga0157378_1081367523300013297Miscanthus RhizosphereMADRIPLMEMGKLQMRMSELKTKKGTAPWSEALVMTD
Ga0157380_1156135923300014326Switchgrass RhizosphereMSERVPLLEVGKLLAGLEDLKAKKGEPPWSDAVVLTDDIQAFIIC
Ga0180063_126781123300014885SoilMTTPRIPLLQLGSLQSRLPELIAKRGAPPWSEAVVLTDDIQAF
Ga0137411_112070513300015052Vadose Zone SoilMTDRIPLLQLGSLQSRLPELIAKKSAPPWSEPVVLTDDISRPS*
Ga0173483_1095401123300015077SoilMTTTRIPLLEIGSLQSRLPDLIAKKGEPPWSEAVVLTDDIQAFLIC
Ga0182007_1003666613300015262RhizosphereMTTTRIPLLELGSLQSRPPDLIAKKGKPPWSEALV
Ga0132257_10032575113300015373Arabidopsis RhizosphereMTERVPLLEVGKLLAGLEDLNAKHGEPPWSHPVVLTDDIQAFIICHAPGHAN
Ga0134074_127947313300017657Grasslands SoilMADRMPLLDIGKLRARVDELHAHKGPPPWSDTLVMTDDIQAFII
Ga0184604_1032061413300018000Groundwater SedimentMAVRVPLLETGRLWTRVEELRARKGAPPWSEALVLTDDIQ
Ga0184632_1022199513300018075Groundwater SedimentVADRMPLLDVGKLRARVDELRGRKGAPPWSDTLVMTD
Ga0190265_1073798623300018422SoilMAVRVPLLETGRLWTRVEELRARKGTPPWSEALVLTDDIQAFIICQPP
Ga0190272_1250447523300018429SoilMAVRVPLLETGRLWTRVEELRARKGTPPWSEALVLTDDIQAFIICQPPGHPN
Ga0190270_1335611423300018469SoilMAVRAPLLETGRLWTRAEELRARKGPAPWSEALILTDDIQAFIICHPPGHPNDTH
Ga0210407_1018666433300020579SoilMTTRVPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLIFGPKHL
Ga0210403_1048704123300020580SoilMTTRIPLLELGSLQSRLPDLIAKKGEPPWSETLVLTDDIQAFLICQG
Ga0210399_1033211313300020581SoilMTTRIPLLELGSLQSRLPDLIAKKGEPPWSETLVLT
Ga0210378_1011507823300021073Groundwater SedimentMTERNPLLQLGSLQSRLPELIARKGAPPWSEAVVLTDDIQAFVICHPPGQAN
Ga0210400_1157491013300021170SoilMTTRIPLLELGSLQSRLPDLIAKKGEPPWSETLVLTDDIQAFLI
Ga0126371_1297029913300021560Tropical Forest SoilMADRAPLMEIGKLHARVSELKAKWGAPPWSEPVVLT
Ga0242654_1031999223300022726SoilAATEGPPPWSDDMTTRIPRLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLIFGPKHL
Ga0207642_1033841213300025899Miscanthus RhizosphereMAVRMPLLETGRLWTRVDELRARKGPAPWSEALVLTDDIQAFI
Ga0207684_1035950313300025910Corn, Switchgrass And Miscanthus RhizosphereMAVRLPLLETGRLWTRVDELRARKGPAPWSEALVLT
Ga0207693_1140263023300025915Corn, Switchgrass And Miscanthus RhizosphereMTTRIPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLIFGPKHL
Ga0207644_1043505323300025931Switchgrass RhizosphereMTTTRIPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLICHP
Ga0207706_1170766513300025933Corn RhizosphereMSERVPLLEVGKLLAGLEDLKAKKGEPPWSDPVVLTD
Ga0257176_102107423300026361SoilVADPMPLLDVGKLRARVEDLRARKGAPPWSDALVMTDDIQAFIICHLLGH
Ga0257152_102500113300026369SoilMTTRVPLLELGSLQSRLPELIAKKGEPPWSEALVLTD
Ga0257179_104499723300026371SoilMTTRVPLLELGSLQSRLPELIARKGEPPWSEALVLTDDIQAFLICHP
Ga0257171_107015913300026377SoilMTTRVPLLELGSLQSRLPELIAKKGEPPWSEALVLTDDIQ
Ga0257172_103301823300026482SoilMTTRVPLLELGSLQSRLPELIARKGEPPWSEALVLTDDI
Ga0209807_114053913300026530SoilMADRMPLLDIGKLRARVDELRAHKGPPPWSDTLVMTDDIQAFIICHLPGHLNDT
Ga0209648_1060636113300026551Grasslands SoilMTTRIPLLELGSLQSRLPELIAKKGEPPWSEALVLTDDIQAFLIC
Ga0208997_104156313300027181Forest SoilMTDRIPLLQLGSLQSRLPELIARKGAPPWSEPVVLTDDIQAFLICHPPGQPNDT
Ga0209874_115148023300027577Groundwater SandMTTRIPLLELGSLWSRLPDLIAKKGEPPWSEALVLTDDIQA
Ga0210002_106974213300027617Arabidopsis Thaliana RhizosphereMTTTRIPLLEIGSLQSRLPDLIAKKGEPPWSEAVVLTDDIQAFLICHSPGQPNDTH
Ga0209819_1017137623300027722Freshwater SedimentMTDRVPLLEVGKLLAGLEELKAKKGEPPWSDAVVLTDDI
Ga0209073_1000878643300027765Agricultural SoilMTTTRIPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLICHTPG
Ga0209074_1020574523300027787Agricultural SoilMTTTRIPLLELGSLQSRLPDLIAKKGKPPWSEALVLT
Ga0209859_103032713300027954Groundwater SandMTTRIPLLELGSLWSRLPDLIAKKGEPPWSEALVLTDDIQAF
Ga0268264_1182800623300028381Switchgrass RhizosphereMSERVPLLEVGKLLAGLEDLKAKKGEPPWSDAVVLTDDI
Ga0257175_103750913300028673SoilVADPMPLLDVGKLRARVEDLRARKGAPPWSDTLVMTDDIQAFIICHLPGHPN
Ga0307281_1027504023300028803SoilMTTRNPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFVI
Ga0307292_1023449113300028811SoilMTDRIPLLQLGSLQSRLPELIAKKGAPPWSEPVVLTDDIQAFLICHPPG
Ga0247825_1058790813300028812SoilMATQATAPAPARIPLLDPGTLQARLETIKAKRATPPWSEAIVLTDDIQAFLICHAPGQPNDTHYHLHDEWW
Ga0307304_1031596813300028885SoilMTTRIPLLELGSLQSRLPELIAKTGEPPWSDALVLTDDI
Ga0247826_1109777113300030336SoilMTDRIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQPNDTHYHH
(restricted) Ga0255311_102200013300031150Sandy SoilMTDRIPLLQLGTLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICH
(restricted) Ga0255312_101002133300031248Sandy SoilMSTPPRIPLLQLGSLQSRLPELIARKGAPPWSEAV
Ga0307505_1046635713300031455SoilMTERIPLLQLGSLQSRLPELIARKGAPPWSEAVVLTDDIQAFLICHP
Ga0318573_1003208843300031564SoilMADRVPLLEIGKLHARVRELKATKGAPPWSEAVVLTDDI
Ga0307469_1077090333300031720Hardwood Forest SoilLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAFLIFGPKHL
Ga0307468_10217970723300031740Hardwood Forest SoilMTTRIPRLELGSLQSRLPDLIAKKGEPPWSETLVLTDDIQAFLIFGPKHL
Ga0318494_1001561813300031751SoilMADRVPLLEIGKLHARVRELKATKGAPPWSEAVVLT
Ga0318568_1001294113300031819SoilMADRVPLLEIGKLHARVRELKATKGAPPWSEAVVLTDDIQAFIICHLPG
Ga0310907_1053554013300031847SoilMTTRVPLLELGSLQSRLPDLIAKKGEPPWSEALVL
Ga0318536_1017947213300031893SoilMADRVPLLEIGKLHARVRELKATKGAPPWSEAVVLTDDIQAFIICHLPGQ
Ga0318520_1016958913300031897SoilMADRVPLLEIGKLHARVRELKATKGAPPWSEAVVLTDDIQAFIICHLPGQPN
Ga0318504_1019937113300032063SoilMADRVPLLEIGKLHARVRELKATKGAPPWSEAVVL
Ga0306920_10325208523300032261SoilMTERVPLLEVGKLLAGLEDLNAKHGESPWSHPVVLTDDI
Ga0335084_1005389123300033004SoilMTTTRIPLLEIGSLQSRLPDLIAKKGEPPWSVLTDDIL
Ga0335084_1017482743300033004SoilMADRIPLLEIGKLQARVSELREKKGEPPWSEALVMTDDIQAFIIC
Ga0247830_1042218913300033551SoilMTTTRIPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDI
Ga0364928_0056746_1_1563300033813SedimentLTERIPLLQLGSLQSRLPELIAKKGAPPWSEAVVLTDDIQAFLICHPPGQPN
Ga0373950_0025919_936_10613300034818Rhizosphere SoilMTTRVPLLELGSLQSRLPDLIAKKGEPPWSEALVLTDDIQAF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.