NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F078917

Metagenome / Metatranscriptome Family F078917

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078917
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 43 residues
Representative Sequence VGKDPTARQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG
Number of Associated Samples 106
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 28.57 %
% of genes near scaffold ends (potentially truncated) 4.31 %
% of genes from short scaffolds (< 2000 bps) 5.17 %
Associated GOLD sequencing projects 100
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.966 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(10.345 % of family members)
Environment Ontology (ENVO) Unclassified
(25.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.241 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 26.39%    β-sheet: 11.11%    Coil/Unstructured: 62.50%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF02943FeThRed_B 68.97
PF01042Ribonuc_L-PSP 7.76
PF01958Asp_DH_C 3.45
PF00206Lyase_1 1.72
PF09674DUF2400 1.72
PF06938DUF1285 0.86
PF02668TauD 0.86
PF03070TENA_THI-4 0.86
PF00378ECH_1 0.86
PF12867DinB_2 0.86
PF01569PAP2 0.86
PF01547SBP_bac_1 0.86
PF13181TPR_8 0.86
PF14518Haem_oxygenas_2 0.86
PF02515CoA_transf_3 0.86
PF04909Amidohydro_2 0.86
PF03473MOSC 0.86
PF01593Amino_oxidase 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG4802Ferredoxin-thioredoxin reductase, catalytic subunitEnergy production and conversion [C] 68.97
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 7.76
COG1712L-aspartate dehydrogenase, NAD(P)-dependentAmino acid transport and metabolism [E] 3.45
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 0.86
COG2175Taurine dioxygenase, alpha-ketoglutarate-dependentSecondary metabolites biosynthesis, transport and catabolism [Q] 0.86
COG3816Uncharacterized conserved protein, DUF1285 familyFunction unknown [S] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.97 %
All OrganismsrootAll Organisms6.03 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300025912|Ga0207707_10227592All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1623Open in IMG/M
3300027526|Ga0209968_1010963All Organisms → cellular organisms → Bacteria1398Open in IMG/M
3300027671|Ga0209588_1067360All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1157Open in IMG/M
3300027886|Ga0209486_10139384All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1326Open in IMG/M
3300027907|Ga0207428_10164322All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1684Open in IMG/M
3300031548|Ga0307408_100004122All Organisms → cellular organisms → Bacteria9902Open in IMG/M
3300032126|Ga0307415_101261700All Organisms → cellular organisms → Bacteria698Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.62%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere7.76%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.17%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand5.17%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.31%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.45%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.45%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.59%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.72%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.72%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.72%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.72%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.72%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.72%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.86%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.86%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.86%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.86%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.86%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.86%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.86%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.86%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.86%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.86%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.86%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.86%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.86%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.86%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.86%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300003267Sugarcane bulk soil Sample L1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006194Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1Host-AssociatedOpen in IMG/M
3300006604Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLMB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300009011Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-4 metaGHost-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009166Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010107Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300010857Boreal forest soil eukaryotic communities from Alaska, USA - W1-3 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012224Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012405Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019228Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT790_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025558Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026032Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027511Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027526Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027950Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032126Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-2Host-AssociatedOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1046144313300001661Forest SoilNVGRDPGAREELMALGLTSLPVLLIGDKRLTGFNPTQIDAALSAST*
JGI25382J43887_1030127413300002908Grasslands SoilVGKDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG*
soilL1_1002033323300003267Sugarcane Root And Bulk SoilVGRDPGAREELMEIGLTSLPVILIGEHKLAGFNPKKIDEALADGADHG*
Ga0063356_10070502533300004463Arabidopsis Thaliana RhizosphereVGRDAGAREELMEIGLTSLPVILIGEHKLAGFNPKKIDEALAGSADHG*
Ga0062595_10255207423300004479SoilGKDPTARQELMELGLTSLPVILIGDKRLTGFNPTAIDAALAG*
Ga0066677_1048720623300005171SoilVGRDPEAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQS*
Ga0066388_10523839213300005332Tropical Forest SoilVGRDPEARQELIALGLLSLPVLLIGEQKLTGFNPNAIDSALAART*
Ga0070703_1054891313300005406Corn, Switchgrass And Miscanthus RhizosphereTEKNVGRDPEARQELMALGLLSLPVLLIGDKKLTGFNPKAIDAALAES*
Ga0070705_10032730713300005440Corn, Switchgrass And Miscanthus RhizospherePKAREELMALGLLSLPVIIIGDKRLTGFNPKAIDAALGAP*
Ga0070698_10006593543300005471Corn, Switchgrass And Miscanthus RhizosphereVGRDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG*
Ga0070695_10181396113300005545Corn, Switchgrass And Miscanthus RhizosphereRNVGRDPGAREELMEIGLTSLPVILIGEHKLAGFNPKKIDEALATLHD*
Ga0070696_10013232433300005546Corn, Switchgrass And Miscanthus RhizosphereDPEARQELMALGLLSLPVLLIGDKKLTGFNPKAIDAALSEG*
Ga0066695_1022194313300005553SoilGKDPVARQELMAIGLTSLPVLLIGEKKLTGFNPNQIDEAINALNG*
Ga0066695_1024648423300005553SoilVGRDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQS*
Ga0066693_1016165833300005566SoilGKDPVARQELMDIGLTSLPVLLIGEKKLTGFNPNQIDEAINALNG*
Ga0066905_10003626333300005713Tropical Forest SoilVGRDPEARQELIALGLLSLPVLLIGEQKLTGFNPTAIDAALAALT*
Ga0066905_10004938243300005713Tropical Forest SoilVGRDPEARQELIALGLLSLPVLLIGEQKLTGFNPNAIDAALAALT*
Ga0066905_10188042013300005713Tropical Forest SoilVGKDPTARQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG*
Ga0068866_1027392713300005718Miscanthus RhizospherePTARQELMELGLMSLPVILIGDKRLTGFNPNAIDAALAE*
Ga0066903_10524279523300005764Tropical Forest SoilRQELMEIGITSLPVIIIGETRLAGFNPSKIDEALAQVQP*
Ga0075427_1002691813300006194Populus RhizosphereDPKAREELMALGLLSLPVIIIGDKRLTGFNPKAIDAALSAS*
Ga0074060_1029726023300006604SoilARQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG*
Ga0079222_1263499613300006755Agricultural SoilVGKDPTARQELMELGLTSLPVILIGDKRLTGFNPTAIDAALAG*
Ga0066658_1006294633300006794SoilGKDPKGREELMALGLLSLPVIIIGDKRLTGCNPNAIDGALNAS*
Ga0075428_10216132513300006844Populus RhizosphereVGRDPGAREELMEIGLTSLPVILIGEHKLAGFNPKKIDEALAESST*
Ga0075421_10158237313300006845Populus RhizosphereREELMALGLLSLPVILIGDKRLTGFNPKAIDAALSAS*
Ga0075421_10227766113300006845Populus RhizosphereNIGKDPGAREELMATGLMSLPVLIIGDKKITGFNPKMIDATIAEQSGG*
Ga0075433_1082533233300006852Populus RhizosphereTEKNVGRDPEARQELMALGLLSLPVLIIGDKKLTGFNPKAIDAALADT*
Ga0075420_10009552143300006853Populus RhizosphereKDPAARQELATMGLMSLPVLLIGDKRLTGFNPAQIDAALAEAGS*
Ga0079217_1115981213300006876Agricultural SoilNVGRDPQARQELMDLGLLSLPVLLIGDKKLTGFNPAQIDAALAAAGGGSST*
Ga0079215_1009649023300006894Agricultural SoilVGKDPTARQELMELGLMSLPVILIGDKRLTGFNPNAIDAALAAE*
Ga0105251_1054118713300009011Switchgrass RhizosphereARKELMAIGIMSLPVIIIGETRLAGFNPAKIDEALAKAEG*
Ga0066710_10144102313300009012Grasslands SoilPFTERNVGRDPGAREELMALGLTSLPVLLIGDKRLTGFNPAQIDAALSAS
Ga0099829_1129499123300009038Vadose Zone SoilKDPTARQELMDLGLTSLPVILIGDKRLTGFNPTAIDAALAGN*
Ga0114129_1037664543300009147Populus RhizosphereVGKDPKAREELMALGLLSLPVIIIGDKRLTGFNPKAIDAALSAS*
Ga0105243_1053402113300009148Miscanthus RhizosphereVGKDPRAGQELMELGLTSLHVILIGDKRLTGFNPTAIDAALAG*
Ga0105100_1080058413300009166Freshwater SedimentRKELMAIGIMSLPVIIIGETRLAGFNPAKIDEALAKAQG*
Ga0105242_1129100513300009176Miscanthus RhizosphereERNVGKDPTARQELMELGLMSLPVILIGDKRLTGFNPNAIDAALAG*
Ga0114945_1060909213300009444Thermal SpringsVGRDPDARKELISLGVTSLPVLLIGDSRLHGFNPAQIDAALAALAE*
Ga0126374_1023154723300009792Tropical Forest SoilMEIGLLSLPVIIIGETRLTGFNPTKIDEALAQVKG*
Ga0105064_107422423300009821Groundwater SandVGRDPEARQELISLGLLSLPVLLIGDQKLTGFNPNAIDAALAALT*
Ga0127494_108952513300010107Grasslands SoilRNVGKDPKAREALMALGLLSLPVIIIGDKRLTGFNPTQIDAALNAS*
Ga0126376_1032289133300010359Tropical Forest SoilVGRDPKAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQIQG*
Ga0126372_1323760513300010360Tropical Forest SoilARQELMTLGLLSLPVLLIGDKKLTGFNPKAIDAALAEG*
Ga0126377_1134221313300010362Tropical Forest SoilPKAREELMALGLLSLPVIIIGDKRLTGFNPKAIDAALSAS*
Ga0126377_1231910823300010362Tropical Forest SoilEARQELMEIGITSLPVIIIGETRLAGFNPMKIDEALAREQG*
Ga0134123_1269395013300010403Terrestrial SoilARQELMELGLTSLPVILIGDKRLTGFNPTAIDAALAG*
Ga0126354_116741913300010857Boreal Forest SoilREELIGLGLTSLPVLIIDGQKLAGFNPNAIDAALSR*
Ga0105246_1141511633300011119Miscanthus RhizosphereMAIGIMSLPVIIIGETRLAGFNPAKIDEALAKAET*
Ga0137365_1090536033300012201Vadose Zone SoilDPEARQELMELGLLSLPVLLIGEQKLTGFNPTKIDEALKTLDA*
Ga0137362_1092144833300012205Vadose Zone SoilREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQVQG*
Ga0134028_106198913300012224Grasslands SoilVGKDPTARQELMELGLTSLPVILIGDKRLTGFNPAQIDAALSAST*
Ga0137361_1117187213300012362Vadose Zone SoilERNVGRDPGAREELMALGLTSLPVLLIGDKRLTGFNPAQIDAALSAS*
Ga0134058_107068623300012379Grasslands SoilFTERNVGRDPKAREELMALGLTSLPVLLIGDKRLTGFNPAQIDAALSAS*
Ga0134051_125694123300012398Grasslands SoilERNVGRDPKAREELMALGVTSLPVLLIGDKRLTGFNPAQIDAALGAS*
Ga0134041_121546123300012405Grasslands SoilREELMALGLLSLPVIIIGDKRLTGFNPNAIDAALNAS*
Ga0137358_1034576723300012582Vadose Zone SoilMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQS*
Ga0137404_1029532733300012929Vadose Zone SoilVGRDPTAREELMALGLTSLPVLLIGDKRLTGFNPAQIDAALSAS*
Ga0137410_1026699833300012944Vadose Zone SoilEELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG*
Ga0126375_1000191413300012948Tropical Forest SoilNVGKDPKAREELMALGLLSLPVIIIGDKRLTGFNPKAIDAALGAA*
Ga0180063_112148723300014885SoilVGRDPEARQELISLGLLSLPVLLIGEQKLTGFNPNAIDAALAALT*
Ga0180085_104943843300015259SoilPGAREELMEIGLTSLPVILIGEHKLAGFNPKKIDEALASL*
Ga0180085_115638213300015259SoilRDPEARQELISVGLLSLPVLLIGEQKLTGFNPNAIDAALAALT*
Ga0132256_10134273213300015372Arabidopsis RhizosphereTARQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG*
Ga0134069_116318023300017654Grasslands SoilVGRDPQAREELMAIGMTSLPVIIISETRLAGFNPAKIDEALAQAQS
Ga0163161_1033590333300017792Switchgrass RhizospherePKARGELMALGLLSLPVIIIGDKRLTGFNPKAIDAALGAP
Ga0187775_1018981223300017939Tropical PeatlandVGKDPQARQELMEIGLLSLPVILIGDLRLTGFNPKKIDEALAQVEG
Ga0184619_1027660333300018061Groundwater SedimentRQELMELGLTSLPVILIGDKRLTGFNPNAIDAALAAE
Ga0066667_1028471723300018433Grasslands SoilVGRDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQVQS
Ga0066669_1095454333300018482Grasslands SoilVGKDPVARQELMEIGLTSLPVLLIGEKKLTGFNPNQIDEAINALNG
Ga0180119_125744633300019228Groundwater SedimentVGRDPEARQELISLGLLSLPVLLIGEQKLTGFNPNAIDAALAALT
Ga0137408_102857023300019789Vadose Zone SoilVGRDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG
Ga0180118_138390513300020063Groundwater SedimentVGRDPEARQELISLGLLSLPVLLIGEQKLTGFNPKAIDAALAAQT
Ga0210139_112398823300025558Natural And Restored WetlandsDPQARQELMEIGITSLPVIIIGETRLAGFNPMKIDEALAAQG
Ga0207707_1022759213300025912Corn RhizosphereVGKDPTARQELMELGLMSLPVILIGDKRLTGFNPNAIDAALAE
Ga0207709_1054712933300025935Miscanthus RhizosphereKDPTARQELMELGLTSLPVILIGDKRLTGFNPTAIDAALAG
Ga0208419_104291913300026032Natural And Restored WetlandsNVGRDPGAREELMAIGLTSLPVILIGDHKLAGFNPKKIDEALAAQGS
Ga0209235_112115633300026296Grasslands SoilELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQS
Ga0209761_112371723300026313Grasslands SoilVGRDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQS
Ga0209686_119358213300026315SoilAREELMALGLLSLPVIIIGDKRLTGFNPNAIDAALNAS
Ga0209472_122683613300026323SoilVGKDPGARQELMEIGLTSLPVLLIGEKKLTGFNPNQIDEAINALNG
Ga0209803_116840613300026332SoilVGKDPKAREELMALGLLSLPVIIIGDKRLTGFNPNAIDAALKAS
Ga0209158_122075923300026333SoilVGKDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG
Ga0209804_130070523300026335SoilELMALGLLSLPVIIIGDKRLTGFNPNAIDAALNAS
Ga0209690_120258213300026524SoilELMALGLLSLPVIIIGDKRLTGFNPNAIDAALKAS
Ga0209806_113853623300026529SoilVGKDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQS
Ga0256867_1013825133300026535SoilERNVGKDPTARQELMELGLMSLPVILIGDKRLTGFNPNAIDAALAE
Ga0209846_106559023300027277Groundwater SandVGRDPEARQELISLGLLSLPVLLIGGQKLTGFNPNAIDAALAALT
Ga0209843_107878123300027511Groundwater SandVGRDPEARQELISLGLLSLPVLLIGDQKLTGFNPNAIDAALA
Ga0209968_101096333300027526Arabidopsis Thaliana RhizosphereVGRDPGAREELMEIGLTSLPVILIGEHKLAGFNPKKIDEALAEGADHG
Ga0209588_106736043300027671Vadose Zone SoilGAREELMELGLTSLPVILIGELRLSGFNPKKIDEALAGS
Ga0209180_1001398933300027846Vadose Zone SoilMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG
Ga0209283_1057802713300027875Vadose Zone SoilELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQG
Ga0209486_1013938413300027886Agricultural SoilPTARQELMELGLMSLPVILIGDKRLTGFNPKAIDAALAGE
Ga0207428_1016432253300027907Populus RhizosphereNVGKDPTARQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG
Ga0209382_1159038823300027909Populus RhizosphereKDPTARQELMELGLMSLPVILIGDKRLTGFNPNAIDAALAE
Ga0209885_103235923300027950Groundwater SandPKAREELMALGLTSLPVLLIGDKRLTGFNPAQIDAALSAS
Ga0209889_100031913300027952Groundwater SandERNVGRDPKAREELMALGLTALPVLLIGDKRLTGFNPAQIDAALSAS
Ga0209889_103646713300027952Groundwater SandVGRDPKAREELMALGLTSLPVLLIGDKRLTGFNPAQIDAALSAS
(restricted) Ga0233417_1041804323300028043SedimentVGRDPEARQELIALGLLSLPVLLIGEQKLTGFNPNAIDAALAALG
Ga0209526_1023833523300028047Forest SoilVGRDPQAREELMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQAQ
Ga0137415_1013727123300028536Vadose Zone SoilMAIGMTSLPVIIIGETRLAGFNPAKIDEALAQVQG
Ga0307504_1036303123300028792SoilPKAREELMALGLLSLPVIIIGDKRLTGFNPKAIDAALGAS
(restricted) Ga0255312_104364233300031248Sandy SoilEARQELMALGLLSLPVLLIGDKKLTGFNPKAIDAALAGG
Ga0310887_1008398353300031547SoilERNVGKDPTARQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG
Ga0307408_10000412283300031548RhizosphereVGKDPTARQELMELGLMSLPVILIGDKRLTGFNPKAIDAALAGE
Ga0307469_1003262043300031720Hardwood Forest SoilVGKDPTARQELMELGLTSLPVILIGDKRLTGFNPTAIDAALAG
Ga0307468_10004422813300031740Hardwood Forest SoilRQELMELGLLSLPVILIGDKRLTGFNPKAIDAALAEG
Ga0318504_1056267413300032063SoilARQELMEIGITSLPVIIIGETRLAGFNPAKIDEALAQVKS
Ga0307415_10126170033300032126RhizosphereAGAREELMAIGLTSLPVILIGEHKLAGFNPKKIDEALAGVSDG
Ga0307470_1061311613300032174Hardwood Forest SoilVGRDPEARQELIALGLLSLPVLLIGEQKLTGFNPKAIDTALAALA
Ga0307472_10017822123300032205Hardwood Forest SoilVGRDPEARQELMALGLLSLPVLLIGDKRLTGFNPKAIDAALAEG
Ga0307472_10118185323300032205Hardwood Forest SoilVGKDPQARQELMEIGITSLPVIIIGETRLAGFNPMKIDEALAAQG
Ga0307472_10241903323300032205Hardwood Forest SoilEARQELMALGLLSLPVLLIGDKKLTGFNPKAIDAALAEG
Ga0326726_1076098813300033433Peat SoilGKDPQARQELMEIGILSLPVIIIGETRLAGFNPMKIDEALAKMQS
Ga0326726_1115739223300033433Peat SoilMEIGILSLPVIIIGETRLAGFNPMKIDEALAKMQP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.