NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F086878

Metagenome / Metatranscriptome Family F086878

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086878
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 50 residues
Representative Sequence MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGD
Number of Associated Samples 107
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 6.36 %
% of genes from short scaffolds (< 2000 bps) 6.36 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.636 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(12.727 % of family members)
Environment Ontology (ENVO) Unclassified
(33.636 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(34.545 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 44.87%    β-sheet: 0.00%    Coil/Unstructured: 55.13%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF03259Robl_LC7 75.45
PF00025Arf 14.55
PF00071Ras 4.55
PF02518HATPase_c 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG2018Predicted regulator of Ras-like GTPase activity, Roadblock/LC7/MglB familySignal transduction mechanisms [T] 75.45
COG1100GTPase SAR1 family domainGeneral function prediction only [R] 14.55


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.64 %
All OrganismsrootAll Organisms6.36 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100228780All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1750Open in IMG/M
3300019880|Ga0193712_1016607All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1535Open in IMG/M
3300020002|Ga0193730_1025923All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1696Open in IMG/M
3300020003|Ga0193739_1016318All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1938Open in IMG/M
3300022531|Ga0242660_1009446All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1626Open in IMG/M
3300022724|Ga0242665_10015695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1657Open in IMG/M
3300025965|Ga0210090_1004467All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1870Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.91%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.64%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.64%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.64%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.73%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.73%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.73%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.73%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.73%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.82%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.82%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.82%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.82%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.82%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.82%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.91%
Hot SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Hot Spring0.91%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.91%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.91%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.91%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.91%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.91%
Environmental → Unclassified → Unclassified → Unclassified → Unclassified → 0.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.91%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.91%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2124908016Sample 642EnvironmentalOpen in IMG/M
3300001990Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3Host-AssociatedOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004145Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300009836Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010313Hot spring microbial communities from South Africa to study Microbial Dark Matter (Phase II) - Sagole hot spring metaGEnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017973Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MGEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019999Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300020067Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT47_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300022506Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-26-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300023057Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S136-409B-6EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025167Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025901Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4 (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025959Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025965Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027424Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027614Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
OU_003744402124908016MAQIGGKAGLSEEERQSLSESARLCEMIIEANPADTGALETLKEVYTKLGDRERLSQVV
JGI24737J22298_1004284113300001990Corn RhizosphereMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVV
JGIcombinedJ26739_10022878013300002245Forest SoilMAQIESXPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARV
Ga0055490_1002568343300004052Natural And Restored WetlandsMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLAR
Ga0062593_10030532633300004114SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIY
Ga0055489_1007624113300004145Natural And Restored WetlandsMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLA
Ga0062589_10007614143300004156SoilMAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALE
Ga0070668_10063084313300005347Switchgrass RhizosphereMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRESLARVVARL
Ga0070700_10023516013300005441Corn, Switchgrass And Miscanthus RhizosphereMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKL
Ga0070708_10088582913300005445Corn, Switchgrass And Miscanthus RhizosphereMTQGSRLTSFSDDERQSLAESARLCEMIVEANPSDTGALETLKEI
Ga0066697_1036982813300005540SoilVSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKETYTKLGDRERLGQVVG
Ga0068855_10167601823300005563Corn RhizosphereMAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALETLKEIYTKLG
Ga0068852_10009726713300005616Corn RhizosphereMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENL
Ga0068861_10000623813300005719Switchgrass RhizosphereMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKE
Ga0075293_100866213300005875Rice Paddy SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGD
Ga0075023_10008721813300006041WatershedsMAQIESKPLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTK
Ga0066665_1122564623300006796SoilVSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQVVGRMA
Ga0079221_1098253123300006804Agricultural SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLA
Ga0075421_10214954413300006845Populus RhizosphereMAQIENKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVA
Ga0075425_10176089313300006854Populus RhizosphereMIHHDGLMALSDDDRQSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRDRL
Ga0075435_10201295823300007076Populus RhizosphereMAQVNAKAPISDEERQSLVESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGHVMARLATL
Ga0099830_1162246413300009088Vadose Zone SoilMPKAEPKASSLSSEEKRSLSESARLCEMIVQAIPSDTGALET
Ga0111539_1078191933300009094Populus RhizosphereMPDVNSKTSFSDEERHSLVESARLCEMIIEANPSDTGALETLKEIY
Ga0105082_108175413300009814Groundwater SandMAQIESKTLSTEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLG
Ga0105068_104235633300009836Groundwater SandMAQASGRTSLSDEERQALAESAQVCEMIVEANPSDTGALETLKE
Ga0126382_1082981413300010047Tropical Forest SoilMPDVNSKPSFSDEERHSLVESASLCEMIVEANPSDTGALETLKEIYTKLGDRERLAQVM
Ga0116211_113402113300010313Hot SpringMAQPSGKPAFSDEERQALAESARLCEMIIEANPSDTG
Ga0126377_1103744933300010362Tropical Forest SoilMPDVNSKPSFSDEERHSLVESASLCEMIVEANPSDTGALETLKEIYTKLGDRERLSQVMA
Ga0126383_1095723723300010398Tropical Forest SoilMAPPSGKPSFSDEERQSLAESARLCEMIIEANPADTGALETLKEI
Ga0150983_1341037033300011120Forest SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRE
Ga0137389_1049783213300012096Vadose Zone SoilMAQASGKASFSDEARQSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQVVG
Ga0137383_1034247833300012199Vadose Zone SoilMAQIVSKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYT
Ga0137363_1153855323300012202Vadose Zone SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGAL
Ga0137399_1054363113300012203Vadose Zone SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVA
Ga0137376_1156898723300012208Vadose Zone SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVV
Ga0137372_1124171723300012350Vadose Zone SoilMAQASGRTSLSQEERQALAESAQVCEMIVEANPSDTGALETLKEIYTKLGDRER
Ga0137369_1028471913300012355Vadose Zone SoilMAQTDNKTSLSSEEKRSLVESVRLCEMIVEANPLDIGALEILKEIYTKLGS
Ga0137358_1026922433300012582Vadose Zone SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLSG
Ga0137398_1078146023300012683Vadose Zone SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLGRVVARLAG
Ga0137359_1167199813300012923Vadose Zone SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTG
Ga0137404_1102425313300012929Vadose Zone SoilMAPVSGKAGLSEVERQSLTESAQLCEMIVEVIPSDTGALENMKEIYTKL
Ga0164301_1034654533300012960SoilMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLAG
Ga0157371_1092342313300013102Corn RhizosphereMAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALETLKEIYTKLADRDNLARVVARLA
Ga0157372_1305879013300013307Corn RhizosphereMAKTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALET
Ga0157380_1026392343300014326Switchgrass RhizosphereMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALE
Ga0180066_100217113300014873SoilMPQSEPTVSLSSEEKRSLFESARLCEMIVQANPSDTGALETLKEIYSKLGDPENLS
Ga0134085_1055624613300015359Grasslands SoilMAQASGKASFSAEERQSLMESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGPVV
Ga0134112_1029790023300017656Grasslands SoilMAQASDKASFSAEERQSLVESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQVVGRMA
Ga0134083_1047000823300017659Grasslands SoilMAQASDKASFSAEERQSLVESARLCEMIVEANPSD
Ga0187824_1010736133300017927Freshwater SedimentMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDT
Ga0187780_1056673813300017973Tropical PeatlandMAQIDNKTSLSPEEKRSLSESARLCEMIVEANPSDTG
Ga0184620_1010063813300018051Groundwater SedimentMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVAR
Ga0184609_1045758913300018076Groundwater SedimentMAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLAR
Ga0066667_1208037323300018433Grasslands SoilVSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKETYTRL
Ga0190270_1113505513300018469SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDT
Ga0184646_150490633300019259Groundwater SedimentMAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLA
Ga0137408_117129013300019789Vadose Zone SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRVGRAWPG
Ga0193712_101660743300019880SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTK
Ga0193718_110042313300019999SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLG
Ga0193731_101295813300020001SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRESL
Ga0193730_102592313300020002SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRESLARVV
Ga0193739_101631843300020003SoilMAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLA
Ga0193749_102080413300020010SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKE
Ga0180109_124353733300020067Groundwater SedimentMAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIY
Ga0179592_1011467613300020199Vadose Zone SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRE
Ga0210403_1147890013300020580SoilMAQIEKTSLSQEEKRSLSESARLCEMIVEANPSDTGALETLKEIYSKLGDQE
Ga0242648_103450133300022506SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKE
Ga0242660_100944613300022531SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDR
Ga0242665_1001569513300022724SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGD
Ga0247797_101143513300023057SoilMAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGA
Ga0209108_1027072233300025165SoilMAQKDSKGSMSTEEKHSLEESARLCEMIVEANPSDTGALETLKEIYTKLGDREKLAKIVVRLA
Ga0209642_1010184343300025167SoilMAQKDSKGSMSTEEKHSLEESARLCEMIVEANPSDTGALETLKEIYTKLGDR
Ga0207688_1014072443300025901Corn, Switchgrass And Miscanthus RhizosphereMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLA
Ga0207709_1025461333300025935Miscanthus RhizosphereMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLK
Ga0207711_1025183243300025941Switchgrass RhizosphereMAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVTPATVITTDT
Ga0210116_111406913300025959Natural And Restored WetlandsMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLAG
Ga0210090_100446713300025965Natural And Restored WetlandsMAPIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKL
Ga0257149_100970513300026355SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTK
Ga0257173_100027643300026360SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLGRVV
Ga0257176_103091233300026361SoilMAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGA
Ga0257179_102127113300026371SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALET
Ga0257178_104788323300026446SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLGR
Ga0257169_108048823300026469SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDREN
Ga0257157_100394443300026496SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEI
Ga0209056_1061348023300026538SoilVSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKETYTRLGDRERLGQVVG
Ga0209161_1037290223300026548SoilMAQAGGKASLSDEERQSLAESARLCEMIVEANPSDT
Ga0209984_100212213300027424Arabidopsis Thaliana RhizosphereMAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALETLKEIYTKLADRDN
Ga0209527_103905833300027583Forest SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETL
Ga0209970_100799813300027614Arabidopsis Thaliana RhizosphereMAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTG
Ga0256866_111378213300027650SoilMAPSDNKTLLSSEEKRSLAESARLCEMIVEANPSDT
Ga0209583_1077350323300027910WatershedsMAQIEKTSLSQEEKRSLSESARLCEMIVEANPSDTGALETLKEIYSK
Ga0209069_1067286523300027915WatershedsMAQIEKTSLSQEEKRSLSESARLCEMIVEANTSDT
Ga0209526_1025055613300028047Forest SoilMAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALET
Ga0257175_103197033300028673SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLK
Ga0307504_1005875033300028792SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGAL
Ga0307504_1008623433300028792SoilMARKDTGDSLSSDERSSLLESARLCEMIVEANPADTGALETLKEIYTKLDDREKLAKI
Ga0307299_1035083313300028793SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVV
Ga0308187_1007945713300031114SoilMAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLAGT
(restricted) Ga0255311_106674833300031150Sandy SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLS
Ga0307473_1144462423300031820Hardwood Forest SoilMSQTSRAAPLSEAERQSLGESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQ
Ga0310904_1091588623300031854SoilMPDVNSKTSFSDEERHSLVESARLCEMIIEANPSDTGALET
Ga0306921_1144230113300031912SoilMPDVNSKAPFSDEERHSLVESARLCEMIVEANPSDTGALETLKEIYTKL
Ga0214473_1199185723300031949SoilMAQIDSKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEI
Ga0214473_1212005223300031949SoilMTQADSRSSIAPEERDSLLESARLCEMIVEANPSDTGALETLKEIYTKLGDRD
Ga0307471_10003480813300032180Hardwood Forest SoilMAQIEKTSLSQEEKRSLSESARLCEMIVEANPSDTGALETLKEIYSKLGE
Ga0307471_10344023823300032180Hardwood Forest SoilMTQTSGTASFSEEERQSFAESARLCEMIIEANPFDTGALETLKEIYTKLGDRAR
Ga0335080_1217942413300032828SoilMAPPTSKAPLSDDERQSLAESARLCEMILEANASDTGALETLKEIYTKL
Ga0364930_0097245_819_10043300033814SedimentMAQKDSKGSMSTEEKHSLEESARLCEMIVEANPSDTGALETLKEIYTKLGDREKLAKIVVRL
Ga0370498_130849_470_5983300034155Untreated Peat SoilMAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLK
Ga0364942_0059717_2_1483300034165SedimentMAQTENKTSLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.