NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097964

Metagenome / Metatranscriptome Family F097964

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097964
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 49 residues
Representative Sequence IVERIALAHDALRARLIVPEVGVFRFFIQFGKATRRGINVKDASSAAVPTA
Number of Associated Samples 98
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 96
AlphaFold2 3D model prediction Yes
3D model pTM-score0.33

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil
(4.808 % of family members)
Environment Ontology (ENVO) Unclassified
(26.923 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(35.577 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 50.63%    β-sheet: 0.00%    Coil/Unstructured: 49.37%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.33
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF05974DUF892 27.88
PF00905Transpeptidase 19.23
PF03462PCRF 8.65
PF17092PCB_OB 7.69
PF01520Amidase_3 1.92
PF00239Resolvase 0.96
PF11741AMIN 0.96
PF01551Peptidase_M23 0.96
PF00575S1 0.96
PF00378ECH_1 0.96
PF13924Lipocalin_5 0.96
PF13520AA_permease_2 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG3685Ferritin-like metal-binding protein YciEInorganic ion transport and metabolism [P] 27.88
COG0216Protein chain release factor RF1Translation, ribosomal structure and biogenesis [J] 8.65
COG1186Protein chain release factor PrfBTranslation, ribosomal structure and biogenesis [J] 8.65
COG0860N-acetylmuramoyl-L-alanine amidaseCell wall/membrane/envelope biogenesis [M] 1.92
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.96
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.81%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil4.81%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil3.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.85%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland3.85%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere3.85%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.85%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.88%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands2.88%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.88%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.92%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.92%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil1.92%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.92%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.92%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.92%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.92%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.96%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.96%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.96%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh0.96%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.96%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.96%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.96%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.96%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.96%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.96%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.96%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.96%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.96%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.96%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.96%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.96%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.96%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.96%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.96%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.96%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.96%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.96%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.96%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2124908032Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Perma_allEnvironmentalOpen in IMG/M
2140918006Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Permafrost Layer P1EnvironmentalOpen in IMG/M
2170459019Litter degradation MG4EngineeredOpen in IMG/M
3300000044Arabidopsis rhizosphere microbial communities from the University of North Carolina - sample from Arabidopsis soil oldHost-AssociatedOpen in IMG/M
3300000580Forest soil microbial communities from Amazon forest - 2010 replicate II A01EnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004055Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Muzzi_PWB_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005833Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.174_CBKEnvironmentalOpen in IMG/M
3300005834Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C1-2Host-AssociatedOpen in IMG/M
3300005874Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_404EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300005980Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil leachate replicate DNA2013-203EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006606Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHMA (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009029Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 1 DNA2013-189EnvironmentalOpen in IMG/M
3300009153Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009660Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil DNA_2013-058EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011000Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t6i015EnvironmentalOpen in IMG/M
3300011427Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT418_2EnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012904Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S029-104C-1EnvironmentalOpen in IMG/M
3300012941Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t4i015EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300014256Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_TuleB_D2EnvironmentalOpen in IMG/M
3300014322Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - WestPond_CattailA_D1EnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014497Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-129_1 metaGHost-AssociatedOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018029Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP06_20_MGEnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018057Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_150EnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300022850Peat soil microbial communities from Stordalen Mire, Sweden - 717 S2 1-5EnvironmentalOpen in IMG/M
3300022898Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S109-311C-5EnvironmentalOpen in IMG/M
3300023062Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S081-202R-4EnvironmentalOpen in IMG/M
3300023266Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S220-509R-4EnvironmentalOpen in IMG/M
3300025321Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025482Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210 shallow-092012 (SPAdes)EnvironmentalOpen in IMG/M
3300025619Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 deep-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025829Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE Permafrost159B-16B (SPAdes)EnvironmentalOpen in IMG/M
3300025891Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE Permafrost154B-one (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026054Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026743Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A3-10 (SPAdes)EnvironmentalOpen in IMG/M
3300026887Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 49 (SPAdes)EnvironmentalOpen in IMG/M
3300027371Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027482Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05.2A2w-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027898Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028708Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_152EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031241Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-14-20 metaGHost-AssociatedOpen in IMG/M
3300031274Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1604-30EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032091Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f25EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033158Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.1EnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034268Forest soil microbial communities from Eldorado National Forest, California, USA - SNFC_MG_FRD_1.2EnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Perma_A_C_013851502124908032SoilPVLQRVALAHHALAARLVAPETGVLRLFVQLGEAAGRGVDVKDASSAAAATA
P1_C_007619802140918006SoilRVALAHHALAARLVAPETGVLRLFVQLGEAAGRGVDVKDASSAAAATA
4MG_013848402170459019Switchgrass, Maize And Mischanthus LitterRELILELLLDTANRLELIVQRVTLAHHALRPHLIVPEIGAFRFFVQFGEASGRGVDVKDASSAAARTA
ARSoilOldRDRAFT_01339723300000044Arabidopsis RhizosphereVERIALAHDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVRTA*
AF_2010_repII_A01DRAFT_106578713300000580Forest SoilTHDPLCARLIVPEIGIFRLFIQFGEAPRRGVDIKDASSAAAQTA*
soilH1_1013422123300003321Sugarcane Root And Bulk SoilLRAGLIAPEIGVFGFSVQFGETPLRGIDVKDASSAAAPTA*
Ga0055480_1018684823300004055Natural And Restored WetlandsHPLGARLIVPEVGILGRLVQFGEAPLRGFDVKDASSAVRATA*
Ga0062589_10190497123300004156SoilADRLELILQRVTFAHHALRPCLVVPEIGIFRFFVQFGEASGRGIDVKDASSAAARTA*
Ga0070688_10019970933300005365Switchgrass RhizosphereLELIVERIALAHDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVPTA*
Ga0070706_10034650713300005467Corn, Switchgrass And Miscanthus RhizosphereLIFELLLDMADRLELILQRVTFAHHALRPRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA*
Ga0066905_10097122423300005713Tropical Forest SoilLELIVQCVALAHNALRARLIVPEIGVFRFFVQFGEAPRRGIDVKDASSAAARTA*
Ga0074472_1078913513300005833Sediment (Intertidal)HALGARLVIPQIGVFGFFVQFGEAARRGIDVKDASSAAAPTA*
Ga0068851_1038125723300005834Corn RhizosphereVERIALAHDALRARLIVPEVGVFRFFIQFGKATRSGINVKDASSAAVPTA*
Ga0075288_104078413300005874Rice Paddy SoilQRVALAHHALAARLIAPQRGVFGFFVQFGEAARRGFDVKDASSAAATTA*
Ga0081455_1091939913300005937Tabebuia Heterophylla RhizosphereLVVPEVGVLGLLIQLGEADLCLVDVKDASSAVRATA*
Ga0066798_1008026113300005980SoilELLLDAADGVELIVQRVALAHGALGARRIAPEIGGFSRFVQLGKAALRGIDVKDASSAVVPTA*
Ga0075026_10032826023300006057WatershedsKLLLDAADGLKLVVERITLAHHTLRSYLIVPEIGVFRLLVQLGKAPRRGVDVKDASSAAALTA*
Ga0075018_1063109613300006172WatershedsLNATDGFKLIVERIALTHNALSTGLVVPEIGVFRLFVQFGEALRRGIDVKDASSAAALTA
Ga0074062_1179457913300006606SoilIAFTHNTLRAGLVVPEIRVFRLSIQFSEALRRGVDVKDASSAAARTA*
Ga0075430_10007710533300006846Populus RhizosphereVPEFGIFGLLVQLGQTHLRGIDVKDASSAAQGTA*
Ga0075434_10012571333300006871Populus RhizosphereDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVPTA*
Ga0075434_10189294223300006871Populus RhizosphereDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVRTA*
Ga0075426_1043923313300006903Populus RhizosphereLRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAA
Ga0066793_1048640113300009029Prmafrost SoilDGVELIVQRVALAHGALGARRIAPEIGGFSRFVQLGKAALRGIDVKDASSAVVPTA*
Ga0105094_1076782113300009153Freshwater SedimentFAHDALRARLVVPQIGVFRFLVQLGETPRRGIDVKDASSAAVPTA*
Ga0105248_1265594813300009177Switchgrass RhizosphereDGLELIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRRGINVKDASSAAVPTA*
Ga0105237_1111131113300009545Corn RhizosphereHDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVRTA*
Ga0105237_1263914213300009545Corn RhizosphereHDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVPTA*
Ga0105854_112420913300009660Permafrost SoilSVALAHGALGLGLVVPECGVFSRLVQFGEAARRSIDVKDASSAGRSTA*
Ga0126384_1012594013300010046Tropical Forest SoilIVPEIGVFRLFIQLGKATRSGINVKDASSAAAPTA*
Ga0126373_1189302623300010048Tropical Forest SoilPRLIVPEIGVFRFFIQLGKATRSGINVKDASSAAVPTA*
Ga0126377_1224968613300010362Tropical Forest SoilDTANRFELVVQRVPLAHHALRARLIVPQIRVFRILVQLCEASGRGVDVKDASSAAGLTA*
Ga0126383_1349023113300010398Tropical Forest SoilLLDSANGLKLVIERVAFAHDTLRASLIVPEIGIFRFLIQFGKAAHRSVDVKDASSAA*
Ga0134127_1024479623300010399Terrestrial SoilFELLLDMADRLELILQRVTFAHHALRPRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA*
Ga0138513_10006864623300011000SoilPHDALRTRLIVPEIGIFRLFVQFGQTPRRSVNVKDASLAAAPTA*
Ga0137448_109725913300011427SoilQRVALTHGALGTRLIVPQIGVFGFFVQFGEAARRGIDVKDASSAVVSTA*
Ga0137381_1155450223300012207Vadose Zone SoilAHNTLRTRLIAPEVGIFGLFIQLCEAPGRGIDVKDASSAAESTA*
Ga0137386_1121885913300012351Vadose Zone SoilLVVKRIALAHNTLRTRLIAPEVGIFGLFIQLCEAPGRGIDVKDASSAAES
Ga0150984_10926685313300012469Avena Fatua RhizosphereLRARLVVPEIGVFGFFIQFGETPLRGIDVKDASSAAAPTA*
Ga0157282_1000320513300012904SoilDMADRLELILQRVTFAHHALRPRLIVPEIGVSRFLVQFGEASRRGVDVKDASSAAARTA*
Ga0162652_10003751423300012941SoilALRTRLIVPEIGIFRLFVQFGQTPRRSVNVKDASLAAAPTA*
Ga0126375_1127781913300012948Tropical Forest SoilLLDAADGFQLIVERITLAHDTLRPRSIVPKIGVFRVFIQLGKATRSAINVKDASSAAVPTA*
Ga0157374_1080173423300013296Miscanthus RhizosphereLIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRRGINVKDASSAAVPTA*
Ga0075318_107927313300014256Natural And Restored WetlandsHHPLGARLVAPQRGILGLFVQLGQAPFRGIDVKDASSAARATA*
Ga0075355_126164213300014322Natural And Restored WetlandsDGLELVFQCVALAHGALGARLVAPEARIFGGFVQFGETPLRGIDVKDASSAVAPTA*
Ga0157380_1021578033300014326Switchgrass RhizosphereLIVERIALAHDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVPTA*
Ga0182008_1009259623300014497RhizosphereVLRARLIVPEIGIFGFFIQFGETPLRCIDVKDASSAAAPTA*
Ga0182008_1028267413300014497RhizosphereLELLLDASDRFKLIVERVTFAHHALSPRLIIPEVGVFRFLVQFGEASRRSVDVKDASSAAARTA*
Ga0157379_1183767123300014968Switchgrass RhizosphereSADGLELIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRSGINVKDASSAAVPTA*
Ga0157376_1239950013300014969Miscanthus RhizosphereIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRRGINVKDASSAAVPTA*
Ga0157376_1248099923300014969Miscanthus RhizosphereLIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRSGINVKDASSAAVPTA*
Ga0132258_1150416613300015371Arabidopsis RhizosphereLRPHLIVPEIGVFGFFVQFGEASGRGVDVKDASSAAA
Ga0132256_10300842613300015372Arabidopsis RhizosphereRLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVPTA*
Ga0132257_10090354413300015373Arabidopsis RhizosphereLIVQRITLAHHVLRARLIVPEIGVFRFLVQFGEASRRGVDVKDASSAAARTA
Ga0132255_10061780223300015374Arabidopsis RhizosphereLELLLYTTDRLELIVERVTFAHHALRSRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA*
Ga0132255_10556174113300015374Arabidopsis RhizosphereIVERIALAHDALRARLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVRTA*
Ga0187787_1004506023300018029Tropical PeatlandAADGLKLLVERITLAHYTLRSCLIVPEVGVFRLLVQLGKAPRRSVDVKDASSAAALTA
Ga0187788_1053321313300018032Tropical PeatlandPVLQRIALAHHALAARRIAPELGVFGLFVQLGEAALRGLDVKDASSAA
Ga0187858_1064285513300018057PeatlandHHALAARRIAPEIGVLGLFVQLGEAALRGVDVKDASSAVAATA
Ga0187765_1007297613300018060Tropical PeatlandCARLIVPQVGTFRFFVQFGEASSRGIDIKDASSAAAPTA
Ga0187765_1107865613300018060Tropical PeatlandLAARRIAPELGILGLFVQLGEAALRGLDVKDASSAA
Ga0184635_1000208613300018072Groundwater SedimentLLFDPADCLQLIIKSVALPHDALRTRLIVPEIGIFRLFVQFGQTPRRSVNVKDASLAAAPTA
Ga0193731_107603023300020001SoilAHGALRLALIAPESGVFSRLVQFGQAARRGIDVKDASSAVQPTA
Ga0210380_1023834523300021082Groundwater SedimentHDALSARLIVPEIGVFRLFIQFGKATRSGVNVKDASSAAVPTA
Ga0224552_100610513300022850SoilLVVQGVALAHGALRASLIVPECRVFGRLVQFGQAARRGIDVKDASSAGAPTA
Ga0247745_100121933300022898SoilRSRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA
Ga0247791_100281913300023062SoilELIIELLLYTTDRLELIVERVTFAHHALRSRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA
Ga0247789_100113533300023266SoilLILELLLYTTDRLELIVERVTFAHHALRSRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA
Ga0207656_1018678623300025321Corn RhizosphereVERIALAHDALRARLIVPEVGVFRFFIQFGKATRSGINVKDASSAAVPTA
Ga0208715_101927023300025482Arctic Peat SoilDGVELIVQRVALAHGALGARRIAPEIGGFSRFVQLGKAALRGIDVKDASSAVVPTA
Ga0207926_112828723300025619Arctic Peat SoilDAADGVELIVQRVALAHGALGARRIAPEIGGFSRFVQLGKAALRGIDVKDASSAVVPTA
Ga0209484_1016985913300025829Arctic Peat SoilVLQRVAFAHHALAARLVAPETGVLRLFVQLGEAAGRGVDVKDASSAAAATA
Ga0209585_1015370213300025891Arctic Peat SoilLRLGLVVPESGIFGKFVQFGEAARRSIDVKDASSAARPTA
Ga0207693_1016589323300025915Corn, Switchgrass And Miscanthus RhizosphereRIAFTHHALRARLVVPEIRVFRLSIQFSEALRRGVDVKDASSAAARTA
Ga0207660_1098347613300025917Corn RhizosphereGLVVPEIWVFGLFVQFGETPLRGIDVKDASSAAAPTA
Ga0207687_1007488713300025927Miscanthus RhizosphereRARLIVPEVGVFRFFIQFGKATRRGINVKDASSAAVPTA
Ga0207706_1048292413300025933Corn RhizosphereRLIVPEVGVFRFFIQLGKATRSGINVKDASSAAVRTA
Ga0207703_1194952713300026035Switchgrass RhizosphereLELIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRSGINVKDASSAAVPTA
Ga0208659_101294823300026054Natural And Restored WetlandsVTLAHHALRPHLIVPEIGVFRFFVQFGETSGRGVDVKDASSAAARTA
Ga0207641_1072648923300026088Switchgrass RhizosphereDRLELILQRVTFEHHALRPRLIVPEIGVFRFLVQFGEASRRGVDVKDASSAAARTA
Ga0207613_10141613300026743SoilDALRRSRPDRLELILQRVTFAHHALRPRLIVPEIGVSRFLVQFGEASRRGVDVKDASSAAARTA
Ga0207805_102632023300026887Tropical Forest SoilRARLVAPKIGIFGLFVQLGEPARRGVDVKDASSAAALTA
Ga0209418_108478013300027371Forest SoilRLVAPEIRILGLFVQLGETLLRGIDVKDASSAAAPTA
Ga0207460_10178023300027482SoilLELLLYTTDRLELIVERVTFAHHALRSRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAARTA
Ga0209009_101454013300027667Forest SoilLQRVALAHHALSTRLIAPETGVFRLFVQLGEAAGRGIDVKDASSAA
Ga0209074_1011081813300027787Agricultural SoilLGARLIVPKVGVFGLFVQLGEAPRRGVDIKDASSAAG
Ga0209067_1058148223300027898WatershedsLAHHALAARLVAPERGILGLFVQFGETPLRGIDVKDASSAAPSTA
Ga0307295_1025879923300028708SoilESITLAHDALSARLIVPEIGVFRLFIQFGKATRSGVNVKDASSAAVPTA
(restricted) Ga0255311_101832023300031150Sandy SoilRLELILQRVTFAHHALRPRLIVPEIGVFRFFVKFGEAPRRGVDVKDASSAAARTA
Ga0265325_1022240013300031241RhizosphereAHGALRLGLVVPECGIFGEFVQFGEAARRSIDVKDASSAARLTA
Ga0307442_115684613300031274Salt MarshRARLVVPEGGILGFFVQLGQAACRGVDVKDASSAVQATA
Ga0310887_1001595123300031547SoilILELLLYTTDRLELIVERVTFAHHALRSRLIVPEIGVFRFLAQFGETSGRGVDVKDASSAAAQTA
Ga0318568_1027114443300031819SoilVERITLAHDALRPRLIVPKIGVFRVFIQLGKATRSGINVKDASSAAVPTA
Ga0308176_1107553113300031996SoilAHGALRALRIVPEFGLLGLFVQFGEALLRDIDVKDASSAAAPTA
Ga0318533_1071824423300032059SoilLLFNAADSLELIIKRIAFTHHPLRAGLIVPEIRILRFFIQFGQAARRSINVKDASSAASPTA
Ga0318577_1040731623300032091SoilHYALRPRLIVPKIGVFRVFIQLGKATRSGINVKDASSAAVPTA
Ga0307472_10007547633300032205Hardwood Forest SoilLIVERIALAHDALRARLIVPEVGVFRFFIQFGKATRSGINVKDASSAAVPTA
Ga0335078_1133043723300032805SoilFAHHALRARLVVPEIGIFRLFVQLGEAAFRGVDVKDASSAVASTA
Ga0335084_1128304213300033004SoilALRARLVVPEIGIFRLFVQFGEALRRDVDVKDASSAA
Ga0335077_1165580713300033158SoilLDTTDGAETVLQRVALPHDALRTRLIAPEMGVFGLFVQFGEAPLRGVDVKGASSAVRATA
Ga0316620_1107554823300033480SoilLGARLIVPKIGILGLFVQLGQASLRGIDVKDASSAVLATA
Ga0316628_10226412813300033513SoilARLVVPEIGVFRFLVQFGEAPRRGIDVKDASSAAAPTA
Ga0372943_0213998_700_8223300034268SoilLRARLIVPEVGVFGLFVQFGETPLRGVDVKDASSAAASTA
Ga0364943_0051805_1_1683300034354SedimentGFEPILQRIALAHGALGASLIVPETGVFGRLVQFGQTARRGIDVKDASSAAAPTA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.