NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101759

Metagenome / Metatranscriptome Family F101759

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101759
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 43 residues
Representative Sequence MRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGAESETAEMRA
Number of Associated Samples 84
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 63.16 %
% of genes near scaffold ends (potentially truncated) 10.78 %
% of genes from short scaffolds (< 2000 bps) 7.84 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (81.373 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(25.490 % of family members)
Environment Ontology (ENVO) Unclassified
(37.255 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.020 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.35%    β-sheet: 0.00%    Coil/Unstructured: 50.65%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00589Phage_integrase 10.78
PF13620CarboxypepD_reg 3.92
PF07969Amidohydro_3 2.94
PF01844HNH 1.96
PF01381HTH_3 0.98
PF04397LytTR 0.98
PF00593TonB_dep_Rec 0.98
PF02517Rce1-like 0.98
PF07676PD40 0.98
PF02896PEP-utilizers_C 0.98
PF12681Glyoxalase_2 0.98
PF10282Lactonase 0.98
PF07638Sigma70_ECF 0.98
PF11535Calci_bind_CcbP 0.98
PF01252Peptidase_A8 0.98
PF00512HisKA 0.98
PF07470Glyco_hydro_88 0.98
PF05649Peptidase_M13_N 0.98
PF07883Cupin_2 0.98
PF00882Zn_dep_PLPC 0.98
PF02899Phage_int_SAM_1 0.98
PF01425Amidase 0.98
PF05015HigB-like_toxin 0.98
PF13470PIN_3 0.98
PF00294PfkB 0.98
PF01557FAA_hydrolase 0.98
PF02604PhdYeFM_antitox 0.98
PF07729FCD 0.98
PF03403PAF-AH_p_II 0.98
PF08281Sigma70_r4_2 0.98
PF00578AhpC-TSA 0.98
PF00892EamA 0.98
PF03190Thioredox_DsbH 0.98
PF15937PrlF_antitoxin 0.98
PF00486Trans_reg_C 0.98
PF15780ASH 0.98
PF02586SRAP 0.98
PF02954HTH_8 0.98
PF12728HTH_17 0.98
PF00150Cellulase 0.98
PF00144Beta-lactamase 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0597Lipoprotein signal peptidaseCell wall/membrane/envelope biogenesis [M] 1.96
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 0.98
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.98
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 0.98
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 0.98
COG4188Predicted dienelactone hydrolaseGeneral function prediction only [R] 0.98
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.98
COG3934Endo-1,4-beta-mannosidaseCarbohydrate transport and metabolism [G] 0.98
COG3590Predicted metalloendopeptidasePosttranslational modification, protein turnover, chaperones [O] 0.98
COG3549Plasmid maintenance system killer proteinDefense mechanisms [V] 0.98
COG2730Aryl-phospho-beta-D-glucosidase BglC, GH1 familyCarbohydrate transport and metabolism [G] 0.98
COG2367Beta-lactamase class ADefense mechanisms [V] 0.98
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 0.98
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.98
COG2135ssDNA abasic site-binding protein YedK/HMCES, SRAP familyReplication, recombination and repair [L] 0.98
COG1802DNA-binding transcriptional regulator, GntR familyTranscription [K] 0.98
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.98
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.98
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.98
COG1331Uncharacterized conserved protein YyaL, SSP411 family, contains thoiredoxin and six-hairpin glycosidase-like domainsGeneral function prediction only [R] 0.98
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A81.37 %
All OrganismsrootAll Organisms18.63 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005533|Ga0070734_10000245All Organisms → cellular organisms → Bacteria141206Open in IMG/M
3300010360|Ga0126372_10219338All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1603Open in IMG/M
3300010360|Ga0126372_12122067All Organisms → cellular organisms → Bacteria → Acidobacteria610Open in IMG/M
3300010376|Ga0126381_100030547All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter kueseliae6422Open in IMG/M
3300012199|Ga0137383_10451306All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium942Open in IMG/M
3300020579|Ga0210407_10722775All Organisms → cellular organisms → Bacteria → Acidobacteria771Open in IMG/M
3300020583|Ga0210401_10020642All Organisms → cellular organisms → Bacteria → Acidobacteria6327Open in IMG/M
3300021401|Ga0210393_10089198All Organisms → cellular organisms → Bacteria → Acidobacteria2448Open in IMG/M
3300021432|Ga0210384_10061846All Organisms → cellular organisms → Bacteria3383Open in IMG/M
3300021560|Ga0126371_10000486All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae32493Open in IMG/M
3300021560|Ga0126371_10040705All Organisms → cellular organisms → Bacteria → Acidobacteria4402Open in IMG/M
3300022557|Ga0212123_10000028All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae457915Open in IMG/M
3300027826|Ga0209060_10000263All Organisms → cellular organisms → Bacteria114594Open in IMG/M
3300028536|Ga0137415_10104258All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2687Open in IMG/M
3300031573|Ga0310915_10925989All Organisms → cellular organisms → Bacteria → Acidobacteria610Open in IMG/M
3300031681|Ga0318572_10984840All Organisms → cellular organisms → Bacteria → Acidobacteria501Open in IMG/M
3300031718|Ga0307474_10000037All Organisms → cellular organisms → Bacteria104595Open in IMG/M
3300031720|Ga0307469_12200913All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium537Open in IMG/M
3300031823|Ga0307478_11822939All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium500Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil25.49%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.75%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere9.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.84%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.90%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland4.90%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.92%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland2.94%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland1.96%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.96%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.96%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.96%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.98%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.98%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.98%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005890Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009616Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_8_100EnvironmentalOpen in IMG/M
3300009640Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_16_40EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014152Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin11_60_metaGEnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018017Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_40EnvironmentalOpen in IMG/M
3300018026Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_100EnvironmentalOpen in IMG/M
3300018057Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_150EnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300018090Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-5 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062386_10057140813300004152Bog Forest SoilMRTMGHRDVKTAMHYQHPELEVVRAALDYAAPNDIAETRA*
Ga0066395_1065849723300004633Tropical Forest SoilMRTENLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDNADVAIAKMSV*
Ga0066684_1001676013300005179SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGAASEPAEMRA*
Ga0066388_10128637613300005332Tropical Forest SoilAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGAASETAEMRA*
Ga0066388_10325875413300005332Tropical Forest SoilVMRTMGHRDVRTAMHYQHPELDVVRAALDFDAPADGSEKKV*
Ga0070713_10013732443300005436Corn, Switchgrass And Miscanthus RhizosphereGNLAAVMRTMGHRDVRTAMHYQHPELEIVRAALDYEAVADATETTL*
Ga0070707_10009396613300005468Corn, Switchgrass And Miscanthus RhizosphereVMRTMGHRDVKTAMHYQHPELEVVRAALDYGMPKDIAEMRG*
Ga0070698_10110413423300005471Corn, Switchgrass And Miscanthus RhizosphereMRTMGHRDVRTAMHYQHPELELVRAALDYGAVVESNETSVSTE*
Ga0070734_100002451133300005533Surface SoilMRTGNLVAVMRTMGHRHVKTAMHYQHPELETVRAALD*
Ga0070697_10178561713300005536Corn, Switchgrass And Miscanthus RhizosphereLAAVMKTMEHRDVKTAMHYQHPELEVVRAALDYGAASETAEIRT*
Ga0070665_10034644133300005548Switchgrass RhizosphereMLAMGHKDVKTAMQYQHPEIEIVRATINEGESVTA*
Ga0066903_10837698413300005764Tropical Forest SoilMRTGNLAAVMKTMGHRDVKSAMHYQRPELDVVRAALDWGAESETAETRA*
Ga0075285_101150223300005890Rice Paddy SoilMRTGNLAAVMRTMGHRDVKTAMHYQNPELEVVRAALDYSMPNNIAEI*
Ga0070717_1000152253300006028Corn, Switchgrass And Miscanthus RhizosphereMGHRDVKTAMHYQHPELDVVRAALDCGAATETAEMRA*
Ga0070716_10030626213300006173Corn, Switchgrass And Miscanthus RhizosphereMGHRDVKTAMHYQHPELDVVRAALDCGAATETSEMRA*
Ga0070712_10072230323300006175Corn, Switchgrass And Miscanthus RhizosphereMRTGNLAAVMKTMGHRDAKTAMHYQHPELDVVRAALDYRAASETTEMRA*
Ga0066665_1013086113300006796SoilMKTMGHRDVKTAMHYQHPEQEIVRAALDYNASVKRAEIIRV*
Ga0075433_1100306723300006852Populus RhizosphereRVLMRTGNLAAVMKTMGHRDVKTAMQYQHPELEVVRSALDYIAESEAVEMRA*
Ga0075425_10004153733300006854Populus RhizosphereMKTMGHRDVKTAMQYQHPELEVVRSALDYIAESEAVEMRA*
Ga0102924_1001702113300007982Iron-Sulfur Acid SpringMKTMGHRDVKTAMHYQHPELDVVHAALDYGAAGEAAEMRR*
Ga0099829_1002842213300009038Vadose Zone SoilTMGHRDVKTAMHYQHPELEIVRDALDYNAGTKGAEITV*
Ga0116111_107644313300009616PeatlandMRTGNLAAVMKTMGHRDVKTAMHYQHPELGVVRAALDYGATTETAEIRA*
Ga0116126_127549013300009640PeatlandMGHRDVKTAMRYQHPELEVVRAALDYGTASETAEM
Ga0126374_1035746313300009792Tropical Forest SoilMRTMGHRDVRTAMHYQHPELDVVRAALDFDAPADGSEKKV*
Ga0126384_1026008513300010046Tropical Forest SoilTMGHRDVKTAMQYQHPELEVVRAALDYGMPNDVVAVRA*
Ga0126373_1000649373300010048Tropical Forest SoilMRTGNLAAVMKTMGHRDVKPAMHFQHPELDVVRAALDYGVASETAEMRA*
Ga0126373_10014242143300010048Tropical Forest SoilMRTGNLAAVMSTMGHRDVKTAMHYQHPEVEVVRVALDYGMPNNIGEMRV*
Ga0126373_1003264523300010048Tropical Forest SoilMKTMGHRDVKTAMHYQHPELDVVRAALDYGAESETAEMQV*
Ga0126373_1009868823300010048Tropical Forest SoilMGHRDVKTAMHYPHPEVEVVRAALDYGMPNNIGEMRV*
Ga0126373_1199949423300010048Tropical Forest SoilMKTMGHRDVKTAMNYQHPELDVVRDALDNGPASEVAEMRA*
Ga0126373_1209093523300010048Tropical Forest SoilMRTGNLAAVMKAMGHRDVKTAMNYQHPELDIVRAAPDYTANSAQIRV*
Ga0126370_1001767983300010358Tropical Forest SoilGHRDVKTAMHYPHPEVEVVRVALDYGMPNNIGEMRV*
Ga0126372_1021933813300010360Tropical Forest SoilMRTGNLAAVMKTMGHSDVKAMHYQHPELDVVRAALD
Ga0126372_1094034023300010360Tropical Forest SoilTGNLAAVMRTMGHRDVKTAMHYQHPELEVVRAALDYGMPNDVVAVRA*
Ga0126372_1212206713300010360Tropical Forest SoilAVLTKTGNLPAVMKTMGHKDVRAAMHYQHPDLEIVRAL*
Ga0126378_1018165643300010361Tropical Forest SoilMRTGNLAAVMRTMGHRDVKTAMHYQHPELEVVRVALDYGMPNDVVAVRA*
Ga0126378_1238503423300010361Tropical Forest SoilMKTMGHHDVKTAMHYQHPELEVVRAALDYDVPVDKTEMKGAGKRLAAHSM
Ga0126377_1101002413300010362Tropical Forest SoilTGNLAAVMRTMGHRDVRAALHYQHPEVEVVRAALDGMPNNIGEMRV*
Ga0126379_1020329233300010366Tropical Forest SoilMRTGNLAAVMRTMGHRNVKTAMQYQHPEIEVVRAGLDYAAPNDIAETRV*
Ga0126379_1320081213300010366Tropical Forest SoilGNLAAVMRTMGHRDVRTAMHYQHPELEVVRAALDFDAPADSSEKKV*
Ga0126381_10003054743300010376Tropical Forest SoilMRTGNLAAVMKTMGHSDVKAMHYQHPELDVVRAALDC
Ga0126381_10098512813300010376Tropical Forest SoilGTRVLMRTRNLAAVMKTMGHGDVKTATHCQHPELDVVRAALDYGAANETVEMRE*
Ga0136449_10430448413300010379Peatlands SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDYGAATDTTETSV*
Ga0126383_1002022533300010398Tropical Forest SoilMRTGNLAAVMKTMGHSDVKAMHYQHPELDVVRAALDCDAPADTTESGLSAKS*
Ga0126383_1006067323300010398Tropical Forest SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGAESETAEMRA*
Ga0137388_1008813313300012189Vadose Zone SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDYNADAATADIRV*
Ga0137383_1045130613300012199Vadose Zone SoilTGNLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDNTPGAPNAEVRV*
Ga0137380_1167604013300012206Vadose Zone SoilLAAVMKTMGHRDVRTAMHYQDPELEIVRAALDYNAGAKGAELSV*
Ga0137379_1119283723300012209Vadose Zone SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDYNASPAAVEI
Ga0137384_1033790613300012357Vadose Zone SoilRTMGHRDVKTAMHYQHPELEIVRAALDYGVPNEVAEARV*
Ga0137384_1117213913300012357Vadose Zone SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDYNASPAAVEISV*
Ga0137361_1017636543300012362Vadose Zone SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYNTANETAEMRA*
Ga0137390_1146874723300012363Vadose Zone SoilMKTMGHRDVKTAMHYQHPELEIVRAALDYNESVNGAETRV*
Ga0137373_1123843313300012532Vadose Zone SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPDLEIVRAALDYSAGAKGAEMTV*
Ga0137359_1011165633300012923Vadose Zone SoilMKTMGHRDVKPAMHYQHPELEIVRAALDYGATSVIGET*
Ga0126375_1086169923300012948Tropical Forest SoilMRTGNLAAVMRTMGHRDVKTAMQYQHPELEVVRAALDYGMPNDVVAVRA*
Ga0126369_1363713413300012971Tropical Forest SoilMGHRDVKTAIDYQHPELEIVRAALDYSAGASNAEMRM*
Ga0181533_1004913103300014152BogMGHRDVKTAMRYQHPELEVVRAALDYGTASETAEMRA*
Ga0182038_1154248313300016445SoilYRTRVLMRTGNLAAVMRTMGHRDVKTAMHYQHPELEVVRAALDYGTESSAVEMRV
Ga0187802_1014690433300017822Freshwater SedimentMGHRDVKTAMHYQHPELEVVRAALDHGMPNDIAETRV
Ga0187818_1024592723300017823Freshwater SedimentMRTGNLAAVMKTMGHRDVKTAMHYQHTELDVVRAALDYDAASETAEI
Ga0187817_1075643313300017955Freshwater SedimentMRTMGHRDVRAAMHYQHPELEIVRAALDKSEATDAIETRV
Ga0187816_1011162913300017995Freshwater SedimentVMKTMGHRDVKTAMHYQHPELEVVRAALDSGPANETAALRV
Ga0187872_1002470333300018017PeatlandMGHRDVKTAMRYQHPELEVVRAALDYGTASETAEMRA
Ga0187857_1048898823300018026PeatlandMRTGNLAAVMKTMGHRDVKTAMHYQHPELGVVRAALDYGATTETAEIRA
Ga0187858_1023031013300018057PeatlandMRTGNLAAVMKTMGHRDVKTAMHYQHPELGVVRAALYYGATTETAEIRA
Ga0187784_1001733383300018062Tropical PeatlandMRTMGHRDVKTAMHYQHPELEVVRAALDYGMPNDTAMQE
Ga0187769_1000250463300018086Tropical PeatlandMKTMGHRDVKTAMHYQHPELEIVRAALDYGEEANTTETGA
Ga0187771_1007093933300018088Tropical PeatlandVMKTMGHRDVKTAMHYQHPELEIVRAALDYGEEANTTETGA
Ga0187771_1084986413300018088Tropical PeatlandMKTMGHRDVKTAMHYQHPELDVVRAALDYGAANEPAEMRA
Ga0187770_1088507713300018090Tropical PeatlandMGHRDVKTAMHYQHPELDVVRAALDWGSASETAELRV
Ga0066667_1076356923300018433Grasslands SoilMKTMGHRDVKTAMHYQHPEQEIVRAALDYNASVKRAEIIRV
Ga0206352_1000273633300020078Corn, Switchgrass And Miscanthus RhizosphereMKVMGHKDVRTAMRYQHPELDIVRDALNNQTAVVQ
Ga0210407_1072277523300020579SoilMRTGNLAAVMRTMGHRDVKTAMPYQHPELEIVRAAL
Ga0210401_1002064223300020583SoilMRTGNLAAVMRTMGHRDVKTAMHYQHPELEIVRAALDTVRQSLK
Ga0210393_1008919843300021401SoilMRTGNLAAVMRTMGHRDVKTAMHYQHPELVIVRAALD
Ga0210384_1006184643300021432SoilMRTGNLAAVMRTMGHRDVRTAMHYQHPELEIVGAALDYGAPPDSNEPTVLAG
Ga0126371_1000048643300021560Tropical Forest SoilMRTGNLAAVMKTMGHRDVKSAMHYQRPELDVVRAALDWGAESETAETRA
Ga0126371_1004070553300021560Tropical Forest SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELEIVRAALDSALVSE
Ga0126371_1312345613300021560Tropical Forest SoilMRTGNLAAVMKAMGHRDVKTAMNYQHPELDIVRAAPDYTANSAQIRV
Ga0212123_100000282483300022557Iron-Sulfur Acid SpringMKTMGHRDVKTAMHYQHPELDVVHAALDYGAAGEAAEMRR
Ga0207692_1010619943300025898Corn, Switchgrass And Miscanthus RhizosphereMRTGNLAAVMKTMGHRDAKTAMHYQHPELDVVRAALDYSAES
Ga0207693_1021449533300025915Corn, Switchgrass And Miscanthus RhizosphereMGHRDVKTAMHYQHPELDVVRAALDCGAATETAEMRA
Ga0207646_1003815213300025922Corn, Switchgrass And Miscanthus RhizosphereNLAAVMRTMGHRDVKTAMHYQHPELEVVRAALDYGMPKDIAEMRG
Ga0209154_1000366123300026317SoilMRTGNLAAVMKTMGHRDVKAAMHYQHPELDVVRAALDYGAASETVERRA
Ga0209473_102713843300026330SoilTRVLMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGAASEPAEMRA
Ga0209161_1029620113300026548SoilTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGAASEPAEMRA
Ga0209060_10000263133300027826Surface SoilMRTGNLVAVMRTMGHRHVKTAMHYQHPELETVRAALD
Ga0209180_1023094023300027846Vadose Zone SoilMKTMGHRDVKTAMHYQHPELEIVRAALDYNESAKGAEMRV
Ga0137415_1010425853300028536Vadose Zone SoilMRTGNLAAVMRTMGHRDVKTAMRYQHPELETVRAALRRL
Ga0310915_1092598913300031573SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRVARITARQ
Ga0318572_1098484023300031681SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRVAR
Ga0307476_10000423303300031715Hardwood Forest SoilMGHHDVKTAMQYQHPELEIVRAALDYGASSTKSAA
Ga0310813_1153764013300031716SoilMGHRDVKIAMHYQHPELDVVRAALDYGTASETGGMRA
Ga0307474_10000037983300031718Hardwood Forest SoilMRIGNLAVVMRTMGHRDVKTATHYQHPELEVVRAALDYGAPNDAAEARA
Ga0307474_10000174543300031718Hardwood Forest SoilMGHRDVKTAMHYQHPELEIVRAALDYGASSTKSAA
Ga0307474_1002572463300031718Hardwood Forest SoilMRTGNLSAVMKTMGHRDVKTAMHYQPPELDVARAALDYGTASESAEIQA
Ga0307469_1220091323300031720Hardwood Forest SoilMRTGNLAAVMRTMGHRDVKTAMHYQHPELEIVRAALD
Ga0307478_1182293923300031823Hardwood Forest SoilMRTGNLAAAMRTMGHRDVKTAMHYQHPELEIVRADARL
Ga0310917_1071109013300031833SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDYGGAGETAEMRA
Ga0310910_1103376813300031946SoilHRDVKTAMHYQHPELEIVRAALDYRAGANSAQIRV
Ga0335069_1026656923300032893SoilMRTGNLAAVMKTMGHRDVKTAMHYQHPELDVVRAALDFGAAGETAEMRA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.