NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102875

Metagenome Family F102875

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102875
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 45 residues
Representative Sequence LESSYEEALEFESYLQEAQAASPEFAEGVQAFLARRAKK
Number of Associated Samples 86
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 98.02 %
% of genes from short scaffolds (< 2000 bps) 82.18 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.55

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (81.188 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(28.713 % of family members)
Environment Ontology (ENVO) Unclassified
(41.584 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(64.356 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.25%    β-sheet: 0.00%    Coil/Unstructured: 50.75%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.55
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00108Thiolase_N 58.42
PF02803Thiolase_C 23.76
PF00440TetR_N 3.96
PF01209Ubie_methyltran 3.96
PF00348polyprenyl_synt 1.98
PF01850PIN 1.98
PF04964Flp_Fap 0.99
PF10531SLBB 0.99
PF14559TPR_19 0.99
PF00378ECH_1 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 82.18
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 3.96
COG22272-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylaseCoenzyme transport and metabolism [H] 3.96
COG0142Geranylgeranyl pyrophosphate synthaseCoenzyme transport and metabolism [H] 1.98
COG3847Flp pilus assembly protein, pilin FlpExtracellular structures [W] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms81.19 %
UnclassifiedrootN/A18.81 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000880|AL20A1W_1220087All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300001536|A1565W1_10582380All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300001536|A1565W1_10871859All Organisms → cellular organisms → Bacteria955Open in IMG/M
3300002028|A17_1001803All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300005179|Ga0066684_10009474All Organisms → cellular organisms → Bacteria → Proteobacteria4640Open in IMG/M
3300005179|Ga0066684_10739183All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300005184|Ga0066671_10229175All Organisms → cellular organisms → Bacteria1133Open in IMG/M
3300005468|Ga0070707_100646132All Organisms → cellular organisms → Bacteria1020Open in IMG/M
3300005540|Ga0066697_10074996All Organisms → cellular organisms → Bacteria1946Open in IMG/M
3300005540|Ga0066697_10653561All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300005546|Ga0070696_100706717All Organisms → cellular organisms → Bacteria822Open in IMG/M
3300005547|Ga0070693_101360537All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300005553|Ga0066695_10061707All Organisms → cellular organisms → Bacteria2249Open in IMG/M
3300005554|Ga0066661_10348930All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300005554|Ga0066661_10776897All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300005555|Ga0066692_11034947All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300005556|Ga0066707_10943480All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300005558|Ga0066698_10239124Not Available1251Open in IMG/M
3300005568|Ga0066703_10171525All Organisms → cellular organisms → Bacteria1309Open in IMG/M
3300005575|Ga0066702_10446819All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300005575|Ga0066702_10877355All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300005576|Ga0066708_10008991All Organisms → cellular organisms → Bacteria → Proteobacteria4589Open in IMG/M
3300005576|Ga0066708_10035686All Organisms → cellular organisms → Bacteria2691Open in IMG/M
3300005598|Ga0066706_10250927All Organisms → cellular organisms → Bacteria1377Open in IMG/M
3300006800|Ga0066660_10419239All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300006804|Ga0079221_11090651All Organisms → cellular organisms → Bacteria610Open in IMG/M
3300006806|Ga0079220_10178487All Organisms → cellular organisms → Bacteria1203Open in IMG/M
3300006854|Ga0075425_101715232All Organisms → cellular organisms → Bacteria706Open in IMG/M
3300006864|Ga0066797_1051134All Organisms → cellular organisms → Bacteria1454Open in IMG/M
3300006914|Ga0075436_100621967All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300007265|Ga0099794_10477847Not Available655Open in IMG/M
3300009088|Ga0099830_11157490All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300009089|Ga0099828_10011954All Organisms → cellular organisms → Bacteria6509Open in IMG/M
3300009089|Ga0099828_10040062All Organisms → cellular organisms → Bacteria3832Open in IMG/M
3300009089|Ga0099828_10892473Not Available794Open in IMG/M
3300009090|Ga0099827_10181105All Organisms → cellular organisms → Bacteria1743Open in IMG/M
3300010301|Ga0134070_10269785All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium641Open in IMG/M
3300010326|Ga0134065_10230052Not Available683Open in IMG/M
3300010335|Ga0134063_10168045Not Available1023Open in IMG/M
3300010336|Ga0134071_10020984All Organisms → cellular organisms → Bacteria2767Open in IMG/M
3300010358|Ga0126370_11507557All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300010361|Ga0126378_12156640All Organisms → cellular organisms → Bacteria → Proteobacteria636Open in IMG/M
3300010400|Ga0134122_11015575All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium814Open in IMG/M
3300011269|Ga0137392_10025075All Organisms → cellular organisms → Bacteria → Proteobacteria4250Open in IMG/M
3300011270|Ga0137391_11134559All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300011270|Ga0137391_11286302All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300011271|Ga0137393_10454293All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1098Open in IMG/M
3300011271|Ga0137393_10759121All Organisms → cellular organisms → Bacteria → Proteobacteria830Open in IMG/M
3300011998|Ga0120114_1010942All Organisms → cellular organisms → Bacteria2052Open in IMG/M
3300011998|Ga0120114_1023661All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1269Open in IMG/M
3300011999|Ga0120148_1044995Not Available909Open in IMG/M
3300012096|Ga0137389_11755822All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas516Open in IMG/M
3300012189|Ga0137388_11363721All Organisms → cellular organisms → Bacteria → Proteobacteria648Open in IMG/M
3300012199|Ga0137383_10159480All Organisms → cellular organisms → Bacteria1652Open in IMG/M
3300012208|Ga0137376_10071220All Organisms → cellular organisms → Bacteria2893Open in IMG/M
3300012208|Ga0137376_10391236All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1207Open in IMG/M
3300012209|Ga0137379_11287745All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300012351|Ga0137386_10371685All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycobacterium → Mycobacterium pseudokansasii1029Open in IMG/M
3300012359|Ga0137385_10020263All Organisms → cellular organisms → Bacteria5942Open in IMG/M
3300012359|Ga0137385_11117146All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300012363|Ga0137390_12012282Not Available503Open in IMG/M
3300012917|Ga0137395_10163739All Organisms → cellular organisms → Bacteria → Proteobacteria1530Open in IMG/M
3300012918|Ga0137396_10282133All Organisms → cellular organisms → Bacteria → Proteobacteria1227Open in IMG/M
3300012927|Ga0137416_11592057All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300013503|Ga0120127_10157880All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium545Open in IMG/M
3300013764|Ga0120111_1042306All Organisms → cellular organisms → Bacteria1169Open in IMG/M
3300013772|Ga0120158_10075515All Organisms → cellular organisms → Bacteria2150Open in IMG/M
3300015358|Ga0134089_10227084Not Available757Open in IMG/M
3300015373|Ga0132257_104190511All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300017654|Ga0134069_1089503All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300018433|Ga0066667_11153497Not Available672Open in IMG/M
3300018468|Ga0066662_12910767All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300018482|Ga0066669_10010359All Organisms → cellular organisms → Bacteria4715Open in IMG/M
3300021046|Ga0215015_10420603All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300022691|Ga0248483_164366All Organisms → cellular organisms → Bacteria2852Open in IMG/M
3300025910|Ga0207684_10502473All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1039Open in IMG/M
3300025913|Ga0207695_11010701All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300026298|Ga0209236_1014833All Organisms → cellular organisms → Bacteria4493Open in IMG/M
3300026298|Ga0209236_1203321All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300026309|Ga0209055_1201154Not Available609Open in IMG/M
3300026313|Ga0209761_1344284Not Available505Open in IMG/M
3300026324|Ga0209470_1077451All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas1530Open in IMG/M
3300026333|Ga0209158_1228739All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Rubrobacteria → Rubrobacterales → Rubrobacteraceae → environmental samples → uncultured Rubrobacteraceae bacterium640Open in IMG/M
3300026335|Ga0209804_1220551All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium → Rhizobium freirei → Rhizobium freirei PRF 81759Open in IMG/M
3300026343|Ga0209159_1036669All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2548Open in IMG/M
3300026527|Ga0209059_1060755All Organisms → cellular organisms → Bacteria1644Open in IMG/M
3300026529|Ga0209806_1161777Not Available840Open in IMG/M
3300026530|Ga0209807_1003691All Organisms → cellular organisms → Bacteria7517Open in IMG/M
3300026530|Ga0209807_1138210Not Available954Open in IMG/M
3300026537|Ga0209157_1052802All Organisms → cellular organisms → Bacteria2145Open in IMG/M
3300026538|Ga0209056_10162507All Organisms → cellular organisms → Bacteria1689Open in IMG/M
3300026551|Ga0209648_10406676Not Available887Open in IMG/M
3300027587|Ga0209220_1035094All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas1344Open in IMG/M
3300027643|Ga0209076_1125042Not Available726Open in IMG/M
3300027738|Ga0208989_10137027Not Available825Open in IMG/M
3300027910|Ga0209583_10203481Not Available847Open in IMG/M
3300028047|Ga0209526_10624062All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300028536|Ga0137415_10324460All Organisms → cellular organisms → Bacteria → Proteobacteria1343Open in IMG/M
3300028819|Ga0307296_10749301Not Available533Open in IMG/M
3300031754|Ga0307475_11580041Not Available501Open in IMG/M
3300032160|Ga0311301_11767246All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas739Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil28.71%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil25.74%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost8.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.93%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.98%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.99%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.99%
Permafrost And Active Layer SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost And Active Layer Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000880Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A35-65cm-20A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001536Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A15-65cm-8A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300002028Permafrost and active layer soil microbial communities from McGill Arctic Research Station (MARS), Canada, for enrichment studies - Sample_A17EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006864Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 3 DNA2013-193EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011998Permafrost microbial communities from Nunavut, Canada - A30_35cm_6MEnvironmentalOpen in IMG/M
3300011999Permafrost microbial communities from Nunavut, Canada - A28_65cm_6MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013503Permafrost microbial communities from Nunavut, Canada - A23_5cm_12MEnvironmentalOpen in IMG/M
3300013764Permafrost microbial communities from Nunavut, Canada - A28_35cm_6MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300022691Soil microbial communities from Calhoun CZO, South Carolina, United States - 60cm depthEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AL20A1W_122008723300000880PermafrostPRGALAGAKRSVVHALESTYEEALEFESYLQEAQAASPEFSEGVRTFLAKRAKK*
A1565W1_1058238033300001536PermafrostVHALESTYEEALEFESYLQEAQAASPEFSEGVRTFLAKRAKK*
A1565W1_1087185923300001536PermafrostLVHALESSYEEALEFESYLQEAQAASSEFTEGVSAFLAKRAKK*
A17_100180323300002028Permafrost And Active Layer SoilHALESSYEEALEFESYLQEAQAASPEFAEGVAAFLARGAKK*
Ga0066684_1000947473300005179SoilAAAKRAVNHALDSTFEQALEFESYLQEAQAASPEFAEGVASFLARRASKK*
Ga0066684_1073918313300005179SoilMAGAKRAVNHALTSTYEEAMEFESYLQEAQAGSPEFAEGVRSFLARRAVKK*
Ga0066671_1022917523300005184SoilAIAAAKRAVNHALDSTFEQALEFESYLQEAQAASPEFAEGVANFLARRASRK*
Ga0070707_10064613213300005468Corn, Switchgrass And Miscanthus RhizosphereAAKRAVNHALESSYEEALEFESYLQEAQAWSPEFAEGVQKFLARRSKK*
Ga0066697_1007499613300005540SoilSQPRQAVAAAKRAVIHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRKRN*
Ga0066697_1065356123300005540SoilSSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK*
Ga0070696_10070671723300005546Corn, Switchgrass And Miscanthus RhizosphereLESSYEEALEFESYLQEAQAASPEFAEGVSAFLAKRSKK*
Ga0070693_10136053723300005547Corn, Switchgrass And Miscanthus RhizosphereNHALTSSYEDAMEFESYLQEAQAGSAEFAEGVQAFLARRASKK*
Ga0066695_1006170743300005553SoilAGAKRAVNHALTSTYEEAMEFESYLQEAQAGSSEFAEGVQKFLESRRKK*
Ga0066661_1034893023300005554SoilAGAKRAVNHALNSTFEEAMEFESYLQEAQAASPEFAEGVQRFLESRKKK*
Ga0066661_1077689723300005554SoilGALAAAKRAVNHALDSTFEQALEFESYLQEAQAASPEFAEGVANFLARRASRK*
Ga0066692_1103494713300005555SoilTFEQALEFESYLQEAQAASAEFAEGVQAFLSRRSAKQG*
Ga0066707_1094348013300005556SoilNHALESSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK*
Ga0066698_1023912413300005558SoilYEEAMEFESYLQEAQAASPEFAEGVRNFLARRAAKKK*
Ga0066703_1017152523300005568SoilSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRATKK*
Ga0066702_1044681913300005575SoilAAAKRAVNHALESSFEDALEFESYLQESQAASPEFAEGVQAFLARRSKK*
Ga0066702_1087735513300005575SoilANQLAAQPRQAMAGAKRAVIHALESSYEEALVFESYLQEAQAASSEFAEGVQSFLARRAAKQS*
Ga0066708_1000899173300005576SoilAAAKRAVNHALDSTFEQALEFESYLQEAQAASPEFAEGVANFLARRASRK*
Ga0066708_1003568643300005576SoilAKRAVNHALESSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK*
Ga0066706_1025092723300005598SoilGAKRAVLHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAAKK*
Ga0066660_1041923913300006800SoilLAAQPRGALAGAKRAVTHALESTFEQALEFESYLQEAQAASPEFAEGVQNFLARRAAKQT
Ga0079221_1109065123300006804Agricultural SoilLNSTFEEAMEFESYLQEAQAASPEFVEGVQNFLARRASKK*
Ga0079220_1017848723300006806Agricultural SoilAKRAVNHALTSSFEDAMEFESYLQEAQAGSSEFAEGVQNFLARRASKK*
Ga0075425_10171523213300006854Populus RhizosphereTSTYEEAMEFESYLQEAQAGSSEFAEGVQKFLQSRSKK*
Ga0066797_105113423300006864SoilLESSFEEALEFESYLQEAQAASPEFAEGVSAFLAKRTKK*
Ga0075436_10062196723300006914Populus RhizosphereVNHALGSSYEEAMEFESYLQEAQAGSSEFAEGVQNFLARRAARK*
Ga0099794_1047784713300007265Vadose Zone SoilIHALESSYEEALEFESYLQEAQSASPEFAEGVAAFLAKRGKK*
Ga0099830_1115749023300009088Vadose Zone SoilAAAKRAVNHALESSYEEALEFESYLQEAQAWSPEFAEGVQKFLARRSKK*
Ga0099828_10011954103300009089Vadose Zone SoilAVNHALESSYEEALEFESYLQEAQAASPEFAEGVQAFLARRAKRP*
Ga0099828_1004006263300009089Vadose Zone SoilESSYEGALEFESYLQEAQAATPEFAEMVQAFLARRAAKK*
Ga0099828_1089247313300009089Vadose Zone SoilKRAVNHALESSYEEALEFESYLQEAQAWSPEFAEGVQKFLARRSEK*
Ga0099827_1018110533300009090Vadose Zone SoilMAAAKRAVNHALESSYEEALEFESYLQEAQAWSPEFAEGVQKFLARRKKS*
Ga0134070_1026978513300010301Grasslands SoilLAAAKRAVNHALESSYEDALEFESYLQEAQAGTSEFAELVQAFLARRAAKK*
Ga0134065_1023005213300010326Grasslands SoilRAVNHALESSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK*
Ga0134063_1016804513300010335Grasslands SoilLTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAKK*
Ga0134071_1002098413300010336Grasslands SoilKRAVNHALTSTYEEAMEFESYLQEAQAGSSEFAEGVQKFLESRSKK*
Ga0126370_1150755713300010358Tropical Forest SoilNSTFEEAMEFESYLQEAQAGSTEFVEGVQNFLARRAAKQ*
Ga0126378_1215664023300010361Tropical Forest SoilVNHALSSTFEEAMEFESYLQEAQVGSAEFAEGVQNFLARRSAKK*
Ga0134122_1101557513300010400Terrestrial SoilVNHALTSTYEEAMEFESYLQEAQAGSSEFAEGVQNFLARRAKK*
Ga0137392_1002507563300011269Vadose Zone SoilEGALEFESYLQEAQAATPEFAEMVQAFLARRAAKK*
Ga0137391_1113455913300011270Vadose Zone SoilAKRAVNHALESSYEEALEFESYLQEAQAWSPEFAEGVQKFLARRSKK*
Ga0137391_1128630213300011270Vadose Zone SoilRAVNHALESSYEEALEFESYLQEAQAASPEFAEGVAAFLARRSKKQ*
Ga0137393_1045429323300011271Vadose Zone SoilAAAKRAVNQALESNFEEALEFESYLQEGQAWSPEFAEGVQKFLARRTQK*
Ga0137393_1075912113300011271Vadose Zone SoilTFEQALEFESYLQEAQAASPEFAEGVQNFLARRAAKK*
Ga0120114_101094233300011998PermafrostLESSYEEALEFESYLQEAQAASPEFAEGVQAFLARRAKK*
Ga0120114_102366123300011998PermafrostNHALESSYEEALEFESYLQEAQAASQEFVDGVQAFLARRAKK*
Ga0120148_104499513300011999PermafrostVTHALESSYEEALEFESYLQEAQAASPEFAEGVQAFLAKRSKK*
Ga0137389_1175582223300012096Vadose Zone SoilEQAMEFESYLQEAQVATPEFAEGVQAFLARRAAKQK*
Ga0137388_1136372113300012189Vadose Zone SoilLAAQPRGAMAGAKRAVNHALESTFEQALEFESYLQEAQAASPEFAEGVQNFLARRAAKK*
Ga0137383_1015948033300012199Vadose Zone SoilEEALEFESYLQEAQAGSQEFRDGVSAFLAKRSKK*
Ga0137376_1007122053300012208Vadose Zone SoilHALESSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK*
Ga0137376_1039123613300012208Vadose Zone SoilAQPRQAMAGAKRAVNHALTSSFEEAMEFESYLQEAQAGSSEFAEGVQNFLARRKK*
Ga0137379_1128774523300012209Vadose Zone SoilEEALEFESYLQEVQAGSQEFADGVQAFLARRAKK*
Ga0137386_1037168513300012351Vadose Zone SoilLESSYEDALEFQSYLQEAQAASPEFAEGVANFLARRSQK*
Ga0137385_1002026313300012359Vadose Zone SoilAVNHALVSSYEDAMEFESYLQEAQAASSEIAEGVQKVHARRSSSKK*
Ga0137385_1111714613300012359Vadose Zone SoilQALAAAKRAVNHALESSYEDALEFESYLQEAQAGTSEFAELVQAFLARRAAKK*
Ga0137390_1201228223300012363Vadose Zone SoilAGQLAKQPRQALAAAKRAVNHALESSYEGALEFESYLQEAQAATPEFAEMVQAFLARRAAKK*
Ga0137395_1016373923300012917Vadose Zone SoilRQALAAAKRAVNHALESSYEDALEFESYLQEAQAATPEFAEMVQAFLARRAAKK*
Ga0137396_1028213323300012918Vadose Zone SoilPRQAVAGAKRAVLHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAAKK*
Ga0137416_1159205723300012927Vadose Zone SoilVVHALQSSFEEALEFESYLQEAQAASPEFADGVQAFLARRNKKK*
Ga0120127_1015788023300013503PermafrostEEALEFESYLQEAQAGSQEFADGVQAFMARRAKK*
Ga0120111_104230623300013764PermafrostYEEALEFESYLQEAQAASQEFVDGVQAFLARRAKK*
Ga0120158_1007551543300013772PermafrostHALESSYEEALEFESYLQEAQAASQEFVDGVQAFLARRAKK*
Ga0134089_1022708413300015358Grasslands SoilVNHALTSTYEEAMEFESYLQEAQAGSSEFAEGVQKFLESRSKK*
Ga0132257_10419051113300015373Arabidopsis RhizosphereNHLVSQPRQALAGAKRAVNHALTSSFEDAMEFESYLQEAQAGSAEFAEGVQNFLARRASKK*
Ga0134069_108950323300017654Grasslands SoilESSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK
Ga0066667_1115349723300018433Grasslands SoilPRQAVAAAKRAVIHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRKRN
Ga0066662_1291076713300018468Grasslands SoilESSFEEALEFESYLQEAQVASPEFVEGVQAFLARRSKKP
Ga0066669_1001035983300018482Grasslands SoilQLGAQPRQAVAGAQRAGLHALSSTYGEAMEFESYLQEAQAASPEFAEGVQNFLARRAKK
Ga0215015_1042060313300021046SoilGAKRARVHALDSSFEQAMEFESYLQEAQAASPEFAEGVQAFLSKRARK
Ga0248483_16436643300022691SoilYEEALEFESYLQEAQAASPEFAEGVAAFLAKRAKR
Ga0207684_1050247323300025910Corn, Switchgrass And Miscanthus RhizosphereEDALEFESYLQEAQAASPEFAEGVQAFLARRAQKK
Ga0207695_1101070123300025913Corn RhizosphereSQPPNAMASAKRAVNNALNSTYDEAMEFESYLQEAQAGSQEFVDGVQAFIARRAKK
Ga0209236_101483313300026298Grasslands SoilALESSFEEALEFESYLQEAQVASPEFAEGVQAFLARRSKKP
Ga0209236_120332123300026298Grasslands SoilSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAAKK
Ga0209055_120115423300026309SoilLHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRKRN
Ga0209761_134428413300026313Grasslands SoilYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAAKK
Ga0209470_107745113300026324SoilALAAAKRAVNHALESSYEGALEFESYLQEAQAGSQEFADGVQAFLARRKK
Ga0209158_122873923300026333SoilSSFEEALEFESYLQEAQAASPEFAEGVQAFLARRKKS
Ga0209804_122055123300026335SoilVLHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRKRD
Ga0209159_103666913300026343SoilVSSYEDAMEFESYLQEAQAASSEFAEGVQNFLARRSTKK
Ga0209059_106075533300026527SoilHALESTFEQALEFESYLQEAQAASPEFAEGVQAFLARRAAKK
Ga0209806_116177713300026529SoilALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRATKK
Ga0209807_1003691113300026530SoilQPRQAVAAAKRAVIHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAKK
Ga0209807_113821013300026530SoilQPRQAVAAAKRAVIHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAAKK
Ga0209157_105280223300026537SoilVNHALESSYEDALEFESYLQEAQAGTSEFAELVQAFLARRAAKK
Ga0209056_1016250733300026538SoilAGAKRAVLHALTSTYEEAMEFESYLQEAQAASPEFAEGVQNFLARRAAKK
Ga0209648_1040667613300026551Grasslands SoilNHALESSYEEALEFESYLQEAQAWSPEFAEGVQKFLARRSKK
Ga0209220_103509423300027587Forest SoilVHALESSYEEALEFESYLQEAQAASPEFREGVSAFLAKRGKR
Ga0209076_112504213300027643Vadose Zone SoilMAAAKRAVNHALESSFEEALEFESYLQESQAWSPEFAEGVQAFLARRTKK
Ga0208989_1013702723300027738Forest SoilSFEEALEFESYLQEAQAASPEFADGVQAFLARRNKKK
Ga0209583_1020348123300027910WatershedsVNHALESSYEEALEFESYLQEAQAASQEFVDGVQAFMARRAKK
Ga0209526_1062406223300028047Forest SoilSYEEALEFESYLQEAQAASPEFAEGVQAFLAKRVKK
Ga0137415_1032446023300028536Vadose Zone SoilSSFEEALEFESYLQEAQAASPEFAEGVQAFLARRKK
Ga0307296_1074930113300028819SoilALESSYEEALEFESYLQEAQAASPEFVEGVQAFLAKRSKK
Ga0307475_1158004113300031754Hardwood Forest SoilHALESNYEEALEFESYLQEAQAASPEFIEGVQNFMARRASKK
Ga0311301_1176724613300032160Peatlands SoilVTHALEASFEEALEFESYLQEAQAASAEFAEGVQAFLARRSARK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.