NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072151

Metagenome Family F072151

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072151
Family Type Metagenome
Number of Sequences 121
Average Sequence Length 45 residues
Representative Sequence MMKDEELRTLLRKRGYTILELSYSSYSDKKRDELYREILNGLGK
Number of Associated Samples 76
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 24.00 %
% of genes near scaffold ends (potentially truncated) 28.10 %
% of genes from short scaffolds (< 2000 bps) 32.23 %
Associated GOLD sequencing projects 66
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (68.595 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(44.628 % of family members)
Environment Ontology (ENVO) Unclassified
(49.587 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.413 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 37.50%    β-sheet: 0.00%    Coil/Unstructured: 62.50%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF01909NTP_transf_2 4.13
PF04307YdjM 2.48
PF01850PIN 2.48
PF13432TPR_16 1.65
PF05168HEPN 1.65
PF04242DUF424 1.65
PF03965Penicillinase_R 1.65
PF08241Methyltransf_11 1.65
PF00801PKD 0.83
PF01351RNase_HII 0.83
PF13361UvrD_C 0.83
PF10137TIR-like 0.83
PF01939NucS 0.83
PF13456RVT_3 0.83
PF13649Methyltransf_25 0.83
PF00227Proteasome 0.83
PF01548DEDD_Tnp_IS110 0.83
PF11867DUF3387 0.83
PF00254FKBP_C 0.83
PF08774VRR_NUC 0.83
PF01844HNH 0.83
PF02182SAD_SRA 0.83
PF00211Guanylate_cyc 0.83
PF04480DUF559 0.83
PF02371Transposase_20 0.83
PF00557Peptidase_M24 0.83
PF05050Methyltransf_21 0.83
PF00145DNA_methylase 0.83
PF01546Peptidase_M20 0.83
PF13439Glyco_transf_4 0.83
PF13482RNase_H_2 0.83
PF00534Glycos_transf_1 0.83
PF05866RusA 0.83
PF01555N6_N4_Mtase 0.83
PF08843AbiEii 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG1988Membrane-bound metal-dependent hydrolase YbcI, DUF457 familyGeneral function prediction only [R] 2.48
COG3682Transcriptional regulator, CopY/TcrY familyTranscription [K] 1.65
COG3547TransposaseMobilome: prophages, transposons [X] 1.65
COG2412Uncharacterized conserved protein, DUF424 domainFunction unknown [S] 1.65
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 1.65
COG1895HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 1.65
COG2250HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 1.65
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.83
COG5405ATP-dependent protease HslVU (ClpYQ), peptidase subunitPosttranslational modification, protein turnover, chaperones [O] 0.83
COG4570Holliday junction resolvase RusA (prophage-encoded endonuclease)Replication, recombination and repair [L] 0.83
COG3484Predicted proteasome-type proteasePosttranslational modification, protein turnover, chaperones [O] 0.83
COG2253Predicted nucleotidyltransferase component of viral defense systemDefense mechanisms [V] 0.83
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 0.83
COG0164Ribonuclease HIIReplication, recombination and repair [L] 0.83
COG1637Endonuclease NucS, RecB familyReplication, recombination and repair [L] 0.83
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 0.83
COG1039Ribonuclease HIIIReplication, recombination and repair [L] 0.83
COG0863DNA modification methylaseReplication, recombination and repair [L] 0.83
COG063820S proteasome, alpha and beta subunitsPosttranslational modification, protein turnover, chaperones [O] 0.83
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A68.60 %
All OrganismsrootAll Organisms31.40 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002562|JGI25382J37095_10035147Not Available1963Open in IMG/M
3300002562|JGI25382J37095_10195670All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon613Open in IMG/M
3300005166|Ga0066674_10131979All Organisms → cellular organisms → Archaea → Euryarchaeota1173Open in IMG/M
3300005167|Ga0066672_10942182All Organisms → cellular organisms → Archaea531Open in IMG/M
3300005174|Ga0066680_10401784Not Available870Open in IMG/M
3300005176|Ga0066679_10570796All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300005447|Ga0066689_10037838All Organisms → Viruses → Predicted Viral2513Open in IMG/M
3300005450|Ga0066682_10418577Not Available856Open in IMG/M
3300005552|Ga0066701_10108992All Organisms → cellular organisms → Archaea1627Open in IMG/M
3300005554|Ga0066661_10836912All Organisms → cellular organisms → Archaea538Open in IMG/M
3300005556|Ga0066707_10882634Not Available549Open in IMG/M
3300005558|Ga0066698_10339538All Organisms → cellular organisms → Archaea1039Open in IMG/M
3300005568|Ga0066703_10741582Not Available563Open in IMG/M
3300005568|Ga0066703_10864838All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon516Open in IMG/M
3300005586|Ga0066691_10279094Not Available984Open in IMG/M
3300007265|Ga0099794_10005723All Organisms → cellular organisms → Archaea5009Open in IMG/M
3300009822|Ga0105066_1048090All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon892Open in IMG/M
3300010304|Ga0134088_10105432All Organisms → cellular organisms → Archaea1328Open in IMG/M
3300010329|Ga0134111_10475575All Organisms → cellular organisms → Archaea545Open in IMG/M
3300010333|Ga0134080_10030663Not Available2047Open in IMG/M
3300010333|Ga0134080_10168679All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → Methanomassiliicoccales → Methanomassiliicoccaceae → Methanomassiliicoccus → Methanomassiliicoccus luminyensis935Open in IMG/M
3300010333|Ga0134080_10238287Not Available799Open in IMG/M
3300010398|Ga0126383_12527957Not Available598Open in IMG/M
3300011271|Ga0137393_10921273All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300012203|Ga0137399_10000465All Organisms → cellular organisms → Archaea16004Open in IMG/M
3300012203|Ga0137399_10449609All Organisms → cellular organisms → Archaea1078Open in IMG/M
3300012206|Ga0137380_10047402All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3944Open in IMG/M
3300012206|Ga0137380_10338923All Organisms → cellular organisms → Archaea1342Open in IMG/M
3300012206|Ga0137380_10591838All Organisms → cellular organisms → Archaea → TACK group971Open in IMG/M
3300012209|Ga0137379_11316036Not Available628Open in IMG/M
3300012349|Ga0137387_10034091All Organisms → cellular organisms → Archaea3308Open in IMG/M
3300012349|Ga0137387_10034291All Organisms → cellular organisms → Archaea3299Open in IMG/M
3300012351|Ga0137386_10555930All Organisms → cellular organisms → Archaea826Open in IMG/M
3300012359|Ga0137385_10033348All Organisms → cellular organisms → Bacteria4618Open in IMG/M
3300012359|Ga0137385_10426475All Organisms → cellular organisms → Bacteria1129Open in IMG/M
3300012359|Ga0137385_10658973All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon876Open in IMG/M
3300012359|Ga0137385_11061525All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon667Open in IMG/M
3300012930|Ga0137407_10337520All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1385Open in IMG/M
3300017654|Ga0134069_1309963All Organisms → cellular organisms → Archaea → TACK group561Open in IMG/M
3300018468|Ga0066662_11018651Not Available821Open in IMG/M
3300025174|Ga0209324_10284646All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1071Open in IMG/M
3300025313|Ga0209431_10523834All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon899Open in IMG/M
3300026296|Ga0209235_1165997All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon845Open in IMG/M
3300026313|Ga0209761_1029439All Organisms → cellular organisms → Archaea3319Open in IMG/M
3300026317|Ga0209154_1136873All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1027Open in IMG/M
3300026317|Ga0209154_1283946All Organisms → cellular organisms → Archaea550Open in IMG/M
3300026328|Ga0209802_1085981All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota1447Open in IMG/M
3300026548|Ga0209161_10062448All Organisms → Viruses → Predicted Viral2377Open in IMG/M
3300027655|Ga0209388_1013594All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2225Open in IMG/M
3300032180|Ga0307471_101556588Not Available818Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil44.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil23.14%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil18.18%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.26%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.83%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.83%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300025174Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 3EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1016660813300002558Grasslands SoilDEELRSLLRKRGYRILELSYGSYTDKKRDELYREILNGLGK*
JGI25384J37096_1015804813300002561Grasslands SoilMKDEELRTLLRKKGYRLLELCYDNYSDKKRDELYEEILANLGMT*
JGI25382J37095_1003514723300002562Grasslands SoilMTKDEELRTLLRKRGYRILELYYDIYSDKKRDQLYEQILKDLSMQ*
JGI25382J37095_1016049913300002562Grasslands SoilHLKTAQMMKDEELRSLLRKRGYRILELSYGTYSDKKRDELYREILNGLGR*
JGI25382J37095_1019567013300002562Grasslands SoilMTKDEELRTLLRKRGYRILELSYSSYSDKKRDELYREIMSGLGK*
JGI25386J43895_1002601923300002912Grasslands SoilMMKDEELRTLLRKRGYTILELSYSSYSDKKRDELYREILNGLGK*
Ga0066674_1013197913300005166SoilMTKDEELRTLLRKRGYRILELFYDSYTDKKREQLYEAILDSLGRQ*
Ga0066674_1050186713300005166SoilLKTTQIVKDEELRSLLGKRGYRVLELAYSSYSDKKRDELYEKILDNLGRS*
Ga0066672_1094218213300005167SoilMSKDEELRTLLRKRGYRILELFYESYSDKKRDQLYEEIQN
Ga0066680_1005375653300005174SoilMMKDEELWTLLRKRGYRILELSYSSYTNKKGDELFREIVNKVGRE*
Ga0066680_1040178413300005174SoilKDEELRSLLRKRGYRVLELCYNSYSDKKRDQLYEGILTGLGKQ*
Ga0066679_1045264913300005176SoilLRTLLRKRGYKVMELVYGSYSDKKRDELYQEIVMQLGKE*
Ga0066679_1057079623300005176SoilTAQMMKDEELRTLLRKKGYRVLEPYYDSYSDKKRDQLYEEILTNLARI*
Ga0066685_1086442523300005180SoilMAKDEEPRTLLRKRGYKVLELIYRSYSDKKRDELYQEIIKELGRE*
Ga0066689_1003783813300005447SoilMAKDEELRTLLRKRGYRILELFYESYSDKKRDQLYEEIQNSLAKLGT*
Ga0066682_1041857713300005450SoilSQMTKDEETRTLLRKKGYRILELYYESYSDKKRDQLYREILAAIGMSSPL*
Ga0070699_10005851933300005518Corn, Switchgrass And Miscanthus RhizosphereMMKDEKLRSLLRKRGYRVLELSYTGYSDKKRDELYREIMSQLGK*
Ga0070699_10035228223300005518Corn, Switchgrass And Miscanthus RhizosphereMSQMIRDGETRSLLRKRGYRALELSYTGYTDKKRDELCREIMSHLGK*
Ga0066701_10002824113300005552SoilKDEELRTLLRKRGYRILELVYGSYTDKKGDELYREILNGLG*
Ga0066701_1010899213300005552SoilDEELRLLLRKRGYRILELYHDSYSDKKRDELYNEIRNGLRIARS*
Ga0066661_1083691213300005554SoilDEELRGLLRKRGYRILELYYDSYSDKKRDELYEEMMNSLGRE*
Ga0066707_1088263423300005556SoilKDEELRSLLRKRGYRVLELRYDSYSDKKRDQLYEQILSDLRRT*
Ga0066698_1033953813300005558SoilMSKDEELRTLLRKRGYRTLELFYESYSDKKRDQLYEEIQNSLAKLAT*
Ga0066703_1074158223300005568SoilHLNTAQMMKDEELRTLLRKKGYRVLEPYYDSYSDKKRDQLYEEILTNLARI*
Ga0066703_1086483813300005568SoilVKDEELRSLLRKRGYRIIELFYDSYSDKKRDELYEQILSDLNRR*
Ga0066708_1024015213300005576SoilQMVKDEELRSLLRKRGYRILKLSYSSYTDKKRDELYREILNGLGK*
Ga0066691_1027909423300005586SoilMAKDEEVRSLLRKRGYRVLELTYNSYSDKKRDDLYEEISNSL
Ga0066656_1003114013300006034SoilTLLRKRGYKVMELIYRSYSDKKRDELYQEIVTQLGRE*
Ga0066665_1114743823300006796SoilHLARAQMIRDKETRLLLRKRGYGILELSYGSYTDKKRDRLYREILNGLGK*
Ga0099791_1013325623300007255Vadose Zone SoilMKDEELRTLLRKRGYRVLELSYTGYTDKKRDELYREILNGLGK*
Ga0099794_1000572343300007265Vadose Zone SoilMVKDEELRSLLRKRGYRVLELLYIKYSEKKRDQLYEEILDNLVGSGAIH*
Ga0066710_100003221123300009012Grasslands SoilMAKDEELRTLLRKRGYKVVELVYRSYSGKKRDELYQEIVKQLGRE
Ga0066710_10257800213300009012Grasslands SoilDEELRTLLRKRGYRVLELSYTGYTDKKRDELYRELVERLGRE
Ga0099829_1105131223300009038Vadose Zone SoilLRTLLRKRGYRVLELSYSSYSDKKRDKLYREMVSALGK*
Ga0099830_1170882913300009088Vadose Zone SoilMSQMKDEELRTILRKGGYRILELSYSSYSDKRRDELYR
Ga0099828_1008687953300009089Vadose Zone SoilMSQMKDEELRTILRKGGYRILELSYSSYSDKKRDELYREIMSGLGK*
Ga0099827_1005258333300009090Vadose Zone SoilMSQMKDEELRTILRKRGYRILEPSYSSYSDKKRDELYREIMSGLGK*
Ga0066709_10010670423300009137Grasslands SoilMAKDEELRTLLRNRGYKVMELVYRSYSDKKRDELFQEIVKQLGREESG*
Ga0066709_10176410613300009137Grasslands SoilMMKDEELRTLLRKRGYRILELRYSSYSDKKRDELYREIMSGLGK*
Ga0066709_10245736923300009137Grasslands SoilDEELRTLLRKRGYRILELTYDNYSDKKRDELYEEILDSLGREI*
Ga0105066_104809033300009822Groundwater SandVKDEELRSLLRKRGYRVLELYYNSYSNKKRDQLYSEILNSLGREYP*
Ga0134088_1010543243300010304Grasslands SoilAQMAKDEELSSLLRKRGYRILELCYNNYSDKKRDELYKEIRHSLHS*
Ga0134088_1019598223300010304Grasslands SoilLKTSQMMKDEELRTLLRKRGYGILELSYGSYTDKKRDRLYREILNGLGK*
Ga0134111_1040135913300010329Grasslands SoilEELRTLLRKRGYKVMELIYRNYSDKKRDELYQELVQQLGREQGGNT*
Ga0134111_1047557513300010329Grasslands SoilAQMAKDEELRSLLRKRGYRILELFYDSYSDKKRDQLYEEIRKGLHSSSS*
Ga0134111_1055428623300010329Grasslands SoilMAKDEELRTLLRKRGYKVMELIYRSYSDKKRDELYEEIVTQLGRE*
Ga0134080_1003066313300010333Grasslands SoilKDEETRTLLRKKGYRILELSYSNYSDKKRDQLYQDILAGLGREL*
Ga0134080_1016867923300010333Grasslands SoilDEELRSLLRKRGYRILELFYDSYSDKKRDMMYEQILYDLGTE*
Ga0134080_1023828713300010333Grasslands SoilKDEELRTLLRKRGYRILELYYNSYSDKKRDQLFKELLDNLGKL*
Ga0126383_1252795723300010398Tropical Forest SoilMKDEEVRSLLRKKGYKVLEPYDESYSEKKRHELYLEIIKTLGRE*
Ga0137393_1092127313300011271Vadose Zone SoilTSQMTKDEETRTLLRKKGYRILELPYDSYSDKKRDELYQEILDSLGTNEVD*
Ga0137388_1103989623300012189Vadose Zone SoilHLKTAQMMKDEELRTLLRKRGYRVLELSYSSYTDKKTDELYREILNGLGK*
Ga0137364_1099849123300012198Vadose Zone SoilELRTLLRKRGYKIVELAYTDYSDKKRDELFQQLLAGLGQE*
Ga0137399_1000046553300012203Vadose Zone SoilMVKDEELRSLLRKRGYRVLELLYIKYSEKKRDQLYEEILDNLVGSGPIR*
Ga0137399_1044960923300012203Vadose Zone SoilMVKDEELRTLLRKRGYRIIELVYSSYTDKKRDELYREILNGLGK*
Ga0137374_1102641913300012204Vadose Zone SoilRTLLRKRGYKVMELIYRSYSDKKRDALYQEIVRQLERK*
Ga0137362_1113357513300012205Vadose Zone SoilMVKDEELRALLRKRGYRVLELSYSGYSDKKRDELYEEILNGRGRSLAAR
Ga0137380_1001817513300012206Vadose Zone SoilLRTLLRKRGYKVMELVYHNYSDKKRDELYQEIVKELGRE*
Ga0137380_1004740213300012206Vadose Zone SoilQMAKDGELRILLRKKGYRVLELYYENYTDKKRDQLYHEILDKLGME*
Ga0137380_1033892313300012206Vadose Zone SoilMSKDEELRTLLRKRGYRILEIFYESYSDKKRDQLYEEIQNSLAKLGT*
Ga0137380_1051230133300012206Vadose Zone SoilMMKDEELRTLLRKRGYRVLELNYTGYTDKKRDELYREIVKQLGRE*
Ga0137380_1059183813300012206Vadose Zone SoilQMMKDEELRILLRKRGYRVLELYYDSYSNLKRDKLYRQLLGSLGKE*
Ga0137380_1146971523300012206Vadose Zone SoilKDEELRTLLRNRGYMVLELYYASYSDKKRDELYREILTSLGRS*
Ga0137381_1041883813300012207Vadose Zone SoilPHLRTIQMAKDEELRTLLRKRGYKVMELIYNSYSDKKRDELYQEILTQLGRE*
Ga0137381_1049107933300012207Vadose Zone SoilGLPHLRTIQMAKDEELRTLLRKRGYEVMELIYLSYSDKKRDELYQEIVKQLGRE*
Ga0137381_1070973533300012207Vadose Zone SoilQMAKDEELRSLLRKRGYRILELCYECYSDKKRDELYEGILSGLGNQ*
Ga0137379_1088649713300012209Vadose Zone SoilKDEELRSLLRKRGYRILELCYECYSDKKRDELYEGILSGLGNQ*
Ga0137379_1131603613300012209Vadose Zone SoilRSLLRRRGYRILELFYESYSDKKRDHLYEEILASLNRLTR*
Ga0137379_1132722023300012209Vadose Zone SoilRKKGYRVLELFYHNYSDKKRDELYEEILASLGRAGLW*
Ga0137379_1171634023300012209Vadose Zone SoilRTLLRKRGYRVLELSYTGYTDKKRDELYREILNGLGK*
Ga0137378_1047471413300012210Vadose Zone SoilIQMAKDEELRTLLRKRGYKVMELIYRSYSDKKRDELYQELVEQLGRE*
Ga0137378_1049939623300012210Vadose Zone SoilMMKDEELRTLLRKRGYRVLELSYTGYTDKKRDEFYREIANGLGK*
Ga0137378_1103151713300012210Vadose Zone SoilEELRSLLRKRGYRILELRYDRYSDKKRDKLYEEILEGLGIG*
Ga0137378_1174358423300012210Vadose Zone SoilQMVKDEELRTLLRKRGYRILELNYDSYSNKRRDELYQEILSNLGKE*
Ga0137387_1003409123300012349Vadose Zone SoilMSKDEELRTLLRKRGYRILEIFYESYSDKKRDQLYEEIQNSLAKLATSRV*
Ga0137387_1003429113300012349Vadose Zone SoilMKDEELRSLLLKRGYRILELSYNNYSDSKRDQLYEEIQNSLAKLTH*
Ga0137387_1132943123300012349Vadose Zone SoilQMIKDEELRTLLRKRGYRVVELGYANYSDKKRDELYQQLLSSLGRE*
Ga0137386_1001758443300012351Vadose Zone SoilMAKDEELRTLLRKRGYKVMELVYHNYSDKKRDELYQEIVKELGRE*
Ga0137386_1006217053300012351Vadose Zone SoilTEKDEELRTLLRKRGYRVLELCYDNYSDKKRDELCEEILANQGTN*
Ga0137386_1008008633300012351Vadose Zone SoilMRAIQMAKDEELRTLLRKRGYKIMELVYRSYSDKKRDELYQEIVKQLGRE*
Ga0137386_1025183313300012351Vadose Zone SoilQMMKDEELRTLLRKRGYRILEVSYSSYSDKKRDELYREILNGLGKQ*
Ga0137386_1055593023300012351Vadose Zone SoilMMKDEELRTLLRRRGYRVLELSYTGYTDKKRDEFYREIMNGLGK*
Ga0137366_1112695213300012354Vadose Zone SoilKDEELRTLLRKRGYRILELSYSSYTDRKRDELYQEIFQQLGRE*
Ga0137369_1019566313300012355Vadose Zone SoilRSLLRKRGYRILELVYSSYTDKKRDELYQEIFQQLGRE*
Ga0137384_1001854413300012357Vadose Zone SoilEELRSLLRKRGYRILELSYDYYSDKKRDELYEEVLNSLNRR*
Ga0137384_1145432513300012357Vadose Zone SoilVMKDEELRTLLRKKGHTLLELCYDNYSDKKRDQLYDEILANLGII*
Ga0137385_1003334833300012359Vadose Zone SoilMSKDEELRTLLRKRGYRILEIFYESYSDKKRDQLYEEIQNSLA
Ga0137385_1042647533300012359Vadose Zone SoilQMTKDEETRTLLRKRGYRVLELRYNDYSDKKRDQLYEELLDNLGKEYPVNSEN*
Ga0137385_1065897313300012359Vadose Zone SoilDEELRTLLRKRGYRILEIFYESYSDKKRDQLYEEIQNSLAKLGT*
Ga0137385_1106152513300012359Vadose Zone SoilDEELRTLLRKRGYRILEIFYESYSDKKRDQLYEEIQNSLAKLATSRV*
Ga0137361_1081861313300012362Vadose Zone SoilMMKDEELWPLLRKRGYRILELSYSSYTDKKRDEFYREIMNGLGK*
Ga0137419_1060390813300012925Vadose Zone SoilVKDEELRTLLRKRGYTILELSYSSYTDKKRDELYREILNGIGK*
Ga0137416_1099860633300012927Vadose Zone SoilMKDEELRTLLRKRGYRVLELSYTGYTDKKRDELCREILNGLGR*
Ga0137407_1033752033300012930Vadose Zone SoilMKGEELRTLLRNKGYRLLELCYDNYSDKKRDELYDEILANLGII*
Ga0134087_1050137113300012977Grasslands SoilRSLLRKRGYRILELYYDSYSGQKRDELYEQILDDLRRK*
Ga0134069_130996313300017654Grasslands SoilSRIVKDEELRSLLRKRGYRVSELCYESYSDKIRDHLYEEIRNSLAKLAT
Ga0066655_1077874113300018431Grasslands SoilEELRTLLRKRGYRILELVYGSYTDKKGDALYREILNGLG
Ga0066667_1093892423300018433Grasslands SoilMAKDGELRTLLRKRGYKVIELVYRSYSGKKRDELYQEIVKQLGR
Ga0066662_1004965943300018468Grasslands SoilMMKDEELWTLLRKRGYRILELSYSSYTNKKGDELFREIVNKVGRE
Ga0066662_1101865113300018468Grasslands SoilMAKDEELRSLLRKRGYRVLELCYNSYSDKARDRLYQQVLNDLR
Ga0066662_1177024913300018468Grasslands SoilRTKNYRSLLRKRGYRILELSYSSYSDKKRNELYREILNGLGK
Ga0066662_1217569213300018468Grasslands SoilMAKDEELRTLLRKRGYKVMELIYRSYSDKKRDELYQEIVKQLGRE
Ga0209324_1028464623300025174SoilGKDEELRSLLRKRGYRVLELYYDNYSEKKRDKLYEEILDGLGKE
Ga0209431_1052383413300025313SoilKDEELRSLLRKRGYRVLELYYNSYSDKKRDQLYGEILNGLGKE
Ga0209234_102357933300026295Grasslands SoilMAKDEELRTLLRNRGYKVMELVYRSYSDKKRDELFQEIVKQLGREESG
Ga0209235_116599713300026296Grasslands SoilKTSQMTKDEETRTLLRKKGYRILELYYSNYSDKKRDQLYQDILAGLGRES
Ga0209236_106809833300026298Grasslands SoilAQMMKDEELRTLLRKKGYRVLELSYTGYSDKKRDELYGEIMSHIGR
Ga0209055_112716633300026309SoilEELRTLLLKKGYRLLELFYDNYSDKKRDELYDEILANLGII
Ga0209761_102943953300026313Grasslands SoilMKDEELRTLLRKKGYRLLELCYDNYSDKKRDELYEEILANLGMT
Ga0209761_105266413300026313Grasslands SoilDEELRTLLRKRGYRILELSYGSYTDKKRDELYREILNGLGK
Ga0209154_113687323300026317SoilMSKDEELRTLLRKRGYRILELFYESYSDKKRDQLYEEIQNSLAKLGT
Ga0209154_128394613300026317SoilMSKDEELRTLLRKRGYRILELFYESYSDKKRDQLYEEIQNSL
Ga0209472_106147113300026323SoilEKSQMLKDEELRSLLRKRGYRVLELSYNSYSDKKRDELHEEIRSGLARLAS
Ga0209802_108598113300026328SoilMAKDEELRTLLRKRGYRILELFYESYSDKKRDQLYEEIQNSLAKLGT
Ga0209056_1003734363300026538SoilDEELRTILRKRGYRILELVYSSYSDKKRDELYREVLNGLGK
Ga0209161_1006244843300026548SoilQMTKDEELRSLLRKRGYRILELFYDNYSDKKRDQLYQQVLDDLR
Ga0209388_101359433300027655Vadose Zone SoilMKDEELRTLLRKRGYRVLELSYTGYTDKKRDELYREILNGLGK
Ga0209283_1074015213300027875Vadose Zone SoilKDEELRTLLRKRGYQILELYYSSYSEKKRDELYQEIVKQLGRE
Ga0209590_1002803613300027882Vadose Zone SoilMSQMKDEELRTILRKRGYRILELSYSSYSDKKRDELYREIMSGLGK
Ga0137415_1008678943300028536Vadose Zone SoilMKDEELRTLLRKRGYRVLELSYTGYTDKKRDELCREILNGLGR
Ga0307471_10155658823300032180Hardwood Forest SoilMMKDEELRGILRKRGYRILELYYDSYSGKKRDELYNEILNGLGKENSGS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.