NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102933

Metagenome Family F102933

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102933
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 43 residues
Representative Sequence RARAERSFSHLVMAEEYVRMYRSVLETGTLPPGRPTP
Number of Associated Samples 81
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 3.00 %
% of genes near scaffold ends (potentially truncated) 97.03 %
% of genes from short scaffolds (< 2000 bps) 80.20 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.62

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.010 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(26.733 % of family members)
Environment Ontology (ENVO) Unclassified
(56.436 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(61.386 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 36.92%    β-sheet: 0.00%    Coil/Unstructured: 63.08%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.62
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF13524Glyco_trans_1_2 44.55
PF13692Glyco_trans_1_4 7.92
PF00005ABC_tran 4.95
PF00664ABC_membrane 3.96
PF09594GT87 3.96
PF13439Glyco_transf_4 2.97
PF12996DUF3880 1.98
PF13489Methyltransf_23 1.98
PF00534Glycos_transf_1 1.98
PF05685Uma2 0.99
PF08665PglZ 0.99
PF05050Methyltransf_21 0.99
PF09557DUF2382 0.99
PF13231PMT_2 0.99
PF01797Y1_Tnp 0.99
PF03901Glyco_transf_22 0.99
PF01075Glyco_transf_9 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0859ADP-heptose:LPS heptosyltransferaseCell wall/membrane/envelope biogenesis [M] 0.99
COG1943REP element-mobilizing transposase RayTMobilome: prophages, transposons [X] 0.99
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.01 %
UnclassifiedrootN/A0.99 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002560|JGI25383J37093_10032197All Organisms → cellular organisms → Bacteria1749Open in IMG/M
3300002562|JGI25382J37095_10133772All Organisms → cellular organisms → Bacteria829Open in IMG/M
3300002915|JGI25387J43893_1050591All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300002916|JGI25389J43894_1073599All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300005172|Ga0066683_10364283All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300005175|Ga0066673_10028595All Organisms → cellular organisms → Bacteria2655Open in IMG/M
3300005179|Ga0066684_10047842All Organisms → cellular organisms → Bacteria2435Open in IMG/M
3300005181|Ga0066678_10013984All Organisms → cellular organisms → Bacteria4021Open in IMG/M
3300005186|Ga0066676_10037272All Organisms → cellular organisms → Bacteria2688Open in IMG/M
3300005336|Ga0070680_101936579All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300005447|Ga0066689_10057163All Organisms → cellular organisms → Bacteria2115Open in IMG/M
3300005471|Ga0070698_100871832All Organisms → cellular organisms → Bacteria845Open in IMG/M
3300005529|Ga0070741_10575182All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300005545|Ga0070695_101801534All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300005555|Ga0066692_10517065All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300005559|Ga0066700_10492419All Organisms → cellular organisms → Bacteria856Open in IMG/M
3300005560|Ga0066670_10079607All Organisms → cellular organisms → Bacteria1784Open in IMG/M
3300005568|Ga0066703_10087905All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300005569|Ga0066705_10032282All Organisms → cellular organisms → Bacteria2825Open in IMG/M
3300005569|Ga0066705_10566554All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300005833|Ga0074472_10775229All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium793Open in IMG/M
3300005836|Ga0074470_10858142All Organisms → cellular organisms → Bacteria3729Open in IMG/M
3300006796|Ga0066665_10167257All Organisms → cellular organisms → Bacteria1682Open in IMG/M
3300006796|Ga0066665_10234765All Organisms → cellular organisms → Bacteria1442Open in IMG/M
3300006796|Ga0066665_10258463All Organisms → cellular organisms → Bacteria1382Open in IMG/M
3300006797|Ga0066659_10804179All Organisms → cellular organisms → Bacteria778Open in IMG/M
3300006914|Ga0075436_100211620All Organisms → cellular organisms → Bacteria1374Open in IMG/M
3300006914|Ga0075436_101210553All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300006954|Ga0079219_12033164All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300009012|Ga0066710_100993177All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1295Open in IMG/M
3300009012|Ga0066710_101206960All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300009012|Ga0066710_102136883All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium821Open in IMG/M
3300009012|Ga0066710_104312611All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300009137|Ga0066709_100299194All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2184Open in IMG/M
3300009137|Ga0066709_100891829All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1295Open in IMG/M
3300009137|Ga0066709_102413336All Organisms → cellular organisms → Bacteria714Open in IMG/M
3300009143|Ga0099792_10315650All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300009147|Ga0114129_12055240All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300009162|Ga0075423_10942567All Organisms → cellular organisms → Bacteria916Open in IMG/M
3300009162|Ga0075423_11495584All Organisms → cellular organisms → Bacteria724Open in IMG/M
3300010301|Ga0134070_10177935All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium772Open in IMG/M
3300010301|Ga0134070_10292899All Organisms → cellular organisms → Bacteria619Open in IMG/M
3300010333|Ga0134080_10016352All Organisms → cellular organisms → Bacteria2709Open in IMG/M
3300010336|Ga0134071_10076497All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1561Open in IMG/M
3300010337|Ga0134062_10603734All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium565Open in IMG/M
3300010362|Ga0126377_10093288All Organisms → cellular organisms → Bacteria2729Open in IMG/M
3300012198|Ga0137364_10420585All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300012201|Ga0137365_10352220All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300012353|Ga0137367_10458100All Organisms → cellular organisms → Bacteria900Open in IMG/M
3300012355|Ga0137369_10775168All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium655Open in IMG/M
3300012355|Ga0137369_10985638All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300012358|Ga0137368_10108674All Organisms → cellular organisms → Bacteria2132Open in IMG/M
3300012359|Ga0137385_10337577All Organisms → cellular organisms → Bacteria1291Open in IMG/M
3300012359|Ga0137385_11013650All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300012361|Ga0137360_11416747All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium598Open in IMG/M
3300012361|Ga0137360_11760829All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium525Open in IMG/M
3300012532|Ga0137373_10015841All Organisms → cellular organisms → Bacteria → Proteobacteria7664Open in IMG/M
3300012927|Ga0137416_10074060All Organisms → cellular organisms → Bacteria2481Open in IMG/M
3300012944|Ga0137410_10005089All Organisms → cellular organisms → Bacteria8958Open in IMG/M
3300012944|Ga0137410_10856635All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300012976|Ga0134076_10128311All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300014157|Ga0134078_10117797All Organisms → cellular organisms → Bacteria1012Open in IMG/M
3300015356|Ga0134073_10213763All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300015359|Ga0134085_10073048All Organisms → cellular organisms → Bacteria1397Open in IMG/M
3300015359|Ga0134085_10412171All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300017656|Ga0134112_10194260All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium792Open in IMG/M
3300017656|Ga0134112_10222785All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300017927|Ga0187824_10281390All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300018468|Ga0066662_10362840All Organisms → cellular organisms → Bacteria1253Open in IMG/M
3300019878|Ga0193715_1008308All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2223Open in IMG/M
3300020010|Ga0193749_1002446All Organisms → cellular organisms → Bacteria3495Open in IMG/M
3300025922|Ga0207646_11603956All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300026277|Ga0209350_1159952All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium510Open in IMG/M
3300026296|Ga0209235_1261311All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300026297|Ga0209237_1122531All Organisms → cellular organisms → Bacteria1089Open in IMG/M
3300026297|Ga0209237_1250952All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300026298|Ga0209236_1101927All Organisms → cellular organisms → Bacteria1290Open in IMG/M
3300026308|Ga0209265_1038226All Organisms → cellular organisms → Bacteria1485Open in IMG/M
3300026313|Ga0209761_1284060All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300026325|Ga0209152_10048850All Organisms → cellular organisms → Bacteria1482Open in IMG/M
3300026332|Ga0209803_1151387All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300026334|Ga0209377_1191051All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium697Open in IMG/M
3300026335|Ga0209804_1139031All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1093Open in IMG/M
3300026343|Ga0209159_1110434All Organisms → cellular organisms → Bacteria1180Open in IMG/M
3300026530|Ga0209807_1021455All Organisms → cellular organisms → Bacteria3142Open in IMG/M
3300026530|Ga0209807_1143387All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300026540|Ga0209376_1380197All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium529Open in IMG/M
3300026542|Ga0209805_1195734All Organisms → cellular organisms → Bacteria876Open in IMG/M
3300026542|Ga0209805_1344432All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium567Open in IMG/M
3300026548|Ga0209161_10119851All Organisms → cellular organisms → Bacteria1558Open in IMG/M
3300027875|Ga0209283_10030389All Organisms → cellular organisms → Bacteria3367Open in IMG/M
3300027903|Ga0209488_11149847All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium526Open in IMG/M
3300031753|Ga0307477_10450453All Organisms → cellular organisms → Bacteria878Open in IMG/M
3300031949|Ga0214473_11770737All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium612Open in IMG/M
3300032180|Ga0307471_100220404All Organisms → cellular organisms → Bacteria1920Open in IMG/M
3300032783|Ga0335079_11651891All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium628Open in IMG/M
3300032897|Ga0335071_10536911All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus1122Open in IMG/M
3300033502|Ga0326731_1006059All Organisms → cellular organisms → Bacteria3120Open in IMG/M
3300033803|Ga0314862_0159791All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium549Open in IMG/M
3300033813|Ga0364928_0058957All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium856Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil26.73%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil17.82%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.95%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.97%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.98%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.99%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.99%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.99%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.99%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.99%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005833Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.174_CBKEnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032897Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.5EnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300033803Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_0_10EnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1003219733300002560Grasslands SoilLAEWDPHACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
JGI25382J37095_1013377223300002562Grasslands SoilACRAHAERYFTHITMAEQYVSLYRNLLATGALGPGRPTPYTIM*
JGI25387J43893_105059113300002915Grasslands SoilIETRSAAACRTHAERYFSHITMAEQYVRLYRNLLATGALGPGQSIPYTTS*
JGI25389J43894_107359913300002916Grasslands SoilNPHACRARAERYFSHLVMAEEYVRMYRSLLDTGKLPPGRPTP*
Ga0066683_1036428323300005172SoilWDPHACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
Ga0066673_1002859543300005175SoilRSPEACRAHAQRFFTHIVMAEEYLRMYRHLIETGGLPSGRSTPSVPA*
Ga0066684_1004784213300005179SoilYACRAHAERHFSHLVMTQEYLRMYRGLLETGRLPPGRPAGHLARTG*
Ga0066678_1001398413300005181SoilDPEACRAHAERNFSHLVMAQEYERMYHAVLETGTLPPGRPAPHLATAG*
Ga0066676_1003727253300005186SoilRSPQACRAFAERQFTHLVMAEEYVRMYRCLLDTGKLPPGRTLAPPAA*
Ga0070680_10193657913300005336Corn RhizosphereRDPAACRAYAEQYFSHVVMAEEYLRMYRSLLDTGSLPPGRVTPHAPTATSP*
Ga0066689_1005716333300005447SoilLRGRLADWDPRACRARAERSFSHLVMAEEYVRMYRSVLEIGTLPPGRPTP*
Ga0070698_10087183213300005471Corn, Switchgrass And Miscanthus RhizosphereCRAHAARFFTHAVMAEEYVRVYGHLLATGTLPPGRPTPYAPA*
Ga0070741_1057518213300005529Surface SoilPLACRAHAERHFTHIVMATEYERMYRHLLDTGTLPAGMATSGA*
Ga0070695_10180153413300005545Corn, Switchgrass And Miscanthus RhizosphereLADWDPRACRARAERSFSHLVMADEYLRMYRAVLETGTLPPGRPTA*
Ga0066692_1051706513300005555SoilRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
Ga0066700_1049241913300005559SoilARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
Ga0066670_1007960713300005560SoilQTRSPAVCRAHAERYFSHRVMAQEYVRMYRSLLETGRLPAGRAMPHLATAG*
Ga0066703_1008790533300005568SoilFFTHAVMAEEYVRVYGHLLATGTLPPGRPTPYAPA*
Ga0066705_1003228213300005569SoilTRDPSACRAHAERYFTHLVMAEEYTRLYRRLLETGRLPPGRPAPHLATAG*
Ga0066705_1056655423300005569SoilCRAHAERHFSHLVMAQEYLRMYRGLLEAGRLPPGRPAPHLATAG*
Ga0074472_1077522913300005833Sediment (Intertidal)QKDPHACRARAERYFSHLVMADAYVRCYRHFLAEGRLPEGVSSEQ*
Ga0074470_1085814243300005836Sediment (Intertidal)RRPETCRAHAERYFTQRVMAEEYVRVYRHLAETGEVPAGRATPWAP*
Ga0066665_1016725733300006796SoilALRGRLADWDPRACRARAERSFSHLVMAEEYVRMYRSVLEIGTLPPGRPTP*
Ga0066665_1023476533300006796SoilALRGRLADWDPRACRARAERSFSHLVMAEEYVRMYRSVLETGTLPPGRSTP*
Ga0066665_1025846313300006796SoilCRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
Ga0066659_1080417923300006797SoilEWDPHACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
Ga0075436_10021162033300006914Populus RhizosphereTRSPEACRAHATRYFSHLVMAAEYVRLYRHLIAHGELPPGRPTPYTTT*
Ga0075436_10121055323300006914Populus RhizosphereAEACRAHAERYFSHVRMAEEYVRVYRHLIATGTLPPGRATPYTTT*
Ga0079219_1203316423300006954Agricultural SoilRAHAARYFTHAVMAEEYVRVYGHLLATGTLPPGRPTPYAPA*
Ga0066710_10099317723300009012Grasslands SoilALRGRLADWDPRACRARAERSFSHLVMAEEYVRMYRSVLETGTLPPGRSTP
Ga0066710_10120696023300009012Grasslands SoilRGRLAEWDPHACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP
Ga0066710_10213688323300009012Grasslands SoilGACRAHAERFFTHLVMAGEYLRMYRHVLETGTLPAGRTTSFVAA
Ga0066710_10431261123300009012Grasslands SoilRGRLAEWDPHACRARAERSFSHLVMTQEYVRMYRAYLETGGLPPGRPTP
Ga0066709_10029919413300009137Grasslands SoilACRALAERRFTHVVMAEEYVRMYRCLLDTGKLAPGRPVAGPNGS*
Ga0066709_10089182923300009137Grasslands SoilALRGRLADWDPRACRARAERSFSHLVMAEEYVRMYRSVLETGTLPPGRPTP*
Ga0066709_10241333623300009137Grasslands SoilDPHACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP*
Ga0099792_1031565023300009143Vadose Zone SoilRFFTHAVMAEEYVRLYGHLLATGTLPPGKPTPGAPA*
Ga0114129_1205524013300009147Populus RhizosphereGRLADWDPRACRARAERSFSHLVMADEYVRMYRAVLETGTLPPGRATA*
Ga0075423_1094256713300009162Populus RhizosphereHAARFFTHAVMAEEYVRLYGHLLATGTLPPGKPTPGAPA*
Ga0075423_1149558413300009162Populus RhizosphereERYFSHLAMAEEYVRMYRALLDTGKLPPGRPTAG*
Ga0134070_1017793523300010301Grasslands SoilRARAERSFSHLVMAEEYVRMYRSVLETGTLPPGRPTP*
Ga0134070_1029289913300010301Grasslands SoilGRLADWDPRACRARAERSFSHLVMTSEYVRMYRSLLESGTLPPGRPTP*
Ga0134080_1001635243300010333Grasslands SoilACRARAERLFTHVRMAEEYVRMYDALKKKGTLPPGRTVD*
Ga0134071_1007649733300010336Grasslands SoilRAERYFSHLVMAEEYVRMYRSLLDTGKLPPGRPTP*
Ga0134062_1060373413300010337Grasslands SoilHTRNPHACRARAERYFSHLVMAEEYVRMYRSLLDTGKLPPGRPTP*
Ga0126377_1009328843300010362Tropical Forest SoilRDPLACRAHAERYFTHVVMADEYVRMYRQLLASGTLPAGRATPAA*
Ga0137364_1042058523300012198Vadose Zone SoilRFFTHAVMAEEYVRVYGHLLATGTLPPGRPTPNAPA*
Ga0137365_1035222023300012201Vadose Zone SoilTRSPQACRTLAERQFTHLVMAQEYVRMYRCLLDTGKLPPGRPTAAA*
Ga0137367_1045810013300012353Vadose Zone SoilVRRARAERFFTHVVMAEEYLRMYRHLLETGTLPPGRVTPCAPT*
Ga0137369_1077516813300012355Vadose Zone SoilVRRARAERFFTHVVMAEEYLRMYRHLLETGTLPPGRV
Ga0137369_1098563813300012355Vadose Zone SoilAHAERYFTHLTMAERYLRMYRCLVETGVLPPGEPTPRTAA*
Ga0137368_1010867433300012358Vadose Zone SoilERYFTHLVMAEEYVRMYRAVIETGTLPAGRPTPSLQASS*
Ga0137385_1033757713300012359Vadose Zone SoilRAHAERCFTHVVMAEEYLRMYHHVLDTGVLPPGRPTPGVAQ*
Ga0137385_1101365013300012359Vadose Zone SoilERYFSHHVMAQEYVRMYRSLLETGRLPAGRAMPHLATAG*
Ga0137360_1141674713300012361Vadose Zone SoilFTHWVMAEEYVRVYRAVIETGKLPAGRPTPYASS*
Ga0137360_1176082923300012361Vadose Zone SoilEACRAHATRYFSHVVMAEEYVRVYRHLLAHGELPPGRPTPYTTT*
Ga0137373_1001584123300012532Vadose Zone SoilVRRARAERFFTHVVMAEEYLRMYRHLLETGTLPPGRVAPCAPT*
Ga0137416_1007406013300012927Vadose Zone SoilRAERSFSHLVMADEYLRMYRAVLETGTLPPGRPTA*
Ga0137410_10005089113300012944Vadose Zone SoilRAHAERYFTHRAMAEGYVRAYRAVIETGNPPAGRPTPYASS*
Ga0137410_1085663513300012944Vadose Zone SoilACRARAERSFSHLVMADEYLRMYRAVLETGSLPPGRPTA*
Ga0134076_1012831123300012976Grasslands SoilCRALAERHFTHVAMAEEYLRMYRALLDTGKLPAGRATPHAASAVPR*
Ga0134078_1011779713300014157Grasslands SoilIHTRSPQACRTLAERQFTHLVMAQEYVRMYRCLLDTGKLPPGRPTAAA*
Ga0134073_1021376313300015356Grasslands SoilHAERYFTHLVMAEEYTRLYRRLLETGRLPPGRPAPHLATAG*
Ga0134085_1007304833300015359Grasslands SoilAHAERYFTHLVMAEEYVRMYRAVIETGTLPAGRPTPSLQASS*
Ga0134085_1041217113300015359Grasslands SoilPHACRARAERYFSHLVMAEEYVRMYRSLLDTGKLPPGRPTP*
Ga0134112_1019426013300017656Grasslands SoilAHATRYFTHITMAEEYVRVYHHLIANGALPPGRPTS
Ga0134112_1022278513300017656Grasslands SoilAACRAHAERYFSHVVMAEEYVRMYRALLDTGTLPPGRPTP
Ga0187824_1028139013300017927Freshwater SedimentTRSAAACRAHAERYFTHRVMAAAYVRMYTSILERGALPPGCPTPDAASAG
Ga0066662_1036284013300018468Grasslands SoilTRDPGACRAHAARFFTHAVMAEEYVRLYGQLLATGTLPPGRPTPDAPA
Ga0193715_100830813300019878SoilCRARAERSFSHLVMADEYVRMYRALLATGTLPPGRPTPG
Ga0193749_100244613300020010SoilYFTHRTMAEEYVRVYRAVIETGNPPAGRPTPYASS
Ga0207646_1160395623300025922Corn, Switchgrass And Miscanthus RhizosphereAARFFTHAVMAEEYVRVYGHLLATGTLPPGRPTPYAPA
Ga0209350_115995213300026277Grasslands SoilHTRNPEACRAHAERHFSHLVMAQEYVRMYRSVLETGTLPPGRLAPHLATAR
Ga0209235_126131123300026296Grasslands SoilAETIHTREPHACRARAERYFSHLVMAEEYVRMYRAVLETGTLPPGRPTP
Ga0209237_112253123300026297Grasslands SoilCRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP
Ga0209237_125095213300026297Grasslands SoilERYFSHHVMAQEYVRMYRSLLETGRLPAGRAMPHLATAG
Ga0209236_110192713300026298Grasslands SoilFFTHAVMAEEYVRVYGHLLATGTLPPGRPTPYAPA
Ga0209265_103822633300026308SoilVCRAHAERYFSHHVMAEEYVRMYRSLLETGRLPAGRAMPHLATAG
Ga0209761_128406023300026313Grasslands SoilYFTHLVMAEEYTRLYRRLLETGRLPPGRPAPHLATAG
Ga0209155_112283223300026316SoilIHTRDPAACRARAERHFTHLVMAEEYVRVYGHLLATGTLPPGRPTPNAPA
Ga0209152_1004885033300026325SoilACRALAERRFTHVVMAEEYVRMYRCLLDTGKLAPGRPVAGPTGS
Ga0209803_115138713300026332SoilERRFTHVVMAEEYVRMYRCLLDTGKLAPGRPVAGPNGS
Ga0209377_119105123300026334SoilRNFSHLVMAQEYERMYHAVLETGTLPPGRPAPHLATAG
Ga0209804_113903113300026335SoilEACRAHAERNFSHLVMAQEYVRMYHTLLETGTLPPGRPAPHLATAG
Ga0209159_111043423300026343SoilLRGRLAEWDPHACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP
Ga0209807_102145543300026530SoilACRAHAERYFTHLVMAEEYTRLYRRLLETGRLPPGRPAPHLATAG
Ga0209807_114338713300026530SoilFSHLVMTEEYLRMYRGLLETGRLPPGRPAGHLARTG
Ga0209376_138019723300026540SoilRSPEACRAHAERYFTHITMAEQYVSLYRNLLATGALGPGRPTPYTIM
Ga0209805_119573423300026542SoilCRALAEQRFTHVVMAEEYVRMYRSLLDTGKLAPGRPATAA
Ga0209805_134443223300026542SoilDPGACRAHAERFFTHLVMAGEYLRMYRHVLETGTLPAGRTTSFVAA
Ga0209161_1011985133300026548SoilACRARAERSFSHLVMTDEYIRMYRSVLETGTLPPGRPTP
Ga0209283_1003038913300027875Vadose Zone SoilHTRDPHACRGRAERYFSHLAMAEEYVRMYRALLDTGNLPPGRPTAG
Ga0209488_1114984723300027903Vadose Zone SoilRAHATRYFSHVVMAEEYVRVYRHLLAHGELPPGRPTPYTTT
Ga0307477_1045045323300031753Hardwood Forest SoilHAERYFSHGVMAAAYVRMYVGLLEQGTLPAGCPTPWAPSAT
Ga0214473_1177073723300031949SoilACRAHAERYFTHRVMAEEYLRMYGAVIAAGTLPPGRPTPYTSS
Ga0307471_10022040413300032180Hardwood Forest SoilAEQYFSHVVMAEEYLRMYRSLLDTGSLPPGRVTPHAPSATSP
Ga0335079_1165189123300032783SoilAHAERYFTHLVMAEEYLRVYRHLIETGSLPAGRPTPYTRV
Ga0335071_1053691113300032897SoilKPEACRAHAERYFSHIVMAEAYVRMYRGLLEAGTLPAGIATPYAP
Ga0326731_100605933300033502Peat SoilAERYFSHLVMAEEYVRFYRGFLETGALPEGRRTAM
Ga0314862_0159791_2_1513300033803PeatlandIDPEACRARAERHFSHLAMATAYVRMYEQLRQTGTLPPGIPVPGSAAAS
Ga0364928_0058957_719_8563300033813SedimentPEACRAHAERYFTHCAMAEEYIRVYRAVIETGALPAGRPTPYAKS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.