NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F075964

Metagenome / Metatranscriptome Family F075964

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F075964
Family Type Metagenome / Metatranscriptome
Number of Sequences 118
Average Sequence Length 43 residues
Representative Sequence VIEGATAAAFTVSVAALLVTLPAELLTTTVNCAPLSELVVAGVV
Number of Associated Samples 96
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 20.59 %
% of genes near scaffold ends (potentially truncated) 18.64 %
% of genes from short scaffolds (< 2000 bps) 24.58 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (73.729 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(43.220 % of family members)
Environment Ontology (ENVO) Unclassified
(41.525 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.763 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: Yes Secondary Structure distribution: α-helix: 45.83%    β-sheet: 0.00%    Coil/Unstructured: 54.17%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF13927Ig_3 8.70
PF03721UDPG_MGDP_dh_N 3.48
PF07679I-set 2.61
PF13895Ig_2 1.74
PF04185Phosphoesterase 1.74
PF08530PepX_C 0.87
PF00083Sugar_tr 0.87
PF14532Sigma54_activ_2 0.87
PF02817E3_binding 0.87
PF00781DAGK_cat 0.87
PF01381HTH_3 0.87
PF13551HTH_29 0.87
PF00664ABC_membrane 0.87
PF16656Pur_ac_phosph_N 0.87
PF00872Transposase_mut 0.87
PF02350Epimerase_2 0.87
PF00012HSP70 0.87
PF10531SLBB 0.87
PF15780ASH 0.87
PF02965Met_synt_B12 0.87
PF03480DctP 0.87
PF13561adh_short_C2 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG0240Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 3.48
COG0677UDP-N-acetyl-D-mannosaminuronate dehydrogenaseCell wall/membrane/envelope biogenesis [M] 3.48
COG1004UDP-glucose 6-dehydrogenaseCell wall/membrane/envelope biogenesis [M] 3.48
COG12503-hydroxyacyl-CoA dehydrogenaseLipid transport and metabolism [I] 3.48
COG1893Ketopantoate reductaseCoenzyme transport and metabolism [H] 3.48
COG1597Phosphatidylglycerol kinase, diacylglycerol kinase familyLipid transport and metabolism [I] 1.74
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 1.74
COG0381UDP-N-acetylglucosamine 2-epimeraseCell wall/membrane/envelope biogenesis [M] 0.87
COG0443Molecular chaperone DnaK (HSP70)Posttranslational modification, protein turnover, chaperones [O] 0.87
COG0508Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) componentEnergy production and conversion [C] 0.87
COG0707UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferaseCell wall/membrane/envelope biogenesis [M] 0.87
COG1410Methionine synthase I, cobalamin-binding domainAmino acid transport and metabolism [E] 0.87
COG2936Predicted acyl esteraseGeneral function prediction only [R] 0.87
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A73.73 %
All OrganismsrootAll Organisms26.27 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300006755|Ga0079222_10038663All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2116Open in IMG/M
3300007258|Ga0099793_10449204All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300009038|Ga0099829_10666568All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium865Open in IMG/M
3300009089|Ga0099828_11425352All Organisms → cellular organisms → Bacteria → Acidobacteria612Open in IMG/M
3300009137|Ga0066709_102731691All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium657Open in IMG/M
3300009143|Ga0099792_11244980All Organisms → cellular organisms → Bacteria → Acidobacteria506Open in IMG/M
3300009792|Ga0126374_10359876All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1001Open in IMG/M
3300010043|Ga0126380_10077143All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1914Open in IMG/M
3300010043|Ga0126380_10077143All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1914Open in IMG/M
3300010125|Ga0127443_1142346All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium539Open in IMG/M
3300010321|Ga0134067_10465003All Organisms → cellular organisms → Bacteria → Acidobacteria518Open in IMG/M
3300010343|Ga0074044_10793317All Organisms → cellular organisms → Bacteria → Acidobacteria619Open in IMG/M
3300010359|Ga0126376_11299093All Organisms → cellular organisms → Bacteria → Acidobacteria747Open in IMG/M
3300010360|Ga0126372_10091605All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2252Open in IMG/M
3300010366|Ga0126379_11144537Not Available884Open in IMG/M
3300012189|Ga0137388_11584318All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300012189|Ga0137388_11945561All Organisms → cellular organisms → Bacteria → Acidobacteria517Open in IMG/M
3300012199|Ga0137383_11023461All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium601Open in IMG/M
3300012351|Ga0137386_10494241All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium881Open in IMG/M
3300012359|Ga0137385_11322645All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium583Open in IMG/M
3300012361|Ga0137360_10974628All Organisms → cellular organisms → Bacteria → Acidobacteria731Open in IMG/M
3300012406|Ga0134053_1144131Not Available1011Open in IMG/M
3300012918|Ga0137396_10822819All Organisms → cellular organisms → Bacteria → Acidobacteria683Open in IMG/M
3300012923|Ga0137359_10649876All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium922Open in IMG/M
3300014156|Ga0181518_10484601Not Available587Open in IMG/M
3300014502|Ga0182021_13130891All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium553Open in IMG/M
3300015054|Ga0137420_1368431All Organisms → cellular organisms → Bacteria4493Open in IMG/M
3300018007|Ga0187805_10569926All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium534Open in IMG/M
3300021180|Ga0210396_10811809All Organisms → cellular organisms → Bacteria802Open in IMG/M
3300026304|Ga0209240_1268632All Organisms → cellular organisms → Bacteria → Acidobacteria524Open in IMG/M
3300026555|Ga0179593_1237879All Organisms → cellular organisms → Bacteria → Proteobacteria3429Open in IMG/M
3300027915|Ga0209069_10684587All Organisms → cellular organisms → Bacteria600Open in IMG/M
3300031753|Ga0307477_10688302All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium684Open in IMG/M
3300031962|Ga0307479_10000099All Organisms → cellular organisms → Bacteria87386Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil43.22%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.78%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.08%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.08%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.39%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.54%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.69%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.69%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.69%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.85%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.85%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.85%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.85%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.85%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.85%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.85%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.85%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.85%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.85%
FenEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Fen0.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.85%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.85%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.85%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.85%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.85%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009521Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaGEnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010125Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010857Boreal forest soil eukaryotic communities from Alaska, USA - W1-3 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012019Permafrost microbial communities from Nunavut, Canada - A7_5cm_12MEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012390Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012405Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012882Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014156Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin01_60_metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014164Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin11_30_metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014502Permafrost microbial communities from Stordalen Mire, Sweden - 612E3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300018090Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022509Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027703Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 81 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030058Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Palsa_E3_1EnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25389J43894_102570713300002916Grasslands SoilVICRAGIEALTVKTPALLVTLPAELLTITVNCAPLSEVLVAGVV*
Ga0062389_10238189433300004092Bog Forest SoilVIFNVDVTAFTVSMAAVLVALPAELLTVTVNSSPLSEVAVAGVV*
Ga0066690_1017559513300005177SoilVIEGDTGPVVTVRVAALLVTTPDELLTVTVNSAPLSEVTVGGVV*
Ga0066678_1054016013300005181SoilWLAGCIVIDGATAAALTVNMAALLVALPAALLTTTVNCAPLSELVVAGVV*
Ga0070709_1091041213300005434Corn, Switchgrass And Miscanthus RhizosphereMIAGATSVALTLSVAVLLVALPAELLATTLNCDPLSDAVVA
Ga0070730_1038193723300005537Surface SoilVIVGATAAAFTVRVAALLVTLPAELLTSTVNCSPLSVLVVAGVV*
Ga0066695_1043597423300005553SoilVGGGETTVRVAALLVMLPAPLLTTTVNCAPLSELVVVGV
Ga0066705_1000633613300005569SoilAALTVSVAALLVTLPAVLLTTTVNCEPLSAEVVAGVV*
Ga0079222_1003866333300006755Agricultural SoilAALTVNVAALLVAVPAVLLTTTSNVEPLSEVVVTGVA*
Ga0066660_1079376433300006800SoilALTVNVATLLVTLPAVLLTATENCAPLSELVVAVVV*
Ga0073928_1091188113300006893Iron-Sulfur Acid SpringTSGPGRGAVAVMVSVAAWLVTLPAELVTTTVNVAPLSEAVVAAVV*
Ga0099791_1027733323300007255Vadose Zone SoilMTGAGTAFTVSVAAALVTDPAVLLTTVPKVAPLSEIAVAGVV*
Ga0099793_1039180413300007258Vadose Zone SoilLEIVGATAAGLTVSVAALLVTLPAELLTTAANVAPLSPLVVAGVV*
Ga0099793_1044920413300007258Vadose Zone SoilVVSGVELFTVSVAALLVMLPAVLLTTTRNVAPLSDVVVAGVV*
Ga0099829_1066656833300009038Vadose Zone SoilVIEGATGAALTVRVAALLVTLPAVLLTATRNVDPLSEVVVAGVV*
Ga0099828_1142535233300009089Vadose Zone SoilAPFTVSVAAALVAVPALLLTTTRKVLPESAVAVDGVV*
Ga0066709_10005199453300009137Grasslands SoilIDGATAAALTVNMAALLVALPAALLTTTVNCAPLSELVVAGVV*
Ga0066709_10273169123300009137Grasslands SoilVIVGATAAALTVSVAAALVTVPAELLTTTRNVAPLSAVVVAGVV*
Ga0099792_1124498013300009143Vadose Zone SoilVVIVGATDAAPTVSAAPLLVTLPTVLLTTTLNCAPLSELVVAPVV*
Ga0116222_108275133300009521Peatlands SoilVALFGETATVTTAALTERMAALLVTLPAELLTTTVNCVLLSDVAVAGVV*
Ga0126374_1035987613300009792Tropical Forest SoilVIDGATAAAFTVNGAALLVTLPAELVTTTEKEVPLSVVVVAGVV*
Ga0126380_1007714323300010043Tropical Forest SoilMEGAVAAAFTVSVAALLVAEPAELETTTEKVEPLSEVVVAGVV*
Ga0126380_1007714333300010043Tropical Forest SoilTVSVAALLVAEPAELETTTEKVEPLSEVVVAGVV*
Ga0126384_1113486823300010046Tropical Forest SoilVIVGAAAAGFTISVAALLVILPAELLATTWNVEPLSVLVVAG
Ga0127443_114234613300010125Grasslands SoilAAALTVNVAALLVELPATLLTTTVNCAPLSELVVAGVV*
Ga0134067_1046500323300010321Grasslands SoilVIVGATAAALTVSVAAALVAVPAELLTTTRNVAPLSAVVVAGVV*
Ga0074044_1079331723300010343Bog Forest SoilMLAGCEVKVGATAAALTVSIAALLVIEPAELLTTTWKADPLFEVVVAGVV*
Ga0126376_1129909323300010359Tropical Forest SoilMAGAGAGFTVSVAALLVAVPAELLTTTVNVEPLSAVVVAVVV*
Ga0126372_1009160543300010360Tropical Forest SoilVIDVATAAAFTVNGAALLVTLPAELVTTPEKAVPLTVVVVAGVV*
Ga0126378_1155816113300010361Tropical Forest SoilCVVIVGATGAAFTVRVAVLLVAFPALLLTRTVNCAPLSDVIARGVV*
Ga0126379_1114453723300010366Tropical Forest SoilMLGTTGAAFTVSVAAALVTLPTELLTVAVKLDPLSAVVVAAVVKLDPEAPL
Ga0136449_10352083523300010379Peatlands SoilWLLIEGAVGARFTVNVAALLVTLPAASDTVTVNWAPLSDAVVAGVV*
Ga0126354_117007023300010857Boreal Forest SoilPVAAPTLRVAVLLVALPALLLTTTENSARSSEAVVAGVV*
Ga0137392_1071844013300011269Vadose Zone SoilMEGATAAAVTERMAALLVTLPAALLTTAVNCAPLSEVDVTGVV*
Ga0137392_1156992923300011269Vadose Zone SoilMEGPTVAAVTVRVAALLVTLPPALLTIAVNCAPLSEVDVAGVV*
Ga0137391_1045623313300011270Vadose Zone SoilFKIANAETTVSVAAALVTVPAVLLTTTSKVEPLSDIVVDGVM*
Ga0120139_112578023300012019PermafrostMVNVAALLVTLPKLLVTVTVNCAPLSELVVAGVV*
Ga0137388_1038506923300012189Vadose Zone SoilVVIWTVEAAFTVRVAALLVALPDALLTTTENCVPLSELIVTGVV*
Ga0137388_1158431813300012189Vadose Zone SoilVIEGASGAGFTESVAALLVALPAPLLTTAVNSAPLSELIVAAMV*
Ga0137388_1194556123300012189Vadose Zone SoilVGATAAAFTVSVAAALVTDPTVLLTTTAKVAPLSAVVVAGVV*
Ga0137364_1057726023300012198Vadose Zone SoilCVVIEGATAAALTVSVAVLLVTLPAVLLTTTVNCEPLSAEVVAGVV*
Ga0137383_1102346123300012199Vadose Zone SoilVIVGATAAAFTVSVAALLVTIPAVLLTTTRKVAPLSAVVAAGVV*
Ga0137363_1012338213300012202Vadose Zone SoilLAGCVVIEGTTGAEFTVSVAAPLVMLPAVLLTTTVNCAPLSELVVAGVL*
Ga0137399_1015081813300012203Vadose Zone SoilAAALTVKVAVLLVTLPAVLLTATVNCAPLSEIAVTGVV*
Ga0137399_1075535413300012203Vadose Zone SoilVICNGEVAAFTVKTAALLETLPAELLTTTANCEPLSEATVAGVA*
Ga0137399_1138080723300012203Vadose Zone SoilVVIEGATADVLTVSVAALLVTLPTVLLTATVNCAPLSELVVAGVV*
Ga0137378_1050712823300012210Vadose Zone SoilLAGCVVIEGATAAALTVSVAVLLVTLPAVLLTTTVNCEPLSAEVVAGVV*
Ga0137386_1049424123300012351Vadose Zone SoilVIVGATAAAFTVSVAALLVTVPAVLLTTTANVDPLSAVVV
Ga0137385_1132264523300012359Vadose Zone SoilVIVGATAAAFTVSVAALLVTVPAVLLTTTAKEDPLSAVVVAGVV*
Ga0137360_1000800313300012361Vadose Zone SoilTVSVAGLLVTLPAVLLIDAVNCAPLSELVEAGVV*
Ga0137360_1097462813300012361Vadose Zone SoilIEGATGAALTVRMAALLVTLPAVLLITTEKSAPLSELAVAGVV*
Ga0137390_1159648023300012363Vadose Zone SoilEGATAAALTVRVPAPLVTLPAVLLIMTVNCAPLSELVVAGVV*
Ga0134054_128064823300012390Grasslands SoilMEGAVAALFTVSVAALLVTLPAVLLTTTTNLALLSAEVDAGVV*
Ga0134041_116358323300012405Grasslands SoilVIEGATAAALTVSVAALLVTLPAVLLTTTVNCEPLSAEVVAGVV*
Ga0134041_116358333300012405Grasslands SoilGCVVIEGATAAALTVSVAALLVTLPAVLLTTTVNCEPLSAEVVAGVV*
Ga0134053_114413113300012406Grasslands SoilPAGWLVIVGATAAAFTVSVAPALVTEPALLLTTTPKLLPESVVLVDGVV*
Ga0137358_1086048013300012582Vadose Zone SoilMEGATGAAFTVSVAAALVILPAVFVTTTLNCAPLSELVVDGVV*
Ga0137358_1086048023300012582Vadose Zone SoilEGATAAAFTVSVAAALVILPAVFVTTTVNCAPLSELVVAGVV*
Ga0137397_1033863113300012685Vadose Zone SoilVVIEGVTGAALTVSVAALLVMLPAELLTATVNCAPVSVAMVAGVV*
Ga0157304_106193523300012882SoilVVDDVGAFTVSVAALLVTDPALFLTTTVNCAPLSEVVVAAVV*
Ga0137396_1069884013300012918Vadose Zone SoilVWLAGCVVIVGATADALTVSVAALLVALPVALLTTTVNCAPLSELVVAGVV*
Ga0137396_1082281923300012918Vadose Zone SoilVIVGATAAAFTDSVAAALVAEPTELLTTTRNVEPLSDVVVAGVL*
Ga0137394_1094565913300012922Vadose Zone SoilVVIEGVTGAALTVSVAALLVMLPAELLTATVNCAPVSVAMVVGVV*
Ga0137359_1064987643300012923Vadose Zone SoilYVFCVPAFTVKTAALLVALPAELLTTTVNVDPLSDVAVAGVV*
Ga0137413_1131773913300012924Vadose Zone SoilVALTVSAAARLVTFPAELLTTTTNEEPLSEDVVAGAV*
Ga0137419_1078547913300012925Vadose Zone SoilAALTVRVAALLVTLPVELLTVTVNCAPLSELVVAGVV*
Ga0137419_1092918813300012925Vadose Zone SoilLAGWPVIEGATGAALTVSVAGLLVTLPAVLLFDAVNCAPLSELVEAGVV*
Ga0137419_1095140833300012925Vadose Zone SoilMEGTDEVALTVSVAALLVAFPEELLTTTVNCDPLSELVVAGVV*
Ga0137419_1153872013300012925Vadose Zone SoilWVAGCVVTVGATGVALIVKATALLVTLPAALLIVTANCAPLSELVVAGVV*
Ga0137416_1193651913300012927Vadose Zone SoilTGAAFTVSVAAALVIAPAVLLTTTVNCAPLSELVVAGVL*
Ga0137404_1144117313300012929Vadose Zone SoilIEGATAAALTVSVAAPLVALLAVLLTATVNCAPLSEIAVAGVV*
Ga0137404_1227240513300012929Vadose Zone SoilMDGATGAAVTVRLAALLVTLPGLLLTVTVNEAPLSEVDVAEVVYDAEVAPLI
Ga0134087_1005779323300012977Grasslands SoilMEGAVAALFTVSVAALLVTLPATLLTVTVNCEPLSAEVVAGVV*
Ga0181518_1048460113300014156BogTVSVAALLVMLPTELLTTTVNCVPLAEVVSAGVV*
Ga0134078_1027758313300014157Grasslands SoilKLTVVVVALTVSVAARLVAFPAELLTTTTNEEPLSEEVVAGVV*
Ga0181532_1053616513300014164BogMFMEMVTGAAFTVRVAALLVMFPAGLLTTAVNCDPLSAVTVAGVVY
Ga0134079_1001099723300014166Grasslands SoilVIEGVTAAALTVSVAVLLVTLPAVLLTTTVNCEPLSAEVVAGVV*
Ga0182021_1313089133300014502FenDGVTFTVKVAALLVTLPVLLLTVTVNCAPLSEAVVAEVV*
Ga0137420_130970823300015054Vadose Zone SoilVLIEGATGAAFTVRVAAVLVTLPAVLLTTTVNSAPLSEVAVAGVV*
Ga0137420_136843113300015054Vadose Zone SoilAEFTVSVAWLLVMLPAVLLTTTSNLEPLSAVVVAAWCS*
Ga0137418_1069621313300015241Vadose Zone SoilLAGCVVIEGGTVAVVTESVAALLVTVPAVLLTTTVNCAPLSELVVAGVL*
Ga0137418_1072936813300015241Vadose Zone SoilTVCVEGMLVPLPAVLLIDAVNCAPLSELVEAGVV*
Ga0137409_1147311123300015245Vadose Zone SoilTAFAVSVAALLVALPAELLIVTVNSAPLSAVVVAGVV*
Ga0134089_1013355813300015358Grasslands SoilIEGATAAAFTVSVAAPLLTVPAELLTTTLNWAPLSEAVIAGVV*
Ga0187805_1056992613300018007Freshwater SedimentKVVWFEGCEAIEGATAAAVTARVAALLVTVPAVLLTTTANFAPLADVVSAGVV
Ga0187771_1169288723300018088Tropical PeatlandVGAVGAAFTVSVAALLVALPAELLTTTTNVAPLSPLVVAGVV
Ga0187770_1040157813300018090Tropical PeatlandKFSVPGAAALTVSVAALLVALPAELLTTTVNFDPFAEVVSAGVV
Ga0187770_1052913013300018090Tropical PeatlandFLTVNVAALLITVPAELLTVTVKEAPLSLEDVAGVV
Ga0137408_147193463300019789Vadose Zone SoilVIEGATAAALTVSVAAVLVTLPVELLTVTVNCAPLSELVVAGVV
Ga0210403_1152270813300020580SoilMAGTPAAFTARDAALLVTLPAELLTTTLNVEPLSVLAVAGVV
Ga0210404_1068806923300021088SoilMAGAPDAAFTVSVAALLVAPPAELLTNTAYVDPLFAAVVAGVV
Ga0210406_1005069313300021168SoilTGAAAALTVSVAALLVTLPAELLTITVNCAPLSELVGAGVV
Ga0210396_1081180923300021180SoilMAGAPDAAFTVSVAALLVAPPAELLTNTAYVDPLFAAVVAGV
Ga0210409_1167019513300021559SoilAELTASVAALLVALPAPLVTTTVNREPLSELVVGAVV
Ga0242649_106979013300022509SoilLAGCVLIDGGAFTVSVAASLLILPALLLMITLNCAPLSALVVGGVV
Ga0137417_131466913300024330Vadose Zone SoilAALTVSVAALLVTLPAVLLTTTVSCAPLSELVVAGVV
Ga0207684_1016466223300025910Corn, Switchgrass And Miscanthus RhizosphereSVAAFTVSTATSLVAVPAGLLTTTLNCDPLSEAVVAGVV
Ga0207660_1026217933300025917Corn RhizosphereVIEGATAAAFTVSVAALLVTLPAELLTTTVNCAPLSELVVAGVV
Ga0209240_126863213300026304Grasslands SoilMAGAGAAFTVRVAAALVTDPAVLLTTAPKVAPLSEIAVA
Ga0209268_113431623300026314SoilVWLAGCVVIDGATAPAFTVSMAALLVVVPAVLLTTTVNCAPLSELVVAAVV
Ga0179593_104695313300026555Vadose Zone SoilAVIEGPAVAEVIVSVAALLVTLPVALLTTAVNCAPLSELVVAGVT
Ga0179593_104768833300026555Vadose Zone SoilVICTGCALTVSVAALLVTVLAVLLTATVNCAPLSKLVVAGVR
Ga0179593_111896623300026555Vadose Zone SoilMVGAAALTVSVAAPLLTFPAELLTTTTNEEPLSDEMVTGVV
Ga0179593_123787943300026555Vadose Zone SoilMTGARAAFTVRVAAALVTDPAVLLTTAPKVAPLSEIAVAGVV
Ga0209118_111301933300027674Forest SoilMAAALTVRVAALLVALPAELLTTTVNCEPLSEAVAAGVL
Ga0207862_100711243300027703Tropical Forest SoilAWLAGSVVTVGATAAALTVSVAVLLVTLPLELLTTTSKLEPLSVVLVAGVV
Ga0209069_1068458713300027915WatershedsTAAFTVNVAALLVMLPAVLLTTTRYVDPLSPLVVAGVV
Ga0137415_1107564513300028536Vadose Zone SoilCAVIEGGTAAAVTASVAALLVTVPAVLLTTTVNCAPLSELVVDGVL
Ga0302179_1008313623300030058PalsaCVIEGATGEAGVTVSRAALLVTLPAESLTTTLNFEPLSEVVVAAVV
Ga0170823_1734782213300031128Forest SoilVGGGKATVSVAGLLVALPALLLTTTVNCAPLSEVVVAAVV
Ga0307476_1003314143300031715Hardwood Forest SoilMEGATAAALTVSVAALLVALPAELLTITENCAPLSELVVEGVE
Ga0307477_1068830233300031753Hardwood Forest SoilVIEGATAAAFTVRAAALLVTLPAVLLTSTVNCAPLSELVVAGVV
Ga0307473_1122532913300031820Hardwood Forest SoilCVVMEGATGAALTVRTAALLVTLPAVLVTTTVNCAPLSEVVVAGVV
Ga0307478_1129425823300031823Hardwood Forest SoilMEGATGAGVTVSTALLLVAVPAELLTTTANCAPLSEVVS
Ga0307479_1000009923300031962Hardwood Forest SoilVICTGGGFTVSAAALLVTVPAVLLTTTVNCAPLSELLVAGVL
Ga0307471_10009956333300032180Hardwood Forest SoilCMVIDGATAAAFTVSMAALLVALPAVLLTTTVNCAPLSELVVAAVV
Ga0307472_10265888223300032205Hardwood Forest SoilGATGAAFTVRVAALLVTLPAALLIAAVNCAPLSELVVAGVV
Ga0335085_1000251913300032770SoilALTLSVAGLLVTLPALLLTTTLNWEPSSDVAVAGVV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.