NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102801

Metagenome / Metatranscriptome Family F102801

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102801
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 43 residues
Representative Sequence VTESSSQNANVVAECLRSWNPKAVVEVIEFRGETTIVVPRE
Number of Associated Samples 87
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 65.35 %
% of genes near scaffold ends (potentially truncated) 97.03 %
% of genes from short scaffolds (< 2000 bps) 92.08 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (53.465 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.772 % of family members)
Environment Ontology (ENVO) Unclassified
(23.762 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.416 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 21.74%    β-sheet: 15.94%    Coil/Unstructured: 62.32%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00507Oxidored_q4 88.12
PF02597ThiS 5.94
PF06580His_kinase 0.99
PF00329Complex1_30kDa 0.99
PF00346Complex1_49kDa 0.99
PF076945TM-5TMR_LYT 0.99
PF00925GTP_cyclohydro2 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0838NADH:ubiquinone oxidoreductase subunit 3 (chain A)Energy production and conversion [C] 88.12
COG1977Molybdopterin synthase sulfur carrier subunit MoaDCoenzyme transport and metabolism [H] 5.94
COG2104Sulfur carrier protein ThiS (thiamine biosynthesis)Coenzyme transport and metabolism [H] 5.94
COG3275Sensor histidine kinase, LytS/YehU familySignal transduction mechanisms [T] 1.98
COG0649NADH:ubiquinone oxidoreductase 49 kD subunit (chain D)Energy production and conversion [C] 0.99
COG0807GTP cyclohydrolase IICoenzyme transport and metabolism [H] 0.99
COG0852NADH:ubiquinone oxidoreductase 27 kD subunit (chain C)Energy production and conversion [C] 0.99
COG2972Sensor histidine kinase YesMSignal transduction mechanisms [T] 0.99
COG3261Ni,Fe-hydrogenase III large subunitEnergy production and conversion [C] 0.99
COG3262Ni,Fe-hydrogenase III component GEnergy production and conversion [C] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A53.47 %
All OrganismsrootAll Organisms46.53 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001084|JGI12648J13191_1029757Not Available538Open in IMG/M
3300001545|JGI12630J15595_10101024Not Available568Open in IMG/M
3300001867|JGI12627J18819_10406828Not Available554Open in IMG/M
3300003220|JGI26342J46808_1002810All Organisms → cellular organisms → Bacteria1517Open in IMG/M
3300003224|JGI26344J46810_1004581All Organisms → cellular organisms → Bacteria1023Open in IMG/M
3300004135|Ga0058884_1342536Not Available539Open in IMG/M
3300004479|Ga0062595_100149049All Organisms → cellular organisms → Bacteria1357Open in IMG/M
3300004631|Ga0058899_11634214Not Available543Open in IMG/M
3300004631|Ga0058899_11686438Not Available529Open in IMG/M
3300004633|Ga0066395_10409336All Organisms → cellular organisms → Bacteria → Acidobacteria766Open in IMG/M
3300005174|Ga0066680_10027428All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3186Open in IMG/M
3300005177|Ga0066690_10443073All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300005451|Ga0066681_10094683All Organisms → cellular organisms → Bacteria1701Open in IMG/M
3300005468|Ga0070707_101144873Not Available744Open in IMG/M
3300005554|Ga0066661_10364880All Organisms → cellular organisms → Bacteria884Open in IMG/M
3300005561|Ga0066699_10941236All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300006041|Ga0075023_100503377Not Available546Open in IMG/M
3300006057|Ga0075026_100542021Not Available676Open in IMG/M
3300006806|Ga0079220_10111520All Organisms → cellular organisms → Bacteria1439Open in IMG/M
3300006914|Ga0075436_101242241All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300007255|Ga0099791_10070143All Organisms → cellular organisms → Bacteria1588Open in IMG/M
3300007788|Ga0099795_10184806Not Available872Open in IMG/M
3300009089|Ga0099828_11331730Not Available635Open in IMG/M
3300010046|Ga0126384_11444216Not Available643Open in IMG/M
3300010046|Ga0126384_12358610Not Available514Open in IMG/M
3300010322|Ga0134084_10213765Not Available680Open in IMG/M
3300010376|Ga0126381_103778328Not Available592Open in IMG/M
3300011120|Ga0150983_14353275Not Available505Open in IMG/M
3300011270|Ga0137391_11488062Not Available522Open in IMG/M
3300012096|Ga0137389_10577574All Organisms → cellular organisms → Bacteria → Acidobacteria966Open in IMG/M
3300012198|Ga0137364_10561232All Organisms → cellular organisms → Bacteria860Open in IMG/M
3300012202|Ga0137363_10317709All Organisms → cellular organisms → Bacteria → Acidobacteria1282Open in IMG/M
3300012202|Ga0137363_10638766All Organisms → cellular organisms → Bacteria → Acidobacteria900Open in IMG/M
3300012203|Ga0137399_10503150All Organisms → cellular organisms → Bacteria → Acidobacteria1016Open in IMG/M
3300012203|Ga0137399_10711415Not Available845Open in IMG/M
3300012205|Ga0137362_10109751All Organisms → cellular organisms → Bacteria → Acidobacteria2331Open in IMG/M
3300012211|Ga0137377_11936467Not Available506Open in IMG/M
3300012389|Ga0134040_1159033Not Available505Open in IMG/M
3300012469|Ga0150984_119551790Not Available509Open in IMG/M
3300012922|Ga0137394_10077650All Organisms → cellular organisms → Bacteria → Proteobacteria2775Open in IMG/M
3300012977|Ga0134087_10321935Not Available731Open in IMG/M
3300014154|Ga0134075_10095846All Organisms → cellular organisms → Bacteria → Acidobacteria1251Open in IMG/M
3300014166|Ga0134079_10004244All Organisms → cellular organisms → Bacteria → Acidobacteria4084Open in IMG/M
3300014166|Ga0134079_10604072Not Available546Open in IMG/M
3300015051|Ga0137414_1033263Not Available579Open in IMG/M
3300015051|Ga0137414_1119163Not Available766Open in IMG/M
3300015242|Ga0137412_10742848All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae726Open in IMG/M
3300017943|Ga0187819_10854343Not Available510Open in IMG/M
3300017961|Ga0187778_10427260Not Available872Open in IMG/M
3300017966|Ga0187776_10347743All Organisms → cellular organisms → Bacteria → Acidobacteria978Open in IMG/M
3300020199|Ga0179592_10303091Not Available710Open in IMG/M
3300020580|Ga0210403_10607753All Organisms → cellular organisms → Bacteria → Acidobacteria882Open in IMG/M
3300020581|Ga0210399_11331748Not Available564Open in IMG/M
3300021088|Ga0210404_10219078All Organisms → cellular organisms → Bacteria → Acidobacteria1026Open in IMG/M
3300021088|Ga0210404_10311775Not Available867Open in IMG/M
3300021088|Ga0210404_10490083Not Available694Open in IMG/M
3300021170|Ga0210400_11391837Not Available559Open in IMG/M
3300021178|Ga0210408_10306372All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300021406|Ga0210386_10761408All Organisms → cellular organisms → Bacteria → Acidobacteria833Open in IMG/M
3300021407|Ga0210383_10232496All Organisms → cellular organisms → Bacteria1580Open in IMG/M
3300021411|Ga0193709_1118987Not Available543Open in IMG/M
3300021432|Ga0210384_11265526Not Available642Open in IMG/M
3300021432|Ga0210384_11689869Not Available538Open in IMG/M
3300021478|Ga0210402_11226268Not Available677Open in IMG/M
3300021479|Ga0210410_10045844All Organisms → cellular organisms → Bacteria3806Open in IMG/M
3300021479|Ga0210410_10538027All Organisms → cellular organisms → Bacteria → Acidobacteria1041Open in IMG/M
3300021560|Ga0126371_10689093All Organisms → cellular organisms → Bacteria1170Open in IMG/M
3300021861|Ga0213853_10590506Not Available719Open in IMG/M
3300022715|Ga0242678_1073350Not Available533Open in IMG/M
3300022722|Ga0242657_1220608Not Available532Open in IMG/M
3300022724|Ga0242665_10198742Not Available660Open in IMG/M
3300022724|Ga0242665_10378629Not Available513Open in IMG/M
3300022873|Ga0224550_1050219All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300024330|Ga0137417_1380412All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia1971Open in IMG/M
3300024330|Ga0137417_1448405All Organisms → cellular organisms → Bacteria3571Open in IMG/M
3300025916|Ga0207663_10553238All Organisms → cellular organisms → Bacteria → Acidobacteria900Open in IMG/M
3300025922|Ga0207646_10700246Not Available906Open in IMG/M
3300025928|Ga0207700_11554348Not Available586Open in IMG/M
3300026319|Ga0209647_1053944All Organisms → cellular organisms → Bacteria2140Open in IMG/M
3300026327|Ga0209266_1274890Not Available533Open in IMG/M
3300026515|Ga0257158_1085308Not Available614Open in IMG/M
3300026557|Ga0179587_10494195Not Available802Open in IMG/M
3300027460|Ga0207506_1002266All Organisms → cellular organisms → Bacteria → Acidobacteria1380Open in IMG/M
3300027565|Ga0209219_1140817Not Available583Open in IMG/M
3300027660|Ga0209736_1036141All Organisms → cellular organisms → Bacteria1449Open in IMG/M
3300027765|Ga0209073_10216021Not Available734Open in IMG/M
3300027846|Ga0209180_10418169Not Available758Open in IMG/M
3300027846|Ga0209180_10710345Not Available546Open in IMG/M
3300027857|Ga0209166_10114989All Organisms → cellular organisms → Bacteria1491Open in IMG/M
3300027869|Ga0209579_10257277All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300027882|Ga0209590_10761352Not Available617Open in IMG/M
3300028906|Ga0308309_10288193All Organisms → cellular organisms → Bacteria → Acidobacteria1385Open in IMG/M
3300028906|Ga0308309_10663700All Organisms → cellular organisms → Bacteria → Acidobacteria906Open in IMG/M
3300029951|Ga0311371_10536437All Organisms → cellular organisms → Bacteria1529Open in IMG/M
3300031028|Ga0302180_10455077Not Available633Open in IMG/M
3300031231|Ga0170824_111882112All Organisms → cellular organisms → Bacteria → Acidobacteria1091Open in IMG/M
3300031716|Ga0310813_10322376Not Available1309Open in IMG/M
3300031718|Ga0307474_10342262All Organisms → cellular organisms → Bacteria → Acidobacteria1157Open in IMG/M
3300031962|Ga0307479_11213524Not Available717Open in IMG/M
3300032180|Ga0307471_100094219All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia2691Open in IMG/M
3300033412|Ga0310810_10696129All Organisms → cellular organisms → Bacteria → Acidobacteria946Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil8.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.97%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.98%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.98%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.98%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.98%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.98%
WatershedsEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds0.99%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.99%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.99%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.99%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.99%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001084Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O1EnvironmentalOpen in IMG/M
3300001545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300003220Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1EnvironmentalOpen in IMG/M
3300003224Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM3EnvironmentalOpen in IMG/M
3300004135Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF204 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012389Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300021861Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2016 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022715Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022873Peat soil microbial communities from Stordalen Mire, Sweden - 717 P3 10-14EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027460Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-ROWE17-C (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029951III_Palsa_N1 coassemblyEnvironmentalOpen in IMG/M
3300031028Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Palsa_E3_2EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12648J13191_102975713300001084Forest SoilVTESDSQNTNVVVAGLRSWSANAIEEVIEFRGETTVVVARK
JGI12630J15595_1010102423300001545Forest SoilVAENSNTNVVVERLRSWSPNAISEVIEFRGETTIV
JGI12627J18819_1040682823300001867Forest SoilVTESPSQKVNTVVESLRSWKPGAVLDVVFFRDETTIIV
JGI26342J46808_100281013300003220Bog Forest SoilVTENTSQNANVVVEHLRAWNPNAIEEVIEFRGETTIVLPRKI
JGI26344J46810_100458133300003224Bog Forest SoilVSENTSQNANFVVEHLRAWNGRAVAEVTGDRGETTIVVPRELIVETA
Ga0058884_134253623300004135Forest SoilVTESSSQSANVVAEFLRSWNAKAVGEVTEFRGETTIVVPRELLRA
Ga0062595_10014904913300004479SoilVAEDSKQTGNVVVERLRGWNPNAISEVIEFRGETTLVIPRNVVRE
Ga0058899_1163421413300004631Forest SoilVRFEIVTESSSKNANVVAECLRSWNPKAVLEVIEFRGETTIVVPRALLRAT
Ga0058899_1168643823300004631Forest SoilVTETSSQNASVVAELLRSWNPKAVAEVSEFRGETTIVVPRELLRAAA
Ga0066395_1040933613300004633Tropical Forest SoilVAENVNQNANAVALHLRAWNPQAVEQVIEFHGETTIVVPHALLRATARECRDALQFDLL
Ga0066680_1002742813300005174SoilVSESGSQNASVVAEQLRTWNANAVSEVIEFHGETTIVVPRELLRV
Ga0066690_1044307313300005177SoilVTESSSQNASIVVECLRSWSQQAVAEVIEFRGETTVVVPRDLLRAVAERCRADA
Ga0066681_1009468353300005451SoilLGFEIVTESSSQNANVVAECLRSWNPQAVVEVIEFRGETTIVVP
Ga0070707_10114487313300005468Corn, Switchgrass And Miscanthus RhizosphereVADNTNQSANVVVEHLRAWNAKAVEEVIEFHGETTIVIPHGLLRAAAQE
Ga0066661_1036488013300005554SoilVSESGGQNASVVAEQLRTWNANAVSEVIEFHGETTIVVPRELLRVTA
Ga0066699_1094123623300005561SoilVADNTNQSANVVVEHLRAWNAKAVEEVIEFHGETTIVVPHGLLRAAAQECRD
Ga0075023_10050337723300006041WatershedsVTESSSKNANVVAECLRSWNPKAVLEVIEFRGETTIVVPRALLRATAEH
Ga0075026_10054202123300006057WatershedsVSETVSESTNQSAHVVVEHLRSWNPKAVAEVIESRGETTIVVPRELLRAAAARCR
Ga0079220_1011152013300006806Agricultural SoilVTESSSQNANVVVESLRSWSPQAVSEVIEFRGETTIVVPRNLLRA
Ga0075436_10124224113300006914Populus RhizosphereVAENTNQNANAVVEHLRAWNAKAVEEIIEFRGETTVVV
Ga0099791_1007014313300007255Vadose Zone SoilVGFQIVTESPSQNANVVAECLRSWNPKAVVEVIAFRGETTIVVPRELLRAT
Ga0099795_1018480633300007788Vadose Zone SoilVAENSNQNTNVVVERLRGWSPNAISEVIEFRGETTIVVP
Ga0099828_1133173013300009089Vadose Zone SoilVGFEIVTESSSKNANVVAECLRSWNPKAVAEVLEFRGETTIVV
Ga0126384_1144421613300010046Tropical Forest SoilLEIVTESSNQKANVVAEGLRSWNAKAVAQVVEFRGET
Ga0126384_1235861023300010046Tropical Forest SoilVTENANQRGEVVVQRLRGWNAAAVAEVILFRGDRTVVVPREYLRALC
Ga0134084_1021376533300010322Grasslands SoilVTESSRQNANVVAGCLRSWNPKAVVDVLEFRGETTIVVPRELLRATA
Ga0126381_10377832823300010376Tropical Forest SoilVAANENQNANAVVEHLRAWNPKAAEEVIEFHGETTIVVPYGLLRATAQECRDA
Ga0150983_1435327523300011120Forest SoilLETVSENTSQSANVVVEHLRAWNPKAVAEVIQFRGET
Ga0137391_1148806223300011270Vadose Zone SoilVGFEIVTESSSKSANVVAECLRSWNPKAVAEVLEFRGE
Ga0137389_1057757433300012096Vadose Zone SoilVTETSSQNANVVAECLRSWNPKAVAEVIEFRGETTIVVPRELLRA
Ga0137364_1056123213300012198Vadose Zone SoilVADNTNQSANVVVEHLRAWNAKAVEEVIEFHGETT
Ga0137363_1031770913300012202Vadose Zone SoilVADNSNQVTNVVVERLRGWNPNAISEVIEFRGETTIVAPRNSLRDV
Ga0137363_1063876633300012202Vadose Zone SoilVGFEIVTESSSQKANLVAECLRSWNPKAVVEVIEFRGETTIVVPRELLRATA
Ga0137399_1050315013300012203Vadose Zone SoilVTESSSKNTNVVAEFLRSWNPKAVAEVIEFRGETTI
Ga0137399_1071141533300012203Vadose Zone SoilVTESSSQKANVVAEHLRSWNSKAVAEVIELRGDTTIIIPREFL
Ga0137362_1010975163300012205Vadose Zone SoilVGFEIVTESSSQNSNVVAECLRSWNPKAVVEVIEFRGETTIVVPRELLR
Ga0137377_1193646713300012211Vadose Zone SoilVGFEIVTESSSQNANVVAECLRSWNPKAVAEVIEFRGETTIVVPR
Ga0134040_115903313300012389Grasslands SoilLEIVTESSSQNANVVAERLRVWNSKAVAEVIQFRDE
Ga0150984_11955179013300012469Avena Fatua RhizosphereVTESGNQNANVVVERLRAWNANAIAEVIEFRGETT
Ga0137394_1007765073300012922Vadose Zone SoilVADNTNQNANAVVEHLRAWNPKAVEDVIEFRGETTVVLPNGLLR
Ga0134087_1032193513300012977Grasslands SoilVGFEIVTESSSQNANVVAESLRSWNPKAVVEVIEFRRETTIVVPRELLRATAERCCK
Ga0134075_1009584633300014154Grasslands SoilVTEPASKNANVVAECLRSWNPKAVAEVLEFRGETTIVVPRELLR
Ga0134079_1000424413300014166Grasslands SoilLGFEIVTESSSQNANVVAECLRSWNPQAVVEVIEFRGETTIVV
Ga0134079_1060407223300014166Grasslands SoilVADNTNQSANVVVEHLRAWNAKAVEEVIEFHGETTIVVPHGLLRAA
Ga0137414_103326313300015051Vadose Zone SoilMWDSKIVTESSSQKANVVAECLRSWNPKAVLEVIEFRGET
Ga0137414_111916313300015051Vadose Zone SoilVGFEIVTESSSQKANVVAECLRSWNPKAVVEVIEFRGETT
Ga0137412_1074284823300015242Vadose Zone SoilVAENSKQAGSVVVDRLRAWSPNAISEVLEFRGETTIVVARNVLREVAARC
Ga0187819_1085434313300017943Freshwater SedimentVTESASQNANVVAESLRSWNPQAAAEVIEYRGETTIVVPRELLRAA
Ga0187778_1042726013300017961Tropical PeatlandVTENASQVANVVVDTLRAWSANAIEEVIEFRGETTL
Ga0187776_1034774333300017966Tropical PeatlandVTESPSQNANVVVEILRSWNPQAVAEVIEYRGETTIVVPRDLLRAAAERCRS
Ga0179592_1030309113300020199Vadose Zone SoilVTESGSQSTHVVVAGLRAWSTNAIEEVIEFRGETTVVVAR
Ga0210403_1060775333300020580SoilVGFEIVTESSSQNANVVAECLRSWNPKAVVEVIEFRGETTIVV
Ga0210399_1133174813300020581SoilLDIVTESPSQKASVVAEHLRSWNAKAVADVLEFRGETTIV
Ga0210404_1021907813300021088SoilVTESSSQNANVVAECLRSWNPKAVVEVIEFRGETTIVVPRE
Ga0210404_1031177513300021088SoilVAENSNQNTNVVVERLRAWSPNAISEVIEFRGETTIVVPR
Ga0210404_1049008323300021088SoilVTENTSQRASVVAEQLRAWNPNAIEEVIEFRGETTLVLP
Ga0210400_1139183713300021170SoilVGFEIVTESSSQNANVVAECLRSWNPKAVVEVIEFRGETTI
Ga0210408_1030637233300021178SoilVTGNASQNAAVVVERLRGWNAAAIAEVIEFRGDTT
Ga0210386_1076140833300021406SoilVTENTSQKTSVVAEHLRAWNPNAIEEVIEFRGETT
Ga0210383_1023249613300021407SoilVTENTSQRASVVAEHLRAWNPNAIEEVIEFRGETTI
Ga0193709_111898723300021411SoilVADVPSQSANVVAEHLRSWNPESVTEVIEFRGETTIVVPRALIR
Ga0210384_1126552623300021432SoilVTGNASQNAAVVVERLRGWNAAAVVEVIEFRGDTTIVVPRELLRELC
Ga0210384_1168986913300021432SoilVTESSSQNANVVAEQLRSWNAKAVAEVLEFRGETTIVV
Ga0210402_1122626813300021478SoilVTESSSLSAHVVAEHLRSWNPKAVAEVIEFHGETTIVVPREF
Ga0210410_1004584413300021479SoilLETVTENTSQRASVVAEHLRAWNPNAIEEVIEFRGETTIVLPRKI
Ga0210410_1053802713300021479SoilVAENSNQNTNVVVERLRSWNPNAISEVIEFRGETTI
Ga0126371_1068909333300021560Tropical Forest SoilVTESPSQNAHVVAEALRSWNAQAVAEVIEYRGDTTIVVPR
Ga0213853_1059050613300021861WatershedsVTESSSQSANVVAEQLRSWNPQAVADVLEFRGETTIMVPRELLRATAARCRED
Ga0242678_107335013300022715SoilVTENTSQKTSVVAEHLRAWNPNAIEEVIEFRGETTVVVPRKIL
Ga0242657_122060823300022722SoilVTENTSQRASVVAEHLRAWNPNAIEEVIEFRGETTIVLPRKILR
Ga0242665_1019874213300022724SoilVGFEIVTESSSKNANVVAECLRSWNPKAVAEVLEFRGETTIVVPRE
Ga0242665_1037862913300022724SoilVTESSSQSANVVAECLRSWNAKAVAEVVEFRGETTIVV
Ga0224550_105021923300022873SoilVSENTSQNANVVVEHLRAWNSKAVEDVIEFRGETTVVVPRQMLR
Ga0137417_138041263300024330Vadose Zone SoilVADNQNQGASAVVEHLRAWNAKAVEEVIEFHGETTS
Ga0137417_1448405113300024330Vadose Zone SoilVGFEIVTESSSQNANVVAECLRSWNPKAVVEVIEFRGETTIVVPRELLQPRRALLQR
Ga0207663_1055323833300025916Corn, Switchgrass And Miscanthus RhizosphereVADNTNQNTNVVAEHLRAWNAKAVEEVIEFHGETTIVVPHGLLRAAAREC
Ga0207646_1070024613300025922Corn, Switchgrass And Miscanthus RhizosphereVTENSNQNTNVVVKRLRSWSPNAVSEVIEFRGETTI
Ga0207700_1155434823300025928Corn, Switchgrass And Miscanthus RhizosphereVADNTNQSANVVVEHLRAWNAKAVEEVIEFHGETTIVIPR
Ga0209647_105394453300026319Grasslands SoilVADNTNQSANVVVEHLRAWNAKAVEEVIEFRGETTVVVPHGLLRA
Ga0209266_127489023300026327SoilVTESSSQNANVVVECLRSWSQQAVAEVIEFRGETTVVVP
Ga0257158_108530823300026515SoilVTEATSQNANVVADALRSWSANAISEVSEFRGETTIVVARNVLR
Ga0179587_1049419513300026557Vadose Zone SoilVAENSKQAGSVVVDRLRAWSPNAISEVLEFRGETTIVVARNVLREVAAR
Ga0207506_100226633300027460SoilVTENTSQKANVVVEHLRAWNPNAIEEVIEFRGETTLVLPRKTL
Ga0209219_114081713300027565Forest SoilVTENTSQRASVVAEHLRAWNPNCIEEVIEFRGETTLVLSRKI
Ga0209736_103614113300027660Forest SoilVTENTSQSANVVVAALRSWSANAISEVIEFRGETTIV
Ga0209073_1021602133300027765Agricultural SoilVTESSSQNANVVTESLRSWNPGAVADVFTFRDETTIVVPRELLRA
Ga0209180_1041816933300027846Vadose Zone SoilVTDSTNQSANVVVEHLRAWNPKAVAEVIQFRGETTVVVPRELL
Ga0209180_1071034513300027846Vadose Zone SoilVTENSNQNTNVVVERLRAWSPNAVSEVIEFRGETTIVVSRNVLR
Ga0209166_1011498943300027857Surface SoilVAEDSKQPGNVVVERLRGWNPNAISEVIEFRGETTV
Ga0209579_1025727733300027869Surface SoilVAESSNQNASVVAEHLRSWNAKAVAEVIEYRGETTIVVPRELLRATA
Ga0209590_1076135223300027882Vadose Zone SoilMGFETVTESSSKNANVVAECLRSWNPKAVMEVIEFRGETTI
Ga0308309_1028819313300028906SoilVTENTSQRASVVAEHLRAWNPNAIEEVIEFRGETTLV
Ga0308309_1066370013300028906SoilVTENTSQQANVVVENLRAWNPNAIEEVIEFRGETTVVLPR
Ga0311371_1053643743300029951PalsaVSENTSQVANVVVENLRAWNGKAVAEVLEFRGEATLV
Ga0302180_1045507713300031028PalsaVSENTSQVANVVVENLRAWNGKAVAEVLEFRGEATLVVPR
Ga0170824_11188211213300031231Forest SoilVTESGVAESGSQNAGVVAEQLRAWNAMCVSEVIEFHGETTIVVPRELLRATAEY
Ga0310813_1032237613300031716SoilVTEKSSQSSYPAVDVLRAWSANAIEEVIEFRGETTLVVPRK
Ga0307474_1034226213300031718Hardwood Forest SoilVGFEIVTESSSKNANLVAERLRSWNPKAVAEVLEF
Ga0307479_1121352433300031962Hardwood Forest SoilVGFEIVTESSSKNANLVAERLRSWNPKAVVEVMEF
Ga0307471_10009421913300032180Hardwood Forest SoilVGFEIVTESSSKNANVVAECLRSWNAKAVSEVIEFRGETTIVVPCELLR
Ga0310810_1069612933300033412SoilVTENTSQKANVVVEHLRAWNPNAIEEVIEFRGETT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.