NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F086079

Metagenome / Metatranscriptome Family F086079

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086079
Family Type Metagenome / Metatranscriptome
Number of Sequences 111
Average Sequence Length 44 residues
Representative Sequence LRHANVVILTDETEFARLLTACWQAERQAPNITVLSSDSWR
Number of Associated Samples 97
Number of Associated Scaffolds 111

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 19.82 %
% of genes from short scaffolds (< 2000 bps) 16.22 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (79.279 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(24.324 % of family members)
Environment Ontology (ENVO) Unclassified
(24.324 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.063 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.54%    β-sheet: 0.00%    Coil/Unstructured: 72.46%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 111 Family Scaffolds
PF02954HTH_8 5.41
PF03640Lipoprotein_15 0.90

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 111 Family Scaffolds
COG4315Predicted lipoprotein with conserved Yx(FWY)xxD motif (function unknown)Function unknown [S] 0.90


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A79.28 %
All OrganismsrootAll Organisms20.72 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002912|JGI25386J43895_10141046All Organisms → cellular organisms → Bacteria → Acidobacteria599Open in IMG/M
3300005447|Ga0066689_10077296All Organisms → cellular organisms → Bacteria → Acidobacteria1859Open in IMG/M
3300006173|Ga0070716_100749971All Organisms → cellular organisms → Bacteria → Acidobacteria751Open in IMG/M
3300006755|Ga0079222_11969246All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium573Open in IMG/M
3300006794|Ga0066658_10145767All Organisms → cellular organisms → Bacteria → Acidobacteria1192Open in IMG/M
3300010364|Ga0134066_10382507All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium531Open in IMG/M
3300011271|Ga0137393_10531143All Organisms → cellular organisms → Bacteria → Acidobacteria1009Open in IMG/M
3300012351|Ga0137386_10046219All Organisms → cellular organisms → Bacteria → Acidobacteria2997Open in IMG/M
3300012362|Ga0137361_11878817All Organisms → cellular organisms → Bacteria → Acidobacteria516Open in IMG/M
3300021086|Ga0179596_10070696All Organisms → cellular organisms → Bacteria → Acidobacteria1513Open in IMG/M
3300021401|Ga0210393_10134687All Organisms → cellular organisms → Bacteria → Acidobacteria1981Open in IMG/M
3300021420|Ga0210394_10135356All Organisms → cellular organisms → Bacteria → Acidobacteria2137Open in IMG/M
3300021559|Ga0210409_10549974All Organisms → cellular organisms → Bacteria → Acidobacteria1020Open in IMG/M
3300026281|Ga0209863_10006718All Organisms → cellular organisms → Bacteria3801Open in IMG/M
3300026304|Ga0209240_1048996All Organisms → cellular organisms → Bacteria → Acidobacteria1593Open in IMG/M
3300026320|Ga0209131_1411052All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium507Open in IMG/M
3300026528|Ga0209378_1035672All Organisms → cellular organisms → Bacteria2564Open in IMG/M
3300026542|Ga0209805_1275991All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium644Open in IMG/M
3300026551|Ga0209648_10329089All Organisms → cellular organisms → Bacteria → Acidobacteria1069Open in IMG/M
3300027605|Ga0209329_1009339All Organisms → cellular organisms → Bacteria → Acidobacteria1835Open in IMG/M
3300027857|Ga0209166_10008353All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis6854Open in IMG/M
3300031753|Ga0307477_10383152All Organisms → cellular organisms → Bacteria → Acidobacteria964Open in IMG/M
3300031754|Ga0307475_11360224All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium548Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil24.32%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil11.71%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.41%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.50%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.50%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.60%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.70%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.70%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.80%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.80%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.90%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.90%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.90%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.90%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.90%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.90%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.90%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.90%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002681Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF120 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005531Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026281Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-046 (SPAdes)EnvironmentalOpen in IMG/M
3300026294Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050 (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027590Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028800Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-21-26 metaGHost-AssociatedOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030399II_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032275Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_bottomEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0005471J37259_11693523300002681Forest SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITLLSSDLWKDHEA
JGI25386J43895_1014104623300002912Grasslands SoilLRHANIVILTDETEFARLLTACWQGERQMPAITVLGSDLWNKQEAPAHDL
JGI25617J43924_1005893723300002914Grasslands SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDLWR
Ga0062595_10185436423300004479SoilLPNFNVLILTDETEFARLLTACWQAERQTPAITVLASDLWKEHQDTP
Ga0066689_1007729613300005447SoilLPHANVVILTDESEFARLLTACWQAERQAPNITVLNSNSWREQDAPDHDL
Ga0066689_1029809723300005447SoilLRHANIVILTDETEFARLLTACWQGERQMPAITVLGSDLWNK
Ga0070707_10105750513300005468Corn, Switchgrass And Miscanthus RhizosphereLRHANVVILTDETEFARLLTACWQAERQAPLVTVLTSDLWQEQQA
Ga0070738_1018634913300005531Surface SoilLRHASLLILTDDAEFARLLSACWQAERQAPRITVLSSDL
Ga0070731_1059215913300005538Surface SoilVRSASVLILTDEPEFARLLTACWQAERHAPGITVL
Ga0066707_1035687213300005556SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVL
Ga0066670_1013832013300005560SoilLRHANVIILTDETEFARLLTACWQAERQAPVITVLT
Ga0066705_1067174113300005569SoilLRSASVLILTDETEFARLLTACWRAERQAPGITVLGTDLWKDH
Ga0070761_1105999223300005591SoilVRSASVLILTDEPEFARLLTACWQAERHAPGITVLSSELW
Ga0070764_1005922613300005712SoilLQNVSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNEH
Ga0070716_10074997113300006173Corn, Switchgrass And Miscanthus RhizosphereLPHANVVILTDESEFARLLTACWQAERQAPNITVLNSNSWREQDAPDHDLV
Ga0075014_10013392723300006174WatershedsLRNANVVILTDETEFARLLTACWQAERLAPNIAVINSDSWREQDAPAH
Ga0075014_10067252513300006174WatershedsLRHASVVILTDEAEFARLLTACWQAERRAPAVTVLTSDLWKAQEAPVGDLMVLG
Ga0079222_1196924613300006755Agricultural SoilLRQANVVILTDETEFARLLTACWQAERQAPVVTVLTSELWQEQEAPARDLI
Ga0066658_1014576713300006794SoilLRHANVVILTDESEFARLLTACWQAERQAPNITVLNSNSWREQDAPDHDLV
Ga0075434_10053812713300006871Populus RhizosphereLPNFTVLILTDETEFARLLTACWQAERQPPAISVLASNLW
Ga0099795_1031178413300007788Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSGSWREQD
Ga0099828_1022286413300009089Vadose Zone SoilLRNANVVILTDETEFARLLTACWQAERQAPNITVLNSDS
Ga0074044_1054346523300010343Bog Forest SoilLRNANVLILTDEAEFARLLTACWQAERHAPRVTVLNSDVWTAQSGPT
Ga0126370_1155546223300010358Tropical Forest SoilLKNANVVILTDESEFARLLSACWHAERHAPAITVLN
Ga0126377_1176278313300010362Tropical Forest SoilLKHASVLILTDETEFARLLTSCWQTERQAPRITVLNS
Ga0134066_1038250723300010364Grasslands SoilLRHANVVILTDDSEFARLLTACWQAERQAPNIAVLNSDSWHEPNAPAHDLV
Ga0137393_1053114313300011271Vadose Zone SoilLRHANVVILTDETEFARLLTACWQTERQAPNITVLNSDSWREQDAPAHDLV
Ga0137389_1045381213300012096Vadose Zone SoilLRHANVIILTDETEFARLLTAGWQAERQAPRVTVLTSDLWQEQEAPARD
Ga0137389_1054668823300012096Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPVVTVLTSDLWQEQEAPARD
Ga0137388_1190414323300012189Vadose Zone SoilLRHANVLILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPA
Ga0137383_1102168723300012199Vadose Zone SoilLRHANVVILTDETEFARLLTACWQTERQAPGVTVLNSDSWQDQDA
Ga0137380_1119232423300012206Vadose Zone SoilLRHANVVILTDETEFARLLTACWQTERQAPGVTVL
Ga0137386_1004621923300012351Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVINSDSWQEQNAQEHDLVVV*
Ga0137360_1025903123300012361Vadose Zone SoilLRHANVLILTDETEFARLLTACWQAERQAPSITVIS
Ga0137360_1184431213300012361Vadose Zone SoilLRSASVLILTDETDFARLLTACWQAERHAPGITVL
Ga0137361_1187881713300012362Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPAHDLVVVG
Ga0137398_1073915723300012683Vadose Zone SoilLILTDETDFARLLTACWQAEKHAPGITVLGSDLWRDHETLPH
Ga0137413_1135535013300012924Vadose Zone SoilLVNSNVLIVTDETEFARLLTSCWQAERQAPGITILGSE
Ga0137419_1092818723300012925Vadose Zone SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDLWRDHE
Ga0137419_1126595813300012925Vadose Zone SoilLRHANVLVLTDETEFARLLTACWQAERQAPGITVIGSELWREQDVPR
Ga0137418_1045079613300015241Vadose Zone SoilLRHANVVILTDDTEFARLHTACWQAERQPPNITVLNSDLWQEQNTTA
Ga0182034_1196740713300016371SoilLRHANVVILTDETEFARLLTACWQAERHAPRVTVL
Ga0134112_1052595913300017656Grasslands SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNSDSWHEQNAPSH
Ga0137408_148162413300019789Vadose Zone SoilLANSNVLIVTDETEFARLLTSCWQAERQAPGITILGSELWSEHEEIA
Ga0179594_1028450613300020170Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLSSDSWR
Ga0179592_1024382513300020199Vadose Zone SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDL
Ga0210407_1011102913300020579SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDLWK
Ga0210403_1112093823300020580SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLN
Ga0210401_1064607623300020583SoilVRNASVLILSDEPEFARLLTACWQAERVAPGITVLSSDLWK
Ga0179596_1007069613300021086Vadose Zone SoilLRNANVIILTDETEFGRLLTACWQTERQAPNITVLSSDLWREQDAPAHDPVVLGP
Ga0210404_1006102913300021088SoilLRHANVVILTDETEFARLLTACWQTERQAPGVTVLTSD
Ga0210404_1006401723300021088SoilLRHANVVILTDETEFARLLTACWQTERQAPNITVLNSDSWREQDAP
Ga0210406_1050767013300021168SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITVLSSDLWK
Ga0210400_1079879313300021170SoilLRHANVVILTDETEFARLLTACWQTERQAPNITVLNSDSWRE
Ga0210400_1148471613300021170SoilLQNSSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNEHEAVAHEL
Ga0210405_1048587813300021171SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITVL
Ga0210408_1062869423300021178SoilLRNANVVILTDETEFARLLTACWQAERQAPNIAVVNSDSW
Ga0210388_1152895023300021181SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITVLSSDLWKDHE
Ga0213876_1004156013300021384Plant RootsLQNSSVLIVTDEPEFARLLTACWRAEREVPAITVLGSDVWNEHEGVAHELAVVG
Ga0210393_1013468733300021401SoilLRNANVLILTDESEFARLLTACWQAERQAPGITVLGSESWKQHEAL
Ga0210393_1169193123300021401SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITLLSS
Ga0210385_1006402843300021402SoilLQNFNVLIVTDEPEFARLLTACWRAEREVPAITVLAS
Ga0210397_1049703623300021403SoilLQNVSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNEHEG
Ga0210386_1010743233300021406SoilLPNSNALIVTDETEFARLLTSCWQAERQAPAITVLGSD
Ga0210394_1013535613300021420SoilLRHANVVILTDETEFARLLTACWQAERQAPAITVLGSSLWREHEGTSHDLVVV
Ga0210384_1052800323300021432SoilLRSASVLILTDETDFARLLTACWRAEKHAPGITVL
Ga0210390_1131568223300021474SoilLRNANVLILTDDAEFARLLTACWQTERQAPRVAVLSSDL
Ga0210398_1010317413300021477SoilLQNFNVLIVTDEPEFARLLTACWRAEREVPAITVLASDVWNEHEGVAHELAVV
Ga0210410_1114300913300021479SoilLRHANVVILTDETEFARLLTACWQAEPQAPNITVLNSESWQE
Ga0210409_1054997413300021559SoilLRHANVVILTDETEFARLLTACWQAEPQAPNITVLNSESWQKQDAPAHDLV
Ga0242662_1013010223300022533SoilLRNANVLILTDESEFARLLTACWQAERQAPGITVLGSESWK
Ga0137417_109988613300024330Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNQAPNITVLN
Ga0137417_143970633300024330Vadose Zone SoilMRQQSVILTDETEFARFADGCWQAERQAPNITVLIAIVA
Ga0247668_100523013300024331SoilMPGSFSPLRHANLLILTDDAEFARLLSACWQAERQAPRITVL
Ga0207671_1090728013300025914Corn RhizosphereLQNASVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWGEH
Ga0209863_1000671813300026281Prmafrost SoilLANSNVLIVTDETEFARLLTSCWQAERQAPGITVLGSDLWS
Ga0209839_1023178813300026294SoilLRNANVLILTDEAEFARLLTACWQTERQAPRVTVLNSDVWHGQSGPT
Ga0209240_104899613300026304Grasslands SoilLRNANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPAHDLV
Ga0209131_141105213300026320Grasslands SoilLRNANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPAHDLVVV
Ga0209152_1030559913300026325SoilLRNASVLILTDETDFARLLTSCWQADRHAPAITVLGSELWKNHEVA
Ga0209267_110203323300026331SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNSDSWHE
Ga0209803_112814923300026332SoilLRHANIVILTDETEFARLLTACWQGERQMPAITVLGSDLWNKQE
Ga0257161_108909123300026508SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWPEQDTP
Ga0209378_103567243300026528SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNSDSWHEQNAPSHDLVV
Ga0209056_1057395013300026538SoilLRHASVVILTDETEFARLLTACWQAERHAPAVTVLTSDL
Ga0209805_115721313300026542SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNS
Ga0209805_127599123300026542SoilLRHANVVILTDDSEFARLLTACWQAERQAPNIAVLNSDSWHEPNAPAHDLVVVGP
Ga0209648_1032908923300026551Grasslands SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPAHDLV
Ga0209648_1049765113300026551Grasslands SoilLRNANVIILTDETEFGRLLTACWHAERQAPNITVLNSDLWR
Ga0209116_106406633300027590Forest SoilLPNSNVLIVTDETEFARLLTACWQTERQAPGITVLGSDL
Ga0209329_100933913300027605Forest SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPAHDLVVVGP
Ga0209118_110803813300027674Forest SoilLRSANVLILSDETDFARLLTACWQAERHAPAITVL
Ga0209074_1017284913300027787Agricultural SoilLRHANVVILTDETEFARLLTACWQAERQAPVVTVLTS
Ga0209274_1020200913300027853SoilVRNASVLILTDEPEFARLLTACWQTERQAPGITVLSSELWKDHEATP
Ga0209274_1067697713300027853SoilVRSASVLILTDEPEFARLLTACWQAERHAPGITVLSSELWK
Ga0209693_1019330113300027855SoilLQNFSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWKEHEGVAHELAVV
Ga0209166_1000835363300027857Surface SoilLRHANVVILTDETEFARLLTACWQADRQAPVVTVLTSDLWQEQQ
Ga0209068_1025472713300027894WatershedsLRNANVVILTDETEFARLLTACWQAERLVPNIAVLNSDSWREQDALVLA
Ga0209488_1056843613300027903Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPTHD
Ga0209006_1104862313300027908Forest SoilLQNSSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNE
Ga0137415_1058306313300028536Vadose Zone SoilLRHANVLVLTDETEFARLLTACWQAERQAPGITVIGSE
Ga0137415_1100833513300028536Vadose Zone SoilLANSNVLIVTDETEFARLLTSCWQAERQAPGITILGSELWSEHE
Ga0265338_1059219123300028800RhizosphereLRNANVLILTDEAEFGRLLSTCWQAERQAPRVTIVDSEH
Ga0222749_1063293423300029636SoilLRHANVVILTDETEFGRLLTACWQAERQAPNITVLNSDSW
Ga0311353_1079424513300030399PalsaLRNANVLILTDEAEFGRLLTTCWRTERQPPRMTILDSDNWHGQE
Ga0307477_1038315223300031753Hardwood Forest SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPAHDLVVV
Ga0307475_1136022413300031754Hardwood Forest SoilLRHANVVILTDDTEFARLLTACWQAERQAPNITVLNSDSWHEQDAPALDLIVVGP
Ga0310913_1119459213300031945SoilLRQANVVILTDETEFARLLTACWQAERQAPVVTVLTSDLCQEQQ
Ga0307479_1048260123300031962Hardwood Forest SoilLRSASVLILTDEPDFARLLTACWQAEKHAPGITVLGSDLW
Ga0307479_1127795823300031962Hardwood Forest SoilLRSASVLILTDETDFARLLTACWQAEKHGPGITVMGSD
Ga0315270_1118812313300032275SedimentLRHTNVIVLTDETEFARLLTACWQAERHVPAIAVLNSDLWLRQQHLPCDLLV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.