NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F081716

Metagenome / Metatranscriptome Family F081716

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F081716
Family Type Metagenome / Metatranscriptome
Number of Sequences 114
Average Sequence Length 56 residues
Representative Sequence SQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Number of Associated Samples 100
Number of Associated Scaffolds 114

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 21.93 %
% of genes from short scaffolds (< 2000 bps) 17.54 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (78.070 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(23.684 % of family members)
Environment Ontology (ENVO) Unclassified
(38.596 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.263 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 17.50%    β-sheet: 2.50%    Coil/Unstructured: 80.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 114 Family Scaffolds
PF00578AhpC-TSA 7.02
PF08534Redoxin 5.26
PF13407Peripla_BP_4 2.63
PF03551PadR 1.75
PF08281Sigma70_r4_2 1.75
PF07228SpoIIE 1.75
PF00903Glyoxalase 1.75
PF12704MacB_PCD 1.75
PF10267Tmemb_cc2 0.88
PF13229Beta_helix 0.88
PF12680SnoaL_2 0.88
PF03062MBOAT 0.88
PF14534DUF4440 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 114 Family Scaffolds
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 1.75
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 1.75
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 1.75


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A78.07 %
All OrganismsrootAll Organisms21.93 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005591|Ga0070761_10115199All Organisms → cellular organisms → Bacteria → Acidobacteria1558Open in IMG/M
3300005610|Ga0070763_10444212All Organisms → cellular organisms → Bacteria → Acidobacteria735Open in IMG/M
3300005921|Ga0070766_10383509All Organisms → cellular organisms → Bacteria → Acidobacteria918Open in IMG/M
3300006176|Ga0070765_100971620All Organisms → cellular organisms → Bacteria → Acidobacteria803Open in IMG/M
3300009089|Ga0099828_10570681All Organisms → cellular organisms → Bacteria → Acidobacteria1019Open in IMG/M
3300009143|Ga0099792_10205230All Organisms → cellular organisms → Bacteria → Acidobacteria1123Open in IMG/M
3300009143|Ga0099792_10780586All Organisms → cellular organisms → Bacteria → Acidobacteria624Open in IMG/M
3300010343|Ga0074044_10295761All Organisms → cellular organisms → Bacteria → Acidobacteria1065Open in IMG/M
3300011271|Ga0137393_10725897All Organisms → cellular organisms → Bacteria → Acidobacteria850Open in IMG/M
3300012205|Ga0137362_10548502All Organisms → cellular organisms → Bacteria → Acidobacteria998Open in IMG/M
3300020580|Ga0210403_10306994All Organisms → cellular organisms → Bacteria1300Open in IMG/M
3300020580|Ga0210403_10699514All Organisms → cellular organisms → Bacteria → Acidobacteria812Open in IMG/M
3300020583|Ga0210401_10138548All Organisms → cellular organisms → Bacteria → Acidobacteria2277Open in IMG/M
3300021168|Ga0210406_10797501All Organisms → cellular organisms → Bacteria → Acidobacteria719Open in IMG/M
3300021170|Ga0210400_10063030All Organisms → cellular organisms → Bacteria2892Open in IMG/M
3300021170|Ga0210400_10266267All Organisms → cellular organisms → Bacteria1403Open in IMG/M
3300021178|Ga0210408_11108614All Organisms → cellular organisms → Bacteria → Acidobacteria608Open in IMG/M
3300026320|Ga0209131_1003187All Organisms → cellular organisms → Bacteria10961Open in IMG/M
3300027034|Ga0209730_1022178All Organisms → cellular organisms → Bacteria → Acidobacteria688Open in IMG/M
3300027737|Ga0209038_10060021All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300027853|Ga0209274_10196758All Organisms → cellular organisms → Bacteria → Acidobacteria1025Open in IMG/M
3300031753|Ga0307477_10002028All Organisms → cellular organisms → Bacteria16046Open in IMG/M
3300031754|Ga0307475_10034784All Organisms → cellular organisms → Bacteria → Acidobacteria3689Open in IMG/M
3300031962|Ga0307479_11334424All Organisms → cellular organisms → Bacteria → Acidobacteria677Open in IMG/M
3300032180|Ga0307471_102291822All Organisms → cellular organisms → Bacteria → Acidobacteria681Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil20.18%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil10.53%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil7.02%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil5.26%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.51%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.63%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.75%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.75%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.75%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.88%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.88%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.88%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.88%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.88%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.88%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.88%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.88%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.88%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.88%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300018034Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_11_10EnvironmentalOpen in IMG/M
3300018090Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021361Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R2Host-AssociatedOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300024287Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK31EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026467Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-AEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027034Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027692Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027737Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027795Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027829Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028146Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK23EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030054Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Palsa_N3_1EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1047813513300001593Forest SoilGRRLFVSFGMHERTSYSQYRRIGPPKEALQAIRAELGKTDSANADP*
JGIcombinedJ26739_10080327223300002245Forest SoilDFDGRRLFVSFSIHERTFYTNYHRIGLPKEALATIRAELGKPGAINADP*
JGI25382J43887_1047173223300002908Grasslands SoilQYDFDGRKFFSSMAVHERTFYSHYRRIGPPKEALALIRAELGKMAGADADP*
JGI25615J43890_102917123300002910Grasslands SoilLWFATYSQYDFDGRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0062389_10232367513300004092Bog Forest SoilRRLFVSFSIHERTFYTQYRRIGPPKEALATIRAELGKPPAVNGDP*
Ga0062389_10453285413300004092Bog Forest SoilLFVSFSIHERTFYTQYRRIGAPKEALATIRAELGKPAAVNGDP*
Ga0062595_10051263113300004479SoilDGRKLFVTFSIHERTFYSQYRRIGPPKEALADIRAELSKLKAAAGDP*
Ga0062388_10017683843300004635Bog Forest SoilRRLFVSFSIHERTFYTQYRRIGAPKEALATIRAELGKPAAVNGDP*
Ga0066678_1025709413300005181SoilWLPSFSQYDFDGRKFFSSMAVHERTFYSHYRRIGPPKEALALIRAELSKAAGADADP*
Ga0066388_10083763733300005332Tropical Forest SoilYDFDGRKFFSSFSVHERTFYSHYRFIGPPKQALAALRAELDKSIPAASDP*
Ga0070730_1091481923300005537Surface SoilQYDFDGRRFFMSFAIHERTFYSKYQRIGPPKEALQAIRAELGKSTSARADP*
Ga0070695_10050535323300005545Corn, Switchgrass And Miscanthus RhizosphereRLFVSFSIHEHTIYSQYRRIGPPKEALQAIRAELGKTSSANADP*
Ga0066702_1079591923300005575SoilTYSQYDFDGRRLFMSFGVHERTTYSQYRRIGPPKEALQAIRAELGKSTTASAYH*
Ga0070761_1011519933300005591SoilMPSFSQYDFDGRKLFSTISIHERTFYSQYRRIGPPKEALAVIRAELGKPAGAIADP*
Ga0070763_1031097913300005610SoilERWELAPGLWLPTYAQYDFDGRRLFMSFSVHEKTFYTNYKRIGPPKEALAEIRAELSKPDLTTGDP*
Ga0070763_1044421213300005610SoilFSQYDFDGRKLFSAISIHERTFYGQYRRIGPPKEALAAIRTELGKPGGAVADP*
Ga0070766_1038350913300005921SoilYDFDGRRLFSSISIHERTFYGQYRRIGPPKEALAMIRAELGKPAGVVANP*
Ga0075018_1026496513300006172WatershedsQYDFDGRRLFVSFAIHERTFYSQYRRIGPPKEALAAIRTELGKLNTAAGDP*
Ga0070716_10069575913300006173Corn, Switchgrass And Miscanthus RhizosphereQYDFDGRRLFVSFGVHERTFYTQYRYVGPPKEALVAVRAELGKLRAAAADP*
Ga0070765_10097162013300006176SoilMLPGLWVPSFSQYDFDGRKLFSAISIHERTFYGQYHRIGPPKEALAMIRAELGKPGGAVADP*
Ga0099791_1046241113300007255Vadose Zone SoilERYEMAPGLWFATYSQYDFDGRRLFMSFGVHERTSYSQYRRIGPPKEALQAIRAELGKSTTANAYP*
Ga0099793_1023423033300007258Vadose Zone SoilLWFATYAQYDFDGRRFFVSFGIHERTLYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0099794_1020651833300007265Vadose Zone SoilAQYDFDGRRFFVSFGIHERTLYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0099795_1009829633300007788Vadose Zone SoilATYSQYDFEGRRLFVSFGIHERTSYSQYRRIGTPKEALEAIRAELGKTDSANADP*
Ga0099830_1018402843300009088Vadose Zone SoilGRRLFVSFGIHERTFYSQYRRIGPPKEALQAIRAELGKTDSANADP*
Ga0099828_1057068113300009089Vadose Zone SoilYEMAPGLWFSTYSQYDFDGRRLFVTFSVHEHTFYSQYRRIGPPREALQAIRAELGKSTSADAAP*
Ga0099792_1020523013300009143Vadose Zone SoilMAPGLWFVTYSQYDFDGRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANAAP*
Ga0099792_1078058613300009143Vadose Zone SoilSFSIHERTLYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0105238_1161135813300009551Corn RhizosphereTYSQYDFDGRRLFLNFSVHERTFYTQYKRIGPPKEAVQAIRAELSKPAPSTGDP*
Ga0126373_1069497133300010048Tropical Forest SoilHFMQERYEIAPGLWLPTFSQYDFDGRRLFSSFSVHERTFYSNYRLLGPPKDALAALRAELDKPAPPAADP*
Ga0074044_1029576133300010343Bog Forest SoilDGRKLFSAISIHERTFYTQYHRIGPPKEALAMIRAELGKPAGITADP*
Ga0126372_1044171713300010360Tropical Forest SoilAPGLWLPSYSQYDFEGRKLFVTFVIHERTFYSQYRRIGPPKEALAAIRAELGKSGSASADP*
Ga0137393_1072589713300011271Vadose Zone SoilYSQYDFDGRRLFMSFSIHERTLYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0137389_1161672223300012096Vadose Zone SoilFSTYSQYDFDGRRLFVTFSVHEHTFYSQYRRIGPPREALQAIRAELGKSTSADAAP*
Ga0137388_1184693323300012189Vadose Zone SoilDGRRLFVTFSVHEHTFYSQYRRIGPPREALQAIRAELGKSTSADAAP*
Ga0137363_1115505113300012202Vadose Zone SoilPGLWLPSYSQYDFDGRRLFVSFSIHERTLYTNYRRIGPPKEAVATIRSELGRAELGKRSDADGDP*
Ga0137399_1057716213300012203Vadose Zone SoilLWFATYSQYDFDGRRLFMSFGVHERTSYSQYRRIGPPKEALQAIRAELGKSTTANAYP*
Ga0137362_1054850213300012205Vadose Zone SoilDGRRLFMRFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0137379_1015325843300012209Vadose Zone SoilFDGRKFFSSMAVHERTFYSHYRRIGPPKEALALIRAELSKTAGADADP*
Ga0137385_1006936513300012359Vadose Zone SoilSFSQYDFDGRKFFSSMAVHERTFYSHYRRIGPPKEALALIRAELSKTAGADADP*
Ga0137358_1030370123300012582Vadose Zone SoilWFPTYSQYDFDGRRLFVSFAIHEHTSYSQYRRIGPPKEALQAIRAELGKSTSANADP*
Ga0137419_1139085613300012925Vadose Zone SoilFATYAQYDFDGRRLFVSFGLHERTSYSQYRRIGPPKEALQAIRAELGKSTTADVFP*
Ga0137416_1151485423300012927Vadose Zone SoilEMAPGLWFATYSQYDFEGRRLFMSFGLHERTSYSQYRRIGPPKEALQAIRAELGKSTTADVFP*
Ga0137405_111837633300015053Vadose Zone SoilLPSYSQYDFDGRRLFVSFSIHERTLYTNYRRIGPPKEAVATIRSELGRAELGKRSDADGDP*
Ga0137418_1058512123300015241Vadose Zone SoilLMQERYEMAPGLWFATYSQYDFDGRRLFMSFGVHERTSYSQYRRIGPPKEALQAIRAELGKSTTANAYP*
Ga0137403_1125551613300015264Vadose Zone SoilTYSQYDFDGRRLFVSFGIHEHTTYSQYRRIGPPKEALQAIRAELGKSTTANADP*
Ga0187825_1019931713300017930Freshwater SedimentRYEVQPGLWMPTFSQYDFDGRKFFSSISIHEQTYYSQYRRIGPPKEALTEIRAELGMPGAPVTDP
Ga0187786_1031505813300017944Tropical PeatlandDGRRLFMSFGIHERTFYSQYRYVGPPKEALVMVRAELGKLHALAADP
Ga0187863_1005304143300018034PeatlandLPSYSQYDFDGRRLFVSFSIHERTLYTRYHRIGPPKEALATIRSELGRAELGKPGTANAD
Ga0187770_1145216223300018090Tropical PeatlandLWLPSFSQYDFDGRRLFSGFSIHEGTFYWNYRYIGPPQEALEVVRKELGHADLGKPVP
Ga0179590_102647613300020140Vadose Zone SoilGIWLPTYSQYDFDGRKFFVPFSVHERTFYTKYRYIGQPAEALSTIRAELGKSSPSVADR
Ga0179592_1018857923300020199Vadose Zone SoilSPGLWFPTYSQYDFDGRRLFVSFGIHEHTTYSQYRRIGPPKEALQAIRAELGKSTTANAD
Ga0210403_1030699413300020580SoilYDFDGRKLFSTISIHERTFYGQYHRIGPPKEALAMIRAELGKPGGAVADR
Ga0210403_1069951423300020580SoilRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0210399_1121627613300020581SoilWMPSFSQYDFDGRRLFSSISIHERTFYGQYHRIGPPKEALAMIRAELGKPGGAVANP
Ga0210401_1013854813300020583SoilFSQYDFDGRKLFSSISIHERTFYGQYHRIGPPKEALAMIRAELGKPGGAVADP
Ga0210401_1026064143300020583SoilWLPSYSQYDFDGRRLFVSFSIHERTFYTQYRRIGAPKEALATIRAELGKPAAVNGDP
Ga0210406_1079750113300021168SoilYEMAPGLWFATYSQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0210406_1097878223300021168SoilRYEMLPGLWMPSFSQYDFDGRKLFSTISIHERTFYGQYHRIGPPKEALAMIRAELGKPGGAVADR
Ga0210400_1006303043300021170SoilYEVAPGLWFATYSQYDFDGRRFFLSFSVHERTTYSQYHRIGPPKEALQAIRAELGKSTSAKAYP
Ga0210400_1026626733300021170SoilGLWFATYAQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0210400_1139783823300021170SoilTYSQYDFDGRRLFVSFGIHEHTSYSQYRRIGPPKEALQAIRAELGKSTTAKAEP
Ga0210408_1110861423300021178SoilLFVSFSIHERTLYSQYRRIGPPKEALQAIRAELGKSPSASADP
Ga0213872_1031757513300021361RhizospherePGLWLPTYAQYDFDGRRLFMNFSVHERTFYTQYKRIGPPKEALAEIRAELSKPARSTGDP
Ga0210397_1027869433300021403SoilWFPTYSQYDFDGRRLFVNFSVHERTFYTQYKRIGPPKEAVEEIRAELSKPAPSTGDP
Ga0210387_1112211813300021405SoilGLWLPSYSQYDFDGRRLFVSFSIHERTFYTQYRRIGPPKEALATIRAELGKPAAVNADP
Ga0210390_1013151653300021474SoilPGLWLPSYSQYDFDGRRLFVSFSIHERTFYTQYRRIGPPKEALATIRAELGKPAAVNGDP
Ga0210392_1056912323300021475SoilLFVNFSVHERTFYTQYKRIGPPKEAVEEIRAELSKPAPSTGDP
Ga0210398_1078702713300021477SoilYDFDGRRLFVAFSIHERTLYTRYHRIGPPKEALATIRSELGRAESGKPGAADADP
Ga0210402_1057801033300021478SoilSFAIHERTFYSQYRRIGPPKEALAAIRAELGKLHTAAGDP
Ga0210409_1104805723300021559SoilLIQDRYEMAPGLWFATYSQYDFDGRRLFVSFAVHERTTYSQYRRIGPPKEALQAIRAELGKSTIASSAP
Ga0247690_101210913300024287SoilLWMPSFSQYDFDGRKLFSSISIHERTFYSQFHRIGPPKEALAAIRAELGKPGAALANP
Ga0207642_1011568013300025899Miscanthus RhizosphereGVWFPSYAQYDFDGRRLFVSFGIHERTFYSQYHYVGPPKEALVTIRAELGKLRATVAEP
Ga0207699_1016905913300025906Corn, Switchgrass And Miscanthus RhizosphereDFDGRKLFSSISIHERTFYSQFHRIGPPKEALAAIRAELGKPGAALANP
Ga0209235_103670143300026296Grasslands SoilPSFSQYDFDGRKFFSSMAVHERTFYSHYRRIGPPKEALALIRAELSKAAGADADP
Ga0209131_1003187113300026320Grasslands SoilRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0257154_103155723300026467SoilSYSQYDFDGRRLFVSFSIHERTFYTRYRRIGPPKEALAAIRAELGKTGAANADP
Ga0179587_1058780613300026557Vadose Zone SoilMAPGLWFATYSQYDFDGRRLFMSFGVHERTSYSQYRRIGPPKEALQAIRAELGKSTTANAYP
Ga0209730_102217823300027034Forest SoilQDRYEMAPGLWFATYSQYDFDGRRFFLSLSVHERTTYSQYRRIGPPKEALQAIRAELGKSTSAKAYP
Ga0209625_105801623300027635Forest SoilEMAPGLWFATYSQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0209217_120943123300027651Forest SoilSQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0209588_114773413300027671Vadose Zone SoilQYDFDGRRFFVSFGIHERTLYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0209118_119674213300027674Forest SoilGLWFATYAQYDFEGRRLFVSFAIHERTSYSQYRRIGPPKEALEAIRAELGKSTSANADP
Ga0209530_103970833300027692Forest SoilLAPGLWLPTYAQYDFDGRRLFMSFSVHEKTFYSQYKRIGPPKEALAEIRAELSKPALPTGDP
Ga0209328_1006163633300027727Forest SoilAPGLWFATYSQYDFDGRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANAD
Ga0209038_1006002133300027737Bog Forest SoilDFDGRKLFSSISIHERTFYGQYHRIGPPKEALAMVRAELGKPGGAVADR
Ga0209073_1020614213300027765Agricultural SoilYEVLPGLWMPTFSQYDFDGRKLFSSINVHERTFYSQYHRIGPPKEALLSIRAELGKPATQASDP
Ga0209139_1004269643300027795Bog Forest SoilDFDGRRLFVSFSIHERTFYTQYRRIGPPKEALATIRAELNKPAAVNGDP
Ga0209773_1008243013300027829Bog Forest SoilMLPGLWMPSFSQYDFDGRKLFSAISIHERTFYGQYHRIGPPKEALATVRAELGKPGGTVADP
Ga0209580_1061326223300027842Surface SoilYSQYDFDGRRLFVSFSMHERTLYSQYRRIGPPKEALQAIRAELGKSPSASADP
Ga0209274_1019675813300027853SoilMPSFSQYDFDGRKLFSTISIHERTFYSQYRRIGPPKEALAVIRAELGKPAGAIADP
Ga0209693_1004297843300027855SoilERWELAPGLWLPTYAQYDFDGRRLFMSFSVHEKTFYTNYKRIGPPKEALAEIRAELSKPDLTTGDP
Ga0209701_1000477193300027862Vadose Zone SoilDGRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKPTSANADP
Ga0209169_1018367613300027879SoilYKGSHFMQERWELAPGLWLPTYAQYDFDGRRLFMSFSVHEKTFYTQYKRIGPPKEALAEIRAELSKPALTTGDP
Ga0209006_1152285913300027908Forest SoilQYDFDGRRLFMSFSVHEKTFYTNYKRIGPPKEALAEIRAELSKPDLTTGDP
Ga0247682_108447623300028146SoilHEVLPGLWMPSFSQYDFDGRKLFSSISIHERTFYSQFHRIGPPKEALAAIRAELGKPGAALANP
Ga0222749_1011330833300029636SoilAPGLWFATYSQYDFDGRRLFVSFGIHERTSYSQYRRIGPPKEALQAIRAELGKSTTANAD
Ga0222749_1030597423300029636SoilLPSYSQYDFDGRRLFVSFSIHERTFYTRYRRIGPPKEALAAIRAELGKTGAANADP
Ga0302182_1000394813300030054PalsaLMQERYEMAPGLWFPSYAQYDFDGRKLFMSFAIHDRTFYSQYKRVGPPKEALSAIRAELGKPDSAQADR
Ga0307474_1001037213300031718Hardwood Forest SoilEMAPGLWFATYSQYDFDGRRLFVSFGIHERTSYTQYRRIGPPKEALQAIRAELGKSTTANADP
Ga0307477_1000202813300031753Hardwood Forest SoilPGLWFATYSQYDFDGRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0307477_1033646113300031753Hardwood Forest SoilRLFVSFSMHERTLYSQYRRIGPPKEALQAIRAELGKSPSASADP
Ga0307475_1003478453300031754Hardwood Forest SoilMQERYEMAPGLWFASYSQYDFDGRRLFVSFSIHERTLYSQYRRIGPPREALQAIRAELGKSPSANADP
Ga0307478_1107008613300031823Hardwood Forest SoilYSQYDFDGRRLFVSFSIHERTFYTQYRRIGPPKEALATIRAELGKPAAVNADP
Ga0307479_1098264723300031962Hardwood Forest SoilGLWMPTFSQYDFDGRKLFSSISIHERTFYGQYHRIGPPKEALAMIRAELGKPGGNVADP
Ga0307479_1133442413300031962Hardwood Forest SoilGRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0307471_10003559113300032180Hardwood Forest SoilRRLFVSFGIHERTFYSNYRRIGPPKEALQAIRAELGKVTTAKADP
Ga0307471_10024297343300032180Hardwood Forest SoilLMQERYEMAPGVWFPSYAQYDFDGRRLFVSFAIHERTFYSQYRYVGPPKEALVTIRAELGKLRANVAEP
Ga0307471_10088664713300032180Hardwood Forest SoilRRLFMSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0307471_10229182213300032180Hardwood Forest SoilSHFMQERYEMAPGLWFASYSQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0307472_10164289313300032205Hardwood Forest SoilMQERYDMAPGLWFASYSQYDFDGRRLFVSFSIHERTFYSQYRRIGPPKEALQAIRAELGKSTSANADP
Ga0348332_1441337013300032515Plant LitterFDGRKLFSSFSIHERTFYGQYHRIGSPKEALATIRAELGKPGGSLTDP
Ga0335080_1189638413300032828SoilSFSQYDFDGRKLFSSISIHERTFYRQYRRIGPPKEALASIRAELNKSGPATADP
Ga0335083_1068285623300032954SoilWELAPGLWFPTYSQYDFDGRRLFMSFSLHERTFYTQYKRIGPPKEALAEIRAELSKPALTTGDP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.