NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069393

Metagenome / Metatranscriptome Family F069393

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069393
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 47 residues
Representative Sequence APLGIPEVPPIPQLPVQVSNGKLAAAVAVLQPDAAARHNGDDADSGA
Number of Associated Samples 96
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 2.42 %
% of genes from short scaffolds (< 2000 bps) 2.42 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.21

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.387 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(16.129 % of family members)
Environment Ontology (ENVO) Unclassified
(25.806 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.323 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 0.00%    Coil/Unstructured: 100.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.21
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF00106adh_short 5.65
PF01242PTPS 4.84
PF13466STAS_2 4.03
PF02511Thy1 3.23
PF02861Clp_N 3.23
PF13602ADH_zinc_N_2 2.42
PF02954HTH_8 2.42
PF00248Aldo_ket_red 2.42
PF12802MarR_2 1.61
PF00313CSD 1.61
PF14691Fer4_20 1.61
PF01740STAS 0.81
PF04545Sigma70_r4 0.81
PF00211Guanylate_cyc 0.81
PF00378ECH_1 0.81
PF00589Phage_integrase 0.81
PF01590GAF 0.81
PF01061ABC2_membrane 0.81
PF12840HTH_20 0.81
PF08241Methyltransf_11 0.81
PF07883Cupin_2 0.81
PF08240ADH_N 0.81
PF07730HisKA_3 0.81
PF02702KdpD 0.81
PF08669GCV_T_C 0.81
PF06445GyrI-like 0.81
PF01636APH 0.81
PF00440TetR_N 0.81
PF07228SpoIIE 0.81
PF00107ADH_zinc_N 0.81
PF12695Abhydrolase_5 0.81
PF13376OmdA 0.81
PF02909TetR_C_1 0.81
PF14690zf-ISL3 0.81
PF00196GerE 0.81
PF02604PhdYeFM_antitox 0.81
PF13191AAA_16 0.81
PF14520HHH_5 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG07206-pyruvoyl-tetrahydropterin synthaseCoenzyme transport and metabolism [H] 4.84
COG0542ATP-dependent Clp protease, ATP-binding subunit ClpAPosttranslational modification, protein turnover, chaperones [O] 3.23
COG1351Thymidylate synthase ThyX, FAD-dependent familyNucleotide transport and metabolism [F] 3.23
COG1309DNA-binding protein, AcrR family, includes nucleoid occlusion protein SlmATranscription [K] 0.81
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.81
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.81
COG2205K+-sensing histidine kinase KdpDSignal transduction mechanisms [T] 0.81
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.81
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.81
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.81
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.81
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.39 %
All OrganismsrootAll Organisms1.61 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300020579|Ga0210407_10389137All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1091Open in IMG/M
3300021559|Ga0210409_11240935Not Available621Open in IMG/M
3300025929|Ga0207664_10189160All Organisms → cellular organisms → Bacteria → Terrabacteria group1771Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere16.13%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.71%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment9.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.65%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.65%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.03%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.03%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.03%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.23%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil3.23%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.23%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.23%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland2.42%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.61%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.61%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.81%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.81%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.81%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459010Grass soil microbial communities from Rothamsted Park, UK - December 2009 direct MP BIO1O1 lysis 0-9cm (no DNA from 10 to 21cm!!!)EnvironmentalOpen in IMG/M
2170459024Grass soil microbial communities from Rothamsted Park, UK - FD1 (NaCl 300g/L 5ml)EnvironmentalOpen in IMG/M
2199352025Soil microbial communities from Rothamsted, UK, for project Deep Soil - DEEP SOILEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009521Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaGEnvironmentalOpen in IMG/M
3300009698Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_3_AS metaGEnvironmentalOpen in IMG/M
3300010085Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010866Boreal forest soil eukaryotic communities from Alaska, USA - C1-3 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300017821Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_2EnvironmentalOpen in IMG/M
3300017924Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_5EnvironmentalOpen in IMG/M
3300017928Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_1EnvironmentalOpen in IMG/M
3300017932Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_4EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017937Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_4EnvironmentalOpen in IMG/M
3300017942Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_3EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018001Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_5EnvironmentalOpen in IMG/M
3300018025Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_20_100EnvironmentalOpen in IMG/M
3300018026Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_100EnvironmentalOpen in IMG/M
3300018043Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_7_10EnvironmentalOpen in IMG/M
3300019189Soil microbial communities from Bohemian Forest, Czech Republic ? CSU3 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020012Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s1EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300024279Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK33EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026864Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027641Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027648Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031543Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f20EnvironmentalOpen in IMG/M
3300031544Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f26EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031897Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f16EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032756Forest Soil Metatranscriptomics Site 2 Humus Litter Mineral Combined AssemblyEnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F62_066262402170459010Grass SoilGAPLEIPEVSQIPQLPAQVSNGKLAAAVAVLEPDAVARRNGGDTHPGT
FD1_107366802170459024Grass SoilVAKEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLTAAAVAPNNDDDTDPGA
deepsgr_025438902199352025SoilSDQVAVTAEAEIPGGAPLEIPEVPPIPQLPVQVSNGKAAAAVAVLEPDAVARHNGDDAHSGG
Ga0066677_1042986613300005171SoilDVATEEAEIESGAPLEIPEVPPIPQLPVQVSNGKQAAAVAVLPDAASGHNGDDAHPDA*
Ga0066683_1053122913300005172SoilPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDAAPGA*
Ga0070714_10203879923300005435Agricultural SoilAPLEIPEVPPIPQLPVQVSNGKQAAAVAVLPDAASGHNGEDAHPDA*
Ga0070710_1040880623300005437Corn, Switchgrass And Miscanthus RhizosphereDVAKEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHKDDDADSGA*
Ga0070711_10089695413300005439Corn, Switchgrass And Miscanthus RhizosphereGAPLEIPEVPQIPQLPVQVSNGKVAAAVAVLEPDAVARHNGDDVHSGV*
Ga0070706_10007582463300005467Corn, Switchgrass And Miscanthus RhizosphereEDVAKEEAEIESGAPLQIPEVPPIPTLPMQAGNGKQAAAAVARHNDDDADPGA*
Ga0070698_10028741833300005471Corn, Switchgrass And Miscanthus RhizosphereEITSGAPLEIPEVPQIPQLPVQVSNGKVAAAVAVAPHEDEAPSGA*
Ga0070698_10096831323300005471Corn, Switchgrass And Miscanthus RhizosphereGNEDVAKEEAEIESGAPLQIPEVPPIPTLPMQAGNGKQAAAAVARHNDDDADPGA*
Ga0070698_10097500213300005471Corn, Switchgrass And Miscanthus RhizosphereAAEAEITSGAPLEIPEVPPIPELPVQASNGTLAAAVAVPQPDAVARHNDDATDSGA*
Ga0070698_10114544513300005471Corn, Switchgrass And Miscanthus RhizospherePLEIPEVPQIPQLPVQVSNGKVAAAAVPQPDAVARHNGDDAHLGV*
Ga0070698_10118365923300005471Corn, Switchgrass And Miscanthus RhizosphereAEAEAEITGGAPLEMPEVPPIPQLPIQVSNGELAAAVAVPQPDAVARHSDDEHPGA*
Ga0070697_10018159213300005536Corn, Switchgrass And Miscanthus RhizosphereIPEVPPVPELPMQVSNGKLAAATVPQPDAVARHNDDAADSGA*
Ga0070696_10019965143300005546Corn, Switchgrass And Miscanthus RhizosphereAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDTDPGA*
Ga0066695_1034883113300005553SoilAEIESGAPLEIPEVPPIPTLPTQVGNGKLAAPAVPQPDAVARRNDDDADSGA*
Ga0066706_1087189713300005598SoilIESGAPLQIPQVPPIPTLPTQVGNGKLAAPAVPQPDAVARRNDDDADSGA*
Ga0070763_1093997013300005610SoilAPLEIPEVPPIPELPTQASNGKLPVAAAAVPQPDAAARHNDHATDPGE*
Ga0068864_10100777613300005618Switchgrass RhizosphereAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHKDDDADSGA*
Ga0070717_1015340843300006028Corn, Switchgrass And Miscanthus RhizosphereEAEIESGAPLQIPEVPPIPTLPTQAGNGKLAAAVARHNDDDADSGA*
Ga0070717_1150498623300006028Corn, Switchgrass And Miscanthus RhizosphereITGGAPLGIPEVPPIPRPSVQVSNGKLAAVAAQSDATARDVGDAYPGA*
Ga0066652_10089492113300006046SoilEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDAAPGA*
Ga0070715_1020986123300006163Corn, Switchgrass And Miscanthus RhizosphereIPEVPPIPTLPMQVGNGKLAAGAVARNNDDDADSGA*
Ga0070716_10100918433300006173Corn, Switchgrass And Miscanthus RhizosphereIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDTDPGA*
Ga0075014_10062131413300006174WatershedsIPQVPPIPTLPMQTGNGKLAATTVPRPGAAARHNDDAADPGA*
Ga0070712_10140760213300006175Corn, Switchgrass And Miscanthus RhizosphereADFGDEDVAKEEAEIESGAPLQIPEVPPIPTLPTQAGNGKLAAAVARHNDDDADSGA*
Ga0079222_1092433913300006755Agricultural SoilPQLPVQVSNGKLAAAVAVLEPDAVAGHNGDDTHPGT*
Ga0079222_1094057313300006755Agricultural SoilITSEAPLEIPEVPPIPQLPVQVSNGKQAAAVAVLPDAASGHNGEDAHPDA*
Ga0079221_1012314713300006804Agricultural SoilIPEVPPIPTSPMRAGNGKVAAAAVARHSGDDADSGA*
Ga0079221_1149238013300006804Agricultural SoilPPIPTLPIQAGNGKQAAAAAPRPDAVARHNDDDADPGA*
Ga0075425_10211133413300006854Populus RhizospherePEVPPIPQLPVQVSNGQQTAPVAVLPDAPARHNGDDTHSDA*
Ga0075425_10287647813300006854Populus RhizosphereIESGAPLQIPEVPPIPTLPIQAGNGKQAAAAAPRPDAVARRNDDDADPGA*
Ga0075435_10124052113300007076Populus RhizospherePLEIPQVPPIPTLPTPVGNGKLAAAAVARHNNDDTAPGA*
Ga0075435_10161701913300007076Populus RhizosphereDEDVATEEAEIESGAPLQIPEVPPIPTLPPQAGNGNMAAAAAPQPGTTPRHNNNDADPGA
Ga0099828_1103498513300009089Vadose Zone SoilPLQIPEVPPIPTLPAHAGNASLAPAAVPQPATMAQHNDDNADSGP*
Ga0066709_10416231713300009137Grasslands SoilEEAEIESGAPLQIPQVPPIPTLPTQVGNGELAAAAVAQPGAVARHNDDSADPGA*
Ga0075423_1194888813300009162Populus RhizosphereVPPIPALPAQAGNGKVAAAAAPRPDAAVRRNDDDADPGA*
Ga0075423_1236260713300009162Populus RhizosphereEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHKDDDADSGA*
Ga0075423_1292253713300009162Populus RhizosphereTEEAEIESGAPLQIPEVPPIPTLPPQAGNGNMAAAAAPQPGTTPRRNNNDADPGA*
Ga0116222_123127313300009521Peatlands SoilIPQLPAQVSNGKPAAAVAVLQPDAAARHNGDDADSGA*
Ga0116216_1028696023300009698Peatlands SoilAPLGIPEVPPIPQLPVQVSNGKLAAAVAVLQPDAAARHNGDDADSGA*
Ga0116216_1091310513300009698Peatlands SoilAAAEAEITGGAPLEIPEVPPIPQLPIPESNGKLAAAAVLQPDTVAQHSDDDAHSGA*
Ga0127445_108183513300010085Grasslands SoilEDVAKEEAEIESGAPLEIPQVPPIPTLSTPVGNGKLAAAAAARHTDDDAAPGA*
Ga0134126_1118491113300010396Terrestrial SoilPQLPVQVSNGKVAAAVAVLEPDAVARHNGDDAHSGA*
Ga0134124_1032635733300010397Terrestrial SoilIPEVPQIPVQVSNGKLTAAVAVPQPDAVARRNGDAHSDV*
Ga0126344_103877513300010866Boreal Forest SoilLPVQVSNGKLGTAAVPQPDAVPQPDAVARHSDDDAHSGS*
Ga0150983_1426188713300011120Forest SoilEEAAIESGAPLEIPAVPPIPTLPVQAANGELAAAVVPQPGAVARHADDADSGA*
Ga0150983_1555960033300011120Forest SoilEIPDVPQIPQLPVQVSNGKLAAAVAVLEPDAVARHNGDDTHLGT*
Ga0150983_1600887213300011120Forest SoilPQLPAQVSNGKQTAAVAVLPDAPARHNGDDAHSDA*
Ga0137389_1006050413300012096Vadose Zone SoilAAEAEITSGAPLEIPAVPPIPQPSIRVSNGNLAAAAVPEPNAVARHSDDDAHSGA*
Ga0137364_1080870023300012198Vadose Zone SoilVAREEAEIESGAPLEIPEVPPIPMQAANGELAAAAVAVPQPGAVARHNDDADSGA*
Ga0137378_1086195213300012210Vadose Zone SoilDVAAAEAEITSGAPLEIPEVPQIPQIPQLPVQVSNGKQTAAVAVLQPDAMARHNGDDAHSGA*
Ga0137390_1076444823300012363Vadose Zone SoilLEIPAVPPIPQPSIHVSNGSLAAAAVPQPDAVARHGDDDARSGA*
Ga0164309_1078860413300012984SoilEAESTSGAPLEIPEAPPVPQLPVQVSNRTQTVAVAVQPDAAARHNGGDAHSDA*
Ga0134078_1034861313300014157Grasslands SoilEEAQIESGAPLQIPQVPPIPTLPAQVGNSELVAAVARHNDDADPGA*
Ga0134079_1043760013300014166Grasslands SoilEDVAKEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDAAPGA*
Ga0134079_1064314613300014166Grasslands SoilSGAPLQIPEVPPIPTLPMQVGNGKVAAAAVPQPGAAGRHSDDADSGA*
Ga0163163_1295271113300014325Switchgrass RhizosphereFGNEDVAKEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDTDPGA*
Ga0157376_1271673823300014969Miscanthus RhizosphereAIPEVPQIPVQVSNGKLTAAAAVPQPDAVARRNGDAHSGV*
Ga0132258_1140885233300015371Arabidopsis RhizosphereEIESGAPLQIPEVPPIPTLPLQAGNGNMAAAAAPQPGTTPRRNNNDADPGA*
Ga0182034_1174669013300016371SoilAAEAEAAITGGGPLEVPEAPPIPQLPIQVSNGKPAAAAAVPQPDAAARHSDGDAHSGG
Ga0187812_126325513300017821Freshwater SedimentAPLEIPEVPPIPQPPIQVSNGNLAAAAVPQPDAVAQHSGDDAHSSAKAPPA
Ga0187820_111342313300017924Freshwater SedimentPQLPVQVSNGNLAAAAVPQPDAVAQHSGDDAHSSAKAPPA
Ga0187806_133047223300017928Freshwater SedimentEIPEVPPIPQLPVQVSNGKLAAAVAVPQPDAVALHNGDNADLGASAPPA
Ga0187814_1006220113300017932Freshwater SedimentAAITGGAPLEIPEVPPIPQLPAQVSNGKLAAAAAAPQPDAAARHSDDDR
Ga0187814_1006451013300017932Freshwater SedimentDVAEAEAEITSGAPLEVPEVPPIPQLPIQVSNGKLAAAAMPQPGGAARHSDGDEHPGE
Ga0187814_1043046113300017932Freshwater SedimentAAEAEITGGAPLEIPEVPPIPQLPIQAGNGQVAAAAVLQPDVVAGRGDDDAQPGV
Ga0187801_1017815923300017933Freshwater SedimentGGAPLEIPEVPPIPQLPISESNGKLAAAAVLQPDTVARHSDDDTHSGA
Ga0187809_1019456213300017937Freshwater SedimentAPLEIPEVPPIPQLPISESNGKLAAAAVLQPDTVARHSDDDTHSGA
Ga0187808_1017405113300017942Freshwater SedimentAAEAEITSGAPLEIPEVPPIPQLPIQVSNGKLAAAAAVLQPDAAAPHNDDAHPGA
Ga0187808_1050831723300017942Freshwater SedimentFGGEDVAAAEAQITGGVPLEIPEVPPIPQLPIQVSNGKLAAAAVLQPDAVARHSGDDAHPGA
Ga0187817_1070863923300017955Freshwater SedimentITGGAPLEIPEVPPLPQLPVQAGNGKLAAAAVTQPDAAARHSDDGAH
Ga0187815_1004808213300018001Freshwater SedimentPPIQVSNGNLAAAAVPQPDAVAQHSGDDAHSSAKAPPA
Ga0187885_1013501713300018025PeatlandPPIPELPLQVSNGKLAATAAVPQPARHDDDAAESGA
Ga0187857_1033617633300018026PeatlandAPLEIPEVPPIPELPLQVSNGKLAATAAVPQPARHDDDAAESGA
Ga0187887_1043384513300018043PeatlandIESGAPLEIPEVPPIPELPLQVSNGKLAATAAVPQPARHDDDAAESGA
Ga0184585_13535023300019189SoilEAEVQSGAPLEIPDVPLIPQLPLQASNGKLAAAAAGPVPDPVTRPSDDAADSGA
Ga0193732_105668623300020012SoilPQIPQLPVQVSNGKVAAAVAVLEPDAVARHNGDDAHSGA
Ga0210407_1038913713300020579SoilPQLPAQVSNGKQTAAVAVLPDAPARHNGDDAHSDA
Ga0210399_1031708343300020581SoilTSGAPLEIPEVPPVPQLPAQVSNGKQTAAVAVLPDAPARHNGDDAHSDA
Ga0210399_1061544213300020581SoilPPIPTLPMQVGNGRLAATAVPQPGAVAGHDDDDADSGA
Ga0210383_1020347733300021407SoilDFGNEDVDKEEAEIQSGAPLQIPEVPPIPTSPMQVGNGKLTAAVVAPHNDDADPGA
Ga0210383_1050945323300021407SoilAEAEIENGAPLEIPEVPPIPELPRHASNGNLAAAAAVPQPATAARHNHDAADPGA
Ga0210383_1138023823300021407SoilEVAPIPQLPIQVSNGKVAAAAMPQPDAAARHSDGDAHPGA
Ga0210384_1072728513300021432SoilAEAEITSGAPLEIPEVPPIPQLPVQVSNGKLVAAGAVLQPDTAARHDDDDAHPGA
Ga0210402_1043476913300021478SoilPIPQLPVQVSNGKLTAAVAVLQPDAVAQHNGDDPHSDV
Ga0210402_1055029613300021478SoilLEIPEVPPIPTPPMQVGNGKLAVAAVARHNNDAADPGA
Ga0210402_1075581513300021478SoilITSEAPLEIPEVPPIPQLPVQVSNGKQAAAVAVLPDAASGHNGEDAHPDA
Ga0210409_1124093513300021559SoilAEITSGAPLEIPEVPQLPQLPVQVSNGELAAAGAVLQPDAVARHNGGDAHSDV
Ga0247692_103942313300024279SoilVPQIPVQVSNGKLTAAAAVPQPDAVARRNGDAHSDV
Ga0207692_1030476813300025898Corn, Switchgrass And Miscanthus RhizosphereNEDVEKEEAEITSGALEIPEVPPIPRLPTHASNGKLAAAVPRPDAAAQHNNDTVDSGA
Ga0207685_1046584513300025905Corn, Switchgrass And Miscanthus RhizosphereEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDTDPGA
Ga0207684_1005717853300025910Corn, Switchgrass And Miscanthus RhizosphereEIPEVLPIPELPMQASNGKLAAAAAVPQPDASARDNGDAADSGA
Ga0207684_1011606333300025910Corn, Switchgrass And Miscanthus RhizospherePPLEIPEVPPIPQLPMQASDGQLAAAAAAPEPDAVARHDDTAADSGA
Ga0207693_1108779823300025915Corn, Switchgrass And Miscanthus RhizosphereADFGDEDVAKEEAEIESGAPLQIPEVPPIPTLPTQAGNGKLAAAVARHNDDDADSGA
Ga0207664_1018916013300025929Agricultural SoilLEIPEVPRIPQLPVQVSNGQQTAPVAVLPDAPARHNGDDAHSDA
Ga0207641_1094484713300026088Switchgrass RhizosphereGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDTDPGA
Ga0209621_101152413300026864Forest SoilEVPQIPVQVGNGKLTAAVAVPQPDAVARRNGDAHSDV
Ga0208827_102236443300027641Peatlands SoilTSGAPLGIPEVPPIPQLPVQVSNGKLAAAVAVLQPDAVAPHNGDADSGA
Ga0209420_100246863300027648Forest SoilESGAPIEIPEVPPIPQLPLQATNGKLATVAAGPPPDAVARPNDDAADSGA
Ga0307313_1021663113300028715SoilSDQVAVTAEAEIPGGAPLAIPEVPPIPQLPVQVSNGKVAAAVAVLEPDAVARHNGDDTHSGA
Ga0307282_1048585013300028784SoilAPLEIPQVPPIPTLPTPVGNGKLAAAAMARHNDDDAAPGA
Ga0307296_1007582943300028819SoilEEAEIESGAPLAIPQVPPIPTLPTPVGNGKLAAAAAVPQPDAAARHSDDADSGA
Ga0307296_1077459613300028819SoilLAIPEVPQIPQLPVPASNGKLTAAVAVPQPDAAARRNGDDAHLGV
Ga0170824_12447184513300031231Forest SoilLEIPEVPPIPQLPVQVSNGKLAAAAAVPQPDPVARHNNDDAHSGA
Ga0170818_10136087513300031474Forest SoilPIPQLPAQVSNGKLAAAAAVPQPDPVARHNNDDAHSGA
Ga0318516_1020174933300031543SoilVPPIPELPGQASNGKLAAAAVPQPDAAARHNHDAADSGA
Ga0318534_1011065413300031544SoilPLEIPEVPPIPELPGQASNGKLAAAAVPQPDAAARHNHDAADSGA
Ga0307475_1107613713300031754Hardwood Forest SoilEVPPIPQLPVQVSNGKQAAAVAVLPDAASGHNGEDAHPDA
Ga0307473_1021562523300031820Hardwood Forest SoilEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHNDDDADSGA
Ga0318520_1028699013300031897SoilAPLEIPEVPPIPELPGQASNGKLAAAAVPQPDAAARHNHDAADSGA
Ga0318520_1069960713300031897SoilPQLPIQVSNGKLAAAAVAQPDAVARHNDDDADSGAQAPPA
Ga0306921_1127653923300031912SoilPLEVPEVPPIPQLPVQVSNGKPTAAAVPQPDAAARHSDGDAHSGG
Ga0310916_1167852713300031942SoilAEAEAAITGGAPLEVPEVPPIPQLPIQVSNGKPAAAAVPQPGAAARHSDGDAHSGG
Ga0307470_1164685113300032174Hardwood Forest SoilNEDVAKEEAEIESGAPLEIPQVPPIPTLPTQVGNGKLAAAAVARHKDDDADSGA
Ga0307471_10431344123300032180Hardwood Forest SoilPLEIPEVPPIPQLPVQVSNGKLTAAVAVLQPDAVAQHNGDDPHSDV
Ga0315742_1129471333300032756Forest SoilQSGAPVEIPEVPLIPQLPLQATNGKLAAAAAGPAPDAVARPNDDAADSGA
Ga0315742_1199145113300032756Forest SoilTGAAALEIPEVPPIPQLLVQVGNGKLAAVAAAQPDALARSDDGDADSGAQTPLA
Ga0315742_1227267313300032756Forest SoilAEIDNEAPLEIPEVPAIPQLPIQASNGKLAAATVGPVPDATAQHNDDAAEPGA
Ga0335078_1175704313300032805SoilGAAITSGAPLEVPEVPPIPQLPIQASNGKLAAATAGLQPDAAARHSDDDAHPGA
Ga0335083_1047287533300032954SoilESGAPLQIPQVPPIPTLPAQVGNGELAAATVARHNDDADPGA
Ga0335076_1006664573300032955SoilTEEAEIESGAPLEIPEVPPIPALPTPAGNGKLAAAAAAVPQPDTAAQPSDDTADPGA
Ga0335084_1048196213300033004SoilAEAEIMGGAPPEISEIPAIPQLPRQISNGKLAAAAVPQPDAMARHSDDDAHSGA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.