NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103371

Metagenome Family F103371

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103371
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 60 residues
Representative Sequence MSYDDWKTHNPDDDRCEFCGAHPRECRGGWQPSACTGECGKGWRDPDAEYEKMRDEA
Number of Associated Samples 88
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.33

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.030 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(12.871 % of family members)
Environment Ontology (ENVO) Unclassified
(25.743 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(33.663 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 21.18%    β-sheet: 0.00%    Coil/Unstructured: 78.82%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.33
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF00589Phage_integrase 5.00
PF04851ResIII 4.00
PF01555N6_N4_Mtase 3.00
PF13392HNH_3 3.00
PF12684DUF3799 2.00
PF13560HTH_31 2.00
PF14354Lar_restr_allev 2.00
PF00145DNA_methylase 2.00
PF00805Pentapeptide 2.00
PF04466Terminase_3 1.00
PF01844HNH 1.00
PF09588YqaJ 1.00
PF05345He_PIG 1.00
PF04448DUF551 1.00
PF01381HTH_3 1.00
PF00436SSB 1.00
PF08299Bac_DnaA_C 1.00
PF09374PG_binding_3 1.00
PF13518HTH_28 1.00
PF02592Vut_1 1.00
PF00959Phage_lysozyme 1.00
PF07120DUF1376 1.00
PF00196GerE 1.00
PF13362Toprim_3 1.00
PF14284PcfJ 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0863DNA modification methylaseReplication, recombination and repair [L] 3.00
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 3.00
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 3.00
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 2.00
COG1357Uncharacterized conserved protein YjbI, contains pentapeptide repeatsFunction unknown [S] 2.00
COG0593Chromosomal replication initiation ATPase DnaAReplication, recombination and repair [L] 1.00
COG0629Single-stranded DNA-binding proteinReplication, recombination and repair [L] 1.00
COG1738Queuosine precursor transporter YhhQ, DUF165 familyTranslation, ribosomal structure and biogenesis [J] 1.00
COG1783Phage terminase large subunitMobilome: prophages, transposons [X] 1.00
COG2965Primosomal replication protein NReplication, recombination and repair [L] 1.00
COG3756Uncharacterized conserved protein YdaU, DUF1376 familyFunction unknown [S] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.03 %
All OrganismsrootAll Organisms2.97 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300024310|Ga0247681_1000024All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae30541Open in IMG/M
3300025900|Ga0207710_10000889All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae15992Open in IMG/M
3300025941|Ga0207711_10000553All Organisms → cellular organisms → Bacteria → Proteobacteria38209Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.87%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.94%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous4.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.95%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil4.95%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere4.95%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere3.96%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.97%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere2.97%
Landfill LeachateEngineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate2.97%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.98%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.98%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.98%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.98%
Deep Subsurface AquiferEnvironmental → Terrestrial → Deep Subsurface → Aquifer → Unclassified → Deep Subsurface Aquifer1.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.98%
EctomycorrhizaHost-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza1.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.98%
WastewaterEngineered → Wastewater → Unclassified → Unclassified → Unclassified → Wastewater1.98%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.99%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake0.99%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.99%
AquaticEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Aquatic0.99%
FreshwaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unclassified → Freshwater0.99%
FreshwaterEnvironmental → Aquatic → Freshwater → Creek → Unclassified → Freshwater0.99%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.99%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Ore Pile And Mine Drainage Contaminated SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Ore Pile And Mine Drainage Contaminated Soil0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.99%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.99%
Sugarcane Root And Bulk SoilHost-Associated → Plants → Rhizome → Unclassified → Unclassified → Sugarcane Root And Bulk Soil0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.99%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.99%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge0.99%
Wastewater EffluentEngineered → Wastewater → Nutrient Removal → Unclassified → Unclassified → Wastewater Effluent0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001078Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_O2EnvironmentalOpen in IMG/M
3300001963Marine microbial communities from Nags Head, North Carolina, USA - GS013EnvironmentalOpen in IMG/M
3300002461Freshwater microbial communities from a drinking water treatment plant in Ann Arbor, Michigan, USAEnvironmentalOpen in IMG/M
3300003312Ore pile and mine drainage contaminated soil microbial communities from Mina do Sossego, Brazil - P1 sampleEnvironmentalOpen in IMG/M
3300003320Sugarcane root Sample H2Host-AssociatedOpen in IMG/M
3300003354Arabidopsis root microbial communities from the University of North Carolina, USA - plate scrape MF_Cvi_mMSHost-AssociatedOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005614Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2Host-AssociatedOpen in IMG/M
3300005664Freshwater viral communities from Emiquon reservoir, Havana, Illinois, USAEnvironmentalOpen in IMG/M
3300005987Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 9/18/14 B DNAEngineeredOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300006805Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNAEnvironmentalOpen in IMG/M
3300007351Combined Assembly of Gp0115775, Gp0115815EnvironmentalOpen in IMG/M
3300007352Deep subsurface aquifer microbial community from Lead, South Dakota (DUSEL-D aquifer)EnvironmentalOpen in IMG/M
3300008065Wastewater microbial communities from the domestic sewers in Singapore - Site 3EngineeredOpen in IMG/M
3300008507Wastewater microbial communities from the domestic sewers in Singapore - Site 2EngineeredOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009151Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_MF_MetaGEnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010356AD_USDEcaEngineeredOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011435Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2EnvironmentalOpen in IMG/M
3300012001Permafrost microbial communities from Nunavut, Canada - A24_80cm_12MEnvironmentalOpen in IMG/M
3300012008Permafrost microbial communities from Nunavut, Canada - A39_80cm_12MEnvironmentalOpen in IMG/M
3300012892Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014205Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 162 metaGEngineeredOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018055Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_coexEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300020020Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a1EnvironmentalOpen in IMG/M
3300020034Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021440Freshwater microbial communities from McNutts Creek, Athens, Georgia, United States - 3-17 MGEnvironmentalOpen in IMG/M
3300022173Freshwater viral communities from Lake Michigan, USA - Sp13.VD.MM15.D.DEnvironmentalOpen in IMG/M
3300024181Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK34EnvironmentalOpen in IMG/M
3300024310Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK22EnvironmentalOpen in IMG/M
3300025302Arabidopsis root microbial communities from the University of North Carolina, USA - plate scrape MF_Cvi_mMS (SPAdes)Host-AssociatedOpen in IMG/M
3300025896Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025937Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027037Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027573Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028800Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-21-26 metaGHost-AssociatedOpen in IMG/M
3300028804Activated sludge microbial communities from WWTP in Nijmegen, Gelderland, Netherland - WWTP WeurtEngineeredOpen in IMG/M
3300031507Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 10_EMHost-AssociatedOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031730Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 19_EMHost-AssociatedOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300034149Sediment microbial communities from East River floodplain, Colorado, United States - 20_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12640J13246_10077623300001078Forest SoilMELPGYDDWKTHNPDDDRCEFCGMHPREWSAGWQPTRCTGECGLSWRDPDFEYEQARDDAQFFGNDIQANDDEY*
GOS2229_100411373300001963MarineSISLYAGESMSSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNSCTGECRQSWRDPDAEYERMRDEE*
AADWTP_1000175913300002461FreshwaterMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPNGCTGKCNTSWRDPDHEYEKMRDEPHF*
P12013IDBA_105449513300003312Ore Pile And Mine Drainage Contaminated SoilGYDDWKTHNPDDDRCEFCGVHPRECRAGWQPDRCTGECGRAWRDPDYEYDQMRDDLTLQ*
rootH2_10110905363300003320Sugarcane Root And Bulk SoilMNYDDWKTHDPDDDRCEFCGVAPWQCRGGWQPDQCTGECQRGWRDPDAEYEKMRDEG*
JGI25160J50197_100029463300003354Arabidopsis RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNKCSGECGTGWRDPDHEYEKMRDEA*
JGI25160J50197_100178033300003354Arabidopsis RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNNCTGECGKGWRDPDHEYEKMRDEA*
Ga0065707_1009404763300005295Switchgrass RhizosphereMSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNACTGECGRGWRDPDAEYEKRRDEQ*
Ga0066388_10789869123300005332Tropical Forest SoilWKTHNPDDDRCEFCGVHPGRQLPGWMPALCTGECGRSWRDPDAEYEAMRDDADF*
Ga0070674_10158897123300005356Miscanthus RhizosphereMTNLPGYDEWKTHNPDDDRCEFCGAHPNESRHGWAPQACVGKCRTSWRDPDDEYDRMRDERDAP*
Ga0070667_10141903623300005367Switchgrass RhizosphereMSNGMSLPGYDDWKTHNPDDDRCEFCGVHPRECRDGWEPLGCTGECGTVWRDPDFEYDQMRENS*
Ga0070700_100000412263300005441Corn, Switchgrass And Miscanthus RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYERMRDDHE*
Ga0070681_1010748353300005458Corn RhizosphereVSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPSQCTGECGRGWRDPDAEYERMKDET*
Ga0070707_10028793843300005468Corn, Switchgrass And Miscanthus RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYEKMRDDHE*
Ga0070707_10099155113300005468Corn, Switchgrass And Miscanthus RhizosphereSMSYDDWKTHNPDDDRCEFCSAGPREYRGGWQPGSCTGECNKSWRDPDAEYEKMRDEA*
Ga0070698_10021053943300005471Corn, Switchgrass And Miscanthus RhizosphereLDGIDERPDEESMSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNACTGECGRGWRDPDAEYEKMRDDHE*
Ga0068855_10221906233300005563Corn RhizosphereMNYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCNGECGKGWRDPDHEYEKMRDEA*
Ga0068856_10096715823300005614Corn RhizosphereMSYDDWKTHNPDDDRCEFCGVHPRDCRGGWQPNACTGECGKGWRDPDYEYEKMRDEA*
Ga0073685_101666143300005664AquaticMATETLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDLCTGECGRSWRDPDYEYDKMRDERDHYI*
Ga0075158_1026087223300005987Wastewater EffluentMATETLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDLCTGECGLSWRDPDYEYDKMRDERDHYI*
Ga0075021_1073919623300006354WatershedsMMTLPGYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGECKTSWRDPDFEYEKMRDEDNG*
Ga0068871_10092454523300006358Miscanthus RhizosphereMSYDDWKTHNPDDDRCEFCGAHPRECRGGWQPSACTGECGKGWRDPDAEYEKMRDEA
Ga0070749_1019439253300006802AqueousVNLPGYDDWKTHNPDDDRCEYCGAYPWQCRGGWEPDCCTGECGRKWRDPDAERDARMDR*
Ga0075464_10007330133300006805AqueousMSYDDWKTHNPDDDRCEFCGVAPWECRGGWQPDRCTGECNRGWRDPDAEYEKMRDEA*
Ga0075464_1051163433300006805AqueousMSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPSQCTGDCGRGWRDPDAEYEAMRDER*
Ga0104751_1009765153300007351Deep Subsurface AquiferVSDMNLPGYDDWKTHNPDDDCGCEFCGASRWECRAGWQPEFCTGECGCSWRDPDYERDKMRDERMER*
Ga0104756_102446553300007352Deep Subsurface AquiferMISHPGYDIWKTSNPDDGRCEFCGAYPRECRGGWQPDCCTGECGCSWRDPDYERDLRNDQ
Ga0110935_103418733300008065WastewaterMSYDDWKTRNPDDDRCEFCGVHPRECRAGWQPNSCTGECGQSWRDPDYEYEKMRDEDR*
Ga0110934_100466533300008507WastewaterMSYDDWKTHNPDDDRCEFCGAAPWEFKGGWQPNRCNGECRQSFRDPDAEYDRMRDEG*
Ga0105247_10001381143300009101Switchgrass RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRISWRDPDAEYERMRDDHE*
Ga0114962_1040734923300009151Freshwater LakeMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGKCRTSWRDPGYEYEKMRDDRD*
Ga0126382_1137930213300010047Tropical Forest SoilMDGLPGYDDWKTHNPDDDCCEFCGVHPRECRDGWQPDRCTGECGRKWRDPDYEYDQMRDEGR*
Ga0116237_1052752543300010356Anaerobic Digestor SludgeMATETLPGYDDWKTHNPDVDRCEFCGVHPRECRGGRQPDLCTGECGRSWRDPDYEYDKMR
Ga0126377_1143995423300010362Tropical Forest SoilMNLPGYDAWKTHNPDDDRCEYCGVHIRETRSGWKPDRCTRECGIGWFDPDRLYEEKRDEPPPPAAPEDYDT*
Ga0134125_1142491913300010371Terrestrial SoilMSLPGYDEWKLASPDDGYCEFCGVHERRCRDGWRPDECTGECRQSWRDPDAEYDRMRDEQ
Ga0134128_1058601943300010373Terrestrial SoilMTYDDWKTHNPDDDRCEFCGVAPWECRGGWQPDSCTGECGKGWR
Ga0134126_1167761923300010396Terrestrial SoilMELPGYDDWKTHNPDDDRCEHCGADPRVYRAGWQPTPCTGECGTVWRDPDAEYEAARDNANDALT*
Ga0134124_10018112153300010397Terrestrial SoilPGYDDWKTHDPDDERCEYCGVHPREIYGGWQPTRCTGQCGLVWKDPDAEYEKMRDEGDQW
Ga0134127_1028669863300010399Terrestrial SoilVSYDDWKTHNPDDDRCEFCGAAPWEYRGGWQPDKCSGECGKGWRDPDQEYEKMRDDHE*
Ga0137426_102143023300011435SoilMDLPGYDAWKTHNPDDDRCEFCGADPRYCKGWQPDACTGECGIGWRDPDAEYDRRRDDPPP*
Ga0120167_104833123300012001PermafrostMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCTGECGKGWRDPDAEYEKMRDEE*
Ga0120174_108089323300012008PermafrostMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCTGECGKGWRDPDAEYEKMRDDA*
Ga0157294_1019722223300012892SoilMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYEK
Ga0137404_1009469063300012929Vadose Zone SoilMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGKCNTSCRDPDAEYERMRDEG*
Ga0137404_1010409663300012929Vadose Zone SoilMSYDDWKTHNPDDDRCEFCGVAPWECRGGYQPNNCTGECGKGWRDPDAEYERMRDEA*
Ga0172380_1008044163300014205Landfill LeachateMDGLPGYDDWKTHNPADDCCEFCGADPRYCRAGWEPDGCTGECGKVWRDPDAERERIRDESY*
Ga0172380_1076682713300014205Landfill LeachateMSYDDWKTHNPDDDRCEFCGVAPWQCRGGWQPDQCTGECCKGWRDPDAEYEKMRDEA*
Ga0172380_1104551823300014205Landfill LeachateMSYDDWKTYNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPD
Ga0132258_1022823693300015371Arabidopsis RhizosphereMSELPGYDEWKTHNPDDDRCEYCGAHPNESRQGWAPENCTGKCKTSWRDPDYEYDRMRDEEDRGYA*
Ga0132258_1214720223300015371Arabidopsis RhizosphereMSDLPGYDDWKTHNPDDDRCEHCGAHPNESRAGWAPQNCTGKCGTSWRDPDYEYDRMRDRRDFGR*
Ga0184616_1015904833300018055Groundwater SedimentMSDLPGYDDWKTHNPDDDRCEFCGANPNASKHGWAPDECTGKCDTSWRDPDYEYDRKRDEEDRGYA
Ga0190271_1002400373300018481SoilMSDLPGYDDWKTHNPDDDRCEFCGANPNASKHGWAPDECTGKCETSWRDPDYEYDRKRDEEDRGYA
Ga0190271_1298491813300018481SoilRKKEGNRTMNLPGYDAWKTHNPDDDRCEFCGANPNASKHGWAPEECTGKCNTSWRDPDFEYDRMRDEEMQERNR
Ga0193738_113976123300020020SoilMSLPGYDDWKTHNPDDDRCEFCGVHPRECRAGWQPDRCTGECGQKWRDPDAEYDQMRDERDAGLE
Ga0193753_1000038923300020034SoilMTYDDWKTYNPDDDRCEFCGAALWESRGGWQPNGCNGECHISWRDPDYEYEKMRDDA
Ga0210407_1130615023300020579SoilMTNYDDWKTHNPDDDRCEHCGVAPWQCRGGWQPDCCTGECGRSWRDPDYEYERRRDDG
Ga0210403_1008295263300020580SoilMSELPASYDQWRTHNPDDDRCEHCGAAPWECRGGWQPNNCTGECGKGFRDPDAEYEKMRDEA
Ga0210403_1076290623300020580SoilMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPSNCTGACNTSWRDPDHEYEKMRDEG
Ga0210395_1041271633300020582SoilMTNYDDWKTHNPDDDHCEHCGVAPWQCRGGWQPDCCTGECGRSWRDPDYEYERRRDDG
Ga0210401_10000628183300020583SoilMSLPAYDDWKTHNPDDDRCEFCGVAPWECRGGWQPSSCTGECKKAWRDPDYEYDKMRDEA
Ga0210380_1011167933300021082Groundwater SedimentMSDLPGYDDWKTHNPDDDRCEFCGANPSVSKHGWAPDECTGKCDTSWRDPDYEYDRKRDEEDGRDA
Ga0210404_1052858213300021088SoilMSYDDWKTHNPDDDRCEFCGAAPWESRGGWMPSGCTGKCNTSWRDPDYEYEKMRDDDNG
Ga0210406_1060898233300021168SoilMSYDDWKTHNPDDDRCEHCGVAPWQCRGGWQPDCCTGECGRSWRDPDYEYKKMREEGP
Ga0210400_1019981333300021170SoilMTYDDWKTHDPDDDRCEFCGAAPWEFEGGWQPDRCNGECRQSFRDPDREYDEMRDEG
Ga0210396_1002747723300021180SoilMSYDDWKTHNPDDDRCEFCGAAPWESKGGWQPSACTGECRTSWRDPDAEYERMRDEETQ
Ga0210394_1010913523300021420SoilMSYDDWKTHNPDDDRCEFCGAHPREFHGGWQPSACTGECRCSFRDPDYEYEKMRDET
Ga0213919_102038613300021440FreshwaterMSYDDWKTHNPADDCCEFCGADPRYCRGGWQPNSCTGECGKGWRDPDYEYEKMRDEA
Ga0181337_102107813300022173Freshwater LakeTERAALMSNYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSSCTGECGKGWRDPDAEYEKMRDDHE
Ga0247693_100453563300024181SoilMDLPGYDDWKTHDRDAERCEYCGVHPTETPRGGWQPHCCTGQCGLIWKDPDAEYEKMRDE
Ga0247681_100002433300024310SoilMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCSGECGKSWRDPDYEYEKMRDEA
Ga0207426_1000609263300025302Arabidopsis RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNNCTGECGKGWRDPDHEYEKMRDEA
Ga0207426_1000609323300025302Arabidopsis RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNKCSGECGTGWRDPDHEYEKMRDEA
Ga0208916_1001822423300025896AqueousMSYDDWKTHNPDDDRCEFCGVAPWECRGGWQPDRCTGECNRGWRDPDAEYEKMRDEA
Ga0208916_1039746013300025896AqueousTRPHDSRCSMATETLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDLCTGECGRSWRDPDYEYDKMRDERDHYI
Ga0207710_10000889273300025900Switchgrass RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRISWRDPDAEYERMRDDHE
Ga0207671_1037930513300025914Corn RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCTGDC
Ga0207660_1107489023300025917Corn RhizosphereSSVSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPSQCTGECGRGWRDPDAEYERMKDET
Ga0207646_1025111133300025922Corn, Switchgrass And Miscanthus RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYEKMRDDHE
Ga0207646_1061763623300025922Corn, Switchgrass And Miscanthus RhizosphereMNYDDWKTHNPDDDRCEFCGAGPREYRGGWQPGSCTGECNKSWRDPDAEYEKMRDEA
Ga0207669_1183200313300025937Miscanthus RhizosphereMTNLPGYDEWKTHNPDDDRCEFCGAHPNESRHGWAPQACVGKCRTSWRDPDDEYDRMRDERDAP
Ga0207711_10000553393300025941Switchgrass RhizosphereMSLPGYDDWKTHNPDDDRCEFCGVHPRECRDGWEPLGCTGECGTVWRDPDFEYDQMRENS
Ga0207689_1096473423300025942Miscanthus RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYERMRDDHE
Ga0207667_1186484123300025949Corn RhizosphereMSSYDDWKTHNPDDDRCEFCGAAPLECRGGWQPDKCNGECGKGWRDPDHEYEKMRDDA
Ga0207702_1173070223300026078Corn RhizosphereMSYDDWKTHNPDDDRCEFCGVHPRDCRGGWQPNACTGECGKGWRDPDYEYEKMRDEA
Ga0207702_1214675623300026078Corn RhizosphereMSYDDWKTHNPADDCCEFCGADPRYCRGGWQPNACTGECNRGWLDPDAEYERMRDEA
Ga0209005_100100753300027037Forest SoilMELPGYDDWKTHNPDDDRCEFCGMHPREWSAGWQPTRCTGECGLSWRDPDFEYEQARDDAQFFGNDIQANDDEY
Ga0208685_109827323300027513SoilVSLPGYDDWKTHNPDDDRCEFCGVDPRGNNGWQPADCSGKCGIIWRDPDYEYERQRDDRA
Ga0209523_102406643300027548Forest SoilMDLPGYDDWKTHNPDDDRCEFCGVHPRECSSGWQPARCTGECGLSWRDPDFEYEQARDDAQFFGNDIQVNDDEY
Ga0208454_110336213300027573SoilPVRQPPRDSDREDLVSLPGYDDWKTHNPDDDRCEFCGVDPRGNNGWQPADCSGKCGIIWRDPDYEYERQRDDRA
Ga0209068_1065735423300027894WatershedsMMTLPGYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGECKTSWRDPDFEYEKMRDEDNG
Ga0265338_1024814823300028800RhizosphereMSYDDWKTHNPDDDRCEFCGAAPWENRGGWMPSGCTGKCNTSWRDPDYEYEKMRDDGND
Ga0268298_1032547633300028804Activated SludgeMSYDDWKTRNPDDDRCEFCGVHPRECRAGWQPNSCTGECGQSWRDPDYEYEKMRDEDR
Ga0307509_10020908253300031507EctomycorrhizaMPGYDAWKTHNPDDDRCEFCGVHPREYRGGWQPNACTGECRKGWRDPDAEYEKMRDEP
Ga0318572_1071170323300031681SoilVSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNACTGECNRGWRDPDAEYERMRDDA
Ga0307516_1064558323300031730EctomycorrhizaMSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDHEYEKMRDDQ
Ga0307471_10052523563300032180Hardwood Forest SoilVIDMDLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDRCTGECGQSWRDPDFEYEQARDDAQFF
Ga0335080_1116559533300032828SoilMSDLPGYDSWKTHNPDDDRCEFCGAHERECRAGWQPDCCTGECRRTWRDPDAEYEKSRDEVSTAMELDE
Ga0335070_1029587433300032829SoilMSRSNGLPGYDAWLTHDPDDDGCEFCGVGKAECRSGWRPEECTGECRRVWRDPDEEYERRREEPRE
Ga0335069_1028111263300032893SoilMSRLNGLPGYDDWLTHNPDDDRCEFCGVGKAERRSSWRPEECTGECRRVWRDPDEEYERRREEPRE
Ga0335084_1112551333300033004SoilMTDLPGYDSWKTHNPDDDRCEFCGAHERECRAGWQPDCCTGECQRIWRDPDYEYEKARDE
Ga0364929_0316921_2_1873300034149SedimentGYDDWKTHNPDDDRCEFCGANPSVSKHGWAPDECTGKCDTSWRDPDYEYDRKRDEEDGRD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.