NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103796

Metagenome Family F103796

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103796
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 55 residues
Representative Sequence AGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP
Number of Associated Samples 93
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.52

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(10.891 % of family members)
Environment Ontology (ENVO) Unclassified
(33.663 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(39.604 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 26.19%    β-sheet: 0.00%    Coil/Unstructured: 73.81%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.52
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00072Response_reg 19.80
PF00664ABC_membrane 15.84
PF13633Obsolete Pfam Family 8.91
PF12697Abhydrolase_6 7.92
PF04199Cyclase 3.96
PF00296Bac_luciferase 2.97
PF13544Obsolete Pfam Family 2.97
PF08334T2SSG 2.97
PF07963N_methyl 1.98
PF02515CoA_transf_3 1.98
PF12146Hydrolase_4 1.98
PF01145Band_7 1.98
PF01522Polysacc_deac_1 0.99
PF00005ABC_tran 0.99
PF00656Peptidase_C14 0.99
PF04794YdjC 0.99
PF00004AAA 0.99
PF02779Transket_pyr 0.99
PF00529CusB_dom_1 0.99
PF00441Acyl-CoA_dh_1 0.99
PF00561Abhydrolase_1 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG1878Kynurenine formamidaseAmino acid transport and metabolism [E] 3.96
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 2.97
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 1.98
COG0726Peptidoglycan/xylan/chitin deacetylase, PgdA/NodB/CDA1 familyCell wall/membrane/envelope biogenesis [M] 0.99
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.99
COG3394Chitooligosaccharide deacetylase ChbG, YdjC/CelG familyCarbohydrate transport and metabolism [G] 0.99
COG4249Uncharacterized conserved protein, contains caspase domainGeneral function prediction only [R] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.93%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.93%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere5.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.95%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere3.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.97%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.97%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.98%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.98%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.98%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.98%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.99%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.99%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.99%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.99%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.99%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.99%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.99%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.99%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.99%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.99%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.99%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2189573000Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis 0-21cm (T0 for microcosms)EnvironmentalOpen in IMG/M
3300001545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1EnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004020Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005840Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2Host-AssociatedOpen in IMG/M
3300005874Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_404EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300014882Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231B'_16_10DEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025560Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026047Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rd (SPAdes)EnvironmentalOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027383Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027995 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MGEnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028716Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032008Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f18EnvironmentalOpen in IMG/M
3300032143Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_0EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032211Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034820Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
N55_003207202189573000Grass SoilMVVYEFDSEESLRDFAASDTLKAMTEDYEARFGGAGDRARFTYRQVFP
JGI12630J15595_1009891623300001545Forest SoilYVPVPVDVGAHPGSEPWQYMVVYEFDSEEALRAFAASDTLKAMTQDYEARFGGAGDRVRFTYRQVFP*
soilL2_1022747113300003319Sugarcane Root And Bulk SoilYMVCYEFDSEASLQAFVASDTLQAMTRDYDSRFRGDRARFAYRQIHP*
Ga0055440_1017300313300004020Natural And Restored WetlandsESPHAGSEPWQYMVCYEFDSEESLQAFVDRTLRAMTTDYNARFGGAGDRARLAYRQIYP*
Ga0062589_10079911223300004156SoilGSEPWQYMVCYEFDSEASLEAFVRSDTLRAMTGDYNARFGGAGDRTRLAYRQIYP*
Ga0070690_10072114413300005330Switchgrass RhizosphereMVCYEFDSEASLQAFVQSDTLRAMTKDYDTRFANTSTRARFAYRQIYP*
Ga0070690_10073789623300005330Switchgrass RhizosphereEPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDARFRGERARFAYQQIFP*
Ga0068869_10001723063300005334Miscanthus RhizosphereEFDSEASLQAFVRSETLQDMTRDYNARFGGAGDRARLAYRQIYP*
Ga0070669_10025686023300005353Switchgrass RhizosphereSEPWQYMVCYEFDSEASLQAFVRSETLQDMTRDYNARFGGAGDRARLAYRQIYP*
Ga0070694_10002395113300005444Corn, Switchgrass And Miscanthus RhizosphereHAGSEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFGGSGDRARLAYRQIYP*
Ga0070708_10101575433300005445Corn, Switchgrass And Miscanthus RhizosphereTPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP*
Ga0070662_10099435713300005457Corn RhizosphereEPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGERARFAYQQIFP*
Ga0068867_10017049113300005459Miscanthus RhizosphereVCYEFDSEASLQAFVHSDTLRAMTKDYDTRFANTSTRARFAYRQIYP*
Ga0068867_10024458633300005459Miscanthus RhizosphereCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP*
Ga0070699_10035792433300005518Corn, Switchgrass And Miscanthus RhizosphereMRRYAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNERFGGSGARARLAYRQIYP*
Ga0070693_10036801813300005547Corn, Switchgrass And Miscanthus RhizosphereSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGDRARLAYRQIYP*
Ga0068859_10282559513300005617Switchgrass RhizosphereLETPHAGSEPWQYMVCYEFDSEASLEAFVRSDTLRAMTGDYNARFGGAGDRTRLAYRQIYP*
Ga0066905_10018580133300005713Tropical Forest SoilVPIALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASETLVAMTRDYEARFGGAGHRARFTYRQVFP*
Ga0066905_10051613313300005713Tropical Forest SoilVPIALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASETLAAMTRDYEARFGGAGHRVRFTYRQVFP*
Ga0066903_10059205933300005764Tropical Forest SoilMVVYEFDSEEALRDFAASDTLKAMTQDYEARFGGAGDRVRFTYRQIFP*
Ga0068870_1111033523300005840Miscanthus RhizosphereEPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDARFRGERARFAYQQIFP*
Ga0075288_106499423300005874Rice Paddy SoilYAALELDSAHAGAEPWQYMVCYEFDSEASLQAFVRSDTLRAMTQDYDSRFAGARARFAYRQIFP*
Ga0070716_10012628613300006173Corn, Switchgrass And Miscanthus RhizosphereYMVCYEFDSEESLRAFVDSDTLRAMTRDYDARFGGARARLAYRQIYP*
Ga0079221_1128880423300006804Agricultural SoilYAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP*
Ga0075436_10116804913300006914Populus RhizosphereGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP*
Ga0105249_1036305533300009553Switchgrass RhizosphereVDVGVHPGSEPWQYMVVYEFDSEESLRDFAASDTLKAMTEDYEARFGGAGDRARFTYRQVFP*
Ga0105249_1174342513300009553Switchgrass RhizosphereGREPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDSRFRGERARFAYQQIFP*
Ga0105064_105942323300009821Groundwater SandWQYMVCYEFDSEESLRAFVNSDTLRAMTKDYNARFGGAGERARLAYRQIYP*
Ga0126382_1099796723300010047Tropical Forest SoilWQYMVCYEFDSEDSMQAFVRSDTLRAMTKDYDSRFRGDRARFAYRQIFP*
Ga0126373_1253814323300010048Tropical Forest SoilALDVGAHPGSEPWQYMVVYEFDSEQALRNFTASETLAAMTRDYEARFGGAGNRARFTYRQVFP*
Ga0126379_1337032013300010366Tropical Forest SoilEPWQYMVVYEFDSEEALRNFTASETLVAMTRDYEARFGGAGHRARFTYRQVFP*
Ga0126381_10340017513300010376Tropical Forest SoilSEPWQYMVVYEFDSEQALRNFTASETLAAMTRDYEARFGGAGHRARFTYRQVFP*
Ga0134127_1180344713300010399Terrestrial SoilTVCYEFDSEASLQAFVQSDTLKAMTRDYDSRFGGDRARFAYRQIYP*
Ga0137374_1084525323300012204Vadose Zone SoilSLDVGVHAGSEPWQYMVVYEFDSEESLREFAASDTLKAMTQDYEARFGGAGDRARLTYRQVFP*
Ga0137381_1165583223300012207Vadose Zone SoilVCYEFDSEESLQAFVRSDMLRAMTRDYNARFGGAGDRARLAYRQIYP*
Ga0137367_1063786813300012353Vadose Zone SoilGSEPWQYMVVYEFDSEESLREFAASETLKAMTQDYEARFGGAGDRARLAYRQVFP*
Ga0137397_1002980813300012685Vadose Zone SoilEPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGDHARFAYRQIYP*
Ga0137394_1014446133300012922Vadose Zone SoilAGSEPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGDRARFAYRQIYP*
Ga0137410_1006335213300012944Vadose Zone SoilEPWQYMVCYEFDSEASLQAFVRSDTLQAMTRDYNARFGGAGDRARLAYRQIYP*
Ga0126375_1058682613300012948Tropical Forest SoilGSEPWQYMVVYEFDSEEALRDFAASDTLKAMTEDYEARFGGAGDRVHFTYRQIFP*
Ga0126375_1063786013300012948Tropical Forest SoilALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASDTLRAMTRDYEARFGDAGHRVRFTYRQVFP*
Ga0126369_1257254323300012971Tropical Forest SoilALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASETLVAMTRDYEARFGGAGHRARFTYRQVFP*
Ga0164306_1082761313300012988SoilALPIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP*
Ga0180069_103214523300014882SoilMRRYAPIPLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP*
Ga0134089_1008623923300015358Grasslands SoilYEFESEAALHAFVHSDTLKAMTRDYNARFAGAGERARFTYRQIFP*
Ga0187775_1005243713300017939Tropical PeatlandEPWQYMVCYEFDSEESLRAFVASDTLRAMTRDYNARFGGAGERARLAYRQIYP
Ga0184638_109890413300018052Groundwater SedimentMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP
Ga0184632_1046960613300018075Groundwater SedimentSPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP
Ga0190265_1142918823300018422SoilCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGERARLAYRQIYP
Ga0187893_1020248913300019487Microbial Mat On RocksAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP
Ga0210401_1113598313300020583SoilAIPLDSPHAGSGPWQYMVCYEFDSEESLRAFVVSDTLRAMTKDYDSRFGGGKRARLAYRQIYP
Ga0210382_1056861023300021080Groundwater SedimentGGRLESAHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP
Ga0179596_1013038533300021086Vadose Zone SoilAATRPFPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGARARLAYRQIYP
Ga0210404_1003787843300021088SoilSEPWQYMVCYEFDSEESLRAFVVSDTLRAMTKDYDSRFGGGGERARLAYRQIYP
Ga0224452_103270813300022534Groundwater SedimentEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP
Ga0210108_107451623300025560Natural And Restored WetlandsPWQYMVVYEFDSEEALRAFTASDTLKAMTRDYEARFGGAGDRVRFTYRQVFP
Ga0207671_1098576213300025914Corn RhizosphereIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP
Ga0207693_1017351913300025915Corn, Switchgrass And Miscanthus RhizosphereWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP
Ga0207646_1075000813300025922Corn, Switchgrass And Miscanthus RhizosphereTPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP
Ga0207681_1031615113300025923Switchgrass RhizosphereYEFDSEAALQAFVRSETLQDMTRDYNARFGGAGDRARLAYRQIYP
Ga0207681_1174465923300025923Switchgrass RhizosphereLPLESAHAGREPWQYMVCYEFDSEASLQAFVRSDTLRAMKKDYDSRFRGERARFAYQQIF
Ga0207644_1052632113300025931Switchgrass RhizosphereAALPIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP
Ga0207706_1016117733300025933Corn RhizosphereMALDTPHAGSEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFGGSGDRARLAYRQIYP
Ga0207704_1032050833300025938Miscanthus RhizosphereMVCYEFDSEASLQAFVQSDTLRAMTKDYDTRFANTSTRARFAYRQIYP
Ga0208658_102283423300026047Natural And Restored WetlandsESDHAGAEPWQYMVCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP
Ga0207648_1001426993300026089Miscanthus RhizosphereCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP
Ga0257169_100946423300026469SoilPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP
Ga0209378_103060433300026528SoilMVIYEFESEAALHAFVHSDTLKAMTRDYNTRFAGAGERARFTYRQIFP
Ga0179587_1074384513300026557Vadose Zone SoilQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGERARLAYRQIYP
Ga0209213_110626213300027383Forest SoilHAGSEPWQYMVCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP
Ga0208991_114881213300027681Forest SoilMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGERARLAYR
Ga0209178_138267813300027725Agricultural SoilPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP
Ga0209177_1002527633300027775Agricultural SoilGGSEPWQYMVCYEFDSEESLRAFVQSDTLRAMTRDYNARFTGDRARFAYRQIYP
Ga0209180_1055593913300027846Vadose Zone SoilCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGARARLAYRQIYP
Ga0209701_1014087813300027862Vadose Zone SoilYAPISLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP
Ga0209488_1025054933300027903Vadose Zone SoilRRYAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGARARLAYRQIYP
Ga0207428_1019151413300027907Populus RhizosphereHAGREPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDSRFRGERARFAYQQIFP
(restricted) Ga0233418_1011872013300027995SedimentYAALSLDTPHAGSEPWQYMVCYEFDSEESLEAFIASDTLRAMTRDYNARFGGAGERARLAYRQIYP
Ga0268265_1013182013300028380Switchgrass RhizosphereETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP
Ga0307313_1011947123300028715SoilALPLESAHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP
Ga0307311_1016149613300028716SoilVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP
Ga0307323_1017639823300028787SoilPWQYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP
Ga0307504_1014980823300028792SoilAHAGSEPWQYMVCYEFDSEASLQAFVGSDTLQAMTRDYNARFGGAGDRARLAYRQIYP
Ga0307287_1012388113300028796SoilYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP
Ga0307281_1002689913300028803SoilPIPLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP
(restricted) Ga0255310_1016995123300031197Sandy SoilGSEPWQYMVCYEFDSEESLRAFVTSDTLRAMTKDYNARFGGGSDRARLAYRQIHP
Ga0307469_1028244613300031720Hardwood Forest SoilVPVPVDVGVHPGSEPWQYMVVYEFDSEESLRDFAASNTLRAMTEDYEARFGGAGDRARFTYRQVFP
Ga0307468_10036695933300031740Hardwood Forest SoilVGVHPGAEPWQYMVVYEFDSEESLRDFAASDTLRAMTVDYEARFGGAGDRARLTYRQVFP
Ga0307468_10084757223300031740Hardwood Forest SoilEPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGERARFAYRQIYP
Ga0307468_10245173513300031740Hardwood Forest SoilREPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDARFRGERARLAYRQIFP
Ga0318554_1022852213300031765SoilQYMVCYEFDSEASLRAFAASDTLRAMTEDYERRFGGAGERVRLAYRQIYP
Ga0307473_1042130923300031820Hardwood Forest SoilWQYMVCYEFDSEASLQAFVSSDTLRAMTKDYNSRFRGERARFAYRQIYP
Ga0318562_1054632013300032008SoilPHAGAEPWQYMVCYEFDSEASLRAFAASDTLRAMTEDYERRFGGAGERVRLAYRQIYP
Ga0315292_1085093923300032143SedimentVALDVGVHQGSEPWQYMVVYEFDSEEALRAFAASDTLKAMTQDYEARFGGAGDRVRLTYRQVFP
Ga0307472_10153064613300032205Hardwood Forest SoilAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP
Ga0310896_1070840023300032211SoilALPIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP
Ga0335085_1037002623300032770SoilSEPWQYMVCYAFDSEAALQAFVRSDTLRAMTKDYDSRFRGERARFAYRQIFP
Ga0214472_1175591013300033407SoilIPLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQVYP
Ga0247830_1133035313300033551SoilMRRYAALPLESAHAGSEPWQYMVCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP
Ga0326723_0437433_415_5943300034090Peat SoilMRRYVPVPLDVGAHPGSEPWQYMVVYEFDSEEALRAFTASDTLKAMTRDYEARFGGAGDR
Ga0373959_0093183_16_1563300034820Rhizosphere SoilMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.