NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086733

Metagenome Family F086733

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086733
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 46 residues
Representative Sequence MRSLIRAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGPS
Number of Associated Samples 61
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 55.14 %
% of genes near scaffold ends (potentially truncated) 39.09 %
% of genes from short scaffolds (< 2000 bps) 72.73 %
Associated GOLD sequencing projects 49
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (50.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.727 % of family members)
Environment Ontology (ENVO) Unclassified
(43.636 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(60.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 50.68%    β-sheet: 0.00%    Coil/Unstructured: 49.32%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF00069Pkinase 8.26
PF00589Phage_integrase 3.67
PF00106adh_short 1.83
PF03797Autotransporter 1.83
PF05977MFS_3 1.83
PF02586SRAP 1.83
PF08241Methyltransf_11 1.83
PF03413PepSY 1.83
PF13371TPR_9 0.92
PF00924MS_channel 0.92
PF14534DUF4440 0.92
PF12681Glyoxalase_2 0.92
PF04679DNA_ligase_A_C 0.92
PF13480Acetyltransf_6 0.92
PF13676TIR_2 0.92
PF11185DUF2971 0.92
PF04185Phosphoesterase 0.92
PF07040DUF1326 0.92
PF07366SnoaL 0.92
PF04079SMC_ScpB 0.92
PF02782FGGY_C 0.92
PF02129Peptidase_S15 0.92
PF00392GntR 0.92
PF00535Glycos_transf_2 0.92
PF00805Pentapeptide 0.92
PF13578Methyltransf_24 0.92
PF08281Sigma70_r4_2 0.92
PF02229PC4 0.92
PF13392HNH_3 0.92
PF00201UDPGT 0.92
PF08401ArdcN 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 33.03
COG1819UDP:flavonoid glycosyltransferase YjiC, YdhE familyCarbohydrate transport and metabolism [G] 1.83
COG2135ssDNA abasic site-binding protein YedK/HMCES, SRAP familyReplication, recombination and repair [L] 1.83
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 1.83
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.92
COG1357Uncharacterized conserved protein YjbI, contains pentapeptide repeatsFunction unknown [S] 0.92
COG1386Chromosome segregation and condensation protein ScpBTranscription [K] 0.92
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.92
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 0.92
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.92
COG4227Antirestriction protein ArdCReplication, recombination and repair [L] 0.92
COG5588Uncharacterized conserved protein, DUF1326 domainFunction unknown [S] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms50.00 %
UnclassifiedrootN/A50.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_17234820All Organisms → cellular organisms → Bacteria4377Open in IMG/M
2228664022|INPgaii200_c0914159Not Available759Open in IMG/M
2228664022|INPgaii200_c1177013All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium954Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100273939Not Available575Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101248819Not Available648Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101592834All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1412Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101639831Not Available501Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_104801271Not Available501Open in IMG/M
3300000789|JGI1027J11758_12359169Not Available853Open in IMG/M
3300000789|JGI1027J11758_12948312All Organisms → cellular organisms → Bacteria → Proteobacteria1517Open in IMG/M
3300000789|JGI1027J11758_13075031Not Available1372Open in IMG/M
3300000955|JGI1027J12803_100062849Not Available1634Open in IMG/M
3300000955|JGI1027J12803_100997285Not Available508Open in IMG/M
3300000955|JGI1027J12803_101304272Not Available585Open in IMG/M
3300000955|JGI1027J12803_101384518Not Available588Open in IMG/M
3300000955|JGI1027J12803_101959584Not Available736Open in IMG/M
3300000955|JGI1027J12803_101985014Not Available621Open in IMG/M
3300000955|JGI1027J12803_102173296All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria716Open in IMG/M
3300000955|JGI1027J12803_102741368Not Available522Open in IMG/M
3300000955|JGI1027J12803_103709065Not Available525Open in IMG/M
3300000955|JGI1027J12803_103763699Not Available1040Open in IMG/M
3300000955|JGI1027J12803_103828369Not Available629Open in IMG/M
3300000955|JGI1027J12803_108653348Not Available2423Open in IMG/M
3300001150|JGI12672J13324_104437Not Available533Open in IMG/M
3300001545|JGI12630J15595_10018143All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1496Open in IMG/M
3300001545|JGI12630J15595_10069553All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium693Open in IMG/M
3300001661|JGI12053J15887_10064547All Organisms → cellular organisms → Bacteria2030Open in IMG/M
3300002245|JGIcombinedJ26739_100000610All Organisms → cellular organisms → Bacteria20466Open in IMG/M
3300002245|JGIcombinedJ26739_100151336All Organisms → cellular organisms → Bacteria2192Open in IMG/M
3300002245|JGIcombinedJ26739_100381294All Organisms → cellular organisms → Bacteria1290Open in IMG/M
3300002245|JGIcombinedJ26739_100517268Not Available1071Open in IMG/M
3300002245|JGIcombinedJ26739_100543558All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1039Open in IMG/M
3300002245|JGIcombinedJ26739_101068359All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium693Open in IMG/M
3300002245|JGIcombinedJ26739_101582832Not Available552Open in IMG/M
3300002906|JGI25614J43888_10000328All Organisms → cellular organisms → Bacteria13047Open in IMG/M
3300002906|JGI25614J43888_10054949Not Available1183Open in IMG/M
3300002917|JGI25616J43925_10043246All Organisms → cellular organisms → Bacteria1968Open in IMG/M
3300002917|JGI25616J43925_10138307All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium980Open in IMG/M
3300002917|JGI25616J43925_10154533Not Available915Open in IMG/M
3300005445|Ga0070708_100137361All Organisms → cellular organisms → Bacteria2265Open in IMG/M
3300005445|Ga0070708_100137361All Organisms → cellular organisms → Bacteria2265Open in IMG/M
3300005467|Ga0070706_101224956Not Available689Open in IMG/M
3300005467|Ga0070706_101877505Not Available544Open in IMG/M
3300005468|Ga0070707_100281303All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1617Open in IMG/M
3300005471|Ga0070698_100000876All Organisms → cellular organisms → Bacteria33074Open in IMG/M
3300005518|Ga0070699_100035108All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4333Open in IMG/M
3300005518|Ga0070699_100063531All Organisms → cellular organisms → Bacteria3201Open in IMG/M
3300005518|Ga0070699_102095016Not Available517Open in IMG/M
3300005536|Ga0070697_100318254All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1339Open in IMG/M
3300006028|Ga0070717_10028110All Organisms → cellular organisms → Bacteria4498Open in IMG/M
3300006854|Ga0075425_101500140Not Available761Open in IMG/M
3300010048|Ga0126373_13046281All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium522Open in IMG/M
3300012200|Ga0137382_10763864Not Available694Open in IMG/M
3300012202|Ga0137363_11248409Not Available630Open in IMG/M
3300012202|Ga0137363_11489081Not Available568Open in IMG/M
3300012203|Ga0137399_10049653All Organisms → cellular organisms → Bacteria3037Open in IMG/M
3300012205|Ga0137362_10372309All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1237Open in IMG/M
3300012205|Ga0137362_11061817Not Available689Open in IMG/M
3300012361|Ga0137360_11850481All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium509Open in IMG/M
3300012582|Ga0137358_10021204All Organisms → cellular organisms → Bacteria4187Open in IMG/M
3300012582|Ga0137358_11055952Not Available521Open in IMG/M
3300012683|Ga0137398_10091739All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Chthoniobacter → Chthoniobacter flavus1895Open in IMG/M
3300012918|Ga0137396_10079339Not Available2308Open in IMG/M
3300012918|Ga0137396_10187829Not Available1516Open in IMG/M
3300012918|Ga0137396_11240046Not Available522Open in IMG/M
3300012923|Ga0137359_10175578All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1913Open in IMG/M
3300012923|Ga0137359_10672191Not Available904Open in IMG/M
3300012923|Ga0137359_11128900Not Available670Open in IMG/M
3300012925|Ga0137419_10694694Not Available825Open in IMG/M
3300012927|Ga0137416_11943009Not Available539Open in IMG/M
3300016294|Ga0182041_11680529Not Available587Open in IMG/M
3300016357|Ga0182032_11456484Not Available594Open in IMG/M
3300020199|Ga0179592_10075684Not Available1543Open in IMG/M
3300020199|Ga0179592_10346955Not Available654Open in IMG/M
3300023046|Ga0233356_1006184All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1222Open in IMG/M
3300024288|Ga0179589_10334853All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae686Open in IMG/M
3300024347|Ga0179591_1042600All Organisms → cellular organisms → Bacteria2772Open in IMG/M
3300024347|Ga0179591_1186215All Organisms → cellular organisms → Bacteria2736Open in IMG/M
3300025910|Ga0207684_10068155All Organisms → cellular organisms → Bacteria3024Open in IMG/M
3300025922|Ga0207646_10430687All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1190Open in IMG/M
3300026304|Ga0209240_1066040Not Available1350Open in IMG/M
3300026319|Ga0209647_1001073All Organisms → cellular organisms → Bacteria24147Open in IMG/M
3300026481|Ga0257155_1024104All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium885Open in IMG/M
3300026498|Ga0257156_1007672All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2023Open in IMG/M
3300026498|Ga0257156_1119609All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium548Open in IMG/M
3300026551|Ga0209648_10056835All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Edaphobacter → Edaphobacter lichenicola3330Open in IMG/M
3300026551|Ga0209648_10066778All Organisms → cellular organisms → Bacteria3037Open in IMG/M
3300026557|Ga0179587_10878826Not Available591Open in IMG/M
3300027383|Ga0209213_1101842Not Available531Open in IMG/M
3300027537|Ga0209419_1009823All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300027583|Ga0209527_1003469All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3183Open in IMG/M
3300027603|Ga0209331_1009644All Organisms → cellular organisms → Bacteria → Acidobacteria2541Open in IMG/M
3300027603|Ga0209331_1045654All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales1111Open in IMG/M
3300027651|Ga0209217_1015088All Organisms → cellular organisms → Bacteria2511Open in IMG/M
3300027684|Ga0209626_1056142All Organisms → cellular organisms → Bacteria989Open in IMG/M
3300028047|Ga0209526_10001888All Organisms → cellular organisms → Bacteria14048Open in IMG/M
3300028047|Ga0209526_10013137Not Available5719Open in IMG/M
3300028047|Ga0209526_10240040All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1243Open in IMG/M
3300028047|Ga0209526_10306211All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300028047|Ga0209526_10478114Not Available815Open in IMG/M
3300028536|Ga0137415_11318244All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300031753|Ga0307477_10513180Not Available814Open in IMG/M
3300031754|Ga0307475_11023235Not Available649Open in IMG/M
3300031912|Ga0306921_11513928All Organisms → cellular organisms → Bacteria733Open in IMG/M
3300031945|Ga0310913_10770795Not Available679Open in IMG/M
3300031962|Ga0307479_10604063Not Available1079Open in IMG/M
3300031962|Ga0307479_10866144All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Tenacibaculum → unclassified Tenacibaculum → Tenacibaculum sp.877Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.73%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil21.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere11.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.18%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.64%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.91%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001150Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M1EnvironmentalOpen in IMG/M
3300001545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300023046Soil microbial communities from Shasta-Trinity National Forest, California, United States - GEON-SFM-MSEnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027061Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027383Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027537Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_025359102088090014SoilVIRSLIRVVYRFRDATLVDEKKDDFGSTLQKLCRDLEGAKELVGPGS
INPgaii200_091415912228664022SoilVRQSSAYRFRDAALIAEKKDDFGSAPQNLYRELERAENLVGPS
INPgaii200_117701312228664022SoilGRDAVMRNLIGAVYRFRDATLVAEKKDDFGNALQNLYRELERAENVVGPS
INPhiseqgaiiFebDRAFT_10027393923300000364SoilIRAVYRFRDAALVAEKKDDFGSALQNLCRELEQAEDLVGPP*
INPhiseqgaiiFebDRAFT_10124881923300000364SoilMAIRNLIRAAYRFRDATLVAEKKHDCGSALQNLYRELERAKT*
INPhiseqgaiiFebDRAFT_10159283423300000364SoilVIRSLIRVVYRFRDATLVDEKKDDFGSTLQKLCRDLEGAKELVGPGS*
INPhiseqgaiiFebDRAFT_10163983133300000364SoilRFRDAALVDEKKDDSGSALQNLYRELERAENLLGPS*
INPhiseqgaiiFebDRAFT_10480127113300000364SoilMSNLIRAAYRFRDATLVAEKKEDFANALQNLYHALKHAENMVGPS*
JGI1027J11758_1235916933300000789SoilYRFRDATLVAENNDDFGNALQNLYRELERAEDLLGSCSH*
JGI1027J11758_1294831213300000789SoilMVMRSLVRAVYRFRDATLVADKKDNFGNALQNLYRELEPAEDVVGPAE*
JGI1027J11758_1307503123300000789SoilGRDMVMRSLVRAVYRFRDATLVAEKKDDFGNALQNLYRELERAENLVGPVG*
JGI1027J12803_10006284923300000955SoilMRSLIRAVYRFRDAALVAEKKDDFGSALQNLCRELEQAEDLVGPP*
JGI1027J12803_10099728513300000955SoilMRSLIRAVYRFRDAALVDEKKDDFGNALQNLYRELVRAENLVGPS*
JGI1027J12803_10130427223300000955SoilLRRDVVRSLVRAAYRFRDAALIAEKKNDFGNALQNLYRELERAENLVGPS*
JGI1027J12803_10138451813300000955SoilAVMRSLIRAVYRFRDATLVDEKKDDFGSALQNLYRELERAENLVGPS*
JGI1027J12803_10195958413300000955SoilMMRSLIRAVYRFRDATLVAEKKDDFGNALQNPYRELERAEDLVGPPG*
JGI1027J12803_10198501423300000955SoilMLSLIRAVYRFRDATLVDEKKGDFGNALQNLYRELERAEDLVGPLSRQG*
JGI1027J12803_10217329623300000955SoilMRSLIQAVYRFRDAALVNEKKGDFGSALQNLYRELER
JGI1027J12803_10274136823300000955SoilMRNLIQAVYRFRDATLVDAKKDDFGSALQNLYRELERAESLVG
JGI1027J12803_10370906513300000955SoilMLSLIRAVYRFRDATLVEEKKADFGSALQNLYRELERAEELVGPAS*
JGI1027J12803_10376369923300000955SoilRDAVMRSLVHAIYRIRDATLIAEKKEDFGNALQNLYRQLERAENLVGPVG*
JGI1027J12803_10382836933300000955SoilMRSLIRAVYRFRDATLVVEKKEDFGSALQNLYRELERAEDLVGPG*
JGI1027J12803_10865334873300000955SoilLIGAVYRFRDATLVAEKKDDFGNALQNLYRELERAENVVGPS*
JGI12672J13324_10443713300001150Forest SoilRSLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDRVGPGA*
JGI12630J15595_1001814313300001545Forest SoilMRSLIQAVYRFRDAALVDEKRGDFGGALQNLYRELERAEDLVGPSE*
JGI12630J15595_1006955323300001545Forest SoilAVMRSLIRAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGPS*
JGI12053J15887_1006454723300001661Forest SoilMRSLVRAVYRFRDATLVAEKKDDFGSALQNLYWELERAEDLVGPG*
JGIcombinedJ26739_100000610363300002245Forest SoilRDIVMRSLIQAVYRFRDAALVDEKRGDFGGALQNLYRELERAEDLVGPSE*
JGIcombinedJ26739_10015133623300002245Forest SoilMRSLIQAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDLVGPGA*
JGIcombinedJ26739_10038129423300002245Forest SoilMRSLIRAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGPS*
JGIcombinedJ26739_10051726813300002245Forest SoilMRSLIRAVYRFRDAALVAEKKDDFGNALQNLYRELERAEDLVGPG*
JGIcombinedJ26739_10054355813300002245Forest SoilAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGPS*
JGIcombinedJ26739_10106835913300002245Forest SoilMRSLIQAVYRFRDAALVDEKRGDFGGALQNLYRELERAEDLVGPP*
JGIcombinedJ26739_10158283213300002245Forest SoilMRSLIQAVYRFRDAALVDEKRGDFGSALQNLYRELERAENLVGPS*
JGI25614J43888_10000328173300002906Grasslands SoilMRSLIGAVYRFRDATLVEEKKGDFGSALQNLYRELERAEELVGPS*
JGI25614J43888_1005494933300002906Grasslands SoilMRSLIRAVHRFRDAGLVAQKKDDFGSALQNLYRELERAENLVGPP*
JGI25616J43925_1004324633300002917Grasslands SoilMRSLVRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENLVGPP*
JGI25616J43925_1013830733300002917Grasslands SoilVMRSLIRAVHRFRDAGLVAQKKDDFGSALQNLYRELERAENLVGPP*
JGI25616J43925_1015453333300002917Grasslands SoilSRDMVMRSLVRAVYRFRDATLVTEKKDDFGNALQNLYRELERGEDLIGGSGSR*
Ga0070708_10013736113300005445Corn, Switchgrass And Miscanthus RhizosphereMVMRSLVRAVYRFRDATLVAEKKDDFGNALQHLYRELERADDLVGPPG*
Ga0070708_10013736143300005445Corn, Switchgrass And Miscanthus RhizosphereMVIRSLIRAVCWFRDPELVAEKKDDFDSALQNLYRELKRAEELVGPG*
Ga0070706_10122495623300005467Corn, Switchgrass And Miscanthus RhizosphereIEGRETVFRSLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERAEEVVGPAA*
Ga0070706_10187750523300005467Corn, Switchgrass And Miscanthus RhizosphereMLSLLRVVYRLRDAALVAEKKDRFGSALQNLYGELEQAENLVRPS*
Ga0070707_10028130313300005468Corn, Switchgrass And Miscanthus RhizosphereTIAGRDAVMRNLIGAVYRFRDATLVAEKKDDFGNALQSLYRELERAENVVGPS*
Ga0070698_100000876293300005471Corn, Switchgrass And Miscanthus RhizosphereMRGLIRAVYRFRDAALVAEKKDDFGNALQNLYRELEPAEDLVGSG*
Ga0070699_10003510823300005518Corn, Switchgrass And Miscanthus RhizosphereMRSLIRAVYRFRDAALVAEKKDDFGNALQNLYRELERAEDLVRRSG*
Ga0070699_10006353113300005518Corn, Switchgrass And Miscanthus RhizosphereLIRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENLVGPS*
Ga0070699_10209501623300005518Corn, Switchgrass And Miscanthus RhizosphereLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERPEDLVGPAS*
Ga0070697_10031825413300005536Corn, Switchgrass And Miscanthus RhizosphereVMRSLIRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENLVGPT*
Ga0070717_1002811043300006028Corn, Switchgrass And Miscanthus RhizosphereMRSLIRAVYRFRNATLVAEKKDDFGGALQNLYRELERAENVVGPS*
Ga0075425_10150014023300006854Populus RhizosphereMVMRSLVRAVYRFRDATLVAEKKHDFGNALQNLYRELERAEDLVGPG*
Ga0126373_1304628123300010048Tropical Forest SoilMRAVYRFRDAILVGDKKGDFTNALQNLYRELSRAEELVGPPGPD*
Ga0137382_1076386413300012200Vadose Zone SoilMVMRSLVRAVYRFRDATSVAEKKEDFGNALQNLYRELERAENLVGP
Ga0137363_1124840913300012202Vadose Zone SoilMQGRDMVMRSLIQAVYRFRDATLVDEKKGDFGSALQNLDREWVRAGDLFRTS*
Ga0137363_1148908123300012202Vadose Zone SoilMRSLVHAMYRIRDATLIAEKKEDFGNALQNLYRQLERPENLVGPVG*
Ga0137399_1004965353300012203Vadose Zone SoilMRSLIRAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGH*
Ga0137362_1037230913300012205Vadose Zone SoilVVMRSLIGAVYRFRDATLVEEKKGDFGSALQNLYRELERAEELVGPS*
Ga0137362_1106181723300012205Vadose Zone SoilMQGRDMVMRSLIQAVYRFRDATLVDEKKGDFGSALQNLDRELVRAGDLFRPS*
Ga0137360_1185048123300012361Vadose Zone SoilRRTVESRDMVMRSLIRAVYRFRDATLVAEKKDDFATALQNLYRELERAENLIGPS*
Ga0137358_1002120463300012582Vadose Zone SoilMRGSLRAVYRFRDATLVAEKKDDFGNALENLYRELERAENLVGPAN*
Ga0137358_1105595223300012582Vadose Zone SoilMRSLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDSVGPAA*
Ga0137398_1009173933300012683Vadose Zone SoilETRVMRNLVRAVYRFRDAALVAEKKDDFGSALQNLYRELERAENVVGPS*
Ga0137396_1007933913300012918Vadose Zone SoilMRSLIRAAYRFRDAALIAEKKEDFGSALQNLYRELERAENTVGRS*
Ga0137396_1018782923300012918Vadose Zone SoilMRNLVRAVYRFRDAALVAEKKDDFGSALQNLYRELERAENLVGPP*
Ga0137396_1124004613300012918Vadose Zone SoilMVMRSLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDLVLPAG*
Ga0137359_1017557823300012923Vadose Zone SoilMRSLVQAVYRFRDATLVAEKKDDFGNALQNLYRELERAESVVGPS*
Ga0137359_1067219123300012923Vadose Zone SoilMRSLIRAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGQ*
Ga0137359_1112890013300012923Vadose Zone SoilAIRNLIGAVYRFRDATLVDEKKDDFGNALQNLYRELERAENLVGPS*
Ga0137419_1069469433300012925Vadose Zone SoilVMRSLIRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENLVGPSG*
Ga0137416_1194300923300012927Vadose Zone SoilFRDAALITEKRDDFGNPLQNLYRELERAEDLVRPPR*
Ga0182041_1168052923300016294SoilMRAVYRFREAILVDDKKGDFTNALQNLYRELSRAEELVGTPGPD
Ga0182035_1055130223300016341SoilMRDLIRAVYRYRDATLVDGKKDDFGAAQQNLYPELAQAEESVGPVD
Ga0182032_1145648413300016357SoilMRAVYRFREAILVDDKKGDFTNTLQNLYRELSRAEELVGPPGPD
Ga0182040_1023372133300016387SoilMRDLIRAVYRYRDATLVDGKKDDFGAAQQNLYRELAHAEESVGPVD
Ga0179592_1007568413300020199Vadose Zone SoilMRSLIRAVYRFRDATLVAEKKDDFGGALQNLYRELERAENLVGH
Ga0179592_1034695523300020199Vadose Zone SoilMRGSLRAVYRFRDATLVAEKKDDFGNALENLYRELERAENLVGPAN
Ga0233356_100618413300023046SoilSGRDALMRSLVQAVYRFRDATLVAEKKDDFGNALQNLYRELERAESVVGPS
Ga0179589_1033485323300024288Vadose Zone SoilGLIKAVYRFRDATLVAEKKDDFGTALQNLYRELERAEDLVGPPEK
Ga0179591_104260053300024347Vadose Zone SoilMVMRSLVRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENLVGPP
Ga0179591_118621543300024347Vadose Zone SoilVMRSLIRAVYRFRDATLVAERKDDFGSALQNLYPELERAENLVGPP
Ga0207684_1006815513300025910Corn, Switchgrass And Miscanthus RhizosphereMVMRSLVRAVYRFRDATLVAEKKDDFGNALQHLYRELERADDLVGPPG
Ga0207646_1043068713300025922Corn, Switchgrass And Miscanthus RhizosphereTVMRSLIRAVYRFRDAALVAEKKDDFGNALQNLYRELERAENVVGPS
Ga0209240_106604023300026304Grasslands SoilMVMRSLVRAVYRFRDATLVTEKKDDFGNALQNLYRELERGEDLIGGSGSR
Ga0209647_1001073193300026319Grasslands SoilMRSLIGAVYRFRDATLVEEKKGDFGSALQNLYRELERAEELVGPS
Ga0257155_102410423300026481SoilMRSHVQAVYRFRDATLVAEKKDDFGNALQNLYRELERAESVVGPS
Ga0257156_100767233300026498SoilRAVYRFRDAALITEKRDDFGNPLQNLYRELERAEDLVRPPR
Ga0257156_111960923300026498SoilGRDVVMRSLIRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENLIGPS
Ga0209648_1005683553300026551Grasslands SoilMRSLIKAVYRFRDGTLVAEKKDDFGNALQNLYRELERAEDLVGPAS
Ga0209648_1006677813300026551Grasslands SoilMRSLVPAVYRFRDAALVAEKKDDFGNALQNLYRELERAEDLVGRAG
Ga0179587_1087882613300026557Vadose Zone SoilMRSLIRAVYRFRDATLVDEKKDDFGSALQNLYRELERAENLVGPS
Ga0209729_105464113300027061Forest SoilRTIAGREAVMRSLIQAVYRFRDATLVAEKKDDFGNALQNLYRELERAESVVGPS
Ga0209213_110184223300027383Forest SoilMQGHDMVMRSLIQAVYRFRDATLVDEKKGDFGSALQNLDRELVRAGTYLD
Ga0209419_100982323300027537Forest SoilMRSLIQAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDLVGPGA
Ga0209527_100346953300027583Forest SoilMRSLIQAVYRFRDAALVDEKRGDFGGALQNLYRELERAEDLVGPSE
Ga0209331_100964423300027603Forest SoilMVMRSLVRAVYRFRDATLIAEKKDDFANALQNLYRELERAEDLVGPG
Ga0209331_104565433300027603Forest SoilVESRDMVMRSLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDRVGPGA
Ga0209217_101508813300027651Forest SoilAVYRFRDATLAAEKKDDFGNALQNLYWELERAENLIGPAS
Ga0209626_105614223300027684Forest SoilMVMRSLVRAVYRFRDATLIAEKKDDFANALQNLYRELERAEDLVGPAS
Ga0209526_10001888223300028047Forest SoilVMRSLIQAVYRFRDAALVDEKRGDFGGALQNLYRELERAEDLVGPSE
Ga0209526_1001313743300028047Forest SoilMQGRDMVMRSLIQAVYRFRDATLVDEKKGHFGSALQNLDRELVRAGDLFRPS
Ga0209526_1024004013300028047Forest SoilWNRRRAVYRFRDAALIVEKKDDFGGALQNLYRELERAENLVGPS
Ga0209526_1030621113300028047Forest SoilSLIRAVYRFRDATLVAEKKDDFGNALQNLYRELERAEDSVGPAA
Ga0209526_1047811433300028047Forest SoilAVYRFRDATLVAEKKDDFGSALQNLYWELERAEDLVGPG
Ga0137415_1131824423300028536Vadose Zone SoilRAVYRFQDATLIVEKKDDFGGAHQNLYRELERAEDLAGPCGDF
Ga0307477_1051318013300031753Hardwood Forest SoilMRSLIRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENVVG
Ga0307475_1102323513300031754Hardwood Forest SoilMVMRSLFRAVYGFRYATRVAEKKDDFGNALQNLCRALEREEDLLGPG
Ga0306921_1151392823300031912SoilMRAVYRFRDAILVDDKKGDFTNALQNLYRELSRAEELVGTPGPD
Ga0310913_1077079513300031945SoilMRAVYRFREAILVDDKKGDFTNALQNLYRELSRAEELVGPPGPD
Ga0307479_1060406313300031962Hardwood Forest SoilKRTIAGRDAVMRSLIRAVYLFRDATLVAEKKDDFGSALQNLYRELERVEDLVGHS
Ga0307479_1086614413300031962Hardwood Forest SoilRDTVMRSLIRAVYRFRDATLVAEKKDDFGSALQNLYRELERAENVVGPS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.