NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100506

Metagenome Family F100506

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100506
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 63 residues
Representative Sequence MRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTSP
Number of Associated Samples 68
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 72.94 %
% of genes near scaffold ends (potentially truncated) 38.24 %
% of genes from short scaffolds (< 2000 bps) 80.39 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (73.529 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(36.274 % of family members)
Environment Ontology (ENVO) Unclassified
(23.529 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(35.294 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: Yes Secondary Structure distribution: α-helix: 31.63%    β-sheet: 0.00%    Coil/Unstructured: 68.37%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00072Response_reg 11.76
PF01243Putative_PNPOx 1.96
PF01904DUF72 0.98
PF09557DUF2382 0.98
PF02735Ku 0.98
PF05559DUF763 0.98
PF00127Copper-bind 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG1273Non-homologous end joining protein Ku, dsDNA break repairReplication, recombination and repair [L] 0.98
COG1415Uncharacterized conserved protein, DUF763 domainFunction unknown [S] 0.98
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A73.53 %
All OrganismsrootAll Organisms26.47 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004157|Ga0062590_102474176All Organisms → cellular organisms → Archaea549Open in IMG/M
3300004463|Ga0063356_100955062All Organisms → cellular organisms → Archaea1218Open in IMG/M
3300004463|Ga0063356_102376638All Organisms → cellular organisms → Archaea810Open in IMG/M
3300004479|Ga0062595_100945472Not Available733Open in IMG/M
3300004479|Ga0062595_101329048Not Available649Open in IMG/M
3300004643|Ga0062591_100439432Not Available1095Open in IMG/M
3300004643|Ga0062591_100994676Not Available797Open in IMG/M
3300005093|Ga0062594_100531891All Organisms → cellular organisms → Archaea1006Open in IMG/M
3300005093|Ga0062594_102500991Not Available567Open in IMG/M
3300005093|Ga0062594_102537373Not Available564Open in IMG/M
3300005289|Ga0065704_10245429All Organisms → cellular organisms → Archaea1003Open in IMG/M
3300005290|Ga0065712_10054540All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Paraburkholderia → Paraburkholderia heleia677Open in IMG/M
3300005293|Ga0065715_10895059Not Available561Open in IMG/M
3300005356|Ga0070674_101067051All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Pseudanabaenales → Prochlorotrichaceae → Prochlorothrix → Prochlorothrix hollandica712Open in IMG/M
3300005456|Ga0070678_101663312Not Available600Open in IMG/M
3300005518|Ga0070699_102185904Not Available505Open in IMG/M
3300005545|Ga0070695_100643416Not Available836Open in IMG/M
3300005546|Ga0070696_100983442Not Available704Open in IMG/M
3300005549|Ga0070704_102054323All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Paraburkholderia → Paraburkholderia heleia531Open in IMG/M
3300005616|Ga0068852_102373129Not Available551Open in IMG/M
3300005843|Ga0068860_102019440Not Available598Open in IMG/M
3300005886|Ga0075286_1040178Not Available637Open in IMG/M
3300006904|Ga0075424_102610656Not Available528Open in IMG/M
3300010371|Ga0134125_10872957Not Available988Open in IMG/M
3300010371|Ga0134125_13091467Not Available504Open in IMG/M
3300010375|Ga0105239_10490701Not Available1396Open in IMG/M
3300010396|Ga0134126_12263853Not Available592Open in IMG/M
3300010399|Ga0134127_10424488Not Available1322Open in IMG/M
3300010400|Ga0134122_12891236Not Available534Open in IMG/M
3300011003|Ga0138514_100013239All Organisms → cellular organisms → Bacteria1385Open in IMG/M
3300011119|Ga0105246_10094523All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium2162Open in IMG/M
3300011119|Ga0105246_12482422Not Available510Open in IMG/M
3300012668|Ga0157216_10394549Not Available614Open in IMG/M
3300012882|Ga0157304_1096821Not Available532Open in IMG/M
3300012913|Ga0157298_10364793All Organisms → cellular organisms → Archaea537Open in IMG/M
3300012938|Ga0162651_100067362Not Available585Open in IMG/M
3300012941|Ga0162652_100000855All Organisms → cellular organisms → Archaea2362Open in IMG/M
3300012951|Ga0164300_10807917Not Available582Open in IMG/M
3300012955|Ga0164298_10670443Not Available723Open in IMG/M
3300012957|Ga0164303_10096839All Organisms → cellular organisms → Archaea1450Open in IMG/M
3300012958|Ga0164299_10600670Not Available752Open in IMG/M
3300012958|Ga0164299_11245655Not Available566Open in IMG/M
3300012960|Ga0164301_10072608All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae → Candidatus Nitrosocosmicus → Candidatus Nitrosocosmicus oleophilus1871Open in IMG/M
3300012960|Ga0164301_10848338Not Available704Open in IMG/M
3300012961|Ga0164302_10624813All Organisms → cellular organisms → Archaea785Open in IMG/M
3300012961|Ga0164302_11223060Not Available602Open in IMG/M
3300012985|Ga0164308_10440205Not Available1076Open in IMG/M
3300012985|Ga0164308_12129092Not Available523Open in IMG/M
3300015077|Ga0173483_10194775Not Available929Open in IMG/M
3300015372|Ga0132256_100031574All Organisms → cellular organisms → Archaea4797Open in IMG/M
3300015373|Ga0132257_102997634Not Available615Open in IMG/M
3300018027|Ga0184605_10104141All Organisms → cellular organisms → Archaea1255Open in IMG/M
3300018028|Ga0184608_10192065Not Available893Open in IMG/M
3300018051|Ga0184620_10013378All Organisms → cellular organisms → Archaea1911Open in IMG/M
3300018051|Ga0184620_10192229Not Available675Open in IMG/M
3300018051|Ga0184620_10203186Not Available659Open in IMG/M
3300018056|Ga0184623_10225166Not Available860Open in IMG/M
3300018056|Ga0184623_10239563All Organisms → cellular organisms → Archaea829Open in IMG/M
3300018061|Ga0184619_10090742All Organisms → cellular organisms → Archaea1366Open in IMG/M
3300018061|Ga0184619_10111775Not Available1233Open in IMG/M
3300018061|Ga0184619_10187808All Organisms → cellular organisms → Archaea948Open in IMG/M
3300018061|Ga0184619_10516240Not Available526Open in IMG/M
3300018061|Ga0184619_10557256Not Available501Open in IMG/M
3300018076|Ga0184609_10359275Not Available679Open in IMG/M
3300018920|Ga0190273_12320108Not Available508Open in IMG/M
3300019867|Ga0193704_1042940Not Available897Open in IMG/M
3300019868|Ga0193720_1009890Not Available1321Open in IMG/M
3300019868|Ga0193720_1018685Not Available979Open in IMG/M
3300019873|Ga0193700_1008784All Organisms → cellular organisms → Archaea1601Open in IMG/M
3300019875|Ga0193701_1056451Not Available788Open in IMG/M
3300019879|Ga0193723_1025041All Organisms → cellular organisms → Archaea1815Open in IMG/M
3300020006|Ga0193735_1088573Not Available877Open in IMG/M
3300020006|Ga0193735_1121778Not Available708Open in IMG/M
3300020006|Ga0193735_1181640Not Available518Open in IMG/M
3300020012|Ga0193732_1084666All Organisms → cellular organisms → Archaea511Open in IMG/M
3300020018|Ga0193721_1033743All Organisms → cellular organisms → Archaea1352Open in IMG/M
3300021078|Ga0210381_10366870Not Available528Open in IMG/M
3300021418|Ga0193695_1016549All Organisms → cellular organisms → Archaea1522Open in IMG/M
3300022694|Ga0222623_10052476All Organisms → cellular organisms → Archaea → TACK group → Crenarchaeota → environmental samples → uncultured crenarchaeote1569Open in IMG/M
3300022694|Ga0222623_10125605Not Available999Open in IMG/M
3300022694|Ga0222623_10148096Not Available914Open in IMG/M
3300022694|Ga0222623_10307394Not Available608Open in IMG/M
3300025901|Ga0207688_11060737Not Available512Open in IMG/M
3300027717|Ga0209998_10109074Not Available691Open in IMG/M
3300031943|Ga0310885_10878511Not Available513Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil36.27%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment12.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil8.82%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil6.86%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.88%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.90%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil2.94%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere1.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.96%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.98%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.98%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.98%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005290Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Rhizosphere Soil Replicate 1: eDNA_1Host-AssociatedOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005886Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300012668Arctic soils microbial communities. Combined Assembly of 23 SPsEnvironmentalOpen in IMG/M
3300012882Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2EnvironmentalOpen in IMG/M
3300012913Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S043-104R-2EnvironmentalOpen in IMG/M
3300012938Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t2i015EnvironmentalOpen in IMG/M
3300012941Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t4i015EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018920Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 ISEnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019868Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1EnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019873Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s1EnvironmentalOpen in IMG/M
3300019875Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020008Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m2EnvironmentalOpen in IMG/M
3300020012Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s1EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025901Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4 (SPAdes)Host-AssociatedOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062590_10247417613300004157SoilMRTQLVLIVPLLAAMIGMLAIVGLNTVDANMTSSEGNYTSGTNATSGNMTSGNATSDNMAGP
Ga0063356_10095506233300004463Arabidopsis Thaliana RhizosphereMRTQLVLMIPLLAAAIGMLAIVGMNSVNAQNMSTPETNMTAGSSMANLTMGGNMTGAGNMTNE
Ga0063356_10237663833300004463Arabidopsis Thaliana RhizosphereLIIPILAAMIGMIAIVGLNSANAQNMSTPEGNMTSGNATSGNATSGNMAGPTNMTSP*
Ga0062595_10094547213300004479SoilMRKQLVLIIPILAAIIGMLAIVGLNSVNAQNMTTPETNYTGGTNATSGNMTSGNATS
Ga0062595_10132904823300004479SoilMTKQLVLIIPILAAMIGMFAIVGLNSVNAQNMSTPETNMSRMSNATMGGNMTSGNATSENMADPLNITTP*
Ga0062591_10043943223300004643SoilMRKQLVLIIPILAAMIGMVTIFRLNSVNAQNMSTPETNMTGMTNMTMDGNMTDGNATSGNMSGPTNMTSP*
Ga0062591_10099467613300004643SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMTTPETNYTGGTNATSGNMTIGNATSDNMAGPTNMTTP*
Ga0062591_10242794323300004643SoilMRTKLVLVISLLAEMMGMFAIVGVTFANAQNMSTPETNMTSGNATSGNMTSGNSS
Ga0062594_10053189113300005093SoilKAMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMTTPETNYTGGTNATSGNMTSGNATSDNMAGPTNMTTP*
Ga0062594_10250099113300005093SoilMRKQLVLIIPILAAMIGMIAIVGVSSVNAQNMSTPETNMTGMTNANMSGNMTGNMTNSSSPMMNSTQ*
Ga0062594_10253737313300005093SoilMRTQSVLIIPILAAMIGMIAIVGLNSANAQNMSTPETNMTGMSNATMSGNMTGS
Ga0065704_1024542913300005289Switchgrass RhizosphereMRKQLALIIPILAAMIGMIAIVGLNSVNAQNMSTPEDNMTSGNATSGNATSGNMAGPTNMTSP*
Ga0065712_1005454013300005290Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAIVGLNSAXAQNMSTPETNMTGGNATMSGNMTGGNATMSGNMTGPTNMTSS*
Ga0065715_1089505923300005293Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAIVGLNSANAQNMSTPETNMTGGNATMSGNMTGGNATMSGNMTGPTNMTSS*
Ga0070674_10106705113300005356Miscanthus RhizosphereMRTQLVLIIPILAAMIGMIAIVGLNSANAQNVSTPETNMTGETNATMGGNMTSSDNMTTSSGS
Ga0070708_10186925913300005445Corn, Switchgrass And Miscanthus RhizosphereAIVGATYANAQNMSTPETNMSNMSNATMSGNMTSGNSSYLGPTNMTSP*
Ga0070678_10166331223300005456Miscanthus RhizosphereMRKQLVLIIPLIAAMIGMLVIVGLNSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMAGSTNLTSP*
Ga0070699_10218590413300005518Corn, Switchgrass And Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAIVGLNSANAQNMSTPETNMTGGNATMSGNMTGPTNMTSS*
Ga0070695_10064341623300005545Corn, Switchgrass And Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAIVGLNSANAQNMSTPETNMTGGNATMSSNMTGPTNMTSS*
Ga0070696_10098344213300005546Corn, Switchgrass And Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAIVGLNSANAQNMSASETNMTGGTNMTNSTMAGNMTSSGNM
Ga0070696_10175175723300005546Corn, Switchgrass And Miscanthus RhizosphereMRTKLVLVISLLAAMMGMFAIVGATYANAQNMSTPETNMSNMSNATMSGNMTSGNSSYLGPTNMTSP*
Ga0070704_10205432313300005549Corn, Switchgrass And Miscanthus RhizosphereMIGMLAIVGLNSANAQNMSTPETNMTGGNATMSGNMTGGNATMSGNMTGGNATM
Ga0068852_10237312913300005616Corn RhizosphereMRKQLVLIIPILAAIIGMLAIVGLNSVNAQNMTTPETNYTGGTNATSGNMTSGNATSDNMAGPTNLTSP*
Ga0068860_10201944023300005843Switchgrass RhizosphereMRKQLVLIIPLIAAMIGMLAIVGLNSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTSP*
Ga0075286_104017813300005886Rice Paddy SoilMRKQLVLIIPILAALIGMFAIAGLNAVNAQNMSTPETNMTSGNMTSGNSSYLGPTNLTSSP*
Ga0075424_10261065623300006904Populus RhizosphereMRRQLVFIIPILAAMIGMIAIVGLNSANAQSMSTQETNATSMSNATMGGNMTGSENMT
Ga0134125_1087295723300010371Terrestrial SoilMRKQLVLIIPLIAAMIGMFAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTAPTNMTSP*
Ga0134125_1309146713300010371Terrestrial SoilMRKQLVLIIQILAAMIGMIAIVGLNSVNAQNMSTPEGNMTSGNATSENMTSGNMAGPTNMTSP*
Ga0134128_1184280523300010373Terrestrial SoilPILAAMIGMFAIAGLNSANGQNMSTPEGNMTSGNATSGNMTSGNTSYLGPTNITSP*
Ga0105239_1049070123300010375Corn RhizosphereMKKQLVLIIPLIAAMIGMLAIVGLNSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTSP*
Ga0134126_1226385313300010396Terrestrial SoilMRKQLALIIPILAAMIGMIAIVGLNSVNAQNMSTPEGNMTSGNATSENMTSGNMAGPTNMTSP*
Ga0134127_1042448823300010399Terrestrial SoilMRKQLVLIIPILAAMIGMIAIVGLNTVDANMTSSGGNYTSGANATSGNMTSGNASDN
Ga0134122_1070028913300010400Terrestrial SoilMRKQLVLIIPILAAMIGMVTIFRLNSVNAQNMSTPETNMTGMTNMTMDGNMTDG
Ga0134122_1289123613300010400Terrestrial SoilMRKQLVLIILILAAIIGMLAIVGLNSVNAQNMTTPETNYTGGTNATSGNRKSGNATSDNM
Ga0138514_10001323923300011003SoilMRKQLVLIIPLLAAMIGMFAIVGLNSVNAQNMSTPETNMTDGNATMGGNMTSGNANMGGDMTAQTNMTSP*
Ga0105246_1009452333300011119Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAVVGLNSVNAQNMSTPETNMTGGNATMSGNMTGGNA
Ga0105246_1248242213300011119Miscanthus RhizosphereMRTQLVFIIPILAAMIGMIAIVGLYSANAQNVSTPETNMTGGTNATMGGNMTSSDNMTTSSG
Ga0157216_1039454923300012668Glacier Forefield SoilMRKQLVLIIPILAAMIGMFAIVGLNSVNGQNMSTPETNMTAGTNATMGGNMTSGNATSNNMAGPTNMTSSP*
Ga0157304_109682113300012882SoilVRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTGSENMT
Ga0157298_1036479323300012913SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNAT
Ga0162651_10006736213300012938SoilMRKQLVLIIPILAAMIGMIAIIGLNSVNAQNMSTPEGNMTSGNATSGNMTGPTNMTSP*
Ga0162652_10000085543300012941SoilMRKQLVLIIPILAAMIGMLTIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTSP*
Ga0164300_1015236433300012951SoilMRTQLAFIIPILAAMIGMIAIVGLNSANAQNVSTPETNMTGGTNATMGGNMT
Ga0164300_1080791713300012951SoilMRKQLILIIPILAAMIGMIAIVGLNSVNAQNMSTPETNMTGGTNASMGGNMTSSDNT
Ga0164298_1067044313300012955SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTSPTNMTTP*
Ga0164303_1009683923300012957SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMSGNMTSPTNMTTS*
Ga0164303_1134343713300012957SoilMRKQLVLIIPILAAMIGMFAIAGLNSVNGQNMSTPEGNMTSGNATSGNMTSGNSSYLGPTNMTSP*
Ga0164299_1060067023300012958SoilMRTKLVFIIPILAAMIGMIAIVGLNSANAQNVSTPETNMTGGTNATMGGNMTSSDN
Ga0164299_1124565513300012958SoilMRKQFVLIIPLLAAMIGMFAIVGLNSANAQNMSTPETNMTSGNETSGNMTSGNMAGLT
Ga0164301_1007260813300012960SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMSGGNATMGGNMTSGNATMGGN
Ga0164301_1084833813300012960SoilQLVLIIPILAAMIGMLAIVGLDSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTSPTNMTTS*
Ga0164302_1044735123300012961SoilMRKQLVLIIPILAAMIGMIATVGLNTANAQNMSTPETNMTSMSNSTMGEDMTGQIHQVQ*
Ga0164302_1062481313300012961SoilMTKQLVLIIPILAAMLGMIAIIGLNSVNAQNMSTPETNMTSGNATSGNMTSGNATSGNMAGPTNMT
Ga0164302_1122306023300012961SoilIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMSGNMTSGNATMSGNMTSPTNMTTS*
Ga0164308_1044020523300012985SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMAGPTNMTSP*
Ga0164308_1212909213300012985SoilMRTQLVLIIPIIAAMIGMIAIVGLNSANAQNMSTPETNMTGMSNATMSGNMTGSEN
Ga0173483_1019477513300015077SoilKQLVLIIPILAAMIGMFAIVGLSSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMAGSTNMTSP*
Ga0132256_10003157463300015372Arabidopsis RhizosphereMKTQLVFIIPILAAMIGMIAIVGLNSVNAQNASMPETNMSDMSNATMGGNMTGSE
Ga0132256_10122502713300015372Arabidopsis RhizosphereMRTKLVLVISLLAAMMGMFAIVGATYANAQNMSTPETNMSNMSNATMSGNMTSGNSSYLG
Ga0132257_10299763413300015373Arabidopsis RhizosphereMRTKLVFIIPILAAMIGMIAIVGLNSVNAQNTSMPETNMTGGTNSTMGGGITGNMTNSSSPMMTNSS*
Ga0184605_1010414133300018027Groundwater SedimentMRKQLVLIIPILAAMIGMIAIVGLDSVNAQNMSTPETNMTSGNATSGNMTGSTNMTTP
Ga0184608_1019206513300018028Groundwater SedimentMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTSP
Ga0184620_1001337833300018051Groundwater SedimentMRKQLVLIIPILAAMIGMFAITGLNAVNAQNMSTPETNMTSENATSGNMTSGN
Ga0184620_1019222923300018051Groundwater SedimentMRKQLVLIIPLLAAMIGMFAIVGLNSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGP
Ga0184620_1020318613300018051Groundwater SedimentMRKQLVLIIPILAAMIGMIAIVGLNSVNAQNMSTPEGNMTSGNATSGNMTGPINMTSP
Ga0184623_1022516623300018056Groundwater SedimentMRRQIVLIIPLLAAMVSVLAFVGLNTVNAQNMSIPETNMTSLTNATIGNMTSGNMTSGNMTSGNMTSGNMTSP
Ga0184623_1023956313300018056Groundwater SedimentMRTQLVLIIPILAAMTGMFAIIGLNSVNAQNMSIPETNMTSLTNATLDGNMTSGNMTSGNMTVDLNMTTP
Ga0184619_1009074223300018061Groundwater SedimentMISVILIALIIPILAAMIGMIAIVGLNSVNAQNMSTPEGNMTSGNMTSGNATSGNMAGPTNMTSP
Ga0184619_1011177533300018061Groundwater SedimentMKKQLVLVIPLLAALIGMIAIVGLNTVNAQNMSTPETNMTSMSNATMGGNMTGSENM
Ga0184619_1018780823300018061Groundwater SedimentMRKQLVLIIPILAAMIGMIAIVGLDSVNAQNMSTPETNMTSGNATSGNMTSGNMTGSTNMTTL
Ga0184619_1051624013300018061Groundwater SedimentMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTAPTNMTSP
Ga0184619_1055725613300018061Groundwater SedimentMRKQLVLIIPILAAIIVVFAIVGLNSVNAENMSTPETNMTSGTNATMEGNMTSGNATSGNMAGPTNMTTP
Ga0184609_1035927523300018076Groundwater SedimentMRKQLVLIIPILAAMIGMIAIVGLNTVNAQNMSTPESNYTIGTNATSGNMAGPTNMTSSP
Ga0190273_1232010813300018920SoilMRKQLVLAIPLLAAMIGMFAIVGLNSVNAQNMSAPESNMTGGNTTIGNITGGNVGNMTSPTNMTSPTNMTTP
Ga0193704_100882433300019867SoilMRKQLVLIIPILAALIGMFAIAGLNAVNAQNMSTPETNMTSGNATSGNMTSGNSSYLGPTNLTSSP
Ga0193704_104294023300019867SoilMKKQLVLIIPILSVMIGMLAIVGLNSVNAQNVSAPETNMTGGTNMTNSTMAGNMTSSGNMTTTSGSMYGK
Ga0193720_100989013300019868SoilMRKQLVLIIPILAAMIGMLAIVGLNAVNAQNMSTPETNMTGGNATSGNMTSSDNMTTSSG
Ga0193720_101868513300019868SoilMRKQLVLIIPLLAAMIGMFAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTAPTNMTSP
Ga0193705_101815223300019869SoilMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTSP
Ga0193700_100878423300019873SoilMRKQLVLIIPILAAMIGMIAIVGLNSVNAQNMSTPEGNMTSGNMTSGNATSGNMAGPTNMTSPYQLSVT
Ga0193700_106235023300019873SoilMRKQLVLIIPLLAAMIGMLAIVGLNSVNAQNVSAPETNMTGGNATMGGNMTGG
Ga0193701_105645113300019875SoilMRKQLVLIIPILAAMIGMIAIIGLNSVNAQNMSTPEGNMTSGNMTSGNATSGNMAGPTNMTSP
Ga0193723_102504123300019879SoilMRKQLVLIIPILAAMIGMIAIVGLNSVNAQNMSTPEGNMTSGNATSGNMAGPTNMTSP
Ga0193735_108857313300020006SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTSGNATMGGNMTSGNATMGGNMTAPTNMTSP
Ga0193735_112177813300020006SoilMRKQLVLIIPILAAMIGMIAIVGLNFVNAQNMSTPEGNMTSGNATSGNMAGPTNMTSP
Ga0193735_118164013300020006SoilMRTQLVLIIPLLAAIVSVLAVVGLNSVNAQNMSTPETNMTSETNATMGGNMTGTGNM
Ga0193757_100734523300020008SoilMIGMSAIVGPNSVNAQNMSTPETNMTAGTNATMGGNMTSGNATSNNMAGPTNMTSSP
Ga0193732_108466623300020012SoilMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGTNMTNSTMTGNMTSSGNMTT
Ga0193721_103374323300020018SoilRKQLVLIIPILAAMIGMIAIIGLNSVNAQNMSTPEGNMTSGNATSGNMAGPTNMTSP
Ga0193721_104356123300020018SoilMKKQLVLVIPLLAALIGMIAIVGLNTVNAQNMSTPETNMTSGNATSGNMTSGNSSYLGPT
Ga0210381_1036687023300021078Groundwater SedimentMRKQLVLIIPLIAAMIGMFTIVGLNSANAQNMSTPETNMTSMSNATMGGNMTGSENMTN
Ga0193695_101654923300021418SoilMRKQLVLIIPILAAMIGMIAIIGLNSVNAQNMSTPEGNMTSGNATSGNMAGPTNMTSP
Ga0222623_1005247623300022694Groundwater SedimentMKKQLVLIIPILAAMLGMLAVLGVNSVNAQNMSTPETNMTGGTNATMGGNMTSGNATSGNMAGPTNMTSP
Ga0222623_1012560513300022694Groundwater SedimentMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTGGNATMGGNMTGPTNMTSP
Ga0222623_1014809613300022694Groundwater SedimentMRTQLVLIIPLLAAMIGMIAILGLSSVNSQNMSTPESNMTSGTNMTNSTMGGNMTT
Ga0222623_1030739413300022694Groundwater SedimentMRTQLVLIVPLLAAMLGMLAIVGLNTVDANMTSAKGNYTSGTNATSGNMTSGNATSGNIA
Ga0222622_1055619523300022756Groundwater SedimentRKAMRKQLVLIIPILAAMIGMFAIAGLNAVNAQNMSTPETNMTSGNATSGNMTSGNSSYLGPTNLTSSP
Ga0207688_1106073713300025901Corn, Switchgrass And Miscanthus RhizosphereMRKQLVLIIPILAAMIGMLAIVGLNSVNAQNMSTPETNMTGGNATMGGNMTGSENMTNSS
Ga0209998_1010907413300027717Arabidopsis Thaliana RhizosphereMRKQLVLIIPILAAMIGMIAIVGLNTVDANMTSSGGNYTSGANATSGNMTSGNASDNMAGPTNMTTP
Ga0307312_1055848213300028828SoilMKKQLVLVIPLLAALIGMIAIVGLNTVNAQNMSTPETNMTSGNATSGNMTSGNSSYLGPTNLTSSP
Ga0307277_1036614213300028881SoilVFFFKDKAMRKQLVLIIPILAALMGMFAIAGLNAVNAQNMSTPETNMTSGNATSGNMTSGNSSYLGPTNLTSSP
Ga0310885_1087851113300031943SoilMRKQLVLIIPILAAMIGMLAIVGLNSANAQNMSTPETNMTGGNATMGGNMTSGNATMGGNMTGPTNMTTS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.