NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300025850

3300025850: Groundwater microbial communities from aquifer - Crystal Geyser CG22_combo_CG10-13_8/21/14_all (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300025850 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111384 | Gp0110945 | Ga0210050
Sample NameGroundwater microbial communities from aquifer - Crystal Geyser CG22_combo_CG10-13_8/21/14_all (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size733976360
Sequencing Scaffolds32
Novel Protein Genes42
Associated Families33

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria5
All Organisms → cellular organisms → Archaea → DPANN group1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1
Not Available16
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Saprospiria → Saprospirales → unclassified Saprospirales → Saprospirales bacterium1
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense6
All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Lokiarchaeota → unclassified Lokiarchaeota → Candidatus Lokiarchaeota archaeon1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_151

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameDevelopment Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater biomeaquifergroundwater
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Utah: Grand County
CoordinatesLat. (o)38.9383Long. (o)-110.1342Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001003Metagenome / Metatranscriptome808Y
F001911Metagenome / Metatranscriptome618Y
F002857Metagenome / Metatranscriptome525Y
F004273Metagenome445Y
F009876Metagenome311Y
F010381Metagenome / Metatranscriptome304Y
F011135Metagenome / Metatranscriptome294Y
F014684Metagenome / Metatranscriptome261Y
F017599Metagenome / Metatranscriptome239Y
F021923Metagenome / Metatranscriptome216N
F028419Metagenome / Metatranscriptome191Y
F036087Metagenome170N
F037060Metagenome / Metatranscriptome168Y
F043221Metagenome156N
F043249Metagenome / Metatranscriptome156Y
F044301Metagenome154N
F050143Metagenome / Metatranscriptome145N
F054576Metagenome / Metatranscriptome139N
F057888Metagenome / Metatranscriptome135N
F060595Metagenome / Metatranscriptome132N
F060596Metagenome / Metatranscriptome132N
F065251Metagenome / Metatranscriptome128Y
F076867Metagenome117N
F079604Metagenome / Metatranscriptome115N
F083695Metagenome / Metatranscriptome112Y
F091327Metagenome107N
F091328Metagenome107N
F094905Metagenome105N
F094906Metagenome105N
F096625Metagenome104N
F096627Metagenome104N
F098657Metagenome103N
F102531Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0210050_1018135All Organisms → cellular organisms → Bacteria4217Open in IMG/M
Ga0210050_1019899All Organisms → cellular organisms → Bacteria3943Open in IMG/M
Ga0210050_1041116All Organisms → cellular organisms → Bacteria2362Open in IMG/M
Ga0210050_1044055All Organisms → cellular organisms → Bacteria2253Open in IMG/M
Ga0210050_1048960All Organisms → cellular organisms → Archaea → DPANN group2097Open in IMG/M
Ga0210050_1054729All Organisms → cellular organisms → Bacteria1947Open in IMG/M
Ga0210050_1074762All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1582Open in IMG/M
Ga0210050_1121761Not Available1139Open in IMG/M
Ga0210050_1126368Not Available1111Open in IMG/M
Ga0210050_1137579Not Available1050Open in IMG/M
Ga0210050_1138383Not Available1046Open in IMG/M
Ga0210050_1208057All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Saprospiria → Saprospirales → unclassified Saprospirales → Saprospirales bacterium799Open in IMG/M
Ga0210050_1219590Not Available771Open in IMG/M
Ga0210050_1221217All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense767Open in IMG/M
Ga0210050_1224350Not Available760Open in IMG/M
Ga0210050_1228445Not Available751Open in IMG/M
Ga0210050_1228478Not Available751Open in IMG/M
Ga0210050_1233583All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Lokiarchaeota → unclassified Lokiarchaeota → Candidatus Lokiarchaeota archaeon740Open in IMG/M
Ga0210050_1235993Not Available735Open in IMG/M
Ga0210050_1255540All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense697Open in IMG/M
Ga0210050_1259964All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_15689Open in IMG/M
Ga0210050_1260780All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense688Open in IMG/M
Ga0210050_1266448Not Available678Open in IMG/M
Ga0210050_1276144All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense662Open in IMG/M
Ga0210050_1316725Not Available604Open in IMG/M
Ga0210050_1319298All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense601Open in IMG/M
Ga0210050_1332587All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense585Open in IMG/M
Ga0210050_1352841Not Available562Open in IMG/M
Ga0210050_1395226Not Available521Open in IMG/M
Ga0210050_1403602Not Available514Open in IMG/M
Ga0210050_1409373Not Available509Open in IMG/M
Ga0210050_1416602Not Available503Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0210050_1018135Ga0210050_10181355F076867MDEKNQNLVELAKKKRYIALVEKLGRGSLGPKELKELEEFEKVEQAKETGVIDGAVDLGTISIYLEKSSRMVRRYVSQGMPVIRDSSGELSRFKVNDVFKWVYGNKGKDAEDKDYWENEYRKNRAKLSELELKQKEGELIPFADHVSVVKNQIRGIRAGLLRLPKHVAPKLYQQDPKLICEMLDQEIRYIINQFAGVKSNDKANKRGA
Ga0210050_1019899Ga0210050_10198991F060595GFISQKKFNQQFFFNTKTKKALLHEISFLEWTHLLLQK
Ga0210050_1041116Ga0210050_10411161F009876MSAKFLGSAPNSGRGILPRRSGWKPLPLFRWIPTEHFKAGIET
Ga0210050_1044055Ga0210050_10440553F102531TSGLFADPIGTLQGLQASAQQVQSVAQQGLTELSFASSILNLPNADNPAGWTLGTYGGQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK
Ga0210050_1048960Ga0210050_10489601F065251VMYLRAFKKGGKKYYYIAKVVRKGLRVIQKSILYIGTADTLYEKLIKLKKRA
Ga0210050_1054729Ga0210050_10547294F102531PQKTIAQINQEIDNLVARGVDELEAIRQVGSISIPNYATTPEQIAALRLADQARTADEILQCPYTWCRHNSAISDAILESRDYQPYRAAMTMQTTEGLSAGGHQTSAIIINGEPVFIDLTNNLIITGQQALEQVLINSEKQLTALEMIRLTTNNVWDVINLIPK
Ga0210050_1074762Ga0210050_10747621F043249MPKIVGTRERVHQPFYDSLIRVDGSGDLRQGNVGVFGAVQSRSQLFVRQGADVAVSNLTTGGFFPSDQTFVTLAVRVWTYFRFNVESQRTDAQNTTGPVASLAGVTADRIQRVHKLYHQAENQLFWQFIAGDKPQLTTFTAYTPAAGGLDGFFSDTRLPRANNGVPTSAALMRLARPILVPPRQGFQVVAIASPIGQAQGASIIEQLNGAVPNNDPWG
Ga0210050_1100349Ga0210050_11003491F096627ARQKNGHASLLTRGLIRPSPGFGWQVERLPAALLQWISSAPSLLRLERIPKDGAAQRIHLFTAGDQGLSLEMDDDTARFVIFQTRRLLQESAVRWLALPAGAKKSTTAHDLPQPITFLPTTWKDSQLASRILKEQGMNTKTAKSTLAWAVSLEWVTALSKVKLEGQRNVLANQFVLCGNAKSIWGGRDERTKVSFVHIAEKTINATIGEML
Ga0210050_1104435Ga0210050_11044351F094905MTTLYGYIAAIVLIVFLGAGWAHEHDKRIVFEAQVEQAGKDAAKHTAETDAKHREEMQNAEQNTIIATNSIADWYRAHPAVRVRYANTDCSAVPSTDNNP
Ga0210050_1121761Ga0210050_11217611F010381GSADKLNALSTLKRIENKKKEANALVAQDNILANFSPVCVNFLKEYFFIQHTFFSARKYINLKLTEISKDPRFIANASLKNNTLTEVKNACERLCESEEIRGITLTEEEDNRCFGISIPDVCPCV
Ga0210050_1123880Ga0210050_11238801F096625SALAHHRNLSLWYALKRLEVHTTPDEIQAKFTEWYQGSSAHSTSQLAATFAMHHSQHNQDKEPIFEEDGTYTFKLVLPCHCLDEHLLEQMADQLKDSWIYNPSSETDEVSNPELLPLYLLWYYQTVHPGPHLEFSEYLHALSSPAFTPDHLRLSQHWFLHALPYPVLPCFESGEGRME
Ga0210050_1126368Ga0210050_11263681F004273MKNNEKAVMLGAGVVLAYFFLQQKKEKKMNVPDLSISPPVFYA
Ga0210050_1137579Ga0210050_11375791F079604MTKARRGITLIKNRKKFKLDTFLLDIRKSPVAFTEICQYQGRQVQKFKQTKRILRLISENNKSIAVLPRGASKSFSLAIIALWYFYTCENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIVLADECAFFGDDSFFRMVVLPMQSGVRTIEKIPKISLVSTIDQQKGFFYDVWKNPEKYGYTKLKMTWQECDGYT
Ga0210050_1137579Ga0210050_11375792F091327RGWKPKTLICKDITISDDEEAFANEFSKKVSEENDTYVDTFLIELAIAQILQVHRVYVYAKEKKISRDASRMIGTVLSTLREMNATKNARKEDNIKVTVNSDIMTLIQQNLNLISNDESKKGNNIDKK
Ga0210050_1138383Ga0210050_11383831F091328VGRKSVTILVLACLLVGGVAGGVFLYLKHQRTVLSNVAVEKSGDLPLELSVDFDPTVNSPKFMIESVDRQTKTFNLKSVFPPTFEGKSLTSRITCQEIKIVGPGDSVGEYVVYDVLMERMEGVSKEMMIFSGLCSDNTCAEINQSCRLYLAKVAP
Ga0210050_1151808Ga0210050_11518081F094906TTYFVIKAYTATENVQSAQYSVAVSNGDSKVYDVYFTVVSGQLRLTWKYFVASPPNNFSAMYRKNGGSWNSWSVDPPTGRQYYTNYAGSGWQSGDQLDIEIYNPSNRSETNIQDSVLYGNDCF
Ga0210050_1166035Ga0210050_11660352F094906SLFVKYVKPNTTTYFVLKASTATENVQSAQYSVAVSNGDSKVYDVYMNIASGQIRLRFKYFVTSPPDNWEARCKKNGGSWNSWIVDPPSGQQYFTDYVGSGWQTGDQLDIDFYNPSNRSETNIQDTILYGNTCF
Ga0210050_1208057Ga0210050_12080571F060595IKKYYTAFLGFISQKKFNQQFFFNTKTKKALLHEISFLEWTHKYFK
Ga0210050_1219590Ga0210050_12195902F050143MAKCDWMTETDVKDAMVRGISNAGAAHTFGVNYPRQDPIQSAISAIQKGIWEKNFRDSIGKWEKKLAMVTIEEWKAATIAAASMYAEKASTIGAEEWGKYYDKAKSVIESAATEYVKSDKKKENMIKFWTDMQKLKNL
Ga0210050_1221217Ga0210050_12212171F001911IRYLHFYLIIMMETVINFEKFKQEYKKDFDIPVLSAKEKDIYIYLFLSLRRKMSKGMFPELYNSEIMFSKSDLKALVLKNIVIFQNHKKGWIISMNPKYITKTTECSFCGAKFNEIVYFRRNSITCPGCGFRMHGSTTAKRVNEYSVAITNIEKVEAIPKVTTSVVKVPIEHVITDVPGNLKITDIVLAPTAMERLKQIANEEVIPFKTGSADKLNALSTLKRIQNKKKEANALVAQDNILANFSPVCVNFLKEY
Ga0210050_1224350Ga0210050_12243501F036087RTVGLRHCDSVPRRRESPHLCRPSELRHRVDLWARAIASALESQTATYWLICHAYPVHPRTGRADHGLLAEYILAVHPEAPLCIGYMAHDWVVDPSAHAGRAPGDRRSLLVCLRCGKRHEYREQADPLAPWPWQSYGMEWDPELA
Ga0210050_1228445Ga0210050_12284451F021923MNLTDFFNNYEFIKKLQNHLTVNKENLLNDFPNNNIEKSLNQLLLSPSNAYLSKKIIPQTFADMTWKFKLLEGQQTLTSPSPREFTLTYESQTSRSIFYYETPISGSCYVECDFVDVEPGTTNCIFLGNAENITSKTYLAIQTFPLSSFQGNKIGDGWVAIMEFNNSNIHNPNVLAWSYDVETTGIFRLSKNTSTLKLHHNRTYSL
Ga0210050_1228478Ga0210050_12284781F002857MKNTKENTGVFKDDETTEMKASMKFLTDRNFVERISLRDGMKRKFTMVKDERTEIIGFDGNTKQGITYTVTENGNLRNFFTASMKCISMLSKFSKDDMFEIELRTKKVGNNIVSFYVVKK
Ga0210050_1233583Ga0210050_12335832F014684MKWHRYYYRVIGFGLAGGGAGLMLDELIHGPFTLSIGNHEFWGLIISIIGVILISKKPHGKD
Ga0210050_1235993Ga0210050_12359932F036087DEDRGYRLTVYRTVGLRHCDSVPRRRESPHRCRPSELEHRVSLWARAIAAELEPQAATYWLICHAYPVHPRTGRADHGLLAEYILAVHPEVPLCIGYMAHDWVVDPSARAGRAPGDRRSLLVCLRCGKRHEYREQADPQAPWPWQSYGMEWDPELA
Ga0210050_1255540Ga0210050_12555401F001911MMKRPIEFEKFKQEYKKDFDIPVLSTKEKDIFMYLFLLLRRKMSKGIFPELYNSEIMFSKSDLKTLVLKNIVIFQNYKKGWIISMNPNCITKNAECSFCGAKFNEMVYFRQNSISCPGCGFRMHGLATAKRVNDYSVAITNIEKVEAIPKVTTSVVKVPIEHVITDVPGNLKITDIVLAPTAMERTIQIANQEVIPFQTGKADKLNALSTLKSIENKKKEANALVVQDNILA
Ga0210050_1259964Ga0210050_12599642F054576MEFQLGLPEIIAFVLANIGGGWALLRISFAQFELRLDDRFELLDNAMADVKRIELEIVRADTRNAQTYVTQTSHDKVLERIFNVLSSMEQKLDGKADAADCEAKHLRHMERK
Ga0210050_1260780Ga0210050_12607801F001003MEKFKKFKQEYKKDFIIPSLSNTKEKDIFIHIFQLLREKINRGIFPEIYNNEIIYTKSDLKNLISKGIILFGRYKKGWIITINPTYATKNVECSWCGAKFNEKIYFRQKRIRCPSCMSGMNGSTVTEKFEEINDVSIISTTENKTKTEKV
Ga0210050_1266448Ga0210050_12664482F002857MENKKRIWGTTQMDEAYDEMIRMEKEKQNGSVFKNDEKNDKKEMEASMKFLSDKNFIERISLRDGMKRKFKLVKDEATEIIGFDGNKKQGITYTVTENGNLRSFFTTSLKCISMLSKLQKDDVFEIELRTK
Ga0210050_1268091Ga0210050_12680911F098657TTREVITFVARVRELKVGLGITDDERMAQFHTTVLTLLHGSAKEAYEVAAKEAATERREELAEQQTETWYLTHAEPTVAARAAKKETYLDEIPEFIEEDFEVGLTAICDQLLPFQALIKVKRYLRRHCHKPANMTIRQYVSRIRKINEHELPLLVPMSPDNQLPFDEVKELLFYGIPNRYRKKLQEMDVDMTNCPDYATFIQRAERVEEAERMDSNSNTSASAS
Ga0210050_1268125Ga0210050_12681251F098657TTREVITFVARVRELKVGLGITDDERMAQFHTTVLTLLHGSAKEAYEVAAKEAATERREELAEQQTETWYLTHAEPTEAVRAAKKKTYLDEIPEFIEEDFEVGLTAICDQLLPFQALIKVKRYLRRHCHKPANMTIRQYVSRIRKINEHELPLLVPMSPDNQLPFDEVKELLFYGIPNRYRKKLQEMDVDMTNCPDYATFIQRAERVEEAERMDSNSNTSASAS
Ga0210050_1276144Ga0210050_12761442F057888CSPAQANAMRKHSQFTDASQLGTADVIETGIIQNFLGVPFIVSTNCPQYPVGGSDINEPNIHWGTAGNTGILISKRKSGQKCSGGIFWKQKEKIDYIQNVEEMLHKFTLIMAFKCTLLQPTSICLVKTSKV
Ga0210050_1316725Ga0210050_13167252F044301MAVTKEIAVGKYKDKISSDSFNKGVSAVTKSPAQQAVAQKDAFTKNTIAGKEVLIKNLSKVTVEEWKAKTIAGFDKLQTKVVRAVETGKWNAAKTLTAGKNAHAAVANMKKGTLSDSYERYLAAQKAVVAVYA
Ga0210050_1319298Ga0210050_13192982F060596MENKEASVGAIPIPDENVAKKKMAKNRTMVLLDRKIVNRLCCCKKKIGNTYNSVISNLLDKYENEQX
Ga0210050_1332587Ga0210050_13325871F011135MILLIEMKKITMEKFKKFKIENKDFVLPQNLSPKEKDIFIYIFRLLRQKINKGIYPELYNNEIIYSKSDMKNLISKGIILFARYKKGWVITLNPIHITKNVECSWCGAKFSEIVYFRQNSITCPGCGFRMHNSTTAEKVDNHSIVEKVDNNSIAEKAVINVPSRHVISDISGNI
Ga0210050_1352841Ga0210050_13528411F017599MKNKQKMEDDIKKRIPPWSENVEKYRISAVYTLISYTTPHYLIFRYYCNSEIYMCETHKSIVLWWISPDEKEKLLAFETKKKEKYKGLKVYDGTTME
Ga0210050_1366772Ga0210050_13667721F094905MTTIYGYIAAIVLIVFLGAGWAHEHDKRITYQAKVEQAGKDAAKHTAEINTKHQE
Ga0210050_1374816Ga0210050_13748161F094906NFSKYKTTSGTSHILPIKYLKPNTTTYFVIKAYTATENVQSSQYSVAVPNGDSKVYDALLSVSSGQTKISWYYFVASPPDNFSARYRKNGGSWNPWSISRTGQRYYTDWVNTGWTTGDQLDIDIYNPSNRSETNIQDSILYGNQCF
Ga0210050_1395226Ga0210050_13952261F083695QDSSVTQRVYGCSDETSPNNSVADLGNGDILTPKLFADAMLKIEESGYVPYCFLCSPAQANAMRKHSQFTDASQLGTTDVIKTGIIQNFLGVPFIVSTNCPQYNNNDTDVNETGVHWGTAGNTGILISKRKSGQKCSGGIFWKQKGKIDYIQNVEEMLHKFTLIMAFKCTLLQ
Ga0210050_1403602Ga0210050_14036021F043221MWGSIEKVRNICGITKEQINDVVIHDILEQTDRKVRDRVYIFRRLTMQIDKIGTNKIRFDINKIGDGNRDTSIDYTDISIYRKNNGVYELFPIQKLDILSNEITFQTDIGENYEMIVEYFEDNFHFSIDTLSDAS
Ga0210050_1409373Ga0210050_14093731F028419IMNDNMVTTRHYLSLNYWDLRNLGSPSNKFLLYEPIITKLSYLYQNNYMSDKFSLSSDPTGKVIITGGYNNMFHIIDADQKLNTQIIIDENNEKIMNTNVIRKINSKGSCFYKKDDPSLTNINFDKRILHQTYSPVENFCHLILLNCIYSYTGALAKKSK
Ga0210050_1416602Ga0210050_14166021F037060NKLELTRMNEVIVDKQIEVKDEDWYGAEIAVTFQDAVNARVDKMKLAKYFLQEQLTDIPDKKLAIALQDTSVTQRVYGGTGNNSVADLDIGDILTPGVFADAMLKVEDSGYVPYCFLCSPAQANAMRKHSQFTDASQLGTADVIKTGIIQNFLGVPFIVSTNCPQYL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.