NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300004265

3300004265: Groundwater microbial communities from aquifer in Utah, USA - Crystal Geyser 4/10/14 3 um filter



Overview

Basic Information
IMG/M Taxon OID3300004265 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111384 | Gp0097054 | Ga0051981
Sample NameGroundwater microbial communities from aquifer in Utah, USA - Crystal Geyser 4/10/14 3 um filter
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size813327234
Sequencing Scaffolds31
Novel Protein Genes35
Associated Families30

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Predicted Viral1
Not Available14
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla6
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota1
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense2
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2
All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium RBG_13_41_221
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus pasadenensis1
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → unclassified Nitrososphaeria → Nitrososphaeria archaeon1
All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Leptospirillum → Leptospirillum sp. Group II → Leptospirillum sp. Group II 'C75'1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameDevelopment Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater biomeaquifergroundwater
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Utah
CoordinatesLat. (o)38.9383Long. (o)-110.1342Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002184Metagenome585Y
F002652Metagenome539Y
F003801Metagenome / Metatranscriptome467Y
F004273Metagenome445Y
F004515Metagenome434Y
F010662Metagenome300Y
F011639Metagenome288Y
F012552Metagenome / Metatranscriptome279Y
F014811Metagenome260Y
F019952Metagenome / Metatranscriptome226Y
F030345Metagenome185Y
F043771Metagenome155Y
F053063Metagenome141Y
F058693Metagenome / Metatranscriptome134Y
F060597Metagenome / Metatranscriptome132Y
F060838Metagenome / Metatranscriptome132Y
F069687Metagenome123Y
F071959Metagenome / Metatranscriptome121Y
F073189Metagenome120Y
F078228Metagenome116Y
F079604Metagenome / Metatranscriptome115N
F080880Metagenome / Metatranscriptome114Y
F080881Metagenome114N
F086649Metagenome / Metatranscriptome110Y
F093503Metagenome106Y
F094905Metagenome105N
F094906Metagenome105N
F096626Metagenome / Metatranscriptome104N
F098657Metagenome103N
F102531Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0051981_10046109All Organisms → Viruses → Predicted Viral1874Open in IMG/M
Ga0051981_10069159Not Available1460Open in IMG/M
Ga0051981_10096640All Organisms → cellular organisms → Bacteria1185Open in IMG/M
Ga0051981_10131612All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla974Open in IMG/M
Ga0051981_10133469Not Available965Open in IMG/M
Ga0051981_10150744All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla892Open in IMG/M
Ga0051981_10176083Not Available805Open in IMG/M
Ga0051981_10180623All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota792Open in IMG/M
Ga0051981_10194724All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense755Open in IMG/M
Ga0051981_10196858All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon749Open in IMG/M
Ga0051981_10212307Not Available713Open in IMG/M
Ga0051981_10215628Not Available706Open in IMG/M
Ga0051981_10225835Not Available685Open in IMG/M
Ga0051981_10231292Not Available674Open in IMG/M
Ga0051981_10240652Not Available657Open in IMG/M
Ga0051981_10252527Not Available637Open in IMG/M
Ga0051981_10257283Not Available629Open in IMG/M
Ga0051981_10270247All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla609Open in IMG/M
Ga0051981_10275200All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium RBG_13_41_22602Open in IMG/M
Ga0051981_10276593All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla600Open in IMG/M
Ga0051981_10280626Not Available594Open in IMG/M
Ga0051981_10299590Not Available568Open in IMG/M
Ga0051981_10301867All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus pasadenensis566Open in IMG/M
Ga0051981_10304309All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense563Open in IMG/M
Ga0051981_10306440All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon560Open in IMG/M
Ga0051981_10319644Not Available545Open in IMG/M
Ga0051981_10322438Not Available542Open in IMG/M
Ga0051981_10325695All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → unclassified Nitrososphaeria → Nitrososphaeria archaeon538Open in IMG/M
Ga0051981_10332400All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Leptospirillum → Leptospirillum sp. Group II → Leptospirillum sp. Group II 'C75'531Open in IMG/M
Ga0051981_10361389All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla502Open in IMG/M
Ga0051981_10361402All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla502Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0051981_10046109Ga0051981_100461094F093503KPMKKYKVKIKHTDKEEIIKADSELEARVKFCEQNNLNYRHLAGKLEITLNNKPLQNNL*
Ga0051981_10069159Ga0051981_100691593F079604MTKARRGITLIQNRKKFKLDTFFQDIHKSPVAFMEICHYQGKQVQKFKQTKRLLRLISECDKSIAVLPRGASKSFSFAIIALWYFYTHENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIVLADECAFFGDDSFFRKVILPMQSGVRTIYKIPKISLVSTIDQDEGFFYDVWKNPEKYGYTKLKMTWQQCDGYTKEDMQKKKI
Ga0051981_10092047Ga0051981_100920471F094905MNALYGYIAVFVVIVFLGAGWAVEHDKRITYQAKVEQAGADALAQTEKINAKHREEMQNAEQNTIIATNSI
Ga0051981_10096640Ga0051981_100966402F078228MSLSEEMMKAIKEEKKKRKLGSIQETIRSILAEYFAKKS*
Ga0051981_10131612Ga0051981_101316122F102531LSFASSILNLPNADNPAGWTLGTYGGQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK*
Ga0051981_10133469Ga0051981_101334691F043771LRGWLWRGDSHLIAGSSPAVIAGIAGRKAVMDAVRLTKCECGCGCGEDATTSDGGVALCAACAEYAVDPESGEVVCSRDPRAEEITECCGAGGQTRSYWRIRPPVAPAVASDGEWACYWNTVGDGSRVVSRHATEAEAARAVVTRRQWVVAGGHPHYLYRYEVRQWDGEAWVAPYEAE*
Ga0051981_10150744Ga0051981_101507443F002184MKYIAKYSRVWDDDVKPIQKPLQPTLYTCEELCSILNKDNYS
Ga0051981_10176083Ga0051981_101760831F071959MASRNYLVMSNDMTLSDKKEYRLNALSAGLERCGLRGIGDIKADIPGLAGIPDANKVARVKLIHNYLITGQWPRSIDQRELTTGTDLVVAPAVDSLLTAPMAAVGNIVSCFQGVAAPQLVQGKLMVCYAVSVESSAVPMPVSRLIFRRGAAGNVQAQFDMEPMGIRWEVDAFFSEVVVIDPQDVFAIQVRCRNATAVPEIVHIHNFLFESAGLVVA*
Ga0051981_10180623Ga0051981_101806231F010662MNTHKLLMSIAEAFMIYGLIIVGYGILAIHVTKTWFGDWHVDRYLPWLTLDMFMMISFALSFFGFIVWRYLKYIEETK*
Ga0051981_10194724Ga0051981_101947241F019952MAQTVKSMRISSMGRVILLEKPRALRRIFYGIKVLTDLTMGHRSHISFDDPTFLSYYILDGQVQQLEAKGEGICQGDIWARNVSPAELIFDMTEILV*
Ga0051981_10196858Ga0051981_101968581F080880LRVEKVGSCEDCVIKAKMNEQTKLREALDLTSLGRKTLDGLLEVAVGDKTIQDFIKEQKAESKNIIEILRGTIEWKTPSNFKEFTIILEEIRNLTQMFDIASHSEIENTVILRPKAFKRLPEVVAFQTAVMLEGVGAPFEIRMMGEDIAVKMIRQEIYPLRKKEFGESLDQQIEKRLATSRPGLFKNSLMLVGPGFMNWAEKHLEEPVTDLGSIIEDVRIALGVDELPREPKEFVMGLLSACVKMNWFK
Ga0051981_10212307Ga0051981_102123072F060838MPTFQYVQPTDEQKQTMQTFRDKYESLAKEIGKLPTSRGLSLAFTKLEESSFWLNKAITQNS*
Ga0051981_10215628Ga0051981_102156281F069687MEELARKIEQILNQFAQEETKNRLSQFSMIALKEIILNEIKSYKPKIKE
Ga0051981_10220085Ga0051981_102200851F094906ENVQSAQYSVAVSNGDSKVYDVEMDIFGSLIRLHWKYFVASPPDNFSARYRINGGSWWSWSITRSGQQYHASDTGGWQTGDQLEINIYNPASGSETNIQDSILYGNDCF*
Ga0051981_10225835Ga0051981_102258351F080881MSKNSKSSVALSELAVLPQFVAPVVEIAPEVLSMIVMDIDPALIEAAEIKEARLDAVKAKKDDAKRLLAQAHEIAKTLNPLCDEQDKVKEECRTLKDVLKDALLATRVALANHPEVLENKAQIVKAAPNLMAEFNEKFNAAVIKQTQIAQADNIARMKALDASFAELDKKTAQLSKQYDEYKALSTRLFAEADQLRVEYGFA*
Ga0051981_10231292Ga0051981_102312921F086649NLTSESTDQQVQDAISAEIELCMNQPPPPGAENQQKYCAGKAYGMAREKTGKELNLGK*
Ga0051981_10240652Ga0051981_102406521F096626LASSATKRMTSAMLLGMWRNSSKYVMDVAPKLTTMTGSQLLRYKANIERHEKRLTALCSRNPELALSASDLDKIMINLADDDADTAFGAYLADRTEEVRSKLTEDSEAL*
Ga0051981_10252527Ga0051981_102525271F004273MKNDEKAVIAGAGVMLAYFLLKKKEKKVDIPNLSISPPVFYA*
Ga0051981_10257283Ga0051981_102572831F030345MELKNERILIQTIEKRDRENESPWYAITGNQGRRFSCFEEEVAKKLQINKVNLCKVRYFGKYANVMSVEGYEDKPEGADANSEIRKERNIESLRILKCVALKCAAQCFDGQSASAGEVIAKSNAFLKWLFSIEDKDAI*
Ga0051981_10270247Ga0051981_102702472F004515MTQTIRSVTVLPGQQRVLLEKPRVLYRIFFSIRALADQTVWYQSKISFDDPLFHSYYVLDGPGKYFEAYGEGIFQGDIWVRNASPVN
Ga0051981_10275200Ga0051981_102752001F053063MTKDEISARIELLKKENDEDNTKLINLNQIRESVIQGMLVRNGRILELEEILKN*
Ga0051981_10276593Ga0051981_102765932F004515MVQTVYSVTVFAGKKRIILEKPRVLRRVFFSIRALADQKVWLKSMLSFGDPLFRSFYVLNGPAKYFEAKGEGIFQGDVWIRNASDQ
Ga0051981_10280626Ga0051981_102806262F002652MKKAKASVGAIPTQVNENRNKTFEKFFEKMRRVKALEIDIQKRENDFFWEYIQKPISTQEERDDRDILKRAAHLGFAGEWRAVFQERNEILEISTDIRELAGAQGKKNDAEKFAKWKEILEKIEKTNKNTV*
Ga0051981_10299590Ga0051981_102995902F014811MWIYDYKTKKEYSTSSPRSKFYNYVIIEIEPIPGKPCIILVKKGMEEGRKPAQVLEQLDIFKNLSVPSAT*
Ga0051981_10301867Ga0051981_103018671F003801RWKEIDDGKEEVDWLMANLHASIEREGNQQLEKKKNTKESVYIFSVPIDEIKKHILNRSVVKIEQVITMTNKIREDIYEFDQQIGIASPETDVEQEKKTKKIDDFQEEIRKRWEDVWRIKEEIHTTTDIIYEISHIRRKYIGVAEEEQMSWADILQEIKHKMKNSFCF*
Ga0051981_10304309Ga0051981_103043092F058693LLNKPLVLNRIFFTVKVLTDPTIDYKLYISLGDPGFSSYFTLDGQVPYFEAKGEGIFQGDIWAQNVSAADLLVSMSEILV*
Ga0051981_10306440Ga0051981_103064401F073189MLSTIDILSIIEEQEIYDVKYLATKLQIPLEQLKEILTNMRKHTLIEYDPRTGKVTLPTWLINIEKKMEKIKPTTGEIILPKYQEIKIQDITIGNFTKNDLELKLRFKAKQKEIAI
Ga0051981_10319644Ga0051981_103196442F058693YSGWKVLLFEKPLVLKRIFYSVKVIVDPTTDYRSYISLGDPRFHSYYTLGGSVSFFEAKGEGIFQGDVWAQNVSSSELLFVVTEILV*
Ga0051981_10319767Ga0051981_103197671F098657TREVITLIARGRELKAGLGITEDAQMAQFHTTVLTLLHGSAKEAYQVAVSHTAETRRQTEAEEATVEFFADLDKPTALVRFNRKRMLYNETLEIINDDFETGLNAICDQLLPFQALIKVKRYLRRHCHKPANMTIRQYVSCIRKINEHELPLLVPMSPDNQLPFDEVKELLFYGIPNRYR
Ga0051981_10322438Ga0051981_103224381F012552MKKKGASVGAIPPQVEEIMKKTFEKLFQKIKYLKDMEVDVQKNENDFFWEYVQKPISTREERD
Ga0051981_10325695Ga0051981_103256951F011639VWIKKKYVRPRLSPDELWLIYRLIDNQYWLLRKHPHPFRSLEEVSRLRRKILRLVNRCRKPKHQISLRQRLY*
Ga0051981_10328505Ga0051981_103285051F060597MVQTVKSQTVYTGWKVILLEKPLASCRVFFSIKTLADPAAWRRSMISFDDPSFVSHYVFDGPLQQLEAKGEGIFQGTIWAWNVSPVDILFTVTEILL*
Ga0051981_10332400Ga0051981_103324001F080881EIKAKKDDAKRLIVLCHEIAKTLNPLCDEWDRVKEERALLKGVLKDALLASRIALAGHPKVRENKAQIIKAAPNLMAEFNKRFDSAVIKQTRIVQADNIARMKALDQSFAELDKKTAQLSKQYDDCKALSTRLFAEADQLKVEYGFA*
Ga0051981_10361389Ga0051981_103613891F004515MVQTLLSKTILPGQKLKLLEKPHVLNRVFFSIRALADDTVWYKSLVSFGDPLFHSFFVLNGPGKYFEARGEGIFQGDVWVRNLSNQNLEYTATEILI*
Ga0051981_10361402Ga0051981_103614021F004515MVQTVISKTILAGQKRIFLEKPIALKRVFFSITALADPGMWYKSEVSFDDPLFLSFFVLNGPARHFEARGDGIFQGNVWVSNLSDYDLEYTATEILV*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.