NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000224

7000000224: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 763840445



Overview

Basic Information
IMG/M Taxon OID7000000224 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052742 | Ga0031241
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 763840445
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size127512101
Sequencing Scaffolds22
Novel Protein Genes26
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides thetaiotaomicron1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → Viruses → Predicted Viral1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4163
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales5
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → unclassified Candidatus Nanosynbacter → Candidatus Nanosynbacter sp. TM7-0571

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F028722Metagenome / Metatranscriptome190Y
F033081Metagenome178Y
F040149Metagenome162N
F043235Metagenome156N
F045567Metagenome152N
F046433Metagenome151N
F047508Metagenome149N
F053092Metagenome141N
F054110Metagenome140N
F066860Metagenome126N
F071328Metagenome122N
F077404Metagenome117N
F080166Metagenome115N
F089057Metagenome109N
F092229Metagenome107N
F092230Metagenome107N
F092232Metagenome107N
F099452Metagenome103N
F099453Metagenome103N
F099454Metagenome103N
F103435Metagenome101N
F105376Metagenome100N
F105379Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3641947Not Available519Open in IMG/M
C3673991All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria642Open in IMG/M
C3712625All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides thetaiotaomicron955Open in IMG/M
C3718125All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1035Open in IMG/M
C3746811All Organisms → Viruses → Predicted Viral2043Open in IMG/M
C3762969All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip27332Open in IMG/M
C3764837All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41628668Open in IMG/M
SRS014573_WUGC_scaffold_11376All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1301Open in IMG/M
SRS014573_WUGC_scaffold_18709All Organisms → cellular organisms → Bacteria13434Open in IMG/M
SRS014573_WUGC_scaffold_30034All Organisms → cellular organisms → Bacteria761Open in IMG/M
SRS014573_WUGC_scaffold_4353All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41667406Open in IMG/M
SRS014573_WUGC_scaffold_46727All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1166Open in IMG/M
SRS014573_WUGC_scaffold_48685All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1465Open in IMG/M
SRS014573_WUGC_scaffold_48879All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1300Open in IMG/M
SRS014573_WUGC_scaffold_49562All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas701Open in IMG/M
SRS014573_WUGC_scaffold_49804All Organisms → cellular organisms → Bacteria10021Open in IMG/M
SRS014573_WUGC_scaffold_50780All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales2813Open in IMG/M
SRS014573_WUGC_scaffold_55003All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1117Open in IMG/M
SRS014573_WUGC_scaffold_55105All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1271Open in IMG/M
SRS014573_WUGC_scaffold_55379All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41650650Open in IMG/M
SRS014573_WUGC_scaffold_57150All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1831Open in IMG/M
SRS014573_WUGC_scaffold_57180All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → unclassified Candidatus Nanosynbacter → Candidatus Nanosynbacter sp. TM7-05720962Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3641947C3641947__gene_144992F028722ERHNEIVRNDYNIENQYGPTHKDAISDGDVQGKGTGHGGHTHYLPDCTKPTGMIDYSNFDTEHGGGKYDIEGRNNIGGRNRTLAYSLYNKENMYGQNLIDTKINKEDGQYYVGQTLKRS
C3673991C3673991__gene_159501F066860MTTKKQKLQKQQAIDTWIIIALWVSAIWFSLARGFITGIGGWVLALLAPWALIVSCICLAIISRQVKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDRE
C3712625C3712625__gene_178538F077404TEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTPNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR
C3718125C3718125__gene_181517F092230MRQAHKRTVDKLKAYLLKVFFPLFIVCIILVAFFRQIGCGSDGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEAQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLVIIL
C3746811C3746811__gene_199822F054110PTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKEKQLYKKESEYSSTKQHEVLLFLVRTYKGGD
C3762969C3762969__gene_217263F105380MTIVELDTSQYVKQGRIFKKFESNLLDSYMDGRQTKYNINLADLDDQISDGIVYADKTGKMIYKFSAKKIVQTAITKDLTISGLADEFKMDYYSFWVPDIYLLSYSGFNPGNGLCLAYRKKYEEYICLTNIFPDRREQENSYFPNGKKLETKSICTGSMMADIDSAEYAAWKNDAVTRASQYVNKFLSARGNADLNFVSSSLRSKVPSHDMKKFAEFLGSITKEQENVNTYEEFIEWTKNTKWLK
C3764837C3764837__gene_221930F099452MDKTYEELLQETLSKIYELKDLDNRDRGKALTIFIGERLNRELLLSSRHIFTLYKDIINLDDVSLLTDLRKTDWYKDWFTDDRNNANLINLSRFNFKTLARFEKEEYLRDAEHYDFEGVIEVDSYGLFDTLIEDKDVELFKLAAENILINHGFFHNTDYNFYDVPDEYMEDKEVCAYMCLLNIGNMDFVDKKTLDTTVLYNIVKDRICGSIYFTLFDSLNKDTRTIAR
SRS014573_WUGC_scaffold_11376SRS014573_WUGC_scaffold_11376__gene_11003F033081MHTDITVVYRPKKGVMAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRLKFSILAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPAGWAKVTAWVVGTTGVFLIGSFTSSYTPSQRLLYSLEATGCGVYDTLLLLALIEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIAISLIAVVLAIDIAKLL
SRS014573_WUGC_scaffold_18709SRS014573_WUGC_scaffold_18709__gene_19045F071328MSRKHWTFTNIIRYIEEYERNPLLIERMKWKFIPEGECIVEFVELCKHLVLERTIDSKEPLTTAIYLRYSSQLLLKKKRAIRRLGIGKKNVSAILRLCGIHYREYGDDEHRVFFLDTDINIYFCKHYQLPMYIIQRIEFSNKESRPFILKVLPVKRSEW
SRS014573_WUGC_scaffold_30034SRS014573_WUGC_scaffold_30034__gene_33046F047508RAVEAIATDAVLVIEFVGEPIHIGMLGHRLVEGRVKYPYLRRIWEYLRHSFDTEDVGWVVKRSELCALMEHIYYLWGDTYALSKALCTVYEAVTDGVDLIEGLYEVLFFENVEDNLYAACVVRNVKVALDLLSFGIAEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGAATVEDQDFHKVLYYMVRCELILSSP
SRS014573_WUGC_scaffold_4353SRS014573_WUGC_scaffold_4353__gene_3904F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGIGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLGKITQLEPSKDNPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIITRTDEFHNTDCYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIVVNQSMLFLPY
SRS014573_WUGC_scaffold_4353SRS014573_WUGC_scaffold_4353__gene_3924F092232MNSQSKFIAEYNDRNRPKFNDRFFCKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYAEVQKLLIGEETPSISIKDSDLKLLKVTYYVGCTKDEETFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTATSAKTQSITLKTNSNAVKMLRNFVDLNTTKEESIRLAMFSVYLFDHKVTLFEYYLARFGWYDTLEKFNFQDIIRITDYDIDDPEYYTFAIANSHMKSPFYISAVKSFVDNDRILQSFIASFQRAIMLFATKKTTLDQIYTTQFWIQKLGFNFVSSETSTFTKGNAIIESLENSYDIPTKKRLRLPDEIKADIYSVLKWMACEFSSIRLKNNLDASTKRIRWSEYIAAMYIMLINIKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNHGVYEWNSFTNEEEPNVWDDNFSKMLNVYREEKGYTSAIMLAEDAGLELTDTRDPDAVAFDAQLLGQTIAMVARTREFETQLRPALINMEDSCSIYFEEA
SRS014573_WUGC_scaffold_46727SRS014573_WUGC_scaffold_46727__gene_55813F099454MKASKLLWAVVMALTFVLTSCDRLTDEPTLEDRGYYKYFDSTAQHKSFRVVTASGKPYNHKIDWHIIGIRDSKSDTYLTKKVDTLSNGDLKISYDWVSFTVRENKSVIDVEVQENETGKVRAVYLNTNTSGRHITLPDMRVTQRAK
SRS014573_WUGC_scaffold_48685SRS014573_WUGC_scaffold_48685__gene_58823F046433MIELPTSPDALSELSPVAPPKLLSQAQDTSRDNLMVYVKADNYLGTETSDPSFMESRCETTEYEAINDFVQFIEMTERYLPDYMKGCAKELIDELAFLGVPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQCRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVKERIAGFEVDNDPEDHGASVLVMAASGDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIGELSLPALANIVRPYRNGEDFDGLSRFRQLLERE
SRS014573_WUGC_scaffold_48879SRS014573_WUGC_scaffold_48879__gene_59136F040149VADEGAEELRWEVLIEEQGIPVLFVEVVAWYDGRVSSSEILRSVGVALEREPWLAPVWSHDSEDAIHDFIYDTSVPKGHTLTAVRERETVV
SRS014573_WUGC_scaffold_49562SRS014573_WUGC_scaffold_49562__gene_60250F099454GIKTLMKAYKLLWAVVMALTFVLTSCDRVTDEPTIEGKMNKFFDSQAQRKSFRVLTASGKPYNHKIDWHIIGILDPKSETYLTKKVDTLSNGDLKISYDWVAFIVRENKSVIDVEVQNNETGQDRSVDFVAQDNHKGLASPSMTVIQRAK
SRS014573_WUGC_scaffold_49804SRS014573_WUGC_scaffold_49804__gene_60670F089057MTNIIPIIAKKYNRKGDTSGSLKSLVSDLNCIDNVDDSLLFLSSIPRETKYTLDEVFDIITSDDIYIKIFGNVLTFLNMDLDYHRLLLNAIKSESYKIISIINESIPTPDLFLAKNNYECLSVALDKPFVIFDKILGMVVSQLLHTASSKEERIFGIFMTICIINREINKLASLCTGYLAITRDEVLVKDLMNESAMVAFQYMSTEDINNVVSDINSRTVLSRYLSNM
SRS014573_WUGC_scaffold_50780SRS014573_WUGC_scaffold_50780__gene_62313F077404SGTLPKGTYSDWKYKNPFAMNILKTAFAFFFALCFMMGANSYAQKTESINAEASKNELKRNAVYLPPALEEYADTTLLHQRFIVENKGNYLYIPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKEDKGFIILTNYLVVLDDKYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTAYDRVELSNFVQSYGRQAALETANAWVMAGYPFSLQSTKFENLYTRGRKLILTDGKTTLYLYFLMTDSVALNFDTEVLPYIKGVFRFNRIQ
SRS014573_WUGC_scaffold_55003SRS014573_WUGC_scaffold_55003__gene_70931F045567VHTSTHRVEDALLAVDGDILTPRDGTHIVQTERVVVVLVSQEDRIDTIDTETCGLVVEVRATVDEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEKPHLTEREGAW
SRS014573_WUGC_scaffold_55105SRS014573_WUGC_scaffold_55105__gene_71250F043235MDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRYPEGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDTLIQVSGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGETAGGCHSMAKDQEVPALYCRSHGFEGGRSMA
SRS014573_WUGC_scaffold_55379SRS014573_WUGC_scaffold_55379__gene_71921F080166MEVTFNGILKRLGNDVRENFHTEYIVNSGSLTTNVKYRYRMKLSPKGEQTCVYIDWDNYDDLFNVLEESIKICDPENPRTPFKRTYSDKGDLLDIRCNSLQVKYQHLNDRFGNTIDLIPFVLVDEQSGLLTEAIRFRFNNELVYDVPISRLKGFRRFLMTYNPLLHAGAMARYMAMTPLLGSNRQNMMRS
SRS014573_WUGC_scaffold_55379SRS014573_WUGC_scaffold_55379__gene_71923F099453MNRFDIIELAQQTITFVHSAFNGKVNALDPYTRLNFVSGYLDKKTNIARTTPYGCIYVSLEAFADTVEAYKFIDTDQIRNLALEIIIHELTHVDQLIDYRYIKFNNGYREEIERQCVKQSCQWILDNIQFIRSLGLVVIPEVYEERLVGLSDVTYAFKNPAVIAMSKLEHMIGKKFKEFNSNDIEIVYVDRLKNYYKIPVCINRMYHNSQNLNDLGERLLNDKQYTIEYMEYGNSKLVIKITQGA
SRS014573_WUGC_scaffold_55379SRS014573_WUGC_scaffold_55379__gene_71945F092229MSENYRFDHIPEVVLRNVKFIRENNIDIGTGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFRSTIEELDLATVLQKCTTRPYIAILNNIFFRYFNSKLIDDMFKLGESTKVLDLAIEYECEYYTINSAKTNIRRYMQQAYFDKYAADSNIISSHRVLNDPQVNAVKSAEFTYDLFTAARSEKFNPEMVRDIFLKYGLKTNSSRNLYTRMNNNLSLYYYMEDYLTEYMLKGSFTYGSQVYSTIKEFKCLPLMNVLTQLTRHNPSGYVLDSNLELVKG
SRS014573_WUGC_scaffold_57150SRS014573_WUGC_scaffold_57150__gene_77907F053092MQSDQGLILCLTHALLVLGALILEPAEMEDTMDDHTVQLFGILIAKELGIATHRIKADEHVPRDHIPLTLVEGDDIGIVVMIEKVLIGLQDALITTELVAELADTTVIASSDLTDPVAKDTLSEARLLDVFVSIVSYKLRFFRHK
SRS014573_WUGC_scaffold_57180SRS014573_WUGC_scaffold_57180__gene_78025F105376MNEKPEVSAKEFGALQAKVEYIKDGVDKHTVMLERIENIARDNVTQAQLKTYIAEHEKESEEKYVKRTEIEGVMNFWKLVTSNLAKLFAIALVGLAIYATNNLIQQNKAVTELKEEVQQTQVRRK
SRS014573_WUGC_scaffold_57675SRS014573_WUGC_scaffold_57675__gene_81255F103435MKFVFCTEPIYQYYRAHLYDADKDKLDKQLLVEYGDYKDIWDLKQQQDALPENIFKAELTSRDYPRNPWNYVSQLINKLTYQYLIDSPDFEDIFSEILFNQSEREFYEFYKAIDRFYNGSEIFITVANDDYSDMVTQMVCSVLRRTYGIRPQIIYDIDDVHSIRDDIDFSPEGAQIAYLQRHTYLALESKSTIEPLRVWYPFDMNSYTNALE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.