NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300020237

3300020237: Marine microbial communities from Tara Oceans - TARA_A100001011 (ERX291767-ERR318621)



Overview

Basic Information
IMG/M Taxon OID3300020237 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0117946 | Gp0115933 | Ga0211478
Sample NameMarine microbial communities from Tara Oceans - TARA_A100001011 (ERX291767-ERR318621)
Sequencing StatusPermanent Draft
Sequencing CenterCEA Genoscope
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size16383351
Sequencing Scaffolds24
Novel Protein Genes26
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → Viruses → Predicted Viral4
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → unclassified Candidatus Pelagibacter → Candidatus Pelagibacter sp. IMCC90632
Not Available14
All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon1
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMarine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationTARA_030
CoordinatesLat. (o)33.93Long. (o)32.7322Alt. (m)N/ADepth (m)70
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000107Metagenome / Metatranscriptome2222Y
F001756Metagenome / Metatranscriptome641Y
F004819Metagenome / Metatranscriptome422Y
F013773Metagenome / Metatranscriptome268N
F020823Metagenome / Metatranscriptome222Y
F025308Metagenome202N
F026125Metagenome / Metatranscriptome199N
F026579Metagenome / Metatranscriptome197N
F027868Metagenome / Metatranscriptome193Y
F028201Metagenome / Metatranscriptome192Y
F029784Metagenome / Metatranscriptome187N
F032678Metagenome / Metatranscriptome179N
F033459Metagenome177Y
F034541Metagenome / Metatranscriptome174N
F036279Metagenome / Metatranscriptome170N
F038721Metagenome / Metatranscriptome165N
F048369Metagenome / Metatranscriptome148N
F051208Metagenome144N
F056670Metagenome / Metatranscriptome137Y
F057435Metagenome / Metatranscriptome136N
F078814Metagenome / Metatranscriptome116N
F079191Metagenome116N
F082792Metagenome / Metatranscriptome113Y
F085576Metagenome / Metatranscriptome111N
F089153Metagenome / Metatranscriptome109Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0211478_100490All Organisms → cellular organisms → Bacteria → Proteobacteria2454Open in IMG/M
Ga0211478_100514All Organisms → Viruses → Predicted Viral2374Open in IMG/M
Ga0211478_100728All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → unclassified Candidatus Pelagibacter → Candidatus Pelagibacter sp. IMCC90631817Open in IMG/M
Ga0211478_100844Not Available1612Open in IMG/M
Ga0211478_101241All Organisms → Viruses → Predicted Viral1245Open in IMG/M
Ga0211478_101633All Organisms → Viruses → Predicted Viral1038Open in IMG/M
Ga0211478_101665All Organisms → Viruses → Predicted Viral1023Open in IMG/M
Ga0211478_101680Not Available1017Open in IMG/M
Ga0211478_101752Not Available988Open in IMG/M
Ga0211478_102231All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → unclassified Candidatus Pelagibacter → Candidatus Pelagibacter sp. IMCC9063849Open in IMG/M
Ga0211478_102813Not Available742Open in IMG/M
Ga0211478_102877All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon733Open in IMG/M
Ga0211478_103220Not Available680Open in IMG/M
Ga0211478_103471Not Available649Open in IMG/M
Ga0211478_103685Not Available627Open in IMG/M
Ga0211478_104008Not Available596Open in IMG/M
Ga0211478_104129Not Available586Open in IMG/M
Ga0211478_104247Not Available577Open in IMG/M
Ga0211478_104505Not Available557Open in IMG/M
Ga0211478_104562All Organisms → cellular organisms → Bacteria553Open in IMG/M
Ga0211478_104638Not Available548Open in IMG/M
Ga0211478_105030All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae523Open in IMG/M
Ga0211478_105316Not Available508Open in IMG/M
Ga0211478_105436Not Available502Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0211478_100490Ga0211478_1004902F025308MISDEDFRFLLQESNGCKKALEIGTGTGKSSAALKLNCEVYSIDKDDIFEYNIDINRFNCESKEYWLNYMHYDFDFVFIDGSIEKIDCEEILKRTKNSFKIVFHDYMPKEDKDPGKNKGWYNMKVFKESALLNYDMKETQGGSHCGMLVLNKDK
Ga0211478_100514Ga0211478_1005141F034541MKAVEAMTWEELESALTEYQVNGNGGGMRVKDIQILHSIEDEISWRREQGYTDLLPREIEIELLEQGKIRERYL
Ga0211478_100514Ga0211478_1005146F057435MQKNIERQILLFTEATGLTKRAEVVHGTLFVKFNNPQDKWFFRRALRNFCRIFINRDTGINANDVGDEYAYDIVPEKDEKNWNRTEPLASQVDTQLTMMEDK
Ga0211478_100728Ga0211478_1007282F001756MLELNRPRKKLCPKLEKNVKINPNRITFKLKLLNIDKNYDF
Ga0211478_100844Ga0211478_1008443F048369MKEDRDSFIEDLADNTPNEGQFDKFMEAEVHDAEEDLLAGKTFKKLKDEVKKGDKNV
Ga0211478_101109Ga0211478_1011091F000107KGIGGEEDQIVADAEELESGNNESKVSLIECYYQIKGTGTLTISAVSETNNLTFTGRGKYGLRPDQLKFGDDKQILLTTDSNVDSYLLITEFRRNN
Ga0211478_101241Ga0211478_1012412F036279MKGTRHSLKTLFIIFSFVGLQACSVPFANGVTGQDIIKIANAGKNIKNITKEGINEELITETKNILRDIQYGGKAQR
Ga0211478_101633Ga0211478_1016332F013773MAKRIVLAGDSFGCEWPNGEGWPLMLAQQHGVNNIAQAGVGEYKILKQLWDLSARDAYWVNNYDCVIVCHTSPSRIHTTEHPVHKEGLHKDCDLIYTDIMDKFDWFNPRLRTAKNWFHHHYDDEYAIDIYNMIRAEIKKFINIPYLAVDHFEISNFYAKEDNVLNLSTTWPKYKGKVNHYSDEGNQIVYNQIIDKLDKIC
Ga0211478_101665Ga0211478_1016651F004819MQYYPQDKDPRLDERSARFHARVLKEDLATLPFVLDTCNRDINIARASTYVTWDHDKEMWAEVDHLMMNFYVQARTSETRDELEDKINRGVVELLKGPRYYEQAKVYCMIDMDYPEDESIYDIVKVPKKDRKKAGWGISGGEGEIVYHVTLHVQECNNVDLTIYDNEDRGDFFDLNGNNLESPMADLERIVNG
Ga0211478_101680Ga0211478_1016801F038721MAYIGNNTKQTAVDTVDERFDEFKETSIDASKVQTIFLGGDESGVADSPTDAFGVSLNVITTDCNHKTFRRIDMGTVVAQVGVVDFGYVANSN
Ga0211478_101752Ga0211478_1017522F027868MDYLALKIPADIEQKITSHTVTDQPEPEEGGGYPFKGDNGAYELCGCDMLQIVPAAYTDKKGRLHLEGDLYCDEEGLLKAAPVHNWRASQMRYWYMHPRSEQLVPDWREWCNVAGDACFVVPATDDNLKIMEDILDS
Ga0211478_102231Ga0211478_1022312F001756SLILELNSPIKKLCPKHEKNVKIKPKIITFKLKLLNI
Ga0211478_102813Ga0211478_1028131F085576MKQWYLRFKAGNWENGYYEIGENFDLVKEVCKAEFIKDCKALGFTPVTAELTLCEEHRI
Ga0211478_102877Ga0211478_1028771F056670MDRINTMNIDKFVEAFTFIGKGPCQEFNCPRQQECAEEKVECKAFRYWVNNDSYDTMRKGKKTSIAIDMERLL
Ga0211478_103220Ga0211478_1032202F078814AEKQAVKDVFTMLGNTKGEIISKIDGIIKQVAKKRNVKVSAIEDYFDNEILS
Ga0211478_103471Ga0211478_1034713F029784MCTQIYAHPVEGYKCFANANKSQGTYYTCCDLDTKEIRYVTYIYDGYFMGYYLVQSAVKATENYGNCQEKIFLSSASNKWPYYKGNDDEYTIDRQYPMQYDIPSQDEIEYSYLDSLLTTGTNKXILLKIKYSH
Ga0211478_103685Ga0211478_1036852F079191MTPNPNMKTVCDNCGATYVVRHDLPDEYIEQYCPFCGEEHENIDDMDEVNWDDED
Ga0211478_104008Ga0211478_1040081F032678WQRQNSLDVYNWLVKSFPNIEFKRHENFIAPDLEWGSKGPNIIDEYGKLKSGNQIELRSHAEYVAHTEKLDAWYCGVTQNPDKEFDERLADRDVFIDSLGDKTLDRLIKPHMGGYACHPFTYVKKDWIVAQYKKLGIMDLFDLTRSCEGDANIYPDVFGDLDYKTYVPGSPVPTCGKCFWCKEREWGVANSDEE
Ga0211478_104129Ga0211478_1041291F020823MITNLILFYLTIYVFFIWGQNIARTPLDTKVFLII
Ga0211478_104247Ga0211478_1042471F026579MAISVTQPSFSETISDVKIIFADKEGEGKPTDDEEPDCE
Ga0211478_104505Ga0211478_1045051F051208FYGATVSSGMVGASKANVMIGPNVESHGQFADTTGDEGATIDIYIIGSSNQVHMASWGNDNYQVHDVIGDSNILDVHPDAIGSHVRMVQYGDNNYMKTVTSGNNNTIRYYGSGGSNNAQIYLYTSGSIVELKQTGGSNTANLTVNGDSIYDYTLLVDQDGSDTCTYSFNRNDQTSDTTVQLTNSG
Ga0211478_104562Ga0211478_1045622F026125MFNYINSWKSRKSEKWNIEVRLGRITLLQLNYDAKKAKFRFMLLNFGLEIGGK
Ga0211478_104638Ga0211478_1046382F028201QIFFYPHFKMIEYNQKQSKEIYKALADSKVDLMEYFLGPDPRKTSYYKSAVRRNSILNDF
Ga0211478_105030Ga0211478_1050302F082792MTKKIAKPVQSKKYFHEAIEEEQKILDIGLKMSRQHKKERTESEKLQEELEPIDENI
Ga0211478_105316Ga0211478_1053161F089153MAISDYSSHDWRKHTDSAVVVDKNKAMLKVNECKV
Ga0211478_105436Ga0211478_1054361F033459MNISRQQVDKLIVGTNDVSYVAPDTSPTGTAVLNGPVYVGKTGASPGYEALL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.