NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300009331

3300009331: Microbial communities of water from the North Atlantic ocean - ACM11



Overview

Basic Information
IMG/M Taxon OID3300009331 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0117984 | Gp0126413 | Ga0103824
Sample NameMicrobial communities of water from the North Atlantic ocean - ACM11
Sequencing StatusPermanent Draft
Sequencing CenterUniversity of Georgia
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size33387495
Sequencing Scaffolds31
Novel Protein Genes34
Associated Families33

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Autographiviridae1
All Organisms → Viruses → Predicted Viral5
All Organisms → cellular organisms → Eukaryota → Sar1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
Not Available10
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Cryomorphaceae2
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → Charybdisvirus → Charybdisvirus scam31
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae2
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Oligotrichia → Strombidiidae → Strombidium → Strombidium inclinatum1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → unclassified Kyanoviridae → Synechococcus phage S-SRM011
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Tamkungvirus → Tamkungvirus ST41
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameAquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water → Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysurface water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationNorth Pacific Ocean
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000237Metagenome / Metatranscriptome1498Y
F000744Metagenome / Metatranscriptome909Y
F000820Metagenome / Metatranscriptome878Y
F001707Metatranscriptome648N
F001968Metagenome / Metatranscriptome610Y
F002392Metatranscriptome564Y
F002556Metagenome / Metatranscriptome548Y
F003495Metagenome / Metatranscriptome483Y
F004898Metagenome / Metatranscriptome419Y
F005478Metagenome / Metatranscriptome399Y
F005629Metagenome / Metatranscriptome394Y
F010768Metagenome / Metatranscriptome299Y
F011300Metagenome / Metatranscriptome292Y
F017649Metagenome / Metatranscriptome239N
F025041Metatranscriptome203Y
F027881Metagenome / Metatranscriptome193Y
F028361Metagenome / Metatranscriptome192Y
F028825Metagenome / Metatranscriptome190N
F030378Metagenome / Metatranscriptome185Y
F031080Metagenome / Metatranscriptome183Y
F031118Metagenome / Metatranscriptome183N
F041233Metagenome / Metatranscriptome160N
F052453Metagenome / Metatranscriptome142Y
F053258Metagenome / Metatranscriptome141N
F056566Metagenome / Metatranscriptome137N
F073632Metagenome / Metatranscriptome120N
F077327Metagenome / Metatranscriptome117N
F081369Metagenome / Metatranscriptome114N
F082390Metagenome / Metatranscriptome113Y
F082393Metagenome / Metatranscriptome113N
F093788Metagenome / Metatranscriptome106N
F096844Metagenome / Metatranscriptome104N
F101001Metagenome / Metatranscriptome102Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0103824_100033All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Autographiviridae7268Open in IMG/M
Ga0103824_100985All Organisms → Viruses → Predicted Viral1604Open in IMG/M
Ga0103824_101018All Organisms → Viruses → Predicted Viral1579Open in IMG/M
Ga0103824_101932All Organisms → cellular organisms → Eukaryota → Sar1200Open in IMG/M
Ga0103824_102013All Organisms → cellular organisms → Bacteria → Proteobacteria1179Open in IMG/M
Ga0103824_102044All Organisms → Viruses → Predicted Viral1172Open in IMG/M
Ga0103824_102087All Organisms → Viruses → Predicted Viral1161Open in IMG/M
Ga0103824_102224Not Available1122Open in IMG/M
Ga0103824_102486All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Cryomorphaceae1069Open in IMG/M
Ga0103824_102850All Organisms → Viruses → Predicted Viral1004Open in IMG/M
Ga0103824_103001All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae978Open in IMG/M
Ga0103824_103359Not Available930Open in IMG/M
Ga0103824_103395All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae926Open in IMG/M
Ga0103824_103996All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales858Open in IMG/M
Ga0103824_105010Not Available767Open in IMG/M
Ga0103824_105108Not Available761Open in IMG/M
Ga0103824_105289Not Available748Open in IMG/M
Ga0103824_105310Not Available747Open in IMG/M
Ga0103824_105782All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → Charybdisvirus → Charybdisvirus scam3716Open in IMG/M
Ga0103824_106316Not Available688Open in IMG/M
Ga0103824_106368All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae686Open in IMG/M
Ga0103824_106746All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Oligotrichia → Strombidiidae → Strombidium → Strombidium inclinatum667Open in IMG/M
Ga0103824_107449All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae637Open in IMG/M
Ga0103824_107760All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → unclassified Kyanoviridae → Synechococcus phage S-SRM01624Open in IMG/M
Ga0103824_108503All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Cryomorphaceae598Open in IMG/M
Ga0103824_108726All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Tamkungvirus → Tamkungvirus ST4591Open in IMG/M
Ga0103824_109365All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae572Open in IMG/M
Ga0103824_110253Not Available549Open in IMG/M
Ga0103824_110272Not Available548Open in IMG/M
Ga0103824_111624All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes517Open in IMG/M
Ga0103824_112213Not Available504Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0103824_100033Ga0103824_10003314F082393MNYMNLLWITLCLAILGDKRLQETWFHPAGNVIKKRGLATGYPGCELRLAAIRIENN*
Ga0103824_100985Ga0103824_1009855F004898EEFRESEIYETDPQDWRGYLADDDYWLPDPELVY*
Ga0103824_101018Ga0103824_1010181F093788RRWLHYPLNLSITTMTEFNPRSYILTQLAYAEEQLMIADDMTSKLTWGNRCDALEAALADLEAA*
Ga0103824_101036Ga0103824_1010363F028825MSSVALQEALESFAKLTDTLQECIKYQDIERAMALAKERHDALVNLLEDADVDQTQRANCADTTLEHLRKEQLLAKSNSDQNRSDFIARKSSYRAYALKAA*
Ga0103824_101932Ga0103824_1019321F010768MKEVLNGGYNVHPVPPPNSEIKERITKRYERKRSKIEKLFTLGYTTSGDP*
Ga0103824_102013Ga0103824_1020131F077327FVLIFVFLTPVFGLADTSLFACGEPGKSKTENLSIDWARQSASYKSYDSVEASYWSSMIIQWTHPMQTEDTRLHEKNLSLQLNMNDMTLMYTILDEGPVLPKSDDDWAVFLPVYMVCREYF*
Ga0103824_102044Ga0103824_1020443F101001MITFNWSLSAMIELWIDDEYLEDVTNSNIPEADIHYEAEHSCYRVIIRHINEMYWLTQDWGTIAEHFGFQPESVVYCH*
Ga0103824_102087Ga0103824_1020871F011300MVMLLLATMMLGMWLMISALGSNDADDDDDFGGGMMIPAYNPTN*
Ga0103824_102224Ga0103824_1022241F027881ERVRGVLTHASRTRIITRIDLSGVRVSNALLTFGKLTLKVNRYLVGAAA*
Ga0103824_102320Ga0103824_1023201F000237IF*SLFNNIYKTYYIIFTNKHLNTDQLTRLMILHYFTP*YYLYLVKLHVLFCHES*DTDSGENTYEDKSGSYVS*FYDAFLKEIQDA*Y*TLYVFVYFFLHHFNGATVNYFFFER*NISELDEVRFYGVAPH*YFRPLMGILVISPTHYEGLM*MGLFFILLAFLPIFYNFYNVYHKYVSTIPMQNSLLQTSFFVFFMMSLYCTASMLPCGRYYYEPEGGYVGNP*VKFSYQYAYLYMA*ILHHLDLVEHYGFQYVQHLMRRHSSLKLYAQARQLPR*SSNSYLSRFSKAKVRHLRFSDAYVDVSVLNTSVKNENR*
Ga0103824_102486Ga0103824_1024862F041233MKKVWIFGLLLSLAGCSYPYEEDLTVLEADTQSALTALQGLYEEGIEADYADLERHASIARTKRFDSVHEPFFRNEFEVLKYHSRQTSRWFELQTHSEWEAQLIYGLEQVQALKHDAEKGLIAEEALRQALESEKNALLPLIAEVETSCAAMRELMAEHDSLDNHWQVMWD
Ga0103824_102850Ga0103824_1028501F056566IVIRLTIGLSRDGLNPITGTARVRWFNSPLQEYRHTHVSRRAILKMLWFSGDTSKGQWVNCHCLAS*
Ga0103824_103001Ga0103824_1030012F003495MKENNEKKIKNEDEISIITKAPPKDTAALDKRSVLAKVNSQGSGT*
Ga0103824_103359Ga0103824_1033591F001707RAQSLTFGARYARTVANPWGAEGWRHNKTDFAHWVKEASTQPNSKAKREFYGYLQMSFGDVDVDKDGKINAQEFDYLCEKVAAMPRRFGFAPSWEAEYGGSIEKRTAARKAMFDAVDSKQGAARGWIGAAQFTDWATTHIAGKIAEIDTASEVDFYHVANYSEADFLKAIEVAVTNKNSREYASLYEFLLTAFVETDATCRGEITYAEFNKLIERAAAVPRTFGLAPPDGTVEARKAIFESMDDTKTGLITFRKFLEWTVTHTAGKVEAHKAGKGYKK*
Ga0103824_103395Ga0103824_1033951F030378MTSNPLSSDFSYKKYSLEQLDNWVNDAINCEDLTPQDIYDTIIKCVDESVEYHKKYLTKSIELLSLLKGHRPVEFDYTEIDEPWDYTATGEKFPRVSNTDWENFWAAEDNSQYTEEEMDAMCERAATENDKEQCREYNLREAEYYNKRAQLDADYEAIKAAGGYDWTPLPTEDKVKKWRLPVQQ
Ga0103824_103996Ga0103824_1039963F005478MYDDYDLDYTFGNDQILDEDTYYEYHAQCDVDLDEDYAHNTQDYDTLAYRHYA*
Ga0103824_105010Ga0103824_1050102F081369MNNGGFGAAWLDEAPMLFIIFALVFLNIIFLYVAWVQNRQRRKVERDLLAVQTFGSASPFAAKQPLSTGENHLRKETHKTIADDESLLKIDKAIAMLKIGSPLEEIRTALDIETSYLKILASHHTK*
Ga0103824_105010Ga0103824_1050103F031118ARCEALTEILVEAEKLKMYRSDGTAYCEKCDNLVVRIQGVFTDVKIEDLSADEIEIIKAIKENVLSEEVFAQSSSILRNPKEWLGTLNE*
Ga0103824_105108Ga0103824_1051081F031080MNEFFEIADAPGEIFDIPELQELDDENKFDVNEYLNSNYDY*
Ga0103824_105289Ga0103824_1052892F001968MFDELWSEIQETPGEIFDMDIPELRDEKFDVNEYLKGDYDY*
Ga0103824_105310Ga0103824_1053101F002392RTSTRQSAALRTFATVPSPWGSDTWRYDKNSFLAWCTEATSKKDSAARRELYGMLAMRFGDTDVDKDGKINAAEFDGLCEDVASLPRRFGLAPSWEKEYGTVERRTASRKAMFDMLDLRQGPPRGWIGLEQFVNWASDHLITKVATIDIHADVDYYHIEQYGEHQYLSHLEEAVSNPTSRAHASLYEFLLAIFTECDSRSTGVLTFAEFDTLLSRAAEVPRTFGLAPPEASKETRKKFFDSMEDKQMG
Ga0103824_105782Ga0103824_1057822F082390MVTVNLTEQQLALMEELVGKKFDEVAQAWLPAERTKDMNKLCYDTLLNLRCVRLSKEYDETYANWDKDFYSLDKKLFLKDVGLVTDEELAQQGIC*
Ga0103824_106316Ga0103824_1063162F052453MKKQPDLAAIMATYTQQHNAMMARSAANREAFAEGRPQPYQAVQSTNWHISDRD*
Ga0103824_106368Ga0103824_1063682F053258MNLTNEQTELIIDAIWKRQHHFIAGDRRYREYGDLLETLEASLPYKYTRDEFK*
Ga0103824_106746Ga0103824_1067461F000820IQVGKLNGENVTKSALVNLIKDTDAPITQALVQTESEVEKSDADGLFKLMDANVTRGNLKNLVTDKPAPITVALAQTEKSDADGLFKLMDANVTRGNLKNLVTDKPAPITVALAQKSDADGLFKLMDANVTRGNLKNLVTDKPAPITVALAQKSDADGLFKLMDANVTRGNLKNLVTDKPAPITVAL*
Ga0103824_107449Ga0103824_1074493F093788LDPPNLSITTMTEFNPRSYILTQLAYAEEQLMIADDMYSKITWGNRCDALEAALADLEAA
Ga0103824_107760Ga0103824_1077603F096844MKLSHSSVTKIADAIKPAIIEYLIADDRVTQALQDGVADGIREVMGDMDEDLFFEIGMLVFDRIEFK*
Ga0103824_108503Ga0103824_1085031F073632ALSAQTTTRHHELGVQLEQSYGFMYGAPLNFKTLYKCGKTDKAVWRMQLGNIQFGHNYYEPSQQKSSWVSMGGALGREWRKDLNSNMRFFHGPLVGIEMSMSRSVSEVPPGTDNSYYYIMPNLIYVLGIQYRINQDLYVAAEIHPGFNTQFNYNDGEWSPNRYMSGGLNGQAALLSVAYQFETYKKGKSKKARGVVRIP
Ga0103824_108726Ga0103824_1087262F005629MKKDIRNDSSSYYVRRTWTTFTGRVVKVDHVGPYMENQDYAVALQRQHDMRRQVPATEQITKWEWVDGVTAIADLVFD*
Ga0103824_109365Ga0103824_1093651F028361MTYEQFIHKGTEFYMNMVKLIDIKLMYRMELTDDEKEINDHILEFQK
Ga0103824_110253Ga0103824_1102531F002556LEQFVNWATDHLITKVATIDVHADVDYYHIEQYGEHQYLSHLEEAVTNPTSRAHASLYEFLLAIFTECDSRSTGVLTFGEFDQLLSRAAEVPRTFGLAPPEASKETRKKFFDSMEDKQMGGVTFRLLLAWTIEHSKGKIAAQKAGKGYKK*
Ga0103824_110272Ga0103824_1102722F017649MILPLLLALSTPDPKLLLTCEQFDWLVERTLDTKSLSLPQKIDFIQSYSRWTDPSCFEVEE*
Ga0103824_111624Ga0103824_1116242F000744MIEAEVKLNVHELGVILSALQLLELSEEKYIAREYGSASTLYNKLNTLMEQMDTSQTGIREFEEASF*
Ga0103824_112213Ga0103824_1122131F025041QIGLKTTLGLVTPAVTVTVDATASTSFALSQPSGKAPSLKATLTLDSFSQRDVVFSIGEISSDDVNRDIEAILTSLLDTINADIPALPILSLPGVHYENPEFIVDNHVLLVKADFSRASVAPVIMV*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.