NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F098199

Metagenome Family F098199

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098199
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 50 residues
Representative Sequence QSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Number of Associated Samples 5
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 4.81 %
% of genes from short scaffolds (< 2000 bps) 0.96 %
Associated GOLD sequencing projects 4
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.038 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Tunicates → Ascidians → Unclassified → Unclassified → Ecteinascidia Turbinata
(100.000 % of family members)
Environment Ontology (ENVO) Unclassified
(100.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Animal → Animal corpus
(100.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 80.39%    β-sheet: 0.00%    Coil/Unstructured: 19.61%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF11598COMP 2.88
PF05699Dimer_Tnp_hAT 1.92
PF00664ABC_membrane 0.96
PF01138RNase_PH 0.96
PF00209SNF 0.96
PF13087AAA_12 0.96
PF14529Exo_endo_phos_2 0.96
PF00648Peptidase_C2 0.96
PF00462Glutaredoxin 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0689Ribonuclease PHTranslation, ribosomal structure and biogenesis [J] 0.96
COG0733Na+-dependent transporter, SNF familyGeneral function prediction only [R] 0.96
COG1185Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase)Translation, ribosomal structure and biogenesis [J] 0.96
COG2123Exosome complex RNA-binding protein Rrp42, RNase PH superfamilyIntracellular trafficking, secretion, and vesicular transport [U] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.04 %
All OrganismsrootAll Organisms0.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001463|JGI24021J15306_10005280Not Available15318Open in IMG/M
3300001463|JGI24021J15306_10011669Not Available10915Open in IMG/M
3300001463|JGI24021J15306_10112768Not Available1894Open in IMG/M
3300001539|JGI24022J15233_10008481Not Available11482Open in IMG/M
3300001913|JGI24158J21664_10096498Not Available2061Open in IMG/M
3300027611|Ga0210059_1013219All Organisms → cellular organisms → Eukaryota → Opisthokonta12374Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Ecteinascidia TurbinataHost-Associated → Tunicates → Ascidians → Unclassified → Unclassified → Ecteinascidia Turbinata100.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001463Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 1Host-AssociatedOpen in IMG/M
3300001539Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 2Host-AssociatedOpen in IMG/M
3300001689Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 4Host-AssociatedOpen in IMG/M
3300001913Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 3Host-AssociatedOpen in IMG/M
3300027611Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 2 (SPAdes)Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI24021J15306_10005280243300001463Ecteinascidia TurbinataQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24021J15306_10011669263300001463Ecteinascidia TurbinataTLINYCAIVFSFALMFANDWLTACDGFYDWLSHMKQ*
JGI24021J15306_10018337203300001463Ecteinascidia TurbinataPSLFTLINYCAIVFSFALMFANDWLTTCDGFYGASAYWLSHMKQ*
JGI24021J15306_1002177513300001463Ecteinascidia TurbinataQRPSLFTLINYSAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24021J15306_10026946113300001463Ecteinascidia TurbinataRPSLFTLINYCAIVFSFALMFANDWLTACDGFYNASACWLSHMKQ*
JGI24021J15306_10031105163300001463Ecteinascidia TurbinataQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTAYDGFYDASACWLSHIKQ*
JGI24021J15306_1004515913300001463Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANGWLTACNGFYDAFACWLSHMKQ*
JGI24021J15306_10045444103300001463Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLAVCDGFYDASACWLSHMKQ*
JGI24021J15306_10048848133300001463Ecteinascidia TurbinataRQPYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACCLSHMKQ*
JGI24021J15306_1008141053300001463Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFILINYCAIVSSLALMFANGWFSACDGFYDASACWLSHMKQ*
JGI24021J15306_1008700213300001463Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24021J15306_1010897513300001463Ecteinascidia TurbinataTQRPSLFTLINYCAIVFSFALMFANDWLTACNGFYDASACWLSHMKQ*
JGI24021J15306_1011276853300001463Ecteinascidia TurbinataLFTLINYCAIVFSFALMFANDWLTACDGFYDAFACWLSHMKQ*
JGI24021J15306_1013212323300001463Ecteinascidia TurbinataMTATAFCQSYMTQRPSLFTLINYCAIVFSFALMFANDWFTACDGFYGASACWLSHMKQ*
JGI24021J15306_1013247833300001463Ecteinascidia TurbinataSLFTLINYCVIVFSFALMFANGWLTACDGFYDAFACWLSHMKQ*
JGI24021J15306_1015426343300001463Ecteinascidia TurbinataFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24021J15306_1016108513300001463Ecteinascidia TurbinataTLINYCAIVFSFALMFANDWLTACGGFYGTSACWLSHMKQ*
JGI24021J15306_1016377933300001463Ecteinascidia TurbinataRPSLFTLINYCAIVFSFALMFANDWLTTCDGFYDASACWLSHMKQ*
JGI24021J15306_1017422643300001463Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACGGFYDASACWRSHMKQ*
JGI24021J15306_1026235213300001463Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANDWLTACYGFYDASACWLSHMKQ*
JGI24022J15233_1000274713300001539Ecteinascidia TurbinataQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWRSHMKQ*
JGI24022J15233_1000848113300001539Ecteinascidia TurbinataPLCTLINYCAIVFSFALMFANDWLTACDGFYDAPACWLSHMKQ*
JGI24022J15233_1001180613300001539Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFTNGWLTACDGFYDASACWLSHMKQ*
JGI24022J15233_1002941993300001539Ecteinascidia TurbinataTVFRQSYMTQRPSLFTLINYCAIVFSFALMFANMCLTACDGFYDASAC*
JGI24022J15233_1007445613300001539Ecteinascidia TurbinataRPSLFTLINYCAIVFSFALMFANGWLTACDGFFDAFACWLSHMKQ*
JGI24022J15233_1007886413300001539Ecteinascidia TurbinataQRPSLFTLINYCAIVFSFALMFANDWLTACDGYYDASACWLRHMKQ*
JGI24022J15233_1008028473300001539Ecteinascidia TurbinataTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFSNGWHTACDGFYDASACWLSHVKQ*
JGI24022J15233_1008462613300001539Ecteinascidia TurbinataLFTLINYCAIVFSFALMFANDWLSACDGFYDASACWLSHMKQ*
JGI24022J15233_1009150013300001539Ecteinascidia TurbinataTQRPSLFTLINYCAIVFSFALMFANDWLIACDGFYDASACWLSHMKQ*
JGI24022J15233_1009732713300001539Ecteinascidia TurbinataRPSLFTLINYCAIVFSFALMFANDRLTACDGFYEASACWLSHMKQQTITITQ*
JGI24022J15233_1010755383300001539Ecteinascidia TurbinataNYCAIVFSFALMFANGWLTACDDFYDAFACWLSHMKQ*
JGI24022J15233_1011359213300001539Ecteinascidia TurbinataTQRPSLFTLINYCAIVFSFALMFANDWLTACNGLYDASACWLSHMKQ*
JGI24022J15233_1013640113300001539Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDVSACWLSHMKQ*
JGI24022J15233_1013964413300001539Ecteinascidia TurbinataTVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDWLSHMKQ*
JGI24022J15233_1019255913300001539Ecteinascidia TurbinataAIVFSFALMFANGWLTACDGFYDAFACWLSHIKQ*
JGI24159J19872_1000125013300001689Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24159J19872_10022408153300001689Ecteinascidia TurbinataPSLFTLINYCAIVFSFALMFANDWLSACDGFYDASACWLSHMKQ*
JGI24159J19872_10031203153300001689Ecteinascidia TurbinataYMTQRPSLFTLINYCAIVFSFALMFANDWLTAYDGFYDASACWLSHIKQ*
JGI24159J19872_1010564413300001689Ecteinascidia TurbinataTQRPSLFTLINYCAIVFSFALMFANDWLTACDGYYDASACWLRHMKQ*
JGI24159J19872_1010836243300001689Ecteinascidia TurbinataRQSYMTQRLSLFTLINYCAIVFSFALMFANDCLTACDGFYDASACWLSHMKQ*
JGI24159J19872_1011461523300001689Ecteinascidia TurbinataTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANGWLTAWDGFHNASAYWLSHMKQLTISG*
JGI24159J19872_1012397913300001689Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANDWLTACDGFYDVSACWLSHMKQ*
JGI24159J19872_1012549233300001689Ecteinascidia TurbinataMTAAVFRQSYMTQRPSLFILINYCAIVFSFALMFANGWLSACDGFYDASACW
JGI24159J19872_1013222813300001689Ecteinascidia TurbinataPSLFTLINYCAIVFSFALMFANDWLAVCDGFYDASACWLSHMKQ*
JGI24159J19872_1014469843300001689Ecteinascidia TurbinataQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDTSACWLSHMKQ*
JGI24159J19872_1014905723300001689Ecteinascidia TurbinataMTATVFRQSYVTHRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24159J19872_1015167323300001689Ecteinascidia TurbinataGLVDQMTATVFRQSYVTQRPSLFTLIIYCAIVFSFAPMFANDCLTACDGFYDASACWLSHMKQ*
JGI24159J19872_1016203833300001689Ecteinascidia TurbinataINYCAIVFSFALMFANDWLTACDGFYDASACWLIHMKQ*
JGI24159J19872_1016653613300001689Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANDRLTACDGFYEASACWLSHMKQQTITITQ*
JGI24159J19872_1016756913300001689Ecteinascidia TurbinataFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWITACDGFYDAPACWLSHMKQ*
JGI24159J19872_1017730223300001689Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24159J19872_1021373323300001689Ecteinascidia TurbinataATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDRLTACDGFYDASACWLSRMKQ*
JGI24159J19872_1021745813300001689Ecteinascidia TurbinataMTATVFRQSYVTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKH*
JGI24158J21664_10010454283300001913Ecteinascidia TurbinataMTATVFCQSYMTQRPSLFTLVNYRAIVFSFALMFANDWLTACDGFYDASAXWLSHMKQ*
JGI24158J21664_1003215613300001913Ecteinascidia TurbinataSYMTQRPSLFTLINYCAIVFSFALMFANDWLTAYDGFYDASACWLSHIKQ*
JGI24158J21664_1003606213300001913Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANDWLTACDGFYDAPACWLSHMKQ*
JGI24158J21664_1005722213300001913Ecteinascidia TurbinataVFRQSYMTQRPSLFTLINYCAIVFSFALMFANSWFTACDGFYGASACWLSHMKQ*
JGI24158J21664_1006508543300001913Ecteinascidia TurbinataPYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACCLSHMKQ*
JGI24158J21664_1006716383300001913Ecteinascidia TurbinataLFTLINYCAIVFSFALMFANDWLTACDGFFDAFACWLSHMKQ*
JGI24158J21664_1008102813300001913Ecteinascidia TurbinataVFRQSYMTQRPSLFTLINYCAIVFSFALMFANMCLTACDGFYDASAC*
JGI24158J21664_1008438363300001913Ecteinascidia TurbinataNYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24158J21664_1008637913300001913Ecteinascidia TurbinataVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTVCDGFYDASACGLSHMKQ*
JGI24158J21664_1008956863300001913Ecteinascidia TurbinataLFTLINYCAIVFSFALMFANDWLTACNGLYDASACWLSHMKQ*
JGI24158J21664_1009082023300001913Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDCITACDGFYDASACWLSHMKQ*
JGI24158J21664_1009276343300001913Ecteinascidia TurbinataPSLFTLINYCAIVFSFALMFANDWLTACDGFYDAFACWLSHMKQ*
JGI24158J21664_1009617613300001913Ecteinascidia TurbinataINYCAIVFSFALMFANDWLTACDGFYDASTCWLSHMKQ*
JGI24158J21664_1009649863300001913Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFAIMFANDWLTACDGFYDASACWLSHMKQ*
JGI24158J21664_1009985453300001913Ecteinascidia TurbinataMTAAVFPQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWRSHMKQ*
JGI24158J21664_1014697013300001913Ecteinascidia TurbinataVFRQSYMTQRPSLFTLINYCAIVFSFALMFANGWLTAWDGFHNASAYWLSHMKQLTISG*
JGI24158J21664_1017143633300001913Ecteinascidia TurbinataVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ*
JGI24158J21664_1017443013300001913Ecteinascidia TurbinataTATVFRQSYMTQRHSLFTLINYCAIVFSFALMSANDWLTACDGFYDASACWLSHMKQ*
JGI24158J21664_1020216413300001913Ecteinascidia TurbinataTLINYCAIVFSFALMFANDWLTPCDGFYDASVFWLSHMKQ*
JGI24158J21664_1022778913300001913Ecteinascidia TurbinataVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLSACDGFYDASACWLSHMKQ*
Ga0210059_100320013300027611Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWFTACDGFYDASACWLSHMKQ
Ga0210059_101147013300027611Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWFTACDGFYDASVCWLSYMKQ
Ga0210059_1013219173300027611Ecteinascidia TurbinataQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_102241133300027611Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANGWLTACDGFYDAYACWLSHMKQ
Ga0210059_102662683300027611Ecteinascidia TurbinataMTTTVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACNGLYDASACWLSHMKQ
Ga0210059_102815083300027611Ecteinascidia TurbinataTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDCLTACDGFYDASACWLSHMKQ
Ga0210059_103776193300027611Ecteinascidia TurbinataMTATVFRQPYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASA
Ga0210059_104077013300027611Ecteinascidia TurbinataATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGIYDASACWLSHMKQ
Ga0210059_104165513300027611Ecteinascidia TurbinataMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_104177533300027611Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDATACWLSHMKQ
Ga0210059_104337673300027611Ecteinascidia TurbinataMQEFPHHMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASAC
Ga0210059_104678043300027611Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_105655013300027611Ecteinascidia TurbinataSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASAYWLSHMKQ
Ga0210059_106663043300027611Ecteinascidia TurbinataMTQRPSLFTLINYFAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_106832113300027611Ecteinascidia TurbinataPSLFTLINYCAIVFSFALMFANDWLSACDGFYDASACWLSHMKQ
Ga0210059_107763543300027611Ecteinascidia TurbinataMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWVSHMKQ
Ga0210059_108307213300027611Ecteinascidia TurbinataMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACCLNHMKQ
Ga0210059_108631333300027611Ecteinascidia TurbinataFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDWLSHMKQ
Ga0210059_109747523300027611Ecteinascidia TurbinataTVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLSACDGFYDPSACWLSHMKQ
Ga0210059_110652733300027611Ecteinascidia TurbinataLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_111728113300027611Ecteinascidia TurbinataSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMNYHYGSTVLL
Ga0210059_112014423300027611Ecteinascidia TurbinataTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_113467413300027611Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_114913223300027611Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYD
Ga0210059_115128113300027611Ecteinascidia TurbinataRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDVSACWLSHMKQ
Ga0210059_116802613300027611Ecteinascidia TurbinataMTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASA
Ga0210059_117295313300027611Ecteinascidia TurbinataFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLPACDGFYDASVCWLSHMKQ
Ga0210059_117378113300027611Ecteinascidia TurbinataNYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_117971413300027611Ecteinascidia TurbinataYMTQRPSLFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ
Ga0210059_119432613300027611Ecteinascidia TurbinataTATVFRQSYMTQRPSLFTLINYCAIVFSFALMFANDWLTACGGFYDTSACWLSHMKQ
Ga0210059_119614323300027611Ecteinascidia TurbinataFTLINYCAIVFSFALMFANDWLTACDGFYDASACWLSHMKQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.