NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F089500

Metagenome Family F089500

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089500
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 43 residues
Representative Sequence MYYEVKNKISSTLVQLLVHFFNETKRTATKFEGLEKKRKIRTV
Number of Associated Samples 5
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 4
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Tunicates → Ascidians → Unclassified → Unclassified → Ecteinascidia Turbinata
(100.000 % of family members)
Environment Ontology (ENVO) Unclassified
(100.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Animal → Animal corpus
(100.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 57.75%    β-sheet: 0.00%    Coil/Unstructured: 42.25%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF00078RVT_1 1.83
PF00754F5_F8_type_C 0.92
PF14291DUF4371 0.92
PF00665rve 0.92
PF01391Collagen 0.92
PF05699Dimer_Tnp_hAT 0.92
PF14529Exo_endo_phos_2 0.92
PF00084Sushi 0.92
PF00753Lactamase_B 0.92
PF13843DDE_Tnp_1_7 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG2801Transposase InsO and inactivated derivativesMobilome: prophages, transposons [X] 0.92
COG2826Transposase and inactivated derivatives, IS30 familyMobilome: prophages, transposons [X] 0.92
COG3316Transposase (or an inactivated derivative), DDE domainMobilome: prophages, transposons [X] 0.92
COG4584TransposaseMobilome: prophages, transposons [X] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001463|JGI24021J15306_10101917Not Available2123Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Ecteinascidia TurbinataHost-Associated → Tunicates → Ascidians → Unclassified → Unclassified → Ecteinascidia Turbinata100.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001463Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 1Host-AssociatedOpen in IMG/M
3300001539Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 2Host-AssociatedOpen in IMG/M
3300001689Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 4Host-AssociatedOpen in IMG/M
3300001913Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 3Host-AssociatedOpen in IMG/M
3300027611Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 2 (SPAdes)Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI24021J15306_10000336453300001463Ecteinascidia TurbinataMYYEVKSKISSTLVQLPANFFNETKRKATKFESIEKREE*
JGI24021J15306_1000772983300001463Ecteinascidia TurbinataMKNLLEGKNKISSTLPQLTVNIFFQTKRIAMKFEGLEKKE*
JGI24021J15306_10009172113300001463Ecteinascidia TurbinataMYYEVKNKISSKLVQLIVDIFNKEKRAATKFEGLEKEIRIIRTV*
JGI24021J15306_10009856113300001463Ecteinascidia TurbinataMYXEVKSKLSSTLIQLAVHFFNETKRTATKFEGLEKKRRIIRTV*
JGI24021J15306_1001047143300001463Ecteinascidia TurbinataMYYEVKSKISSTLVQLTVHFLNETKRTATKFEGLEKREK*
JGI24021J15306_1001116533300001463Ecteinascidia TurbinataMHYEVKSKISSTLVQLTVHFLNETKRIAMKFEGLE*
JGI24021J15306_10011759113300001463Ecteinascidia TurbinataMYYEVKNKISSTVVQLTVHFFNETKRTATKCEGHEKREE*
JGI24021J15306_1001344993300001463Ecteinascidia TurbinataMYYEVKNEISTTLVQSTVFFIETKRTATEFERLEKKRIYRIV*
JGI24021J15306_1001402063300001463Ecteinascidia TurbinataMYYKVESKISTTRTVTVTFFTETKRTATKFEGLEKREEKRQHFQ*
JGI24021J15306_1001512183300001463Ecteinascidia TurbinataMYYEVKSEISSTLVQLAVHFFNETKRTATKFEGLEKKRRIMRTV*
JGI24021J15306_10017768183300001463Ecteinascidia TurbinataMCYEVRSKISSTLVQLTVNFFTETKRTATKFEGIEKKRRII*
JGI24021J15306_1002761863300001463Ecteinascidia TurbinataMYHEVKSKISSTLVQLAVHFFNGTKRTATKFEGFEKKRRIIRTV*
JGI24021J15306_1002979463300001463Ecteinascidia TurbinataMYYEVKGKISSTLVQLTVHFFNETKQTASKFEGREKKRRIIRTA*
JGI24021J15306_1002982363300001463Ecteinascidia TurbinataMNCEVKSKISNTLVPLTVIFLNHTKRTATKFEGLEKKRRIIQTV*
JGI24021J15306_1003279073300001463Ecteinascidia TurbinataMCYEVKSEISSTLVQLAVNFFNETKRTATKFEGLEKK*
JGI24021J15306_1003792273300001463Ecteinascidia TurbinataMYYEVQNKISSTLIQLAVHFLNETKRTETKFEGLEKKRRIIRTV*
JGI24021J15306_1004411733300001463Ecteinascidia TurbinataMHYEVKSKISSTLLQLTVHFFNETKRTATKFEDPDKKRRIIRTL*
JGI24021J15306_1004451523300001463Ecteinascidia TurbinataMYYEVKSKISSALVQLTVHFFNETKRTATKLEGR*
JGI24021J15306_1005995553300001463Ecteinascidia TurbinataMYYEVKSKISSTLVRLAVHFFNETKQTATKFEGLEKKRRIIRTV*
JGI24021J15306_1006139623300001463Ecteinascidia TurbinataMFYEVKSTISSTLVQLTVNFFDETERTATKVEGFEKKKIIIRTV*
JGI24021J15306_1006555273300001463Ecteinascidia TurbinataMYYEVKNKISSTLVQLTLHFFNETKRTATKFQELEKKRRMIRTV*
JGI24021J15306_1006563243300001463Ecteinascidia TurbinataMSYEVNNKTSSTLVQLTTFFNEAKQTATKFEGLENKRRIIRTV*
JGI24021J15306_1006644643300001463Ecteinascidia TurbinataMYYELKRKVSSTLVQLDVDFFNETKRTATKFEGLENKEK*
JGI24021J15306_1007313353300001463Ecteinascidia TurbinataMYGEVKSKISSTVVQLTANFFNETKQTATKFEGLEKKRRIIRMV*
JGI24021J15306_1007670783300001463Ecteinascidia TurbinataMCYEVKSKISSTLVQLAVHFLNETKRTATNFEGL*
JGI24021J15306_1007716423300001463Ecteinascidia TurbinataMYYEAKSEVSNTLARLAVHFFNDTKRTATKFEGLEKKRRIIRTA*
JGI24021J15306_1007812843300001463Ecteinascidia TurbinataMYYEVKSKINSTLVQLTANFFNETKRIATKFEGLEKKEKNN*
JGI24021J15306_1008015963300001463Ecteinascidia TurbinataMYYEVKSKIRSTLVQLPVQLFYETKRTATKFEGLDKKRRKVRTV*
JGI24021J15306_1008535043300001463Ecteinascidia TurbinataVKREISSTLAQLMQIIFIETKRTATKFEGLEKREK*
JGI24021J15306_1008575573300001463Ecteinascidia TurbinataVKSKIRSTLVQLSVQFFNETKRTATKFEGLEKKRRTVIRTV*
JGI24021J15306_1010191743300001463Ecteinascidia TurbinataMYYEVKSKLSSTLVRLTVNFLNETKRTATKFEXLAKKRKIMRTV*
JGI24021J15306_1011463053300001463Ecteinascidia TurbinataMYYEIKSKISSTLVQLTVIFFNETKRTATKFEEPGKKRRILQKL*
JGI24021J15306_1012259023300001463Ecteinascidia TurbinataMYYEVKSKIGSTLVQLTANFFNETKRTATMFE*IETREE*
JGI24021J15306_1013234423300001463Ecteinascidia TurbinataMHYEVKSKISSTLVQLTVQFFKETKRTATKFKGLEKKRRIIRTV*
JGI24021J15306_1013261133300001463Ecteinascidia TurbinataMYYEVKSKISSTLVQLQQIIFNETKRTATTFEGLEKREQ*
JGI24021J15306_1013313923300001463Ecteinascidia TurbinataMKYEVKSKISGTLVQLTVGLNFFNETKRTATKFEGLEKMRRIIRTV*
JGI24021J15306_1013562743300001463Ecteinascidia TurbinataMYYEVKSKMSSTLVQLSEILFNETKPTATRFEELEKKRRIIRTA*
JGI24021J15306_1015453153300001463Ecteinascidia TurbinataMYYEVKSKRSSTLVQLDVHFFNETKRTATKFEGLEKREK*
JGI24021J15306_1016034023300001463Ecteinascidia TurbinataMYYAVKSKISSTLVQLTLHFFIETKRTATKFEGLEKKRRITNTWE*
JGI24021J15306_1020045813300001463Ecteinascidia TurbinataVKIYYEAKRKISITLVQLDVHFFNETKRTATKFEGLEKREK*
JGI24021J15306_1021071123300001463Ecteinascidia TurbinataLKNVAYYEVKSKIRSTLIQLTGHFFNETKRTATKFEGLEKREEEFEQIW*
JGI24022J15233_10008342133300001539Ecteinascidia TurbinataMCYEVKSKISSTLVHLTVNFFNETKRTATNFEGLEKKRRIIRTV*
JGI24022J15233_1002159243300001539Ecteinascidia TurbinataMYHEVKNKISTTLLQLTVNFFNETKQTATKFEGLEKKRRIIRIV*
JGI24022J15233_1003439673300001539Ecteinascidia TurbinataMYYEVKGKISSTLVQLIENFFNKTKRTVTKFEGLDKNRRIIRTV*
JGI24022J15233_10037082123300001539Ecteinascidia TurbinataMYYELKKIKSSTLVQLIVHFFNERKRTATKFEGLEKKRRIIGTV*
JGI24022J15233_1004025033300001539Ecteinascidia TurbinataMYYEVKSKLSSTLIQLGVHFFNETKRTATKFEGLEKKRRIIRTV*
JGI24022J15233_10040714123300001539Ecteinascidia TurbinataMYYEVKNKTNSTIVHLTVDLHFFNETKRTATKFEGLEKKRRIIRTV*VNSTGY
JGI24022J15233_1005062763300001539Ecteinascidia TurbinataMYYEGKSKISSTLVQLTANFFEDTKRTATKFEGLVKKRTIEQFE*
JGI24022J15233_1005179553300001539Ecteinascidia TurbinataMYYQVKSKISSTLVQLNVHFFNETKRTATKFERLEKKRRKIRTV*
JGI24022J15233_1005432173300001539Ecteinascidia TurbinataMXYEVKSEISSTLVQLAVNFFNETKRTATKFEGLEKK*
JGI24022J15233_1006125223300001539Ecteinascidia TurbinataMYYEVKKNKQHTYTVDLHFSNEKKRTATKFEELEKKRRIIRKV*
JGI24022J15233_1008250843300001539Ecteinascidia TurbinataMYYEVKSKRSSTVVQLTVIFFNETKPTATKFEGLDKKRRKIRTV*
JGI24022J15233_1008313333300001539Ecteinascidia TurbinataMHYEVKSKISSTLLQLTVHFFNETKRTATKFEDPNKKRRIIRTL*
JGI24022J15233_1008370243300001539Ecteinascidia TurbinataMYYEVKNKISSTLLQLTAHFFNETKRTATKFEGLEKKRKII*
JGI24022J15233_1009618733300001539Ecteinascidia TurbinataMYCEVVGRLSSTLVQLTEIFFNETKRTATKCEGLEKKEEKLQQFE*
JGI24022J15233_1010895343300001539Ecteinascidia TurbinataMYYEVKRKISSTLVQLPVIFFYETKRTATKFEGLENKRRMIGIV*
JGI24022J15233_1011443743300001539Ecteinascidia TurbinataMYYEVKSKLSSTLVRLTVNFLNETKRTATKFEGLAKKRKIMRTV*
JGI24022J15233_1012147123300001539Ecteinascidia TurbinataMHYEVESKIRSTLVQLTVQFSKKTKRTATKFEGLEKR*
JGI24022J15233_1012188333300001539Ecteinascidia TurbinataMYYEVKSKISSTLVQLPANFFNETKRKATKFEAIEKREE*
JGI24022J15233_1014863533300001539Ecteinascidia TurbinataMYYEVKNKISSTLVQLPVHFFNETKRTAMKFEGIEI*
JGI24022J15233_1016586933300001539Ecteinascidia TurbinataMYFEIKNKTSSTLVQLTIHFLNEKKRTATKCEGLEKKRRTIRTV*
JGI24022J15233_1019352413300001539Ecteinascidia TurbinataMNYEVKSKISSTLVQLIANLFNERERTATKFNRREKKRRIILTV*
JGI24159J19872_1000857623300001689Ecteinascidia TurbinataMYYEVKNKQQHVQLIVHFFNETKRTATKSEGLKNRQE*
JGI24159J19872_1003535233300001689Ecteinascidia TurbinataMYYEVKYKISIILVQLTVHFFNETKRTATKCEGLEKKRRIVLTI*
JGI24159J19872_1004254643300001689Ecteinascidia TurbinataMCYEVKNKTSSTLIQLTVHLFNETKRTATKFERLEKKRTIIRTV*
JGI24159J19872_1006180743300001689Ecteinascidia TurbinataVKMCYEVKSEISSTLVQLAVNFFNETKRTATKFEGLEKK*
JGI24159J19872_1006548163300001689Ecteinascidia TurbinataMHYEVKSKISSTLVQLTVQFFNETKRTATKFKGLEKKRRIIRTV*
JGI24159J19872_1007144823300001689Ecteinascidia TurbinataMYYEGKSKISSTLVQLTANFLNDTKRTATKFEGLVKKRTIEQFE*
JGI24159J19872_1007433783300001689Ecteinascidia TurbinataMYYELKRKVSSTLVQLDVDXFNETKRTATKXEGXENKEK*
JGI24159J19872_1007792343300001689Ecteinascidia TurbinataMYYEVKSEVSNTLARLAVHFFNDTKRTATKFEGLEKKRRIIRTA*
JGI24159J19872_1008018783300001689Ecteinascidia TurbinataSKIRSTLVQLSVQFFNETKRTATKFEGLEKKRRTVIRTV*
JGI24159J19872_1008230853300001689Ecteinascidia TurbinataMYYEVKNKISSTLVQLLVHFFNETKRTATKFEGLEKKRKIRTV*
JGI24159J19872_1008613733300001689Ecteinascidia TurbinataMYHEVKSKISSTLVQLAVQFFNETKRTATKFEGLEKKRRIIRTV*
JGI24159J19872_1009943833300001689Ecteinascidia TurbinataMYYEVKSKISRTLVQLMQIFYNETERTATKFEGLEKTKRIIRTV*
JGI24159J19872_1010682943300001689Ecteinascidia TurbinataMKNHDDVKSKISSALVQLTVQFFNETKRTATEFDGLEKKRRII*
JGI24159J19872_1010858813300001689Ecteinascidia TurbinataMYYEVKNKISSILVQLTVHFFNETKQTATKFERFDETRRIIRTV*
JGI24159J19872_1014408023300001689Ecteinascidia TurbinataMYYEVKSKISSTLVQLDVHFFNETKRTATKFERLEKREK*
JGI24159J19872_1022741213300001689Ecteinascidia TurbinataMYHEVKSKISSTLVQLLVHFFNETKRTATKFEGFEKAKNNSNSLS
JGI24158J21664_10013280153300001913Ecteinascidia TurbinataMYYEVKSEISSTLVQLAVHFFNETKRTATKFEGLEKKRRIMRTA*
JGI24158J21664_1004222913300001913Ecteinascidia TurbinataMYYEVKNKISSTLVQLSVHFFNETKLTATKLEGLEKKRRIIRK
JGI24158J21664_1005309853300001913Ecteinascidia TurbinataMYYEVKNKISSTLLQLTVHFFNETKRTATKFEGLEKKRKIIRTV*
JGI24158J21664_1006091753300001913Ecteinascidia TurbinataMHYEVKSKISSALVQLTVHFFHETKRTATKFEGLEKKRRIIRTI*
JGI24158J21664_1006507433300001913Ecteinascidia TurbinataMHYEVKSKISSTLVQLTVQFFNETKRTATKFKGLEKKRRXIRTV*
JGI24158J21664_1006637383300001913Ecteinascidia TurbinataMYYEVKNKTNSTIVHLTVDLHFFNETKRTATKFEGLEKKRRIIRTV*VNSTGYL
JGI24158J21664_1007745973300001913Ecteinascidia TurbinataVKMCYEVNSKTSSTLVQLAVHFLNETKLTATKFEGL*
JGI24158J21664_1011828523300001913Ecteinascidia TurbinataMYYEVKNTISSTLVQLLVHFFNETKRTATKFEGLEKKEKKNSTV*
JGI24158J21664_1012409133300001913Ecteinascidia TurbinataMYYEVKIKIRSTLVQLTAHFFNETKRTATKFEGLEKKEEFEQFG*
JGI24158J21664_1023254923300001913Ecteinascidia TurbinataMYYEVKKIISSTLVQLIVHFFNERKRTATKFEGLEKKRRIIGTV*
Ga0210059_1007628133300027611Ecteinascidia TurbinataMYYEVKRKISSTLVQLPVIFFYETKRTATKFEGLENKRRMIGIV
Ga0210059_100785173300027611Ecteinascidia TurbinataMYYEVKSKISSTLVQLPANFFNETKRKATKFEAIEKREE
Ga0210059_100882253300027611Ecteinascidia TurbinataMYYEVKSKRSSTVVQLTVIFFNETKPTATKFEGLDKKRRKIRTV
Ga0210059_101173323300027611Ecteinascidia TurbinataMYYELKKIKSSTLVQLIVHFFNERKRTATKFEGLEKKRRIIGTV
Ga0210059_101358473300027611Ecteinascidia TurbinataMYYEVKGKISSTLVQLTVHFFNETKQTASKFEGREKKRRIIRTA
Ga0210059_101521153300027611Ecteinascidia TurbinataMYYEVKNKISSKLVQLIVDIFNKEKRAATKFEGLEKEIRIIRTV
Ga0210059_102134343300027611Ecteinascidia TurbinataMYYEVKSKINSTLVQLTANFFNETKRIATKFEGLEKKEKNN
Ga0210059_102330923300027611Ecteinascidia TurbinataMYYEVKSKISKTLVQLTANFMNEIKRTATKFEGLEKKRTIIRTV
Ga0210059_102708723300027611Ecteinascidia TurbinataMCYEVRSKISSTLVQLTVNFFTETKRTATKFEGIEKKRRII
Ga0210059_103035063300027611Ecteinascidia TurbinataMYYEVKNKISSTLLQLTAHFFNETKRTATKFEGLEKKRKII
Ga0210059_103169913300027611Ecteinascidia TurbinataMYYEVKGKISSTLVQLIENFFNKTKRTVTKFEGLDKNRRIIRTV
Ga0210059_103192023300027611Ecteinascidia TurbinataMFYEVKSTISSTLVQLTVNFFDETERTATKVEGLEKKKIIIRTV
Ga0210059_104347143300027611Ecteinascidia TurbinataEVKNKISSTLVQLPVHFFNETKRTATKFERLEKKRRIIRTV
Ga0210059_106210433300027611Ecteinascidia TurbinataMYYEVKSKRSSTLVQLDVHFFNETKRTATKFEGLEKREK
Ga0210059_106261853300027611Ecteinascidia TurbinataMYYEIKSKISSTLVQLTVIFFNETKRTATKFEEPGKKRRILQKL
Ga0210059_106712123300027611Ecteinascidia TurbinataMYYELKRKVSSTLVQLDVDFFNETKRTATKFEGLENKEK
Ga0210059_107177733300027611Ecteinascidia TurbinataMYGEVKSKISSTVVQLTANFFNETKQTATKFEGLEKKRRIIRMV
Ga0210059_109418123300027611Ecteinascidia TurbinataMYYEVKNKINSTLVQLSVHFFNETKLTATKLEGLEKKRRIIRKV
Ga0210059_110085133300027611Ecteinascidia TurbinataMYLEVKSKISSTLVQLLVHFFNETKRTATKFEGFEK
Ga0210059_112417523300027611Ecteinascidia TurbinataMYYEVKSKISSTLVRLAVHFFNETKQTATKFEGLEKKRRIIRTV
Ga0210059_118642323300027611Ecteinascidia TurbinataMYYEVKNKISSTLVQLPVHFFNETKRTATKFERLEKKRRIIRTV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.