NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102042

Metagenome Family F102042

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102042
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 45 residues
Representative Sequence VVEDLISFGIEDHTFGPTKVREHFPEEELTLGKNRLLEVASLVE
Number of Associated Samples 5
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.98 %
Associated GOLD sequencing projects 4
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Tunicates → Ascidians → Unclassified → Unclassified → Ecteinascidia Turbinata
(100.000 % of family members)
Environment Ontology (ENVO) Unclassified
(100.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Animal → Animal corpus
(100.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 43.06%    β-sheet: 0.00%    Coil/Unstructured: 56.94%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00078RVT_1 10.78
PF03137OATP 3.92
PF00145DNA_methylase 0.98
PF136402OG-FeII_Oxy_3 0.98
PF00520Ion_trans 0.98
PF01227GTP_cyclohydroI 0.98
PF13358DDE_3 0.98
PF13419HAD_2 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001689|JGI24159J19872_10140537Not Available1252Open in IMG/M
3300027611|Ga0210059_1018783Not Available9694Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Ecteinascidia TurbinataHost-Associated → Tunicates → Ascidians → Unclassified → Unclassified → Ecteinascidia Turbinata100.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001463Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 1Host-AssociatedOpen in IMG/M
3300001539Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 2Host-AssociatedOpen in IMG/M
3300001689Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 4Host-AssociatedOpen in IMG/M
3300001913Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 3Host-AssociatedOpen in IMG/M
3300027611Ecteinascidia turbinata endosymbiont from Florida, USA - Sample 2 (SPAdes)Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI24021J15306_10000777203300001463Ecteinascidia TurbinataLKVNVVEDLVLFGIEDHTLGPTKVREHFPEEEVTLGKNRLLEVSSLVE*
JGI24021J15306_10009098153300001463Ecteinascidia TurbinataVKVVENLISFGIEDDTFGPTNVGEHFPEEELTLVKNRLLEVASLVD*
JGI24021J15306_10024831123300001463Ecteinascidia TurbinataHLKVKVVENLISSGIEDHTFGPMNLREHFPEEKLTLGKNRFLEVASLVE*
JGI24021J15306_1002500633300001463Ecteinascidia TurbinataVVEDLISFGIDDNTFGPTKVREHFPEEELTLGKNRLLEVASLVE*
JGI24021J15306_1003382363300001463Ecteinascidia TurbinataMVEDLISFGIKTTLLGTKVREHFPEEELTLGQNRLLDVASLVE*
JGI24021J15306_1003402053300001463Ecteinascidia TurbinataMVEDFISFGIEDHTFGPTKVREHFPEEESTLGENMLLKVASLVE*
JGI24021J15306_1004243623300001463Ecteinascidia TurbinataVVENLISSGIEHYTFGPINLREHFPEEELTLGKNRFLEAASLVE*
JGI24021J15306_1004964913300001463Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFPMEELTPSKNRLLEVASLVE*
JGI24021J15306_1005357833300001463Ecteinascidia TurbinataVVEFLISFGIEYHTFRPTKVREYFPEEKLTLGKNRLLGVASLVE*
JGI24021J15306_1006207293300001463Ecteinascidia TurbinataVVEDLISFGIEDHTLGTIKVREPFLEEELTLGKNRLLEVASLVE*
JGI24021J15306_1008116143300001463Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFAEEELTLGKNRLAEVASLVE*
JGI24021J15306_1008733233300001463Ecteinascidia TurbinataVVENLISFGIEDHTFGPTKVREHFPEEKLTLGKKRLLEVASLVE*
JGI24021J15306_1010369543300001463Ecteinascidia TurbinataMVEDVISFGIEDHTLGPIKVREHFPEKELTLGKNGSLEVASLME*
JGI24021J15306_1010600613300001463Ecteinascidia TurbinataVVENLILYGIEDHTFGPMNLREHFPEEKLTLGKNRFLE
JGI24021J15306_1012736323300001463Ecteinascidia TurbinataVVEDLISFGIEHHTLRPVKVREHFPEEELTLGKNRLLEVAILVE*
JGI24021J15306_1013947323300001463Ecteinascidia TurbinataLKVNVVEDLSFRIEDHTFGPTKVWKHFPEEEVTLGKNRLLRVASLVE*
JGI24021J15306_1014488513300001463Ecteinascidia TurbinataVVENLISFGIEDHTFGPINLSEQFPEEELTLGKNRFLEVASLVE*
JGI24021J15306_1017476623300001463Ecteinascidia TurbinataVKVVEDLISFGIEDHTFGPTKVREHFPVEELTLGKNRLLEVSSLVE*
JGI24021J15306_1018554423300001463Ecteinascidia TurbinataVVKYLISSGIENYTFGPVKLREHFPEEELILGKNRFLEVASLVE*
JGI24021J15306_1023625713300001463Ecteinascidia TurbinataVVESLISSRIEDHTFGPINFPEEELSLGKNRFLEVASLVE*
JGI24022J15233_10002390123300001539Ecteinascidia TurbinataLKVNVVEDLVLFGIEDHTLGPTKVREHFPEEELTLGKNRLLEVSSLVE*
JGI24022J15233_10013146113300001539Ecteinascidia TurbinataMWLKTFNSFGIEDHTFGATKVSEHFAEEELPLGKIRLLEVASLVE*
JGI24022J15233_1002353713300001539Ecteinascidia TurbinataHXKVKVVENLISSGIEDHTFGPMNLREHFPEEKLTLGKNRFLEVASLVE*
JGI24022J15233_10037798103300001539Ecteinascidia TurbinataMVEDFILFGIEDHTFGPTKVREHFPEEESTLGENMLLKVASLVE*
JGI24022J15233_1004347073300001539Ecteinascidia TurbinataVVQDLISFGIEDHTFGPTNVGEHFPEEELTLGKNRLLEVASLVE*
JGI24022J15233_1005850333300001539Ecteinascidia TurbinataLFEALISFGIEDHTLGPTKVREHFPDEELTLGKNRLSEVASLVE*
JGI24022J15233_1006305143300001539Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKAREHFPEEELTLGKNRLFEVASLKLS*
JGI24022J15233_1006330453300001539Ecteinascidia TurbinataVVEDLTSFGIEDHTLGPIKVREHFPEEELTLGKNRLLERKTHSR*
JGI24022J15233_1008300533300001539Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKITENVPEEELTLGKNRLLEVASLVE*
JGI24022J15233_1008366513300001539Ecteinascidia TurbinataEYLISFGIEDHTFGPTKVREHFPEEELTLGKNRLLEVASLVE*
JGI24022J15233_1008898463300001539Ecteinascidia TurbinataVVEDLSFRIEDHTFGPTMVWKHFPEEEVTLGNNRLLRVASLVE*
JGI24022J15233_1010049513300001539Ecteinascidia TurbinataVVEDLISFGIEDHTIGPTKVREHFPEEELTLGKNRLLEVA
JGI24022J15233_1010183233300001539Ecteinascidia TurbinataVVEDLISFGIEDHTLGPTKVRKNFPEEELTLGKNRLLEVASLVE*
JGI24022J15233_1010497323300001539Ecteinascidia TurbinataVADYLISFGIEDHTFGPTKVREHFPEEELTLGKNKLLKVASLVE*
JGI24022J15233_1010840043300001539Ecteinascidia TurbinataMSSRIEDHTFGPINLREHFPEEELTQDKNRLFEVASLVE*
JGI24022J15233_1016315413300001539Ecteinascidia TurbinataVVEDLISHDIEDHTFGLTKVREHFPEEELTLGKNRLLEVASLVE*
JGI24022J15233_1016394723300001539Ecteinascidia TurbinataVVEDLISFGIEDHTLGPTKTREHFPEEELILGKKRLLEVASLVE*
JGI24022J15233_1017317423300001539Ecteinascidia TurbinataVKVVEALISFGIEDHTFGPTKVREHFPVEELTLGKNRLLEVSSLVE*
JGI24022J15233_1021175223300001539Ecteinascidia TurbinataVVENLISSGIEGHTFGPINLRVHFPEEKLTLGKNMFLEVASLVE*
JGI24022J15233_1022457423300001539Ecteinascidia TurbinataVVENLISSGIEHHTFGPMNLREHFPEEELTLGKNRLLEVASLVE*
JGI24022J15233_1024325213300001539Ecteinascidia TurbinataVFLKFHLKVKVVANLISFGIEDHTFGPINLREHFPEEELTLGKNRFLEVSSLVE*
JGI24022J15233_1024763013300001539Ecteinascidia TurbinataFGIEYHTLGPTKVREHFPEEELILGKNRLLEVANLVE*
JGI24159J19872_10001520183300001689Ecteinascidia TurbinataLKVKVAARDFIPSGIEKYTFWPTNVREYFPEEEVTLSKNRLLEVVSLVE*
JGI24159J19872_10004278143300001689Ecteinascidia TurbinataVVEDLISSSFEDHTFGLTNVREHYPEEELTLGENRLLDLWSGNELS*
JGI24159J19872_1002428423300001689Ecteinascidia TurbinataVVDDLLSFGIEDHTLEPTNVREHFPEEELTLGKNGLLKVASLVE*
JGI24159J19872_1003602853300001689Ecteinascidia TurbinataVKVVENLISFGIEDDTFGPTNVGEHFPEEELTLGKNRLLEVASLVD*
JGI24159J19872_10039279103300001689Ecteinascidia TurbinataMSLGIEDHTFGPTKVREHFPEEEVTLVRNRNRLVEVASLVERMN*
JGI24159J19872_1004509173300001689Ecteinascidia TurbinataMSSRIEDHTFGPINLREHFPEEELTQDKNRLLEVPSLVE**
JGI24159J19872_1005433913300001689Ecteinascidia TurbinataSGIEDHTFGPINLREHFPDEELTLGKNRFLEVASLVE*
JGI24159J19872_1005995913300001689Ecteinascidia TurbinataLISFGIEYHTLGPTKVREHFPEEELILGKNRLLXVAXLVE*
JGI24159J19872_1006233153300001689Ecteinascidia TurbinataVVANLISFGIEDHTFGPINLREHFPEEELTLGKNRFLEVSSLVE*
JGI24159J19872_1009488013300001689Ecteinascidia TurbinataVFLQSPLKVNVADYLISFGIEDHTFGPTKVREHFPEEELTLGKNKLLKVASLVE*
JGI24159J19872_1009612113300001689Ecteinascidia TurbinataVVENLILYGIEDHTFGPMNLREHFPEEKLTLGKNRFLEVASPVE*
JGI24159J19872_1010759233300001689Ecteinascidia TurbinataVVEYLISFGIEDHTFGPINLREHFAEVELTLGKNRFLEVASLVES*
JGI24159J19872_1014053713300001689Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFPEEELTLGKNRLLEVASLVE*
JGI24159J19872_1019714823300001689Ecteinascidia TurbinataVVEKLISSCIEDHTFGPMNLREHFPDEELNLCKNRFLEVASFVE*
JGI24158J21664_1001474513300001913Ecteinascidia TurbinataLKVNVVEDLILFGIEDHTLGPTKVREHFPEEELTLGKNRLLEVASLVE*
JGI24158J21664_1003757413300001913Ecteinascidia TurbinataEDLISFGIEDHTYGPTKVREHLPAAELTLGANSLLEVTSLVELE*
JGI24158J21664_1004961243300001913Ecteinascidia TurbinataVVEDVISFGIEDHTFEPTKVREHFPREELTLGKNRLLEVASLVD*
JGI24158J21664_1005915113300001913Ecteinascidia TurbinataFFSFHLKVNVVEDLISFGIEDHTLGTIKVREPFPEEELTLGKNRLLEVASLVE*
JGI24158J21664_1007844333300001913Ecteinascidia TurbinataVFFSFHLKVNVVEDLISFGIEHHTLRPIKVREHFPEEELTLGKNRLLEVAILVE*
JGI24158J21664_1014024343300001913Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFPEEELTLGKNRLLEVASLVE*EE
JGI24158J21664_1014616543300001913Ecteinascidia TurbinataVNVVENLISFGIEDHTFGPTKVREHFPEEKLTXGKXRLLEVASLVX*
JGI24158J21664_1020159923300001913Ecteinascidia TurbinataVVEDLISFGIEDHTLGPTKVREHFPEEEFNSGKNRLLEVASLVE*
JGI24158J21664_1021951433300001913Ecteinascidia TurbinataVVEDLSFRIEDHTFGPTKVWKHFPEEEVTLGKNRLLRVASLVE*
JGI24158J21664_1022886923300001913Ecteinascidia TurbinataLISFGIEYHTLGPTKVREHFPEEELILGKNRLLEVANLVE*
Ga0210059_1000267303300027611Ecteinascidia TurbinataVFLQSPLKVNVADYLISFGIEDHTFGPTKVREHFPEEELTLGKNKLLKVASLVE
Ga0210059_100118533300027611Ecteinascidia TurbinataVVENLISSGIEDDTFGPMNLIEHFPEEELTLGKNRFLEVASLVE
Ga0210059_1001189193300027611Ecteinascidia TurbinataLKANVVEDLISHDIEDHTFGLTKVREHFPEEELTLGKNRLLEVASLVE
Ga0210059_1001708223300027611Ecteinascidia TurbinataVVEDLISFGIEHHTLRPIKVREHFPEEELTLGKNRLLEAASLVE
Ga0210059_1001849113300027611Ecteinascidia TurbinataLKVNVVEDLVLFGIEDHTLGPTKVREHFPEEELTLGKNRLLEVSSLVE
Ga0210059_100228443300027611Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKITENVPEEELTLGKNRLLEVASLVE
Ga0210059_1003539223300027611Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFAEEELTLGKNRLAEVASLVE
Ga0210059_1004059103300027611Ecteinascidia TurbinataVVENLISSGIEDHTFGPINLRKHFPEEELTLGKNRFLEVASLVE
Ga0210059_100630353300027611Ecteinascidia TurbinataVVEDLISFGIEDHTLGTIKVREPFLEEELTLGKNRLLEVASLVE
Ga0210059_100643113300027611Ecteinascidia TurbinataVVQDLISFGIEDHTFGPTNVGEHFPEEELTLGKNRLLEVASLVE
Ga0210059_1008267103300027611Ecteinascidia TurbinataVKVVEALISFGIEDHTFGPTKVREHFPVEELTLGKNRLLEVSSLVE
Ga0210059_101040923300027611Ecteinascidia TurbinataVVENLISSRIEDHTFGPMNLREHFPEEELTLGKNRFLEVASLVE
Ga0210059_1016361123300027611Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKAREHFPEEELTLGKNRLFEVASLKLS
Ga0210059_101798253300027611Ecteinascidia TurbinataMVEDVISFGIEDHTLGPIKVREHFPEKELTLGKNGSLEVASLME
Ga0210059_101857613300027611Ecteinascidia TurbinataVVENLISSGIEDHTFGPMNLREHFPEEKLTLGKNRFLEVASLVE
Ga0210059_1018783103300027611Ecteinascidia TurbinataVVADLISFGIEDHTLGPIKVREYFPEEELTLGKNRLLEVASLVE
Ga0210059_101952523300027611Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFPEEELTLGKNRLLDVASLVE
Ga0210059_103180233300027611Ecteinascidia TurbinataLFEALISFGIEDHTLGPTKVREHFPDEELTLGKNRLSEVASLVE
Ga0210059_103405353300027611Ecteinascidia TurbinataVVENLISFGIEDHTFGPMNLREHFPEEELTLGKNRFLEVASLVE
Ga0210059_103588123300027611Ecteinascidia TurbinataVVENLISSGIEDHTFGPINLVPEEELTLGKNRFLEVASLVE
Ga0210059_103956233300027611Ecteinascidia TurbinataVVENLISFGVEDHTFGPMNLREHFPEEELTLGKNRFLEVASLVE
Ga0210059_104430233300027611Ecteinascidia TurbinataVVENLISSGIEHYTFGPINLREHFPEEELTLGKNRFLEAASLVE
Ga0210059_104563613300027611Ecteinascidia TurbinataVVEDLISFGIEDHTLGPTKVRKNFPEEELTLGKNRLLEVASLVE
Ga0210059_105403833300027611Ecteinascidia TurbinataVVENLISFGIEDHTFGPTKVREHFPEEKLTLGKKRLLEVASLVE
Ga0210059_106263443300027611Ecteinascidia TurbinataVVDSLISSGMEDHTFGPINLREHFPEEELTLGKNRLLEVASLVE
Ga0210059_106305413300027611Ecteinascidia TurbinataVVENLILYGIEDHTFGPMNLREHFPEEKLTLGKNRFLEVASPVE
Ga0210059_106579043300027611Ecteinascidia TurbinataISFGIEYHTLGPTKVREHFPEEELILGKNRLLEVANLVE
Ga0210059_107548813300027611Ecteinascidia TurbinataVVENLISSGIEDHTFGPLNLREHFPEEKLTLGKKRFLEVASLVE
Ga0210059_108029623300027611Ecteinascidia TurbinataVVEDLISFGIEDHTLGPTKTREHFPEEELILGKKRLLEVASLVE
Ga0210059_108470123300027611Ecteinascidia TurbinataVVENLISSGIEDHTFGPMNLREPFPEEELTLGKNRFLEVASLVE
Ga0210059_108885813300027611Ecteinascidia TurbinataVVENLISSGIEHHTFGPMNLREHFPEEELTLGKNRLLEVASLVE
Ga0210059_109204243300027611Ecteinascidia TurbinataVVEDLISFGIEDHTFGPTKVREHFPEEELTLGKNRLLEVASLVE
Ga0210059_112347513300027611Ecteinascidia TurbinataVVECLISSGIEDHTFGPINLREHFPEEELTLGKNRFLKVASLVE
Ga0210059_114574613300027611Ecteinascidia TurbinataVVENLISFGIEDHTFGPINLREHFPEEELTLGKNRFLEVASLVE
Ga0210059_117799023300027611Ecteinascidia TurbinataVVENLISSGIEDHTFGPMNLREHFPDEELTLGKNRFLEVASLVE
Ga0210059_119158013300027611Ecteinascidia TurbinataVVEDLISFGIEYHTLGPTKVREHFPEEELILGKNRLLEVASLVECDFFH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.