NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F096995

Metagenome / Metatranscriptome Family F096995

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096995
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 42 residues
Representative Sequence ALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA
Number of Associated Samples 76
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Archaea
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 85.58 %
Associated GOLD sequencing projects 61
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (83.654 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(35.577 % of family members)
Environment Ontology (ENVO) Unclassified
(59.615 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(61.538 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50
1JGI25385J37094_101156192
2JGI25383J37093_100883881
3JGI25382J37095_100508032
4JGI25382J43887_102146871
5JGI25390J43892_101272672
6JGI25389J43894_10663932
7Ga0066683_107765482
8Ga0066680_106025512
9Ga0066685_108346281
10Ga0066678_101528323
11Ga0070708_1000982391
12Ga0066686_106539312
13Ga0066689_108794961
14Ga0070707_1000554697
15Ga0066661_105791071
16Ga0066661_108875391
17Ga0066704_100601971
18Ga0066704_102077473
19Ga0066698_104048681
20Ga0066700_105975761
21Ga0066700_109338051
22Ga0066703_100782694
23Ga0066691_106138352
24Ga0066665_100166311
25Ga0066665_101094035
26Ga0066665_103553343
27Ga0066665_109108712
28Ga0066659_101084105
29Ga0066659_103964642
30Ga0066659_105962922
31Ga0066659_106618641
32Ga0099794_100997581
33Ga0099794_102935602
34Ga0066710_1002068515
35Ga0066710_1007849111
36Ga0066710_1011408251
37Ga0066710_1032647092
38Ga0099829_113670751
39Ga0099828_101318201
40Ga0099827_102155873
41Ga0066709_1000425597
42Ga0066709_1010316601
43Ga0066709_1041938662
44Ga0127492_10899911
45Ga0127483_10220361
46Ga0137391_108492511
47Ga0137391_110145011
48Ga0137391_111152011
49Ga0137393_111818241
50Ga0137389_108928001
51Ga0137364_111406631
52Ga0137364_114392541
53Ga0137383_101357071
54Ga0137383_109523061
55Ga0137383_111508881
56Ga0137365_105954991
57Ga0137399_108474381
58Ga0137374_111283252
59Ga0137380_107554151
60Ga0137379_108946431
61Ga0137377_103721141
62Ga0137377_118940591
63Ga0137387_112639941
64Ga0137386_106001962
65Ga0137385_100518188
66Ga0137360_104869414
67Ga0137360_109391991
68Ga0137390_104044521
69Ga0134058_12095841
70Ga0134050_12399031
71Ga0137396_101248344
72Ga0137396_103902791
73Ga0137416_114447651
74Ga0137416_115562911
75Ga0134076_103308502
76Ga0134087_103142281
77Ga0134087_105196622
78Ga0134075_100689875
79Ga0134089_101561122
80Ga0066667_100299164
81Ga0066667_122563051
82Ga0066662_113664081
83Ga0215015_100747333
84Ga0207646_106332632
85Ga0209235_11403271
86Ga0209236_11916312
87Ga0209055_10082595
88Ga0209055_10356535
89Ga0209239_11872691
90Ga0209761_11128592
91Ga0209154_10531464
92Ga0209152_100083925
93Ga0209801_11051152
94Ga0209803_10248981
95Ga0209806_10583031
96Ga0209160_11585602
97Ga0209376_13729101
98Ga0209076_10900661
99Ga0209689_13570602
100Ga0209180_101040521
101Ga0209590_104545872
102Ga0137415_101208136
103Ga0137415_103497331
104Ga0307504_103538351
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.50%    β-sheet: 0.00%    Coil/Unstructured: 72.50%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540ALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
93.3%6.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Grasslands Soil
Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
35.6%8.7%31.7%19.2%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1011561923300002558Grasslands SoilALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA*
JGI25383J37093_1008838813300002560Grasslands SoilGQFDIVALWQAKTSEDIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
JGI25382J37095_1005080323300002562Grasslands SoilLWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA*
JGI25382J43887_1021468713300002908Grasslands SoilPGQFDIVALWQAKTSEEIVKNSVERVTNLPGIFKSETLLAYAPFFKA*
JGI25390J43892_1012726723300002911Grasslands SoilQFDIVALWQARTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA*
JGI25389J43894_106639323300002916Grasslands SoilVPGQFDIVALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
Ga0066683_1077654823300005172SoilDIVALWQARTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA*
Ga0066680_1060255123300005174SoilALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA*
Ga0066685_1083462813300005180SoilLWQARTSEDIVKTSVERVSQLDGIFKSETLLAYAPFFKA*
Ga0066678_1015283233300005181SoilTVPGQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA*
Ga0070708_10009823913300005445Corn, Switchgrass And Miscanthus RhizosphereTVPGQFDIVALWQAKTSEDIVKNSVERVTNLQGIFKSETLLAYAPFFKA*
Ga0066686_1065393123300005446SoilVALWQARTSEDIVKTSVEKVSNLEGIFHSETLLAYAPFFKA*
Ga0066689_1087949613300005447SoilVALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYSPFFKA*
Ga0070707_10005546973300005468Corn, Switchgrass And Miscanthus RhizosphereSTVPGQFDIVALWQAKTSEDIVKNSVERVTNLQGIFKSETLLAYAPFFKA*
Ga0066661_1057910713300005554SoilTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA*
Ga0066661_1088753913300005554SoilWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
Ga0066704_1006019713300005557SoilARTSEDIVKTSVERVSHLDGIFKSETLLAYAPFFKA*
Ga0066704_1020774733300005557SoilRTSEDIVKTSVERVSNLEGIFKSETLLAYAPFFKA*
Ga0066698_1040486813300005558SoilVPGQFDIVALWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA*
Ga0066700_1059757613300005559SoilGQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA*
Ga0066700_1093380513300005559SoilQARTSEDIVKTSVERVSHLDGIFKSETLLAYAPFFKA*
Ga0066703_1007826943300005568SoilRTSEDIVKTSVERVSNLPGIFKSETLLAYAPFFKA*
Ga0066691_1061383523300005586SoilQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLA*
Ga0066665_1001663113300006796SoilQFDIVALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYSPFFKA*
Ga0066665_1010940353300006796SoilWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA*
Ga0066665_1035533433300006796SoilALWQAKTSEDIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
Ga0066665_1091087123300006796SoilALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
Ga0066659_1010841053300006797SoilWQARTSEDIVKTSVERVSNLPGIFKSETLLAYAPFFKA*
Ga0066659_1039646423300006797SoilTSEDIMKNSVERITNLPGLFHSETLLAYAPFFKA*
Ga0066659_1059629223300006797SoilTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
Ga0066659_1066186413300006797SoilALWQARTSEDIVKTSVERVSHLDGIFKSETLLAYAPFFKA*
Ga0099794_1009975813300007265Vadose Zone SoilTVPGQFDIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA*
Ga0099794_1029356023300007265Vadose Zone SoilWQARTSEDIVKTSVERVSNLEGIFKSETLLAYAPFFKA*
Ga0066710_10020685153300009012Grasslands SoilWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA
Ga0066710_10078491113300009012Grasslands SoilLWQARTSEDIVKTSIERVSNLEGIFKSETLLAYAPFFKA
Ga0066710_10114082513300009012Grasslands SoilALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYSPFFKA
Ga0066710_10326470923300009012Grasslands SoilLWQARTSEDIVKTSVERVSQLDGIFKSETLLAYAPFFKA
Ga0099829_1136707513300009038Vadose Zone SoilWQARTSEDIVKTSVEKVSNLEGIFKSETLLAYAPFFKA*
Ga0099828_1013182013300009089Vadose Zone SoilWQAKTSEDIVKNSVERVTGLQGIFKSETLLAYAPFFKA*
Ga0099827_1021558733300009090Vadose Zone SoilQARTSEDIVKTSVERVSNLEGIFKSETLLAYAPFFKA*
Ga0066709_10004255973300009137Grasslands SoilALWQAKTSEEIVKNTVERVTTLPGVFKSETLLAYAPFFKA*
Ga0066709_10103166013300009137Grasslands SoilALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYSPFFKA*
Ga0066709_10419386623300009137Grasslands SoilQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA*
Ga0127492_108999113300010087Grasslands SoilTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA*
Ga0127483_102203613300010142Grasslands SoilALWQARTSEEIVKTSVERVTNLPGIFKSETLLAYAPFFKA*
Ga0137391_1084925113300011270Vadose Zone SoilPGQFDIVALWQAKTSEDIVKNSVERVTSLQGIFKSETLLAYAPFFKA*
Ga0137391_1101450113300011270Vadose Zone SoilVPGQFDIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYSPFFKA*
Ga0137391_1111520113300011270Vadose Zone SoilPGQFDIVALWQAKTSEDIVKNSVERVTSLQGIFKSETLLAYSPFFKA*
Ga0137393_1118182413300011271Vadose Zone SoilGQFDIVALWQAKTSEDIVKNSAERATNLQGIFKSETLLAYSPFFKA*
Ga0137389_1089280013300012096Vadose Zone SoilVALWQAKTSEDIVKNSVERVTNLQGIFHSETLLAYAPFFKA*
Ga0137364_1114066313300012198Vadose Zone SoilQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA*
Ga0137364_1143925413300012198Vadose Zone SoilQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA*
Ga0137383_1013570713300012199Vadose Zone SoilQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFNA*
Ga0137383_1095230613300012199Vadose Zone SoilQARTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA*
Ga0137383_1115088813300012199Vadose Zone SoilTSEDIVKTSVEKVSNLEGIFHSETLLAYAPFFKA*
Ga0137365_1059549913300012201Vadose Zone SoilALWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA*
Ga0137399_1084743813300012203Vadose Zone SoilPGQFDIVALWQARTSEDIVKTSVEKVSHLDGIFRSETLLAYAPFFKA*
Ga0137374_1112832523300012204Vadose Zone SoilDIVALWQARTSEEIMKNSVERFTNLQGLFHSETLLAFAPFFKA*
Ga0137380_1075541513300012206Vadose Zone SoilLWQARTSEDIVKTSVEKVSNLEGIFHSETLLAYAPFFKA*
Ga0137379_1089464313300012209Vadose Zone SoilARTSEDIVKTSVEKVSNLEGIFHSETLLAYAPFFKA*
Ga0137377_1037211413300012211Vadose Zone SoilIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFNA*
Ga0137377_1189405913300012211Vadose Zone SoilWQARTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA*
Ga0137387_1126399413300012349Vadose Zone SoilGQFDIVALWQARTSEDIMKNSVERITNLPGLFHSETLLAYAPFFKA*
Ga0137386_1060019623300012351Vadose Zone SoilWQAKTSEDIVKNSVERVTTLPGIFKSETLLAYAPFFKA*
Ga0137385_1005181883300012359Vadose Zone SoilWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFNA*
Ga0137360_1048694143300012361Vadose Zone SoilQARTSEDIVKTSVERVSHLDGIFKSETILAYTPVFKA*
Ga0137360_1093919913300012361Vadose Zone SoilFDIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYSPFFKA*
Ga0137390_1040445213300012363Vadose Zone SoilVPGQFDIVALWQAKTSEDIVKNSVERVTNLQGIFKSETLLAYSPFFKA*
Ga0134058_120958413300012379Grasslands SoilGQFDIVALWQARTSEDIVKQSVEKVSNLEGIFHGETLLAYAPFFKA*
Ga0134050_123990313300012407Grasslands SoilTTVPGQFDIVALWQAKTSEEIVKNTVERVTTLPGVFKSETLLAYAPFFKA*
Ga0137396_1012483443300012918Vadose Zone SoilDIVALWQAKTSEDIVKNSVERVTNLQGIFHSETLLAYAPFFKA*
Ga0137396_1039027913300012918Vadose Zone SoilQFDIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA*
Ga0137416_1144476513300012927Vadose Zone SoilARTSEDIVKTSVERVSNLEGIFKSETLLAYAPFFKA*
Ga0137416_1155629113300012927Vadose Zone SoilFDIVALWQAKTSEDIVKNSVERVTNLQGIFKSETLLAYAPFFKA*
Ga0134076_1033085023300012976Grasslands SoilWQAKTSEDIVKNSLERVTNLPGIFKSETLLAYAPFFKA*
Ga0134087_1031422813300012977Grasslands SoilQARTSEDIVKTSVERVSQLDGIFKSETLLAYAPFFKA*
Ga0134087_1051966223300012977Grasslands SoilSTVPGQFDIVALWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA*
Ga0134075_1006898753300014154Grasslands SoilGQFDIVALWQARTSEDIVKTSVERVSQLDGIFKSETLLAYAPFFKA*
Ga0134089_1015611223300015358Grasslands SoilRTSEDIVKTSVEKVTNLEGIFKSETLLAYAPFFKA*
Ga0066667_1002991643300018433Grasslands SoilWQAKTSEEIVKTSVERVSHLDGIFKSETVLAYTPVFKA
Ga0066667_1225630513300018433Grasslands SoilVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA
Ga0066662_1136640813300018468Grasslands SoilQARTSEDIVKTSVERVSHLDGIFKSETLLAYAPFFKA
Ga0215015_1007473333300021046SoilAKTSEDIVKNSVERVTNLQGISKSETLLAYAPFFKA
Ga0207646_1063326323300025922Corn, Switchgrass And Miscanthus RhizospherePGQFDIVALWQAKTSEDIVKNSVERVTNLQGIFKSETLLAYAPFFKA
Ga0209235_114032713300026296Grasslands SoilALWQARTSEDIVKTSVERVSNLEGIFKSETLLAYAPFFKA
Ga0209236_119163123300026298Grasslands SoilVALWQARTSEDIVKTSIERVSNLQGIFKSETLLAYAPFFKA
Ga0209055_100825953300026309SoilLWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA
Ga0209055_103565353300026309SoilVALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA
Ga0209239_118726913300026310Grasslands SoilIVALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA
Ga0209761_111285923300026313Grasslands SoilVALWQARTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA
Ga0209154_105314643300026317SoilVALWQAKTSEEIVKNSVERVTTLPGIFKSETLLAYAPFFKA
Ga0209152_1000839253300026325SoilTVPGQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA
Ga0209801_110511523300026326SoilARTSEDIVKNSFERVTNLQGLFSSETLLAHLPFFNA
Ga0209803_102489813300026332SoilQFDIVALWQAKTSEEIVKNTVERVTTLPGIFKSETLLAYAPFFKA
Ga0209806_105830313300026529SoilGQFDIVALWQARTSEDIVKNSFEKVTNLQGLFSSETLLAHLPFFNA
Ga0209160_115856023300026532SoilRTSEDIVKTSVERVSNLEGIFKSETLLAYAPFFKA
Ga0209376_137291013300026540SoilVPGQFDIVALWQAKTSEEIVKTSVERVSHLDGIFKSETILAYTPVFKA
Ga0209076_109006613300027643Vadose Zone SoilPGQFDIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA
Ga0209689_135706023300027748SoilVPGQFDIVALWQAKTSEEIVKNSVERVTTLPGVFKSETLLAYAPFFKA
Ga0209180_1010405213300027846Vadose Zone SoilVALWQARTSEDIVKTSVERVSNLPGIFKSETLLAYAPFFKA
Ga0209590_1045458723300027882Vadose Zone SoilVPGQFDIVALWQARTSEDIVKTSVERVSNLPGIFKSETLLAYAPFFKA
Ga0137415_1012081363300028536Vadose Zone SoilIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYAPFFKA
Ga0137415_1034973313300028536Vadose Zone SoilIVALWQAKTSEDIVKNSVERVTNLPGIFKSETLLAYSPFFKA
Ga0307504_1035383513300028792SoilKTSEDIVKNSVERVTSLQGIFKSETLLAYSPFFKA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.