NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F033108

Metagenome / Metatranscriptome Family F033108

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F033108
Family Type Metagenome / Metatranscriptome
Number of Sequences 178
Average Sequence Length 67 residues
Representative Sequence MSDPHPFDDPVPRGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ
Number of Associated Samples 148
Number of Associated Scaffolds 178

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 38.76 %
% of genes near scaffold ends (potentially truncated) 16.85 %
% of genes from short scaffolds (< 2000 bps) 74.16 %
Associated GOLD sequencing projects 136
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (77.528 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(7.303 % of family members)
Environment Ontology (ENVO) Unclassified
(26.404 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(33.708 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.
1ICChiseqgaiiDRAFT_07978892
2ICChiseqgaiiFebDRAFT_113617052
3fpDRAFT_10077668
4JGIcombinedJ13530_1059522801
5GOS2236_10311112
6metazooDRAFT_12371567
7B570J29032_1089815081
8metazooDRAFT_13602142
9metazooDRAFT_109055964
10B570J40625_1006093782
11Ga0063356_1019148292
12Ga0063356_1020937363
13Ga0063356_1039131232
14Ga0062383_104676872
15Ga0062380_104316722
16Ga0066388_1044323082
17Ga0070739_1000168929
18Ga0070761_100554882
19Ga0068861_1010426342
20Ga0066903_1010952732
21Ga0075156_102554872
22Ga0075012_101779313
23Ga0075023_1000372363
24Ga0075024_1001183584
25Ga0075417_100764753
26Ga0075364_107924841
27Ga0075029_1000998803
28Ga0075026_1010021652
29Ga0075432_105025612
30Ga0075427_100238951
31Ga0075422_103428361
32Ga0074055_102905372
33Ga0074060_111466992
34Ga0066658_100220555
35Ga0075430_1008344652
36Ga0075431_1009774751
37Ga0075431_1020948242
38Ga0075420_1007625872
39Ga0073934_108571651
40Ga0079218_113863271
41Ga0102532_11286193
42Ga0103959_10501915
43Ga0105044_101892911
44Ga0105044_102379485
45Ga0105048_100196383
46Ga0099829_105735902
47Ga0105047_102279421
48Ga0105047_103145913
49Ga0105047_105520521
50Ga0099828_105485112
51Ga0099828_105769941
52Ga0114980_102808682
53Ga0111538_109501792
54Ga0114977_105103351
55Ga0116854_10415041
56Ga0073899_100780952
57Ga0073899_102663072
58Ga0073899_107011322
59Ga0123355_121746262
60Ga0131092_100269979
61Ga0131092_101333753
62Ga0131077_100488576
63Ga0131077_104436703
64Ga0126309_103911133
65Ga0123356_101821263
66Ga0133939_11152022
67Ga0116245_103987743
68Ga0116243_107552722
69Ga0116253_103263622
70Ga0116244_100796984
71Ga0116236_110145392
72Ga0126381_1012327244
73Ga0133913_102869966
74Ga0133913_118335423
75Ga0120192_101538542
76Ga0136627_11092672
77Ga0136624_11102792
78Ga0137372_112251122
79Ga0137369_109431121
80Ga0138256_105805292
81Ga0168317_10153132
82Ga0168317_10261022
83Ga0164308_115346463
84Ga0170681_10545471
85Ga0172367_100866664
86Ga0172367_101104051
87Ga0172367_106282512
88Ga0172373_100094643
89Ga0157378_111062823
90Ga0181531_110705241
91Ga0182012_103088431
92Ga0172376_102141064
93Ga0182030_113164742
94Ga0137403_100490931
95Ga0163144_101223595
96Ga0163144_104985673
97Ga0182747_100066023
98Ga0187818_104295482
99Ga0187806_10539263
100Ga0187808_106137892
101Ga0187785_107901322
102Ga0187779_101057222
103Ga0187783_112349522
104Ga0187781_101015712
105Ga0187781_102695162
106Ga0187782_100984101
107Ga0187815_101549272
108Ga0187784_100582174
109Ga0187784_104046482
110Ga0187784_108598861
111Ga0187772_113541631
112Ga0190265_105683132
113Ga0190265_113729102
114Ga0190275_100843243
115Ga0190269_114286002
116Ga0190270_128149072
117Ga0190267_107081942
118Ga0193726_12478492
119Ga0210401_100191587
120Ga0196959_100119512
121Ga0207933_10442543
122Ga0209203_10670662
123Ga0209152_100102774
124Ga0255188_10810813
125Ga0207826_11925602
126Ga0247833_10399452
127Ga0209810_100140628
128Ga0209277_101832492
129Ga0209476_100644602
130Ga0209180_103206382
131Ga0209591_102634661
132Ga0209274_103718502
133Ga0209023_100153688
134Ga0209181_106652581
135Ga0209481_101624262
136Ga0209382_102286534
137Ga0209382_120093882
138Ga0209583_100380933
139Ga0209069_100346093
140Ga0209069_101895063
141Ga0255341_10190015
142Ga0247822_119538251
143Ga0255346_12588031
144Ga0302154_105340712
145Ga0239302_1140712
146Ga0311328_108846123
147Ga0311332_111805572
148Ga0299907_100705553
149Ga0311349_112457152
150Ga0311372_115238743
151Ga0268386_102511212
152Ga0299914_101179263
153Ga0299914_111724822
154Ga0302324_1011465673
155Ga0272422_10690773
156Ga0307408_1001997764
157Ga0307374_1000770812
158Ga0307374_100629344
159Ga0307374_100672554
160Ga0307374_100722287
161Ga0307372_102922642
162Ga0307372_103636801
163Ga0307477_10000029234
164Ga0315900_101102323
165Ga0315900_104747802
166Ga0307478_114501192
167Ga0307479_119686401
168Ga0326597_100671952
169Ga0315905_110971121
170Ga0335397_100972272
171Ga0335397_104587101
172Ga0335397_105141392
173Ga0335074_108362542
174Ga0335000_0024998_1001_1207
175Ga0335010_0216876_963_1154
176Ga0335014_0034268_2432_2638
177Ga0335051_0336056_370_588
178Ga0370495_0071797_413_613
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.63%    β-sheet: 0.00%    Coil/Unstructured: 59.38%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060MSDPHPFDDPVPRGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
77.5%22.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater
Lake
Freshwater
Freshwater And Sediment
Freshwater
Freshwater Lake
Freshwater Lake
Freshwater Microbial Mat
Bog
Wetland Sediment
Freshwater Sediment
Watersheds
Freshwater
Freshwater
Polar Desert Sand
Freshwater
Marine
Wetland
Hot Spring Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Serpentine Soil
Terrestrial
Surface Soil
Agricultural Soil
Arctic Peat Soil
Soil
Soil
Hardwood Forest Soil
Soil
Soil
Soil
Untreated Peat Soil
Tropical Peatland
Bog
Tropical Forest Soil
Soil
Soil
Soil
Fen
Palsa
Bog
Rock
Rock
Weathered Mine Tailings
Termite Gut
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Populus Endosphere
Populus Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Activated Sludge
Activated Sludge
Wastewater
Activated Sludge
Active Sludge
Anaerobic Digestor Sludge
Industrial Wastewater
Wastewater Effluent
Activated Sludge
Wastewater
5.6%6.2%3.9%7.3%3.9%5.6%3.4%6.7%3.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_079788923300000033SoilMPIEPFDDDPVPPGILLDLGEALRVLEALEDALNAMEGAGLAPGLQDEIATVIRVLHVRLGFDEGGLR*
ICChiseqgaiiFebDRAFT_1136170523300000363SoilMPTEPFDDDSVPPGILLDLGEALRLLEALEDALAAIEHARLASGSQDEIATVIRVLHVRLGFDEGGLR*
fpDRAFT_100776683300000730Activated SludgeMTERDPFDDPVPRRVTLSLDDAFRVLEAIEDARLELKDRGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
JGIcombinedJ13530_10595228013300001213WetlandMSDRNPFHDPVPRSVELSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLSFDEGGVQ*
GOS2236_103111123300001968MarineMLPWCHISNVKDSSGHDDPVPLVIRLSLEQTFRVLEAIEDARLVLREHNLALGLQDELATVIRMIHDKLGLDEGGTP*
metazooDRAFT_123715673300002197LakeMSDADDPVPENIVLSLDDIFRILEAIEDARLELRERQAAPGLQDELATVIRILHSRLGLDEGGAQ*
B570J29032_10898150813300002408FreshwaterMSDRDPFDDRVPRGVELSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
metazooDRAFT_136021423300002470LakeMSDADDPVPENIVLSLDDIFRILEAIEEARLELRERQAAPGLQDELATVIRILHSRLGLDEGGAQ*
metazooDRAFT_1090559643300002476LakeNRKMSDADDPVPENIVLSLDDIFRILEAIEDARLELRERQAAPGLQDELATVIRILHSRLGLDEGGAQ*
B570J40625_10060937823300002835FreshwaterMSDRNPSDDPVPLGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0063356_10191482923300004463Arabidopsis Thaliana RhizosphereMTDLHPFDDPVPRDVVLSSDEAFRVLEAIEDARLALREHGAAPGLQDELATVIRMLHGKLGLDEGGVQ*
Ga0063356_10209373633300004463Arabidopsis Thaliana RhizosphereVSTNDRTDDPVPRSVLLELDEAFRVLEALEDSRLALREHGAAPGLQDELATVIRVLHGKLGLDEGGVL*
Ga0063356_10391312323300004463Arabidopsis Thaliana RhizosphereVSTDDPTDDPVPRAVVLDLDEAFRVLEVLEDSRLALRERGAAPGRQDELATVIRVLHGKLGLDEGGAL*
Ga0062383_1046768723300004778Wetland SedimentMSDRHPFDDPVPRDVVLSLDEAFRVLEAIEDARLALREHGAAPGLQDELATVIRMLHGRLGLDEGGVQ*
Ga0062380_1043167223300004779Wetland SedimentMPTDRRPDDDPVPLGIVLDLEEALRVLEALEDARLALRDLDAAPGLRDELATMVNLLPGRLGFDPGGLR*
Ga0066388_10443230823300005332Tropical Forest SoilVSSSGSDDDPVPPGLLLDLDEASRVLEALEDARLELRDARTAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0070739_10001689293300005532Surface SoilVTAMSQGDDDRVPPGLVLNLDEAFRVLEALEDALLALEEAGLAPGLQDELATVIRLIHGSLGLDEGGLR*
Ga0070761_1005548823300005591SoilMSQPSNNLLGVSEDDGDPVPPELVLTLDEAFRVLEAMEDALLSIEEAGVAPGLQDELATVIHFVHGRPGLDEGGVL*
Ga0068861_10104263423300005719Switchgrass RhizosphereVHTQRVSTHEPNDDPVPRTLLLDLDEGFRVLEALEDARLALRERDAAPGLQDELATVIRVLHGKLGLDEGGVL*
Ga0066903_10109527323300005764Tropical Forest SoilVSRPDDDPVPPQLVLSLDEAFRVLEALEDALSALEAAGAAPGLRDELATVIRLIHGRLGLDEGGVR*
Ga0075156_1025548723300005982Wastewater EffluentMAADGEPDAIPPGLVLGLDEALRVLAALEGALWELQLAQRALGLRDELATVIRRLHGTLGLDPGGLP*
Ga0075012_1017793133300006033WatershedsVSSTDPFDDPVPHTVVLSLDDAFRVLEAIEDARLELRDQGAAPGLQDELATVIRMLHGRLGLDEGGAL*
Ga0075023_10003723633300006041WatershedsMQPADDDPVPPGMLLDLDEAFRVLEALEDARLALREATAGLGLQDELATVIRMLHVRLGFDEGGVR*
Ga0075024_10011835843300006047WatershedsVNPLDSFSDPVPSGVVLSLDEAFRVLEALEDARLALRERGAAPGLQDELATVIRMLHGKLGLDEGGVQ*
Ga0075417_1007647533300006049Populus RhizosphereVSPSGTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0075364_1079248413300006051Populus EndosphereTLSVSSNDPSDGPVPRTVVLSLDDAFRVLEAIEDARLAPRECGAAPGLQDELATVIRLLHARLGLDEGGVL*
Ga0075029_10009988033300006052WatershedsMNRGDDPVPPGLVLSLDEAFRLLEALEDARLAMRQVDLAPGLQDELATVIRLLHGRLGFDEGGVR*
Ga0075026_10100216523300006057WatershedsMDPTDPFDDPVPPGLLLDLDEAFRVLEALEDSRLALRDLSAARGLQDELATVIRILHGRLGFDEGGVR*
Ga0075432_1050256123300006058Populus RhizosphereGRHRGVSPSGTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0075427_1002389513300006194Populus RhizosphereRWTGLSGLLSPPGQRWMAGRHRGVSPSGTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0075422_1034283613300006196Populus RhizosphereTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0074055_1029053723300006573SoilMSGHGSDPAPTSIVLTLDETFRVLEALEDARLELRDARAALGLQDELATVIRILHGRLGLDEGGVR*
Ga0074060_1114669923300006604SoilVPTSIVLTLDETFRVLEALEDARLELRDARAALGLQDELATVIRILHGRLGLDEGGVR*
Ga0066658_1002205553300006794SoilMSRREDDPVPPAIVLDLDEAFRVLEALEDARLAMREADLAPGLQDELATVIRLLHGRLGLDEGGVL*
Ga0075430_10083446523300006846Populus RhizosphereVSPSGTDDDPVPPGLVLDLDETFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0075431_10097747513300006847Populus RhizosphereVSPSGSDDDPVPPGLVLDLDEAFRVLEALEDARLALREIGAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0075431_10209482423300006847Populus RhizosphereVSPRGTDDDPVPPGLVLDLDEAFRVPAGPGGRPIRAQGDRAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0075420_10076258723300006853Populus RhizosphereVSPRGTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0073934_1085716513300006865Hot Spring SedimentMGRGDHFGDPVPPGLVLGLDEAFRVLEALEDARLALLERGAAPGLQDELATVIRLLHGRLGFDEGGVP*
Ga0079218_1138632713300007004Agricultural SoilYGPAMSDEPPDPVPLGMVLSLDEAFRVLEGLEEALHALEVARVAPGLRDELATVIRVLHDHLGLDEVGFA*
Ga0102532_112861933300007094Freshwater LakeMHAGPPSGVDSFDDPVPRGVGLSLGETLRVFEVIGDARSVLLEPRTTSGPQDELATVTRMLHGKLWLDEVGVR*
Ga0103959_105019153300007214Freshwater LakeMSDSNPRFDPVPKGVVLTLDEAFRVLEAFEDARLELSERGAAPGLQDELATVIRMLHGKLGLNEGGVQ*
Ga0105044_1018929113300007521FreshwaterMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALRERGAVPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0105044_1023794853300007521FreshwaterLIGPSDDPVPRTVVLSLDDAFRMLEATEDARLALRERGAAPGLQDELATVIRLLHGRLGLDEGGVQ*
Ga0105048_1001963833300009032FreshwaterMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALRERGAFPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0099829_1057359023300009038Vadose Zone SoilMSRSADDPVPPGIVLSLDEAFRVLEALEDALLIVAEAALAPGLQDELATVIRLLHSRLGLDEGGVP*
Ga0105047_1022794213300009083FreshwaterMSDRNPFDDPVPRNVALSLDEAFRVLEAIEDARLAIRERGAVPGLQDELATVIRMLHGKLGF
Ga0105047_1031459133300009083FreshwaterMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALRERGAFPGLQDELATAIRMLHGKLGFDEGGVQ*
Ga0105047_1055205213300009083FreshwaterMSDRNPFDDPVPRNVALSLDEAFRVLEAIEDARLALREHGAVPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0099828_1054851123300009089Vadose Zone SoilMSSFTDDPVPPGIVLDLDEAFRVLEALEDALLTVAEAALAPGLRDELATVIRLLHGRLGLDEGGVL*
Ga0099828_1057699413300009089Vadose Zone SoilMSRSADDPVPPGIVLSLDEAFRVLEALEDALLIVAEAALAPGLQDELATVIRLLHGRLGLDEGGVP*
Ga0114980_1028086823300009152Freshwater LakeMSDHDPFGDPVPRSVVLLLDETFRVVAAIEDARLALRDRGAAPGLQDELATVIRMLHGTLGLDEGGVE*
Ga0111538_1095017923300009156Populus RhizosphereMQTLRVSPHDQTDDPVPRTVVLDLDESFRVLEALEDARLALRERGAAPGLLDELATVIRVLHGKLGLDEGGAL*
Ga0114977_1051033513300009158Freshwater LakeRHARGMSDCDPFDDPVPRGVELSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0116854_104150413300009400SoilASRVSQPGDDPVPPGLVLTLDEAFRVLEALGDARLAMRETGLAPVLQDELATVIRLVHGRVGLEEGGLS*
Ga0073899_1007809523300009540Activated SludgeMSDGDPFDDPVPRRVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0073899_1026630723300009540Activated SludgeVSSTDPFDDPVPGTVALSLDDAFRVLEAIEDARLELRDRGAAPGLQDELATVIRMLHGRLGLDEGGAL*
Ga0073899_1070113223300009540Activated SludgeMSWQNPLDDPVPSGVVLSLDEAFRVLESIEDDRLALRERGATPGLQDELATVIRMVHGRLGFDEGGVQ*
Ga0123355_1217462623300009826Termite GutVNRSDDPVPRALDLELEAAFRVLEALEDARFEIREAALAPGLQDELATVIRLLHGRLGLDEGGLR*
Ga0131092_1002699793300009870Activated SludgeLHAHGMSDANPFDDPVPRGVALSLDEAFRVLEGIEDARLALRERGAAPGLQDELATVIRMLHGKLGLDEGGAQ*
Ga0131092_1013337533300009870Activated SludgeMSHGNPFDDPVPQGVALSLDEAFRVLEAIEDARLAMRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0131077_1004885763300009873WastewaterMAPSSGADHDDGDPVPLGLVLDLDESFRVLEGLEDALVALIELNAGLGLQDELATVIRLLHRRLGLDEGGVT*
Ga0131077_1044367033300009873WastewaterMSDLDPFDDPVPLDVALTLDEAFRVLEAIEDARLALRDRGAAPGLQDELATVIRMLHGKLGLDEGG
Ga0126309_1039111333300010039Serpentine SoilEPTDDPVPRSVVLDLDEAFRVLEALEDARLALRERGAAPGLHDELATVIRVLHGKLGLDEGGVL*
Ga0123356_1018212633300010049Termite GutVNRSDDPVPRALDLELEAAFRVLEALEDARFEIRGAALAPGLQDELATVIRLLHSRLGLDEGGLR*
Ga0133939_111520223300010051Industrial WastewaterMAPADDRSPDDPVPLGLVLGLDEAFRVLEALEDARFALRELATEPGLQDELVTVIRMVHRRLGLDEGGVS*
Ga0116245_1039877433300010338Anaerobic Digestor SludgeMSDGDPFHDPVPRGVALSLDDAFRVLEAIEDARLALRDRGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0116243_1075527223300010344Anaerobic Digestor SludgeMSDRTPFDDPVPRSVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0116253_1032636223300010345Anaerobic Digestor SludgeMAPSDADDDLVPLGLVLGLDEAFRVLEALEDAIYELERTSMAPGLRDEIATVIRLLHGRLGFDQGGL*
Ga0116244_1007969843300010350Anaerobic Digestor SludgeVSDLDPFDDPVPRGVALSLEEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGLDEGGVQ*
Ga0116236_1101453923300010353Anaerobic Digestor SludgeVPPADDRSPDDPVPLGLVLDLDEAFRVLEALEDARFTLRERAAEPGLQDELVTVIRMVHRRLGLEEGGVS*
Ga0126381_10123272443300010376Tropical Forest SoilMSRGDDDPVPPGFVLNLDEALRVLEALEDARLAMQQARVAPGLQDELATVIRMIHGRFGVEQGGVP*
Ga0133913_1028699663300010885Freshwater LakeLNPFDDPVPRRVALSLDEAFRVLEATEDARLVLRERGAAPGLKDELATVIRLLHGTLGLEEGGVQ*
Ga0133913_1183354233300010885Freshwater LakeMSDCDPFDDPVPRGVELSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0120192_1015385423300012021TerrestrialMTTRCRPGLVLGLDEAFRVLEALEDSRLELREAGAAPGLQDELATVIRILHSRLGFDEGGVP*
Ga0136627_110926723300012042Polar Desert SandMAPDHDDPVPPGIVLDLDEALRVLEALEEALNELERSAMGPGLRDELATVIRIVHHRLGLDQGGTP*
Ga0136624_111027923300012183Polar Desert SandMAPDHDNPVPPGIVLDLDEALRVLEALEEALNELERSAMGPGLRDELATVIRIVHHRLGLDQGGTP*
Ga0137372_1122511223300012350Vadose Zone SoilVKDDDDDPVPLGLVLNLDEAFRVLEALEDARLELRARDVAPGFQDELVTVIRILHEALGFYEGGAS*
Ga0137369_1094311213300012355Vadose Zone SoilRSPGGDDPVPPGLVLDLDEAFRVLEALEDARLALREVGAAPGLQDELATVIRLLHGRLGFDEGGVV*
Ga0138256_1058052923300012533Active SludgeMNDRNPFDDPVPRNLTLSLDEAFRVLESLEDARLALRERGAAPGLQDELATVIRLLHGKLGFDEGSVR*
Ga0168317_101531323300012982Weathered Mine TailingsMSRHGNDPVPSGIWLDLDEAFRVLEALEDSLLALEEAGLAPGLRDELATVIRLVHDRLGLDEGGLK*
Ga0168317_102610223300012982Weathered Mine TailingsVSRTDDDPVPPGIWLDLDEALRTLEALEDARLALRNAGVSPGLQDELATVIRQLHGRLGFEEGGLL*
Ga0164308_1153464633300012985SoilMTPPDDDDPVPLGLVLSLDESFRVLEALEDSRLALRSAQSAPGLQDELATVIRIIHGRLGFDERGLS*
Ga0170681_105454713300013026RockVNSGGSDDDPIPAGLILDLEEAFRVLEALEDARLALREAGTALGLQDELATVIRLLHGRLGFDEGGAV*
(restricted) Ga0172367_1008666643300013126FreshwaterMSDPRPFDDPVPRGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
(restricted) Ga0172367_1011040513300013126FreshwaterMSDPHPFDDPVPRGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKL
(restricted) Ga0172367_1062825123300013126FreshwaterMSDPRPVDDPVPRSVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
(restricted) Ga0172373_1000946433300013131FreshwaterMSDPRPFDDPVPRSVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0157378_1110628233300013297Miscanthus RhizosphereVSPSDPTDDPVPPGLVLDLDEAFRVLEALEDARLALRAVGAAPGLQDELATVIRLLHGRLGFDEGGVP*
Ga0181531_1107052413300014169BogVTQNEDPVPPGLLLTLEEAFRVLEALEDGLLAMEEADVALGLRDELATVIRLLHIRLGLDEGG
Ga0182012_1030884313300014499BogRPDDDPVPSGLRMSLDEAFRVLEALEDARFALRESGLAPGLQDELATVIRIIHGKLGIEEGGVQ*
(restricted) Ga0172376_1021410643300014720FreshwaterMSDPHPFDDPVPRGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ*
Ga0182030_1131647423300014838BogVSRPDDDPVPPGLLLSLEEAFRVPEALEDARLAIRESGLAPGLQDELATVIRIIHGTLGFEEGGVL*
Ga0137403_1004909313300015264Vadose Zone SoilMLPEGITMSEFDDPVPLGLVLDLDEAFRVLAALEDARLELGKVGAAPGLCDELATLIRLLHGRLGLDEGGVQ*
Ga0163144_1012235953300015360Freshwater Microbial MatVSPTDPFDDPVPRTVLLALDDAFRILEAIEDARLELREREAAPGLQDELATVIRMLHGRLGLAEGGAL*
Ga0163144_1049856733300015360Freshwater Microbial MatMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALRERGAFPGLQDELATVICILHGKLVSDEGGVQ*
Ga0182747_1000660233300017560SoilVSEFDDPVPPAILLDLDEVFRVLEALEGSLVVILDHGLAPGLLDELRTVIAMLHHRLGFDQGGWNV
Ga0187818_1042954823300017823Freshwater SedimentVSQNDDPVPPGLLLSLEEAFRVLEALDALLAMEEADVALGLRDELATVIRLLHVRLGLDEGGVG
Ga0187806_105392633300017928Freshwater SedimentVSQNDDPVPPGLLLSLEEAFRVLEALDALLAMEEADVALGLRDELATVMRLLHVRLGLDEGGVG
Ga0187808_1061378923300017942Freshwater SedimentVSQNDDPVPPGLLLSLEEAFRVLEALDALLAMEEADVALGLRGELATVMRLLHVRLGLDEGGVGWKEA
Ga0187785_1079013223300017947Tropical PeatlandVSSHDQDPVPSSVVLPLQEAFRALEALEDARLALRESGVALGLQDELATVIRIVHGRLGLDEGGVL
Ga0187779_1010572223300017959Tropical PeatlandLLATPIWQSVIVSQNDDPVPPGLRLDLEEAFRVLEALEDALLAMEEAGLTPGLQDELATVIRVLHARLGLGEGGTQ
Ga0187783_1123495223300017970Tropical PeatlandMSRSQDDPVPSGIVLNLDEAFRVLEALEDARLVMRELTLALGLQDELATVIRMLHGKLGFEEGGLS
Ga0187781_1010157123300017972Tropical PeatlandMNQGGDPVPPHLVLSLDEAFRVLEAVEDARLAMRDVGLAPGPQDELATVIGLLHSRLGFDEGGVR
Ga0187781_1026951623300017972Tropical PeatlandMSRSQDDPVPSGIVLNLDEAFRVLEALEDARLVMREIGLALGLQDELATVIRMLHGKLGFEEGGLS
Ga0187782_1009841013300017975Tropical PeatlandMSRSQDDPVPSGIVLNLDEAFRVLEALEDARLVMREIGLAPGLQDELATVIRMLHGKLGFEEAGLS
Ga0187815_1015492723300018001Freshwater SedimentVSQNDDPVPPGLLLSLEEAFRVLEALDALLAMEEADVALGLRGELATVMRLLHVRLGLDEGGVG
Ga0187784_1005821743300018062Tropical PeatlandMNQGGDPVPPNLVLSLDEAFRVLEAVEDARLAMRDVGLAPGLQDELATVIGLLHSRLGFDEGGVR
Ga0187784_1040464823300018062Tropical PeatlandMSQNDDPVAPGLLLRLDEAFRVLEALEDGLLALEEADVAPGLRDEQATVIRLVDVRLGLDEGGIR
Ga0187784_1085988613300018062Tropical PeatlandMSRSQDDPVPSGIVLNLDEAFRVLEALEDARLVMREIGLAPGLQDELATVIRMLHGKLGFEE
Ga0187772_1135416313300018085Tropical PeatlandMSRSQDDPVPSGIVLNLDEAFRVLEALEDARLVMREIGLAPGLQDELATVIRMLHGKLGFEEGGLS
Ga0190265_1056831323300018422SoilMPEEPHDPVPLRVVLSLDEAFRVLEGLEDALLELELAGAAPGLRDELATVIRVLHVHLGFSEGGGAP
Ga0190265_1137291023300018422SoilMSSYDAVGDPVPRSVVLDLDEAFRVLEALEDSRLALRERGAAPGLQDELATVIRVLHGKLGLDEGGVL
Ga0190275_1008432433300018432SoilVTPGKPDGDDDPVPLGLVLDLDEAFRVLEALEDALATLDEARLALGCQDELATVIRMVHGRLGLDEGGA
Ga0190269_1142860023300018465SoilMSERSPFDDPVPRSVALSLDDAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGAQ
Ga0190270_1281490723300018469SoilVSGGSDDPVPAQVVLELDEAFRVLEALEDARLALREGGVALGLQDELATVVRILHGRLGLDEGGVP
Ga0190267_1070819423300019767SoilVSTHEPSGDPVPRAVVLDLDEAFRVLEALEDARLELRERGAAPGLQDELATVIRVLHGKLGLDDGGVL
Ga0193726_124784923300020021SoilVSQNDDPVPPGLPFRLEEVFRVLEALEDALLVMEEADVALGLRDELATVIRLVHVRLGLD
Ga0210401_1001915873300020583SoilMVTPVRHSSPVDQNHDPVPAGLLLSLDEAFRVLEALEDALLTMEEASVSPGLQDELATVIRLLHVRLGLDEGGVG
Ga0196959_1001195123300021184SoilVTASDDPVPYGLVLDLEQAWRVLAALEDALLELENLAAAPGLRDELATVIRIIHGNLGLDEGGLA
Ga0207933_104425433300025679Arctic Peat SoilVPPSVTLNLDEAFRVLEALEDGLDTLRQAQLAPGFQDELVTVIRMLHGKLGLDEGGAR
Ga0209203_106706623300025702Anaerobic Digestor SludgeVSDLDPFDDPVPRGVALSLEEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGLDEGGVQ
Ga0209152_1001027743300026325SoilMSRREDDPVPPAIVLDLDEAFRVLEALEDARLAMREADLAPGLQDELATVIRLLHGRLGLDEGGVL
Ga0255188_108108133300027079FreshwaterRHMSGLDAFDDPVPDGIGLSLNEAFRVLEAIEDARLALREHGAAPGLQDELATVIRMLHGKLGLDKGGVQ
Ga0207826_119256023300027680Tropical Forest SoilMSQNGDPVPPGLVLSLEEAFRVLEALEDALLSMEEANVAPGLRDEVATVIRLLHIRLGLDEGG
(restricted) Ga0247833_103994523300027730FreshwaterMSDRNPSDDPVPLSVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIHMLHGKLGFDEGGVQ
Ga0209810_1001406283300027773Surface SoilMSQGDDDRVPPGLVLNLDEAFRVLEALEDALLALEEAGLAPGLQDELATVIRLIHGSLGLDEGGLR
Ga0209277_1018324923300027776Wastewater EffluentMAADGEPDAIPPGLVLGLDEALRVLAALEGALWELQLAQRALGLRDELATVIRRLHGTLGLDPGGLP
Ga0209476_1006446023300027802Activated SludgeMSDGDPFDDPVPRRVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0209180_1032063823300027846Vadose Zone SoilMSRSADDPVPPGIVLSLDEAFRVLEALEDALLIVAEAALAPGLQDELATVIRLLHSRLGLDEGGVP
Ga0209591_1026346613300027850FreshwaterMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALRERGAVPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0209274_1037185023300027853SoilMSQPSNNLLGVSEDDGDPVPPELVLTLDEAFRVLEAMEDPLLSIEEAGVAPGLQDELATVIHFVHGRPGLDEGGVL
Ga0209023_1001536883300027870Freshwater And SedimentMSDGDPFHDPVPRGVALSLDDAFRVLEAIEDARLALRDRGAAPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0209181_1066525813300027878FreshwaterMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALREHGAVPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0209481_1016242623300027880Populus RhizosphereVSPRGTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSRLGFDEGGVP
Ga0209382_1022865343300027909Populus RhizosphereVSPSGTDDDPVPPGLVLDLDEAFRVLEALEDARFALREISAAPGLQDELATVIRILHSR
Ga0209382_1200938823300027909Populus RhizosphereVSPRGTDDDPVPPGLVLDLDEAFRVPAGPGGRPIRAQGDRAAPGLQDELATVIRILHSRLGFDEGGVP
Ga0209583_1003809333300027910WatershedsMQPADDDPVPPGMLLDLDEAFRVLEALEDTRLALREATAGLGLQDELATVIRMLHVRLGFDEGGVR
Ga0209069_1003460933300027915WatershedsMQPADDDPVPPGMLLDLDEAFRVLEALEDARLALREATAGLGLQDELATVIRMLHVRLGFDEGGVR
Ga0209069_1018950633300027915WatershedsVNPLDSFSDPVPSGVVLSLDEAFRVLEALEDARLALRERGAAPGLQDELATVIRMLHGKLGLDEGGVQ
(restricted) Ga0255341_101900153300028570WastewaterMAPSDADDDLVPLGLVLGLDEAFRVLEALEDAIYELERTSMAPGLRDEIATVIRLLHGRLGFDQGGL
Ga0247822_1195382513300028592SoilMSRPGDDDPVPPGLVLGLDEAFRVLEALEDARLELREAGAAPGLQDDLATVIRILHSRLGFDEGGVP
(restricted) Ga0255346_125880313300028677WastewaterERPGARCQTLFVPCRAMAPSDADDDLVPLGLVLGLDEAFRVLEALEDAIYELERTSMAPGLRDEIATVIRLLHGRLGFDQGGL
Ga0302154_1053407123300028882BogVSRPDDDPVPPGLLLSLEEAFRVPEALEDARLAIRESGLAPGLQDELATVIRIIHGRLGFEEGGVL
Ga0239302_11407123300029314Activated SludgeVQTSRVSADDSSIDPVPDAVVLTLDEVFRVLEGLEDARLELRERGAAPGLQDELATVIRVLHGKLGLDEGGVP
Ga0311328_1088461233300029939BogVSRPDDDPVPPGLLLSLEEAFRVPEALEDARLAIRESGLAPGLQDELATVIRIIHGTLGFEEGGVL
Ga0311332_1118055723300029984FenMSDRNPFDDPVPRGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHDKLGFDEGGVQ
Ga0299907_1007055533300030006SoilMSRAGDDDPVPPGLVLGLDEAFRVLEALEDSRLELREAGAAPGLQDELATVIRILHSRLGFDEGGVP
Ga0311349_1124571523300030294FenVSTHDPTDDPVPRIIVLDLDETFRVLEALEDARLALRDRGAAPGLQDELATVIRILHGKLGLDEGGVL
Ga0311372_1152387433300030520PalsaVSQNEDPVPPGLLLSLDEAFRVLEALEDGLLAMEEADVAPGLRDELATVIRLLHVRLGLDEGGVG
Ga0268386_1025112123300030619SoilMSRPGDDDPVPPGLVLGLDEAFRVLEALEDARLELREAGAAPGLQDELATVIRILHSRLGFDEGGVP
Ga0299914_1011792633300031228SoilMSRPGDDDPVPPGLVLGLDEAFRVLEALEDARLELRGAGAAPGLQDELATVIRILHSRLGFDEGGVP
Ga0299914_1117248223300031228SoilMPEEPHDPVPLRVVLSLDEAFRVLEALEDALLELELADAAPGLRDELATVIRVLHVHLGFSEGGGPP
Ga0302324_10114656733300031236PalsaVTQNEDPVPPGLLLTLEEAFRVLEALEDGLLAMEEADVALGLRDELATVIRLLHIRLGLDEGGVG
Ga0272422_106907733300031452RockMSDDDPVPPGIFLDLDEAFRVLEALEESRLTFRDHQLAPGLRDELATLIRFLHDRLGFDSGGLV
Ga0307408_10019977643300031548RhizosphereLTDPDDPVPSGLVLDLENAWRVLAALEDALLELENLAAAPGLRDELATVIRIIHGNLGLDEGGLA
Ga0307374_10007708123300031670SoilMDDDPVPRGVPIELGEALRVLEALEDARDALRRADLLPGLQDELATVVRMLHGKLGLDEGGLR
Ga0307374_1006293443300031670SoilMSRQGDNPVPPGLWLELEDAFRVLEALEDSLLALEEAGLVPGLRDELATVVRLVHGRLGLDEGGLK
Ga0307374_1006725543300031670SoilMSREGNDPVPPGLWLDLEEAFGLLESLEDSLLALEEADLALGLRDELSTVIRHLHGSLGLREGGLR
Ga0307374_1007222873300031670SoilVSEDGDDPIPPELVLGRDEAFRVLEALEDALLSIEEAGVAPGLRDEPATVIRLLHGRLGLDEGGVP
Ga0307372_1029226423300031671SoilMNRPDDDPVPIGIFLELDETFRVLEALEDSIATFMDTEVAPGLVDELATVIRLLHGKLGLAEG
Ga0307372_1036368013300031671SoilMSPQGDDPVPPGLWLDLEEGFRVLEALEDSLLALEEAGLLPGLRDELATVIRLIHGRLGLDEGGLR
Ga0307477_100000292343300031753Hardwood Forest SoilMSEGDDDPVPAELVLGLDDAFRVLEALEDALLSMEDSGVAPGLQDELATVIRLLNSKLGLDEGGVP
Ga0315900_1011023233300031787FreshwaterVSVHDDESVPHHVVLSLDEAFRVLEAIEDARLALRERSAAPGLQDELATVVRLLHGKLGLDEGGVL
Ga0315900_1047478023300031787FreshwaterVSSTDPFDDPVPHTIVLSLDDAFRVLEAIEDARLELRERAAAPGLQDELATVIRMLHGRLGLDEGGAL
Ga0307478_1145011923300031823Hardwood Forest SoilMVTPVRHPSPVDQNHDPVPPGLLLSLDEAFRVLEALEDALLTMEEASASPGLQDELATVIRLLHVRLGLDEG
Ga0307479_1196864013300031962Hardwood Forest SoilLNGRLCAVSRSDDDPVPPPLILSLDEAFRVLEALEDALSALEVARAAPGLRDELATVVRVIHGRLGLDEGGVR
Ga0326597_1006719523300031965SoilMPIEPFDDDPVPPGILLDLGEALRVLEALEDALTAMEQAGLAPGLRDEVATVIRVLHVRLGFDEGGLR
Ga0315905_1109711213300032092FreshwaterVSDRSPFTDPVPEGVALSLDESFRVLEAIEDARLALRERGAAPGLQDELATVIRLLHGKLGLDEGGVQ
Ga0335397_1009722723300032420FreshwaterMSDRNPFDDPVPRNVALSLDEAFRVLEAIEDARLALRERGAVPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0335397_1045871013300032420FreshwaterMSDRNPFDDPVPRTVALSLDEAFRVLEAIEDARLALRERGAFPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0335397_1051413923300032420FreshwaterMSDRNPFDDPVPRNVALSLDEAFRVLEAIEDARLALREHGAVPGLQDELATVIRMLHGKLGF
Ga0335074_1083625423300032895SoilVTGGRPDDDPVPPGLVLSLDESFRVLEALEDARLAMRETGLAPGLQDELATVIRMLHGRLGLEEGGVL
Ga0335000_0024998_1001_12073300034063FreshwaterMSDRNPSDDPVPLGVALSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0335010_0216876_963_11543300034092FreshwaterPFDDRVPRGVELSLDEAFRVLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0335014_0034268_2432_26383300034094FreshwaterMSDRNPFDDPVPRGVGLSLDEAFRVLEAIEDARLALRERGATPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0335051_0336056_370_5883300034109FreshwaterMLTEMSDPHPFDDPVPRSVALSLDEAFRMLEAIEDARLALRERGAAPGLQDELATVIRMLHGKLGFDEGGVQ
Ga0370495_0071797_413_6133300034257Untreated Peat SoilMSGPEDDPVPLGLVLGLEDAFRVLEAIEDALLELIRQEAAPGLRDELATVIRLIHDSLGIDEGGPQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.