NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F043274

Metagenome / Metatranscriptome Family F043274

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F043274
Family Type Metagenome / Metatranscriptome
Number of Sequences 156
Average Sequence Length 79 residues
Representative Sequence VRLVASAALALCLLAGCAAERWSYTKPGLTPARLDQDLEVCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE
Number of Associated Samples 130
Number of Associated Scaffolds 156

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 53.55 %
% of genes near scaffold ends (potentially truncated) 23.08 %
% of genes from short scaffolds (< 2000 bps) 82.05 %
Associated GOLD sequencing projects 124
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.359 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(14.744 % of family members)
Environment Ontology (ENVO) Unclassified
(23.077 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(35.897 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.
1JGI10214J12806_106017651
2JGI10216J12902_1174734791
3soilH2_102900672
4Ga0055468_101475021
5Ga0062593_1000841312
6Ga0062589_1001584042
7Ga0063356_1003103853
8Ga0062591_1005551232
9Ga0062591_1028752352
10Ga0062594_1012345272
11Ga0066677_102023331
12Ga0066673_101245352
13Ga0066685_104593282
14Ga0066678_101505762
15Ga0066678_102293172
16Ga0070690_1001516582
17Ga0066388_1000208054
18Ga0066388_1005527142
19Ga0066388_1066629132
20Ga0070687_1005369172
21Ga0070673_1006315871
22Ga0070703_104464432
23Ga0070701_108779122
24Ga0070705_1000569913
25Ga0070694_1008384322
26Ga0066689_103050172
27Ga0070662_1010950571
28Ga0070695_1006491521
29Ga0070695_1009732941
30Ga0066705_106183962
31Ga0068859_1002186491
32Ga0068864_1012441652
33Ga0068864_1020574951
34Ga0068861_1025277841
35Ga0068862_1016179192
36Ga0066652_1006717692
37Ga0075417_100610742
38Ga0075428_1000670343
39Ga0075421_1008191101
40Ga0075421_1009256972
41Ga0075430_1000015975
42Ga0075433_113810322
43Ga0075433_119425062
44Ga0075420_1002779212
45Ga0075425_1001508693
46Ga0075425_1028875641
47Ga0075429_1006745792
48Ga0075426_100160694
49Ga0079219_105943002
50Ga0105098_103556292
51Ga0111539_118466981
52Ga0111539_122026932
53Ga0066709_1034178432
54Ga0114129_1000524621
55Ga0114129_101540742
56Ga0105092_101516852
57Ga0075423_100703713
58Ga0075423_107396642
59Ga0105087_11203022
60Ga0126380_105124822
61Ga0126380_117247092
62Ga0126376_108862642
63Ga0126377_127681971
64Ga0134125_124509081
65Ga0134124_100354415
66Ga0134127_108514931
67Ga0134127_115450302
68Ga0105246_119098861
69Ga0137369_101632242
70Ga0137397_100043996
71Ga0157298_103788621
72Ga0137394_100031448
73Ga0137394_105352332
74Ga0137394_112257252
75Ga0137359_103623332
76Ga0157378_102333742
77Ga0075312_11106891
78Ga0075309_10850591
79Ga0075351_11346641
80Ga0137411_10030042
81Ga0137409_112911012
82Ga0187775_102062242
83Ga0187778_107665602
84Ga0184610_10066982
85Ga0184604_103150361
86Ga0184605_100868232
87Ga0184608_100565402
88Ga0184634_101565562
89Ga0184638_10452322
90Ga0184626_102993272
91Ga0184621_100354492
92Ga0184623_102140902
93Ga0184623_105119952
94Ga0184619_102578012
95Ga0184637_101159022
96Ga0184640_100436182
97Ga0184632_100808992
98Ga0184632_104209792
99Ga0184609_101218152
100Ga0184633_100271184
101Ga0184612_104253792
102Ga0184639_101294912
103Ga0184629_101499532
104Ga0066655_106129142
105Ga0066667_109677952
106Ga0066662_108151472
107Ga0184646_10295322
108Ga0193755_10561712
109Ga0210382_101151961
110Ga0210379_104354861
111Ga0210377_100026264
112Ga0207642_105561861
113Ga0207686_104363533
114Ga0207703_117127932
115Ga0207648_102811132
116Ga0207648_107503382
117Ga0209237_10897082
118Ga0209686_10590202
119Ga0209801_12477812
120Ga0209378_12299481
121Ga0209805_13763992
122Ga0209819_101088162
123Ga0209814_100541753
124Ga0209814_102406682
125Ga0209481_100158133
126Ga0209382_10000016125
127Ga0209382_102208243
128Ga0268264_117410101
129Ga0137415_105773192
130Ga0247823_101867012
131Ga0307296_108295962
132Ga0299907_105489831
133Ga0247826_104043431
134Ga0299906_100052025
135Ga0310888_101426212
136Ga0310887_101497131
137Ga0307408_1000639263
138Ga0310813_100791503
139Ga0307469_100792083
140Ga0307469_114084542
141Ga0307469_122333011
142Ga0307468_1000240043
143Ga0307468_1000709282
144Ga0307468_1007550202
145Ga0307468_1011249222
146Ga0310892_101807372
147Ga0214473_103336463
148Ga0307411_101985382
149Ga0307470_107436531
150Ga0307471_1001450602
151Ga0335085_103873042
152Ga0335084_100258055
153Ga0326726_119071701
154Ga0364946_044083_703_912
155Ga0373915_019862_287_514
156Ga0373959_0028024_557_796
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 35.51%    β-sheet: 11.21%    Coil/Unstructured: 53.27%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070VRLVASAALALCLLAGCAAERWSYTKPGLTPARLDQDLEVCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDESequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Groundwater Sediment
Natural And Restored Wetlands
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Soil
Agricultural Soil
Sugarcane Root And Bulk Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Soil
Soil
Natural And Restored Wetlands
Tropical Peatland
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Peat Soil
Sediment
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
Corn Rhizosphere
Sediment Slurry
12.8%3.8%5.8%3.2%7.1%3.2%5.8%3.2%3.8%4.5%14.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1060176513300000891SoilALAGGPAVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ*
JGI10216J12902_11747347913300000956SoilLGLCMLAGCASGQEWTYSRPGLTPARLDLDLEACRKQAHRPYWFAVTRSRRVDQDVLNQCMERKGYTPRRDE*
soilH2_1029006723300003324Sugarcane Root And Bulk SoilVRGATGAALALCLLAGCAAQRWSYTKPGLTPARLDQDLESCRREAHRPYWFAFTREGRVDQDALTRCMKHRGYDARRDD*
Ga0055468_1014750213300003993Natural And Restored WetlandsVRGAAWGALGLCLLAGCASERWTYSRPGLTPAGLDHDLESCRRASVRSDWLAVTREGQLDQRAIKRCMERKGYTSQPDR*
Ga0062593_10008413123300004114SoilVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ*
Ga0062589_10015840423300004156SoilVRLRASAALALCLLGGCAAERWSYTKPGLTPARLDQDLETCRKQAHRPYWFAFTRSARVDQEALNLCMQHRGYSARREE*
Ga0063356_10031038533300004463Arabidopsis Thaliana RhizosphereVRPHGWVAIGFGLLLAGCATESWTYSKAGLTPARLDQDLGACRRQSVRPQWFAVTRAGRLDQEAITQCMEHKGYTSRRDR*
Ga0062591_10055512323300004643SoilVRLRASAALALCLLGGCAAERWSYTKPGLTPARLDQDLETCRKQAHRPYWFAFTRSARVDQEALNLCMQHRGYSARREE
Ga0062591_10287523523300004643SoilMRVVASAALALCLLAGCAAERWSYTKPGLTPARLDQDLEGCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE*
Ga0062594_10123452723300005093SoilMRMVGLVALVLGLLAGCASEEWSYTKAGLTPARLDQDLEACRRQARRPQWFAITRDGRLDREAINQCMERKGYTSRRDQ*
Ga0066677_1020233313300005171SoilLALALALCTLAGCAAQRWSYTKPGMTPGRLDQDLESCRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE*
Ga0066673_1012453523300005175SoilVRTLVAAGLVLGLLAGCAGRWTYDKAGVTPGALDRDLAACRSLAHRPYWFAFTRAARVDQDALNQCMQHRGYSARRDD*
Ga0066685_1045932823300005180SoilMAGLALVRTALALALALCTLAGCAAQRWSYTKPGMTPGRLDQDLESCRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE*
Ga0066678_1015057623300005181SoilVRAAVVAVLALGLLAGCAERWTFEKPGLTPGRLDTDLESCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSARRDD*
Ga0066678_1022931723300005181SoilMAGLALVRAAVALALALCALAGCAAQRWSYTKPGMTPGRLDQDLESCRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE*
Ga0070690_10015165823300005330Switchgrass RhizosphereVSAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0066388_10002080543300005332Tropical Forest SoilVKRAAAGALALCLLAGCAAERWSYTKPGLTPARLDQDLEACRRLAHRPYWFAFTRSGRVDQDALNQCMQHRGYDARRGD*
Ga0066388_10055271423300005332Tropical Forest SoilVIRRLGAVLVLCLLAGCAQHWTYTRPGLLPARLDQDLEACRREAHRPYWFAFTRAARVDQDALNKCMEKRGYTPHRED*
Ga0066388_10666291323300005332Tropical Forest SoilVKARAAAALALCLLAGCAERWSYTKVGMTPGRLDQDLEACRRVAHRPHWFALTRSARVDQDVLNRCMQQKGYTAHRDD*
Ga0070687_10053691723300005343Switchgrass RhizosphereVRLVARAALALCLLAGCAAERWSYTKPGLTPARLDQDLEGCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE*
Ga0070673_10063158713300005364Switchgrass RhizosphereMRMVGLVALVLGLLAGCASEEWSYNKPGLTPARLDQDLEACRRQARRPHWFGITREARLDREAINQCMERKGYTSQRDP*
Ga0070703_1044644323300005406Corn, Switchgrass And Miscanthus RhizosphereLGVLCLALLAGCSTGHWTYDRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ*
Ga0070701_1087791223300005438Corn, Switchgrass And Miscanthus RhizosphereVRLGVSAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0070705_10005699133300005440Corn, Switchgrass And Miscanthus RhizosphereVRLGASAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0070694_10083843223300005444Corn, Switchgrass And Miscanthus RhizosphereVRGGALSAALALVLLAGCAAEQWSYKKPGLTPGRLDQDLEACRRQSRRPHWFALTRAGRVDQEALNQCMQHRGYTPHRDD*
Ga0066689_1030501723300005447SoilVVAVLALGLLAGCAERWTFEKPGLTPGRLDTDLESCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSARRDD*
Ga0070662_10109505713300005457Corn RhizosphereVRLVARAALALCLLAGCAAERWSYTKPGLTPARLDQDLEVCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE*
Ga0070695_10064915213300005545Corn, Switchgrass And Miscanthus RhizosphereVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTS
Ga0070695_10097329413300005545Corn, Switchgrass And Miscanthus RhizosphereVRGGALGAALALVLLAGCAAEQWSYKKPGLTPGRLDQDLEACRRQSRRPHWFALTRAGRVDQEALNQCMQHRGYTPHRDD*
Ga0066705_1061839623300005569SoilVRLRVGTALALCLLAGCAAQQWSYTKPGLTPARLDQDLEACRRQAHRPYWFAFTRSGRVDQEALNQCMQQRGYAAHRDD*
Ga0068859_10021864913300005617Switchgrass RhizosphereVRLVARAALALCLLAGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0068864_10124416523300005618Switchgrass RhizosphereLAGGPAMRMVGLVALVLGLLAGCASEEWSYNKPGLTPARLDQDLEACRRQARRPHWFGITREARLDREAINQCMERKGYTSQRDP*
Ga0068864_10205749513300005618Switchgrass RhizosphereMSAGGLDPLGRWLLAACAAAMLAGCVAERWSYTKPGLTPARLDLDLESCRREAHRPHWFALTRSARLDQDVLNQCMERKGYNAQRDE*
Ga0068861_10252778413300005719Switchgrass RhizosphereALAGGPAVSRRAWAVLGFCVLAGCAAVEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ*
Ga0068862_10161791923300005844Switchgrass RhizosphereGLVALVLGLLAGCASEEWSYTKAGLTPARLDQDLEACRRQARRPQWFAITRDGRLDREAINQCMERKGYTSQRDQ*
Ga0066652_10067176923300006046SoilVRLRASAALALCLLAGCAAERWSYIKPGLTPARLDQDLEACRRQAHRPNWFAFTRSARVDQEALNECMHRRGYAARRDE*
Ga0075417_1006107423300006049Populus RhizosphereMLAGCASGQEWTYSRPGLTPARLDLDLEACRKQAHRPYWFAVTRSRRVDQDVLNQCMERKGYTPRRDE*
Ga0075428_10006703433300006844Populus RhizosphereVRLPGWALIALGLLAGCATERWVYSKAGVTPARLGQDLELCRRHAVRPQRFAISREGRLDQDAIRECMEHKGYTSRREE*
Ga0075421_10081911013300006845Populus RhizosphereMLAGCASGQEWTYSRPGLTPARLDLDLEACRKQAHRPYWFAVTRSRRVDQDVLNQCMERKGYT
Ga0075421_10092569723300006845Populus RhizosphereMARIALVRASVTLALVLALCALAGCAAQRWSYTKPGMTPSRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDA*
Ga0075430_10000159753300006846Populus RhizosphereMRGAGAALGLCLLAGCASAQWTYSRPGLTPARLDLDLEFCRRQAQRPDWFALSRSGRLDQDAVKRCMERKGYTAGRDE*
Ga0075433_1138103223300006852Populus RhizosphereMLALVLALCALAGCAAQRWSYTKPGMTPSRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDE*
Ga0075433_1194250623300006852Populus RhizosphereARPRRRRALPGRMARIALVRATVALALVLCALAGCAAQRWSYTKPGMTPGRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDE*
Ga0075420_10027792123300006853Populus RhizosphereVRPHDWVAIGFGLLLAGCATESWTYSKAGLTPARLDQDLGACRRQSVRPQWFAVTRAGRLDQEAITQCMEHKGYTSRRDR*
Ga0075425_10015086933300006854Populus RhizosphereVRLGASAALALCLLAGCAAERWSYTKPGLTPARLDQDLEMCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0075425_10288756413300006854Populus RhizosphereVRGVWAALGLCMLAGCASGQEWTYSRPGLTPARLDLDLEACRKQAHRPYWFAVTRSRRVDQDVLNQCMERKGYTPRRDE*
Ga0075429_10067457923300006880Populus RhizosphereMARIALVRASVTLALVLALCALAGCAAQRWSYTKPGMTPSRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDE*
Ga0075426_1001606943300006903Populus RhizosphereMVAVVALGLLAGCATERWSYEKVGLTPSGLDRDLEACRRQAHRPYWFAFTRSARVDQEALNQCMQHRGYSARRDD*
Ga0079219_1059430023300006954Agricultural SoilVAGRAPVRAARLGVLMLGLLAGCAPAHWTYDKPGLTPGKLDQDMAACRRLAHRPYWFALTRSGRVDQEALNQCMQHRGYTARRDD*
Ga0105098_1035562923300009081Freshwater SedimentMRKRAWAVLGLGLMAGCATERWSYSKPGLTPARLDRDLGACRRQAARPQWFAVTRDGQLDQAAITQCMERKGYTSHRDD*
Ga0111539_1184669813300009094Populus RhizosphereMARIALVRATAMLALVLALCALAGCAAQRWSYTKPGMTPSRLDQDLEACRRLAHRPYWFAFTRSARVDQEAL
Ga0111539_1220269323300009094Populus RhizosphereMVGLVALVLGLLAGCASEEWSYTKAGLTPARLDQDLEACRRQARRPQWFAITRDGRLDREAINQCMERKGYTSRRDQ*
Ga0066709_10341784323300009137Grasslands SoilVAGRALVRAAVVAVLALGLLAGCAERWTFEKPGLTPGRLDTDLESCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSARRDD*
Ga0114129_10005246213300009147Populus RhizosphereMRGAGAALGLCLLAGCASAHWTYSRPGLTPARLDLDLEFCRRQAQRPDWFALSRSGRLDQDAVKRCMERKGYTAGRDE*
Ga0114129_1015407423300009147Populus RhizosphereMARIALVRATAMLALVLALCALAGCAAQRWSYTKPGMTPSRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDE*
Ga0105092_1015168523300009157Freshwater SedimentMRAWAVLGLGLLAGCATERWSYSKPGLTPARLDRDLGACRRQSARPQWFAVTRDGQLDQAAITQCMERKGYTSHRDD*
Ga0075423_1007037133300009162Populus RhizosphereVARRPSVRLGASAALALCLLAGCAAERWSYTKPGLTPARLDQDLEMCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0075423_1073966423300009162Populus RhizosphereMARIALVRATVALALVLCALAGCAAQRWSYTKPGMTPGRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDE*
Ga0105087_112030223300009819Groundwater SandVRLGTWTVLGLCLLAGCATERWSYNRPGLTPGRLDQDLESCRKQAHRPHWFALTHAARVDQEALNQCMERKGYTAR
Ga0126380_1051248223300010043Tropical Forest SoilVRARAAAALALCLLAGCAERWSYTKAGMTPGKLDQDLEACRRVAHRPHWFALTRSARVDQDVLNRCMQQKGYTAHRDD*
Ga0126380_1172470923300010043Tropical Forest SoilMLAGCASGQEWTYSRPGLTPARLDLDLEACRKQAHRPHWFGLTRSGRVDQDVLNQCMERKGYTARRDE*
Ga0126376_1088626423300010359Tropical Forest SoilVASRRLVKRAAAGALALCLLAGCAAERWSYTKPGLTPARLDQDLEACRRLAHRPYWFAFTRSGRVDQDALNQCMQHRGYDARRGD*
Ga0126377_1276819713300010362Tropical Forest SoilLVRARAAAVLALCLLAGCAERWSYTKAGMTPGKLDQDLEACRRVAHRPHWFALTRSARVDQDVLNRCMQQKGYTAHRDD*
Ga0134125_1245090813300010371Terrestrial SoilMLGLVALVLGLLAGCASEEWSYTKAGLTPARLDQDLEACRRQARRPQWFAITRDGRLDREAINQCMERKGYTSQRDQ*
Ga0134124_1003544153300010397Terrestrial SoilVARRPSVRLGVSAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE*
Ga0134127_1085149313300010399Terrestrial SoilMVGLVALVLGLLAGCASEEWSYNKPGLTPARLDQDLEACRRQARRPHWFGITREARLDREAINQCMERKGYTSQRDP*
Ga0134127_1154503023300010399Terrestrial SoilVARRPAVRLVARAALALCLLAGCAAERWSYTKPGLTPARLDQDLEGCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE*
Ga0105246_1190988613300011119Miscanthus RhizosphereDHNLALADALASLAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ*
Ga0137369_1016322423300012355Vadose Zone SoilMRPRVWTVLGLCLLAGCAAERWSYTRPGLTPARLDLDLEVCRKQAHRPHWFALTRSARVDREAFNQCMERKGYAARRDD*
Ga0137397_1000439963300012685Vadose Zone SoilVARPALVRLRASAALVLCLLAGCAAQRWSFTKPGLTPARLDQDLEACRRQAHRPYWFAFTRSGRVDQEALNQCMQQRGYAAHRDD*
Ga0157298_1037886213300012913SoilMRALCGVTMLGLLGGCAAERWSYTKPGLTPARLDQDLETCRKQAHRPYWFAFTRSARVDQEALNLCMQHRGYSARREE
Ga0137394_1000314483300012922Vadose Zone SoilVARPALVRLGASAALALCLLAGCAAQRWSFTKPGLTPARLDQDLEACRRQAHRPYWFAFTRSGRVDQEALNQCMQQRGYAAHRDD*
Ga0137394_1053523323300012922Vadose Zone SoilVLALCLLAGCAAERWSYTRAGLTPARLDLDLEVCRKHAQRPHWFALTRSGRLDLEALNQCMERKGYTARRDE*
Ga0137394_1122572523300012922Vadose Zone SoilVARTALVRAGVTAALALCLLAGCGPQRWSYTRPGLTPSRLDQDLETCKRQAHRAYWFAFTRSARVDQEALNQCMQHKGYTAQRDD*
Ga0137359_1036233323300012923Vadose Zone SoilVRPQALAALGLCVLAGCAAERWSYTRPGLTPARLDIDLEACRRQAHRPYWFAFTRSARLDQDALNQCMERKGYTGRREE*
Ga0157378_1023337423300013297Miscanthus RhizosphereVRLRASAALALCLLAGCAAERWSYNKPGLTPARLDQDLETCRRQAHRPHWFAFTRSARVDQEALNECMHRRGYSARRDE*
Ga0075312_111068913300014254Natural And Restored WetlandsRRAAEALGLCLLAGCASERWTYSRPGLTPAGLDHDLESCRRASVRSDWLAVTREGQLDQRAIKRCMERKGYTSQPDR*
Ga0075309_108505913300014268Natural And Restored WetlandsVRGAAWGALGLCLLAGCASERWTYSRPGLTPAGLDHDLESCRRASVRSDWLAVTREGQLDQRAIKRCMERK
Ga0075351_113466413300014318Natural And Restored WetlandsVARAALVRPDVAAALALCLLAGCAAERWSYTKPGLTPGKLDQDLESCRRLAHRPYWFAFTRSARVDQVALN
Ga0137411_100300423300015052Vadose Zone SoilALALCLLAGCAAQRWSFTKPGLTPARLDQDLEACRRQAHRPYWFAFTRSGRVDQEALNQCMQQRGYAAHRDD*
Ga0137409_1129110123300015245Vadose Zone SoilVALVLALCALGGCAAERWSYTKAGLTPGRLDQDLETCRRQAHRLYWFAFTRSARVDQGALNQCMERKGYAAHHDD*
Ga0187775_1020622423300017939Tropical PeatlandVRRASGLLALCLLAGCATEEWSFTRAGATPAQLDQDLEGCRRQAQRPYTWALTRQGRVDPDVLNRCMERKGYAAHREN
Ga0187778_1076656023300017961Tropical PeatlandVRPGTRALALCLLAGCATPQWSYTKPGLTPGRLDQDLEACKRQAHRAYWFAVTRSARVDQDALNQCMQNRGYTARQDE
Ga0184610_100669823300017997Groundwater SedimentMVVGLGVLAGCAAERWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0184604_1031503613300018000Groundwater SedimentAGRRARVSRALARGPAVRRRAWAVLAVGLLAGCAAGQWSYHKPGLTPAQLDQDLVACRRQARRPHWFALSRDARLDQETINQCMERKGYTARRDD
Ga0184605_1008682323300018027Groundwater SedimentVRPRTWTVLGLCLLAGCASERWSYTRAGLTPARLDLDLEVCRKQAHRPHWFALTRSARVDQEALNQCMERKGYTARRDD
Ga0184608_1005654023300018028Groundwater SedimentMVVGLGLLAGCAAERWSYTRSGLTPARLDLDLELCRKQAQRPHSFALTRSARVDREALNQCMERKGYTAQRDD
Ga0184634_1015655623300018031Groundwater SedimentVRLGTWTVLGLCLVAGCAAERWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARLDREALNQCMERKGYTAQRDD
Ga0184638_104523223300018052Groundwater SedimentMVVGLGLLAGCAAERWSYTRSGLTPARLDLDLEICRKQAQRPHWFALTRSARVDRETLNQCMERKGYTAQRDD
Ga0184626_1029932723300018053Groundwater SedimentMVVGLGLLAGCAAERWSYTRSGLTPARLDLDLEICRKQAQRPHWFALTRSARVDREALNQCMERKGYTAQRDD
Ga0184621_1003544923300018054Groundwater SedimentMVVGLGVLAGCAAERWSYTRSGLTPARLDLDLEICRKQAQRPHWFALTRSARVDRETLNQCMERKGYTAQRDD
Ga0184623_1021409023300018056Groundwater SedimentVRLRAWTVLGLSLLASCAAEGWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0184623_1051199523300018056Groundwater SedimentVRLRAWTVLGLSLLAGCAAEGWSYTRAGLTPARLDLDLEICRKQAQRPHWFALTRSARVDREALNQCMERKGYTAQRDD
Ga0184619_1025780123300018061Groundwater SedimentVRPRTWTVLGLCLLAGCASERWSYTRAGLTPARLDLDLEACRKQAHRPHWFALVRSARVDQEALNQCMERKGYTARRDD
Ga0184637_1011590223300018063Groundwater SedimentVRLRAWTVLGLSLLAGCAAEGWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0184640_1004361823300018074Groundwater SedimentVRLRAWTVLGLSLLAGCAAEGWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARLDREALNQCMERKGYAARRDE
Ga0184632_1008089923300018075Groundwater SedimentMVVGLGVLAGCAAERWSYTRAGLTPARLDLDLEICRKQAQRPHWFALTRSARVDQETLNQCMERKGYTAQRDD
Ga0184632_1042097923300018075Groundwater SedimentMRPRVWTVLGLCLLAGCAAERWSYTRSGLTPARLDLDLEICRKQAHRPHWFALTRSARVDQEALNQCMERKGYTARRDD
Ga0184609_1012181523300018076Groundwater SedimentVRLRAWTVLGLSLLTGCAAEGWSYTRAGLTPARLDLDLEFCRKQAHRPHWFALTRSARVDQEALNQCMERKGYTARRDD
Ga0184633_1002711843300018077Groundwater SedimentVRLRTWTVLGLCLLAGCAAERWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0184612_1042537923300018078Groundwater SedimentMRPRVWTVLGLCLLAGCAAERWSYTRSGLTPARLDLDLENCRKQAHRPHWFALTRSARVDQEAFNQCMERKGYAAR
Ga0184639_1012949123300018082Groundwater SedimentVRLGTWTVLGLCLVAGCAAERWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0184629_1014995323300018084Groundwater SedimentVRLRTWTVLGLCLLAGCAAERWSYTRAGLTPVRLDLDLESCRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0066655_1061291423300018431Grasslands SoilMRALALALALCTLAGCAAQRWSYTKPGMTPGRLDQDLESCRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE
Ga0066667_1096779523300018433Grasslands SoilMAGLALVRTALALALALCTLAGCAAQRWSYTKPGMTPGRLDQDLESCRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE
Ga0066662_1081514723300018468Grasslands SoilVVAVLALGLLAGCAERWTFERPGLTPGRLDTDLESCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSARRDD
Ga0184646_102953223300019259Groundwater SedimentAMSRSLAGGSPVRLRAWTVLGLSLLTGCAAEGWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRDD
Ga0193755_105617123300020004SoilVRPQALAALGLCVLAGCAAERWSYTRPGLTPARLDIDLQACRHQAHRPYWFAFTRSARLDQDALNQCMERKGYTGRREE
Ga0210382_1011519613300021080Groundwater SedimentVRPRTWTVLGLCLLAGCASERWSYTRAGLTPARLDLDLEVCRKQAHRPHWFALTRSARVDQEVLNQCMERKGYTARRDD
Ga0210379_1043548613300021081Groundwater SedimentALSRPLAGRPPVRLRTWTVLGLCLLAGCAAERWSYTRAGLTPVRLDLDLESCRKQAHRPHWFALTRSARVDQEALNQCMERKGYAARRDE
Ga0210377_1000262643300021090Groundwater SedimentVRFLTWTVLGLCLLAGCAADRWSYTRAGLTPVRLDLDLESCRKQAHRPHWFALTRSARVDREALNQCMERKGYTARRED
Ga0207642_1055618613300025899Miscanthus RhizospherePSVRLGASAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE
Ga0207686_1043635333300025934Miscanthus RhizosphereVSAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE
Ga0207703_1171279323300026035Switchgrass RhizosphereVRLVASAALALCLLAGCAAERWSYTKPGLTPARLDQDLEVCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE
Ga0207648_1028111323300026089Miscanthus RhizosphereVRLGASAALALCLLVGCAAERWSYTRPGLTPARLDQDLETCRRQAHRPYWFAFTRSARVDQEALNECMHKRGYSARRDE
Ga0207648_1075033823300026089Miscanthus RhizosphereVRLVARAALALCLLAGCAAERWSYTKPGLTPARLDQDLEVCRRQAHRPYWFAFTRSGRVDQEALNRCMHHKGYSARRDE
Ga0209237_108970823300026297Grasslands SoilPRRRRAVPGDVAGRALVRAAVVAVLALGLLAGCAERWTFEKPGLTPGRLDTDLESCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSARRDD
Ga0209686_105902023300026315SoilMAGLALVRAAVALALALCTLAGCAAQRWSYTKPGMTPGRLDQDLESCRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE
Ga0209801_124778123300026326SoilMAGLALVRAAVALALALCALAGCAAQRWSYTKPGMTPGRLDQDLEACRRLAHRPYWFAFTRSGRVDQEALNQCMQRRGYTAQRDE
Ga0209378_122994813300026528SoilPRARPRRRRAVPGDVAGRALVRAAVVAVLALGLLAGCAERWTFEKPGLTPGRLDTDLESCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSARRDD
Ga0209805_137639923300026542SoilVSAGLVAVLALGLLTGCAAERWTYDKAGLTPGGLDRDLEVCRRQAHRPYWFAFTRSARVDQDALNQCMQHRGYSAR
Ga0209819_1010881623300027722Freshwater SedimentMRMRAWAVLGLGLLAGCATERWSYSKPGLTPARLDRDLGACRRQSARPQWFAVTRDGQLDQAAITQCMERKGYTSHRDD
Ga0209814_1005417533300027873Populus RhizosphereMLAGCASGQEWTYSRPGLTPARLDLDLEACRKQAHRPYWFAVTRSRRVDQDVLNQCMERKGYTPRRDE
Ga0209814_1024066823300027873Populus RhizosphereMLALVLALCALAGCAAQRWSYTKPGMTPSRLDQDLEACRRLAHRPYWFAFTRSARVDQEALNQCMQRRGYTAQRDE
Ga0209481_1001581333300027880Populus RhizosphereMRGAGAALGLCLLAGCASAQWTYSRPGLTPARLDLDLEFCRRQAQRPDWFALSRSGRLDQDAVKRCMERKGYTAGRDE
Ga0209382_100000161253300027909Populus RhizosphereVRLPGWALIALGLLAGCATERWVYSKAGVTPARLGQDLELCRRHAVRPQRFAISREGRLDQDAIRECMEHKGYTSRREE
Ga0209382_1022082433300027909Populus RhizosphereVRPHGWVAIGFGLLLAGCATESWTYSKAGLTPARLDQDLGACRRQSVRPQWFAVTRAGRLDQEAITQCMEHKGYTSRRDR
Ga0268264_1174101013300028381Switchgrass RhizosphereVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCM
Ga0137415_1057731923300028536Vadose Zone SoilVRLRASAALVLCLLAGCAAQRWSFTKPGLTPARLDQDLEACRRQAHRPYWFAFTRSGRVDQEALNQCMQQRGYAAHRDD
Ga0247823_1018670123300028590SoilVRPHDWVAIGFGLLLAGCATESWTYSKAGLTPARLDQDLGACRRQSVRPQWFAVTRAGRLDQEAITQCMEHKGYTSRRDR
Ga0307296_1082959623300028819SoilVRPRTWTVLGLCLLAGCASERWSYTRAGLTPARLDLDLEVCRKQAHRPHWFALVRSARVDQEALNQCMERKGYTARRDD
Ga0299907_1054898313300030006SoilMRMRAWAVLGLGLLAGCATERWSYSKQGLTPARLDRDLGACRRQSARPQWFAVTRDGQLDQAAITQCMERKGYTSHRDD
Ga0247826_1040434313300030336SoilWVAIGFGLLLAGCATESWTYSKAGLTPARLDQDLGACRRQSVRPQWFAVTRAGRLDQEAITQCMEHKGYTSRRDR
Ga0299906_1000520253300030606SoilVKARGWVALALGLLGGCAAQRWDYSKPGLTPASLDQDLTACRREAHRPYRFALTHSGRVDQDALNQCMTRKGYTVRPDA
Ga0310888_1014262123300031538SoilVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ
Ga0310887_1014971313300031547SoilLAGGPAVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ
Ga0307408_10006392633300031548RhizosphereVRARVWGAIGLGLLAGCATESWTYSRPGLTPARLDQDLEACRRQSVRPQWFAVTRSGRLDQEAINQCMERKGYTSRRDR
Ga0310813_1007915033300031716SoilMSAGGLDPLGRWLLAACAAAMLAGCVAERWSYTKPGLTPARLDLDLESCRREAHRPHWFALTRSARLDQDVLNQCMERKGYNAQRDE
Ga0307469_1007920833300031720Hardwood Forest SoilVTAALALCLLAGCGPERWSYVKPGLTPGRLDQDLETCKRQAHRPYWFAFTRSARVDQEALNQCMQHKGYTAQRDD
Ga0307469_1140845423300031720Hardwood Forest SoilMTAALALCLLAGCAAQRWSYTRPGLTPGRLDTDLQMCKRQAHRPYWFAFTRSARVDQDALNQCMQHKGYTAQRDD
Ga0307469_1223330113300031720Hardwood Forest SoilVALGLCLLAGCAAERWSYTKTGLTPGKLDQDLEACKRSAHRPYWFAFTRSARVDQEALNQCMQHKGYAAHRDD
Ga0307468_10002400433300031740Hardwood Forest SoilVSRSVSAALALVLLAGCAAQRWSYTKPGLTPARLDQDMEACRRQAHRPYWFAFTRSARVDQDALNKCMQQRGYAAHRDD
Ga0307468_10007092823300031740Hardwood Forest SoilVRTGVTAALALCLLAGCGPERWSYVKPGLTPGRLDQDLETCKRQAHRPYCFAFTRSARVDQEALNQCMQHKGYTAQRDD
Ga0307468_10075502023300031740Hardwood Forest SoilVRLRIWTILVLCLLTGCAERWSYTRTGLTPARLDLDLEACRKQAHRPHWFALTRSARVDQDALNQCMERKGYTARRDD
Ga0307468_10112492223300031740Hardwood Forest SoilVRGALSAALVLILLAGCAAQQWSYTKSGLTPGRLDQDLEACRRLAKRPYWFALSRSGRVDQNVLNQCMQHRGYTPHRDD
Ga0310892_1018073723300031858SoilVSRRAWAALGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDQ
Ga0214473_1033364633300031949SoilVRLRTWTVLGLCLLAGCAAERWSYTRAGLTPARLDLDLEICRKQAHRPHWFALTRSGRVDQEALNQCMEHKGYAARRDD
Ga0307411_1019853823300032005RhizosphereAGGPPVRARVWGAIGLGLLAGCATESWTYSRPGLTPARLDQDLEACRRQSVRPQWFAVTRSGRLDQEAINQCMERKGYTSRRDR
Ga0307470_1074365313300032174Hardwood Forest SoilRTWTVLGLCLLAGCAAERWSYTRAGLTPARLDLDLEVCRKHAQRPHWFALTRSGRLDQEALNQCMERKGYTARRDD
Ga0307471_10014506023300032180Hardwood Forest SoilMVAVVALGLLAGCATERWSYEKVGLTPGGLDRDLEACRRQAHRPYWFAFTRSARVDQEALNQCMQHRGYSARRDD
Ga0335085_1038730423300032770SoilVSVRAAAALALCLLAGCAAERWSYTKPGLTPGKLDQDLEACRRQAHRPYWFAFTRSARVDQEALNQCMQQKGYAARPDD
Ga0335084_1002580553300033004SoilVRARAAAALALCLLAGCAAERWSYTKPGLTPGKLDQDLEACRRQAHRPYWFAFTRSARVDQEALNQCMQQKGYAARPDD
Ga0326726_1190717013300033433Peat SoilVKPWVLAALGLCLLAGCAPERWSYTRPGLTPARLDIDLESCRRQAHRPQWFALTRSARLDQDALNQCMERKGYTGQRDE
Ga0364946_044083_703_9123300033815SedimentVCLLAGCAAERWSYTRAGLTPVRLDLDLESCRKQAHRPHWFALTRSARVDQEALNQCMERKGYTARRDD
Ga0373915_019862_287_5143300034162Sediment SlurryVKPHGWVVIGVGLLLAGCATESWTYSKPGLTPARLDQDLEACRRQSVRPQWFAVTRAGRLDQEAINQCMEHKGYTS
Ga0373959_0028024_557_7963300034820Rhizosphere SoilVSRRAWAVLGFCVLAGCAAEEWSYTRPGLTPARLDQDLEACRRQARRPQWFALTRAARLDQDAINQCMERKGYTSQRDR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.