NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F020569

Metagenome / Metatranscriptome Family F020569

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F020569
Family Type Metagenome / Metatranscriptome
Number of Sequences 223
Average Sequence Length 47 residues
Representative Sequence YTPAKAGGLIEEQQRTGRKFGSMNERELAETLQRHGWELLGPSPL
Number of Associated Samples 188
Number of Associated Scaffolds 223

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 2.24 %
% of genes near scaffold ends (potentially truncated) 94.17 %
% of genes from short scaffolds (< 2000 bps) 91.93 %
Associated GOLD sequencing projects 175
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (77.130 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(23.767 % of family members)
Environment Ontology (ENVO) Unclassified
(23.318 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.018 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.
1JGIcombinedJ26739_1004408413
2JGIcombinedJ26739_1004581201
3C688J35102_1199246881
4JGI25382J43887_100441701
5JGI25382J43887_102976122
6JGIcombinedJ51221_103044762
7JGIcombinedJ51221_104060481
8Ga0062595_1018636151
9Ga0066677_108589362
10Ga0066683_100765633
11Ga0066673_108175342
12Ga0066679_108203672
13Ga0066388_1004637152
14Ga0068868_1014610451
15Ga0070710_115064882
16Ga0070711_1017101641
17Ga0066686_100832141
18Ga0070706_1012661982
19Ga0070738_100707981
20Ga0070738_103428801
21Ga0066700_104493812
22Ga0066699_112088401
23Ga0066693_100431311
24Ga0070740_103732282
25Ga0070763_101367261
26Ga0070763_103843141
27Ga0070763_104933881
28Ga0068861_1026998331
29Ga0066903_1040655291
30Ga0066903_1056697491
31Ga0066903_1057023821
32Ga0066903_1061582621
33Ga0066903_1064996471
34Ga0070766_112974702
35Ga0070717_101168905
36Ga0066651_100356991
37Ga0075365_113394331
38Ga0070765_1001649243
39Ga0070765_1003891311
40Ga0070765_1010832172
41Ga0075369_106268232
42Ga0079222_110333121
43Ga0079220_112853901
44Ga0075425_1014391061
45Ga0075436_1004212452
46Ga0099794_101584062
47Ga0066710_1024620331
48Ga0066710_1035344491
49Ga0099828_119614541
50Ga0099827_111886501
51Ga0099827_115996631
52Ga0105245_126678281
53Ga0075418_104288861
54Ga0099792_100801631
55Ga0099792_106441372
56Ga0116144_102480701
57Ga0105077_1003921
58Ga0126380_108807942
59Ga0126384_111469862
60Ga0126373_116794462
61Ga0123356_106967033
62Ga0099796_102727251
63Ga0134088_105599912
64Ga0134111_100714921
65Ga0134080_100127251
66Ga0134063_106013742
67Ga0134071_103795082
68Ga0126376_107259824
69Ga0126372_103454383
70Ga0126372_132763601
71Ga0126378_109599621
72Ga0126378_114673041
73Ga0126378_117502392
74Ga0126377_109665802
75Ga0126379_103291843
76Ga0134128_107614552
77Ga0134128_109435371
78Ga0126381_1005930433
79Ga0126354_10424291
80Ga0124850_10919583
81Ga0126350_122390742
82Ga0105246_114197692
83Ga0137393_115134272
84Ga0137388_116662532
85Ga0137363_102936243
86Ga0137363_113935871
87Ga0137399_116879372
88Ga0137380_114894292
89Ga0137379_107160222
90Ga0150985_1137999481
91Ga0137386_112354402
92Ga0137361_103717193
93Ga0137361_105084811
94Ga0137390_119957941
95Ga0150984_1001013702
96Ga0137397_107502532
97Ga0137395_105727822
98Ga0137395_113012351
99Ga0137394_103453683
100Ga0137413_112270582
101Ga0137419_107840492
102Ga0137416_115673092
103Ga0126369_121467321
104Ga0164308_122302981
105Ga0163163_122594302
106Ga0182024_111009012
107Ga0137412_104359481
108Ga0132258_133671742
109Ga0132256_1023137501
110Ga0182036_109504441
111Ga0182034_101633293
112Ga0182037_116707231
113Ga0182039_101257815
114Ga0182039_106524061
115Ga0134074_10343794
116Ga0187874_103027252
117Ga0187859_100384973
118Ga0187766_106525132
119Ga0066662_121236671
120Ga0190274_103933501
121Ga0066669_100210246
122Ga0193733_10952351
123Ga0210407_107669663
124Ga0210407_110954182
125Ga0210403_112359481
126Ga0210399_104961053
127Ga0210399_106321212
128Ga0210399_114537672
129Ga0210401_100219615
130Ga0179584_14220183
131Ga0210406_105428641
132Ga0210400_113779622
133Ga0210405_104538351
134Ga0210408_100739121
135Ga0210408_100774576
136Ga0210408_100818001
137Ga0210408_107233103
138Ga0210396_103653932
139Ga0210388_108897742
140Ga0213882_101159582
141Ga0213876_108057852
142Ga0210393_107860673
143Ga0210385_111907782
144Ga0210387_104645071
145Ga0210386_114551212
146Ga0210383_105661782
147Ga0210402_109429031
148Ga0210409_105581093
149Ga0213880_101967512
150Ga0207653_101757841
151Ga0207684_103383643
152Ga0207693_101581401
153Ga0207693_107028332
154Ga0207663_103045483
155Ga0207641_116209722
156Ga0207675_1023615182
157Ga0209266_10784821
158Ga0209473_12566231
159Ga0209158_11519263
160Ga0257151_10370511
161Ga0209159_10410854
162Ga0209161_104787672
163Ga0209161_105472691
164Ga0209474_103720442
165Ga0179587_109854961
166Ga0208859_10346292
167Ga0208983_10631461
168Ga0209689_13226601
169Ga0209693_101269193
170Ga0209693_103801191
171Ga0209283_103729242
172Ga0209283_109565621
173Ga0209062_10380681
174Ga0265337_12066812
175Ga0222749_103501752
176Ga0302184_101711342
177Ga0102770_103275831
178Ga0170834_1033693262
179Ga0170824_1238618022
180Ga0265340_102263121
181Ga0255312_10866962
182Ga0318534_101386073
183Ga0318538_106517571
184Ga0318573_100744773
185Ga0318542_103388632
186Ga0310686_1048756192
187Ga0318493_103880942
188Ga0306918_105064673
189Ga0318502_101832102
190Ga0318502_107292073
191Ga0318492_104855202
192Ga0318552_100917633
193Ga0318548_104854893
194Ga0307473_115555412
195Ga0307478_110066802
196Ga0310917_105191041
197Ga0318511_105302942
198Ga0310892_113274801
199Ga0306919_106493351
200Ga0306925_104699203
201Ga0318520_106278961
202Ga0306921_119444291
203Ga0310912_107038671
204Ga0310910_100267266
205Ga0310909_108370842
206Ga0318530_102604021
207Ga0318531_105270273
208Ga0318507_105174733
209Ga0318556_100239184
210Ga0318556_105604771
211Ga0318575_100804163
212Ga0318533_109297681
213Ga0318513_104722292
214Ga0318553_104019821
215Ga0306924_108935153
216Ga0318525_102942471
217Ga0318540_101800401
218Ga0307470_103814922
219Ga0307471_1017496092
220Ga0307472_1002976021
221Ga0306920_1028744101
222Ga0335073_106707951
223Ga0370515_0441239_2_157
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 35.62%    β-sheet: 0.00%    Coil/Unstructured: 64.38%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045YTPAKAGGLIEEQQRTGRKFGSMNERELAETLQRHGWELLGPSPLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
77.1%22.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Untreated Peat Soil
Tropical Peatland
Permafrost
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Groundwater Sand
Sandy Soil
Palsa
Exposed Rock
Termite Gut
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Populus Endosphere
Switchgrass Rhizosphere
Plant Roots
Populus Rhizosphere
Miscanthus Rhizosphere
Rhizosphere
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Anaerobic Digestor Sludge
Boreal Forest Soil
13.5%5.8%6.7%23.8%4.9%3.1%4.0%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10044084133300002245Forest SoilVLFLYTPAKAGGLIEEQQRTGRKFASMNERELADILERHGWELLDPSPL*
JGIcombinedJ26739_10045812013300002245Forest SoilTPAKAGGLIEEQQRTGRGFASMNEHELTELLRRHGWELLGPSPL*
C688J35102_11992468813300002568SoilKSTGAETGRVLFLYTPAKAGGLLEEQHRTERNFAAMTESELADTLQRHGWELLGPSPL*
JGI25382J43887_1004417013300002908Grasslands SoilPARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL*
JGI25382J43887_1029761223300002908Grasslands SoilGGLLEEQQRTDRTFAAMDEREAAELRQRHGWEIVGPSPL*
JGIcombinedJ51221_1030447623300003505Forest SoilLFLYTPATAGGLIEEQQRTGRKFADMSKAELADFLQRHGWELLGPSPL*
JGIcombinedJ51221_1040604813300003505Forest SoilGLIEEQQRTGRKFAMMGKEEMDDILDRHGWMIVGPSPLET*
Ga0062595_10186361513300004479SoilGAEAGLVLFLYTPAAAGGVIEEQQRTGSNFSSMSPTERAEMLRRYDWELLGPSPL*
Ga0066677_1085893623300005171SoilAETGRVLFLYTPAKAGGLIEEQQRTGRGFRSMNEHELTDLLRRHGWELLGPPAL*
Ga0066683_1007656333300005172SoilLFLYTPARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL*
Ga0066673_1081753423300005175SoilKAGGFIEEQHRTERALASMTESERAELLQRHGWELLGPSPFVT*
Ga0066679_1082036723300005176SoilRVLFLYTPAKAGRLIEEQQRTGRKFASMNERELADLLQRHGWELVGPSPL*
Ga0066388_10046371523300005332Tropical Forest SoilRVLFLYTPGKAGGLIEEQQRTGSNFSTMTEAARAEMLRRYGWELLGPSPL*
Ga0068868_10146104513300005338Miscanthus RhizosphereGAETGRVLFLYTPGKAGDLIEEQQRTRRSFSAMSEDELAGFLQRHGWEILGPSPL*
Ga0070710_1150648823300005437Corn, Switchgrass And Miscanthus RhizosphereLFLYAPAKAGGLIEEQHRTQRTFSTMSEEERADTLRRHGWELLGPSPL*
Ga0070711_10171016413300005439Corn, Switchgrass And Miscanthus RhizosphereFLYTPARAGGLIEEQQRTGRNFASMTEREAAELRQRYGWELLGPSPL*
Ga0066686_1008321413300005446SoilTGRVLFLYTPARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL*
Ga0070706_10126619823300005467Corn, Switchgrass And Miscanthus RhizosphereTPAGAGGLIEEQQRTGRPFASMNERESAQMRQRHGWEIVGPSPL*
Ga0070738_1007079813300005531Surface SoilLYTPGAAGRLIEEQERTGRKFSAMGERELADFLERHGWEILGPSPL*
Ga0070738_1034288013300005531Surface SoilLYTPGAAGRLIEEQERTGRKFSAMGERELADFLERHGWEIVGPSPL*
Ga0066700_1044938123300005559SoilYTPAGAGGLLEEQQRTHHPIASMNEREAAELRQRHGWEIVGPTPL*
Ga0066699_1120884013300005561SoilAGGLIEEQQRTGRGFASMNEHELTELLRRHGWKLLGPSPL*
Ga0066693_1004313113300005566SoilTGRVLFVYTPAKAGGLIEEQHRTQRNFASMSETELADMLQRHGWELLGPSPL*
Ga0070740_1037322823300005607Surface SoilHAWKSTGAETGKVLFLYTPGAAGRLIEEQERTGRKFSAMGERELADFLERHGWEILGPSPL*
Ga0070763_1013672613300005610SoilFLYTPAKAGGLIEEQQRTGSKFASMNERELADILGRHGWELLGPSPL*
Ga0070763_1038431413300005610SoilFLYTPARAGGLREEQQRTGRKFASMSERELAEICQRHGWELLGPPPL*
Ga0070763_1049338813300005610SoilVTARAPAAETARVLFLYTPAKAGGLIEEQQRTGRKFFSMSERESAEILERHGWELLGPSPL*
Ga0068861_10269983313300005719Switchgrass RhizosphereLYTPAKAGGLIEEQQRTGRKFGSMNERELADTLQRHGWELLGPSPL*
Ga0066903_10406552913300005764Tropical Forest SoilVLFLYTPAGAGGLIEEQQRTGSNFSSMSAPERAEMLRRYDWELLGPSPL*
Ga0066903_10566974913300005764Tropical Forest SoilLYTPAGAGGLLEEQQRTHRTITSINEDEGADLRRRHGWEIVGLTPL*
Ga0066903_10570238213300005764Tropical Forest SoilGLIEEQQRTGRKFASMNERELAALLQRHGWELLGPSPA*
Ga0066903_10615826213300005764Tropical Forest SoilLYTPGKAGGLIEEQQRTGSNFSTMTETARAEMLRRYGWELLGPSPL*
Ga0066903_10649964713300005764Tropical Forest SoilGVQHAWKSSGAETARVLFLYTPGDAGRLVEEQQRTQDAIASMSEREKAEQRQRYGWEIVGPNPL*
Ga0070766_1129747023300005921SoilFLYTPAKAGGLIEEQQRTGSKFASMNERELSGILQRHGWELLGPSPL*
Ga0070717_1011689053300006028Corn, Switchgrass And Miscanthus RhizosphereVLCLYAPAKAGGLIEEQHKIQRNFAAMTEADRTDMLRRHGWELLGPSPL*
Ga0066651_1003569913300006031SoilVLFLYTPAGAGGLLEEQQRTQGTTASRNEREAAELRQRYGWEIVGPNPL*
Ga0075365_1133943313300006038Populus EndosphereLYTPAKAGGLIEEQQESGRGFSAMSQGELADLLQRHGWELFGASPL*
Ga0070765_10016492433300006176SoilAKAGGLIEEQHQTQRNFAAMSETELADTLQRHGWELLRPSPL*
Ga0070765_10038913113300006176SoilAQAGRVLFLYTPAKAGGLIEEQQRTGSKFASMNERELADILGRHGWELLGPSPL*
Ga0070765_10108321723300006176SoilVLFLYTPAKAGGLIEEQRRTGRKFANMSEQELAAFLDRHGWEMAGESPL*
Ga0075369_1062682323300006186Populus EndosphereEEQQRIGRGFNSMTKQELADLLQRHGWELFGASPL*
Ga0079222_1103331213300006755Agricultural SoilKSTGADTGRALFLYTPAKAGGLIEEQHRIRRKFAEMSERELSDMLQRHGWELLGPSPL*
Ga0079220_1128539013300006806Agricultural SoilTGRILFLYIPAEAGRLVEEQQRTQHTTASMSDREKTEQLQRHGWQIVGPNPL*
Ga0075425_10143910613300006854Populus RhizosphereKNSGAETGRVLFLYTPARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWQIVGPTPL*
Ga0075436_10042124523300006914Populus RhizosphereVLFLYTPARAGGLIEEQQRTGRKFGSMNERELADILERHGWELLGPSPL*
Ga0099794_1015840623300007265Vadose Zone SoilAGGLIEEQQQTGRKFGSMNERELAEILQRHGWELLGPSPL*
Ga0066710_10246203313300009012Grasslands SoilLFLYTPAGAGALVEEQQRTQDTIASMNEREKAEQRQRYGWEIVGPNPL
Ga0066710_10353444913300009012Grasslands SoilKNSGAETGRVLFLYTPARAGGLLEEQQRTDRTFAAMDEREATELRRRHGWEIVGPSPL
Ga0099828_1196145413300009089Vadose Zone SoilKSTGVQAGRVLFLYTPARAGGLIEEQQRTGRKFGSMNERELADILQRHGWELLGPSPL*
Ga0099827_1118865013300009090Vadose Zone SoilGLLEEQQRTDRTFAAMDEREATELRRRHGWEIVGPSPL*
Ga0099827_1159966313300009090Vadose Zone SoilLYTPGGAGGLIEEQQRTGRTFASMDERELADMLQRHGWKIVGPSPL*
Ga0105245_1266782813300009098Miscanthus RhizosphereSTGAETGRVLFLYTPGKAGDLIEEQQRTRRSFSAMSEDELAGFLQRHGWEILGPSPL*
Ga0075418_1042888613300009100Populus RhizosphereGAGGLLEEQQRTQGTSRNEREAAELRPRYGWETVGPNPL*
Ga0099792_1008016313300009143Vadose Zone SoilPAKAGGLIEEQHRTQRNFASMSEAELADMLQRHGWELLGPSPL*
Ga0099792_1064413723300009143Vadose Zone SoilVLFLYTPAKAGGLIEEQQRTGRGFRSMNERELADTLQRHGWALLGPSPL*
Ga0116144_1024807013300009687Anaerobic Digestor SludgeGRVLILYTPAKAGGLIEEQERTGRKFGAMDAAELADILSRHGWSLLGPSPL*
Ga0105077_10039213300009793Groundwater SandGAGGLLEEQQRTQGTTASMNEHEKAEQRQRYGWVSVR*
Ga0126380_1088079423300010043Tropical Forest SoilRVLFLYTPGKAGGLIEEQQRTCSNFSTMTETARAEMLRRYGWELLGPAPL*
Ga0126384_1114698623300010046Tropical Forest SoilAAAGGLIEEQQRTGREFASINARGLADILQRHGWELLGPSPL*
Ga0126373_1167944623300010048Tropical Forest SoilGAETGRVLFLYTPAKAGGLLEEQQRTGRSLAEIGERHGWELLGPSPL*
Ga0123356_1069670333300010049Termite GutEAGRVLFLYTPGKAGGFIEEQQTTERKLSSMTERERAAICERHGWELLGPSPL*
Ga0099796_1027272513300010159Vadose Zone SoilGLIEEQHRTQRNFASMSERELSETLQRHGWELLGPSPL*
Ga0134088_1055999123300010304Grasslands SoilLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL*
Ga0134111_1007149213300010329Grasslands SoilEEQQRTQDTIASMNEREKAEQRQRYGWEIIGPNPL*
Ga0134080_1001272513300010333Grasslands SoilAETGRVLFLYTPARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL*
Ga0134063_1060137423300010335Grasslands SoilVLFLYTPAGAGALVEEQQRTQDTIASMNEREKAEQRQRYGWEIIGPNPL*
Ga0134071_1037950823300010336Grasslands SoilLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL*
Ga0126376_1072598243300010359Tropical Forest SoilEEQHRTRRGFASMNERELTDILDRHSWELLGPSPL*
Ga0126372_1034543833300010360Tropical Forest SoilLFLYTLARAGGLIEEQQRTGRKFAQMSERELAETLERHGWELLGPSPL*
Ga0126372_1327636013300010360Tropical Forest SoilETGRVLFLYTPASAGGLIEEQQRTGRKFGSMNDRELADILQRHGWELLGPSPL*
Ga0126378_1095996213300010361Tropical Forest SoilKAGGLIEEQQRTGRGFRSMDERELADFLERHGWEVLGPSPL*
Ga0126378_1146730413300010361Tropical Forest SoilRVLFLYTPARAGGLIEEQQRTGRKFAQMSESELAETLERHGWELLGPSPL*
Ga0126378_1175023923300010361Tropical Forest SoilETGRVLFLYTPAKAGGLIEEQQRTGRGFRSMNDRELADILQRHGWELLGPSPL*
Ga0126377_1096658023300010362Tropical Forest SoilYIPAGAGGLVEEQQRTHHPIASMNEREAAGLRQRHGWEIVGPTPL*
Ga0126379_1032918433300010366Tropical Forest SoilGGLIEEQQRTGRGFASMNEREAAELRQRYGWELLGPSPL*
Ga0134128_1076145523300010373Terrestrial SoilMPGRAPADTGRALFLYTPAKAGGLIEEQHRIRRKFAEMSERELSDMLQRHGWELLGPSPL
Ga0134128_1094353713300010373Terrestrial SoilGLIDEQRRTGRTFSSMNESELADILHRHGWQLLRPSPL*
Ga0126381_10059304333300010376Tropical Forest SoilAGGLVEEQQRTQHTIASMSEPEKAEQRGRYGWEIVGPNPL*
Ga0126354_104242913300010857Boreal Forest SoilLIEEQQQTGRKFGSMNERELADILQRHGWELLGPSPL*
Ga0124850_109195833300010863Tropical Forest SoilFLYTPAGAGGLIEEQQRTGSNFSSMSAPERAEMLRRYDWELLGPSPL*
Ga0126350_1223907423300010880Boreal Forest SoilLIEEQQRTGRTFGSMNERELAEILQRHGWELLGPSPL*
Ga0105246_1141976923300011119Miscanthus RhizosphereEQHRIRRKFAEMSERELSDMLQRHGWELLGPSPL*
Ga0137393_1151342723300011271Vadose Zone SoilTGAETGRVLFLYTPAKAGGLVEEQQRTGRGFASMNEHELTELLRRHGWELLGPSPL*
Ga0137388_1166625323300012189Vadose Zone SoilGAGGLIEEQQRTGRKFGSMDERELADILERHGWELLGPSPL*
Ga0137363_1029362433300012202Vadose Zone SoilVYTPAKAGGLFEEQHRTQRNFASMSEVELADMLQRHGWELLGPSPL*
Ga0137363_1139358713300012202Vadose Zone SoilTGRVLFLYTPAKAGGLIEEQQRTGRKFGSMNERELAEILQRHGWELLGPSPL*
Ga0137399_1168793723300012203Vadose Zone SoilIEEQQRTGRGFRSMNEHELTELLRRHGWELLGSSPL*
Ga0137380_1148942923300012206Vadose Zone SoilIEEQQRTGSKFGSMNEQELAEILERHGWELLGPSPL*
Ga0137379_1071602223300012209Vadose Zone SoilYTPAGAGGLIEEQQRTGRKFASMDERELADILERHGWELLGPSPL*
Ga0150985_11379994813300012212Avena Fatua RhizosphereYTPAKAGGLIEEQQRTGRGFRSMNAHELAEILQRHGWELLGPSPL*
Ga0137386_1123544023300012351Vadose Zone SoilETGRVLFLYTPARAGGLIEEQQRTGRGFASMTEREAAELRQRYGWELLGPSPL*
Ga0137361_1037171933300012362Vadose Zone SoilLIEEQQRTGRKFASMDERELADILERHGWELLGPSPL*
Ga0137361_1050848113300012362Vadose Zone SoilSTGAETGRVLFLYTPARAGGLIEEQQQTGRKFGSMNERELAEILQRHGWELLGPSPL*
Ga0137390_1199579413300012363Vadose Zone SoilGRVLFLYTPAKAGGLIEEQQRTGRKFGSMNERELAEILQRHGWELLGPSPL*
Ga0150984_10010137023300012469Avena Fatua RhizosphereVLFLYTPGKADGLIEEQQRTRRSFSAMSEEELAAFLQRHGWEVLGPSPL*
Ga0137397_1075025323300012685Vadose Zone SoilETGRVLFLYTPARAGGLIEEQQRTGRKFGSMNERELADILQRHGWELLGLSPL*
Ga0137395_1057278223300012917Vadose Zone SoilAGGLIEEQQRTGSKFSSMTETERAEMLRRYGWELLGPSPL*
Ga0137395_1130123513300012917Vadose Zone SoilGGLIEEQQQTGRGFASMTEREAAELRQRYGWELLGPSPL*
Ga0137394_1034536833300012922Vadose Zone SoilADTGRVLFLYTPGGAGGLIEEQQRTGRTFASMSEGELAEMLERHGWEIVGPSPL*
Ga0137413_1122705823300012924Vadose Zone SoilGAETGRVLFVYTPAKAGGLIEEQHRTQRNFASMSEAELADMLQRHGWELLGPSPL*
Ga0137419_1078404923300012925Vadose Zone SoilEEQQQTGSNFSSMTETERAEMLQHYGWELLGPSPL*
Ga0137416_1156730923300012927Vadose Zone SoilARAGGLIEEQQQTGRKFGSMNEGELAEILPRHGWELLGPSPL*
Ga0126369_1214673213300012971Tropical Forest SoilAKAGGLLEEQHRTRRGFASMNERELTDILDRHGWELLGRSPL*
Ga0164308_1223029813300012985SoilTETGRALFLYTPARAGGLIEEQQRTGRKFASMTKAELAEMLQRHGWELLGPSPL*
Ga0163163_1225943023300014325Switchgrass RhizosphereLFLYTPAGAGGLIEEQQRTGSGFSSMTDAERAEMLRRYGWELLGPSPL*
Ga0182024_1110090123300014501PermafrostTGAESGKVLFLYTPAKAGGLIEEQRRTGRKFAEMSESELAEILNRHGWEMVGPSPL*
Ga0137412_1043594813300015242Vadose Zone SoilVLFLYTPAKAGGLIEEQQRTGRTIGSMNERELAEILQRHGWELLGPSPI*
Ga0132258_1336717423300015371Arabidopsis RhizosphereLIEEQQRTGRKFAAMTKPELTEMLQRHGWELLGPSPL*
Ga0132256_10231375013300015372Arabidopsis RhizospherePARAGGLIEEQQRTGSGFSSMTETERAAMLQRYGWELLGPSPL*
Ga0182036_1095044413300016270SoilEEQQRTGSTFSSMSAPERAEMLRRYDWELLGPSPL
Ga0182034_1016332933300016371SoilVLFLYTPAKAGGLLEEQQRTGRKLAEIGERHGWEVLGPSPL
Ga0182037_1167072313300016404SoilTPARAGGLLEEQQRTERTFASLNDREAAELRQRHGWEIVGPSPL
Ga0182039_1012578153300016422SoilYAPAKAGGFIEEQHRSERTRASMTESERAEMLQCYGRELLGPSPL
Ga0182039_1065240613300016422SoilTPAKAGGLLEEHHRTRRGFASMNERELTGILDRHGWELLGPSPL
Ga0134074_103437943300017657Grasslands SoilGGLLEEQQRTHHPIASMGDREAAELRQRHGWEIVGPTPL
Ga0187874_1030272523300018019PeatlandADTGRVLFLYTPARAGGLIVEQQRTGRKFASMNEHELADILQRHGWELLGPSPL
Ga0187859_1003849733300018047PeatlandGLIVEQQRTGRKFASMNEHELADILQRHGWELLGPSPL
Ga0187766_1065251323300018058Tropical PeatlandGAETARVLFLYTPGKAGGLIEEQLQTGRKLFQLSERERADTLERHGWELLGPSPL
Ga0066662_1212366713300018468Grasslands SoilLYTPARAGGLLEEQQRTDRTFAAMGEREAVELRRRHGWEIVGPSPL
Ga0190274_1039335013300018476SoilADTGRVLFLYTPAKAGGLLEEQHNTQCSFAQMSEAELADVLQRHGWELLGPSPLFGPSPL
Ga0066669_1002102463300018482Grasslands SoilFLYTPAKAGGLIEEQHRTQRSFSMMTEQERTDTLQRHGWELLGPSPL
Ga0193733_109523513300020022SoilSWKNTGAATGRVLFVYTPAKAGGLIEEQHRTQRNFASMSETELADMLQRHGWELLGPSPL
Ga0210407_1076696633300020579SoilRAGGLREEQQRTGRKFASMSERELAEICQRHGWELLGPPPL
Ga0210407_1109541823300020579SoilAWKSTGAETGRVLFLYAPARAGGLIEEQQRTGRNFASMTEREATELRQRYGWKLLGPSPL
Ga0210403_1123594813300020580SoilTGKVLFLYTPAKAGGLIEEQHQTQRNFAAMSETELADTLQRHGWELLGPSPL
Ga0210399_1049610533300020581SoilVLFLYTPARAGGLIEEQQQTGRKFGSMNERELAEMLQHHGWELLGPSPL
Ga0210399_1063212123300020581SoilIEEQHQTQRNFAAMSESELADTLQRHGWDLLGPSPL
Ga0210399_1145376723300020581SoilGLIEEQQRTGRKFAMMGKEEMDDILDRHGWMIVGPSPLET
Ga0210401_1002196153300020583SoilLYTPAKAGGLIEEQHQTQRNFAAMSETELADTLQRHGWELLRPSPL
Ga0179584_142201833300021151Vadose Zone SoilLLEEQQRTNRKFASMSEREVIEICQRHGWEILGPSPL
Ga0210406_1054286413300021168SoilRAGGLIEEQQRTGRKFASMNERELAEICQRHGWELLGPPPL
Ga0210400_1137796223300021170SoilKSTGAEAGRVLFLNTPARAGGLIEEQQRTGRKFGSMNELEAAELRQRYGWELLGPSPL
Ga0210405_1045383513300021171SoilYTPAKAGGLIEEQQRTGRKFGSMNERELAETLQRHGWELLGPSPL
Ga0210408_1007391213300021178SoilAGAQQQTGRKFGSMNERELAEMLQHHGWELLGPSPL
Ga0210408_1007745763300021178SoilRALFLYTPARAGGLIEEQQRTGRKFASMTKAELAEMLQRHGWELLGPSPL
Ga0210408_1008180013300021178SoilGGLIEEQQRTGGKFASMNERELAEILQRHGWELLGPSPL
Ga0210408_1072331033300021178SoilAGGFIEEQHRTERTRASMTESERAEMLQRYGWELLGPSPL
Ga0210396_1036539323300021180SoilWKVTGPETSHVLFLYTPAKAGGLIEEQQRTGRKFAMMGKEEMDDILDRHGWMIVGPSPLE
Ga0210388_1088977423300021181SoilRVLFLYTPARAGGLIEEQQRTGRKFGAMNERELADILERHGWELLGPSPL
Ga0213882_1011595823300021362Exposed RockKAGGLIEEQHQIRRKFAEMSERELSDMLQRHGWELLGESPL
Ga0213876_1080578523300021384Plant RootsPAKAGALVEEQQRTGRKFASMSEAELAEFCRRHGWEIVGPSPL
Ga0210393_1078606733300021401SoilTVGGLLEERHRTGRNFASMNEREVTEICQRYGWEIVGPPPL
Ga0210385_1119077823300021402SoilVLFLYTPAKAGGLIEEQRRTGRKFANMSEQELAAFLDRHGWEMAGESPL
Ga0210387_1046450713300021405SoilSQVLFLYTPAKAGGLIEEQQRTGRKFGSMDERELADILERHGWQLLGPSPL
Ga0210386_1145512123300021406SoilFLYTLAKAGGLIEEQQRTGRKFGSMNERELAETLQRHGWELLGPSPL
Ga0210383_1056617823300021407SoilRGVPHAWKSTGAETARVLFLHTPAKVGGLLEERHRTGRNFASMNECEVTEICQRYGWEIVRPPPL
Ga0210402_1094290313300021478SoilEEQQQTGSNFSSMTETERAEMLQRYGWELLGPSPL
Ga0210409_1055810933300021559SoilEERHRTGRNFASMNEREVTEICQRYGWEIVGPPPL
Ga0213880_1019675123300021953Exposed RockLFLYTPAKAGGLIEEQHRIRRKFSEMSESELSAMLRRHGWELLGDSPL
Ga0207653_1017578413300025885Corn, Switchgrass And Miscanthus RhizosphereLIEEQQRTGSNFSSMTETERAEMLQRCGWELLGPSPL
Ga0207684_1033836433300025910Corn, Switchgrass And Miscanthus RhizosphereVLFLYTPAGAGGFIEEQHRMQSKLASMTQQERTDMRQRHGWGLLGPSPL
Ga0207693_1015814013300025915Corn, Switchgrass And Miscanthus RhizosphereYTPAKAGGLIEEQHRIRRKFAEMSERELSDMLRRHGWELLGPSPL
Ga0207693_1070283323300025915Corn, Switchgrass And Miscanthus RhizosphereYTPAKAGGLIEEQHRIRRKFAEMSERELSDMLQRHGWELLGPSPL
Ga0207663_1030454833300025916Corn, Switchgrass And Miscanthus RhizosphereVLFLYTPAAAGGLIEEQQQTGSNFSSMSPTERAEMLRRHDWELLGPSPL
Ga0207641_1162097223300026088Switchgrass RhizosphereKAGGLIEEQHRIRRKFAEMSERELSDMLRRHGWELLGPSPL
Ga0207675_10236151823300026118Switchgrass RhizosphereKAGGLIEEQQRTGRKFGSMNERELADTLQRHGWELLGPSPL
Ga0209266_107848213300026327SoilGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL
Ga0209473_125662313300026330SoilAETARVLFLYTPAAAGALVEEQQRTHGTTESMNEREKDEQRRRYGWEIVGPNPL
Ga0209158_115192633300026333SoilAKAGGLIEEQYRIQRNFAAMSEAERADMLQRHGWELLGPSPL
Ga0257151_103705113300026341SoilFYTPARAGGLIEEQQRTGSKFSSMTETERAEMLRRYGWELLGPSPL
Ga0209159_104108543300026343SoilLFLYTPAGAGGLLEEQQRTQGTTASMNEREKAEQRQRYGWEIIGPNPL
Ga0209161_1047876723300026548SoilGRVLFLYTPARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL
Ga0209161_1054726913300026548SoilPAGAGGLLEEQQRTHHPIASMGDREAAELRQRHGWEIVGPTPL
Ga0209474_1037204423300026550SoilHAWKSTGIETGRVLFVYTPAKAGGLIEEQYRTKRNFSSMSEAELADTLQRHGWELLGPSP
Ga0179587_1098549613300026557Vadose Zone SoilYTPAKAGGLIEEQQRTGRGFRSMNEHELTELLRRHGWELLGPSPL
Ga0208859_103462923300027069Forest SoilKAGGLIEEQQRTGRKFASMNERELADILERHGWELLGPSPL
Ga0208983_106314613300027381Forest SoilIEEQQQTGSNFSSMTETERAEMLQRHGWQLLGPSPL
Ga0209689_132266013300027748SoilARAGGLLEEQQRTDRTFAAMDEREAAELRRRHGWEIVGPSPL
Ga0209693_1012691933300027855SoilFLYTPAKAGGLIEEQQRTGSKFASMNERELADILGRHGWELLGPSPL
Ga0209693_1038011913300027855SoilVTARAPAAETARVLFLYTPAKAGGLIEEQQRTGRKFFSMSERESAEILERHGWELLGPSP
Ga0209283_1037292423300027875Vadose Zone SoilFLYTPARAGGLIEEQQRTGRVFASMTEREAAELRQRYGWELLGPSPL
Ga0209283_1095656213300027875Vadose Zone SoilTGKVLFLYTPARAGGLIEEQQRTRRGFASMNERELAEILERHGWELLGASLL
Ga0209062_103806813300027965Surface SoilIEEQEQTGRKFSAMGERELADFLERHGWEIVGPSPL
Ga0265337_120668123300028556RhizosphereDKAGGLIEEQQWTGRKFASMNERELAEILQRHGWELLGPSLL
Ga0222749_1035017523300029636SoilRVLFLYTPARAGGLIEEMQRTGRKFADMSEAELADFLQRHGWELLGPSPL
Ga0302184_1017113423300030490PalsaRVLFLYTPAKAGGLLEEQHRTQRKFASMTECEVAEICQRHGWEIVGPSPL
Ga0102770_1032758313300030981SoilGLLEEQQRTNRKFASMNEHELMEICQRHGWEILGPSPL
Ga0170834_10336932623300031057Forest SoilGVEPGRVLFLDTPARAGGLIEEQQRTGRKFAQMNERELAETLQRYGWELLAPSPL
Ga0170824_12386180223300031231Forest SoilGLIEEQQQTGSGFSSMTEIERVEMLQRHGWELLGPSPL
Ga0265340_1022631213300031247RhizosphereEQQRTGRTFVSMNECELAGILQRHGWELLGPSPLFAL
(restricted) Ga0255312_108669623300031248Sandy SoilFLYTPAKAGGLIEEQRRTGRTFSSMTERELAEMLERHGWKLLGPSPL
Ga0318534_1013860733300031544SoilAGLVLFLYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318538_1065175713300031546SoilYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318573_1007447733300031564SoilLYTPAKAGGLLEEQQRTGRKLAEIGERHGWEVLGPSPL
Ga0318542_1033886323300031668SoilKSTGAETGRVLFLYTPAKAGGLLEEQQRTGRKLAEIGERHGWEVLGPSPL
Ga0310686_10487561923300031708SoilVTARAPAAETARVLFLYTPAKAGGLIEEQQLTGRKFFSMSERESAEILERHGWELLGPSP
Ga0318493_1038809423300031723SoilYTPAKAGGLIEEQQRTGRGFASMNEREAAELRQRYGWELLGPSPL
Ga0306918_1050646733300031744SoilGGFIEEQHRSERTRASMTESERAEMLQCYGRELLGPSPL
Ga0318502_1018321023300031747SoilKSTGAETGRGLFLYTPAKAGGLLEEHHRTRRGFASMNERELTGILDRHGWELLGPSPL
Ga0318502_1072920733300031747SoilLFLYTPAQAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0318492_1048552023300031748SoilMPGKAPVLFLYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318552_1009176333300031782SoilLYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318548_1048548933300031793SoilFLYTPAKAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0307473_1155554123300031820Hardwood Forest SoilILFLYTPAKAGGLLEEQHRIQRNFAAMSEPELADMLQRHGWELLGPSPL
Ga0307478_1100668023300031823Hardwood Forest SoilHAWKATGPQTSHVLFLYTPAKAGGLIEEQQRTGRKFAMMGKEEMDDILDRHGWMIVGPSPLKT
Ga0310917_1051910413300031833SoilVLFLYAPAKAGGFIEEQHRSERTRASMTESERAEMLQCYGRELLGPSPL
Ga0318511_1053029423300031845SoilLETARVLFLYTPAKAGGFIEEQHRTERTLASMTESERAEMLRRHGWELLGPSPL
Ga0310892_1132748013300031858SoilGGLFEEQQRTHRTIASMSEKEAAELRQRHGWEIVGPTPL
Ga0306919_1064933513300031879SoilFLYTPAQAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0306925_1046992033300031890SoilGLVLFLYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318520_1062789613300031897SoilADGGRALFLYTPGKAGGLIEEQQRTGRGFRSMDERELADILQRHGWELLGPSPL
Ga0306921_1194442913300031912SoilAGSGRVLFLYTPARAGGLLEEQQRMDRTVASMDEREAAELRQRHGWEIVGPSPL
Ga0310912_1070386713300031941SoilGGLIEEQQRTGRGFASMNEREAAELRQRYGWELLGPSPL
Ga0310910_1002672663300031946SoilTPAKAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0310909_1083708423300031947SoilKAGGFIEEQHRSERTRASMTESERAEMLQRYGWELLGPSPL
Ga0318530_1026040213300031959SoilTGAETGRVLFLYTPAKAGGLLEEQQRTGRKLAEIGERHGWEVLGPSPL
Ga0318531_1052702733300031981SoilNTRAETGRALFLYTPAKAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0318507_1051747333300032025SoilLYTPTKAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0318556_1002391843300032043SoilPAKAGGLIEEQQRTGRGFASMNEREAAELRQRYGWELLGPSPL
Ga0318556_1056047713300032043SoilNTGAETGRALFLYTPAKAGGLIEEQQRTGRKFSSMNKRELADMLERHGWELLGPSPL
Ga0318575_1008041633300032055SoilFLYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318533_1092976813300032059SoilLLEEQQRMDRTVASMDEREAAELRQRHGWEIVGPSPL
Ga0318513_1047222923300032065SoilAEAGLVLFLYTPAGAGGLIEEQQRTGSNFSSMSAAARAEMLRRYDWELLGPSPL
Ga0318553_1040198213300032068SoilLYTPGKAGGLIEEQQRTGRGFRSMDERELADILQRHGWELLGPSPL
Ga0306924_1089351533300032076SoilGGLIEEQQRTGRKFAQMSERELAETLQRHGWELLGPSPL
Ga0318525_1029424713300032089SoilLYTPTKAGGLIEEQQRTGRKFGSMNERELADILQRHGWELLGPSPL
Ga0318540_1018004013300032094SoilRSIGPETARVLFLYTPAKAGGFIEEQHRTERTLASMTESERAEMLRRHGWELLGPSPL
Ga0307470_1038149223300032174Hardwood Forest SoilETGRVLFLYTPARAGGLIEEQQRTGRKFDSMNERELADILQRHGWELLGPSPL
Ga0307471_10174960923300032180Hardwood Forest SoilAGAGGLIEEQQRAGHALASMTEGELTEMRQRHGWEIVGPSPL
Ga0307472_10029760213300032205Hardwood Forest SoilGRVLFLYTPAGAGGLLEEQQRTHHPIASVNEREAAELRQRHGWEIVGPTPL
Ga0306920_10287441013300032261SoilAETARVLFLYTPGKAGGLIEEQQRTGSNFSTMTEIARAEMLRRHGWELLGPSPL
Ga0335073_1067079513300033134SoilGLIEEQQRTGRRFASMSEEELAGILRRHGWELLGPSPL
Ga0370515_0441239_2_1573300034163Untreated Peat SoilGRVLFLYTPAKAGGLLEEQHRTQRKFASMTECEVAEICQRHGWEIVGPSPL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.