NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F049520

Metagenome Family F049520

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F049520
Family Type Metagenome
Number of Sequences 146
Average Sequence Length 84 residues
Representative Sequence LLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLTLDK
Number of Associated Samples 89
Number of Associated Scaffolds 146

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 8.90 %
% of genes near scaffold ends (potentially truncated) 87.67 %
% of genes from short scaffolds (< 2000 bps) 86.30 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (54.795 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(36.301 % of family members)
Environment Ontology (ENVO) Unclassified
(58.219 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.014 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.
1JGI25385J37094_100958143
2JGI25383J37093_101198421
3JGI25383J37093_101392233
4JGI25384J37096_101311191
5JGI25384J37096_101573592
6JGI25384J37096_101862361
7JGI25382J37095_1000076111
8JGI25382J43887_105050051
9JGI25386J43895_100256503
10Ga0066677_101689453
11Ga0066683_103698711
12Ga0066683_108548931
13Ga0066680_104228892
14Ga0066680_109593652
15Ga0066690_102147231
16Ga0066688_105974461
17Ga0066685_102907462
18Ga0066685_105025631
19Ga0066678_107873121
20Ga0066676_100731535
21Ga0066676_100948701
22Ga0066675_100347085
23Ga0066675_102121061
24Ga0066686_108965681
25Ga0066697_102526411
26Ga0066701_101634064
27Ga0066692_103404052
28Ga0066704_100884131
29Ga0066704_102709851
30Ga0066698_100334011
31Ga0066698_108402911
32Ga0066700_100004941
33Ga0066700_110716602
34Ga0066652_1007817651
35Ga0066665_102664801
36Ga0066665_105172321
37Ga0066659_106553562
38Ga0066659_110549242
39Ga0066660_104705212
40Ga0099794_100102621
41Ga0099794_106687672
42Ga0099829_115473911
43Ga0099830_100264651
44Ga0099830_101993991
45Ga0099830_102566691
46Ga0099830_109522181
47Ga0099830_113192161
48Ga0099828_101202591
49Ga0099827_100409525
50Ga0099827_105798913
51Ga0099827_110233161
52Ga0066709_1022209571
53Ga0066709_1026270022
54Ga0134082_101057351
55Ga0134088_101884771
56Ga0134088_102243041
57Ga0134088_105909501
58Ga0134111_101119191
59Ga0134071_100591954
60Ga0134071_102741522
61Ga0126372_117858412
62Ga0137391_101696271
63Ga0137391_114254671
64Ga0137393_108229261
65Ga0137393_117324801
66Ga0137389_104312281
67Ga0137389_118328532
68Ga0137399_101018723
69Ga0137399_103294022
70Ga0137380_103512801
71Ga0137380_103558172
72Ga0137380_103780653
73Ga0137380_104669833
74Ga0137380_115094002
75Ga0137381_102159701
76Ga0137381_102261033
77Ga0137381_103728594
78Ga0137381_109557641
79Ga0137378_101828041
80Ga0137378_111200281
81Ga0137377_116122031
82Ga0137366_100675125
83Ga0137384_102766403
84Ga0137384_113719332
85Ga0137368_102124151
86Ga0137385_103345581
87Ga0137385_106850051
88Ga0137385_111766471
89Ga0137360_110701342
90Ga0137390_112152631
91Ga0137396_100319914
92Ga0137396_100496894
93Ga0137396_110440912
94Ga0137416_108375541
95Ga0134077_104848122
96Ga0134077_105875091
97Ga0134076_104347773
98Ga0134076_106392052
99Ga0134087_100993613
100Ga0134081_101604341
101Ga0134075_102506801
102Ga0134073_101481821
103Ga0134089_100996201
104Ga0134069_11207413
105Ga0134112_103152222
106Ga0134083_101304492
107Ga0187771_105052141
108Ga0066655_103100041
109Ga0066655_111587772
110Ga0066662_101516231
111Ga0066662_106196801
112Ga0066662_106271033
113Ga0137417_11717931
114Ga0137417_12238921
115Ga0209350_11628812
116Ga0209238_12428922
117Ga0209761_11403592
118Ga0209761_11805213
119Ga0209761_12031783
120Ga0209761_12200571
121Ga0209154_10018451
122Ga0209472_12388142
123Ga0209801_10549531
124Ga0209801_11004113
125Ga0209803_11280851
126Ga0209158_10229605
127Ga0209159_12127072
128Ga0257179_10020103
129Ga0257177_10033001
130Ga0257177_10128303
131Ga0209378_12532412
132Ga0209806_12539403
133Ga0209157_12863322
134Ga0209056_100309161
135Ga0209056_102214732
136Ga0209376_12972471
137Ga0209388_10112344
138Ga0209689_11706431
139Ga0209689_13744111
140Ga0209180_100480733
141Ga0209180_107066882
142Ga0137415_102129821
143Ga0137415_102384331
144Ga0137415_102407463
145Ga0307504_102189232
146Ga0307471_1007210693
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 11.01%    β-sheet: 20.18%    Coil/Unstructured: 68.81%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080LLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLTLDKSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
63.7%36.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Tropical Peatland
Soil
36.3%13.0%30.1%15.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1009581433300002558Grasslands SoilVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL*
JGI25383J37093_1011984213300002560Grasslands SoilRINDAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLTLDK*
JGI25383J37093_1013922333300002560Grasslands SoilPRMNDAVSLLMEIRLPDGKWMLDGVYRGWRHPHAMHGEETVSRPEERELITQGWATERSLQLEEAGKPSKWITLQALLVLKRLGLLALT*
JGI25384J37096_1013111913300002561Grasslands SoilLYDFLHGLRILTETGIKDDHRMNDAVRVLMAKRLPDRTWPLDGVYRGWRYSHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKSSRWITLQALLILKRIGLLSLAST*
JGI25384J37096_1015735923300002561Grasslands SoilLQEGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGVLNIGGSELAGVRAH*
JGI25384J37096_1018623613300002561Grasslands SoilAVSLLLEKRLPDGKWVLDGVYRGWRHPHPMHGEETVSRPEERELIAQGWGTERSLQLEEAGKPSKWITLQALLVLKRLGLLSLE*
JGI25382J37095_10000761113300002562Grasslands SoilVSLLMEKRLPDGKWMLDGVYRGWRHPHAMHGEETVSRPEERELITQGWATERSLQLEEAGKPSKWITLQALLVLKRLGLLALA*
JGI25382J43887_1050500513300002908Grasslands SoilILDGVYRGWRHSHAMHGEETVSRPEERELITQGWGTERSLQLEEAGKPGKWITLQALLVLKRLGLLSLE*
JGI25386J43895_1002565033300002912Grasslands SoilYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL*
Ga0066677_1016894533300005171SoilVALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL*
Ga0066683_1036987113300005172SoilPRMDAAVSLLLEKRMPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGMLNPK*
Ga0066683_1085489313300005172SoilLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQLEEAGKPSKWITLQALTALKRLGMLNPGIAQTVNV*
Ga0066680_1042288923300005174SoilLGAKYDPRMGDAVNLLRQKRLPDGKWVLEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILERLS*
Ga0066680_1095936523300005174SoilLGAKYDPRMGDAVNLLRQKRLPDGKWVLEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILEGLS*
Ga0066690_1021472313300005177SoilPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLLESAS*
Ga0066688_1059744613300005178SoilMGDAVNLLRQKRLPDGKWVLEGVYRGRRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQA
Ga0066685_1029074623300005180SoilLLEKRMPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQLEEAGKPSKWVTLQALLVLKRLGLLNWESIRP*
Ga0066685_1050256313300005180SoilLLEKRMPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGMLNPK*
Ga0066678_1078731213300005181SoilGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL*
Ga0066676_1007315353300005186SoilLDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRVGLLHLE*
Ga0066676_1009487013300005186SoilPRMDAAVSLLLEKRMPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQLEEAGKPSKWVTLQALLVLKRLGLLNWESIRP*
Ga0066675_1003470853300005187SoilLLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERDLITQGWGTERALQIEETGKPSKWLTLQALLVLKRLGLLVSGL*
Ga0066675_1021210613300005187SoilLLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL*
Ga0066686_1089656813300005446SoilGIKNDPRMNDAVALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL*
Ga0066697_1025264113300005540SoilNDPRINDAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLTLDK*
Ga0066701_1016340643300005552SoilDGKWALDGVYRGWRTKRPLHGVGAFRPEENEVITHGWDSDTTLQLEEAGKPSKWITLQALLALRRLGLLDQRMPMS*
Ga0066692_1034040523300005555SoilRTDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERDLVIEGWGTDHTLQLEEAGKPSKWVTLQALLILKRLGLLYSWS*
Ga0066704_1008841313300005557SoilNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL*
Ga0066704_1027098513300005557SoilRMNDAVALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELIVQGWGTERSLQLEEAGKPSKWITLQALLVLKRLGLLAPGSIA*
Ga0066698_1003340113300005558SoilRMSDAANLLLQKRFPDGKWALEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILERLS*
Ga0066698_1084029113300005558SoilLAELGVPRDDRMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGIDRTLQLEEAGKPSKWVTLQALLVLKRLGLLEQSS*
Ga0066700_1000049413300005559SoilVYRGWRQSHPMHGTETVSRPEERELVTEGWGTDHTLQLEEAGKPSKWVTLQAFLVLKRLGLLESAP*
Ga0066700_1107166023300005559SoilGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLIQKRLGLLGPSS*
Ga0066652_10078176513300006046SoilLLEKRLPDGKWILDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGMLNPEIAQTVNV*
Ga0066665_1026648013300006796SoilDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQIEEAGKPSKWITLQALLVLKRLGMLNPK*
Ga0066665_1051723213300006796SoilELGVPRDERIDDALRMLRAKQLVDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTEGWGTDHTLQLEEAGKPSKWVTLQAFLVLKRLGLLESAP*
Ga0066659_1065535623300006797SoilLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGKDRTLQLEEAGKPSKWVTLQALLVMKRLGLLEPSS*
Ga0066659_1105492423300006797SoilDAVNLLRQKRLPDGKWVLEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILERLS*
Ga0066660_1047052123300006800SoilLRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDHTLQLEEAGKPSKWVTLQALLTLKRLGLLYSWS*
Ga0099794_1001026213300007265Vadose Zone SoilALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLPNPTS*
Ga0099794_1066876723300007265Vadose Zone SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQTLLILKRLGLPNPTS*
Ga0099829_1154739113300009038Vadose Zone SoilRDERMDDALRMLRAKQLVDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTDGWGTDHTLQVEEAGKPSKWVTLQALLILKRLGTPVS*
Ga0099830_1002646513300009088Vadose Zone SoilELEVPRDERMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLLNPTF*
Ga0099830_1019939913300009088Vadose Zone SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEASKPSKWVTLQALLILKRLGFLESSS*
Ga0099830_1025666913300009088Vadose Zone SoilPREERMDDALRMLRAKPLVDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTEGWGTDHTLQVEEAGKPSKWVTLQALLVLKRLGPAGSES*
Ga0099830_1095221813300009088Vadose Zone SoilMDDALRMLRAKQLVDERWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLLESLS*
Ga0099830_1131921613300009088Vadose Zone SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGIETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLLYS*
Ga0099828_1012025913300009089Vadose Zone SoilESGVDDPRMDDAVRILLAKRLPDGKWPLEGVYRGWRHAHPMHGLETVSRPEEREIITEGWGTERAIQLEEAGKPSKWITLQALLVLKRIGLLSLN*
Ga0099827_1004095253300009090Vadose Zone SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELITEGWGTDRTLQLEEAGKPSKWVTLQALLILIRLGLSEPSS*
Ga0099827_1057989133300009090Vadose Zone SoilEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGEPSKWITLQALLVLKRLGLLAIEQ*
Ga0099827_1102331613300009090Vadose Zone SoilDAVRVLMAKRLPDGRWPLEGAYRGWRHAHPMHGLETVSRPEEREIVTEGWGTERAIQLEEAGKPSKWITLQALLVLKRMGLLSIDSGA*
Ga0066709_10222095713300009137Grasslands SoilMGDAVNLLRQKRLPDGKWVLEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLRRLGILERLS*
Ga0066709_10262700223300009137Grasslands SoilVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVPGL*
Ga0134082_1010573513300010303Grasslands SoilRLPDGKWTLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQSLLVLKRLGLLSLK*
Ga0134088_1018847713300010304Grasslands SoilKDDHRMNDAVRVLMAKRLPDGRWPLDGVYRGWRHPHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKSSRWITLQALLILKRMGLLSLAST*
Ga0134088_1022430413300010304Grasslands SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSYPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLVLKRLGLLEQSS*
Ga0134088_1059095013300010304Grasslands SoilILTETGIKDDHRMNDAVRVLMAKRLPDGRWPLDGVYRGWRYSHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKSSRWITLQALLILKRIGLLSLAST*
Ga0134111_1011191913300010329Grasslands SoilPRDDRMDDALRMLRAKQLVDGRWPLEAVYRGWRHSYPMHGTETVSRPEERELVTEGWGIDRTLQLEEAGKPSKWVTLQALLVLKRLGLLEQSS*
Ga0134071_1005919543300010336Grasslands SoilGLRILTETGIKDDHRMNDAVRVLMAKRLPDGRWPLDGVYRGWRYSHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKSSRWITLQALLILKRIGLLSLAST*
Ga0134071_1027415223300010336Grasslands SoilDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGMLNPK*
Ga0126372_1178584123300010360Tropical Forest SoilQPDGKWILDGVYRGWRHAHSMHGEETVSRPEERELITEGWGTEHTLQIEEAGKPSKWITLQGLIVLKRLGLLHPGYLHLPSP*
Ga0137391_1016962713300011270Vadose Zone SoilMEDALRMLRAKQLVDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTEGWGTDHTLQVEEAGKPSKWVTLQALLVLKRLGPAESQL*
Ga0137391_1142546713300011270Vadose Zone SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLLNPSS*
Ga0137393_1082292613300011271Vadose Zone SoilRLPDGKWVLDGVYRGWRHPHAMHGQETVSRPEERELITTGWGTERTLQLEEAGKPSKWITLQALLILKRLGILDIE*
Ga0137393_1173248013300011271Vadose Zone SoilRVLMAKRLPDGRWPVDGVYRGWRHPHPMHGLETVSRPEERDLVTEGWGSERSIQLEEAGKASKWITLQALLILKRIGLLSLAAP*
Ga0137389_1043122813300012096Vadose Zone SoilAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERASQIEEAGKPSKWITLQALLVLKRLGLLALEQ*
Ga0137389_1183285323300012096Vadose Zone SoilERMGDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLELLESQS*
Ga0137399_1010187233300012203Vadose Zone SoilVLSELEVPKDERMDDALRMLRAKQLIDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTEGWGTDRTLQLEDAGKPSKWVTLQALLILKRLGLLESTS*
Ga0137399_1032940223300012203Vadose Zone SoilRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGAETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGSPYS*
Ga0137380_1035128013300012206Vadose Zone SoilALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLLPGL*
Ga0137380_1035581723300012206Vadose Zone SoilSLLLEKRLPDGKWLLDGVYRGWRHPHAMHGQETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGMLNLGVAQIVNV*
Ga0137380_1037806533300012206Vadose Zone SoilEGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL*
Ga0137380_1046698333300012206Vadose Zone SoilSPEGKWILDGVYRGWRHPHAMHGGEFVARPEERELITQGWGSERSLQLEEAGKPSKWITLQSLLVLKRLGLLSLK*
Ga0137380_1150940023300012206Vadose Zone SoilSLLLEKRLPDGKWLLDGVYRGWRHPHAMHGQETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGMLNLGVAQTVNV*
Ga0137381_1021597013300012207Vadose Zone SoilRDERMDDALRMLRAKQLVDGRWPLEAVYRGRRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLLESAS*
Ga0137381_1022610333300012207Vadose Zone SoilGIKNDPRMNDAVALLLEKRTPEGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL*
Ga0137381_1037285943300012207Vadose Zone SoilALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL*
Ga0137381_1095576413300012207Vadose Zone SoilQRMNDAVRVLMAKRLPDGRWPLDGVYRGWRHPHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKASKWITLQALLILKRIGLLSLAAP*
Ga0137378_1018280413300012210Vadose Zone SoilDNAISLLLEKRLPDGKWLLDGVYRGWRHPHAMHGQETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGMLNLGVAQTVNV*
Ga0137378_1112002813300012210Vadose Zone SoilMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWITLQALLIMRRLGLLNSSS*
Ga0137377_1161220313300012211Vadose Zone SoilGVYRGWRHPNAMHGEETVSRPEERELITQGWGTEKALQVEEAGKPSKWITLQALLVLKRLGLFKPE*
Ga0137366_1006751253300012354Vadose Zone SoilELGVKNDPRMNDAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKSSKWITLQALLVLKRLGLLVLEG*
Ga0137384_1027664033300012357Vadose Zone SoilLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVPGS*
Ga0137384_1137193323300012357Vadose Zone SoilMNDAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKSSKWITLQALLVLKRLGLLVLEG*
Ga0137368_1021241513300012358Vadose Zone SoilVRNDQRMDDAVSLLLEKKLPDGKWPLEGVYRGWRHPNAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQAFLVLKRLGLFTPE*
Ga0137385_1033455813300012359Vadose Zone SoilLEKRLPDGKWLLDGVYRGWRHPHAMHGQETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGMLNLGVAQTVNV*
Ga0137385_1068500513300012359Vadose Zone SoilRMNDAIALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLLPGL*
Ga0137385_1117664713300012359Vadose Zone SoilQRMNDAVKVLMAKRLQDGRWPLDGVYRGWRHPHPMHGLETVSRPEERDLIAEGWGSERSIQLEEAGKPSKWITLQALLILKRMGLLNLAAP*
Ga0137360_1107013423300012361Vadose Zone SoilMNDAVFLLLEKRLADGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLALKRLGLLTLDK*
Ga0137390_1121526313300012363Vadose Zone SoilMDDALRVLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELITEGWGTDRTLQLEEAGKPSKWVTLQALLILRRLGLSEPSS*
Ga0137396_1003199143300012918Vadose Zone SoilMEDALRMLRAKQLVDGRWPLEAVYRGWRQFHPMHGTETVSRPEERELVTEGWGTDRTIQLEEAGKPSKWVTLQALLVLKRLGLVESQS*
Ga0137396_1004968943300012918Vadose Zone SoilDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGKDRTLQLEEAGKPSKWVTLQALLVMKRLGLLEPSS*
Ga0137396_1104409123300012918Vadose Zone SoilLRMLRAKQLVDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTEGWGTDHTSQLEEAGKPSRWVTLQALLVLKRLGPLEAQS*
Ga0137416_1083755413300012927Vadose Zone SoilKNDPRMNDAVALLLEKRTPEGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL*
Ga0134077_1048481223300012972Grasslands SoilRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGMLNPEIAQTVNV*
Ga0134077_1058750913300012972Grasslands SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSYPMHGTETVSRPEERELVTEGWGIDRTLQLEEAGKPSKWVTLQALLVLKRLGLLEQSS*
Ga0134076_1043477733300012976Grasslands SoilMEKRLPDGKWILDGVYRGWRHPHAMHGEETVSRPEERELITQGWGSERSLQLEEAGKPSKWITLQSLLVLKRLGLLSLK*
Ga0134076_1063920523300012976Grasslands SoilVPRDERMDDALRMLRAKQLVDGRWPLEAVYRGWRHCYSMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLVLKRLGLLG*
Ga0134087_1009936133300012977Grasslands SoilGKWLLDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRVGLLHLE*
Ga0134081_1016043413300014150Grasslands SoilDPRMNDAVALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERDLITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL*
Ga0134075_1025068013300014154Grasslands SoilLDGVYRGWRHPHAMHGEETVSRPEERELITQGWATERSLQLEEAGKPSKWITLQALLVLKRLGLLALT*
Ga0134073_1014818213300015356Grasslands SoilMDDAVSLLLEKRLPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRVGLLHLE*
Ga0134089_1009962013300015358Grasslands SoilLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLTLDK*
Ga0134069_112074133300017654Grasslands SoilLLEKRTPDGNWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL
Ga0134112_1031522223300017656Grasslands SoilSLLMEKRSPEGKWILDGVYRGWRHPYAMHGEETVSRPEERELITQGWGSERSLQLEEAGKPSKWITLQALLVLKRLGLLSLE
Ga0134083_1013044923300017659Grasslands SoilLLDGVYRGWRHPHAMHGEETVSRPEERELITQGRGTERALQIEEAGKPSKWITLQALLVLKRLGMLNPK
Ga0187771_1050521413300018088Tropical PeatlandKRGVWNLDGTYRGWTHPHSANGDWVERPEEYEVVEQGWGGGRTLQLEEAGEPSKWVTLQCLIVLKRLGLLQIRPSKR
Ga0066655_1031000413300018431Grasslands SoilNDPRMDAAVSLLLEKRMPDGKWLLDGVYRGWRHTHAMHGEETVSRPEERELVTQGWGTERALQLEEAGKPSKWITLQALLVLKRLGLLNWESIRP
Ga0066655_1115877723300018431Grasslands SoilGTKYDPRMSDAVNLLLQKRFPDGKWALEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILERLS
Ga0066662_1015162313300018468Grasslands SoilMGDAVNLLRQKRLPDGKWLLEGVYRGWRQSVGIHVVKAVSRPEEKEAFTEGWGDGHTLQLEEAGKPSKWITLQALLILKRLGILERLS
Ga0066662_1061968013300018468Grasslands SoilGKWVLEGVYRGWRQSVGIHGGKAVSRPEEREAFAEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILEGLS
Ga0066662_1062710333300018468Grasslands SoilVALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL
Ga0137417_117179313300024330Vadose Zone SoilVKNDPRMNDAISLLLLEKRLPDGRWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTEKALQIEEAGKPSKWVTLQALLVLKRLGLLALER
Ga0137417_122389213300024330Vadose Zone SoilLRAKLRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLPESQS
Ga0209350_116288123300026277Grasslands SoilLRAKQLVDGRWPLEAVYRGWRHSYPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLVLKRLGLLEQSS
Ga0209238_124289223300026301Grasslands SoilDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWLTLQALLVLKRLGLLVSGL
Ga0209761_114035923300026313Grasslands SoilAVRVLMAKRLPDGRWPLDGVYRGWRYSHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKSSRWITLQALLILKRIGLLSLAST
Ga0209761_118052133300026313Grasslands SoilGVYRGWRHAHAMHGEETVSRPEERELIVQGWGTERSLQLEEAGKPSKWITLQALLVLKRLGLLAPGSIA
Ga0209761_120317833300026313Grasslands SoilPDGKWVLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQAILVLKRLGLLAFGS
Ga0209761_122005713300026313Grasslands SoilRLTDGKWLQEGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRLGVLNIGGSELAGVRAH
Ga0209154_100184513300026317SoilTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWVTLQALLVLKRLGLLVPGL
Ga0209472_123881423300026323SoilRMDDAVSLLLEKRLPDGKWILDGVYRGWRHPHAMHGEETVSRPEERELITQGWGTERALQLEEAGKPSKWITLQALLVLKRVGLLHLE
Ga0209801_105495313300026326SoilAVSLLLEKRLPDGKWMLDGVYRGWRHPHAMHGEETVSRPEERELITQGWATERSLQLEEAGKPSKWITLQALLVLKRLGLLALT
Ga0209801_110041133300026326SoilMGDAVNLLRQKRLPDGKWLLEGVYRGWRQSVGIHGGKAVSRPEEREAFTEGWGDGHTLQLEEAGKPSKWITLQALLTLKRLGILERLS
Ga0209803_112808513300026332SoilVYRGWRHPHAMHGEETVSRPEERELIVQGWGTERSLQLEEAGKPSKWITLQALLVLKRLGLLAPGSIA
Ga0209158_102296053300026333SoilLEKRTPEGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAAKPSKWVTLQALLVLKRLGLLVPGL
Ga0209159_121270723300026343SoilKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKSSKWITLQALLVLKRLGLLVLEG
Ga0257179_100201033300026371SoilISLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERASQIEEAGKPSKWITLQALLVLKRLGLLALEQ
Ga0257177_100330013300026480SoilMDDALRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLILKRLGLPNPTS
Ga0257177_101283033300026480SoilLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERASQIEEAAKPSKWITLQALLVLKRLGLLALEQ
Ga0209378_125324123300026528SoilDDAFRMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWITIQALLIMRRLGLLNSSS
Ga0209806_125394033300026529SoilGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWVTLQALLVLKRLGLLVPGL
Ga0209157_128633223300026537SoilLRILTETGIKDDHRMNDAVRVLMAKRLPDGRWPLDGVYRGWRHPHPMHGLETVSRPEERDLVTEGWGSDRSIQLEEAGKSSRWITLQALLILKRMGLLSLAST
Ga0209056_1003091613300026538SoilPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQLEEAGKPSKWVTLQALLVLKRLGLLNWESIRP
Ga0209056_1022147323300026538SoilPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQIEEAGKPSKWITLQALLVLKRLGMLNPK
Ga0209376_129724713300026540SoilPRMDAAVSLLLEKRMPDGKWLLDGVYRGWRHPHAMHGEETVSRPEERELVTQGWGTERALQLEEAGKPSKWITLQALLVLKRLGLLNWESIRP
Ga0209388_101123443300027655Vadose Zone SoilDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERASQIEEAGKPSKWITLQALLVLKRLGLLALEQ
Ga0209689_117064313300027748SoilDAVALLLEKRTPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELIVQGWGTERSLQLEEAGKPSKWITLQALLVLKRLGLLAPGSIA
Ga0209689_137441113300027748SoilAKQLVDRRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEEAGKPSKWVTLQALLIQKRLGLLGPSS
Ga0209180_1004807333300027846Vadose Zone SoilMDDALRLLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDQTLQMEEAGKPSKWVTLQALLILKRLGQLDSPA
Ga0209180_1070668823300027846Vadose Zone SoilRDERMDDALRMLRAKQLVDGRWPLEAVYRGWRQSHPMHGTETVSRPEERELVTDGWGTDHTLQVEEAGKPSKWVTLQALLILKRLGTPVS
Ga0137415_1021298213300028536Vadose Zone SoilDDALGMLRAKQLVDGRWPLEAVYRGWRHSHPMHGTETVSRPEERELVTEGWGTDRTLQLEDAGKPSKWVTLQALLILKRLGLLESTS
Ga0137415_1023843313300028536Vadose Zone SoilVKNDPRMNDAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLALEE
Ga0137415_1024074633300028536Vadose Zone SoilVKNDPRMNDAVSLLLEKRLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERASQIEEAGKPSKWITLQALLVLKRLGLLAPGQ
Ga0307504_1021892323300028792SoilLPDGKWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLALLALEQ
Ga0307471_10072106933300032180Hardwood Forest SoilGIWNLDGVYRGWRHAHAMHGEETVSRPEERELITQGWGTERALQIEEAGKPSKWITLQALLVLKRLGLLALEQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.