NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F044413

Metagenome / Metatranscriptome Family F044413

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F044413
Family Type Metagenome / Metatranscriptome
Number of Sequences 154
Average Sequence Length 78 residues
Representative Sequence MNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRASL
Number of Associated Samples 109
Number of Associated Scaffolds 154

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 38.96 %
% of genes near scaffold ends (potentially truncated) 56.49 %
% of genes from short scaffolds (< 2000 bps) 71.43 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.65

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.351 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(42.857 % of family members)
Environment Ontology (ENVO) Unclassified
(38.312 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.948 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.
1JGI12635J15846_100407102
2JGI12635J15846_105289451
3JGI12635J15846_108449112
4JGI25613J43889_100095245
5JGI25390J43892_101068892
6Ga0066680_102436391
7Ga0066690_100142446
8Ga0066688_107898572
9Ga0066684_102891932
10Ga0066684_103393161
11Ga0066685_103966232
12Ga0070711_1010073971
13Ga0066689_107208471
14Ga0066661_107868062
15Ga0066692_101469852
16Ga0066692_102154061
17Ga0066692_108355902
18Ga0066705_100048792
19Ga0066702_101830232
20Ga0066708_109617232
21Ga0066691_108808532
22Ga0066654_100110364
23Ga0079222_100712353
24Ga0066658_104515742
25Ga0066659_100106916
26Ga0066660_103002122
27Ga0066660_107528181
28Ga0079221_104959431
29Ga0099791_104384721
30Ga0099793_102134362
31Ga0099794_103888252
32Ga0099794_103958362
33Ga0099794_106113552
34Ga0099829_102911581
35Ga0127473_11025251
36Ga0134088_100068144
37Ga0134088_104396161
38Ga0134109_100878132
39Ga0134067_100347641
40Ga0134067_101398401
41Ga0134084_100132702
42Ga0134080_101916042
43Ga0126378_112811291
44Ga0126377_134199802
45Ga0126379_108117692
46Ga0126383_135902432
47Ga0137392_100366505
48Ga0137392_107881411
49Ga0137391_103055931
50Ga0137391_108376451
51Ga0137391_113166552
52Ga0137393_116280611
53Ga0137389_100085913
54Ga0137389_101642373
55Ga0137389_108960812
56Ga0137364_108330782
57Ga0137364_108867841
58Ga0137383_100232234
59Ga0137382_107694472
60Ga0137382_111885531
61Ga0137363_100033975
62Ga0137363_100429432
63Ga0137363_110293011
64Ga0137399_101308642
65Ga0137399_102950592
66Ga0137399_105878692
67Ga0137399_110982941
68Ga0137362_102768252
69Ga0137362_106947491
70Ga0137380_102301103
71Ga0137380_114241461
72Ga0137376_114471521
73Ga0137387_110478901
74Ga0137386_109693782
75Ga0137366_100267545
76Ga0137360_110152231
77Ga0137360_117778412
78Ga0137390_102931531
79Ga0137390_111711892
80Ga0137358_101386711
81Ga0137358_102410951
82Ga0137358_102785152
83Ga0137398_100299583
84Ga0137398_102896681
85Ga0137398_108173231
86Ga0137397_108080732
87Ga0137396_100476654
88Ga0137396_100861731
89Ga0137413_105259901
90Ga0137419_100505261
91Ga0137419_110130651
92Ga0137416_102352641
93Ga0137416_119918782
94Ga0137410_112691052
95Ga0134110_101535902
96Ga0134110_102516191
97Ga0134081_103938431
98Ga0137420_133043616
99Ga0134112_104815232
100Ga0066669_100016816
101Ga0137408_14540562
102Ga0179592_100032985
103Ga0179592_100067763
104Ga0210407_1000055542
105Ga0210407_113638432
106Ga0210403_101782303
107Ga0210399_115699652
108Ga0179596_102718742
109Ga0210404_100036944
110Ga0210404_102132571
111Ga0210404_105062432
112Ga0210406_102119132
113Ga0210408_109241361
114Ga0210409_107658252
115Ga0210409_116936751
116Ga0126371_100332211
117Ga0222728_10814531
118Ga0209350_11665122
119Ga0209238_10018696
120Ga0209239_10225823
121Ga0209155_10041338
122Ga0209471_10394473
123Ga0209131_100092712
124Ga0209131_10241605
125Ga0209152_100628863
126Ga0209267_100007713
127Ga0209803_10074335
128Ga0209377_12614402
129Ga0209804_10143636
130Ga0257176_10427612
131Ga0257181_10204752
132Ga0209808_13054012
133Ga0209378_12496731
134Ga0209648_100034866
135Ga0209577_1000001645
136Ga0179593_10380009
137Ga0179593_10725043
138Ga0179587_101774132
139Ga0209588_10293141
140Ga0209588_11657631
141Ga0209118_10023395
142Ga0209011_10173545
143Ga0209011_11520112
144Ga0209178_13039711
145Ga0137415_102677852
146Ga0137415_105095422
147Ga0222749_106332761
148Ga0307474_101074223
149Ga0307477_101521262
150Ga0307473_101957002
151Ga0307478_102451793
152Ga0307479_100393273
153Ga0307479_102920073
154Ga0307471_1000060133
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 6.80%    β-sheet: 35.92%    Coil/Unstructured: 57.28%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRASLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.65
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
42.9%3.2%7.8%18.8%5.8%9.7%4.5%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1004071023300001593Forest SoilMNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEALGATSDKLGVGVQFLYYEVPRPSL*
JGI12635J15846_1052894513300001593Forest SoilRPMQAPVDSEREAASMNICNHGIYFVTDQRVQEGVMIQVHLKMPREVVGVDVTEWCFTGRVAHVQPLRAANDKLGVGVQFLYYEVP*
JGI12635J15846_1084491123300001593Forest SoilCNHGVYFATDQRMPKGVMIQVHLKMPREVVGDDVKEWCFTGRVAHVELLGATNDKSGVGVQFLYYEVPRASP*
JGI25613J43889_1000952453300002907Grasslands SoilMNICNYGVYFATDHRLPRGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAKNDKLGVGVQFIYYEVPRASL*
JGI25390J43892_1010688923300002911Grasslands SoilCNHGVYFVTDQRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGTANDKLGVGVQFLYYEVPRISL*
Ga0066680_1024363913300005174SoilMHEPGDPEWEAATMNICNYGVYFATDHRLPRGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAKNDKLGVGVQFIYYEVPRASL*
Ga0066690_1001424463300005177SoilMNICNHGVYFATDQRVPQGLMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPR
Ga0066688_1078985723300005178SoilMNICNHGVYFATDQRVHKGVMVQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL*
Ga0066684_1028919323300005179SoilMNICNHGVYFATEQSLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGTANDKLGVGVQFLYYEVPRISL*
Ga0066684_1033931613300005179SoilAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELLGPKGSKLGVGVQFLYYEVPRPSL*
Ga0066685_1039662323300005180SoilHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELLGPKGGKLGVGVQFLYYEVPRPSL*
Ga0070711_10100739713300005439Corn, Switchgrass And Miscanthus RhizosphereGVYFATDQKVPEGLVVQLHLKMPKEIVGDDVEEWSFTGRVAHVEPLSRQNRKSGVGVQFLFYEVPRPAAK*
Ga0066689_1072084713300005447SoilMQAPADPEQEAASMNICNHGVYFATDQRVPQGLMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRVSL*
Ga0066661_1078680623300005554SoilQASTDSEREAASMNICNHGVYFATDQRVHKGVMVQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL*
Ga0066692_1014698523300005555SoilRPMKQASIPEESAASMNISTHGVYFATATKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSHSR*
Ga0066692_1021540613300005555SoilQSANAMNICSHGVYFATEQKLPKGVLIRVHLKMPREVLGEAVGEDVKEWCFTGRVAHVESLGATNDKLGVGVHFLYYEVPRPVL*
Ga0066692_1083559023300005555SoilRPMKQASIPEESAASMNISTHGVYFATATKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSNSR*
Ga0066705_1000487923300005569SoilMNICNHGVYFATEQSLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGAANDKLGVGVQFLYYEVPRISL*
Ga0066702_1018302323300005575SoilMQAPADPEQEAASMNICNHGVYFATDQRVPQGLMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRISL*
Ga0066708_1096172323300005576SoilEHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVESLGPQGNKLGVGVQFLYYEVPRASL*
Ga0066691_1088085323300005586SoilTADPEHEAASMNICNHGVYFATDWKLPEGGMIQVHLKMPREVVGEDVAEWCFTGRVAHVELLGPKGSKLGVGVQFLYYEVPRPSL*
Ga0066654_1001103643300005587SoilATDQNVAEGLMVQLHLKMPKEIVGDEVEEWSFTGRVAHVEPLSRKNGKSGVGVQFLFYEVPRPAMK*
Ga0079222_1007123533300006755Agricultural SoilVLKEESVSSMNISTHGVYFATDQKVREGLMVQLHLKMPKEIVGDEVEEWSFTGRVAHVEPLSRQNGKSGVGVQFLFYEVPRPAAK*
Ga0066658_1045157423300006794SoilMNICNHGVYFATDQRVPQGLMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGTANDKLGVGVQFLYYEVPRVSL*
Ga0066659_1001069163300006797SoilMQAPADPEQQAASMNICNHGVYFATDRRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRVSL*
Ga0066660_1030021223300006800SoilSMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVESLGPQGNKLGVGVQFLYYEVPRASL*
Ga0066660_1075281813300006800SoilNICNHGVYFATEQSLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGAANDKLGVGVQFLYYEVPRISL*
Ga0079221_1049594313300006804Agricultural SoilLKEEAVSSMNISTHGVYFATDQKVREGLMVQLHLKMPKEIVGDDVEEWSFTGRVAHVEPLSRQNGKSGVGVQFLFYEVPRPAAK*
Ga0099791_1043847213300007255Vadose Zone SoilEAASMNICHHGVYFATDRKLPEELMIQVHLKMPREVVGDDVKEWCFTGRVAHVEPLGANNGKSGVGVQFLYYEVPRPSL*
Ga0099793_1021343623300007258Vadose Zone SoilMNICHHGVYFATDRKVPKGVMLQVHLKLPREVAGDDVSEWCFTGRVAHVEPLGDPNGKSGVGVQFLCYEVPRALV*
Ga0099794_1038882523300007265Vadose Zone SoilPLQAPDYPEGEAATMNICNHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVTEWRFTGRVAHVEPLGTTNHKSGVGVQFLYYEVPRPFL*
Ga0099794_1039583623300007265Vadose Zone SoilICNHGVYFATDQMLPTGVIIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGNPDGKSGVGVQFLCYEVPRASF*
Ga0099794_1061135523300007265Vadose Zone SoilSMNICSHGVYFATDQRVHEGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL*
Ga0099829_1029115813300009038Vadose Zone SoilMNICNHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVTEWRFTGRVAHVEPLGTTNHKSGVGVQFLYYEVPRPFL*
Ga0127473_110252513300010096Grasslands SoilQETADPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVESLGPQGNKLGVGVQFLYYEVPRASL*
Ga0134088_1000681443300010304Grasslands SoilMQETANPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELLGPKGGKLGVGVQFLYYEVPRPSL*
Ga0134088_1043961613300010304Grasslands SoilMKQASIPEESAASMNISTHGVYFATATKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSHSR*
Ga0134109_1008781323300010320Grasslands SoilYFVTDQRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGAANDKLGVGVQFLYYEVPRISL*
Ga0134067_1003476413300010321Grasslands SoilTHGVYFATDQNVAEGLMVQLHLKMPKEIVGDEVEEWSFTGRVAHVEPLSRKNGKSGVGVQFLFYEVPRPAMK*
Ga0134067_1013984013300010321Grasslands SoilMQETADPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELGGPKGSKLGVGVQFLYYEVP
Ga0134084_1001327023300010322Grasslands SoilMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVESLGPQGNKLGVGVQFLYYEVPRASL*
Ga0134080_1019160423300010333Grasslands SoilMKQASIPEESAASMNISTHGVYFATGTKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSNSR*
Ga0126378_1128112913300010361Tropical Forest SoilHGVYFATDQRVREGLMVQLHLKMPKEIVGDDVEEWSFTGRVAHVEPLSRQNGKSGVGVQFLFYEVPRPAAK*
Ga0126377_1341998023300010362Tropical Forest SoilISTHGVYFATDQKVREGLMVQLHLKMPKEIVGDEVEEWSFTGRVAHVERLNWKNGKSGVGVQFLFYEAPRPAMK*
Ga0126379_1081176923300010366Tropical Forest SoilAVSSMNISAHGVYFATDQNVAEGLMVQLHLRMPKEIVGDVVEEWSFTGSVAHVESLSQKNGKSGVGVQFLFYEVPRPVLE*
Ga0126383_1359024323300010398Tropical Forest SoilYFATDQKVREGLMVELHLKMPKEIVGDEVAEWSFTGRVAHVEPLSRQNGKSGVGVQFLFYEVPRPAMK*
Ga0137392_1003665053300011269Vadose Zone SoilVYFATDHRLSEGLMIQVHLKMPREVVGDDVKEWCFTGRVAHVEPLGANNGKSGVGVQFLYYEVPRPSL*
Ga0137392_1078814113300011269Vadose Zone SoilTDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRASL*
Ga0137391_1030559313300011270Vadose Zone SoilDSEQSANAMNICSHGVYFATDQKLPKGVLIQVHLKMPREVLGDDVTEWCFTGRVAHVESLGPTNDKLGVGVHFVYYEVPRPVL*
Ga0137391_1083764513300011270Vadose Zone SoilNICNHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVTEWRFTGRVAHVEPLGTTNHKSGVGVQFLYYEVPRPSL*
Ga0137391_1131665523300011270Vadose Zone SoilANAMNICSHGVYFATDQKLPKGVLIQVHLKMPREVVGDVVGDDVTEWCFTGRVAHVESLGATKDKLGVGVHFLYYEVPRPLL*
Ga0137393_1162806113300011271Vadose Zone SoilNHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVTEWRFTGRVAHVEPLGTTNHKSGVGVQFLYYEVPRPFL*
Ga0137389_1000859133300012096Vadose Zone SoilMNICNHGVYFATDQKVPMGVMLQVHLKMPREVAGDDVVEWCFTGRVTHVEPLGAPNGKLGVGVQFLYYEVPRPSL*
Ga0137389_1016423733300012096Vadose Zone SoilMNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRASL*
Ga0137389_1089608123300012096Vadose Zone SoilYFATDQMLPTGVIIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGNPDGKSGVGVQFLCYEVPRASF*
Ga0137364_1083307823300012198Vadose Zone SoilEAASMNICNHGVYFATDQRVHEGVMIQVHLRMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL*
Ga0137364_1088678413300012198Vadose Zone SoilEAASMNICNHGVYFATDQRVHEGVMIQVHLRMPREVVGDDVTEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRPSL*
Ga0137383_1002322343300012199Vadose Zone SoilNHGVYFATDRKLPEGGMIQVHLKMPREVVGEDVAEWCFTGRVAHVELLGPRGDKLGVGVQFLYYEVPRPSL*
Ga0137382_1076944723300012200Vadose Zone SoilNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVESLGPQGNKLGVGVQFLYYEVPRASL*
Ga0137382_1118855313300012200Vadose Zone SoilDQRVHEGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRPSL*
Ga0137363_1000339753300012202Vadose Zone SoilMNICNHGVYFATDQKVPEGGMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGIASDKSGVGVQFLYYEVPRASL*
Ga0137363_1004294323300012202Vadose Zone SoilMQETADPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLKMPREVVGEDVAEWCFTGRVAHIELLGPKGSKLGVGVQFLYYEVPRPSL*
Ga0137363_1102930113300012202Vadose Zone SoilMNICNHGVYFATDQKVPEGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGIASDKSGVGVQFLYYEVPRAS
Ga0137399_1013086423300012203Vadose Zone SoilMQATADPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLKMPREVVGEDVAEWCFTGRVAHVELLGPKGTKLGVGVQFLYYEVPRPSL*
Ga0137399_1029505923300012203Vadose Zone SoilMQAPPDSEREAASMNICNHGVYFATDQRVHKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVQPLGAANDILGVGVQFLYYEVPRASL*
Ga0137399_1058786923300012203Vadose Zone SoilMNICNHGVYFATDQRLAQGVLIQVHLKMPREVVGDDVVEWCFTGRVAHVEPLGVSNDKSGVGVQFLYYEVPRASL*
Ga0137399_1109829413300012203Vadose Zone SoilAASMNICNQGVYFATEQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATSDKLGVGVQFLYYEVPRPSL*
Ga0137362_1027682523300012205Vadose Zone SoilGVYFATDHRLPRGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAKNDKLGVGVQFIYYEVPRASL*
Ga0137362_1069474913300012205Vadose Zone SoilEKDAASMNICNQGVYFATDEKVRKGVMIQVHLKLPREVVGDDVAEWCFTGRVAHVESLGANHDKLGVGVQFLYYETPRTSI*
Ga0137380_1023011033300012206Vadose Zone SoilAASMNISTHGVYFATATKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSHSR*
Ga0137380_1142414613300012206Vadose Zone SoilMKQASIPEESAASMNISTHGVYFATGTKVSEGVLVQVHLKMPREIVGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSHSR*
Ga0137376_1144715213300012208Vadose Zone SoilDSEREAASMNICNHGVYFATDQRVHEGVMIQVHLRMPREVVGDDVTEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRPSL*
Ga0137387_1104789013300012349Vadose Zone SoilMKQASIPEETAASMNISTHGVYFATDAKMSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQF
Ga0137386_1096937823300012351Vadose Zone SoilEAASMNICNHGVYFATDRKLPEGGIIQVHLKMPREVVGEDVAEWCFTGRVAHVELLGPRGNKLGVGVQFLYYEVPRPSL*
Ga0137366_1002675453300012354Vadose Zone SoilMKQASIPEESAASMNISTHGVYFATATKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGFFSHSR*
Ga0137360_1101522313300012361Vadose Zone SoilNICDHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVTEWRFTGRVAHVEPLGTTNHKSGVGVQFLYYEVPRPFL*
Ga0137360_1177784123300012361Vadose Zone SoilVYFATDQMVHEGVMIQVHLKMPREVIGDDVTEWCFTGRVAHVQPLGTANDQLGVGVQFLYYEVPRASL*
Ga0137390_1029315313300012363Vadose Zone SoilGNNGDSEQSANAMNICSHGVYFATDQKLPKGVLIQVHLKMPREVVGDVVGDDVTEWCFTGRVAHVESLGATKDKLGVGVHFLYYEVPRPLL*
Ga0137390_1117118923300012363Vadose Zone SoilNNGDSEQSANAMNICSHGVYFATDQKLPKGVLIQVHLKMPREVLGDDVTEWCFTGRVAHVESLGPTNDKLGVGVHFVYYEVPRPVL*
Ga0137358_1013867113300012582Vadose Zone SoilAASMNICNHGVYFATDQRLAQGVLIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGVSNDKSGVGVQFLYYEVPRASL*
Ga0137358_1024109513300012582Vadose Zone SoilQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRASL*
Ga0137358_1027851523300012582Vadose Zone SoilFATEQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATSDKLGVGVQFLYYEVPRPSL*
Ga0137398_1002995833300012683Vadose Zone SoilMNICNHGVYFATDQKVPEGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGIASDKSGVGIQFLYYEIPRASI*
Ga0137398_1028966813300012683Vadose Zone SoilNSEREAASMNICNHGVYFATDQKVHEGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVQPLGTANDQLGVGVQFLYYEVPRASL*
Ga0137398_1081732313300012683Vadose Zone SoilVYFATDRKLAKGVLIQVHLKMPREVLGEVVGDHVEEWCFTGRVAHVESLRATSDKLGVGVHFLYYEVPRPVL*
Ga0137397_1080807323300012685Vadose Zone SoilERDDASMNSYNHRVYFATDHMVHEGVMIQVHLKMPRQVSGDDVTEWCFTGRVAHVQPLGTANDQLGVGVQFLYYEVPRASL*
Ga0137396_1004766543300012918Vadose Zone SoilVQAPDGSEEDAASMNICHHGVYFATDRKVPKGVMLQVHLKLPREVAGDDVSEWCFTGRVAHVEPLGDPNGKSGVGVQFLCYEVPRALV*
Ga0137396_1008617313300012918Vadose Zone SoilMLATADPEHEAASMNICHHGVYFATDHRLPEGLMIQVHLKMPREVVGDDVKEWCFTGRVAHVEPLGANNGKSGVGVQFLYYEVPRPSL*
Ga0137413_1052599013300012924Vadose Zone SoilMNICNHGVYFATDQKVPEGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGIASVMSGVGVQFLYYEVPRLSL*
Ga0137419_1005052613300012925Vadose Zone SoilASMNICNHGVYFATDQRVHEGVMIQVHLKMPREVIGDDATEWCFTGRVAHVQPLGASSDKLGVGVQFLYYEVPQASL*
Ga0137419_1101306513300012925Vadose Zone SoilMQASDDPEREAASMNICHHGVYFATDQTVPKGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAANDKSGVGVQFLYYEVPRPSL*
Ga0137416_1023526413300012927Vadose Zone SoilQRVPKGVMIQVHLKMPREVVGDGVTEWCFTGRVAHVEPLGAANDKSGVDVQFVYYEVPRTSL*
Ga0137416_1199187823300012927Vadose Zone SoilANSEREAASMNICNHGVYFATDQRVHEGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVQPLGTANDQLGVGVQFLYYEVPRASL*
Ga0137410_1126910523300012944Vadose Zone SoilASMNICNHGVYFATDQRLAQGVLIQVHLKMPREVVGDDVVEWCFTGRVAHVEPLGVSNDKSGVGVQFLYYEVPRASL*
Ga0134110_1015359023300012975Grasslands SoilMQETADPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELLGPKGSKLGVGVQFLYYEVPRPSL*
Ga0134110_1025161913300012975Grasslands SoilGVYFATDQRVHKGVMVQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL*
Ga0134081_1039384313300014150Grasslands SoilCNHGVYFATEQSLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGTANDKLGVGVQFLYYEVPRISL*
Ga0137420_1330436163300015054Vadose Zone SoilMNICNNGVYSHRSEVHEGVMIQVHLKMPREVIGDDVTEWCFTGASRMSNVRTANDQLGVGVPVLYYESAPSLALAVAR*
Ga0134112_1048152323300017656Grasslands SoilIPEESAASMNISTHGVYLATATKVSEGVLVQVHLKMPREIAGDDVEEWSFTGRVAHVEPLGTTNGKSGIGVQFLYYEVPPATGLFSNSR
Ga0066669_1000168163300018482Grasslands SoilMNICNHGVYFATDQRVHKGVMVQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL
Ga0137408_145405623300019789Vadose Zone SoilMQEPGDPEWEAATMNICNYGVYFATDHRLPRGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAKNDKLGVGVAVHLL
Ga0179592_1000329853300020199Vadose Zone SoilMNICNHGVYFATDQRVHEGVMIQVHLKMPREVIGDDVTEWCFTGRVAHVQPLGASSDKLGVGVQFLYYEVPQASL
Ga0179592_1000677633300020199Vadose Zone SoilMNICHHGVYFATDQTVPKGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAANDKSGVGVQFLYYEVPRPSL
Ga0210407_10000555423300020579SoilMNINNHGVYFATDQRLPEGVMIQVHLRLPREVAGDDVTEWCFTGRVAHVESLGPTNGKSGVGVQFLYYEVPRPSL
Ga0210407_1136384323300020579SoilFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL
Ga0210403_1017823033300020580SoilMNICHHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAASDKSGVGVQFLYYEVPRASF
Ga0210399_1156996523300020581SoilYFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL
Ga0179596_1027187423300021086Vadose Zone SoilMNICNHGVYFATDQRVHEGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRASL
Ga0210404_1000369443300021088SoilMQAPADPEQAAASMNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGSTNDKLGVGVQFLYYEVPRASL
Ga0210404_1021325713300021088SoilGVYFATDQMMPKGVIIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGNPDGKSGVGVQFLCYEVPRASFQRSV
Ga0210404_1050624323300021088SoilADVEQTAASMNICHHGVYFSTDQRVPKGVMIQVHLKMPREVVGDDVREWCFTGRVAHVESVGTANDKLGVGVQFLYYEVPRASL
Ga0210406_1021191323300021168SoilMNINNHGVYFATDQRLPEGVMIQVHLRLPREVAGDDVTEWCFTGRVAHVESLGLTNGKSGVGVQFLYYEVPRPSL
Ga0210408_1092413613300021178SoilNHGVYLATDQRGPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL
Ga0210409_1076582523300021559SoilPMQAPDDPEREAASLNICNHGVYFATDQSVPKGVMIQVHLKMPREVVGEDVTVWCFTGRVAHVEPLGAASEKSGVGVQFLYYEVPRPSL
Ga0210409_1169367513300021559SoilMNICNHGVYFATDQRLPQGVLIQVHLKMPREVVGDDVVEWCFTGRVAHVEPLGVSNDKSGVGVQFLYYEVPRTSL
Ga0126371_1003322113300021560Tropical Forest SoilLGEESVSSMNISSHGVYFATDQKVREGLMVQLHLKMPKEIVGDDVAEWSFTGRVAHVEPLSRQNGKSGVGVQFLFYEVPRPAMK
Ga0222728_108145313300022508SoilMAFFFATDQKVPKGLMLQVHLKMPREVVGDEVTEWCFTGRVAHVESLGATKDKSGVGVQFLYYEVPPVSLEQYS
Ga0209350_116651223300026277Grasslands SoilTDQRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGTANDKLGVGVQFLYYEVPRVSL
Ga0209238_100186963300026301Grasslands SoilMNICNHGVYFVTDQRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGTANDKLGVGVQFLYYEVPRISL
Ga0209239_102258233300026310Grasslands SoilMNICNHGVYFVTDQRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGTANDKLGVGVQFLYYEVPRISL
Ga0209155_100413383300026316SoilYFVTDQRLPQGVMIQVHLRMPREVLGDDVAEWCFTGRVVHVESLGTANDKLGVGVQFLYYEVPRISL
Ga0209471_103944733300026318SoilETADPEHEAASMNICNHGVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELLGPKGSKLGVGVQFLYYEVPRPSL
Ga0209131_1000927123300026320Grasslands SoilMNICNHGVYFATDQRVHEGVMIQVHLKMPREVIGDDVTEWCFTGRVAHVQPLGASSDKLGVGVQFLYYEVPQVSL
Ga0209131_102416053300026320Grasslands SoilMHEPGDPEWEAATMNICNYGVYFATDHRLPRGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAKNDKLGVGVQFIYYEVPRASL
Ga0209152_1006288633300026325SoilMNICNHGVYFATDQRVHEGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVESLGATNDKLGVGVQFLYYEIPRSSL
Ga0209267_1000077133300026331SoilVQTPADPERTAAFMNICNHGVYFATEQSLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGAANDKLGVGVQFLYYEVPRISL
Ga0209803_100743353300026332SoilMNICNHGVYFATEQSLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGAANDKLGVGVQFLYYEVPRISL
Ga0209377_126144023300026334SoilYFATEQKLPKGVLIRVHLKMPREVLGEAVGEDVKEWCFTGRVAHVESLGATNDKLGVGVHFLYYEVPRPVL
Ga0209804_101436363300026335SoilMQAPADPEQEAASMNICNHGVYFATDQRVPQGLMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRP
Ga0257176_104276123300026361SoilMNICNQGVYFATEQRVPKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATSDKLGVGVQFLYYEVPRPSL
Ga0257181_102047523300026499SoilMNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDGVTEWCFTGRVAHVEPLGAANDKSGVDVQFVYYEVPRTSL
Ga0209808_130540123300026523SoilFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVESLGPQGNKLGVGVQFLYYEVPRASL
Ga0209378_124967313300026528SoilVYFATDRKLPEGGMIQVHLRMPREVVGEDVAEWCFTGRVAHVELLGPKGSKLGVGVQFLYYEVPRPSL
Ga0209648_1000348663300026551Grasslands SoilMNICNHGVYFATDQRVHKGVMIQVHLKMPREVVGDDVTEWCFTGRVAHVEPLGATNDKLGVGVQFLYYEVPRPSL
Ga0209577_10000016453300026552SoilMQAPADPEQEAASMNICNHGVYFATDQRVPQGLMIQVHLKMPREVVGDDVAEWCFTGRVAHVESLGATNDKLGVGVQFLYYEVPRISL
Ga0179593_103800093300026555Vadose Zone SoilMNICSHGVYFATEQKLPKGVLIQVHLKMPSGSGGRRGGRRGNGVWCFTGRVAHVESLGPTNNKLGVGVHFLYYEVPRPSALDRPDNPIQFRASMTYK
Ga0179593_107250433300026555Vadose Zone SoilMNICSHGVYFATEQKLPKGVLIQVHLKMPREVVGDVVGDEATEWCFTGRVAHVESLGPTNNKLGVGVHFLTTKFPAQCSRLAG
Ga0179587_1017741323300026557Vadose Zone SoilMNICNHGVYFATDQKVPEGGMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGIASDKSGVGVQFLYYEVPRASL
Ga0209588_102931413300027671Vadose Zone SoilSANAMNICSHGVYFATEQKLPKGVLIQVHLKMPREVVGDVVGDEATEWCFTGRVAHVESLGPTNNKLGVGVHFLYYEVPRPVL
Ga0209588_116576313300027671Vadose Zone SoilEAATMNICDHGVYFATDQSVPKGVMIQVHLKMPREVVGDDVTEWRFTGRVAHVEPLGTTNHKSGVGVQFLYYEVPRPFL
Ga0209118_100233953300027674Forest SoilMQASADSEQTAASMNICNHGVYFATDQRMPKGVMIQVHLKMPWEVVGDDVKEWCFTGRVAHVESLGATNDKSGVGVQFLYYEVPRTSL
Ga0209011_101735453300027678Forest SoilMNICNHGVYFVTDQRVQEGVMIQVHLKMPREVIGDDVTEWCFTGRVAHVQPLGAANDKLGVGVQFLYYEVP
Ga0209011_115201123300027678Forest SoilMNICNHGVYFATDQRVHEGVMIQVHLKMPREVNGDDVTEWCFTGRVAHVQPLGAANDKLGVGVQFLYYEVPRASL
Ga0209178_130397113300027725Agricultural SoilVYFATDQKVREGLMVQLHLKMPKEIVGDDVEEWSFTGRVAHVEPLSRQNGKSGVGVQFLFYEVPRPAAK
Ga0137415_1026778523300028536Vadose Zone SoilMLATADPEHEAASMNICHHGVYFATDHRLPEGLMIQVHLKMPREVVGDDVKEWCFTGRVAHVEPLGANNGKSGVGVQFLYYEVPRPSL
Ga0137415_1050954223300028536Vadose Zone SoilMNICNHGVYFATDQRVPKGVMIQVHLKMPREVVGDGVTEWCFTGRVAHVEPLGAANDKSGVGVQFVYYEVPRTSL
Ga0222749_1063327613300029636SoilATHPEETAASMNICTHGVFFATDQKVPKGLMLQVHLKMPREVVGDDVTEWCFTGRVAHVESLGATKDKSGVGVQFLYYEVPPASLEQYS
Ga0307474_1010742233300031718Hardwood Forest SoilMNICNHGVYFATDQSVPKGVMIQVHLRMPREVVGDDVAEWCFTGRVAHVEPLGPTNYKTGVGVQFLYYEVPRPSL
Ga0307477_1015212623300031753Hardwood Forest SoilMNICNHGVYFATDQAVPKGMMIQVHLRMPREVVGDDVMEWCFTGRVAHVEPLGPTNYKTGVGVQFLYYEVPRPSL
Ga0307473_1019570023300031820Hardwood Forest SoilMNICNHGVYFATDHRLPRGEMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLGAKNDKLGVGVQFIYYEVPRASL
Ga0307478_1024517933300031823Hardwood Forest SoilMNICNHGVYFATDQSVPKGVMIQVHLKMPREVVGGDVAEWCFTGRVAHVEPLGPTNYKTGVGVQFLYYEVPRPSL
Ga0307479_1003932733300031962Hardwood Forest SoilMNICHHGVYFAMDQSVPKGVMIQVHLKMPREVVGDDVAEWCFTGRVAHVEPLAAASYKSGVGVQFLYYEVPRASF
Ga0307479_1029200733300031962Hardwood Forest SoilMNICNHGVYFATDQAVPKGMMIQVHLRMPREVVGDDVVEWCFTGRVAHVEPLGPTNYKTGVGVQFLYYEVPRPSL
Ga0307471_10000601333300032180Hardwood Forest SoilMNICNHGVYFATDQRLPQGVMIQVHLKMPREVVGDDVAEWCFTGRVVHVESLGAANDKLGVGVQFLYYEVPRISL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.