NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F042211

Metagenome Family F042211

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F042211
Family Type Metagenome
Number of Sequences 158
Average Sequence Length 81 residues
Representative Sequence MTGTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPLVGHPIRVKRATRSRVKKSR
Number of Associated Samples 124
Number of Associated Scaffolds 158

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 35.03 %
% of genes near scaffold ends (potentially truncated) 27.85 %
% of genes from short scaffolds (< 2000 bps) 72.78 %
Associated GOLD sequencing projects 115
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (59.494 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.785 % of family members)
Environment Ontology (ENVO) Unclassified
(31.646 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.
1KansclcFeb2_05226480
2JGI10216J12902_1124097462
3JGI25613J43889_100225701
4soilL2_102182802
5Ga0062590_1026383481
6Ga0066672_100023835
7Ga0066690_101124941
8Ga0065705_102616941
9Ga0070691_105536021
10Ga0070705_1018622501
11Ga0066689_105095361
12Ga0066681_100535402
13Ga0070698_1001679092
14Ga0070698_1019064811
15Ga0070696_1014021552
16Ga0070693_1006195412
17Ga0066692_101169474
18Ga0066704_104566113
19Ga0066698_106513062
20Ga0066703_100931224
21Ga0066691_109689451
22Ga0068864_1017557311
23Ga0075417_100168284
24Ga0075417_100169171
25Ga0075417_101653613
26Ga0075432_103628612
27Ga0075428_1000366095
28Ga0075428_1006682211
29Ga0075421_1000646241
30Ga0075421_1004318342
31Ga0075421_1014674361
32Ga0075430_1000727323
33Ga0075430_1013401932
34Ga0075431_1001186073
35Ga0075431_1010732061
36Ga0075433_118623502
37Ga0075425_1019547432
38Ga0075425_1020988781
39Ga0075429_1000919112
40Ga0075426_104009772
41Ga0075436_1001113253
42Ga0097620_1001070854
43Ga0075435_1001321041
44Ga0066710_1011690252
45Ga0099830_102642392
46Ga0099828_117958552
47Ga0111539_113854362
48Ga0075418_1000300617
49Ga0075418_100337077
50Ga0066709_1000831734
51Ga0066709_1012844963
52Ga0066709_1039529801
53Ga0114129_103270512
54Ga0105092_101321183
55Ga0075423_103188144
56Ga0134127_100238303
57Ga0137393_100264633
58Ga0137442_11257911
59Ga0137338_11537261
60Ga0137388_104424892
61Ga0137383_109085963
62Ga0137363_101278693
63Ga0137399_108055792
64Ga0137399_110633661
65Ga0137374_108929362
66Ga0137362_103049752
67Ga0137362_105290261
68Ga0137381_105974832
69Ga0137376_100881603
70Ga0137379_105015492
71Ga0137379_111064791
72Ga0137377_100864071
73Ga0137372_102609211
74Ga0137372_105298751
75Ga0137367_100881995
76Ga0137366_106292811
77Ga0137369_105303941
78Ga0137385_108919281
79Ga0137375_104302461
80Ga0137361_111131911
81Ga0137373_106662201
82Ga0137394_101029586
83Ga0137419_108915101
84Ga0137404_100493275
85Ga0137404_120013081
86Ga0137407_110094352
87Ga0137407_120558611
88Ga0137407_121749181
89Ga0162653_1000142452
90Ga0162650_1000882342
91Ga0163162_126706801
92Ga0180084_11077351
93Ga0137403_1000883914
94Ga0134089_101297992
95Ga0184604_100084102
96Ga0184608_101799323
97Ga0184620_100478752
98Ga0184638_10236292
99Ga0184638_10336491
100Ga0184638_11263623
101Ga0184626_100046593
102Ga0184626_100319533
103Ga0184626_100958962
104Ga0184621_102255281
105Ga0184621_102870542
106Ga0184623_100588853
107Ga0184623_102413442
108Ga0184619_101619051
109Ga0184619_102337081
110Ga0184617_10262123
111Ga0184618_101291002
112Ga0184618_103973891
113Ga0184635_100446071
114Ga0184635_101363581
115Ga0184624_100394305
116Ga0184632_100126035
117Ga0184609_100175134
118Ga0184609_101033742
119Ga0184612_102387652
120Ga0184625_100365431
121Ga0184625_102576171
122Ga0184629_100745372
123Ga0190265_120142831
124Ga0190275_102442962
125Ga0066662_108814482
126Ga0066669_104943891
127Ga0190273_108685352
128Ga0193747_10309952
129Ga0193743_10426462
130Ga0193739_10049023
131Ga0193749_10210792
132Ga0193716_10133213
133Ga0193709_11061971
134Ga0224452_11788261
135Ga0212128_102066202
136Ga0222622_100248925
137Ga0222622_109628401
138Ga0207684_110691741
139Ga0209154_10172053
140Ga0209131_10036197
141Ga0209801_10214802
142Ga0209804_10683601
143Ga0256867_100259045
144Ga0209819_100451101
145Ga0209701_101477753
146Ga0209488_109796081
147Ga0209382_100415677
148Ga0209382_102594961
149Ga0209382_107658952
150Ga0209382_108877302
151Ga0137415_105318643
152Ga0299906_107059651
153Ga0268386_100172734
154Ga0299913_107209171
155Ga0307479_107211332
156Ga0315910_101355151
157Ga0370498_188366_3_287
158Ga0364943_0368180_88_327
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 34.26%    β-sheet: 1.85%    Coil/Unstructured: 63.89%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080MTGTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPLVGHPIRVKRATRSRVKKSRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
59.5%40.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Thermal Springs
Soil
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Switchgrass Rhizosphere
Soil
Sugarcane Root And Bulk Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Untreated Peat Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Sediment
Soil
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
17.7%8.2%22.8%8.9%5.1%4.4%18.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
KansclcFeb2_052264802124908045SoilLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVSRSRNISVPKHRR
JGI10216J12902_11240974623300000956SoilMYNAATTPMTWTIDRWLQLAELAKIFGLSEESIKRLAKNRGLPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRALASSRSRRKTSR*
JGI25613J43889_1002257013300002907Grasslands SoilMTGKTDQWLQLTEIVKMFGLSEESIKRLAKAQGFPLRRLTPYATPGVIESELIAWLKAQPNIGAPVRAKRSTKSKRSMGKR*
soilL2_1021828023300003319Sugarcane Root And Bulk SoilMSRDRIIETIDRWLQLGEISRMFGLSEESIKRLAKTHDLPLRRVTPFATPGTLESELMLWLKTQPRVGPPVRGRKKSKR*
Ga0062590_10263834813300004157SoilPGSNFTKSRCALNRMYNVATTPMTWIIDRWLQLAELAKIFGLSEESIKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAMASNRSRRKTSR*
Ga0066672_1000238353300005167SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPRRGQPIRPRPTIKSRRKKSRKRD*
Ga0066690_1011249413300005177SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPRRGQPIRPRPT
Ga0065705_1026169413300005294Switchgrass RhizosphereMADTVDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPLVGKPIRAKRATRSRAKKSR*
Ga0070691_1055360213300005341Corn, Switchgrass And Miscanthus RhizosphereMTGLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHS
Ga0070705_10186225013300005440Corn, Switchgrass And Miscanthus RhizosphereMTGLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNVKKDDKSGSDR*
Ga0066689_1050953613300005447SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPLRGQPIR
Ga0066681_1005354023300005451SoilMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGKR*
Ga0070698_10016790923300005471Corn, Switchgrass And Miscanthus RhizosphereCPPKLMYNAATTPMTWINDRWLQLAELAKIFGLSEESIKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAMAGNRSRRKASR*
Ga0070698_10190648113300005471Corn, Switchgrass And Miscanthus RhizosphereMTGLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNA
Ga0070696_10140215523300005546Corn, Switchgrass And Miscanthus RhizosphereMYNVATTPMTWIIDRWLQLAELAKIFGLSEESIKRLAKNRGLPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAMAGNRSRRKASR*
Ga0070693_10061954123300005547Corn, Switchgrass And Miscanthus RhizosphereMTGLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSN
Ga0066692_1011694743300005555SoilMTGIIDRWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESEFVRWLKAQPLVGKPIRAKRVTRSRGKKSRSRP*
Ga0066704_1045661133300005557SoilMTGTGDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPRVGRPIRVKRAKRSRGKKSRQRP*
Ga0066698_1065130623300005558SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPLRGQ
Ga0066703_1009312243300005568SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPLRGQPIRARPTIKSRRKKSRKRD*
Ga0066691_1096894513300005586SoilMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQGFPLRRLTPYATPGVLKSELVSWLKAQPRVGAPVRAKRTVRSKHSIGK
Ga0068864_10175573113300005618Switchgrass RhizosphereMTGLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIACFKAQPKVGPPIRVNRPRTRKHSN
Ga0075417_1001682843300006049Populus RhizosphereMTSTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPRVGHPIRVKRATQSRAKKSR*
Ga0075417_1001691713300006049Populus RhizosphereMTSTIDRWIQLGELAKLFGLSEESMKRRVAKNHGFPLRRLTPCATPGVLESELFTWLKAQPRVGNAIRVKRATRSRVKKSR*
Ga0075417_1016536133300006049Populus RhizosphereMNRKASGIIDHWLTLGELAKLFGLSEESIKRLAKKNGFPLRRLTPYATPGVIESEFFRWLKTQPLVGRAVRAKRVTRSRNTSQPKNRR*
Ga0075432_1036286123300006058Populus RhizosphereLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVSRSRNISVPKHRR*
Ga0075428_10003660953300006844Populus RhizosphereMTSTMDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFTWLKAQPRVGNAIRVKRATRSRVKKSR*
Ga0075428_10066822113300006844Populus RhizosphereQLAELSELFGLSEESIKRLSKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRVKKPRVRKSRSGSVSQGNR*
Ga0075421_10006462413300006845Populus RhizosphereMTWIIDRWLQLAELAKLFGLSEESIKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAMANKRLRRKTSTR*
Ga0075421_10043183423300006845Populus RhizosphereMNRKATGVIDQWMPLAELAELFGLSEESIKRFAKKNGFPLRRLTPYATPGVLESELFSWLKAQPPVGPAVRASRVTRSRDFSVPKHRR*
Ga0075421_10146743613300006845Populus RhizosphereMDQWLTLGELAELFGLSEESIKRLAKKNGFPLRRLTPYATPGVLESELFRWLKAQPPVGRAVRARRVLGARACLLINPFLA*
Ga0075430_10007273233300006846Populus RhizosphereMTSTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFTWLKAQPRVGNAIRVKRATRSRVKKSR*
Ga0075430_10134019323300006846Populus RhizosphereMTWIIDRWLQLAELAKLFGLSEESIKRLAKNRGLPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAMANKRLRRKTSTR*
Ga0075431_10011860733300006847Populus RhizosphereMTGTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPRVGHPIRVKRATRSRVKKSR*
Ga0075431_10107320613300006847Populus RhizosphereMTGSIDQWLQLAELSELFGLSEESIKRLSKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRVKKPRVRKSRSGSVSQGNR*
Ga0075433_1186235023300006852Populus RhizosphereDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFTWLKAQPRVGNAIRVKRATRSRVKKSR*
Ga0075425_10195474323300006854Populus RhizosphereMTSTIDRWIQLGELAKLFGLSEESMKRRVAKNHGFPLRRLTPYATPGVLESELFTWLKAQPRVGNAIRVKRATRSRVKKSR*
Ga0075425_10209887813300006854Populus RhizosphereMTGVVDRWMQLGELAKLFGLSEESIKRLAKNQGFPLRRLTPYATPGALESELVPWLKVQPLVGQPVRAKHAPRSGAKKFRQGS*
Ga0075429_10009191123300006880Populus RhizosphereMTGTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPLVGHPIRVKRATRSRVKKSR*
Ga0075426_1040097723300006903Populus RhizosphereMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHS
Ga0075436_10011132533300006914Populus RhizosphereLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVS*
Ga0097620_10010708543300006931Switchgrass RhizosphereMTGLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKDDKSGSDR*
Ga0075435_10013210413300007076Populus RhizosphereHAPMTSTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPRVGHPIRVKRATQSRAKKSR*
Ga0066710_10116902523300009012Grasslands SoilMTGIIDRWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESEFVRWLKAQPLVGKPIRVKRPPMRKNKNRRYR
Ga0099830_1026423923300009088Vadose Zone SoilMRTIDGWLQLAELSIMFGLSEESIKRLAKKHGFPLPRITPYAKPGVLESELVRWMKAQPLTGRAVRKKAHSK*
Ga0099828_1179585523300009089Vadose Zone SoilMGITDGWLQLAELSKMFGLSEESIKRLVKNHGFPLRRITPYATPGVLESELVRWMKAQSLAGRRIRAKKKSR*
Ga0111539_1138543623300009094Populus RhizosphereKAGILFAASRQATMTRTSDRWIQLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVSRSRNISVPKHRR*
Ga0075418_10003006173300009100Populus RhizosphereMTGSIDQWIQLAELSELFGLSEESIKRLSKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRVKKPRVRKSRSGSVSQGNR*
Ga0075418_1003370773300009100Populus RhizosphereMTRTSDRWIQLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVSRSRNISVPKHRR*
Ga0066709_10008317343300009137Grasslands SoilMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQVFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGKR*
Ga0066709_10128449633300009137Grasslands SoilMTGIIDRWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESELVRWLKAQPLVGKPIRV
Ga0066709_10395298013300009137Grasslands SoilMTGSTDGWLQLAELSELFGLSEESIKRLAKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRAKKPRVRKSPSGSVSQGNR*
Ga0114129_1032705123300009147Populus RhizosphereMYNAATTPMTWIIDRWLQLAELAKIFGLSEESIKRLAKNRGLPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAIAGNRSRRKKSKIHTA*
Ga0105092_1013211833300009157Freshwater SedimentMTGTIDRWIQLGELAKLFGLSEEGMKRLAKNHGFPLRRLTPYATPGVLESEFLRWLKAQPLVGQPIRAKRATRSRGKKSR*
Ga0075423_1031881443300009162Populus RhizosphereFKAGILFAASRQATMTRTSDRWIQLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVSRSRNISVPKHRR*
Ga0134127_1002383033300010399Terrestrial SoilAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKDDKSGSDR*
Ga0137393_1002646333300011271Vadose Zone SoilMWITDRWLQLAELSKMFGLSEESIKRLAKKHGFPLPRITPYAKPGVLESELVRWMKAQPLTGRAVRKKAHSK*
Ga0137442_112579113300011414SoilMDVEVSQIMTGKTDQWLQLTEIVKMFGLSEESIKRLAKAQGFPLRRLTPYATPGVLESELIAWLKAQPNIGAPVRAKRSTKSKRSMGKR*
Ga0137338_115372613300012174SoilMIGMLDQWLPLTELARMFGLSEESIKRLAKTHGFPLRRLTPYATPGVLESEMVSWLKAQPHVGQPVRVKRANSRPKNRRKHA*
Ga0137388_1044248923300012189Vadose Zone SoilMRTIDGWLQLAELSIMFGLSEESIKRLAKKHGFPLRRITPYAKPGVLESELVRWMKAQPLTGRAVRKKAHSK*
Ga0137383_1090859633300012199Vadose Zone SoilDRWLRLDELAKLFGLTEESIKRVAKNHGFPLRRLTPYATPGVLESELVHWLKAQPLVGQPVRIKRPSSRKKMNR*
Ga0137363_1012786933300012202Vadose Zone SoilMTGIIDQWLQLAELAKLFGLSEESIKRVAKNHGLPLRRLTPYATPGVLESELLHWLKAQPLVGKPIRVKRPPMRKKKNRR*
Ga0137399_1080557923300012203Vadose Zone SoilMSGIVDQWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESELVHWLKTQPLVGKPIRVKRPMRKKKNRR*
Ga0137399_1106336613300012203Vadose Zone SoilMTGKSDQWLQLPEIAKMFGLSEESIKRLAKAQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGKR*
Ga0137374_1089293623300012204Vadose Zone SoilMTWIIDRWLQLAELAKLFGLSEESIKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAIAGNQSRRKKTR*
Ga0137362_1030497523300012205Vadose Zone SoilMTGIIDQWLQLAELAKLFGLSEESIKRVAKNHGLPLRRLTPYATPGVLESELVHWLKAQPLVGKPIRVKRPPMRKKKNRR*
Ga0137362_1052902613300012205Vadose Zone SoilMTGTGDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPLVGKPIRAKRVTRSRGKNSR*
Ga0137381_1059748323300012207Vadose Zone SoilMTGTIDRWIQLGELAKSFGLSEDSIKRLAKNNGFPLRRLTPYATPGVLESELVRWLKAQPLVGKPVRTSRISRSRKFSLPKHRR*
Ga0137376_1008816033300012208Vadose Zone SoilMTGSTDGWLQLAELSELFGLSEESIKRLAKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRAKKPRVRKSPSDSVSQGNR*
Ga0137379_1050154923300012209Vadose Zone SoilMTRTSDRWIQLGDLAELFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPLVGRAVRASRVSRSRNISVPKHRR*
Ga0137379_1110647913300012209Vadose Zone SoilDQWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESELVHWLKAQPLVGQPVRIKRPSSRKKMNR*
Ga0137377_1008640713300012211Vadose Zone SoilMTGSTDGWLQLAELSELFGLSEESIKRLAKSHGFPLRRLTPYATPGVIQSELLRWLKAQPFVGPAVRAKKPRVRKSPSDSVSQGNR*
Ga0137372_1026092113300012350Vadose Zone SoilTMTRTSDRWIQLGELAELFGLSEESIKRFAKKKGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGQAVRAKRVTRSRNISVPEHRR*
Ga0137372_1052987513300012350Vadose Zone SoilMTRTSDRWIQLGELAELFGLSEESIKRLAKNGFPLRRLTPYATPGVLESELFSWLKAQPLVGRAVRASRVSRSRNISVPKHRR*
Ga0137367_1008819953300012353Vadose Zone SoilMTGTIDRWIQLGELAKSFGLSEDSIKRLAKNNGFPLRRLTPYATPGVLESELVRWLKAQPLVGK
Ga0137366_1062928113300012354Vadose Zone SoilLGELAELFGLSEESIKRFAKKKGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRASRVSRSRNISVPKHRR*
Ga0137369_1053039413300012355Vadose Zone SoilMTWINDRWLQLAELAKLFGLSEESMKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAIAGNQSRRKKTR*
Ga0137385_1089192813300012359Vadose Zone SoilTHTRMTGTGDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPLVGKPIRGKRATRSRGKKSRSRP*
Ga0137375_1043024613300012360Vadose Zone SoilMTWIIDRWLQLAELAKLFGLSEESIKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPI
Ga0137361_1111319113300012362Vadose Zone SoilMTGIIDQWLQLAELAKLFGLSEESIKRVAKNHGLPLRRLTPYATPGVLESELFSWLKAQPLVGKPIRAKRVTRSRGKKSRSRP*
Ga0137373_1066622013300012532Vadose Zone SoilMNRKATGIIDQWLTLDELAKLFGLSEESIKRFAKKNGFPLRRLTPYATPGVLESELFRWLKAQPLVGPAVRASRVSRSR*
Ga0137394_1010295863300012922Vadose Zone SoilMSGIIDQWLQLAELAKLFGLSEESNKRVTKNHGFPLRRLTPYATPGVLESELVHWLKAQPLVGKPIRVK
Ga0137419_1089151013300012925Vadose Zone SoilMTGKSDQWLQLPEIAKMFGLSEESIKRLAKAQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGK
Ga0137404_1004932753300012929Vadose Zone SoilMTGSTDGWLQLAELSELFGLSEESIKRLAKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRAKKQRVRKSPSGSVSQGNR*
Ga0137404_1200130813300012929Vadose Zone SoilMTGIIDQWLQLAELAKLFGLSEESIKRVAKNHDFPLRRLTPYATPGVLESELVHWLKAQPLVGKPIRVKRPPMRKKKNRR*
Ga0137407_1100943523300012930Vadose Zone SoilMTHTSDRWIQLGELAELFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPRVGRPIRVKRAKRSRGKKSRQRP*
Ga0137407_1205586113300012930Vadose Zone SoilMTGIIDRWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESELVHWLKAQPLVGKPIRVKRPPMRKKESTVRGET*
Ga0137407_1217491813300012930Vadose Zone SoilMSRKATGIIDQWLTLGELAKLFGLSEESIKRLAKKNGFPLRRLTPYATPGVLESELFRWLKAQPPVGPRPKKPSVRKSRSGSLSQGKR*
Ga0162653_10001424523300012937SoilMSPNLRSPGVIDKWLQLAELAELFGLSEESIKRMAKKNGFPLRRLTPYATPGVIKSELLRWLKAQPLVGRAVRASRVSRSRNTSQPQYRRSIRSEAITGKTSSVE*
Ga0162650_10008823423300012939SoilGRLQLAELAKLFGLSEGSIRRLVKKNGFPLRRLTPYAAPGVLESELVRWISPQTCRN*
Ga0163162_1267068013300013306Switchgrass RhizosphereMTGQLDQWLQLAEIAKMFGLSEESIKRLARAHDFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKDDKSGSDR*
Ga0180084_110773513300014874SoilMTGKIDQWLQLAEIAKMFGLSEESIKRLAKAHGFPLRRLTPYATPGALESELVAWLKAQPPVGAPVRSKRSVRHKHSSAKR*
Ga0137403_10008839143300015264Vadose Zone SoilMSRKATGIIDQWLTLGELAKLFGLSEESIKRLAKTNGFPLRRLTPYAPGVIESKFLRWLKAQPLMGRAARTKRVTR*
Ga0134089_1012979923300015358Grasslands SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPLRGQPIRARPTIKSRRKK
Ga0184604_1000841023300018000Groundwater SedimentMTGQLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTPKHSKLKR
Ga0184608_1017993233300018028Groundwater SedimentMSPNLRNTGVIDKWLQLAELAKLFGLSEESIRRLVKKNGFPLRRLTPYAAPGVLESELVRWISPQTCRN
Ga0184620_1004787523300018051Groundwater SedimentMTGQLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKLKDDKLGSDR
Ga0184638_102362923300018052Groundwater SedimentMRTTDRWLQLAELSKMFGLSEESIKGLVKNHGFPLRRITPYATPGVLESELFSWLKAQPRNGTAVRKKLYSK
Ga0184638_103364913300018052Groundwater SedimentMTWVIDRWLHLAELAELFGLSEGSIKRVAKNHGFPLRRLTPYATPGVLESELLSCLKTQPLVGQPIRAKRVTRSHSKKR
Ga0184638_112636233300018052Groundwater SedimentMSSDLRNTGVIDKWLQLAELAKLFGLSEGSIRRLAKKNDFPLRRLTPYAAPGVLESELVRWISPQTCRN
Ga0184626_1000465933300018053Groundwater SedimentMTGVIDQWLQLSELSKLFGLSEESIKRRAKKNGFPLRRLTPYATPGVLVSEFLRWLKAQPLVGQPVRSKRPSIRKKMKKG
Ga0184626_1003195333300018053Groundwater SedimentMSGIIDQWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESELVHWLKAQPLVGKPIRVKRPPMRKKESTVRCETKKGKRLLA
Ga0184626_1009589623300018053Groundwater SedimentMSPNLRSPGVVDKWLQLAELAELFGLSEESIKRLAKSQCFPLRRLTPYATPGVIKSELLRWLKAQPLVGRAVRIKRKNE
Ga0184621_1022552813300018054Groundwater SedimentMYNVATTPMTWIIDRWLQLAELAKLFGLSEESIKRLAKNRGFPLRRLTPYATPGVLESELVLWLKAQPLIGQPIRAMASSRSRRKTSR
Ga0184621_1028705423300018054Groundwater SedimentMTRTSDRWIQLGELAELFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPLVGPAVRASRVTRSRNFSVPEHRR
Ga0184623_1005888533300018056Groundwater SedimentMIETIDRWLQLTEVAKMFGLSEESIKRLAKTHDLPLRRVTPFATPGTLESELVRWLKTQPRVGPPVRSRNGNGAHARDKI
Ga0184623_1024134423300018056Groundwater SedimentMTGTVDRWLQLAELARMFGLSEESIKRLAKTHGFPLRRLTPYATPGVLESELVSWLKAQPHVGQPVRARRANSRPKKSRK
Ga0184619_1016190513300018061Groundwater SedimentMTHTSDRWIQLGELAELFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPLVGRAVRASRVSRSRNPPQ
Ga0184619_1023370813300018061Groundwater SedimentMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGKR
Ga0184617_102621233300018066Groundwater SedimentMTVQLDQWLQLAEIAKMVGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKLKDDKLGSDR
Ga0184618_1012910023300018071Groundwater SedimentMTWITDRWLQLAELAKLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELIRWLKAQPLIGQAIRATARRKKSG
Ga0184618_1039738913300018071Groundwater SedimentMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKVKDDKFGSGH
Ga0184635_1004460713300018072Groundwater SedimentDLRNTGVIDKWLQLAELAKLFGLSEGSIRRLVKKNGFPLRRLTPYAAPGVLESELVRWISPQTCRN
Ga0184635_1013635813300018072Groundwater SedimentMSPNLRSPGVVDKWLQLAELAELFGLSEESIKRMAKKNGFPLRRLTPYATPGVIKSELLRWLKAQPLVGRAVRTKRVTRSRNISLQKHRR
Ga0184624_1003943053300018073Groundwater SedimentMSPNLRDTGVIDKWLQLAELAKLFGLSEGSIRRLVKKNGFPLRRLTPYAAPGVLESELVRWIRPQTCRN
Ga0184632_1001260353300018075Groundwater SedimentMSPNLRDTGVIDKWLQLAELAKLFGLSEGSIGRLAKKNGFPLRRLTPYAAPGVLESELVRWISPQTCRN
Ga0184609_1001751343300018076Groundwater SedimentMTWTTDRWLQLAELAKLFGLSEESIKRLAKNRGFPLRRLTPYATPGVLESELVLWLKAQPLIGQPIRAIASNRSRRKTSR
Ga0184609_1010337423300018076Groundwater SedimentMTGIMDRWLQISELAKLFGLSEESIKRLAKNHMLPLRRLTPYATPGVLESELVSWLKAQPLVGLPVRATQTARSQG
Ga0184612_1023876523300018078Groundwater SedimentMSPNLRNTGVIDKWLQLAELAKLFGLSEGSIRRLAKENGFPLRRLTPYAAPGVLESELVRWISPQTCRN
Ga0184625_1003654313300018081Groundwater SedimentMSSDLRNTGVIDKWLQLAELAKLFGLSEGSIRRLVKKNGFPLRRLTPYAAPGVLESELVRWISPQTCRN
Ga0184625_1025761713300018081Groundwater SedimentMTHTSDRWIQLGELAELFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPLVGPAVRASRVTRSRNFS
Ga0184629_1007453723300018084Groundwater SedimentMTGTVDRWIQLAELARMFGLSEESIKRLAKTHGFPLRRLTPYATPGVLESELVSWLKDQPHVGPPVRTISRAKSRPKNRRKHA
Ga0190265_1201428313300018422SoilMTETVDGWLQLAELARMFGLSEESIKRLAKTHGFPLCRLTPYATPGVLESELVSWLKFQPAVGRPIRARRANLRPKKSR
Ga0190275_1024429623300018432SoilMIEQIDRWLQLPEIAKMFGLSEESIKRLAKTHCLPLRRVTPFATPGALESELVRWLKAQPQVGAPVRSKTSGVSKFTNTKKSRNV
Ga0066662_1088144823300018468Grasslands SoilMTGIIDRWLQLAELAKLFGLSEESIKRVAKNHGFPLRRLTPYATPGVLESEFVRWLKAQPLVGKPIRVKRPPMRKKKNPRWRGKT
Ga0066669_1049438913300018482Grasslands SoilIMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGKR
Ga0190273_1086853523300018920SoilMYNAATTPMTWTIDRWLQLAELAKIFGLSEESIKRLAKNRGLPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRALASSRSRRKTSR
Ga0193747_103099523300019885SoilSQIMTGKSDQWLQLPEIAKMFGLSEESIKRLAKTQGFPLRRLTPYATPGVLESELVSWLKAQPRVGAPVRAKRTVRSKHSIGKR
Ga0193743_104264623300019889SoilMTGQLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKARPKVGPPIRVNRPRTRKHSNAKKLKDDKLGSDR
Ga0193739_100490233300020003SoilMTWITDRWLQLAELAKLFGLSEESMKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRATAGNRSRRKTSR
Ga0193749_102107923300020010SoilMDVEVSQIMTGKTDQWLQLTEIVKMFGLSEESIKRLAKAQGFPLRRLTPYATPGVLESELIAWLKAQPNIGAPVRAKRSTKSKRSMGKR
Ga0193716_101332133300020061SoilMTGQLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRSRTRKHSNAKKLKDDKLGSDR
Ga0193709_110619713300021411SoilMSGQLDQWLQLAEIAKMFGLSEESIKRLARAHGFPLRRLTPHATPGVFQSELIAWFKAQPKVGPPIRVNRPRTRKHSNAKKD
Ga0224452_117882613300022534Groundwater SedimentMSPNLRSPGVIDKWLQLAELAELFGLSEESIKRLAKSQCFPLRRLTPYATPGVIKSELLRWLKAQPLVGRAVRIKRKNE
Ga0212128_1020662023300022563Thermal SpringsMNGLIDRWLQLPEIAKMFGLSEESIKRLAKTHGLPLRRVTPFATPGVLESEMVKWLKAQPLVGPPVRSKGRIKRVR
Ga0222622_1002489253300022756Groundwater SedimentMTHTSDRWIQLGELAELFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFRWLKAQPLVGPAVRASRVTRSRNFSVPEHRR
Ga0222622_1096284013300022756Groundwater SedimentMYNAATTPMTWTIDRWLQLAELAKIFGLSEESIKRLAKNRGLPLRRVTPYATPGILESELVLWLKAQPLIGQPIRALASSRARRKTFK
Ga0207684_1106917413300025910Corn, Switchgrass And Miscanthus RhizosphereMTGVVDRWMQLGELAKLFGLSEESIKRLAKNQGFPLRRLTPYATPGALESELVPWLKVQPLVGQPVRAKRAPRSGAKKFRQSS
Ga0209154_101720533300026317SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPRRGQPIRPRPTIKSRRKKSRKRD
Ga0209131_100361973300026320Grasslands SoilMTGKTDQWLQLTEIVKMFGLSEESIKRLAKAQGFPLRRLTPYATPGVIESELIAWLKAQPNIGAPVRAKRSTKSKRSMGKR
Ga0209801_102148023300026326SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPLRGQPIRARPTIKSRRKKSRKRD
Ga0209804_106836013300026335SoilMNWIVDRWLQLAELARLFGLSEESIKRLAKNHGFPLRRLTPYATPGVLESELVPWLKAQPLRGQPIRARPTIKS
Ga0256867_1002590453300026535SoilMKSDSNDGDPSSMTGTIDRWLQLAELAKLFGLSEESIKRLAKSHGFPLRRLTPYATPGALESELIPWLKAQPLVGQPIRLNRHAKSPRKTLRKRPNE
Ga0209819_1004511013300027722Freshwater SedimentMTGTIDRWIQLGELAKLFGLSEEGMKRLAKNHGFPLRRLTPYATPGVLESEFLRWLKAQPLVGQPIRAKRATRSRGKKSR
Ga0209701_1014777533300027862Vadose Zone SoilMRTIDGWLQLAELSIMFGLSEESIKRLAKKHGFPLRRITPYAKPGVLESELVRWMKAQPLTGRAVRKKAHSK
Ga0209488_1097960813300027903Vadose Zone SoilMTGTGDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPLVGKPIRAKRVT
Ga0209382_1004156773300027909Populus RhizosphereMTRTSDRWIQLGELAELFGLSEESIKRFAKKNGFPLRRLTPYAKPGVIESEFLRWLKAQPLVGRAVRGSRVSRSRNISVPKHRR
Ga0209382_1025949613300027909Populus RhizosphereMTSTIDRWIQLGELAKLFGLSEESMKRVAKNHGFPLRRLTPYATPGVLESELFSWLKAQPRVGHPIRVKRATQSRAKKSR
Ga0209382_1076589523300027909Populus RhizosphereMDQWLTLGELAELFGLSEESIKRLAKKNGFPLRRLTPYATPGVLESELFRWLKAQPPVGRAVRARRVLGARACLLINPFLA
Ga0209382_1088773023300027909Populus RhizosphereMYNTTTTAMTWIIDRWLQLAELAKLFGLSEESIKRLAKNRGFPLRRVTPYATPGVLESELVLWLKAQPLIGQPIRAMANKRLRRKTSTR
Ga0137415_1053186433300028536Vadose Zone SoilMTGIIDQWLQLGELAKLFGLSEESIKRLAKSHGFPLRRLTPYATPGVIQSELLRWLKAQPLVGPAVRAKKPRVRKSPSGSVSQGNR
Ga0299906_1070596513300030606SoilMVILASSMTGTIDRWLQLAELAKLFGLSEESIKRLAKSHGFPLRRLTPYATPGALESELIPWLKAQPLVGQPIRLNRHTKSPRKTLRKRPNE
Ga0268386_1001727343300030619SoilMTGTIDRWLQLAELAKLFGLSEESIKRLAKSHGFPLRRLTPYATPGALESELIPWLKAQPLVGQPIRLNRHAKSPRKTLRKRPNE
Ga0299913_1072091713300031229SoilMKSDSNDGDPSSMTGTIDRWLQLAELAKLFGLSEESIKRLAKSHGFPLRRLTPYATPGALESELIPWLKAQPLVGQPIRLNRRTKSPRKTLRKRPNE
Ga0307479_1072113323300031962Hardwood Forest SoilMTGTIDRWLQIGELAKLFGLSEESIKRLAKNRGFPLRRLTPYATPGVLESELMPWLKDQLPGGQPVRAKPASARSQKD
Ga0315910_1013551513300032144SoilMNGTFDKWIRISELSNLFGVSEESIRRLAKSHGFPLRRLTPYAIPGVLESELLSWLNAQPRIGRPIRAKASRRKRR
Ga0370498_188366_3_2873300034155Untreated Peat SoilMGSMTGTLDQWLQLPEIAKMFGLSEDSIKRLAKAHGFPLRRLTPYATPGVLESELIDWLKAQPQVGAPVRVKSPVRRKDSSAKKAKDDGFGSVR
Ga0364943_0368180_88_3273300034354SedimentMTWVIDRWLHLAELAELFGLSEGSIKRVAKNHGFPLRRLTPYATPGVLESELLSWLKTQPLVGQPIRPKRVTRSHSKKR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.