NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F032972

Metagenome / Metatranscriptome Family F032972

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F032972
Family Type Metagenome / Metatranscriptome
Number of Sequences 178
Average Sequence Length 70 residues
Representative Sequence DLNAPQVIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVEAMREALRRRKAA
Number of Associated Samples 138
Number of Associated Scaffolds 178

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 1.12 %
% of genes near scaffold ends (potentially truncated) 98.88 %
% of genes from short scaffolds (< 2000 bps) 77.53 %
Associated GOLD sequencing projects 127
AlphaFold2 3D model prediction Yes
3D model pTM-score0.65

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (93.258 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil
(30.899 % of family members)
Environment Ontology (ENVO) Unclassified
(68.539 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(73.596 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96
1JGI12053J15887_101481261
2JGI25385J37094_101664771
3JGI25384J37096_101650472
4JGI25382J43887_100801922
5Ga0066674_100076441
6Ga0066674_100130761
7Ga0066672_101306671
8Ga0066673_100958181
9Ga0066690_100233381
10Ga0066690_100236961
11Ga0066685_110412331
12Ga0066675_100013721
13Ga0066675_100222901
14Ga0066682_104877641
15Ga0066681_107292811
16Ga0070699_1008195592
17Ga0070699_1008841381
18Ga0070697_1000999381
19Ga0070697_1013093381
20Ga0066701_102846992
21Ga0066701_108359151
22Ga0066701_109506821
23Ga0066692_101382621
24Ga0066707_104705061
25Ga0066698_103827201
26Ga0066705_100095853
27Ga0066708_110390791
28Ga0066654_108192701
29Ga0066654_108262101
30Ga0066706_102468431
31Ga0066651_100532231
32Ga0066651_101596242
33Ga0066651_107359481
34Ga0066656_101094723
35Ga0066656_108178381
36Ga0066658_101898521
37Ga0066665_100198221
38Ga0066665_102194881
39Ga0066665_112224741
40Ga0066659_112060201
41Ga0066660_100718431
42Ga0066660_102333732
43Ga0075434_1000165406
44Ga0075426_100315493
45Ga0079219_102085972
46Ga0066710_1000775723
47Ga0066710_1026371751
48Ga0099827_102172073
49Ga0066709_1013165422
50Ga0066709_1024491391
51Ga0066709_1037376761
52Ga0127461_10304941
53Ga0127473_10971751
54Ga0127501_11148532
55Ga0127501_11296981
56Ga0127470_11117591
57Ga0127494_10349391
58Ga0127474_11075431
59Ga0127491_10356091
60Ga0127460_11439342
61Ga0127465_10991811
62Ga0127498_10182641
63Ga0127493_11109981
64Ga0127455_10990681
65Ga0127459_11916491
66Ga0127456_10324112
67Ga0134070_101181872
68Ga0134070_103608551
69Ga0134088_100126203
70Ga0134088_100144061
71Ga0134109_100505561
72Ga0134084_101559522
73Ga0134080_101876572
74Ga0134063_100044313
75Ga0134071_103404401
76Ga0134071_105294791
77Ga0134071_106877311
78Ga0134066_100359932
79Ga0137391_105760251
80Ga0137393_106760891
81Ga0137393_108675612
82Ga0137399_113597021
83Ga0137374_100575171
84Ga0137380_100724393
85Ga0137380_111151462
86Ga0137376_100521251
87Ga0137376_103989331
88Ga0137376_111760291
89Ga0137377_100610893
90Ga0134028_10000731
91Ga0134028_12703601
92Ga0137387_103779852
93Ga0137372_100938753
94Ga0137386_100690131
95Ga0137386_101333732
96Ga0137367_101443942
97Ga0137366_100729681
98Ga0137366_103364152
99Ga0137369_105091412
100Ga0137371_102853252
101Ga0137384_105707131
102Ga0137375_111812611
103Ga0137360_102682931
104Ga0134037_11184562
105Ga0134025_11799811
106Ga0134058_10654211
107Ga0134058_12137711
108Ga0134033_10321091
109Ga0134036_12397271
110Ga0134030_10322532
111Ga0134030_11207701
112Ga0134031_11386023
113Ga0134056_12641571
114Ga0134048_10796681
115Ga0134024_13629822
116Ga0134041_11489242
117Ga0134045_10136221
118Ga0134045_12942162
119Ga0134060_11493612
120Ga0137373_106651272
121Ga0137373_111742791
122Ga0137396_100051541
123Ga0137396_112811381
124Ga0137394_114736041
125Ga0137404_110302292
126Ga0137407_100815071
127Ga0134110_100470282
128Ga0134087_103414822
129Ga0134078_105740331
130Ga0137411_13520621
131Ga0137420_12175061
132Ga0134072_100109673
133Ga0134085_101163822
134Ga0134085_101745052
135Ga0132256_1006257202
136Ga0134069_13186861
137Ga0134112_102079372
138Ga0134074_10512252
139Ga0134083_101454702
140Ga0066655_102245923
141Ga0066667_100655913
142Ga0066667_102871751
143Ga0066667_111747381
144Ga0066669_101775432
145Ga0184641_14719432
146Ga0248483_1758811
147Ga0209641_101031573
148Ga0207663_100018979
149Ga0209235_10421573
150Ga0209235_11979081
151Ga0209236_10208533
152Ga0209027_10467561
153Ga0209468_10639913
154Ga0209265_10097863
155Ga0209761_10861982
156Ga0209686_11428151
157Ga0209155_10671502
158Ga0209801_10227651
159Ga0209266_12059672
160Ga0209266_12143771
161Ga0209802_10436914
162Ga0209375_10236831
163Ga0209375_10240051
164Ga0209803_11493933
165Ga0209690_12904011
166Ga0209378_10693663
167Ga0209058_12143171
168Ga0209157_10807521
169Ga0209376_13354441
170Ga0209156_100275523
171Ga0208995_10267491
172Ga0209689_10881921
173Ga0209177_100211011
174Ga0209180_100620161
175Ga0209283_104613642
176Ga0307308_100309933
177Ga0307480_10157681
178Ga0307473_106714062
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 38.14%    β-sheet: 8.25%    Coil/Unstructured: 53.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060DLNAPQVIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVEAMREALRRRKAASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.65
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
93.3%6.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Vadose Zone Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Populus Rhizosphere
Arabidopsis Rhizosphere
19.7%30.9%27.5%10.1%2.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1014812613300001661Forest SoilSLPDLNAPEVIRRLLEPGRRLDAEVIVVTGGMPDQASVELRQMGVKVILNKAAGMPAVVDAMREALRRRKAA*
JGI25385J37094_1016647713300002558Grasslands SoilPDLNAPQVLQRLLEPGRKLDAEIIVVTGGMPQAATAQLKDMGVKVIVNKSEGMPAVVDAMRQALQRRKVA*
JGI25384J37096_1016504723300002561Grasslands SoilGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
JGI25382J43887_1008019223300002908Grasslands SoilTLPDLNAPQVLQRLLEPGRKLDAEIIVVTGGIPETATKQLKDMGVKVIVNKSEGMPAVMEAVQEALRRRKVA*
Ga0066674_1000764413300005166SoilLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVNKTDGMPAVVEAMRQALKRRKAA*
Ga0066674_1001307613300005166SoilLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVSKTDGMPGVVEAMRQALKRRKAA*
Ga0066672_1013066713300005167SoilRLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA
Ga0066673_1009581813300005175SoilPEVIRRLLEPGRKLDAEVMVVTGGMPEQAAVELRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0066690_1002333813300005177SoilVHPTLIVLDYTLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0066690_1002369613300005177SoilQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0066685_1104123313300005180SoilDYSLPDLNAPEVIRRLLEPGRRLDAEVMVVTGGMPDQAAVALRQMGVKVILNKAAGMPAVVEAMGEALRRRKAA*
Ga0066675_1000137213300005187SoilTLPDLNATQVLQRLLEPGRRLDAEIIVVTGGIPEAASAQLRDMGVKVIVTKAEGMPAVVEAMRQALQKRKVA*
Ga0066675_1002229013300005187SoilLIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0066682_1048776413300005450SoilQVIQRLLEPGRRSDAEVVVITGGMPDQAAQRLREVGVRVILNKGEGMAAVVEAIRQALRRRKAA*
Ga0066681_1072928113300005451SoilVVVITGGMPDQAAQRLREVGVRVILNKGEGMAAVVEAIRQALRRRKAA*
Ga0070699_10081955923300005518Corn, Switchgrass And Miscanthus RhizospherePSLIVLDYSLPDLNAPQVIRRLLEPGRQLDAEVLVVTGGMPEAAGKELREMGVKVILNKVEGMPAVVEAMRQALKRRKAA*
Ga0070699_10088413813300005518Corn, Switchgrass And Miscanthus RhizosphereQVIQRLLEPGRRLDAEVVVITGGMPDQAAQQLREMGVRVILNKGEGMAAVVEAIRQALRRKKAA*
Ga0070697_10009993813300005536Corn, Switchgrass And Miscanthus RhizosphereDYSLPDINAPEVIRRLLEPGRQLDVEVIVVTGGMPEAAAVELRQMGVKTIVNKAEGMQSVVEAMQQALKRRKVA*
Ga0070697_10130933813300005536Corn, Switchgrass And Miscanthus RhizosphereLLEPGRRLDAEVVVITGGMPDQAAQQLREMGVRVILNKGEGMAAVVEAIRQALRRKKAA*
Ga0066701_1028469923300005552SoilDYTLPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAAAAQLRGMGVKVIVNKADGMPGVVEAMRQALKRRKAA*
Ga0066701_1083591513300005552SoilSLPDLNAPEVIRRLLEPGRKLDAEVMVVTGGMPEQAAVELRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0066701_1095068213300005552SoilLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0066692_1013826213300005555SoilSLIVLDYSLPDLNASQVIQRLLEPGRRLDAEVMVVTGGMPAEAGVQLRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0066707_1047050613300005556SoilLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVSKTDGMPGVVEAMRQALKRRKAA*
Ga0066698_1038272013300005558SoilYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0066705_1000958533300005569SoilTLPDLNATQVLQRLLEPGRRLDAEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0066708_1103907913300005576SoilDYSLPDLNAPEVIRRLLEPGRQLDVEVIVVTGGMPEKAAVELREMGVKTIVNKVDGMQAVVEAMQQALKRRKVA*
Ga0066654_1081927013300005587SoilLIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPVEAAVELRQMGVKVILNKSAGMSAVVEAMREALRRRKAA*
Ga0066654_1082621013300005587SoilLIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSDGMPAVVDAMREALRRRKAA*
Ga0066706_1024684313300005598SoilLEPGRKLDSEIIVVTGGIPEAASSQLKDMGVKVIVSKAEGMPAVVDAMRKALLRRKVA*
Ga0066651_1005322313300006031SoilQPSLIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0066651_1015962423300006031SoilPEVIRRLLEPGRKLDAEVMVVTGGMPEQAAVELRQMGVKVILNKSAGMSAVVEAMREALRRRKAA*
Ga0066651_1073594813300006031SoilPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0066656_1010947233300006034SoilRLLEPGRRLDAEVIVVTGGMPEAAAAQLRQMGVKVIVNKADGMPGVVDAMRQALKRRKAA
Ga0066656_1081783813300006034SoilLEPGRQLNVEVIVVTGGMPERAAVELREMGVKTIVNKVEGMQAVVEAMQQALKRRKVA*
Ga0066658_1018985213300006794SoilDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0066665_1001982213300006796SoilDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVNKTDGMPAVVEAMRQALKRRKAA*
Ga0066665_1021948813300006796SoilPEVIRRLLEPGRKLDAEVMVVTGGMPEQAAVELRQMGVKVILNKSAGMPAVVEAMREALRRRKAA*
Ga0066665_1122247413300006796SoilRVHPTLIVLDYTLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0066659_1120602013300006797SoilQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0066660_1007184313300006800SoilDYTLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0066660_1023337323300006800SoilLQRLLEPGRKVDAEIIVVTGGIPEAASAQLKSMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0075434_10001654063300006871Populus RhizosphereKLDAEVIVVTGGMPAEAAVQLRQMGVKMILNKAEGMPAVVDAMREALRRRKAA*
Ga0075426_1003154933300006903Populus RhizosphereIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRQMGVKMILNKAEGMPAVVDAMREALRRRKAA*
Ga0079219_1020859723300006954Agricultural SoilIGRVQPSLIVLDYSLPDLNAPQVIRRLLEPGRQLDAEVLVVTGGMPDAAGKELREMGVKVILNKVEGMPAVVEAMRQALKRRKAA*
Ga0066710_10007757233300009012Grasslands SoilVIVLDFTLPDLNATQVIERLLEPGRRLDAEVIVVTGGMPEAAAAQLRQMGVKVIVNKADGMPGVVDAMRQALKRRKAA
Ga0066710_10263717513300009012Grasslands SoilVIRRLLEPGRRLDAEVMVVTGGMPEQAAGELRQMGVKVILNKTAGMPAVVEAMREALRRRKAA
Ga0099827_1021720733300009090Vadose Zone SoilRLLEPERRLDAEVIVVTGGVPGEEEAALRKLGVRVILNKVEGMTAVLEAMREALRRRKAA
Ga0066709_10131654223300009137Grasslands SoilLLGIGRVQRALIVLDYGLPDLNAAQVIQRLLEPGRRLDTEVVVITGGMPDQAAQQLREVGVQVIVNKADGMAAVVEAIRQALGRRKAA*
Ga0066709_10244913913300009137Grasslands SoilAPQVIQRLLEPGRRLDAEVMVVTGGMPEQAAVALRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0066709_10373767613300009137Grasslands SoilVLDYTLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0127461_103049413300010084Grasslands SoilPGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA*
Ga0127473_109717513300010096Grasslands SoilLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0127501_111485323300010097Grasslands SoilIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0127501_112969813300010097Grasslands SoilLLEPGRKLDAEIIVVTGGIPEMATKQLKDMGVKVIVNKSEGMPAVMDAVRHGLRKGKAA*
Ga0127470_111175913300010105Grasslands SoilQRLLEPGRNLDAEVMVVTGGMPEQAAVALRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0127494_103493913300010107Grasslands SoilYTLPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPDAAARRLRDLGVKVILNKADGMSAVVEAMRAALRRRKAA*
Ga0127474_110754313300010108Grasslands SoilLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0127491_103560913300010111Grasslands SoilSLIVLDYTLPDLNATQVLQRLLEPGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA*
Ga0127460_114393423300010114Grasslands SoilLNAPQVIERLLEPGRKLDAEVIVVTGGMPEQAVVELRRMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0127465_109918113300010118Grasslands SoilEVIRRLLDPGRRLDAEVIVVTGGMPERAAVELREMGVKTIVNKVEGMQAVVEAMQQALKRRKVA*
Ga0127498_101826413300010124Grasslands SoilIVLDYTLPDLNAPQVLQRLLEPGRKLDAEIIVVTGGIPETATKQLKDMGVKVIVNKSEGMPAVMDAVRHGLRKGKAA*
Ga0127493_111099813300010130Grasslands SoilSLIVLDYTLPDLNAPQVLQRLLEPGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA*
Ga0127455_109906813300010132Grasslands SoilPDLNAAQVIQRLLEPGRRSDTEVVVITGGMPDQAAQQLREMGVRVILNKGEGMAAVVEAIRQALRRKKAA*
Ga0127459_119164913300010133Grasslands SoilLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0127456_103241123300010140Grasslands SoilDYTLPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKTDGMPAVVEAMRQALKRRKAA*
Ga0134070_1011818723300010301Grasslands SoilRLLEPGRRLDAEVMVVTGGMPDQAAVALRQMGVKVILNKSAGMPAVVEAMGEALRRRKAA
Ga0134070_1036085513300010301Grasslands SoilPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0134088_1001262033300010304Grasslands SoilGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA*
Ga0134088_1001440613300010304Grasslands SoilGVDGLLEIGRAQPTVIVLDYSLPDLNAAQVIQRLLEPGRRLDAEVVVVTGGMPEAASAQLRGMGVKVIVNKADGMPAVVEAMRQALKRRKAA*
Ga0134109_1005055613300010320Grasslands SoilLPDLNAPEVIRRLLEPGRQLDVEVIVVTGGMPETAAVELREMGVKTIVNKVAGMQAVVEAMQQALKRRKVA*
Ga0134084_1015595223300010322Grasslands SoilDYSLPDLNAPEVIRRLLEPGRQLDVEVIVVTGGMPETAAVELREMGVKTIVNKVAGMQAVVEAMQQALKRRKVA*
Ga0134080_1018765723300010333Grasslands SoilLEPGRKLDAEIIVVTGGMPQTATAQLKDMGVKVIVNKSEGMPAVVDAMRQALQRRKVA*
Ga0134063_1000443133300010335Grasslands SoilLNAPQVLQRLLEPGRKLDAEIIVVTGGIPETATKQLKDMGVKVIVNKSEGMPAVMEAVQEALRRRKVA*
Ga0134071_1034044013300010336Grasslands SoilLNAPQVLQRLLEPGRNLDAEIIVVTGGIPETATKQLKDMGVKVIVNKSEGMPAVMEAVQEALRRRKVA*
Ga0134071_1052947913300010336Grasslands SoilLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKADGMPAVVEAMRQALKRRKAA*
Ga0134071_1068773113300010336Grasslands SoilLAPGRLDAEVIVVTGGMPDAAAAVLRRFGVKVILNKAEGMPAVVEALGTALKRQRGKAA*
Ga0134066_1003599323300010364Grasslands SoilRLLEPGRQLDVEVIVATGGMPDKAAVELREMGVKTIVNKVEGMQAVVEAMQQALKRRKVA
Ga0137391_1057602513300011270Vadose Zone SoilAPQVIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRHMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0137393_1067608913300011271Vadose Zone SoilDLNAPQVIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0137393_1086756123300011271Vadose Zone SoilDLNAPQVIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0137399_1135970213300012203Vadose Zone SoilVIQRLLEPGRRLDAEVIVVTGGMPDDASVELRQMGVKVILNKSDGMPAVVDAMREALRRRKAA*
Ga0137374_1005751713300012204Vadose Zone SoilLEIGRAQPTLIVLDYSLPDLNAAQVIRRLLEPGRRLDAEVIVVTGGMPAEATAPLREMGVKVIVNKAAGIPAVVEAMREALQRRQAA*
Ga0137380_1007243933300012206Vadose Zone SoilAAQVIQRLLEPGRRSDAEVVVITGGMPDQAAQRLREMGVRVILNKGEGMAAVVEAIRQALRRRKAA*
Ga0137380_1111514623300012206Vadose Zone SoilSQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0137376_1005212513300012208Vadose Zone SoilLLEIGRVRPALIVLDYTLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0137376_1039893313300012208Vadose Zone SoilLDYSLPDINAPGVIRRLLEPGRQLNVEVIVVTGGMPEQAAVQLREMGVKTIVNKVEGMQAVVEAMQQALKRRKVA*
Ga0137376_1117602913300012208Vadose Zone SoilATQVLQRLLEPGRKLDAEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0137377_1006108933300012211Vadose Zone SoilLLEPGRKLDSEIIVVTGGIPEAASSQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0134028_100007313300012224Grasslands SoilVDGLLEIGRVRPALIVLDYTLPDLNATQVLQRLLEPGRKLDAEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0134028_127036013300012224Grasslands SoilMLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0137387_1037798523300012349Vadose Zone SoilIQRLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0137372_1009387533300012350Vadose Zone SoilVVIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVVVVTGGMPEQAAVALREMGVKTIVNKIEGMPAVVEAMRQALARRKAA*
Ga0137386_1006901313300012351Vadose Zone SoilQRLLEPGRKLDAEIIVVTGGMPQTATAQLKDMGVRVIVNKSEGMPAVVDAMRQGLQSRKVA*
Ga0137386_1013337323300012351Vadose Zone SoilEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVEAMREALRRRKAA*
Ga0137367_1014439423300012353Vadose Zone SoilRRLDAEVIVVTGGMPEQAAVALRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0137366_1007296813300012354Vadose Zone SoilDLNAAQVIQRLLEPGRRSDAEVVVITGGMPDQAAQRLREMGVRVILNKGEGVAAVVEAIRQALPRRKAA*
Ga0137366_1033641523300012354Vadose Zone SoilLEPGRRLDAEVMVVTGGMPEQAAVALREMGVKTIVSKIEGMPAVGEAMRQALARRKAA*
Ga0137369_1050914123300012355Vadose Zone SoilVIRRLLEPARRLDAEVIVVTGGMPAEATAPLQEMGVRVIVNKSEGIPAVVEAMREALQRRQAA*
Ga0137371_1028532523300012356Vadose Zone SoilAAQVIQRLLEPGRRSDAEVVVITGGMPDQAAQRLREVGVRVILNKGEGMAAVVEAIRQALRRRKAA*
Ga0137384_1057071313300012357Vadose Zone SoilEPGRRSDAEVVVITGGMPDQAAQRLREMGVRVILNKGEGVAAVVEAIRQALRRRKAA*
Ga0137375_1118126113300012360Vadose Zone SoilPGRKLDAEVIVVTGGMPEQAAVELRRMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0137360_1026829313300012361Vadose Zone SoilRRLLEPGRQLDAGVMVVTGGMPDVAAVELRRMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0134037_111845623300012372Grasslands SoilLQRLLEPGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA*
Ga0134025_117998113300012378Grasslands SoilPDLNATQVLQRLLEPGRRLDAEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0134058_106542113300012379Grasslands SoilQRLLEPGRRLDAEVIVATGGLPADAAAQLREMGVKVIVNKSAGMPAVVEAMRDALQRRKVA*
Ga0134058_121377113300012379Grasslands SoilEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKADGMPALVDAMRQALKRRKAA*
Ga0134033_103210913300012383Grasslands SoilVDGLLEIGRVRPALIVLDYTLPALNATQVLQRLLEPGRRLDSEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0134036_123972713300012384Grasslands SoilYSLPDLNAPEVIRRLLEPGRQLDVEVIVATGGMPDKAAVELREMGVKTIVNKVDGMQAVVEAMQQALKRRKVA*
Ga0134030_103225323300012387Grasslands SoilVRPALIVLDYTLPDLNATQVLQRLLEPGRRLDAEIIVVTGGIPEAASAQLRDMGVKVIVTKAEGMPAVVEAMRQALQKRKVA*
Ga0134030_112077013300012387Grasslands SoilLIVLDYTLPDLNATQVLQRLLEPGRRLDSEIIVVTGGIPEAASAQLRDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA*
Ga0134031_113860233300012388Grasslands SoilPQVIQRLLEPGRNLDAEVMVVTGGMPEQAAVALRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0134056_126415713300012397Grasslands SoilDLNAAQVIQRLLEPGRRSDVEVVVITGGMPDQAAQQLREMGVRVILNKGEGMAAVVEAIRQALRRKKAA*
Ga0134048_107966813300012400Grasslands SoilATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVNKTDGMPAVVEAMRQALKRRKAA*
Ga0134024_136298223300012404Grasslands SoilDLNAPQVLQRLLEPGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA*
Ga0134041_114892423300012405Grasslands SoilVLQRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA*
Ga0134045_101362213300012409Grasslands SoilGRVQPSVILLDYTLPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKADGMPAVVEAMRQALKRRKAA*
Ga0134045_129421623300012409Grasslands SoilDGLLEIGRVRPALIVLDYTLPDLNATQVLQRLLEPGRKLDAEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKVAQD*
Ga0134060_114936123300012410Grasslands SoilATQVIQRLLEPGRQLDAEVIVVTGGMPDAAARRLRDLGVKVILNKADGMSAVVDAMRAALRRRKAA*
Ga0137373_1066512723300012532Vadose Zone SoilLEIGRVQPGLIVLDYSLPDLNAAQVIQRLLEPGRWLDAEVIVVTGGMPSEATAPLQEMGVRVIVNKSEGIPAVVEAMREALQRRKAA*
Ga0137373_1117427913300012532Vadose Zone SoilPDLNAPQVIQRLLEPGRRLDAEVIVVTGGMPEQAAVALRQMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0137396_1000515413300012918Vadose Zone SoilDLNATQVLQRLLEPGRKVDAEIIVVTGGIPEAASAQLKGMGVKVIVSKAEGMPAVVDAMRQALQRRKVA*
Ga0137396_1128113813300012918Vadose Zone SoilAPQVIQRLLEPGRKLDAEVIVVTGGMPEQATGELRRMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0137394_1147360413300012922Vadose Zone SoilIGRVQPSLIVLDYSLPDLNAPQVIQRLLEPGRQLDAEVIVVTGGMPEPAAGELRRMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0137404_1103022923300012929Vadose Zone SoilLDYSLPDLNASQVIQRLLEPGRRLDAEVMVVTGGMPAEVTLELRRMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0137407_1008150713300012930Vadose Zone SoilLDYSLPDLNAPEVIRRLLEPGRRLDAEVMVVTGGMPDEAAVELRRMGVKVILNKSAGMPAVVDAMREALRRRKAA*
Ga0134110_1004702823300012975Grasslands SoilLPDLNASQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0134087_1034148223300012977Grasslands SoilDLNAPEVIRRLLEPGRRLDAEVMVVTGGMPEQAAGELRQMGVKVILNKAAGMPAVVEAMREALRRRKAA*
Ga0134078_1057403313300014157Grasslands SoilSLPDLNAPEVIRRLLEPGRQLDVEVIVVTGGMPETAAVELREMGVKTIVNKVAGMQAVVEAMQQALKRRKVA*
Ga0137411_135206213300015052Vadose Zone SoilIVLDFSLPDLNAPQVIQRLLEPGRKLDAEVIVVTGGMPEQATGELRRMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0137420_121750613300015054Vadose Zone SoilDYSLPDLNAPQVIRRLLEPGRRLDAEVMVVTGGMPDQAAVELRQMGVKVILNKAEGMPAVVDAMREALRRRKAA*
Ga0134072_1001096733300015357Grasslands SoilQPSLIVLDYSLPDLNAPEVIRRLLEPGRRLDAEVMVVTGGMPDQAAVALRQMGVKVILNKAAGMPAVVEAMGEALRRRKAA*
Ga0134085_1011638223300015359Grasslands SoilLLEPGRRLDAEVMVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA*
Ga0134085_1017450523300015359Grasslands SoilDYTLPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAAAAQLRQMGVKVIVNKADGMPGVVDAMRQALKRRKAA*
Ga0132256_10062572023300015372Arabidopsis RhizosphereAALVQPALIVLDYSLPDLNAPQVIRRLLEPGRQLDAEVIVVTGGMPEQAAVELRQMGVKTIVNKIAGMPAVVEAMQQALARRKAA*
Ga0134069_131868613300017654Grasslands SoilAPQVIQRLLEPGRRLDAEVMVVTGGMPDQAAVALRQMGVKVILNKAAGMPAVVEAMGEALRRRKAA
Ga0134112_1020793723300017656Grasslands SoilLEIGRVQPSVIVLDYSLPDLNAPQVIERLLAPGRLDAEVIVVTGGMPDAAAAVLRWFGVKVILNKAEGMPAVVEALGTALKRQRGKAA
Ga0134074_105122523300017657Grasslands SoilPSLIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVEAMREALRRRKAA
Ga0134083_1014547023300017659Grasslands SoilPQVLQRLLEPGRKLDAEIIVVTGGMPQAATAQLKDMGVKVIVNKSEGMPAVVEAMRQALKRRKVA
Ga0066655_1022459233300018431Grasslands SoilDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVNKTDGMPAVVEAMRQALKRRKAA
Ga0066667_1006559133300018433Grasslands SoilLQRLLEPGRKLDAEIIVVTGGMPQAATAQLRDMGVKVIVNKAEGMPAVVEAIRQAVRRRKVA
Ga0066667_1028717513300018433Grasslands SoilRLLEPGRKLDSEIIVVTGGIPEAASTQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKVA
Ga0066667_1117473813300018433Grasslands SoilSLIVLDYSLPDLNASQVIQRLLEPGRRLDAEVMVVTGGMPVEAAVELRQMGVKVILNKSEGMPAVVDAMREALRRRKAA
Ga0066669_1017754323300018482Grasslands SoilGRVRPALIVLDYTLPDLNATQVLQRLLEPGRKLDAEIIVVTGGIPEAASAQLRDMGVKVIVTKAEGMPAVVEAMRQALQKRKVA
Ga0184641_147194323300019254Groundwater SedimentLEIGRVQPALIVLDYSLPDLNAPEVIRRLLEPGRRLDAEVMVVTGGMPEAAGQELRRMGVKVILDKADGMPAVVDAMRQALQRRKAA
Ga0248483_17588113300022691SoilVIERLLEPGRQLDAEIIVVTGGMPDEATEALRRMGVKVIVNKADGMPAVVDAMRQALRRREAA
Ga0209641_1010315733300025322SoilLPDLNAAQVIQRLLAPGRRLDAEVIVVTGGMPEEATARLLEMGVQVIVNKAEGMTAVVEAMRQALRRRKAA
Ga0207663_1000189793300025916Corn, Switchgrass And Miscanthus RhizosphereGRVQPSLIVLDYSLPDLNAPQVIRRLLEPGRQLDAEVLVVTGGMPEAAGKELREMGVKVILNKVEGMPAVVEAMRQALKRRKAA
Ga0209235_104215733300026296Grasslands SoilQPSVIVLDFTLPDLNATQVIERLLEPGRRLDAEVIVVTGGMPEAAAAQLRQMGVKVIVNKADGMPGVVEAMRQALKRRKAA
Ga0209235_119790813300026296Grasslands SoilEPGRKLDSEIIVVTGGIPEAASSQLKDMGVKVIVSKAEGMPAVVDAMRKALLRRKVA
Ga0209236_102085333300026298Grasslands SoilDLNATQVIERLLEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKADGMPALVDAMRQALKRRKAA
Ga0209027_104675613300026300Grasslands SoilQPSLIVLDYSLPDLNAPQVIRRLLEPGRRLDAEVIVVTGGMPEQAAVELRQMGVKVILNKSAGMPAAVEAMREALRRRKAA
Ga0209468_106399133300026306SoilPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVNKTDGMPAVVEAMRQALKRRKAA
Ga0209265_100978633300026308SoilEVIRRLLDPGRRLDAEVIVVTGGMPERAAVELREMGVKTIVNKVEGMQAVVEAMQQALKRRKVA
Ga0209761_108619823300026313Grasslands SoilPGRKLDAEIIVVTGGMPQAATAQLKDMGVKVIVNKSEGMPAVVDAMRQALQRRKVA
Ga0209686_114281513300026315SoilTLPDLNATQVLQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKVA
Ga0209155_106715023300026316SoilQPSLIVLDYSLPDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMPAVVDAMREALRRRKAA
Ga0209801_102276513300026326SoilNAPEVIRRLLEPGRQLDVEVIVVTGGMPEAAAVELRQMGVRTIVNKVEGMQAVVEAMQQALKRRKVA
Ga0209266_120596723300026327SoilDLNAPQVIQRLLEPGRRLDAEVMVVTGGMPAEAAVELRQMGVKVILNKSEGMAAVVEAMREALRRRKAA
Ga0209266_121437713300026327SoilGRVQPALIVLDYALPDLNAAQVIQRLLEPGRRSDAEVVVITGGMPDQAAQRLREVGVRVILNKGEGMAAVVEAIRQALRRRKAA
Ga0209802_104369143300026328SoilEPGRRLDAEVIVVTGGMPEAAAAQLRGMGVKVIVNKADGMPALVDAMRQALKRRKAA
Ga0209375_102368313300026329SoilQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQKRKV
Ga0209375_102400513300026329SoilQRLLEPGRKLDSEIIVVTGGIPEAASAQLKDMGVKVIVSKAEGMPAVVEAMRQALQRRKV
Ga0209803_114939333300026332SoilGLLEIGRAQPSVIVLDYTLPDLNATQVIQRLLEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKADGMPALVDAMRQALKRRKAA
Ga0209690_129040113300026524SoilAPEVIRRLLDPGRQLNVEVIVVTGGMPEKAAVELREMGVKTIVNKVDGMQAVVEAMQQALKRRKVA
Ga0209378_106936633300026528SoilVIVLDFTLPDLNATQVIERLLEPGRRLDAEVIVVTGGMPEAASAQLRGMGVKVIVNKADGMPALVDAMRQALKRRKAA
Ga0209058_121431713300026536SoilPGRRLDAEVMVVTGGMPAEAAVQLREMGVKVILNKSEGMPAVVEAMREALRRRKAA
Ga0209157_108075213300026537SoilQRLLEPGRRLDAEVIVVTGGMPEAASAQLRQMGVQTIVSKTDGMPGVVEAMRQALKRRKA
Ga0209376_133544413300026540SoilGRLGAEVIVVTGGMPDAAAAVLRRFGVKVILNKAEGMPAVVEALGTALKRQRGKAA
Ga0209156_1002755233300026547SoilPEVIRRLLEPGRQLDVEVIVVTGGMPETAAVELREMGVKTIVNKVDGMQAVVEAMQQALKRRKVA
Ga0208995_102674913300027388Forest SoilLEIGRVQPSLIVLDYSLPDLNAPEVIRRLLEPGRRLDAEVIVVTGGMPDQASVELRQMGVKVILNKAAGMPAVVDAMREALRRRKAA
Ga0209689_108819213300027748SoilLLEPGRNLDAEIIVVTGGIPETATKQLKDMGVKVIVNKSEGMPAVMEAVQEALRRRKVA
Ga0209177_1002110113300027775Agricultural SoilDLNAPQVIRRLLEPGRQLDAEVLVVTGGMPDAAGKELREMGVKVILNKVEGMPAVVEAMRQALKRRKAA
Ga0209180_1006201613300027846Vadose Zone SoilQVIQRLLEPGRKLDAEVIVVTGGMPEQATGELRRMGVKVILNKSEGMPAVVDAMREALRRREAA
Ga0209283_1046136423300027875Vadose Zone SoilVLDFSLPDLNAPQVIQRLLEPGRKLDAEVIVVTGGMPAEAAVQLRQMGVKVILNKSEGMPAVVDAMREALRRRKAA
Ga0307308_1003099333300028884SoilEPGRQLDAEVIVVTGGMPEAAGQELRGMGVKVILDKTDGMPAVVEAMRQALQRRKAA
Ga0307480_101576813300031677Hardwood Forest SoilLNATEVIRRLLEPGRQLNAEVLVVTGGMPEAAGEELREMGVKVILNKVEGMPAVVEAMRQALKRRKAA
Ga0307473_1067140623300031820Hardwood Forest SoilLLEIGRVQPSLIVLDYSLPDLNAPQVIQRLLEPGRKLDADVIVVTGGMPDEAAVELRRMGVKVILNKSEGMPAVVDAMREALRRRKAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.