NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F027477

Metagenome / Metatranscriptome Family F027477

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F027477
Family Type Metagenome / Metatranscriptome
Number of Sequences 194
Average Sequence Length 47 residues
Representative Sequence WAPLADLKVGPDAAPIPDDIATRLREIERTAAWATPKLELATP
Number of Associated Samples 155
Number of Associated Scaffolds 194

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.48 %
% of genes from short scaffolds (< 2000 bps) 91.75 %
Associated GOLD sequencing projects 142
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.485 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil
(19.072 % of family members)
Environment Ontology (ENVO) Unclassified
(45.876 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.155 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.
1JGI25382J43887_101367663
2JGI25386J43895_100999391
3JGI25617J43924_102152162
4JGI25617J43924_102565222
5Ga0066673_105007842
6Ga0066685_100684991
7Ga0066678_100428581
8Ga0070703_102464681
9Ga0070709_104737841
10Ga0070705_1001067981
11Ga0066682_101279921
12Ga0066681_101265291
13Ga0066681_104711292
14Ga0070706_1011913402
15Ga0066697_100867421
16Ga0070704_1000158646
17Ga0066661_108520971
18Ga0066707_100447934
19Ga0066707_102772731
20Ga0066704_108580892
21Ga0066698_110308662
22Ga0066670_103319141
23Ga0066705_105724112
24Ga0066691_105176381
25Ga0066706_101274623
26Ga0066706_108206411
27Ga0075299_10420132
28Ga0066651_104371182
29Ga0066651_104892261
30Ga0066656_100641554
31Ga0079222_103564041
32Ga0066659_100288035
33Ga0066659_116742431
34Ga0079220_100976383
35Ga0075428_1013318682
36Ga0075421_1005415043
37Ga0075430_1008466552
38Ga0075431_1007959692
39Ga0075426_108487982
40Ga0075436_1002916021
41Ga0079218_103703853
42Ga0099793_106724172
43Ga0099794_100687563
44Ga0099829_101622563
45Ga0099830_100597374
46Ga0075418_106844992
47Ga0066709_1007200541
48Ga0114129_116938971
49Ga0127456_10049242
50Ga0127483_10948701
51Ga0134070_100200751
52Ga0134088_100694644
53Ga0134088_107004101
54Ga0134084_100026686
55Ga0134084_100181641
56Ga0134084_101293462
57Ga0134084_104099142
58Ga0134086_100369761
59Ga0134086_101775072
60Ga0134086_102035371
61Ga0134086_102857461
62Ga0134064_100460793
63Ga0134065_101895111
64Ga0134071_101058593
65Ga0134062_101941342
66Ga0126376_100383303
67Ga0126372_112640222
68Ga0134127_114440471
69Ga0134127_117282691
70Ga0134122_122369302
71Ga0134123_122515742
72Ga0137391_102901913
73Ga0137455_10747571
74Ga0137364_109555412
75Ga0137382_101632741
76Ga0137381_103293533
77Ga0137376_106255422
78Ga0137370_103625681
79Ga0137387_109218122
80Ga0137386_106028281
81Ga0137367_101226583
82Ga0137367_110335982
83Ga0137366_101253241
84Ga0137366_110652501
85Ga0137371_103170563
86Ga0137371_111716212
87Ga0137361_101066911
88Ga0134043_10342531
89Ga0134051_11808141
90Ga0134048_11940222
91Ga0134045_13895802
92Ga0137398_101164403
93Ga0137396_104949651
94Ga0137359_117787261
95Ga0137413_111860262
96Ga0137413_114947161
97Ga0137419_118674781
98Ga0137404_118916271
99Ga0153915_103633791
100Ga0137410_100478884
101Ga0137410_110859181
102Ga0134077_103743321
103Ga0134076_100470803
104Ga0134076_104147631
105Ga0157378_111019932
106Ga0134075_103315682
107Ga0134079_101221432
108Ga0137405_13454601
109Ga0137403_112763331
110Ga0134072_104398732
111Ga0134089_102317081
112Ga0134089_102371392
113Ga0134085_100206784
114Ga0134085_101967022
115Ga0134085_102006281
116Ga0134085_102304921
117Ga0134069_10280253
118Ga0134069_10711992
119Ga0134069_11657862
120Ga0134112_105204481
121Ga0187776_101301171
122Ga0187776_111534242
123Ga0187766_108390331
124Ga0184637_102213641
125Ga0184633_103891822
126Ga0066655_110253582
127Ga0066667_105872962
128Ga0066667_123537121
129Ga0066669_101498763
130Ga0193723_11336771
131Ga0210379_100722291
132Ga0179596_106846201
133Ga0222625_13605871
134Ga0222623_100379303
135Ga0137417_14572403
136Ga0209519_107943571
137Ga0207653_102183141
138Ga0207699_104368511
139Ga0207684_109670161
140Ga0208285_10237202
141Ga0209438_10027837
142Ga0209235_10597641
143Ga0209236_10914083
144Ga0209027_10669033
145Ga0209238_10509061
146Ga0209238_12103212
147Ga0209469_10379321
148Ga0209268_10754461
149Ga0209131_13226161
150Ga0209472_12266792
151Ga0209472_12859311
152Ga0209470_10144551
153Ga0209267_12907122
154Ga0209808_11185663
155Ga0209806_11540031
156Ga0209058_11848541
157Ga0209157_10847693
158Ga0209157_11919691
159Ga0209376_12567312
160Ga0209161_105403571
161Ga0209577_103651921
162Ga0208474_1001361
163Ga0209886_10695612
164Ga0209845_10769121
165Ga0209177_105066881
166Ga0209074_103699291
167Ga0233416_102678282
168Ga0209180_100878141
169Ga0209180_106746511
170Ga0209701_100844061
171Ga0137415_101914631
172Ga0307299_101992082
173Ga0307281_101797642
174Ga0302046_104623402
175Ga0073996_112042801
176Ga0307501_102604531
177Ga0307496_100863311
178Ga0308194_100899582
179Ga0308194_101520872
180Ga0307469_101637153
181Ga0307469_102044871
182Ga0307469_105555751
183Ga0307469_114397192
184Ga0307469_120135922
185Ga0307469_121602531
186Ga0307468_1012352601
187Ga0214473_120639722
188Ga0307471_1034794992
189Ga0307471_1039146651
190Ga0335082_110980312
191Ga0214472_108336482
192Ga0364928_0124242_455_619
193Ga0364932_0148316_3_116
194Ga0364934_0223588_1_120
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 28.17%    β-sheet: 0.00%    Coil/Unstructured: 71.83%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540WAPLADLKVGPDAAPIPDDIATRLREIERTAAWATPKLELATPSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Sediment
Freshwater Wetlands
Soil
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Soil
Rice Paddy Soil
Tropical Peatland
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Groundwater Sand
Sediment
Populus Rhizosphere
Miscanthus Rhizosphere
4.1%18.6%19.1%18.0%8.2%4.6%4.1%4.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1013676633300002908Grasslands SoilKVGAGAAPIPDDIASRLRELERTAGWATPKASLAESPGP*
JGI25386J43895_1009993913300002912Grasslands SoilGVIIPSADWAPLADLKVGPEAAPIPDDIAARLREVERTAGWATPKLDLATPSPQAPNP*
JGI25617J43924_1021521623300002914Grasslands SoilGPDAAAIPDDIAAGLRAIERSAGWAAAKLELVREPAAPDR*
JGI25617J43924_1025652223300002914Grasslands SoilFRELKRAYDARSILNPDVILPAVDWAPLADLKVGPDAASIPDDIATQLRAIERSAGWAVPKLELAREPAAPDS*
Ga0066673_1050078423300005175SoilLNPGVILPAAGWEGAPLADLKVGADAAGIPDDIARRLREIERTAAWATPKTSLADT*
Ga0066685_1006849913300005180SoilKRAYDPLSIFNPGVIIPANDWSPVAALKMGAEAAVIPDDIATRLREVERSAAWATPKLELTR*
Ga0066678_1004285813300005181SoilNPGVIIPSADWAPLVDLKVGPEAAPIPDDIARRLRELEQTAGWATPKLDLTTPSS*
Ga0070703_1024646813300005406Corn, Switchgrass And Miscanthus RhizosphereAYDAAAIFNPGVIIPSADWAPLADLKVGPTAAPIPDDIAARLRDVERTAGWAIPKLDLAVPTP*
Ga0070709_1047378413300005434Corn, Switchgrass And Miscanthus RhizosphereSADWAPLADLKVGPTAAPIPDDIAARLRDVERTAGWAISKLDLVNP*
Ga0070705_10010679813300005440Corn, Switchgrass And Miscanthus RhizosphereAPLAALKVGDGAAAIPDDIASRLRDVERNAAWATPKLELTHPTPST*
Ga0066682_1012799213300005450SoilGVIIPSADWAPLAELKVGPDRAPMPDDIATRLREIERTAGWATPKLELARPSGHPTPDT*
Ga0066681_1012652913300005451SoilAYDPLSIFNPGVIIPANDWSPVAALKVGAEAAAIPDDIATRLREVERSAAWATPKLELTR
Ga0066681_1047112923300005451SoilSPLDALKVGDGAAAIPDDIATRLRDVERSAAWATPKPELARQTPET*
Ga0070706_10119134023300005467Corn, Switchgrass And Miscanthus RhizosphereGVIIPATDWSPVGALKVGDQAAAIPDDIATRLREVERSAAWATPKLELAH*
Ga0066697_1008674213300005540SoilNPGVIIPTADWAPLADLKVGPAAAPIPDDIAARLREIERTAGWATPKLELARFPEPRAPSS*
Ga0070704_10001586463300005549Corn, Switchgrass And Miscanthus RhizosphereWSPVAALKVGVDAAAIPDDIATRLREVERSAAWDTPKLDLTR*
Ga0066661_1085209713300005554SoilAALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0066707_1004479343300005556SoilVIIPTADWSPLAALKVGDETAIIPEDIATRLREVERSAAWATPKLELAL*
Ga0066707_1027727313300005556SoilLKVGPDAAPIPDDIAARLREIERTAGWSVPKLDLAREPLTPGT*
Ga0066704_1085808923300005557SoilPGVILPAADWAPLAELKVGPDAAPIPDDIAARLRELERAAAWGTPKLELARAAPSP*
Ga0066698_1103086623300005558SoilGVILPAADWTPLADLKVGPAAAPIPDDIAARLRDLERQGGWATPKPELAR*
Ga0066670_1033191413300005560SoilADWAPFAGLKVGADAAPIPDDIAARLRDIERTAGWATPKLDLALAP*
Ga0066705_1057241123300005569SoilGADAAGIPDDIARRLREIERTAAWATPKTSLADT*
Ga0066691_1051763813300005586SoilDWSPLDALKVGDGAAAIPDDIATRLRDVERSAAWATPKPELARQTPET*
Ga0066706_1012746233300005598SoilAYDPLSIFNPGVIIPANDWSPVAALKMGAEAAVIPDDIATRLREVERSAAWATPKLELTR
Ga0066706_1082064113300005598SoilKVGPDAAPIPGDIAARLREIERTAGWATPKLELARSSSP*
Ga0075299_104201323300005883Rice Paddy SoilFADVKRAYDPLSIFNPGVIIPASDWSPIAALKVGDDAAAIPEDIATRLREVERNAAWATTKLQLAR*
Ga0066651_1043711823300006031SoilWSPVAALKVGADAAAIPDDIATRLREVERRAAWDTPKLELTR*
Ga0066651_1048922613300006031SoilGVILPAADWAPLADLKVGPDAAPIPDDIAARLREIERTAGWATPKLELARSSSP*
Ga0066656_1006415543300006034SoilNPGVIIPATDWSPVAALKVGAGAAVIPDDIATRLRDVERNAAWATPKLELTR*
Ga0079222_1035640413300006755Agricultural SoilGVIIPATDWSPVAALKVGAGAAAIPDDIATRLRDVERSAAWATPKAELAQ*
Ga0066659_1002880353300006797SoilGPDRAPMPDDIATRLREIERTAGWATPKLELARPSGHPTPDT*
Ga0066659_1167424313300006797SoilIPSADWAPLADLKVGPTAAPIPDDIAARLREIERTAAWATPKLELARSSSPEPRAPNP*
Ga0079220_1009763833300006806Agricultural SoilDWTALADLKVGPAAAQLPDDIAVRLRDVERTAGWAIPKSELARPRHPTPDT*
Ga0075428_10133186823300006844Populus RhizosphereADWAPLADLKVGPEAAPIPDDIAARLREIERRAAWATPKLDLAR*
Ga0075421_10054150433300006845Populus RhizosphereGVIIPATGWSPLAALKIGDEAAAIPDDIATRLRDVERNAAWATSKLQLAR*
Ga0075430_10084665523300006846Populus RhizosphereKVGPEAAPIPDDIAGRLREVERAGAWSTPKLDLAR*
Ga0075431_10079596923300006847Populus RhizosphereATDWLPLAALKVGAGAAAIPDDIATRLREVERSAAWGTPKLELTRQTPNP*
Ga0075426_1084879823300006903Populus RhizosphereLADLKVGPAAAQLPDDIAVRLRDVERTAGWATPKSELARPRHPTPDT*
Ga0075436_10029160213300006914Populus RhizosphereHPQPGCYNPLGQWAPLADLKVGPDAAPIPDDIAARLREVERAAGWAVPKVELARESPTTDP*
Ga0079218_1037038533300007004Agricultural SoilLTIFNPGVIIPAPGWSPLAALKVGDEAAVIPDDIATRLRDVERSAAWGIPKLELTQPTPVT*
Ga0099793_1067241723300007258Vadose Zone SoilPLADLKVGPEAAPIPADIAARLRDVERNAGWATPKLDLATRAPKP*
Ga0099794_1006875633300007265Vadose Zone SoilPAADWAPLTELKVGPDAAAIPDDIAARLRDTERNAAWGVLKLDLARADTPNPAPDT*
Ga0099829_1016225633300009038Vadose Zone SoilGVILPAADWAPLAELKVGPDVAPIPDDIAARLRELERTAAWGTPKLELARATP*
Ga0099830_1005973743300009088Vadose Zone SoilWAPLADLKVGPDAAPIPDDIATRLREIERTAAWATPKLELATP*
Ga0075418_1068449923300009100Populus RhizosphereIPATGWSPLAALKIGAEAAAIPDDIATRLRDVERNAAWAIPKLELARPSP*
Ga0066709_10072005413300009137Grasslands SoilLKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0114129_1169389713300009147Populus RhizosphereADWAPLASLKVGEDAAAIPDDIAARLRETERNAAWATPKLELAR*
Ga0127456_100492423300010140Grasslands SoilVGADAAGIPDDIARRLREIERTAAWATPKTSLADT*
Ga0127483_109487013300010142Grasslands SoilVGAGAAAIPDDIAMRLREVERSAAWATPKLELTRQTP*
Ga0134070_1002007513300010301Grasslands SoilALKVGAEAAAIPDDIATRLREVERSAAWATPKLELTR*
Ga0134088_1006946443300010304Grasslands SoilDWSPVAALKMGAEAAVIPDDIATRLREVERSAAWATPKLELTR*
Ga0134088_1070041013300010304Grasslands SoilNVKRAYDPLTIFNPGVIIPATDWSPLDALKVGDGAAAIPDDIATRLRDVERSAAWATPKPELARQTPET*
Ga0134084_1000266863300010322Grasslands SoilVILPAADWAPLADLKVGPDAAPIPDDIAARLREIERTAGWATPKLELARSSSPESQAPSP
Ga0134084_1001816413300010322Grasslands SoilSIFNPGVIIPSADWAPLADLKVGPAAAAIPDDIAARLREVERTAGWAIPKLDLVKP*
Ga0134084_1012934623300010322Grasslands SoilSPVAALKVGADAAAIPDDIATRLRELERSAAWDTAKLELTR*
Ga0134084_1040991423300010322Grasslands SoilLKVGADAAPIPDDIGDRLRDIERTAGWATPKLDLVRRSLTPDT*
Ga0134086_1003697613300010323Grasslands SoilADWAPLADVKVGAGAAPIPDDIAGRLRELERTAAWSTPKTTLAGT*
Ga0134086_1017750723300010323Grasslands SoilVIIPASDWSPVTALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAY*
Ga0134086_1020353713300010323Grasslands SoilVAALKVGAGAAVIPDDIATRLRDVERNAAWATPKLELTR*
Ga0134086_1028574613300010323Grasslands SoilLKVGPEAAPIPDDIAARLREVERTAGWATPKLDLATPGP*
Ga0134064_1004607933300010325Grasslands SoilDWAPLADLKVGPAAAPIPDDIAARLREIERTAGWATPKLELDRSPEPRAPSP*
Ga0134065_1018951113300010326Grasslands SoilPVAALKMGAEAAVIPDDIATRLREVERSAAWATPKLELTR*
Ga0134071_1010585933300010336Grasslands SoilLKVGAGAADIPEDIATRLREVERSAAWATPKLELAH*
Ga0134062_1019413423300010337Grasslands SoilVIIPATDWSPLATLKVGADAAAIPDDIAGKLRDMERSAAWATPKLELIR*
Ga0126376_1003833033300010359Tropical Forest SoilVGDGAAPIPDDIAQRLREIERNAGWAIPKTDVSRTP*
Ga0126372_1126402223300010360Tropical Forest SoilWSPLGDLKVGDGAALIPDDIAHRLRDIERNAGWATPKSELARLP*
Ga0134127_1144404713300010399Terrestrial SoilPLAALKVGDGAAAIPDDIASRLRDVERNAAWATPKLELTHPTPST*
Ga0134127_1172826913300010399Terrestrial SoilIPATDWSPLAALKVGEDAAAIPDDIATQLRDVERSAAWATPKRQLAR*
Ga0134122_1223693023300010400Terrestrial SoilGVIIPAPDWNPLVDLKVGTDAAPIPDDIAARLRDVERSAGWGISKRELTRRL*
Ga0134123_1225157423300010403Terrestrial SoilEWAPLADLKVGPEATPIPDDIAARLREMERTAGWATPKLDLATPSP*
Ga0137391_1029019133300011270Vadose Zone SoilLKVGPEAASIPDDIAARLREVERTAGWATPKLDLATPNP*
Ga0137455_107475713300011429SoilKVGEDAAAIPDDIAARLRDMERNAGWATPKPELAR*
Ga0137364_1095554123300012198Vadose Zone SoilDLKVGPDAAPIPDDIAARLREIEQTAGWATPKLELARSSSP*
Ga0137382_1016327413300012200Vadose Zone SoilIPASDWSPVTALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0137381_1032935333300012207Vadose Zone SoilIIPSAEWAPLADLKVGPGAAPLPNDIAAGLREIERTAGWATPKLDLARASDT*
Ga0137376_1062554223300012208Vadose Zone SoilTALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0137370_1036256813300012285Vadose Zone SoilILPSPEWAPLADLKVGADAAAIPGDIAARLREVERAAAWAVPKVELARESPTPDP*
Ga0137387_1092181223300012349Vadose Zone SoilLADLKVGAGAAPIPDDIAARLRELERTATWSTPKTTLAGT*
Ga0137386_1060282813300012351Vadose Zone SoilPPPDWTPLADLKVGREVMQIPEDIAHRLREVERTAGWATPKLELASPNP*
Ga0137367_1012265833300012353Vadose Zone SoilGVIIPATDWSPVAALKVGEGAAVIPEDIATRLREVERSAAWATPKLELAH*
Ga0137367_1103359823300012353Vadose Zone SoilAALKIGAGAAVIPEDIATRLREVERSAAWATPKRELAR*
Ga0137366_1012532413300012354Vadose Zone SoilPLAALKVGEGAAPIPDDIAARLREVERTASWATPKLDLVRSLTPDP*
Ga0137366_1106525013300012354Vadose Zone SoilLPAADWAPLADVKVGAGAAPIPDDIAGRLRELERTAAWSTPKTTLAGT*
Ga0137371_1031705633300012356Vadose Zone SoilWAPLADLKVGPEAAPIPDDIAARLREVERTAGWATPKLDLATPSPQAPNP*
Ga0137371_1117162123300012356Vadose Zone SoilADWAPLADLKVGPEAAPIPDDIAARLREVERTAGWATPKLDLATSSPQAPNP*
Ga0137361_1010669113300012362Vadose Zone SoilLKVGPEAAPIPHDIAARLREVERTAGWAIPKLDLATPSP*
Ga0134043_103425313300012392Grasslands SoilFNPGVIIPATDWSPVAALKVGAGAAAIPDDIAMRLREVERSAAWATPKLELTRQTP*
Ga0134051_118081413300012398Grasslands SoilSPVTALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0134048_119402223300012400Grasslands SoilLKVGAGAAAIPEDIATRLREVERSAAWATPKPELAH*
Ga0134045_138958023300012409Grasslands SoilLKVGAEAAVIPDDIATRLREVERSAAWATPKLELTR*
Ga0137398_1011644033300012683Vadose Zone SoilILPAADWQPLAELKVGAAAASIPDDIALRLRDVERSGGWAVPKPELAR*
Ga0137396_1049496513300012918Vadose Zone SoilWSPLATLKVGDGAPAIPEDIATRLREVERSAAWATPKFELAH*
Ga0137359_1177872613300012923Vadose Zone SoilGVIIPSADWAPLADLKVGPAAAPIADDIAARLRDVERTAGWATPKLDLATPSP*
Ga0137413_1118602623300012924Vadose Zone SoilDLKVGPEAMPIPDDIARRLRELERNAGWATPKLELATPSP*
Ga0137413_1149471613300012924Vadose Zone SoilGVIIPASDWSPVTALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0137419_1186747813300012925Vadose Zone SoilAADWAPLAALKVGEDAAVIPDDIAARLRELERAAGWATPKLELAR*
Ga0137404_1189162713300012929Vadose Zone SoilPLADLKVGPEATPIPDDIARRLRELERTAGWATPKLDLARAADT*
Ga0153915_1036337913300012931Freshwater WetlandsKVGPDAAPIPDDIAARLRDIERTGGWATPKQDLARLP*
Ga0137410_1004788843300012944Vadose Zone SoilVAALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAH*
Ga0137410_1108591813300012944Vadose Zone SoilAADWQPLAELKVGAAAASIPDDIALRLRDVERSGGWAVPKPELAR*
Ga0134077_1037433213300012972Grasslands SoilALKVGDGAAAIPDDIATRLRDVERSAAWATPKPELARQTPET*
Ga0134076_1004708033300012976Grasslands SoilGVIIPATDWSPLDALKVGDGAAAIPDDIATRLRDVERSAAWATPKPELARQTPET*
Ga0134076_1041476313300012976Grasslands SoilLPSPEWAPLVDLKVGADVAPIPDDIAARLREIERTAGWPAPKLELARSSNPEPPAPSP*
Ga0157378_1110199323300013297Miscanthus RhizosphereVILPAADWAPLRDLKVGPDAAPIPDDIAGRLRDTERNATWSVPKLELAAPRHRTPDT*
Ga0134075_1033156823300014154Grasslands SoilSIFNPGVIIPATDWSPVAALKVGAGAADIPEDIATRLREVERSAAWATPKLELAH*
Ga0134079_1012214323300014166Grasslands SoilFNPGVIIPAADWSPVAALKVGAGAAAIPEDIATRLREVERSAAWATPKLELAR*
Ga0137405_134546013300015053Vadose Zone SoilNPLADWAPLADLRSGPRRRRSHDDIAARLREVERTAGWATPKLDFATPSP*
Ga0137403_1127633313300015264Vadose Zone SoilVGPEATPIPDDIAARLRQVEQSANWAIPKLELATRAANP*
Ga0134072_1043987323300015357Grasslands SoilDWSPVAALKVGAGAAAIPEDIAMRLREVERSAAWAIPKLELAH*
Ga0134089_1023170813300015358Grasslands SoilPPPDWTPLADLKVGREVMQIPENIAHRLREVERTAGWATPKLELASPNP*
Ga0134089_1023713923300015358Grasslands SoilAPLADLKVGPDAAPIPDDIAARLREIERTAGWSVPKLDLAREPLTPGT*
Ga0134085_1002067843300015359Grasslands SoilVIIPATDWSPVAALKVGAGAAVIPEDIATRLREVERSAAWATPKLELAH*
Ga0134085_1019670223300015359Grasslands SoilPGVIIPAPDWTPLADLKVGPATAAIPDDIAARLREVERTAGWATPKYELARPDT*
Ga0134085_1020062813300015359Grasslands SoilPLADLKVGPEAAPIPDDIAARLREVERTAGWATPKLDLATPGPQAPNP*
Ga0134085_1023049213300015359Grasslands SoilDLQVGPDAAPSPSATAARLRDVERTAAWASPKVELARESPTPDP*
Ga0134069_102802533300017654Grasslands SoilLPAADWAPLADVKVGAGAAPIPDDIAGRLRELERTAAWSTPKTTLAGT
Ga0134069_107119923300017654Grasslands SoilWAPLADLKVGPEAAPIPDDIAARLREVERTAGWATPKLDLATSSPQAPNP
Ga0134069_116578623300017654Grasslands SoilDWAPLAELKVGPDRVPMPDDIATRLREIERTAGWATPKLELARPSGHPTPDT
Ga0134112_1052044813300017656Grasslands SoilVAALKVGDRAADIPEDIATRLRDVERNAAWATPKLALTQ
Ga0187776_1013011713300017966Tropical PeatlandADLKVGPDAARIPDDIAARLRDVERNAAWSVPKLELATLTPET
Ga0187776_1115342423300017966Tropical PeatlandLTHLKVGDGAVPIPDDIALRLRETERTAGWATPKTDLARAP
Ga0187766_1083903313300018058Tropical PeatlandAADWSPLADLKVGPDAAPIPADIAARLRDTERNAAWGAPKLDLATLTPHT
Ga0184637_1022136413300018063Groundwater SedimentLKVGERAAAIPDDIAARLREVERSAAWATPKVALATPEPLAPSP
Ga0184633_1038918223300018077Groundwater SedimentAAAADWAPLAALKVGERAAAIPDDIAARLREVERSAAWATPKVALATPEPLAPSP
Ga0066655_1102535823300018431Grasslands SoilDLKVGADAAGIPDDIARRLREIERTAAWATPKTSLADT
Ga0066667_1058729623300018433Grasslands SoilLADLKVGADAAPIPGDIAARLREVERAATWAVPKVELARESPTPDP
Ga0066667_1235371213300018433Grasslands SoilAGSMFNPGVIIPSADWAPLADLKVGPGAAPIPDDIAARLREVERTAGWAIPKLDLVKP
Ga0066669_1014987633300018482Grasslands SoilGPEATPIPDDIAARLREVERTAGWATPKLDLTTPSS
Ga0193723_113367713300019879SoilVILPAADWGPLHDLKVGPDAAPIPDDIAARLRDTERNASWNVPKLDLTQPVVPDT
Ga0210379_1007222913300021081Groundwater SedimentADLKVGPSAAPIPEDIAARLRAVEREGGWATPKTELAR
Ga0179596_1068462013300021086Vadose Zone SoilLSIFNPGVIIPATDWSPVAALKIGAGAAVIPEDIATRLREVERSAAWATPKRELAR
Ga0222625_136058713300022195Groundwater SedimentPVAALKIGAGAAVIPEDIATRLREVERSAAWATPKLELAH
Ga0222623_1003793033300022694Groundwater SedimentLKVGEDAAAIPDDIAARLRDVERTAGWATPKLELAR
Ga0137417_145724033300024330Vadose Zone SoilVGPEAMPIPDDIARRLRELERNAGWATPKLELATPSP
Ga0209519_1079435713300025318SoilVIIPAADWAPLAALKVGEDAAPIPDDIAARLRELERAAGWATPKPELAR
Ga0207653_1021831413300025885Corn, Switchgrass And Miscanthus RhizosphereAYDAAAIFNPGVIIPSADWAPLADLKVGPTAAPIPDDIAARLRDVERTAGWAIPKLDLAVPTP
Ga0207699_1043685113300025906Corn, Switchgrass And Miscanthus RhizosphereDLKVGPTAAPIPDDIAARLRDVERTAGWAISKLDLVNP
Ga0207684_1096701613300025910Corn, Switchgrass And Miscanthus RhizosphereATDWSPVGALKVGDQAAAIPDDIATRLREVERSAAWATPKLELAH
Ga0208285_102372023300026005Rice Paddy SoilADLKVGDGAAAIPDDIAGRLRGLERNAAWSTPKLDLTRQP
Ga0209438_100278373300026285Grasslands SoilVAADWAPLAELKVGEHAAAIPDDIAARLREMERTAGWATPKLELAR
Ga0209235_105976413300026296Grasslands SoilPSAEWAPLADLKVGPDAAPIPDDIAQRLRDIERTAGWATPKLELARSPSPEPLAPSP
Ga0209236_109140833300026298Grasslands SoilAPLADLKVGPEAAPIPHDIAARLREVERTAGWAIPKLDLATPSP
Ga0209027_106690333300026300Grasslands SoilDAGSMFNPGVIIPSADWAPLADLKVGPGAAPIPDDIAARLREVERTAGWAIPKLDLVKP
Ga0209238_105090613300026301Grasslands SoilILPSPEWAPLADLKVGPDAAPIPGDIATRLREIEQTAGWATPKLELARSSSP
Ga0209238_121032123300026301Grasslands SoilLADLKVGPGAAPIPDDIAARLREIERTAGWAISKLDLVKP
Ga0209469_103793213300026307SoilATDWAPLANLKVGPDATPIPDDIAHRLRALEQGAGWATPKTELAN
Ga0209268_107544613300026314SoilIIPSADWAPLAELKVGPDRAPMPDDIATRLREIERTAGWATPKLELARPSGHPTPDT
Ga0209131_132261613300026320Grasslands SoilPAADWQPLADLKVGAAAAPIPDDIALRLRDVERSGGWAVPKPELAR
Ga0209472_122667923300026323SoilVGPATAAIPDDIAARLREVERTAGWATPKYELARPDT
Ga0209472_128593113300026323SoilPVAALKVGDRAADIPEDIATRLRDVERNAAWATPKLALTQ
Ga0209470_101445513300026324SoilAAPIPGDIAARLRDVERTAAWAVPKVELARESPTPDP
Ga0209267_129071223300026331SoilVIIPATDWSPVAALKVGDDAAAIPEDIATRLREVERSAAWATPKLELAH
Ga0209808_111856633300026523SoilPGVIIPATDWSPVAALKVGDDAVAIPEDIATRLREVERSAAWATPKLELAH
Ga0209806_115400313300026529SoilDWAPLADLKVGPDAAPIPDDIAARLREIERTAGWSVPKLDLAREPLTPGT
Ga0209058_118485413300026536SoilKVGPEATPIPDDIAARLREVERTAGWATPKLDLAAASL
Ga0209157_108476933300026537SoilPDAAPIPEDVAVRLREIERTAGWATPKLDLVRNTLTPDT
Ga0209157_119196913300026537SoilPSADWAPLADLKVGPEAAPIPDDIARRLRDTERNAAWGVPKFELATPSP
Ga0209376_125673123300026540SoilPSADWAPLADLKVGPEAAPIPHDIAARLREVERTAGWAIPKLDLATPSP
Ga0209161_1054035713300026548SoilPGVIIPSAEWAPLADLKVGPDAAPIPGDIAARLRDVERTAAWAVPKVELARESPTPDP
Ga0209577_1036519213300026552SoilADLKVGPEATPIPDDIAARLREVERTAGWATPKLDLTTPSS
Ga0208474_10013613300026700SoilDAAAIFNPGVIIPSADWAPLADLKVGATAAPIPDDIAARLRDVERTAGWAIPKLDLVNP
Ga0209886_106956123300027273Groundwater SandDWAPLADLKVGPSAAPIPDDIAARLRAVEREGGWATPKTELAR
Ga0209845_107691213300027324Groundwater SandPFADLKVGPSAAPIPDDIAARLRAVEREGGWATPKTELAR
Ga0209177_1050668813300027775Agricultural SoilQPLADLKVGDAAAPIPDDIALRLRDVERSGGWAVPKPELAR
Ga0209074_1036992913300027787Agricultural SoilKVGPAAAQLPDDIAVRLRDVERTAGWATPKSELASPRHPTPDT
(restricted) Ga0233416_1026782823300027799SedimentPLAALKVGPAAAAIPDDIAARLRHLERAAAWHTPKTELAR
Ga0209180_1008781413300027846Vadose Zone SoilPGVIIPSADWAPLADLKVGPEAAPIPADIAARLRDVERNAGWATPKLDLATRAPKP
Ga0209180_1067465113300027846Vadose Zone SoilIIPSAEWAPLADLKVGPEATPIPDDIAARLREVERTAGWATPKLDLATPSP
Ga0209701_1008440613300027862Vadose Zone SoilWAPLADLKVGPDAAPIPDDIATRLREIERTAAWATPKLELATP
Ga0137415_1019146313300028536Vadose Zone SoilPAADWAPLVDLKVGAAATPIPDDIAARLRELERTASWATPKTSLADSPGT
Ga0307299_1019920823300028793SoilKVGEDAAAIPDDIAARLRDVERAAGWATPKLELAR
Ga0307281_1017976423300028803SoilLKVGEGAAPIPDDIAVRLRAVERGAEWAILKTGLAD
Ga0302046_1046234023300030620SoilTDWAPLVDLKVGDRVAPIPDDIASRLREIERAAAWATPKTELAGPPARSGG
Ga0073996_1120428013300030998SoilALKVGDEAAAIPEDIATRLREVERSAAWATPKLELAH
Ga0307501_1026045313300031152SoilPGVIIPAADWAPLAALKVGEDAAAIPDDIAARLRELERSAGWATPKLELAR
Ga0307496_1008633113300031200SoilKVGDGAAPIPEDIATRLREVEHTAGWSTPKLDLARAPTPDT
Ga0308194_1008995823300031421SoilALKVGAGAAAIPEDIATRLREVERNAAWATPKLALAH
Ga0308194_1015208723300031421SoilSPVTALKVGAGAAAIPDDIATRLREVERSAAWATPKRELAR
Ga0307469_1016371533300031720Hardwood Forest SoilDWSPLDALKVGDGAAAIPDDIATRLRDVERNAAWATPKPELARRTPET
Ga0307469_1020448713300031720Hardwood Forest SoilAALKVGEDAAAIPDDIATQLRDVERSAAWATPKRQLAR
Ga0307469_1055557513300031720Hardwood Forest SoilPAADWAPLADVKAGADAAAIPDDIARRLREIERTAAWATPKTSLADT
Ga0307469_1143971923300031720Hardwood Forest SoilPLAALKVGEDAAAIPDDIAARLRELERAAGWATPKLELAR
Ga0307469_1201359223300031720Hardwood Forest SoilAADWAPLLDLKVGPDAAPIPDDIAGRLRDTERNAAWSIPKLDLI
Ga0307469_1216025313300031720Hardwood Forest SoilSIFNPGVIIPATDWSPVAALKVGAGAADIPEDIATRLREVERSAAWATQKLELAR
Ga0307468_10123526013300031740Hardwood Forest SoilIIPATDWSPVAALKVGADAAAIPDDIATRLREMERSAAWDTPKLDLTR
Ga0214473_1206397223300031949SoilWAPLADLKVGEAAAPIPDDIAARLRAVERTAEWAVPKTGLAD
Ga0307471_10347949923300032180Hardwood Forest SoilPRCSFNPGVIIPTANWAPLADLKVGPEATPIPDDIAGRLRELERTAGWATPKLDLARAPD
Ga0307471_10391466513300032180Hardwood Forest SoilDWAPLADLKVGPTAAPIPDDIAARLRDVERTAGWAIPKLDLAVPSP
Ga0335082_1109803123300032782SoilKVGPEAAPIPDDIARRLREIERSGGWAVPKLDLARSSDP
Ga0214472_1083364823300033407SoilSPLAELKVGNGAAAIPEDIATRLREVERSAAWATPKLEMARPNP
Ga0364928_0124242_455_6193300033813SedimentVFNPGVIVPAAAWAPLADLKVGEAAAPIPDDIAARLRAVERGAEWAVPKTGLAD
Ga0364932_0148316_3_1163300034177SedimentDLKVGPSAAPIPDDIAARLRAVEREGGWATPKTELAR
Ga0364934_0223588_1_1203300034178SedimentLADLKVGPEAAPIPDDIAARLRDVERRAAWATPKLELAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.