NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F047886

Metagenome / Metatranscriptome Family F047886

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F047886
Family Type Metagenome / Metatranscriptome
Number of Sequences 149
Average Sequence Length 91 residues
Representative Sequence VTAPAQAAETASAAPADYDEFVFVRRDGTVFFAVGYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFRLPA
Number of Associated Samples 106
Number of Associated Scaffolds 149

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 12.24 %
% of genes near scaffold ends (potentially truncated) 56.38 %
% of genes from short scaffolds (< 2000 bps) 55.03 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (65.772 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(49.664 % of family members)
Environment Ontology (ENVO) Unclassified
(48.993 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.336 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140
1JGI12635J15846_100319751
2JGI12053J15887_106284142
3JGI25381J37097_10131501
4JGI25390J43892_100134273
5JGI25616J43925_101228971
6Ga0066679_104488331
7Ga0066690_103200491
8Ga0066692_108249422
9Ga0066704_100448154
10Ga0066698_100000877
11Ga0066702_100383914
12Ga0066706_114257301
13Ga0075019_104613051
14Ga0075015_1004067832
15Ga0075030_1005878371
16Ga0075018_104862272
17Ga0070716_1008802321
18Ga0070765_1015235071
19Ga0066665_107832902
20Ga0063777_13870502
21Ga0099791_105623182
22Ga0099793_106226562
23Ga0099794_102246552
24Ga0066710_1000923911
25Ga0099829_104010781
26Ga0099829_109246561
27Ga0099830_100070994
28Ga0099830_101404321
29Ga0099830_106468721
30Ga0099830_115088502
31Ga0099828_106216111
32Ga0099828_108454461
33Ga0099828_111384191
34Ga0099827_102209132
35Ga0127503_102517541
36Ga0134070_102909581
37Ga0134067_100360241
38Ga0134063_106823802
39Ga0126376_120917802
40Ga0136449_1000387725
41Ga0136449_1022048701
42Ga0138553_1686681
43Ga0138565_10643731
44Ga0138564_12014481
45Ga0150983_117825042
46Ga0137392_100194892
47Ga0137392_101896843
48Ga0137392_105515402
49Ga0137391_114147972
50Ga0137393_101292871
51Ga0137393_101385771
52Ga0137393_103340101
53Ga0137393_106572211
54Ga0137393_109980101
55Ga0137393_117811592
56Ga0137389_103596142
57Ga0137388_103796411
58Ga0137364_112154062
59Ga0137383_100711263
60Ga0137363_100295194
61Ga0137363_105402082
62Ga0137379_105328822
63Ga0137377_117229532
64Ga0137370_105493942
65Ga0137387_106813562
66Ga0137387_109092952
67Ga0137372_100746641
68Ga0137384_115857891
69Ga0137385_106954991
70Ga0137360_101005681
71Ga0137360_104029082
72Ga0137360_109794882
73Ga0137360_118384202
74Ga0137361_105054311
75Ga0137390_107158611
76Ga0137390_107489902
77Ga0137390_115751852
78Ga0137390_116031201
79Ga0134025_11636551
80Ga0134031_12046311
81Ga0137358_100244871
82Ga0137358_101906982
83Ga0137398_100192404
84Ga0137398_105406281
85Ga0137398_112141341
86Ga0137397_100235951
87Ga0137397_100433802
88Ga0137397_103519532
89Ga0137359_102876372
90Ga0137359_115271562
91Ga0137419_108587282
92Ga0137416_105530481
93Ga0137404_110942661
94Ga0137404_119742691
95Ga0137404_122525992
96Ga0134078_100579892
97Ga0137414_10540592
98Ga0137420_10249751
99Ga0137420_13085502
100Ga0137418_100006297
101Ga0137412_1000072620
102Ga0066669_111287691
103Ga0179592_102650861
104Ga0179592_105124111
105Ga0210407_112467282
106Ga0210403_102845771
107Ga0210403_105570441
108Ga0215015_107761941
109Ga0210400_115113991
110Ga0210400_116580831
111Ga0210408_113016272
112Ga0210402_118691771
113Ga0210409_102544061
114Ga0210409_115542602
115Ga0242656_10344681
116Ga0179589_103460462
117Ga0137417_11459741
118Ga0209240_10420383
119Ga0209240_12921672
120Ga0209239_12835862
121Ga0209375_11911342
122Ga0257176_10882842
123Ga0257168_10552852
124Ga0209059_12053302
125Ga0209378_11115621
126Ga0209161_100605041
127Ga0209161_102276972
128Ga0209161_102568411
129Ga0209648_100162939
130Ga0209648_101287473
131Ga0209648_102587681
132Ga0179587_108274132
133Ga0207949_10225712
134Ga0209735_10568611
135Ga0209220_10781812
136Ga0209733_10810562
137Ga0209117_10790381
138Ga0208990_11721792
139Ga0209180_107061532
140Ga0209283_102704791
141Ga0209283_102881701
142Ga0209067_105142331
143Ga0138298_10655561
144Ga0307477_105136931
145Ga0307477_111692582
146Ga0307473_104478611
147Ga0307479_106194612
148Ga0307479_120383411
149Ga0311301_100738315
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.04%    β-sheet: 23.15%    Coil/Unstructured: 64.81%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080VTAPAQAAETASAAPADYDEFVFVRRDGTVFFAVGYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFRLPASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
65.8%34.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Peatlands Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
3.4%49.7%4.0%4.7%9.4%7.4%8.7%3.4%6.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1003197513300001593Forest SoilETPQPEATDAEVRETDRRYRTPQPAPAPAAETASSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVMQDTLDLDATRQFNEQRGLNF*
JGI12053J15887_1062841423300001661Forest SoilAAETASSAASNNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVTQDTLDLDATRQFNEQRGLNFHSPA*
JGI25381J37097_101315013300002557Grasslands SoilTRRRYRAPEPATAPLAETASSAPPDNEEFVFVRRDGTVFFAVAYSWEKGTLHYITSQGLPRTITEDALDLDATRQFNEQRGLSFRLPA*
JGI25390J43892_1001342733300002911Grasslands SoilGQAPAAEAENAASSDNDGFVFVRRDGTIFFAVAYSWENGALRYVTSQGLRHTVTLDALDLDATRQFNEQRGLTFRLPA*
JGI25616J43925_1012289713300002917Grasslands SoilRRNRVPQPATEPKAEAADSAPTDNEEFVFVRRDGTVFFAVAYAWERGTLRYITSQGLRRTVKQDALDLDATRQFNEQRGLNFRSPA*
Ga0066679_1044883313300005176SoilTAPVAETASSGPPDHEEFVFVRRDGTVFFAVAYSWEKGTLHYITSQGLPRTITEDTLDLDATRQFNEQRGLSFRLPP*
Ga0066690_1032004913300005177SoilDAEVRETDRRYRTPEPVTAPAAETASSAASDNEEFVFVRRDGTLFFAVAYAWEKGTLRYITSQGLRRTITQDTLDLDATRQFNEQRGLNFHSPA*
Ga0066692_1082494223300005555SoilPAAETASSAASDNEEFVFVRRDGTLFFAVAYAWEKGTLRYITSQGLRRTITQDTLDLDATRQFNEQRGLNFHSPA*
Ga0066704_1004481543300005557SoilPPQVEEASAAPRQTEEFVFVRRDGTVFFAVAYAWENGVLRYVTSEGLRRSVARETLDLNATQQFNEQRGLNFRLPA*
Ga0066698_1000008773300005558SoilVANPAPASTNEAASVTSSDNDEFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0066702_1003839143300005575SoilEVPVAETSQAEDADIEVRETSRRHRVSDAATNLAPAATNETASATSSDNDEFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0066706_1142573013300005598SoilETSRRRRVSEPVTNPAPAGETAGAASTDNDGFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHMVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0075019_1046130513300006086WatershedsVDDTATAESAQPKITDSENRDRGRRYRVAEPQPAPAVEAANGEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYVTSQGLRHTVTQDALDMDATRQFNEQRGLNFRLPA*
Ga0075015_10040678323300006102WatershedsSGPEAPPPVEAANAGPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYVTSQGLRHTVTQDVLDLDATRQFNEQRGLNFRLPA*
Ga0075030_10058783713300006162WatershedsFYVPGAAQTVDDTATAESAQPKITDSENRDRGRRYRVAEPQPAPAVEAANGEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYVTSQGLRHTVTQDALDMDATRQFNEQRGLNFRLPA*
Ga0075018_1048622723300006172WatershedsPEILDSEVRERGRRYRITGPAAPAAVEAASSAPVDNDEFVFVRRDGTVFFAVAYSWEKGTLRYITSRGLRYTVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0070716_10088023213300006173Corn, Switchgrass And Miscanthus RhizosphereSAPPDNEEFVFVRRDGTVFFAVAYSWEKGTLHYITSQGLPRTITGDALDLDATRQFNEQRGLSFRLPA*
Ga0070765_10152350713300006176SoilVTDTEVREADRRNRTPQPVTAPAAETASPAASDNDEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFHSPA*
Ga0066665_1078329023300006796SoilVSDAATNLAPAATNETASATSSDNDEFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0063777_138705023300006861Peatlands SoilDASQPGAADTDVSDAGRRYRASQTVTEQTEEAANTGPADNDEFVFVRRDGTLFFAVAYTWEKGTLRYVTSEGLRQTITKDALDLDATRQFNEQRGFNFTSPA*
Ga0099791_1056231823300007255Vadose Zone SoilMTAPAAALVPVVETASTVLGESDQFVFVRRDGMVFFAVAYAWENGTLRYVTGEGLRHTVAADTLDLDATRQFNEQRGLSFRLPA*
Ga0099793_1062265623300007258Vadose Zone SoilVAETAPPDVTDAELRGRGRRYRAPEPETAPVPEAPGAAPADNEPYVFVKRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0099794_1022465523300007265Vadose Zone SoilTARPEATDAEARETTRRHRATQPVTEPAPTVEAPSAAAADNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVMQDALDLDATRQFNEQRGLNFRLPV*
Ga0066710_10009239113300009012Grasslands SoilAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDALDLAATQQFNEQRGLNFRLPA
Ga0099829_1040107813300009038Vadose Zone SoilSAAPTDNEGFAFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVTQDALDLDATRQFNEQRGLNFLSPT*
Ga0099829_1092465613300009038Vadose Zone SoilVTAPAQAAETASAAPADYDEFVFVRRDGTVFFAVGYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0099830_1000709943300009088Vadose Zone SoilVSGSSVSGDEGSVAESAQPDVTDAELRGRGRRYRASEPETAPTPAAESPSGTPAEDDPFVFVRRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRSPA*
Ga0099830_1014043213300009088Vadose Zone SoilMAAPAPTPAVETANAAPPDSDEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVAQGALDLEATRQFNEQRGLIFRSPA*
Ga0099830_1064687213300009088Vadose Zone SoilVAPAPAPAIETASPAPPSNDEFVFVRRDGTLFFAVAYSWENGTLRYVTSEGLRRTVKQDALDMGATQQFNEQ
Ga0099830_1150885023300009088Vadose Zone SoilAETAGSALPSNDEFVFVRRDGTVFFAVAYSWEGGTLRYVTNQGLRRTVKQDALDMGATQQFNEQRGLSFRSPA*
Ga0099828_1062161113300009089Vadose Zone SoilMAAPAPTLAVETANAVPPGNDEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLGHTVAQGALDLEATRQFNEQRGLIFRSPA*
Ga0099828_1084544613300009089Vadose Zone SoilPEATDGEVRETSRRNRVPQPATEPKAEAADSAPTDNEEFVFVRRDGTVFFAVAYAWERGTLRYITSQGLRRTVKQDALDLDATRQFNEQRGLNFRSPA*
Ga0099828_1113841913300009089Vadose Zone SoilASEPVMAPPVETASSATPDDEQFVFVRRDGSVFFAVAYAWEKGTLRYITSQGLRRTVTEDALDLDATRQFNEQRGLNFRLPA*
Ga0099827_1022091323300009090Vadose Zone SoilVTAAAETASSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDTLDLDATRQFNEQRGLNFHSPV*
Ga0127503_1025175413300010154SoilPGSAVATDDSSVVEDSQQEDIDAENREYARRHRNSQSAQVSAPVVETDSAVLQDTAEFVFVRRDGTLFFAVAYTWENGTLRYINSQGLKQTVTQDALDLGATQQFNEQRGLNFHSPA*
Ga0134070_1029095813300010301Grasslands SoilATDEGGADAQERMKVSQAAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVSQDALDLAATQQFNEQRGLNFRLPA*
Ga0134067_1003602413300010321Grasslands SoilAVDEGPAAETPRPETTDAEAAKTRRRYRAPEPATAPLAETASSAPPDNEEFVFVRRDGTVFFAVAYSWEKGTLHYITSQGLPRTITEDALDLDATRQFNEQRGLSFRLPA*
Ga0134063_1068238023300010335Grasslands SoilPETTDAEAAETRRRYLAPEPATAPVAETASSGPPDHEEFVFVRRDGTVFFAVAYSWEKGTLHNITSQGLPRTITEDTLDLDATRQFNEQRGLSFRLPP*
Ga0126376_1209178023300010359Tropical Forest SoilVASEPPPVATEVAAAAGPTDEYVFVRRDGTVFFAVAYSWENGALRYITSQGLRGTVTRDTLDLNATQQFNDQRGLSFHSPA*
Ga0136449_10003877253300010379Peatlands SoilMADTASAGPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYITSEGLRHSVMQDALDLGATQQFNEQRGLNFRLPA*
Ga0136449_10220487013300010379Peatlands SoilVDDSAAADTVQPDAVDADLRDYGRRYRAAQTAPAPAVETASSEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYITNQGLRHTVTQDALDLDATREFNEQRGLNFRLPA*
Ga0138553_16866813300011047Peatlands SoilMADTASAGPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYITSEGLRHSVMQDALDLGATQQFNEQR
Ga0138565_106437313300011078Peatlands SoilVPGAAANVDDSAVAETAQPEIIDSENRERGRRYRITEPETGPAVEAASAEPADNEEFVFVRRDGTVFFAMAYSWEKGTLQYITSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPV*
Ga0138564_120144813300011086Peatlands SoilMADTASAGPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYITSEGLRHSVMQDALDLGATQQFNEQRGLNFR
Ga0150983_1178250423300011120Forest SoilGGFFFTIFDGGFPVAGSPVSGDEGPVAESAPPDVTDAEQRGRGRRYRAPEPETAPAAESASAAADNEPYVFVKRDGTVFFAVAYSWEKGTLRYITNEGLRRVVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0137392_1001948923300011269Vadose Zone SoilVSGSSVSGDEGSVAESAPPDVTDAELRGRGRRYRASEPETAPTPAAESPSGTPAEDDPFVFVRRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRSPA*
Ga0137392_1018968433300011269Vadose Zone SoilERRKVSQAAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVSQDALDLAATQQFNEQRGLNFRLPA*
Ga0137392_1055154023300011269Vadose Zone SoilPTIEAPSAAPSDDEEFVFVRRDGTVFFAVGYAWEKGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRWPS*
Ga0137391_1141479723300011270Vadose Zone SoilQAAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDSLDLAATQQFNEQRGLNFRLPA*
Ga0137393_1012928713300011271Vadose Zone SoilDAEVRERGRRYRASEPVMAPPVEAASSATPDDEQFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFRLPV*
Ga0137393_1013857713300011271Vadose Zone SoilVAAPTIEAPSAAPSDDEEFVFVRRDGTVFFAVGYAWEKGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPS*
Ga0137393_1033401013300011271Vadose Zone SoilAQPEGTDADVRETRRHRAAQPLTEPTSTVETSNAAATDNEGFVFVRRDGTLFFAVGYTWENGTLRYVTDQGLRRTVTQDALDLDATRQFNEQRGLNFRLPS*
Ga0137393_1065722113300011271Vadose Zone SoilDAEVRERGRRYRASEPVMAPPVEAASSATPDDEQFVFVRRDGSVFFAVAYAWEKGTLRYITSQGLRHTVTQDALDLDATRQFNEQRGLNFLSPT*
Ga0137393_1099801013300011271Vadose Zone SoilVTAPAAETASSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDALDLDAT
Ga0137393_1178115923300011271Vadose Zone SoilRGRGRRYRAPEPDPAPASEAPSATPADNEPYVFVKRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0137389_1035961423300012096Vadose Zone SoilRYRASLEAAAPTPAVETTNAAPTENDEFVFVRRDGTVFFAVAYSWENGTLRYVTSEGLRRTVKQDALDMGATQQFNEQRGLNFRLPA*
Ga0137388_1037964113300012189Vadose Zone SoilVAAPTIEAPSAAPSDDEEFVFVRRDGTVFFAVGYAWEKGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRWPS*
Ga0137364_1121540623300012198Vadose Zone SoilRETDRRYRTPEPVAAPAAETAGSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDTLDLDATRQFNEQRGLNFHSPA*
Ga0137383_1007112633300012199Vadose Zone SoilAAESPQPEAPDAEVRETDRRNRTPQPVTAPAAETASTAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVTQDTLDLDATRQFNEQRGLNFRSPA*
Ga0137363_1002951943300012202Vadose Zone SoilMRYRVTEAETTPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137363_1054020823300012202Vadose Zone SoilVRETRRHRAAQPLTEPTSTVETSNAAATDHEGFVFVRRDGTLFFAVGYTWENGTLRYVTDQGLRRTVTQDALDLDATRQFNEQRGLNFRLPS*
Ga0137379_1053288223300012209Vadose Zone SoilEGGPAVETPEPETTDAEAGETRRRYRDPEPATAPVTETASSALPDNEEFVFVRRDGTLFFAVAYSWEKGTLHYITSQGLPRTITEDTLDLDATRQFNEQRGLSFRLPA*
Ga0137377_1172295323300012211Vadose Zone SoilAPERRKVSQAAPARTAETANAAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDALDLAATQQFNEQRGLNFRLPA*
Ga0137370_1054939423300012285Vadose Zone SoilETSRRHRDSDAVTNPAPASTNETAGVTSSDNDEFVFVRRDGTIFFAVAYSWENGTLRYITSQGLRHMVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0137387_1068135623300012349Vadose Zone SoilAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDALDLAATQQFNEQRGLNFRLPA*
Ga0137387_1090929523300012349Vadose Zone SoilMKVSQAAPARAAETASPAPQESEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVSQDALDLAATQQFNEQRGLNFRLPA*
Ga0137372_1007466413300012350Vadose Zone SoilQEAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDSLDLAATQQFNEQRGLNFRLPA*
Ga0137384_1158578913300012357Vadose Zone SoilMTAPQPAAEAASAATPDDQQFVFVRRDGTLFFAVAYSWENGTLHYVTSQGLRHTVTQDALDLDATRQFNEQ
Ga0137385_1069549913300012359Vadose Zone SoilVTAPQPAAEAASAATPDDEQFVFVRRDGTLFFAVAYSWENGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0137360_1010056813300012361Vadose Zone SoilPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137360_1040290823300012361Vadose Zone SoilRASQAAAAPAPAAETAGSALPSNDEFVFVRRDGTVFFAVAFAWEGGTLRYVTNQGLRRTVKQDALDMGATQQFNEQRGLNFRSPA*
Ga0137360_1097948823300012361Vadose Zone SoilRASQAAAAPAPAAETAGSALPSNDEFVFVRRDGTVFFAVAYSWEGGTLRYVTNQGLRRTVKQDALDMGATQQFNEQRGLNFRSPA*
Ga0137360_1183842023300012361Vadose Zone SoilGRRYRASEPVMAPPVETASSATPDDEQFVFVRRDGSVFFAVAYAWEKGTLRYITSQGLRRTVTEDALDLDATRQFNEQRGLNFRLPA*
Ga0137361_1050543113300012362Vadose Zone SoilPAPTPAVETNKAAPPENDEFVFVRRDGTVFFAVAYSWENGTLRYVTSEGLRRTVKQDALDMGATQQFNEQRGLNFRLPA*
Ga0137390_1071586113300012363Vadose Zone SoilLQAAAPAPAPAIETANAGPPSNDEFVFVRRDGTVFFAVAYSWENGTLRYVTSEGLRRTVKQDALDMGATQQFNEQRGVIFRAPA*
Ga0137390_1074899023300012363Vadose Zone SoilAAETASPAPQEAEQFVFVRRDGTLFFAVAYVWENGALRYITSEGLRRTVSQDALDLAATQQFNEQRGLNFRLPA*
Ga0137390_1157518523300012363Vadose Zone SoilVSQAAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDSLDLAATQQFNEQRGLNFRLPA*
Ga0137390_1160312013300012363Vadose Zone SoilAAPARAAETASPAPQEAEQFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDALDLAATQQFNEQRGLNFRLPA*
Ga0134025_116365513300012378Grasslands SoilADDGSAAESPQPEAPDAEVRETDRRNRTPQPVTAPAAETASTAASDNEGFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVTQDTLDLDATRQFNEQRGLNFRSPG*
Ga0134031_120463113300012388Grasslands SoilETDRRNRTPQPVTAPAAETASTAASDNEGFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVTQDTLDLDATRQFNEQRGLNFRSPG*
Ga0137358_1002448713300012582Vadose Zone SoilEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137358_1019069823300012582Vadose Zone SoilVTEPAPTVEAPSAAAADNEEFVFVRRDGTVFFAVGYVWEEGTLRYVTNQGLRRTVTQDALDLDATRQFNEQRGLNFRLPS*
Ga0137398_1001924043300012683Vadose Zone SoilVPEAATGVDESATAESGQPEIIESENRDRGRRYRVTEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137398_1054062813300012683Vadose Zone SoilATDAEVREADRRNRTPQPVTAPAAETVGPAASDNDEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFHSPA*
Ga0137398_1121413413300012683Vadose Zone SoilPTPAIETANPAPPSNDEFVFVRRDGTVFFAVAYAWENGMLRYVTSEGLRRTVKQDALDMGATQQFNEQRGVIFRAPA*
Ga0137397_1002359513300012685Vadose Zone SoilSAASNNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVTQDTLDLDATRQFNEQRGLNFHSPA*
Ga0137397_1004338023300012685Vadose Zone SoilVPGAATGVDESATAESGQPEIIESENRDRGRRYGVTEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137397_1035195323300012685Vadose Zone SoilDDGSAAPIPQPEGTDAQAQGTGRGYQAPQPAPVPAPAAETANSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHSVTKDALDLDATREFNEQRGMNFRLPA*
Ga0137359_1028763723300012923Vadose Zone SoilVDESATAESGQPEIIESENRDRGRRYRVTEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137359_1152715623300012923Vadose Zone SoilGSAAEAPQPEATDAEVREADRRNRTPQPVTAPAAETVGPAASDNDEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFHSPA*
Ga0137419_1085872823300012925Vadose Zone SoilFFPIFDGGFPVAGSSVGGDEGPVAETAPPDVTDAELRGRGRRYRAPEPETAPVPEAPGAAPADNEPYVFVKRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRLPA*
Ga0137416_1055304813300012927Vadose Zone SoilPEATDAEARETTRRHRATQPVTEPAPTVEAPSAAAADNEEFVFVRRDGTVFFAVGYVWEEGTLRYVTNQGLRRTVTQDALDLDATRQFNEQRGLNFRLPS*
Ga0137404_1109426613300012929Vadose Zone SoilVEEASAAPRQTEEFVFVRRDGTVFFAVAYAWENGTLRYVTSEGLRRSVARETLDLNATQQFNEQRGLNFRLPA*
Ga0137404_1197426913300012929Vadose Zone SoilTDADAREATRRHRAAQPVMESAPTIEAASAAPADNEEFVFVRRDGTLFFAVGYAWENGTLRYVTNQGLRRTVTQDALDLDATRQFNEQRGLNFRLPS*
Ga0137404_1225259923300012929Vadose Zone SoilARVMQAAPAPPQVEEASAVPRQTEEFVFVRRDGTVFFAVGYAWENGTLRYVTSEGLRRSVARETLDLNATQQFNEQRGLNFRLPA*
Ga0134078_1005798923300014157Grasslands SoilVAAEDGSAAETSQEATDAEVRETDRRYRTPQPVTAPAAETASSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDTLDLDATRQFNEQRGLNFRSPG*
Ga0137414_105405923300015051Vadose Zone SoilSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDTLDLEATRQFNEQRGLNFHSPA*
Ga0137420_102497513300015054Vadose Zone SoilSRRRCQHRPVPAPAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGMNFRLPA*
Ga0137420_130855023300015054Vadose Zone SoilVPGAATDVDESATAEGGQPEIIESENRDRGRRYRVTEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137418_1000062973300015241Vadose Zone SoilVPGAATDVDESATAEGGQPEIIESENRDRGRRYRVSEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA*
Ga0137412_10000726203300015242Vadose Zone SoilTSSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDTLDLEATRQFNEQRGLNFHSPA*
Ga0066669_1112876913300018482Grasslands SoilGSAAETSQEATDAEVRETDRRNRTPQPVTAPAAETASTAASDNEGFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVTQDTLDLDATRQFNEQRGLNFRSPG
Ga0179592_1026508613300020199Vadose Zone SoilEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA
Ga0179592_1051241113300020199Vadose Zone SoilFPVAGSSVSGDEGPVAETAPPDVTDAEVRGRGRRYRAPEPDPAPASEAPSATPADNEPYVFVKRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRLPA
Ga0210407_1124672823300020579SoilAQAQGTGRGYQAPQPAAVPAPAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGMNFRLPA
Ga0210403_1028457713300020580SoilRPRRTRVYASAQRQENEPASAGPSESQEFVFVRRDGTVFFAVAYSWESGTLRYITTEGLRRSVARDSLDLVATQQFNEQRGMNFRLPA
Ga0210403_1055704413300020580SoilPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGMNFRLPA
Ga0215015_1077619413300021046SoilPAPAAQREAEQFVFVRRDGTVFFAVAYSWDSGMLRYVTQEGLRKSVAGNALDLGATQQFNEQRGLSFHAPA
Ga0210400_1151139913300021170SoilVSEPEMAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFRLPA
Ga0210400_1165808313300021170SoilVPEPAPAPTAESASSAPAENEQFVFVRRDGTVFFAVAYAWENGTLRYVTSQGLRRTVTQDALDLDATRQFNEQRGLNFQLPA
Ga0210408_1130162723300021178SoilAEAPQPEATDAEVREPDRRNRTPQPVTAPAAETAGPAASDDDEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFHSPA
Ga0210402_1186917713300021478SoilQPAQVSAPVVGTDSAALQDTAEFVFVRRDGTLFFAVAYTWENGTLRYINSQGLKQTVTQDALDLGATQQFNEQRGLNFHSPA
Ga0210409_1025440613300021559SoilPIFDGGFPVAGSSLSGDEGPVTETAPPDVTDAEVRGRGRRYGAAEPETAATPVAESASAAPADNEPYVFVKRDGTVFFAVAYSWERGSLRYITSEGLRRVVTQDALDLDATREFNEQRGLNFRSPA
Ga0210409_1155426023300021559SoilPVPAPAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYVTPQGLRHTVTKDALDLDATQKFNEERGLNFRLPA
Ga0242656_103446813300022525SoilGSSVSGDEGPVAESAPPDVTDAEQRGRGRRYRAPEPETAPAAESASAAADNEPYVFVKRDGTVFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRSPA
Ga0179589_1034604623300024288Vadose Zone SoilVSEAETAPPVGPARAEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQFNEQRGLNFRLPA
Ga0137417_114597413300024330Vadose Zone SoilEDGSAAEAQQPESTDAEVRETDRRYRTPQPVPAPAAETASSAASNNEEFVFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVTQDMLDLDATRQFNEQRGLNFHSPA
Ga0209240_104203833300026304Grasslands SoilQPEGTDAQAQGTGRGYQAAQPAPVPAPAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGMNFRLPA
Ga0209240_129216723300026304Grasslands SoilEEAEAESRERGRRYRASLQAAAPAPTPAVETANAAPPDNDEFVFVRRDGTVFFAVAYAWEKGTLRYITGQGLRRTVTQDALDLDATRQFNEQRGLSFRSPA
Ga0209239_128358623300026310Grasslands SoilPEAPDAEVRETDRRNRTPQPVTAPAAETASTAASDNEGFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVTQDTLDLDATRQFNEQRGLNFRSPG
Ga0209375_119113423300026329SoilETRRRYRAPEPATAPVAETASSGPPDHEEFVFVRRDGTVFFAVAYSWEKGTLHNITSQGLPRTITEDTLDLDATRQFNEQRGLSFRLPA
Ga0257176_108828423300026361SoilGSAAPIPQPEGTDVQAQGTGRGYQAPQPAPVPAPAAETASSAAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGMNFRLPA
Ga0257168_105528523300026514SoilAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGMNFRLPA
Ga0209059_120533023300026527SoilDADIEVRETSRRHRVSDAATNLAPAATNETASATSSDNDEFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPA
Ga0209378_111156213300026528SoilTAETASPAPQEAEEFVFVRRDGTVFFAVAYVWENGALRYITSEGLRRTVAQDSLDLAATQQFNEQRGLNFRLPA
Ga0209161_1006050413300026548SoilQAPAAEAENAASSDNDGFVFVRRDGTIFFAVAYSWENGALRYVTSQGLRHTVTLDALDLDATRQFNEQRGLTFRLPA
Ga0209161_1022769723300026548SoilTASATSSDNDEFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHTVTQDALDLDATRQFNEQRGLNFRLPA
Ga0209161_1025684113300026548SoilSRRRRVSEPVTNPAPAGETAGAASTDNDGFVFVRRDGTIFFAVAYSWENGTLRYVTSQGLRHMVTQDALDLDATRQFNEQRGLNFRLPA
Ga0209648_1001629393300026551Grasslands SoilRRRTRVSQAAPAPVPATETASSALPSNDEFVFVRRDGTVFFAVAYSWENGTLRYVTSQGLRRTVKQDALDMSATQQFNEQRGLNFQSPA
Ga0209648_1012874733300026551Grasslands SoilSDAEPRDTGRRDRAPQPAAGPAPAAETASSAPAESEPFVFVRRDGTVFFAVAYTWEKGTLRYITSQGLRQTVTKDALDLDATRQFNEQRGLNIQLPA
Ga0209648_1025876813300026551Grasslands SoilRYRASLQSVTPGPAAPEVEAANPAPRENDEFVFVRRDGTVFFAVAYSWENGTLRYVTSEGLRRTVKQDALDMGATQQFNEQRGVIFRSPA
Ga0179587_1082741323300026557Vadose Zone SoilGYQAPQPTPVPAPAAETANSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHSVTKDALDLDATREFNEQRGMNFRLPA
Ga0207949_102257123300026999Forest SoilAELRGRGRHYYRAPEPETGPTPAVESASDAPADNEPYVFVKRDGTVFFAVAYSWEKGTLRYITNEGLRRVVTQDALDLDATRQFNEQRGLNFRSPA
Ga0209735_105686113300027562Forest SoilETDRRYRTPQPAPAPAAETASSAASDNEEFVFVRRDGTVFFAVAYAWEKGTLRYVTSQGLRRTVMQDTLDLDATRQFNEQRGLNFRLPA
Ga0209220_107818123300027587Forest SoilIPQPEGTDAQAQGTGRGYQAPQPVPAPAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYVTPQGLRHTVTKDALDLDATREFNEQRGLNFRLPA
Ga0209733_108105623300027591Forest SoilAPAPAPAAETASSAPAENEEFVFVRRDGTVFFAVAYTWDRGTLRYITPQGLRHTVTKDALDLDATREFNEQRGLNFRLPA
Ga0209117_107903813300027645Forest SoilTDRRYRTPQPVAAPAAETASSAVSDNEEFVFVRRDGTLFFAVAYAWEKGTLRYITSQGLRRTVTQDVLDLDATRQFNEQRGLNFHLPA
Ga0208990_117217923300027663Forest SoilTPQPAPAPAAETASSAASNNEEFVFVLRDGTVFFAVAYAWEKGTLRYITSQGLRHTVTQDTLDLDATRQFNEQRGLNFHSPA
Ga0209180_1070615323300027846Vadose Zone SoilTTDADARERGRRNRIPEPVTAPAAEPASAAPTDNEGFAFVRRDGTVFFAVAYAWEKGTLRYITSQGLRHTVTQDALDLDATRQFNEQRGLNFLSPT
Ga0209283_1027047913300027875Vadose Zone SoilVVRRARAPEPPAAAPAEAAPAPVRDEDEFVFVRRDGTVFFAVAYAWENGALRYVTPEGLRRVVARDALDLAATEQFNEQRGLVFQRPV
Ga0209283_1028817013300027875Vadose Zone SoilAPGPAAPEVEAANPAPRENDEFVFVRRDGTVFFAVAYAWENGTLRYVTSEGLRRTVKQDALDMGATQQFNEQRGVIFRSPA
Ga0209067_1051423313300027898WatershedsVDDTATAESAQPKITDSENRDRGRRYRVAEPQPAPAVEAANGEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYVTSQGLRHTVTQDALDMDATRQFNEQRGLNFRLPA
Ga0138298_106555613300031015SoilMPGAAAPVEESSAAESGQPEVMESENRDRGRRYRVTEAETAPPVEPARAEPVDNDEFVFVRRDGTVFFAVAYSWEKGTLRYINSQGLRHTVTQDALDLDATKQF
Ga0307477_1051369313300031753Hardwood Forest SoilRGRRNRVAEPVTAPPVEASRAMPADDEQFVFVRRDGTVFFAVGYAWENGTLRYVTNQGLRRTVTQDALDLDATREFNEQRGLNFRLPS
Ga0307477_1116925823300031753Hardwood Forest SoilTDAEFRGRGRRYREPETAPTQAPESANAAPADNEPYVFVKRDGTLFFAVAYSWEKGSLRYITSEGLRRVVTQDALDLDATRQFNEQRGLNFRLPA
Ga0307473_1044786113300031820Hardwood Forest SoilAAEAPQPEATDAEVREADRRNRTPQPVTAPAAETAGPAASDNDEFVFVRRDGTVFFAVAYVWEKGTLRYITSQGLRRTVTQDALDLDATRQFNEQRGLNFHSPA
Ga0307479_1061946123300031962Hardwood Forest SoilESENRDRGRRYRVTEAETAPPAQPASSEPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYINNQGLRHTVTQDALDLEATKQFNEQRGLNFRLPA
Ga0307479_1203834113300031962Hardwood Forest SoilEDSSAADSGQPEDTDVDARGASRRRVAAAPLSTPPTEASNATPEESEEFVFVRRDGTLFFAVGYAWESGTLRYVTNQGLRRTVTQDALDLDATRQFNEQRGLNFRLPS
Ga0311301_1007383153300032160Peatlands SoilMADTASAGPADNDEFVFVRRDGTVFFAVAYSWEKGTLRYITSEGLRHSVMQDALDLGATQQFNEQRGLNFRLPA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.