NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F066661

Metagenome Family F066661

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066661
Family Type Metagenome
Number of Sequences 126
Average Sequence Length 81 residues
Representative Sequence MTPRAYIFGIACVLIAARFGNDRLNAAEPSVQPEPPWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYE
Number of Associated Samples 107
Number of Associated Scaffolds 126

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 64.08 %
% of genes near scaffold ends (potentially truncated) 80.16 %
% of genes from short scaffolds (< 2000 bps) 66.67 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (81.746 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(32.540 % of family members)
Environment Ontology (ENVO) Unclassified
(41.270 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.730 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146
1GPIPI_02310640
2INPgaii200_08949441
3INPgaii200_08984252
4INPgaii200_09791391
5ICChiseqgaiiDRAFT_04919241
6ICChiseqgaiiFebDRAFT_108205301
7INPhiseqgaiiFebDRAFT_1018987572
8INPhiseqgaiiFebDRAFT_1018996642
9INPhiseqgaiiFebDRAFT_1019002172
10INPhiseqgaiiFebDRAFT_1019005361
11INPhiseqgaiiFebDRAFT_1019005752
12F14TC_1000291081
13JGI11643J11755_116474161
14JGI11643J11755_116708981
15JGI1027J12803_1001824721
16JGI1027J12803_1004127501
17JGI1027J12803_1035481762
18JGI24036J26619_100792171
19JGI25381J37097_10924272
20JGI25617J43924_103042801
21Ga0062595_1011583282
22Ga0062592_1003649752
23Ga0062594_1004028582
24Ga0066673_105609791
25Ga0066688_101939662
26Ga0066684_108396591
27Ga0066685_109139121
28Ga0066678_102018761
29Ga0066676_104378242
30Ga0066675_102344952
31Ga0065712_108270962
32Ga0070711_1017612171
33Ga0066689_106518772
34Ga0066682_104109462
35Ga0070662_1007185941
36Ga0066692_100117556
37Ga0066698_108318302
38Ga0070664_1008022342
39Ga0066693_101774392
40Ga0066705_100096767
41Ga0066706_102045702
42Ga0066903_1031414572
43Ga0066652_1000806203
44Ga0066652_1017300772
45Ga0066653_100261841
46Ga0066665_100002541
47Ga0066659_100029581
48Ga0066660_111928412
49Ga0075428_1011989661
50Ga0075433_104291962
51Ga0075435_1009688312
52Ga0075418_114699542
53Ga0099792_105632152
54Ga0126380_113621462
55Ga0134065_100629051
56Ga0134062_103359802
57Ga0126372_115859992
58Ga0126377_124866821
59Ga0134066_100899761
60Ga0134128_130806091
61Ga0134126_129680682
62Ga0134124_128853712
63Ga0137364_102034681
64Ga0137383_1000640311
65Ga0137383_103999632
66Ga0137382_100147261
67Ga0137374_100181867
68Ga0137374_102075651
69Ga0137374_106058502
70Ga0137374_107183541
71Ga0137377_105866331
72Ga0137370_101014431
73Ga0137387_100140931
74Ga0137369_100798211
75Ga0137369_102439581
76Ga0137371_109772141
77Ga0137375_109327642
78Ga0137373_102372412
79Ga0137373_108747192
80Ga0137395_100897442
81Ga0137419_100295604
82Ga0137416_101117242
83Ga0137404_113491511
84Ga0137407_113063591
85Ga0134076_101936562
86Ga0134087_100284882
87Ga0134087_102374822
88Ga0164308_100211214
89Ga0134081_100242642
90Ga0134078_103936182
91Ga0157377_104163291
92Ga0137412_1000062524
93Ga0137409_105367322
94Ga0134072_100071151
95Ga0132256_1012935642
96Ga0134074_11122412
97Ga0184608_102096842
98Ga0184619_104122951
99Ga0184617_11790391
100Ga0184617_11881061
101Ga0193715_10600892
102Ga0193725_11285152
103Ga0193697_10301902
104Ga0179596_100535221
105Ga0193695_10045671
106Ga0224452_11773122
107Ga0222623_100362212
108Ga0222622_103484951
109Ga0222622_110027971
110Ga0207689_107191711
111Ga0207679_117904481
112Ga0209234_10770881
113Ga0209238_11932742
114Ga0209688_10469932
115Ga0209239_10490022
116Ga0209470_10675591
117Ga0209473_10057311
118Ga0257156_11023512
119Ga0209690_10134906
120Ga0209376_12108331
121Ga0209156_100163961
122Ga0209388_10998412
123Ga0307307_101103102
124Ga0307284_104881362
125Ga0307277_100814702
126Ga0307308_101079192
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 23.21%    β-sheet: 15.18%    Coil/Unstructured: 61.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080MTPRAYIFGIACVLIAARFGNDRLNAAEPSVQPEPPWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYESequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
81.7%18.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.2%3.2%7.1%21.4%7.9%32.5%4.0%3.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_023106402088090014SoilMTPRAYIFGIACVLIAARFGNDRLNAAEPSVQPEPPWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYE
INPgaii200_089494412228664022SoilMTPRAYIFGIGCMLIAGEFASDQLIAAERSPQREPGWIEIGSEKAVIARDSKSADARNALAWTVDSSEPVDWSLLDPVKARARKMREVS
INPgaii200_089842522228664022SoilMTPRAYIFGIGCMLIAGEFASDQLIAAERSPQREPGWIEIGSEKAVIARDSKSADARNALAWTVDSSEPVDWSLLEEDPNR
INPgaii200_097913912228664022SoilMTRHAYILGIGCMLIAGEFASHQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDX
ICChiseqgaiiDRAFT_049192413300000033SoilMRPCAYILGIGCMLIAARLSDDRLIATEPSIQPEPRWVEIGSEKAVIARDSKSADGRXALAWTVDSSEPV
ICChiseqgaiiFebDRAFT_1082053013300000363SoilMLIAARLSDDRLIATEPSIQPEPRWVEIGSEKAVIARDSKSADGRXALAWTVDSSEPVD
INPhiseqgaiiFebDRAFT_10189875723300000364SoilMLVAAEFVCAKVISAESSPPAERRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPL
INPhiseqgaiiFebDRAFT_10189966423300000364SoilMLVAAEFACDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWSLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVGDKG
INPhiseqgaiiFebDRAFT_10190021723300000364SoilMLVAAEFVCDKLISAESSPPAEPRWIEIGSERAAIVRDSKSADGRNALAWTXDSSEPIDWPLLEK
INPhiseqgaiiFebDRAFT_10190053613300000364SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALA
INPhiseqgaiiFebDRAFT_10190057523300000364SoilMTPRAYIFGIACMLIAARFGNDRLTAAEPSVQQEPPWIEIGSEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFY
F14TC_10002910813300000559SoilMTLRAYIFGIGCMLIAARFGNDRLTAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSNEPVDWSLLE
JGI11643J11755_1164741613300000787SoilMKPCAYILGIGCVLIAARFVGDRLIGAEPSAQPEPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPV
JGI11643J11755_1167089813300000787SoilMLIAARLSDDRLIATEPSIQPEPRWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVD
JGI1027J12803_10018247213300000955SoilMRKRVTTFPQRNSIVLALRSTYHQKTLKRHSYILGIGCTLIAARFAGDRLLAAEPSAQPEPRWIEIGSERAVIARDSKSADGRNALAWIADSSEPIDWSLLEKDPN
JGI1027J12803_10041275013300000955SoilMLIAGEFASDQLIAAERSPQREPGWIEIGSEKAVIARDSKSADARNALAWTVDSSE
JGI1027J12803_10354817623300000955SoilMLVAAEFACDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDADHFYEQYDVKEIWV
JGI24036J26619_1007921713300002128Corn, Switchgrass And Miscanthus RhizosphereMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWS
JGI25381J37097_109242723300002557Grasslands SoilVKPRTYLLGIACVLVAAEFVCXKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGSH
JGI25617J43924_1030428013300002914Grasslands SoilMTPRAYILGIGCTLIAGGFASDQLIAAEPSPQREPGWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLL
Ga0062595_10115832823300004479SoilMLIAARFGNDRLTAAEPSVQQEPPWIEIGSEKAVIARDSKSADGRNALAWTVD
Ga0062592_10036497523300004480SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAI
Ga0062594_10040285823300005093SoilMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAI
Ga0066673_1056097913300005175SoilVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDAD
Ga0066688_1019396623300005178SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDVERFYEQYDVKEIWVVNLSD
Ga0066684_1083965913300005179SoilMNERACAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDNSEPVDWSLLEKDPNHFYEQYD
Ga0066685_1091391213300005180SoilMTLRVYIFGIAWVLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEP
Ga0066678_1020187613300005181SoilVKPRTYLLGIACVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVN
Ga0066676_1043782423300005186SoilMERPCYTFTVACMLLSGVLASSNAMAAEPSGESRWIEIGSAKAVILRDSKSADGRNALAWTIDSTEPVDWSLLEKDADHFYEQYDVKEI
Ga0066675_1023449523300005187SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKGG
Ga0065712_1082709623300005290Miscanthus RhizosphereMTPRAYIFGIACMLIAARFGNDRLNAAEPSVQPEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVD
Ga0070711_10176121713300005439Corn, Switchgrass And Miscanthus RhizosphereMTLRAYIFGIACMLIAARFGNDRLTAAEPSVQPESPWIEIGPEKAAIARDSKSADGRNALAWTVDSNEPVDWSLLEKD
Ga0066689_1065187723300005447SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLE
Ga0066682_1041094623300005450SoilMTLRAYTFGIACMLIAARFGNHRLAAAAEPSVQPDRPWIEIGSEKAVIARDSKSADGRNALAWIIDSSEPVDWSLLEKDPNRFY
Ga0070662_10071859413300005457Corn RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPN
Ga0066692_1001175563300005555SoilVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVERFYEQYDVKEI
Ga0066698_1083183023300005558SoilMTLRAYTFGIACMLIAARFGNHRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTV
Ga0070664_10080223423300005564Corn RhizosphereMTPRAYIFGIACMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNR
Ga0066693_1017743923300005566SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVER
Ga0066705_1000967673300005569SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGT
Ga0066706_1020457023300005598SoilMTLRVYIFGIAWVLIAARFGNDRLAAAAEPSVQPDRPWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDP
Ga0066903_10314145723300005764Tropical Forest SoilMLVAAKFVGHELISAESSPPAEPRWIETGSERAVIVRDSRSADGRNALAWTIDSGEPIDWSLLEKDVDHFYEQYEVKEIWVLNLSDKKKIGTVGDKGGY
Ga0066652_10008062033300006046SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDV
Ga0066652_10173007723300006046SoilMTLRVYIFGIAWVLIAARFGNDRFAAAAEPSVQPDRPWVEIGLEKAVIARDSKSADGRNALAWTIDSSEPVDWSLLEKDPNRFY
Ga0066653_1002618413300006791SoilMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFY
Ga0066665_1000025413300006796SoilVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGT
Ga0066659_1000295813300006797SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSESADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGT
Ga0066660_1119284123300006800SoilMTPRAYILGIGCMLIAGELASDQLIAAEPSPQREPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKD
Ga0075428_10119896613300006844Populus RhizosphereMKPCAYVLGIGCVLIAAWFVGDGLIGAEPSAQPEPRWIEIGSEKAVIARDSKSADGRNALAWIVDSSEPVDWSLLEKDPN
Ga0075433_1042919623300006852Populus RhizosphereMLVAEFVCDKLISAESSPSAEPRWIEIGSERAAIVRDSKSADGRNALAWTIDSSEPIDWP
Ga0075435_10096883123300007076Populus RhizosphereMLVTGQLANDSVVAAEPSQPGESRWIEIGPEKAVIARDSKSADGRNAMAWT
Ga0075418_1146995423300009100Populus RhizosphereMKPCAYVLGIGCVLIAAWFVGDGLIGAEPSAQPEPRWIEIGSEKAVIARDSKSADGRNALAWIVDSSEPVDWSLLEKDP
Ga0099792_1056321523300009143Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKG
Ga0126380_1136214623300010043Tropical Forest SoilMKRRTHLLEVACILVAAEFVCDKLILAESSPAAEPRWIELGSERAAIVRDSKSADGRNALAWTIDSSEPIDWSLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVGDKGG
Ga0134065_1006290513300010326Grasslands SoilMILRVYIFGIAWVLIAARFGNDRLAAATEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVEWSLLEKDPNRFYEQYEVKAIWVINLT
Ga0134062_1033598023300010337Grasslands SoilVKPRTYLLGIACVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVE
Ga0126372_1158599923300010360Tropical Forest SoilMKRRTYLLEVACMLVAAEFVCDKLTLAESSPPSEPGWSEVGSERAAIVRDSKSADGRNALAWTIDSSEPVDWSLLEK
Ga0126377_1248668213300010362Tropical Forest SoilMKRRTYLLEVACMLVAAGFVCDKLISAESSPSAEPRWIEIGSERAAIVRDSKSADGRNALAWTIDSSEPI
Ga0134066_1008997613300010364Grasslands SoilVKPRTYLLGIACVLVAADFVSDKLMSAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEP
Ga0134128_1308060913300010373Terrestrial SoilMTLRAYIFGIACMLIAARFGNDRLTAAEPSVQPESPWIEIGPEKAAIARDSKSADGRNALAWTVDSNEPVDWSLLEK
Ga0134126_1296806823300010396Terrestrial SoilMTPRVYIFGIACMLIAARFGNDRLTAAEPSVQPESPWIEIGPEKAAIARDSKSADGRNALAW
Ga0134124_1288537123300010397Terrestrial SoilMNERACVFGLGCILIVAQAADDRIVAADPSPQSGPRSIDIGSEKVVIARDSMSADGRNALAWTVDSSQPVDWSLLGKDPNHFYEEFDVKEIWVVNIPDKKKVGTVAD
Ga0137364_1020346813300012198Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDVERFYEQYDVK
Ga0137383_10006403113300012199Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKE
Ga0137383_1039996323300012199Vadose Zone SoilMKPLAYILGLSCLLVAGQFAADKLAAAESAVQSEPRWIQIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDSDHFYEHYEVKEIWVLNIPD
Ga0137382_1001472613300012200Vadose Zone SoilMNECACAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYE
Ga0137374_1001818673300012204Vadose Zone SoilMTPRAYIFGIACLLIAARFGNDRLAAATEPSVQPELPWIEIGSEKAVIARDSKSADGRNALAWTVDS
Ga0137374_1020756513300012204Vadose Zone SoilMTLRAYTFGIACMLIAARFGSDRLAAAAESSVQPDRPWVEIGSEKAVIARDSKSADGRNA
Ga0137374_1060585023300012204Vadose Zone SoilMTLRAYTFGIACMLIAARFGNDRLAAAAESSVQPDRPWVEIGSEKAVIARDSKSADGRNA
Ga0137374_1071835413300012204Vadose Zone SoilMKPCAYILGIGCVLIAARFVGDRLIGAEPSGQPETPWIEIGSEKAVIARDSKSADGRNALAWTIDSNEPVDWSL
Ga0137377_1058663313300012211Vadose Zone SoilMTPRAYILGIGCMLIAGELASDQLIAAEPSPQREAGWIEIGSEKAVMARDSKSADGRNALAWTVDSREPVD
Ga0137370_1010144313300012285Vadose Zone SoilMWQRVTAFPQRNCIVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEKGSEKVVIARDSKSADGRNALGWTVDSNEPVDWSLLEKDADHFYEQYD
Ga0137387_1001409313300012349Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYD
Ga0137369_1007982113300012355Vadose Zone SoilMKPCAYILGIGCVLIAARFVGDRLIGAEPSGQPETPWIEIGSEKAVIARDSKSADGRNALAWTIDSNEP
Ga0137369_1024395813300012355Vadose Zone SoilMSPRAYIFGIACMLIAARFGNYTLTAAAESSVQQEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSSESIDWSL
Ga0137371_1097721413300012356Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDVERFYEQYDVKEIWVVNLSDK
Ga0137375_1093276423300012360Vadose Zone SoilMTLRAYTFGIACMLIAAGSGNDRLAAAAEPPVQPEPPWIEIGSEKAVIARDSKSADDRNALAWTVDSDEPVDWSLLEKDPNRFYEQYDVKA
Ga0137373_1023724123300012532Vadose Zone SoilMTLRAYTFGIACMLIAAGSGNDRLAAAAEPPVQPEPPWIEIGSEKAVIARDSKSADDRNALAWTVDSDEPVDWSLLEKDPNRFYEQYDVKAIWVINL
Ga0137373_1087471923300012532Vadose Zone SoilVTPRAYIFGIACLLIAPGSGNDGLAAAAEPSVQPEPPWIEIGSEKLVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDP
Ga0137395_1008974423300012917Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGS
Ga0137419_1002956043300012925Vadose Zone SoilMKPCAYTLAIGYMLIAARLAGDRLIAAEPSVQPESRWIEIGSEKLIIARDSKSADGRNALAWAVDSNEPVDWSLLEKDP
Ga0137416_1011172423300012927Vadose Zone SoilMLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDW
Ga0137404_1134915113300012929Vadose Zone SoilMKPCAYVLGIGCMLIAGDFASDQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALAW
Ga0137407_1130635913300012930Vadose Zone SoilMTLRAYTFGIACMLITARFGNDRLAAEAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVD
Ga0134076_1019365623300012976Grasslands SoilMTLRAYILGIACVLIAAPFGNDRLAAAAEPSAQPESRWIEIGSEKAVIARDSKSADGRNALA
Ga0134087_1002848823300012977Grasslands SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDIEHFYEQYDVKEIWVVNLSDKKKIGTVGDKGGYVRPG
Ga0134087_1023748223300012977Grasslands SoilMNECACAFGIGCILIVAQLSGDRIAAADSSPQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPN
Ga0164308_1002112143300012985SoilMNKRACVFGLGCILIVAQAADDRIVAADPSPQSGPRSIDIGSEKVVIARDSMSADGRNALAWTVDSSQPVDWSLLEKDPNHFYEEFDVKEIWVVNIPDKKKVG
Ga0134081_1002426423300014150Grasslands SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGAVADKGGYVR
Ga0134078_1039361823300014157Grasslands SoilMWQRVTAFPQRNCIVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALGWTVDSNEPVDW
Ga0157377_1041632913300014745Miscanthus RhizosphereMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYE
Ga0137412_10000625243300015242Vadose Zone SoilMKPCAYTLAIGYMLIAALLAGERLIAAEPSVQPELRWIEIGSEKLIIARDSKSADGRNALAWAVDSNEPVDWSLLEKDPNR
Ga0137409_1053673223300015245Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDVERFYEQ
Ga0134072_1000711513300015357Grasslands SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVE
Ga0132256_10129356423300015372Arabidopsis RhizosphereMTLRAYIFGIACMLITARFGNDTLTAAPESSAQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYE
Ga0134074_111224123300017657Grasslands SoilMTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTIDSSEPVDWSLLEKD
Ga0184608_1020968423300018028Groundwater SedimentMLIAGRFAGDRLAAAAEPSAPSEPRWIEIDSEKVVIARDSKSADGRNALAWIVDSSEPIDWSLLEKDADRFYEQYDLKAIRVINLPDKKKVG
Ga0184619_1041229513300018061Groundwater SedimentMTPRAYIFGIACMLIAGEFASDSLIAAEPSPQPEPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYEVKAIW
Ga0184617_117903913300018066Groundwater SedimentVLALRLTYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEARWIEIGSEKAVIARDSKSADGRNILAWTVDSNEPVDWSLLEKDPDRFYER
Ga0184617_118810613300018066Groundwater SedimentMTLRACIFGIACILIAARFGNDALTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWLLLEKDPNRF
Ga0193715_106008923300019878SoilMTLRAYIFGIVCMLIAARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKD
Ga0193725_112851523300019883SoilMLVAGQFAANKLAATETSAQSEPRWIQIGSEKVVIARDSKSADGRNALAWTVDS
Ga0193697_103019023300020005SoilMKPCAYTLAVGYMLIAVRLAGDGLIAAEPSIQPEQRWIEIGSEKAVIVRDSKSADGRNALAWTVDSTEPVDWSLLEKDPNRFYEQYDVKAIWVN
Ga0179596_1005352213300021086Vadose Zone SoilVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDV
Ga0193695_100456713300021418SoilMTARAYILGIACMLIAGEFASDSLIAAEPSPQPEPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQ
Ga0224452_117731223300022534Groundwater SedimentMTRGAYIFGIACMLITARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSN
Ga0222623_1003622123300022694Groundwater SedimentMAARFGNDRLAAAAEPSVQPEPPWIEIGSEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVIN
Ga0222622_1034849513300022756Groundwater SedimentMLVSARFGGDRLIAAELSAQPEARWIEIGSEKAVIVRDSKSADGRNALAWTVDSNEPIDWSLLEKDPDRFYEQYDVKAIWVIN
Ga0222622_1100279713300022756Groundwater SedimentMTPRAYIFCVACVLMAARFGNDRLAAAAEPSVQPEPPWIEIGSEKAVIARDSKSADGRNALA
Ga0207689_1071917113300025942Miscanthus RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFY
Ga0207679_1179044813300025945Corn RhizosphereMTPRAYIFGIACMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTV
Ga0209234_107708813300026295Grasslands SoilVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVK
Ga0209238_119327423300026301Grasslands SoilVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDVERFYEQYDVKEIWVVNLSDKKKIGTV
Ga0209688_104699323300026305SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLERDVEHFYEQYDVKEIWVVNLSDKKKIGTVA
Ga0209239_104900223300026310Grasslands SoilMTLRAYTFGIACMLIAARFGNHRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALA
Ga0209470_106755913300026324SoilMTPRAYIFGVACVLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNE
Ga0209473_100573113300026330SoilVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYNVKEIWVVNLSD
Ga0257156_110235123300026498SoilMLIAGEFASDQLIAAEPSPQREPGWIEIGSEKAVIARDSKSADGRNALAWTVDSREPV
Ga0209690_101349063300026524SoilMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIGRDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIP
Ga0209376_121083313300026540SoilVKPRTYLLGIACVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADK
Ga0209156_1001639613300026547SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLS
Ga0209388_109984123300027655Vadose Zone SoilMTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDW
Ga0307307_1011031023300028718SoilMTRGAYIFGIACMLITARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPTRFYEQY
Ga0307284_1048813623300028799SoilMKPCAYVLGIGCMLIAGDFASDQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALVW
Ga0307277_1008147023300028881SoilMNEHACAFGIGCILIVAQLSGDRIVAADSSPQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYEVKEIWI
Ga0307308_1010791923300028884SoilMNARACVLGLGCILIVAQVAGDRIVAADSSPQSGPRSIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYDVKKI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.