NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F066951

Metagenome / Metatranscriptome Family F066951

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066951
Family Type Metagenome / Metatranscriptome
Number of Sequences 126
Average Sequence Length 96 residues
Representative Sequence MGLGGMDTLHGKRNCKVGSCGSAAITTLDRQALCLNHFLLRCYERLEGLDPRGRKFTAEPVDLASMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDIL
Number of Associated Samples 89
Number of Associated Scaffolds 126

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 75.20 %
% of genes near scaffold ends (potentially truncated) 96.83 %
% of genes from short scaffolds (< 2000 bps) 87.30 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.206 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(53.175 % of family members)
Environment Ontology (ENVO) Unclassified
(49.206 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(53.968 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148
1JGI12635J15846_101920703
2JGI20242J16303_1085851
3JGI25617J43924_100413073
4JGI25616J43925_102248161
5Ga0058896_12663341
6Ga0066672_102191583
7Ga0066690_106098701
8Ga0066684_105163942
9Ga0066671_108907412
10Ga0066675_104208351
11Ga0066686_111193022
12Ga0066682_109361872
13Ga0066682_109639861
14Ga0066661_102780572
15Ga0066692_107248512
16Ga0066707_106211071
17Ga0066698_101629461
18Ga0066903_1031923312
19Ga0066656_106458341
20Ga0075015_1009334841
21Ga0070765_1004163383
22Ga0075426_106046691
23Ga0099791_106315972
24Ga0099794_101075681
25Ga0099794_104236831
26Ga0066710_1000187961
27Ga0066710_1007557083
28Ga0099829_100739041
29Ga0099828_102989573
30Ga0099828_111875561
31Ga0099828_113311782
32Ga0099827_118241221
33Ga0099792_101763711
34Ga0099792_112669161
35Ga0134070_102070622
36Ga0134064_101565442
37Ga0134071_105072511
38Ga0150983_130635052
39Ga0150983_137808272
40Ga0150983_154446021
41Ga0137392_103025193
42Ga0137392_104774381
43Ga0137392_109066262
44Ga0137392_109416182
45Ga0137392_113274881
46Ga0137392_114439792
47Ga0137391_100870015
48Ga0137391_107131022
49Ga0137391_107808811
50Ga0137393_101883971
51Ga0137393_105827992
52Ga0137393_108184181
53Ga0137388_101934441
54Ga0137364_100171853
55Ga0137364_100257073
56Ga0137383_101122391
57Ga0137399_112302872
58Ga0137362_108618852
59Ga0137381_112158601
60Ga0137376_106544842
61Ga0137379_108059721
62Ga0137377_103249501
63Ga0137387_101558621
64Ga0137386_107750742
65Ga0137386_112119131
66Ga0137371_112873442
67Ga0137358_106231792
68Ga0137398_103297541
69Ga0137398_104849941
70Ga0137396_101655063
71Ga0137396_108513681
72Ga0137359_116149552
73Ga0137419_100102256
74Ga0137419_103361371
75Ga0137419_106133331
76Ga0137416_106005843
77Ga0137416_109638812
78Ga0137416_115219991
79Ga0137416_116674261
80Ga0134081_100117101
81Ga0134078_100104504
82Ga0137420_14864576
83Ga0066667_107429351
84Ga0179592_101667631
85Ga0179592_104304551
86Ga0179596_106901631
87Ga0210396_100103441
88Ga0179585_10235181
89Ga0210384_104165651
90Ga0222728_10362191
91Ga0242669_10181762
92Ga0242662_102428281
93Ga0242666_10355292
94Ga0242654_100718011
95Ga0137417_10924781
96Ga0137417_11228521
97Ga0209155_11147981
98Ga0209154_10452381
99Ga0209158_10349091
100Ga0209804_11422531
101Ga0209157_11019741
102Ga0209648_100151478
103Ga0209648_101542281
104Ga0209648_104132251
105Ga0209648_105895402
106Ga0209648_108457742
107Ga0179587_105613892
108Ga0209733_11736861
109Ga0209588_10248401
110Ga0209588_12477852
111Ga0209180_103141373
112Ga0209701_104590402
113Ga0209283_100675731
114Ga0209590_109529242
115Ga0209068_101626742
116Ga0137415_101985553
117Ga0137415_105459813
118Ga0137415_113363482
119Ga0137415_113469281
120Ga0307484_1038452
121Ga0307469_101688193
122Ga0307477_107837252
123Ga0307475_100106467
124Ga0307475_105633072
125Ga0307471_1000273226
126Ga0307472_1015818231
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 46.46%    β-sheet: 0.00%    Coil/Unstructured: 53.54%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090MGLGGMDTLHGKRNCKVGSCGSAAITTLDRQALCLNHFLLRCYERLEGLDPRGRKFTAEPVDLASMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Watersheds
Vadose Zone Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Forest Soil
Soil
Populus Rhizosphere
53.2%4.0%13.5%7.9%5.6%5.6%5.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1019207033300001593Forest SoilMDTMLGKRDCSVGSCGSAAITTLDHHALCLNHFLLRCYEKLEGLDPRGRKFSVEPVDLVSMRAFVEECSRKALDVSLQSENLSNLQRGRLLDILLWA
JGI20242J16303_10858513300001648Forest SoilMGLGGMDTKQRERNCFVDSCSSAPVTSLGQQDLCLNHFLLRCYEKLERLDPRGGRFCSETLDAAAMRAFIEECSRKALDVSLHCKDLSNLERGRLLDILL
JGI25617J43924_1004130733300002914Grasslands SoilMDLGGMDTMLGKRNCRMGSCGSAAITTLDRQALCLNHFLLRCYEKLEGLDPRGRKFSAEPVDLASMRAFIEECSRKALDISLQSKKLSNLQRGRLLDILL
JGI25616J43925_1022481613300002917Grasslands SoilMLGKRKCRVGSCSGAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLEVVSMRAFIEECSRKALDVSLQSENLTNLQRGRLLDILLWAGELF
Ga0058896_126633413300004101Forest SoilMGLCGMDMEYRERNCDINPCASAAITALDHQDLCLNHFLLRCYERLESLDPRGRQFCAEPLDAAAMRAFIEECSRKALDVSLHSENLSNLERGRLLDILLWAGELFL
Ga0066672_1021915833300005167SoilMDPKENETVHRGESVGLGEMAAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGE
Ga0066690_1060987013300005177SoilMDPKENEIARRGESVGLGEMAAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEELEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLL
Ga0066684_1051639423300005179SoilMDTMLRKRNCRMGSCGGSAITALDHQALCLKHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLD
Ga0066671_1089074123300005184SoilMGSCGGSAITALDHQALCLKHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWASELFLLLRAPRLTLA
Ga0066675_1042083513300005187SoilMDTMLRKRNCRMGSCGGSAITALDHQALCLKHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDIL
Ga0066686_1111930223300005446SoilLAVELNQRGLGGMDTLHRKRNCTVASCGRAAITSLGQQVLCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEECSRKALDVSLQCEELSNLER
Ga0066682_1093618723300005450SoilLAVELNQRGLAGMDTMHRKRNCSVASCGRAAITSLGQQILCVNHFLLRCYEKLDGLDPRGRNFTSEPLDLAAMRGFIEEC
Ga0066682_1096398613300005450SoilMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWASELFLLLRAPRLTLAQ
Ga0066661_1027805723300005554SoilVGLGEMAAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLL
Ga0066692_1072485123300005555SoilMGLGGMDTMLGKRNCRVDSCSRAAITSLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGE
Ga0066707_1062110713300005556SoilMLGKRNCRMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLW
Ga0066698_1016294613300005558SoilVELNQRGLGGMDTLHRKRNCTVASCGRAAITSLGQQVLCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEECS
Ga0066903_10319233123300005764Tropical Forest SoilMKNDTDRVIMGLGGVDTLLEKRTCTADSCDGAATTTLDHYPLCLNHFLLCCYERLEELDPRGRQFSDGRVDVISMRAFIEECSRKALDISLQSESLSNLQRARLL
Ga0066656_1064583413300006034SoilMKLAVELNQRGLGGMDTLHRKRNCTVASCGRAAITSLGQQVLCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEECSRKALDVSLQCE
Ga0075015_10093348413300006102WatershedsMGLGGMDTLHGKRNCKVGSCGSAAITTLDRQALCLNHFLLRCYERLEGLDPRGRKFTAEPVDLASMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDIL
Ga0070765_10041633833300006176SoilMHRERNCSVASCSGAPVTQLDHQDFCLNHFLLRCYDKLEGLDPRGRRFCSETLDAPAMRAFIEECSRKALDVSLHSDDLTNLQRGRLLDILLWAGELFLLLR
Ga0075426_1060466913300006903Populus RhizosphereVGLGEMFTMLEKRNCRMGSCGSAAITTLDRQAFCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLIAMRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELF
Ga0099791_1063159723300007255Vadose Zone SoilMGSCGSAAITTLDRQALCLNHFLLRCYEKLESLDPRGRKFSAEPIDLASMRAFIEECSRKALDVSLQSKNLSNL
Ga0099794_1010756813300007265Vadose Zone SoilMDTMLGKRNCRVRSCSSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLEVVSMRAFIEECSRKALDVSLQSENLTNLQR
Ga0099794_1042368313300007265Vadose Zone SoilMDSMLGKRNCTVGSCNSAATITLDHQALCLNHFLIRCYERLDGLDPRGRKFSAERIDMVSMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILLWGGELFLL
Ga0066710_10001879613300009012Grasslands SoilMLGKRNCRVGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWAGELFLLLRA
Ga0066710_10075570833300009012Grasslands SoilVGLGEMVAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLL
Ga0099829_1007390413300009038Vadose Zone SoilMGLGGMDTMLGKRNCRVGSCSRAAITSLDRQALCLNHFLVRCYERLESDDPRGRKFSGEPINLASMRAFIEECSRKALDVSLQSEGLSNLQRGRLLDILLWAGELFLL
Ga0099828_1029895733300009089Vadose Zone SoilMDTMLGKRNCRVRSCSSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLEVVSMRAFIEECSRKALDVSLQSENLTNLQRGRLLDILL
Ga0099828_1118755613300009089Vadose Zone SoilMDTMHRKRNCSVGSCGRATITSLGQQILCVNHFLLRCYEKLDGLDPRGRKFTSEPIDLAAMRGFIEECSRKALDVSLQCEEL
Ga0099828_1133117823300009089Vadose Zone SoilMDTMHRKRNCSVGSCGRAAITSLGQQILCVNHFLLRCYEKLDGLDPRGRKFTSEPIDLAAMRGFIEECSRKALDVSLQCEE
Ga0099827_1182412213300009090Vadose Zone SoilMKLAVELNQGGLAGMDTKHRKRNCSVGSCGRAAITSLGQQVLCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEECSRKALDVSLQCEELSNLERGRLLDILLWAGELF
Ga0099792_1017637113300009143Vadose Zone SoilLKKDPKKSETGRTVKSMGLGGMDTKVGKRSCWVGSCGSAAITTLDRQALCLSHFLMRCYERLDGLDPRGRKFSAEPVDVVSMRAFIEECSRRALDVSLQSQNLSNLQRGRL
Ga0099792_1126691613300009143Vadose Zone SoilMGLGGMDTMLGKRDCSMGSCGSAAITTLDRQTLCLNHFLLRCYEKLEGLDPRGRKFSAEPMDLVSMRAFVEECSRKALDVSLQSENLSNLQRGRL
Ga0134070_1020706223300010301Grasslands SoilMKLAVELNQRGLAGMDTLHRKRNCTVASCGRAAITSLGQQVLCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEECSRKALDVSLQ
Ga0134064_1015654423300010325Grasslands SoilMDPKENETVHRGESVGLGEMAAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLL
Ga0134071_1050725113300010336Grasslands SoilMDPKENDTVRRGKSVGLGEMVAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQR
Ga0150983_1306350523300011120Forest SoilMASMHRERICSVASCSGAPITELDHQDLCLNHFLLHCYDKLEGLDPRGRRFCSETLDAPAMRAFIEECSRKALDVSLHSDELTNLQRGRLLD
Ga0150983_1378082723300011120Forest SoilMGLGGMDAKYRERNCAVDSCAGAAITALDHQDLCLNHFLLRCYERLESLDPRGRRFCAEPLDAAAMRAFIEECSRKALDVSLHSKNLSNLERGRLLDILLWA
Ga0150983_1544460213300011120Forest SoilVASCGGAAITSLDHQALCLNHFLLGCYEKLEGLDPRGRRFSAQPVDLVYMRGFIEECSRKALDISLHSQSLTNLQRARLLDILLWAGELFLLLRAPRL
Ga0137392_1030251933300011269Vadose Zone SoilMGLGGVDTMSGKRNCGVGSCSGAAIATLDRQALCLNHFLLRCYAQLERLDPRGGKSSAEPVDLAAMRAFIEDCSRKALDISLQSKNLSNLQRGRLLDILLWAGELFLLL
Ga0137392_1047743813300011269Vadose Zone SoilMDTLRNKRNCRVDSCGVAATTALDREALCLNHFLLRCYEELERLDPRGRKFSAELVDVVSMRAFIEECSRKALDISLQSKTLSNLQRGRLLDILLWAGELFLFLRAPR
Ga0137392_1090662623300011269Vadose Zone SoilMDTMRRKRNCSVRSCGRAAITSLGQQILCVNHFLLRCYQKLDGLDPRGRKFTAEPLDLAAMRGFIEECSRKALDVSLQCEELSNLERGRLLDIL
Ga0137392_1094161823300011269Vadose Zone SoilMGTMLGKRNCRVGSCGGAAITTLDHHALCLNHFLLRCYEKLEGLDPRGRRFSPEAVDLVYMRAFIEECSRKALDISLQSQSLTNLQRGRLLDILLWAGE
Ga0137392_1132748813300011269Vadose Zone SoilVGSCSGAAITTLDHQALCLNHFLLRCYDKLERLDPRGRRSSAEPADLASMRAFIEECSRKALDVSLQSKNLSNLQRGRLLDILLWAGELFLLLRAPRPA
Ga0137392_1144397923300011269Vadose Zone SoilMDTLRRKRNCGVGSCSGAAITSLDRQALCLNHFLHRCYEKLDGFDPRGRKFSDGPVDLAAMRAFVEECSRKALDVSLRSEDLSNL
Ga0137391_1008700153300011270Vadose Zone SoilMDTMHRKRNCSVGSCGRAAITSLGQQILCVNHFLLRCYEKLDGLDPRGRKFTSEPIDLAAMRGFIEECSRKALDVSLQCEELS
Ga0137391_1071310223300011270Vadose Zone SoilMGSCVSAAVTTLDHQAFCLNHFLLRCYAELEGFDPRGRQFSSETVDLVSMRAFIEECSRKALDISLQSENLTNLQRGRLLDILLWAG
Ga0137391_1078088113300011270Vadose Zone SoilMDTMRRKRNCSVRSCGRAAITSLGQQILCVNHFLLRCYQKLDGLDPRGRKFTAEPLDLAAMRGFIEECSRKALDVSLQCE
Ga0137393_1018839713300011271Vadose Zone SoilMDLGGMDTMLANRNCRVGSCGGAAVATLDHQALCLSHFLSRCYEKLEGFDPRGRKFSAAEPVDLTSMRAFIEECSRKALDISLHSQSLTNLQRGR
Ga0137393_1058279923300011271Vadose Zone SoilMGLGGMDTKLEKRDCSVGSCGSAAITTLDRQTLCLNHFLLRCYEKLEGLDPRGRKFSAEPLDLVSMRAFVEECSRKALDVSLQSENLSNLQRGRLLDILLW
Ga0137393_1081841813300011271Vadose Zone SoilMDTMLGKRNCSVGSCASAAITALDHQDLCLNHFLLCCYERLEGVDPRGRKSSAEPLDLISTRAFIEECSRKALDVSLQSENLSNLQRGRLLDIL
Ga0137388_1019344413300012189Vadose Zone SoilLDHQALCLNHFLSRCYEKLEGLDPRGRKFSAEPVDMASMRSFIEECSRKALDISLQSRNLTNLQRGRLL
Ga0137364_1001718533300012198Vadose Zone SoilMKKDPKKSETGRRVKTMGLAGMDTKVGKRSCWVGSCGSAAITTLGHQSLCLNHFLIRCYERLDGLDPRARKFSAEPVDVVSMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILL
Ga0137364_1002570733300012198Vadose Zone SoilMMKPFGSHRASNTLKKDPKKGETGRRVKTMGLAGMDTKVGKRSCWVGSCGSGAITTLGHQSLCLNHFLIRCYERLDGLDPRGRKFSAEPVDVVSMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILL
Ga0137383_1011223913300012199Vadose Zone SoilMDTMLGKRNCRVGSCSRAAITSLDRQALCLNHFLLRCYEGLESLDPRGRKFSIEPINLASMRAFIEECSRKALDVSLHSEDLSNLQRGRLL
Ga0137399_1123028723300012203Vadose Zone SoilMDTMLGKRDCSVGSCGSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKFSAESVDLVSMRAFVEKCSRKALDVSLQSESLSNLQRGRLLDILLW
Ga0137362_1086188523300012205Vadose Zone SoilMGSCSRAAITTLDHQAFCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAG
Ga0137381_1121586013300012207Vadose Zone SoilMGLAGMDTKVGKRSCRVGSCGSAAITTLDHRALCLNHFLSRCYEKLEKLEPRGRKFSAAEPVDPASMRAFIEECSRKALDISLHSQSLTNLQRGR
Ga0137376_1065448423300012208Vadose Zone SoilMGLAGMDTKVGKRSCWAGSCGSAAITTLGHQSLCLNHFLMRCYERLDGLDPRGRKFSAEPVDVVSMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILLWAGE
Ga0137379_1080597213300012209Vadose Zone SoilMRLAVELNQGGLAGMDTLHRKRNCSVGSCGRAAITSLGQQILCVNHFLLRCYEKLDGLDPRGRKFTSEPIDLAAMRGFIEECSRKALDVS
Ga0137377_1032495013300012211Vadose Zone SoilMGLAGMDTKVGKRSCRVGSCGSAAITTLDHQALCLNHFLMHCYERLDGLDPRGRKFSAEPIDVVSMRAFIEECSRKALDVSLQSQNLSN
Ga0137387_1015586213300012349Vadose Zone SoilMDTMRRKRNCSVGSCGRAAITSLGQQILCVNHFLLRCYQKLDGLDPRGRKFTAEPLDLAAMRGFIEECSRKALDVSLQCEEL
Ga0137386_1077507423300012351Vadose Zone SoilMAAMLEKRNCSMGSCSRAAITTLDRQAFCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRDFIEDCSRKALDVSLRSENLSNLQRGRLLDILLWAGELFLLLRAPRL
Ga0137386_1121191313300012351Vadose Zone SoilMVTMLGKRKCQAGSCSGTAITSLDRQTLCLNHFLLRCYEKLEGLDPRGRKFSAEPIDLVSMRAFVEECSRKALDVSLQSENLSN
Ga0137371_1128734423300012356Vadose Zone SoilMDTMRRKRNCSVGSCGRAAITSLGQQILCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEEC
Ga0137358_1062317923300012582Vadose Zone SoilMGLAGMDLKVGKRSCWEVSCGRAAITTLDHQSLCLNHFLMRCYETLDSLDPRGGKFSTEPIDVVSMRAFIEECSRKALDVSLQSQTLTNLQRGRLLDIL
Ga0137398_1032975413300012683Vadose Zone SoilMDTMLRKRDCSVGSCGSAAITTLDHQPLCLNHFLLRCYERLEGLDPRGRKFSAEPLDLVSMRAFVEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELFL
Ga0137398_1048499413300012683Vadose Zone SoilMGLGGMDTMLASRNCRVGSCGSAAITTLDHRALCLNHFLSRCYEKLEKLEPRGRKFSAAEPVDLASMRAFIEECSRKALDISLHSQSLTNLQRGRLLD
Ga0137396_1016550633300012918Vadose Zone SoilMGLAGMDPKVGKRSCWEVSCGRAAITALDHQSLCLNHFLMRCYETLDSLDPRGRKFSTEPIDVVSMRAFIEECSRKALDVSL
Ga0137396_1085136813300012918Vadose Zone SoilMDTMLGKRNCRVRSCSSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLEVVSMRAFIEECSRKALDVSLQSEHLTNLPRGRLLDILLWAGELFLLLRAPRL
Ga0137359_1161495523300012923Vadose Zone SoilMKLAVELNQGGLAGMDTMHRKRNCSVGSCGRATITSLGQQILCVNHFLLRCYEKLDGLDPRGRKFTSEPIDLAAMRGFIEECSRKALDVSLQCEELSNLERGRLLDILLWA
Ga0137419_1001022563300012925Vadose Zone SoilMGSCGSAAVTTLDHQAFCLNHFLLRCYAELEGLDPRGRQFRSETVDLVSMRAFIEECSRKALEISLQSENLTNLQRGRLLDILLWAGEL
Ga0137419_1033613713300012925Vadose Zone SoilMGSCSGAAVATLDHQALCLNHFLSRCYEKLEKLEPRGRKFSAAEPLDLTSMRVFIEECSRKALDISLHSQTLTNLQRGRLLDILLW
Ga0137419_1061333313300012925Vadose Zone SoilMGLGGMVTKVGKRSCRVGSCGGAAITTLDHQALCLDHFLMSCYERLDALDPRGRKFSAEPIDVVSMRAFIEECSRKALDVSLQYQNLSTLQRGR*
Ga0137416_1060058433300012927Vadose Zone SoilMGLAGMDPKVGKRSCWEVSCGSAAITTLDHQSLCLNHFLMRCYETLDSLDPRGRKFTTEPIDVVSMRAFIEECSRKALDVSLQSQ
Ga0137416_1096388123300012927Vadose Zone SoilMDTMLGERNCSVGSCASAAITALDHQNLCLNHFLLCCYERLEGVDPRGRKSSAEPLDLISTRAFIEECSRKALDVSLQSENLSNLQRGRLLD
Ga0137416_1152199913300012927Vadose Zone SoilMDTMLGKRNCRMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPIDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWAGELFL
Ga0137416_1166742613300012927Vadose Zone SoilMGSCSRAAITTLDHQAFCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRGFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELFLLL
Ga0134081_1001171013300014150Grasslands SoilMRLFVEVNQWAWGGMVAMLGKRNCRVGSCSGAAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVAIRAFVEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELFLL
Ga0134078_1001045043300014157Grasslands SoilMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSDEPVDVLSMRAFVEECSRKALDVSLQSKNLNNLERGRLLYFALGR*
Ga0137420_148645763300015054Vadose Zone SoilMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPIDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWAG
Ga0066667_1074293513300018433Grasslands SoilMGLAGMDTKVGKRSCWVGSCGSAGVTKLGHQSLCLNHFLIRCYERLDGLDPRGRKFSAEPVDVVSMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILLWAGELF
Ga0179592_1016676313300020199Vadose Zone SoilMGLGGMDALLGKRNCKVGSCGSAPVTSLDRKALCLNHFLQRCYERLEGLDPRGRKFTAEPVDLAARRAFIEECSRKALDVSLQSQNLSNLQRGRLLDILLWAGELFLL
Ga0179592_1043045513300020199Vadose Zone SoilMLGKRNCRMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWAGELL
Ga0179596_1069016313300021086Vadose Zone SoilMDTLRNKRNCRVDSCGVAATTALDREALCLNHFLLRCYEELERLDPRGRKFSAELVDVVSMRAFIEECSRKALDISLQSKILSNLQRGRLLDILLW
Ga0210396_1001034413300021180SoilMHRERNCSVASCSGAPVTQLDHQDFCLNHFLLRCYDKLEGLDPRGRRFCSETLDAPAMRAFIEECSRKALDVSLHSDDLTNLQRGRLLDILLW
Ga0179585_102351813300021307Vadose Zone SoilMGLGGMDTMLGKRDCSAGSCGSAAITTLDHQALCLNHFLLRCYERLEGLDPWGRKFSAEPLDLVSMRAFVEECSRKALDVSLQSENLSNLQRGRLLDI
Ga0210384_1041656513300021432SoilMGLCGMDMKYRERNCDINPCASAAITALDHQDLCLNHFLLRCYERLESLDPRGRQFCAEPLDAAAMRAFIEECSRKALDVSLHSENLSNLERGRLLDILL
Ga0222728_103621913300022508SoilMGLGGMDAKYQERNCAVDSCAGAAITALDHQDLCLNHFLLRCYERLESLDPRGRRFCTEPLDAAAMRAFIEECSRKALDVSLHSKNLSNLERGRLLDIL
Ga0242669_101817623300022528SoilMHRERNCSVASCSGAPVTQLDHQDFCLNHFLLRCYDKLEGLDPRGRRFCSEALDAPAMRAFIEECSRKALDVSLHSDDLTNLQRGRLLDILL
Ga0242662_1024282813300022533SoilMGLGGMDTKQRERNCFVDSCSSAPVTSLGQQDLCLNHFLSRCYEKLERLDPRGRRFCSETLDAAAMRAFIEECSRKALDVSLHSKDLSNLER
Ga0242666_103552923300022721SoilMASMHRERICSVASCSGAPITELDHQDLCLNHFLLHCYDKLEGLDPRGRRFCSETLDAPAMRAFIEECSRKALDVSLHSDDLTNLQRGRLLDILLWAGE
Ga0242654_1007180113300022726SoilMGLGGMDTLRGKRNCTVGSCSSAAVTSLDRKALCLNHFLQRCYERLEGLDPRGRKFSAEPVDLASMRAFIEECSRKALDVSLQSQSLSNLQRGRLLDILLWAGELFL
Ga0137417_109247813300024330Vadose Zone SoilMLGKRDCSVRSCGSAAITTLDRQTLCLNHFLLRCYERLEGLDPRGRKFSAEPLDLVSMRAFVEECSRKALDVSLQSENLSNLQR
Ga0137417_112285213300024330Vadose Zone SoilMGLGGMDTMLGKRDCSAGSCGSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKFSAEPLDLVSMRAFVEKCSRKALDVSLQSESLSNLQRGRLLDILLLGR
Ga0209155_111479813300026316SoilMDTMLRKRNCRMGSCGGSAITALDHQALCLKHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNL
Ga0209154_104523813300026317SoilVGLGEMAAMLEKRNCRMGSCSRSAITTLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDVSLQSENLSNLQRGRL
Ga0209158_103490913300026333SoilMGLAGMDTKVGKRSCWAGSCGSAAITTLGHQSLCLNHFLMCCYERLDGLDPRGRKFSAEPVDVVSMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDIL
Ga0209804_114225313300026335SoilMLAKRNCRMGSCGGSAITALDHQALCLKHFLLCCYERLEGLDPRGRKFSAEPVDVPSMRAFVEECSRKALDVSLQSKNLNNLERGRLL
Ga0209157_110197413300026537SoilVELNQRGLGGMDTLHRKRNCTVASCGRAAITSLGQQVLCVNHFLLRCYEKLDGLDPRGRKFTGEPIDLAAMRGFIEECSRKALDVSLQCEELSNLER
Ga0209648_1001514783300026551Grasslands SoilMGLGGMDPMLGKRNCWVDSCGGVAITTLDHQALCLNHFLLRCYEKLEGLDPRGRQLSAKPVDLASMRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELF
Ga0209648_1015422813300026551Grasslands SoilMSRKRNCGVGSCAGAAITALDRQALCLNHFLLRCYERLEGLDPRCQKFAIEPVDVAAMRAFVEECSRKALDVSLQSKNLDNLQRGR
Ga0209648_1041322513300026551Grasslands SoilMDTMLGKRNCRVRSCSSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLDVVSMRAFIEECSRKTLDVSLQSENLTNLQRGRLLDILLWAGELFLLLRAPR
Ga0209648_1058954023300026551Grasslands SoilMGRRVKSRGLGGMDPLRNKRNCRVDSCGVAATTALDREALCLNHFLLRCYEELERLDPRGRKFSADLVDVVSMRAFIEECSRKALDISLQSKTLSNL
Ga0209648_1084577423300026551Grasslands SoilMLGKRKCQVGSCGGAAIASLDRQALCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRAFIEECSRKALDISLQSQNLSNLQRGRLLDILL
Ga0179587_1056138923300026557Vadose Zone SoilMDTMLGKRDCSMGSCGSAAITTLDHQPLCLNHFLLRCYEKLEGLDPRGRKFNPEPMDLVSMRAFVEECSRKALDVSLQSENLSNLQ
Ga0209733_117368613300027591Forest SoilMGLVGVDTMHGERNCRVGSCDGAAITALDHQALCLNHFLLRCYEKLEGLDPRGRRFSAEPVDLASMRAFIEECSRKALDISLQSQSLTNLQRGRLLDI
Ga0209588_102484013300027671Vadose Zone SoilMLGERNCSVGSCASAAITALDHQDLCLNHFLLCCYERLEGVDPRGRKSSAEPLDLISTRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELFLLLRAP
Ga0209588_124778523300027671Vadose Zone SoilMGLGGMDTMLANRNCRMGSCSGAAVATLDHQALCLNHFLSRCYEKLENLEPRGRKFSSEPVDIASMRAFIEECSRKALDISLHSQSLTNL
Ga0209180_1031413733300027846Vadose Zone SoilMGLGGVDTMSGKRNCGVGSCSGAAIATLDRQALCLNHFLLRCYAQLERLDPRGGKSSAEPVDLAAMRAFIEDCSRKALDISLQSKN
Ga0209701_1045904023300027862Vadose Zone SoilMDTMLGKRNCRVRSCSSAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLEVVSTRAFIEECSRKALDVSLQSENLTNLQRGRLLDILL
Ga0209283_1006757313300027875Vadose Zone SoilMDTMLGKRKCRVGSCSGAAITTLDHQALCLNHFLLRCYERLEGLDPRGRKCSAEPLEVVSMRAFIEECSRKALDVSLQSENLTNLQRGRLLDILLWAGELFLLL
Ga0209590_1095292423300027882Vadose Zone SoilMLGKRKCQAGSCSGTAITSLDRRTFCLNHFLLRCYEKLEGFDPRGRKFSAEPIDLVSMRAFVEECSRKALDVSLQSENLSNLQRGRLLDILLW
Ga0209068_1016267423300027894WatershedsMLGERNCRVGSCGGAAIITLDHQALCLNHFLVRCYERLEGLDPRGRKFSAEPVDVASMRAFIEECSRKALDVSLQSENLSNLQRGRLLDILLWAGELLLLL
Ga0137415_1019855533300028536Vadose Zone SoilMLGERNCSVGSCASAAITALDHQNLCLNHFLLCCYERLEGVDPRGRKSSAEPLDLISTRAFIEECSRKALDVSLQSENLSNLQRGRL
Ga0137415_1054598133300028536Vadose Zone SoilMGLAGMDTKVGKRSCWMGSCGSAAIATLSHQSLCLNHSLMRRYERLDGLDPRGRKFSAEPIDVVSMRAFIEECSRKALDVSLQSQNL
Ga0137415_1133634823300028536Vadose Zone SoilMLGKRNCRMGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPVDVVSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWAGELFL
Ga0137415_1134692813300028536Vadose Zone SoilMVAMLEKRNCRMGSCSRAAITTLDHQAFCLNHFLLRCYEKLEGFDPRGRKFSAEPVDLVSMRGFIEECSRKALDVSLQSENLSNLQRG
Ga0307484_10384523300031663Hardwood Forest SoilMDTMRGKRNCGVGSCSSAAITTLDRQALCLNHFLSRCYEKLEHVDPRGRRFTAEPIDPASMRAFIEECSRKALDASLHSENLSNLQRGRLLDILLWAGELFVL
Ga0307469_1016881933300031720Hardwood Forest SoilMLGKRNCGVDSCRRSTITTLDRQSLCLNHFLLCCYDRLEGLDPRGRKFSDESVDLASMRAFIEECSRKALDVSLQSQNLSNLQRGRLLDI
Ga0307477_1078372523300031753Hardwood Forest SoilMLGKRNCDVDACGGAAITSLDQQALCLNHFLPRCYEKLEALDPRGRKLSAETVDLAAMRAFVEECSRKALDVSLQSGNLNNLQRGRLLDILLWAGELFLW
Ga0307475_1001064673300031754Hardwood Forest SoilMGLGTMDMLRGKRNCRVGSCGGAAITSMDRQALCVNHFVSRCYEELDRVDPRGRKFTAEPVDVASMRAFIEECSRKALDISLQSQALSNLQRGRLLDILLWAGELFLLL
Ga0307475_1056330723300031754Hardwood Forest SoilMGLGGMDTLLGKRNCTVGSCSSAEVASLDRKALCLNHFLQRCYERLEGLDPRGRKFTAEPVDLASMRAFIEECSRKALDVSLQSRNLSNLQRGRL
Ga0307471_10002732263300032180Hardwood Forest SoilMKNELKIDELCRVVQMGPGQMDTKLLERNCGARSCHGAAITSLDQQALCLNHFLLRCYERLEALDPRCRKLRAEPLDLASMRAFIEECSRKALDVSLH
Ga0307472_10158182313300032205Hardwood Forest SoilMLGKRNCRVGSCGGSAITALDHQALCLNHFLLCCYERLEGLDPRGRKFSAEPVDVVSMRAFVEECSRKALDVSLQSKNLNNLERGRLLDILLWAGELF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.