NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097977

Metagenome Family F097977

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097977
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 147 residues
Representative Sequence MSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLASVPDRNSALFDFSLKLAKHGPWVSGRDVLTLKDSGFDERTILEAIVTTGVG
Number of Associated Samples 87
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 32.04 %
% of genes near scaffold ends (potentially truncated) 99.04 %
% of genes from short scaffolds (< 2000 bps) 91.35 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (75.962 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(25.961 % of family members)
Environment Ontology (ENVO) Unclassified
(30.769 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(33.654 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200.202.204.206.208.210.212.214.216.218.220.222.224.226.228.230.232.234.236.238.240.242.244.246.248.250.252.254.256.258.260.
1JGI12270J11330_102518521
2Ga0066690_106802082
3Ga0070708_1003228953
4Ga0075024_1003225181
5Ga0075028_1002698981
6Ga0075029_1000847791
7Ga0075029_1011033081
8Ga0075017_1000293011
9Ga0075017_1005003272
10Ga0075019_103517682
11Ga0075030_1001669744
12Ga0075030_1014687222
13Ga0075018_103467141
14Ga0075018_104479491
15Ga0070716_1011894821
16Ga0075014_1004991782
17Ga0075014_1009246201
18Ga0066710_1012773011
19Ga0099829_106466062
20Ga0099830_111180561
21Ga0099792_112690591
22Ga0105242_101289754
23Ga0116219_106065062
24Ga0074045_107246911
25Ga0074044_103604822
26Ga0126383_128354811
27Ga0137392_105401311
28Ga0137392_105676352
29Ga0137391_110312542
30Ga0137393_110228641
31Ga0137363_104339261
32Ga0137363_107427292
33Ga0137363_110570461
34Ga0137380_102447701
35Ga0137381_101956231
36Ga0137379_106910892
37Ga0137378_106829692
38Ga0137378_107710491
39Ga0137378_110467881
40Ga0137386_110072101
41Ga0137390_108975582
42Ga0137390_110196181
43Ga0137397_100481121
44Ga0137395_100898341
45Ga0137394_108011811
46Ga0137419_113992551
47Ga0137416_113099542
48Ga0137404_102966341
49Ga0137410_109046311
50Ga0182019_107347151
51Ga0132255_1036269891
52Ga0187802_101943841
53Ga0187802_102580792
54Ga0187819_104632692
55Ga0187817_102254301
56Ga0187817_106097922
57Ga0187816_100945001
58Ga0187816_101167402
59Ga0187816_104247051
60Ga0187816_104940011
61Ga0187804_102934792
62Ga0187869_102465991
63Ga0179592_102653681
64Ga0207699_113618901
65Ga0207664_107686451
66Ga0207686_106676271
67Ga0207665_110428741
68Ga0209804_12758741
69Ga0207781_10078222
70Ga0207858_10304741
71Ga0207783_10305981
72Ga0207824_10163541
73Ga0207803_10188381
74Ga0207815_10341052
75Ga0207819_10352451
76Ga0207855_10380551
77Ga0207726_10424612
78Ga0207762_10132822
79Ga0207777_10177641
80Ga0207761_10179771
81Ga0208043_10640381
82Ga0208324_10124164
83Ga0208827_12096401
84Ga0207826_11181612
85Ga0207862_11608321
86Ga0209517_105109191
87Ga0209068_101910802
88Ga0209067_101616201
89Ga0209583_101231481
90Ga0209698_106437711
91Ga0209698_106567732
92Ga0302166_100330332
93Ga0302203_11216411
94Ga0302163_100530012
95Ga0302272_10499482
96Ga0302207_10165355
97Ga0310037_102576652
98Ga0310039_101858792
99Ga0302297_10821991
100Ga0310909_115704091
101Ga0306920_1028735062
102Ga0335079_100922731
103Ga0335076_100825831
104Ga0335077_119708441
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.39%    β-sheet: 0.00%    Coil/Unstructured: 40.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

20406080100120MSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLASVPDRNSALFDFSLKLAKHGPWVSGRDVLTLKDSGFDERTILEAIVTTGVGSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
76.0%24.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Freshwater Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Peatlands Soil
Soil
Grasslands Soil
Soil
Soil
Soil
Bog Forest Soil
Fen
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Fen
Bog
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
9.6%17.3%26.0%7.7%2.9%13.5%3.8%4.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12270J11330_1025185213300000567Peatlands SoilMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRVSDYCRALFGHSLPTMPDHSSALLDFSLKLATHGPWISGSDLLRLKSSGFDEKAVLEAIVTTGIGVMLCTVADGLRPLLDPELSSPA
Ga0066690_1068020823300005177SoilMSQFATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSSVVPGNSALFDFSLKLAKHGPWVSG
Ga0070708_10032289533300005445Corn, Switchgrass And Miscanthus RhizosphereMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRDQKDAILNGVATVRGSDYCRALFGHSLAAVPDRNSALFDFSVKLAKHGPWVSGRDVLTLKNSLFDEK
Ga0075024_10032251813300006047WatershedsMSQSATISNDPPKSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQRLIDTVVVRQGRLSRDQKDAILDGVATVRASDYCRALFGHSLGTVPDRNSALFDFCFKLAKHGPWVSGRDVLTLKDSGFDERAILEAIVTTGVGLMLCTLADGLRPRLDTELRSPAPAELLNVPDPLDWPKAS
Ga0075028_10026989813300006050WatershedsMSQSSTISNDPPGSYPQLIAVYGFLPNLFRVQSAIPHAIEAEQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSTAVPDRNSALFDFSLKLAKHSPWVSERDVLTLKNSGFDERAI
Ga0075029_10008477913300006052WatershedsMSQPATISNDPAGSCPQLVAAYGFLPNLFRAQSALPDAIEAEQRLIDAVVVRHNGLTRNQKGAILNGVATVRGSDYCRALFGHSFAAEPDHSSALLDFSLKLAKHGPWVAKSDLLR
Ga0075029_10110330813300006052WatershedsMNQSATISNDPARSYPQLVAAYGFLPNLFRAQSAIPHAIEAEQGFIDSVLVRQSRLSRNQKDAILSGVATVRGSDYCRALFGRSFATMPGNSSALLDFSLKLAKHGPWVSESDLLRLKSSKFDGKAI
Ga0075017_10002930113300006059WatershedsMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRDQKDAILNGVATVRGSDYCRALFGHSFAAVPDRNSALFDFSVKLAKHGPWVSGRDVLTLKNSLFDEKAILEAIVTTGVGLMLCTLAEGLRP
Ga0075017_10050032723300006059WatershedsMSQPATISNDPPGSYAQLVAAYGFLPNLFRVQSAIPHAVEAEQRLVDTVVVRQSRLSRNQKDTILNGVATVLGSDYCRALFGHSLAAVPDHSSALLDFSLKLAKHGPWVSESDHLRLKSSGFDEKAVLEAIVTTGIGVMFCTVADGLRPLLDAELRLPAPAELLNVPEPLDWPKPSG
Ga0075019_1035176823300006086WatershedsMSQSATIFNDPPRSYPQLIATYGFLPNLFRAQSTIPRAIEAEERLIDTVVVCQGRLSRGHKDAILNGVATVRGSDYCRALFGHSLAAVPDRNFALFDFSLKLAKHGPWVSGRDVLTLKNSGFDERAILEAIVTTGVGLMLCTLADGLRPRLDTE
Ga0075030_10016697443300006162WatershedsMSKSATISNDPTRSYPHLVAAYGFLPNLFRVQSVVPHAVEGEQRLVDTVLVRQGRLSKNQKDAILNGVATVRGNDYCRALFGHSLASVPEHSSALLDFSLKLAKHCPWVSGSDILTLKNSGFDEK
Ga0075030_10146872223300006162WatershedsMNQSATISNDPARSYPQLVAAYGFLPNLFRAQSAIPHAIEAEQGFIDSVLVRQSRLSRNQKDAILSGVATVRGSDYCRALFGRSFATMPGHSSALLDFSLKLAKHGPWVSESDLLRLKSSKFDGKAI
Ga0075018_1034671413300006172WatershedsMSQSATISNDPPKSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLAAVSDRNSALFDFALKLAKHGPWVSGHDLLSLKNSLFDEKAI
Ga0075018_1044794913300006172WatershedsMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQTAIPRAIEAEQGLIDTVVVRQGRLSRNQKDVILNGVATVRGSDYCRALFGHSLGAVADRNSALFYFSLRLAKHGPWISGH
Ga0070716_10118948213300006173Corn, Switchgrass And Miscanthus RhizosphereMTQPSTISNDLAGSHPQLVAVYGFLPNLFRVQSAVPPAIEAEQRLIDIVVVRQSRISRDQKDAILYGVATVRANDYCRALFGHALRAVPDRDSGLFDFCLKLAKHGPWVSERDVLTLKDSGFDERAILEAIVTTGVGLMLCTLADGLRPRLDTELRSPAPAELLNVPDPLDWPKASG
Ga0075014_10049917823300006174WatershedsMNQSSTTSNDLPKSHPQLVAVYGFLPNLFQVQSAIPPAIEAEQGLIDSVLVRQGRLSRNQKDAILNGVATVRGSDYCRALFGHSFAAVPDHNSVLLDFSLKLARHGPWVSERDILILKSSGFDEKAVLEAIVTTGIG
Ga0075014_10092462013300006174WatershedsMSQPATISNEPSRSYPQLIAAYGFLPNLFQAQSALPQAIEAEQQLINIVVVRQGRLSRSQKDVILNAVATVRGCDYCRALFGDSLAALPDRNSGLFDFTLKLTKHGPWVSERDILTLKHSGFDERAILEAIVTIGVGLMLCT
Ga0066710_10127730113300009012Grasslands SoilMSQFATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLGAVPDRNPALFDFCLKLAKHGPWVSG
Ga0099829_1064660623300009038Vadose Zone SoilMSQFATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSTVVPGNSALFDFSLKLAKHGPWVSGRDVLTLKNSGFDERAILEAIVTTGVGLMLCTLADGLRPRLDAELRSPAPSELLNVPEPLDWP
Ga0099830_1111805613300009088Vadose Zone SoilMSQFPTISSDPPRSYPQLVAVYGFLPNLFRAQSALPRAVEAEHQLIDTVVLRQGRLSRDQKDAILEGVAIVRDNDYCRALFGHALPAVPDCKSALFDFSLKLAKYGPWVSKHDVVALKNSGFDETAIQEAVVTTGIGHML
Ga0099792_1126905913300009143Vadose Zone SoilQSATISNDPPRSYPQLIAAYGFLPSLFRVQIAIPRAIEAEQRLIDTVVVRQGRLSRDQKDAILSGVATVRGSDYCRALFGHSRTAVPDRNSALFDFSLKLAKHGPWVSERDVLTLKNSGFDERAILEAIVTTGVGLMLCTLADGLRPQLDAELRSPASSGLLNVPEP
Ga0105242_1012897543300009176Miscanthus RhizosphereMSQPANTSDDPQGSYPQLVAAYGFLPNLFQAQSALPHVVDAEQRLVDTVLVRQGRLSRTQKDTILRGVATVRGSDYCRALFGQALHALPDRSSALFDFSLKLAKHGPWVSGSDVEALKASGFDERAILEAVVTTALGL
Ga0116219_1060650623300009824Peatlands SoilMSQPATISNDPARSYPQLVATYGFLPNVFRAQSAIPQAIEAEQRFIDTVVVRQNRLSRNQKDAILNGVATVRGSDYCRALFGHSLAAMPDHSFALLDFSLKLAKHGPWVSGSDVLTLKKSGFDERAVLEAIVT
Ga0074045_1072469113300010341Bog Forest SoilMSQPATISNDLPGSYPQLVAAYGFLPNLFRVQSAIPHAVEAEQRLIETVVVGQNSLSRNQKDAILNGVATVRGSDYCRALFGHSLAGASDHGSALHSALFDFSLKLAKHGPWVSKSDV
Ga0074044_1036048223300010343Bog Forest SoilMNQSSTTSNDLPTSHPLLVAVYGFLPNLFRVQSAIPHAIEAEQGLIDSVLVRQGRLGRNQKDAILNGVATVRGSDYCRALFGHSFAAVPDHNSVLLDFSLKLARHGPWVSESDLLILKSSGFDETAVLEVIVTAGIGVMFCTVADGLHPPLDTQLRSAPAEVLNVPEP
Ga0126383_1283548113300010398Tropical Forest SoilMNQSAIVLNDQPRSYPQLIATYGFVPNLFQAQRAMPQAIEAEQQLNTAVVREGKLSRNQKDMILNGVATVWGSDYCRALFGHSLVDAPDRNFALFDFSLKLAKHGPWVSGRDVLALRNSGVDERAILEAIVTTCLGLMFCTLADGLRPQLDTELGSPAASELLNVP
Ga0137392_1054013113300011269Vadose Zone SoilMSQFPTISSDPPRSYPQLVAVYGFLPNLFRAQSALPRAVEAEHQLIDTVVVRQGRLSRDQKDAILEGVAIVRDNDYCRALFGHALPAVPDCKSALFDFSLKLAKYGPWVSKHDVVALKNSGFDETAIQEAVVTTGIGHMLCTLADGLNPRLDAELRS
Ga0137392_1056763523300011269Vadose Zone SoilMSQFATTSNGPSRSYPQLIAVYGVLPNLFRAQSVLPRAIEAEQRLIDAVLVQQRRLSRNQKDTILSGVATVRGSDYCRALFAQAVPALPDRNSAVFDFSLRDFSLRLAKHGPRVSGRDVEALKSSGFDEQAILEAVATTSLGLMLCTLADGLRPHLDAELASPAPCEVLNVPEPTEWPKA
Ga0137391_1103125423300011270Vadose Zone SoilMSQFPTISSDPPRSYPQLVAVYGFLPNLFRAQSALPRAVEAEHQLIDTVVVRQGRLSRDQKDAILEGVAIVRDNDYCRALFGHALPAVPDCKSALFDFSLKLAKYGPWVSKHDVVALKNSGFDETAIQEAVVTTGIGHMLCTLADGLNPRLDAEL
Ga0137393_1102286413300011271Vadose Zone SoilMSQCATTSNGPSRSYPQLIAVYGVLPNLFRAQSVLPRAIEAEQRLIDAVLVQQRRLSRNQKDTILSGVATVRGSDYCRALFAQAVPALPDRNSALFDFSLRDFSLRLAKHGPRVSAHDVEALKSSGFDEQAILEAVATTSLGLMLCTLADGLRPHPDPKLASPAPGEVLNVSEPTSWPKASGPYLNLSFALDSCADSPASVVLREQYGFL
Ga0137363_1043392613300012202Vadose Zone SoilMSRFATSINDPLRSYPQLIAVYGVLPSLFRAQSVLPRAIEAEQRLIDAVLVQQRRLSRNQKDTILSGVATVRGSDYCRALFAQGVPVLPDRNSAVFDFSLRDFSLRLAKHGPRVSAHDVEALKSSGFDEQAILEAVATTSLGLMLCTLADGLRPHPDPKLASPSPGEVLNVSEPTSW
Ga0137363_1074272923300012202Vadose Zone SoilMSQSATIFNDPPRSYPQLIATYGFLPNLFQAQSAIPQAIEAEQQLMNTVLVRQGRLSRSQKDAILNGVATVRGCDYCRALFGHSLAALPDRNSGLVDFSLKLAKHGPWVSQSDVLILKNSGFDERAILEAIVTTAAGLMLCTLADGLSP
Ga0137363_1105704613300012202Vadose Zone SoilMSQFATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSTVVPGNSALFDFSLKLAKHGPWVSGRDVLTLKNSGFDERAILEAIVTTGVGLMLCTLAEGLRPRLDTELRSPDPCELLDVPEPLNWPK
Ga0137380_1024477013300012206Vadose Zone SoilMSQFPTISSDLPSSYPRLVAVYGFLPNLFRAQSALPRAVEAEHQLIDAVVVRQGRLSRDQKDAILKGVARVRNNDYCRALFGHALSAVPDCNSALFDFSLKLAKHGPWVSKHDVVTLKNSGFGETGILEAVVTTGIGHMLCTLADGLNPRLDAELRSPTPGELLN
Ga0137381_1019562313300012207Vadose Zone SoilMSQSATISNDPPKSYPQLVAAYGFLPNLFRVQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLASVPDRNSALFDFSLKLAKHGPWVSGHDLLSLKNCLVDEKAILEAIVTTGVGLMLCTLAEGLRPRLDTELRSPDPCELLDVPEPLNWPKAPGPHLGLPSDS
Ga0137379_1069108923300012209Vadose Zone SoilMSQSATIFNDPPRSYPQLIATYGFLPNLFRAQGTIPRAIEAEERLIETVVVRQGRLSRDQKDAILNGVATVWGSDYCRALFGHSLAAVPDRNFALFDFSLKLAKHGPCVSGRDVLTLKNSGFDERAILEAIVTTGVGLMLCTLADGLRP
Ga0137378_1068296923300012210Vadose Zone SoilMSQSATISNDPPKSYPQLVAAYGFLPNLFRVQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLASVPDRNSALFDFSLKLAKHGSWVSARDVLTLKNSGFD
Ga0137378_1077104913300012210Vadose Zone SoilMSQSATIFNDPPRSYPQLIATYGFLPNLFRAQGTIPRAIEAEERLIETVVVRQGRLSRDQKDAILNGVATVWGSDYCRALFGHSLAAVPDRNFALFDFSLKL
Ga0137378_1104678813300012210Vadose Zone SoilMSQFPTISSDPPRSYPQLVAVYGFLPNLFRAQSALPRAVEAEHQLIDAVVVRQGRLSRDQKDAILKGVASVRDNDYCRALFGHALPAVPDCNSALFDFSLKLAKHGPWVSKHDVVTLKNSGFDETAILEAVVTTGIGHMLCTLADGLNPRLDAELRSPTPGELLN
Ga0137386_1100721013300012351Vadose Zone SoilMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLASVPDRNSALFDFSLKLAKHGPWVSGRDVLTLKDSGFDERTILEAIVTTGVG
Ga0137390_1089755823300012363Vadose Zone SoilMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILNGVATVRGNDYCRALFGHSLGAVPDRNSALFDFCLKLAKHGPWVSGRDVLTLKDSGFDERTILEAIVTTGVGLMLCTLADGLRPRLDTELRSPA
Ga0137390_1101961813300012363Vadose Zone SoilMSQFATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSTVVPGNSALFDFSLKLAKHGPWVSGRDVLTLKNSGFDERAILEAIVTTGVGLM
Ga0137397_1004811213300012685Vadose Zone SoilMSQFPTISSDPPRSYPQLVAVYGFLPNLFRAQSALPRAVEAEHQLIDAVVVRQGRLSRDQKGALLKGVASVRDNDYCRALFGHALPAVPDCNSALFDFSLKLAKHGPWVSKHDVVTLKNSGFDETGILEAVVTTGIGHMLCTLADGLNPRLDAELRSPTPGELLNVPEPLSWP
Ga0137395_1008983413300012917Vadose Zone SoilMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSTVVPGNSALFDFSLKLAKHGPWVSGRDVLTLK
Ga0137394_1080118113300012922Vadose Zone SoilMSQFATTSEDLPRSYTNLVAAYGFLPNLFRAQSDLPHAIEAEQRLIEVVVVRSGRLSREEKDAILFGVATVRGSDYCRALFEHSLPAVSHRNSALLDFSLKLAKHGPRVSGRDVASLQNCGFNDRAILEAIVTTGVALMLCTLADGLQPELDLELGLVVPGEPPDLPEPSDWP
Ga0137419_1139925513300012925Vadose Zone SoilMSQSSTISNEQPRSHLQLVAVYGFLPNLFRAQSALPHAIDAEQRLIDTVVVRQGRLSRDQKDAILNGVATVRGSDYCRALFGHSLATLPKGNSALVDFSIKLAKHGPWVAAHDLLTLRNSGLDERAILEAIVTTGVGLMLCTLADGLRPRLDTELNSPAPAELPNVPEPVDWPE
Ga0137416_1130995423300012927Vadose Zone SoilMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSTVVPGNSALFDFSLKLAKHG
Ga0137404_1029663413300012929Vadose Zone SoilMSQFPTISSDPPRSYPQLVAVYGFLPNLFRAQSALPRAVEAEHQLIDAVVVRQGRLSRDQKDAILEGVASIRDNDYCRALFGHALPAVPDCKSALFDFSLKL
Ga0137410_1090463113300012944Vadose Zone SoilMSQSATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILSGVATVRGSDYCRALFGHSRTAVPDRNSALFDFSLKLAKHGPWVSGHDVRTLKNSGFDERAILETIVTTGVGLMLCTLADGL
Ga0182019_1073471513300014498FenMSQPSTISNEPSRSYPQLIAAYGFLPNLFRAQSAVPHAIEAEQLLIDTVVVRQGRLSRDQKGAILNGVATVWGSDYCRALFGHSLAAVPDRNTALSDFSLKLAKQGPWVSGRDIGALREA
Ga0132255_10362698913300015374Arabidopsis RhizosphereMSQSATIFNDPPRSYPQLIATYGFLPNLFHAQSAIPQAIEAEQRLINTVLVRQGRLSRSQKDAILNAVATIRGCDYCRALFGHSLATLPDRNSGLFDFTLKLAKHSPWVSESDVLILKNFGFDERAILEAIVTTGLGIMLCTLADGLRPRLDTELRSPASS
Ga0187802_1019438413300017822Freshwater SedimentMSQSATISNDPARSYPQLVAAYGFLPNLFQVQSAMPDAIEAEQRLIDTVVVRQNILGRNQKDAILNGVATVRGSDYCRALFGHPLAAVPDHSSALLDFSLKLAKYGPWISEGDLLRLKSSGFDEKAVLEAIVTTGI
Ga0187802_1025807923300017822Freshwater SedimentMNQSSTTSNALTRSYPQLVAAYGFLPNLFRVQSAMPDVIEAEQQLIDTVVVRQGRLSKNQKDAILNGVATVRGSDYCRALFGHSLAAVADHSSALLDFSLKLAKYSPWVSESDILTLKSSGLDEEAVLEAIVTTGIGAMLCTLADGLRPLHD
Ga0187819_1046326923300017943Freshwater SedimentYPQLVAAYGFLPNLFRVQSAIPHAIEAEQRLIETVVVRQGRLSKHQKDAILNGVATVRGSDYCRALFGHSFSAAPDDSSALLAFSLKLAKRGPWVSGPDILTLKSFGFDEKAILEAIVTTGVGVMFCTLADGLRPRLDTELGSPAAPTT
Ga0187817_1022543013300017955Freshwater SedimentMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQTIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRGGDYCRALFGPSLATVPDHSSALLDFSLKLATHGPWVSGSDLLGLKSSGFDEKAVLEAIVTTGIG
Ga0187817_1060979223300017955Freshwater SedimentMSQPATIPNHPASSYPQLVAVYGFLPNLFQLQSAIPHAMEAEQRLIEAVVVRQNRLSRNQKDAILNGVATVLGSDYCRALFGHSLAAAPDHSSALLDFSLKLARHGPWVSESDILRLKSSGFD
Ga0187816_1009450013300017995Freshwater SedimentLPMSQSATISNGPARSYPQLVAAYGFLPNLFQVQSAMPDAIEAEQRLIDTVVVRQNKLGRNQKDAILNGVATVRGSDYCRALFGHPLAAVPDHSSALLDFSLKLAKHGPWISEGDLLRLKSSGFDEKAVLEAIVTTGI
Ga0187816_1011674023300017995Freshwater SedimentMSQPATISNHPASSYPQLVAVYGFLPNLFQLQSAIPHAMEAEQRLIEAVVVRQNRLSRNQKDAILNGVATVLGSDYCRALFGHSLAAAPDHSSALLDFSLKLAKHGPWISEGDLLRLKSSGFDEKAVLEAIVTTGIGVMFCTLADGLRPLLDTELRSAPAEVLNVPEPLDWPKSSGAHLSLSSDSG
Ga0187816_1042470513300017995Freshwater SedimentMSQPATISNSPARSYPQLVAAYGFLPNLFRAQSAIPQTIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRGGDYCRALFGPSLATVPDHSSALLDFSLKLATHGPWVSGSDLLGLKSSGFDEKAVLEAIVTTGIGVMLCTVADGLRPLLDPELRSAAPAELLNVPEP
Ga0187816_1049400113300017995Freshwater SedimentMSQSATISNDPAKSYPQLVAAYGFLPNLFRVQSAIPHAIEAEQRLIETVVVRQGRLSKHQKDAILNGVATVRGSDYCRALFGHSFSAAPDDSSALLAFSLKLAKRGPWVSGPDILTLKSFGFDEKAILEAIVTTGV
Ga0187804_1029347923300018006Freshwater SedimentMSQPATISNHPASSYPQLVAVYGFLPNLFQLQSAIPHAIEAEQRLIEAVVVRQNRLSRNQKDAILNGVATVLGSDYCRALFGHSLAAAPDCNSVLIDFSLKLAKHGPWVSGSDALTLKNSGFDEMAILETVATT
Ga0187869_1024659913300018030PeatlandMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNGVASVRGCDYCRALFGHSLTAVPDYSSALLDFSVKLARHGPWVSESDILRLKSSGFDEKAVLEAIVTTGIGVMFCTVADGLRPQLDTELRSP
Ga0179592_1026536813300020199Vadose Zone SoilMSQFATTSEDLPRSYPNLVAAYGFLPNLFRAQSDLPHAIEAEQRLIEVVVVRSGRLSREEKDAILFGVATVRGSDYCRALFEHSLPAVSHRNSALLDFSLKLAKHGPRVSGRDVASLQNCGFNDRAILEAIVTTGVALMLCTLADGLQPELDLELGLVVPGEPPDLPEPSDWPKA
Ga0207699_1136189013300025906Corn, Switchgrass And Miscanthus RhizosphereMTQPSTISNDLAGSHPQLVAVYGFLPNLFRVQSAVPPAIEAEQRLIDIVVVRQGRISRDQKEAILYGVATVRANDYCRALFGHALRAVPDRDSGLFDFCLKLAKHGPWVSERDVLTLKDSGFDERAILEAIVTT
Ga0207664_1076864513300025929Agricultural SoilMRQPATISNKPSRSYPQLIAAYGFLPNLFQVQSTLPQAIEAEERLIDTVVVRQGRLSRSQKNVILNAVATVRGCDYCRALFGDSLAALPDRNSGLFDFTLKLTKHGPWVSERDILTLKHS
Ga0207686_1066762713300025934Miscanthus RhizosphereMSQPANTSDDPQGSYPQLVAAYGFLPNLFQAQSALPHVVDAEQRLVDTVLVRQGRLSRTQKDTILRGVATVRGSDYCRALFGQALHALPDRSSALFDFSLKLAKHGPWVSGSDVEALKASGFDERAILEAVVTTAL
Ga0207665_1104287413300025939Corn, Switchgrass And Miscanthus RhizosphereMTQPSTISNDLAGSHPQLVAVYGFLPNLFRVQSAVPPAIEAEQRLIDIVVVRQSRISRDQKDAILYGVATVRANDYCRALFGHALRAVPDRDSGLFDFCLKLAKHGPWVSERDVLTLKDSGFDERAILEAIVTTGVGLMLCTLADGLRPRLDTELRSPAPAELLNVLRRFLVDDLDDVVDGDDALHAPLG
Ga0209804_127587413300026335SoilMSQFATISNDPPRSYPQLVAAYGFLPNLFRAQIAIPRAIEAQQGLIDTVVVRQGRLSRDQKDAILNGVATVRRSDYCRALFGHSSSVVPGNSALFDFSLKLAKHG
Ga0207781_100782223300026890Tropical Forest SoilMSQPATISDDPPTSYPQLVATYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQDRLSRHQKAAILNGVATVRGSDYCRALFGQSLTAMPDRNSALFAFSLKLVKYGPWVSENDVLGLKNCGFDEKAVLEAIATTAIGVMLCTVA
Ga0207858_103047413300026909Tropical Forest SoilMSQPATISDDPPTSYPQLVATYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQNRLSRHQKAAILNGVATVRGSDYCRALFGHSFTPAADPSSALLDFCLKLAKHGPWVSGADLLKPKTSGFDEKAILEA
Ga0207783_103059813300026942Tropical Forest SoilMSQPATISDDPPTSYPQLVAAYGFLPNLFQVQSTIPQAIEAEQRLIETVVVHQNRLSRHQKAAILNGVATVRGSDYCRALFGHSFTPAADPSSALLDFCLKLVKHGPWVSGADFLKPKTSGFDEKAILDVLGPDAKQVVSSGDDVEDRQNRAKFVQK
Ga0207824_101635413300026990Tropical Forest SoilMSQPATIPNDPTRSYPQLVSAYGFLPNLFQAQSDIPNAIKAEQRLIETVVVCPNRLSRNQKDAILNGVATVRGSDYCRALFGRSLAAVPDHGSALLDMCLKLAKHGPWLSGSDFL
Ga0207803_101883813300027000Tropical Forest SoilMSQPATIPNDPTRSYPQLVAAYGFLPNLFQVQSAMPDAIEAEQRLIETVVVRQDRLSRHQKAAILNGVATVRGSDYCRALFWHSFTPAADPSSALLDFCLKLAKHGPWVSGADFLKPKTSGFDEKTILEAIVTTGLGVMFCTVADG
Ga0207815_103410523300027014Tropical Forest SoilMSQPATISDDPPTSYPQLVAAYGFLPNLFQVQSTIPQAIEAEQRLIETVVVCPNRLSRNQKDAILNGVATVRGSDYCRALFGHSFTPAADPSSALLDFCLKLAKHGPWVSGADF
Ga0207819_103524513300027024Tropical Forest SoilMSQPATISDDPPTSYPQLVATYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQDRLSRHQKAAILNGVATVRGSDYCRALFWHSFTPAADPSSALLDFCLKLAKHGPWVSGSDLLILKSSGFDEKAVLETIVTTGIGVMLCTVADGLHPLLDTELKPPAP
Ga0207855_103805513300027039Tropical Forest SoilMSQPATISDDPPTSYPQLVAAYGFLPNLFQVQSTIPQAIEAEQRLIETVVVCPNRLSRNQKDAILNGVATVRGSDYCRALFGHSFTPAADPSSALLDFCLKLVKHGPWVSGADFLKPKTSGFDEKTILEAIVTTGLGVMFCTVADGL
Ga0207726_104246123300027045Tropical Forest SoilMSQPATISDDPPTSYPQLVATYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQDRLSRHQKAAILNGVATVRGSDYCRALFWHSFTPAADPSSALLDFCLKLAKRGPWVSGADFLKPKTSGFDEKAIL
Ga0207762_101328223300027063Tropical Forest SoilMSQPATISDDPPTSYPQLVAAYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQNRLSRHQKAAILNGVATVRGSDYCRALFGHSFTPAPDPSSALLDFCLKLGKHGPWVSGADLLKPKTSGFDEKAILEAIVTTGLGVMFCTVADGLRPRIDTEL
Ga0207777_101776413300027330Tropical Forest SoilMSQPATISDDPPTSYPQLVAAYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQDRLSRHQKAAILNGVATVRGSDYCRALFWHSFTPAADPSSALLDFCLKLAKHGPWVSGADFLKPKTSGFDEKTILEAIVTTGLGVMFCTVA
Ga0207761_101797713300027516Tropical Forest SoilMSQPATISDDPPTSYPQLVATYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQDRLSRHQKAAILNGVATVRGSDYCRALFWHSFTPAADPSSALLDFCLKLAKHGPWVSGADLLKPKTSGFDEKAILEAIVTTGLGVMFCTVADGLRPRIDTELNVPAPA
Ga0208043_106403813300027570Peatlands SoilMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRVSDYCRALFGHSLPTMPDHSSALLDFSLKLATHGPWISGSDLLRLKSSGFDEKAVLEAIVTTGIGVMLCTVADGLRPLLDPELSSPAPAELLNVP
Ga0208324_101241643300027604Peatlands SoilMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRVSDYCRALFGHSLPTMPDHSSALLDFSLKLATHGPWISGSDLLRLKSSGFDEKAVLEAIVTTGIGVMLCTVADG
Ga0208827_120964013300027641Peatlands SoilDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRVSDYCRALFGHSLPTMPDHSSALLDFSLKLATHGPWISGSDLLRLKSSGFDEKAVLEAIVTTGIGVMLCTVADGLRPLLDPELSSPAPAELLNVPEPVEWPKPSGAHL
Ga0207826_111816123300027680Tropical Forest SoilMSPPATISNDPPRSYPQLVSAYGFLPNLFQVQSAIPHAIEAEQRLIETVVVRQNRLSRNQKDAILIGVATVRGSDYCRALFGHSFTAVPDLSSALLDFSLKLAKHGPWVSGSDLLKLKSSGYDEKAVLE
Ga0207862_116083213300027703Tropical Forest SoilMSQPATISDDPPTSYPQLVAAYGFLPNLFQVQSTIPQAIEAEQRLIETVVVRQNRLSRHQKAAILNGVATVRGSDYCRALFWHSFTPAADPSSALLDFCLKLVKHGPWVSGADLLKPKTSGFDEKAILEAIVTTGLGVMFCTVADGLRPRIDTELNVPAPAEPPNIPE
Ga0209517_1051091913300027854Peatlands SoilMSQPATISNDPARSYPQLVATYGFLPNVFRAQSAIPQAIEAEQRFIDTVVVRQNRLSRNQKDAILNGVATVRGSDYCRALFGHSLAAMPDHSFALLDFSLKLAKHGPWVSGSDVLTLKKSGFDERAVLEAIVTTGIGVMFCTLADGLRPLLDTELRPPAPAEV
Ga0209068_1019108023300027894WatershedsMNQSSTTSNDLPTSHPQLVAVFGFLPNLFQVQSAIPPAIEAEQGLIDSVLVRQGRLSRNQKDAILNGVATVRGSDYCRALFGHSFAAVPDHNSVLLDFSLKLARHGPWVSERDILILKSSGFDEKAVLEAIVTTGIG
Ga0209067_1016162013300027898WatershedsMNQSATISNDPARSYPQLVAAYGFLPNLFRAQSAIPHAIEAEQGFIDSVLVRQSRLSRNQKDAILSGVATVRGSDYCRALFGRSFATMPGHSSALLDFSLKLAKHGPWVSESDLLR
Ga0209583_1012314813300027910WatershedsMSQSATISNDPPKSYPQLVAAYGFLPNLFRAQIAIPRAIEAEQGLIDTVVVRQGRLSRNQKDAILDGVATVRGNDYCRALFGHSLAAVPDRNSALFDFSLKLAKHGPWVSERDVLTLKNSGFDERAILEAIVTTGVGLMLCTLADGLRPRLDAELRSPAPGEL
Ga0209698_1064377113300027911WatershedsMSQSATISNDPARSYPQLVAAYGFLPNLFRVQSAVPDAIEAEQGLIDSVLVRQGRLSRHQKDAILNGVSTVRGNDYCRALFGHSFAAMPHHSSALLDFSLKLAKHGPWVSGRDVLTLKNSGFDDKTILEAIVTTGIAVMFCSVADGLRPLLDRELSTPAPAEVLNVSEPLDWPNPSGAHL
Ga0209698_1065677323300027911WatershedsMNQSATISNDPARSYPQLVAAYGFLPNLFRAQSAIPHAIEAEQGFIDSVLVRQSRLSRNQKDAILSGVATVRGSDYCRALFGRSFATMPGHSSALLDFSLKLAKHGPWVSESDLLRLKSSKFDGKAILEAIVTTGIGVMFCTLAEGLHPLLD
Ga0302166_1003303323300028652FenMLQGLAVSVRLSRYTCNPIVIVYECMRQPATISNDPLRSYPGLIAAYGFLPNLFRVQSALPGVIEAEQQLIEAVVVRPGRLSRTEKEAILNSVVTVRGSDYCRALFGHSLAAVPDRNTVLSDFSLKLAKHGLWVSGRDIGELREAAFDDTAILEAVATTAVAVMLCTLADGLRPPLDTDLRS
Ga0302203_112164113300028853FenATISNDPLRSYPGLIAAYGFLPNLFRVQSALPGVIEAEQQLIEAVVVRPGRLSRTEKEAILNSVATVRGSDYCRALFGHSLAAVPDRNTVLSDFSLKLAKHGLWVSGRDIGELREAAFDDTAILEAVATTAVAVMLCTLADGLRPPLDTDLRSPGSYEPPKIPEPLD
Ga0302163_1005300123300028868FenMLQGLAVSVRLSRYTCNPIVIVYECMRQPATISNDPLRSYPGLIAAYGFLPNLFRVQSALPGVIEAEQQLIEAVVVRPGRLSRTEKEAILNSVATVRGSDYCRALFGHSLAAVPDRNTVLSDFSLKLAKHGLWVSGRDI
Ga0302272_104994823300030001BogMLQGLAVSVRLSRYTCNPIVIVYECMRQPATISNDPLRSYPGLIAAYGFLPNLFRVQSALPGVRPGRLSRTEKEAILNSVATVRGSDYCRALFGHSLAAVPDRNTVLSDFSLKLAKHGLWVSGRDIGELREAAFDDTAILEAVATTAVAVMLCTLADGLRPPLDTDLRSPGSYEPPKIPEPLDWPEGAGPHLGSSSGSASDSHA
Ga0302207_101653553300030230FenMLQGLAVSVRLSRYTCNPIVIVYECMRQPATISNDPLRSYPGLIAAYGFLPNLFRVQSALPGVIEAEQQLIEAVVVRPGRLSRTEKEAILNSVVTVRGSDYCRALFGHSLAAVPDRNTVLSDFSLKLAKHGLWVSGRDIGELREAAFDDTAILEAVATTAVAVMLCTLADGLRPPL
Ga0310037_1025766523300030494Peatlands SoilMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRVSDYCRALFGHSLPTMPDHSSALLDFSLKLATHGPWISGSDLLRLKSSGFDEKA
Ga0310039_1018587923300030706Peatlands SoilMSQPATISNDPARSYPQLVAAYGFLPNLFRAQSAIPQAIEAEQRLIDTVLVRQDGLSRNQKDAILNAVATVRVSDYCRALFGHSLPTMPDHSSALLDFSLKLA
Ga0302297_108219913300031244FenTCNPIVIVYECMRQPATISNDPLRSYPGLIAAYGFLPNLFRVQSALPGVIEAEQQLIEAVVVRPGRLSRTEKEAILNSVATVRGSDYCRALFGHSLAAVPDRNTVLSDFSLKLAKHGLWVSGRDIGELREAAFDDTAILEAVATTAVAVMLCTLADGLRPPLDTDLRSPGSYEPPKIPEPLDWPEGAGPHLGSSSGS
Ga0310909_1157040913300031947SoilANGRISRYTGKPLFFMSQSAANFNDPPRSYPQLIAAYGFLPNLFQAQSTMPQALAAEQRLIDTIVVRQGRLSRDQKDSILDAVATVRANEYCRALFGHSIAGVPHRNPALLDFSLKLAKHGPWVSGNDRLALKNSGFDERAILEAIVTTGVALMLCTLADGLRPTLDTELMAP
Ga0306920_10287350623300032261SoilMSQPATIPNDPTRSYPQLVAAYGFLPNLFQAQSDIPDAIKAEQRLIETVVVCPNRLSRNQKDAILNGVATVRGSDYCRALLEQSLTAMPDRNSALFAFSLKLVKHGPWVSESDVLGLKNCGFDEKAVLEAIATTGIG
Ga0335079_1009227313300032783SoilMSQSATIFNDPPGSYPQLVALHGFLPNLFRAQSAIPRAIEAEERLIDTVVVRQGRLSRDQKDAILNGVATVRGSDYCRALFGPSLAAVPDRNFALFDFSLKLAKHGPWVSGNDLLTLKNSGFDERAILEAIVT
Ga0335076_1008258313300032955SoilMSQPSTISNDLAGSHPQLVAAYGFLPDLFRVQSALPRALEAEQQLINTVVVREGRLSREQKDAILNDVATVWGSDYCRALFGHSFAAVPDRNFALFDFCLKLAKHGPWVSGRDVLTLKNSGFDETAILEAIVTTGVGLMLCTLAEGLRPRLDTELKSPAPAELLNVPEPLDWPNTPGPHLGSLSDSV
Ga0335077_1197084413300033158SoilMSQSATIFNDPPRSYPQLIATYGFLPNLFRAQSTIPRAIEAEERLIDTVVVHQGRLSRDQKDAILNGVATVWGSDYCRALFGHSLAAVPDRNFALFDFSLK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.