NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F049903

Metagenome / Metatranscriptome Family F049903

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F049903
Family Type Metagenome / Metatranscriptome
Number of Sequences 146
Average Sequence Length 87 residues
Representative Sequence MQRLNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSAEGQCDSGA
Number of Associated Samples 130
Number of Associated Scaffolds 146

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 80.14 %
% of genes near scaffold ends (potentially truncated) 96.58 %
% of genes from short scaffolds (< 2000 bps) 89.04 %
Associated GOLD sequencing projects 124
AlphaFold2 3D model prediction Yes
3D model pTM-score0.33

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (83.562 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(31.507 % of family members)
Environment Ontology (ENVO) Unclassified
(31.507 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.575 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150
1NODE_05107941
2INPhiseqgaiiFebDRAFT_1007171012
3INPhiseqgaiiFebDRAFT_1018248101
4AF_2010_repII_A01DRAFT_10158781
5AF_2010_repII_A1DRAFT_100396793
6JGI1027J11758_125613441
7AP72_2010_repI_A100DRAFT_10273921
8AP72_2010_repI_A001DRAFT_10432262
9F14TB_1003990542
10JGI25614J43888_100740573
11JGI25617J43924_101105393
12JGI25616J43925_100457224
13Ga0066396_100796831
14Ga0063356_1005378554
15Ga0066388_1017429983
16Ga0066388_1049852611
17Ga0008090_102479821
18Ga0070714_1018234551
19Ga0066681_106555081
20Ga0066701_100915994
21Ga0066701_102589853
22Ga0066707_100607294
23Ga0066698_105372633
24Ga0066705_104755611
25Ga0066905_1003913123
26Ga0066903_1012256994
27Ga0066903_1033568503
28Ga0066903_1041828352
29Ga0066652_1007723441
30Ga0075028_1008047451
31Ga0075432_104577571
32Ga0075018_103819091
33Ga0070716_1012334451
34Ga0070712_1018635992
35Ga0075428_1018967332
36Ga0075421_1002795651
37Ga0075419_110995981
38Ga0075435_1008529841
39Ga0099794_100047001
40Ga0066710_1001127562
41Ga0099829_110455262
42Ga0099830_103398041
43Ga0066709_1026034491
44Ga0111538_119584732
45Ga0111538_120646822
46Ga0126380_113646581
47Ga0126382_100403864
48Ga0134070_101713532
49Ga0134086_103673631
50Ga0126378_119532391
51Ga0126377_134910881
52Ga0126379_118175632
53Ga0126379_136021691
54Ga0126381_1013668161
55Ga0124844_10307654
56Ga0124844_10311301
57Ga0124844_10317414
58Ga0153974_11001712
59Ga0137399_116162241
60Ga0137377_103054644
61Ga0137385_104848061
62Ga0137375_110985021
63Ga0137413_118357431
64Ga0137404_115092001
65Ga0137407_100045519
66Ga0137407_110374831
67Ga0164302_105124411
68Ga0126369_103440391
69Ga0126369_129798992
70Ga0134110_101078633
71Ga0164305_108094281
72Ga0134078_100451971
73Ga0137412_104464752
74Ga0137403_105039223
75Ga0132256_1022759692
76Ga0182036_107538771
77Ga0182041_106939581
78Ga0182033_116992661
79Ga0182032_101429454
80Ga0182039_114529482
81Ga0182038_101874614
82Ga0066655_101775561
83Ga0066669_115543671
84Ga0210408_111358451
85Ga0213872_102168461
86Ga0210397_116371911
87Ga0210410_110549831
88Ga0210410_116510162
89Ga0242662_101327601
90Ga0207693_109765172
91Ga0209055_11909602
92Ga0209239_10135145
93Ga0209647_10000231
94Ga0209647_10071287
95Ga0209131_10008691
96Ga0257176_10555852
97Ga0179587_109893632
98Ga0209465_102177483
99Ga0209465_102749963
100Ga0307286_101979391
101Ga0308189_102108942
102Ga0308181_10424911
103Ga0318534_100129035
104Ga0318541_103916012
105Ga0318538_102404473
106Ga0318571_102259502
107Ga0318515_102624503
108Ga0318555_100907151
109Ga0318555_107431782
110Ga0318561_101525771
111Ga0318574_101630623
112Ga0318574_105385522
113Ga0318560_102022061
114Ga0307469_123078782
115Ga0318493_100981301
116Ga0318501_100542241
117Ga0318502_100359802
118Ga0318494_101357614
119Ga0318537_102697662
120Ga0318526_101252843
121Ga0318546_106646981
122Ga0318508_11485391
123Ga0318548_100342514
124Ga0318503_101769021
125Ga0318557_104254242
126Ga0318523_102074001
127Ga0318565_103583392
128Ga0318568_109101901
129Ga0307473_113295492
130Ga0318567_102074631
131Ga0318499_100509684
132Ga0318527_103952961
133Ga0318495_100652154
134Ga0318544_101325981
135Ga0318536_100405821
136Ga0310916_100132086
137Ga0214473_123962401
138Ga0318530_101487481
139Ga0318563_106072662
140Ga0318569_105099202
141Ga0318549_102460602
142Ga0318545_101975562
143Ga0318575_107258191
144Ga0318505_102831792
145Ga0318577_101468021
146Ga0268251_105117682
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 29.91%    β-sheet: 11.21%    Coil/Unstructured: 58.88%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MQRLNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSAEGQCDSGASequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.33
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
83.6%16.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Arabidopsis Thaliana Rhizosphere
Populus Rhizosphere
Rhizosphere
Arabidopsis Rhizosphere
Agave
Attine Ant Fungus Gardens
Sugar Cane Bagasse Incubating Bioreactor
Tropical Rainforest Soil
3.4%9.6%6.2%7.5%7.5%31.5%4.1%8.2%4.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
NODE_051079413300000156Sugar Cane Bagasse Incubating BioreactorMQGQRQIVLRLIAVAAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFTLRPVTAADLVNAAGQCTGDGEQAGTDASAA
INPhiseqgaiiFebDRAFT_10071710123300000364SoilMVLHALAMAALVLALGGCSSDLSLNNLTLAPKPETLMRKPDWATFSAGKNDLGQRPVTAADLINRDGQCP
INPhiseqgaiiFebDRAFT_10182481013300000364SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQCDGGAGQAAAGPVAGGI
AF_2010_repII_A01DRAFT_101587813300000580Forest SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWANFSGGKNNFELRPVTAADLVSAEGQCNSGDGQAAADPTA
AF_2010_repII_A1DRAFT_1003967933300000597Forest SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLTGVTLVPKPETLLRKPDWANFSGSKNDFELRPVTAADLVSAEGQCDSGAGQA
JGI1027J11758_1256134413300000789SoilMQRGNQILRVLAMTALGSAVAACSSDLSLNNVTLVPKPETLLXKPDWATFSGGKNDFALRPVTAADLVNAAGQCSGETEQAGNDSTTAGAAPITGG
AP72_2010_repI_A100DRAFT_102739213300000837Forest SoilMQRVNQIVLRMIAMATLAFAMAACSSDLGLNGVTLVPKPETLLRKPDWTTFSGGRNDFELRPVTAADLVSAEGQCNSGAGQAAADSTAAG
AP72_2010_repI_A001DRAFT_104322623300000893Forest SoilMQTVNQIRLRMIAVAAFCCALAACSSDLSLNNVTLVPKPETLLRKPDWATFSGSKNEFTLRPVTAADLVNAAGQCAGEGEQAGSDPTTGGAAPVAGGIALQMTE
F14TB_10039905423300001431SoilMQRLDQIVLRMIAMATLGSTMAACSSDLGLTNVTLPKPDTLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCSGESAQAGTDPTAGAAPPVAGGIALQM
JGI25614J43888_1007405733300002906Grasslands SoilMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVGAEGQCDSGTGQAAADPTAAG
JGI25617J43924_1011053933300002914Grasslands SoilMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVGAEGQCDSGTGQAAADPT
JGI25616J43925_1004572243300002917Grasslands SoilMQRPNQILRLLAMMALGSAVAACSSDLGLNNVTLVPKPETLLRKPDWATFSGGKSDFVLRPVTAGDLVNAAGQCAGEA
Ga0066396_1007968313300004267Tropical Forest SoilVAMQRVIQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAG
Ga0063356_10053785543300004463Arabidopsis Thaliana RhizosphereMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQCDGGAGQAAAEPVAGGIALQMTE
Ga0066388_10174299833300005332Tropical Forest SoilMQRLNQIMLRVIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKSDFELRPVTAADLVSAEGQCDSGAGQAAADPTAAGGTVAGGIA
Ga0066388_10498526113300005332Tropical Forest SoilMNRSGWRMALIAALAAGVAACSTDLSLTNVTLPKADSVLRKPDWATYSGGAKDFELRPISAADLVGPDGQCAAAGEA
Ga0008090_1024798213300005363Tropical Rainforest SoilMQGQRQIVLRLIAVAAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFTLRPVTAADLVNAAGQCTGDGEQAGTDASAAGQVSGGIALQMTECD
Ga0070714_10182345513300005435Agricultural SoilMQRLNQIVLRMIAMATLACAMSACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDSGAGQAAADPTAAGGTVAGGIALQMTEC
Ga0066681_1065550813300005451SoilMQRFVRIGTALIAVAVLAGGLGACSSDLGLNNVTLVPKPETLLRKPDWATFSGGKNEFGLRPVTAADLIDSQGQCSGGPE
Ga0066701_1009159943300005552SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQCDGGAGQAAA
Ga0066701_1025898533300005552SoilMIAMATLGCALAACSSDLSLNNVTLPKPDTLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCSSESAQAGTDPTVGAAPP
Ga0066707_1006072943300005556SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQC
Ga0066698_1053726333300005558SoilMQRLNQIVLRMIAMATLGCAVAACSSDLSLNNVTLPKPETLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCSSESVQAGTDPTVGAAPPAAGGIALQMTECDVVRRAGPVEKIDFAS
Ga0066705_1047556113300005569SoilMQTVNQIRLRMIAVAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGSKNEFALRPVTAADLVNAAGQCAGEGEQAGSDPTTGG
Ga0066905_10039131233300005713Tropical Forest SoilMAALVPALGACSSDLSLNNLTLVPKPETLMRKPDWATFSAGKNDITLRPLTSADFVNQDG
Ga0066903_10122569943300005764Tropical Forest SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLTGVTLIPKPETLLRKPDWANFSGGKNDFELRPVTAADLVSADGQCNSAEGQ
Ga0066903_10335685033300005764Tropical Forest SoilLCPIASRPKALRLLALAALAPAIVACSTDLSLNNVTLAPKPDQLFRKPDWATFSGGKNDFELRPITPADLVAPDGSCPI
Ga0066903_10418283523300005764Tropical Forest SoilMQTVNQIRLRTITVAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGSKNEFTLRPVTAADLVNAA
Ga0066652_10077234413300006046SoilMRRILMRMLAVAALASGPGACSTDLSLSNVTLVPKPETLTRKPDWATFSGGKTDFQLRPITAADLVGPEGQCGGGNPAAAAGFADSQAGGGAPP
Ga0075028_10080474513300006050WatershedsMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVGAEGQCDSGTGQAAAD
Ga0075432_1045775713300006058Populus RhizosphereMQRLNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSAEGQCDSGA
Ga0075018_1038190913300006172WatershedsMQRLNQIVLRMIAMATLACAMSACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQCD
Ga0070716_10123344513300006173Corn, Switchgrass And Miscanthus RhizosphereMQRSNGVAWRVGAAAALAVAVGACSSDLSLNNVTLAPKPDNLLRKPDWATYSGGKSDFELRPVTAADLVDRDGQCS
Ga0070712_10186359923300006175Corn, Switchgrass And Miscanthus RhizosphereMQRLNQIVLRMIAMATLACAMSACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDSGAGQAAADPTAAGGTV
Ga0075428_10189673323300006844Populus RhizosphereMIAMATLGAAVAACSTDLGLNNVTLPKPETLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCSSESAQAGSDPTV
Ga0075421_10027956513300006845Populus RhizosphereMAALVQALAACSSDLSLNNLTLVPKPETLMRKPDWATFSAGKNDSSLRPLTSADFVNQDGQCA
Ga0075419_1109959813300006969Populus RhizosphereMRRSSRNLLRIVAATAVASAAAACSSNLSLTDVTLAPKPSTMMTRPDWATFSGGKNDFELRPITAADLVSPEGQCAAAPGQAAGFADS
Ga0075435_10085298413300007076Populus RhizosphereMQRLNQIGLRVIAAAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGGKDGFTLRPVTAADLVNAAGQCAEGEQAG
Ga0099794_1000470013300007265Vadose Zone SoilMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKSDELRPVTAADLVGAEGQCDSGGQAAADPTAAGGSAAGGIA
Ga0066710_10011275623300009012Grasslands SoilMIAMATLGCVVAACSSDLSLNNVTLPKPETLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCTSESP
Ga0099829_1104552623300009038Vadose Zone SoilMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSE
Ga0099830_1033980413300009088Vadose Zone SoilMQRVNRIALRIVVLAALASAIGACSSDLSLSNVTLVPKPETLLRKPDWATFSGGKNDFELRPITAADLVGPEGQCGADATGQAAGFADPTAAGGAQPAA
Ga0066709_10260344913300009137Grasslands SoilVLAITAFGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFTLRPVTAADLVNAAGQCAGEAEQAGNDATTAG
Ga0111538_1195847323300009156Populus RhizosphereMRRSSRNLLRIVAATAVASAAAACSSNLSLTDVTLAPKPSTMMTRPDWATFSGGKNDFELRPITAADLVGPEGQCAAAPGQAAGVADS
Ga0111538_1206468223300009156Populus RhizosphereMQRVNQIGLRVIAAAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGGKDGFTLRPVTAADLVNAAGQCAEGEQAGNDSTATGALVAGG
Ga0126380_1136465813300010043Tropical Forest SoilMQRLNQIMLRVIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKSDFELRPVTAADLVSAEGQC
Ga0126382_1004038643300010047Tropical Forest SoilMTALGSAVAACSSDLSLNNVTLVPKPETLMRKPDWATFSGGKNDFVLRPVTAADLVNAAGQCAG
Ga0134070_1017135323300010301Grasslands SoilVLAITAFGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFTLRPVTAADLVNAAGQCAGEAE
Ga0134086_1036736313300010323Grasslands SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEG
Ga0126378_1195323913300010361Tropical Forest SoilMQRLKQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKSDFELRPVTAADLVSAEGQCDSGAGQAAADPTAAG
Ga0126377_1349108813300010362Tropical Forest SoilMLVLTQKFRTMAPWSRPAVEAVAMQGQRQIVLRLIAVAAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFALRPVTAADLVNAAGQCTGDGEQAGTDASAAVSGGIALQMTECDVV
Ga0126379_1181756323300010366Tropical Forest SoilLIRLLWVEDLTMQRFKANALRMIAMAALTAAVGACSSDLSLSNVTLVPKPETMLRKPDWATFSGGKNDFELRPITAADLVSPEGQCGAGAQGFADPAAPG
Ga0126379_1360216913300010366Tropical Forest SoilMIAVAVFSSAVAACSSDLGLNNVTLVPKPDTLLRKPDWATFSGGKNEFTLRPVTAADLVNSAGQCAGDGGQAGSDPATAGAGLVAGGIALQMTECDVV
Ga0126381_10136681613300010376Tropical Forest SoilMPAIAACSTDLSLNNVTLAPKPDQLFRKPDWATFSGGKNDFELRPITPADLVAPDGSCP
Ga0124844_103076543300010868Tropical Forest SoilMQRGNQILRVLAMTALGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFVLRPVTAADLVNAAGQCAGETDQAGNDSTTAGAAPATGGIA
Ga0124844_103113013300010868Tropical Forest SoilMQRGNQILRVLAMTALGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFVLRPVTAADLVNAAGQCAGETDQAGNDSTTAGAAPATGGI
Ga0124844_103174143300010868Tropical Forest SoilMQRGNQILRVLAMTALGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFVLRPVTAADLVNAAGQCAGETDQAGNDSTTAGAAPATG
Ga0153974_110017123300012180Attine Ant Fungus GardensMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKAEFELRPVTAADLVSAEGQCDSG
Ga0137399_1161622413300012203Vadose Zone SoilMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKSDELRPVTAADLVGAEGQCDSGGQ
Ga0137377_1030546443300012211Vadose Zone SoilMTVFGSAVAACSSDLSLNNVTLVPKPETLLRKSDWATFSGGKNDFTLRPVTAADLV
Ga0137385_1048480613300012359Vadose Zone SoilMQRLNQIVLRMIAMATLGCALAACSSDLSLNNVTLPKPETLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCSSESAQAGTDPTVGAAPPV
Ga0137375_1109850213300012360Vadose Zone SoilVVVLAALGAAVGACSSDLSLNNVTLAPKPETLLRKPDWATFSGGKNDFDLRPITPAD
Ga0137413_1183574313300012924Vadose Zone SoilMRRSNRILLCVIALASAVGACSSDLSLSNVTLVPKPETLMRKPDWATFSGGKNDFELRPITAADLVGPEGQCGGGAP
Ga0137404_1150920013300012929Vadose Zone SoilMRRSNRTLVRVVAVAALASGIGACSSDLSLSNVTLVPKPETLLKKPDWATFSGGATDFQLRPITAADLVGPD
Ga0137407_1000455193300012930Vadose Zone SoilMQRLNQIVLRMIAMATLGCALAACSSDLSLNNVTLPKPETLLRKPDWATFSGGKHDFTLRPVTAADLVNAAG
Ga0137407_1103748313300012930Vadose Zone SoilVKDLAMRRILMRMLAVAALASGPGACSTDLSLSNVTLVPKPETLTRKPDWATFSGGKTDFQLRPITAA
Ga0164302_1051244113300012961SoilMQRLNQIGLRVIAAAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGGKDGFTLRPVTAADLVNAAGQRAEGEQAGNDSTATGAPV
Ga0126369_1034403913300012971Tropical Forest SoilMIAVAVFSSAVAACSSDLGLNNVTLVPKPDTLLRKPDWATFSGGKNEFTLRPVTAADPVNSAGQCAGDGGQAGS
Ga0126369_1297989923300012971Tropical Forest SoilMQRLNQIMLRVIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCNSGDGQAAAD
Ga0134110_1010786333300012975Grasslands SoilVLAITAFGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFTLRPVTAADLVNAAGQCAGEAEQ
Ga0164305_1080942813300012989SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSSEGQCDGAAGQAAAEPVA
Ga0134078_1004519713300014157Grasslands SoilMTVFGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFTLRPVTAADLVNAAGQCAGEAEQAGNDTTTAGSA
Ga0137412_1044647523300015242Vadose Zone SoilMQRPNQILRLLAMMALGSAVAACSSDLGLNNVTLVPKPETLLRKPDWATFSGGKSDFVLRPVTAGDLVNAAGQCAGEADQAGNDSTTYFPC*
Ga0137403_1050392233300015264Vadose Zone SoilMRRSNRTLVRVVAVAALASGIGACSSDLSLSNVTLVPKPETLLKKPDWATFSGGATDFQLRPITAADLV
Ga0132256_10227596923300015372Arabidopsis RhizosphereMQRLNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSAEGQ
Ga0182036_1075387713300016270SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAGPVEKIDI
Ga0182041_1069395813300016294SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLTGVTLVPKPETLLRKPDWANFSGSKNDFELRPVTAADLVSAEGQCNSGDGQAATDSTAAG
Ga0182033_1169926613300016319SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWANFSGGKNNFELRPVTAADLVSAEGQCNSGDGQAATDSTAAG
Ga0182032_1014294543300016357SoilMQGQRQIVLRLIAVTAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFALRPVTAADLVNAAGQCTGEGEQAGT
Ga0182039_1145294823300016422SoilMQGQRQIVLRLIAVTAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFALRPVTAADLVNAAGQCTGEGEQAGTDASAAVSGG
Ga0182038_1018746143300016445SoilMQGQRQIVLRLIAVTAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFALRPVTAADLVNAAGQCTGEGEQAGTDASAAVSGGIALQMTECDVVRRAGQVDKIDFAS
Ga0066655_1017755613300018431Grasslands SoilMIAMATLGCVVAACSSDLSLNNVTLPKPETLLRKPDWATFSGGKHDFTLRPVTAADLVNAAGQCSS
Ga0066669_1155436713300018482Grasslands SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQCDGGAGQA
Ga0210408_1113584513300021178SoilMQRSNGLAWRVGAAAALAVAVGACSSDLSLNNVTLAPKPDNLLRKPDWATYSGGKSDFELRPVTAAVLVGRDGQCSGGDQSAG
Ga0213872_1021684613300021361RhizosphereMQRLNQIVLRLIAMASLACAMAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAAGQAAADPTAAGGTVAGGIALQM
Ga0210397_1163719113300021403SoilMQRLNQIGLRVIAAAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGGKDGFTLRPVTAADLVNAAGQCAEGEQAGNDSTAT
Ga0210410_1105498313300021479SoilMQRLNQIGLRVIAAAAFCCALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGGKDGFTLRPVTAADLVNAAGQCAEGEQAGSDSTATGAPVAGGIALQMTECD
Ga0210410_1165101623300021479SoilMQTLNQIGLRMIAVAVFCSALAACSSDLGLNNLTLVPKPETLLRKPDWATFSGGKNEFTLRPVTAADLVNAAGQCAGEGEQAGSDSTTAG
Ga0242662_1013276013300022533SoilMQRLNQIVLRMIAMASLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSSEGQCD
Ga0207693_1097651723300025915Corn, Switchgrass And Miscanthus RhizosphereMQRLNQIVLRMIAMATLACAMSACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDSGAGQAAADPTAPGGSVAGG
Ga0209055_119096023300026309SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDSGA
Ga0209239_101351453300026310Grasslands SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVSSEGQCDSGAGPAAADPTAA
Ga0209647_100002313300026319Grasslands SoilMQRPNQILRLLAMMALGSAVAACSSDLGLNNVTLVPKPETLLRKPDWATFSGGKSDFVLRPVTAGDLVNAAGQCAGEADQAGNDSTTSGAAPVTGGIALQM
Ga0209647_100712873300026319Grasslands SoilMQRLNQIVLRMIAMATLACAVAACSSDLSLTGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVGAEGQCDSGTGQAAADPTAADDGMRRGAARRPGRKDRYRFR
Ga0209131_100086913300026320Grasslands SoilMQRPNQILRLLAMMALGSAVAACSSDLGLNNVTLVPKPETLLRKPDWATFSGGKSDFVLRPVTAGDLVNAAGQCAGEADQAGNDSTTSGAAPVTGGI
Ga0257176_105558523300026361SoilMQQPNQILRLLAMMALGSAVAACSSDLGLNNVTLVPKPETLLRKPDWATFSGGKSDFVLRPVTAGDLVNAAGQCAGEAD
Ga0179587_1098936323300026557Vadose Zone SoilMQRLNQIGLRMIVVAVFCSALAACSSNLSLNNLTLVPKPETLLRKPDWATFSGSKSEFTLRPVTAADLVNATGQCAGEGEQAGSDPTTAG
Ga0209465_1021774833300027874Tropical Forest SoilVAMQRVNQIVLRMIAMATLAGAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNDFELRPVTAADLVGAEGQCNSRGGEAAADPTATGATVAGGIALQMTECDV
Ga0209465_1027499633300027874Tropical Forest SoilMQRGNQILRVLAMTALGSAVAACSSDLSLNNVTLVPKPETLLRKPDWATFSGGKNDFVLRPVTAADLVNAAGQCAGETDQAGNDSTTAGAAPATGG
Ga0307286_1019793913300028876SoilMQRSKAIALRLIAMAALVPVVAACSSDLSLNNVTLVPKPETMLRKPDWATFSGAKSDFELRPITSADLVAPDGTCPIVHGQAAGSAD
Ga0308189_1021089423300031058SoilMQRSKAIALRLIAMAALVPVVAACSSDLSLNNVTLVPKPETMLRKPDWATFSGAKSDFELRPITSADLVAPDGTCPIVQGQAAGSAD
Ga0308181_104249113300031099SoilMQRSKAIALRLIAMAALVPVVAACSSDLSLNNVTLVPKPETMLRKPDWATFSGAKSDFELRPITSADLVAPDGTCPIVQ
Ga0318534_1001290353300031544SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAG
Ga0318541_1039160123300031545SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAGPV
Ga0318538_1024044733300031546SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWANFSGGKNNFELRPVTAADLVSAEGQCNSGDGQAAVDPT
Ga0318571_1022595023300031549SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAAD
Ga0318515_1026245033300031572SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWANFSGGKNNFELRPVTAADLVSAEGQCNSGDGQAAVDPTAAG
Ga0318555_1009071513300031640SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVR
Ga0318555_1074317823300031640SoilMQRLGRICRDLTAAAIVALAVGACSTDLGLNNVTLVPKPETLLRKPDWATFSGGKNEFELRPVTAADLVNAQGQCASGLEQQ
Ga0318561_1015257713300031679SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAG
Ga0318574_1016306233300031680SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAE
Ga0318574_1053855223300031680SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAAGQAAADPTAAGGTVAGGIALQMT
Ga0318560_1020220613300031682SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGET
Ga0307469_1230787823300031720Hardwood Forest SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQC
Ga0318493_1009813013300031723SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAAGQAAADPTAAGGTVA
Ga0318501_1005422413300031736SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAAGQAAADPTAAG
Ga0318502_1003598023300031747SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAAGQAAADPTAASRCR
Ga0318494_1013576143300031751SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLSGVTLVPKPETLLRKPDWANFSGGKNNFELRPVTAADLVSAEGQC
Ga0318537_1026976623300031763SoilMQRLNQIVLRMIAMAMLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDTATAQAAADPTAAGGSVTGG
Ga0318526_1012528433300031769SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTI
Ga0318546_1066469813300031771SoilMQGQRQIVLRLIAVTAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFALRPVTAADLVNAAGQCTGEGEQAGTDASAAVSGGIALQMTECD
Ga0318508_114853913300031780SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNS
Ga0318548_1003425143300031793SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTIA
Ga0318503_1017690213300031794SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAA
Ga0318557_1042542423300031795SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCD
Ga0318523_1020740013300031798SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAE
Ga0318565_1035833923300031799SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGE
Ga0318568_1091019013300031819SoilMQGQRQIVLRLIAVTAVGCALVACSSDLGLNNVTLVPKPETLLRKPDWTSFSGGKNEFALRPVTAADLVNAAGQCT
Ga0307473_1132954923300031820Hardwood Forest SoilMQRFKANALRMFAVAALTAAAGACSSDLSLSNVTLVPKPETMLRKPDWATFSGGKNDFELRPITAADLVGPEGQCGAAAPAQG
Ga0318567_1020746313300031821SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSREGQCSGGDGQAAADSTAGGGLVAGGIALQMT
Ga0318499_1005096843300031832SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADSTATGA
Ga0318527_1039529613300031859SoilMQRLNQIVLRMIAMAMLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDTATAQAAADPTAAGGSV
Ga0318495_1006521543300031860SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAGPVEK
Ga0318544_1013259813300031880SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLTGVTLVPKPETLLRKPDWANFSGSKNDFELRPVTAADLVSAEGQCDSGAGQAAADPTAGGGLVAGGIALQMTECDVVR
Ga0318536_1004058213300031893SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQ
Ga0310916_1001320863300031942SoilMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAGPVE
Ga0214473_1239624013300031949SoilMSRDWAMRRSSRIVRRMVVLAALAAIVGACSSDLSLNNVTLAPKPESILRKPDWATFSGSKNDFELRPITPADLVSPEGQCASAAADQAT
Ga0318530_1014874813300031959SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDELRPVTAADLVSAEGQCDTATAQAAADPTAAGGSVTGGIALQM
Ga0318563_1060726623300032009SoilVAMQRVNQIVLRMIVMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKSDFELRPVTAADLVGAEGQCNSRGGEAAA
Ga0318569_1050992023300032010SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAA
Ga0318549_1024606023300032041SoilMQRLNQIVLRMIAMATLACAMAACSSDLSLTGVTLVPKPETLLRKPDWANFSGSKNDFELRPVTAADLVSAEGQ
Ga0318545_1019755623300032042SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVR
Ga0318575_1072581913300032055SoilMQRLNQIVLRLIAMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWATFSGGKNDFELRPVTAADLVSAEGQCDSAAGQAAADPTAAGGTVAGGIALQM
Ga0318505_1028317923300032060SoilVAMQRVNQIVLRMIAMATLACAMAACSSDLSLNGFTLVPKPETLLRKPDWTTFSGGKNEFELRPVTAADLVGAEGQCNSRGGEAAADPTATGAAVSGGIALQMTECDVVRRAGPVEK
Ga0318577_1014680213300032091SoilVAMQRVNQIVLRMIVMATLACAMAACSSDLSLNGVTLVPKPETLLRKPDWTTFSGGKSDFELRPVTAADLVGAEGQCNSRGGEAAADSTATGATVSGGIALQMTECDVVRRAGPVEKIDIGSDER
Ga0268251_1051176823300032159AgaveMQRLNQIVLRIIAMATLGAAVAACSTDLGLNNVTLPKPDTLLRKPDWATFSGGKHDFTLRPVTVADLVNAAGQC


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.