NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F047936

Metagenome Family F047936

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F047936
Family Type Metagenome
Number of Sequences 149
Average Sequence Length 88 residues
Representative Sequence VTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASDLSPSVKTALTAFFGATPLLTKLIAIALLGRPTVNFLKRHSFKLFRRGSGGAD
Number of Associated Samples 108
Number of Associated Scaffolds 149

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 51.01 %
% of genes near scaffold ends (potentially truncated) 20.13 %
% of genes from short scaffolds (< 2000 bps) 89.93 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.315 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(24.161 % of family members)
Environment Ontology (ENVO) Unclassified
(28.188 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.651 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.
1deepsgr_02122020
2ICChiseqgaiiFebDRAFT_108364901
3INPhiseqgaiiFebDRAFT_1006131162
4INPhiseqgaiiFebDRAFT_1054709671
5F14TC_1046824932
6JGI1027J11758_122256952
7JGI12053J15887_101813362
8P5cmW16_10000204
9P5cmW16_10047502
10JGIcombinedJ26739_1007286622
11Ga0055440_101843331
12Ga0063356_1010684452
13Ga0062595_1000661061
14Ga0062595_1004461092
15Ga0062595_1004523782
16Ga0062591_1011471611
17Ga0062594_1004493911
18Ga0062594_1012687432
19Ga0068993_101200001
20Ga0065707_108681171
21Ga0066388_1016315702
22Ga0066388_1031832782
23Ga0068869_1004981172
24Ga0070682_1004982413
25Ga0070711_1005544882
26Ga0070711_1013621191
27Ga0070694_1010313592
28Ga0070663_1016478991
29Ga0070678_1012324931
30Ga0070672_1011609821
31Ga0070695_1003852542
32Ga0070704_1003728071
33Ga0070704_1009538732
34Ga0068866_101399513
35Ga0068863_1015708082
36Ga0068858_1002127696
37Ga0068858_1011547441
38Ga0068860_1012327922
39Ga0081455_108956882
40Ga0075026_1010902792
41Ga0070715_106243112
42Ga0105245_128240092
43Ga0105247_101750872
44Ga0105247_117530101
45Ga0105248_119387011
46Ga0105249_106548582
47Ga0105249_110960771
48Ga0105062_10548042
49Ga0126380_100046173
50Ga0126384_121396381
51Ga0126382_100686563
52Ga0126377_106234583
53Ga0134125_107197032
54Ga0134128_109588952
55Ga0134128_120045761
56Ga0105239_104672131
57Ga0105239_106118061
58Ga0134126_120692151
59Ga0134127_103861122
60Ga0134121_101785011
61Ga0134123_102304401
62Ga0120156_10253013
63Ga0120114_10002699
64Ga0120163_10392312
65Ga0137362_116352131
66Ga0137387_108738941
67Ga0137361_113060551
68Ga0164300_100418824
69Ga0164300_100524801
70Ga0164300_104299181
71Ga0164300_107647912
72Ga0164300_107866471
73Ga0164298_100157514
74Ga0164298_100627402
75Ga0164298_101714963
76Ga0164303_100017318
77Ga0164303_100408012
78Ga0164303_102068031
79Ga0164299_105460301
80Ga0164299_106666581
81Ga0164301_101932242
82Ga0164301_105543901
83Ga0164302_100596123
84Ga0164302_100822663
85Ga0164302_101813332
86Ga0164309_108953692
87Ga0164308_114734791
88Ga0164304_102616383
89Ga0164304_104413802
90Ga0164304_107134031
91Ga0164304_113194202
92Ga0164305_102837562
93Ga0164305_108675141
94Ga0164305_117224141
95Ga0157378_119972112
96Ga0163162_131835742
97Ga0120154_11280832
98Ga0120111_10010879
99Ga0157376_117160841
100Ga0137403_107549791
101Ga0132258_1004143713
102Ga0132258_107300233
103Ga0132258_110163845
104Ga0132256_1000201717
105Ga0184608_100240072
106Ga0184608_100654353
107Ga0184608_101011861
108Ga0184619_102588952
109Ga0184609_102001581
110Ga0184609_102330511
111Ga0190272_102833741
112Ga0224452_12488072
113Ga0222623_102237901
114Ga0207688_102216962
115Ga0207663_106296402
116Ga0207704_112128501
117Ga0207689_104651151
118Ga0207712_119680371
119Ga0207640_115968461
120Ga0207640_117096771
121Ga0207658_115232381
122Ga0207677_112440301
123Ga0207648_111321421
124Ga0207675_1006928932
125Ga0256867_101517432
126Ga0256866_10150811
127Ga0256865_10528162
128Ga0209118_10371592
129Ga0268264_107694701
130Ga0247828_111810021
131Ga0299907_102005442
132Ga0268386_104965881
133Ga0302046_104521661
134Ga0307501_101401032
135Ga0307497_105155591
136Ga0299913_108438691
137Ga0299913_112723961
138Ga0310886_100438664
139Ga0307469_101145782
140Ga0307469_120416672
141Ga0307468_1005411182
142Ga0307473_108988371
143Ga0214473_100767732
144Ga0214473_102112705
145Ga0214473_115299171
146Ga0307470_101033312
147Ga0307470_111516102
148Ga0310889_102466962
149Ga0307471_1043281001
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 66.37%    β-sheet: 0.00%    Coil/Unstructured: 33.63%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080VTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASDLSPSVKTALTAFFGATPLLTKLIAIALLGRPTVNFLKRHSFKLFRRGSGGADCytopl.Extracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
97.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Natural And Restored Wetlands
Groundwater Sediment
Watersheds
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Switchgrass Rhizosphere
Soil
Permafrost
Soil
Hardwood Forest Soil
Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Groundwater Sand
Arabidopsis Rhizosphere
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
4.0%24.2%4.7%4.0%4.7%4.7%3.4%5.4%4.0%4.0%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
deepsgr_021220202199352025SoilPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGSTPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD
ICChiseqgaiiFebDRAFT_1083649013300000363SoilMGAVLIAGGYAAWLLIPLVVASDLSPSVKTALAAFLGATPLLTKVIAIALLGQPTINFLRRHSFKLFRRDSGGAD*
INPhiseqgaiiFebDRAFT_10061311623300000364SoilGSTPSPACEMDTAAMSRIPRLPDTFRWRLAIGAVLIAGSYLAWLMIPLVVSSGLSPKVKTALTAVLGATPLATKFIAVALLGRPTINYLKKHPLRLFRGESD*
INPhiseqgaiiFebDRAFT_10547096713300000364SoilVAAFTVSRLPRTTDTFRWRLAIGAVLIAGGYLAWLVIPLVVGSDLTPRVKTALMAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD*
F14TC_10468249323300000559SoilMGAVLIAGGYAAWLLIPLVVASDLSPSVKTALAAFLGATPLLTKVIAIALLGQPTINFLRRHSFKLLQGFWRR*
JGI1027J11758_1222569523300000789SoilMPRAPRVTDTFRWQLAMGAVLIAGGYAAWLLIPLVVASDLSPSVKTALAAFLGATPLLTKVIAIALLGQPTINFLRRHSFKLFRRDSGGAD*
JGI12053J15887_1018133623300001661Forest SoilMTDTFRWRLAIGAVLIAGGYAAWLLIPLVVASDLSPSVKTALTALFGATPLLTKLIAIGLLGRPTINFLKSHSFKLFRRDSGNAD*
P5cmW16_100002043300001664PermafrostMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGATPLLTKFIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
P5cmW16_100475023300001664PermafrostMGHDTMTRAPRVTDTFRSLAIGAVLVAGGYAAWLIIPLIVASDLSPSVKTALTAFFGATPPLTKLIAIALLGRPTINFLKRHSSKLFRRDSSSAD*
JGIcombinedJ26739_10072866223300002245Forest SoilMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGVTPLLTKFIAIGLLGRPTINFLKTHSFKLFRRDSGS
Ga0055440_1018433313300004020Natural And Restored WetlandsPKNEVWRFARFRRRVSQLLRMAAGGSRERDEQAPRVTDTFRWRLAIGALLIVCGYAAWLIIPLVVASDLRPTAKTTLTAFLDATPLLTKLIAIGLLGRPTINFLKKHPFKLFRQDSGGAD
Ga0063356_10106844523300004463Arabidopsis Thaliana RhizosphereVTRPPRKTDTFRWRLAIGAILIAGGYGAWLIIPLVVGSDLTPSIKTALTAVLGATPLLTKLLAIALLGRPTIDFLKKHSFNLFRNNSGAAD*
Ga0062595_10006610613300004479SoilMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0062595_10044610923300004479SoilMSGTRAAATFRRRLAIGAVLIAGGYTAWLIIPFVATSDLSPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFDRGSGRR*
Ga0062595_10045237823300004479SoilMPRAPRVTDTFRWRLAMGAVLIAGGYAAWLLIPLVVASDLSPSVKTALAAFLGATPLLTKVIAIALLGQPTINFLRRHSFKLFRRDSGGAD*
Ga0062591_10114716113300004643SoilRLAIGAVLIAGGYTAWLIIPFVATSDLSPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFGRGSGPD*
Ga0062594_10044939113300005093SoilMEAAAMSKIPRLPDTFRWRLAIGAVLIAGSYLAWLIIPVVVSSGLSPEVKTALTAVLGATPLATKFIAFALLGRPTINYLKKHPLKLFRRESGPAD*
Ga0062594_10126874323300005093SoilMNRTPRRRDTFRWRLALGAILVAGGYAAWLLIPLVVASDLNSNVKAALTACLGATPFLTKLIAIALLGRPTIMFLKRHFFGLFRGGSGST*
Ga0068993_1012000013300005183Natural And Restored WetlandsMAAGGSRERDEQAPRVTDTFRWRLAIGALLIVCGYAAWLIIPLVVASDLRPTAKTTLTAFLDATPLLTKLIAIGLLGRPTINFLKKHPFKLFRQDSGGAD*
Ga0065707_1086811713300005295Switchgrass RhizosphereMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAVALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0066388_10163157023300005332Tropical Forest SoilMTDTFRWRLAIGAVLIAGGYFAWLIIPLVAASELSPSAKTVLTAFFGATPLLTKLIAIALLGRPAINFLTKHSFNLLRRDLGGAD*
Ga0066388_10318327823300005332Tropical Forest SoilMTDTFRWRLAIGAVLIAGGYVAWLIIPLVGASELSPSAKTVLTAFFGATPLLTKLVAIALLGRPTINFLKKHSFKLLGGAD*
Ga0068869_10049811723300005334Miscanthus RhizosphereMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0070682_10049824133300005337Corn RhizosphereMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0070711_10055448823300005439Corn, Switchgrass And Miscanthus RhizosphereMSKRAVRAKFKARRGPKPPRRTDTFRWRLVIGAILIAGGYGAWLIIPLVVGSHLTPNIKTALTAVLGATPLLTKLLAIALLGRPTIDFLKRHSFKLLRNDSGNPD*
Ga0070711_10136211913300005439Corn, Switchgrass And Miscanthus RhizosphereVTRTPRRTDTFRWRLAIGAVLIAGGYGAWLTVPLVVASELSPSMKVALTAVLGATPLLTKLLAIALLGRPTVEFLKRHSFKLLPERFRRR*
Ga0070694_10103135923300005444Corn, Switchgrass And Miscanthus RhizosphereAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD*
Ga0070663_10164789913300005455Corn RhizosphereMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0070678_10123249313300005456Miscanthus RhizosphereMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINY
Ga0070672_10116098213300005543Miscanthus RhizosphereMSGTRAAATFRRRLAIGAVLIAGGYTAWLIIPFVATSDLSPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFDRGSG
Ga0070695_10038525423300005545Corn, Switchgrass And Miscanthus RhizosphereMIRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD*
Ga0070704_10037280713300005549Corn, Switchgrass And Miscanthus RhizosphereRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0070704_10095387323300005549Corn, Switchgrass And Miscanthus RhizosphereMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD*
Ga0068866_1013995133300005718Miscanthus RhizosphereLGDTSKGVGFGRRPFGKVRCRLTRRPRRTDTFRWRLAIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLLAIALLGRPTIDFLKRHPLKPFRNDSRRAG*
Ga0068863_10157080823300005841Switchgrass RhizosphereVLIAGGYTAWLIIPFVATSDLSPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFDRGSGRR*
Ga0068858_10021276963300005842Switchgrass RhizosphereLGDTSKGVGFGRRPFGKVRCRLTRRPRRTDTFRWRLAIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLLAIALLGRPTIDFLKRHALKPFRNDSRRAG*
Ga0068858_10115474413300005842Switchgrass RhizosphereMTDTFRWRLAIGAALVASGYAAWLVIPVVVASDLSPSIKSGLGAFLGATPLLTKLIAVALLGRPTINFLKKHSFKLFRRSVGGGE*
Ga0068860_10123279223300005843Switchgrass RhizosphereMTDTFRWRLAIGAALVASGYAAWLVIPVVVASDLSPSIKSGLGAFLGATPLLTKLIAVVLLGRPTIDFLKKHSFKLFRRSMGGDD*
Ga0081455_1089568823300005937Tabebuia Heterophylla RhizosphereLAIGAVLIAGAWLTIPVVVASDLSPKITAALTAVLGATPLLTKLLAIALLGRPTIEFLKKHSFKLLPERFRRR*
Ga0075026_10109027923300006057WatershedsVTDAFRWRLAIGAVLIAGGYAAWLIIPIVVASDLNPSIKSGLAAFLDATPLLTKLIAVALLGRPTINFLKKHSFKLFRRSAGGGD*
Ga0070715_1062431123300006163Corn, Switchgrass And Miscanthus RhizosphereMNRAPRLSDTFRWRLAIGAAFIAGGYLAWSIIPLVVASDLSPSAKTALAAFLGATPLLTKLIAIALLGRPTIDFLKKHSFKLFHRDSGTAD*
Ga0105245_1282400923300009098Miscanthus RhizosphereGRRPFGKVRCRLTRRPRRTDTFRWRLAIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLLAIALLGRPTIDFLKRHPLKPFRNDSRRAG*
Ga0105247_1017508723300009101Switchgrass RhizosphereLRVTRRPRRTDTFRWRLAIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLLAIALLGRPTIDFLKRHPLKPFRNDSRRAG*
Ga0105247_1175301013300009101Switchgrass RhizosphereMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLKLFRRESGPAD*
Ga0105248_1193870113300009177Switchgrass RhizosphereMSGTRAAATFRRRLAIGAVLIAGGYTAWLIIPFVATSDLGPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFNRGSGRR*
Ga0105249_1065485823300009553Switchgrass RhizosphereLAIGAVIVAGGYAAWLIIPLVVASGLSPGIKSGLAIFLGTTPLLMKLIAIALLGRPTINFLKKHSFKLFRRSVGGGE*
Ga0105249_1109607713300009553Switchgrass RhizosphereMSRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVVGSDLSPRVKTALTAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD*
Ga0105062_105480423300009817Groundwater SandMTDTFRWRLAFGAVLIAGGYGAWLIIPLVVASDLSPSVKTALTAFLGAIPLLTKLLALALLGRPTIKRSSFKLFR*
Ga0126380_1000461733300010043Tropical Forest SoilMTDTFRWRLAIGAVLIAGGYFAWLIIPLVAASELSPSAKTVLTAFFGATPLLTKLVAIALLGRPTINFLKKHSFNLLRRDLGGAD*
Ga0126384_1213963813300010046Tropical Forest SoilLAIGAVLIAGGYFAWLIIPLVAASELSPSAKTVLTAFFGATPLLTKLVAIALLGRPTINFLKKHSFKLLRRDLGGAD*
Ga0126382_1006865633300010047Tropical Forest SoilMTDTFRWRLAIGAVLIAGGYFAWLIIPLVAASELSPSAKTVLTAFFGATPLLTKLVIALLGRPTINFLKKHSFNLLRRDLGGAD*
Ga0126377_1062345833300010362Tropical Forest SoilLAIGAVLIAGGYFAWLIIPLVAASELSPSAKTVLTAFFGATPLLTKLVAIALLGRPTINFLKKHSFNLLRRDLGGAD*
Ga0134125_1071970323300010371Terrestrial SoilMNRPRRTDTFRWRLTIGAILVAGGYLAWLIIPSVVASDLAPSIKTVLTAVLGATPLLTKLLAIALLGRPTINLLKKHSFKLFRRDS
Ga0134128_1095889523300010373Terrestrial SoilMTDTFRWRLAIGAVIVAGGYAAWLIIPLVVASGLSPGIKSGLAIFLGTTPLLMKLIAIALLGRPTINFLKKHSFKLFRRSVGGGE*
Ga0134128_1200457613300010373Terrestrial SoilMEAAAMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD
Ga0105239_1046721313300010375Corn RhizosphereMEAAAMTRMPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0105239_1061180613300010375Corn RhizosphereLAIGAVIVAGGYAAWLIIPLVVASGLSPGIKSGLAIFLGTTPLLMKLIAIALLGRPTINFLKKHSFKLF
Ga0134126_1206921513300010396Terrestrial SoilLAIGAVIVAGGYAAWLIIPLVVASGLSPGIKSGLAIFLGTTPLPMKLIAIALLGRPTINFLKKHSFKLFRRSVGGGE*
Ga0134127_1038611223300010399Terrestrial SoilMDAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKTHPLRLFRGKSGPAD*
Ga0134121_1017850113300010401Terrestrial SoilMSKLPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0134123_1023044013300010403Terrestrial SoilMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLRTHPLRLFRGKSGPD*
Ga0120156_102530133300011996PermafrostMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGATPLLTKFIAIGLLGRPTINFLKTHSFKLFRGDSGSAD*
Ga0120114_100026993300011998PermafrostMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGATLTKFIAIGLLGRPTINFLKTHSFKLFRGDSGSAD*
Ga0120163_103923123300012003PermafrostMTDMFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGATPLLTKFIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
Ga0137362_1163521313300012205Vadose Zone SoilMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGVTPLLTKLIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
Ga0137387_1087389413300012349Vadose Zone SoilMTDTFRWRLAFGAVLIAGGYGAWLIIPLVVASDLSPSVKTALTAFLGATPLLTKLLAIALVGRPTIDFLKRHSFKLFRKDTGGA*
Ga0137361_1130605513300012362Vadose Zone SoilVNIDEMSRVPRMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGVTPLLTKLIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
Ga0164300_1004188243300012951SoilLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGSTPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD*
Ga0164300_1005248013300012951SoilLAIGAAFIAGGYLAWLIIPLVVASDLSPSAKTALAAFLGATPLLTKLIAIALLGRPTIDFLKKHSFKLFHRDSGTAD*
Ga0164300_1042991813300012951SoilLVVRRIVWGGGAGFSRLELTRTRWRLVIIAGGYGAWLIIPLVVGSHLTPNIKTALTALLGATPLLTKLLAIALLGRPTIDFLKRHSFKLLRNDSGNPD*
Ga0164300_1076479123300012951SoilMTDTFRWRLAIGAALVASGYAAWLVIPVVVASDLSPSIKSGLGAFLGATPLLTKLIAVVLLGRPTIDFLKKHSFNLCRRSMGGDD*
Ga0164300_1078664713300012951SoilMEAAAMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQH
Ga0164298_1001575143300012955SoilVTAPTVTRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGSTPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD*
Ga0164298_1006274023300012955SoilMEAAAMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD*
Ga0164298_1017149633300012955SoilMTDTFRWRLAIGAALVASGYAAWLVIPRLGAFLGATPLLTKLIAVVLLGRPTIDFLKKHSFKLFRRSMGGDD*
Ga0164303_1000173183300012957SoilVTRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGSTPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD*
Ga0164303_1004080123300012957SoilLAIGAAFIAGGYLAWLIIPLVVASDLSPSAKTALAAFLGATPLLTKLIAIALLGRPTIDFLIYLSFYVYHRDSGTAD*
Ga0164303_1020680313300012957SoilMSKRAARARLEARRVSRPPRRTDTFRWRLVIGAILIAGGYGAWLIIPLVVGSHLTPNIKTALTAVLGATPLLTKLLAIALLGRPTIDFLKRHSFKLLRNDSGNPD*
Ga0164299_1054603013300012958SoilMEAAAMTRMPRLPDTFRWRLAMGAVLIAGSYLAWFIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD*
Ga0164299_1066665813300012958SoilMTDTFRWRLAIGAALVASGYAAWLVIPVVVASDLSPSIKSGLGAFLGATPLLTKLIAVVLLGRPTIDFLKKHSFKLFRRSM
Ga0164301_1019322423300012960SoilVRAKFKARRVPRPPRRTDTFRWRLVIGAILIAGGYGAWLIIPLVVGSHLTPNIKTALTALLGATPLLTKLLAIALLGRPTIDFLKRHSFKLLRNDSGNPD*
Ga0164301_1055439013300012960SoilMDTAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD*
Ga0164302_1005961233300012961SoilLIAGGYGAWLIIPFVVGSDLTPSIKTALTAVLGATSLLTKLFAIALLGRPTIDFLKRRSFKLFRKNSRAD*
Ga0164302_1008226633300012961SoilLAIGAAFIAGGYLAWLIIPLVVASDLSSSAKTALAAFLGATPLLTKLIAIALLGRPTIDFLKKHSFKLFHRDSGTAD*
Ga0164302_1018133323300012961SoilLVIGAILIAGGYGAWLIIPLVVGSHLTPNIKTALTAVLGATPLLTKLLAIALLGRPTIDFLKRHSFKLLRNDSGNPD*
Ga0164309_1089536923300012984SoilLAIGAVLIAGGYLAWLVIPTVVGSDLSPRFKTALTACLGATPLLTKLIAIALLGRPTINLLKKHSFKPFRRAAAAD*
Ga0164308_1147347913300012985SoilPRTTATFRWRLAIGAVLIAGGYLAWLVIPTVVGSDLSPRFKTALTACLGATPLLTKLIAIALLGRQTINLLKKHSFKPFRRAAAAD*
Ga0164304_1026163833300012986SoilMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSVPAD*
Ga0164304_1044138023300012986SoilRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKKALKAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD*
Ga0164304_1071340313300012986SoilLIIGAILIAGGYGAWLIIPLVVGSHLTPNIKTALTALLGATPLLTKLLAIALLGRPTIDFLKRHSFKLLRNDSGNSD*
Ga0164304_1131942023300012986SoilGAVLIAGGYLAWLVIPTVVGSDLSPRFKTALTACLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRRAAAAD*
Ga0164305_1028375623300012989SoilLAIGAAFIAGGYLAWLIIPLVVASDLSPSAKTALAAFLGATPLLTKLIAIVLLGRPTIDFLKKHSFKLFHRDSGTAD*
Ga0164305_1086751413300012989SoilVTDAFHWRLANGAVLIVGGYAASLIIPLVVASDLSPTIKSGLAAFLAATPLLTKLMAVALLGRPTINFLKKPSFKLFRLSAGGGD*
Ga0164305_1172241413300012989SoilVLIAGSYLAWLIIPVVVSSGLSPEVKTALTAVLGATPLATKFIALALLGRPTINYLKKHPLKLFRRESGPAD*
Ga0157378_1199721123300013297Miscanthus RhizosphereMSGTRAAATFRRRLAIGAVLIAGGYTAWLIIPFVATSDLSPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFDRGS
Ga0163162_1318357423300013306Switchgrass RhizosphereLAIGHRAVIVAGGYAAWFIIPLVLGSDLSHQVGASAFLGATPLLTKLIAVALLGRPTINFLKKHSFKLFRRSVGGGE*
Ga0120154_112808323300013501PermafrostQVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGATPLLTKFIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
Ga0120111_100108793300013764PermafrostMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGATLTKFIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
Ga0157376_1171608413300014969Miscanthus RhizosphereLTRRPRRTDTFRWRLAIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLLAIALLGRPTIDFLKRHPLKPFRNDSRRAG*
Ga0137403_1075497913300015264Vadose Zone SoilMTDTFRWRLAIGAVLVAGGYAAWLIIPLVVASGLSPGVKTALTALFGVTPLLTKLIAIGLLGRPTINFLKTHSFKLFRRDSGSAD*
Ga0132258_10041437133300015371Arabidopsis RhizosphereLNWEGVGFGRRPFGKVRCRLTRRPRRTDTFRWRLAIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLLAIALLGRPTIDFLKRHPLKPFRNDSRRAG*
Ga0132258_1073002333300015371Arabidopsis RhizosphereMGALLVAGGYAAWFIIPLVVASDLSPTVKTALTAFFGATPLLSKLIAIALLGRPTINFLKRHYLKLFHRDSAGGQ*
Ga0132258_1101638453300015371Arabidopsis RhizosphereVKTNIVIETNTLGPSLSRRVTRTPRRTDTFRWRLAIGAVLIAGAYGAWLTIPLVVASDLSPSIKAALIAVLGATPLLTKLLAIALLGRPTIEFLKKHSFKLLPERFGRR*
Ga0132256_10002017173300015372Arabidopsis RhizosphereLNWEGVGFGRRPFGKVRCRLTRRPRRTDTFRWRLSIGAILIAGGYGAWLIIPFVAGANLTSGIKTALTAVLGVTPLLTKLVAIALLGRPTIDFLKRHPLKPFRNDSRRAG*
Ga0184608_1002400723300018028Groundwater SedimentMSRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD
Ga0184608_1006543533300018028Groundwater SedimentMRRIPRLPDTFRWRLAIGPVLIAGGYLAWLIIPLVVSSGLSPKVKTALAAVLGATPLATKFIAVALLGRPTINYLKKHPLGLFRGKSGPAD
Ga0184608_1010118613300018028Groundwater SedimentVTRTPRKTDTFRWRLAFGAVLIAGGYGAWLIIPLVVASDLSPSVKTTLTALLGATPLLTKLLAIALLGRPTIDFLKRHSFKLFRKESGGAD
Ga0184619_1025889523300018061Groundwater SedimentMSRPSRVIDTFRWRLAIGALLIVGGYAAWLIIPLVVASDLTPTAKTTLTAFLGATPLLTKLIAIGLLGRPTINFLKRHPFKLFRQDSGGAD
Ga0184609_1020015813300018076Groundwater SedimentMSRAPRVTDTFRWRLAIGAVLIAGGYAAWLIIPFVVASDLSPSVKTALTAFFGATPLLTKLIAIALLGRPTINFLKRHSFKLFRRDSGGAD
Ga0184609_1023305113300018076Groundwater SedimentMSRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVVGSDLSPRVKTALTAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD
Ga0190272_1028337413300018429SoilLAIGAVLIASGYGAWLIVPLVIASGLSPSVKTALTAFLGVTPLLTKLLAIALLGRPTINFLKLHSFKLFRKDSGAAD
Ga0224452_124880723300022534Groundwater SedimentRIPRLPDTFRWRLAIGPVLIAGSYLAWLIIPLVVSSGLSPKVKTALAAVLGATPLATKFIAVALLGRPTINYLKKHPLRLFRGKSGPAD
Ga0222623_1022379013300022694Groundwater SedimentMRRIPRLPDTFRWRLAIGPVLIAGSYLAWLIIPLVVSSGLSPKVKTALAAVLGATPLATKFIAVALLGRPTINYLKKHPLGLFRGKSGPAD
Ga0207688_1022169623300025901Corn, Switchgrass And Miscanthus RhizosphereMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD
Ga0207663_1062964023300025916Corn, Switchgrass And Miscanthus RhizosphereVTDAFHWRLANGAVLIVGGYAAWLIIPLVAASDLSPTIKSGLAAFLATTPLLTKLIAVALLGRPTINFLKKHSFKLFRLSAGGGD
Ga0207704_1121285013300025938Miscanthus RhizosphereMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAVALLGRPTINYLKKHPLRLFRGKSGPAD
Ga0207689_1046511513300025942Miscanthus RhizosphereMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTAVLGATPLATKFIAVALLGRPTINYLKTHPLRLFRGKSGPAD
Ga0207712_1196803713300025961Switchgrass RhizosphereMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD
Ga0207640_1159684613300025981Corn RhizosphereMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTINYLKQHPLRLFRGKSGPAD
Ga0207640_1170967713300025981Corn RhizosphereMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD
Ga0207658_1152323813300025986Switchgrass RhizosphereMSGTRAAATFRRRLAIGAVLIAGGYTAWLIIPFVATSDLSPGVKTAVTAFLGATPLLTKLIAIGLLGRPTVDYLKRHSFKWFERGSGRR
Ga0207677_1124403013300026023Miscanthus RhizosphereMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTLNYLKQHPLRLFRGKSGPAD
Ga0207648_1113214213300026089Miscanthus RhizosphereMTRMPRLPDTFRWRLAMGAVLIAGSYLAWLIIPLVVSSGLSPDVKTALTAVLGATPLATKFIAVALLGRPTINYLKKHPLRLFRGKSGPAD
Ga0207675_10069289323300026118Switchgrass RhizosphereMTDTFRWRLAIGAALVASGYAAWLVIPVVVASDLSPSIKSGLGAFLGATPLLTKLIAVVLLGRPTIDFLKKHSFKLFRRSMGGDD
Ga0256867_1015174323300026535SoilVTRAPRKTDTFRWRLAIGAVLIAGGYGAWLIVPLVVASNLSPSVKTALTAFLSATPLLTKLLAIALLGRPTINFLKRHSFKLFRNDSSRR
Ga0256866_101508113300027650SoilLTIGAALIAGGYAAWLIIPIVVASELNPNVKTTLTVFLGATPLLTKLIAIALLGRPTINFLKRHFKLFRRDSGGAH
Ga0256865_105281623300027657SoilMSRAPRVTDTFRWRLAIGAVLIAGGYGAWLIIPLVVASDLSPSVKTALTAFFGATPLLTKLIAIALLGRPTVNFLKRHSFKLFRRGSGGAD
Ga0209118_103715923300027674Forest SoilMTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASGLSPGVKTALTALFGVTPLLTKFIAIGLLGRPTINFLKTHSFKLFRRDSGSAD
Ga0268264_1076947013300028381Switchgrass RhizosphereMAAAMSKIPRLPDTFRWRLAIGAVLIVGSYLAWLIIPLVVSSGLSPKAKTALTALLGATPLATKFIAIALLGRPTINYLKKHPLRLFRGKSGPAD
Ga0247828_1118100213300028587SoilMSRLPRVTDTFRWRLAIGAVLIAGGYLAWLVIPLVVGSDLSPRVKTALTAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD
Ga0299907_1020054423300030006SoilVTDTFRWRLAIGAALIAGGYAAWLIIPIVVASELNPNVKTTLTVFLGATPLLTKLIAIALLGRPTINFLKRHFKLFRRDSGGAH
Ga0268386_1049658813300030619SoilVTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASDLSPSVKTALTAFFGATPLLTKLIAIALLGRPTVNFLKRHSFKLFRRGSGGAD
Ga0302046_1045216613300030620SoilMSRVPRLTDTFRWRLAIGAGLIAGGYAAWLIIPFVVASDLSPTVKTALTAFLGATPLLTKLIAIGLLGRPTINFLKRHSFKLFRDSGGAG
Ga0307501_1014010323300031152SoilMSKRASSAKLEARRVPRPPRRTDTFRWRLAIGAILVAGGYGAWLIIPLVVGSNLTPSIKTALTAALGATPLLTKLLAIALLGRPTIDFLKKHSFK
Ga0307497_1051555913300031226SoilVLIVGGYGAWLTIPLVVASDLSPSIKAALIAVLGATPLLTKLLAIALLGRPTIEFLKKHSFKLLPERFGRR
Ga0299913_1084386913300031229SoilLTIGAALIAGGYAAWLIISIVVASELNPNVKTTLTVFLGATPLLTKLIAIALLGRPTINFLKRHFKLFRRDSGGAH
Ga0299913_1127239613300031229SoilVRWRLAIGAVLIAGGYGAWLIVPLVVASNLSPSVKTALTAFLSATPLLTKLLAIALLGRPTINFLKRHSFKLFRNDSSRR
Ga0310886_1004386643300031562SoilVTAPTVTRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGSTPLLTKLIAIALLGRPTINFLKKHSFKLFRRAGAAD
Ga0307469_1011457823300031720Hardwood Forest SoilMSRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVVGSDLSPRVKTALTAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLLRRAGAAD
Ga0307469_1204166723300031720Hardwood Forest SoilMTDTFRWRLVIGAVLVASGYAAWLVIPVVVASDLGPSIKSGLAAFLGATPLLTKLIAIALLGRPTINFLKKHSFRLFRRSVGDGD
Ga0307468_10054111823300031740Hardwood Forest SoilMKAAAMSRIPRLPDTFRWRLAIGAVLIAGSYLAWLIIPLVVSSGLSPKVKAALTAVLGATPLATKVIAVALLGRPTINYLKKHPLKLFRRESGPAD
Ga0307473_1089883713300031820Hardwood Forest SoilMKAAAMSRIPRLPDTFRWRLAIGAVLIAGSYLAWLIIPLVVSSGLSPKVKAALTAVLGATPLATKVIAVALLGRPTLNYLKKHPLKLFRRESGPAD
Ga0214473_1007677323300031949SoilMTRPPRMTDTFRWRLAFGAVLIAGGYGAWLIIPLVVASDLSPSVKTALTAFLGAIPLLTKLLALALLGRPTINFLKRRSFKLFR
Ga0214473_1021127053300031949SoilMSRAPRVTDTFRWRLAIGAVLIAGGYAAWLIIPLVVASDLSPSVKTALTAFFGATPLLTKLIAIALLGRPTVNFLKRHSFKLFRRGSGGAD
Ga0214473_1152991713300031949SoilVTDTFRWRLAIGALLIVGGYAAWLIIPLVVASDLRPTAKTTLTAFLSATPLLTKLIAIGLLGRPTINFLKRHPFKLFRQDSGGAD
Ga0307470_1010333123300032174Hardwood Forest SoilMSRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVVGSDLSPRVKTALTAFLGATPLLTKLIAIALLGRPTINFLKKHSFKLFRPAGAAD
Ga0307470_1115161023300032174Hardwood Forest SoilLAIGAVLIAGSYLAWLIIPLVVSSGLSPKVKAALTAVLGATPLATKVIAVALLGRPTINYLKKHPLKLFRRESGPAD
Ga0310889_1024669623300032179SoilVTAPTVTRLPRATDTFRWRLAIGAVLIAGGYLAWLVIPLVLGSDLTPRVKTALTAFLGSTPLLTKLIAIALFGRPTINFLKKHSFKLFRRAGAAD
Ga0307471_10432810013300032180Hardwood Forest SoilMTDAFRWRLAIGAILIAGGYAAWLIIPVVVASDLSPDIKSGLAAFLGATPLLTKLIAIALLGRPTINFLKKHSFRLFRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.