NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F053606

Metagenome / Metatranscriptome Family F053606

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F053606
Family Type Metagenome / Metatranscriptome
Number of Sequences 141
Average Sequence Length 76 residues
Representative Sequence MMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQD
Number of Associated Samples 119
Number of Associated Scaffolds 141

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 84.40 %
% of genes near scaffold ends (potentially truncated) 20.57 %
% of genes from short scaffolds (< 2000 bps) 88.65 %
Associated GOLD sequencing projects 109
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.738 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(19.149 % of family members)
Environment Ontology (ENVO) Unclassified
(36.879 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.681 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.
1KansclcFeb2_05545190
2KansclcFeb2_10016130
3ICCgaii200_05043021
4ICCgaii200_10220511
5ICCgaii200_10230103
6INPgaii200_11845373
7ICChiseqgaiiFebDRAFT_111046401
8INPhiseqgaiiFebDRAFT_1014718661
9INPhiseqgaiiFebDRAFT_1014725952
10AL20A1W_11393833
11JGI11643J12802_101896571
12JGI11643J12802_102176892
13JGI10216J12902_1060942281
14F14TB_1019683712
15JGI25390J43892_100277292
16JGI25390J43892_101072311
17Ga0062593_1004005061
18Ga0062593_1026838651
19Ga0063356_1035621461
20Ga0062591_1023700002
21Ga0062594_1006143242
22Ga0062594_1007424613
23Ga0062594_1010616441
24Ga0070668_1007940221
25Ga0070659_1007816881
26Ga0070659_1017071242
27Ga0070709_108160191
28Ga0070709_112537602
29Ga0070714_1003355462
30Ga0070713_1004885351
31Ga0070711_1004044932
32Ga0070705_1006636712
33Ga0070700_1008252011
34Ga0070694_1016305871
35Ga0070708_1000576763
36Ga0070707_1013101751
37Ga0070699_1013540812
38Ga0070697_1007105853
39Ga0070696_1001337893
40Ga0066704_110189561
41Ga0068855_1004198251
42Ga0068855_1018242991
43Ga0068861_1023831141
44Ga0070715_100634644
45Ga0070716_1000741784
46Ga0070716_1001673962
47Ga0070712_1000106695
48Ga0075420_1006181211
49Ga0066710_1016653661
50Ga0066709_1015105681
51Ga0105243_101632035
52Ga0105243_126613152
53Ga0105242_104858732
54Ga0105249_109066912
55Ga0134062_103411052
56Ga0134124_127420082
57Ga0134127_124150891
58Ga0134123_106079342
59Ga0126349_12921202
60Ga0126348_10384362
61Ga0138514_1000198571
62Ga0137433_10926431
63Ga0120139_11938742
64Ga0137364_107001622
65Ga0137383_100529334
66Ga0137383_101500602
67Ga0137382_113012141
68Ga0137365_100250386
69Ga0137365_104812462
70Ga0137376_113723521
71Ga0137370_106312431
72Ga0137372_104986531
73Ga0137384_100101855
74Ga0137360_110464131
75Ga0137358_103950122
76Ga0157286_104579381
77Ga0164300_103513802
78Ga0164299_103099741
79Ga0164299_114287701
80Ga0164301_103104982
81Ga0164301_108450321
82Ga0164302_107417981
83Ga0120111_11176271
84Ga0157376_105979032
85Ga0137405_11208411
86Ga0134089_103173062
87Ga0132258_117469453
88Ga0163161_120944261
89Ga0184605_100174152
90Ga0184608_100708124
91Ga0184620_100074253
92Ga0184619_100389742
93Ga0184619_101416191
94Ga0184618_100764781
95Ga0066655_101134572
96Ga0190269_114285241
97Ga0173481_102017662
98Ga0173482_107242121
99Ga0193704_10239081
100Ga0193729_100279210
101Ga0247786_11638661
102Ga0247801_10881432
103Ga0209751_104295253
104Ga0207642_100465491
105Ga0207688_108895791
106Ga0207684_101663213
107Ga0207671_104449132
108Ga0207693_100052181
109Ga0207693_106602591
110Ga0207646_113843522
111Ga0207700_104079482
112Ga0207690_114227462
113Ga0207706_102196482
114Ga0207709_106344942
115Ga0207669_113226462
116Ga0207665_100411684
117Ga0207665_109068062
118Ga0207708_113702992
119Ga0207676_109291641
120Ga0209350_10243832
121Ga0209027_10299602
122Ga0268265_109218191
123Ga0307317_100448972
124Ga0307316_100134492
125Ga0307280_100957892
126Ga0307306_100772571
127Ga0307305_100892102
128Ga0307292_101174972
129Ga0307310_106417172
130Ga0307312_101863442
131Ga0307289_100126356
132Ga0307289_103453462
133Ga0307278_104649602
134Ga0308178_11803832
135Ga0307497_107215691
136Ga0308194_101819261
137Ga0310887_102099363
138Ga0247727_100227473
139Ga0310813_106118852
140Ga0310900_104502962
141Ga0307471_1016852882
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 21.90%    β-sheet: 14.29%    Coil/Unstructured: 63.81%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQDSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
57.4%42.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Soil
Soil
Permafrost
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Soil
Biofilm
Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Boreal Forest Soil
4.3%19.1%9.2%4.3%9.2%5.0%17.0%2.8%2.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
KansclcFeb2_055451902124908045SoilMMGRIKAVGTATAVTLFLAWPASAGARVKWVCDVPGEGPVTFVSASDAARHGIDTANAHAGQTFNRNFGEVCSVVQD
KansclcFeb2_100161302124908045SoilMMGRIKAVGTATAVALFLAWPAPAGASVNWVCNVPGEGLVTFVSVSDAARHGIDTANAHAGQTFNRQFGEVCTVVQD
ICCgaii200_050430212228664021SoilMGRIKALGTATAVTLFLAWPASAGASVKWVCNVPGEGSVTFVSVADAARHGIDTANAHAGQTFNRQFDEVCTVVQD
ICCgaii200_102205112228664021SoilMXGWIKAVGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRNFGEVCTVVQD
ICCgaii200_102301032228664021SoilMMGWIKAVGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRNFGEVCTVVQD
INPgaii200_118453732228664022SoilMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGXVTFVSVSDAARHGIDTANAHAGQTFNRNFGEVCTVVQD
ICChiseqgaiiFebDRAFT_1110464013300000363SoilMRPKVLLGAMAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA*
INPhiseqgaiiFebDRAFT_10147186613300000364SoilMMGRIKAMGAAAAVALFLAWPASAGARVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNLRFGEVCTVVQN*
INPhiseqgaiiFebDRAFT_10147259523300000364SoilMMGRIKAMGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNRQFGEVCTVVQN*
AL20A1W_113938333300000880PermafrostMMGRIKAMGTATAVTLFLAWPASAGASVRWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQN*
JGI11643J12802_1018965713300000890SoilMGRIKALGTATAVTLFLAWPASAGASVKWVCNVPGEGSVTFVSVADAARHGIDTANAHAGQTFNRQFDEVCTVVQD*
JGI11643J12802_1021768923300000890SoilMMGWIKAVGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRNFGEVCTVVQD*
JGI10216J12902_10609422813300000956SoilATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
F14TB_10196837123300001431SoilMMGRIKAVGTATAVTLFLAWPASAGARVKWVCDVPGEGPVTFVSASDAARHGIDTANAHAGQTFNRNFGEVCSVVQD*
JGI25390J43892_1002772923300002911Grasslands SoilMMGLIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQH*
JGI25390J43892_1010723113300002911Grasslands SoilMMGRIKAVGTATAVALFLAWPASAGARVNWVCNVPGEGLVTFVSVSDAARHGIDTANAHAGQTFNRQFGEECTVVQTDH*
Ga0062593_10040050613300004114SoilLGAIAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA*
Ga0062593_10268386513300004114SoilMMGRIKAMGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0063356_10356214613300004463Arabidopsis Thaliana RhizosphereMMGWIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRNFGEVCTVVQN*
Ga0062591_10237000023300004643SoilMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRNFGEVCTVVQD*
Ga0062594_10061432423300005093SoilMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0062594_10074246133300005093SoilITAMGAAAAVTLFLAWPTSAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0062594_10106164413300005093SoilMKRRMSAAIGWIALVLLMAWPSAAGANVNWVCNVPGEGPVVFVSAADAARHGLETANAHAGQTFNRNFGEVCTVVQG*
Ga0070668_10079402213300005347Switchgrass RhizosphereVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
Ga0070659_10078168813300005366Corn RhizosphereMMGWIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAAGHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0070659_10170712423300005366Corn RhizosphereMKRRMSAAIGWIALVLLMAWPSAAGANVNWVCNVPGEGPVVFVSAADAARHGLETANAHAGQTFNR
Ga0070709_1081601913300005434Corn, Switchgrass And Miscanthus RhizosphereMMGWIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0070709_1125376023300005434Corn, Switchgrass And Miscanthus RhizosphereMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0070714_10033554623300005435Agricultural SoilMIGRIKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVPVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0070713_10048853513300005436Corn, Switchgrass And Miscanthus RhizosphereGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0070711_10040449323300005439Corn, Switchgrass And Miscanthus RhizosphereMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVKD*
Ga0070705_10066367123300005440Corn, Switchgrass And Miscanthus RhizosphereMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAAGHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0070700_10082520113300005441Corn, Switchgrass And Miscanthus RhizosphereMMQAKVLLGATAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIVTANAHAGQTFLRNFGEVCTVENA*
Ga0070694_10163058713300005444Corn, Switchgrass And Miscanthus RhizosphereMMGWIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
Ga0070708_10005767633300005445Corn, Switchgrass And Miscanthus RhizosphereMMGRIKALEAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0070707_10131017513300005468Corn, Switchgrass And Miscanthus RhizosphereMIGRIKAIGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0070699_10135408123300005518Corn, Switchgrass And Miscanthus RhizosphereMMGRIKALGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0070697_10071058533300005536Corn, Switchgrass And Miscanthus RhizosphereMGRIKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGSVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0070696_10013378933300005546Corn, Switchgrass And Miscanthus RhizosphereMMGRIKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0066704_1101895613300005557SoilMKRRIKATVGASAIVLAAAMPTSAGATVNWVCVVPGVGTVTFVSAPDAARHGIETANAHAGQTFYRNFGEVCTVVQA*
Ga0068855_10041982513300005563Corn RhizosphereAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
Ga0068855_10182429913300005563Corn RhizosphereAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
Ga0068861_10238311413300005719Switchgrass RhizosphereMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
Ga0070715_1006346443300006163Corn, Switchgrass And Miscanthus RhizosphereMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0070716_10007417843300006173Corn, Switchgrass And Miscanthus RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAAGHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0070716_10016739623300006173Corn, Switchgrass And Miscanthus RhizosphereMMGRVKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0070712_10001066953300006175Corn, Switchgrass And Miscanthus RhizosphereVKETKRAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0075420_10061812113300006853Populus RhizosphereKTAMAGIVALALAVVWPSAASARVNWVCDVPGEGTVVFVSAADAARHGLNTANAHAGQTFNRQFGEVCTVVSG*
Ga0066710_10166536613300009012Grasslands SoilMKRFVISMLGSSALTLSLASPAGATVRWVCTVPEEGDVTFVSAPDAAAHGIETANSHAGQTFAARFGEECTVRP
Ga0066709_10151056813300009137Grasslands SoilMKRRIKAMVGASAIVLAAAMPTTAGATVNWVCVVPGVGTVTFVSAPDAARHGIETANAHAGQTFNRNFGEVCTVVQA*
Ga0105243_1016320353300009148Miscanthus RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRNFGEVCTVVQG*
Ga0105243_1266131523300009148Miscanthus RhizosphereMQAKVLLGATAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA*
Ga0105242_1048587323300009176Miscanthus RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG*
Ga0105249_1090669123300009553Switchgrass RhizosphereMMGWIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0134062_1034110523300010337Grasslands SoilMKRRIKALVGASAIVLAAAMPTSAGATVNWVCVVPGVGTVTFVSAPDAARHGIETANAHAGQTFNRNFGEVCTVVQA*
Ga0134124_1274200823300010397Terrestrial SoilMMGRIKAMGAAAAVTLFLAWPTSAGAKVMWVCDVPGEGSVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0134127_1241508913300010399Terrestrial SoilMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRNFGEVCTGVQ
Ga0134123_1060793423300010403Terrestrial SoilMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN*
Ga0126349_129212023300010861Boreal Forest SoilMMGRIRAMGAAAAVTLFLAWPASAGARVIWVCAVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNRQFGEVCTVVQN*
Ga0126348_103843623300010862Boreal Forest SoilMGRIRAMGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNRQFGEVCTVVQN*
Ga0138514_10001985713300011003SoilMTRCIKAMVGASAIVLAAAMPTSAGATVNWVCVVPGEGSVTFVSAPDAARHGIETANAHAGQTFNRNFGEVCTVVQA*
Ga0137433_109264313300011440SoilMKGRLKAMVGAFAIVLAVAMPTSAGATVNWVCEVPGEEPVTFVSVPDAARYGIETANAHAGQTFNRKFGEVCAVVQV*
Ga0120139_119387423300012019PermafrostMMGRIKAMGTATAVTLFLAWPASAGASVRWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQE*
Ga0137364_1070016223300012198Vadose Zone SoilMTGRIKAVGTATAVALFLAWPASAGARVNWVCNVPGEGLVTFVSVSDAARHGIDTANAHAGQTFNRQFGEECTVVQTDH*
Ga0137383_1005293343300012199Vadose Zone SoilMMGRIKAVGTAAAVTLFLAWPASAGASVKWVCDVPGEGAVTFVSVSDAPRHGIDTANARAGQTFNRKFGEVCTVVQD*
Ga0137383_1015006023300012199Vadose Zone SoilMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQH*
Ga0137382_1130121413300012200Vadose Zone SoilMKGQVKASIGGIVLVLVMAWPSSARANVNWVCNVPGEGTVVFVSAADAARHGLETANAHAGQTFNRNFGEVCTVVKG*
Ga0137365_1002503863300012201Vadose Zone SoilMMGRIKAVGTAAAVTLFLAWPASAGASVKWVCDVPGEGPVTFVSVSDAARHGIDTANARAGQTFNRKFGEVCTVVQD*
Ga0137365_1048124623300012201Vadose Zone SoilMMGRIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQH*
Ga0137376_1137235213300012208Vadose Zone SoilMIGRIKAMGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFYRQFGEVCTVVQN*
Ga0137370_1063124313300012285Vadose Zone SoilMMGLIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEECTVVQTDH*
Ga0137372_1049865313300012350Vadose Zone SoilVKRQIKAMVGAIAIVLAVAMPTAAGATVNWVCVVPGEGIVTFVSAPDAARHGIDTANAHAGQTFNRNFGEVCTVVQV*
Ga0137384_1001018553300012357Vadose Zone SoilMMGLIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGPTFNRQFGEECTVVQTDH*
Ga0137360_1104641313300012361Vadose Zone SoilMMGRIKAIGAAGAVTLFLAWPASAGARVMWVCDVPGEGLVIFVSVPDAARHGIDTANAHAGQTFYRQFGEVCTVVQN*
Ga0137358_1039501223300012582Vadose Zone SoilMMGRIRAIGTATAVTLFLAWPASAGARVTWVCNVPGEGLVTFVSVPDAARHGIDTANAHAGQTFYRQFGEVCTVVQN*
Ga0157286_1045793813300012908SoilMQPKVLLGAIAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA*
Ga0164300_1035138023300012951SoilMMGRIRAIGIATAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNRQFGEVCTVVQN*
Ga0164299_1030997413300012958SoilETLFLAWPASAGATVMWVCNVPGVGPVTFVSVPDAARHGIETANAHAGQTFNLRFGEVCTVVQN*
Ga0164299_1142877013300012958SoilMMGRIKALGSATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0164301_1031049823300012960SoilMMGRIKALGSATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVHKTDD*
Ga0164301_1084503213300012960SoilMMGRIRAIGIATAVTLFLAWPASAGAGVKWVCDVPGEGSVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQD*
Ga0164302_1074179813300012961SoilMTGRIKAMGAAAAVTLFLAWPASAGAGVKWVCDVPGEGSVTFVSVPDAARHGIDTANAHAGETCNRKCGEVCTVVQD*
Ga0120111_111762713300013764PermafrostMKRRIKAMVGASAIVLAAAMPTSAGATVNWVCVVPGVGTVTFVSAPDAARHGIETANAHAGQTFNRNFGEVCTVVQA*
Ga0157376_1059790323300014969Miscanthus RhizosphereLKAMVGTFAIVLAVAMPTSAGATLNWVCEVPGEGTVTFVSVPDAARHGIETANAHAGQTFNRKFGEVCTIVQG*
Ga0137405_112084113300015053Vadose Zone SoilMMGLIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANSHAGQTLNRQFGEVCTVVQH*
Ga0134089_1031730623300015358Grasslands SoilATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQH*
Ga0132258_1174694533300015371Arabidopsis RhizosphereMQPKVLLGAVAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA*
Ga0163161_1209442613300017792Switchgrass RhizosphereMQAKVLLGATAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA
Ga0184605_1001741523300018027Groundwater SedimentMRRIKAMGTATAVTLFLAWPASAGASVKWVCDVPGEGSVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0184608_1007081243300018028Groundwater SedimentMGRIKAVGTATAVILFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0184620_1000742533300018051Groundwater SedimentMGRIKAVGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0184619_1003897423300018061Groundwater SedimentMMGWIKAMGTAAAVTLFLAWPASAGASVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQD
Ga0184619_1014161913300018061Groundwater SedimentMMGRIKAVGAATAVTLFLAWPSSAGASVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQD
Ga0184618_1007647813300018071Groundwater SedimentMMGRIKAMGTATAVTLFLAWPASAGASVKWVCDVPGEGSVTFVSVSDAARHGIDTANAHAGQTFNRQFGEVCTVVQD
Ga0066655_1011345723300018431Grasslands SoilMKRRIKALVGASAIVLAAAMPTSAGATVNWVCVVPGVGTVTFVSAPDAARHGIETANAHAGQTFNRNFGEVCTVVQA
Ga0190269_1142852413300018465SoilMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIETANAHAGQTFNRKFGEVCTVVQG
Ga0173481_1020176623300019356SoilMLPKVLLGAIAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA
Ga0173482_1072421213300019361SoilMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0193704_102390813300019867SoilIKAMGTATAVTLFLAWPASAGASVKWVCDVPGEGSVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0193729_1002792103300019887SoilMTGRIKAMGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAAQHGIETANAHAGQTFNRQFGEVCTVVQN
Ga0247786_116386613300022883SoilMLPKVLLGAIAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLGNFGEVCTVENA
Ga0247801_108814323300023064SoilMQPKVLLGAIAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA
Ga0209751_1042952533300025327SoilMKRSIKAMVGIIVLTLTLTLPASAGATVKWVCVVPGVGDVTFVSVPDAALHGITQANLKAGETFRNQFGEECRVV
Ga0207642_1004654913300025899Miscanthus RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRNFGEVCTVVQD
Ga0207688_1088957913300025901Corn, Switchgrass And Miscanthus RhizosphereLFREGCMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0207684_1016632133300025910Corn, Switchgrass And Miscanthus RhizosphereMMGRIKAMGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN
Ga0207671_1044491323300025914Corn RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0207693_1000521813300025915Corn, Switchgrass And Miscanthus RhizosphereIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAAGHGIDTANAHAGQTFNRKFGEVCTVVQD
Ga0207693_1066025913300025915Corn, Switchgrass And Miscanthus RhizosphereTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN
Ga0207646_1138435223300025922Corn, Switchgrass And Miscanthus RhizosphereRKRVLATFRTFQGVAMMGRIKALGAAAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLRFGEVCTVVQN
Ga0207700_1040794823300025928Corn, Switchgrass And Miscanthus RhizosphereMMGRVKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQT
Ga0207690_1142274623300025932Corn RhizosphereMKRRMSAAIGWIALVLLMACPSAAGANVNWVCNVPGEGPVVFVSAADAARHGLETANAHAGQTFNRNFGEVC
Ga0207706_1021964823300025933Corn RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0207709_1063449423300025935Miscanthus RhizosphereVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0207669_1132264623300025937Miscanthus RhizosphereARSRGWVTMQAKVLLGATAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA
Ga0207665_1004116843300025939Corn, Switchgrass And Miscanthus RhizosphereMMGRVKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNLRFGEVCTVVQN
Ga0207665_1090680623300025939Corn, Switchgrass And Miscanthus RhizosphereMMGRIKALGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAAGHGIDTANAHAGQTFNRKFGEVCTVVQD
Ga0207708_1137029923300026075Corn, Switchgrass And Miscanthus RhizosphereMMQAKVLLGATAVTLLLAWPASASARVNWVCDVPGEGRVVFVSASDAARHGIVTANAHAGQTFLRNFGEVCTVENA
Ga0207676_1092916413300026095Switchgrass RhizosphereWIALVLLMAWPSAAGANVNWVCNVPGEGPVVFVSAADAARHGLETANAHAGQTFNRNFGEVCTVVQG
Ga0209350_102438323300026277Grasslands SoilMMGLIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQH
Ga0209027_102996023300026300Grasslands SoilMMGRIKAVGTATAMAVFLAWPASAGASVSWVCQVSGEEPVTFVSVPDAARHGIDTANAHAGQTFNRQFGEVCTVVQH
Ga0268265_1092181913300028380Switchgrass RhizosphereMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNR
Ga0307317_1004489723300028720SoilVGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0307316_1001344923300028755SoilMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0307280_1009578923300028768SoilMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQD
Ga0307306_1007725713300028782SoilMMARIKAAGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTF
Ga0307305_1008921023300028807SoilMMARIKAAGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0307292_1011749723300028811SoilMRRIKAIGTATAVTLFLAWPASAGASVKWVCDVPGEGSVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0307310_1064171723300028824SoilMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEV
Ga0307312_1018634423300028828SoilMMGRIKALGTATAATLFLAWPASAGANVKWVCDVPGEGPVTFVSVPDAARHGIDTANAHAGQTFNRNFGEVCTVVQD
Ga0307289_1001263563300028875SoilKAVGTATAVILFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0307289_1034534623300028875SoilMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTF
Ga0307278_1046496023300028878SoilMMGWIKAVGTATAVILFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRNFGEVCTVVQD
Ga0308178_118038323300030990SoilFRGGLGMMSRIKAVGTVTAVTLFLAWPASASARVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0307497_1072156913300031226SoilMTGRLKAMVGTFAIVLAVAMPTSAGATLNWVCEVPGEGTVTFVSVPDAARHGIETANAHAGQTFNRKFGEVCTIVQV
Ga0308194_1018192613300031421SoilVSHFSGEGCMMGRIKAVGAATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0310887_1020993633300031547SoilMQAKVLLGALAVTILLVWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA
Ga0247727_1002274733300031576BiofilmMRRRDLIAFKAIVGVIALTLILAWPSSAGATVKWVCDVPGEGLVTFVSVPDAARHGIDTANQHAGATFNKQFGEVCTVVQD
Ga0310813_1061188523300031716SoilMMGWIKAVGTATAVTLFLAWPASAGANVKWVCDVPGEGPVTFVSVSDAARHGIDTANAHAGQTFNRKFGEVCTVVQG
Ga0310900_1045029623300031908SoilMQAKVLLGALAVTLLLVWPASASARVNWVCDVPGEGRVVFVSASDAARHGIETANAHAGQTFLRNFGEVCTVENA
Ga0307471_10168528823300032180Hardwood Forest SoilMMGRIKAMGAGAAVTLFLAWPASAGARVMWVCDVPGEGPVTFVSVPDAALHGIETANAHAGQTFNLQFGEVCTVVQN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.