NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F046938

Metagenome Family F046938

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F046938
Family Type Metagenome
Number of Sequences 150
Average Sequence Length 90 residues
Representative Sequence DMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA
Number of Associated Samples 95
Number of Associated Scaffolds 150

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 4.83 %
% of genes near scaffold ends (potentially truncated) 71.33 %
% of genes from short scaffolds (< 2000 bps) 76.00 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (42.667 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.333 % of family members)
Environment Ontology (ENVO) Unclassified
(64.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.
1JGI25381J37097_10112331
2JGI25385J37094_101644102
3JGI25385J37094_101912852
4JGI25384J37096_100750651
5JGI25384J37096_101122562
6JGI25382J37095_100023581
7JGI25382J37095_101232331
8JGI25382J37095_101684971
9JGI25382J37095_102395032
10JGI25382J43887_100320641
11JGI25382J43887_102030861
12JGI25382J43887_102852402
13JGI25382J43887_103316281
14JGI25382J43887_104571652
15JGI25388J43891_10235573
16JGI25386J43895_100690632
17JGI25386J43895_100961092
18JGI25386J43895_101564402
19JGI25386J43895_101794472
20JGI25617J43924_100750301
21Ga0066674_100830263
22Ga0066672_102337651
23Ga0066679_100193531
24Ga0066679_103338372
25Ga0066688_100553915
26Ga0066688_107039831
27Ga0066688_107817112
28Ga0066678_100406671
29Ga0066676_107184651
30Ga0066686_102457832
31Ga0066689_100371311
32Ga0066701_105718082
33Ga0066701_107863411
34Ga0066661_102297351
35Ga0066707_102173662
36Ga0066704_105866662
37Ga0066704_107759751
38Ga0066704_108088771
39Ga0066704_109171151
40Ga0066698_102473891
41Ga0066698_102575621
42Ga0066700_105770501
43Ga0066703_100833411
44Ga0066703_102842101
45Ga0066703_108067571
46Ga0066706_110176471
47Ga0066658_102747751
48Ga0066665_101116201
49Ga0066659_100379524
50Ga0066659_108374801
51Ga0079221_118078991
52Ga0079220_102737881
53Ga0099791_103722342
54Ga0099793_100036367
55Ga0099794_106653522
56Ga0066710_1012109161
57Ga0066710_1025650581
58Ga0066710_1031943601
59Ga0066710_1041642031
60Ga0099829_101883451
61Ga0099829_101916731
62Ga0099830_101997811
63Ga0075423_128698352
64Ga0134088_100207611
65Ga0134088_102343621
66Ga0134088_103669321
67Ga0134111_102296631
68Ga0134071_100807993
69Ga0137392_102560333
70Ga0137391_104731571
71Ga0137391_108900901
72Ga0137391_110914462
73Ga0137388_107311902
74Ga0137388_109501862
75Ga0137365_107071571
76Ga0137363_110781061
77Ga0137363_116496911
78Ga0137399_104422452
79Ga0137380_100352011
80Ga0137380_102889012
81Ga0137380_108457291
82Ga0137380_115902112
83Ga0137381_101353541
84Ga0137381_102737961
85Ga0137379_101831382
86Ga0137378_101416984
87Ga0137378_101991451
88Ga0137378_115045011
89Ga0137386_104086862
90Ga0137384_100992882
91Ga0137368_101825251
92Ga0137385_104367172
93Ga0137385_108990541
94Ga0137385_114334741
95Ga0137360_110553321
96Ga0137396_107766202
97Ga0137416_101270703
98Ga0137410_101466272
99Ga0137410_105865241
100Ga0134077_100520841
101Ga0134077_100742323
102Ga0134077_102408321
103Ga0134077_104023351
104Ga0134076_100159173
105Ga0134076_101466632
106Ga0134076_101920141
107Ga0134087_105173831
108Ga0134075_100182023
109Ga0137418_113262191
110Ga0137409_101378702
111Ga0134089_100347101
112Ga0134069_11198462
113Ga0134112_103334692
114Ga0134083_101615062
115Ga0187803_104006372
116Ga0066667_100449094
117Ga0066667_100895293
118Ga0066662_101720721
119Ga0210404_101555981
120Ga0207646_109314231
121Ga0209350_10214941
122Ga0209235_100324210
123Ga0209237_10226981
124Ga0209236_10084614
125Ga0209236_12607431
126Ga0209055_10061763
127Ga0209239_10766951
128Ga0209761_100191114
129Ga0209761_10067376
130Ga0209801_10742822
131Ga0209801_10998813
132Ga0209803_11803532
133Ga0209158_12206322
134Ga0257177_10802151
135Ga0257181_10861421
136Ga0209378_10914362
137Ga0209806_10126526
138Ga0209160_100625513
139Ga0209160_12428631
140Ga0209058_11708432
141Ga0209056_1000962910
142Ga0209577_101266011
143Ga0209689_10738653
144Ga0209689_11948551
145Ga0209180_100126201
146Ga0209180_102831261
147Ga0209180_103418452
148Ga0307469_109096671
149Ga0307477_108094522
150Ga0307479_105473901
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.67%    β-sheet: 0.00%    Coil/Unstructured: 51.33%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080DMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
60.0%40.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Vadose Zone Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Populus Rhizosphere
28.0%12.0%29.3%23.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_101123313300002557Grasslands SoilTFEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEGKATEPRALLGGTPRQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25385J37094_1016441023300002558Grasslands SoilGRSTTIEKLSQTITNFRDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
JGI25385J37094_1019128523300002558Grasslands SoilGRNPEEMKWILRVHNPLEGEKATEPRALLGGTPQQAVEDLPGLKELGIDHVFYDMNHPAQVPIETQLALLRRLVRLIKP*
JGI25384J37096_1007506513300002561Grasslands SoilRDMVRRAGRNPEEMKRILRVHNPLSKEKATEPRTLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25384J37096_1011225623300002561Grasslands SoilNNFRDMVGRAGRKPEEMKWILRVHNPLVEEKATEPPALLGGTPQQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
JGI25382J37095_1000235813300002562Grasslands SoilVHNVLDEEKAAEPRALLGGTPQQAAKDLPGLKDLGIDHVFYDMNHPAQVPIDNQLLLLRRLMRLIKN*
JGI25382J37095_1012323313300002562Grasslands SoilGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEEKATEPRALLGGTPQQAAKDLPRLKELGIDHVFYDMNHPAQVPXDTQLVLLRXLVRLIKA*
JGI25382J37095_1016849713300002562Grasslands SoilFRDMVRRAGRSPEEMKWILRVHDPLDEEKASEPRALLGGTPQQAAKDLPRLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLINAS*
JGI25382J37095_1023950323300002562Grasslands SoilDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
JGI25382J43887_1003206413300002908Grasslands SoilINNFRDMVRRAGRNPEEMKWILRVHNPLGEGKATEPRALLGGTPRQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25382J43887_1020308613300002908Grasslands SoilTNFRDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
JGI25382J43887_1028524023300002908Grasslands SoilAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEEKATEPRALLGGTPQQAAKDLPRLKELGIDHVFYDMNHPAQVPVDTQLVLLRKLVRLIKA*
JGI25382J43887_1033162813300002908Grasslands SoilQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKA*
JGI25382J43887_1045716523300002908Grasslands SoilLRVHNPLGEEKATEPRALLGGTPRQAAEDLPRLKELGIEHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA*
JGI25388J43891_102355733300002909Grasslands SoilRDMVRRAGRNPEEIRWILRVHNPLSEEKGTEPRALLGGTSQQAAEDLPRLEELGIDDVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25386J43895_1006906323300002912Grasslands SoilEMKWILRVHNPLDEEKATEPRALLGGTPQQAAEDLPRLNELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25386J43895_1009610923300002912Grasslands SoilAARIADGIMPAAGRSTTIEKLSQTINNFGDMVRRAGRNPEEMKWILRVHNPLDEEKATEPRALLGGTPQQAAKDLPKLKELGIDHVFYDMNHPAQVPIDTQLVLLRKLVRLIKA*
JGI25386J43895_1015644023300002912Grasslands SoilAAGRGTTIEKLGQTINNFRDMVRRAGRHPEEMKWILRVHNPLSEEKAXEPRALXGGXPQXAAXDXPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25386J43895_1017944723300002912Grasslands SoilARIADGIMPAAGRSTTIEKLSXTINNFRDMVRRAGRSPEEMKWILRVHNPLDEEKASEPRALLGGTPQQAAEDLPRLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLINAS*
JGI25617J43924_1007503013300002914Grasslands SoilERQARIADGIMPAAGRSTTIEKLSQTINNFHDVVRRAGRNPREIRWILRVHNSLEKKTTEPRPLLGSTPQQAAKDLPRLKDIGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKRPRNALVGSDLFLVPAAKSVWLVSLGF*
Ga0066674_1008302633300005166SoilWILRVHNPLDKEKATEPRPLLGGTPQQAAEELPRLEELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0066672_1023376513300005167SoilMPAAAGSTTIEKLSQTIKDFHEKVRRAGRNPEEMKWILRVHNPLSGEKATEPRALLGGTPQQAAEDLPRLKELGIDHIFYDMNHPAQVPK*
Ga0066679_1001935313300005176SoilWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA*
Ga0066679_1033383723300005176SoilQTIKDFREKVRRAGRNPEEMKWILRVHNPLSGEKATEPRALLGGTPQQAAEDLPRLKELGIDHIFYDMNHPAQVPK*
Ga0066688_1005539153300005178SoilIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA*
Ga0066688_1070398313300005178SoilIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIMWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA*
Ga0066688_1078171123300005178SoilRNPEEMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA*
Ga0066678_1004066713300005181SoilRIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLALLRRLMRLIKA*
Ga0066676_1071846513300005186SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLV
Ga0066686_1024578323300005446SoilLEEKKAAESRALLGGTPEQAAKDLPRLKELGVDHVFYDMNHPAHVPIDTQLVLLRRLVELIKD*
Ga0066689_1003713113300005447SoilTINNFREMVRRAGRNPDEMIWILRVHNPLDEEKATEPRALLGGTPQQAAEDLTRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0066701_1057180823300005552SoilEEMKWILRVHNPLAEEKATEPPALLGGTPEQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066701_1078634113300005552SoilTINNFRDMVRRAGRHPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRKFVRLTKA*
Ga0066661_1022973513300005554SoilAARIADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRSPDEMKWILRVHNPLDEEKASEPRALLGGAPQQAATDLPRLRELGIDHVFYDMNHPAHVPVETQLVLLRRLVRLIKASGA*
Ga0066707_1021736623300005556SoilMKWILRVHNPLTEEKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0066704_1058666623300005557SoilKLSQTINSFQDMVRRAGRKPEEMKWILRVHNPLYEEKASEPRALLGGTPQQVAKDFPRVKELGIDHVFYDMNHPAHVPIDSQLVLLRRLVRLIKNN*
Ga0066704_1077597513300005557SoilADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLEEKATEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLTKA*
Ga0066704_1080887713300005557SoilLSQTINSFQDMVRRAGRKPEEMKWFLRVHNPLYEEKASEPRALLGGTPQQAAGDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066704_1091711513300005557SoilDGIMPAAGRSATIEKLSQTINTFQDMVRRAGRKPEEMKWILRVHNPLAEEKATEPHALLGSTPEQAAKDFLRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066698_1024738913300005558SoilDMVRRAGRNSKEMKWILRVHNPMYEEKAAEPRALLGGTPQQAAEDLSRVRELGIGHVFYDMNHPAHVPIDTQLVLLRRLVRLIKP*
Ga0066698_1025756213300005558SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0066700_1057705013300005559SoilDMVRRAGRKPEEMKWILRVHNPLYEEKASEPRALLGGTPQQVAKDFPRVKELGIDHVFYDMNHPAHVPIDSQLVLLRRLVRLIKNN*
Ga0066703_1008334113300005568SoilRIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA*
Ga0066703_1028421013300005568SoilLERAARIADGIMPTAGRNTTIEKLSQTINNFRDIVRRAGHNPEGIKWILRVHNPLEEEKATETRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQVPIDTQLALLRRLVRLIKA*
Ga0066703_1080675713300005568SoilGRKPEEMKWILRVHNPLAEEKATEPHALLGSTPEQAAKDFLRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066706_1101764713300005598SoilSLERAARIADGIMPAAAGSTMIEKLSQTINNFRDMVRRAGRHPEEMKWILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA*
Ga0066658_1027477513300006794SoilTIEKLSQTINSFRDMVRRAGRNPEEIMWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA*
Ga0066665_1011162013300006796SoilNNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKA*
Ga0066659_1003795243300006797SoilMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA*
Ga0066659_1083748013300006797SoilARLADGIMPAAARSATIEKLSQTINNFHDMARSAGRKPEEMKWILRVHNPLFEEKATEPGALLGGTPQQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0079221_1180789913300006804Agricultural SoilNPDELKWILRAHNTLNEEKASEPRPLLGGTPQQAVNDLPRLKELGIDHVFYDMNHPAQVPMETQLALLRRLVKLIKA*
Ga0079220_1027378813300006806Agricultural SoilMVRKAGRNPNEIKWILRVHNPLDEWKTSEPRGLLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAHVPIDTQLVLLRKLLKLAKS*
Ga0099791_1037223423300007255Vadose Zone SoilRWILRVHNPLTEEKAAEPRPLLGGTPQQAAKDLPRIKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVQLIKN*
Ga0099793_1000363673300007258Vadose Zone SoilMVRRAGRNPEELKWILRVHNTLDEEKATEPRPLLGGTPQQAAQDLPRLKELGIGHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
Ga0099794_1066535223300007265Vadose Zone SoilEVDTESPRPLSEEKAAEPRALLGGTPQQAAQDLPRLKELSVDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA*
Ga0066710_10121091613300009012Grasslands SoilRIADGIMPAAGRSTTIEKLNQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIKTQLVLLRRLVRLIKA
Ga0066710_10256505813300009012Grasslands SoilINNFRDMVRKANRNPDEMKWILRVHNVLEEKKAAESRALLGGTPEQAAKDLPRLKELGIDHVFYDMNHPAQVPIGTPLVLLRRLVELIKD
Ga0066710_10319436013300009012Grasslands SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTPLVLLRRLVRLIKA
Ga0066710_10416420313300009012Grasslands SoilEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEGKATEPRALLGGTPRQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0099829_1018834513300009038Vadose Zone SoilARVADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLAEEKVTEPPALLGGTPQQAAKDFARVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0099829_1019167313300009038Vadose Zone SoilAGRSTTIEKFSQTISNFRDMVRRAGRNPEELKWILRVHNPLDEEKAAGPRALLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKK*
Ga0099830_1019978113300009088Vadose Zone SoilAGRTTTIEKLSQTINNFRDMVRRAGRSPDEMKWILRVHNPLDEKKATEPRPLLGGTPQQAADLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLLRLIKA*
Ga0075423_1286983523300009162Populus RhizosphereEEIMRVHDILTGEKAAEPRALLGGTLQQAAEDLPRLKDLGIDPIFYNMNHPAQVPIDTQLSLLTKLIRLIKK*
Ga0134088_1002076113300010304Grasslands SoilEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0134088_1023436213300010304Grasslands SoilDMVQKTDRNPDEMKWILRVHNVPDEEKAAEHRALLGGTPEQAAEDLPRLKELGIDHVFHDMNHPAHVPIDTQLVLLRRLVRLINEWEYAA*
Ga0134088_1036693213300010304Grasslands SoilILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA*
Ga0134111_1022966313300010329Grasslands SoilNNFRDMVRRAGRHLEEMKWILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA*
Ga0134071_1008079933300010336Grasslands SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0137392_1025603333300011269Vadose Zone SoilKLSQTINNFRDMVRKTDRNPDEMKWILRVHNVLDEEKAAETRALLGGTPEQASKDLPILKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKE*
Ga0137391_1047315713300011270Vadose Zone SoilKWILRVHNPLSEEKAAEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAQVPIDTQLVLLGRLVQLIKN*
Ga0137391_1089009013300011270Vadose Zone SoilLDEEKAAGPRALLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKK*
Ga0137391_1109144623300011270Vadose Zone SoilMKWILRVHNPLEEEKASEPRALLGGTPQQASKDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRKLVRLIKH*
Ga0137388_1073119023300012189Vadose Zone SoilMPAAGRSTTIEKLSQTIKDFCDMVRRAGRNPEEMKWILRVHNPLDEKKATEPRPLLGGTPQQAADLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLLRLIKA*
Ga0137388_1095018623300012189Vadose Zone SoilMKWILRVHNVLDEEKAAEPRALLGGTPQQAAKDLPGLKDLGIDHVFYDMNHPAQVPIDTQLLLLRRLMRLIKN*
Ga0137365_1070715713300012201Vadose Zone SoilGMLSQTINNFHDMVRRAGRNPEEMKWILRVHNPLTEEKATEPRTLLGGTPEQAAEDLPRLKELGIDHVFYDMNHPAHVPINTQLVLLRKLMQIINA*
Ga0137363_1107810613300012202Vadose Zone SoilHNPLYEEKAAEPRALLGGTPQQAARDLPKLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLTKT*
Ga0137363_1164969113300012202Vadose Zone SoilERAARIGDGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLEEEKASEPRALLGGTPQQASEDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA*
Ga0137399_1044224523300012203Vadose Zone SoilLSQTINNFRDMVRRAGRDPDELKWILRVHNPLEEEKASEPRALLGGTPQQAAKDLPRLRELGKDHVFYDMNHPAHVPMETQLVLLRRLVRLIKASGS*
Ga0137380_1003520113300012206Vadose Zone SoilMVRKSGRNPDEMKWILRVHNVLDEEKAGDPRPLLGGTPEQAAKDLPRLKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLVKD*
Ga0137380_1028890123300012206Vadose Zone SoilMVRRAGRNPEEMKWILRVHNPLEEEKASEPRALLGGTPQQASEDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRRLVRLIRH*
Ga0137380_1084572913300012206Vadose Zone SoilMVRKADRNPDEMKWILRVHNVLDEEKAADPRALLGGTPEQAAKDFPRLKELGINHVFYDMNHPANVPIDTQLALLRRLVRLIKE*
Ga0137380_1159021123300012206Vadose Zone SoilDGIMPAGGRSTTIEKLSQTIKDFREKVRRAGRNPEEMKWILRVHNPLEGEKATEPRALLGGTPQHAVEDLPRLKELGIDHVFYDMNHPAQVPIETQLALLRRLVRLIKP*
Ga0137381_1013535413300012207Vadose Zone SoilLARAARIADGIMPAGGRSTTIEKLSQTINNFRDMVRRAARNPEEIRWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLMELGIDHVFYDMNHPAQVPVDTQLVLLRRLVRLMKD*
Ga0137381_1027379613300012207Vadose Zone SoilMVQGAGRNPEEMKWILRVHNPLSEEKAKEPRALLGGTPKQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRL
Ga0137379_1018313823300012209Vadose Zone SoilMKWILRVHNPLSEEKAKEPRALLGGTPKQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIRH*
Ga0137378_1014169843300012210Vadose Zone SoilSQTINNFRDMVRKADRNPDEMKWILRVHNVLDEEKAEEPRALLGGAPEQAAEDLLRLKELGIDHVFYDMNHPAHVPINTQLALLRRLVRLIQRIESAA*
Ga0137378_1019914513300012210Vadose Zone SoilMVRKADRNPDEMRWILRVHNVLEEEKAAEPRALLGGTPEQAARDLPRLKELRIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKE*
Ga0137378_1150450113300012210Vadose Zone SoilIMPAAAGSTMIEKLSQTINNFRDMVRRAGRHPEEMKWILRVHNPLTEQKATESRTLLGGMPQQAAEDFPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0137386_1040868623300012351Vadose Zone SoilMVRKADRNPDEMKWILRVHNVLEEEKAAEPRALLGGTPEQAARDLPRLKELRIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIK*
Ga0137384_1009928823300012357Vadose Zone SoilMVRKADRNPDEMKWILRVHNVLGEEKAADPRALLGGTPEQAAKDLPRLKELGIDHVFYDINHPAHVPIDTQLVLLRRLVRLIKE*
Ga0137368_1018252513300012358Vadose Zone SoilINNFRALVRRAGRNQDEMRWILRVHNPLDEEKATEDRASLGGAPEQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRKLMRLIKN*
Ga0137385_1043671723300012359Vadose Zone SoilRAARIADGIMPAAGRSTTIEKLSQTIKDFRDMVRRAGRNPEEMKWILRVHNPLDEEKATDPRPLLGGTPQQAATDLPRLKELGIDHTFYDMNHPAQVPIDTQLVLLRRLMRLIKN*
Ga0137385_1089905413300012359Vadose Zone SoilMKWILRVHNPLEEEKASEPRALLGGTPQQASEDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRRLVRLIRH*
Ga0137385_1143347413300012359Vadose Zone SoilARIADGIMPAAGRSTTIEKLNQTINDFRDMVRRAGRNPEEMKWILRVHNPLSKEKATEPRTLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0137360_1105533213300012361Vadose Zone SoilDMVRRAGRSPEEMRWILRVHNPLDEEKATEPRTLLGGTPQQAAKDLPKLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKAQGGSR*
Ga0137396_1077662023300012918Vadose Zone SoilMVRRAGRNPEEIRWILRVHNALDEGKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLALLRRLVRLINA*
Ga0137416_1012707033300012927Vadose Zone SoilMPAAAGGTTIEKLSQTINNFRDMVRIAGRNPEELKWILRVHNPLGEEKATEPRALLGGTPEQAARDLPRLKALGIDHIFYDMNHPAQVPIDTQLVLLRKLVRLIKA*
Ga0137410_1014662723300012944Vadose Zone SoilMKWILRVHNPLGEEKATEPRPLLGGTPQQAAQDLPRLKELGIDHAFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
Ga0137410_1058652413300012944Vadose Zone SoilTINNFRDMVRRAGRNPEEIKWILRVHNPLEEKATEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAHLPIDTQLVLLRKLVRLI*
Ga0134077_1005208413300012972Grasslands SoilMVRRASRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0134077_1007423233300012972Grasslands SoilMVQKADRNPDEMRWILRVHNVLEEEKATEPRALLGGAPEQAVTDLPRLKELGLDHVFYDMNHPAHVPIDTQLVLLRRLVELIKD*
Ga0134077_1024083213300012972Grasslands SoilSQTINNFRDLVRRAGRKPEEMKWILRVHNPLYEEKATEPPALLGGTPQQAAGDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIRNN*
Ga0134077_1040233513300012972Grasslands SoilAGRNPEEMKWILRVHNPLEEKATEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLTKA*
Ga0134076_1001591733300012976Grasslands SoilRAARIADGIMPAAGRSTTIEKLSQTINNFRDIVRRAGRTPEEMKWILRVHNPLAEEKATEPPALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0134076_1014666323300012976Grasslands SoilKWILRVHNPLYEEKPREPPSLLGGTPEQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0134076_1019201413300012976Grasslands SoilILRVHNPLEEKAAEPRALLGGTPQQATEDLPRVRELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLTKA*
Ga0134087_1051738313300012977Grasslands SoilRAGRNPEEIRWILRVHNPLSEEKGTEPRALLGGTSQQAAEDLPRLEELGIDDVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0134075_1001820233300014154Grasslands SoilMVRRASRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPARIPIDTQLVLLRRLVRLIKA*
Ga0137418_1132621913300015241Vadose Zone SoilQTINSFSDMVRRAGRNPEEMKWILRVHNPLTEEKATEPRPLLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDSQLALLRRLVRLIKA*
Ga0137409_1013787023300015245Vadose Zone SoilMKWILRVHNPLGEEKATEPRPLLGGTPQQAAQDLPRLKELGIDHAFYDMNHPAQVPIDTQLVLLRRLMRLIKV*
Ga0134089_1003471013300015358Grasslands SoilMPAAGRSTTIEKLSQTINNFRDLVRKADRNPDEMRWILRVHNVLEEEKATEPRALLGGAPEQAVTDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRMIR*
Ga0134069_111984623300017654Grasslands SoilAGRNPEEMKWILRVHNPLDEEKATEPRPLLGGTPRQAAKDLPRLKERGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0134112_1033346923300017656Grasslands SoilMKWILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA
Ga0134083_1016150623300017659Grasslands SoilEMKWILRVHNVLEEEKAVEPRALVGGAPEQAVTDLPRLRELGIDHVFYDMNHPAHVPINTQLVLLRRLVELIKD
Ga0187803_1040063723300017934Freshwater SedimentMVRRAGRNPAEMQWILRVHNTLDKEKATDPRPLLGGTPQQALEDLPRLKDIGIDHVFYDMNHPAHIPIDTQLLLLRRLMQLIKN
Ga0066667_1004490943300018433Grasslands SoilMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA
Ga0066667_1008952933300018433Grasslands SoilVAAARSTTLDKLSQTVNSFADMVRRAGRSPEEMKWILRVHNRLDEEKARASSVIGGGTPQKAATDLPRLKELGIDHIFYDMNHPAHVPIDTQLVLLRRLVLLIKA
Ga0066662_1017207213300018468Grasslands SoilDGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKRILRVHNPLDEEKATEPRALLGGTPQQAAEDLPRLNELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0210404_1015559813300021088SoilASMKRASRLADGILPAAGRKTTIEKLSQTINNFHDMVRRAGRNPEEMKWILRVHNPLEEKSTEPRALLGGTPQQAAKDLPRLKEIGIDHVFYDMNHPAGVPIDTQLLLLRRLMRLIKSQENSRVEKNPQA
Ga0207646_1093142313300025922Corn, Switchgrass And Miscanthus RhizosphereLDEEKAAEPRALLGGTPQQAAKDLPGLKDLGIDQVFYDMNHPAQVPIDNQLLLLRRLMRLIKN
Ga0209350_102149413300026277Grasslands SoilLSKEKATEPRMLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209235_1003242103300026296Grasslands SoilAGRSTTIEKLSQTINNFRDMVRRAGRSPEEMKWILRVHNPLDEEKASEPRALLGGTPQQAAEDLPRLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLINAS
Ga0209237_102269813300026297Grasslands SoilEKLNQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRKLVRLIKA
Ga0209236_100846143300026298Grasslands SoilMPAAGRGTTIEKLGQTINNFRDMVRRAGRHPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209236_126074313300026298Grasslands SoilPEEMKWILRVHNPLSEEKAKAPRALLGGTPKQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLTKA
Ga0209055_100617633300026309SoilMPAAAGSTTIEKLSQTIKDFHEKVRRAGRNPEEMKWILRVHNPLSGEKATEPRALLGGTPQQAAEDLPRLKELGIDHIFYDMNHPAQVPK
Ga0209239_107669513300026310Grasslands SoilAEEKATEPPALLGGTPEQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN
Ga0209761_1001911143300026313Grasslands SoilLSQTINDFRDMVRRAGRNPEEMKRILRVHNPLSKEKATEPRMLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209761_100673763300026313Grasslands SoilIADGIMPAAAKSITIERLSQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKA
Ga0209801_107428223300026326SoilMVRRAGRNPEEMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA
Ga0209801_109988133300026326SoilVDPESPQPPEEIRWILRVHNPLSEERATDPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA
Ga0209803_118035323300026332SoilTINNFREMVRRAGRNPDEMIWILRVHNPLDEEKATEPRALLGGTPQQAAEDLTRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209158_122063223300026333SoilIKDFREKVRRAGRNPEEIKWILRVHNPLEGEKATEPRALLGGTPQQAVEDLPRLKELGIDHVFYDMNHPAQVPIETQLALLRRLVRLIKP
Ga0257177_108021513300026480SoilMKWILRVHNPLEKGKATEPRALLGGTPQQSATDLPRLKELGIDHVFYDMNHPAHVPIGTHLVLLRRLVRLIKA
Ga0257181_108614213300026499SoilAGRSTTIEKLSQTVNSFSDMVRRAGRNPEEMRWILRVHNPLTEEKAAEPRPLLGGTPQQAAKDLPRIKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVQLIRN
Ga0209378_109143623300026528SoilMKWILRVHNPLTEEKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209806_101265263300026529SoilMPAAGRGTTIEKLGQTINNFRDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209160_1006255133300026532SoilMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDT
Ga0209160_124286313300026532SoilLSQTINSFQDMVRRAGRKPEEMKWFLRVHNPLYEEKASEPRALLGGTPQQAAGDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN
Ga0209058_117084323300026536SoilMKWILRVHNPMYEEKAAEPRALLGGTPQQAAEDLSRVRELGIGHVFYDMNHPAHVPIDTQLVLLRRLVRLIKP
Ga0209056_10009629103300026538SoilLTEEKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209577_1012660113300026552SoilSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA
Ga0209689_107386533300027748SoilAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA
Ga0209689_119485513300027748SoilAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIMWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA
Ga0209180_1001262013300027846Vadose Zone SoilKLSQTINNFRDMVRRAGRNPEELKWILRVHNPLEEEKAPEPQALLGGTPQQAAEVLPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA
Ga0209180_1028312613300027846Vadose Zone SoilRVHNPLDEEKAAGPRALLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKK
Ga0209180_1034184523300027846Vadose Zone SoilAARIADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLAEEKVTEPPALLGGTPQQAAKDFARVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN
Ga0307469_1090966713300031720Hardwood Forest SoilRIADGIMPAAGRSTTIEKLSQTINSFHEMVRRAGRNPEEIMWILRVHNPLEEEKALEPRALLGGTPQQAANDLPRLKELGIDHVFYDMNHPAQVPIQTQLVLLRRLMRLIKA
Ga0307477_1080945223300031753Hardwood Forest SoilRVHNSLSDEKAAEPRALLGGMPQQAVNDLPRLRELGIDHVFYDMNHPDQVPIETQLALLRRLVKLIKA
Ga0307479_1054739013300031962Hardwood Forest SoilDELRWILRVHNSLDEEKAAEPRALLAGTPQQAVNDLPRLRELGIDHVFYDMNHPAQVPIETQLALLRRLVKLIKA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.