NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F080424

Metagenome / Metatranscriptome Family F080424

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F080424
Family Type Metagenome / Metatranscriptome
Number of Sequences 115
Average Sequence Length 134 residues
Representative Sequence MASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLEGGSELGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Number of Associated Samples 95
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 60.87 %
% of genes near scaffold ends (potentially truncated) 47.83 %
% of genes from short scaffolds (< 2000 bps) 80.87 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.22

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(20.000 % of family members)
Environment Ontology (ENVO) Unclassified
(26.957 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.130 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158
1Ga0062385_101136831
2Ga0062387_1003211762
3Ga0058904_10399831
4Ga0058903_14484711
5Ga0058887_14697532
6Ga0058897_109529901
7Ga0066680_1000037310
8Ga0066690_102303521
9Ga0066388_1053229951
10Ga0070713_1008692331
11Ga0066689_104305761
12Ga0070732_100012953
13Ga0070732_109259471
14Ga0070732_109906311
15Ga0066704_101792841
16Ga0070762_103476311
17Ga0070717_100823863
18Ga0066696_103423401
19Ga0075029_1007301601
20Ga0075017_1000401503
21Ga0075017_1007324101
22Ga0075017_1010523471
23Ga0075019_101386243
24Ga0075019_103660792
25Ga0075019_109272601
26Ga0075015_1006358341
27Ga0075030_1000023842
28Ga0070716_1001828562
29Ga0070765_1009828222
30Ga0099793_102863351
31Ga0123355_1000878518
32Ga0126373_106531832
33Ga0126373_122018011
34Ga0150983_119328901
35Ga0150983_122882261
36Ga0137382_100042448
37Ga0137399_102219051
38Ga0137419_101515441
39Ga0182036_108989091
40Ga0182041_122470571
41Ga0182033_119679331
42Ga0182035_114095271
43Ga0182035_119470831
44Ga0182032_115816101
45Ga0182039_111591561
46Ga0182038_115018941
47Ga0187825_101063411
48Ga0187801_101099331
49Ga0187801_102027541
50Ga0187819_107739301
51Ga0187816_100726861
52Ga0187804_104950901
53Ga0066662_107676321
54Ga0210407_101756502
55Ga0210403_100005484
56Ga0210403_113881121
57Ga0210395_103017761
58Ga0210401_103675782
59Ga0210401_111933172
60Ga0210400_110106332
61Ga0210405_107883901
62Ga0210408_1000020170
63Ga0210396_100394875
64Ga0210396_101195511
65Ga0210389_109267921
66Ga0210384_106343522
67Ga0210390_101413841
68Ga0187846_102664011
69Ga0210409_104650702
70Ga0126371_112311442
71Ga0242648_10445891
72Ga0242663_10062881
73Ga0242664_11221981
74Ga0242661_10244771
75Ga0242661_10555071
76Ga0242665_100452611
77Ga0224564_11216041
78Ga0137417_12355941
79Ga0207693_113926891
80Ga0207700_105527692
81Ga0209055_100065713
82Ga0209471_11088641
83Ga0209267_12505801
84Ga0209803_13118281
85Ga0209158_11811781
86Ga0257172_11003831
87Ga0208730_10106471
88Ga0209004_10408341
89Ga0209446_10331702
90Ga0209580_100004043
91Ga0209580_103559831
92Ga0209275_109188931
93Ga0209583_102585821
94Ga0209698_1000083818
95Ga0209698_100022376
96Ga0209698_104341451
97Ga0209698_105412962
98Ga0137415_1000072629
99Ga0308309_106100482
100Ga0307482_11011751
101Ga0170820_107121711
102Ga0307476_1000006082
103Ga0307469_100317433
104Ga0307477_107975301
105Ga0307475_104879461
106Ga0307475_109221411
107Ga0307475_113499941
108Ga0310917_111358561
109Ga0316049_1234961
110Ga0306926_113658041
111Ga0307479_100384165
112Ga0307470_104016241
113Ga0307471_1016481571
114Ga0335085_1000043786
115Ga0335082_101119981
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.14%    β-sheet: 0.00%    Coil/Unstructured: 68.86%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

20406080100120MASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLEGGSELGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQSequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.22
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Biofilm
Termite Gut
5.2%12.2%5.2%4.3%8.7%20.0%7.8%8.7%7.0%3.5%4.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062385_1011368313300004080Bog Forest SoilMSSSNKREGIGGIDFGADFSELRERPILKGADLDLPTAIEIGVQAEEPQPVSQTESREAATPESTAGLEVDPLTRLLTATERERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALMKLRQMSRTRKRPRLQ*
Ga0062387_10032117623300004091Bog Forest SoilMSSSNKREGIGGIDFGADFSELRERPILKGADLDLPTAIEIGVQAEEPQPVSQTESREAAAPESTAGLEVDPLTRLLTATERERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALMKLRQMSRTRKRPRLQ*
Ga0058904_103998313300004100Forest SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0058903_144847113300004103Forest SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGLVESREPVSAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0058887_146975323300004119Forest SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTRLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0058897_1095299013300004139Forest SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0066680_10000373103300005174SoilMASGNKREGNGGIDFGADFSGLHERPILKGADLDLTTSMEIVTPAEEPVSQAEERTATPENAAEAETDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRARKRPRLQ*
Ga0066690_1023035213300005177SoilMASGNKREGSGGIDFGADFSGLHERPVLKGADLDLPTSMEIVTPAEEPVSKSEEMAVTPESAAEAEIDPLTKLITAAERERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALHKLRQMSRNRKRPRVQ*
Ga0066388_10532299513300005332Tropical Forest SoilMAGSYKREGGSGINFGADFSGLRERPILKGADLDLPTSLEVAPPLEEEQQIDPVESREAVTAEKAVPLDDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLE
Ga0070713_10086923313300005436Corn, Switchgrass And Miscanthus RhizosphereDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0066689_1043057613300005447SoilFGADFSGLHERPVLKGADLDLPTSMEIVTPAEEPVSKSEEMAVTPESAAEAEIDPLTKLITAAERERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALHKLRQMSRNRKRPRVQ*
Ga0070732_1000129533300005542Surface SoilMASSNKREGSSGINFGADFSELRERPILKGADLDLPTSIEAAPELEGGSEILPAESREAVSAGKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0070732_1092594713300005542Surface SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELSPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0070732_1099063113300005542Surface SoilDFSELRERPILKGADLDLPTPIEVAVRVEEPQPVSQPESREAVAPESTAGLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ*
Ga0066704_1017928413300005557SoilELRERPILKGADLDLPTSMEVAPPLEGGSELGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0070762_1034763113300005602SoilMASSNKREGGSGINFGADFSELRERPILRGADLDLPTSMEVAPPLEGGSEIGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0070717_1008238633300006028Corn, Switchgrass And Miscanthus RhizosphereMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLEGVSEIGPAESRETVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0066696_1034234013300006032SoilMAGGKKREGNGGIDFGADFSGLHERPILKGADLDLTTSMEIVTPAEEPVSQAEERTATPENAAEAETDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRARKRPRLQ*
Ga0075029_10073016013300006052WatershedsLRERPILKGADLDVPTAMEVGLRGEEPASHTASREAVAEESALELDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRLKKRPSRLPQ*
Ga0075017_10004015033300006059WatershedsMASINKREGGGGIDFGADFSELRERPILKGADLDVPTAMEVALRGEEPASHTASREAVAEESALELDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRLKKRPSRLPQ*
Ga0075017_10073241013300006059WatershedsMSISNKREGNGGIDFGADFSGLRERPVLKGADLDLPTPMEVAGEEPMSYGDSREAIAPETPQLDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALLKLRQMSRSRKRARPQ*
Ga0075017_10105234713300006059WatershedsMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLEGGSEIGPAEIREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0075019_1013862433300006086WatershedsGGIDFGADFSELRERPILKGADLDVPTAMEVGLRGEEPASHTASREAVAEESALELDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRLKKRPSRLPQ*
Ga0075019_1036607923300006086WatershedsPILKGADLDLPTAMEVALRVEEPVNHTESGEGVAAERALELDMDPLTRLVTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALLKLRQMSRAKKRPSRLPQ*
Ga0075019_1092726013300006086WatershedsISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLEGGSEISPAESRETVTAGKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0075015_10063583413300006102WatershedsDFGADFSELRERPILKGADLDVPTAMEVGLRGEEPASHTASREAVAEESALELDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRLKKRPSRLPQ*
Ga0075030_10000238423300006162WatershedsMASINKREGGGGIDFGADFSELRERPILKGADLDVPTAMEVGLRGEEPASHTASREAVAEESALELDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRLKKRPSRLPQ*
Ga0070716_10018285623300006173Corn, Switchgrass And Miscanthus RhizosphereMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0070765_10098282223300006176SoilSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0099793_1028633513300007258Vadose Zone SoilKQRGNRVRISERGDSEMASSNKREGSGGIDFGADFSELRERPILKGADLDLPTPIKVALPVEEPQPVNHTESREAVAPESTAGLDADPLTKLLTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ*
Ga0123355_10008785183300009826Termite GutLKGVDLDAPTSLEVSPPIEVAQPVGPAEDGEVVTAERVANLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0126373_1065318323300010048Tropical Forest SoilINFGADFSGLRERPILKGADLDVPTSMEIAPPFEEPQAIVPAESREAVPVEKAADLEDDPLTKLLTAAERERGLPYTFRIPISMDKEFRTLAKEFDLDLSDIARSGLEMAMAKLRQMSRAKKRHLQ*
Ga0126373_1220180113300010048Tropical Forest SoilMAGSYKREGGSGINFGADFSGLRERPILKGADLDVPTSMEAAPPLEEPQETAPVESREVVPVEKPAEPEDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALAKLRQMSKARKRRLQ*
Ga0150983_1193289013300011120Forest SoilRGSRIHISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLEDDPLTKLLTAAEKERGLPYTFRIAISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0150983_1228822613300011120Forest SoilRGSRIHISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTRLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0137382_1000424483300012200Vadose Zone SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLEGGSELGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ*
Ga0137399_1022190513300012203Vadose Zone SoilMASSNKREGSGGIDFGADFSELRERPILKGADLDLPTPIEVALRVEEPQPVSPTETREVVAPESSIALDADPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALFKLRQMSRTKKRPRLQ*
Ga0137419_1015154413300012925Vadose Zone SoilREGSGGIDFGADFSELRERPILKGADLDLPTPIEVALPVEESQLVSQTETREAVAQSTVSLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALFKLRQMSRTKKRPRLQ*
Ga0182036_1089890913300016270SoilMAISNKREGGSGINFGADFSGLRERPILKGADLDVATSMEAVPPVEEPQEITPAESREAVSVEKPLELEDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALAKLRQMSKARKRRLQ
Ga0182041_1224705713300016294SoilMAISNKREGGSGINFGADFSGLRERPILKGADLDVPTSMEIAPPFEEPQPVAPAESREAVPVERAPDPEDDPLTKRLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMAMAKLRQMSRAK
Ga0182033_1196793313300016319SoilASSNRREGSSGVDFGADFSGLHERPALKGADLERPTPLEIVARGEGLQPDNASGGREAVVSGAAVDSDTDPLTRLLTAAEKERGLPYTFRIPISMDREFRNLAKEFDLDLSDIARSGLEMAMLRLRQMSSTKKRSLPQ
Ga0182035_1140952713300016341SoilMASSNRREGSSGVDFGADFSGLHERPALKGADLERPTPLEIVARGEGLQPDNASGGREAVVSGAAVDSDTDPLTRLLTAAEKERGLPYTFRIPISMDREFRNLAKEFDLDLSDIARSGLEMAMLRLRQMSSAKKRSLPH
Ga0182035_1194708313300016341SoilMAISNKREGGSGINFGADFSGLRERPILKGADLDVATSMEAVPPVEEPQEITPAESREAVSVEKPLELEDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLS
Ga0182032_1158161013300016357SoilMASSNRREGSSGVDFGADFSGLHERPALKGADLERPTPLEIVPRGEGLQSDNASSGREAVAPENAADSDTDPLTRLLTAAEKERGLPYTFRIPISMDREFRNLAKEFDLDLSDIARS
Ga0182039_1115915613300016422SoilMASSNRREGSSGVDFGADFSGLHERPALKGADLERPTPLEIVARGEGLQPDNASGGREAVVSEAAVDSDTDPLTRLLTAAEKERGLPYTFRIPISMDREFRNLAKEFDLDLSDIARSGLEMAMLRLRQMSSTKKRSLPQ
Ga0182038_1150189413300016445SoilKREGSSGIDFGADFSGLRERPILKGADLDVRTSMEIAPPFEEPQPVAPAESREAVPVERAPDPEDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMAMAKLRQMSRAKKRHLQ
Ga0187825_1010634113300017930Freshwater SedimentMASSNKREGSGGIDFGADFSGLRERPILKGADLDMPTSIEAALPVEEPQSVGQVEAREATAPESTAELDPDPLTRLVTAAEKERGLPYTFRIPISMDREFRSMAKEFDLDLSDIARSGLEMALMKLRQMSRTKKRRLQ
Ga0187801_1010993313300017933Freshwater SedimentSSNKRESIGGIDFGADFSELRERPILKDADLDVPTPIEVALRVEEPQPVDHAENREAVAPESAADLDVDPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRSKKRPRLTQ
Ga0187801_1020275413300017933Freshwater SedimentMSSSNKREGSGGIDFGADFSELRERPILKGADLDVPTPIEAGFRVEKPQPAGNTESREAVAPESTAGLDADPLTRLLTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALFKLRQMSRTRK
Ga0187819_1077393013300017943Freshwater SedimentSELRERPILKGADLDVPTPFVASFRVEKPQPAGNTESREAVAPEGTASLDADPLTRLLTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALFKLRQMSRTRKQPRLQ
Ga0187816_1007268613300017995Freshwater SedimentMASSNKREGSGGIDFGADFSELRERPILKDADLDLPTPIEVALRVEEQQPVDHADSREAVAPESAPDLEMDPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRSKKRPRLTQ
Ga0187804_1049509013300018006Freshwater SedimentMSSSNKREGSGGIDFGADFSELRERPILKGADLDVPTSIEAGFRVEKPQPVGNTESREAVAPESTAGLDADPLTRLLTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIGRAGLEIALFKLRQMSRTRKQPRLQ
Ga0066662_1076763213300018468Grasslands SoilVLKGADLDLPTSMEIVTPAEEPVSKSEEMAVTPESAAEAEIDPLTKLITAAERERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALHKLRQMSRNRKRPRVQ
Ga0210407_1017565023300020579SoilMSSSNKREGSGGIDFGADFSELRERPILKGADLDLPTSMEAALPVDGPPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0210403_1000054843300020580SoilMASSNKREGGSGINFGADFSELRERPILKDADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0210403_1138811213300020580SoilSGGIDFGADFSELRERPILKGADLDLPTSMEAALPVDGPPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0210395_1030177613300020582SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTRLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0210401_1036757823300020583SoilDSEMSSSNKREGSGGIDFGADFSELRERPILKGADLDLPTSMEAALPVDGPPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0210401_1119331723300020583SoilSSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLEDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0210400_1101063323300021170SoilRERPILKGADLDLPTSMEAALPVDGPPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ
Ga0210405_1078839013300021171SoilEGGDSEMSSSNKREGSGGIDFGADFSELRERPILKGADLDLPTSMEAALPVDGPPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0210408_10000201703300021178SoilMSSNNKREGSGGIDFGADFSELRERPILKGADLDLPTSMEAALPVDGAPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0210396_1003948753300021180SoilELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLEDDPLTKLLTAAEKERGLPYTFRIAISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0210396_1011955113300021180SoilGADFSELRERPILKGADLDLPTSMEVAPPLDGGAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0210389_1092679213300021404SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTRLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMS
Ga0210384_1063435223300021432SoilGGIDFGADFSELRERPILKGADLDLPTSMEAALPVDGPPSVGHAETREAVAPESTVDLDADPLTKLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0210390_1014138413300021474SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQRSRARKRPRLQ
Ga0187846_1026640113300021476BiofilmMASSNKREGGSGIDFGADFSELRERPILKGADLDLPTSMEAAPPLEAEEQISPAESREAAPAEKGPDVDDDPLTKLLTAAEKEWGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEIALAK
Ga0210409_1046507023300021559SoilSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLEDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0126371_1123114423300021560Tropical Forest SoilMAIGNRREGSGGIDFGADFSELRDRPILKGVDLDLPTPLETAARLDELPLVNHSIDGEAEIREASAGESTDPLTRLLTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMAMLKLRQMSIAKKKSRKQ
Ga0242648_104458913300022506SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTRLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKR
Ga0242663_100628813300022523SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLEDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0242664_112219813300022527SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGSAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0242661_102447713300022717SoilRGSRIHISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0242661_105550713300022717SoilRGSRIHISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGPGESREPVTAAKAADLEDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0242665_1004526113300022724SoilRGSRIHISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSEIGLVESREPVSAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0224564_112160413300024271SoilMSSSNKREGGGGIDFGADFSELRERPILKGADLDLPTPIEVAVRVEEPQPVSQLESREVVAPESAAGLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMAL
Ga0137417_123559413300024330Vadose Zone SoilMASSNKREGSGGIDFGADFSELRERPILKGADLDLPTPIKVALPVEEPQPVNHTESREAVAPESTAGLDADPLTKLLTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ
Ga0207693_1139268913300025915Corn, Switchgrass And Miscanthus RhizosphereMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRP
Ga0207700_1055276923300025928Corn, Switchgrass And Miscanthus RhizosphereDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0209055_1000657133300026309SoilMASGNKREGNGGIDFGADFSGLHERPILKGADLDLTTSMEIVTPAEEPVSQAEERTATPENAAEAETDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRARKRPRLQ
Ga0209471_110886413300026318SoilMASGNKREGSGGIDFGADFSGLHERPVLKGADLDLPTSMEIVTPAEEPVSKSEEMAVTPESAAEAEIDPLTKLITAAERERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALHKLRQMSRNRKRPRVQ
Ga0209267_125058013300026331SoilMASGNKREGSGGIDFGADFSGLHERPVLKGADLDLPTSMEIVTPAEEPVSKSEEMAVTPESAAEAEIDPLTKLITAAERERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMAL
Ga0209803_131182813300026332SoilFGADFSGLHERPVLKGADLDLPTSMEIVTPAEEPVSKSEEMAVTPESAAEAEIDPLTKLITAAERERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALHKLRQMSRNRKRPRVQ
Ga0209158_118117813300026333SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVVPPLEGGSELGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0257172_110038313300026482SoilREGGGGIDFGADFSELRERPILRGADLDLPTPIEVALRVEEPQPVSQPENREAVAPESTAGLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALQKLRQMSRTRKRPRPQ
Ga0208730_101064713300027047Forest SoilIHISEGGDSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0209004_104083413300027376Forest SoilMSSSGNKREGFGGIDFGADFSGLRERPILKGADLNLPTTMEVTLQVEEPQSVSYADSREAMVPESVPELDPDPLTRLVTAAERERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRSKKRVRTQ
Ga0209446_103317023300027698Bog Forest SoilMSSSNKREGIGGIDFGADFSELRERPILKGADLDLPTAIEIGVQAEEPQPVSQTESREAATPESTAGLEVDPLTRLLTATERERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALMKLRQMSRTRKRPRLQ
Ga0209580_1000040433300027842Surface SoilMASSNKREGSSGINFGADFSELRERPILKGADLDLPTSIEAAPELEGGSEILPAESREAVSAGKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0209580_1035598313300027842Surface SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELSPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0209275_1091889313300027884SoilMASSNKREGGSGINFGADFSELRERPILRGADLDLPTSMEVAPPLEGGSEIGPAESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0209583_1025858213300027910WatershedsMAIGNKREGSGGIDFGADFSELRERPILKGADLDLPTSIEVALREEEPQPASHPESREAVAPESTLDLDPDPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSKSRKRPRLQ
Ga0209698_10000838183300027911WatershedsMASINKREGGGGIDFGADFSELRERPILKGADLDVPTAMEVGLRGEEPASHTASREAVAEESALELDTDPLTRLVTAAEKERGLPYTFRIPISMDREFRALAKEFDLDLSDIARSGLEMALLKLRQMSRLKKRPSRLPQ
Ga0209698_1000223763300027911WatershedsMSSSNKREGSGGIDFGADFSELRERPILKGADLDVPTPIEAGFRVEKPQPAGNTESREAVAPESTAGLDADPLTRLLTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALFKLRQMSRTRKQPRLQ
Ga0209698_1043414513300027911WatershedsMSISNKREGNGGIDFGADFSGLRERPVLKGADLDLPTPMEVAGEEPMSCGDSREVIAPERTAELDPDPLTRLVTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALLKLRQMSRSKRRARPQ
Ga0209698_1054129623300027911WatershedsMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVTPPLEGGSEISPAEIRETVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0137415_10000726293300028536Vadose Zone SoilMASSNKREGSGGIDFGADFSELRERPILKGADLDLPTPIEVALPVEEPQLVSQTETREAVAPESTVSLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALFKLRQMSRTKKRPRLQ
Ga0308309_1061004823300028906SoilASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGAELGAVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0307482_110117513300030730Hardwood Forest SoilGADFSELRDRPILKGADLDLPTPIEVALRVEEPQPVSQPESREAVAPEITAGLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ
Ga0170820_1071217113300031446Forest SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVK
Ga0307476_10000060823300031715Hardwood Forest SoilMSSSNKREGGGGIDFGADFSELRERPILKGADLDLPTPIEVALRVEEPQPVSQPESREAVAPEITAGLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ
Ga0307469_1003174333300031720Hardwood Forest SoilMASSNRREGSGGIDFGADFSGLRERPILKGADLDMPTSIEAALPVEEPQPVSQIETREAPAPESVPDLDPDPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALNKLRQMSRTKKRRLQ
Ga0307477_1079753013300031753Hardwood Forest SoilSEMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSIEVAPPLDGGSEISPAESREPVTAGKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALAKLRQMSRARKRPRLQ
Ga0307475_1048794613300031754Hardwood Forest SoilMASSNKREGSGGIDFGADFSGLRERPILKGADLDMPTSIEAALPVEGPQSVGQVEARETTAPESVAELDPDPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRRLQ
Ga0307475_1092214113300031754Hardwood Forest SoilLRERPILKGADLDMPTSIEAALPAEEPQPVGQVEARETAAPESMAELDPDPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMSRTKKRPRLQ
Ga0307475_1134999413300031754Hardwood Forest SoilSSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSRARKRPRLQ
Ga0310917_1113585613300031833SoilADFSGLRERPILKGADLDVATSMEAVPPVEEPQEITPAESREAVSVEKPLELEDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALAKLRQMSKARKRRLQ
Ga0316049_12349613300031866SoilFGADFSELRERPILKGADLDLPTPIEVAVRVEEPQPVSQLESREVVAPESAAGLDADPLTRLVTAAEKERGLPYTFRIPISMDREFRCLAKEFDLDLSDIARSGLEMALLKLRQMSRTRKRPRLQ
Ga0306926_1136580413300031954SoilMAISNKREGSSGIDFGADFSGLRERPILKGADLDVPTSMEIAPRFEEPQPIAPAETREAVSVEKAPDLEDDPLTKLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMAMAKLRQMSRAKKRHLQ
Ga0307479_1003841653300031962Hardwood Forest SoilLKGADLDLPTPIEIAARVEELPLGSSTDSEAIAREAVGGVGTDPLTRLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMAMLKLRQMSSAKKRSSPQ
Ga0307470_1040162413300032174Hardwood Forest SoilMASSNKREGGSGINFGADFSELRERPILKGADLDLPTSMEVAPPLDGGSELGPVESREPVTAAKAADLDDDPLTKLLTAAEKERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQVSRARKRPRLQ
Ga0307471_10164815713300032180Hardwood Forest SoilMAISNKREGSGGIDFGADFSELRERPILKGVDLDLATPMEAALRVDEPQAGNLSESREAAPESTAELEADPLTRLVTAAEKERGLPYTFRIPISMDREFRSLAKEFDLDLSDIARSGLEMALLKLRQMAKNKKRRLQ
Ga0335085_10000437863300032770SoilMASSNKREGGSGINFGADFSELRERPILKGSDLDVPTAMEVAPPSEGEPQTDVIESREAVAAEKAADVENDPLTRLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSKARKRRLQ
Ga0335082_1011199813300032782SoilMASSNKREGGSGINFGADFSELRERPILKGSDLDVPTSMEVAPPSEGEQLQTDAVESRVAMAAEKAADVENDPLTRLLTAAERERGLPYTFRIPISMDREFRTLAKEFDLDLSDIARSGLEMALVKLRQMSKARKRRLQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.