NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F069926

Metagenome Family F069926

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069926
Family Type Metagenome
Number of Sequences 123
Average Sequence Length 111 residues
Representative Sequence LTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQL
Number of Associated Samples 79
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 45.90 %
% of genes near scaffold ends (potentially truncated) 27.64 %
% of genes from short scaffolds (< 2000 bps) 56.10 %
Associated GOLD sequencing projects 60
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (95.935 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(44.715 % of family members)
Environment Ontology (ENVO) Unclassified
(58.537 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.293 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.
1JGI25383J37093_100148732
2JGI25384J37096_100015136
3JGI25382J37095_100260031
4JGI25382J43887_1000162915
5JGI25382J43887_100281872
6JGI25382J43887_102502572
7Ga0066672_100153222
8Ga0066672_100394524
9Ga0066672_104672741
10Ga0066677_100037483
11Ga0066677_100238352
12Ga0066680_100147222
13Ga0066679_100229323
14Ga0066679_102423501
15Ga0066679_103984141
16Ga0066679_105632491
17Ga0066690_100278621
18Ga0066690_100507852
19Ga0066690_100874422
20Ga0066690_102052701
21Ga0066688_101819443
22Ga0066688_104607672
23Ga0066678_100891093
24Ga0066676_106970931
25Ga0070708_1000126402
26Ga0070707_10000001959
27Ga0066697_100121282
28Ga0066701_105740851
29Ga0066661_100093752
30Ga0066661_103534041
31Ga0066707_102552011
32Ga0066700_100104692
33Ga0066699_100133002
34Ga0066699_101096792
35Ga0066699_103840252
36Ga0066703_101717002
37Ga0066703_102628802
38Ga0066705_106718162
39Ga0066903_1081144551
40Ga0066696_107850231
41Ga0066658_104770502
42Ga0066658_108341381
43Ga0066665_110876331
44Ga0066659_113485721
45Ga0079220_119065481
46Ga0066710_1020577611
47Ga0099829_100066048
48Ga0099829_100199562
49Ga0099829_100650202
50Ga0099828_102892893
51Ga0134088_101037521
52Ga0126378_121292591
53Ga0126383_101089152
54Ga0137392_108421002
55Ga0137391_107209351
56Ga0137389_100186433
57Ga0137389_114819931
58Ga0137388_100654435
59Ga0137388_102964432
60Ga0137383_102688852
61Ga0137365_102770372
62Ga0137365_104260421
63Ga0137399_100714942
64Ga0137399_115698531
65Ga0137380_101105232
66Ga0137380_102351402
67Ga0137381_100448913
68Ga0137387_101425201
69Ga0137387_103035222
70Ga0137360_101184831
71Ga0137390_100185441
72Ga0137396_112490341
73Ga0137419_106761722
74Ga0137416_102169981
75Ga0137416_112429331
76Ga0134075_101517531
77Ga0137418_100010431
78Ga0187803_102314771
79Ga0187771_118494101
80Ga0066662_100212282
81Ga0066662_112048421
82Ga0066662_113017051
83Ga0066662_118141272
84Ga0215015_102158072
85Ga0215015_105484502
86Ga0215015_107850291
87Ga0207646_10000001965
88Ga0209235_10154984
89Ga0209237_10603722
90Ga0209237_10945631
91Ga0209237_12056081
92Ga0209055_10087208
93Ga0209055_10247592
94Ga0209055_10298711
95Ga0209055_11319972
96Ga0209055_11745772
97Ga0209686_10055782
98Ga0209686_10289353
99Ga0209154_10055522
100Ga0209154_10130358
101Ga0209152_100016223
102Ga0209801_11963971
103Ga0209802_100095611
104Ga0209803_10816251
105Ga0209804_10383792
106Ga0209804_11088151
107Ga0209804_12040211
108Ga0209057_10645332
109Ga0209806_10751171
110Ga0209160_10240811
111Ga0209056_100096607
112Ga0209056_103632121
113Ga0209161_101556642
114Ga0209474_104371901
115Ga0209689_10053775
116Ga0209180_100508722
117Ga0209180_100527072
118Ga0209180_105832641
119Ga0209701_102192742
120Ga0209283_100092282
121Ga0137415_100110014
122Ga0307469_100017515
123Ga0307471_1032453132
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 36.69%    Coil/Unstructured: 63.31%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100110LTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Tropical Peatland
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
26.8%44.7%12.2%3.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1001487323300002560Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHEYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQL*
JGI25384J37096_1000151363300002561Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHEYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQPDFA*
JGI25382J37095_1002600313300002562Grasslands SoilLSLSYWFDPGRVIWTKNGTVAAVHEYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQL*
JGI25382J43887_10001629153300002908Grasslands SoilLTSLFLAYWFDPGRVIWTKDGTVAAVHQYPGSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNEGNGKCLNAPLSFRLNQTLTVEALNNGDFMLKNRQP*
JGI25382J43887_1002818723300002908Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQL*
JGI25382J43887_1025025723300002908Grasslands SoilMRKRLIIGTAVTAIMVSSLFLVYWYDPGRAIWTKEGTVAAVQEHWDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNDGNGNCLNSPLSFRLNQTLTVVAMNNGDFVITNRQL*
Ga0066672_1001532223300005167SoilVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQL*
Ga0066672_1003945243300005167SoilLTSLFLTYWFDPGRVIWTKDGTVAAVHQHPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGDGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066672_1046727413300005167SoilMAIIVSSLFLVYWCEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066677_1000374833300005171SoilMAIIVSSLFLVYWYEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLGFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066677_1002383523300005171SoilVIWTRDGTVAAVHQYQDSNTGTNWNVLVYIYQQNDMGYLLNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMVKNRQL*
Ga0066680_1001472223300005174SoilMKKLVIGIAVLAALLTSLFLSYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGFGTQGHGNYCWNDGNGKCLNVPLSFSLNQTLAVDALNDGNFMIRNSHI*
Ga0066679_1002293233300005176SoilMVSSLFLVYWYDPGRAIWTKDGTVAAVHEHSDSMSRTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNNGNGNCLNSPLSFRLNQTLTVVALNNGDFMITNRQL*
Ga0066679_1024235013300005176SoilVMAIIVSSLFLVYWYEPGRVIWTKNGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066679_1039841413300005176SoilVLTSLFLTYWFDPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQL*
Ga0066679_1056324913300005176SoilVIWTKDGTVAAVHQYSDSNTGTNWNALVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI*
Ga0066690_1002786213300005177SoilIWTKDGTVAAVHEHSDSMSRTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNNGNGNCLNSPLSFRLNQTLTVVALNNGDFMITNRQL*
Ga0066690_1005078523300005177SoilLTSLFITYWFDPGRVIWTRDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI*
Ga0066690_1008744223300005177SoilLTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQL*
Ga0066690_1020527013300005177SoilMTSLFLTYWFDPGRVIWTKDGTVAAVHQFLDSDTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGRGNYCWNDGNGKCLSAPLSFSLNQTLTVEALNNGNFMLKNRQL*
Ga0066688_1018194433300005178SoilMWTKDGTVAAVHQSPDSNTGTNWNVLVYIYQQNDMGYIVNTGYVALNGPGTQGHGNYCWNDGNGKCLTVPLSFSLNQTLAVDALNDGNFMLKNRQQ*
Ga0066688_1046076723300005178SoilMAIIVSSLFLVYWYEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066678_1008910933300005181SoilMAIIVSSLFLVYWYEPGRVIWTKNGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066676_1069709313300005186SoilIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0070708_10001264023300005445Corn, Switchgrass And Miscanthus RhizosphereMRRRRIVGIAILAIVTSSLVLAYWFDPNRVIWTRKGTVATVHQFLGSNNLTSWNVLVFVYQQNDRGYLVDTGYLALNGPGTAGHRNFCWNDGNGGCLKVPLNFSLNQTLTVDALNDGNFMIRNRQP*
Ga0070707_100000019593300005468Corn, Switchgrass And Miscanthus RhizosphereMIGITVLAIVLSSLSLTYWLDPGRVIWTKDGTVTAVHQYSDPNGGTNWNVLVYIYQQNDMGYLVNTGYVPLNGLGTQGHGNYCWDDGTSNCLNAPLSFRLNETLSVAALNNGNFMLKNRQL*
Ga0066697_1001212823300005540SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNCQL*
Ga0066701_1057408513300005552SoilMWTKDGTVAAVHQSPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066661_1000937523300005554SoilMAIIVSSLFLVYWYEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066661_1035340413300005554SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHEYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETL
Ga0066707_1025520113300005556SoilMAILAVVVTSLFLTYLFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFILRTVNCECA*
Ga0066700_1001046923300005559SoilMRKRLIIGIAVTAIIVSSLFLSYWYYPGRILWTKDGTVAAVHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIESHGNYCWNDGNGNCLISPLSFRLNQTLTVVAMNNGDFMIMNHQL*
Ga0066699_1001330023300005561SoilMAIIVSSLFLVYWCEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLGFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066699_1010967923300005561SoilVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI*
Ga0066699_1038402523300005561SoilVIWTRDGTVAAVHQYQDSNTGTNWNVLVYIYQQNDMGYLLNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFM
Ga0066703_1017170023300005568SoilLVIVLSSLFLVYWFDPGRAIWAKDGIVAAVQEQSGSNGGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGNCLNSPLSFRLNETLTVDALNNGDFMISNRQPSTGFRLRALTISL*
Ga0066703_1026288023300005568SoilVIWTKDGTVAAVHQFLDSDTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGRGNYCWNDGNGKCLSAPLSFSLNQTLTVEALNNGNFMLKNRQL*
Ga0066705_1067181623300005569SoilLTSLFITYWFDPGRVIWTRDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNF
Ga0066903_10811445513300005764Tropical Forest SoilESNSNNGTNWNVLVYIYQQTSMGYLVNTGYVALNGPGTPGNGNYCWDNGNGSCLNAPLYFSPNQALTVEALNNGNFMIINRRP*
Ga0066696_1078502313300006032SoilIVLTSLFITYWFDPGRVIWTRDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI*
Ga0066658_1047705023300006794SoilLTSLFLTYWFDPGRVIWTKDGTVAAVHQHPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI*
Ga0066658_1083413813300006794SoilVYWYEPGRVIWTKNGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL*
Ga0066665_1108763313300006796SoilVIWTKDGTVAAVHQYQDSNTGTDWNVLVYIHQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFILRTVNCECA*
Ga0066659_1134857213300006797SoilAVHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIESHGNYCWNDGNGNCLISPLSFRLNQTLTVVAMNNGDFMIMNHQL*
Ga0079220_1190654813300006806Agricultural SoilVSSLFLVYWYSPGRVVWTKDGTVAAVHQQTDSNNVPNWNVLVYIYQQNDMGYLVNTGYVALNGMGIAGRGNYCWNDGNGNCLSSPLSFRLNQTLMVVAMNNGDFLISNRQL*
Ga0066710_10205776113300009012Grasslands SoilRLSRQLPVFFRGRPKPTRKKLVIGMAILAVVVTSLFLTYLFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLVVDALNDGNFMLKNRQL
Ga0099829_1000660483300009038Vadose Zone SoilMWTKDGTVAAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRLNETLTVAALNNGNFMLKNRQP*
Ga0099829_1001995623300009038Vadose Zone SoilMRRRLIIGVAVLVIVLSNLFLIYWLDPSRVEWTRNGTVAAVHQHSNSKGETNWNVLIYIYQQNDMGYLVNTGYVALNGLGTQEHGNYCWNDGNGNCLYSPLSFRLNETLTVDALNNGDFMISNRQP*
Ga0099829_1006502023300009038Vadose Zone SoilVHQYSDSNSGANWNVLVYIYQQNDMGYLVNTGFVALNGLGIPGHGNFCWNDGHSSCLGRPLGFSLNQTLTVEALKNGNFMLKNRQL*
Ga0099828_1028928933300009089Vadose Zone SoilVIWTKNGTVAAVHQYSDSNSGANWNVLVYIYQQNDMGYLVNTGFVALNGLGIPGHGNFCWNDGHSSCLGRPLGFSLNQTLTVEALKNGNFMLKNRQL*
Ga0134088_1010375213300010304Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNCQL*
Ga0126378_1212925913300010361Tropical Forest SoilMLSSFFLVYWTYPGRVVWTKDGIVMAVHEESNSNNGTNWNVLVYIYQQTSMGYLVNTGYVALNGLGTPGHGNYCWDKGNGRCLNAPLNFSSNQALTVEALNNGNFMIINRQL*
Ga0126383_1010891523300010398Tropical Forest SoilMIGITALTFVISSLFLFYWTFPDRVVWTKDGIVTAVHQQSDSKIGTNWNVLVYIYQQTSLGYIVNTGYVALNGLGTPGHGNYCWDKGNGRCLNAPLNFSSNQALTVEALNNGNFMIINRQL*
Ga0137392_1084210023300011269Vadose Zone SoilMWTKDGTVVAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRLNETLTVAALNNGNFMLKNRQP*
Ga0137391_1072093513300011270Vadose Zone SoilMTSLFLTYWFDPGRVIWTKDGTVAAAHQYSNSNNGANWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRLNETLTVAALNNGNFMLKNRQP*
Ga0137389_1001864333300012096Vadose Zone SoilMWTKDGTVAAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRPNETLTVAALNNGNFMLKNRQP*
Ga0137389_1148199313300012096Vadose Zone SoilVIWTEYGTVAAVHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIQSHGNYCWNDGNGNCLNSPLSFRLNQTLTVVAMNNGDFMIRNHQL*
Ga0137388_1006544353300012189Vadose Zone SoilMAILAVLMTSLFLTYWFEPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWDDGYGNCLNAPLSFSLNETLTVAALNNGKFILVNRHL*
Ga0137388_1029644323300012189Vadose Zone SoilTVAAVHQHSNSKGETNWNVLIYIYQQNDMGYLVNTGYVALNGLGTQEHGNYCWNDGNGNCLYSPLSFRLNETLTVDALNNGDFMISNRQP*
Ga0137383_1026888523300012199Vadose Zone SoilMRKRLIIGIAVMAIIMSSLFLVYWYDPGRVIWTKDGTVAALHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYIALNGIGIEGHGNYCWSDGDGNCLNSPLSFRLNETLTVDALNNGYFVISNRQP*
Ga0137365_1027703723300012201Vadose Zone SoilVRRRLTIRIAVPVIIVSSLFFVHWYDPGRAIWTKDGTVATVHQQSDSNIGTNWNVFVYIYQQNEMGFLVNTGYIALNGMGVEGHGNYCWDNGSGNCLNSPLSFRLNQTLTVVASNNGDFKISNRQV*
Ga0137365_1042604213300012201Vadose Zone SoilMRKRLIIGIAVMAIIMSSLFLVYWYDPGRVIWTKDGTVAALHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYIALNGIGIEGHGNYCWSDGDGNCLNSPLSFRLNETLTVDALNNGYFMISNRQP*
Ga0137399_1007149423300012203Vadose Zone SoilVSSLFLVYWYDPERAIWTKEGIVAAVHEHSNSMSGTSWNILVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNDGNGNCLNSPLSFRLNQTLTVVALNNGDFMITNRRL*
Ga0137399_1156985313300012203Vadose Zone SoilVTAIIVSSLFLVYWYNPGRVMWTKDGTIAAVHEYSDSTSGTSWNVLVYIHQQNEMGFLVNTGYVALNGMGIEGHGNYCWNDGNGDCLNSPLSFQLNETLTVVALSNGGFIINNRQLYTGFSLHLYQSPSNLGFAITTKKL*
Ga0137380_1011052323300012206Vadose Zone SoilMAILAVVVTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLVLDALNDGNFMLKNRQL*
Ga0137380_1023514023300012206Vadose Zone SoilMLSDLFLIYWLDPSRVEWTRNGTVAAVHQHSDSNSGTNWNVLIYIYQQNDMGYLVNTGYVALNGLGTHGHGNYCWNDGHGNCLNSPLSFRLNETLTVDALNNGDFMISNRKP*
Ga0137381_1004489133300012207Vadose Zone SoilVIWTKDGTVAAVHQYQDSNTGTDWNVLVYIHQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLVLDALNDGNFMLKNRQL*
Ga0137387_1014252013300012349Vadose Zone SoilMLSDLFLIYWLDPSRVEWTRNGTVAAVHQHSDSNSGTNWNVLIYIYQQNDMGYLVNTGYVALNGLGTHGHGNYCWNDGHGNCLNSPLSFRLNETLTV
Ga0137387_1030352223300012349Vadose Zone SoilMRKRLIIGIAVMAIIMSSLFLVYWYDPGRVIWTKDGTVAALHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYIALNGIGIEGHGNYCWSDGDGNCLNSPLSFRLNETLTVDALNNGDFMISNRKP*
Ga0137360_1011848313300012361Vadose Zone SoilFDPGRVIWTKDGTVAAVHQYPGSNTRTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNEGNGKCLNAPLSFSLNQTLTVEALNNGDFMLTNRQP*
Ga0137390_1001854413300012363Vadose Zone SoilVIWTKNGTVAAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRLNETLTVAALNNGNFMLKNRQP*
Ga0137396_1124903413300012918Vadose Zone SoilVHEYSDSTSGTSWNVLVYIHQQNEMGFLVNTGYVALNGMGIEGHGNYCWNDGNGDCLNSPLSFQLNETLTVVALSNGGFIINNRQL*
Ga0137419_1067617223300012925Vadose Zone SoilMWTKDGTIAAVHEYSDSTSGTSWNVLVYIHQQNEMGFLVNTGYVALNGMGIEGHGNYCWNDGNGNCLNSPLSFRVNQTLTVVALN
Ga0137416_1021699813300012927Vadose Zone SoilMSSLFLVYCYNPGRVIWTKDGTVAAVHEYSDSMSGTSWNVLVYIYQQNEMDFLLNTGYVALNGMGIESHGNYCWNDGNGKCLISPLSFRLNQTLTVVALSNGDFVIGNRQL*
Ga0137416_1124293313300012927Vadose Zone SoilGTVAAVHEHSDSTSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIESHGNYCWNDGNGNCLNSPLSFRLNQTLTVVALNNGDFMITNRRL*
Ga0134075_1015175313300014154Grasslands SoilIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNCQL*
Ga0137418_1000104313300015241Vadose Zone SoilMWTKDGTIAAVHEYSDSTSGTSWNVLVYIHQQNEMGFLVNTGYVALNGMGIQSHGNYWWNDGNGNCIDSPLSLRLNQTLTVVAMNNGDFVIRNRQL*
Ga0187803_1023147713300017934Freshwater SedimentMAIVLSSLFLVYWFDPGRAIWTKDGIVAAVQEQSGSNSGTNWNVLVYIYQQTNMGYLVNTGYVALNGLGTQGSGNYCWNDGKGSCLNSPLSFRLNQTLTVDALNNGDFVISNR
Ga0187771_1184941013300018088Tropical PeatlandAVIAIALSGLFVVYWIDPGRVVWTRNGTVAAVHQQSDPNDGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTHGHGNYCWNDGNGNCLSTPLTFRLNQTLTIEALNDGVFIITNSQR
Ga0066662_1002122823300018468Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHEYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQPDFA
Ga0066662_1120484213300018468Grasslands SoilAVMAIIVSSLFLVYWYDPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL
Ga0066662_1130170513300018468Grasslands SoilRKRLVIGIAILAIVLTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFRLNQTLAVAALNDGNFMLKNRQL
Ga0066662_1181412723300018468Grasslands SoilMRKRLIIGTAVMAIIVSSLFLVYWYDPGRVIWTKDGTVAALHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNNGNGNCLNSPLSFRLNQTLTVVALNNGDFMITNRQL
Ga0215015_1021580723300021046SoilLAVVITSLFLTYWFDPARVIWTKDGTVAAVHQYSNSNNEANWNVLVYIYQQNDIGYLVNTGYIALSGLGIQGHGNYCWNDGNGNCLNAPLSFRLNETLTVAALNNGNFMLENRQL
Ga0215015_1054845023300021046SoilLSRQLPVFFRGRPKPRGKRLIIGITVTAIIMSSLFLVYLSDPGRTIWTKDGTVTAVHEYSDSISGTSWNVLVYIYQQNEMGFLLNTGYVALNRMGIEGHGNYCWNNGNGNCLNSPLSFRLNQTLAVVALNDGDFMITNRQL
Ga0215015_1078502913300021046SoilMRRIRIVGIVVLATVLSSLLLVYWFDPNRVIWSRDGTVATVHQYSNSNNVTNWNVLAFVYQRNDMGYLVDSGYVALNGLGTEGHGNFCWNDGNGSCLAAPLSFRLNQTLTVDALNDGNFMIRNRQP
Ga0207646_100000019653300025922Corn, Switchgrass And Miscanthus RhizosphereMIGITVLAIVLSSLSLTYWLDPGRVIWTKDGTVTAVHQYSDPNGGTNWNVLVYIYQQNDMGYLVNTGYVPLNGLGTQGHGNYCWDDGTSNCLNAPLSFRLNETLSVAALNNGNFMLKNRQ
Ga0209235_101549843300026296Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHEYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQ
Ga0209237_106037223300026297Grasslands SoilMRKRLIIGTAVTAIMVSSLFLVYWYDPGRAIWTKEGTVAAVQEHWDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNDGNGNCLNSPLSFRLNQTLTVVAMNNGDFVITNRQL
Ga0209237_109456313300026297Grasslands SoilLTSLFLAYWFDPGRVIWTKDGTVAAVHQYPGSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNEGNGKCLNAPLSFRLNQTLTVEALNNGDFMLKNRQP
Ga0209237_120560813300026297Grasslands SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNDIGYLVNTGYVALNGLGTQGHGNYCWNSGSGNCLNAPLRFRLNETLSVAALNNGNFMVKNRQPDFA
Ga0209055_100872083300026309SoilMAIIVSSLFLVYWYEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL
Ga0209055_102475923300026309SoilMVSSLFLVYWYDPGRAIWTKDGTVAAVHEHSDSMSRTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIEGHGNYCWNNGNGNCLNSPLSFRLNQTLTVVALNNGDFMITNRQL
Ga0209055_102987113300026309SoilVIWTRDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI
Ga0209055_113199723300026309SoilMWTKDGTVAAVHQSPDSNTGTNWNVLVYIYQQNDMGYIVNTGYVALNGPGTQGHGNYCWNDGNGKCLTVPLSFSLNQTLAVDALNDGNFMLKNRQQ
Ga0209055_117457723300026309SoilVIWTKDGTVAAVHQHPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLVVDALNDGNFMLKNRQ
Ga0209686_100557823300026315SoilMAIIVSSLFLVYWCEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLGFSLNQTLAVDALNDGNFMLKNRQL
Ga0209686_102893533300026315SoilVIWTRDGTVAAVHQYQDSNTGTNWNVLVYIYQQNDMGYLLNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMVKNRQL
Ga0209154_100555223300026317SoilVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQL
Ga0209154_101303583300026317SoilMAIIVSSLFLVYWYEPGRVIWTKNGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL
Ga0209152_1000162233300026325SoilMAIIVSSLFLVYWYEPGRVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLGFSLNQTLAVDALNDGNFMLKNRQL
Ga0209801_119639713300026326SoilKTLVIGIAVLAIVLTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTWTSWNVLVYIYLQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGTGKCLNAPLGFSLNQTLAVDALNDGNFMIWNRQL
Ga0209802_1000956113300026328SoilMKKLVIGIAVLAALLTSLFLSYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGFGTQGHGNYCWNDGNGKCLNVPLSFSLNQTLAVDALNDGNFMIRNSHI
Ga0209803_108162513300026332SoilVIWTKDGTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL
Ga0209804_103837923300026335SoilLTSLFLTYWFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFRLNQTLAVDALNDGNFMLKNRQL
Ga0209804_110881513300026335SoilMTSLFLTYWFDPGRVIWTKDGTVAAVHQFLDSDTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGRGNYCWNDGNGKCLSAPLSFSLNQTLTVEALNNGNFMLKNRQL
Ga0209804_120402113300026335SoilTVAAVHQYQDSNTGTNWNVLVYTYQQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL
Ga0209057_106453323300026342SoilMIGIALLAIVLSFLSLSYWFDPGRVIWTKNGTVAAVHQYSDPGGGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNCQ
Ga0209806_107511713300026529SoilHQYQDSNTGTNWNVLVYTYEQNDMGYLVNTGYVALNGPGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLAVDALNDGNFMLKNRQL
Ga0209160_102408113300026532SoilVIWTKDGTVAAVHQYPDSNTWTSWNVLVYIYLQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGTGKCLNAPLGFSLNQTLAVDALNDGNFMIWNRQL
Ga0209056_1000966073300026538SoilLTSLFITYWFDPGRVIWTRDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI
Ga0209056_1036321213300026538SoilMAILAVVVTSLFLTYLFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLVVDALNDGNFMLKNRQL
Ga0209161_1015566423300026548SoilMAILAVVVTSLFLTYLFDPGRVIWTKDGTVAAVHQYPDSNTGTNWNVLVYIYQQNEMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLTAPLSFSLNQTLVVDALNDGNFMLKNRQ
Ga0209474_1043719013300026550SoilAIVLTSLFITYWFDPGRVIWTRDGTVAAVHQYPDSNTGTNWNVLVYIYQQNDMGYLVNTGYVALNGLGTQGHGNYCWNDGNGKCLNAPLSFSLNQTLAVDALNDGNFMIRNSHI
Ga0209689_100537753300027748SoilMRKRLIIGIAVTAIIVSSLFLSYWYYPGRILWTKDGTVAAVHEYSDSMSGTSWNVLVYIYQHNEMGFLVNTGYVALNGMGIESHGNYCWNDGNGNCLISPLSFRLNQTLTVVAMNNGDFMIMNHQL
Ga0209180_1005087223300027846Vadose Zone SoilLVIVLSNLFLIYWLDPSRVEWTRNGTVAAVHQHSNSKGETNWNVLIYIYQQNDMGYLVNTGYVALNGLGTQEHGNYCWNDGNGNCLYSPLSFRLNETLTVDALNNGDFMISNRQP
Ga0209180_1005270723300027846Vadose Zone SoilVIWTKNGTVAAVHQYSDSNSGANWNVLVYIYQQNDMGYLVNTGFVALNGLGIPGHGNFCWNDGHSSCLGRPLGFSLNQTLTVEALKNGNFMLKNRQL
Ga0209180_1058326413300027846Vadose Zone SoilMWTKDGTVAAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRLNETLTVAALNNGNFMLKNRQP
Ga0209701_1021927423300027862Vadose Zone SoilMAILAVLMTSLFLTYWFDPGRVMWTKDGTVAAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRLNETLTVAALNNGNFMLKNRQP
Ga0209283_1000922823300027875Vadose Zone SoilMWTKDGTVAAVHQYSNSNNGVNWNVLVYIYQQNDMGFLVNTGYVALNGLGTQGHGNYCWNDGKGKCLNAPLSFRPNETLTVAALNNGNFMLKNRQP
Ga0137415_1001100143300028536Vadose Zone SoilMSSLFLVYCYNPGRVIWTKDGTVAAVHEYSDSMSGTSWNVLVYIYQQNEMDFLLNTGYVALNGMGIESHGNYCWNDGNGKCLISPLSFRLNQTLTVVALSNGDFVIGNRQL
Ga0307469_1000175153300031720Hardwood Forest SoilMAVLAILVSSLFLTYWFEPNRLIWTKDGTVAALHQYSDSNSATNWNVLVYIYQQNDMGYLVNTGFVALNGLGVPGHGNFCWNDGHNSCLGTPLGFNLNQTLTIEALNNGNFMVKNRQLNVSTDSFMSLPRVSTRGV
Ga0307471_10324531323300032180Hardwood Forest SoilNRLIWTKDGTVAALHQYSDSNSATNWNVLVYIYQQNDMGYLVNTGFVALNGLGVPGHGNFCWNDGHNSCLGTPLGFNLNQTLTIEALNNGNFMVKNRQLNVSTDSFMSLPRVSTRGV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.