NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102949

Metagenome / Metatranscriptome Family F102949

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102949
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 92 residues
Representative Sequence LRSVYEDIEGRYLTGGQGDVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNRERVEAVIAAIRDRVHQYAPVG
Number of Associated Samples 83
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 13.86 %
% of genes from short scaffolds (< 2000 bps) 10.89 %
Associated GOLD sequencing projects 74
AlphaFold2 3D model prediction Yes
3D model pTM-score0.68

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (86.139 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(52.475 % of family members)
Environment Ontology (ENVO) Unclassified
(64.356 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(81.188 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 33.61%    β-sheet: 20.17%    Coil/Unstructured: 46.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.68
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00378ECH_1 36.63
PF13561adh_short_C2 3.96
PF00106adh_short 0.99
PF00441Acyl-CoA_dh_1 0.99
PF13452MaoC_dehydrat_N 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A86.14 %
All OrganismsrootAll Organisms13.86 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005171|Ga0066677_10024310All Organisms → cellular organisms → Bacteria2833Open in IMG/M
3300005172|Ga0066683_10050276All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2457Open in IMG/M
3300005447|Ga0066689_10477086All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium785Open in IMG/M
3300005450|Ga0066682_10136710All Organisms → cellular organisms → Archaea → Euryarchaeota1555Open in IMG/M
3300005454|Ga0066687_10409107All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium786Open in IMG/M
3300005552|Ga0066701_10725212All Organisms → cellular organisms → Bacteria → Terrabacteria group596Open in IMG/M
3300005556|Ga0066707_10234867All Organisms → cellular organisms → Archaea1193Open in IMG/M
3300005568|Ga0066703_10018898All Organisms → cellular organisms → Bacteria3511Open in IMG/M
3300009137|Ga0066709_102019594All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium798Open in IMG/M
3300010139|Ga0127464_1131117All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium643Open in IMG/M
3300010303|Ga0134082_10156381All Organisms → cellular organisms → Bacteria → Terrabacteria group923Open in IMG/M
3300010325|Ga0134064_10025349All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1709Open in IMG/M
3300026305|Ga0209688_1028096All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1087Open in IMG/M
3300026548|Ga0209161_10179482All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1197Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil52.48%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.84%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.90%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost5.94%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.98%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000886Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-65cm-3A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001538Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-PF 4A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1EnvironmentalOpen in IMG/M
3300001664Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - 5cm_reassembledEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010139Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012014Permafrost microbial communities from Nunavut, Canada - A10_80cm_6MEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012374Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012389Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013501Permafrost microbial communities from Nunavut, Canada - A35_65cm_0.25MEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021362Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R09EnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026305Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AL3A1W_124645113300000886PermafrostAELRAVYQAIEARYLLGHHGDVVRELRYRMVPESAFHDIMQALEREFPDVSLGSYPQTETRELILRATGQASERVEAVLNAIRGGVSQYRPVG*
A10PFW1_1154879613300001538PermafrostSVYGAIQERYLLGEHGDVVRELRYQAVPESAFHDIMQALEAEFPDVSLGSYPQTETRELILRASGRTAERVEAVLTAIRQAMTQYRPVG*
JGI12630J15595_1008992813300001545Forest SoilGLAFDLGQDRYVFALPGVPHELRSVYADIEIRYLTGSHGDAVRELHYKLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPDRVEAVINAIRDRVKQFTPVG*
P5cmW16_103120913300001664PermafrostFELGEDRYLFALPGVPAELRAVYQAIEARYLLGHHGDVVRELRYRMVPESAFHDIMQALEREFPDVSLGSYPQTETRELILRATGQASERVEAVLNAIRGGVSQYRPVG*
JGIcombinedJ26739_10089401723300002245Forest SoilLRSVYEDIEGRYLTGGQGDVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNRERVEAVIAAIRDRVHQYAPVG*
Ga0066672_1018164233300005167SoilAFDLGQDRYLFALPGVPHELRSVYEDIESRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTQTRELILRASGPDADRVEAVITAIRDRVRQFSPLGS*
Ga0066672_1041968023300005167SoilRYLTGSHGDAVREVHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVEAVIRAIRDRVRQFSPLR*
Ga0066677_1002431053300005171SoilVPHELRSIYEDIEIRYLTGGQADVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRQLILRASGPNAERVDAVITAIRDRVKQYDPVA*
Ga0066683_1005027643300005172SoilMAPGLAFDLDQGRYLFALPGVPHELRSIYEDIEIRFLSGGQGDVVRELHYRLAPESMFHDVMQALEHEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0066680_1012988333300005174SoilVYDDIEVRYLGGGQADVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPIG*
Ga0066673_1083017913300005175SoilRYLFALPGVPHELRSIYEDIEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVPLPPP*
Ga0066688_1032625113300005178SoilRYLFALPGVPHELRAVYDDIEVRYLGGGQADVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPIG*
Ga0066684_1081134923300005179SoilALPGVPHELRAVYEDVELRYLTGGRADVVRELHYRMAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDAQHVEAVIKAIRERITQYAPAG*
Ga0066671_1037106713300005184SoilGSHGDAVRELRYRLAPESLFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGPDTDRVEAVINAIRDRVRQFSPLGS*
Ga0066676_1058112123300005186SoilDAVRELRYRLAPESLFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGPDTDRVEAVINAIRDRVRQFSPLGS*
Ga0066689_1018885333300005447SoilQGRYLFALPGVPHELRAVYGDIEVRYLGGGQADVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPIG*
Ga0066689_1047708623300005447SoilGRYLFALPGVPHELRSIYEDIEIRFLSGGHGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLE*
Ga0066682_1013671033300005450SoilGRYLFALPGVPHELRSIYEDIEIRFLSGGHGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0066682_1022490033300005450SoilELRSIYEDIEIRYLSGGQADIVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGLSVERVEAVIAAIRDRVKQYQPVA*
Ga0066682_1084127413300005450SoilPHELRSVYEDIESRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVESVINAIRDRVRQFLPLGS*
Ga0066687_1026681833300005454SoilYLTGSHGDAVRELRYRLAPESLFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGPDTDRVEAVINAIRDRVRQFSPLGS*
Ga0066687_1031554833300005454SoilVPHELRTVYEDIEAIYLTGGQADIVGELHYRLAPESMFHDVMASLEREYPDVSLGSYPQTESRELIIRASGPDAQRVGAVLQAIRARVTQYSPLS*
Ga0066687_1040910713300005454SoilVPHELRAVYEDIERTYLTGGRGDVVRELHYRLAPESMFHDVMQSLEAEYPDVSLGSYPQTETRELILRASGPSPEHVDAVIAAIRDRVKQFQPVA*
Ga0066697_1011246043300005540SoilGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0066701_1072521223300005552SoilRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPVA*
Ga0066692_1077823313300005555SoilERADVVRELHYRLAPESMFHDVMQTLEAEYPDVSLGSYPQTETRELILRASGPNPERVDAVIAAIRERITQYQPVA*
Ga0066707_1023486733300005556SoilEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVP*
Ga0066704_1080203723300005557SoilVPHELRSVYEDIESRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTQTRELILRASGPDADRVEAVITAIRDRVRQFSPLGS*
Ga0066698_1004484013300005558SoilELRSIYEDIEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVP*
Ga0066698_1013312333300005558SoilPGLAFDLDQGRYLFALPGVPHELRAIYEDIEIRYLSGGQADVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPNVERVEAVIAAIRNQVSQYTPLH*
Ga0066700_1004468613300005559SoilVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPIG*
Ga0066699_1102203313300005561SoilVYEDIESRYLTGEQGDVVRELRYRLAPESMFHDVMKELEREYPDVSLGSYPQTETRELILRASGPNRERVEAVIAAIRDRVRQYAPVG*
Ga0066693_1031167613300005566SoilPGVPHELRSVYEDIESRYLTGSHGDAVRELHYHLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVESVINAIRDRVRQFSPLGS*
Ga0066703_1001889853300005568SoilELHYRLAPESMFHDVMQSLEVEYPDVSLGSYPQTETRELILRASGPNPQHVDAVIAAIRDRVKQFQPVA*
Ga0066703_1007102413300005568SoilYEDIESRYLTGEQGDVVRELRYRLAPESMFHDVMKELEREYPDVSLGSYPQTETRELILRASGPNRERVEAVIAAIRDRVRQYAPVG*
Ga0066702_1005396813300005575SoilYDDIERSYLSGERADVVRELHYRLAPESMFHDVMQTLEAEYPDVSLGSYPQTETRELILRASGPNPERVDAVIAAIRERITQYQPVA*
Ga0066702_1029947833300005575SoilVYEDIELRYLAGAQADVVRELHYRLAPESMFHDVMQALEREYPDVSLGSYPQTETRQLILRASGPNPERVESVIRAIRDRVTQYTPIS*
Ga0066702_1032826233300005575SoilRSVFEQEIEPRFLLGSQADAVAELHYASAPESALHDVMQALEREYPDVSLGSYPQTERRELILRATGPDPARVKAVLKAIRIRMTRYTPLEA*
Ga0066652_10078878313300006046SoilGVPHELRAVYEDIEVRYLTGSRGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDSERVDAVIKAIRDRIRQYSPVG*
Ga0066658_1054603323300006794SoilGAGMAPGLACELGQGRLLFALPGVPHELRTVYDDIEAIYLYGGTADTVRELRYRLAPESMFHDAMQALEGEFPDVSLGSYPQTETRELIIRASGPDRTRVDAVIHAIRERVTQYSPLI*
Ga0066658_1088866423300006794SoilAVRELHYRLAPESMFHDVMRDVEREYPDVSLGSYPQTQTRELILRASGPDADRVEAVITAIRDRVRQFSPLGS*
Ga0066659_1158486713300006797SoilADVVRELHYRLAPESMFHDVMQTLEAEYPDVSLGSYPQTETRELILRASGPNPERVDAVIAAIRERITQYQPVA*
Ga0079219_1141143623300006954Agricultural SoilARYLTGSHGDAVRELRYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVEAVINAIRDRVRQFSPLG*
Ga0099828_1182304313300009089Vadose Zone SoilHELRSVYEDIEARYLLGGRADVVRELHYRLAPESMFHDVMRDLETEYPDVSLGSYPQTETRELILRASGPDPERVEAVIKAIRDRVRQYSPVG*
Ga0099827_1001169413300009090Vadose Zone SoilRADVVRELHYRMAPESMFHDVMQALEREFPDVSLGSYPQTETRELILRASGPDAVRVEAVIRAIRERITQYAPVG*
Ga0099827_1039087813300009090Vadose Zone SoilVYDDIEVRYLGGGQADVVRELHYRLAPESMFHDVMQDLEREYPDVSLGSYPQTETRELILRASGPDPGRVEAVIEAIRDRVRQFTPIS*
Ga0066709_10201959413300009137Grasslands SoilLFALPGVPHELRSIYEDIEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVP*
Ga0127464_113111713300010139Grasslands SoilRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0134082_1015638133300010303Grasslands SoilGQGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0134064_1002534913300010325Grasslands SoilELRAIYEDIEIRYLSGGQADVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILLASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0134064_1030707823300010325Grasslands SoilLAFDLGQGRYLFALPGVPHELRSIYEDIEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVP*
Ga0134063_1063634613300010335Grasslands SoilPDLDVLVDRPKLVWHARYLFALPGVPHELRSIYEDIEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVPLPPP*
Ga0134066_1045264813300010364Grasslands SoilLIRNGAGMAPGLAFDLDQGRYLFALPGVPHELRSIYEDIEIRFLSGGQGDVVRELHYRLAPESMFHDVMQALEHEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR*
Ga0137392_1032893913300011269Vadose Zone SoilYLLGGRGDVVRELHYRLAPESMFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGPDPERVEAVINAIRDRVRQYSPIG*
Ga0137391_1113262313300011270Vadose Zone SoilLGGRGDVVRELHYRLAPESMFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGLDPERVEAVINAIRDRVRQYSPVG*
Ga0137393_1096397123300011271Vadose Zone SoilPHELRSVYEDIEARYLLGGRADVVRELHYRLAPESMFHDVMRDLEVEYPDVSLGSYPQTETRELILRASGPDPERVEAVIRAIRDRVRQYSPVG*
Ga0120159_103877533300012014PermafrostVAGVPADLGSVDEAIQERYLLGEHGDVVRELRYQAVPESAFHDIMQALEAEFPDVSLGSYPQTETRELILRASGRTAERVEAVLTAIRQAMTQYRPVG*
Ga0137364_1117845623300012198Vadose Zone SoilVRNGAGMAPGLAFDLAQDRYLFALPGVPHELRAVYDDIEVRYLGGGQADVVRELHYRMAPESMFHDVMRHLEREYPDVSLGSYPQTETRELILRASGPNRERVESVVRAIRDRVTQYAPVS*
Ga0137383_1089409023300012199Vadose Zone SoilRYLTGSRGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDSERVDAVIKAIRDRIRQYSPVG*
Ga0137380_1177009413300012206Vadose Zone SoilFALPGVPHELRSVYEDIEVRYLTGSRGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDSERVDAVIKAIRDRIRQYSPVG*
Ga0137376_1171615823300012208Vadose Zone SoilAGMAPGLAFDLGQDRYLFALPGVPHELRSVYEDIERRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVESVINAIRDRVRQFLPLGS*
Ga0137387_1007585413300012349Vadose Zone SoilDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETGELILRASGPDSERVDAVIKAVRDRIRQYSPVG*
Ga0137371_1134923023300012356Vadose Zone SoilQGDVVRELYYRLAPESMFHDVMQALEHEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPIR*
Ga0134039_108616033300012374Grasslands SoilGMAPGLAFDLDQGRYLFALPGVPHELRSIYEDIEIRFLSGGDGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLE*
Ga0134040_108779113300012389Grasslands SoilAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDTDRVEAVINAIRDRVRQFSPLGS*
Ga0137395_1034961433300012917Vadose Zone SoilAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVEAVINAIRDRVRQFSPLGA*
Ga0134110_1012466113300012975Grasslands SoilSRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTQTRELILRASGPDADRVEAVITAIRDRVRQFSPLGS*
Ga0134087_1012834313300012977Grasslands SoilHELRSVYEDLESRYLTGSHGDAVRELRYRLAPESLFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGPDTDRVEAVINAIRDRVRQFSPLGS*
Ga0120154_114980923300013501PermafrostGVPAELRSVYEAIQERYLLGEHGDVVRELRYQAVPESAFHDIMQALEAEFPDVSLGSYPQTETRELILRASGRTAERVEAVLTAIRQAMTQYRPVG*
Ga0120125_109339323300014056PermafrostGAGMAPGLAFELANGRYLFALPGVPAELLSVYEAIQERYLLGEHGDVVRELRYQAVPESAFHDIMQALEAEFPDVSLGSYPQTETRELILRASGRTAERVEAVLTAIRQAMTQYRPVG*
Ga0137420_135732113300015054Vadose Zone SoilLAFDLGQDRYLFALPGVPHELRSVYEDIEGRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSFPQTETRELILRASGPDADRVEAVITAIRDRVRQFSPLGS*
Ga0137409_1160210413300015245Vadose Zone SoilIEARYLTGSHGDTVRELRYRQAPESMFHDVMRDLERDYPDVSLGSYPQTETRELILRASGPNVERVEAVINAIRDRIRQFSPVG*
Ga0066662_1024720213300018468Grasslands SoilDGGTADTVRELRYRLAPESMFHDAMQALEAEFPDVSLGSYPQTETRELIIRASGPDRMRVDAVIEALRERVTQYAPLK
Ga0215015_1014142713300021046SoilRDVERSRGLGDVYKRQGLAFDLGQDRYLFALPGVPHELRAVYEDIERSYLTGARADVVRELHYRLAPESMFYDVMQTLEAEYPDVSLGSYPQTETRELILRASGPNPERVDAVIAAIRDRVKQYQPVT
Ga0215015_1048266113300021046SoilVGSEMCIRDRHYKMAPESMFHDVMQALEKEFPDVSLGSYPQTETRELILRASGQDAQRVDAVIKAIRDRVTQYAPVG
Ga0213882_1039987323300021362Exposed RockARPDTVRELHYRLAPESMFYDVMDALEREYPDVSLGSYPQTEARELIIRASGPDPARVDAVLRAVRERVSQYSPLG
Ga0210410_1171019113300021479SoilYEDVELRYLTGALADVVRELHYRMAPESMFHDVMKALEQEFPDVSLGSYPQTESRELILRASGPDAGRVEAVIQAIRDRVTQYAPVE
Ga0207646_1011755843300025922Corn, Switchgrass And Miscanthus RhizosphereELRAVYEDVELRYLTGARADVVRELHYRMAPESMFHDVMQALEQEFPDVSLGSYPQTETRELILRASGPDAVRVEAVIHAIRERITQYAPVG
Ga0207665_1056695813300025939Corn, Switchgrass And Miscanthus RhizosphereLELRYLTGARADVVRELHYRMAPESMFHDAMQALEREYPDVSLGSYPQTESRELILRASGPDAQRVEAVIQALRDRVTQYAPVG
Ga0209688_102809613300026305SoilDLGQGRYLFALPGVPHELRSIYEDIEIRYLSGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVPLPPP
Ga0209055_118946613300026309SoilLGQDRYLFALPGVPHELRSVYEDIESRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTQTRELILRASGPDADRVEAVITAIRDRVRQFSPLGS
Ga0209268_110949323300026314SoilELRSVYEDIESRYLTGSHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDADRVESVINAIRDRVRQFLPLGS
Ga0209470_123830313300026324SoilTGSHGDAVRELRYRLAPESLFHDVMRDLEGEYPDVSLGSYPQTETRELILRASGPDTDRVEAVINAIRDRVRQFSPLGS
Ga0209152_1046251923300026325SoilAGMAPGLACELGQGRLLFALPGVPHELRTVYDDIEAIYLYGGTADTVRELRYRLAPESMFHDAMQALEGEFPDVSLGSYPQTETRELIIRASGPDRTRVDAVIHAIRERVTQYSPLI
Ga0209801_116456813300026326SoilGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPDSERVDAVIKAIRDRIRQYSPVG
Ga0209801_134565413300026326SoilIEVRYLGGGQADVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPIG
Ga0209266_116806823300026327SoilFALPGVPHELRSIYEDIEIRFLSGGQGDVVRELHYRLAPESMFHDVMQALEHEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR
Ga0209375_103746323300026329SoilLALPGVPHELRSIYEDIEIRYLSGGQADIVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGLSVERVEAVIAAIRDRVKQYQPVA
Ga0209806_125859213300026529SoilARADVVRELHYRMAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDAQHVEAVIKAIRERITQYAPAG
Ga0209806_126622623300026529SoilYEDIESRYLTGEQGDVVRELRYRLAPESMFHDVMKELEREYPDVSLGSYPQTETRELILRASGPNRERVEAVIAAIRDRVRQYAPVG
Ga0209058_106061443300026536SoilYEDIEIRFLSGGQGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR
Ga0209157_111344413300026537SoilPGVPHELRSIYEDIEIRYLSGGQADIVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGLSVERVEAVIAAIRDRVKQYQPVA
Ga0209157_135152723300026537SoilIRFLSGGQGDVVRELHYRLAPESMFHDVMQALEQEYPDVSLGSYPQTETRELILRASGPDVDRVDAVITAIRNRISQFTPLR
Ga0209056_1021504833300026538SoilGGHADVVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVP
Ga0209056_1052050713300026538SoilVRNGAGMAPGLAFELEQDRYLFALPGVPHELRAVYDDIEVRYLGGGQADVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPI
Ga0209161_1017948213300026548SoilVRELHYRLAPESMFHDVMKALGEEYPDVSLGSYPQTETRELILRASGPSVERVDAVIAALRDRVTQYQPVP
Ga0209527_101602933300027583Forest SoilDLGDGRYLFALPGVPHELRAVYEDLEVRYLTGARADVVRELHYKMAPESMFHDVMQALEREFPDVSLGSYPQTETRELILRASGPDAQRVEAVIKAIRDRVTQYAPVA
Ga0209118_100603673300027674Forest SoilHELRSVYEDIEARYLFGGHGDAVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGQNPQRVEAVIQAIRDRVQQYSPVG
Ga0209689_134610623300027748SoilRAVYGDIEVRYLGGGQADVVRELHYRLAPESMFHDVMRDLEREYPDVSLGSYPQTETRELILRASGPNPEQVEAVISAIRERVTQYAPIG
Ga0209580_1068136523300027842Surface SoilALPGVPHELRAVYEDIERTYLTGSRADVVRELHYRLAPESMFHDVMQTLEAEYPDVSLGSYPQTETRELILRASGPNPERVDAVIAAISERVKQYQPVG
Ga0209283_1024496113300027875Vadose Zone SoilVRELHYKMAPESMFHDVMQALEQEFPDVSLGSYPQTETRELILRASGPDAQRVEAVINAIRDRVTQYAPVA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.