NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104906

Metagenome / Metatranscriptome Family F104906

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104906
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 77 residues
Representative Sequence VRGTVVVAFTVQSLYADTAGLKTVQNTQGPFADTRLVTVPSKFPDTTVETVALREVDALLADIVRWIGGGQ
Number of Associated Samples 85
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.000 % of family members)
Environment Ontology (ENVO) Unclassified
(32.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 24.24%    β-sheet: 12.12%    Coil/Unstructured: 63.64%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF05343Peptidase_M42 80.00
PF02142MGS 8.00
PF11028DUF2723 3.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1362Aspartyl aminopeptidaseAmino acid transport and metabolism [E] 80.00
COG1363Putative aminopeptidase FrvXCarbohydrate transport and metabolism [G] 80.00
COG2195Di- or tripeptidaseAmino acid transport and metabolism [E] 80.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil21.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere9.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.00%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.00%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021972Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2m2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_106853913300002557Grasslands SoilDTAGLNAVKALQGPFTESRQVSVPVKFADTAVETVTVRDIAALASDIVRWMGGSQ*
Ga0066674_1022063013300005166SoilVQSLYADTAGLKTVVNLQGPFTDARMVTVPSKFGETTVETVALKDVDALVADVVQWMGGAR*
Ga0066677_1050919923300005171SoilPRVTGTVVVAFAVQSLYADTAGLKTVVNVQGPFTESRLVTVPSKFPETMVETVGLKDVDALRADIVKWIGGSQ*
Ga0066683_1084287123300005172SoilNAGRRAACAALASAALSRPRVSGTVVVAFTVQSLYADTAGLKTIVNLQGPFTETRQLSVPARFPDTTVETVSLRDVDALATDVVKWMGGGR*
Ga0066679_1089856413300005176SoilLSRPRVRGSVVVAFTVQSLYADTAGLRTVLSTQGPFAGARPVTVPSKFAETTVETVALKDVEALVADVMRWIGGGQ*
Ga0066684_1077118323300005179SoilLYADTAGLNAVKALQGPFAESRQVSVPIKFADTAVETVTVRDIQALVTDVVRWIGGSQ*
Ga0066676_1019737313300005186SoilRGTVVVAFTVQSLYADTAGLKTVQNLQGPFEVTRMVTLVSSYRETAVETVALRDVDALVADLVRWIGGGQ*
Ga0070701_1116293613300005438Corn, Switchgrass And Miscanthus RhizosphereVAFTVQGLYADSAGLKTVQNLQGPFAESRLVSVPARFADTAVETVALGDVEALKADIVRWIGGGQ*
Ga0070705_10028368923300005440Corn, Switchgrass And Miscanthus RhizosphereRPRVSGTVVVAFAVQTLYADSAGLKTVLNLQGPFSDTRMVTVPSKFGETTVETVAMKDVSALMADVVKWMGGGQ*
Ga0070694_10026248923300005444Corn, Switchgrass And Miscanthus RhizosphereAALASAALARPRVSGTVVVAFAVQTLYADSAGLKTIVNLQGPFSDTRMVTVPSKFGETAIETVALKDVVALTADVVNWIGGAR*
Ga0066686_1041095413300005446SoilVVVAFTVQSLYADSAGLKTVVNLQGPFSDARMVTVPSKFGDTTVETVALKDVDALVADVVRWIGGGQ*
Ga0066681_1045749613300005451SoilPNAGRRAACAALASAALSHPRVTGTVVVAFAVQSLYADTAGLKTVVNVQGPFTESRFVTVPSKFPETMVETVGLKDVDALRADIVKWIGGSQ*
Ga0070699_10135739623300005518Corn, Switchgrass And Miscanthus RhizosphereLASAALSKPRARGSVVVAFTVQGLYADTAGLKTVHALQGPFDASRQVSLPVKFTDTAVETVTLRDVDALVTDILRWMGGGQ*
Ga0070697_10145394913300005536Corn, Switchgrass And Miscanthus RhizosphereLSKPRARGTVVVAFTVQGLYADSAGLKTVQNLQGPFAESRLVSVPARFADTAVETVALGDVEALKADIVRWIGGGQ*
Ga0070695_10161710013300005545Corn, Switchgrass And Miscanthus RhizosphereLLAAPYAGRRAACAALAAAVLSKPRVRGTVVVAFTVQSLYADSAGLKAVQNLQGPFEVTRMVNVFSEYRETAVETVALKDVDALVADLVRWIGGGQ*
Ga0070696_10056320113300005546Corn, Switchgrass And Miscanthus RhizosphereAACAALASAALARPRVSGTVVVAFAVQTLYADSAGLKTIVNLQGPFAEARLVSVPSKFPDTTVETVGLKDVDALRADIVKWIGGSQ*
Ga0070704_10001616213300005549Corn, Switchgrass And Miscanthus RhizosphereASAALSKPRARGTVVVAFTVQGLYADSAGLKTVQNLLGPFAESRLVSVPARFADTAVETVALGDVEALKADIVRWIGGGQ*
Ga0070704_10090660123300005549Corn, Switchgrass And Miscanthus RhizosphereGTVVVAFTVQSLYADTAGLKTIVHLQGPFSEARGVTVPSMYGETTVETVALKDVSAVMADVVKWMGGAQ*
Ga0066695_1048391113300005553SoilAGRRAACAALASAALSKPRVRGTVVVAFTVQSLYADSAGLKTLQNTQGPFADSRLVMVQSKFPDTTVETVALREVDALVADIVRWIGGGQ*
Ga0066707_1011137633300005556SoilVQSLYADTAGLKTVVNLQGPFTESRLVTVPSTFLETTVETVGLKDVDALKADIVKWIGGSQ*
Ga0066704_1005099533300005557SoilNAGRRAACAALASAALSKPRARGSVVVAFTVQGLYADTAGLNAVKALQGPFDESRQVSVPVKFADTAVETVALRDVDALVTDIVRWIGGSQ*
Ga0066704_1041191113300005557SoilNAGRRAACAALASAALSKPRARGSVVVAFTVQGLYADTAGLNAVNALQGPFDESRQVSLPVKFADTAVETVALRDVDALVADIVRWIGGGQ*
Ga0066698_1076845313300005558SoilAALSKPRVRGTVVVAFTVQSLYADSAGLKTLQNTQGPFADSRLVMVQSKFPDTTVETVALREVDALVADIVRWIGGGQ*
Ga0066699_1067896113300005561SoilADTAGLNTVKALQGPFAESRQLSLSVKFADTAVETVTLHDVDALVSDVVRWIGGGQ*
Ga0066691_1015721223300005586SoilTVQSLYADTAGLRTVLSTQGPFAGARPVTVPSKFAETTVETVALKDVEALVADVMRWIGGGQ*
Ga0066706_1083200913300005598SoilAFTVQGLYADTAGLNAVKALQGPFDESRQVSLPVKFADTAVETVALRDVDALVSDIVHWIGGGQ*
Ga0066653_1003033243300006791SoilVAFTVQSLYADTAGLKTIVNLQGPFTETRQLSVPARFPDTTVETVSLRDVDALATDVVKWMGGGR*
Ga0066665_1158873413300006796SoilAGRRAACAALVSAALSRPRVRGTVVVAFTVQSLYADTAGLKTLLNLQGPFTENRLVSVPSKFPDTAVETVGLRDVDALTADIVRWMGGGQ*
Ga0079220_1128373913300006806Agricultural SoilRGTVVVAFTVQSLYADTAGLKTIVHLQGPFSEARGVTVPSMYGETTVETVALKDVSALMADVVKWMGGGQ*
Ga0075425_10122763013300006854Populus RhizosphereALSKPRVRGTVVVAFAVQSLYADTAGLKTIVHLQGPFTESRLVTVPSKFPETTVETVGLKDVADVTADIVKWIGGSQ*
Ga0075425_10262406313300006854Populus RhizosphereALSKPRVRGTVVVAFAVQSLYADTAGLKTIVHLQGPFSEARVVTVPSMYGETTVETVALKDVAGVTADIVKWIGGSQ*
Ga0075434_10028636233300006871Populus RhizosphereTVVVAFTVQSLYADSAGLKTVMNLQGPFTETRLVTVPSKFPETTVETVGLRDVDGLLSDIVKWIGGSQ*
Ga0075424_10021131913300006904Populus RhizosphereRGTVVVAFTVQSLYADTAGLKAIVHLQGPFTESRLVTVPSKFPETTVETVGLKDVADVTADIVKWIGGLQ*
Ga0075424_10021726513300006904Populus RhizosphereALAAAALSKPRVRGTVVVAFTVQSLYADSAGLKTVMNLQGPFADTRLVTVPSKFPETTVETVGLRDVDGLLSDIVKWIGGSQ*
Ga0099791_1037199823300007255Vadose Zone SoilFTVQSLYADTAGLKTVQNTQGPFADTRLVTVPSKFPDTTVETVALREVDALMADIVRWIGGGQ*
Ga0099792_1064122713300009143Vadose Zone SoilQTKRPHRYGDHAGLLAAPFAGRRSACAVLAAAALSRPRVNGTVVVAFTVQSLYADSAGFKTVVNLQGPFTESRLVTVQSKFPDTTVETVALREVDALVADIVRWIGGSQ*
Ga0075423_1185233013300009162Populus RhizosphereNAGRRAACAALAAAALSKPRVRGTVVVAFTVQSLYADSAGLKTVMNLQGPFTETRLVTVPSKFPETTVETVGLRDVDGLLSDIVKWIGGSQ*
Ga0105063_103108723300009804Groundwater SandVRGTVVVAFTVQSLYADSAGLKTVQNLQGPFEVTRFVNLTSNYTGTAVETVSLRDVATLAADLVRWLGGTP*
Ga0134109_1008247313300010320Grasslands SoilLYADMAGLKTVVNLQGPFTDARMVSVPSRFGETTVETVALKDVDALVADVVQWMGGAR*
Ga0134062_1017798623300010337Grasslands SoilAAALAKPRVRGSVVVAFTVQGLYADTAGLNTVRNLQGPFDESRQVSVPVKYGDTAVETVALSDVEALVKDVVQWIGGSR*
Ga0134127_1201959513300010399Terrestrial SoilNAGRRSACAALAAAALSRPRVRGTVVVAFTVQSLYADSAGLKAVENLQGPFESARDVTVGSRFPDTAVETVSLRDVSALVAELSRWMGQ*
Ga0137392_1136803413300011269Vadose Zone SoilAGCAALASAALSRPRVRGTVVVAFTVRSLYADTAGLKTVLNLQGPFAESRQISMPSKFLDTTVETVGLKDVDALMADLVKWLGGS*
Ga0137382_1017374313300012200Vadose Zone SoilRGTVVVAFTVQSLYADTAGLKTVVNTQGPFADARMVAVPSKFGETTVETVALKDVDALVADVVKWLGGGQ*
Ga0137382_1030068423300012200Vadose Zone SoilMLAFTVQSLYADTAGLKTIVNLEGPFTEARMVTVTSKFGETMVETVALKDVDAVVADVVKWIGGSQ*
Ga0137363_1094696013300012202Vadose Zone SoilNGLLAAPNAGRRAACAALASAALARPRVRGTVVVAFTVQSLYADTAGLKTVVNLQGPFTDARMVTLPSKFPDTPVETVSLPDVDALIADVVKWIGGSQ*
Ga0137399_1123621713300012203Vadose Zone SoilAALASAALSKPRVRGTVVVAFTVQSLYADTAGLKTVQNTQGPFADTRLVTVPSKFPDTTVETVALREVDALLADIVRWIGGGQ*
Ga0137399_1154167323300012203Vadose Zone SoilLAKPRVRGTVVIAFTVQSLYATSAGLNSVKALQGPFDDTRELTVPAKFSETAVETVALADVEALVRDVVQWIGGGR*
Ga0137362_1141065123300012205Vadose Zone SoilAAPYAGRRAACAALAAAVLSKPRVRGTVVVAFTVQSLYADTAGLKTVQNLQGPFEVTRLVTLVSSYRETAVETVALRDVDALVADLVRWIGGGQ*
Ga0137362_1162712723300012205Vadose Zone SoilVVAFTVQSLYADTAGWKAVRNELGPFTEERAVTLQSRFVDTAVETVSLRDVDALVADLVRWIRGSQ*
Ga0137380_1144106313300012206Vadose Zone SoilPRVRGTVVVAFTVQSLYADTAGLKTIVNTQGPFADTRLVTVPSKFTETTVETVALKDVDALVADVVKWIGGAR*
Ga0137376_1057250513300012208Vadose Zone SoilGTVVVAFTVQSLYADTAGLKTVENLQGPFTDTRMVTVPAKFGETTVETVALKDVEALMADVVKWIGGGQ*
Ga0137376_1103858113300012208Vadose Zone SoilRRAACAALASAALSRPRVSGTVVVAFTVQSLYADTAGLKTVVNLQGPFTDARMVSVPSRFGETTVETVALKDVSALMADVVQWMGGAR*
Ga0137379_1181267013300012209Vadose Zone SoilRRAACAALASAALSRPRVSGTVVVAFTVQSLYADSAGLKTVVNLQGPFTDTRMVTVPSKFGETTVETVALKDVSALMADVVKWMGGGQ*
Ga0137377_1145755313300012211Vadose Zone SoilKPRVRGTVVVAFTVQSLYADSAGLKTVQNTQGPFADSRQVTVPSKFPDTSVETVSLRDVDALLADVVRWIGGSQ*
Ga0137369_1028515823300012355Vadose Zone SoilLYADTAGLKTVQNLQGPFTDSRMVTVPSKFGETTVETVALKDVDALVADLVKWIGGGQ*
Ga0137371_1032595513300012356Vadose Zone SoilRLLAALNAGRRAACAALASAALSRPRVRGSVVVAFTVQSLYADTAGLKTIVNTQGPFADTRLVTVPSKFTETTVETVALKDVDALVADLVKWMGGGQ*
Ga0137384_1148813313300012357Vadose Zone SoilACAAALASAALSRPRVNGTVVVAFAVQSLYADTAGLKTVVNLQGPFTESRLVTVPSTFLETTVETVGLQDVDALKADIMNWLRGSQ*
Ga0137390_1043234323300012363Vadose Zone SoilAFTVQSLYADTAGLKTVQNLQGPFEVTRIVTLVSSYRETAVETVALRDVDALVADLVRWIGGGQ*
Ga0137396_1038662023300012918Vadose Zone SoilVRGTVVVAFTVQSLYADTAGLKTVQNTQGPFADTRLVTVPSKFPDTTVETVALREVDALLADIVRWIGGGQ*
Ga0137413_1144716923300012924Vadose Zone SoilRAACAALAAAALSKPRVRGSVVVAFTVQSLYADTAGSKAVRNELGPFTEERAVTLQSRFVDTAVETVSLRDVDALVADLVRWIGGSQ*
Ga0137419_1002207043300012925Vadose Zone SoilAACAALASAAVARPRVRGTVVVAFTVQSLYADTAGLKTVVNLQGPFTEARMVTLPSKFPDTPVETVSLPDVAALVADVVKWIGGSQ*
Ga0137419_1102247523300012925Vadose Zone SoilYGDRLLAAPVAGRRSACAALASAALSKPRVRGTVVVAFTVQSLYADTAGLKTVQNTQGPFADTRLVTVPSKFPDTTVETVALREVDALLADIVRWIGGGQ*
Ga0137419_1145675623300012925Vadose Zone SoilSLYADTAGLKTVVNLQGPFNEARMVTLPSKFPDTPVESVSLPDVDALMADVVKWIGGSQ*
Ga0137416_1034948123300012927Vadose Zone SoilRPRVNGTVVVAFTVQSLYADTAGLKTVVNLQGPFTDARLVTMPSKFVETTVETVTLKDVDALMADVVKWIGGGQ*
Ga0137416_1210906913300012927Vadose Zone SoilRAACAALASAALSRPRVNGTVVVAFAVQSLYGDTAGLKTVQNLQGPFTESRLVTVPSKFPETTVETVGLKDVDALLTDIVRWLGGSQ*
Ga0137416_1217925623300012927Vadose Zone SoilRPRVNGTVVVAFTVQSLYADTAGLKTVVNLQGPFTDARLVTIPSKFVETTVETVTLKDVDALMADVVKWIGGSQ*
Ga0134075_1015417513300014154Grasslands SoilAPVRGTVVVAFTVQSLYADTAGLKTVVNTQGPFSESRLVSVPSRFPETTVETVGLRDVDALMADIARWIGGGR*
Ga0134079_1056131113300014166Grasslands SoilTAALSKPRARGSVVVAFTVQGLYADTAGLNAVKALQGPFAESRQVSLPVKFADTAVETVTVRDIQALVSDIVRWMGGPQ*
Ga0137403_1061541123300015264Vadose Zone SoilVAFTVQSLYADSAGLKTVVNLQGPFTEARLVTVPSKFAETTVETVGLKDVDALRSDIVKWIGGSQ*
Ga0134074_101045633300017657Grasslands SoilAAPNAGRRAACAALASAALSRPRVSGTVVVAFTVQSLYADTAGLKTIVNLQGPFTETRQLSVPARFPDTTVETVSLRDVDALATDVVKWMGGGR
Ga0184638_115746823300018052Groundwater SedimentFTVQSLYADTAGLKTVKNLQGPFDDSRQVTVPVKFADTAVETVDLRDLAALMADIVRWIGGGQ
Ga0184626_1003459713300018053Groundwater SedimentPAAGRRSACAALASAALSKPRVRGTVVVAFTVQSLYADTAGLKTVKNLQGPFAETRQLSVPVKFGDTAVETVALKDVDALLADILRWIGGGQ
Ga0184626_1018766313300018053Groundwater SedimentLSKPRVRGTVVVAFTVQSLYADTAGLKTVKNLQGPFDDSRQVTVPVKFADTAVETVDLRDLAALMADIVRWIGGGQ
Ga0184637_1018388223300018063Groundwater SedimentTVVVAFTVQSLYAANAGLNSVKALQGPFGDTREVTIPVKFEATAVETVALRDVEALMADLARWIGGGQ
Ga0184618_1001132313300018071Groundwater SedimentLAAAALSRPRVQGTVVVAFTVQSLYADTAGLKTVKNLQGPFAAAREAILTTRYTGTAVETVALRDVEALVADLARWIGAPR
Ga0184633_1020869613300018077Groundwater SedimentAACAALATAALSKPPRVRGTVVVAFTVQSLYAANAGLNSVKALQGPFDDTREVTIPVKLEATAVETVALRDVEALVADLARWIGGGQ
Ga0184627_1033886323300018079Groundwater SedimentQSLYAANAGLNSVKALQGPFDDTREVTIPVKFEATAVETVALRDVEALMADLARWIGGGQ
Ga0184639_1033216513300018082Groundwater SedimentVRGTVVLAFTVQSLYAANAGLNSVKALQGPFGDTREVTIPVKLEATAVETVALRDVEALVADLARWIGGGQ
Ga0066655_1041206523300018431Grasslands SoilAACAALASAALSRPRVSGTVVVAFTVQSLYADSAGLKTVVNLQGPFTEARLVTVPSKFPDTTVETVGLKDVDALRADLVRLIGGSQ
Ga0066669_1072762613300018482Grasslands SoilYADTAGLNSVKNLQGPFDEARQVSVPVRFGDTAVETVGFRDVDAVVTDIVRWIGGSQ
Ga0184643_110949713300019255Groundwater SedimentQPRVRGTVVVAFTVQSLYAVNAGLNTVKNLQGPFDDTRELTIPVKFPDTAVETVALSDVEALVRDVVQWIGGGR
Ga0184643_116113313300019255Groundwater SedimentGDRAGFLAAPNAGRRSACAALASAALSRPRVTGTVVVAFTVQSLYADTAGLKTIVNLQGPFTESRLVTVPSKFGETTVETVALKDVDALMADVVKWIGGGQ
Ga0193722_102215433300019877SoilNTGRRAACAALASAALARPRVRGTVVVAFTVQSLYADTAGLKTVVNLQGPFTEARMVTVPSKFVETTVETVALKDVDALVADVVKWIGGSQ
Ga0193713_104325923300019882SoilRFCTAFGTVVVAFTVQSLYADTAGLKAVQNLQGPFEVTRMVTLVSESRETAAETVALRDVDALVADLVRWIGGGQ
Ga0210378_1026253313300021073Groundwater SedimentSRGRRSACAALAAAALSKPRVRGTVVIAFTVQSLYAANAGLNTIKNLQGPFDEARELTLAVKFADTAVETVALADADALIKDVVQWIGGGR
Ga0193737_100085513300021972SoilRPLSRLRVQGTVVVAFTVQSLYADTAGLKTVTNLQGPFAAAREAMLPTQYTGTAVETVALRDVEALVADLARWIGAPR
Ga0222622_1067966313300022756Groundwater SedimentPRVRGTVVVAFTVQSLYADSAGLKAVMNLQGPFESTRDVIVASRFPDTAVETVSLRDVNALVAELSRWMGQ
Ga0209239_104447533300026310Grasslands SoilSVVVAFTVQGLYADTAGLNAVKALQGPFTESRQVSVPVKFADTAVETVALRDVDALVTDIVRWIGGSQ
Ga0209686_116792623300026315SoilCAALAAAALSKPRARGSVVVAFTVQGLYADTAGLNAVKALQGPFAESRQVSVPVKFADTAVETVTARDIQALVTDVVRWMGGSQ
Ga0209471_114809023300026318SoilALSRPRVRGSVVVAFTVQSLYADTAGLRTVLSTQGPFAGARPVTVPSKFAETTVETVALKDVEALVADVMRWIGGGQ
Ga0209470_110025613300026324SoilPRVRGTVAVAFTVQSLYADTAGLKTVQNLQGPFEVTRMVTLVSSYRETAVETVALRDVDALVADLVRWIGGGQ
Ga0209266_123809413300026327SoilSGTVVVAFTVQSLYADTAGLKTIVNLQGPFTETRQLSVPARFPDTTVETVSLRDVDALATDVVKWMGGGR
Ga0209157_1002896103300026537SoilAFTVQGLYADTAGLNAVKALQGPFDESRQVSLPVKFADTAVETASLRDVAALVTDIVRWIGGSQ
Ga0209648_1010035813300026551Grasslands SoilRLLAAPNAGRRAACAALASAALARPRVRGTVVVAFTVQSLYADTAGLKTVVNLEGPFTEARMVTLPSKFPDTTVETVRLRDVDALMADVVKWIGGSQ
Ga0209648_1023054913300026551Grasslands SoilAFTVQGLYADTAGLNAVKALQGPFDESRQVSLPVKFADTAVETVSLRDIDALVTDIARWIGGGQ
Ga0209899_109866123300027490Groundwater SandRSRPRVRGTVVVAFTVQSLYADSAGLKTVQNLQGPFEVTRFVNLTSNYTGTAVETVSLRDVATLAADLVRWLGGTP
Ga0209701_1005052833300027862Vadose Zone SoilTAGLNTVKNVQGPFDESRQVSLPVKFADTAVETVTLHDVDALVMDIVRWIGGPQ
Ga0307473_1128707313300031820Hardwood Forest SoilGSVVVAFTVQGLYADTAGLNTVKNLQGPFDENRQISVPVKYGDTAVETVALSDVAALVQDVVQWIGGTR
Ga0214472_1067328413300033407SoilARGTVVVAFTVQSRYAGNAGLNSVNALQGPFDETREVTLPVKFADTAVETVGLADADALVQDLVQWIGGGR
Ga0364930_0104365_666_8813300033814SedimentVRGTVVVAFTVQSLYADSAGLKTVQNLQGPFAETRLLSVPVKFADTAVETVVLRDVDALMTDIVRWIGGGQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.