NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F096789

Metagenome / Metatranscriptome Family F096789

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096789
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 60 residues
Representative Sequence MLENEIDARRKREELKAKRNLLFKRFLKNPLDTRLALKIKTIDDQVAECTEQMERKRGRR
Number of Associated Samples 66
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 4.81 %
% of genes from short scaffolds (< 2000 bps) 8.65 %
Associated GOLD sequencing projects 61
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (87.500 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(23.077 % of family members)
Environment Ontology (ENVO) Unclassified
(28.846 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.231 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 60.23%    β-sheet: 0.00%    Coil/Unstructured: 39.77%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF00891Methyltransf_2 7.69
PF07238PilZ 4.81
PF00903Glyoxalase 1.92
PF00221Lyase_aromatic 1.92
PF07920DUF1684 0.96
PF04384Fe-S_assembly 0.96
PF01068DNA_ligase_A_M 0.96
PF00171Aldedh 0.96
PF00361Proton_antipo_M 0.96
PF07690MFS_1 0.96
PF10996Beta-Casp 0.96
PF05649Peptidase_M13_N 0.96
PF00484Pro_CA 0.96
PF07589PEP-CTERM 0.96
PF02803Thiolase_C 0.96
PF00355Rieske 0.96
PF00589Phage_integrase 0.96
PF01740STAS 0.96
PF07715Plug 0.96
PF13466STAS_2 0.96
PF00101RuBisCO_small 0.96
PF07228SpoIIE 0.96
PF04055Radical_SAM 0.96
PF14907NTP_transf_5 0.96
PF00072Response_reg 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG2986Histidine ammonia-lyaseAmino acid transport and metabolism [E] 1.92
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 0.96
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 0.96
COG0288Carbonic anhydraseInorganic ion transport and metabolism [P] 0.96
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 0.96
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 0.96
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.96
COG2975Fe-S-cluster formation regulator IscX/YfhJPosttranslational modification, protein turnover, chaperones [O] 0.96
COG3358Uncharacterized conserved protein, DUF1684 familyFunction unknown [S] 0.96
COG3590Predicted metalloendopeptidasePosttranslational modification, protein turnover, chaperones [O] 0.96
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 0.96
COG4451Ribulose bisphosphate carboxylase small subunitCarbohydrate transport and metabolism [G] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A87.50 %
All OrganismsrootAll Organisms12.50 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005439|Ga0070711_100806473Not Available796Open in IMG/M
3300006162|Ga0075030_100282211All Organisms → cellular organisms → Bacteria → Acidobacteria1329Open in IMG/M
3300007076|Ga0075435_101617275All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium568Open in IMG/M
3300009038|Ga0099829_10658786All Organisms → cellular organisms → Bacteria → Acidobacteria870Open in IMG/M
3300017822|Ga0187802_10011337All Organisms → cellular organisms → Bacteria2890Open in IMG/M
3300020579|Ga0210407_10331726Not Available1189Open in IMG/M
3300020580|Ga0210403_10004233All Organisms → cellular organisms → Bacteria12398Open in IMG/M
3300020580|Ga0210403_10010311All Organisms → cellular organisms → Bacteria → Acidobacteria7621Open in IMG/M
3300020581|Ga0210399_10013389Not Available6447Open in IMG/M
3300021180|Ga0210396_10110774All Organisms → cellular organisms → Bacteria → Acidobacteria2480Open in IMG/M
3300021479|Ga0210410_10001067All Organisms → cellular organisms → Bacteria26632Open in IMG/M
3300021559|Ga0210409_10220494All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1723Open in IMG/M
3300026374|Ga0257146_1006329All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1935Open in IMG/M
3300026551|Ga0209648_10073270All Organisms → cellular organisms → Bacteria → Acidobacteria2876Open in IMG/M
3300026551|Ga0209648_10205677All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_20CM_4_61_61497Open in IMG/M
3300027862|Ga0209701_10582187All Organisms → cellular organisms → Bacteria → Acidobacteria596Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil23.08%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment21.15%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.54%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland10.58%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.65%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil6.73%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.85%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.92%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.92%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.92%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.96%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.96%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.96%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300014153Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_60_metaGEnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017932Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_4EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300017942Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_3EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017972Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017973Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MGEnvironmentalOpen in IMG/M
3300018001Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_5EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018085Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028146Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK23EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25617J43924_1016775923300002914Grasslands SoilMSATVVDEKREREALKAKRNLLFARYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSRNCL*
Ga0062389_10159471223300004092Bog Forest SoilVLETAIDEKKERRELKTKRSQLFERFLKNPLDIRLALEIKIIDDQVAEWVNQKEAQREVWR*
Ga0058899_1184967023300004631Forest SoilMPETGINEKRKREELKARRNLLFRIYLKNPLETRLALEIKIIDDQVAASMAR*
Ga0058899_1186463223300004631Forest SoilMPETAIDAKRELEEWKTRRNLLLRIYLKNPLETRLALEIKVIDDPVAASVAR*
Ga0066388_10028004043300005332Tropical Forest SoilMIENEIDAKWKREGLRTARKLLFKRFLKNPLDTRLALKIKTVDDQIAECDGQIEQKRKGRN*
Ga0070711_10080647323300005439Corn, Switchgrass And Miscanthus RhizosphereMHQTPIDAVRKREQLKAKRNLLFRRFLQNPLETHLALEIKIIDDQVAECTANERKRGPKHKT*
Ga0070732_1002372623300005542Surface SoilMSVVITGEKKEREALKARRNLLFQRYLKAPQDTRLALEIKLIDDQVAKFTEQIDRKRASNN*
Ga0075030_10028221143300006162WatershedsMSATVVDEKREREALKAKRNLLFARYLKAPLDTRLALEIKIIDDQVAESTKRLERKRGSRN*
Ga0075425_10119167523300006854Populus RhizosphereMQETPIDAMRKREELRAKKTLLFRRFLQNPLETRLALEIKAIEPSRSSPEAL*
Ga0075435_10161727513300007076Populus RhizosphereVLETAIEERKKREELKAKRNLLFDRYLKHPMDTRLALEIKIIDDQLAEERKLPSKRSG
Ga0099829_1065878613300009038Vadose Zone SoilMSAAVIDEKREREALKAKRNVLFERYLKAPLDTRLALEIKIIDDQVAEYTKQMERKR
Ga0099830_1040091923300009088Vadose Zone SoilMSATVVDEKREREALKAKRNLLFERYLKAPLDTRLALEIKIIDDQVAEYNKQMERKRESRN*
Ga0099830_1165250813300009088Vadose Zone SoilMSAAVIDEKREREALKAKRNVLFERYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRSSGNCF*
Ga0136449_10344873613300010379Peatlands SoilVLENAIDVLRKREELKAQRSLLFARFLENPLETQLALKIKIIDDQVAECCEKMRQTREKRY*
Ga0137392_1015781433300011269Vadose Zone SoilMSAAVIDEKREREALKAKRNLLFARYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSRNCL*
Ga0137391_1026457123300011270Vadose Zone SoilMSAAVIDEKREREALKAKRNVLFERYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRSSGNCL*
Ga0137391_1032798513300011270Vadose Zone SoilMSATVVDEKREREALKAKRNLLFERYLKAPLDTRLALEIKIIDDQVAKYTKQMERKRSSGNCL*
Ga0137389_1088643123300012096Vadose Zone SoilVHSMSATVVDEKREREALKAKRNLLFARYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSRNCL*
Ga0137388_1088393413300012189Vadose Zone SoilAYTSYEVHSMSATVVDEKREREALKAKRNLLFARYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSRNCL*
Ga0137363_1096179713300012202Vadose Zone SoilMQATPFNAMRKREELKAQRNLLFKRFSRNPLDTSLALKIKIIDDQIAASDEQMERE
Ga0137390_1075617223300012363Vadose Zone SoilMSATVVDEKREREALKAKRNLLFERYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSRNCL*
Ga0153915_1011200523300012931Freshwater WetlandsVLETAIDEKGKRDKLKAKRNLLFERFLKNPLDTRLALEIKIIDDQVAECTDQMERKRGRRN*
Ga0181527_103487213300014153BogMLENEIDARRKREELKAKRNLIFARFLKNPLHTRLALEIKIIDDQIAECTDQMQQKRKGRN*
Ga0137412_1040020933300015242Vadose Zone SoilMQATPFNAMRKREELKAQRNLLFKRFSRNPLDTSLALKIKIIDDQIAASDEQMERER
Ga0187802_1001133733300017822Freshwater SedimentMQETAINAMRKREELKAQRNRLFKRFLRNPLDTHLALKIKVIDDQVAECTEQME
Ga0187802_1001460333300017822Freshwater SedimentMQETPIDAKRKREELKVVRNLLFKRFLKNPLDTHLALKIKTIDYQVAECAEQMALMTESR
Ga0187802_1015225523300017822Freshwater SedimentMLESEIDARRKREELKAKRNLLFKRFLKNPLDTRLALKIKTIDDQVAECTEQMERKRGRR
Ga0187818_1000045793300017823Freshwater SedimentMLANEVDARRKREELKAKRNLLFKWFLKNPLDTRLALKIKIIDDQVAECTEQMERKRGRR
Ga0187818_1000328443300017823Freshwater SedimentMPENAIDARKKREELKAKRNLLFRRFLKNPQDIHLALRIKRIDDRLAECAEKMEQKRAGR
Ga0187818_1003933933300017823Freshwater SedimentMLESEIDARRKREELKAKRNLLFKRFLKNPLDTRLALKIKTIDDQVAECTEQMERKRGKR
Ga0187814_1007573013300017932Freshwater SedimentMQETAINAMRKREELKAQRNLLFKRFLRSPLDTHLALKIKVIDDQVAECTEQMERKRGKR
Ga0187801_1007361113300017933Freshwater SedimentVLENVIDVLKKREELKAKRNLLFARFLNNPLETQLALKIKIIDDQVAECCEQMRQRRKER
Ga0187801_1008897213300017933Freshwater SedimentMLEEEIDARRKREELKAKRNLLFQQFLRNPRDTRLALKIKTIDDQVAKCTAQMERKRETR
Ga0187801_1018120123300017933Freshwater SedimentREILKAKRYLLFARFLKNPSDTRLALEIKIIDDQVAECVKQMQQQGEKRNRVQRLFFQI
Ga0187801_1028994213300017933Freshwater SedimentMPKNEMDARKKCEKLKATRNILFRHFLKNPLDTRLALKIKTIDDQIAECTEQMKQRREER
Ga0187803_1011309113300017934Freshwater SedimentMLENAIDVRRKREELKSKRNLLFQQFLKNPLDTRLALRIKTIDDQVAECTEQMARMKERR
Ga0187808_1050799413300017942Freshwater SedimentMLENEIDARRKREELKAKRNLLFKRFLKNPLDTRLALKIKTIDDQVAECTEQMERKRGRR
Ga0187819_10002469103300017943Freshwater SedimentMLANEVDARRKREELKAKRNLLFKRFLKNPLDTRLALKIKIIDDQVAECTEQMERKRGRR
Ga0187819_1023372213300017943Freshwater SedimentMLEDAIDARKKREELKAKRNLLFQRFLKNPLDTRLALKIKIIDDQIAECTVQMEQKRAGR
Ga0187819_1047052723300017943Freshwater SedimentMLENEIDARRKREELKAKRNLLFKRFLKNPLDTRLALKIKTIDDQVAECTEQME
Ga0187817_1040272313300017955Freshwater SedimentMQETAIDAKKKRERLKAKRDLLFARFLKNPSDTRLALEIKIIDDQVAECVKQMQQQGEKRNRVQRLFFQI
Ga0187817_1043259113300017955Freshwater SedimentVLENVIDVLKKREELKAKRNLLFARFLNNPLETQLALKIKIIDDQVAECCVRQMRKERD
Ga0187779_1000253873300017959Tropical PeatlandMLENVIDAKKKREELKSKRNLLFQRFLKNPHNIRLALRIKRIDDRIAEFTEQMEQKRAGR
Ga0187778_1007365713300017961Tropical PeatlandMLENAIDVRRKREELKAKRNLLFQQFLKNPLDTRLALKIKTIDDQVAECAEQMARMKERR
Ga0187778_1060266813300017961Tropical PeatlandKREELKSKRNLLFQRFLKNPHNIRLALRIKRIDDRIAEFTEQMEQKRAGRY
Ga0187781_1000709323300017972Tropical PeatlandMLLELSRMLENVIDARKKREELKAKRNLLFQRFLKNPHNIRMALRINRIDDRIAEFTEQMEQKRAGGS
Ga0187780_1000836433300017973Tropical PeatlandMLLELSRMLENVIDAKKKREELKSKRNLLFQRFLKNPHNIRLALRIKRIDDRIAEFTEQMEQKRAGRY
Ga0187780_1127058013300017973Tropical PeatlandMLENVIDARKKREELKAKRNLLFQRFLKNPHNIRLALRIKRIDDRIAEFTEQMEQKRAGR
Ga0187815_1033680913300018001Freshwater SedimentMLESEIDARRKREELKAKRNLLFRRFLKNPQDIHLALRIKRIDDRLAECAEKMEQKRAGR
Ga0187804_1039942223300018006Freshwater SedimentMLENAIDVRRKREELKAKRNLLFQQFLKNPLDTRLALRIKTIDDQVAECTEQMARMKERR
Ga0187810_1008186223300018012Freshwater SedimentMLEEEIDARRKREELKAKRNLLFQQFLRNPRDIRLALKIKTIDDQVAKCTAQMERKRETR
Ga0187810_1010573423300018012Freshwater SedimentMLENEIDALRKREELKAKRNLLFQRFLKNPLDTRLALKIKIIDDQIAECTVQMEQKRAGR
Ga0187784_1093603223300018062Tropical PeatlandMLENVIDARKKREELKAKRNLLFQRFLKNPHNIRMALRINRIDDRIAEFTEQMEQKRAGR
Ga0187772_1000780233300018085Tropical PeatlandMLLELSRMLENAIDARKKREELKAKRNLLFQRFLKNPHNIRMALRINRIDDRIAEFTEQMEQKRAGRN
Ga0187771_1009887023300018088Tropical PeatlandMLENVIDVRRKREELKAKRNLLFERFLKNPLDTRLALKIKTIDDQVAEYTEQMARMKDRR
Ga0187771_1026215323300018088Tropical PeatlandMLEEEIDARRKREELKAKRNLLFQQFLRNPRDTRLALKIKTIDDQVAKCTEQMERKRETR
Ga0187771_1125480013300018088Tropical PeatlandSPFFHAFVKELSRMLQNAIDVSRRREELKAKRNLLFQRFLKNSLDTRLALKIKTIDDPVAECTEQMERMKERRK
Ga0210407_1000671783300020579SoilMQETSIDAMRKREKLKAKRNLLFQRFLKNPLETHLALEIKIIDDQVAECSELVGRKHGPDHKT
Ga0210407_1025997723300020579SoilMQETSINAMRKREELNATRNLLFKRFLKNPMDTHLALEIKSIDDQVAECMERMRQNRKRR
Ga0210407_1033172623300020579SoilMQETPIDAMRKREELKRTRNRLFKRFRKHPMDTYLALKIKTIDDQVAECSEQMRQEKKQRDTTLVP
Ga0210407_1034441213300020579SoilLLENAIDVLRKREELKAKRNLLFARFLKNPLDTQLALKIKIIDDQVAEC
Ga0210403_10003426123300020580SoilMQETPIDTMRKREELKRTRNLLFKQFLKNPVDTRLALKIKSIDDQVAECSERMRQNRKIR
Ga0210403_1000423383300020580SoilMLENETNAKWNREGLRAARKLLFRRFLKNPLDTRMALKIKTLDDQIAECDGQMEQKKKGR
Ga0210403_10010311113300020580SoilMQETPIDAMRKREELKGTRNLLFKRFLKNPMDTRLALKIKSIDDQVAECSERMRQNRKRR
Ga0210403_1050447413300020580SoilMLENASDARKKCKELKAKRNLLFARFLKNPRDTHLAVKIKIIDDQIVEYTEKMERKKERR
Ga0210403_1054092723300020580SoilVLETAIDAKTKREELKAKRNLLFARFLKNPLDTHLALEIKIIDDLVAEWTEQMEPDREGL
Ga0210399_1001338923300020581SoilMLENEINAKWKREGLRAARKLLFRRFLKNPLDTRMALEIKTLDDQIAECDGQMEQKKKGR
Ga0210399_1039161623300020581SoilKPFAHAFRKELTRVLETAIDMKRKREELKAKRNPLFARFLKDPLDTRLALEIKIIDDLVAESAEQLQEERAKRD
Ga0210399_1072567613300020581SoilVLETAIDVKTKREELKAKRNLLFARFLKNPLDTHLALEIKIIDDLVAEWTEQMEPDREGL
Ga0210406_1135410923300021168SoilMQETPIDAMRKREELKRTRNRLFKRFRKHPMDTYLALKIKTIDDQVAECSEQMRQE
Ga0210400_1019775123300021170SoilMQETPLDAMRKREELKRTRNRLFKRFRKHPMDTYLALKIKTIDDQVAECSEQMRQEKKQRDTTLVP
Ga0210405_1040686613300021171SoilMLENEINAKWKREGLRAARKLLFRRFLKNPLDTRMALTIKTLDDQIAECDGQMEQQKKGR
Ga0210396_1011077423300021180SoilMQETPIDAMRKREELKRTRNRLFKRFRKHPMDTYLALIIKTIDDQVAECSEQMRQENKQRDTTLVP
Ga0210387_1145093113300021405SoilFAHAFRKKPTRVLETAIDMKRKREELKAKRNPLFARFLKDPLDTRLALEIKIIDDLVAESAEQLQEERAKRD
Ga0210410_10001067203300021479SoilMQETPIDAMRKCEELKRTRNRLFKRFLKHPMDTYLALIIKTIDDQVVECSEQMRQENKQRDTTLVP
Ga0210409_1022049413300021559SoilMQETPFNPVRKREELKAQRKLLFRRFLRNPQDTRLALKIKIIDDQIAACTEQIE
Ga0210409_1115024323300021559SoilMQETPFNPMRKREELKAQRKLLFRRFLRNPQDTRLALKIKIIDDQIAACTEQIE
Ga0257146_100632953300026374SoilMQETPINAMTKREELKATRNLLFKQFLKNPMDTHLALKIKSIDDQVAECMERMRQNRKRR
Ga0257171_101146923300026377SoilMQETPINAMRKREELNATRNLLFKRFLKNPMDTHLALKIKSIDDQVAECMERMRQNRKRR
Ga0209648_1007327043300026551Grasslands SoilMSATVVDEKREREALKAKRNLLFARYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSRNCL
Ga0209648_1015832533300026551Grasslands SoilVLETAIDVKRKREELKAKRNLIFARFLKDPLDTHLALEIKIIDDLVAEWTEQMEQKREAR
Ga0209648_1020567733300026551Grasslands SoilMSAAVIDEKREREALKAKRNVLFERYLKAPLDTRLALEIKIIDDQVAEYTKQMERKRGSR
Ga0209580_1009767823300027842Surface SoilMSVVITGEKKEREALKARRNLLFQRYLKAPQDTRLALEIKLIDDQVAKFTEQIDRKRASN
Ga0209517_1048532013300027854Peatlands SoilNGIDAMNKREELKAKRNLLFQRFLKNPQDIRLALKIKMIDDQIAECTEQVEQKRAGRN
Ga0209701_1058218723300027862Vadose Zone SoilMSAAVIDEKREREALKAKRNVLFERYLKAPLDTRLALEIKIIDDQVAEYTKQME
Ga0247682_111142013300028146SoilMLENEINAGWKREGLRAARKLLFRRFLKNPLDTRMALTIKTLDDQIAECDGQMEQKKKGGTKV
Ga0170834_10726142313300031057Forest SoilVLETAIDMKRKREELKAKRNPLFARFLKDPLDTRLALEIKIIDDLVAESAEQMQEERAKR
Ga0307497_1038252523300031226SoilVLENVIDVLKKREELKAKRNLLFARFLNNPLETQLAVKIKIIDDQVAECW
Ga0170824_10806826823300031231Forest SoilVLRKREELKAKRNLLFARFLKNPLDTQLALTIKLIDDQVAECSEKVQQMREKRY
Ga0170824_11035330923300031231Forest SoilVLENLIDVLKKREELKAKRNLLFARFLNNPLETQLALRIKIIDDQVAECCEQMRQMREER
Ga0170824_12568243013300031231Forest SoilMQETPIDAMTKREELKGTRNLLFKRFLKNPMDTRLALKIKSIDDQVAECTERMRQNRKRR
Ga0170820_1450743223300031446Forest SoilVLENLIDVLKKREELKAKRNLLFARFLNNPLETQLALKIKIIDDQVAECCEQMRQMREER
Ga0170820_1492714123300031446Forest SoilVLETAIDMKRKREELKAKRNPLFARFLKDPLDTRLALEIKIIDDLVAESA
Ga0170818_10965472223300031474Forest SoilVLETAIDMKRKREELKAKRNPLFARFLKDPLDTRLALEIKIIDDQVAECVKQKQQQREKR
Ga0307469_1028437533300031720Hardwood Forest SoilVLRKREELKAKRNLLFARFLKNPLDTQLALKIKLIDDQVAECSEKVQQMREKWY
Ga0318546_1124209513300031771SoilMQETPIDSTRKREELKAKRNLLFRRFLQNPLETHLALEIKIIDDQVAEC
Ga0307470_1011860733300032174Hardwood Forest SoilMQETPIDAMRKRAELKRTRNRLFKRFLKHPMDTYLALIIKTIDDQVAECSEQMRQANKQRDWSEPLE
Ga0307471_10057128213300032180Hardwood Forest SoilMHGTPIDAMRKREELKAKRNLLFRRFLQNPLETHLALEIKIIDDQVAECAELMKRKHGPKHKT
Ga0307471_10068516113300032180Hardwood Forest SoilMQETPIDAMRKREELKRTRNRLFKRFRKHPMDTYLALIIKTIDDQVAECSEQMRQANKQR
Ga0307471_10079916113300032180Hardwood Forest SoilMLENEINAKWKREGLRAARKLLFRRFLKNPLDIGMALKIKTLDDQIAECDGQMEQKKKGR
Ga0307471_10100581223300032180Hardwood Forest SoilEFTRVRENAIDVLRKREELKAKRNLLFARFLKNPLDTQLALKIKIIDDQVAECSEQMQQMREKRY
Ga0307472_10007592733300032205Hardwood Forest SoilVLENAIDVLRKREELKAKRNLLFARFLKNPLDTQLALKIKIIDDQVAECSEQMQQMREKR
Ga0307472_10064450413300032205Hardwood Forest SoilVLENVIDVLKKREELKAKRNLLFARFLNNPLETQLALKIKIIDDQVAECCEQMRQDERRTRLVRRSFFQV
Ga0307472_10103585613300032205Hardwood Forest SoilMIENEIDAKWKREGLRAARKLLFKRFLKNPLDSRLALKIKTMDDQIAEYDGQMEQKREGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.