NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104524

Metagenome / Metatranscriptome Family F104524

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104524
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 51 residues
Representative Sequence MPRESLDALENLPKEAPGQVAFGQLEDEGPRMPDEAPAGLEQPLLQARE
Number of Associated Samples 82
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.15

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.000 % of family members)
Environment Ontology (ENVO) Unclassified
(31.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 0.00%    Coil/Unstructured: 100.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.15
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF08241Methyltransf_11 3.00
PF00535Glycos_transf_2 2.00
PF13701DDE_Tnp_1_4 2.00
PF03994DUF350 2.00
PF07883Cupin_2 2.00
PF07690MFS_1 2.00
PF01527HTH_Tnp_1 2.00
PF13384HTH_23 1.00
PF10518TAT_signal 1.00
PF14499DUF4437 1.00
PF12836HHH_3 1.00
PF13683rve_3 1.00
PF13649Methyltransf_25 1.00
PF04453LptD 1.00
PF01553Acyltransferase 1.00
PF02738MoCoBD_1 1.00
PF13676TIR_2 1.00
PF00291PALP 1.00
PF11583AurF 1.00
PF13188PAS_8 1.00
PF01047MarR 1.00
PF02776TPP_enzyme_N 1.00
PF01381HTH_3 1.00
PF13551HTH_29 1.00
PF01590GAF 1.00
PF00589Phage_integrase 1.00
PF01229Glyco_hydro_39 1.00
PF00892EamA 1.00
PF09721Exosortase_EpsH 1.00
PF00775Dioxygenase_C 1.00
PF00873ACR_tran 1.00
PF13561adh_short_C2 1.00
PF05368NmrA 1.00
PF00196GerE 1.00
PF03320FBPase_glpX 1.00
PF00072Response_reg 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1452LPS assembly outer membrane protein LptD (organic solvent tolerance protein OstA)Cell wall/membrane/envelope biogenesis [M] 1.00
COG1494Fructose-1,6-bisphosphatase/sedoheptulose 1,7-bisphosphatase or related proteinCarbohydrate transport and metabolism [G] 1.00
COG3485Protocatechuate 3,4-dioxygenase beta subunitSecondary metabolites biosynthesis, transport and catabolism [Q] 1.00
COG3664Beta-xylosidaseCarbohydrate transport and metabolism [G] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300011270|Ga0137391_10410330All Organisms → cellular organisms → Bacteria1156Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil9.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.00%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand6.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.00%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs4.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.00%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland3.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.00%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.00%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment2.00%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment1.00%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Beach Aquifer PorewaterEnvironmental → Aquatic → Unclassified → Unclassified → Unclassified → Beach Aquifer Porewater1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Soil1.00%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland1.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2140918007Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Active_allEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300004020Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009538Microbial community of beach aquifer porewater from Cape Shores, Lewes, Delaware, USA - H-2WEnvironmentalOpen in IMG/M
3300009549Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_20_100EnvironmentalOpen in IMG/M
3300009634Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_13_150EnvironmentalOpen in IMG/M
3300009802Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60EnvironmentalOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300009810Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017975Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018090Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300021051Subsurface sediment microbial communities from Mancos shale, Colorado, United States - Mancos A1EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300025149Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 (SPAdes)EnvironmentalOpen in IMG/M
3300025157Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3 (SPAdes)EnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025477Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_13_150 (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300027187Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300031549Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f24EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031763Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f29EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300032054Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f23EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032516Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_0EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300033977Tropical peat soil microbial communities from peatlands in Loreto, Peru - SJ75EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A_all_C_013142602140918007SoilMPRAALDAPEDLPTQALRHVAFGQLEDQVPRMPDETPDGLEEPLLAARQGPALDG
JGI1027J12803_10188963333300000955SoilMPRQPLDVGKDLAKERSSQVTFGELQGEVPGVPDQPPAGLEEPLLQA
JGI25385J37094_1017117313300002558Grasslands SoilMPGESLDALENLPKEAPRQGAFGELQGEVPGMSDEPRAGLEEPLLEA
Ga0055440_1014210713300004020Natural And Restored WetlandsMPRESLDAPENLPKEALRQAAVGRAGGEVARIPDETPAASPIGWCPP
Ga0066685_1005748663300005180SoilVPREPLDALENLPKEAPRQVAFGELQGKVPGMPDEPRAGLEPLLEAR*
Ga0070708_10139079123300005445Corn, Switchgrass And Miscanthus RhizosphereVPPEPLDALENLPEEAPRQVAFGELQGGVPRMPDEPRTGLEEPLLET
Ga0070706_100005281173300005467Corn, Switchgrass And Miscanthus RhizosphereVPREPLDALENLSKEAPRQVAFGELQGEVPRMPDQPPARLEQALLQARQ*
Ga0070706_10026807023300005467Corn, Switchgrass And Miscanthus RhizosphereVPRESLDAPEDLPKPPPRQLAFGELEREVPRVPNQPPAECEEALPFGPASQ*
Ga0070699_10053460633300005518Corn, Switchgrass And Miscanthus RhizosphereMPGESLDAPDDLPKQALCQVALGQLEHEVPGMSDQPPAGLEEPLLEARQGPAL
Ga0070696_10002364933300005546Corn, Switchgrass And Miscanthus RhizosphereMPRQPLDALENLPKERPSQVAFGELQGEGPGVPDQPRAGLEQPLLETRERPAAAC*
Ga0066692_1068307213300005555SoilMPRESLDAPDDLPKQALCQVALGQLEHEVPGMSDQPPAGLEEPLLE
Ga0066698_1003747213300005558SoilMARESLDASENLPKESPRQVAFRKPQDEVPGMSDEAPAGLE*
Ga0066698_1006801643300005558SoilVPCESLDAPENLPKEGPRQVALGQLQDEVPGVSNQTPTGLEQPLLQARQGPA
Ga0066698_1011833223300005558SoilMPRESPDAPEDLPKEAPGQVTFGQLEDEVPSMPDEPPAGLEEPLLETRQRPALDGERQDQSA*
Ga0068861_10028104623300005719Switchgrass RhizosphereMPRQPLDALENLPKERPSQVAFGELQGEVSGVPDQPRAGLEQPLLETRERPAAAC*
Ga0066903_10598819133300005764Tropical Forest SoilMPRQPLDAHEDLANEGPCQVTAGELQGEVSGVPDQPRAGL*
Ga0099794_1000187043300007265Vadose Zone SoilMPRESLDALENLPKEAPGQVAFGQLEDEGPRMPDEAPAGLEQPLLQARE*
Ga0066710_100048161103300009012Grasslands SoilVPRESRDTPENLPKEAPRQVALGQLEYEVPRMPDQPPAGLKEPLLE
Ga0066710_10036996613300009012Grasslands SoilPRESPDAPEDLPKEAPGQVTFGQLEDEVPSMPDEPPAGLEEPLLETRQRPALDGERQDQS
Ga0099829_1142316813300009038Vadose Zone SoilVPREPLDAPEDLPKQALRQVAFAQLQDEVPHMSDEAPAVLEQALLQSRQRPIPG*
Ga0099830_1048689213300009088Vadose Zone SoilVPRESLDAPNDLTDEAPCQGAFSQLKDEVPRMPDQAPAGLEEPVLQ
Ga0099828_1012452033300009089Vadose Zone SoilMPSESLDAPENLPKERWRQVTFGQLQDEVPGMPNEAPAGLEQALLQARPRP*
Ga0099828_1072577533300009089Vadose Zone SoilMPRESLDAPEDLPKQVSRQVALGQLEDEVPRMPDEAPAGLEEPLLEARQGP
Ga0099828_1180319213300009089Vadose Zone SoilMPRESLDALENLPKEAPGLVAFGQLEDEVPSVPDEAPAGLEEPLLEARQRPALGWQTA
Ga0099827_1017881513300009090Vadose Zone SoilMPRESLDAPENLPKEAPRQVAVGQLEHEVPCMPDQAPAGLEI*
Ga0099827_1066627023300009090Vadose Zone SoilMPRESLDAPENLPKEATGQVAFGELRGEVSGMPNEAYARPEQPPLETRQG
Ga0105241_1061809113300009174Corn RhizospherePLDALENLPKERPSQVAFGELQGEGPGVPDQPRAGLEQPLLETRERPAAAC*
Ga0114945_1037841023300009444Thermal SpringsMPRESLDAREYLPKLAPRQAALDKLEEEVSCMPDEAPASLAEPLL*
Ga0114945_1055681513300009444Thermal SpringsMPRESLDAPEDLPKQALRQVAPGQLEPEVPGMSHETPRGLEEPLL*
Ga0129287_1001816423300009538Beach Aquifer PorewaterVPRESLDAPEDLPKEAPRQVALGPLEAEVPRMSDEPPAGLEEPLPDHHDQRR*
Ga0116137_101245813300009549PeatlandMPRESLDAPEALPNEAPRQVAFSKLENVMPRMPDQPPASLEEPLLEARQGAVLDGEGEGEPAQEIAEVVGD
Ga0116124_103539533300009634PeatlandMPRESLDAPEALPNEAPRQVAFSKLENVMPRMPDQPPASLEEPLLEARQGAVLDGEGEG
Ga0105073_104681013300009802Groundwater SandVPRESLDAPEDLPKQALCQVALGQLEDEVSRMPDETPAGLEQPLLEARQRPTLNGEGQNKSASLGS*
Ga0105061_103328313300009807Groundwater SandVPRESLDAPEDLPKQAPRQVALGKLQDEVPSIPDEAPACLEESLLEARQ*
Ga0105088_104263113300009810Groundwater SandPPLVRPRSSCPRFVCGPRSGGEPRESLDAPEDLPKQAPRQVALGKLQDEVPSIPDEAPACLEESLLEARQ*
Ga0105058_102569533300009837Groundwater SandVCGPRSGGEPRESLDAPEDLPKQAPRQVALGKLQDEVPSIPDEAPACLEESLLEARQ*
Ga0126380_1122186113300010043Tropical Forest SoilMPCEPFDAAENLPKEGSRQVALGRLQGEVPDMPDQPTAGLEEPLLKARERPVVDDNG*
Ga0126382_1063254023300010047Tropical Forest SoilVPRESLDAPKNLPKEGSRQVALGELQDEVPGMPDQPPAGLEEPLLQFASSVS*
Ga0134071_1043655423300010336Grasslands SoilMPRESPDAPEDLPKEAPGQVTFGQLKDEVPSMPDEPPAGLEESLLETRQRPALDGERQD
Ga0134071_1052746623300010336Grasslands SoilMPCEPLDALKNLTKEVPRQVAFGKLQGEVPGMPDEASARPEQPLLEARE
Ga0126379_1181803713300010366Tropical Forest SoilSFDAPKDLREQTRRQVALGELQDEVPGMPDEASAGLEEPLLEARQ*
Ga0136847_1145598923300010391Freshwater SedimentMPRESLDALENLTKEVPRQVAFGELQGEVPGMLDEASARPEQPLLEAREGPALD
Ga0126383_1368901713300010398Tropical Forest SoilMPREWLDTAENLPKERRRQVGFGQLEDEVSGVSEQPPPGLEQPLLQT
Ga0134121_1008016133300010401Terrestrial SoilMPRQPLAALEILPKERPSQVAFGELQGEGPGVPDQPRAGLEQPLLETRERPAAAC*
Ga0137391_1019507013300011270Vadose Zone SoilMPRESLDAPEDLPKHVSRQVALGQLEDEVPRMPDEAPAGLEEPLLEARQGPALDGERQ
Ga0137391_1041033023300011270Vadose Zone SoilMPRESLDAPENLPKEFPRQVAPGELEHEVPGIADQASTGLEEPLLE
Ga0137393_1130191923300011271Vadose Zone SoilVPREPLDALENLPKEAPRQVALRELQDEVPGMSDGPRAGLEEPLLEARQGP
Ga0137388_1015576913300012189Vadose Zone SoilMPRESLDAPEDLLKHVSRQVALGQLEDEVPRMPDEAPAGLEEPLLEARQGPALD
Ga0137388_1016812933300012189Vadose Zone SoilMPRESRNVPEDLPKERRCQVALGQLEDEVPGMADEATAGLEQPLLEAREGPALDGERQ
Ga0137388_1031275543300012189Vadose Zone SoilMPREALDAPDDLPKQALCQVALGQLEHEVPGMPDQAPAGLEQPLLEARQ*
Ga0137363_1023479313300012202Vadose Zone SoilVPRESLDGPENLPEEGPRQAALSQLQDEEPRVPDEAAAGLEESLLEARQGPALDGTG*
Ga0137363_1047872613300012202Vadose Zone SoilMPRESLDALENLPKEAPGQVAFGQLEEEGPRMQDEAPAGLEQPLLQARE*
Ga0137363_1121979023300012202Vadose Zone SoilVPREPLDALENLPQEAPRQVTFGELQGEVPGMPDEPRAGLEPLLEAR*
Ga0137381_1127070513300012207Vadose Zone SoilMPRQSLDAPKDLPKQTSRQVAFGQLEDEVSRMPDEAPAGLEKSLLEARQRPVLNGQG
Ga0137378_1176514623300012210Vadose Zone SoilMPRELLDAAKDLPKEAPRQVAFGQLEHEVPRISDEAPAGFEQPLLRARE*
Ga0137377_1054933113300012211Vadose Zone SoilVPREPLDALENLPKEAPRQVAFGELQGEVPGMPDQPPARLKQALLQARE*
Ga0137377_1079454523300012211Vadose Zone SoilMARESLDASENLPKESPRQVAFRKPQDEVPGMSDEAPAGLE
Ga0137360_1081087413300012361Vadose Zone SoilVPPESLDAPDDLPEEAPCQVAFSQLKDEVPRMPDEAATSLEEPLLETRQGPAL
Ga0137361_1074238113300012362Vadose Zone SoilPLDALENLPKEAPRQVAFGELEGEVPGMSDAAPALLEQPLLDGS*
Ga0137390_1148625933300012363Vadose Zone SoilMPCESVDAPENLPEQARRQVAVGQLQDEVPRMPDETSAGLEESLLGTRQRPAFDGTGQG
Ga0134057_100832813300012396Grasslands SoilPKEAPRQVTFGELQGEVPGIPDERRAGLEEPLLEAR*
Ga0137396_1025128123300012918Vadose Zone SoilVPCESLDARVDLPKEEPCQVAFGKVQGEVPSMPDEASARLEEPLL
Ga0126375_1145685723300012948Tropical Forest SoilMPRQPLDAPENLPKEAGCQMALGQLEDEAPRMSNEAPAGLEEPLLET
Ga0180063_101472413300014885SoilMPREPLDSVENPPKEAPCQVALGQPEHEVPSMPDKAPAGLEQPLLET
Ga0134085_1020059133300015359Grasslands SoilMPSESLDAPENLPKEAPRQVALGQLKDEVPRMPDQAPTGLEQPLLETRQGPAVDGDGPP
Ga0132257_10048473423300015373Arabidopsis RhizosphereVPRQLLDACENLTKEGASQVTFGKLQGEVPRKPDQPSARLE*
Ga0182032_1110276113300016357SoilMPRESLDAPENLPEEAPRQVALGKLQDEVPGMPDQASAGLERSLLQ
Ga0134083_1060181813300017659Grasslands SoilMPRESLDALENLPKEAPRQVAFGQLEDEVPRMPDEAPAGFEQPLLHRAGE
Ga0187782_1142502813300017975Tropical PeatlandMPREALDAPKDLPEEASRQVAFGQLQDEVPGMPDKSPAGLEYPLLQARQ
Ga0184627_1021335713300018079Groundwater SedimentMPREPLDAVENLPKEAPRQVALGQLEHEEPSMPDKAPASLEQPLLEI
Ga0187770_1030215813300018090Tropical PeatlandMPREALDASKDLPEEASRQVAFGQLQDEVPGMPDKPPAGLEYPLLQARQ
Ga0206224_101789123300021051Deep Subsurface SedimentMPREPLDAVENPPKEAPCQVALGQLEHEVPSMPDEAPAGLEQPLLEMREG
Ga0206224_102294423300021051Deep Subsurface SedimentMPRDSLDAPKNLPNKGPRQVALGQLEHEVPCMPDQLPVGLEEPLLEARQGPS
Ga0179596_1048665313300021086Vadose Zone SoilMPRESLDALENLPKEAPGQVAFGQLEDEGPRMPDEAPAGLEQPLLQARE
Ga0209827_1086883723300025149Thermal SpringsMPRESLDAREYLPKLAPRQAALDKLEEEVSCMPDEAPASLAEPLL
Ga0209399_1007006213300025157Thermal SpringsMPRESLDAPEHLSKEALCQVAFSKLEDVVPGGPDEAPAGLEQSLLEARQG
Ga0209520_1002507723300025319SoilMPRESLDPPENLPTQAVRQVAFGRLEDEVPRMPDEAPAGLE
Ga0209342_1075365613300025326SoilMPCELLDAPQDLAKEALCQVAFGKLEDEVSRMPDEAPAGLEEPLLEARQ
Ga0208192_102858833300025477PeatlandMPRESLDAPEALPNEAPRQVAFSKLENVMPRMPDQPPASLEEPLLEARQGAVLDGEGEGEPA
Ga0207646_1062605523300025922Corn, Switchgrass And Miscanthus RhizosphereVPREPLDALENLSKEAPRQVAFGELQGEVPRMPDQPPARLEQALLQARQ
Ga0207646_1095784013300025922Corn, Switchgrass And Miscanthus RhizosphereMPCEALDAVENPPKEAPGQVARAQLEHEVPRMPDQLPAGLEQPLLET
Ga0207681_1030401313300025923Switchgrass RhizosphereMPRQPLDALENLPKERPSQVAFGELQGEVSGVPDQPRAGLEQPLLETRERPAAAC
Ga0209236_111187123300026298Grasslands SoilVPREPLDALENLPKEAPRQVAFGELQGEVPGMPDEPRAGLEPLLEAR
Ga0209158_123932123300026333SoilVPREPLDALENLPQEAPRQVTFGELQGEVPGMPDEPRAGLEPLLEAR
Ga0209158_126389313300026333SoilPLDALENLPKEAPRQVAFGELQGKVPGMPDEPRAGLEPLLEAR
Ga0209058_104381353300026536SoilMARESLDASENLPKESPRQVAFRKPQDEVPGMSDEAPAG
Ga0209869_104493713300027187Groundwater SandLPDALENLPEEAPRQVAFSELEGEAPCMPDQPSARLEQALLQA
Ga0209842_103462533300027379Groundwater SandVCGPRSGGEPRESLDAPEDLPKQAPRQVALGKLQDEVPSIPDEAPACLEESLLEARQ
Ga0318571_1015550723300031549SoilMPRESLDAPENLPEEAPRQVALGKLQDEVPGMPDQASAGLERSLLQTGQ
Ga0318528_1022286223300031561SoilMPRESLDAPENLPEEAPRQVALGKLQDEVPGMPDQASAGLEQSLLQTGQ
Ga0307469_1006935723300031720Hardwood Forest SoilVRRESLDAPEDLPKERWRQVAFGQLQDEVPRVSNEAPAGLEQALLEARQILRDARRGEEP
Ga0307468_10193451213300031740Hardwood Forest SoilVRRESLDAPEDLPKERWRQVAFGQLQDEVPRVSNEAPAGLEQVLLEARQILRDARRGEEP
Ga0318537_1032136323300031763SoilMPRESLDTPKDLPKEAPRQVALGQLEHEVPRMPDQPPAGLEEPLLE
Ga0318509_1086241913300031768SoilMPRESLDAAENLLEEAPRQVALGELQDEVPAMPDEASAGLEESLLEARQG
Ga0306921_1239710313300031912SoilMPRESLDAPENLPEEAPRQVALGKLQDEVPGMPDQASAGLERSLLQT
Ga0318570_1007603023300032054SoilMPRESLDAPENLPEEAPRQVALGKLQDEVPGMPDQVSAGLERSLLQTGQ
Ga0307470_1119039623300032174Hardwood Forest SoilMPPQPLDVGKDLAKERSSQVTFGELQGEVPGVADQPPAGLEEPLLQARERPVLDGDGQHQPTE
Ga0315273_1211168123300032516SedimentMPRQPLDPVENPPKEAPCQVALGQLEHEVPSMPDEAPAGLEQPLL
Ga0364930_0231872_2_1933300033814SedimentMPREPLDAPENLPKQALRQVAFGQLEDTVPRMPDEAPAGGDEPRLEARQGPALDGERQDSRRRR
Ga0314861_0104052_961_11103300033977PeatlandMPREALDASKDLPEEASRQVAFGQLQDEVPGMPDKSPAGLEYPLLQARQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.