NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103728

Metagenome / Metatranscriptome Family F103728

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103728
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 73 residues
Representative Sequence MSESTEKAAEWWSDPDRDVPGTQWLQVPGAVENMNRRATGDPEMDWITHSAGLLAKFTKPIKALSVGCGFG
Number of Associated Samples 92
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(15.842 % of family members)
Environment Ontology (ENVO) Unclassified
(36.634 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.594 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.38%    β-sheet: 0.00%    Coil/Unstructured: 61.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF13469Sulfotransfer_3 22.77
PF13489Methyltransf_23 3.96
PF00685Sulfotransfer_1 3.96
PF07995GSDH 1.98
PF13692Glyco_trans_1_4 0.99
PF05050Methyltransf_21 0.99
PF12708Pectate_lyase_3 0.99
PF14581SseB_C 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG2133Glucose/arabinose dehydrogenase, beta-propeller foldCarbohydrate transport and metabolism [G] 1.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.85%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere3.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.97%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.97%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.98%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.99%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.99%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.99%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.99%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.99%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2189573000Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis 0-21cm (T0 for microcosms)EnvironmentalOpen in IMG/M
2189573004Grass soil microbial communities from Rothamsted Park, UK - FG2 (Nitrogen)EnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000890Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005335Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaGHost-AssociatedOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005562Agave microbial communities from Guanajuato, Mexico - As.Ma.eHost-AssociatedOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012898Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S194-509B-1EnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020062Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021339Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c1EnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300021413Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c1EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300021510Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_coexEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025903Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300030917Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FB5 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300032000Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D3EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
N55_094367302189573000Grass SoilMSESSKRAAEWWSDPDRVVPGTQWLQVPGTSENMNRRATGDPEMNWITHSAALLAKFPKPIKALSLGC
FG2_079018602189573004Grass SoilMSETTEKAAEGWSAPDRDVPGTQWLQIPGAVENMNRRATGDPEMDWITHSAGLLAKFANPIKALSLGCGFGVIERVLRRCDYCQL
F14TC_10134485613300000559SoilMSETTEKAAEWWSDPDRDVPGTQWLQIPGAVKNMNRRATGDPEMDWITHSAGLLAKFAKP
JGI1027J11758_1240598723300000789SoilMVEGTKSDAAKKAGEWWSDPERQVTGTQWVEVPGTFENLNRRATGDPAIDWITHSGSLLATFTKPVKALSLGCGFGVIERILRRRDYCQLI
JGI11643J12802_1072077823300000890SoilMSEAIRKAAEWWSDPESEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASLLASFTKPINLLSVGCGFGAIERLLRRRDYCQHI
JGI1027J12803_10167014443300000955SoilMSKATKKAAEWWSDPQSEAPGTQWVQVPGVFESLNRRATGDPSIDWINHSASLLANFAKPIKALSAGCGFGGIERILRR
JGI1027J12803_10362743723300000955SoilMSEPTKKAAEWWSDPDRDVPGTQWLQIPGASENMNRRATGDPEVNWITHSAGLLAQFPKP
JGI10216J12902_10301940623300000956SoilMSEATQKAAEWWSDPESGDSETQWVRVPGVAENMNRRATGDPAIDWIHHSAGLLASFAKPIKALSLGCGFGIIERVLRRSDFCQIIH
soilH1_1040081213300003321Sugarcane Root And Bulk SoilMTEPAKKAAEWWSDPESEARETQWVRVPGVQENMNRRATGDPEMDWISHSGGLLVKFAKPVKALSLGCGFGVIERVIRRR
Ga0062595_10090973013300004479SoilMSEAVKKAAEWWSDPERGVTGTQWLDVPGAIENMNLRATGDPKLDWISHSASLLASLSKPVKALSVGCGFGVI
Ga0062592_10067037813300004480SoilMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRR
Ga0062591_10159709913300004643SoilMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDW
Ga0066683_1001298413300005172SoilMTDTSVSEAAKRSARWWNDPQSEAPGTQWVEVPGVAENINRRATGDPEIDWIGHSAGLLLKSKRPIEALSIGCGFGRI
Ga0066673_1040027113300005175SoilMAEPINTKAIKKVAERWGDPQSEPPGTQWVGVPGVAENINRRATGDPKIDWINHSGSLLARFKKPIKALSLGCGFGAIERILR
Ga0066675_1096633213300005187SoilMIETSVSEATKRAAKWWSDPQSEVPGTQWVEVPGVAETINRRATGDPEIDWISHSAGLLAKSKRPIKALSIGCGFGGIERLLRRRDYCQLIH
Ga0066388_10106195513300005332Tropical Forest SoilMSESTEKAAEWWSVPDQNPPGTQWLQIPGATENMNRRATGDPEMDWITHSAGLLSKFQKPIKALSLGCGFGVIERVLR
Ga0066388_10697454123300005332Tropical Forest SoilMNESTSEKAAEWWSDPGREIPGTQWLQVPGAIQNMNSRATGDPEMDWITHSAGLLAKFAKPVKALSLGCGFGVIERVLRRSD
Ga0070666_1136359323300005335Switchgrass RhizosphereMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALL
Ga0070687_10059832113300005343Switchgrass RhizosphereMSETTEKAAEWWSDPGRDVPGTQWLQIPGAIENMNRRATGDPEMDWITHSAGLLSKF
Ga0070674_10124119313300005356Miscanthus RhizosphereMSDSTEKAAEFWSDPGRDVPGTQWLQIPGAIENMNLRATGDPEMDWITHSAALLAKFKKP
Ga0070710_1071341823300005437Corn, Switchgrass And Miscanthus RhizosphereMSESTEKAAEWWSDPDRDVPGTQWLQVPGAVENMNRRATGDPEMDWITHSAGLLAKFTKPIKALSVGCGFGIIERVLRRCDYCQLIHGVDVAEGAIEGARKAAQDEGL
Ga0070711_10008070613300005439Corn, Switchgrass And Miscanthus RhizosphereMSKTANKAAEWWSDPETEGPETHWVRVPGVVENMNRRATGDPAIDWISHSASLLARFAKPIKALSVGCGFGGIERALRRRN
Ga0070711_10177787713300005439Corn, Switchgrass And Miscanthus RhizosphereMSETTEKAAEWWSDPGRIAAGTQWLEIPGATENMNHRATGDAEMDWITHSAGLLAKFAKPIKALSL
Ga0066689_1035792713300005447SoilMSESTEKAAEWWSDPGRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVLSLGCGFGVIE
Ga0070679_10131549513300005530Corn RhizosphereMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRRADY
Ga0066692_1067312223300005555SoilMSEATKKAAEWWSDPQSEAPETQWVRVPGVVQNMNRRATGDMAIDWINHSATLLSRFAKPIKALSIGCGFGIIERVL
Ga0058697_1036185813300005562AgaveMSEAIRKAADWWSDPQSEAPETQWVRVPGVVENMNRRATGDPAIDWINHSATLLTSLAKPIKALSIGCGFGVIERTLRRQDFCQLIHGVD
Ga0066702_1001926213300005575SoilMIETSVSEATKRAAKWWSDPQSEVPGTQWVEVPGVAENINRRATGDPEIDWISHSAGLLAKSKRPIKALSIGCG
Ga0068859_10244478923300005617Switchgrass RhizosphereMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHS
Ga0068864_10208864913300005618Switchgrass RhizosphereMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRR
Ga0066903_10679697813300005764Tropical Forest SoilMSESTEKAAEWWSDPSHDVPGTQWLGVPGATENMNRRATGDPEMDWIAHSAALLSRFAKPIKALSLGCGFGVIE
Ga0070716_10177774413300006173Corn, Switchgrass And Miscanthus RhizosphereMSEPTKKAAEWWSDPDRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAKFAKPIKALSLGCGFGVIERV
Ga0066665_1149955123300006796SoilMSESTEKAAEWWSDPGRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVLSLGCGFGVI
Ga0075428_10041662523300006844Populus RhizosphereMSEASKKAAEWWSDPKSEAPETQWVRVPGVAENMNRRATGDPAINWINHSAGLLTSFAKPIKALSLSCGFGIIERVLRRSDFCQIIHGVDVAENAIESAR
Ga0075425_10076936613300006854Populus RhizosphereMSESTEKAAEWWSDPGRDVPGTQWLQVPGAIENMNLRATGDPEMDWITHSAGLLAKFAKPIKALSLGCGFGV
Ga0105245_1040831723300009098Miscanthus RhizosphereMSKSTNKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALL
Ga0126308_1038902323300010040Serpentine SoilMSESTKKAADFWSDPGRDVPGTQWLHIPGAVENMNLRATGDPEMDWIT
Ga0126372_1149271013300010360Tropical Forest SoilMNEPTKKAAEWWSDPDSEAPETQWVRVPGVAENMNRRATGDPEMDWITHSAGLLAKFEKPVKALSLGCGFGVIERVLRRCDYCQLIHGL
Ga0126372_1213910713300010360Tropical Forest SoilMSESTEKAAEWWSDPSHDVPGTQWLGVPGATENMNRRATGDPEMDWITHSAGL
Ga0134125_1267215123300010371Terrestrial SoilMSEASRKAAEWWSDPDRDVPGTQWLQIPGAVQNMNRRATGNPEMDWISHSAGLLAKFAKPIK
Ga0105246_1074586123300011119Miscanthus RhizosphereMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYC
Ga0137364_1008578413300012198Vadose Zone SoilMSEASKKAAEWWSDPKSEAPETQWVRVPGVAENMNRRATGDPAINWINHSAGLLSGFAKPIKALSLGCGFGIIERVLRRSDFC
Ga0137399_1154058813300012203Vadose Zone SoilMSEATRKAAEWWSDPQSEASETQWVRVPGVAENMNRRATGDPAINWINHSAGLLSGFAKPIKALSLGCGFGIIERVL
Ga0137377_1176710023300012211Vadose Zone SoilMSESTKKAGEWWSDPDRDIPGTQWLQIPGAVENMNRRATGDPEMDWI
Ga0137371_1048517313300012356Vadose Zone SoilMIESTEKAAEWWSDSDRDVPGTQWLLVPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVL
Ga0137360_1172860223300012361Vadose Zone SoilMSEATKKAAEWWSDPQSEAPETQWVRIPGVVENMNCRATGDPAMDWINHSAGLLASFAKPVKALSVGC
Ga0157293_1015711623300012898SoilMSKSTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKP
Ga0137413_1004504033300012924Vadose Zone SoilMSEAIRKAAEWWSDPESEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASFLASFTKPINVLSVGCGFGTIERLLRRRDNCQQVNRVDIAGAVIEATTKTAEAERLEGLT*
Ga0137407_1055135413300012930Vadose Zone SoilMSESTEKAAEWWSDPDRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVL
Ga0137407_1237074513300012930Vadose Zone SoilMSESTEKAAEWWSDPGRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVL
Ga0126375_1122494023300012948Tropical Forest SoilMSESTEKAAEWWSDPDRDVPGTQWLQVPGAIENMNLRATGDPEMDWITHSAGLLAKFAKPIKALSPGCGFGVIE
Ga0164300_1084737023300012951SoilMSEATKKAAEWWSDPKSEAPETQWVRVPGVVENMNRRATGDPAIDWINHSAGLLTGFARPIKALSVGCGFGVIERTLRRHDFCQLIHGVDVAENA
Ga0164302_1065220723300012961SoilMSEPTKKAAEWWSDPESEAPETQWVRVPGVVENMNRRATGDPEMDWITHSAGLLAKFAK
Ga0134076_1047295713300012976Grasslands SoilMSEATRKAANWWSDPQSEAPETQWVRVPGISENMNRRATGDPAIDWIHHSAGLLRSFAKPIKALSIGCGFGI
Ga0164305_1106912713300012989SoilMVLFDMSETTEKAAEWWSDPGRIAAGTQWLEIPGATENMNHRATGDPEMDWITHSA
Ga0134078_1009925723300014157Grasslands SoilMAEGTKSDATKKVAEWWSDSQREVPGTQWVEVPGALENMNRRATGDPGIDWINHSASVLAHFKKPIKALSLECGFGLIERVLRRGNFCQLVHGVDVAEGAMKALGKRPKQRGWMV*
Ga0157379_1220213313300014968Switchgrass RhizosphereMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWI
Ga0132256_10206767313300015372Arabidopsis RhizosphereMSESTEKAAEWWSDPDRDVPGTQWLQVPGAIENMNRRATGEPEMNWITHS
Ga0132257_10288380013300015373Arabidopsis RhizosphereVNVTEKAAEWWSDPDRDVPGTQWLQIPGAVQNMNRRATGNPEMDWITHSAGLLAKFAKPIKALSLGCGFG
Ga0132255_10087831413300015374Arabidopsis RhizosphereMSESSKRAAEWWSDPDRDVPGTQWLQVPGTSENMNRRATGDPEMDWITHSAGLLAK
Ga0132255_10303066013300015374Arabidopsis RhizosphereMSKSTEKAAEWWSDPDREVPGTQWLLVPGASENMNRRATGDPEMDWITHSAALLAKFAKPIKALSLGC
Ga0132255_10410817123300015374Arabidopsis RhizosphereVNVTEKAAEWWSDPDRDVPGTQWLQIPGAVQNMNRRATGNPEMDWITHSAGLLAKFAKPIKALSLGCGFGVIER
Ga0182041_1155995913300016294SoilMSTLKSDVSKKVAVSWSDPQSEAPGTQWVQVPGVKESVNRRATGDPAIEWIDHSASLLAGFTKPINVLSVGCGFGAIERLLR
Ga0134069_120710613300017654Grasslands SoilMSEATRKAANWWSDPQSEAPETQWVRVPGISENMNRRATGDPAIDWIHHSAGLLRSFAKPIKALSIGCGFGIIERTLRRRDFCQ
Ga0184620_1022365613300018051Groundwater SedimentMTKTLESDAAEKVAVWWSDPQSEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASLLASFTKPIKVLSVGCGFGAIERLL
Ga0173482_1036169323300019361SoilMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITH
Ga0193704_104200313300019867SoilMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLARFKKPIKVLSLGCGFGVIERVLRRSDSCQLIH
Ga0193707_109139223300019881SoilMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDW
Ga0193710_101892013300019998SoilMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRRSDSCQLIHG
Ga0193735_107905223300020006SoilMSEATKKAAEWWSDPQSEAPETQWVRVPGVVENMNYRATGDPAIDWINHSAGLLATFAKPVKALSA
Ga0193721_106160813300020018SoilMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAAL
Ga0193721_117318413300020018SoilMSLGAPLQNAVLSRMDEVSKKVAEWWGDPQSEAPGTQWVEVPGISENTKFRASGDPAIDWVNHSASLLSRFTRPIKALSLGCGFGVIERILRRRDYCQLIHG
Ga0193724_100119123300020062SoilMSEPTKKAAEWWSDPESEAPETQWVRVPGVLENMNRRATGDPEMDWITHSAGLLAKFAKPVKALSV
Ga0210381_1002635433300021078Groundwater SedimentMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIK
Ga0193706_120189613300021339SoilMRLKPKPLSNEALFRMSEATKKAAEYWSSAQSQAFGNNWVGVPGVVENMNRRASGDPAINWINHSAALLSRFAKPIKALSIGCGLGIIERVLRRHDFCQLIHGVDVAENSIKSARQT
Ga0193709_110345123300021411SoilMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLA
Ga0193750_103002113300021413SoilMALLHMVEDTKIEAAKKAGEWWSDPEREIPGTQWLQIPGASENMNHRATGDPEMDWITHSASLLAKFAKPIKALSLGCGFGVIER
Ga0193695_110878013300021418SoilVNVTEKAAEWWSDPERDVPGTQWLLVPGAVENMNRRATGDPAINWITHSAGLLAKFAKPIKALSLGCGFGIIERVL
Ga0222621_103792613300021510Groundwater SedimentMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRRSDSCQ
Ga0224452_110417623300022534Groundwater SedimentMSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLARF
Ga0179589_1004070513300024288Vadose Zone SoilMSEAIRKAAEWWSDPESEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASFLASFTKPINMLSVGCGFGAIERLLR
Ga0207680_1102922323300025903Switchgrass RhizosphereMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGC
Ga0207685_1043560623300025905Corn, Switchgrass And Miscanthus RhizosphereMSESTEKAAEWWSDPDRDVPGTQWLQVPGAVENMNRRATGDPEMDWITHSAGLLAKFTKPIKALSVGCGFG
Ga0207657_1132003323300025919Corn RhizosphereMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWIT
Ga0207701_1048766223300025930Corn, Switchgrass And Miscanthus RhizosphereMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYCQFIH
Ga0207704_1146110323300025938Miscanthus RhizosphereMSETTEKAAEWWSDPGRDVPGTQWLQIPGAIENMNRRATGDPEMDWITHSAGLLSKFPKPVKVLSLGCGFGVIERALRR
Ga0207711_1180476013300025941Switchgrass RhizosphereMSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGCGFGVI
Ga0209473_107381423300026330SoilMAEGTKSDATKKVAEWWSDSQREVPGTQCVEVPGALENMNRRATGDPGIDWINHSASVLAHFKKPIKALSLGC
Ga0209158_133853113300026333SoilMIESTEKAAEWWSDPDRDVPGTQWLLVPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVLSL
Ga0209057_105362023300026342SoilMTDTSVSEAAKRSARWWNDPQSEAPGTQWVEVPGVAENINRRATGDPEIDWIGHSAGLLLKSKRPIEALSIGCGFGRIERLLRRSDYCQLIHGRGLAGFDV
Ga0257161_106386833300026508SoilMSEATKKAAEYWSSAQSHAPGNNWLGVPGVVENMNRRATGDPAIDWINHSAALLSRFAKPIKALSIGCGFGIIERVLR
Ga0209474_1000496413300026550SoilVCYNDIIHMAEGTKSDATKKVAEWWSDSQREVPGTQCVEVPGALENMNRRATGDPGIDWINHSASLLAHFKKPIKALSLGCGFGVIERV
Ga0075382_1084645413300030917SoilMSETTEKAAEWWSNPDRDVPGTQWLQIPGAVENMNRRATGNPEMDWITHSAGLLAKFAKPIKALSLGCGFGVIERVLRRCDY
Ga0310888_1068773613300031538SoilMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYCQL
Ga0307476_1065307413300031715Hardwood Forest SoilMNKAVKKAAEWWSDPQREVPGTQWLEIPGALQNMNRRATGDPAIDWINHSASLLANFKPPVKALSLGCGFGIIERVLRRQ
Ga0310904_1059148913300031854SoilMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGL
Ga0308175_10195390523300031938SoilMSESTKKAAEFWSDPDRDVPGTQWLHIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRSSDS
Ga0310912_1068671413300031941SoilMSEAIKKAAEWWSDPESEAPETQWVRVPGVEQNMNRRATGDPAIDWINHSASLLTSFAKPIKALSIGCGFGIIERRLRRNDFCQIIHGVDVAEN
Ga0310903_1062484713300032000SoilMSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYCQLIHG
Ga0307471_10100440413300032180Hardwood Forest SoilMSEPTKKAAEWWSDPGSEAPETQWVRVPGVDENMNRRATGDPEMDWITHSAGLLAKFPKP
Ga0307471_10396613133300032180Hardwood Forest SoilMSEVIRKAADWWSDPQSEAPETQWVRVPGVNENMNRRATGDPAIDWINHSAVLLSRFAKPIKALSVGCGFG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.