NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099378

Metagenome Family F099378

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099378
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 64 residues
Representative Sequence FGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD
Number of Associated Samples 94
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(9.709 % of family members)
Environment Ontology (ENVO) Unclassified
(52.427 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(62.136 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 26.14%    Coil/Unstructured: 73.86%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF02566OsmC 47.57
PF13645YkuD_2 9.71
PF02616SMC_ScpA 6.80
PF01887SAM_HAT_N 4.85
PF03976PPK2 1.94
PF13568OMP_b-brl_2 1.94
PF11138DUF2911 0.97
PF13715CarbopepD_reg_2 0.97
PF07494Reg_prop 0.97
PF01741MscL 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 47.57
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 47.57
COG1354Chromatin segregation and condensation protein Rec8/ScpA/Scc1, kleisin familyReplication, recombination and repair [L] 6.80
COG1912Stereoselective (R,S)-S-adenosylmethionine hydrolase (adenosine-forming)Defense mechanisms [V] 4.85
COG2326Polyphosphate kinase 2, PPK2 familyEnergy production and conversion [C] 1.94
COG1970Large-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.97
COG3292Periplasmic ligand-binding sensor domainSignal transduction mechanisms [T] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.71%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere6.80%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere6.80%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere6.80%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere6.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil5.83%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.88%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere3.88%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere3.88%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.91%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.91%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere2.91%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.94%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil1.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.94%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.97%
Polar Desert SandEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand0.97%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.97%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.97%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.97%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.97%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.97%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.97%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.97%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.97%
Active SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Active Sludge0.97%
Sediment SlurryEngineered → Bioremediation → Metal → Unclassified → Unclassified → Sediment Slurry0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005614Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2Host-AssociatedOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012956Active sludge microbial communities from wastewater, Klosterneuburg, Austria - Klosneuvirus_20160825_MGEngineeredOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300013105Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2-5 metaGHost-AssociatedOpen in IMG/M
3300013294Permafrost microbial communities from Nunavut, Canada - A3_65cm_0MEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017789Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ322 (21.06)EnvironmentalOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300022883Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S066-202C-4EnvironmentalOpen in IMG/M
3300022899Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S016-104C-6EnvironmentalOpen in IMG/M
3300022901Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S156-409C-4EnvironmentalOpen in IMG/M
3300023057Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S136-409B-6EnvironmentalOpen in IMG/M
3300023064Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S001-104B-6EnvironmentalOpen in IMG/M
3300023261Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S166-409R-6EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025926Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026983Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 21 (SPAdes)EnvironmentalOpen in IMG/M
3300027637Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027639Agricultural soil microbial communities from Utah to study Nitrogen management - NC Control (SPAdes)EnvironmentalOpen in IMG/M
3300028379Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300032003Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D1EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034159Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_0210_18EnvironmentalOpen in IMG/M
3300034815Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - B3A4.1EngineeredOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10120755323300000364SoilSGKGVRKVLLQKAPGMNPFGSHKLQSSDKYTFSARKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGTTLFAFGVD*
F14TC_10045684933300000559SoilSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMLSMGTGMDGSTVIFAFGMD*
F14TC_10313961123300000559SoilSSEKYTFSARKVKEGYWELVVDKPLPKGEYAFTMMGMGTSGMDGSTVLFAFGVD*
JGI1027J11758_1198732423300000789SoilSKKNQSSEKFSFSVKKIRDGYWELVVDKQLPKGEYAFSMMDMTGSSGAMGAMLFAFGVD*
JGI1027J12803_10105457323300000955SoilIFLQKSGGSFSKKNQSSEKFSFSVKKIRDGYWELVVDKQLPKGEYAFSMMDMTGSSGAMGAMLFAFGVD*
JGI25613J43889_1022921613300002907Grasslands SoilMFSANKNQSSDKYTFSVKKVREGYWELLVDKTLSRGEYAFTMMNMGMGNMDGSMLLFAFAVD*
Ga0063356_10638777213300004463Arabidopsis Thaliana RhizosphereFASKKMKSSDKYTFSVKKIKEGYWELVVDKSLPKGEYAFSVMSMGMGNMDGSNLMFAFGID*
Ga0070683_100007809123300005329Corn RhizosphereNPFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD*
Ga0068869_10116879413300005334Miscanthus RhizosphereFGSHKLQSSEKFTFSARKIKEGYWELIVDKPLPKGEYAFTMMGMGMSGMDGSTVIFAFGVD*
Ga0070680_10114249223300005336Corn RhizosphereGYFSMGKNKLQSSDKMTFSVKKIRSGYWELIVDKPLPKGEYAFTMMGMGTSGMDGSSLIFAFGVD*
Ga0070660_10094363023300005339Corn RhizosphereLQSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD*
Ga0070668_10029357033300005347Switchgrass RhizosphereFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD*
Ga0070669_10098241613300005353Switchgrass RhizosphereGMNPFGSHKLQSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD*
Ga0070675_10007666313300005354Miscanthus RhizosphereFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD
Ga0070674_10105114023300005356Miscanthus RhizosphereKIYMMKTGGAFSMGKNKSSDKFTFSVRKIRDGYWELVVDKPLPKGEYAFVTMNGYSGTGGMDALLFAFGVD*
Ga0070667_10012003853300005367Switchgrass RhizosphereSAKGVRKVLLQKAPGMNPFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD*
Ga0070667_10091496723300005367Switchgrass RhizosphereLQKAPGMNPFGSHKLQSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD*
Ga0070694_10106616213300005444Corn, Switchgrass And Miscanthus RhizosphereAFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD*
Ga0068867_10012816413300005459Miscanthus RhizosphereVKVGGYFSMGKNKLQSSDKMTFSVKKIRNGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSTLIFAFGVD*
Ga0068853_10026257333300005539Corn RhizosphereSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD*
Ga0070696_10048231723300005546Corn, Switchgrass And Miscanthus RhizosphereDKYTFSVKKVREGYWELVVDKPLPKGEYAFTMMNMGMGSMDGSTLLFAFAID*
Ga0070665_10155243313300005548Switchgrass RhizosphereVKVGGYFSMGKNKLQSSDKMTFSVKKIRSGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSTLIFAFGVD*
Ga0070664_10040617013300005564Corn RhizosphereKLQSSEKYTFSARKIKAGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD*
Ga0070664_10128984423300005564Corn RhizospherePGMNPFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD*
Ga0068854_10192372213300005578Corn RhizosphereSDKYTFSVKKVREGYWELVVDKPLPKGEYAFTMMNMGMGSMDGSTLLFAFAID*
Ga0068856_10251007813300005614Corn RhizosphereKSIRKIYLMKVGGYFSMGKNKLQSSDKMTFSVKKIRSGYWELIVDKPLPKGEYAFTMMGMGTSGMDGSSLIFAFGVD*
Ga0068852_10231579323300005616Corn RhizosphereFTVSVKKIRDGYYQLDVDKPLPKGEYAFVMNSMTSADGSALLFAFGVD*
Ga0068861_10260431323300005719Switchgrass RhizospherePGMNPFGSHKLQSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD*
Ga0074470_1146076423300005836Sediment (Intertidal)GALPFSSHKIRSSDKYTFGVKKIREGYWELVVDKELPKGEYAFTMMNFMGGGPMGETLVFAFAID*
Ga0068863_10041472913300005841Switchgrass RhizospherePFGSHKLQSSDKYTFSTRKIRDGYWELIADKPLPKGEYAFTMMGTGMDVMSGTTLFAFGVD*
Ga0068863_10154221633300005841Switchgrass RhizosphereDAGKGTRKIYTMKTGGAFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD*
Ga0068860_10215319513300005843Switchgrass RhizosphereFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD
Ga0068862_10246379613300005844Switchgrass RhizosphereKAPGMNPFGSHKLQSSEKYTFSARKIKAGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD*
Ga0066652_10093826733300006046SoilRKIYLVKVGGYFSMGKNKLQSSDKMTFSVKKIRNGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSSLIFAFGVD*
Ga0068871_10046921733300006358Miscanthus RhizosphereSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD*
Ga0075428_10045313113300006844Populus RhizosphereLMKTGGAVPFAGPKNKSSDKYTLSIKKIREGYWEMIVDKTLPKGEYAFTVMGGTMANADGSVIIFAFAVD*
Ga0075434_10191751323300006871Populus RhizosphereASKKPKSSEKYSFSVKKIRDGYWELVIDKPLPKGEYAFTVMGMGMANMDGTTNIFAFAVE
Ga0079217_1047514423300006876Agricultural SoilSKKMKSSDKYTFSVKKIKDGYWELVVDKSLPKGEYAFSVMSMGMGNMDGSTLMFAFGID*
Ga0068865_10087945423300006881Miscanthus RhizosphereMKTGGAFSMGKNKSSDKFTFSVRKIRDGYWELVVDKPLPKGEYAFVTMNGYSGTGGMDALLFAFGVD*
Ga0079215_1095270713300006894Agricultural SoilKILLMKSPGAMPFGSKKMKSSDKYTFSVKKIKDGYWELVVDKSLPKGEYAFSVMSMGMGNMDGSTLMFAFGID*
Ga0075423_1225693723300009162Populus RhizosphereGKGTRKIYTMKTGGAFSMGKNKSSDKFTFSVKKICEGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD*
Ga0105249_1117098313300009553Switchgrass RhizosphereQSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD*
Ga0134124_1236147713300010397Terrestrial SoilVLLQKAPGMNPFGSHKLQSSEKFTFSARKIKEGYWELIVDKPLPKGEYAFTMMGMGMSGMDGSTVIFAFGVD*
Ga0105246_1050025933300011119Miscanthus RhizosphereSAKGVRKVLLQKAPGMNPFGSHKLQSSEKYTFSARKIKERHWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD*
Ga0137374_1067182513300012204Vadose Zone SoilMQKIGGAMSFGNKKMQSSYKYTFSVKKIREGYWELVIDKTLPKGEYAFSMMGMGMGNMDGSTTLFSFGID*
Ga0150985_11500286613300012212Avena Fatua RhizosphereRKIYLVKVGGYFSMGKNKLQSSDKITFSVKKIRNGYWELVIDKPLSKGEYAFTMMSMGMSGADGSTLIFAFGVD*
Ga0137372_1109209113300012350Vadose Zone SoilLLQKAPGMNPFGKHKIESSDKYTFSVKKIREGYWELVVDKPLPRGEYAFTMQGAGMGDAMTGATTLFAFGVD*
Ga0126375_1197998123300012948Tropical Forest SoilGYFSVGKNKLQSSDKMTFSVRKIRSGYWELVIDKPLSKGEYAFTMMGMGTSGMDGSTLIFAFGVD*
Ga0164298_1040400833300012955SoilFSARKIKEGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD*
Ga0154020_1016403513300012956Active SludgeKNKSSDKFTFSVKKIREGYWELVIDKPLNRGEYVFSMMNMGAGAMDGSTVLFAFGVD*
Ga0153916_1276427823300012964Freshwater WetlandsYTFSVRKIRPGYLELVIDKPLPKGEYVFVVMGGFEINLDGSASLFAFEIV*
Ga0126369_1160440813300012971Tropical Forest SoilAPGMNPFGSHKIKSSDKYTFSAKKIRDGYWELVIDKPLPQGEYAFTMMGVGANMDMTGGMLVFAFGVD*
Ga0164308_1201918323300012985SoilIYLVKVGGYFSMGKNKLQSSDKMTFSVKKIRNGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSSLIFAFGVD*
Ga0157369_1220836413300013105Corn RhizosphereKSIRKIYLVKVGGYFSMGKNKLQSSDKMTFSVKKIRSGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSTLIFAFGVD*
Ga0120150_105242213300013294PermafrostMPFGSKKMQSSDKFTFSVKKIREGYWELVIDKPLPKGEYAFSMMSIGMASMDGGTTLFAFGVD*
Ga0163162_1059077213300013306Switchgrass RhizosphereAKGVRKVLLQKAPGMNPFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD*
Ga0157379_1033234813300014968Switchgrass RhizosphereGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD*
Ga0132258_1144913413300015371Arabidopsis RhizosphereKADATKDKRKVYLMKSPGMVNPFGSKKSQSSDKYTFSTRKIKDGYWELLVDKSLPKGEYIFVMSGMSMNNMDGSMMLFAFAVD*
Ga0132255_10060944413300015374Arabidopsis RhizosphereMNPFGSHKLQSSEKYTFSARKIKAGYWELVVDKPLPKGEYAFTMMGMGTGMDGSTVIFAFGVD*
Ga0132255_10095828033300015374Arabidopsis RhizosphereKSPGMVNPFGSKKSQSSDKYTFSTRKIKDGYWELLVDKSLPKGEYIFVMSGMSMNNMDGSMMLFAFAVD*
Ga0136617_1073689013300017789Polar Desert SandKYTFSVRKIREGYWELVIDKPLPKGEYAFTMTGGMNMSMDGSVTLYAFGVD
Ga0163161_1115751423300017792Switchgrass RhizosphereSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD
Ga0190272_1242042913300018429SoilSDKFTFSVKKIREGYWELVVDKTLPKGEYAFAVMGGYSGGGGMDALLYAFAIE
Ga0190272_1285071123300018429SoilGAFSMGKNKSSDKFTFSVKKIREGYWELFVDKPLPKGEYAFAVMAGYSGSGGMDALLYAFAID
Ga0190269_1088202123300018465SoilSDKYPFSIKKVREGYWELVPDKKLPKGEYAFVVMGYNSDGSYTLFAFGIN
Ga0190274_1346787813300018476SoilGSKKMKSSDKCSFSVKKIREGYWELIIDKPLTKGEYAFTMMGQPGAGSMDMSVTVFAFGI
Ga0190271_1304343913300018481SoilRKIYLMKTGGAFSMGKNKSSDKFTFSLKKIRGGYWELLIDKSLPKGEYAFAVQGMNMNNMDGSIMIFAFAVD
Ga0210392_1079711413300021475SoilVVMQKSPGASPFGSKKTQSSDKFTFSVKKVREGYWELVIDKPLSKGEYVFTMMNMGMGNMDGSQLLFAFAID
Ga0247786_101416633300022883SoilMGKNKLQSSDKMTFSVKKIRSGYWELIVDKPLPKGEYAFTMMGMGTSGMDGSSLIFAFGV
Ga0247795_107420623300022899SoilAGKGTRKIYTMKTGGAFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0247788_105015613300022901SoilKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0247797_107383913300023057SoilKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0247801_105725613300023064SoilRKIYTMKTGGAFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0247796_102876533300023261SoilYKTDAGKGTRKIYTMKTGGAFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0207642_1108142823300025899Miscanthus RhizosphereKNIRKIYLMKVGGYFSMGKNKLQSSDKMTFSVKKIRNGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSTLIFAFGVD
Ga0207660_1096922123300025917Corn RhizosphereGYFSMGKNKLQSSDKMTFSVKKIRSGYWELIVDKPLPKGEYAFTMMGMGTSGMDGSSLIFAFGVD
Ga0207657_1045562033300025919Corn RhizosphereFSVKKIRSGYWELIVDKPLPKGEYAFTMMGMGTSGMDGSSLIFAFGVD
Ga0207681_1023779033300025923Switchgrass RhizosphereVLLQKAPGMNPFGSHKLQSSDKYTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD
Ga0207659_1094394123300025926Miscanthus RhizosphereSQSSDKYTFSVRKIREGYWELVIDKPLPRGEYAFSMMGMGAGNMDGSTTLFAFAID
Ga0207704_1098284923300025938Miscanthus RhizosphereRKIYMMKTGGAFSMGKNKSSDKFTFSVRKIRDGYWELVVDKPLPKGEYAFVTMNGYSGTGGMDALLFAFGVD
Ga0207704_1123732113300025938Miscanthus RhizosphereNPFGSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTLMGMGTGMDGSTVIFAFGVD
Ga0207704_1138423113300025938Miscanthus RhizosphereSGKGVRKILLQKAPGMNPFGSHKLQSSDKYTFSTRKIRDGYWELIADKPLPKGEYAFTMMGTGMDVMSGTTLFAFGVD
Ga0207661_1007279613300025944Corn RhizosphereSHKLQSSEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD
Ga0207712_1045013633300025961Switchgrass RhizosphereEGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0207640_1162465123300025981Corn RhizosphereKKMQSSDKYTFSVKKVREGYWELVVDKPLPKGEYAFTMMNMGMGSMDGSTLLFAFAID
Ga0207639_1075725213300026041Corn RhizosphereTFSAKKIKDGYWELVVDKPLPKGEYAFTMMGMGMDAMGGGTTLFAFGVD
Ga0207702_1230039613300026078Corn RhizosphereEKSIRKIYLMKVGGYFSMGKNKLQSSDKMTFSVKKIRSGYWELIVDKPLPKGEYAFTMMGMGTSGMDGSSLIFAFGVD
Ga0207641_1008326753300026088Switchgrass RhizosphereEKYTFSARKIKEGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD
Ga0207676_1219017313300026095Switchgrass RhizosphereSTRKIRDGYWELIADKPLPKGEYAFTMMGTGMDVMSGTTLFAFGVD
Ga0209131_104342853300026320Grasslands SoilGMFSANKNQSSDKYTFSVKKVREGYWELLVDKTLSRGEYAFTMMNMGMGNMDGSMLLFAFAVD
Ga0207856_101968533300026983Tropical Forest SoilFGSHKNQSSDKYTFSVRKIREGYWELVVDKSLPKGEYIFTVVNISIGNMDGAMLLFAVD
Ga0209818_128537623300027637Agricultural SoilPGAMPFGSKKMKSSDKYTFSVKKIKDGYWELVVDKSLPKGEYAFSVMSMGMGNMDGSTLMFAFGID
Ga0209387_122961023300027639Agricultural SoilSRKILLMKSPGAMPFGSKKMKSSDKYTFSVKKIKDGYWELVVDKSLPKGEYAFSVMSMGMGNMDGSTLMFAFGID
Ga0268266_1223868623300028379Switchgrass RhizosphereVKVGGYFSMGKNKLQSSDKMTFSVKKIRSGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSTLIFAFGVD
Ga0310813_1066305333300031716SoilTFSARKIKGGYWELVVDKPLPKGEYAFTMMGIGTGMDGSTVIFAFGVD
Ga0310813_1152481023300031716SoilKVLLQKAPGMNPFGSHKLQSSEKFTFSARKIKEGYWELIVDKPLPKGEYAFTMMGMGMSGMDGSTVIFAFGVD
Ga0310897_1048691313300032003SoilITLYKTDAGKGTRKIYTMKTGGAFSMGKNKSSDKFTFSVKKIREGYWELVVDKPLPKGEYAFVTMNGYSGVGGMDALLFAFGVD
Ga0307470_1170668413300032174Hardwood Forest SoilGGYFSMGKNKMQSSDKMTFSVKKIRSGYWELVIDKPLAKGEYAFTMMGMGTSGMDGSSLIFAFGVD
Ga0335084_1159234513300033004SoilFSLKKIRDGYWELVVDKPLPSGEYAFSMGGMGGMDMTGSLTVFAFGID
Ga0310810_1087639423300033412SoilRKVLLQKAPGMNPFGSHKLQSSDKYTFSARKIRDGYWELVVDKPLPKGEYAFTAMGMGIDVMGGTTLFAFGVD
Ga0370498_163328_2_1813300034155Untreated Peat SoilSKKNKSADKYTFSVKKIREGYWELVIDKTLPGGEYIFTLVDIMSMSGGGESLLFAFAVD
Ga0370509_0146846_668_8203300034159Untreated Peat SoilDKFTFSVKKIREGYWELVIDKFLPKGEYAFTMMNMMSMNGETLLFAFAVD
Ga0373906_185961_330_5033300034815Sediment SlurryKTRESKKYTFSVRKIRPGYLELVIDKPLPKGEYIFVVMGGFEINLDGSASLFAFEIV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.