NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101788

Metagenome / Metatranscriptome Family F101788

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101788
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 71 residues
Representative Sequence MKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQY
Number of Associated Samples 91
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 2.94 %
% of genes from short scaffolds (< 2000 bps) 3.92 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.078 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.471 % of family members)
Environment Ontology (ENVO) Unclassified
(33.333 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.078 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 62.12%    β-sheet: 4.55%    Coil/Unstructured: 33.33%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF02482Ribosomal_S30AE 6.86
PF00118Cpn60_TCP1 2.94
PF00881Nitroreductase 2.94
PF01145Band_7 1.96
PF02696SelO 0.98
PF00313CSD 0.98
PF01527HTH_Tnp_1 0.98
PF05362Lon_C 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG1544Ribosome-associated translation inhibitor RaiATranslation, ribosomal structure and biogenesis [J] 6.86
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 2.94
COG0397Protein adenylyltransferase (AMPylase) SelO/YdiU (selenoprotein O)Posttranslational modification, protein turnover, chaperones [O] 0.98
COG0466ATP-dependent Lon protease, bacterial typePosttranslational modification, protein turnover, chaperones [O] 0.98
COG1067Predicted ATP-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 0.98
COG1750Predicted archaeal serine protease, S18 familyGeneral function prediction only [R] 0.98
COG3480Predicted secreted protein YlbL, contains PDZ domainSignal transduction mechanisms [T] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.08 %
All OrganismsrootAll Organisms3.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300006893|Ga0073928_10603666All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium775Open in IMG/M
3300012201|Ga0137365_10148339All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1762Open in IMG/M
3300012360|Ga0137375_10838952All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium735Open in IMG/M
3300028906|Ga0308309_10426551All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1140Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.90%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.92%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.96%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.98%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.98%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.98%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.98%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011409Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT423_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012899Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S058-202B-2EnvironmentalOpen in IMG/M
3300012908Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S089-202R-1EnvironmentalOpen in IMG/M
3300012911Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S088-202R-2EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026341Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-AEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026880Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030967Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA11 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031023Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil TCEFA (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M
3300034680Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_116 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1107320313300000550SoilMKICWALFSVAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLM
JGI25615J43890_100122043300002910Grasslands SoilMKIRWALFSAAILIGFLPSQVTSQEAVLGQGNISCGSWIENRRDDNPLAATRTAWVL
Ga0062386_10169247713300004152Bog Forest SoilMKTRWALFSAAIFFGFLSSQVTAAEAVLGQGNISCVSWTESRGDDNPLAATRTAWVLGFVTAF
Ga0066673_1055032723300005175SoilMKIRWALFSAAMLFGFLPSQATSEEAILGQGNISCSSWIENRRDDNPLAATRTAWVLGFITAF
Ga0066690_1034372013300005177SoilMNTRWALFSAAIFFGSLSIQVTAAEGVLGQGNVSCGSWIESRGDNNPLAVTRTAWVLGFVTGFNQYKSKPEGDVSDGKDTEVLMSRI
Ga0066688_1027187233300005178SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFLTAFNQYGAKPQRDVSGGKDTEVLMARIDD
Ga0068868_10130652113300005338Miscanthus RhizosphereMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRADDNALAAARTAWVLGFITAFNHYRSKP
Ga0070714_10215410913300005435Agricultural SoilMKTRWVLFSVTIFLGLLSSQVTAAEAVLGQGNISCASWIESRGDDNALAATRTAWVLGF
Ga0070710_1043399923300005437Corn, Switchgrass And Miscanthus RhizosphereMKTRWVLFSVTIFLGLLSSQVTAAEAVLGQGNISCASWIESRGDDNTLAATRTAWVLGFVTAFNQYGSKPEGDVSGGKATEV
Ga0070705_10139525413300005440Corn, Switchgrass And Miscanthus RhizosphereMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYRSKPERD
Ga0070708_10049790213300005445Corn, Switchgrass And Miscanthus RhizosphereMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYRSKPEGDVSGGKDTEV
Ga0066692_1038104223300005555SoilMKKRSVRFSTSVFLGFLSSQVIAAEAVLGQGNISCNSWIEGRSDNNPLAATRTAWVLGFITAFNQYGSKPERDVSGGKETEALMARI
Ga0066704_1059050623300005557SoilMKKRSVRFSTSVFLGFLSSQVIAAEAVLGQGNISCNSWIEGRSDNNPLAATRTAWVLGFITAFNQY
Ga0070762_1127099223300005602SoilMKTRCWALFSAVILFGFLPSQVTSEEAILGQGNISCTSWIESRRDDNPLAATRTAWVLGFITAFNQYGPKPQRDVSG
Ga0070717_1027286943300006028Corn, Switchgrass And Miscanthus RhizosphereMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYRSKPEGDVSGGKDTEVLML
Ga0075417_1021064313300006049Populus RhizosphereMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNVSCESWIKSRGDGNPVVAARTAWVLGFVTAFNQYRSKP
Ga0075432_1012854923300006058Populus RhizosphereMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMDRLR*
Ga0075018_1063626113300006172WatershedsMKTRWALFSVTIFLGLLSSQVTAAEAVLGQGNISCASWVESRGDDNTLAATRTAWVLGFVTAFNQY
Ga0070712_10073334713300006175Corn, Switchgrass And Miscanthus RhizosphereMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNISCDSWITSRGDGNPVAATRTAWVLGFVTAFNQYRSKPE
Ga0066659_1143718323300006797SoilMKTRWALLSAAIFFGFLSNQMIAAEDVLGQGNVSCDSWIKSRGDGNPVVAARTAWVLGFVTAFNQYRPKPA
Ga0075425_10089407313300006854Populus RhizosphereMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRGHDNALAAARTAWVLGFI
Ga0073928_1060366613300006893Iron-Sulfur Acid SpringMFTAQQYRANPLAVGSWKTRWAFFSAAIFFGFVPSRAATAAEAVLGQGNISCDSWLESRQADDPLAASRTAWVLGLLTAFNQYGSKPEGGVSGGKDTE
Ga0075424_10160876113300006904Populus RhizosphereMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRGDDNALAAARTAWVL
Ga0099830_1068943723300009088Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSQEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFI
Ga0099827_1158012413300009090Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPVAATRTAWVLRVYNRLQSIRC*
Ga0075423_1161914413300009162Populus RhizosphereMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDENPLAATRTAWVL
Ga0105241_1252731113300009174Corn RhizosphereMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQY
Ga0105248_1246943213300009177Switchgrass RhizosphereMKNRWALFSAAIFCEFLSCQVIAAEAVLGQGNISCGSWIESRADDNALAAARTAWVLGFITAFNHYRSKPEAD
Ga0126315_1034960023300010038Serpentine SoilMKIHWALFSAAIVIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAAQASA*
Ga0134125_1028372813300010371Terrestrial SoilMKTRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIDNRRDDTPLAATRTAWV
Ga0134128_1133150313300010373Terrestrial SoilMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRGDDNALAAARTAWVLGFITAFNHYRSKPEADVSAGKATEVLMA*
Ga0134128_1238415723300010373Terrestrial SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLM
Ga0137391_1126809913300011270Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLTATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEV
Ga0137323_113068213300011409SoilMKTRWALFSAAIFFGFLSSQVTAAEAVLGQGNISCSSWIESRGDDNPLAATRTAWVLGFVTAFNHYRSKAEGDVSGGKDTEVLMARI
Ga0137382_1103518413300012200Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMARID
Ga0137365_1003692043300012201Vadose Zone SoilMKIRWALFSAAILIGFLPSQATSEEAVLGQGNISCGSWIENRRDDNPLAAQASA*
Ga0137365_1014833923300012201Vadose Zone SoilMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNVSCDSWIKSRGDGNPVVAARTAWVLGFVTAFNQYRSKPASTALVDELQQ*
Ga0137365_1109656723300012201Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEV
Ga0137374_1047912513300012204Vadose Zone SoilMKIRWALFSAAILFGFLPSQVTSEEAVLGQGNISCSSWIENRRDDNPLAATRTAWVLGFI
Ga0137376_1152328513300012208Vadose Zone SoilMKIRWALFSAAILIEFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQ
Ga0137379_1163699923300012209Vadose Zone SoilMKKRSVRFSTSVFLGFLSSQVIAAEAVLGQGNISCNSWIEGRSDNNPLAATRTAWVLGFITAFNQYGS
Ga0137370_1071104313300012285Vadose Zone SoilSGGDFSLSRECIMKIRWALFSAAILFGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAAQASA*
Ga0137372_1098344013300012350Vadose Zone SoilMKIRWALFSAAILIGFLPSQATSEEAVLGQGNISCGSWNENRRDDNPLAATRTAWVL
Ga0137366_1110933713300012354Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITA
Ga0137375_1083895233300012360Vadose Zone SoilMKTRWALFSAAILIGFLASQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMARIDD
Ga0137360_1155875013300012361Vadose Zone SoilMKIHWALFSVAILIGFLPSQVASEEAVLGQGNISCGSWIENRRDDNPLAATRTAW
Ga0137360_1180642113300012361Vadose Zone SoilMKIRWALFSAAILMGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLGATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMAR
Ga0137361_1094562613300012362Vadose Zone SoilMKIRWALFSAAILFGFLPSQVTSEEAVLEQGNISCSSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMA
Ga0137397_1036281313300012685Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGSYARKLVTA*
Ga0157299_1034578113300012899SoilMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRGDDNALAAARTAWVLGFITAFNHYRSKPE
Ga0157286_1023128723300012908SoilMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRVDDNPLAATRTAWVLGFI
Ga0157301_1026174213300012911SoilLFYQKEQNMKNRWALFSAAIFCGFLSCQVIAAEAVLGQGNISCGSWIESRGDDNPLAATRTAWVLGF
Ga0137395_1012729113300012917Vadose Zone SoilMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYRSKPE
Ga0137395_1049759513300012917Vadose Zone SoilMRNRWSLFFAAILFGFLPSQVIAAEAVLGQGNISCLSWIESRGDDNPLAATRTAWVLGFVTAFNQYVSKSKGDVSRGKDTEALM
Ga0137395_1103254623300012917Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSQEAVLGQGNISCGSWIENRRDDNPLGATRTAWVLGFITAFNQYGAKPQRDVAGG
Ga0164298_1135639223300012955SoilMKTRWALLSAAIFFGLLSNQMIAAEGVLGQGNISCDSWIKSRGDGNPVAGTRTAWVLGFVTAFNQYRSKPEGTCLEDTEVLMLRIDDH
Ga0164301_1086502313300012960SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIEHRRDDNPLAATRTA
Ga0164302_1047520513300012961SoilMKNRWALFSATILFGFLPSQVTSEEATLGQGNISCTSWIESRRDDNPLAATRTA
Ga0164302_1144912323300012961SoilMKIRWALFSAAILIGFLPSQVASEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGVK
Ga0164302_1175095123300012961SoilMKIRWALFSAAILIGFLPSQVASEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQY
Ga0164307_1180153813300012987SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYG
Ga0163162_1137959213300013306Switchgrass RhizosphereMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSCIENRRDDNPLAAT
Ga0132256_10130073213300015372Arabidopsis RhizosphereMKNRWALFSAAIFCEFLSCQVIAAEAVLGQGNISCGSWIESRADDNALAAARTAWVLGFITAFNHYRSKP
Ga0132256_10241030413300015372Arabidopsis RhizosphereMKNRWAPFSAAFFCGFLSCQVIAAEAVLGQGNISCGSWIESRGDDNALAAARTAWVLGFITAFNHYRSKP
Ga0132257_10074290223300015373Arabidopsis RhizosphereMKNRWALFSAAIFCEFLSCQVIAAEAVLGQGNISCGSWIESRADDNALAAARTAWVLGL*
Ga0132255_10129611913300015374Arabidopsis RhizosphereMKTRWALFSAAIFFGFLPSRVTAAESVLGQGNISCVSWIESRGDDNPLAATRTAWVLGFVTAFNQYGSK
Ga0182036_1141641213300016270SoilMKIRWALFSAAIVIGFLPSQVTSEEAVLGQGNISCSSWIENRQDDKPLAGTRIAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMARI
Ga0066662_1293208823300018468Grasslands SoilMKTRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMA
Ga0193728_121876623300019890SoilMKTRWALFSAAIFFGFLSSQVTAAEAVLGQGNISCVSWIESRGNDNPLAATRTAWVLGFVTAFNQYGSKPEGDVSGGKDTEVLMG
Ga0210408_1044681413300021178SoilMKTRWALFSVTIFLGLLSSQVTAAEAVLGQGNISCASWVESRGDDNTLAATRTAWVLGFVTAFNQYGSKHEGDV
Ga0207684_1110354013300025910Corn, Switchgrass And Miscanthus RhizosphereMKTRWALLSAAIFFGFLSNQMIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYRS
Ga0209473_132213023300026330SoilMNTRWALFSAAIFFGSLSIQVTAAEGVLGQGNVSCGSWIESRGDNNPLAVTRTAWVLGFVTGFNQ
Ga0257162_101004713300026340SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGTKPQR
Ga0257151_104044913300026341SoilMKTPWVLFSVTIFLGLLSSQVTAAEAVLGQGNISCASWIESRGDDNTLAATRTAWVL
Ga0257173_103458123300026360SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRT
Ga0257178_101555733300026446SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVL
Ga0257153_102159913300026490SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQ
Ga0257161_112988613300026508SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAF
Ga0257168_111731713300026514SoilMKIRWALFSAAILIGFLPGQVASEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGGKPQRDVS
Ga0209806_118221913300026529SoilMKNRWALFSATILFGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRT
Ga0209157_128108913300026537SoilYTDVAGGDFVLSKEYNMKTRWALFSAAIFFGFLSSQVTAAEGVLGQGNISCDSWVESRGNDNPLAVTRTA
Ga0209623_101304913300026880Forest SoilMKTPWVLFSVTIFLGLLSSQVTAAEAVLGQGNISCASWIESRGDDNTLAATRTA
Ga0209331_105031333300027603Forest SoilMKTRCWALFFAVILFGFLPSQVTSEEAILGQGNISCTSWIESRRDDNPLAATRTAWVLGFITAFNQYGPKPQRDVSGGKDTDVLMARVLD
Ga0209217_112752733300027651Forest SoilMKTRCWALFFAVILFGFLPSQVTSEEAILGQGNISCTSWIESRRDDNPLAATR
Ga0209689_105101313300027748SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGG
Ga0209701_1047643913300027862Vadose Zone SoilMKIRWALFSAAILIGFLPGQVASEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMARI
Ga0209701_1054202523300027862Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSQEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMARI
Ga0209283_1094018823300027875Vadose Zone SoilMKTRWALLSAAIFFGFLSNQVIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTA
Ga0209590_1094843923300027882Vadose Zone SoilMKIHWALFSVAILIGFLPSQVASEEAVLGQGNISCGSWIENRRDNNPLAATRTAWVLGFITAFNQYGAKPQR
Ga0209488_1012021313300027903Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWV
Ga0209488_1013493423300027903Vadose Zone SoilMKIRWALFSAAILIGFLPSQVTSQEAVLGQGNISCGSWIENRRDDNPLAATRTAWV
Ga0209526_1029412013300028047Forest SoilMKTRCWALFSAVILFGFLPSQVTSEEAILGQGNISCTSWIESRRDDNPLAATR
Ga0308309_1042655113300028906SoilMKTRWALFSVTIFLGLLSSQVTAAEAVLGQGNISCASWVESRGDDNTLAATRTAWVLGFVTAFNQYRSKPEGDVSGGKDTEVLMLRIDD
Ga0222749_1039686513300029636SoilMKTRWALLSAAIFFGFLSNQLIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYGSKPEGDVSGGKATEVLMARID
Ga0075399_1040586513300030967SoilMKIYWALFSAAILIGFLPSQVTSEEAVLGQGNISCDSWIENRRDDNPLAATRTAWVLGFITAFNQYGAKPQRDVSGGKDTEVLMAR
Ga0308178_112363023300030990SoilMKTRWALFSAAIFFGFLSSQVTAAEAVLGQGNISCVSWIESRGDDNPLAATRTAWVLGFV
Ga0073998_1162707213300031023SoilMKTRWALFSAAIFFGFLLSQVTAAEAVLGQGNISCVSWIESRGDDNPLAATRTAWVLGFVTAFNQYGSKPEGDVSGGKDTEVLMG
Ga0318512_1070326613300031846SoilMKIRWALFSAAIVIGFLPSQVTSEEAVLGQGNISCSSWIENRQDDKPLAGTRIAWVLGFITAFNQYGAKPQRDVSGGKDTEV
Ga0310916_1155384213300031942SoilMKIRWALFSAAIVIGFLPSQVTSEEAVLGQGNISCSSWIENRQDDKPLAGTRIAWVLGFITAFNQYGAKPQRDVSGGK
Ga0306924_1058762613300032076SoilMKTRWALLSAATFFGLLSNQLIAAEGVLGQGNISCDSWIKSRGDGNPVAATRTAWVLGFVTAFNQYRSKPEGDVSGGKD
Ga0318519_1030344913300033290SoilMKIRWALFSAAIVIGFLPSQVTSEEAVLGQGNISCSSWIENRQDDKPLAGTRIAWVLGFITAFNQYGAKPQRDVSGGKDTEVLM
Ga0370541_001571_1518_16973300034680SoilMKFRWALFSAAILIGFLPSQVTSEEAVLGQGNISCGSWIENRRDDNPLAATRTAWVLGFI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.