NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F074502

Metagenome Family F074502

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074502
Family Type Metagenome
Number of Sequences 119
Average Sequence Length 98 residues
Representative Sequence MATKADFSEDEWKTMQKGVTGAGMLVSVSDRDFTDSFGEASALAKYLGRQREGDASELMRELATAKGTGFGFTDSPQEVETETLAALHSSVATL
Number of Associated Samples 107
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.84 %
Associated GOLD sequencing projects 105
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.160 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(11.765 % of family members)
Environment Ontology (ENVO) Unclassified
(22.689 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.336 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 57.38%    β-sheet: 0.00%    Coil/Unstructured: 42.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF01654Cyt_bd_oxida_I 15.97
PF01863YgjP-like 7.56
PF01740STAS 1.68
PF01263Aldose_epim 0.84
PF13419HAD_2 0.84
PF01370Epimerase 0.84
PF01182Glucosamine_iso 0.84
PF05185PRMT5 0.84
PF03733YccF 0.84
PF14026DUF4242 0.84
PF12697Abhydrolase_6 0.84
PF01988VIT1 0.84
PF07883Cupin_2 0.84
PF05872HerA_C 0.84
PF03706LPG_synthase_TM 0.84
PF00355Rieske 0.84
PF01471PG_binding_1 0.84
PF00230MIP 0.84
PF04229GrpB 0.84
PF07650KH_2 0.84
PF04024PspC 0.84
PF01019G_glu_transpept 0.84
PF13412HTH_24 0.84
PF00196GerE 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG1271Cytochrome bd-type quinol oxidase, subunit 1Energy production and conversion [C] 15.97
COG1451UTP pyrophosphatase, metal-dependent hydrolase familyGeneral function prediction only [R] 7.56
COG03636-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminaseCarbohydrate transport and metabolism [G] 0.84
COG0392Predicted membrane flippase AglD2/YbhN, UPF0104 familyCell wall/membrane/envelope biogenesis [M] 0.84
COG0405Gamma-glutamyltranspeptidaseAmino acid transport and metabolism [E] 0.84
COG0433Archaeal DNA helicase HerA or a related bacterial ATPase, contains HAS-barrel and ATPase domainsReplication, recombination and repair [L] 0.84
COG0580Glycerol uptake facilitator or related aquaporin (Major Intrinsic protein Family)Carbohydrate transport and metabolism [G] 0.84
COG0676D-hexose-6-phosphate mutarotaseCarbohydrate transport and metabolism [G] 0.84
COG1633Rubrerythrin, includes spore coat protein YhjRInorganic ion transport and metabolism [P] 0.84
COG1814Predicted Fe2+/Mn2+ transporter, VIT1/CCC1 familyInorganic ion transport and metabolism [P] 0.84
COG2017Galactose mutarotase or related enzymeCarbohydrate transport and metabolism [G] 0.84
COG2320GrpB domain, predicted nucleotidyltransferase, UPF0157 familyGeneral function prediction only [R] 0.84
COG3304Uncharacterized membrane protein YccF, DUF307 familyFunction unknown [S] 0.84
COG4076Predicted RNA methylaseGeneral function prediction only [R] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.16 %
All OrganismsrootAll Organisms0.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009822|Ga0105066_1042995All Organisms → cellular organisms → Bacteria939Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.76%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere10.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil6.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil6.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.04%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.20%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.20%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen4.20%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil3.36%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.36%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.52%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.52%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.52%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.52%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere2.52%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere2.52%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.68%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.68%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.84%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.84%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment0.84%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.84%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.84%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.84%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.84%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.84%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.84%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.84%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.84%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.84%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.84%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.84%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.84%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.84%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.84%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.84%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004079Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - WestPond_TuleC_D2EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005344Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaGHost-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006051Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-4Host-AssociatedOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300009840Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105AEnvironmentalOpen in IMG/M
3300010037Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot25EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012884Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S073-202C-2EnvironmentalOpen in IMG/M
3300012892Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1EnvironmentalOpen in IMG/M
3300012905Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S013-104B-2EnvironmentalOpen in IMG/M
3300012939Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t1i015EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014318Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rdEnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017947Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_4_20_MGEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019356Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S073-202C-2 (version 2)EnvironmentalOpen in IMG/M
3300019884Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2s2EnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026827Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G09A4-11 (SPAdes)EnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027560Soil and rhizosphere microbial communities from Laval, Canada - mgLPC (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027995 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MGEnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028855Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Fen_N1_4EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030002II_Fen_N1 coassemblyEnvironmentalOpen in IMG/M
3300030010Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Fen_N3_4EnvironmentalOpen in IMG/M
3300030114I_Fen_E2 coassemblyEnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031170Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 12_SEnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031713Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f22EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031798Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f19EnvironmentalOpen in IMG/M
3300031799Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031918III_Fen_N3 coassemblyEnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033158Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.1EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034151Sediment microbial communities from East River floodplain, Colorado, United States - 2_s17EnvironmentalOpen in IMG/M
3300034358Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_01D_16EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiFebDRAFT_1059282723300000363SoilVATKSDFTEEEWKTMQRGVAGAGTLVSVSDPDFTDSFGEAGALAKYLAEQQQNSSSILIRDLAHVHGTGIGFTASHEKAETETLDALRSA
JGI1027J12803_10294325523300000955SoilVATKADFTEDEWKTMQRGVTGAGTLTSVADADFTDTFGEASALAKYLADRRRSSDSALVRDLASVHGTGFGLTASRDEVEAG
JGI10216J12902_10232787813300000956SoilMATKSDFSEAEWDALHKGVTGAGMLVSVADRDFTDTFGEAGALAKHLRAEHEQSESALVRDIAAVHGSGFGFTASPQKVETETLEALRSSTATLAAKAPDE
Ga0055514_1014893413300004079Natural And Restored WetlandsMASKADFTQDEWATLWKGVTGAGMLVSVADRDFTDSFGEASALAKAISEERISGASELLRELAAGHGTGFGLTASPQKVETETLAALG
Ga0062590_10180290423300004157SoilVATKADFTEEEWKLMQKGVTGAGALVSISDPDFTDSFGEASAIGKYLAQQHEKSDSVLIRDLAHVHGTGFGLTASLEKVETETLDALRSATVTLQAKAPDGL
Ga0063356_10573505713300004463Arabidopsis Thaliana RhizosphereMATKADFTEEEWKTMQKGVTGAGALVSISDPDFTDSFGEASAIGKYLAEQREKSDSVLIRDLTHVHGTGFGLTASREKVETE
Ga0062595_10028385523300004479SoilVATKADFTEEEWKLMQKGVTGAGALVSISDPDFTDSFGEASAIGKYLAQQHEKSDSVLIRDLAHVHGTGFGLTASLEKVETETLDALRSATVTLQAKAPD
Ga0062595_10121780013300004479SoilVATKADFTEDEWKTMQRGVTGAGTLTSVADADFTDTFGEASALAKYLAEARRSSDSALVRDLASVHGTGFGMTASRDEVETGTLSALRSAVATLEAKAPDEL
Ga0062595_10246844513300004479SoilMTSETETRGAGVATKADFTEEEWKTMQKGVTGAGMLVSISDADFTDSFGEASALAKHLAEEQQRSGSALIRDLAHVHGTGFGLTASREKAETETLDALRAAV
Ga0062591_10081035813300004643SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASAVAKYLGRQRESGASELIRELAHAKGTGFGFTDSPQEVETETLEALRSSVATLAA
Ga0062591_10131155113300004643SoilMASKSDFTEDEWETMQKGIVGAGMLVSVSDRDFTDTFGEVGALAKYLADQHEHAESPLVRELAETHRTGFGITTSAEKVEAQTLDSLRAAVAAL
Ga0062591_10218711523300004643SoilMAAKADFSEEEWDTLHKGVTGAGLLVSVGDRDFTDSFGEANALAHRLLEEHEHSESELVRELAGVRGTGFGFTTSAKKAEAETLEALRSATAILAAKAPDEADAYRKLVLDVADAVANAK
Ga0066388_10750040613300005332Tropical Forest SoilMAAKADFTEDEWNALEKGVTGAGMLVSVGDRDFTDAFGEASALAKALAAQRERSGSDLVRELAGVRGTGYGFTASPQEVETEMLVSLDVAMKALAAKAPEE
Ga0066388_10753548913300005332Tropical Forest SoilVATKADFTEAEWKTMQKGVTGAGMLVSASDADFTDSFGEASALAKYLAHQQQDSGSALIRDLAHMHSTGFGFTTSREKAETETFESLRSAIATLQTKAPDDLAAYQE
Ga0070660_10080316913300005339Corn RhizosphereMDASGNREKGADVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSLAKYLAEQQRNSGSDLIGDLAHVHGTGFGLTASREKAETETLEALRSAAATLQEK
Ga0070661_10131447623300005344Corn RhizosphereVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSLAKYLADQQQHSGSVLIRDLAHVHGTGFGLTASREK
Ga0070667_10130128723300005367Switchgrass RhizosphereVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSLAKYLAEQQRNSGSDLIGDLAHVHGTGFGLTASREKAETETLEALRSAAATLQEKA
Ga0070709_1051217113300005434Corn, Switchgrass And Miscanthus RhizosphereMATKADFTEDEWKTMQKGVTGAGMFVSVSDADFTDSFGEVGALAKRLGKEHEENASELMRELAHIHGSGFGLTASRQEVEAGTIEALHSAT
Ga0070696_10102597823300005546Corn, Switchgrass And Miscanthus RhizosphereVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSLAKYLAEQQRNSGSDLIGDLAHVHGTGFGLTASREKAETETLEAL
Ga0066698_1019634413300005558SoilVATKADFTEDEWGTLHKGVTGAGMLVSVSDRDFTDTFGEAGALAKRLRKEHEQSPSELVRELAGTHGTGFGFTASPQKVEAETLEA
Ga0066903_10772051013300005764Tropical Forest SoilMATKADFTEEEWHDLHKGVTGAGMLVSVGDPDFTDSFGEASALARRLRGEHEHNPSPLMRELASGHGTGFGFGASPAKVEAETVEA
Ga0068858_10014987613300005842Switchgrass RhizosphereMATKADFTEDEWHALQKGVTGAGMLVSVSDADFTDSFGEASALAKFLAEQRRTNERELVREIAAVHGGGFGLTASREKVETETMAALR
Ga0081455_1074217833300005937Tabebuia Heterophylla RhizosphereMATKADFSEDEWEALHKGATGSGLMVSVGDRDFTDTFGEAGALARQMREAHEQSASELVRELAGIRGSGFGFTTSPEKA
Ga0075417_1039232023300006049Populus RhizosphereMATKADFTEDEWEAMKKGVTGAGMLVSIGDRDFTDTFGEVSALAKRLSEERKEGGSELMRELAAGRPSGFGLTASPDEVESKTLEALRSATAILAAKAPDEAGA
Ga0075364_1002896413300006051Populus EndosphereMATKADFTEDEWEAMKKGVTGAGMLVSIGDRDFTDTFGEVGALAKRLSEERKEGGSELMRELAAGRPSGFGLTASPEEVESKTLEALRSATAILAAKAPDEAGAYR
Ga0075432_1010609313300006058Populus RhizosphereMATKADFTEDEWEAMKKGVTGAGMLVSIGDRDFTDTFGEVGALAKRLSEERKEGGSELMRELAAGRPSGFGLTASPDEVESKTLEALRSATAILAAKAPDEAG
Ga0073934_1061757733300006865Hot Spring SedimentMATRADFTDDEWKAMERGVTGAGMLVSVGDRDFTDSFGEASALAKALAAQHEHSSSELVRDLADVRGTGFGMTVSMQEVQSETIADLGVAMGAILAKAPEEADAYR
Ga0075434_10098635823300006871Populus RhizosphereMATKADFTEDEWHALQKGVTGSGMLVSVSDADFTDSFGEASALAKFLGKQRQKSDSELVREVAAVHGGGFGLTASREKVETETMAALRSAVTTLSAK
Ga0075434_10193947623300006871Populus RhizosphereMATKADFTDAEWEALHKGVTGAGMLVSVGDRDFTDTFGEAGALAKRMREEHEQSQSELIREIAGVHGSGFGLTASPQKVETETLA
Ga0075424_10068814513300006904Populus RhizosphereVATKADFTEEEWKTMQKGVTGAGMLVSISDADFTDSFGEASALAKHLAEEQQRSESALIRDLAHVHGTGFGLTASREKAETETLDALRAAVT
Ga0075436_10143565523300006914Populus RhizosphereMATKADFTDAEWEALHKGVTGAGMLVSVGDRDFNDTFGEAGALAKRMREEHEQSQSELIREIAGVHGSGFGLTASPQKVETETLAALRTATSTLAAKAPDELDGYKKLVLDVADAVANAKGG
Ga0111539_1172913313300009094Populus RhizosphereMAGKADFTEEEWDALRKGVTGAGMLVSVGDRDFTDSFGEASALAHRLLEEHEQSESELVRELAGVRSTGFGFTLSAKKAE
Ga0111539_1310316113300009094Populus RhizosphereMATRSDFTDDEWEAMQHGVTGAGALVSVSDRDFTDTFGEASALAKALAAYREQSDSVVIRELAKARGAGFGLTDSPQEMEAKT
Ga0066709_10256432023300009137Grasslands SoilMASKADFTEEEWKAMQKGVTGAGMLVSVSDRDFTDTFGEAGALARSLAEQHQSNESELIRELAGIHGSGFGLTASPEKVESETLSALRTAMAALASKAPIPA
Ga0114129_1298374413300009147Populus RhizosphereMATKADFTEDEWEAMKKGVTGAGMLVSISDRDFTDTFGEVGALAKRLSEERKEGGSELMRELAAGRPSGFGLTASPEEVETQTL
Ga0111538_1221708413300009156Populus RhizosphereMAGKADFTEEEWDALRKGVTGAGMLVSVGDRDFTDSFGEASALAHRLLEEHEQSESELVRELAGVRSTGFGFTLSAKKAEAETLEALRSATAILAAKAPDEADAYRKLVLDVADAVANAK
Ga0111538_1269161713300009156Populus RhizosphereMATRSDFTDDEWEAMQHGVTGAGALVSVSDRDFTDTFGEASALAKALAAYREQSDSVVIRELAKARGAGFGLTDSPQEME
Ga0111538_1302856723300009156Populus RhizosphereVATKADFTEEEWKTMQKGVTGAGMLVSISDADFTDSFGEASALAKHLAEEQQRSESALIRDLAHVHGTGFGLTASREKAETETLDALRAA
Ga0105252_1027508723300009678SoilMATKADFSEDEWKALQKGMTGAGMLVSVSDRDFTDSFGEASALAKYLGRQREEGASELMKELAHAKGTGFGLTDSPQEV
Ga0105066_104299523300009822Groundwater SandMATKADFTEDEWETMQKGVTGAGMLVSVGDRDFTDSFGEAGALAKYLGDQREGSESELVRELASVRGTGFGLTDSQQEVEAETLAAKAPDEAGAYRQLVLDVAEAVAEAKGGVKPGETAAVEAIKGALGPA*
Ga0126313_1014547123300009840Serpentine SoilMATRTDFTEDEWETIRKGVTGAGMLVSIGDRDFTDTFGEAGALAKRLNEEREQSGSQLVRQLATGRPSGFGLTDSPQEVEAKTLEALRSATAILAAKAPDEAGA*
Ga0126304_1060370623300010037Serpentine SoilMATKTDFTEDEWETMKKGVTGAGMFVSIGDRDFTDTFGEVGALTKRLSQEREQSESQLVRELTAGRPSGFGLTASPQEVEA
Ga0126309_1069915223300010039Serpentine SoilMATRADFTEDEWETMRKGVTGAGMLVSIKDRDFTDTFGEVGALAKRLSEERMDSPSALMRELASDRPKGFGLTASPDEVEAETLDALRSAIAI
Ga0126380_1046628733300010043Tropical Forest SoilMAAKADFTEDEWETMKKAVSGAGILVSVGDRDFTDTFGEVAALTKRLSEERAESTSALVREVASERPKGFGLTASPQEIETETIDALRSATAILAAKAPDEADAYRQFVLDV
Ga0126311_1032532333300010045Serpentine SoilMATRADFTEDEWETMRKGVTGAGMLVSIGDRDFTDTFGEVSALAKRLNEERGESPSALMRELASDRLKGFGLTASPDEVEAETLDALRSAIAILAAKAPDEVG
Ga0126382_1201501413300010047Tropical Forest SoilMATKADFSEDEWTALHKGATGAGMMVSVGDRDFTDTFGEAGALAKQMRKAHEHSQSELVRELAGIRGSGFGFTTSPQKAETETLDSLRAAMAALEAKAPEEKAAYRQFVLDVANAVASAKGGVTPSETAVIEKVTAALDAS*
Ga0126372_1324463013300010360Tropical Forest SoilMEGAEMATKADFSEDEWKTMHKGVTGAGMFVSASDADFTDSFGEASALAHFLAEQHQGNSSELVRELTGIHGSGFGLTASPQKVEAETLSS
Ga0134066_1036599713300010364Grasslands SoilMASKADFTEEEWKAMQKGVTGAGMLVSVSDRDFTDTFREAGALARSLAEQHQSNESELIRELASTHGSGFGLTASPQKVEEETLASLRSATATLASKAPDEVDAYR
Ga0105239_1336389923300010375Corn RhizosphereMDSSGNREKGADVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSLAKYLAEQQRNSGSDFIGDLAHVHGTGFGLTASREKAETETLEALRSAAATLQ
Ga0126383_1108042313300010398Tropical Forest SoilMATKADFTEQEWDALHKGVTGAGLLVSVGDRDFTDTFGEAGALAKQLREAHERSTSELVRELAGVRGTGFGFTTSPAKAETETLDALNSAMTALA
Ga0150985_11424159113300012212Avena Fatua RhizosphereMKPTHGAGRDTGGSTDRGGPMATKADFTEEEWKALQKGVTGAGMLVSVSDRDFTDTFGEASALAKYMAAQHETNESPLIREIAASKGSGFGMTASVDKVDAQTFQALATALETLQA
Ga0137370_1052873123300012285Vadose Zone SoilMASKADFTEEEWKAMQKGVTGAGMLVSVSDRDFTDTFGEAGALARSLAEQHQSNESELIRELAGIHGSGFGLTASPDKVESETLSSLRTAMAALASKAPDDL
Ga0157300_111901523300012884SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASALAKYLGRQRESGASDLIRELAHAKGTGFGFTDSPQEVETETLE
Ga0157294_1022451713300012892SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASAVAKYLGRQRESGASELIRELAHAKGTGFGFTDSPQEVETETLEALRSSVATLAAKASDEVDAYRQLV
Ga0157296_1011004823300012905SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASAVAKYLGRQRESGASELIRELAHAKGTGFGFTDSPQEVETE
Ga0162650_10007802723300012939SoilMATKADFSEDEWKTMQKGVTGAGMLVSVSDRDFTDSFGEASALAKYLGGQRESGPSELMRELAHAKGTGFGLTDSPQEVETETLAAIRSSVATMESRARETPSSRRAN
Ga0126369_1095043433300012971Tropical Forest SoilVATKADFTEEEWKAMQRGVTGAGMLVSVGDRDFTDSFGEASAMAKYLAGQHQQNESELLRAVAGTHGSGFGLFTSPQKMEAETFEAL
Ga0134081_1037583013300014150Grasslands SoilMASKADFTEEEWETMRKGVAGAGMLVSVGDRDFTDTFGEAGALAKQLGQEHEAAGSEFIRELAAGRPAGFGLTASTQEVEAETLEALRSTTAI
Ga0134078_1037002023300014157Grasslands SoilMIVERKVVMASKADFTDQEWKAMQKGVTGAGMLVSISDRDFTDTFGEAGALAKYLGEEHEKSGSELIRELASTHGSGFGLTASPQKVEEETLASLRSAAATLASKAPDEVDAYRQ
Ga0075351_108209123300014318Natural And Restored WetlandsMANKADFTEDEWKALQKGVTGAGMLVSVGDRDFTDSFGEASALAKYLAAQKQANESTLMRGIADIHGTGFGMTDSPQKVQAETMDALRAALA
Ga0163163_1028213813300014325Switchgrass RhizosphereMATKADFTEDEWHALQKGVTGAGMLVSVSDADFTDSFGEASAFGKFLAEQHQKNESELVREIAGVHGGGFGLTASREKVETETMAALRTAVA
Ga0132258_1082795643300015371Arabidopsis RhizosphereVATKADFTKEEWKTMQKGVTGSGALVSISDPDFTDSFGEASAIGKYLAQQHEKSDSVLIRDLTHVHGTGFGLTASREKVETETLDALRSATA
Ga0132256_10190136923300015372Arabidopsis RhizosphereMAGKADFSEEEWDTLHKGVTGAGLLVSVGDRDFTESFGEANALARRLVEEHEHSESELVRELAGVRGTGFGFTTSAKKAEAETLEALRSATAILAAKA
Ga0132256_10247497623300015372Arabidopsis RhizosphereMASKADFTEDEWETMRKGVAGAGMLVSVGDRDFTDTFGEVATLAKRLGQEQQAGGSELMRELAAGKPAGFGLTASAQEVEEETLEALRSATATLAAKA
Ga0132256_10305427723300015372Arabidopsis RhizosphereMDSSGNREKGADVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSMAKYLAEQQRNSGSDLIGDLAHVHGTGFGLTAS
Ga0182032_1156944223300016357SoilVATKTDFSEDEWKAMHKGVTGAGMLVSVSDADFTDSFGEAKALAKELVEERTQGTTQLVRELASGGGTGFGFGASRDKVEAETLDSLRSTM
Ga0182037_1213607123300016404SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDAIGSAMATLAAKAPEERDAYRQFVLDLANAVAGAKGG
Ga0134074_125977923300017657Grasslands SoilVATKADFTEDEWGTLHKGVTGAGMLVSVSDRDFTDTFGEAGALAKRLRKEHEQSPSELVRELAGTHGTGFGFTASPQKVEAETLEALRSAAAMLAARAPGEAEAY
Ga0134083_1018055023300017659Grasslands SoilMASKADFTEEEWETMRKGVAGAGMLVSVGDRDFTDTFGEAGALAKRLGQEREAGGSEFIRELAAGRPAGFGLTASAQEVEAET
Ga0187785_1038181713300017947Tropical PeatlandMATKADFTEDEWKTMQKGVAGAGALVSVSDPDFTDSFGEASALAKYLAEQQQASGSVLIRDLAHVHGTGFGFTTSREKAETETL
Ga0190268_1045356823300018466SoilMATKADFSEDEWQAMQRGITGAGMLVSVSDRDFTDSFGEASAIAKYLARQREEGSSELMKELAQAKGTGFGLTDSPQKVETETLDALSSSVATLTTKAPD
Ga0190270_1246089923300018469SoilMATKADFSEEEWKTMQKGVTGAGMLVSVSDRDFTDTFGEASAISKYLGRQREEGATELIKELAHAKGTGFGLTDSPQEVETETLDALRASAATLAAKASGEVAHYRELVL
Ga0190274_1355569413300018476SoilMATKADFSEDEWKTMQKGVTGAGMLVSVSDKDFTDSFGEASALAKYLGRQRDEGASELIKELAHAKGTGFGLTDSRQEVETETLDALHSS
Ga0190271_1058216613300018481SoilMATKADFSEDEWKAMQKGVTGAGMLVSVSDRDFTNSFGEASALAKYLGRQREEGATELIKELAHAKGTGFGLTDSPQEVET
Ga0190271_1241735813300018481SoilMATKADFTEDEWKAMQKGITGAGMLASISDRDFTDSFGEASALAKFLGAQRESNASDLIRELAAVKSTGFGFTDSPQ
Ga0066669_1126561413300018482Grasslands SoilMATKADFTEEEWQAMQKGVVGAGMLVSFSDRDFTDTFGESKALAKYLGAQHRVNPSPLIREITGGHGSGFGMTASAEKVECGALAPLRS
Ga0173481_1034444623300019356SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASAVAKYLGRQRESGASELIRELAHAKGTGFGFTDSPQEVETETLE
Ga0193741_103632913300019884SoilMATKADFSEDEWKTMQKGVTGAGMLVSVSDRDFTDSFGEASALAKYLGRQREGDASELMRELATAKGTGFGFTDSPQEVETETLAALHSSVATL
Ga0212128_1044486023300022563Thermal SpringsMATKADFTSDEWTALEKGVTGAGMLVSVGDRDFSDSFGEASALAKALAAQRGSSSELVCELAAVRGTGFGLTASPHEVESETIAALDKIRQALGGAA
Ga0207685_1035675713300025905Corn, Switchgrass And Miscanthus RhizosphereMATKADFTEEEWKALQKGVTGAGMLVSVSDRDFTDTFGEASALAKYMAAQHETNESPLIREIAASKGSGFGMTASVDKVDAQTFQALATAL
Ga0207664_1195421013300025929Agricultural SoilVATKADFTDAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASSLAKYLAEQQQHGGSDLIRDLAHVHGTGFGLTASREKAETETLDALRSAA
Ga0207706_1076916913300025933Corn RhizosphereVATKADFTEEEWKTMQKGVTGAGMLVSISDADFTDSFGEASALAKHLAEEQQRSGSALIRDLAHVHGTGFGLTASREKAETETLDALRAAVT
Ga0207711_1169156913300025941Switchgrass RhizosphereMDSSGNREKGADVATKADFTEAEWKTMQKGVTGAGMLVSVSDPDFTDSFGEASALAKHLAEEQQRSGSALIRDLAHVHGTGFGLTASREKAETETL
Ga0207591_10844413300026827SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASAVAKYLGRQRESGASDLIRELAHAKGTGFGFTDSPQEVETETLEALRSSVATLAAKASDEVDAYRQLVLGIGAF
Ga0209886_104378023300027273Groundwater SandMATKADFTEEEWETMQKGVTGAGMLVSVGDRDFTDSFGEAGALAKYFGEQREASQSQLVRELASVRGTGFGLTASQQEVGAETVAALRSASATLAA
Ga0207981_111281723300027560SoilMATKADFTEEEWKTMQKGVTGAGLLVSVGDRDFTDSFGEAGALAKYLAEERDQNASELIQELAHVHGSGFGFTASQQEVETETVAALGSAVATL
(restricted) Ga0233416_1001797933300027799SedimentMATRADFTEDEWKAMEKGVTGAGMLVSVGDRDLTDSFGEASALAKALAVRREQSPSELVRELAAVRGTGFGMTASPEKVEQETLASLRLAT
Ga0209382_1025522313300027909Populus RhizosphereMAGKADFTEEEWDALHKGVTGAGMLVSVGDRDFTDSFGEASALAHRLLEEHEQSESELVRELAGVRSTGFGFTLSAKKAEAETLEALRSAKAILAAKAPDEADAYRKLVLDVADAVANAK
(restricted) Ga0233418_1014987423300027995SedimentMATRADFTEDEWKAMEKGVTGAGMLVSVGDRDLTDSFGEASALAKALAVRREQSPSELVRELAAVRGTGFGMTASPQKVE
Ga0247822_1080216713300028592SoilMATKADFSEDEWKTMQKGITGAGVLVSVSDRDFTDTFGEASAVAKYLGRQRESGASDLIRELAHAKGTGFGFTDSPQEVET
Ga0307305_1042653213300028807SoilVASKADFTDEEWKTMQKGVTGAGTLVSISDPDFTDSFGEASAIGKYLAEQREQSDSTLIRDLTHVHGTGFGLTASREKVETETLDALRSA
Ga0247825_1078012213300028812SoilMAAKADFTEDEWKAMQKGVTGAAALVSISDRDFTDSFGEASAIAKYLAEQHDKSESALVRDLAKVRGTGFGLTASAQEVESETLDALHAATE
Ga0302257_111935313300028855FenMASKDDFSADEWGTLWKGVTGAGMLVSVADRDFTDSFGEASALAKRISEERVQGASELLRELAAGHGTGFGLTASPQKVETETLEALRAATATLTTKAPDELAGYRQFVLDVADAVAEAKGG
Ga0247827_1098553113300028889SoilMATKADFTEDEWKAMQKGVTGAGALVSISDRDFTDSFGEASAIAKYLAQQRETSDSPLVRDLAKVRGTGFGLTASAQEVEAQTVDALHTATETLGAKAP
Ga0311350_1046178513300030002FenMASKDDFSADEWGTLWKGVTGAGMLVSVADRDFTDSFGEASALAKRISEERVQGASELLRELAAGHGTGFGLTASPQKVETETLEALRAATA
Ga0302299_1012979033300030010FenMASKADFTADEWGTLWKGVTGAGMLVSVGDRDFTDSFGEASALAKRISEERVSGASELLRELAAGHGTGFGLVASPQKVETETLEALGSAT
Ga0311333_1006432743300030114FenMASKDDFSADEWGTLWKGVTGAGMLVSVADRDFTDSFGEASALAKRISEERVQGASELLRELAAGHGTGFGLTASPQKVETETLEALHAATATLTTKAPDELAGYRQFVLDVA
Ga0247826_1056963223300030336SoilWKTMQKGITGAGVLVSVSDRDFTDTFGEASALAKYLGRQRESGASDLIRELAHAKGTGFGFTDSPQEVETETLEALRSSVATLAAKASDEVDAYRQLVLGIGAFVAEAKGGVTDVEAATIAKLEEALGVS
Ga0307498_1026807823300031170SoilMATKADFTEDEWDALQKGVTGAGMLVSVSDADFTDSFGEASAFAKFLGEQHRTNDSELVREIAGVHGHGFGLTASREKVE
Ga0307499_1026682213300031184SoilMATKADFTEDEWKALQRGVTGAGTLVSVSDRDFTDSFGEASALAKHLAEQQKTGATELMREIAHARGTGFGFTDSPQEV
Ga0307505_1054816813300031455SoilMATKADFSEDEWKALQRGVTGAGTLVSVSDRDFTDSFGEASALAKHLAEQQKSGTSELMREIAHARGTGFGFTDSPQEVEAGTLEALRT
Ga0318496_1073129313300031713SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDAIGSAMATLAAKAPEE
Ga0318554_1088029523300031765SoilMATKADFTEDEWKAMQKGITGAGMLVSVSDQDFTDSFGEASALAKALAAQRQQGPSELIRELASARGTGFGLTASAHEVESETLASLASATPSTS
Ga0318523_1045367713300031798SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDAIGSAMATLAAKAPEERDAYRQFVLDLANAVAGAKGGVTPAETAA
Ga0318565_1041475513300031799SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDALGSAIATLAAKAPEERDAYRQFVLDLANAVAGAKGG
Ga0318567_1068629213300031821SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDAIGSAMATLAAKAPEERDAYRQFVLDLANAVAGAKGGVT
Ga0310892_1098102813300031858SoilMATKADFSEVEWKTMQKGITGAGMLVSVSDRDFTDTFGEASALAKYLGRQRESGASDLIRELAHAKGTGFGFTGSPQEVETETLEALRSSVATLAVKASDEVDAYRQVVLGIGAFVA
Ga0306919_1067729623300031879SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDAIG
Ga0311367_1233253323300031918FenMASKDDFTADEWGTLWKGVTGAGMLVSVADRDFTDSFGEASALAKRISEERVTSASELLRELAAGHGTGFGLTASPQKVETETLEALRAA
Ga0318569_1049147313300032010SoilVATKDMFTEDEWEALHKGATGAGLLVSVSDRDFTDSFGEASELAKRLRQEHEESASELVRELAAVRGTGFGLTTSPQKAETETLDAIGSAMATLAAKAPEERDAYRQFVLDLANAVAGAKGGVTPAETAAIE
Ga0310890_1164077913300032075SoilMATKADFTEDEWEAMKKGVTGAGMLVSIGDRDFTDTFGEAGALAKYLTAQRQQGATELVRELSEGRPPGFGFTASPQEVETETLAALGSAKAILAEKSPDEADSYAQHVLG
Ga0335069_1188655113300032893SoilMATKTDFTEDEWSALQKGVTGSGMLVSVSDADFTDSFGEASALAHYLAEQSEKGETELIRELSHVHGSGFGLTASAQKVREETLE
Ga0335084_1234483513300033004SoilMATRADFTEDEWKAMQKGVTGTGLLVSVSDRDFTDTFGEAGALAKTVAEEHEKSESALIRELASIHGSGFGLTGSQEKVENETLDALRSAVATLTAKAPDDVAAYRKLV
Ga0335077_1011482933300033158SoilMATKADFTEDEWSAMQKGVTGAGLLVSVSDADFTDSFGEASALAKYLVGQRENGETELVRELSHAHGGGFGLGASPQKVKEETLEALAPRS
Ga0326726_1075694013300033433Peat SoilMATKAAFTEDEWDALHKGVTGAGLLVSVGDRDFTDTFGEAGALAKQLREAHERSTSELVRELAGSRGTGFGFTTSPEKAEAEALAALGSAMTTLAAKAPDEADAYRQLVLDVANA
Ga0247829_1127593513300033550SoilMPTKADFSEDEWQAMQKGITGAGMLVSVTDRDFTDSFGEASAIAKYLGRQREEGSSELMKDLAHAKRSGFGLTDSLQEVETETMDALRSSVATLTAKAPDEVG
Ga0247830_1107022513300033551SoilMATKADFSEDEWKTMQKGITGAGALVSVSDRDFTDTFGEASALAKYLARQRESGASDLIRELAHAKGTGFGLTDSPQEVETETLAALRSSVATLTAKASDEVDGYRELV
Ga0364935_0335307_180_5033300034151SedimentMATKADFSEDEWKTMQKGVTGAGMLVSVSDRDLTDSFGEASALAKYLGRQRETGASELIRDLAHAKGTGFGFTDSPQEVEAETLAALRSSVATLAAKASDEVDAYREL
Ga0370485_0087299_1_3483300034358Untreated Peat SoilVRHDLSHDHANGTTGDPMASKADFTEDEWETLWKGVTGAGMLVSVGDRDFTDSFGEASALAKRISEERVSGASELLRELAAHHGTGFGLVASPQKVETETLESLGSATALLTAKAP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.