NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F073808

Metagenome / Metatranscriptome Family F073808

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073808
Family Type Metagenome / Metatranscriptome
Number of Sequences 120
Average Sequence Length 54 residues
Representative Sequence MRILLLNLGPESTQEVNQALSGQGYELTTGRRLTVDKILALSPEVLITEATPSDLS
Number of Associated Samples 94
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 40.00 %
% of genes near scaffold ends (potentially truncated) 4.17 %
% of genes from short scaffolds (< 2000 bps) 2.50 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.667 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(28.333 % of family members)
Environment Ontology (ENVO) Unclassified
(30.833 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.167 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.38%    β-sheet: 0.00%    Coil/Unstructured: 72.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF00582Usp 5.00
PF11453DUF2950 4.17
PF00106adh_short 2.50
PF02852Pyr_redox_dim 2.50
PF11154DUF2934 2.50
PF00296Bac_luciferase 1.67
PF14378PAP2_3 0.83
PF03466LysR_substrate 0.83
PF04545Sigma70_r4 0.83
PF10431ClpB_D2-small 0.83
PF00291PALP 0.83
PF08240ADH_N 0.83
PF00069Pkinase 0.83
PF13424TPR_12 0.83
PF14342DUF4396 0.83
PF02321OEP 0.83
PF14720NiFe_hyd_SSU_C 0.83
PF07494Reg_prop 0.83
PF00675Peptidase_M16 0.83
PF13581HATPase_c_2 0.83
PF02954HTH_8 0.83
PF11737DUF3300 0.83
PF13602ADH_zinc_N_2 0.83
PF07228SpoIIE 0.83
PF03264Cytochrom_NNT 0.83
PF04055Radical_SAM 0.83
PF00201UDPGT 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.33
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.67
COG1819UDP:flavonoid glycosyltransferase YjiC, YdhE familyCarbohydrate transport and metabolism [G] 1.67
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.67
COG3005Tetraheme cytochrome c subunit NapC of nitrate or TMAO reductaseEnergy production and conversion [C] 0.83
COG3292Periplasmic ligand-binding sensor domainSignal transduction mechanisms [T] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.67 %
All OrganismsrootAll Organisms3.33 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005536|Ga0070697_100609446All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium960Open in IMG/M
3300011270|Ga0137391_10705152All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium838Open in IMG/M
3300015053|Ga0137405_1399469All Organisms → cellular organisms → Bacteria2016Open in IMG/M
3300027738|Ga0208989_10022916All Organisms → cellular organisms → Bacteria → Acidobacteria2151Open in IMG/M
3300030991|Ga0073994_12059523Not Available814Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil28.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.17%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil13.33%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa9.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.33%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.17%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil3.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.33%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.83%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.83%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.83%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.83%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.83%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002681Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF120 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010341Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022508Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-19-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022510Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-14-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022712Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027105Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF018 (SPAdes)EnvironmentalOpen in IMG/M
3300027504Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027616Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027767Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028798Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_E2_2EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300029943I_Palsa_N3 coassemblyEnvironmentalOpen in IMG/M
3300029944II_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300029999I_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030053Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Palsa_E1_2EnvironmentalOpen in IMG/M
3300030399II_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300030503III_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030618II_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034163Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_04D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1068737013300001593Forest SoilMRILLLNLGVESAQEVNQALSGQGYEIAADRRLTVDKILALSPEVLITEATPSDLSCCGVISQIKASPD
JGIcombinedJ26739_10043032713300002245Forest SoilMESVQEVGRALSGQGYEISADRSLTVEEVLTLSPGVLVTEVTPSDLGSCDL
JGIcombinedJ26739_10144409123300002245Forest SoilMRILLLNLGLESTPEVTQALSGQGYELATGRRLTVDKILALSPEVLIT
Ga0005471J37259_12470423300002681Forest SoilVMRILLLNLGMQSTREVKQALSGQGYEITADRSLTVDEVLALSPEVLIT
Ga0062389_10081050023300004092Bog Forest SoilMRILLLNLDQGSTQEVTQALSGQGYEITTGRGLSVDEILGLSPGVLITEATPSDLSCCGLISQ
Ga0062388_10267343023300004635Bog Forest SoilMKILLLDLGPESGPAVHQALDGQGYEILSEHGLTVEAVLALAPEVLITEATPS
Ga0062388_10272666423300004635Bog Forest SoilMRILLLDLGVESAQAVEQALAGQGYDITTEPGLTVEAILALSPEILITEATP
Ga0066672_1074197123300005167SoilMRILLVSLGMESTQNVKQALLGQGYEVTSEPSLTVDEILALSPELLI
Ga0066688_1008598863300005178SoilLTAMRILLLNLGPESTQEVNQALSGQGYELTTGRRLTVDKILALSPEVLITEATPSDLSCCGV
Ga0070706_10088468323300005467Corn, Switchgrass And Miscanthus RhizosphereMRILLLDLGLESTQEVKQALSGQGYEIITGRGLTVDEILALSPELLITEATPSDLSC
Ga0070707_10150431813300005468Corn, Switchgrass And Miscanthus RhizosphereMRILLLDLGMESTQEVPQALSSQGYEITTHSGLTVDEILARSPEVLITEATPSDLSCCGVISQIK
Ga0070699_10042261523300005518Corn, Switchgrass And Miscanthus RhizosphereMLGDKLTSMRILLLNLGMESIQEVKQALSGQGYEITTERDLTVDEIL
Ga0070697_10060944623300005536Corn, Switchgrass And Miscanthus RhizosphereMRILLLDLGMESTQEVTQALSSQGYEITTHSGLTVDEILARSPEVLITEATPSDLSCCGVISQIK
Ga0070731_1027299613300005538Surface SoilVRILLLNLGGESSEAVKQALSGQGYEVATTQTQAVDEVLALSPEILITEATPSDL
Ga0066703_1026945013300005568SoilMRILLLNLGPESTQEVNQALSGQGYELTTGRRLTVDKILALSPEVLITEATPSDLS
Ga0070761_1093263413300005591SoilMRILFLDLGAESAQQAELALSGQGYEITTGRGLTVDEILARSPEVLITEATPSDLSCCGLIS
Ga0070762_1050495513300005602SoilMRILLLNLGTESSQDVTQALSGQGYEITTERNLTVDEILALSPELLIMEATPSNLSCCGLISQIKAS
Ga0070764_1074987723300005712SoilMRILLLNLDQGSTEKVAEALTGQGYEITTRLGLSVDEILGLSPGVLITEATPSDLSCCGLISQIK
Ga0070765_10051733813300006176SoilMRILLLNMGMESNQEVKQALSGQGYEIIADRNLTVDEILALQPELLITEATPSNLSCCGLISQIKAS
Ga0073928_1053651923300006893Iron-Sulfur Acid SpringMRILLLNLGPESTQEVMQALSAQGYELTTGRRLTVDKILALSPEVLITEATPSDLSCCGVISQI
Ga0099794_1063333123300007265Vadose Zone SoilMRILLLNLGMGSTQEVEQALSGQGYEITADRGLTIDEILALSPEVLVTEATPSDLSCCGVIS
Ga0099830_1024994913300009088Vadose Zone SoilMRILLLNLGVESAEQVNQALSGQGYEITTDRRLTVEKILA
Ga0099828_1058764923300009089Vadose Zone SoilMRILLLDLGMESTQEVKQALSGQGYEITTDTGLTVDEILARSPEVLITEATPSDLSCCGVISQIKASP
Ga0099828_1149107623300009089Vadose Zone SoilMRILLLNLGIESTQEVQQALSGQGYEITTDHSLTVDE
Ga0099827_1101151313300009090Vadose Zone SoilMRILLLNLGMESTQEVKQGLSGQGYEITTARSLKVDEILALSPEVLITE
Ga0099827_1171869113300009090Vadose Zone SoilMRILLLNLGVESAQEVNQALSGQGYEIITDRRLTVDKILVLSPEVLITEATPSDLSCCGV
Ga0099792_1006016913300009143Vadose Zone SoilMMTAMRILLLNLGTESTQEVMQGLSGQGYEITTGRRLTVDEILVLSPEVLITEATPSDLSCCG
Ga0099792_1046905123300009143Vadose Zone SoilMRILLLNLGMGSTQEVEQALSGQGYEITADRGLTIDEILALSPEVLVTEATPSDLSCCGVISQIKT
Ga0074045_1025156223300010341Bog Forest SoilMHILLLNLGAESAQEVEQALSGQGYEITTGRGLTVDEILARSPEVLITEATPSDLS
Ga0137392_1010685113300011269Vadose Zone SoilMRILLLNLGVESAEQVNQALSGQGYEITTDRRLTVEKILALSPEVLITEATPSDL
Ga0137391_1070515223300011270Vadose Zone SoilMRILLLDLGMESTQEVTQALSSQGYEITTHSGLTVDEILARSPE
Ga0137391_1074104613300011270Vadose Zone SoilMRILLLNLGVESAEQVNQALSGQGYEITTDRRLTVEKILALSPEVLITEATPSDLS
Ga0137393_1046964423300011271Vadose Zone SoilMRILLLNLGKQSAQDVTQALAGQGYEITAERFLNVDEILALSPEVLI
Ga0137388_1138112713300012189Vadose Zone SoilMRILLLNLGPESTQEVNQALSGQGYEITTDRRLTVDKILALSPEVLITEATPSDLSCCGV
Ga0137363_1099753713300012202Vadose Zone SoilMHIVLFNLGIESTQKVDQALSGQGYEITAHRSPKVDEILALSPEVLITEATPSDL
Ga0137363_1174868523300012202Vadose Zone SoilMRILLLNLGMGSTQAVEQALSGQGYEITADRGLTIDEILAL
Ga0137399_1087164223300012203Vadose Zone SoilMRILLLNLGPESTQEVTQALSGQGYELTTGRRLTVDKILALSPEVLITEATPSDLSCCGVISQIKASP
Ga0137399_1106424923300012203Vadose Zone SoilMRILLLDLGVESTQEVKQALSGQGYEITTGWSLSVDEVLALFPEVLITEATPSDLSCCGLISQIK
Ga0137399_1140501113300012203Vadose Zone SoilMRILLLNLGPESTQEVTQALSGQGYELTTGRRLTVDKILALSPEVLITEATPSDLSCCGVISQIKASPD
Ga0137399_1168136713300012203Vadose Zone SoilMRILLLNVGIDSTQEVKQALSGKGYEITTERDLTVDEILALSP
Ga0137362_1109179213300012205Vadose Zone SoilMRILLLDLGMESTREVTQALSGQGYEITTGSGLMVDKILALSPEVLITEATPSDLS
Ga0137378_1063287713300012210Vadose Zone SoilMRILLLNLGPESTQEVNQALSGQGYEIITDRRLTVEKILALSPE
Ga0137359_1142562713300012923Vadose Zone SoilMRILLLNLGMGSTQAVEQALSGQGYEITADRDLAIDEILALSPEVLVTEATPSDLSCCGVISQIKAS
Ga0137404_1086589613300012929Vadose Zone SoilMRVLLFNLGMESTQEVRQALSGQGYEITAEHDLTVDEILALSPEVLVTEATPSDLSCCGVISQIK
Ga0137405_132580243300015053Vadose Zone SoilMRILLLNLGMGSTQAVEQALSGQGYEITADRDLAIDEILALSPEVWSQKQHHPT*
Ga0137405_139946943300015053Vadose Zone SoilMRVLLFNLGMESTQEVKQALSGQGYEITAERDLTVDEI
Ga0137409_1029835713300015245Vadose Zone SoilMRILLLNLGTESAQAVEHALSVQGYDIAAEHGLTIDQILARSPELLITEATPSNLSCGGLISQIK
Ga0187824_1013196623300017927Freshwater SedimentMRVLLVNLGVESTEAVRQALSDEGYEIATIETQAVDKVLALSPEVLITEATPSDL
Ga0179592_1026044213300020199Vadose Zone SoilMRIFLLNLGPESTQEVTQALSGQGYELTTGRRLTVEKIL
Ga0179592_1042095413300020199Vadose Zone SoilMMTAMRILLLNLGMESTQVVMQALSGQGYEITTGR
Ga0210407_1041274423300020579SoilMRILLLNLSAESSREVNQALSGQGYEFFTESNLTVDEILLLSPDVLITEATPSDLSCCGLISQIASS
Ga0210403_1003608393300020580SoilMRILLLNLGTESTQDVTQALSGQGYEITAERNLTVDEVLALSPELLIME
Ga0210403_1061702513300020580SoilMRILLVNLGTDSNHAVEQALSGQGYDIAADQDLSIDEILARSPGLLITEATPSDLSCCGL
Ga0210401_1002004813300020583SoilLLGGKLTAMRILLLNLGIESNHEVKQALSGQGYEITAERNLTVDKILSLSPEILITEA
Ga0210401_1071376323300020583SoilMRILLLNLAEESTQEVTQALSGQGYEITTGRDLSVDEILSLSPGVLITQATPSDL
Ga0210400_1013842433300021170SoilMRILVLNLGTDSTREVKQALSGQGYEITTEPNLTVDEILALLPEIL
Ga0210405_1049994733300021171SoilMRILILNLGMDSTQEVEQALSGRGYEIIADRGLTIDEILVLSPEVLVTEAT
Ga0210408_1054245513300021178SoilMYVLLLNLVAESTQEAQQALSGQGYEITTERNLTVDEILALSPGVLITEATPSDLSCCGLISQIKAG
Ga0210396_1004887413300021180SoilMRILLLNLGMESIQEVKQALSGQGYEITPERNLTVDKILAL
Ga0210396_1089590833300021180SoilMRILLLNLGTESTQDVTQALSGQGYEITTERNLTVDEVLALSPELLIMEATPSNLKVG
Ga0210385_1150795123300021402SoilMRILLLNLGTESTQAVEHALLGQGYDVAPVRSVTINQILARSPELLITEATPSDLSCCGLISQIKAS
Ga0210389_1141954113300021404SoilMRILLLNLGAESTQEVKQALSGQGYEVISERSLNVDEILALSPELLIT
Ga0210383_1090982123300021407SoilMHILLLNLGMESIREVKQALSGQGYEITTERTLTLDEIQTLSPEILVT
Ga0210391_1152155113300021433SoilMRILLLNLDQGSTQEVTHALSSQGYEITTDHGLSVDEILGLSPGVLITEATPSDLSCCG
Ga0210392_1134845423300021475SoilMRILLLNLGIESNHEVKQALSGQGYEITAERNLTVDKILALSPEILITEA
Ga0210409_1064667723300021559SoilMRILLLNLGMESIQEVKQALSGQGYEITAERNLTVDKILALSPEILITEATPSDL
Ga0222728_104609713300022508SoilMRILILNLGMDSTQEVEQALSGRGYEIIAVRGLTIDEILVLSPEVLVTEATPSDL
Ga0242652_102341023300022510SoilMRILILNLGMDSTQEVEQALSGRGYEIIADRGLTIDEILVLSPEV
Ga0242653_110297613300022712SoilMRILLLNLGVGSTQEVEQALSGQGYEVTADRGLTIDEILALSPEVL
Ga0137417_123625313300024330Vadose Zone SoilMMTAMGILLLNLGTESTQEVMQALSGQGYETTTGRRLTVDEVEVLAHSPE
Ga0207684_1045895313300025910Corn, Switchgrass And Miscanthus RhizosphereMRILLLDLGLESTQEVKQALSGQGYEIITGRGLTVDEILALSPELLITEATPSDLSCCGLITQIKA
Ga0207663_1009292613300025916Corn, Switchgrass And Miscanthus RhizosphereMRILLLNLGMESILEVKQALSGQGYEITAERDLTVDKILALSPEILITEATPSDLN
Ga0209377_108504513300026334SoilMRILLLNLGPESTQEVTQALSGQGYELTTGRRLTVEKILALSPEVLITEATPSDLSCCGV
Ga0257165_105052313300026507SoilMMTAMGILLLNLGTESTQVVMQALSGQGYEITTGRRLTVDEILAHSPEVLITEATPSDLSCCGL
Ga0179593_123460473300026555Vadose Zone SoilMRILLFNLGMGSTQAVEQALSGQGYEITADRDLTIDEILALSPEVLVTEATPSDLS
Ga0179587_1048424023300026557Vadose Zone SoilMRILLLDLGVESTQEVTQALSGQGYEITTGRSLSVDEVLALFPEVLITEATP
Ga0207944_101280913300027105Forest SoilMRILLLNLGMESILEVKQALSGQGYEITAERDLTVDTILALS
Ga0209114_107618513300027504Forest SoilMRILLLNLDQGSTEKVTEALSGQGYEIKTGRGLPVDEILGLSPG
Ga0209220_109554823300027587Forest SoilMRILLLNLGVESTPEVTQALSGQGYELTTGRRLTVDKILTLSPEVLITEATPSDLSCCGVISQIKASP
Ga0209106_112013613300027616Forest SoilMRISLVNLAMESCQEVERALSGQGYEITADRSLTVDEILALSPEVLVTEVTPSDL
Ga0209118_102077913300027674Forest SoilMRILLLNLGPESTQEVTQALSGQGYELTTGRRLTVEKILALSPEVLITE
Ga0209011_111168233300027678Forest SoilMRILLLNLGPESIQEVTQALSGQGYELTTGRRLTVEKILALSPEVL
Ga0208989_1002291613300027738Forest SoilMKVETFGDKLKVMRILLLNLGLKSTREVKQALSGQGYEITADRSLTVDEVLALSPEVLITEATPSDLSCCG
Ga0209655_1011006423300027767Bog Forest SoilMRILLLSLDQGSTQEVTQALAGQGYETTTGRGLSVDEILGLSPGVLITEAKPSDLSCCG
Ga0209180_1042783113300027846Vadose Zone SoilMYILLLNLGMESTEEVKQALSGQGYEITTERNLTVDEVLALSPEVLITEATPSDLSCCGL
Ga0209701_1064926413300027862Vadose Zone SoilMRILLLDLGMESTQEVTQALSSQGYEITTHSGLTVDEILARSPEVLITEATPSDLSCCGV
Ga0209701_1067617913300027862Vadose Zone SoilMRVLLLNLGVESTQDVKQALSGQGYEITVDRGLTV
Ga0209275_1004589313300027884SoilMRILLLDLGVESTGQVEQALASQGYEITAGHDLTVDQVFALSPDVLITEATPSDL
Ga0209624_1031167613300027895Forest SoilMHIVLLNVGMKSAQEVQRVLSGQGYEITTRRTPTVDEIHALSAEVLITEATPSDLSCC
Ga0209624_1101033423300027895Forest SoilMRILLVNLDQGSTHEVTQALSGQGYEITTGRDLSVDEIVGLSPGVLITEATPSDLS
Ga0209526_1014460513300028047Forest SoilMRVLLLNLGVESAQEVNQALSGQGYEIITDRRLTVDKILALSPEVLI
Ga0209526_1054126423300028047Forest SoilMRILLLNLGPESTQEVTQALSGQGYELTTGRRLTVEKILALSPEVLITEATPSDLSCCG
Ga0209526_1078468513300028047Forest SoilMRVLLLNLGTESTQEVQQALSGQGYEIARDRNLTVDEVLALSPEVLITEAT
Ga0302222_1025739723300028798PalsaMRILLLNLDQGSTLEVTRALSGQGYEITTGSGLSVDETLGLSPEVLITEATPSDLSCC
Ga0222749_1041528823300029636SoilMRILLLNLGMDSAREVKQALSGQGYEITADRSLSVDEILTLSPEVLVTEA
Ga0222749_1058465023300029636SoilMRSLLVNLAMESVQEVARALSGQGYEITADRNLTVEEMLTLSPEVIVTEVTPSDLGNC
Ga0311340_1078053013300029943PalsaMRILLLNLDQGSTQEVTQALEGQGYEITTGRSLSVDDILGLSP
Ga0311352_1038789623300029944PalsaMRILLLNLDQGSTQEVTQALSGQGYETTTGRDLSVDEVLRLSPGVLI
Ga0311339_1099071723300029999PalsaMRILLLNLDQGSTLEVTRALSGQGYEITTGSGLSVDETLGLSPEVLITEATPSDLSCCGLISQIK
Ga0302177_1072184813300030053PalsaMRILLLNLDQGSTLEVTHALSGQGYEITTGSGLSVDETLGLSPEVLITEATPSDLSCCG
Ga0311353_1025785723300030399PalsaMRILLLNLDQESTQKVTQALSGQGYETTTGRGLSVDEIRALSPGVLITEATP
Ga0311353_1120318813300030399PalsaMRILLLNLDQGSTLEVTHALSGQGYEITTGSGLSVDETL
Ga0311370_1076069623300030503PalsaMRILLINLDQGSTHEVTQALSGQGYEITTGRDLSVDEIVGLAP
Ga0311354_1024019433300030618PalsaMRILLLNLDQGSTQEVTQALSGQGYEITTGRGLSVDEILGLSP
Ga0073994_1205952313300030991SoilMRVLLLNLGMGSTQEVKQALSGQGYEITAERNLTVNEILALSPEVL
Ga0170823_1610497023300031128Forest SoilMRILLLNLGPESTQEVNQALSGQGYEITTDRRLTVDKILALSPEVL
Ga0302325_1122082723300031234PalsaMRILLLNLDQGTTQEVTQAVSGQGYEITTGHGLSVDEILGLSPGVLITEATPSDL
Ga0302324_10058503013300031236PalsaMRILLINLDQGSTHEVTQALSGQGYEITTGRDLSVDEIVGLAPGVLITEATPSDLSCCGLISQIKAGPD
Ga0310686_10700953923300031708SoilMRILLLTLDQGSTQEVTQALSGQGYEITTGRGLAVDEILGLSPEVLITEATPSDLSCCGL
Ga0307474_1035778313300031718Hardwood Forest SoilMRILLLDLGVESTQEVTQALSGQGYEITTGRGLSVDEI
Ga0307475_1046639913300031754Hardwood Forest SoilMHVLLFNLGVESTQAVTQALSDQGYEITTGRSLTADEILALSPEVL
Ga0307478_1002529713300031823Hardwood Forest SoilMIAMRILLLNLGTESTQDVTQALSGQGYEITTERNLTVDEILALSPELLIIEATPSNLSCCGLI
Ga0307478_1042220013300031823Hardwood Forest SoilMRILLLDLGVESTQEVTQALSGQGYEITTGRGLSVDEILALFP
Ga0307478_1066701033300031823Hardwood Forest SoilMRILLLNLGTESTQDVTQALSGQGYEITTERNLTVDEILALSPELLIMEATPSNL
Ga0307478_1113131313300031823Hardwood Forest SoilMRILLLNLGTESTQAVEHALLGQGYDIAPVRSVTIDQIPALSPELLITEATPSDLSCC
Ga0307478_1116196623300031823Hardwood Forest SoilMRILLLNLGAESTQEVKQALSGQGYEVLSERSLNVDEILALSPELLITEATP
Ga0307478_1136989523300031823Hardwood Forest SoilMRILLLDLGPESTTAVQQALSGQGYDIISDRGLTVEAILALSPEVLITEATPAD
Ga0307479_1088658913300031962Hardwood Forest SoilMRILLLNLGMESIHEVTQALSGQGYENTAERNLTV
Ga0307471_10134697513300032180Hardwood Forest SoilMRILLLNLGIESNHEVKQALSGQGYEITAERNLTVDKILALSPEILITE
Ga0370515_0463484_389_5353300034163Untreated Peat SoilMRILLLNLDQGSTQEVTQALSGQGYEITTGRGLSVDEILVLSPGVLITE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.