NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F079151

Metagenome / Metatranscriptome Family F079151

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079151
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 101 residues
Representative Sequence MSAAANTLSGSVPKAAHKRVSGWAQLTRLFPYVARHKTEVLIGFVTQAGMGITGTLLPLILGVITDCIKGAETPLAQLGRLTQIALGPLLPYYHPKD
Number of Associated Samples 96
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 25.00 %
% of genes near scaffold ends (potentially truncated) 98.28 %
% of genes from short scaffolds (< 2000 bps) 89.66 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (72.414 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(18.103 % of family members)
Environment Ontology (ENVO) Unclassified
(22.414 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(37.931 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 55.20%    β-sheet: 0.00%    Coil/Unstructured: 44.80%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF00664ABC_membrane 1.72
PF00962A_deaminase 1.72
PF00171Aldedh 0.86
PF00230MIP 0.86
PF00583Acetyltransf_1 0.86
PF00903Glyoxalase 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG1816Adenosine/6-amino-6-deoxyfutalosine deaminaseNucleotide transport and metabolism [F] 1.72
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 0.86
COG0580Glycerol uptake facilitator or related aquaporin (Major Intrinsic protein Family)Carbohydrate transport and metabolism [G] 0.86
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 0.86
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms72.41 %
UnclassifiedrootN/A27.59 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001661|JGI12053J15887_10570830Not Available538Open in IMG/M
3300002245|JGIcombinedJ26739_101036654All Organisms → cellular organisms → Bacteria706Open in IMG/M
3300004020|Ga0055440_10062389Not Available836Open in IMG/M
3300004152|Ga0062386_100155105All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1788Open in IMG/M
3300005534|Ga0070735_10472783All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300005541|Ga0070733_10407886All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300005610|Ga0070763_10396311All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300005921|Ga0070766_10683847All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300006052|Ga0075029_101149016Not Available541Open in IMG/M
3300006059|Ga0075017_101307265All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300006086|Ga0075019_10150106All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1362Open in IMG/M
3300006102|Ga0075015_100149480All Organisms → cellular organisms → Bacteria1214Open in IMG/M
3300006102|Ga0075015_100216856All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300006102|Ga0075015_100389296All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300006172|Ga0075018_10682355Not Available554Open in IMG/M
3300006174|Ga0075014_100243340All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300006174|Ga0075014_100370171All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300006237|Ga0097621_101928705Not Available564Open in IMG/M
3300007788|Ga0099795_10135769All Organisms → cellular organisms → Bacteria996Open in IMG/M
3300009029|Ga0066793_10408821All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300009038|Ga0099829_11685228Not Available522Open in IMG/M
3300009090|Ga0099827_10425374All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1137Open in IMG/M
3300009523|Ga0116221_1012033All Organisms → cellular organisms → Bacteria4948Open in IMG/M
3300009672|Ga0116215_1059832All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1722Open in IMG/M
3300009683|Ga0116224_10295806All Organisms → cellular organisms → Bacteria770Open in IMG/M
3300010159|Ga0099796_10313745All Organisms → cellular organisms → Bacteria667Open in IMG/M
3300011120|Ga0150983_10128616Not Available520Open in IMG/M
3300011120|Ga0150983_15064082All Organisms → cellular organisms → Bacteria714Open in IMG/M
3300011271|Ga0137393_11284797All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300012202|Ga0137363_10618409All Organisms → cellular organisms → Bacteria915Open in IMG/M
3300012202|Ga0137363_10823886All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300012203|Ga0137399_11130789All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300012206|Ga0137380_10031921All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4856Open in IMG/M
3300012210|Ga0137378_10496667All Organisms → cellular organisms → Bacteria1127Open in IMG/M
3300012363|Ga0137390_10856073Not Available865Open in IMG/M
3300012918|Ga0137396_11319850Not Available500Open in IMG/M
3300012927|Ga0137416_11290841Not Available659Open in IMG/M
3300012931|Ga0153915_10796607All Organisms → cellular organisms → Bacteria1095Open in IMG/M
3300012944|Ga0137410_10033891All Organisms → cellular organisms → Bacteria3585Open in IMG/M
3300014168|Ga0181534_10141030All Organisms → cellular organisms → Bacteria1230Open in IMG/M
3300014200|Ga0181526_10832360Not Available581Open in IMG/M
3300014657|Ga0181522_10315469All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300014968|Ga0157379_10410713All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1245Open in IMG/M
3300015052|Ga0137411_1334664All Organisms → cellular organisms → Bacteria1675Open in IMG/M
3300015054|Ga0137420_1093300Not Available613Open in IMG/M
3300015264|Ga0137403_10324553All Organisms → cellular organisms → Bacteria1431Open in IMG/M
3300015371|Ga0132258_12801792All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1214Open in IMG/M
3300017821|Ga0187812_1196124Not Available646Open in IMG/M
3300017822|Ga0187802_10104216All Organisms → cellular organisms → Bacteria1068Open in IMG/M
3300017823|Ga0187818_10045249All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1889Open in IMG/M
3300017933|Ga0187801_10026471All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2009Open in IMG/M
3300017936|Ga0187821_10169746All Organisms → cellular organisms → Bacteria830Open in IMG/M
3300017936|Ga0187821_10254041All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300017943|Ga0187819_10790150Not Available533Open in IMG/M
3300018006|Ga0187804_10592927Not Available503Open in IMG/M
3300018012|Ga0187810_10270853All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300018042|Ga0187871_10286289All Organisms → cellular organisms → Bacteria913Open in IMG/M
3300020580|Ga0210403_10668539All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300020581|Ga0210399_10129851All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2074Open in IMG/M
3300020581|Ga0210399_11275353Not Available580Open in IMG/M
3300020583|Ga0210401_10064651All Organisms → cellular organisms → Bacteria → Acidobacteria3445Open in IMG/M
3300020583|Ga0210401_10542685All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300021086|Ga0179596_10117271All Organisms → cellular organisms → Bacteria1232Open in IMG/M
3300021086|Ga0179596_10227221All Organisms → cellular organisms → Bacteria916Open in IMG/M
3300021088|Ga0210404_10638622Not Available606Open in IMG/M
3300021180|Ga0210396_10877060All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300021181|Ga0210388_11355389All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300021404|Ga0210389_11469784Not Available519Open in IMG/M
3300021406|Ga0210386_11176755All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300021432|Ga0210384_10159413All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2025Open in IMG/M
3300021474|Ga0210390_10723585All Organisms → cellular organisms → Bacteria828Open in IMG/M
3300021478|Ga0210402_11719407Not Available554Open in IMG/M
3300021559|Ga0210409_10178744All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1938Open in IMG/M
3300021559|Ga0210409_10883884All Organisms → cellular organisms → Bacteria767Open in IMG/M
3300022532|Ga0242655_10251254All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300022712|Ga0242653_1011424All Organisms → cellular organisms → Bacteria1113Open in IMG/M
3300022724|Ga0242665_10096688All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300024254|Ga0247661_1098843Not Available552Open in IMG/M
3300024330|Ga0137417_1197191All Organisms → cellular organisms → Bacteria1256Open in IMG/M
3300025913|Ga0207695_10109743All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2740Open in IMG/M
3300026359|Ga0257163_1048796All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300027570|Ga0208043_1052978All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1181Open in IMG/M
3300027629|Ga0209422_1091666All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300027681|Ga0208991_1197940All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300027738|Ga0208989_10106779All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300027857|Ga0209166_10495663Not Available628Open in IMG/M
3300027862|Ga0209701_10618341Not Available571Open in IMG/M
3300027911|Ga0209698_10246222All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1429Open in IMG/M
3300028047|Ga0209526_10021125All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4545Open in IMG/M
3300028047|Ga0209526_10777081All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300030991|Ga0073994_12067634All Organisms → cellular organisms → Bacteria878Open in IMG/M
3300031234|Ga0302325_11109587All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300031234|Ga0302325_12993878Not Available546Open in IMG/M
3300031235|Ga0265330_10192064All Organisms → cellular organisms → Bacteria864Open in IMG/M
3300031235|Ga0265330_10399615Not Available581Open in IMG/M
3300031239|Ga0265328_10245810All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300031239|Ga0265328_10379514Not Available553Open in IMG/M
3300031239|Ga0265328_10383518Not Available550Open in IMG/M
3300031590|Ga0307483_1041559Not Available507Open in IMG/M
3300031663|Ga0307484_102534All Organisms → cellular organisms → Bacteria998Open in IMG/M
3300031672|Ga0307373_10017367All Organisms → cellular organisms → Bacteria9535Open in IMG/M
3300031708|Ga0310686_106370075All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300031708|Ga0310686_114245231Not Available571Open in IMG/M
3300031715|Ga0307476_10064173All Organisms → cellular organisms → Bacteria2531Open in IMG/M
3300031715|Ga0307476_10641849All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300031720|Ga0307469_10819692All Organisms → cellular organisms → Bacteria855Open in IMG/M
3300031754|Ga0307475_11465790Not Available524Open in IMG/M
3300031823|Ga0307478_10681008All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300031962|Ga0307479_10014035All Organisms → cellular organisms → Bacteria7512Open in IMG/M
3300031962|Ga0307479_11476299All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300031962|Ga0307479_11968102Not Available534Open in IMG/M
3300032174|Ga0307470_11101152All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300032515|Ga0348332_10992061Not Available659Open in IMG/M
3300032515|Ga0348332_14053210All Organisms → cellular organisms → Bacteria667Open in IMG/M
3300032805|Ga0335078_10534903All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1496Open in IMG/M
3300034282|Ga0370492_0261777Not Available702Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil18.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.10%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil9.48%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds8.62%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment7.76%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.76%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere4.31%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil3.45%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog2.59%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.72%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.72%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.72%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter1.72%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.86%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.86%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.86%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.86%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.86%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.86%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil0.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.86%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.86%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.86%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004020Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009029Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 1 DNA2013-189EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009523Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaGEnvironmentalOpen in IMG/M
3300009672Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_2_FS metaGEnvironmentalOpen in IMG/M
3300009683Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_b_LC metaGEnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014168Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_10_metaGEnvironmentalOpen in IMG/M
3300014200Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_30_metaGEnvironmentalOpen in IMG/M
3300014657Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_10_metaGEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017821Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW-S_2EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018042Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_10EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022712Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024254Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK02EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300027570Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027629Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031235Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-19-19 metaGHost-AssociatedOpen in IMG/M
3300031239Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-16-24 metaGHost-AssociatedOpen in IMG/M
3300031590Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031663Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031672Soil microbial communities from Risofladan, Vaasa, Finland - OX-2EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300034282Peat soil microbial communities from wetlands in Alaska, United States - Eight_mile_03D_16EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1057083013300001661Forest SoilMSAAADILTGSVVKTTEKRVSGWAQLTRLFPYVRRHKGEVLLGMLTQTGMGITGTLLPLIIGAIVDCIKGAAAPLAQLGRLTQISLGFLLPYYQPKNPHTLAIFCSALII
JGIcombinedJ26739_10103665413300002245Forest SoilMSAAANTLSGSVPKTARKPVSGWAQLMRLLPYVTRHKGEIVIGFITQAGMGITGTLLPLILGVITDCLKGAEVPLAQLGHLTQIALGPL
Ga0055440_1006238913300004020Natural And Restored WetlandsMSATANTLNESAVTAPSKPVAGWRNLSRLIPYVAGHKGEVTLGMLTQAGMGITGTLLPLIIGVIVDCVRGTPVPLAQLGRLTHVALGFLLPY
Ga0062386_10015510533300004152Bog Forest SoilMSAAANTIPGTLPKAAPKRVSGWAQLMRLMPYVARHKTEVVVGFITQVGMGISGTLIPLLLGVITDCIKGAETPLAQLGRLTQITLGPMLPYYHAKD
Ga0070735_1047278313300005534Surface SoilMSAAAHTISSSVPKNKAARVSGWAQLTRLLPYVARHKGEVFLGMLTQIGMGITGTLLPLMIGVVVDCLKGAAAPLEQLGRLTQISLGFLLPYYHPRSAQTLMVFC
Ga0070733_1040788623300005541Surface SoilMSAAANTLTGSVPKTAHKRASGWAQIIRLYPYVARHKGEVLLGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPR
Ga0070763_1039631113300005610SoilMSTAANTLAGSAVKGTHTRVSGYAQLTRLYPYVRRHTGEVILGMITQTGMGITGTLLPLIIGAIVDCIKGAESPLAQLGRLTQISLGFL
Ga0070766_1068384713300005921SoilMSAAANTLSGSVPKAAQKRVSGWAQLMRLAPYVSRHKLEVVIGFITQAGMGITGTLLPLILGVITDCLKGADTPLAQLGRLTQIALGPLLPYYHPK
Ga0075029_10114901613300006052WatershedsMTTAANTLSASEPKVAPTRVSGWAHLSRLYPYVMRHKTEVVIGMITQVGMGITGTLLPLLLGVITDCIKGAETPLAQLGRLTQISLGYLLPYYHPKDP
Ga0075017_10130726513300006059WatershedsMSAAANTLENPAVNTAPRRVSGWGHLARLLPYVARHKTEVLIGLVTQTGMGITGTLLPLILGVITDCIKGAETPLAQLGRLTQISLGYLLPYYHPKDPRTLAVFASALVIVCAVQGFFSYCTRQILIGLSRDIEFDLRND
Ga0075019_1015010613300006086WatershedsMSAAANTISGSIAKTAHKRASGWAQLMRLLPYVARHKTEVVIGFITQAGMGITGTLLPLLLGVITDCIKGAETPLAQLGRLTHITLGPLLPYYH
Ga0075015_10014948033300006102WatershedsMSSAANTISGSIAKTAHKRVSGWAQLMRLLPYVARHKTEVVIGFITQAGMGITGTLLPLLLGVITDCIKGAETP
Ga0075015_10021685613300006102WatershedsMSAAANTLSGSLPRTAHRRVSGWAQLTRLYPYVARHKLEVVIGFITQAGMGITGTLLPLILGVITDCIKGAETPLAQLGRLTQITLGFLLPYYHPKDPHTLAVFCSALVIICAVQGVFSYCTRQILIGL
Ga0075015_10038929613300006102WatershedsMSAAAQTFANPVVKQAPKRVSGWAQLARLLPYVSRHKTEVAIGMVTQIGMGITGTLLPLIMGAIVDCIKGAAAPLEQLGQLARISLGFLLPYYHPKSPHTLAVFCSALIVICAIQGVF
Ga0075018_1068235513300006172WatershedsMSAAANTLSGSVPKTAQKRVSGWAQLTRLFPYVARHKTEVLIGFVTQAGMGITGTLLPLLLGAITDCLKGAEVPLAQLGRLTQIALGPLLPYYHPKDPH
Ga0075014_10024334023300006174WatershedsMSAAANILSGSVPKTAHKRVSGWAQLTRLYPYVARHKLEVVIGFITQAGMGITGTLLPLILGVITDCIKGAETPLAQLGR
Ga0075014_10037017113300006174WatershedsMSAAANTLSGSLPKTAHRRVSGWAQLTRLSPYVARHKLEVVIGFITQAGMGITGTLLPLILGVITDCLTGAETPLAQLGRLTHIALGPLLPYYHPKDPHTLAVF
Ga0097621_10192870513300006237Miscanthus RhizosphereMSAAANTLSGTAPATAHKRVSGWAQLTRLYPYVARHKGEVLVGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRL
Ga0099795_1013576913300007788Vadose Zone SoilMSAAANTLAGSAVKSTPKNASGWSQLMRLYPYVRRHTGEVILGMVTQTGMGITGTLLPLMIGAIVDCIKGAAAPLAQLGRLTQISLGFLLPYYHPKSAQTLAIFC
Ga0066793_1040882113300009029Prmafrost SoilMSAAANTLSGSAPKTAHKRVSGWAQLTRLYPYVARHKGEVVLGLFTQIGMGITGTLLPLLIGAIVDCIKGAEAPLAQ
Ga0099829_1168522813300009038Vadose Zone SoilMSAATSTWPGAAAKTRRKPVSGWGHLSRLLPYFRGHGGEIILGLLAQAVMGITGTLLPLLVGAVVDCIKGAEAPLVQLGRLTRLSLGFLLPYYHPRSAQTIEIFCTAL
Ga0099827_1042537433300009090Vadose Zone SoilMSAAANTWPGAAAKTRRKPVSGWGHLSRLLPYFRGHGGEIILGLLAQAGMGITGTLLPLLVGAVVDCIKGAEAPLVQLGRLTRL
Ga0116221_101203313300009523Peatlands SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKTEVVIGFITQAGMGITGTLLPLILGVITDCIKGAEVPLAQLGRLTHITLGFLLPYYHPKDPHTLAVFCSALVIICAVQGVFSYCTRQILIGLSRDIE
Ga0116215_105983213300009672Peatlands SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVLVGMFTQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQ
Ga0116224_1029580613300009683Peatlands SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVVIGFITQAGMGITGTLLPLILGVITDCLNGAQTPLAQLGRLTQIALGPLLPYYHPK
Ga0099796_1031374523300010159Vadose Zone SoilMSAAANTLANPAVKKAHKSVSGWAHLTRLLPYVSRHKTGVLVGLVTQTGMGITGTLLPLILGAITDCIKGAETPLAQLGRLTQISLGFLLPYYHPKDPHTLAVFASALVIVCAIQGFFS
Ga0150983_1012861623300011120Forest SoilMSAAANTFSGAVPKRVTGWAQLKRLFPYVARHKTEVVIGFITQAGMGITGTLLPLILGVITDCLKGADVPLAQLGRLTQIALGPLLPYYHPKSAHTLAVFCSALVI
Ga0150983_1506408213300011120Forest SoilMSAAAANTLSGSVPKAAHKRVSGWSQLMRLTPYVTRHKLEVAIGFITQAGMGITGTLLPLLLGAIADCLNGADVPLAQLGFLTHYTLGWLLPYYHPKDPRT
Ga0137393_1128479723300011271Vadose Zone SoilMSAAANTLSGSVPKATHKRVSGWAQLMRLSPYVARHKVEVLIGFITQAGMGITGTLLPLILGAIADCIKGAETPLAQLGRLTH
Ga0137363_1061840913300012202Vadose Zone SoilMSAAANTLSGSVPKATHKRVSGWAQLMRLSPYVARHKTEVLIGFITQAGMGITGTLLPLILGAIADCIKGAETPLAQLGRLTH
Ga0137363_1082388623300012202Vadose Zone SoilMSAAANTLAGSAAKTTHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLT
Ga0137399_1113078913300012203Vadose Zone SoilMSAAANTLAGSAAKTKHKRVSGWAQLTRLYPYVRRHKGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKNAHTLAIFCSALIIVCAIQGIFSY
Ga0137380_1003192113300012206Vadose Zone SoilMSATANTLASSAVKTAQKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLIIGVIVDCIKGAAAPLAQLGRLTQISLGFLLPYYHPKSAHTLAIFCSALIIICTIQGIFSYAT
Ga0137378_1049666723300012210Vadose Zone SoilMSAAANTLAGSAAKTKHKRVSGWAQLTRLYPYVRRHKGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKAAEAPLAQLGRLTQISLGFLLPFYHPKDAHTVAIFCSALIIVCAIQGIFSYATRQIL
Ga0137390_1085607323300012363Vadose Zone SoilMSATANTWPGAAAKTRRKPVSGWGHLSRLLPYFRGHGGEIILGLLAQAGMGITGTLLPLLVGAVVDCIKGAEAPLVQLGRLTR
Ga0137396_1131985023300012918Vadose Zone SoilMSAAANTLAGSAAKTTHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKNAH
Ga0137416_1129084113300012927Vadose Zone SoilMSAAANTLAGSAAKTTHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKNAHTLAIFCSALIIVCAIQGIFSYATRQILIG
Ga0153915_1079660713300012931Freshwater WetlandsMSAAANILTGSAAKTAQKPVSGWAQLTRLLPYVKRHRGEVLTGMLTQAGMGITGTLLPLIIGALVDCIKGAEVPLAQLGRLTQISLGFLLPYYHPRNTQTLAVFCLALIVICAIQGI
Ga0137410_1003389113300012944Vadose Zone SoilMSAAANTLAGSAVKSTPKNASGWSQLMRLYPYVRRHTGEVILGMVTQTGMGITGTLLPLMIGAIVDCIKGAAAPLAQLGR
Ga0181534_1014103033300014168BogMSAATQTLPGTLPKAAPKRVSGWAQLTRLFPYVAKHKLEVAIGFVTQAGMGITGTLLPLILGAITDCIKGAETPLAQLGRLTQIALGPLLPYYHPKSS
Ga0181526_1083236013300014200BogMSAAANTFTGTAPKAASKRISGWSQLSRLRPYVARHKTEVVIGMLTQVGMGVTGTLLPLIMGAIVDCIKGAAVPLAQLGRLAQISLGLLLPYYHPKSSGTLAVYCSTLIVI
Ga0181522_1031546923300014657BogMSAAANTFAGTTATTTQKRVSGWAQLMRLRPYVAKHKAEVFVGMVTQIGMGITGTLLPLLIGAIVDCIKGAAVPLAQLGRLAQISLGFLLPYYHPKNSHTLAIYCSALIIICAI
Ga0157379_1041071333300014968Switchgrass RhizosphereMSAAANTLSGTAPATAHKRVSGWAQLTRLYPYVARHKAEVLVGLITQIGMGITGTLLPLIIGAIVDCIKGAAAPLVQLGRLTHISLGFLLPYYHPRSAHTLEIFCA
Ga0137411_133466423300015052Vadose Zone SoilMSAAANTLAGSAAKTKHKRVSGWAQLTRLYPYVRRHKGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPPLRSSAG*
Ga0137420_109330023300015054Vadose Zone SoilMSAAANTLAGSAAKTKHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKNAHTLAIFCSALIIVCAIQGIFSYATRQIL
Ga0137403_1032455323300015264Vadose Zone SoilMSAAANTLAGSAAKTKHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKDAHTLAIFC*
Ga0132258_1280179213300015371Arabidopsis RhizosphereMSAAANTLSGTAPATAHKRVSGWAQLTRLYPYVARHKGEVLVGLITQIGMGISGTMLPLIIGAIVDCIKGAEAPLAQLGRLTHI
Ga0187812_119612423300017821Freshwater SedimentMSAAANTLSGSIPKTAHKRVSGWAQLTRLYPYVARHKAEVFIGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPRNAHTLEIFCSALILICALQGIFSYCTRQILIGLSRDI
Ga0187802_1010421613300017822Freshwater SedimentMSAAANTLSGAVPKTAHKRVSGWAQLTRLYPYVARHKAEVFIGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPRNAHTLEIFCSALILICALQGIFSYCTRQILIG
Ga0187818_1004524913300017823Freshwater SedimentMSAAATTLSGSAPKAAQRRVSGWAQLTRLYPYVARHKTEVVIGFITQAGMGITGTLLPLILGVVTDCLEGAQTPLAQLGHLTQIALGPLLPYYHPKDPRTVAVFCVALVIVCAVQGVFSY
Ga0187801_1002647143300017933Freshwater SedimentMSAAANTLSGSIPKTAHKRVSGWAQLTRLYPYVARHKAEVFIGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLA
Ga0187821_1016974623300017936Freshwater SedimentMSAATQTLSGAAPKAAPKRASGFAQLLRLWPYVTRHKLEVSIGFVTQACMGITGTLLPLILGAITDCIQGAETPLAQLGRLTHVTLGFLLPYYHAKDPRTLAVFCSALVIICAI
Ga0187821_1025404123300017936Freshwater SedimentMSAAANTFPGAVPKTAHKRVSGWAQLTRLYPYVARHKAEVIIGLVTQTGMGITGTLIPLILGVITDSLEGAQVPLAQLGHLRQ
Ga0187819_1079015013300017943Freshwater SedimentMSAAANTLSGSIPKTAHKRVSGWAQLTRLYPYVARHKAEVFIGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGF
Ga0187804_1059292713300018006Freshwater SedimentMSAAANTLSGAVPKTAHKRVSGWAQLTRLYPYVARHKAEVFIGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPRNAHTLEIFCSALILICALQGIFSYCT
Ga0187810_1027085323300018012Freshwater SedimentMSAAANTLSGSVPKTAHKRVSGWAQLTRLYPYVARHKAEVFIGLITQIGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPRNAHTLEIFCSALILICALQGIFSY
Ga0187871_1028628913300018042PeatlandMSAAANTFTGTAPKAASKRISGWSQLSRLRPYVARHKTEVVIGMLTQVGMGVTGTLLPLIMGAIVDCIKGAE
Ga0210403_1066853923300020580SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQLGRLTRITLGFLLPYY
Ga0210399_1012985113300020581SoilMSAAAQTLSGSAPKAAQKRVSGWAQLKRLMPYVARHKSEVVIGFITQAGMGITGTLLPLILGVITDCIKGAETPLAQLGHLTQIALGPLLPYYHPKDPHTLAVFCSS
Ga0210399_1127535323300020581SoilMSATANTLESPAVKTTPKRVSGWAQLMRLRPYLGRHKGEIVIGIITQIGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPRNAQTLAIFCSALIIMCAIQGVFSYATRQ
Ga0210401_1006465113300020583SoilMSAAANTLSGSVPKTTHKRVSGWAQLMRLSPYVTRHKLEVLIGFITQAGMGITGTLLPLILGAIADCIKGAETPLAQLGRLTHITLGFLLP
Ga0210401_1054268513300020583SoilMSAAAANTLSGSVPKAAHKRVSGWSQLMRLTPYVTRHKLEVAIGFITQAGMGITGTLLPLLLGAIADCLNGADVPLAQLGFLTHYTLGWLLPYYHPKDPRTVAIFCTALVVVCAVQGVFSYLTR
Ga0179596_1011727113300021086Vadose Zone SoilMSAAANTLAGSAAKTTHKRVSGWAQLTRLYPYVRRHRGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKNAHTLAIFCSALIIVCAIQGIFSYATRQILIGL
Ga0179596_1022722123300021086Vadose Zone SoilMSAAANTLSGSVPKAAHKRVSGWAQLTRLFPYVARHKTEVLIGFVTQAGMGITGTLLPLILGVITDCIKGAETPLAQLGRLTQIALGPLLPYYHPKD
Ga0210404_1063862223300021088SoilMSAAANTLSGSVPKAPHKRVSGWAQLMRLSPYVTRHKLEVLIGFITQAGMGITGTLLPLILGAIADCIKGAETPLAQLGRLTHITLGFLLPYYHPKDPHTLAVFCSALVIVCAVQGVFSYCTRQILIGL
Ga0210396_1087706023300021180SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQLGRLTRITL
Ga0210388_1135538923300021181SoilMSAATQTLPGTLPKTAPKRVSGWAQLTRLYPYVAKHKLEVAIGFVTQAGMGITGTLLPLLLGAITDCIKGAETPLAQLGRLTQISLGPLLPY
Ga0210389_1146978413300021404SoilMSTAANTISVSEPKTAPKRVSGWAHLTRLFPYVMRHKTEVAIGFITQAGMGITGTLLPLMLGVITDCIKGAATPLAQLGRLTQITLGPLLP
Ga0210386_1117675513300021406SoilMSTAANTISVSEPKTAPKRVSGWAHLTRLFPYVMRHKTEVAIGFITQAGMGITGTLLPLMLGVITDCIKGAATPLA
Ga0210384_1015941313300021432SoilMSATANTLESPAVKTTPKRVSGWAQLMRLRPYLGRHKGEIVIGIITQIGMGITGTLLPLMIGAIVDCIKGAEAPLAQL
Ga0210390_1072358523300021474SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQLGRLTRITLGFLLPYYHPKDPHTLAVFCSALVIVCAVQGVFSYSTRQILIGL
Ga0210402_1171940713300021478SoilMSAAANTLSGSVSRTAPKRASGWAQLMRLSPYVARHKGELLVGLVTQIGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTHISLGFLLPYYHPRNAQTLEIFCTALIVVCALQGFFSY
Ga0210409_1017874413300021559SoilMSAAANTLSGSVPKTATKRVSGWAQLTRLFPYVARHKTEVAIGFVTQAGMGITGTLLPLILGVITDCLKGAETPLAQLGRLTHIALGPLLPYYHPKDP
Ga0210409_1088388413300021559SoilMSAAAQTLSGSAPKAAQKRVSGWAQLKRLMPYVARHKSEVVIGFLTQAGMGITGTLLPLILGVITDCIKGAQTPLAQLGRLTQIALGPL
Ga0242655_1025125413300022532SoilMSAAAQTLSGSAPKAAQKGVSGWAQLKRLMPYVARHKSEVVIGFITQAGMGITGTLLPLILGVITDCIKGAETPL
Ga0242653_101142433300022712SoilMSTAAQTLSGSAPKAAQKGVSGWAQLKRLMPYVARHKSEVVIGFITQAGMGITGTLLPLILGVITDCIKGAETPLAQLGRLTQIALGPLLPYYHAK
Ga0242665_1009668813300022724SoilMSAAANTLSGSVPKTAHKRVSGWAQLMRLRPYVTRHKTEVLIGFITQAGMGITGTLLPLILGVITDCLTGAETPLAQLGRLTRIALGPLL
Ga0247661_109884323300024254SoilMSAAANTLSGTAPATAHKRVSGWAQLTRLYPYVARHKAEVLVGLITQIGMGITGTLLPLIIGAIVDCIKGAAAPLVQLGRLTHISLGFLLPYYHPRNAHTLEIFCAA
Ga0137417_119719113300024330Vadose Zone SoilMSAAANTLAGSAAKTTHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKNAHTLAIFCSALIIVFVCAI
Ga0207695_1010974353300025913Corn RhizosphereMSAAAQTMTGSAAKTSTRRVSGWSQLTRLFPYVARHKGEVVLGMLTQIGMGVTGTLLPLLIGAIVDCIKGAAAPLAQLGRLTEISLGFLLPYYHP
Ga0257163_104879623300026359SoilMSAAANTLAGSAAKTKHKRVSGWAQLTRLYPYVRRHKGEVILGMLTQTGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYH
Ga0208043_105297813300027570Peatlands SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQL
Ga0209422_109166623300027629Forest SoilMSAAANTLSGSVPKTAHKRVSGWSQLTRLYPYVARHKAEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQ
Ga0208991_119794013300027681Forest SoilMSAAADILTGSVVKTTEKRVSGWAQLTRLFPYVRRHKGEVLLGMLTQTGMGITGTLLPLIIGAIVDCIKGAAAPLAQLGRLTQISLGFLLPYYQPKNPHT
Ga0208989_1010677913300027738Forest SoilMSAAADILTGSVVKTTKKRVSGWAQLTRLFPYVRRHKGEVLLGMLTQTGMGITGTLLPLIIGAIVDCIKGAAAPLAQLGRLTQISLGFLLPYYQPKNPHTLAIFCSALII
Ga0209166_1049566313300027857Surface SoilMSAAANTLSGSVSTTAPKRASGWAQLMRLSPYVARHKSELLIGLVTQIGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTHISLGFLLPYYHPRSAQTLEIFCTALIV
Ga0209701_1061834113300027862Vadose Zone SoilMSAATNTWAGAAAKTRHKPVSGWGHLSRLLPYFRGHGGEIILGLLAQAGMGITGTLLPLLVGAVVDCIKGAEAPLVQLGRLTRLSLGFLLPYYHPRSAQTI
Ga0209698_1024622233300027911WatershedsMSAAANTLSGSLPKTAHRRVSGWAQLTRLSPYVARHKLEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQLGRLTQITLGFLLPYYHPKDPHTLAVFCSALVIVCAV
Ga0209526_1002112553300028047Forest SoilMSAAANTLAGSAVKTTPKRVSGWAQLTRLYPYVRRHTGEVFLGMVTQTGMGITGTLLPLIIGVIVDCIKGAEAPLAQLGRLTQISLGFLLP
Ga0209526_1077708113300028047Forest SoilMSAAANTLSGSVPKATHKRVSGWAQLMRLSPYVARHKVEVLIGFITQAGMGITGTLLPLILGAIADCIKGAETPLAQLGR
Ga0073994_1206763423300030991SoilMSAAANTLAGSAVKSTPKRVSGYAQLTRLYPYVRRHTGEVIVGMVTQTGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHP
Ga0302325_1110958733300031234PalsaMSASASTFAGTAPKMAHKRISGWAQLSRLGPYVARHKTEVVIGMVTQIGMGVAGTLLPLIIGAIVDCIKGAAAPLAQLGRLTQI
Ga0302325_1299387813300031234PalsaMSAATQTLPGTLPKAAPKRVSGWAQLTRLYPYVAKHKLEVAIGFVTQAGMGITGTLLPLLLGAITDCIKGAETPLAQLGRLTQIALGPLLPYYHPKDPHTLAV
Ga0265330_1019206413300031235RhizosphereMSAAANTFAGTPAKTAPKHVSGWAQLMRLRPYVRRHTGEVFLGMLTQIGMGIAGTLLPLLIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKSATTLAIFCSALIIMCSV
Ga0265330_1039961513300031235RhizosphereVSAAANTFAGTAPKAAPKRVSGWSQLSRLRPYVARHKTEVVIGMITQIGMGITGTLLPLIMGAVVDCIKGAEAPLAQLGRLAQISLGFLLPYYHPKSPQTLAIFCSALIVICAIQGVFSYATR
Ga0265328_1024581013300031239RhizosphereMSAAANTFAGTPAKTAPKHVSGWAQLMRLRPYVRRHTGEVFLGMLTQIGMGIAGTLLPLLIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPKSATTLAIFCSALIIMCSVQGVFSYATRQI
Ga0265328_1037951423300031239RhizosphereVSAAANTFAGTAPKAAPKRVSGWSQLSRLRPYVARHKTEVVIGMITQIGMGITGTLLPLIMGAVVDCIKGAEAPLAQLGRLAQISLGFLLPYYHPKSPQTLAIFCSALIVICAIQGVFSYATRQILIGLSR
Ga0265328_1038351813300031239RhizosphereMSATVNTFAGTPAKTAPKRVSGWAQLMRLRPYVRRHTGEVFLGMLTQIGMGISGTLLPLLIGAIVDCIKGAAAPLAQLGRLTQISLGFLLPYYHPKSATTLAIFCSALIIMCAVQGVFSYATRQILIGLS
Ga0307483_104155923300031590Hardwood Forest SoilMSAAANTLAGPVPKTAQRRVSGWAQLARLLPYVSRHKLEVLIGFITQAGMGITGTLLPLILGVITDCIQGAKVPLAQLGRLAQLSLGFLLPYY
Ga0307484_10253413300031663Hardwood Forest SoilMSAAANTLAGPVPKTAQRRVSGWAQLARLLPYVSRHKLEVLIGFITQAGMGITGTLLPLILGVITDCIQGAKVPLAQLGRLAQLSLGFL
Ga0307373_1001736713300031672SoilMSAAANTLTGSGPKAAHKRVSGWAHLARLIPYVGRHRTEVLIGMATQIGMGITGTLLPLIIGAIVDCIKGADAPLAQLGRLTQISLGFLLPYYHPKSAQTLVIFCSALIAI
Ga0310686_10637007523300031708SoilMSAAANTLSGSVTNTAPKRVSGWAQLTRLFPYVARHKSEVLIGFVTQAGMGITGTLLPLILGVITDCLKGAETPLAQLGRLTHIALG
Ga0310686_11424523123300031708SoilMSAAANTLSGSVPKPAQKRVSGWAQLTRLFPYVARHKTEVVIGFITQAGMGITGTLLPLILGVITDSLKGAEVPLAQLGRLTQIALGPLLPYY
Ga0307476_1006417313300031715Hardwood Forest SoilMSAAANTLAGPVPKTAQRRVSGWAQLARLLPYVSRHKLEVLIGFITQAGMGITGTLLPLILGVITDCIQGAKVPLAQLGRLAQ
Ga0307476_1064184923300031715Hardwood Forest SoilMSAAAANTQSGSVPKAAHKRVSGWSQLMRLTPYVTRHKLEVAIGFITQAGMGITGTLLPLLLGAIADCLNGADVPLAQLGFLTHYTLGWLLPYYHPKDPRTVAI
Ga0307469_1081969213300031720Hardwood Forest SoilMSAAANTLAGSAVPTSHKRVSGWAQLTRLYPYVRRHTGEVILGMLTQTGMGITGTLLPLIIGAIVDCIKGAAAPLAQLGRLT
Ga0307475_1146579013300031754Hardwood Forest SoilMSATANTLESPAVKTTPKRVSGWAQLMRLRPYLGRHKGEIVIGIITQIGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPRNAQTL
Ga0307478_1068100813300031823Hardwood Forest SoilMSAAANTLSGSVQSSAPKRVSGWAQLMRLSPYVARHKGEVLIGLITQIGMGISGTLLPLLLGAIADCLNGADVPLA
Ga0307479_1001403513300031962Hardwood Forest SoilMSAAANTLASPAVKSTPKRVSGYAQLTRLYPYVRRHTGEVIVGMITQTGMGITGTLLPLIIGAIVDCIKGAEAPLAQLGRLTQISLGFLLPYYHPGSPRTLAIFCSALIIVCTIQGFFSYATRQILI
Ga0307479_1147629923300031962Hardwood Forest SoilMSATANTLEGPAVKTTPKRVSGWAQLMRLRPYLGRHKGEIVIGIITQIGMGITGTLLPLMIGAIVDCIKGAEAPLAQLGRL
Ga0307479_1196810213300031962Hardwood Forest SoilMSATAHALTSPAVKTTQKRVSGWSQLTRLYPYVRRHTGEVILGMLTQTAMGITGTLLPLIIGVIVDCIKGAEAPLAQLGRLTHISLGFLLPYYHPKSPLTLA
Ga0307470_1110115213300032174Hardwood Forest SoilMSAAANTLAGSAVKSTPKSASGWSQLMRLYPYVRRHTGEVILGMVTQTGMGSTGTLLPLMIGAIVDCIKGAAAPLAQLGRLTQISLGFLLPYY
Ga0348332_1099206123300032515Plant LitterMSAAANTISASAPKTAHKRVSGWSQLTRLYPYVARHKMEVVIGFITQAGMGITGTLLPLILGVITDCIKGAQTPLAQLGRLTQITLGFLLPYYHPKDPHTLEVFCSALVIICAVQGIFSYCTR
Ga0348332_1405321023300032515Plant LitterMSAAANTLSGSVAKTATKRVSGWAQLTRLFPYVARHKSEVAIGFVTQAGMGITGTLLPLILGVITDCLKGAQTPLAQ
Ga0335078_1053490313300032805SoilMSAAANTFASPALKARPARVSGWAQLMRLMPYVARHKGEVVIGMITQIGMGITGTLLPLIIGAIVDCLKGAETPLAQLGRLTQISLGFLLPYYHPRNALTLEVFCSALIIICAIQGVFSY
Ga0370492_0261777_412_7023300034282Untreated Peat SoilMSAAAQTIPGTLPKAAPKRVSGWAQLTRLYPYVAKHKLEVAIGFVTQAGMGITGTLLPLLLGAITDCIKGAETPLAQLGRLTQIALGPLLPYYHPKS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.