NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F093390

Metagenome Family F093390

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093390
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 114 residues
Representative Sequence MTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF
Number of Associated Samples 78
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 23.58 %
% of genes near scaffold ends (potentially truncated) 25.47 %
% of genes from short scaffolds (< 2000 bps) 66.04 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (71.698 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Peat Moss → Unclassified → Unclassified → Host-Associated
(17.924 % of family members)
Environment Ontology (ENVO) Unclassified
(41.509 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.566 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 65.28%    β-sheet: 0.00%    Coil/Unstructured: 34.72%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.40.1.1: V-type ATP synthase subunit Cd6m0rb_6m0r0.64094
a.25.1.1: Ferritind2fzfa12fzf0.63639
f.72.1.1: Double antiporter-like subunits from respiratory complex Id3rkod_3rko0.62637
d.144.1.0: automated matchesd3tl8a_3tl80.61981
a.114.1.1: Interferon-induced guanylate-binding protein 1 (GBP1), C-terminal domaind1f5na11f5n0.61179


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF00553CBM_2 4.72
PF02798GST_N 3.77
PF13545HTH_Crp_2 2.83
PF00106adh_short 1.89
PF01063Aminotran_4 1.89
PF13410GST_C_2 1.89
PF00583Acetyltransf_1 1.89
PF06945DUF1289 1.89
PF04397LytTR 1.89
PF06961DUF1294 0.94
PF17198AveC_like 0.94
PF01965DJ-1_PfpI 0.94
PF02735Ku 0.94
PF00593TonB_dep_Rec 0.94
PF136402OG-FeII_Oxy_3 0.94
PF13560HTH_31 0.94
PF01894UPF0047 0.94
PF03160Calx-beta 0.94
PF06580His_kinase 0.94
PF00561Abhydrolase_1 0.94
PF01553Acyltransferase 0.94
PF02518HATPase_c 0.94
PF00069Pkinase 0.94
PF08240ADH_N 0.94
PF00384Molybdopterin 0.94
PF06250YhcG_C 0.94
PF04185Phosphoesterase 0.94
PF12281NTP_transf_8 0.94
PF13365Trypsin_2 0.94
PF01928CYTH 0.94
PF00300His_Phos_1 0.94
PF00903Glyoxalase 0.94
PF00072Response_reg 0.94
PF127294HB_MCP_1 0.94
PF01037AsnC_trans_reg 0.94
PF00873ACR_tran 0.94
PF14833NAD_binding_11 0.94
PF08281Sigma70_r4_2 0.94
PF12680SnoaL_2 0.94
PF13439Glyco_transf_4 0.94
PF07859Abhydrolase_3 0.94
PF00155Aminotran_1_2 0.94
PF02472ExbD 0.94
PF04240Caroten_synth 0.94
PF04261Dyp_perox 0.94
PF09084NMT1 0.94
PF03473MOSC 0.94
PF04972BON 0.94
PF07715Plug 0.94
PF04389Peptidase_M28 0.94
PF03992ABM 0.94
PF00589Phage_integrase 0.94
PF02780Transketolase_C 0.94
PF01850PIN 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG5297Cellulase/cellobiase CelA1Carbohydrate transport and metabolism [G] 4.72
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.77
COG0115Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyaseAmino acid transport and metabolism [E] 3.77
COG3313Predicted Fe-S protein YdhL, DUF1289 familyGeneral function prediction only [R] 1.89
COG2837Periplasmic deferrochelatase/peroxidase EfeBInorganic ion transport and metabolism [P] 0.94
COG4804Predicted nuclease of restriction endonuclease-like (RecB) superfamily, DUF1016 familyGeneral function prediction only [R] 0.94
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 0.94
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.94
COG3326Uncharacterized membrane protein YsdA, DUF1294 familyFunction unknown [S] 0.94
COG3275Sensor histidine kinase, LytS/YehU familySignal transduction mechanisms [T] 0.94
COG2972Sensor histidine kinase YesMSignal transduction mechanisms [T] 0.94
COG2324Uncharacterized membrane proteinFunction unknown [S] 0.94
COG1273Non-homologous end joining protein Ku, dsDNA break repairReplication, recombination and repair [L] 0.94
COG0848Biopolymer transport protein ExbDIntracellular trafficking, secretion, and vesicular transport [U] 0.94
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 0.94
COG0657Acetyl esterase/lipaseLipid transport and metabolism [I] 0.94
COG0432Thiamin phosphate synthase YjbQ, UPF0047 familyCoenzyme transport and metabolism [H] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms71.70 %
UnclassifiedrootN/A28.30 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004080|Ga0062385_10155996Not Available1185Open in IMG/M
3300004091|Ga0062387_100189314All Organisms → cellular organisms → Bacteria1230Open in IMG/M
3300004092|Ga0062389_100171100All Organisms → cellular organisms → Bacteria → Proteobacteria2085Open in IMG/M
3300005529|Ga0070741_10051350All Organisms → cellular organisms → Bacteria4996Open in IMG/M
3300005602|Ga0070762_10816031Not Available632Open in IMG/M
3300005712|Ga0070764_10005199All Organisms → cellular organisms → Bacteria → Proteobacteria6165Open in IMG/M
3300006893|Ga0073928_10717462All Organisms → cellular organisms → Bacteria → Proteobacteria696Open in IMG/M
3300009500|Ga0116229_10030605All Organisms → cellular organisms → Bacteria6054Open in IMG/M
3300009500|Ga0116229_10081804All Organisms → cellular organisms → Bacteria → Proteobacteria2979Open in IMG/M
3300009500|Ga0116229_10106415All Organisms → cellular organisms → Bacteria2519Open in IMG/M
3300009500|Ga0116229_10334400All Organisms → cellular organisms → Bacteria → Proteobacteria1273Open in IMG/M
3300009500|Ga0116229_10466467All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300009500|Ga0116229_10529546All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → Rugamonas → unclassified Rugamonas → Rugamonas sp. CCM 8940973Open in IMG/M
3300009500|Ga0116229_10531204All Organisms → cellular organisms → Bacteria972Open in IMG/M
3300009500|Ga0116229_10993939Not Available675Open in IMG/M
3300009510|Ga0116230_10033333All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4888Open in IMG/M
3300009510|Ga0116230_10054134All Organisms → cellular organisms → Bacteria3652Open in IMG/M
3300009545|Ga0105237_10426678All Organisms → cellular organisms → Bacteria1331Open in IMG/M
3300009551|Ga0105238_10119677All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax → Variovorax paradoxus2613Open in IMG/M
3300009633|Ga0116129_1001564All Organisms → cellular organisms → Bacteria12530Open in IMG/M
3300009633|Ga0116129_1003304All Organisms → cellular organisms → Bacteria → Proteobacteria7771Open in IMG/M
3300009633|Ga0116129_1078337Not Available974Open in IMG/M
3300009701|Ga0116228_10041265All Organisms → cellular organisms → Bacteria3771Open in IMG/M
3300009709|Ga0116227_10035856All Organisms → cellular organisms → Bacteria5042Open in IMG/M
3300009709|Ga0116227_10348645All Organisms → cellular organisms → Bacteria1131Open in IMG/M
3300009787|Ga0116226_10033216All Organisms → cellular organisms → Bacteria → Proteobacteria5110Open in IMG/M
3300010371|Ga0134125_11730368Not Available680Open in IMG/M
3300010373|Ga0134128_11306768All Organisms → cellular organisms → Bacteria → Proteobacteria799Open in IMG/M
3300010376|Ga0126381_103684191Not Available600Open in IMG/M
3300011270|Ga0137391_10815645Not Available768Open in IMG/M
3300012924|Ga0137413_11524300Not Available544Open in IMG/M
3300012925|Ga0137419_11590412Not Available555Open in IMG/M
3300012982|Ga0168317_1018361All Organisms → cellular organisms → Bacteria → Proteobacteria2088Open in IMG/M
3300013105|Ga0157369_12334326Not Available542Open in IMG/M
3300014168|Ga0181534_10134459Not Available1258Open in IMG/M
3300014168|Ga0181534_10304860Not Available860Open in IMG/M
3300014501|Ga0182024_10095274All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4361Open in IMG/M
3300014501|Ga0182024_11219712All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales879Open in IMG/M
3300014838|Ga0182030_10022940All Organisms → cellular organisms → Bacteria → Proteobacteria11581Open in IMG/M
3300015241|Ga0137418_10267792Not Available1441Open in IMG/M
3300015242|Ga0137412_10342746All Organisms → cellular organisms → Bacteria1165Open in IMG/M
3300020580|Ga0210403_10634235All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300021086|Ga0179596_10033669All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1999Open in IMG/M
3300021168|Ga0210406_10894107Not Available669Open in IMG/M
3300021181|Ga0210388_10026473All Organisms → cellular organisms → Bacteria → Proteobacteria4738Open in IMG/M
3300021401|Ga0210393_11094145Not Available644Open in IMG/M
3300021403|Ga0210397_11252024Not Available577Open in IMG/M
3300021405|Ga0210387_10131167All Organisms → cellular organisms → Bacteria2128Open in IMG/M
3300021405|Ga0210387_11082801All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Occallatibacter → Occallatibacter riparius699Open in IMG/M
3300021406|Ga0210386_10340798Not Available1289Open in IMG/M
3300021406|Ga0210386_11492385All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Occallatibacter → Occallatibacter riparius564Open in IMG/M
3300021420|Ga0210394_11328158All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Edaphobacter614Open in IMG/M
3300021475|Ga0210392_10780250Not Available713Open in IMG/M
3300021477|Ga0210398_11176767Not Available607Open in IMG/M
3300021478|Ga0210402_11826811Not Available534Open in IMG/M
3300024288|Ga0179589_10144025All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter → Steroidobacter agaridevorans1012Open in IMG/M
3300025463|Ga0208193_1002203All Organisms → cellular organisms → Bacteria7998Open in IMG/M
3300025463|Ga0208193_1003175All Organisms → cellular organisms → Bacteria → Proteobacteria6299Open in IMG/M
3300025924|Ga0207694_10181280All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax → Variovorax paradoxus1708Open in IMG/M
3300026557|Ga0179587_10893441All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria586Open in IMG/M
3300027559|Ga0209222_1005690All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2668Open in IMG/M
3300027574|Ga0208982_1003519All Organisms → cellular organisms → Bacteria → Proteobacteria3203Open in IMG/M
3300027807|Ga0209208_10082868All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2326Open in IMG/M
3300027807|Ga0209208_10267272Not Available895Open in IMG/M
3300027860|Ga0209611_10047728All Organisms → cellular organisms → Bacteria3512Open in IMG/M
3300027860|Ga0209611_10235304All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium TMPK11101Open in IMG/M
3300027860|Ga0209611_10270752Not Available1008Open in IMG/M
3300027879|Ga0209169_10007523All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6206Open in IMG/M
3300027895|Ga0209624_10071122All Organisms → cellular organisms → Bacteria2245Open in IMG/M
3300027908|Ga0209006_11304306All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300028775|Ga0302231_10126478All Organisms → cellular organisms → Bacteria → Proteobacteria1067Open in IMG/M
3300029882|Ga0311368_10020849All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales6371Open in IMG/M
3300029907|Ga0311329_10371548All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae → Rhodanobacter → Rhodanobacter denitrificans1010Open in IMG/M
3300029910|Ga0311369_10798531Not Available764Open in IMG/M
3300029939|Ga0311328_10181572All Organisms → cellular organisms → Bacteria1648Open in IMG/M
3300029951|Ga0311371_10182681All Organisms → cellular organisms → Bacteria → Proteobacteria3146Open in IMG/M
3300029951|Ga0311371_11021895All Organisms → cellular organisms → Bacteria980Open in IMG/M
3300029951|Ga0311371_11045760All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria965Open in IMG/M
3300029954|Ga0311331_10139185All Organisms → cellular organisms → Bacteria → Proteobacteria2945Open in IMG/M
3300029999|Ga0311339_10036475All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales6887Open in IMG/M
3300029999|Ga0311339_11523656All Organisms → cellular organisms → Bacteria → Proteobacteria594Open in IMG/M
3300030007|Ga0311338_11160080Not Available736Open in IMG/M
3300030618|Ga0311354_10189830All Organisms → cellular organisms → Bacteria → Proteobacteria2206Open in IMG/M
3300030618|Ga0311354_10397828All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1391Open in IMG/M
3300030677|Ga0302317_10160010All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300030693|Ga0302313_10277953All Organisms → cellular organisms → Bacteria → Proteobacteria669Open in IMG/M
3300030906|Ga0302314_10246110All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Phenylobacterium → Phenylobacterium deserti2130Open in IMG/M
3300031027|Ga0302308_10258456All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium lablabi1090Open in IMG/M
3300031057|Ga0170834_104115791All Organisms → cellular organisms → Bacteria → Proteobacteria631Open in IMG/M
3300031128|Ga0170823_13516693All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Occallatibacter → Occallatibacter riparius883Open in IMG/M
3300031231|Ga0170824_100338467All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Occallatibacter → Occallatibacter riparius1229Open in IMG/M
3300031231|Ga0170824_117093771Not Available771Open in IMG/M
3300031234|Ga0302325_13370437All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300031236|Ga0302324_103063911Not Available554Open in IMG/M
3300031247|Ga0265340_10249073All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia → Nevskia soli792Open in IMG/M
3300031446|Ga0170820_10622020All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300031525|Ga0302326_12250380Not Available694Open in IMG/M
3300031525|Ga0302326_13444715All Organisms → cellular organisms → Bacteria → Proteobacteria527Open in IMG/M
3300031708|Ga0310686_100348274All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Beijerinckiaceae → Methylovirgula → Methylovirgula ligni904Open in IMG/M
3300031708|Ga0310686_104842146All Organisms → cellular organisms → Bacteria3334Open in IMG/M
3300031708|Ga0310686_108781105All Organisms → cellular organisms → Bacteria → Proteobacteria1743Open in IMG/M
3300031708|Ga0310686_117211576All Organisms → cellular organisms → Bacteria → Proteobacteria6121Open in IMG/M
3300031754|Ga0307475_11254154Not Available576Open in IMG/M
3300031823|Ga0307478_11666088Not Available526Open in IMG/M
3300032180|Ga0307471_103550626Not Available552Open in IMG/M
3300032896|Ga0335075_10011029All Organisms → cellular organisms → Bacteria → Proteobacteria13678Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa17.92%
Host-AssociatedHost-Associated → Plants → Peat Moss → Unclassified → Unclassified → Host-Associated17.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.26%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.55%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland4.72%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.77%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.77%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil2.83%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.83%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog2.83%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere2.83%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.89%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost1.89%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.94%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.94%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.94%
Weathered Mine TailingsEnvironmental → Terrestrial → Geologic → Mine → Unclassified → Weathered Mine Tailings0.94%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.94%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300009500Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MGHost-AssociatedOpen in IMG/M
3300009510Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fd - Sphagnum fallax MGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300009633Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10EnvironmentalOpen in IMG/M
3300009701Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum fallax MGHost-AssociatedOpen in IMG/M
3300009709Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fb - Sphagnum magellanicum MGHost-AssociatedOpen in IMG/M
3300009787Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fa - Sphagnum fallax MGHost-AssociatedOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012982Weathered mine tailings microbial communities from Hibbing, Minnesota, USA - DCWfieldEnvironmentalOpen in IMG/M
3300013105Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2-5 metaGHost-AssociatedOpen in IMG/M
3300014168Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_10_metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025463Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025924Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027559Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027574Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027807Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fd - Sphagnum fallax MG (SPAdes)Host-AssociatedOpen in IMG/M
3300027860Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MG (SPAdes)Host-AssociatedOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028775Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_N2_2EnvironmentalOpen in IMG/M
3300029882III_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300029907I_Bog_N1 coassemblyEnvironmentalOpen in IMG/M
3300029910III_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300029939I_Bog_E3 coassemblyEnvironmentalOpen in IMG/M
3300029951III_Palsa_N1 coassemblyEnvironmentalOpen in IMG/M
3300029954I_Bog_N3 coassemblyEnvironmentalOpen in IMG/M
3300029999I_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030007I_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300030618II_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030677Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_N3_3EnvironmentalOpen in IMG/M
3300030693Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_N2_2EnvironmentalOpen in IMG/M
3300030906Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_N2_3EnvironmentalOpen in IMG/M
3300031027Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_E3_3EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031247Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-CB2-25 metaGHost-AssociatedOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032896Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062385_1015599613300004080Bog Forest SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0062387_10018931413300004091Bog Forest SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRICAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0062389_10017110023300004092Bog Forest SoilMTPTQFMQDPLQGKLSVSRVFWLYGVVGSLVYGCLEFFIDPGNTFLIRLYTIGGCLYSAYVIVGTYRCSVNCRTSGMARFVRVSCIISLLLLPILTYFELSGAIGSDLSQLDQLNF*
Ga0070741_1005135023300005529Surface SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYELLEFFIDPANAFLVRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLLVFMYFELSGALDGELSRLQQLNL*
Ga0070762_1081603123300005602SoilVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLVYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTYFEFSGALSGDLSQIEQLNF*
Ga0070764_1000519953300005712SoilMTPARFFQAPLEGKLSVSRVFWLYGVVGSLVYGILEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFEFSGAFDGELSQFQQLNF*
Ga0073928_1071746223300006893Iron-Sulfur Acid SpringVTPIQFIQAPLQAKLSVARVFWLYGVAGSLIYGCLEFFIDPGNTFLMRLYTIGGCLYSAYVIIGTYQCAVNCRTVRMARFVRVSSILSLILLPILTYYELSGAFSSELSQLEQLNL*
Ga0116229_1003060533300009500Host-AssociatedMDMNIPAFIRAPLEGKTTVSRVFWLYGVVGSLIYSALEFVIDPGNVGLLRVYTLGGALFSLYVIVGTYRCAVNCRTEAMARFVRVSCVLSVLLLPVLTYFELTGALRTDLSQLDQLNL*
Ga0116229_1008180433300009500Host-AssociatedMTTILQFFRAPLEGKLSVSRVFVLYGIVGSLLYGCLEFLIDPLNSFLMRTYTIGGVLYSAYVIVGTHRCAVNCKTARRARWVRVCCVISLLLLPVLTYVEWNGTFDSELSQLDQLNF*
Ga0116229_1010641543300009500Host-AssociatedMSIKSFIEAPLQAKTSVSRVFWLYGIVGSLLYSGLEFLIDPGNAWLLRAYTLGGALLSLYVIVGTYRCAVNCRSAGMARFVRVSCVISVLLLPVITYLELSGALSTDLSQLQQLGL*
Ga0116229_1033440013300009500Host-AssociatedMTPTEFLRAPLQAKLSVSRVFWLYGVAGSLVYGCLEFFIDPGSTSLMRLYTIGGGLYSAYVIIGTYRCAVNCATVAMARFVRISCILSLLLLPILTYYELSGAFGSELSQLDQLKF*
Ga0116229_1046646723300009500Host-AssociatedMSIRSFIEAPLQGKTSVSRVFWLYGVVASLVYSSLEFLIDPGNAALLRAYTLGGALLTLYVIVGTYRCAVNCRSPGMARFVRVSCVISLLLLPVITYLELSGALSTDLSQLQELGL*
Ga0116229_1052954623300009500Host-AssociatedMSIKSFIEAPLQGKTSVSRVFWLYGVLLSLLYSSLEFLIDPGNAVLLRVYTLGGALLTLYVIVGTYRCAVNCRSPGMARFVRVSCVISLLLLPVITYLELSGALSTDLSQLKELGL*
Ga0116229_1053120423300009500Host-AssociatedMSIRAFIEAPLQGKTSVSRVFWLYGVVGSLMYSALEFLIDPGNAGLLRAYTLGDALFSLYVIVGTYRCAVNCRTAGMARFVRVSCVISVLLLPVITYLELSGALSADLSQLEQLNL*
Ga0116229_1099393913300009500Host-AssociatedMASIAQFFRAPLEGKLSVSRVVWVYGIVGSLLYGCLEFFIDPLNSILMRIYTLGGILYSAYVIVGTYRCAVNCKTPRMARLVRLSCVISLLLLPVLTYMEWNGAVDSELLQLQQLNF*
Ga0116230_1003333353300009510Host-AssociatedMNVAAARSPFAAVMISPMTSIAQFFRAPLQGKLSVSRVFVVYGIVGSLLYGCLEFMIDPFNLFLMRVYTIGGILYSAYVIVGTYRCAVNCKTPRMARFVRVSCVISLLLLPVITYMEWNGAVDSELLQLEQLNF*
Ga0116230_1005413453300009510Host-AssociatedMSIKSFIEAPLQAKTSVSRVFWLYGIVGSLLYSGLEFLIDPGSAWLLRAYTLGGALLSLYVIVGTYRCAVNCRSAGMARFVRVSCVISVLLLPVITYLELSGALSTDLSQLQQLGL*
Ga0105237_1042667823300009545Corn RhizosphereMTTAQFVKAPLQGKLSVSRVFWLYGVVGSLVYGLVEFFIDPANALLIRLYTVGAYAYSAYVIVGTYRCAVNCKTAGMARFVRISAIVSLILLPVFMYFELSGVFDGELSQFQQLNF*
Ga0105238_1011967713300009551Corn RhizosphereQFVQAPLKGELAVSRVFWLYGVVGSLVYGFLEFFIDPANVFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIVSLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0116129_1001564103300009633PeatlandVTPTQFIQAPLQAKLSVARVFWLYGVAGSLVYGGLEFFIDPGNTFLMRIYSIGGFLYSAYVIVGTYRCSVNCSTEAMARFVRISSIISLILLPILTYFELSGAFSSDLSQLEQLNL*
Ga0116129_100330463300009633PeatlandVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLVYGALEFLIDPGNTFLMRLCTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCIISLILLPVLTYFEFSGALSGDLSQLEQLNF*
Ga0116129_107833723300009633PeatlandMSIRTFIEAPLQGKTSVSRVFWLYGIVGSLLYSCLEFLIDPGNGGLLRVYTLGGALLTLYVIVGTYRCAVNCRSPGMARFVRVSSVISLLLLPVITYLELSGALSTDLSQLQELNF*
Ga0116228_1004126543300009701Host-AssociatedMITPMTSIAQFFRAPLQGKLSVSRVFVVYGIVGSLLYGCLEFMINPFNLFLMRVYTIGGILYSAYVIVGTYRCAVNCKTPRMARFVRVSCVISLLLLPVITYMEWNGAVDSELLQLEQLNF*
Ga0116227_1003585623300009709Host-AssociatedMDMNIPAFIRAPLEGKTTVSRVFWLYGVVGSLIYSALEFVIDPGNVGLLRAYTLGGALFSLYVIVGTYRCAVNCRTEAMARFVRVSCVLSVLLLPVLTYFELTGALRTDLSQLDQLNL*
Ga0116227_1034864513300009709Host-AssociatedMTTLSEFFRAPLEGKLSVSRVFVVYGVVGSLVYGCLEFTLDPSNSFLMRAYTLGGALYSAYVIVATYRCAVNCKTPRMAQFVRVSCIISLVILPVVTYMELSGVFESELSQLDQLNF*
Ga0116226_1003321633300009787Host-AssociatedLSIRDFIQAPLQGKTSVSRVFWLYGVVGSLLYSCLELLIDPGKAGLLRAYTLGGALFTLYVIVGTYRCAVNCRSPGMARFVRVSCVISVLLLPLITYLELSGALTADLSQIEELGL*
Ga0134125_1173036823300010371Terrestrial SoilVQAPLQGKLTVSRVFWLYGVVGSLVYGLLEFFIDPGNVFLIRLYTVGAYAYSAYVIIGTYRCAVNCRTAGMARFVRISALISLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0134128_1130676823300010373Terrestrial SoilMTLAQFVKAPLEGRLSVSRVFWLYGVVGSLVYGLLEFFIDPGNALLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIVSLILLPVFMYFELSGALDGELSQFQQLNY*
Ga0126381_10368419123300010376Tropical Forest SoilVSRVFWLYGVVGSLAYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTALMARFVRISAIVSLILLPIFMYFELSGALDGELSQFQQLNF*
Ga0137391_1081564513300011270Vadose Zone SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLAYGLLEFFIDPANAFLIRLYSVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0137413_1152430013300012924Vadose Zone SoilMTPDQFVQAPLQGKLSVSRVFWVYGVVGSLAYGLLEFFIDPANAFLIRLYSVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0137419_1159041213300012925Vadose Zone SoilMTPAQFVQAPLQGELSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0168317_101836133300012982Weathered Mine TailingsMTPAQFVQAPLQGKLSVSRVFWLYGVLGSLVYGLLEFFIDPGNAFLIRLYTVGAYAYSAYVIVGTYRCAVHCRTAGMARFVRISAIASLILLPVFMYFEFSGALGSELSQLQQLNF*
Ga0157369_1233432623300013105Corn RhizosphereMTAAQFIQAPLKGELTVSRVFWLYGVVGSLVYGLLEFFIDPANVFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIVSLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0181534_1013445913300014168BogRVFWLYGVAGSLVYGCLEFFIDPGNTFLMRLYTIGGGLYSAYVIIGTYRCAVNCATVAMARFVRISCILSLILLPMLTYYELSGALSGELSQLEQLNL*
Ga0181534_1030486023300014168BogRVFWLYGVAGSLVYGCLEFFIDPGNTVLMRLYTIGGGLYSAYVIIGTYRCAVNCATVGMARFVRISCVLSLLLLPILTYYELSGALGSDMSQIEQLNL*
Ga0182024_1009527433300014501PermafrostMGIRAFIEAPLQGKTSVSRVFWLYGLVGSLLYSCLEFLIDPGNARLLRAYTLGGALLTLYVIVGTYRCAVNCRSPGMARFVRVSCVVSLLLLPVITYLDLSGALSTDLSQLQELNF*
Ga0182024_1121971213300014501PermafrostMTPIQFLQEPLQAKLSVSRVFWLYGVAGSLVYGCLEFFIDPGNTVLMRLYTVGDGLYSAYVIVGTYRCAVNCATVGMARFVRISCVLSLLLLPVLTYVELSGALSSDLSQLDQLNL*
Ga0182030_1002294063300014838BogMSIRAFIEAPLQGKTSVARVFWLYGIVGSLLYSCLEFLIDPGSAGLLRTYTLGGALLTLYVIVATYRCAVNCRSPGMARFVRVSSVISLLLLPVITYLELSGALSTDLSQLEQLNL*
Ga0137418_1026779223300015241Vadose Zone SoilMTPDQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0137412_1034274613300015242Vadose Zone SoilMTPAQFVQAPLQGKLSVSRVFWVYGVVGSLAYGLLEFFIDPANAFLIRLYSVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF*
Ga0210403_1063423523300020580SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSVYVIVGTYRCAVNCKTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF
Ga0179596_1003366943300021086Vadose Zone SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFIYFELSGALDGELSQFQQLNF
Ga0210406_1089410713300021168SoilMTPAQFVQAPLQGRLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIVSLILLPIFMYFELSGALDGELSQFQQLNF
Ga0210388_1002647323300021181SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPVNAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAVASLILLPVFMYFELSGALDGELSQFQQLNF
Ga0210393_1109414513300021401SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANGFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAVVSLILLPIFMYFELSGALDGELSQFQQLNF
Ga0210397_1125202413300021403SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF
Ga0210387_1013116723300021405SoilMTPAQFVQAPLKGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQHLNF
Ga0210387_1108280113300021405SoilMTPTQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIFGTYRCAVNCRTAGMARFVRISAIVSLILLPVFMYYELSGAFDGELSQFQQLNF
Ga0210386_1034079823300021406SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMALFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF
Ga0210386_1149238513300021406SoilSGMTPAQFLQAPLEGKLTVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRLSAIASLILLPIFMYLELSGALDGELSQFQQLNF
Ga0210394_1132815823300021420SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLVRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF
Ga0210392_1078025013300021475SoilMGIREFFEAPLQGKMSVSRVFWLYGIVGSLVYGLFEFLIDPGNVFLMRLYSIGGLLYTAYVIVATHRSAVNCKSQRMASFVRISCVASLLILPLIAYLELTGSLSLDIGGLDQLDLTGR
Ga0210398_1117676723300021477SoilMTPTQFVQAPLEGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFEFSGAFDGELSQFQQLNF
Ga0210402_1182681113300021478SoilMTPTQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSVYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLNF
Ga0179589_1014402523300024288Vadose Zone SoilMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLAYGLLQFFIDPANAFLIRLYSVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFEFSGALDGELSQLQQLNF
Ga0208193_100220373300025463PeatlandMGTTYVSVAPKVSKIGGVTPTQFIQAPLQAKLSVARVFWLYGVAGSLVYGGLEFFIDPGNTFLMRIYSIGGFLYSAYVIVGTYRCSVNCSTEAMARFVRISSIISLILLPILTYFELSGAFSSDLSQLEQLNL
Ga0208193_100317533300025463PeatlandVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLVYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCIISLILLPVLTYFEFSGALSGDLSQLEQLNF
Ga0207694_1018128013300025924Corn RhizosphereQFVQAPLKGELAVSRVFWLYGVVGSLVYGFLEFFIDPANVFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIVSLILLPVFMYCELSGALDGELSQFQQLNF
Ga0179587_1089344123300026557Vadose Zone SoilDSGMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLAYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLVLLPVFMYFELSGALDGELSQLQQLNF
Ga0209222_100569033300027559Forest SoilVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLVYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTYFEFSGALSGDLSQLEQLNF
Ga0208982_100351943300027574Forest SoilMGIRQFFEAPLQGKVTVSRVFWLYGVVGSLVYGLFEFLIDPGNVFLTRLYSIGGLLYTAYVIVATHRSAVNCKSQRMASFVRISCVASLLLLPVFAYLELSGALSLDMAGLEQLDVTGR
Ga0209208_1008286833300027807Host-AssociatedVSRVFVVYGIVGSLLYGCLEFMIDPFNLFLMRVYTIGGILYSAYVIVGTYRCAVNCKTPRMARFVRVSCVISLLLLPVITYMEWNGAVDSELLQLEQLNF
Ga0209208_1026727233300027807Host-AssociatedMSIKSFIEAPLQAKTSVSRVFWLYGIVGSLLYSGLEFLIDPGSAWLLRAYTLGGALLSLYVIVGTYRCAVNCRSAGMARFVRVSCVISVLLLPVITYLELSGALSTDLSQLQQLGL
Ga0209611_1004772853300027860Host-AssociatedMTTILQFFRAPLEGKLSVSRVFVLYGIVGSLLYGCLEFLIDPLNSFLMRTYTIGGVLYSAYVIVGTHRCAVNCKTARRARWVRVCCVISLLLLPVLTYVEWNGTFDSELSQLDQLNF
Ga0209611_1023530413300027860Host-AssociatedMTPTEFLRAPLQAKLSVSRVFWLYGVAGSLVYGCLEFFIDPGSTSLMRLYTIGGGLYSAYVIIGTYRCAVNCATVAMARFVRISCILSLLLLPILTYYELSGAFGSELSQLDQLKF
Ga0209611_1027075223300027860Host-AssociatedRMDMNIPAFIRAPLEGKTTVSRVFWLYGVVGSLIYSALEFVIDPGNVGLLRVYTLGGALFSLYVIVGTYRCAVNCRTEAMARFVRVSCVLSVLLLPVLTYFELTGALRTDLSQLDQLNL
Ga0209169_1000752333300027879SoilMTPARFFQAPLEGKLSVSRVFWLYGVVGSLVYGILEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFEFSGAFDGELSQFQQLNF
Ga0209624_1007112233300027895Forest SoilMTPAQFVQAPLKGNLSVSRVFWLYGVVGSLVYGLLELFIDPANAFLIRLYSVGAYAYATYVIVGTYRCAVNCRTAGVARFVRISAIVSLILLPVFMYFDISGALDGELSQFQQLNF
Ga0209006_1130430623300027908Forest SoilVTPTEFINAPLQGKVSVSRVFWLYGVVGSLVYGLLEFFINPGNTLLVRLYSIGGYVYSVYVILGTYRCAVNCKTAGMARFVRISAIASLILLPVFMYFEFSGAFDGELSQFQQLNF
Ga0302231_1012647823300028775PalsaVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLMYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTYFEFSGALSGDLSQLEQLNF
Ga0311368_1002084943300029882PalsaMTPIQFLQEPLQAKLSVSRVFWLYGVAGSLVYGCLEFFIDPGNTVLMRLYTVGDGLYSAYVIVGTYRCAVNCATVGMARFVRISCVLSLLLLPVLTYVELSGALSSDLSQLDQLNL
Ga0311329_1037154813300029907BogMSIKSFIEAPLQGKTSVSRVFWLYGVVLSLLYSSLEFLIDPGNTVLLRLYTLGGALLTVYVIVGTYRCAVNCRSPGMARFVRVSCVISLLLLPLITYLELSGALNTDLSQLKELG
Ga0311369_1079853113300029910PalsaVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGAFDGELSQFQQLNF
Ga0311328_1018157233300029939BogMTHIQFLQAPLQAKISVARVFWLYGVAGSLVYGCLEFFIDPGNTFLMRLYTIGGGLYSTYVIVGTYRCAVNCATVGMARFVRISCVLSLLLLPILTYLELSGALSSDLSQLDQLDL
Ga0311371_1018268133300029951PalsaMTVAQFVQAPLEGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGAFDGELSQFQQLNF
Ga0311371_1102189523300029951PalsaMTPLQFLQAPLQAKLSVARVFWLYGVAGSLVYGCLEFFIDPGNTFLMRLYTIGGGLYSAYVIIATYRCAVNCATVAMARFVRISCILSLILLPVLTYYELSGAFSSELSQLEQLNL
Ga0311371_1104576023300029951PalsaMGIKAFIEAPLQGKISVSRVFWLYGVVGSLLYSCLEFLIDPGNSVLLRIYTLGGALLTVYVIVGTYRCAVNCRSPSMARFVRVSCVISMLLLPVITYLELSGALSTDLSQLQELNP
Ga0311331_1013918543300029954BogMSIKSFIEAPLQGKTSVSRVFWLYGVVLSLLYSSLEFLIDPGNTVLLRLYTLGGALLTVYVIVGTYRCAVNCRSPGMARFVRVSCVISLLLLPLITYLELSGALNTDLSQLKELGL
Ga0311339_1003647563300029999PalsaIQAPLQAKLSVSRVFWLYGVVGSLVYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTYFEFSGALSGDLSQLEQLNF
Ga0311339_1152365613300029999PalsaESTARQAHEHNRPMGIKAFIEAPLQGKISVSRVFWLYGVVGSLLYSCLEFLIDPGNSVLLRIYTLGGALLTVYVIVGTYRCAVNCRSPSMARFVRVSCVISMLLLPVITYLELSGALSTDLSQLQELNP
Ga0311338_1116008013300030007PalsaMGIKAFIEAPLQGKISVSRVFWLYGVVGSLLYSCLEFLIDPGNSVLLRIYTLGGALLTVYVIVGTYRCAVNCRSPSMARFVRVSCVISMLLLPVI
Ga0311354_1018983033300030618PalsaVSRVFWLYGVVGSLMYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTYFEFSGALSGDLSQLEQLNF
Ga0311354_1039782813300030618PalsaMGIKAFIEAPLQGKISVSRVFWLYGVVGSLLYSCLEFLIDPGNTKLLRVYTLGGALLTVYVVVGTYRCAVNCRSPSMARFVRVSCVISLLLLPVITYLELSGALSTDLSQLEQLNL
Ga0302317_1016001013300030677PalsaSRVFWLYGVAGSLVYGCLEFFIDPGNTVLMRLYTVGDGLYSAYVIVGTYRCAVNCATVGMARFVRISCVLSLLLLPVLTYVELSGALSSDLSQLDQLNL
Ga0302313_1027795323300030693PalsaVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLMYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTY
Ga0302314_1024611013300030906PalsaVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLMYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVLSLILLPVLTYFEFSG
Ga0302308_1025845613300031027PalsaGGVTPTQFIQAPLQAKLSVSRVFWLYGVVGSLVYGALEFLIDPGNTFLMRLYTIGGCVYSAYVIVGTYRCAVNCRTAGMARFVRVSCVISLILLPVLTYFEFSGALSGDLSQLEQLNF
Ga0170834_10411579123300031057Forest SoilMTPAQFVQAPLEGKLSVSRVFWLYGVVGSLVYGLIEFFIDPANALLIRLYTVGAYAYSAYVIVGTYRCAVNCRTASMARFVRISAIVSLILLPVFMYFELSGALDGELSQFQQLNF
Ga0170823_1351669323300031128Forest SoilMTPAQVVQAPLKGKLSVSRVFWLYGVVGSLVYGLIEFFIDPANALLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFLYLELSGAFDGELSQFQQLNF
Ga0170824_10033846723300031231Forest SoilMTLAQFVQAPLEGRLSVSRVFWLYGVVGSLAYGLLEFFIDPANAFLIRLYTIGAYAYSAYVIVGTYRCAVNCRTASMARFVRISAIVSLILLPVFLYLELSGAFDGELSQFQQLNF
Ga0170824_11709377113300031231Forest SoilMTPGQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAVMARFVRISAIASLILLPLFMYFEFSGALDGELSQFQQLNF
Ga0302325_1337043723300031234PalsaIQFLQAPLQAKLSVARVFWLYGVAGSLAYGCLEFFIDPGNTILMRLYTIGGGLYSAYVIVGTYRCSVNCATARMALLVRVSCVLSLILLPILTYVELSGALGSDLSQIEQLNL
Ga0302324_10306391113300031236PalsaLQAKLSVSRVFWLYGVLGSLVYGALEFFINPGNAFLMRVYTIGDGLYSAYVIVGTYRCSVNCSSAGMARFVRVSCIISLILLPILTYFELSGAFDSELSQLEQLNL
Ga0265340_1024907313300031247RhizosphereMTLAQFVQAPLKGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRSVGMERFVRISAIVSLILLPVFMYFELSGALDGELSQFQQLNF
Ga0170820_1062202023300031446Forest SoilMTLAQFVQAPLEGRLSVSRVFWLYGVVGSLAYGLLEFFIDPANAFLIRLYTIGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFLYLELSGAFDGELSQFQQLNF
Ga0302326_1225038023300031525PalsaMSIRAFIEAPLQGKTSVSRVFWLYGVVGSLLYSCLEFLIDPGNTKLLRVYTLGGALLTVYVVVGTYRCAVNCRSPSMARFVRVSCVISLLLLPVITYLELSGALSTDLSQ
Ga0302326_1344471523300031525PalsaVTFAQFIQAPLQAKLSVSRVFWLYGVLGSLVYGALEFFINPGNAFLMRVYTIGDGLYSAYVIVGTYRCSVNCSSAGMARFVRVSCIISLILLPILTYFELSGAFDSELSQLEQ
Ga0310686_10034827423300031708SoilMTPAQFIQAPLEGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANALLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFEFSGAFDGELSQFQQLNF
Ga0310686_10484214653300031708SoilGMTPAQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAILSLILLPVFMYFELSGALDGELSQFQQLNF
Ga0310686_10878110513300031708SoilMTPTQFLQAPLQAKLSVTRVFWLYGVAGSLAYGCLEFFIDPGNTLLMRLYTVGGGLYSVYVIVGTYRCAVNCATVGMARFVRISCVLSLLLLPILTYFELSGALSDDLSQIEQLNL
Ga0310686_11721157643300031708SoilMTPAQFVQAPLEGKLSVSRVFWVYGVVGSLVYGLVEFFIDPANALLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGAFDSELSQFQQLNF
Ga0307475_1125415413300031754Hardwood Forest SoilVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIIGTYRCAVNCRTAGMARFVRICAIASLILLPVCMYFELSGALDGELSQFQQLNF
Ga0307478_1166608813300031823Hardwood Forest SoilMTPTQFVQAPLQGKLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIVGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGA
Ga0307471_10355062613300032180Hardwood Forest SoilQAPLEGRLSVSRVFWLYGVVGSLVYGLLEFFIDPANAFLIRLYTVGAYAYSAYVIIGTYRCAVNCRTAGMARFVRISAIASLILLPVFLYFELSGAFDGELSQFQQLNF
Ga0335075_1001102963300032896SoilMQAPLKGELSVSRVFWLYGVVGSLVYGLLELFINPANAFLIRLYTVGAYAYSAYVIIGTYRCAVNCRTAGMARFVRISAIASLILLPVFMYFELSGALDGELSQFQQLDF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.