NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F098667

Metagenome Family F098667

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098667
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 110 residues
Representative Sequence LKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRVYAVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Number of Associated Samples 80
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 62.75 %
% of genes near scaffold ends (potentially truncated) 26.21 %
% of genes from short scaffolds (< 2000 bps) 71.84 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.79

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (78.641 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(40.777 % of family members)
Environment Ontology (ENVO) Unclassified
(44.660 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.777 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 64.29%    β-sheet: 0.00%    Coil/Unstructured: 35.71%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.79
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.25.1.1: Ferritind2fzfa12fzf0.6455
d.144.1.0: automated matchesd3tl8a_3tl80.62055
a.25.1.1: Ferritind1z6om11z6o0.62019
a.114.1.1: Interferon-induced guanylate-binding protein 1 (GBP1), C-terminal domaind1f5na11f5n0.61901
a.25.1.1: Ferritind3t9ja_3t9j0.61099


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF13599Pentapeptide_4 16.50
PF12697Abhydrolase_6 11.65
PF00892EamA 7.77
PF07883Cupin_2 2.91
PF00805Pentapeptide 1.94
PF04986Y2_Tnp 1.94
PF01075Glyco_transf_9 0.97
PF13852DUF4197 0.97
PF01161PBP 0.97
PF01061ABC2_membrane 0.97
PF02776TPP_enzyme_N 0.97
PF02622DUF179 0.97
PF04828GFA 0.97
PF00210Ferritin 0.97
PF13520AA_permease_2 0.97
PF13844Glyco_transf_41 0.97
PF13505OMP_b-brl 0.97
PF13676TIR_2 0.97
PF00563EAL 0.97
PF00266Aminotran_5 0.97
PF00848Ring_hydroxyl_A 0.97
PF07075DUF1343 0.97
PF08922DUF1905 0.97
PF02040ArsB 0.97
PF01593Amino_oxidase 0.97
PF00578AhpC-TSA 0.97
PF01641SelR 0.97
PF08479POTRA_2 0.97
PF09176Mpt_N 0.97
PF10543ORF6N 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG1357Uncharacterized conserved protein YjbI, contains pentapeptide repeatsFunction unknown [S] 1.94
COG4638Phenylpropionate dioxygenase or related ring-hydroxylating dioxygenase, large terminal subunitInorganic ion transport and metabolism [P] 1.94
COG0229Peptide methionine sulfoxide reductase MsrBPosttranslational modification, protein turnover, chaperones [O] 0.97
COG0471Di- and tricarboxylate antiporterCarbohydrate transport and metabolism [G] 0.97
COG0859ADP-heptose:LPS heptosyltransferaseCell wall/membrane/envelope biogenesis [M] 0.97
COG1055Na+/H+ antiporter NhaD or related arsenite permeaseInorganic ion transport and metabolism [P] 0.97
COG1678Putative transcriptional regulator, AlgH/UPF0301 familyTranscription [K] 0.97
COG1881Uncharacterized conserved protein, phosphatidylethanolamine-binding protein (PEBP) familyGeneral function prediction only [R] 0.97
COG2200EAL domain, c-di-GMP-specific phosphodiesterase class I (or its enzymatically inactive variant)Signal transduction mechanisms [T] 0.97
COG3434c-di-GMP phosphodiesterase YuxH/PdeH, contains EAL and HDOD domainsSignal transduction mechanisms [T] 0.97
COG3791Uncharacterized conserved proteinFunction unknown [S] 0.97
COG3876Exo-beta-N-acetylmuramidase YbbC/NamZ, DUF1343 familyCell wall/membrane/envelope biogenesis [M] 0.97
COG4943Redox-sensing c-di-GMP phosphodiesterase, contains CSS-motif and EAL domainsSignal transduction mechanisms [T] 0.97
COG5001Cyclic di-GMP metabolism protein, combines GGDEF and EAL domains with a 6TM membrane domainSignal transduction mechanisms [T] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms78.64 %
UnclassifiedrootN/A21.36 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100452917All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1160Open in IMG/M
3300004092|Ga0062389_100385334Not Available1514Open in IMG/M
3300004152|Ga0062386_100543259All Organisms → cellular organisms → Bacteria946Open in IMG/M
3300005537|Ga0070730_10000772All Organisms → cellular organisms → Bacteria → Proteobacteria34423Open in IMG/M
3300005994|Ga0066789_10002384All Organisms → cellular organisms → Bacteria → Proteobacteria8197Open in IMG/M
3300005994|Ga0066789_10025063All Organisms → cellular organisms → Bacteria → Proteobacteria2643Open in IMG/M
3300005994|Ga0066789_10140669All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1027Open in IMG/M
3300005994|Ga0066789_10195363Not Available854Open in IMG/M
3300005994|Ga0066789_10261097All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingobium726Open in IMG/M
3300005995|Ga0066790_10009849All Organisms → cellular organisms → Bacteria → Proteobacteria4222Open in IMG/M
3300006059|Ga0075017_100417711All Organisms → cellular organisms → Bacteria1008Open in IMG/M
3300007788|Ga0099795_10034210All Organisms → cellular organisms → Bacteria → Proteobacteria1769Open in IMG/M
3300009143|Ga0099792_10187067All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1169Open in IMG/M
3300009633|Ga0116129_1012317All Organisms → cellular organisms → Bacteria → Proteobacteria3250Open in IMG/M
3300009651|Ga0105859_1215231Not Available571Open in IMG/M
3300010159|Ga0099796_10436661All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium580Open in IMG/M
3300011269|Ga0137392_10348327All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1227Open in IMG/M
3300011270|Ga0137391_10298485All Organisms → cellular organisms → Bacteria → Proteobacteria1390Open in IMG/M
3300011271|Ga0137393_10511262All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1030Open in IMG/M
3300012189|Ga0137388_11728471Not Available559Open in IMG/M
3300012202|Ga0137363_10148728All Organisms → cellular organisms → Bacteria → Proteobacteria1838Open in IMG/M
3300012202|Ga0137363_10218407All Organisms → cellular organisms → Bacteria → Proteobacteria1537Open in IMG/M
3300012205|Ga0137362_10181912All Organisms → cellular organisms → Bacteria → Proteobacteria1807Open in IMG/M
3300012357|Ga0137384_10373571All Organisms → cellular organisms → Bacteria1180Open in IMG/M
3300012361|Ga0137360_10128319All Organisms → cellular organisms → Bacteria → Proteobacteria1982Open in IMG/M
3300012362|Ga0137361_10192092All Organisms → cellular organisms → Bacteria1844Open in IMG/M
3300012363|Ga0137390_11023255Not Available777Open in IMG/M
3300012683|Ga0137398_10075093All Organisms → cellular organisms → Bacteria → Proteobacteria2071Open in IMG/M
3300012683|Ga0137398_10112775All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1725Open in IMG/M
3300012685|Ga0137397_10080233All Organisms → cellular organisms → Bacteria → Proteobacteria2372Open in IMG/M
3300012922|Ga0137394_10192004All Organisms → cellular organisms → Bacteria1743Open in IMG/M
3300012924|Ga0137413_10019340All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3548Open in IMG/M
3300012924|Ga0137413_10066250All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales2148Open in IMG/M
3300012925|Ga0137419_10492991All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales971Open in IMG/M
3300012927|Ga0137416_10257649All Organisms → cellular organisms → Bacteria1428Open in IMG/M
3300012927|Ga0137416_11464245Not Available620Open in IMG/M
3300012927|Ga0137416_12037904All Organisms → cellular organisms → Bacteria → Proteobacteria527Open in IMG/M
3300012929|Ga0137404_10096222All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales2379Open in IMG/M
3300012929|Ga0137404_10822010Not Available845Open in IMG/M
3300012930|Ga0137407_10407477All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1260Open in IMG/M
3300012944|Ga0137410_10002667All Organisms → cellular organisms → Bacteria → Proteobacteria12137Open in IMG/M
3300014495|Ga0182015_10017454All Organisms → cellular organisms → Bacteria6010Open in IMG/M
3300014838|Ga0182030_10022887All Organisms → cellular organisms → Bacteria → Proteobacteria11599Open in IMG/M
3300015164|Ga0167652_1028138All Organisms → cellular organisms → Bacteria → Proteobacteria1119Open in IMG/M
3300015193|Ga0167668_1053179All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales842Open in IMG/M
3300015203|Ga0167650_1032787All Organisms → cellular organisms → Bacteria → Proteobacteria1376Open in IMG/M
3300015241|Ga0137418_10103776All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales2555Open in IMG/M
3300015242|Ga0137412_10625680All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingobium811Open in IMG/M
3300015245|Ga0137409_10002044All Organisms → cellular organisms → Bacteria → Proteobacteria21740Open in IMG/M
3300015264|Ga0137403_10253465All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1670Open in IMG/M
3300015264|Ga0137403_10423262All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300017948|Ga0187847_10662227Not Available586Open in IMG/M
3300020001|Ga0193731_1153589Not Available563Open in IMG/M
3300020021|Ga0193726_1215890All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingobium798Open in IMG/M
3300020061|Ga0193716_1054317All Organisms → cellular organisms → Bacteria → Proteobacteria1855Open in IMG/M
3300020170|Ga0179594_10312119Not Available596Open in IMG/M
3300020199|Ga0179592_10000340All Organisms → cellular organisms → Bacteria → Proteobacteria18451Open in IMG/M
3300020579|Ga0210407_10161362All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1731Open in IMG/M
3300020580|Ga0210403_10163804Not Available1821Open in IMG/M
3300021086|Ga0179596_10332027All Organisms → cellular organisms → Bacteria → Proteobacteria762Open in IMG/M
3300021086|Ga0179596_10679122All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria521Open in IMG/M
3300021168|Ga0210406_10287999All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1339Open in IMG/M
3300021478|Ga0210402_10197914All Organisms → cellular organisms → Bacteria → Proteobacteria1846Open in IMG/M
3300021478|Ga0210402_10384909All Organisms → cellular organisms → Bacteria1302Open in IMG/M
3300021478|Ga0210402_11938806Not Available515Open in IMG/M
3300021479|Ga0210410_10043179All Organisms → cellular organisms → Bacteria → Proteobacteria3926Open in IMG/M
3300024283|Ga0247670_1010219All Organisms → cellular organisms → Bacteria → Proteobacteria1700Open in IMG/M
3300025463|Ga0208193_1005643All Organisms → cellular organisms → Bacteria4362Open in IMG/M
3300025463|Ga0208193_1008239All Organisms → cellular organisms → Bacteria → Proteobacteria3398Open in IMG/M
3300026291|Ga0209890_10002994All Organisms → cellular organisms → Bacteria7368Open in IMG/M
3300026291|Ga0209890_10053531All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales1483Open in IMG/M
3300026291|Ga0209890_10204175Not Available632Open in IMG/M
3300026294|Ga0209839_10004824All Organisms → cellular organisms → Bacteria → Proteobacteria6276Open in IMG/M
3300026557|Ga0179587_10020252All Organisms → cellular organisms → Bacteria → Proteobacteria3581Open in IMG/M
3300027678|Ga0209011_1208420Not Available530Open in IMG/M
3300027857|Ga0209166_10015220All Organisms → cellular organisms → Bacteria → Proteobacteria4888Open in IMG/M
3300027895|Ga0209624_10465282All Organisms → cellular organisms → Bacteria → Proteobacteria843Open in IMG/M
3300027903|Ga0209488_10130509All Organisms → cellular organisms → Bacteria → Proteobacteria1894Open in IMG/M
3300027908|Ga0209006_10007975All Organisms → cellular organisms → Bacteria → Proteobacteria9564Open in IMG/M
3300028047|Ga0209526_10298237All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300028536|Ga0137415_10112489All Organisms → cellular organisms → Bacteria2570Open in IMG/M
3300028536|Ga0137415_10819090All Organisms → cellular organisms → Bacteria → Proteobacteria741Open in IMG/M
3300028536|Ga0137415_11260504Not Available556Open in IMG/M
3300028783|Ga0302279_10084312All Organisms → cellular organisms → Bacteria1742Open in IMG/M
3300029943|Ga0311340_10142917All Organisms → cellular organisms → Bacteria → Proteobacteria2523Open in IMG/M
3300029987|Ga0311334_11301790Not Available611Open in IMG/M
3300029989|Ga0311365_11246441All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium640Open in IMG/M
3300031128|Ga0170823_16848159Not Available1008Open in IMG/M
3300031231|Ga0170824_108058824All Organisms → cellular organisms → Bacteria → Proteobacteria504Open in IMG/M
3300031231|Ga0170824_119847731All Organisms → cellular organisms → Bacteria → Proteobacteria2086Open in IMG/M
3300031231|Ga0170824_121562543All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales1028Open in IMG/M
3300031231|Ga0170824_122001690All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingobium655Open in IMG/M
3300031231|Ga0170824_126700232All Organisms → cellular organisms → Bacteria → Proteobacteria1065Open in IMG/M
3300031232|Ga0302323_100761705All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium1062Open in IMG/M
3300031446|Ga0170820_16412936Not Available803Open in IMG/M
3300031474|Ga0170818_105956860All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales945Open in IMG/M
3300031524|Ga0302320_11383865All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiales genera incertae sedis → Ideonella → unclassified Ideonella → Ideonella sp. A 288702Open in IMG/M
3300031722|Ga0311351_10908894Not Available673Open in IMG/M
3300031730|Ga0307516_10000874All Organisms → cellular organisms → Bacteria → Proteobacteria41396Open in IMG/M
3300031788|Ga0302319_11062799Not Available764Open in IMG/M
3300031823|Ga0307478_10509166All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1004Open in IMG/M
3300031918|Ga0311367_12172681Not Available533Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil40.78%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil9.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.77%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil7.77%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.85%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen4.85%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland2.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.91%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil2.91%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog2.91%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.94%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.94%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.97%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.97%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil0.97%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.97%
PalsaEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Palsa0.97%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.97%
EctomycorrhizaHost-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza0.97%
Host-AssociatedHost-Associated → Plants → Peat Moss → Unclassified → Unclassified → Host-Associated0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005994Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049EnvironmentalOpen in IMG/M
3300005995Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009633Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10EnvironmentalOpen in IMG/M
3300009651Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil DNA_2013-063EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014495Permafrost microbial communities from Stordalen Mire, Sweden - 712P3M metaGEnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015164Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-4b, rock/ice/stream interface)EnvironmentalOpen in IMG/M
3300015193Arctic soil microbial communities from a glacier forefield, Rabots glacier, Tarfala, Sweden (Sample Rb6, proglacial stream)EnvironmentalOpen in IMG/M
3300015203Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-3c, vegetated patch on medial moraine)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017948Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_10EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300024283Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK11EnvironmentalOpen in IMG/M
3300025463Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10 (SPAdes)EnvironmentalOpen in IMG/M
3300026291Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049 (SPAdes)EnvironmentalOpen in IMG/M
3300026294Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027860Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MG (SPAdes)Host-AssociatedOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028783Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Bog_N3_3EnvironmentalOpen in IMG/M
3300029943I_Palsa_N3 coassemblyEnvironmentalOpen in IMG/M
3300029987I_Fen_E3 coassemblyEnvironmentalOpen in IMG/M
3300029989III_Fen_N1 coassemblyEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031232Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_3EnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031524Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Bog_T0_3EnvironmentalOpen in IMG/M
3300031722II_Fen_N3 coassemblyEnvironmentalOpen in IMG/M
3300031730Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 19_EMHost-AssociatedOpen in IMG/M
3300031788Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Bog_T0_2EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031918III_Fen_N3 coassemblyEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10045291723300002245Forest SoilLNLWPALTDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRAYSVGGLLFSIYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLDWTGALSLAMLGEQ*
Ga0062389_10038533413300004092Bog Forest SoilMPTPPSPGVAAFLTDPLRGKTSLSKVIWLYGLVGSLLYGAIELFLDPENLLSMRIYAIGGLVYTLYVIVAMYRCAANSATPARARMARISAIICLLLLPVIAYLDLTGSLTLSALGVDPNLLSPQ*
Ga0062386_10054325923300004152Bog Forest SoilALADPLQGKTSLSRVVWGYGLLGSVIYGAVELLLDPGNEFAMRAYTVGGLIFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLDLTGALSLAMLGEQ*
Ga0070730_10000772143300005537Surface SoilVKISPALTDPLKGKTSLSTVVWGYGLLGSMVYGALELFLDPGNEFAMRAYAVGGLLLTVYVTVATYRCAGNCGSKFWGGMARISAVLSLLLLPLLAYLELTGALSLAMMGEQ*
Ga0066789_1000238423300005994SoilMSLTTALSDPLQGKTSLSRVVWIYGLLGSVLYGAIELLLDPGNEFAMRVYSVGGLLFSVYVAVATYRCAGNCSSKFWGRLAQISAILSILLLPVIAYLAFTGALSLALMGEQ*
Ga0066789_1002506343300005994SoilMDSQRVWSALSDPLLGKTSLSTVVWGYGLLGSIGYGAIELFLDPENEFAMRMYIVGGLIFSVYVTVATYRCAANCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0066789_1014066923300005994SoilWVYGLLGSVVWSAAGLLNIAGNEFTTRIYTVCGLLFSVYVTVATYRCADNCSSKFWARMARISAVLSLLLLPVMAYLDLTGALDLALLGEQ*
Ga0066789_1019536323300005994SoilVKVWSALSDPLQGKTSLSTVVWGYGLLGSIGYGAIELFLDPGNEFAMRMYTMGGLIFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0066789_1026109713300005994SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRAYTVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGAISLAMLGEQ*
Ga0066790_1000984963300005995SoilMSLTTALSDPLQGKTSLSRVVWIYGLLGSVLYGAIELLLDPGNEFAMRVYSVGGLLFSVYVAVATYRCAGNCSSKFWGRLAQISAILSILLLPVIAYLEFTGALSLALMGEQ*
Ga0075017_10041771123300006059WatershedsVTLGSLLRDPLEGKTSLAKVVWVYGLLGSLLYGALELFLDPTNELEMRVYSVGGLLLTAYVSVATYRCAGNSSSKFWGRMARVSAVASMVLLPLLAYLEFSGVLSLALMGEQ*
Ga0099795_1003421033300007788Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0099792_1018706713300009143Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNDFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0116129_101231733300009633PeatlandMRNFLDSSLWAAVADPLRGRTSLSRVVWVYGLLGSLLYGALELFLDPGNALVTGIYSVVGLLFSIYVTVATYRCADNCASKFWGRMARVSAVLSLLLLPVLAYLDFTGALSLALMGEQ*
Ga0105859_121523113300009651Permafrost SoilMSLTTALSDPLQGKTSLSRVVWIYWLLGSVLYGAIELLLDPGNEFAMRVYSVGGLLFSVYVAVATYRCAGNCSSKFWGRLAQISAILSILLLPVIAYLAFTGALSLALMGEQ*
Ga0099796_1043666123300010159Vadose Zone SoilLKLWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRMYTVGGLLFSVYVTVAVYRCAGNCASKFWGRMARISAVLSLLLLPVLAYWELTGALSLAMLGEQ*
Ga0137392_1034832723300011269Vadose Zone SoilLSDPLLGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRVYTVGGLLFSVYVTVAMYRCASNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137391_1029848523300011270Vadose Zone SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRAYTVGGLLFSVYVTVAIYRCADNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137393_1051126223300011271Vadose Zone SoilLSDPLLGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRVYTVGGLLFSVYVTVAMYRCAGNCASKFWGRIARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137388_1172847113300012189Vadose Zone SoilKGADLKLWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRVYTVGGLLFSVYVTVAMYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137363_1014872813300012202Vadose Zone SoilSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137363_1021840733300012202Vadose Zone SoilGLLGSILYGAIELFLDPGNEFAMRMYTVGGLLFSIYVTVATYRCAGNCASRFWGRMARISAVLSLLLLPVLVYLELTGALSLAMLGEQ*
Ga0137362_1018191223300012205Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRMYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137384_1037357113300012357Vadose Zone SoilKTSLSTVVWGYGLLGSIAYGAIELFLDPANEFAMRVYTMGGLLFSVYVTVATYRCAGNCASKFWGRTARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137360_1012831923300012361Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNELAMRMYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137361_1019209223300012362Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRIYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137390_1102325523300012363Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSIVYGAIELFWDPGNEFAMRVYTLGGLLFSVYVTVATYRCAGNCASKFWGRAARVSAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137398_1007509323300012683Vadose Zone SoilLKVWSALSDPLQGKTSLSTVFWGYGLLGSIAYGAIELFLDPGNDFAMRVYTVGGLLFCVYVTVATYRCAGNCASKFWGRMARISALLSLLLLPVLAYLALTGALSLAMLGEQ*
Ga0137398_1011277513300012683Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSIVYGAIELFLDPDNEVAMRVYTVGGLLFTVYVTVATYRCAGNCASRFWGRMARISAVLSLLLLPVLVYLELTGALSLA
Ga0137397_1008023323300012685Vadose Zone SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRMYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137394_1019200443300012922Vadose Zone SoilQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137413_1001934023300012924Vadose Zone SoilMSIWPVLSDPLRGKTSLSKVVWGYGLLGSILYGLIELFLDPGNEFAMRMYSVGGLLFTIYVTVATYRCAGNCASTFWGRMARISAVLSLLLLPVLAYLDWSGALSLTMLGEQ*
Ga0137413_1006625023300012924Vadose Zone SoilLKVLSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137419_1049299113300012925Vadose Zone SoilLSDPLQGKTSLLTVVWGYGLLGSILYGAIELFLDPGNEFAMRMYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0137416_1025764923300012927Vadose Zone SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIVYGAIELFLDPGNEFAMRVYSVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137416_1146424523300012927Vadose Zone SoilDRCRRIKGADLKVWSALSDPLQGKTSLSTVFWGYGLLGSIAYGAIELFLDPGNDFAMRVYTVGGLLFCVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLEEQ*
Ga0137416_1203790413300012927Vadose Zone SoilMSIWPVLSDPLRGKTSLSKVVWGYGLLGSILYGLIELFLDPGNEFAMRMYSVGGLLFTIYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLDWSGALSLTMLGEQ*
Ga0137404_1009622223300012929Vadose Zone SoilLNDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137404_1082201013300012929Vadose Zone SoilLFRCTLPALQGAGLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRAYTLGGLLFSVYVTVAMYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLALTGALSLAMLGEQ*
Ga0137407_1040747713300012930Vadose Zone SoilLNDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGE
Ga0137410_10002667153300012944Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0182015_1001745463300014495PalsaGSAKGGRRTAQAGDDAMSGWSALIDPLKGKTSLSKVVWGYGLLGSILYGAIELALDPDNAFTMRIYSIGGLVFSVYVSVATYRCAANCASKFWGRMAQAGAVLSLLLLPVVAYLEFTGALSLALMGEQ*
Ga0182030_1002288753300014838BogLSRWPWRCGAAGARTDRAVRKANQFSWPAVTDPLRGKTPLSRVVWVYGVLGSLAYGALELFLDAGSAVEMRLYSIGGLVFSIYVTVATYRCAGNCGSPFWARMARLSAVLTLLLLPLLFYLDWSGALSLALLGEQ*
Ga0167652_102813823300015164Glacier Forefield SoilMRNLSTESPWAAVNDPLWGRTSLSKVVWVYGLLGSVVYGALALLVNPGNAFATRLYSIVGLLFSIYVTVAIYRCADNCSSKFCARAARVSAVLSLLLLPVIAYLDFTGALSLALIGEQ*
Ga0167668_105317913300015193Glacier Forefield SoilLSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPSNEFAMRVYTVGGLLFSVYVTIATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ*
Ga0167650_103278723300015203Glacier Forefield SoilLHCGSDVIRSVLLDPLLGKTSLSKVVWGYGLLGSVVYGAIEFVLDPDNEMAMRLYAVGGLLFSVYVTVATYRCAGNCASAFWARMARISAVLSLLLLPVLVYLTLAGGFTLALPVDQ*
Ga0137418_1010377623300015241Vadose Zone SoilLKVWSALSDPLQGKTSLSMVVWGYGLLGSILYGAIELFLDPGNEFAMRMYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137412_1062568023300015242Vadose Zone SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNDFAMRVYTVGGLLFCVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLALTGALSLAMLGEQ*
Ga0137409_10002044273300015245Vadose Zone SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0137403_1025346523300015264Vadose Zone SoilLSDPLQGKTSLSMVVWSYGLLGSIAYGAIELFLDPGNEFAMRAYTLGGLLFSVYVTVAMYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLALTGALSLAMLGEQ*
Ga0137403_1042326213300015264Vadose Zone SoilLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ*
Ga0187847_1066222713300017948PeatlandMSRWSLVTDPLRGKTSLSRVVWGYGVLGSVLYGALEFMIDPGHEWAMRAYDLGGLLFTVYVIVATYRCAANCSSKFWGRMAQISSVLSLLLLPLIAYLYFTGALSLALLGEQ
Ga0193731_115358913300020001SoilSALIEPLQGKTSLSRVVWGYGLLGSIVYGAFEFLLDPGNEFAMRAYTVAGLLFSVYVTIATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLDLTGALSLALLGEQ
Ga0193726_121589023300020021SoilLKVWSALIDPLQGKTSLSRVVWGYGLLGSIVYGAIELFFDPGNEPAMRVYTVGGLLFSIYVTVAIYRCADNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0193716_105431723300020061SoilMSIWPVLSDPLQGKTSLSKVVWGYGLLGSILYGLLELFLDPGNEFAMGMYSVGGLLLTIYVTVATYRCADNCASKFWGRMARISAVLSLLLLPVLAYLAWSGALSLTMLGEQ
Ga0179594_1031211923300020170Vadose Zone SoilVWSALNDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0179592_10000340123300020199Vadose Zone SoilLKVWSALSDPLQGKTSLSTVFWGYGLLGSIAYGAIELFLDPGNDFAMRVYTVGGLLFCVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLALTGALSLAMLGEQ
Ga0210407_1016136233300020579SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNELAMRVYTVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0210403_1016380423300020580SoilFSADESRPDRCRQIEGADLKVWSALSDPLHGKTSLSTVVWGYGLLGSIVYGAIELFVDPGNEFATRVYTVGGLLFSIYVTVAIYRCAGNCASKFWGRMARVSAVLSLLLLPVLAYLELTGALSLAMIGEQ
Ga0179596_1033202723300021086Vadose Zone SoilLLGSIAYGAIELFLDPGNEFAMRVYIVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0179596_1067912213300021086Vadose Zone SoilVWAFLTDPLQGKTSLSKVIWLYGVLGSVLYGAIELFLDPANLGVMRAYVIGGLVFSMYVTVATYQCAMNCRSPFLGRLVRVSAVISLLLLPVIAYLDLTGALTLAALGGEQFP
Ga0210406_1028799923300021168SoilLKGWSALSDPLLGKTSLSTVVWAYGLLGSIVYGAIELLLDPGNEFAMRLYTVGGVLYCVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLVYLDLTGALSLAMLGEQ
Ga0210402_1019791433300021478SoilLKVWPALSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRMYTVGGLIFSIYVTVATYRCAGNCKSKFWGRMARLSAVLSLLLLPVFAYLELTGALSAAMLGEQ
Ga0210402_1038490913300021478SoilGLKVWSALSDPLQGKTSLSTVVWGYGLLGSIVYGAIELLLDPDNEFAMRLYTVGGLLFCVYVTVATYRCAGNCASKFWGRMARISVVLSLLLLPVLAYLYLTGALSLAMLGEQ
Ga0210402_1193880613300021478SoilMSWESFLKEPLQGKTSLSRVFWLYGVLGSVLYSALELFLIAGNEAVMRAYVIGGLLLSLYVTVATYRCAMNCGSAFLGRFVRISALISLLLLPVIAYLSLTGVLTFDLPALDGERMPE
Ga0210410_1004317963300021479SoilLNLWPVLTDPLQGKTSLSTVVWGYGLLGSIVYGAIELFLNPGNEFAMRAYSVGGLLFSIYVTVAMYRCADNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0247670_101021923300024283SoilLKLWSALTDPLQGKTSLSTVVWGYGLLGSIVYGAIELFLDPGNEFAMRVYSVGGLVYGVYATVAVYRCAGNCASQFWGRMARISAVLSLLLLPVFAYLELTGTLSLAMLGEQ
Ga0208193_100564333300025463PeatlandMRNFLDSSLWAAVADPLRGRTSLSRVVWVYGLLGSLLYGALELFLDPGNALVTGIYSVVGLLFSIYVTVATYRCADNCASKFWGRMARVSAVLSLLLLPVLAYLDFTGALSLALMGEQ
Ga0208193_100823923300025463PeatlandMRNFFESSLWAAVIDPLRGRTSLSKVVWVYGLLGSLVYGALELLLDPGNALVIRMYSVFGLLFSIYVTVATYRCADNCSSKFWARTARVSAVLSLLLLPVLAYLDFTGALSLALMGEQ
Ga0209890_1000299423300026291SoilMSLTTALSDPLQGKTSLSRVVWIYGLLGSVLYGAIELLLDPGNEFAMRVYSVGGLLFSVYVAVATYRCAGNCSSKFWGRLAQISAILSILLLPVIAYLEFTGALSLALMGEQ
Ga0209890_1005353123300026291SoilVKVWSALSDPLQGKTSLSTVVWGYGLLGSIGYGAIELFLDPGNEFAMRMYTMGGLIFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0209890_1020417513300026291SoilMDSQRVWSALSDPLLGKTSLSTVVWGYGLLGSIGYGAIELFLDPENEFAMRMYIVGGLIFSVYVTVATYRCAANCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0209839_1000482453300026294SoilMSLTTALSDPLQGKTSLSRVVWIYGLLGSVLYGAIELLLDPGNEFAMRVYSVGGLLFSVYVAVATYRCAGNCSSKFWGRLAQISAILSILLLPVIAYLAFTGALSLALMGEQ
Ga0179587_1002025213300026557Vadose Zone SoilSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRIYTVGGLLFSIYVTVATYRCAGNCKSKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0209011_120842013300027678Forest SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRVYAVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0209166_1001522053300027857Surface SoilVKISPALTDPLKGKTSLSTVVWGYGLLGSMVYGALELFLDPGNEFAMRAYAVGGLLLTVYVTVATYRCAGNCGSKFWGGMARISAVLSLLLLPLLAYLELTGALSLAMMGEQ
Ga0209611_1002426033300027860Host-AssociatedMRWPGFLSDPLKGKTSLSRVVWWYGLVGSLAYGALELFLDSGNVLVMRLYIVGGLVISVYTAVATYRCAGNCRSKVWTRMAQISAILSLLLLPLIAYLELSGALDLSSLSGVL
Ga0209624_1046528223300027895Forest SoilVYGLLGSLVYGALELLLDSGNALVIRMYSVVGLLFSIYVTVATYRCADNCSSKFWGRMARVSAVLSLLLLPVLAYLDFTGALSLALMGEQ
Ga0209488_1013050923300027903Vadose Zone SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNDFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0209006_1000797563300027908Forest SoilLNLWPALTDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRAYSVGGLLFSIYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLDWTGALSLAMLGEQ
Ga0209526_1029823723300028047Forest SoilALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRVYAVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0137415_1011248933300028536Vadose Zone SoilLKVWSALSDPLQGKTSLSAVVWGYGLLGSIVYGAIELFLDPGNEFAMRVYSVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0137415_1081909023300028536Vadose Zone SoilMSIWPVLSDPLRGKTSLSKVVWGYGLLGSILYGLIELFLDPGNEFAMRMYSVGGLLFTIYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLDWSGALSLTMLGEQ
Ga0137415_1126050413300028536Vadose Zone SoilLSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRLYAVGGLIFSVYVTVATYRCAGNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMMGEQ
Ga0302279_1008431233300028783BogMVEARVGLGRRAWSAVMDPLRGKTSLSRVVWGYGVGGSLLYGALELLIDPAEAWQTRAYSVGGLLYSLYAIVATYRCAFNCSSRFWGRMAQLSAILSLLLLPLIAYLDYSGALSLALLGE
Ga0311340_1014291733300029943PalsaMSLESFLKEPMQGKTSLSRVIWLYGALGSLLYGAIELFLDPGNVVVMRLYAVGGLIFSIYVTIATYRCAMNCKSAALGRFVRVSAVISLFLLPVLFYLDLSGALTLSGLAGDELPN
Ga0311334_1130179013300029987FenLNLWSALKDPLQGKTSLSTVVWGYGLLGSIVYGAVELLLDPGNEFAMRAYTVGGLIFSIYVTIATYRCADNCASKFWARMARLSAVLSLLLLPVFAYLELTGALTLAALGEQ
Ga0311365_1124644123300029989FenLNLWSALKDPLQGKTSLSTVVWGYGLLGSIVYGAVELLLDPGNEFAMRAYTVGGLIFSIYVTIATYRCADNCASKFWARMARLSAVLSLLLLPVFAYLELTD
Ga0170823_1684815923300031128Forest SoilLKVWSALSDPLLGKTSLSRVVWGYGLLGSIVYGAIELLLDPGNEFAMRLYTVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVLAYLDLTGALSLALLGEQ
Ga0170824_10805882413300031231Forest SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIAYGAIELFLDPGNEFAMRAYSVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPV
Ga0170824_11984773123300031231Forest SoilLKAWSALSDPLQGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFAMRTYTVGGLLLSIYVTVATYRCAGNCKSKFWARMARISAVLSLLLLPVLAYLDLTGALSLAMLGEQ
Ga0170824_12156254313300031231Forest SoilLKVWSALSDPLEGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFVMRMYTVGGLLFSIYLSIATYRCAGNCKSKFWARMARISAVLSLLLLPVFAYL
Ga0170824_12200169023300031231Forest SoilLKLWSALSDPLQGKTSLSTVFWGYGLLGSIAYGAIELFLDPGNEFAMRVYTVGGLLFCVYVTVATYRCAGNCASKFLGRMARISAVLSLLLLPVLAYLALSGALSLAMLGEQ
Ga0170824_12670023223300031231Forest SoilLKVWSALSDPLQGKTSLSTVVWGYGLLGSIVYGAIELLLDPGNELAMRVYSVGGLLFSVYVTVATYRCAGNCASKFWGRMARISAVLSLLLLPVFAYLELTGALRLAMLGEQ
Ga0302323_10076170523300031232FenLNLWSALKDPLQGKTSLSTVVWGYGLLGSIVYGAVELLLDPGNEFAMRAYTVGGLIFSIYVTIATYRCADNCASKFWARMARLSAVLSLLLLPVFAYLELTGALTLATLGEQ
Ga0170820_1641293623300031446Forest SoilYGLLGSILYGAIELFLDPGNEFAMRTYTVGGLLLSIYVTVATYRCAGNCKSKFWARMARISAVLSLLLLPVLAYLDLTGALSLAMLGEQ
Ga0170818_10595686023300031474Forest SoilLKVWSALSDPLEGKTSLSTVVWGYGLLGSILYGAIELFLDPGNEFVMRMYTAGGVILSVYVIVATYRCAENCASKFWGRMARLSAVLSALLLPVLVYLELTGALSLAMMGEQ
Ga0302320_1138386513300031524BogVMDPLRGKTSLSRVVWGYGVGGSLLYGALELLIDPAEAWQTRAYSVGGLLYSLYAIVATYRCAFNCSSRFWGRMAQLSAILSLLLLPLIAYLDYSGALSLALLGEQ
Ga0311351_1090889413300031722FenLLGSIVYGAVELLLDPGNEFAMRAYTVGGLIFSIYVTIATYRCADNCASKFWARMARLSAVLSLLLLPVFAYLELTGALTLAALGEQ
Ga0307516_10000874343300031730EctomycorrhizaVKAWSALTDPLQGKTSLSTVVWGYGLLGSIVYGAIELFLDPGNELATRVYILGGLLFSVYVTVATYRCAGNCASKFWARAARVSAVLSLLLLPVFAYLELSGALSLAMLGEQ
Ga0302319_1106279913300031788BogMVEARVGLGRRAWSAVMDPLRGKTSLSRVVWGYGVGGSLLYGALELLIDPAEAWQTRAYSVGGLLYSLYAIVATYRCAFNCSSRFWGRMAQLSAILSLLLLPLIAYLD
Ga0307478_1050916623300031823Hardwood Forest SoilLNLWPVLTDPLQGKTSLSTVVWGYGLLGSIVYGAIELFLDPGNEFAMRAYSVGGLLFSIYVTVAMYRCADNCASKFWARMARISAVLSLLLLPVLAYLELTGALSLAMLGEQ
Ga0311367_1217268113300031918FenLSTVVWGYGLLGSIVYGAVELLLDPGNEFAMRAYTVGGLIFSIYVTIATYRCADNCASKFWARMARLSAVLSLLLLPVFAYLELTGALTLAALGEQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.