NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F074441

Metagenome / Metatranscriptome Family F074441

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074441
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 96 residues
Representative Sequence MTNHEVLLRLSVNQHDHMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGALMRRMAGHARLYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Number of Associated Samples 73
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 32.77 %
% of genes near scaffold ends (potentially truncated) 30.25 %
% of genes from short scaffolds (< 2000 bps) 67.23 %
Associated GOLD sequencing projects 66
AlphaFold2 3D model prediction Yes
3D model pTM-score0.82

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (64.706 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(33.613 % of family members)
Environment Ontology (ENVO) Unclassified
(47.059 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.580 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 61.29%    β-sheet: 0.00%    Coil/Unstructured: 38.71%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.82
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.177.1.1: Sigma2 domain of RNA polymerase sigma factorsd1or7a21or70.6646
a.177.1.0: automated matchesd4g7hf14g7h0.6437
a.177.1.1: Sigma2 domain of RNA polymerase sigma factorsd1siga11sig0.62649
c.72.1.5: PfkB-like kinased1lhpa_1lhp0.5889
c.72.1.2: Thiamin biosynthesis kinasesd1v8aa_1v8a0.57581


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF13426PAS_9 5.88
PF04185Phosphoesterase 2.52
PF01182Glucosamine_iso 1.68
PF13407Peripla_BP_4 1.68
PF00881Nitroreductase 1.68
PF00905Transpeptidase 1.68
PF04715Anth_synt_I_N 1.68
PF00990GGDEF 1.68
PF00154RecA 1.68
PF00722Glyco_hydro_16 1.68
PF11528DUF3224 1.68
PF13525YfiO 1.68
PF13432TPR_16 1.68
PF14310Fn3-like 0.84
PF04116FA_hydroxylase 0.84
PF13487HD_5 0.84
PF02518HATPase_c 0.84
PF01872RibD_C 0.84
PF01255Prenyltransf 0.84
PF00691OmpA 0.84
PF17148DUF5117 0.84
PF02310B12-binding 0.84
PF076987TM-7TMR_HD 0.84
PF00563EAL 0.84
PF10127RlaP 0.84
PF13401AAA_22 0.84
PF13714PEP_mutase 0.84
PF02954HTH_8 0.84
PF13174TPR_6 0.84
PF07238PilZ 0.84
PF09907HigB_toxin 0.84
PF01761DHQ_synthase 0.84
PF04055Radical_SAM 0.84
PF00072Response_reg 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG0147Anthranilate/para-aminobenzoate synthases component IAmino acid transport and metabolism [E] 3.36
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 2.52
COG03636-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminaseCarbohydrate transport and metabolism [G] 1.68
COG0468RecA/RadA recombinaseReplication, recombination and repair [L] 1.68
COG2273Beta-glucanase, GH16 familyCarbohydrate transport and metabolism [G] 1.68
COG0020Undecaprenyl pyrophosphate synthaseLipid transport and metabolism [I] 0.84
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 0.84
COG1480Cyclic di-AMP-specific phosphodiesterase PgpH, HD superfamilySignal transduction mechanisms [T] 0.84
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 0.84
COG2200EAL domain, c-di-GMP-specific phosphodiesterase class I (or its enzymatically inactive variant)Signal transduction mechanisms [T] 0.84
COG3000Sterol desaturase/sphingolipid hydroxylase, fatty acid hydroxylase superfamilyLipid transport and metabolism [I] 0.84
COG3434c-di-GMP phosphodiesterase YuxH/PdeH, contains EAL and HDOD domainsSignal transduction mechanisms [T] 0.84
COG4943Redox-sensing c-di-GMP phosphodiesterase, contains CSS-motif and EAL domainsSignal transduction mechanisms [T] 0.84
COG5001Cyclic di-GMP metabolism protein, combines GGDEF and EAL domains with a 6TM membrane domainSignal transduction mechanisms [T] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms64.71 %
UnclassifiedrootN/A35.29 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001154|JGI12636J13339_1003449All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2626Open in IMG/M
3300002907|JGI25613J43889_10023109All Organisms → cellular organisms → Bacteria → Acidobacteria1756Open in IMG/M
3300002914|JGI25617J43924_10071329All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1270Open in IMG/M
3300004100|Ga0058904_1414188Not Available1309Open in IMG/M
3300004104|Ga0058891_1198661Not Available554Open in IMG/M
3300004479|Ga0062595_101345891Not Available646Open in IMG/M
3300004631|Ga0058899_11997735All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium562Open in IMG/M
3300004631|Ga0058899_12051348Not Available703Open in IMG/M
3300005541|Ga0070733_10502290All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium811Open in IMG/M
3300005542|Ga0070732_10195879All Organisms → cellular organisms → Bacteria1207Open in IMG/M
3300005602|Ga0070762_10015334All Organisms → cellular organisms → Bacteria → Acidobacteria3918Open in IMG/M
3300007258|Ga0099793_10001530All Organisms → cellular organisms → Bacteria7439Open in IMG/M
3300009038|Ga0099829_10384937All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1158Open in IMG/M
3300009038|Ga0099829_10801321All Organisms → cellular organisms → Bacteria → Acidobacteria782Open in IMG/M
3300009088|Ga0099830_10340995All Organisms → cellular organisms → Bacteria → Acidobacteria1203Open in IMG/M
3300009088|Ga0099830_10776429Not Available790Open in IMG/M
3300011120|Ga0150983_10304952Not Available881Open in IMG/M
3300011120|Ga0150983_10424922All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium925Open in IMG/M
3300011120|Ga0150983_16475587All Organisms → cellular organisms → Bacteria → Acidobacteria756Open in IMG/M
3300011269|Ga0137392_10085679All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2454Open in IMG/M
3300011270|Ga0137391_10106618All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2428Open in IMG/M
3300012096|Ga0137389_10384687All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1197Open in IMG/M
3300012189|Ga0137388_11231374Not Available686Open in IMG/M
3300012202|Ga0137363_10007857All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6691Open in IMG/M
3300012202|Ga0137363_10409245All Organisms → cellular organisms → Bacteria1131Open in IMG/M
3300012203|Ga0137399_10101379Not Available2227Open in IMG/M
3300012203|Ga0137399_10707392Not Available848Open in IMG/M
3300012203|Ga0137399_11026090Not Available694Open in IMG/M
3300012203|Ga0137399_11026187Not Available694Open in IMG/M
3300012205|Ga0137362_11316826Not Available608Open in IMG/M
3300012361|Ga0137360_10235047All Organisms → cellular organisms → Bacteria → Acidobacteria1499Open in IMG/M
3300012362|Ga0137361_10294686All Organisms → cellular organisms → Bacteria → Acidobacteria1484Open in IMG/M
3300012582|Ga0137358_10166098Not Available1504Open in IMG/M
3300012927|Ga0137416_10279013All Organisms → cellular organisms → Bacteria1377Open in IMG/M
3300012927|Ga0137416_10289765All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_20CM_2_59_71353Open in IMG/M
3300012927|Ga0137416_10549973Not Available1000Open in IMG/M
3300012930|Ga0137407_10000842All Organisms → cellular organisms → Bacteria20183Open in IMG/M
3300020170|Ga0179594_10031450All Organisms → cellular organisms → Bacteria → Acidobacteria1697Open in IMG/M
3300020170|Ga0179594_10079577Not Available1153Open in IMG/M
3300020199|Ga0179592_10059430All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1744Open in IMG/M
3300020199|Ga0179592_10100232All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1330Open in IMG/M
3300020579|Ga0210407_10000343All Organisms → cellular organisms → Bacteria54950Open in IMG/M
3300020579|Ga0210407_10006610All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8849Open in IMG/M
3300020579|Ga0210407_10034237All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3778Open in IMG/M
3300020579|Ga0210407_10148908All Organisms → cellular organisms → Bacteria → Acidobacteria1803Open in IMG/M
3300020579|Ga0210407_11079008Not Available609Open in IMG/M
3300020580|Ga0210403_10002697All Organisms → cellular organisms → Bacteria15870Open in IMG/M
3300020580|Ga0210403_10057415All Organisms → cellular organisms → Bacteria → Acidobacteria3117Open in IMG/M
3300020580|Ga0210403_10579526Not Available907Open in IMG/M
3300020580|Ga0210403_10674343Not Available830Open in IMG/M
3300020581|Ga0210399_10631843Not Available884Open in IMG/M
3300020581|Ga0210399_10719773Not Available819Open in IMG/M
3300020582|Ga0210395_10281773All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1250Open in IMG/M
3300020583|Ga0210401_10922093All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium732Open in IMG/M
3300021046|Ga0215015_10170401All Organisms → cellular organisms → Bacteria2731Open in IMG/M
3300021168|Ga0210406_10025436All Organisms → cellular organisms → Bacteria5437Open in IMG/M
3300021168|Ga0210406_10061064All Organisms → cellular organisms → Bacteria → Acidobacteria3268Open in IMG/M
3300021168|Ga0210406_10110487All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2330Open in IMG/M
3300021168|Ga0210406_11361476Not Available508Open in IMG/M
3300021170|Ga0210400_10000013All Organisms → cellular organisms → Bacteria → Acidobacteria372411Open in IMG/M
3300021170|Ga0210400_10139124All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidipila → unclassified Acidipila → Acidipila sp.1949Open in IMG/M
3300021171|Ga0210405_10031779All Organisms → cellular organisms → Bacteria4227Open in IMG/M
3300021180|Ga0210396_11271018Not Available613Open in IMG/M
3300021405|Ga0210387_10049473All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3380Open in IMG/M
3300021405|Ga0210387_11362029Not Available611Open in IMG/M
3300021405|Ga0210387_11490197Not Available579Open in IMG/M
3300021407|Ga0210383_11342336All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium596Open in IMG/M
3300021420|Ga0210394_10634276All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium938Open in IMG/M
3300021478|Ga0210402_10052886All Organisms → cellular organisms → Bacteria3554Open in IMG/M
3300021478|Ga0210402_11687667Not Available560Open in IMG/M
3300021559|Ga0210409_10275237All Organisms → cellular organisms → Bacteria → Acidobacteria1521Open in IMG/M
3300021559|Ga0210409_10292337All Organisms → cellular organisms → Bacteria → Acidobacteria1470Open in IMG/M
3300021559|Ga0210409_11345000All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidipila → unclassified Acidipila → Acidipila sp.590Open in IMG/M
3300022525|Ga0242656_1126195Not Available522Open in IMG/M
3300022530|Ga0242658_1164806All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium581Open in IMG/M
3300024178|Ga0247694_1000001All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia262418Open in IMG/M
3300024182|Ga0247669_1000017All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia161539Open in IMG/M
3300024330|Ga0137417_1004040Not Available502Open in IMG/M
3300024330|Ga0137417_1437019All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1616Open in IMG/M
3300024331|Ga0247668_1111299Not Available555Open in IMG/M
3300026374|Ga0257146_1004118All Organisms → cellular organisms → Bacteria2416Open in IMG/M
3300026377|Ga0257171_1001517All Organisms → cellular organisms → Bacteria3217Open in IMG/M
3300026515|Ga0257158_1020502All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1107Open in IMG/M
3300026551|Ga0209648_10050497All Organisms → cellular organisms → Bacteria → Acidobacteria3563Open in IMG/M
3300026557|Ga0179587_10140670All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1497Open in IMG/M
3300026557|Ga0179587_10609943Not Available718Open in IMG/M
3300026557|Ga0179587_11163801Not Available507Open in IMG/M
3300027671|Ga0209588_1236071Not Available562Open in IMG/M
3300027674|Ga0209118_1000042All Organisms → cellular organisms → Bacteria → Acidobacteria95085Open in IMG/M
3300027862|Ga0209701_10433115Not Available727Open in IMG/M
3300027862|Ga0209701_10532192All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300027867|Ga0209167_10395578All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium753Open in IMG/M
3300027875|Ga0209283_10035920All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3109Open in IMG/M
3300027903|Ga0209488_10425556Not Available980Open in IMG/M
3300028536|Ga0137415_10213152All Organisms → cellular organisms → Bacteria1747Open in IMG/M
3300028536|Ga0137415_11084582Not Available613Open in IMG/M
3300028906|Ga0308309_10002189All Organisms → cellular organisms → Bacteria11610Open in IMG/M
3300029701|Ga0222748_1127296All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium519Open in IMG/M
3300031057|Ga0170834_109019141Not Available604Open in IMG/M
3300031718|Ga0307474_10068936All Organisms → cellular organisms → Bacteria2625Open in IMG/M
3300031720|Ga0307469_10006607All Organisms → cellular organisms → Bacteria5292Open in IMG/M
3300031720|Ga0307469_11132723Not Available737Open in IMG/M
3300031753|Ga0307477_10000022All Organisms → cellular organisms → Bacteria → Acidobacteria264958Open in IMG/M
3300031753|Ga0307477_10008304All Organisms → cellular organisms → Bacteria7204Open in IMG/M
3300031753|Ga0307477_10985543Not Available553Open in IMG/M
3300031754|Ga0307475_10048509All Organisms → cellular organisms → Bacteria3181Open in IMG/M
3300031754|Ga0307475_10058712All Organisms → cellular organisms → Bacteria → Acidobacteria2912Open in IMG/M
3300031754|Ga0307475_10168249All Organisms → cellular organisms → Bacteria1746Open in IMG/M
3300031820|Ga0307473_10826928Not Available663Open in IMG/M
3300031962|Ga0307479_10020839All Organisms → cellular organisms → Bacteria6200Open in IMG/M
3300031962|Ga0307479_10094285All Organisms → cellular organisms → Bacteria → Acidobacteria2905Open in IMG/M
3300031962|Ga0307479_10148087All Organisms → cellular organisms → Bacteria2302Open in IMG/M
3300031962|Ga0307479_10252247All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1742Open in IMG/M
3300031962|Ga0307479_10425384All Organisms → cellular organisms → Bacteria1312Open in IMG/M
3300031962|Ga0307479_11327622Not Available680Open in IMG/M
3300031962|Ga0307479_11825738Not Available559Open in IMG/M
3300032174|Ga0307470_11807141Not Available517Open in IMG/M
3300032180|Ga0307471_100164304All Organisms → cellular organisms → Bacteria → Acidobacteria2161Open in IMG/M
3300032205|Ga0307472_100468901Not Available1076Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil33.61%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil32.77%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil16.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.56%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.52%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.52%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.84%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001154Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004100Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF244 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004104Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF218 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022525Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022530Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029701Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-O (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12636J13339_100344923300001154Forest SoilMGRMTNREAIQRLSINKHDPIAVTSLHDNNAAIIRAAIARYFGAGPVADKAECALMQRMAAHARLYEHPEDLDAWLTRCANTECDRLRNEAIWEKADKD*
JGI25613J43889_1002310933300002907Grasslands SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLAGCANTECDRMRNEAIHDKANTD*
JGI25617J43924_1007132923300002914Grasslands SoilMTNHEALLRLSVDKNDDLAVTSLHENNAEIIRTTVIRYFGSGTAAGNVEFELMKRMANHARLYEDSEDLDVWXARCANTECDRLRNEAIRDKANRD*
Ga0058904_141418823300004100Forest SoilMTNHEVLLRLSVNKHDNGAVTSLHDNNAEIIRTAVIRYFGTGTVADNVECALMKRMAGQARLYERSEDPDAWLARCANTECDRLRNEAIRDKAEKD*
Ga0058891_119866113300004104Forest SoilMTNQEALLRLSVNKHDLLAVSSLRDNNPTIIRSAMVRYFGTAPVEGNVESALMKRLADHARLYDLSEDPEAWLVRCATVECDRLRNEAIRKKADND*
Ga0062595_10134589113300004479SoilMTNREALMRLSADKNDRMAMTSLHDNNAHIIRTTVGRYFGAGTIADNAEFALMQRLVDHARLYEDSEDPEAWLARCTNTECDRLRNEA
Ga0058899_1199773513300004631Forest SoilMTNREAIQRLSVNKYDRLAVTSLHENNAGIIRAAIDRYFGAGPVADKAESALMQRVADHARLYEHPEDPDAWLARCANTECDRLRNEAIWQKADKD*
Ga0058899_1205134823300004631Forest SoilMTNLEAIQRLAVNKHDPIAVTSLHDNNAAIIRAVIGRYFGSGPVADKAECALMKRMAAHARLYEHPEDLDAWLTRCANTECDRLRNEAIWEKADKD*
Ga0070733_1050229013300005541Surface SoilERLSANKHDHLAAISLHENNAEIIRTTVIRFFGTGIVADKAEAALMRRMADHARLYEHHEDPEVWLARCASTECDRLRNEAIREKANKD*
Ga0070732_1019587923300005542Surface SoilHLAATSLHENNAEIIRTTVIRFFGTGIVADKAEAALMRRMADHARLYEHHEDPEVWLARCASTECDRLRNEAIREKANKD*
Ga0070762_1001533433300005602SoilVARVDSVKAGYVGIMTNHEVLLKLSVDKNDHLAVTSLLENNAEIIRMTVVRYFGTGTFAEGMEFALLERMADHARLYEDSEDPDVWLADCANTECDRLRNEAIREKANRD*
Ga0099793_1000153043300007258Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMANHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRD*
Ga0099829_1038493713300009038Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVIRYFGSGTAAGNVEFELMKRMANHARLYEDSEDLDVWLARCANTECDRLRNEAIRDKANRD*
Ga0099829_1080132113300009038Vadose Zone SoilNKHDHLAATSLHHNNAEIIRTTVIRYFGTGMVADKAGSVLMRRVADNARLYEHSEDPDAWLARCANTECNRMRNEAIYEKANRA*
Ga0099830_1034099523300009088Vadose Zone SoilMTNREAIQRLSVNKHDHLAVKSLHDNNAEVIRTAVIRYFGTGAVADKAESALMERMAESARSYAPQESPDEWLARCANTECDRLRNEAIHEKANKD*
Ga0099830_1077642913300009088Vadose Zone SoilMTNLEAIERLSVNKHDHLAATSLHHNNAEIIRTTVIRYFGTGMVADKAGSVLMRRVADNARLYEHSEDPDAWLARCANTECNRMRNEAIYEKANRA*
Ga0150983_1030495213300011120Forest SoilMTNHEVLLRLSVNKHDNGAVTSLHDNNAEIIRTAVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCANTECDRLRNEAIRDKAEKD*
Ga0150983_1042492213300011120Forest SoilMTNHEVLLRLSVNQHDHSAVTSLHDNNVEIIRTTVIRYFGTGTVPDNVEGALMQRMANHARLYERSEDPDAWLARCANTECDRLRNEAIRDKAEKD*
Ga0150983_1647558723300011120Forest SoilMTNREAIQRLSVNKYDRLAVTSLHENNAGIIRAAIGRYFGAGPVADKAESALMQRVADHARLYEHPEDPDAWLARCANTECDRLRNEAIWQKADKD*
Ga0137392_1008567933300011269Vadose Zone SoilMTNHEVLLRLSVNKHDNSAVTSLHDNNVEIIRTTVVRYFGTGTVADNVECALMQRVAGQARLYERSEDPDAWLARCANMECDRLRNEAIRDKAEKD*
Ga0137391_1010661833300011270Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAGNVEFELMKRMANHARLYEDSEDLDVWLARCANTECDRLRNEAIRDKANRD*
Ga0137389_1038468733300012096Vadose Zone SoilDSVKAGCFGVMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTRVIRYFGSGTAAGNVEFELMKRMANHARLYEDSEDLDVWLARCANTECDRLRNEAIRDKANRD*
Ga0137388_1123137413300012189Vadose Zone SoilVDKNDHLAVASLQDNNAEVIHTTVMRYFRTGTGGDNLEFALMRRMADHARFYKHSEDADAWLARCANTECDRLRNETIHDKANRD*
Ga0137363_1000785713300012202Vadose Zone SoilNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLAGCANTECDRMRNEAIHDKANTD*
Ga0137363_1040924533300012202Vadose Zone SoilMTNHEVLLRLSVNQHDYIAVASLRHNNAEIIRTTVIRYFGTGTVPDNVEGTLMQRMAGHARSYERSEDPDAWLARCANTECDRLRNEAIRDTPTAAASSRPTPRMNCGLE*
Ga0137399_1010137943300012203Vadose Zone SoilMTNHEVLLRLSVNKHDNSAVTSLHDNNVEIIRTTVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCANTECDRLRNEAIRDKAEKD*
Ga0137399_1070739223300012203Vadose Zone SoilMDSLKAWCFGVMTNHEALLRLSADKNDHLAVASLHDNNAEIIRTTVMRYFRTVTGGDNLEFALMQRMADHARFYKHSEDADAWLARCANTECDRLRNEAIHDKANRD*
Ga0137399_1102609023300012203Vadose Zone SoilSRGICVASVDSVKAGCFGVMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMADHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRD*
Ga0137399_1102618723300012203Vadose Zone SoilSRGICVASVDSVKAGCFGVMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMANHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRD*
Ga0137362_1131682623300012205Vadose Zone SoilMTNHEALLRLSVDKNDHLAVASLQDNNAEVIHTTVMRYFRTGTGGDNLEFALMRRMADHARFYKHSEDADAWLARCANTECDRLRNEAIHD
Ga0137360_1023504733300012361Vadose Zone SoilMTNHEALLRLSVDKNDDLAVTSLHENNAEIIRTTVIRYFGSGTAAGNVEFELMKRMANHARLYEDSEDLDVWLARCANTECDRLRNEAIRDKANRD*
Ga0137361_1029468623300012362Vadose Zone SoilMTNHEALLRLSADKNDHLTVMSLHDNNAEIIRTTVMHYFGSGTAAENVEFELMKRMAEHARLYEDSEDLAVWLARCANTECDRLRNEAIRDKANRN*
Ga0137358_1016609823300012582Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMADHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRD*
Ga0137416_1027901323300012927Vadose Zone SoilMTNREAIQRLSVNKHDHLAVKSLHDNNAEVIRTTVIRYFGTGPVADKAETALMERMAESARSYAPQESPDEWLARCANTECDRLRNEAIHEKANKD*
Ga0137416_1028976523300012927Vadose Zone SoilMDSLKAGCFGVMTNHEALLRLSVDKNDHLAVASLQDNNAEVIHTTVMRYFRTGTGGDNLEFALMRRMADHARFYKHSEDADAWLARCANTECDRLRNEAIHDKANGD*
Ga0137416_1054997323300012927Vadose Zone SoilAVTSLHDNNVEIIRTTVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCADTECDRLRNEAIRDKAEKD*
Ga0137407_1000084273300012930Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNGEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLAGCANTECDRMRNEAIHDKANTD*
Ga0179594_1003145043300020170Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLAGCANTECDRMRNEAIHDKANTD
Ga0179594_1007957723300020170Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMADHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRD
Ga0179592_1005943023300020199Vadose Zone SoilMTNREVLLRLSVNKHDRLAVVSLHDNNAEIIHTAVIRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADRN
Ga0179592_1010023213300020199Vadose Zone SoilMTNHEVLLRLSVNKHDNSAVTSLHDNNVEIIRTTVVRYFGTGTVADNLGCALMQRMASQARLYERSEDPDAWLARCANMECDRLRNEAIRDKAEKD
Ga0210407_10000343473300020579SoilMTNHEVLLRLSVNQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANKD
Ga0210407_1000661063300020579SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLASCANTECDRMRNEAIHDKANTD
Ga0210407_1003423733300020579SoilMTNHEVLLRLSVNKHDNGAVTSLHDNNAEIIRTAVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCANTECDRLRNEAIRDKAEKD
Ga0210407_1014890823300020579SoilMTNREAIERLSVNKHDHLAVTSLHENNAEIIRTTVIRYFGTGTVADKAEAALMRRMADHARLYEHHEDPEVWLARCASTECDRLRNEAIREKANKD
Ga0210407_1107900823300020579SoilAVMTNHEALLRLSVDKNDRLAVTSLHENNAEIIRTTVKRYFGSGTATENVESALMKRMADNARSYERSENPDVWLARCANTECDRLRNEAIRDKANRD
Ga0210403_1000269773300020580SoilMTNHEVLLRLSVNQHDHMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARFYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210403_1005741523300020580SoilMTNHEVLLRLSVNQHDHMAVASLHDNNVKIIRTTVIRYFGTGTVPDNVEGALMQRMADHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210403_1057952613300020580SoilMTNHEVLLRLSVNQHDHMAVESLYDNNSIIIRTTVIRYFGTGTVPDNLEGTLMQRMADHARLYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210403_1067434313300020580SoilSVNKHDRLAVTSLHDNNAGIIRTTVMRYFGTGAVADKIERALMQRMADNARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADRN
Ga0210399_1063184323300020581SoilMTNHEALLRLSINKNDHLAVTSLHDNNAETIRIAIMRYFGAGTVAENVEAALMRRVADHARLYEGSENADAWLTRCANTECDRLRNERIRDKADRQ
Ga0210399_1071977313300020581SoilHEVLLRLSVKKHDNGAVTSLHDNNAEIIRTAVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCANTECDRLRNEAIRDKAEKD
Ga0210395_1028177323300020582SoilMTNHEVLLRLSVNKHDNGAVTSLHDNNAEIIRTAVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLTRCANTECDRLRNEAIRDKAEKD
Ga0210401_1092209323300020583SoilMTNHEVLLRLSINQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANKD
Ga0215015_1017040133300021046SoilMTNREAIQRLAVNKDDRVAVTSLHDNNAEVIRTTVTRYFGTGKVADKAESALMERMAGSARSYQHPEDPDAWLARCANTECDRLRNEAIHEKANKD
Ga0210406_1002543633300021168SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEVIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLASCANTECDRMRNEAIHDKANTD
Ga0210406_1006106443300021168SoilHEVLLRLSVNQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANKD
Ga0210406_1011048713300021168SoilMTNHEVLLRLSVNQHDHMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGALMRRMAGHARLYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210406_1136147613300021168SoilMTNHEVLLRLSVNQHDHMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMAGHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210400_100000132553300021170SoilMTNREAIERLSVNKHDHLAVTSLHDNNTEIIRTTVIRYFGTGKVADKAEAALMRRMADHARLYEHHEDPEVWLARCASTECDRLRNEAIREKANKD
Ga0210400_1013912423300021170SoilMTNHEALLRLAVNKNDHLAVTSLRENNTEIIHTTVIRYFGTGAIAENLEIALMKRLADHARLYEGSEDPDVWLAHCADSECDRLRNEAIREKANSE
Ga0210405_1003177943300021171SoilMTNHEVLLRLSVNQHDHSAVTSLHDNNVEIIRTTVIRYFGTGTVPDNVEGALMQRMANHARLYERSEDPDAWLARCADTECDRLRNEAIRDKANRD
Ga0210396_1127101813300021180SoilMTNHEVLLRLSVNQHDHSAVISLHDNNVEIIRTTVIRYFGTGTVPDNVEGALMQRMANHARLYERSEDPDAWLARCADTECDRLRNEAIRDKASRD
Ga0210387_1004947343300021405SoilMTNHEVLLRLSVNQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWLARCANSECDRLRNEAIRDKANKD
Ga0210387_1136202913300021405SoilMTNHEVLLRLSINQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLTQRMADHAECDRLRNEAIRDKANKD
Ga0210387_1149019723300021405SoilTRSGAIVTNYEVLLRLSINQHDLMAVASLHDNNAKIIRTTVIRYFGTGTVPDKVEGALMRRMAGNARLYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210383_1134233613300021407SoilMTNHEVLLRLSVNQHDHMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWPARCANTECDRLRNEAIRDKANKN
Ga0210394_1063427623300021420SoilLSVNQHDHMAVASLHDNNVKIIRTTVIRYFGTGTVPDNVEGALMQRMADHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210402_1005288633300021478SoilVDSVKATRSGAIVTNHEVLLRLSVNQHDLIAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARLYERSEDPDAWLARCANTECDRLRNEAIRDKANKD
Ga0210402_1168766713300021478SoilCGAIMTNHEVLLRLSVNQHDHMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMAGHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0210409_1027523723300021559SoilMTNLEAIQRLAVNKHDPIAVTSLHDNNAAIIRAVIGRYFGSGPVADKAECALMKRMAAHARLYEHPEDLDAWLMRCANTECDRLRNEAIWEKADED
Ga0210409_1029233733300021559SoilMTNHEVLLRLSINQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLTQRMADHAECDRLRNEAIRDKAEKD
Ga0210409_1134500023300021559SoilNDHLAVTSLRENNTEIIHTTVIRYFGTGAIAENLEIALMKRLADHARLYEGSEDPDVWLAHCADSECDRLRNEAIREKANSE
Ga0242656_112619523300022525SoilMTNHEVLLRLSVNKHDNGAVTSLHDNNAEIIRTAVIRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCANTECDRLRNEA
Ga0242658_116480613300022530SoilMTNHEVLLRLSINQHDDMAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWLARCANTECDRLRNEAIRDKANK
Ga0247694_10000011193300024178SoilMTNREALMRLSADKNDRMAVTSLHDNNAQIIRTTVGRYFGAGTIADNAEFALMQRLADHARLYEDSEDPEAWLARCTNTECDRLRNEAVHDKANRD
Ga0247669_1000017683300024182SoilMTNREALMRLSADKNDRMAMTSLHDNNAHIIRTTVGRYFGAGTIADNAEFALMQRLVDHARLYEDSEDPEAWLARCTNTECDRLRNEAVHDKANRD
Ga0137417_100404013300024330Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMANHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRRLGSHQK
Ga0137417_143701923300024330Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMANHARLYEDSEDLDVWLACCANTECDRMRNEAIRDKANRD
Ga0247668_111129913300024331SoilMRLSADKNDRMAVTSLHDNNAQIIRTTVGRYFGAGTIADNAEFALMQRLADHARLYEDSEDPEAWLARCTNTECDRLRNEAVHDKANRD
Ga0257146_100411843300026374SoilMTNHEVLLRLSVNQHDYIAVASLRHNNAEIIRTTVIRYFGTGTVPDNVEGTLMQRMAGHARSYERSEDPDAWLARCAHTECDRLRNEAIRDTPTAAASSRPTPRMNCGLEYSR
Ga0257171_100151753300026377SoilMTNHEVLLRLSVNQHDYIAVASLRHNNAEIIRTTVIRYFGTGTVPDNVEGTLMQRMAGHARSYERSEDPDAWLARCANTECDRLRNEAIRDTPTAAASSRPTPRMNCGLEYSR
Ga0257158_102050223300026515SoilNYNEHEVLLRLSVNKHDNSAVTSLHDNNVEIIRTTMVRYFGTGTVADNLECALMQRMASQARLYERSEDPDAWLARCANMECDRLRNEAIRDKAEKD
Ga0209648_1005049733300026551Grasslands SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVIRYFGSGTAAGNVEFELMKRMANHARLYEDSEDLDVWLARCANTECDRLRNEAIRDKANRD
Ga0179587_1014067013300026557Vadose Zone SoilAIMTNREVLLRLSVNKHDRLAVVSLHDNNAEIIHTAVIRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADRN
Ga0179587_1060994313300026557Vadose Zone SoilMTNHEVLLRLSVNKHDNSAVTSLHDNNSEIIRTTVVRYFGTGTVADNVECALMQRMAGQARLYERSEDPDAWLARCANMECDRLRNEAIRDKAEKD
Ga0179587_1116380123300026557Vadose Zone SoilMTNREVLLRLSVNKHDRLAVVSLHDNNAEIIHTAVIRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWVARCVDTQCDRL
Ga0209588_123607123300027671Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMANHARLYEDSEDLDVWLACCANTECDRMRNEAIHDKANTD
Ga0209118_1000042483300027674Forest SoilMTNREAIQRLSINKHDPIAVTSLHDNNAAIIRAAIARYFGAGPVADKAECALMQRMAAHARLYEHPEDLDAWLTRCANTECDRLRNEAIWEKADKD
Ga0209701_1043311513300027862Vadose Zone SoilTPARGIGQMTNLEAIERLSVNKHDHLAATSLHHNNAEIIRTTVIRYFGTGMVADKAGSVLMRRVADNARLYEHSEDPDAWLARCANTECNRMRNEAIYEKANRA
Ga0209701_1053219223300027862Vadose Zone SoilIQRLSVNKHDHLAVTSLHDNNAGIIRAAITRYFGTGPVADKAESALMQRMADHARSYEHTEDPDVWLARCANTECDRLRNEAIREKANKD
Ga0209167_1039557823300027867Surface SoilIERLSANKHDHLAAISLHENNAEIIRTTVIRFFGTGIVADKAEAALMRRMADHARLYEHHEDPEVWLARCASTECDRLRNEAIREKANKD
Ga0209283_1003592053300027875Vadose Zone SoilMTNLEAIERLSVNKHDHLAATSLHHNNAEIIRTTVIRYFGTGMVADKAGSVLMRRVADNARLYEHSEDPDAWLARCANTECNRMRNEAIYEKANRA
Ga0209488_1042555623300027903Vadose Zone SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAGNVEFELMKRMAGHARLYEDSEDLDVWLAGCANTECDRMRNEAIHDKANTD
Ga0137415_1021315223300028536Vadose Zone SoilMPVRGIGRMTNREAIQRLSVNKHDHLAVKSLHDNNAEVIRTTVIRYFGTGPVADKAETALMERMAESARSYAPQESPDEWLARCANTECDRLRNEAIHEKANKD
Ga0137415_1108458213300028536Vadose Zone SoilMDSLKAGCFGVMTNHEALLRLSVDKNDHLAVASLQDNNAEVIHTTVMRYFRTGTGGDNLEFALMRRMADHARFYKHSEDADAWLARCANTECDRLRNEAIHDKANGD
Ga0308309_10002189143300028906SoilNDHLAVMSLRENNAEIIRITVVRYFGTGTFAEGMEFALLKRMADHARLYEDSEDPDVWLADCANTECDRLRNEAIREKANRD
Ga0222748_112729623300029701SoilMTNHEVLLRLSVNQHDDMAVALLHDNNAKIIRTTVIRYFGTGTVPDNVEGTLMQRMADHARSYERSEDPDAWPARCANTECDRLRNEAIRDK
Ga0170834_10901914123300031057Forest SoilVTNREVLLRLSVNKHDRLAVVSLHDNNAEIIRTTVIRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADRN
Ga0307474_1006893613300031718Hardwood Forest SoilMTNREVLLRLSVNKHDHLAVTSLHDNNAEIIRTTVVRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADKN
Ga0307469_1000660753300031720Hardwood Forest SoilMTNHEVLLRLSVNQHDYIAVASLHDNNAEIIRTTVIRYFGTGAVPDNIEGALMQRMAGHARFYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0307469_1113272313300031720Hardwood Forest SoilMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLACCANTECDRMRNEAIHDKANTD
Ga0307477_100000221373300031753Hardwood Forest SoilMTNREAIQRLSVNKDDHAAAMSLHDNNAEVIRTTVMRYFGTGKVADKAESALMERMADSARSYQHPEDPDVWLARCANTECDRLRNEAIHEKANKD
Ga0307477_1000830433300031753Hardwood Forest SoilMTNREAIQRLSVNKYDRLAVTSLHENNAGIIRAAIGRYFGAGPVADKAESALMQRVADHARLYEHPEDPDAWLARCANTECDRLRNEAIWQKADKD
Ga0307477_1098554313300031753Hardwood Forest SoilGVDFVQAGRCGAIMTNREVLLRLSVNKHDHLAVTSLHDNNADIIRATVMRYFGTGMVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADRN
Ga0307475_1004850923300031754Hardwood Forest SoilMTNQEALLRLSVNKHDHLAVASLRDNNSEIIRTTMIRHFGPAPVAGDVESALMKRLADHARLYDHMEDPDAWLARCTNTECDRLHNEAIRDKANRD
Ga0307475_1005871213300031754Hardwood Forest SoilMTNREVLLRLSVNKHDHLAVTSLHDNNAEIIRTAVIRYFGTGTVADKIKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADKN
Ga0307475_1016824923300031754Hardwood Forest SoilMTNREVLLRLSVNKHDHLAVTSLHDNNADIIRATVMRYFGTGMVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADGN
Ga0307473_1082692813300031820Hardwood Forest SoilMTNREVLLRLSVNKHDHLAVTSLHDNNAEIIRTTVIRYFGTGTAADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADKN
Ga0307479_1002083913300031962Hardwood Forest SoilMTNREVLLRLSVNKHDHLAVTSLHDNNADIIRTTVIRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADKN
Ga0307479_1009428533300031962Hardwood Forest SoilMTNHEALLRLSADQNDHLAVMSLRDNNAEIIRTTVMHYFGSGTAAENVEFELIKRMAEHARLYEDSEDLDVWLARCTNTECDRLRNEAIRDQANRN
Ga0307479_1014808733300031962Hardwood Forest SoilMTNREAIERLSVNKHDHLAVTSLHENNAEIIRTTVIRYFGTGTVADKAEAALMRRMADHARLYEHHEDPEVWLARCASRECDRLRNEAIREKANKD
Ga0307479_1025224723300031962Hardwood Forest SoilMTNREVLLRLSVNKHDRLAVTSLHDNNAGIIRTTVMRYFGTGAVPDKIERALMQRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADKN
Ga0307479_1042538423300031962Hardwood Forest SoilEAIQRLSVNKYDRLAVTSLHENNAGIIRAAIGRYFGAGPVADKAESALMQRVADHARLYEHPEDPDAWLARCANTECDRLRNEAIWQKADKD
Ga0307479_1132762223300031962Hardwood Forest SoilVKTGRCGVIMTNHGVLLRLSVNQHDHIAVASLHDNNAKIIRTTVIRYFGTGTVPDNVEDALMLLMAGHARLYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0307479_1182573813300031962Hardwood Forest SoilVKAGRRGAIMTNREVLLRLSVNKHDHLAVTSLHDNNAEIIRTTVVRYFGTGTVADKVKFALMRRMADHARLYQDPEDPDAWLARCVDTQCDRLRNEAIRDEADKN
Ga0307470_1180714123300032174Hardwood Forest SoilMTNHEVLLRLSVNQHDYIAVASLHDNNAEIIRTTVIRYFGTGAVPDNVEGALMQRMAGHARFYERSEDPDAWLARCANTECDRLRNEAIRDKANRD
Ga0307471_10016430433300032180Hardwood Forest SoilMTNREAIERLSVNKYDRLAVTSLHDNNAEIIRAAIDRYFGAGPVADKAELALMQRMADHARLYEHPEDPDAWLARCANTECDRLRNEAIWQKADKD
Ga0307472_10046890123300032205Hardwood Forest SoilVEAACFGVMTNHEALLRLSVDKNDHLAVTSLHENNAEIIRTTVMRYFGSGTAAENVEFELMKRMAGHARLYEDSEDLDVWLACCANTECDRMRNEAIHDKANTD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.