NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F047201

Metagenome / Metatranscriptome Family F047201

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F047201
Family Type Metagenome / Metatranscriptome
Number of Sequences 150
Average Sequence Length 88 residues
Representative Sequence MPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Number of Associated Samples 99
Number of Associated Scaffolds 150

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 74.67 %
% of genes near scaffold ends (potentially truncated) 30.00 %
% of genes from short scaffolds (< 2000 bps) 78.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (63.333 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(32.000 % of family members)
Environment Ontology (ENVO) Unclassified
(32.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(74.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 34.78%    β-sheet: 16.52%    Coil/Unstructured: 48.70%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 150 Family Scaffolds
PF02517Rce1-like 14.00
PF04379DUF525 8.00
PF12680SnoaL_2 5.33
PF07690MFS_1 3.33
PF01402RHH_1 2.00
PF01425Amidase 2.00
PF13432TPR_16 1.33
PF13302Acetyltransf_3 1.33
PF08450SGL 1.33
PF13520AA_permease_2 1.33
PF02738MoCoBD_1 1.33
PF07963N_methyl 1.33
PF09335SNARE_assoc 0.67
PF02233PNTB 0.67
PF01300Sua5_yciO_yrdC 0.67
PF01894UPF0047 0.67
PF04909Amidohydro_2 0.67
PF07676PD40 0.67
PF01022HTH_5 0.67
PF08352oligo_HPY 0.67
PF07607DUF1570 0.67
PF00069Pkinase 0.67
PF07730HisKA_3 0.67
PF03450CO_deh_flav_C 0.67
PF02583Trns_repr_metal 0.67
PF14559TPR_19 0.67
PF04191PEMT 0.67
PF07715Plug 0.67
PF08818DUF1801 0.67
PF03729DUF308 0.67
PF01966HD 0.67
PF03050DDE_Tnp_IS66 0.67
PF00578AhpC-TSA 0.67
PF13376OmdA 0.67
PF13847Methyltransf_31 0.67
PF05402PqqD 0.67
PF02426MIase 0.67

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 150 Family Scaffolds
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 14.00
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 14.00
COG2967Uncharacterized conserved protein ApaG affecting Mg2+/Co2+ transportInorganic ion transport and metabolism [P] 8.00
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 2.67
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 2.00
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 1.33
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 1.33
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 0.67
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 0.67
COG4829Muconolactone delta-isomeraseSecondary metabolites biosynthesis, transport and catabolism [Q] 0.67
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.67
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.67
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 0.67
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.67
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.67
COG3436TransposaseMobilome: prophages, transposons [X] 0.67
COG3247Acid resistance membrane protein HdeD, DUF308 familyGeneral function prediction only [R] 0.67
COG1937DNA-binding transcriptional regulator, FrmR familyTranscription [K] 0.67
COG1282NAD/NADP transhydrogenase beta subunitEnergy production and conversion [C] 0.67
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 0.67
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 0.67
COG0432Thiamin phosphate synthase YjbQ, UPF0047 familyCoenzyme transport and metabolism [H] 0.67
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 0.67


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms63.33 %
UnclassifiedrootN/A36.67 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10629182All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300001867|JGI12627J18819_10451070All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300002245|JGIcombinedJ26739_100318481All Organisms → cellular organisms → Bacteria1438Open in IMG/M
3300004092|Ga0062389_100135833All Organisms → cellular organisms → Bacteria2279Open in IMG/M
3300004103|Ga0058903_1507494Not Available510Open in IMG/M
3300004631|Ga0058899_12102060Not Available660Open in IMG/M
3300004631|Ga0058899_12140864All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300004635|Ga0062388_100184688Not Available1627Open in IMG/M
3300005176|Ga0066679_10045082All Organisms → cellular organisms → Bacteria2511Open in IMG/M
3300005434|Ga0070709_10203574All Organisms → cellular organisms → Bacteria → Acidobacteria1403Open in IMG/M
3300005529|Ga0070741_10006254All Organisms → cellular organisms → Bacteria25735Open in IMG/M
3300005533|Ga0070734_10655256Not Available597Open in IMG/M
3300005534|Ga0070735_10868234All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium530Open in IMG/M
3300005537|Ga0070730_10074742All Organisms → cellular organisms → Bacteria → Acidobacteria2383Open in IMG/M
3300005538|Ga0070731_10142618All Organisms → cellular organisms → Bacteria1589Open in IMG/M
3300005538|Ga0070731_11153349Not Available512Open in IMG/M
3300005541|Ga0070733_10045830All Organisms → cellular organisms → Bacteria2734Open in IMG/M
3300005541|Ga0070733_10335672All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300005568|Ga0066703_10170688All Organisms → cellular organisms → Bacteria1312Open in IMG/M
3300005575|Ga0066702_10116385All Organisms → cellular organisms → Bacteria1547Open in IMG/M
3300005921|Ga0070766_10592667Not Available744Open in IMG/M
3300005921|Ga0070766_10973302Not Available583Open in IMG/M
3300006176|Ga0070765_100234093All Organisms → cellular organisms → Bacteria1679Open in IMG/M
3300006176|Ga0070765_100456980All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1197Open in IMG/M
3300006176|Ga0070765_100674940All Organisms → cellular organisms → Bacteria → Acidobacteria976Open in IMG/M
3300006176|Ga0070765_100924010All Organisms → cellular organisms → Bacteria825Open in IMG/M
3300006755|Ga0079222_10275115All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1079Open in IMG/M
3300006755|Ga0079222_12675109Not Available502Open in IMG/M
3300006954|Ga0079219_10124368All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300009088|Ga0099830_10402599All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1107Open in IMG/M
3300010379|Ga0136449_100074767Not Available7215Open in IMG/M
3300010379|Ga0136449_100262782Not Available3191Open in IMG/M
3300010379|Ga0136449_103052194Not Available652Open in IMG/M
3300011120|Ga0150983_11107567Not Available561Open in IMG/M
3300011120|Ga0150983_15612385Not Available505Open in IMG/M
3300011120|Ga0150983_15692005Not Available626Open in IMG/M
3300011120|Ga0150983_15848644Not Available692Open in IMG/M
3300012929|Ga0137404_11173660All Organisms → cellular organisms → Bacteria → Acidobacteria706Open in IMG/M
3300014158|Ga0181521_10292304Not Available840Open in IMG/M
3300014200|Ga0181526_11058534Not Available509Open in IMG/M
3300014501|Ga0182024_10018203All Organisms → cellular organisms → Bacteria → Proteobacteria13107Open in IMG/M
3300014501|Ga0182024_10121288All Organisms → cellular organisms → Bacteria → Acidobacteria3739Open in IMG/M
3300014501|Ga0182024_10143959All Organisms → cellular organisms → Bacteria3364Open in IMG/M
3300017926|Ga0187807_1206721Not Available637Open in IMG/M
3300017927|Ga0187824_10001129All Organisms → cellular organisms → Bacteria6447Open in IMG/M
3300017927|Ga0187824_10155807Not Available760Open in IMG/M
3300017927|Ga0187824_10374372Not Available516Open in IMG/M
3300017930|Ga0187825_10000896All Organisms → cellular organisms → Bacteria8334Open in IMG/M
3300017930|Ga0187825_10218185Not Available691Open in IMG/M
3300017936|Ga0187821_10041757All Organisms → cellular organisms → Bacteria → Acidobacteria1633Open in IMG/M
3300017943|Ga0187819_10538403Not Available664Open in IMG/M
3300017993|Ga0187823_10000898All Organisms → cellular organisms → Bacteria6988Open in IMG/M
3300017994|Ga0187822_10139820Not Available770Open in IMG/M
3300018006|Ga0187804_10343339All Organisms → cellular organisms → Bacteria → Acidobacteria655Open in IMG/M
3300018058|Ga0187766_10769459Not Available670Open in IMG/M
3300018062|Ga0187784_10547278Not Available931Open in IMG/M
3300020579|Ga0210407_10838268All Organisms → cellular organisms → Bacteria → Acidobacteria707Open in IMG/M
3300020580|Ga0210403_10221328All Organisms → cellular organisms → Bacteria1553Open in IMG/M
3300020582|Ga0210395_10091200All Organisms → cellular organisms → Bacteria2249Open in IMG/M
3300020583|Ga0210401_10013534All Organisms → cellular organisms → Bacteria7921Open in IMG/M
3300020583|Ga0210401_10341615All Organisms → cellular organisms → Bacteria → Acidobacteria1359Open in IMG/M
3300020583|Ga0210401_10624753Not Available939Open in IMG/M
3300020583|Ga0210401_11183702Not Available623Open in IMG/M
3300021170|Ga0210400_10780698Not Available783Open in IMG/M
3300021170|Ga0210400_10825676Not Available759Open in IMG/M
3300021171|Ga0210405_10043231All Organisms → cellular organisms → Bacteria3579Open in IMG/M
3300021171|Ga0210405_10366203All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300021171|Ga0210405_10451324All Organisms → cellular organisms → Bacteria1010Open in IMG/M
3300021171|Ga0210405_10670658All Organisms → cellular organisms → Bacteria803Open in IMG/M
3300021171|Ga0210405_10682062All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300021178|Ga0210408_10434789All Organisms → cellular organisms → Bacteria1045Open in IMG/M
3300021178|Ga0210408_10949062All Organisms → cellular organisms → Bacteria → Acidobacteria667Open in IMG/M
3300021180|Ga0210396_11101811Not Available668Open in IMG/M
3300021181|Ga0210388_11209072Not Available641Open in IMG/M
3300021181|Ga0210388_11643822Not Available533Open in IMG/M
3300021401|Ga0210393_10796728Not Available769Open in IMG/M
3300021402|Ga0210385_11019890All Organisms → cellular organisms → Bacteria → Acidobacteria636Open in IMG/M
3300021404|Ga0210389_11086453Not Available619Open in IMG/M
3300021405|Ga0210387_11353339All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300021405|Ga0210387_11466422Not Available585Open in IMG/M
3300021406|Ga0210386_10524004Not Available1024Open in IMG/M
3300021407|Ga0210383_10615111Not Available936Open in IMG/M
3300021420|Ga0210394_10001400All Organisms → cellular organisms → Bacteria39034Open in IMG/M
3300021420|Ga0210394_10006348All Organisms → cellular organisms → Bacteria12822Open in IMG/M
3300021420|Ga0210394_10435507All Organisms → cellular organisms → Bacteria1154Open in IMG/M
3300021420|Ga0210394_10466951All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300021420|Ga0210394_10580860All Organisms → cellular organisms → Bacteria985Open in IMG/M
3300021420|Ga0210394_10727587All Organisms → cellular organisms → Bacteria → Acidobacteria869Open in IMG/M
3300021432|Ga0210384_10097130All Organisms → cellular organisms → Bacteria2644Open in IMG/M
3300021432|Ga0210384_11039496Not Available721Open in IMG/M
3300021432|Ga0210384_11060157Not Available713Open in IMG/M
3300021433|Ga0210391_10908530All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_68_18687Open in IMG/M
3300021474|Ga0210390_10032722All Organisms → cellular organisms → Bacteria → Acidobacteria4245Open in IMG/M
3300021474|Ga0210390_10192675All Organisms → cellular organisms → Bacteria1724Open in IMG/M
3300021474|Ga0210390_10502872All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300021474|Ga0210390_11108420All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300021475|Ga0210392_10830032All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300021476|Ga0187846_10173316All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300021477|Ga0210398_10809230Not Available754Open in IMG/M
3300021559|Ga0210409_10066369All Organisms → cellular organisms → Bacteria3362Open in IMG/M
3300021559|Ga0210409_10143423All Organisms → cellular organisms → Bacteria → Acidobacteria2190Open in IMG/M
3300021559|Ga0210409_10269488All Organisms → cellular organisms → Bacteria → Acidobacteria1539Open in IMG/M
3300021559|Ga0210409_10938822Not Available739Open in IMG/M
3300022724|Ga0242665_10377478Not Available513Open in IMG/M
3300025906|Ga0207699_10750269Not Available716Open in IMG/M
3300026515|Ga0257158_1033216All Organisms → cellular organisms → Bacteria → Acidobacteria913Open in IMG/M
3300026551|Ga0209648_10419604All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300027376|Ga0209004_1035605Not Available822Open in IMG/M
3300027565|Ga0209219_1028422All Organisms → cellular organisms → Bacteria → Acidobacteria1386Open in IMG/M
3300027576|Ga0209003_1067833All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300027591|Ga0209733_1148358All Organisms → cellular organisms → Bacteria → Acidobacteria571Open in IMG/M
3300027635|Ga0209625_1003470All Organisms → cellular organisms → Bacteria3448Open in IMG/M
3300027660|Ga0209736_1059887All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300027706|Ga0209581_1004426All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae11304Open in IMG/M
3300027842|Ga0209580_10434446All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300027854|Ga0209517_10124206All Organisms → cellular organisms → Bacteria → Proteobacteria1693Open in IMG/M
3300027857|Ga0209166_10316260Not Available820Open in IMG/M
3300027867|Ga0209167_10272375All Organisms → cellular organisms → Bacteria → Acidobacteria912Open in IMG/M
3300027867|Ga0209167_10352919All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300027867|Ga0209167_10755294Not Available530Open in IMG/M
3300027884|Ga0209275_10315988All Organisms → cellular organisms → Bacteria → Acidobacteria870Open in IMG/M
3300027889|Ga0209380_10354920All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300027905|Ga0209415_10843138Not Available632Open in IMG/M
3300027986|Ga0209168_10591293All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium531Open in IMG/M
3300028047|Ga0209526_10220812All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1306Open in IMG/M
3300028906|Ga0308309_10102730All Organisms → cellular organisms → Bacteria2215Open in IMG/M
3300028906|Ga0308309_10542025Not Available1008Open in IMG/M
3300028906|Ga0308309_11280773All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300030007|Ga0311338_10894299All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium874Open in IMG/M
3300031231|Ga0170824_114640153Not Available1505Open in IMG/M
3300031446|Ga0170820_15549965Not Available548Open in IMG/M
3300031474|Ga0170818_104708255Not Available929Open in IMG/M
3300031715|Ga0307476_10406193All Organisms → cellular organisms → Bacteria → Acidobacteria1006Open in IMG/M
3300031718|Ga0307474_10476535Not Available977Open in IMG/M
3300031720|Ga0307469_11367747All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300031754|Ga0307475_10050546All Organisms → cellular organisms → Bacteria3120Open in IMG/M
3300031754|Ga0307475_10742765Not Available781Open in IMG/M
3300031823|Ga0307478_10303781All Organisms → cellular organisms → Bacteria → Acidobacteria1307Open in IMG/M
3300031962|Ga0307479_10002292All Organisms → cellular organisms → Bacteria17377Open in IMG/M
3300031962|Ga0307479_10067685All Organisms → cellular organisms → Bacteria3445Open in IMG/M
3300031962|Ga0307479_10152170All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2270Open in IMG/M
3300031962|Ga0307479_10183182All Organisms → cellular organisms → Bacteria2060Open in IMG/M
3300032160|Ga0311301_10866905All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1224Open in IMG/M
3300032174|Ga0307470_10893048Not Available697Open in IMG/M
3300032180|Ga0307471_100021551All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia4737Open in IMG/M
3300032770|Ga0335085_10000221All Organisms → cellular organisms → Bacteria178374Open in IMG/M
3300032783|Ga0335079_10017738All Organisms → cellular organisms → Bacteria8134Open in IMG/M
3300032783|Ga0335079_10770705All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300032805|Ga0335078_10482853All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → Mesorhizobium ciceri1598Open in IMG/M
3300032892|Ga0335081_12570310Not Available524Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil32.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil11.33%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil10.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment7.33%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil7.33%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil4.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.33%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.00%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost2.00%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.33%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.33%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.33%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.33%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.67%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.67%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.67%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004103Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF242 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014158Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin02_60_metaGEnvironmentalOpen in IMG/M
3300014200Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_30_metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300017926Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_2EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027376Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027576Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300030007I_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1062918223300001593Forest SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASAQIFVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK*
JGI12627J18819_1045107023300001867Forest SoilSKAENHKEAIMSAVEEDTESQAIAAAARAAKGISVERPVRAQELPALVEKALEGSGEPFRLTFDDDSSLYLTTKEGILTATVSK*
JGIcombinedJ26739_10031848123300002245Forest SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK*
Ga0062389_10013583313300004092Bog Forest SoilMKIDGNSETATPKSKAEDYKEVIMSSVEEQTESPAIADAARAARHIYYAEKLVRPEELSTLVEQALDGAGEPFRLTFDDGSSLYLTTSAGILTATVSK*
Ga0058903_150749413300004103Forest SoilMPKSKAENYKEAIMSSVEEETEAQAIGDAAKASSRIYVERVVRPEELSKLVEKSLDGTGESFRLEFDDDSTLYLSTREGILTATVSK*
Ga0058899_1210206013300004631Forest SoilMPKSKAENHKEAIMSSVEEETEAEAIAAAAKAANEISVERVVRPDELSALVEKALNGPGEPFRLTFDDGSSLYLSTKQVILTATVSK*
Ga0058899_1214086433300004631Forest SoilKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK*
Ga0062388_10018468813300004635Bog Forest SoilMKIDGNSETATPKSKAEDYKEVIMSSVEEQTESPAIADAARAARHIYYAEKLVRPEELSTLVEQALDGAGEPFRLTFDDGSSLYLTTSAGI
Ga0066679_1004508233300005176SoilMPKSKAESHKEAIMSSVEDETEKEALAAAAKAARRISVERTVRPEELSALVEKALDGSGEPFRLTFDDDSSLYLTTKEGILTATVSK*
Ga0070709_1020357423300005434Corn, Switchgrass And Miscanthus RhizosphereMPKSKAENHKEVIMSAVEDETESEAIAEAAKAAQRIRAGERSVRPEDLSAQVEKALDGAGERFHLKFDDGSELFLATTGGILTATVSK*
Ga0070741_1000625443300005529Surface SoilMSALEDETEDEAIGDAGKAAARIYVERAIRPQELSKLVEQALDGGGERFRLSFDDDSNLYLSTSGGMLTATVSK*
Ga0070734_1065525613300005533Surface SoilLAKSKAESFKEAIMSALEDETEDEAIGDAGKAAARIYVERAIRPQELSKLVEQALDGGGERFRLSFDDDSNLYLS
Ga0070735_1086823413300005534Surface SoilMAAVEEETENEAMADAAKVATQIYVDQRLVRPAELSTLVEKALDGSGESFRLKFDDGSDLYISTSGGILTATVSK*
Ga0070730_1007474233300005537Surface SoilMNVLSDDAAVAENDGNHEATAMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSQIYVERVVRSGELSNLVDKALHGTGESFRLKFDDDSILYLSTREGILTATVSK*
Ga0070731_1014261823300005538Surface SoilMPKSKAENHKEAIMSSVEEETEAEAIAAAAKAANEISVERVVRPDELSALVEKALNGPGEPFRLTFDDGSSLYLSTKQGILTATVSK*
Ga0070731_1115334913300005538Surface SoilMPKSKAENYKELIMSSVQEETETEALTDAAKAAAQIYVERAVRPEEFSKLVEKALDGAGEPFRLRFDDDSNLYLSTTNGILIATVSK*
Ga0070733_1004583043300005541Surface SoilMPKSKAENYKEVIMSSVEEETETQAIADAAKDSSRIYVERVVRPEELGKLVEKSLDGSGETFRLKFDDDSTLYLSTREGILTATVSK*
Ga0070733_1033567223300005541Surface SoilMNVLSDDAAVAENDGNHEAPAMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSQIYVERVVRPGELSNLVDKALHGTGESFRLKFDDDSILYLSTREGILTATVSK*
Ga0066703_1017068833300005568SoilMPKSKAESHKEAIMSALEEDTESQAIAAAARAAKGISIERPVRAQELPALVEKALDDSGEPFRLTFDDDSSLYLTTKEGILTATVSK*
Ga0066702_1011638523300005575SoilMPKSKAESHKEAIMSSVEDETEKEALAAAAKAARRISVERIVRPEELSALVEKALDGSGEPFRLTFDDDSSLYLTTKEGILTATVSK*
Ga0070766_1059266713300005921SoilNLNKCETALSHGAAVTQNHGNHEATAMPKSKAENYKEAIMSSVEEETETQAIADAARASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK*
Ga0070766_1097330213300005921SoilMPKSKAENHKEAIMSSVEEETEAEAIAAAAKAANEISVERVVRPDELSALVEKALNGLGEPFRLTFDDGSSLYLS
Ga0070765_10023409323300006176SoilMKADGKSEGTKPKSKAEDHKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVSK*
Ga0070765_10045698033300006176SoilMPKSKAESHKEAIMAAVEEDTENEAIAAAARAATEIYVEQRVVPAEELSELVEKALDDGGGPFRLKFGDGSNLYLSTSGGTLTATVSK*
Ga0070765_10067494013300006176SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRIYVERVVRPEELSKLVEKSLDGTGESFRLEFDDDSTLYLSTREGILTATVSK*
Ga0070765_10092401033300006176SoilMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSN*
Ga0079222_1027511523300006755Agricultural SoilMAAVEEETENEAIADAAKAAKRAYAEQRSVRAGELSALVEKALDDSGESFRLEFDDGSNLYLSTKGGILTATVSK*
Ga0079222_1267510923300006755Agricultural SoilMPKSKAESHKEAIMSSIEDETEKEALAAAAKAARSIAIERDVRPQELPSLVEKALDGSGEPFRLTFDDDSCLYLTTRQGILTATVSK*
Ga0079219_1012436823300006954Agricultural SoilMPKSKAETFKETIMAAVEEETENEAIADAAKAAKRAYAEQRSIRAGELSALVEKALDDSGESFRLEFDDGSNLYLSTKGGILTATVSK*
Ga0099830_1040259933300009088Vadose Zone SoilMHKSKAENIKEVIMAAVEDETETETIADAAKAAKRIYFEERTVRAEELSTLVEKALDDSGESFRLEFDDGSNLYLSTKGGILTATVSK*
Ga0136449_10007476793300010379Peatlands SoilMPKSKAESYKEEIMSAVEDETETESIADAAKAAKHIHCERLVRPEELSALVEKALDGAGESFRLTFDDDSRLYLTTSNGILTATVSK*
Ga0136449_10026278233300010379Peatlands SoilMPKSKAESHKEAIMAAVEEETESEAIAAAAKEATQINVEQRVVPAEKLSEMVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATVSK*
Ga0136449_10305219413300010379Peatlands SoilMPKSKAESHKEAIMAAVEEETESEAIAAAARAATQIYVEKRAVPAERLSELVEKALDDSGGPFRLQFGDGSNLYLSTSGGTLTATVSK*
Ga0150983_1110756723300011120Forest SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRIYVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK*
Ga0150983_1561238513300011120Forest SoilAVTQNHGNHEATAMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK*
Ga0150983_1569200523300011120Forest SoilMPKSKAENHKEAIMSSVEDETEKEAIAAAAKAARKISVERAVRPEELSALVQKALDSSGEPFRLTFDDDSCLYLTTKEGILTATVS
Ga0150983_1584864413300011120Forest SoilYKEAIMSSVEEETETQAIADAAKASARIYVERLVRPEELSKLVEKALDGSGETFRLTFDDDSTLYLSTREGILTATVSK*
Ga0137404_1117366013300012929Vadose Zone SoilMTAIMRPAMPKSKAENHKEAIMSAVEDETESEAIAAAAKAAQSIRVAERIVRPEDLSTQVEKALDGAGERFHLKFDDGSELYLATTGGILTAPVSK*
Ga0181521_1029230423300014158BogMPKSKAESHKEAIMAAVEEETESEAIAAAAKEATQIDVEQRGVPAEKLSEMVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATVSK*
Ga0181526_1105853413300014200BogMKADGNSEATKPKSKAEDHKESIMSAVQEQTEGPAIADAAKAATHIYYAKRLVRPEDLSTLVEQALDGAGEPFRLTFDDGSSLHLTTSNGMLTATVSK*
Ga0182024_10018203103300014501PermafrostMKADGNSEATKPKSKAEDHKESIMSAVQEQTESEAIADAARAATHIYYSKQLVRPEELSTLVGQALDGAGEPFRLTFDDGSSLYLTTSEGMLTATVSK*
Ga0182024_1012128823300014501PermafrostMKAAGKSEATQPKSKAEDHKESIMSAVQEETEGPAIADAARAAKHIYCSKRLVRHEELSALLEQALDGDGEPFRLTFDDGSSLYLTTSEGILTATVSK*
Ga0182024_1014395923300014501PermafrostMKADGNSEATKPKSKAEGHKESIMSSVQEQTESQAIAEAARAATHIYYAKQLVRPEELSTLVGQALDGAGEPFRLTFDDDSSLYLTTSDGILTATVSK*
Ga0187807_120672113300017926Freshwater SedimentMPKSKAESHKEAIMAAVEEETESEAISAAAKEATQINVEQRVVPAEKLSELVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATVSK
Ga0187824_1000112963300017927Freshwater SedimentMPKSRAENYKEVIMSSVEEETETEAIADAAKAAAQIYVERAVRPEELSRLVEKALDSGGESFRLKFDDGSSLYLSTNAGILTATVSK
Ga0187824_1015580713300017927Freshwater SedimentMNVLSDDAAVAENDGNHEATAMPTSKAENHKEAIMSSVEEETEAQAIADAAKASSQIYVERVVRSGELSNLVDKALHGTGESFRLKFDDDSILYLSTREGILTATVSK
Ga0187824_1037437213300017927Freshwater SedimentMRKSKAESHKEAIMAAVEEETEGEAIAAAAKAATQIKLEQRTISAKELSASVEKALHGRGGSFRLQFDDGSNLFLSTSGGILTATVSK
Ga0187825_1000089673300017930Freshwater SedimentMPKSKAENYKEVIMSSVEEETETEAIADAAKAAAQIYVERAVRPEELSRLVEKALDSGGESFRLKFDDGSSLYLSTNAGILTATVSK
Ga0187825_1021818513300017930Freshwater SedimentMRKSQAESHKEAIMAAVEEETEGEAIAAAAKAATQIKLEQRTISAKELSASVEKALHGRGGSFRLQFDDGSNLFLSTSGGILTATVSK
Ga0187821_1004175713300017936Freshwater SedimentMRKSQAESQKEAIMAAVEEETEGEAIAAAAKAATQIKLEQRTISAKELSASVEKALHGPGGSFRLQFDDGSNLFLSTSGGILTATVSK
Ga0187819_1053840313300017943Freshwater SedimentMPKSKAESHKEAIMAAVEEETESEAIAAAAKAATEIYVEKRTVPAEQLSELVEKALDDGGGPFRLQFGDGSNLY
Ga0187823_1000089823300017993Freshwater SedimentMPKSRAENYKEVIMSSVEEETETEAIADAAKAAAQIYVERVIRPEELSRLVEKALDGGGESFRLKFDDGSSLYLSTNAGILTATVSK
Ga0187822_1013982023300017994Freshwater SedimentEETEGEAIAAAAKAATQIKLEQRTISAKELSASVEKALHGPGGSFRLQFDDGSNLFLSTSGGILTATVSK
Ga0187804_1034333923300018006Freshwater SedimentMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRIYVERAVRPEELSKLVEKALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0187766_1076945923300018058Tropical PeatlandMRKSKAESHKEAIMAAVEEETESEAIAAAATAATQINVEQRAVPAKQLSDLVEKALNEGGGPFCLQFEDGSNLYLSTDDGTLTATVSK
Ga0187784_1054727823300018062Tropical PeatlandMPKSKAESYKETVMSSLEDETESEALADAAKAATQIYVEGLARPAELSRLVEKALDSSGESFRLVFDDDSILYLSTANGILTATVSK
Ga0210407_1083826823300020579SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVENALDGAGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210403_1022132833300020580SoilVLSDDAAVAENDGNHEATAMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVENALDGAGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210395_1009120043300020582SoilMKADGKSEGTKPKSKAEDHKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVSK
Ga0210401_1001353483300020583SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVHPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210401_1034161513300020583SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRIYVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210401_1062475313300020583SoilKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVSK
Ga0210401_1118370213300020583SoilMPKSKAENHKEAIMSSVEEETEAEAIAAAAKAANEISVERVVRPDELSALVEKALNGPGEPFRLTFDDGSSLYLSTKQGILTATVSK
Ga0210400_1078069813300021170SoilTAMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210400_1082567623300021170SoilSEDAAVAENDGNHEATAMPKSKAESYKEAIMSSVEEETEAQAIADAAKASSRIYVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210405_1004323133300021171SoilMPKSKAENHKEVIMAAVEEETENEAIAEAAKAATNIYAEERLVRAEELSNLVEKALTGTAGSFRLKFEDGSNLYLSTSGGILTATVSN
Ga0210405_1036620313300021171SoilSDGAAVNQNHCNHEATAMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVHPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210405_1045132423300021171SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLTFDDDSTLYLSTREGILTATVSK
Ga0210405_1067065813300021171SoilENYKEAIMSSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVENALHGAGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210405_1068206223300021171SoilVLSADAAVAENDGNHEATAMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRIYVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSN
Ga0210408_1043478923300021178SoilMPKSKAENHKEAIMSSVEDETEKEAIAAAARAARKISVERAVRPEELSALVQKALDSSGEPFRLTFDDDSCLYLTTKEGILTATVSK
Ga0210408_1094906223300021178SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLTFDDDSTLYLSTR
Ga0210396_1110181113300021180SoilMPKSKAENHKEAIMSAVEDETESEAIAEAAKAAQQIRVAERTVRAEDLSTLVEKALDNPGGRFHLKFDDGSELYLA
Ga0210388_1120907213300021181SoilMKADGNNEATKPKSKAEDHKESIMSAVQEQTESEAIADAARAAKHIYYAKQLVRPEELSTLVEQALDGAGEPFRLTFDDGSSLYLTTSEGMLTATVSK
Ga0210388_1164382213300021181SoilMPGKSAEPSQPAGNEATKPKSKAEGYKEGIMSSVQEQIEIPAIADAARAATQIYYAEKLIRPEELSDLVEKALDGAGEPFRLTFDDGSSLYLST
Ga0210393_1079672813300021401SoilMKADGKSEGTKPKSKAEDHKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSEGMLTATVSK
Ga0210385_1101989013300021402SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTAT
Ga0210389_1108645323300021404SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210387_1135333923300021405SoilAENDGNHEATAMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVENALDGAGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210387_1146642223300021405SoilMPKSKAESHKEAIMAAVEEDTENEAIAAAARAATEIYVEQRVVPAEELSELVEKALDDGGGPFRLKFGDGSNLYLSTSGGTLTATVSK
Ga0210386_1052400413300021406SoilMKTDGKSEGTKPKSKAEDHKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVSK
Ga0210383_1061511113300021407SoilMKADGNNEATKPKSKAEDHKESIMSAVQEQTESEAIADAARAAKHIYYAKQLVRPEELSTLVEQALDGAGEPFRLTFDDGSSLYLTTSEGMLTATVS
Ga0210394_10001400213300021420SoilMPKSKAEGYKEAIMSSVEEETEAEAIADAAKASSQIYVERVVRPGELSKLVEKALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210394_10006348143300021420SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFHLKFDDDSTLYLSTREGILTATVSK
Ga0210394_1043550733300021420SoilAAVEEETESEAIAAAARVATQIYVEQRVVPAEELSELVEKALDDGGGPFRLKFGDGSNLYLSTSGGILTATVSK
Ga0210394_1046695123300021420SoilMPKSKAENHKEAIMSSVEEETEAEAIAAAAKAANEISVERVVRPDELSALVEKALNGLGEPFRLTFDDGSCLYLSTKQGILTATVSK
Ga0210394_1058086023300021420SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRVYVERVVRPEELSKLVENALDGNGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210394_1072758723300021420SoilENHKEAIMSGVEDETESEAIAAAAKAAQQIRVAERTIRPEDLSTLVEKALDNPGGRFHLKFDDGSELYLATTGGILTATVSK
Ga0210384_1009713033300021432SoilMPKSKAENHKEAIMSSVEDETEKEAIAAAAKAARKISVERAVRPEELSALVQKALDSSGEPFRLTFDDDSCLYLTTKEGILTATVSK
Ga0210384_1103949613300021432SoilHEATAMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210384_1106015723300021432SoilMPKSKAENHKEVIMSAVEDETESEAIAEAAKAAQLIRVGERIVRPEDLSTQVEKALDGAGERFQLKFDDGSELYLATTGGILTATVSK
Ga0210391_1090853023300021433SoilHKEAIMAAVEEETESEAIAAAAKEATQINVEQRPVPAEKLSELVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATVSK
Ga0210390_1003272233300021474SoilESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVSK
Ga0210390_1019267513300021474SoilYKEAIMSSVEEETETQAIADAAKASARIYVERVVHPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0210390_1050287223300021474SoilMPKSKAESHKEAIMAAVEEETESEAIAAAATAATEIYVEQSVVPAEELSELVEKALDDGGGPFRLKFGDGSNLYLSTSGGILTATVSK
Ga0210390_1110842023300021474SoilATMPKSKAENHKEAIMSSVEEETETEAIAAAAKAAKEISVERVVRPDELSALVEKALNGLGEPFRLTFDDGSSLYLSTKQGILTATVSK
Ga0210392_1083003213300021475SoilVLSDDAAVAENDGNHEATAMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVENALHGAGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0187846_1017331623300021476BiofilmMPKSKAESHKEAILSAIEEDTETQAIAAAARAAKAISVERPIRAQELPALVEKALDGSGEPFRLTFDDDSALYLTTKEGILTATVSK
Ga0210398_1080923013300021477SoilMKADGKSEATKPKSKAEGHKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVSK
Ga0210409_1006636923300021559SoilMPKSKAENHKEVIMAAVEEETENEAIAEAAKAATNIYAEERLVRAEELSNLVEKALSGTAGSFRLKFEDGSNLYLSTSGGILTATVSN
Ga0210409_1014342323300021559SoilMPKSKAENHKEAIMSAVEDETESEAIAEAAKAAQQIRVAERTVRAEDLSTLVEKALDNPGGRFHLKFDDGSELYLATTGGILTATVSK
Ga0210409_1026948813300021559SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGESFCLKFDDDSTLYLSTREGILTATVSK
Ga0210409_1093882223300021559SoilMPKSKAESHKEAIMAAVEEETESEAIAAAAKEATQINVEQRVLPAEKLSEMVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATVSK
Ga0242665_1037747813300022724SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERVVHPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0207699_1075026923300025906Corn, Switchgrass And Miscanthus RhizosphereMPKSKAENHKEVIMSAVEDETESEAIAEAAKAAQRIRAGERSVRPEDLSAQVEKALDGAGERFHLKFDDGSELFLATTGGILTATVSK
Ga0257158_103321623300026515SoilMPKSKAENYKEAIMSSVEEETEAQAIGDAAKASSQIYVERVVRPEELSKLVENALHGAGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0209648_1041960423300026551Grasslands SoilMPKSKAENHKEAIMSSVEDETEREAIAAAAKAARSISVERVVRPEELSALVEKALDGSGEPFRLTFDDDSCLYLSTKQGILTATVSK
Ga0209004_103560523300027376Forest SoilHRCETALTNGAAATQNHGNHEATAMPKSKAENYKEVIMSSVEEETETQAIVDAAKASARIYVERVVRPEELSNLVEKALDGSGESFRLRFDDDSTLFLSTREGILTATVSK
Ga0209219_102842233300027565Forest SoilMPKSKAENYKEAIMSSVEEDTEAQAIADAAKASAQIFVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0209003_106783323300027576Forest SoilMPKSKAENYKEAIMSSVEEETETLAIADAAKASSRIYVERVVRAEELSKLVEKALDGSGESFHLEFDDDSTLYLSTREGILTATVSK
Ga0209733_114835813300027591Forest SoilVLSEDAAVAENDGNHEATAMPKSKAENYKEAIMSSVEEETEAQAIADAAKASAQIFVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0209625_100347033300027635Forest SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGETFRLTFDDDSTLYLSTREGILTATVSK
Ga0209736_105988723300027660Forest SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASAQIFVERAVRPEELSKLVENALDGTGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0209581_100442693300027706Surface SoilMSALEDETEDEAIGDAGKAAARIYVERAIRPQELSKLVEQALDGGGERFRLSFDDDSNLYLSTSGGMLTATVSK
Ga0209580_1043444623300027842Surface SoilMPKSKAENHKEAIMSALEEDTESQAIAAAARAAKSISVERPIRAQELPALVEKALDGSGEPFRLTFDDDSALYLATKEGILTATVSK
Ga0209517_1012420623300027854Peatlands SoilMPKSKAESHKEAIMAAVEEETESEAIAAAAKEATQINVEQRVVPAEKLSEMVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATVSK
Ga0209166_1031626023300027857Surface SoilPKSKAENYKEAIMSSVEEETEAQAIADAAKASSQIYVERVVRSGELSNLVDKALHGTGESFRLKFDDDSILYLSTREGILTATVSK
Ga0209167_1027237523300027867Surface SoilMPKSKAENYKEVIMSSVEEETETQAIADAAKDSSRIYVERVVRPEELGKLVEKSLDGSGETFRLKFDDDSTLYLSTREGILTATVSK
Ga0209167_1035291923300027867Surface SoilMNVLSDDAAVAENDGNHEAPAMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSQIYVERVVRPGELSNLVDKALHGTGESFRLKFDDDSILYLSTREGILTATVSK
Ga0209167_1075529413300027867Surface SoilMPKSKAENYKELIMSSVQEETETEALTDAAKAAAQIYVERAVRPEEFSKLVEKALDGAGEPFRLRFDDDSNLYLSTTNGILIATVSK
Ga0209275_1031598813300027884SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASSRIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSN
Ga0209380_1035492023300027889SoilMPKSKAENYKEAIMSSVEEETETQAIADAARASARIYVERVVRPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0209415_1084313813300027905Peatlands SoilMPKSKAESHKEAIMAAVEEETESEAIAAAAKEATQINVEQRVVPAEKLSEMVEKALDDGGGPFRLQFGDGSNLYLSTSGGTLTATV
Ga0209168_1059129313300027986Surface SoilMAAVEEETENEAMADAAKVATQIYVDQRLVRPAELSTLVEKALDGSGESFRLKFDDGSDLYISTSGGILTATVSK
Ga0209526_1022081223300028047Forest SoilMPKSKAESYKEAIMSAVEEETDNQVIAEAAKAANHISVGERPVRPEDLSTKVEQALDGAGERFHLKFDDGSDLYLATTGGILTATVSK
Ga0308309_1010273023300028906SoilMPKSKAENYKEAIMSSVEEETEAQAIADAAKASSRIYVERVVRPEELSKLVEKSLDGTGESFRLEFDDDSTLYLSTREGILTATVSN
Ga0308309_1054202513300028906SoilMKADGKSEGTKPKSKAEDHKESIMSSVQEQTEGPAIADAARAATHIYYAKRLVRPEELSTLVEQSLDGAGEPFRLTFDDGSSLYLTTSNGMLTATVS
Ga0308309_1128077313300028906SoilNHKEAIMSSVEEETEAEAIAAAAKAANEISVERVVRPDELSALVEKALNGPGEPFRLTFDDGSSLYLSTKQGILTATVSK
Ga0311338_1089429923300030007PalsaMPEKSAEPRQPAGNPEATKPKSKAEGYKEGIMSSVQEQTEIPAIADAARAATHIYYAEKLIRPEELSDLVEQALDGAGEPFRLTFDDGSSLYLSTSEGILTATVSG
Ga0170824_11464015323300031231Forest SoilMKADGYSEAMKPKSKAEGHKESIMSAVQEQTDGEAIADAARDATHIHYAKHLVRPEELRTLVGQALDGAGEPFRLTFDDGSTLYLTTSEGMLTATVSK
Ga0170820_1554996513300031446Forest SoilMPKSKAENHKEAIMSAVEDETEGEAITEAAKAAQQIRVADRIVRPEDLSTQVEKALDGPGERFHLKFDDGSELYLATAGGILTATVSK
Ga0170818_10470825513300031474Forest SoilMKADGYSEAMKPKSKAEGHKESIMSAVQEQTGGEAIADAARDATHIHYAKHLVRPEELRTLVGQALDGAGEPFRLTFDDGSTLYLTTSEGMLTATVSK
Ga0307476_1040619313300031715Hardwood Forest SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVECVVHPEELSKLVEKALDGSGESFRLKFDDDSTLYLSTREGIL
Ga0307474_1047653513300031718Hardwood Forest SoilMKADGKSEATRPKSKAEDHKEAIMSSIQEQTESEAIADAARAATHIYYSKQLVQPEELSTLVGQALDGAGEPFVLTFDDDSSLYLTTSEGILTATVSK
Ga0307469_1136774723300031720Hardwood Forest SoilMPKSKAENYKEAIMSSVEEETEAQAIGDAAKASSRIYVERVVRPEELSKLVENALHGAGESFRLKFDDDSILYLSTREGILTATVSK
Ga0307475_1005054643300031754Hardwood Forest SoilMPKSQAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGNGETFRLKFDDDSTLYLSTREGILTATVSK
Ga0307475_1074276513300031754Hardwood Forest SoilSVEEETETQAIADAAKASSRIYVERAVRPEELSKLVEKALDGGGESFRLKFDDDSTLYLSTREGILTATVSK
Ga0307478_1030378133300031823Hardwood Forest SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGETFRLQFDDDSTLYLSTREGILTAT
Ga0307479_1000229293300031962Hardwood Forest SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGSGETFRLQFDDDSTLYLSTREGILTATVSK
Ga0307479_1006768543300031962Hardwood Forest SoilMPKSQAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGNGETFRLKFDDDSTLYLST
Ga0307479_1015217033300031962Hardwood Forest SoilMPKSKAENHKEVIMAAVEEETESEAITEAAKAATNIYAEERLVRAEELSNLVEKALSGTAGSFRLKFEDGSNLYLSTSGGILTATVSN
Ga0307479_1018318243300031962Hardwood Forest SoilMPKSKAENHKEAIMSSVEDETEKEAIAAAAKAARKISVERAVRPEELSALVQKALDSSGEPFRLTFDDDSCLYLTTKEGILTATVSE
Ga0311301_1086690523300032160Peatlands SoilEDYKEEIMSAVEDETETESIADAAKAAKHIHCERLVRPEELSALVEKALDGAGESFRLTFDDDSRLYLTTSNGILTATVSK
Ga0307470_1089304813300032174Hardwood Forest SoilKPKSKAEGHKEVIMSSLQEQTESEALAAAARAATHIYYDKQPIRPEELSDMVGQALDGAGEPFILRFDDDSSLYLTTSDGILTATVSK
Ga0307471_10002155143300032180Hardwood Forest SoilMPKSKAENYKEAIMSSVEEETETQAIADAAKASARIYVERVVRPEELSKLVEKALDGGGETFRLKFDDDSTLYLSTREGILAATVSK
Ga0335085_10000221263300032770SoilMAAVEEETESEAIAAAAKEATQINVEQRVIPAEKLSELVEKALDDGGGPFRLQFGDGSNLYLSTSDGTLTATVSK
Ga0335079_1001773863300032783SoilMPKSKAESHKEAIMAAVEEETESEAIAAAAKEATQINVEQRVIPAEKLSELVEKALDDGGGPFRLQFGDGSNLYLSTSDGTLTATVSK
Ga0335079_1077070513300032783SoilMAAVEEETESEAIAAAAIAATQIKVEQRAVPAKELTDLVEKALDEGGGPFRLQFDDGSNLFLSTNGGMLTATVSK
Ga0335078_1048285333300032805SoilSKAESHKEAIMAAVEEETESEAIAAAAKAAAQIYVEQRLIPANQLSELVEKALDDGGGPFRLKFDDGSQLYLSSSAGTLTATVSK
Ga0335081_1257031013300032892SoilMAKSKAESYKESIMSAVEDETEGEVIAEAAKAAKQIIADRSVRPEELSARVEKALDGAGESFRLRFDDGSVLTLTTSSGSLTATVSK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.