NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097732

Metagenome / Metatranscriptome Family F097732

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097732
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 121 residues
Representative Sequence MSSIKARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDSEGHNVLLLEAQRKSGEP
Number of Associated Samples 82
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 86.54 %
% of genes near scaffold ends (potentially truncated) 25.96 %
% of genes from short scaffolds (< 2000 bps) 63.46 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.78

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.038 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(27.885 % of family members)
Environment Ontology (ENVO) Unclassified
(24.038 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.385 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 14.47%    β-sheet: 32.24%    Coil/Unstructured: 53.29%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.78
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.32.1.3: Extradiol dioxygenasesd3isqa13isq0.72218
d.32.1.0: automated matchesd2ehza12ehz0.71972
d.32.1.3: Extradiol dioxygenasesd1kw3b11kw30.71867
d.32.1.0: automated matchesd2zyqa12zyq0.70941
d.32.1.3: Extradiol dioxygenasesd4ghga14ghg0.69629


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF00490ALAD 43.27
PF07992Pyr_redox_2 12.50
PF14373Imm_superinfect 11.54
PF03099BPL_LplA_LipB 6.73
PF00676E1_dh 5.77
PF13302Acetyltransf_3 3.85
PF00488MutS_V 3.85
PF12681Glyoxalase_2 3.85
PF10067DUF2306 0.96
PF12831FAD_oxidored 0.96
PF13155Toprim_2 0.96
PF07805Obsolete Pfam Family 0.96
PF01739CheR 0.96
PF00069Pkinase 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0113Delta-aminolevulinic acid dehydratase, porphobilinogen synthaseCoenzyme transport and metabolism [H] 43.27
COG0095Lipoate-protein ligase ACoenzyme transport and metabolism [H] 6.73
COG0321Lipoate-protein ligase BCoenzyme transport and metabolism [H] 6.73
COG0340Biotin-protein ligaseCoenzyme transport and metabolism [H] 6.73
COG05672-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymesEnergy production and conversion [C] 5.77
COG1071TPP-dependent pyruvate or acetoin dehydrogenase subunit alphaEnergy production and conversion [C] 5.77
COG0249DNA mismatch repair ATPase MutSReplication, recombination and repair [L] 3.85
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.85
COG1193dsDNA-specific endonuclease/ATPase MutS2Replication, recombination and repair [L] 3.85
COG1352Methylase of chemotaxis methyl-accepting proteinsSignal transduction mechanisms [T] 1.92
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.04 %
UnclassifiedrootN/A0.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100035670All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis4442Open in IMG/M
3300003505|JGIcombinedJ51221_10005681All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis3777Open in IMG/M
3300005178|Ga0066688_10042749All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2583Open in IMG/M
3300005445|Ga0070708_100006847All Organisms → cellular organisms → Bacteria9089Open in IMG/M
3300005445|Ga0070708_100067246All Organisms → cellular organisms → Bacteria3217Open in IMG/M
3300005446|Ga0066686_10370872All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300005450|Ga0066682_10833989All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Thermales → Thermaceae → Marinithermus → Marinithermus hydrothermalis554Open in IMG/M
3300005467|Ga0070706_100326697All Organisms → cellular organisms → Bacteria1430Open in IMG/M
3300005534|Ga0070735_10106713All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1760Open in IMG/M
3300005536|Ga0070697_100492122All Organisms → cellular organisms → Bacteria1072Open in IMG/M
3300005541|Ga0070733_10450632All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300005541|Ga0070733_10767038All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300005556|Ga0066707_10066546All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2137Open in IMG/M
3300005557|Ga0066704_10317650All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1050Open in IMG/M
3300005559|Ga0066700_10178236All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1457Open in IMG/M
3300005602|Ga0070762_10001241All Organisms → cellular organisms → Bacteria11616Open in IMG/M
3300005602|Ga0070762_10757613All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300005921|Ga0070766_11233903All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300006059|Ga0075017_100857234All Organisms → cellular organisms → Bacteria704Open in IMG/M
3300006176|Ga0070765_100200720All Organisms → cellular organisms → Bacteria1809Open in IMG/M
3300006176|Ga0070765_100200943All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300006796|Ga0066665_10033796All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3386Open in IMG/M
3300006797|Ga0066659_10049808All Organisms → cellular organisms → Bacteria2636Open in IMG/M
3300009038|Ga0099829_10160782All Organisms → cellular organisms → Bacteria1799Open in IMG/M
3300009038|Ga0099829_10344581All Organisms → cellular organisms → Bacteria1226Open in IMG/M
3300009088|Ga0099830_10049819All Organisms → cellular organisms → Bacteria2950Open in IMG/M
3300009088|Ga0099830_10052950All Organisms → cellular organisms → Bacteria2871Open in IMG/M
3300009089|Ga0099828_10000278All Organisms → cellular organisms → Bacteria30403Open in IMG/M
3300009089|Ga0099828_10808149All Organisms → cellular organisms → Bacteria840Open in IMG/M
3300009090|Ga0099827_10453871All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1099Open in IMG/M
3300009137|Ga0066709_100294419All Organisms → cellular organisms → Bacteria2200Open in IMG/M
3300009521|Ga0116222_1016897All Organisms → cellular organisms → Bacteria3340Open in IMG/M
3300009698|Ga0116216_10030695All Organisms → cellular organisms → Bacteria3346Open in IMG/M
3300010048|Ga0126373_10008183All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8288Open in IMG/M
3300010048|Ga0126373_10041572All Organisms → cellular organisms → Bacteria4046Open in IMG/M
3300010379|Ga0136449_100078112All Organisms → cellular organisms → Bacteria7009Open in IMG/M
3300011269|Ga0137392_10057613All Organisms → cellular organisms → Bacteria2947Open in IMG/M
3300011269|Ga0137392_10121245All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2086Open in IMG/M
3300011271|Ga0137393_11034681All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300012206|Ga0137380_10064314All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter3356Open in IMG/M
3300012354|Ga0137366_11216895All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300017924|Ga0187820_1027470All Organisms → cellular organisms → Bacteria1461Open in IMG/M
3300017933|Ga0187801_10071894All Organisms → cellular organisms → Bacteria1282Open in IMG/M
3300017943|Ga0187819_10018920All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis3986Open in IMG/M
3300017955|Ga0187817_10193955All Organisms → cellular organisms → Bacteria1294Open in IMG/M
3300017995|Ga0187816_10018268All Organisms → cellular organisms → Bacteria2719Open in IMG/M
3300018006|Ga0187804_10548745All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300018022|Ga0187864_10277301All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300018433|Ga0066667_10107115All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1876Open in IMG/M
3300018433|Ga0066667_12035117All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300020580|Ga0210403_10037071All Organisms → cellular organisms → Bacteria3897Open in IMG/M
3300020580|Ga0210403_10395243All Organisms → cellular organisms → Bacteria1129Open in IMG/M
3300020581|Ga0210399_10171318All Organisms → cellular organisms → Bacteria1800Open in IMG/M
3300020582|Ga0210395_10209444All Organisms → cellular organisms → Bacteria1462Open in IMG/M
3300020582|Ga0210395_10349941All Organisms → cellular organisms → Bacteria1112Open in IMG/M
3300020582|Ga0210395_10605252All Organisms → cellular organisms → Bacteria822Open in IMG/M
3300020582|Ga0210395_11117883All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300020583|Ga0210401_10094579All Organisms → cellular organisms → Bacteria2807Open in IMG/M
3300020583|Ga0210401_10162966All Organisms → cellular organisms → Bacteria2081Open in IMG/M
3300020583|Ga0210401_10440930All Organisms → cellular organisms → Bacteria1165Open in IMG/M
3300020583|Ga0210401_10853285All Organisms → cellular organisms → Bacteria770Open in IMG/M
3300021168|Ga0210406_10572244All Organisms → cellular organisms → Bacteria885Open in IMG/M
3300021170|Ga0210400_11266217All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300021171|Ga0210405_10950427All Organisms → cellular organisms → Bacteria650Open in IMG/M
3300021171|Ga0210405_11230708All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300021171|Ga0210405_11417973All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300021178|Ga0210408_10359214All Organisms → cellular organisms → Bacteria1161Open in IMG/M
3300021180|Ga0210396_10054827All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis3646Open in IMG/M
3300021403|Ga0210397_10858305All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300021404|Ga0210389_10072713All Organisms → cellular organisms → Bacteria2636Open in IMG/M
3300021404|Ga0210389_10572054All Organisms → cellular organisms → Bacteria888Open in IMG/M
3300021407|Ga0210383_11780960All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300021420|Ga0210394_10306621All Organisms → cellular organisms → Bacteria1393Open in IMG/M
3300021420|Ga0210394_11054151All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300021433|Ga0210391_10130439All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis1979Open in IMG/M
3300021475|Ga0210392_10483636All Organisms → cellular organisms → Bacteria910Open in IMG/M
3300021478|Ga0210402_11601409All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300021479|Ga0210410_10181253All Organisms → cellular organisms → Bacteria1884Open in IMG/M
3300022523|Ga0242663_1052471All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300024271|Ga0224564_1032998All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter977Open in IMG/M
3300025910|Ga0207684_10028571All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4752Open in IMG/M
3300025922|Ga0207646_10067440All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis3194Open in IMG/M
3300025922|Ga0207646_10225361All Organisms → cellular organisms → Bacteria → Acidobacteria1693Open in IMG/M
3300026334|Ga0209377_1096805All Organisms → cellular organisms → Bacteria → Acidobacteria1231Open in IMG/M
3300026524|Ga0209690_1009753All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5178Open in IMG/M
3300027069|Ga0208859_1009302All Organisms → cellular organisms → Bacteria1072Open in IMG/M
3300027545|Ga0209008_1107942All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300027604|Ga0208324_1046702All Organisms → cellular organisms → Bacteria1269Open in IMG/M
3300027854|Ga0209517_10491251All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia671Open in IMG/M
3300027855|Ga0209693_10049638All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis2063Open in IMG/M
3300027862|Ga0209701_10122020All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis1613Open in IMG/M
3300027875|Ga0209283_10082188All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis2088Open in IMG/M
3300027882|Ga0209590_10026841All Organisms → cellular organisms → Bacteria2989Open in IMG/M
3300027889|Ga0209380_10008428All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis6089Open in IMG/M
3300028015|Ga0265353_1010033All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300028906|Ga0308309_10221510Not Available1569Open in IMG/M
3300030862|Ga0265753_1012573All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300031231|Ga0170824_125064367All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis891Open in IMG/M
3300031708|Ga0310686_118191107All Organisms → cellular organisms → Bacteria2073Open in IMG/M
3300031718|Ga0307474_10903935All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300032160|Ga0311301_10075148All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis7044Open in IMG/M
3300032805|Ga0335078_10244827All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2452Open in IMG/M
3300032892|Ga0335081_11079772All Organisms → cellular organisms → Bacteria927Open in IMG/M
3300033134|Ga0335073_11659967All Organisms → cellular organisms → Bacteria606Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil27.88%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil9.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil7.69%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.73%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment5.77%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil5.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.85%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.85%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.88%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.92%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.96%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.96%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009521Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaGEnvironmentalOpen in IMG/M
3300009698Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_3_AS metaGEnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300017924Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_5EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018022Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_11_40EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024271Soil microbial communities from Bohemian Forest, Czech Republic ? CSU5EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300027069Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF002 (SPAdes)EnvironmentalOpen in IMG/M
3300027545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027604Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_5_LS metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028015Soil microbial communities from Maridalen valley, Oslo, Norway - NSE6EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300033134Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10003567023300002245Forest SoilMSSINARLVGVELYFDDLAAAKNFYEGTLGLNIVGEQPGHHAQFNVGRAFLCLEKKGVEDYPSRDKAVIFLEVPSIATAVEAIGQERFVHIARGSEGSGSPWAVLHDTEGHNLLLLEAQPASSER*
JGIcombinedJ51221_1000568133300003505Forest SoilMSTINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEHYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLLLETRPGSQG*
Ga0066688_1004274923300005178SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNASAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARSAP*
Ga0070708_10000684733300005445Corn, Switchgrass And Miscanthus RhizosphereVPQINARLVGVELYFDDLVAAKRFYQETLGLAISGERPGHHAQFDAGPSFLCVEKKGVENYPSRDKAVIFLKVPSVQAAVETIGRERIVHLESSAEAGRQAWAVLHDPEGHNVVLLEATKR*
Ga0070708_10006724623300005445Corn, Switchgrass And Miscanthus RhizosphereMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAISAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGAAKAI*
Ga0066686_1037087223300005446SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSAEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT*
Ga0066682_1083398913300005450SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT*
Ga0070706_10032669713300005467Corn, Switchgrass And Miscanthus RhizosphereMANINARLVGLELYFEDLTAAKRFYEGTLGLTLSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAISAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGAAKAI*
Ga0070735_1010671323300005534Surface SoilMSTINARLVGVELYFDDLVAAKNFYEGTLGLDVFSEQPGHHAQFNVGRAFLCLEKKGVEDYPSRDKAVVFLEVPSVQDAIEAIGRERFVHVESGSESTKPAWAVLHDTEGHNVLLLEPRRKAPG*
Ga0070697_10049212223300005536Corn, Switchgrass And Miscanthus RhizosphereNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHVQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAISAIGRDRILRFEPAREDKRPAWAVLQDPEGHSVLLLEAGSAPSESPV*
Ga0070733_1045063213300005541Surface SoilMSTINARLVGVELYFDDLVAAKNFYEGTLGLDVFSEQPGHHAQFNLGRAFLCLEKKGVEDYPSRDKAVIFLEVPGVKDAIEAIGRERFVHVESGSESTKAAWAVLHDTEGHNVLLLEARPKAPG*
Ga0070733_1076703823300005541Surface SoilMSSINARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYSSRDKAVIFLEVPNVRVAVESIGRERFVHVENGDESTQPWAVLHDSEGHNVLLLEAQRKSAEP*
Ga0066707_1006654613300005556SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLE
Ga0066704_1031765023300005557SoilMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHDAQFNAGAAFLCLEKKGVEDYRSHDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARSAP*
Ga0066700_1017823623300005559SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNASAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT*
Ga0070762_1000124183300005602SoilMSSIHARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDSEGHNVLLLEAQRKSGEP*
Ga0070762_1075761313300005602SoilMSSINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKRGVEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLLLETRPGSQG*
Ga0070766_1123390323300005921SoilMSSINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKRGVEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLLLETRP
Ga0075017_10085723413300006059WatershedsMSSINARLVGVELYFDDLVAAKNFYEGTLGLNIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP*
Ga0070765_10020072013300006176SoilMSSIHARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDS
Ga0070765_10020094313300006176SoilMSSINARLVGVELYFDDLVAAKNFYEGTIGLDIFGEQPGHHAQFKVGQAFLCLEKKGVEDYSSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDS
Ga0066665_1003379623300006796SoilMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHDAQFNAGAAFLCLEKKGVEDYPSHDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLETRSAP*
Ga0066659_1004980823300006797SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGHILSGEQSGHHAQFNASAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT*
Ga0099829_1016078213300009038Vadose Zone SoilMPQIKARLVGVELYFDDLVAAKRFYHETLGLAISGERPRHHAQFGAAPSFLCVEKKGVENYPSCDKAVIFFEVPSVQDAVEAIGRKRIVHFESNPEAGRQAWAVLHDPEGHNVLLLEATKR*
Ga0099829_1034458113300009038Vadose Zone SoilMANINARLVGLELYFEDLTAAKRFYEGTLGLALSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAISAIGRDRILRFEPAREDKRPAWAV
Ga0099830_1004981943300009088Vadose Zone SoilMPQIKARLVGVELYFDDLVAAKRFYHETLGLAISGERPRHHAQFGAAPSFLCVEKKGVENYPSCDKAVIFFEVPSVQDAVEAIGRERIVHFESNPEAGRQAWAVLHDPEGHNVLLLEATKR*
Ga0099830_1005295033300009088Vadose Zone SoilMANINARLVGLELYFEDLTAAKRFYEGTLGLALSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARGAPRTA*
Ga0099828_1000027873300009089Vadose Zone SoilMANINARLVGLELYFEDLTAAKRFYEGTLGLALSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGECPAWAVLQDPEGHSVLLLEARGAPRTA*
Ga0099828_1080814923300009089Vadose Zone SoilMPNIHARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGAAKAI*
Ga0099827_1045387123300009090Vadose Zone SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGAAKAI*
Ga0066709_10029441923300009137Grasslands SoilMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHDAQFNAGAAFLCLEKKGVEDYPSHDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARSAP*
Ga0116222_101689733300009521Peatlands SoilMSKINAQLVGVELYFDDLPAAKRFYQETLGLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDNAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGTRSSWAALHDPEGHNVLLLEKPQSAG*
Ga0116216_1003069523300009698Peatlands SoilMSKINARLVGVELYFDDLPAAKRFYQETLGLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDNAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGTRSSWAALHDPEGHNVLLLEKPQSAG*
Ga0126373_1000818373300010048Tropical Forest SoilLDARLIGVELYFDDLQAAKRFYQETLGLNLIHEQSGHHVQFGIGGPFLCLERKGVEDYPSRDKAVIFLEVADVRSAVEELGKEKVVHFEAGNADGAPPWAVLHDPEGHNVLLLQASQR*
Ga0126373_1004157233300010048Tropical Forest SoilLPDLGARLIGVELYFDDLVAAKRFYQETLGLKVSHEQSGHHVQFEIGGPFLCLEKKGVEDYPSRDKAVIFLEVADVRSALEGLGKANLVHFEEGDRNAASPWAVLHDPEGHNVLLLQANSR*
Ga0136449_10007811233300010379Peatlands SoilVPGINARLVGVELYFDDLAAAKAFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVAAIGQERFAHVETGDESTRPPWAVLHDTEGHNVLLLEARPDSQG*
Ga0137392_1005761323300011269Vadose Zone SoilVGVELYFDDLVAAKRFYQETLGLAISGERPRHHAQFGAAPSFLCVEKKGVENYPSCDKAVIFFEVPSVQDAVEAIGRKRIVHFESNPEAGRQAWAVLHDPEGHNVLLLEATKR*
Ga0137392_1012124523300011269Vadose Zone SoilLVGLELFFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARGAPRTA*
Ga0137393_1103468113300011271Vadose Zone SoilARLVGLELYFEDLTAAKRFYEGTLGLALSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARGAPRTA
Ga0137380_1006431413300012206Vadose Zone SoilMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT*
Ga0137366_1121689513300012354Vadose Zone SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKVVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT*
Ga0187820_102747023300017924Freshwater SedimentMSEQDLHDSKINARLVGVELYFDDLPAAKHFYQDTLGLSLSGEQLGHHAQFDVGTVFFCAEKKGVEDYPSRDKAVIFLEVDSVRAAVEAIGGDRIVRFDNNSRSPWAVLHDPEGHNVILLEAQQKSRDR
Ga0187801_1007189433300017933Freshwater SedimentMSKINARLVGVELYFDDLPAAKRFYQETLSLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDKAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGSRSSWAALHDPEGHNVLLLEKPQSAG
Ga0187819_1001892023300017943Freshwater SedimentMPNINARLVGVELYFDDLLGARNFYEQTLGLDVFSEQPGHHAQFNVGRAFLCLEKKGVENYPSRDKAVIFLEVPSVQAAVEVIGRERFVHIERSESPQPPWAILHDTEGHNVLLLEARPESQR
Ga0187817_1019395523300017955Freshwater SedimentMLKINARLVGVELYFDDLPAAKRFYQETLGLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDKAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGTRSSWAALHDPEGHNVLLLEKPQSAG
Ga0187816_1001826833300017995Freshwater SedimentMSKINARLVGVELYFDDLPAAKRFYQETLSLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDKAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGTRSSWAALHDPEGHNVLLLEKAQSAR
Ga0187804_1054874513300018006Freshwater SedimentARLVGVELYFDDLPTAKHFYQDTLGLSLSGEQLGHHAQFDVGTVFFCAEKKGVEDYPSRDKAVIFLEVDSVKAAVEAIGGDRIVRFDNNSRAPWAVLHDPEGHNVILLEAQQKSRDR
Ga0187864_1027730123300018022PeatlandMSSINARLVGVELYFDDLVAAKNFYEGTLGLDVFSEQPGHHAQFNVGRAFLCLEKKGVEDYPSRDKAVIFLEVPSVQDAVEAIGQERFVHIERGAESTRPPWAVLQDTEGHNVLLLEARPKSQG
Ga0066667_1010711523300018433Grasslands SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNASAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT
Ga0066667_1203511713300018433Grasslands SoilYPISTSQVHLGGLDLFVEDLAVVRRFYAGTVGLILYGEPSGHDAQFIAGPAFQCLKKKGVEDYPSHDKAVVFLEVPSVEAAINAISRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARGVSTTT
Ga0210403_1003707123300020580SoilMSSINARLVGVELYFDDLVAAKNFYEGILGLDIFSEQPGHHAQFDVGQAFLCLEKKGVVDYPSRDKAVIFLEVPNVQDAVEAIGRERFVHVETGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210403_1039524323300020580SoilMSSINARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWGVLHDSEGHNVLLLEALRKSGEP
Ga0210399_1017131823300020581SoilMSSINARLVGVELYFDDLVAAKNFYEGILGLDIFSEQPGHHAQFDVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQDAVEAIGRERFVHVETGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210395_1020944413300020582SoilMSTINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEHYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLLLETRPGSQG
Ga0210395_1034994123300020582SoilMSSIKTRLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210395_1060525213300020582SoilMSSIKARLVGVELYFDDLVAAKNFYAETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGAEDYPSRDKAVIFLEVPKVQAAVEAIGRERFVHVENGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210395_1111788313300020582SoilMSSIKARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGAEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTRPPWAILHDTEGHNVLLLE
Ga0210401_1009457933300020583SoilMSKINARLLGVELYFDDLLAAKRFYQETLGLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDKAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGKRSSWAALHDPEGHNVLLLEKPQSAR
Ga0210401_1016296613300020583SoilMSSIDARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYSSRDKAVIFLEVPNVRVAVESIGRERFVHVENGDESTQPWAVLHDSEGHNVLLLEAQRKSAEP
Ga0210401_1044093023300020583SoilMSSIHARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210401_1085328523300020583SoilMSSINARLVGIELYFDDLIAAKDFYAGTLGLNIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPSVHDAVAAIGEERFVHIELGSESTQPSWAVLHDTEGHNVLLLEARPKSQG
Ga0210406_1057224433300021168SoilVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWGVLHDSEGHNVLLLEALRKSGEP
Ga0210400_1126621713300021170SoilMSSINARLVGVELYFDDLAAAKNFYEGTLGLNIVGEQPGHHAQFNVGRAFLCLEKKGVEDYPSRDKAVIFLEVPSIATAVEAIGQERFVHIARGSEGSGSPWAVLHDTEG
Ga0210405_1095042713300021171SoilMSSINARLVGVELYFDDLVAAKNFYEGTLGLDVFGEQPGHHAQFKVGQAFLCLEKKCVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVESGDERTQPWAVLHDSEGHNVLLLEAQRKSGDP
Ga0210405_1123070823300021171SoilMSSIHARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDSEGHNVLLLEAQ
Ga0210405_1141797323300021171SoilMSTINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEHYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLL
Ga0210408_1035921423300021178SoilMSSIKARLVGVELYFDDLVAAKNFYAETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGAEDYPSRDKAVIFLEVPKVQAAVEAIGRERFVHVENGNERTPPWAVLHDSEGHNVLLLEAHRKSGEP
Ga0210396_1005482723300021180SoilMSSIKARLVGVELYFDDLVAAKNFYAETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGAEDYPSRDKAVIFLEVPKVQAAVETIGRERFVHVENGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210397_1085830513300021403SoilMSSINARLVGVELYFDDLVAAKNFYKGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRDRFVHVENGDERTQPWGVLHDSEGHNVLLLEALRKSGEP
Ga0210389_1007271343300021404SoilMSSIKARLVGVELYFDDLVAAKNFYAETLELDIFGEQPGHHAQFNVGQAFLCLEKKGAEDYPSRDKAVIFLEVPKVQAAVETIGRERFVHVENGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210389_1057205423300021404SoilMSSINARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWGVLHDSEGHNVLLLEAHRKSGEP
Ga0210383_1178096013300021407SoilMSSINARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPW
Ga0210394_1030662113300021420SoilDDLVAAKNFYAETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGAEDYPSRDKAVIFLEVPKVQAAVETIGRERFVHVENGDERTPPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0210394_1105415113300021420SoilMSSINARLVGVELYFDDLVAAKNFYEETLGLDIFGKQPGHHVQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNLQTAVEAIGRERFVHVESGDESTQSPWAILPDTEGHNVLLLETRPGSQG
Ga0210391_1013043913300021433SoilMSSIKARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPW
Ga0210392_1048363613300021475SoilARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWGVLHDSEGHNVLLLEALRKSGEP
Ga0210402_1160140913300021478SoilMSSINARLVGVELYFDDLVAAKNFYEGTVGLDIFGEQPGHHAQFKVGQAFLCLEKKGVEDYSSRDKAVIFLEVPNVQAAVEAIGRQRFVYVENGDEDTQPWAVLHDSEGHNVLLLEAQR
Ga0210410_1018125333300021479SoilMSSINARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQLWAVLHDSEGHNVLLLEAQRKSGEP
Ga0242663_105247113300022523SoilMSSIHARLVGVELYFDDLVAAKNFYQGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVENYSSRDKAVIFLEVPNVRVAVESIGRERFVHVENGDESTQPWAVLHDSEGHNVLLLEAQRKSAEP
Ga0224564_103299823300024271SoilMSSINARLVGVELYFDDLVAAKNFYEEILGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNIQAAVEAIGRERFVHVGSGDESPQSPWAILHDTEGHNVLLLETRTASQG
Ga0207684_1002857133300025910Corn, Switchgrass And Miscanthus RhizosphereMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAISAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEGRGAAKAI
Ga0207646_1006744033300025922Corn, Switchgrass And Miscanthus RhizosphereMPNINARLVGLELYFEDLTAAKRFYEGTLGLTLSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGGRPAWAVLHDPEGHNVLLLEAHGAPKTA
Ga0207646_1022536113300025922Corn, Switchgrass And Miscanthus RhizosphereVPQINARLVGVELYFDDLVAAKRFYQETLGLAISGERPGHHAQFDAGPSFLCVEKKGVENYPSRDKAVIFLKVPSVQAAVETIGRERIVHLESSAEAGRQAWAVLHDPEGHNVVLLEATK
Ga0209377_109680523300026334SoilMPNINARLVGLELYFEDLTVAKRFYEGTLGLILSGEQSGHDAQFNAGAAFLCLEKKGVEDYPSHDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGAPKLHESAFF
Ga0209690_100975323300026524SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGVSTTT
Ga0208859_100930223300027069Forest SoilMSTINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLLLE
Ga0209008_110794213300027545Forest SoilMSSINARLVGVELYFDDLAAAKNFYEGTLGLNIVGEQPGHHAQFNVGRAFLCLEKKGVEDYPSRDKAVIFLEVPSIATAVEAIGQERFVHIARGSEGSGSPWAVLHDTEGHNLLL
Ga0208324_104670233300027604Peatlands SoilMSKINAQLVGVELYFDDLPAAKRFYQETLGLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDNAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGTRSSWAALHDPEGHNVLLLEKPQSAG
Ga0209517_1049125113300027854Peatlands SoilMSKINAQLVGVELYFDDLPAAKRFYQETLGLSLSGEQLGHHAQFNFGQAFLCLEKKGVEDYPSQDNAVIFLEVPSVQAAVEAIGRERFVHVEPGAEGTRSSWAALHDPEGHNVL
Ga0209693_1004963823300027855SoilMSSINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKRGVEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVDRGDENTQSPWAILHDTEGHNVLLLETRPGSQG
Ga0209701_1012202023300027862Vadose Zone SoilMANINARLVGLELYFEDLTAAKRFYEGTLGLALSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLQDPEGHSVLLLEARGAPRTA
Ga0209283_1008218833300027875Vadose Zone SoilMPQIKARLVGVELYFDDLVAAKRFYHETLGLAISGERPRHHAQFGAAPSFLCVEKKGVENYPSCDKAVIFFEVPSVQDAVEAIGRKRIVHFESNPEAGRQAWAVLHDPEGHNVLLLEATK
Ga0209590_1002684133300027882Vadose Zone SoilMPNINARLVGLELYFEDLTAAKRFYEGTLGLILSGEQSGHHAQFNAGAAFLCLEKKGVEDYPSRDKAVVFLEVPSVEAAINAIGRDRILRFEPAREGERPAWAVLHDPEGHSVLLLEARGAAKAI
Ga0209380_1000842813300027889SoilMSSINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKRGVEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVDRGDENTQSPW
Ga0265353_101003323300028015SoilMSSIKARLVGVELYFDDLVAAKNFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQPWAVLHDSEGHNVLLLEAQRKSGEP
Ga0308309_1022151033300028906SoilMSSINARLVGVELYFDDLVAAKNFYEGTIGLDIFGEQPGHHAQFKVGQAFLCLEKKGVEDYSSRDKAVIFLEVPNVQAAVEAIGRERFVHVENGDERTQP
Ga0265753_101257313300030862SoilMSSINARLVGVELYFDDLVAAKNFYEGILGLDIFSEQPGHHAQFDVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVEAIGRERFVHVGSGDESTQSPWAILHDTEGHNVLLLETRPGSQG
Ga0170824_12506436723300031231Forest SoilMSTINARLVGVELYFDDLVAAKNFYEETLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQTAVEAIGRERFVHVESGDESTQSPWAILHDTEGHNVLLLETRPGSQG
Ga0310686_11819110723300031708SoilMSSINARLVGVELYFDDLVAAKNFYEEILGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNIQAAVEAIGRERFVHVGSGDESPQSPWAILHDTEGHNVLLLETRTGSQG
Ga0307474_1090393523300031718Hardwood Forest SoilMSSINARLVGVELYFDDLVAAKNFYEGTLGLNIFGEQPGHHAQFNVGRAFLCLEKKGVEDYPSRDKAVIFLEVPSVASAVEVIGKERFVHIARGSEGSPWAVLHDTEGHNVLLLEAQRTSGER
Ga0311301_1007514843300032160Peatlands SoilVPGINARLVGVELYFDDLAAAKAFYEGTLGLDIFGEQPGHHAQFNVGQAFLCLEKKGVEDYPSRDKAVIFLEVPNVQAAVAAIGQERFAHVETGDESTRPPWAVLHDTEGHNVLLLEARPDSQG
Ga0335078_1024482743300032805SoilMSKLKARLVGVELYFDDLVRAKRFYEDTLGLSISGEEDGHHAQFNLGAAFLCVEKKGVEDFPSYDKAVIFLEVPSVEEAVRTLGSRMIVRFEPDAGDTRRPWAVIHDPEGHNVLLLERPSAES
Ga0335081_1107977213300032892SoilFDDLVRAKRFYEDTLGLSISGEEDGHHAQFNLGAAFLCVEKKGVEDFPSYDKAVIFLEVPSVEEAVRTLGSRMIVRFEPDAGDTRRPWAVIHDPEGHNVLLLERPSAES
Ga0335073_1165996723300033134SoilMPNIHARLVGVELYFADLPAAKRFYHEILGLQLSGEQPGQHAQFDAGPAFLCAEKKGVEDYPSQDKAVIFLEVPSLRAAVEAIGKERIVRFSEDANTPWAVLHDPEGHNVLLLQSSEPKSRS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.