NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F063965

Metagenome Family F063965

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F063965
Family Type Metagenome
Number of Sequences 129
Average Sequence Length 154 residues
Representative Sequence MEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Number of Associated Samples 111
Number of Associated Scaffolds 129

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 66.12 %
% of genes near scaffold ends (potentially truncated) 44.96 %
% of genes from short scaffolds (< 2000 bps) 82.17 %
Associated GOLD sequencing projects 103
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (65.891 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.954 % of family members)
Environment Ontology (ENVO) Unclassified
(17.829 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.186 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.53%    β-sheet: 12.63%    Coil/Unstructured: 56.84%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 129 Family Scaffolds
PF01656CbiA 0.78
PF02146SIR2 0.78
PF06325PrmA 0.78
PF13432TPR_16 0.78
PF13646HEAT_2 0.78
PF07676PD40 0.78
PF064393keto-disac_hyd 0.78
PF07394DUF1501 0.78
PF07638Sigma70_ECF 0.78

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 129 Family Scaffolds
COG0846NAD-dependent protein deacetylase, SIR2 familyPosttranslational modification, protein turnover, chaperones [O] 0.78
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.78
COG2264Ribosomal protein L11 methylase PrmATranslation, ribosomal structure and biogenesis [J] 0.78
COG2890Methylase of polypeptide chain release factorsTranslation, ribosomal structure and biogenesis [J] 0.78
COG3897Protein N-terminal and lysine N-methylase, NNT1/EFM7 familyPosttranslational modification, protein turnover, chaperones [O] 0.78


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A65.89 %
All OrganismsrootAll Organisms34.11 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004114|Ga0062593_100423213All Organisms → cellular organisms → Bacteria1201Open in IMG/M
3300004152|Ga0062386_100163162Not Available1742Open in IMG/M
3300004156|Ga0062589_100772227All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium865Open in IMG/M
3300004480|Ga0062592_101090163All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300005176|Ga0066679_10568455Not Available739Open in IMG/M
3300005177|Ga0066690_10911440Not Available560Open in IMG/M
3300005184|Ga0066671_10685474All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia → Escherichia coli663Open in IMG/M
3300005332|Ga0066388_100270732All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium2340Open in IMG/M
3300005439|Ga0070711_100672126Not Available870Open in IMG/M
3300005529|Ga0070741_10137747All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia2482Open in IMG/M
3300005532|Ga0070739_10093175All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1728Open in IMG/M
3300005533|Ga0070734_10800721Not Available535Open in IMG/M
3300005549|Ga0070704_100896257All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium797Open in IMG/M
3300005552|Ga0066701_10456639Not Available793Open in IMG/M
3300005555|Ga0066692_10621662Not Available676Open in IMG/M
3300005557|Ga0066704_10387433Not Available931Open in IMG/M
3300005559|Ga0066700_10149063All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1583Open in IMG/M
3300005602|Ga0070762_11296724Not Available505Open in IMG/M
3300005713|Ga0066905_100069179Not Available2263Open in IMG/M
3300005713|Ga0066905_101205166Not Available677Open in IMG/M
3300005764|Ga0066903_100398065All Organisms → cellular organisms → Bacteria2263Open in IMG/M
3300005764|Ga0066903_100582439Not Available1932Open in IMG/M
3300005764|Ga0066903_102210275All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1061Open in IMG/M
3300005764|Ga0066903_103298758Not Available872Open in IMG/M
3300005764|Ga0066903_104471296All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium746Open in IMG/M
3300006028|Ga0070717_11037917Not Available747Open in IMG/M
3300006174|Ga0075014_100036009Not Available2035Open in IMG/M
3300006796|Ga0066665_10124123Not Available1928Open in IMG/M
3300006797|Ga0066659_10307559Not Available1212Open in IMG/M
3300006800|Ga0066660_11087030Not Available635Open in IMG/M
3300006854|Ga0075425_100484111All Organisms → cellular organisms → Bacteria1425Open in IMG/M
3300006871|Ga0075434_101396668All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. OHSU_III710Open in IMG/M
3300006893|Ga0073928_10083219Not Available2735Open in IMG/M
3300006893|Ga0073928_10175379All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1703Open in IMG/M
3300006893|Ga0073928_10216514All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1488Open in IMG/M
3300006904|Ga0075424_101455588All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300009012|Ga0066710_100454451Not Available1921Open in IMG/M
3300009012|Ga0066710_101356753All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae1103Open in IMG/M
3300009012|Ga0066710_103102528Not Available642Open in IMG/M
3300009088|Ga0099830_10562041All Organisms → cellular organisms → Bacteria → PVC group934Open in IMG/M
3300009137|Ga0066709_102919568Not Available630Open in IMG/M
3300009147|Ga0114129_13522290Not Available500Open in IMG/M
3300009444|Ga0114945_10051627All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium2240Open in IMG/M
3300009448|Ga0114940_10340231Not Available657Open in IMG/M
3300009806|Ga0105081_1024830Not Available760Open in IMG/M
3300009820|Ga0105085_1063561Not Available682Open in IMG/M
3300010048|Ga0126373_12474526Not Available578Open in IMG/M
3300010343|Ga0074044_10330371Not Available1001Open in IMG/M
3300010379|Ga0136449_102724444Not Available701Open in IMG/M
3300010396|Ga0134126_10240092All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium2147Open in IMG/M
3300011269|Ga0137392_10733382Not Available817Open in IMG/M
3300012180|Ga0153974_1125977Not Available592Open in IMG/M
3300012181|Ga0153922_1012327Not Available2017Open in IMG/M
3300012198|Ga0137364_10380217Not Available1055Open in IMG/M
3300012203|Ga0137399_10836190Not Available775Open in IMG/M
3300012205|Ga0137362_10732679Not Available849Open in IMG/M
3300012206|Ga0137380_10623341Not Available942Open in IMG/M
3300012349|Ga0137387_11026611Not Available590Open in IMG/M
3300012354|Ga0137366_10127089All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1924Open in IMG/M
3300012362|Ga0137361_10178190All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1915Open in IMG/M
3300012922|Ga0137394_11182776Not Available628Open in IMG/M
3300012923|Ga0137359_10161992All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1996Open in IMG/M
3300012923|Ga0137359_11157287Not Available660Open in IMG/M
3300012923|Ga0137359_11554516Not Available549Open in IMG/M
3300012927|Ga0137416_11857005Not Available551Open in IMG/M
3300014053|Ga0119956_1018142Not Available536Open in IMG/M
3300015084|Ga0167654_1010304All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1605Open in IMG/M
3300015241|Ga0137418_10317442Not Available1297Open in IMG/M
3300016422|Ga0182039_11475147Not Available619Open in IMG/M
3300017970|Ga0187783_10091211All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales2245Open in IMG/M
3300017970|Ga0187783_10241795All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1321Open in IMG/M
3300017972|Ga0187781_10413363All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium964Open in IMG/M
3300017975|Ga0187782_10605950All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium842Open in IMG/M
3300018062|Ga0187784_10466338Not Available1019Open in IMG/M
3300018063|Ga0184637_10081088All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → unclassified Planctomycetales → Planctomycetales bacterium1990Open in IMG/M
3300018085|Ga0187772_10110655Not Available1780Open in IMG/M
3300018085|Ga0187772_10185822All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1390Open in IMG/M
3300018468|Ga0066662_10760167Not Available934Open in IMG/M
3300019487|Ga0187893_10576120Not Available718Open in IMG/M
3300019487|Ga0187893_10689047Not Available634Open in IMG/M
3300021171|Ga0210405_10128607Not Available2001Open in IMG/M
3300021372|Ga0213877_10038699All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1347Open in IMG/M
3300021476|Ga0187846_10039660All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium2106Open in IMG/M
(restricted) 3300021517|Ga0224723_1037848All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1797Open in IMG/M
3300022557|Ga0212123_10063729Not Available3236Open in IMG/M
3300022557|Ga0212123_10290823Not Available1148Open in IMG/M
3300022563|Ga0212128_10101849All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1855Open in IMG/M
3300025916|Ga0207663_10838428Not Available733Open in IMG/M
3300026538|Ga0209056_10145193Not Available1829Open in IMG/M
3300027109|Ga0208603_1062765Not Available558Open in IMG/M
3300027773|Ga0209810_1026894All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium3526Open in IMG/M
3300027826|Ga0209060_10591145Not Available502Open in IMG/M
3300027884|Ga0209275_10830471Not Available533Open in IMG/M
3300027902|Ga0209048_10601750Not Available733Open in IMG/M
3300028536|Ga0137415_10700093Not Available826Open in IMG/M
3300028792|Ga0307504_10374370Not Available554Open in IMG/M
3300031545|Ga0318541_10060125Not Available1972Open in IMG/M
3300031561|Ga0318528_10611994Not Available584Open in IMG/M
3300031576|Ga0247727_10185464All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1934Open in IMG/M
3300031716|Ga0310813_10502157All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1060Open in IMG/M
3300031720|Ga0307469_10501296Not Available1064Open in IMG/M
3300031724|Ga0318500_10028492Not Available2214Open in IMG/M
3300031744|Ga0306918_10249385All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1355Open in IMG/M
3300031777|Ga0318543_10070789All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1461Open in IMG/M
3300031781|Ga0318547_10182802All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1246Open in IMG/M
3300031782|Ga0318552_10297419Not Available820Open in IMG/M
3300031794|Ga0318503_10292486Not Available529Open in IMG/M
3300031796|Ga0318576_10069253Not Available1565Open in IMG/M
3300031821|Ga0318567_10067749Not Available1880Open in IMG/M
3300031890|Ga0306925_11299257Not Available723Open in IMG/M
3300031893|Ga0318536_10299282Not Available816Open in IMG/M
3300031894|Ga0318522_10287043Not Available624Open in IMG/M
3300031910|Ga0306923_10603657All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1232Open in IMG/M
3300032041|Ga0318549_10121187Not Available1152Open in IMG/M
3300032043|Ga0318556_10579389Not Available585Open in IMG/M
3300032059|Ga0318533_11268768Not Available539Open in IMG/M
3300032068|Ga0318553_10080729Not Available1638Open in IMG/M
3300032160|Ga0311301_11001605All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1105Open in IMG/M
3300032160|Ga0311301_12743060Not Available541Open in IMG/M
3300032770|Ga0335085_10587463All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1256Open in IMG/M
3300033004|Ga0335084_11943017Not Available574Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.95%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.53%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil6.20%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland5.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.65%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.10%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.10%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring3.88%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil3.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.88%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil2.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.33%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.33%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater1.55%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs1.55%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.55%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.55%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.55%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.55%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens1.55%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment0.78%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.78%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.78%
Contaminated WaterEnvironmental → Aquatic → Freshwater → Pond → Sediment → Contaminated Water0.78%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.78%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.78%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.78%
Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Bulk Soil0.78%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.78%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.78%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.78%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.78%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.78%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.78%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.78%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005532Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen14_06102014_R1EnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009448Groundwater microbial communities from Cold Creek, Nevada to study Microbial Dark Matter (Phase II) - Cold Creek SourceEnvironmentalOpen in IMG/M
3300009806Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60EnvironmentalOpen in IMG/M
3300009820Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012180Attine ant fungus gardens microbial communities from Georgia, USA - TSGA058 MetaGHost-AssociatedOpen in IMG/M
3300012181Attine ant fungus gardens microbial communities from New Jersey, USA - TSNJ006 MetaGHost-AssociatedOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014053Contaminated water microbial communities from Hexachlorocyclohexane (HCH) contaminated sediment in Lucknow, India - Pond SedimentEnvironmentalOpen in IMG/M
3300015084Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-5a, rocky medial moraine)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017972Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017975Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018085Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021372Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R01EnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021517 (restricted)Freshwater sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Balambano_FR2_MetaGEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027109Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF008 (SPAdes)EnvironmentalOpen in IMG/M
3300027773Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen14_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027902Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - CRP12 CR (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031777Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f24EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031782Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f20EnvironmentalOpen in IMG/M
3300031794Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f23EnvironmentalOpen in IMG/M
3300031796Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f24EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031893Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f28EnvironmentalOpen in IMG/M
3300031894Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f18EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032043Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f24EnvironmentalOpen in IMG/M
3300032044Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f20EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032065Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f20EnvironmentalOpen in IMG/M
3300032068Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f21EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0062593_10042321323300004114SoilMEGRSFLRRKDREALRERWLLRAGKAFERMFAEANQEQLVTFDEREDMACLLGKELAAFLLEEHVGADSQVRPSATRPPSCPKCQEPAEPVTKRDAKLPERELRTDAGDIKVGRAKWRCKKCRILFFSVRPQTEVGHGGIQPALGAEGGAPGEQVGVVQRSQ*
Ga0062386_10016316223300004152Bog Forest SoilMEGRRFLHRKDRESLRERWLLRAGRAFERMFAESNQDQLITFTEREDMACLLGQELAAFLLEEHVAVEGAVRLPENRPPCCPQCQQPGQRVTAPDEALPERELTTRAGHVKLRREQWSCAKCRRLFFSAGPQAAVGDRGVQSAGAGEGRAASGQGGVVQGSE*
Ga0062589_10077222723300004156SoilMEGRAFLRRKDREALRERWLLRAGKAFERMFGEANQGQLVTFTEREDMACLLGKEMAAFLLEEHAAAEGQVRASEKRPPCCPKCQQPARRITKPKEQLPERELTTRAGEIKFKREQWRCPKCRVVFFSAGPQAEVGDGALQPAGAGESGAAIKQGGIVQGRQ*
Ga0062592_10109016313300004480SoilMFAEANQEQLVTFDEREDMACLLGKELAAFLLEEHVGADSQVRPSATRPPGCPKCQEPAEPVTKRDAKLPERELRTDAGDIKVGRAKWRCKKCRILFFSVRPQTEVGHGGIQPALGAEGGAPGEQVGVVQRSQ*
Ga0066679_1056845513300005176SoilKANAIGQHHDTILHPTAGNHDRTSPCSESPLAVYREVNQIERIKGEAGVFPFLRRVFPGSMIPGQLRTILRVFGGAAMEGRSFLRRKDREALRERWLLRAGQAFERMFKEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPRCPKCQQPSERVTKPKEKLPERELTTRAGEIKLQREQWRCQKCRVVFFSVRPQAEVGDREIQSAGTGEGGTAGSQSGVVQGSQ*
Ga0066690_1091144013300005177SoilCPWINPFKLYRVKVALSSDKWRQGVFPFLRRVFPGSMIPGQLRTILRVFGGAAMEGRSFLRRKDREALREWWLLRAGQAFERMFREAYQDQRVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPRCPKCQQPSERVTKPKEKLPERELTTRAGEIKLQREQWRCQKCRVVFFSVRPQA
Ga0066671_1068547413300005184SoilFADTNQDQLVTFTQREDLACLLGQELAAFLLQEHAGADSQVRPPKKELPCCPKCKEPAARITKANDKLAERELTTRAGKIELRREQWRCKKCRVVFFSARPQTEIGHGEIQSARTGESGTPGDPGGVVQGG*
Ga0066388_10027073233300005332Tropical Forest SoilMESHLILRRKDREALREKWLQRAGQAFERMFAKANQDQLVTFTQREDMACALGQELATFLLEEHVAADIQVQPTEKRPPCCPRCQQPAQVVNKPKGRKEDLPERVLTSRAGQVRMRRQRWRCQKCRLVFFSVRPKAETGDGGLQSTGIAENGAAGSQGAFLQGSQ*
Ga0070714_10050345823300005435Agricultural SoilQLVTFTQREDMACLLGKELAAFLLEEHTAADHQARPSDKQPPACPKCHQPGVRVSGADGLPQRELTTRAGEIEFQREQWRCPKCRIVFFSAGPQAEVGHRAL*
Ga0070711_10067212613300005439Corn, Switchgrass And Miscanthus RhizosphereMEGRPFLRGKDREALRERWLLRAGQAFERMFGEANQDRLVTFTEREDMACLLGKELTAFLLEEHAGSDSQVRPSEKRPPSCPKCQQPGVRVTKANGPLPERELTTRAGEIKLQREQWRCPKCRVLFFSVRPQAEVGDGEIQSAGFGKGGTTGEQGGVVQGSQ*
Ga0070741_1013774723300005529Surface SoilMEGRSLLRRKDREALRERWLARAGQAFERMFGKAHQDQLVTFTEREDMACLLGQELAAFLLEEHAAADREVCPSEKQPPCCPKCQQPGERRTKAKEKLPERELTTRAGEIKLRRQQWRCQKCRIVFFSAGPQAEIGDGEIQPPALGEGGASGE*
Ga0070739_1009317513300005532Surface SoilMEGRPFLRRKEREALRERWLLCAGQAFERMFAEANQEQLVTFTEREDLACLLGKELATFLLEQHTSADTQVRPSDKQPPACPKCQQPAVRAGEAGEDLVPRELTTRAGEIQLQREQWRCQKCRIVFFSAGPQAEAGHRAL*
Ga0070734_1080072123300005533Surface SoilMEGRPFLRCKDREALRERWLLRAGQAFERMFGEGSQEQLVTFTEREDLACLLGKELAAFLLEQHAAADSQVRPSAKQPPCCPKCQQPGAPVLKADEELPERELTTRAGEIKLQRQQWRCPKCRIVFFSVRPQAEV
Ga0070704_10089625723300005549Corn, Switchgrass And Miscanthus RhizosphereDNQDQLVTFTQREDMACALSKELAAFLLEEHVAVDAQVRPPEKQPPACPRCQQPGQRVTKPKEALPERTLTTSAGVVQVRREQWRCKKCRILFFSVRPQAGVGDGGLQPARVAEGGSASGQGAVVQGSE*
Ga0066701_1045663913300005552SoilMEGRSFLRRKDREALRERWLLRAGTAFERMFGEANQDQLVTFTQREDMACALAKELAAFLVEEHVAVDAQVRPPEKEPPNCPKCQKPGQRVTKRKENLPERALTTGAGEVTLRREQWRCAKCRVLFFSVRPEACLGDGGVQSAGIGEGGTAGRQGAVVQRSE*
Ga0066692_1062166213300005555SoilMEGRSNFPRKDREALRERWLLRAGQAFERMFGKDNQDQLVTFTQREDMACALSRELATFLLEEHVAVDAQVRPSEKQPPCCPRCQQPGQRVTGRKEPLPERVLTTGAGEVNLGREQWRCKKCRIL
Ga0066704_1038743323300005557SoilMEGRSFLRRKDREDLREQFLQHAGKAFERMFGEANQDGLVTFTEREDMACLLGEELAAFLLEKHAAADRQVRPSDKQPPCCRKCQQPGQRVAKSQLQERELRTRAGEVRLRREQWRCKKCRILFFSVRPQAEIGDGALQSTDSGKDRPAREPGVIPGGE*
Ga0066700_1014906323300005559SoilLREQFLQHAGKAFERMFGEANQDGLVTFTEREDMACLLGEELAAFLLEKHAAADRQVRPSEKQPPCCPKCQQPGERVAKSQLPERELTTRAGEVRLRREQWRCKKCRILFFSVRPQAEIGDGALQSTDSGKDRPAREPGVIPGGE*
Ga0070762_1129672413300005602SoilMEGRSFLRRKDREALRERWLSRAGTAFERMFGEASQDQLITFTQREGMACALAKELAAFLVEEHVAADALVRPSEKNPPCCPRCRKPGYKVTKPKEQLPDRMLKTEAGEVTLRRERWRCSKCRVLFFSVRSEVGLGD
Ga0066905_10006917923300005713Tropical Forest SoilMEGRPFLHRKEREALRERWLQRAGRAFERMFGEANQDQLVTFTEREDMACLLGKELAAFLLEEHAAADSQVRPSEKRAPGCPKCQQPAVRVTPADEAFLERELTTRAGEVKLRRQQWRCKKCRIVFFSVRPQIEVGDGTL*
Ga0066905_10120516613300005713Tropical Forest SoilRGAFPFLRGVFPGTMIPGQLITLFQDFGGTAMEGRSFLRRKDREALRERWLLRASKAFERMFAEANQEQLVTFTEREDMACLLGKELAAFLLEEHAAADGQVRPSERRPPCCPKCQKPAERATKPKEKLPARPLTTRAGQITLQREQWRCLKCRILFFSVRPQAEVGDGGIQSACGGAGGTAVEPSGLVQGRQ*
Ga0066903_10039806533300005764Tropical Forest SoilMEGQSFLRRKDREALRERWLLRAGQAFERMFAEANQDQLVTFTQREDLACRLGGELAAFLLEEHAAADRQVRPSEKQPPGCPKCHAPAQRVTNRQEKLPERDLTTDAGEITLRREQWTCKKCRIVFFSVRPQAASGDGALQSPGPAKGDPPSGQGRVVPRGQ*
Ga0066903_10058243923300005764Tropical Forest SoilMEGRSFLRRKDREALRERWLLRAGQAFERMFAEANQGQLVTFTEREDMACLLGKELAAFLLEEHAAADGQVRPSEKQPPCCPKCQKPAERVTKRSEKLPERELTTRAGEINLRRAQWRCQKCRVVFFSVRPQAAIGHGEIQSAGSGNGRTASEQSGIVPGSQ*
Ga0066903_10221027513300005764Tropical Forest SoilMEGRRFLPRKHREALRERWLLRAGQAFERMFAQANQDQLVTFTEREDLACLLGQQLAAFLLEEHVAAEGAVRASEKRPPGCPHCQQPGQRATPPEEELPERELTTRAGKVKLRREKWTCTTCRRVFFSVRPPTPIGDGGIQSAAAAEGRAAGEQGSVVQGGQ*
Ga0066903_10329875813300005764Tropical Forest SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ*
Ga0066903_10447129613300005764Tropical Forest SoilFGEANQDQLVSLTQREDLACLLGEELAGFLLEEHAAADSQVRPSEKQPACCPKCQQAGMPVTSDDGKLPERELTTRAGEIKLKRQQWHCKKCRILFFSVRPQAEAGD*
Ga0070717_1103791723300006028Corn, Switchgrass And Miscanthus RhizosphereMEGRPFLRGKDREALRERWLLRAGSAFERMFAEANQDQLITFTQRENMACALAKELAAFLVEEHVTADAQVRPSEKEPPCCPKCQKPGQRVTKEKAKLPERSLKTGAGEVTLRREQWRCPKCRVLFFSVRPKAEVGNGGVQPADSGEGGPASRQGTVVQGRE*
Ga0075029_10011037433300006052WatershedsFTQREDVACLLSQELAAFLLEEHAAADSQVQPSEKQPPCCPKCQQVAVPVSAEAGKLSERELTTRAGEIKLKRQQWRCKKCRIVFFSIRPQAEIGDGALQSPHPGEGRAAGQQGSLLQGS
Ga0075014_10003600913300006174WatershedsMDGRPSLHRKDREALRERWLQRAGRAFERMFGEANQDQLVTFTEREDMACLLGKELAAFLLEEHAAVDNQVRPSEKRPPGCPKCHQPAIRVTQADEALPERELTTRAGEIKLRRQQWRCKKCRIIFFSVRPQVEVGDGAL*
Ga0066665_1012412323300006796SoilMEGRSFLRRKDREALRERWLLRAGTAFERMFGEANQDQLVTFTQREDMACALAKELAAFLVEEHVAVDAQVRPPEKEPPNCPKCQKPGQRVTKRKEDLPERALTTGAGEVTLRREQWRCAKCRVLFFSVRPEACLGDGGVQSAGIGEGGTAGRQGAVVQRSE*
Ga0066659_1030755923300006797SoilMEGRSNFPRKDREALRERWLLRAGQAFERMFGKDNQDQLVTFTQREDMACALSRELATFLLEEHVAVDAQVRPSEKQPPCCPRCQQPGQRVTGRKEPLPERVLTTGAGEVNLGREQWRCKKCRILFFSVRPQAGVGDGGIQSVACAESGSAGQQGGVVQGRQ*
Ga0066660_1108703023300006800SoilMEGRSFLRRKDREALRERWLLRAGKAFERMFGEANQDQLVTFTEREDLACLLGKELSAFLLEEHAAADSQVRPSQTRPPCCPKCQEPAERITKPKEKLAERELTARAGEIKLRREQWRCKKCR
Ga0075425_10048411113300006854Populus RhizosphereFSGTFGGAAMEGRSFLRRKDREALRERWLLRAGKAFERMFAEANQEQLVTFDEREDMACLLGKELAAFLLEEHVGADSQVRPSATRPPSCPKCQEPAEPVTKRDAKLPERELRTDAGDIKVGRAKWRCKKCRILFFSVRPQTEVGHGGIQPALGAEGGAPGEQVGVVQRSQ*
Ga0075434_10139666813300006871Populus RhizosphereEDWLKDEKSHRGNGGQEREKTLGQSNSTRRPLRRQPSSWHGCIPVPAPGFLITIIPSQLSTFSGTFGGAAMEGRSFLRRKDREALRERWLLRAGKAFERMFAEANQEQLVTFDEREDMACLLGKELAAFLLEEHVGADSQVRPSATRPPSCPKCQEPAEPVTKRDAKLPERELRTDAGDIKVGRAKWRCKKCRILFFSVRPQTEVGHGGIQPALGAEGGAPGEQVGVVQRSQ*
Ga0073928_1008321933300006893Iron-Sulfur Acid SpringMSFGGAAMEGRRFLSRKDREALRERWLLRAGQAFERMFAEANQDQLVTFTEREDMACELSQELAAFLLEEHAAADSQVRPSAKHPPTCPKCHAPGQRVSKRNDKLTERELTTRAGEVKLQREQSRCLKCRILFFSVRPQAEVGDGKIQPTVTGEGGTSGHQVRLV*
Ga0073928_1017537913300006893Iron-Sulfur Acid SpringGAFPFLRGVFSATIVPRQLITFLHGFEGAAMEGRSFLRRKDREALRERWLLRAGQAFERMFAEANQDQLVTLTEREDMACLLGKELATFLLDEHAAADSQVRPSEKHPPHCPKCQQPGIRVTPAKEALPERELTTRAGEITLQREQWRCKKCRVVFFSVRSKAEIGDGAVQSAGAGESRTPGEQGRLVQGSQ*
Ga0073928_1021651413300006893Iron-Sulfur Acid SpringRWLLRAGSAFERMFAEANQDQLITFTQREDMACALAKELAAFLVEEHVTTDAQVRPSEKEPPCCPKCQKPGQRVTKEKAKLPERALRTGAGEITLRREQWRCPKCRVLFFSVRPKAEVGNGGVQPADLGEGGPASRQGAVVQGRE*
Ga0075424_10145558823300006904Populus RhizosphereIPVPAPGFLITIIPSQLSTFSGTFGGAAMEGRSFLRRKDREALRERWLLRAGKAFERMFAEANQEQLVTFDEREDMACLLGKELAAFLLEEHVGADSQVRPSATRPPSCPKCQEPAEPVTKRDAKLPERELRTDAGDIKVGRAKWRCKKCRILFFSVRPQTEVGHGGIQPALGAEGGAPGEQVGVVQRSQ*
Ga0066710_10045445113300009012Grasslands SoilMEGRSFLRRKDREALRERWLLRAGTAFERMFGEANQDQLVTFTQREDMACALAKELAAFLVEEHVAVDAQVRPPEKEPPNCPECQKPGQRVTKRKENLPERALTTGAGEVTLRREQWRCAKCRVLFFSVRPEACLGDGGVQSAGIGEGGTAGRQGAVVQRSE
Ga0066710_10135675323300009012Grasslands SoilMEGRSFLRRKDREGLRERWLLRAGKAFERMFAESNQEQLVTLSEREDMACLLGKELTAFLLEEHVAADGQVRPSEQSPPCCPKCAKAAERTMQRKAKLPERELTTRAGEIKLWREQWRCKTCRIVFFSVRPKVEAGDGAIQSTHPREGSASGEQGGVVQGRQ
Ga0066710_10310252813300009012Grasslands SoilMEGRSNFPRKDREALRERWLLRAGQAFERMFGKDNQDQLVTFTQREDMACALSRELATFLLEEHVAVDAQVRPSEKQPPCCPRCQQPGQRVTGRKEPLPERVLTTGAGEVNLGREQWRCKKCRILFF
Ga0099830_1056204113300009088Vadose Zone SoilMEGRSFLRRKDREALRERWLQRAGKAFERMFAEANQDQLVTFTEREDMACALGEELAVFLLEEHVAADGQVQPSDRQAPCCPKCQQPGQRVSKRNEQLPERELTTRAGEVRLGREQWKCKKCRIVFFSIRP
Ga0066709_10291956813300009137Grasslands SoilMEGRSILRRKDREALRERWLLRAGKAFERMFGEAHQEQLVTFTEREDMACLLGEELAAFLLEEHAAADGQVRPSEKRAPCCPKCQVPATRVTQRDEELPERPLTTRAGDIQLRREQGRCRKWRIVFFSAGPEA
Ga0114129_1352229013300009147Populus RhizosphereGGAAMEGRSFLRRKDREALRERWLLRAGQAFERMFGEANQDQLVTFTEREDLACLLGEELAAFLLEEHAAADGQVRPSEKRPPCCPKCQQPGKRVTKPSEKLPERELTTRAGEIKVQREQWRCQKCRIVFFSVRPQAAVGDGEIQSAAFGEGGGAGEQGGLLQGGQ
Ga0114945_1005162723300009444Thermal SpringsMEGRSILRRQDREALRERWLLRAGQAFERMFGKANQDQLVTFTEREDMACLLGKELAAFLLEEHAAADRQVRPSAKQPPCCPKCQQPGERRTKAKEKLPERELTTRAGEIKLRREQWQCKKCRIVFFFRSTTS*
Ga0114940_1034023123300009448GroundwaterMEGRAFLRRQDREALRERWLLRAGKAFERMFAEANQDQLVTFTEREDMACLLSKELSAFLLEEHAGADRQVRPSDKQALACPRCDKPGVRVTKRNEKLPERELTTRAGEVQLRREKWHCAKCRLFFSH*
Ga0114940_1043435013300009448GroundwaterMEGRPILLRKDREALRERWLLRAGQAFERMFGQANQDQWVTFTEREDMACLLGDELSAFLLEQHAATDNQVRPSEQRPPCCPKCQQPGKRVSRRHDEALTERELTTRAGEIKLRR
Ga0105081_102483023300009806Groundwater SandMEGRSILRRQDREALRARWEQRAGKAFERMFGAANQDQLVTFTEREDMACALAKELAAFLLEEHAAADGQVRPCAKRPPCCPKCQQPAERVSKRNEQLPERALTTRAGEVKLRREQWRCRKCRIVFFSVRPTAEVGDGGLQ
Ga0105085_106356113300009820Groundwater SandMEGRSILRRQDREALRARWEQRAGKAFERMFGAANQDQLVTFTEREDMACALAKELAAFLLEEHAAADGQVRPCAKRPPCCPKCQQPAERVSKRNEQLPERALTTRAGEVKLRREQWRCRKCRIVFFSARPTAEVGDGGLQS
Ga0126373_1247452613300010048Tropical Forest SoilMEGRPFLRRKDREALRERWLLRAGKAFEQMFGEANQDQLITFTEREDMACLLGKELAAFLLEEHAAADSQVRPSEKQPPCCPKCQWPSVPVTKGGEKLPERELTTRAGEIQLKRQQWRCKKCRIVFFSVRPQAEVGDGALQSADSGEERASGQQSGLVPRS*
Ga0074044_1033037113300010343Bog Forest SoilMEGRRFLPRKDREALRERWLLRAGQAFERMFAEAGQDQLVSFTEREDLACLLGKELAAFLLQEHVDVQGAVRLPENRPPCCPQCQQPGQRVTPEDEALPERELTTLAGQIKLRREQWSCAKCRRVFFPAGPQAALGNRGVQSAGVGEGRAAGEQGGVVQGSE*
Ga0136449_10272444413300010379Peatlands SoilMEGRPFLRRKDREALRERWLLRAGQAFERMFGEAGQDQLVTFTEREDMACVLGKELAAFLLEQHAAADSQVRPSEKQPPCCPRCQQPGVPVTQSDEPLAERELTTRAGEIKLQRQQWRCQKCRIVFFSVRPQAEVGDGALQS
Ga0134126_1024009223300010396Terrestrial SoilMEGRSFLRRKDREALRERWLLRSAQAFERMFAEAHQDQLVTFTEREDLACLLGKELATFLLEEHAAADRQVRPSGKQAAACPKCQQPGERVTQPDAKLPERRLTTRAGKVHLRREQWRCPACRILFFSAGPQVEVGDRGVQPTGVGAGGAAGEQGVVV*
Ga0137392_1073338223300011269Vadose Zone SoilMEGRSFLRRKDREALRERWLQRAGKAFERMFAEANQDQLVTFTEREDMACALGEELAVFLLEEHVAADGQVQPSDRQAPCCPKCQQPGQRVTKRNKQLPERELTTRAGEMRLRREQWKCKKCRIIFFSVRPQAEVGDGAIQSAGTGEGGTPGSQGSVVQGSQ*
Ga0153974_112597723300012180Attine Ant Fungus GardensKAFERMFAEANQDQLVTFTEREDMAVLLGKELAAFLLEEHVTVAGQVRASDNQPPCCPKCQKSGERVTKSTEKLPERELTTRAGEVKLRREQWRCPKCRILFFSVGPQAEVGDGEIQSSGSGEGGASSRPSGVVQGSE*
Ga0153922_101232713300012181Attine Ant Fungus GardensMARAGQAFERMFAEANQDQLVTFTEREDMACLLGQDLAAFLLEEHAAADRQVRPSDKQPPCCPKCRQPGVRVTEANEELPERELTTRAGEIKLQREQWRCQKCRIVFFSAGPQAEIGDRTL*
Ga0137364_1038021713300012198Vadose Zone SoilMGAFPFLRTVFAGNMFPGQLITVFGAPGGSAMEGRSNFRRKDRETLRERWLLRAGQAFERMFGKDNQDQLVTFTQREDMACALGKELAIFLLEEHVTVDAQVRPSEKQPPCCPRCQQPGQRVTGRQEPLPERALTTNAGEVNLRREQWRCKKCRILFFSVRPQVGVGDGGIQSTGVGEGRSAGG*
Ga0137399_1083619013300012203Vadose Zone SoilLGAFPFLRPGFSVSIIPGQLLTSLRVFGEAAMEGRSFLRRKDREALRERWLQRAAKAFERMFAEANQDQLVTFTEREDMACALGDGLAAFLLEEHVSADGRVHPSDKQAPCCPKCQQPGQRVSKRNEQLPERELTTRAGDIRLGREQWKCKKCRIVFFSVRPQAAIGDGEIQSAGVGEGGTAGSQSGVVQGSQ*
Ga0137362_1073267913300012205Vadose Zone SoilIPCQLVTFLHAFGGAAMEGRSFLRRKDREALRERWLLRAGRAFERMFAEANQDQLVTFTDREDMACVLGKELAAFLLEEHAAADNQVRPSPKQPPCCPKCKQPATRLAKSDEKLPERELTTRAGEIKLRRQQWQCKKCRILFFSVRPQAEVGNGALQSADFGEGGAAGEQGGVVQGSQ*
Ga0137380_1062334123300012206Vadose Zone SoilMFAEANQDQLITFTQREDMACALAKELAAFLVEEHVTADAQVRPSEKEPPCCPKCQKPSQRVTKGKEKLPERALRTGAGEVTLRREQWRCPKCRVLFFSVRPEAEVGNGGVQPAGLGEGGPAGS*
Ga0137387_1102661113300012349Vadose Zone SoilMEGRSFLRRKDREVLRERWLQRAGKAFERMFGEAHQDQLVTFTEREDLACLLGEELAAFLLEEHAAADRQVQPSEKQPPCCPKCQQPGERVAKPQLSERELTTRAGEIKLQREQWRCQKCRILFFSVRPQAEVGDGALQSADSGEGRPTGEQGGVGQGGQ
Ga0137366_1012708913300012354Vadose Zone SoilKARSPACGGAFPFLRRVFPGIIIPGQLIVVSGNFGGAAMEGRSILRRKDREALRERWLLRAGKAFERMFGEAHQDQLVTFTEREDMACLLGEELAAFLLEEHAAADRQVRPSEKRAPCCPKCQVPATRVTQPNEELPERPLTTRAGDIQLRREQWRCRKCRIVFFSAGPEAAIGDGGIQSTDRAEGGAAGEQGAVVQGRQ*
Ga0137361_1017819033300012362Vadose Zone SoilSVRLITSVKPVEVTSKIRPLAFPSIVMLLPPSIVKLFVGAFPFLRPGFSVSIIPGQLLTSLGVFGGAAMEGRSFLRRKDREALRERWLQRAGKAFERMFAEANQDQLVTFTEREDVACALSEELAAFLLEEHVAADGQVHPSDKHAPCCPRCQEAGQRVTKRNDKLPERELTTRAGEVRLGREQWKCKKCRIVFFSAGSQAEVGDGEIQSAGAGEGGTPGGQGSVVQGCQ*
Ga0137394_1118277613300012922Vadose Zone SoilMISRQLIMFQRRFGGAAMEGRSFLRRKDREALRERWLLRAGQAFERMFAEANQEQLVTFTEREDMACLLGEELAAFLLEEHAAADSQVRPSEKRPPGCPKCQQPGQRVTKPNAKLSERDLTTRAGEIKLQREQWRCSKCRIVFFSARPQAEVGDGAIQSADSGEGG
Ga0137359_1016199223300012923Vadose Zone SoilMEGRSFLRRKDREALRERWLLRAGRDFERMFAEANQDQLVTFTDREDMACVLGKELAAFLLEEHAAADNQVRPSPKQPPCCPKCKQLATRLAKSDEKLPERELTTRAGEIKLRRQQWQCKKCRILFFSVRPQAEVGNGALQSADFGEGGAAGEQGGVVQGSQ*
Ga0137359_1115728713300012923Vadose Zone SoilMIPSQLFIFQRAFGGAAMEGRSFLRRKDREALRERWILRAGKAFERMFAEANQDQLVTFTEREDMACALGEELAAFLLEEHAAADGQVRLSEKRPPCCPKCQKPAERVTKQNETLAERELTTRAGEIKLRREQWRCQKCRVVFFSVGPQAEVGDGEIQSAGFGERGSAGVQGGIVQGGE*
Ga0137359_1155451613300012923Vadose Zone SoilMEGRSFLRRKDREELREQFLQHAGKAFERMFGEANQDGLVTFTEREDMACLLGEELAAFLLEKHAAADRQVRPSEKQPPCCPKCQQPGERVAKSQLPERELTTRAGEVRLRREQWRCKKCRILFFSVRPQ
Ga0137416_1185700513300012927Vadose Zone SoilGLPEDRTPALAGAFPFLRPGFSVSIIAGQLLTSLRVFGGAAMEGRSFLRRKDREALRERWLQRAAKAFERMFAEANQDQLVTFTEREDMACALGDGLAAFLLEEHVSADGRVHPSDKQAPCCPKCQQPGQRVSKRNEQLPERELTTRAGDIRLGREQWKCKKCRIVFFSVRPQAAIGDGEIQS
Ga0119956_101814213300014053Contaminated WaterMEGRSFLRRKDREALRERWLLRAGKAFERMFGEANQDQLVTFTEREDMAVLLSKELAAFLLEEHAAADRQVRPSEKHAPNCPKCSKPGVRVTKASEKLPERELTTRAGEVKLRREQWRCAKCRVNFFSVRPQAQAGDRRLQSADSGESRAAGEQGGIVQRGE*
Ga0167654_101030423300015084Glacier Forefield SoilRWLLRAGKAFERMYAEANQEQLVTFTEREDMACLLGKELAAFLLEEHAAADEQGKPSEQRPPCCPKCQQPGERVTQRKEKLPERELTTRAGEIKLRREQWRCKKCRVLFFSARPQAEAGDGKIQSAHPAEGGSAGEQGGVIQGCE*
Ga0137418_1031744233300015241Vadose Zone SoilMEGRSFLRRKDREELREQFLQHAGKAFERMFGEANQDGLVTFTEREDMACLLGEELAAFLLEKHAAADRQVRPSEKQPPCCPKCQQPGQRVAKSQLPERELTTRAGEVRLRREQWRCKKCRILFFSVRPQAEIGDGALQSTDSGKDRPAREPGVIPGGE*
Ga0182039_1147514713300016422SoilEGRPLLRRKDREALRERWLLRAGQAFERMFGEANQDQLVTLTEREDLACLLGHELAGFLLEEHAAADSQVRPSEKQPPCCPKCQQAGMPVTSGDEKLPERELTTRAGEIKLKRQQWRCKKCRIVFFSVRPQAEAGDRGLQSADRGEGRTAGRQGDLVQGSQ
Ga0182038_1089152913300016445SoilLACLLGHELAGFLLEEHAAADSQVRPSEKQPPCCPKCQQAGMPVTSGDEKLPERELTTRAGEIKLKRQQWRCKKCRIVFFSVRPQAEAGDRGLQSADRGEGRTAGRQGDLVQGSQ
Ga0187783_1009121123300017970Tropical PeatlandMEGRSFLRRKDHEALRERWLQRAGQAFERMFAEANQEQLVTFTEREDMACQLGEELAIFLLEEHAAADRQVRPSDKQPPGCPKCHQPGVRVTAANEELPQRELTTRAGEVKLQREQWRCQKCRIVFFSAGPQAEVGDRAL
Ga0187783_1024179523300017970Tropical PeatlandMEGRPFFRRKDREALRERWLLRAGQAFERMFAEANQEQLVTFTQREDVACLLGKELAAFLLEEHAAADHQVRPSEKLPPACPKCNQPGERVTKANQNLPERELTTRAGEIRLKREQWRCKKCRIVFFSVRPQAETGNGALQSADLGEGSSAGQQGGLVQGSQ
Ga0187781_1041336313300017972Tropical PeatlandMEGRPFFRRKDREALRERWLLRAGQAFERMFAEASQDQLVTFTQREDLACLLGKELAVFLLEEHAAADHQVRPSEKHPPGCPKCNQPGERVTKANQRLPERELTTRAGEIKLKREQWRCKKCRIVFFSVRPQAETGNGALQSADLGEGSSAGQQGGLVQGSQ
Ga0187782_1060595013300017975Tropical PeatlandMEGRPFFRRKDREALRERWLLRAGQAFERMFAEASQDQLVTFTQREDLACLLGKELAVFLLEEHAAADHQVRPSEKHPPGCPKCNQPGERVTKANQRLPERELTTRAGEIKLKREQWRCKKCRIVFFSVRPQAETGNGALQSADLGEGSSAGRQGGLVQGGQ
Ga0187784_1046633813300018062Tropical PeatlandMEGRPFFRRKDREALRERWLLRAGQAFERMFAEASQDQLVTFTQREDLACLLGKELAVFLLEEHAAADHQVRPSDKHPPGCPKCNQPGERVTKANQRLPERELTTRAGEIKLKREQWRCKKCRIVFFSVRPQAETGNGALQSADLGEGSSAGQQGGLVQGSQ
Ga0184637_1008108823300018063Groundwater SedimentMDGGLNLRGKEREALRERWLKRSAEAFERMFGKEHQDRLVTLTQREDMACALGEELRAFLLKEHVAADPQTRPSEKRAPCCPKCQRPAERVTDRKEELPERELSTRAGDITLRREQWQCKKCRVLFFSARPQAGVGDGGV
Ga0187772_1011065533300018085Tropical PeatlandMEGRPFLHRKEREALRERWLLHAGKAFEQMFAEANQDQLVTFTEREDMACLLGKELAAFLLEEHAAADGAVRPSEKRPPACPKCENPGERVAKPNATLPDRELTTRVGEIKIQREQWRCKKCRILFFSVRPQTQVGDGGIQSTAPGESGSTGDPGVVVRGGQ
Ga0187772_1018582223300018085Tropical PeatlandMEGRRFLRVKDREALRERWLLRAGKAFERMFAEANQDQLVTFTEREDMACLLGEELAAFLLEEHAAADSQVRPSEKQPPSCPKCQGSGERVAKPKQKLTERELTERELTTRAGKIKLQREQWHCKKCRILFFSVGPQAEAGDGEIQSTSTGEGGTPGGQGSLIQGGQ
Ga0066662_1076016713300018468Grasslands SoilMEGGIFLHRKDREALRERWLLRAAKAFERMFAETNQDHLVTFTQREDLACLLGKELAAFLLEEHAAADNQVQPSEKGPPCCPKCKQPGERVTKSHGNLWERELTTRAGELKLRREQWRCKQCRILFFSVRSQTEVGDGTLQSADSGEGSTPGE
Ga0187893_1057612013300019487Microbial Mat On RocksGAFPFLRQVFAGTMIPGQLLTFPKGFGGTAMEGRSFLRRKDREALRERWLLRAGKAFERMFAEANQEDLVTFTEREDMACLLGKELAAFLVEEHAAADAQVRPPEQRPPCCPKCQRPAQRVTKRNEKLPERELTTRAGEVKLRREQWRCQKCRIFFFSVRPQAGVGDGEIQSKAAAEGGAAGEQGGLVQGRQ
Ga0187893_1068904723300019487Microbial Mat On RocksEGRSILRRKDREALRERWLLRAGKAFERMFGAANQDQLVTFTEREDMAVLLGQELAAFLLEEHASADRQARPSEKDTLGCPKCSKPGVRVTKRAEKLPERELTTRAGKVTLRREKWWCAKCRVHFFSVRQTAQTGDGGLQSADRGEGRAAGEQGSLVSGRE
Ga0210405_1012860723300021171SoilMEGRPFLRRKDREALRERWLLRAGSAFERMFAEANQDQLITFTQREDMACALAKELAAFLVEEHVTTDAQVRPSEKEPPCCPKCQKPGQRVTKEKAKLPERALRTGAGEITLRREQWRCPKCRVLFFSVRPKAEVGNGGVQPADLGEGGPASRQGAVVQGRE
Ga0213877_1003869923300021372Bulk SoilMEGRPFLRRKDREALREQWLLRAGQAFERMFAEANQDQLVTFTEREDLACLLGKELGAFLLEQHAAADSQVRPSEKQPPCCPKCQQPGLPVTQGNEKLPERELTTRAGEIKLQRQQWRCTKCRIVFFSVRPQAEVGDRAVQPAASREGRAAGQQGSLLQGSP
Ga0187846_1003966013300021476BiofilmMEGRPFLRRKAREALRERWLLRAGQAFERMFGEKRQDQLVTFTEREDMACLLGEELAAFLLEEHAAADSQVRPSEKHPPCCPKCKQLGERVTKSNDELPERELTTRAGEVKVRREQWRCRKCRIGFFFRWTSGCN
(restricted) Ga0224723_103784823300021517Freshwater SedimentMDGVRFLRGKDREALRERWLQRAGRAFEQMFAEAHQDQLVTFTEREDMACLLGKELAAFLLEQHAAADNQVRPSDKKPPCCPKCQQPGKRIAKQTDEPLTERELTTQAGEIKLRREQWRCSKCRILFFSVRPQAEVGNGALQSADSGEGGAAGEQGGFVQRRQ
Ga0212123_1006372923300022557Iron-Sulfur Acid SpringMEGRRFLSRKDREALRERWLLRAGQAFERMFAEANQDQLVTFTEREDMACELSQELAAFLLEEHAAADSQVRPSAKHPPTCPKCHAPGQRVSKRNDKLTERELTTRAGEVKLQREQSRCLKCRILFFSVRPQAEVGDGKIQPTVTGEGGTSGHQVRLV
Ga0212123_1029082313300022557Iron-Sulfur Acid SpringPFLRGVFSATIVPRQLITFLHGFEGAAMEGRSFLRRKDREALRERWLLRAGQAFERMFAEANQDQLVTLTEREDMACLLGKELATFLLDEHAAADSQVRPSEKHPPHCPKCQQPGIRVTPAKEALPERELTTRAGEITLQREQWRCKKCRVVFFSVRSKAEIGDGAVQSAGAGESRTPGEQGRLVQGSQ
Ga0212128_1010184923300022563Thermal SpringsMEGRSILRRQDREALRERWLLRAGQAFERMFGKANQDQLVTFTEREDMACLLGKELAAFLLEEHAAADRQVRPSAKQPPCCPKCQQPGERRTKAKEKLPERELTTRAGEIKLRREQWQCKKCRIVFFFRSTTS
Ga0207663_1083842813300025916Corn, Switchgrass And Miscanthus RhizosphereMEGRPFLRGKDREALRERWLLRAGQAFERMFGEANQDRLVTFTEREDMACLLGKELTAFLLEEHAGSDSQVRPSEKRPPSCPKCQQPGVRVTKANGPLPERELTTRAGEIKLQREQWRCPKCRVLFFSVRPQAEVGDGEIQSAGFGKGGTTGEQGGVVQGSQ
Ga0209056_1014519333300026538SoilMEGRSFLRRKDREALRERWLLRAGTAFERMFGEANQDQLVTFTQREDMACALAKELAAFLVEEHVAVDAQVRPPEKEPPNCPKCQKPGQRVTKRKENLPERALTTGAGEVTLRREQWRCAKCRVLFFSVRPEACLGDGGVQSAGIGEGGTAGRQGAVVQRSE
Ga0208603_106276513300027109Forest SoilMEGRSFLRRKDREALRERWLLRAGTAFERMFGEANQDQLVTFTQREDMACALTKELAAFLVEEHVAADAQVRPSEKEPPCCPKCRKPGQQMTKRKEKLPERALRTEAGEVTVRREQWRCASCRVLFFSVRPEARLGDGGVQSADIGEGGTAGSQ
Ga0209810_102689433300027773Surface SoilMEGRPFLRRKEREALRERWLLCAGQAFERMFAEANQEQLVTFTEREDLACLLGKELATFLLEQHTSADTQVRPSDKQPPACPKCQQPAVRAGEAGEDLVPRELTTRAGEIQLQREQWRCQKCRIVFFSAGPQAEAGHRAL
Ga0209060_1059114513300027826Surface SoilMEGRPFLRCKDREALRERWLLRAGQAFERMFGEGSQEQLVTFTEREDLACLLGKELAAFLLEQHAAADSQVRPSAKQPPCCPKCQQPGAPVLKADEELPERELTTRAGEIKLQRQQWRCPKCRIVFFSVRPQAE
Ga0209275_1083047113300027884SoilMEGRSFLRRKDREALRERWLSRAGTAFERMFGEASQDQLITFTQREGMACALAKELAAFLVEEHVAADALVRPSEKNPPCCPRCRKPGYKVTKPKEQLPDRMLKTEAGEVTLRRERWRCSKCRVLFFSVRSEVGLGDGGVQSAS
Ga0209048_1060175013300027902Freshwater Lake SedimentMEGRSFLSRKDREALRERWLLRAGKAFERMFAENNQHQLVTFTEREDVACLIAKELSAFLLEEHVAADRQVRPSDKQAPHCPRCEKPGTRVTKPDGKLPQRELTSRAGAVKLQREKWRCPKCRIIFFSARPQTEVGDRGLQSADSGEGGAAVEQGGVVQGGE
Ga0137415_1070009313300028536Vadose Zone SoilMEGRSFLRRKDREALRERWLQRAAKAFERMFAEANQDQLVTFTEREDMACALGDGLAAFLLEEHVSADGRVHPSDKQAPCCPKCQQPGQRVSKRNEQLPERELTTRAGDIRLGREQWKCKKCRIVFFSVRPQAASGDGEIQS
Ga0307504_1037437013300028792SoilMEGRSFLRRKDREALRERWIQRAGTAFERMFGEANQDQLVTFTEREDMACALAKELAAFLVEEHTAVDSQVRPSDKRPPCCPKCQQPGKRVTKRNEKLPDRALTTRAGAVTLRREQWECKKCRVVFFSVRPEIAVGDGGVQSAGIGKGSAAGKQGGV
Ga0318541_1006012523300031545SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318528_1061199423300031561SoilTIISLQLIAFVDPFRRDAMEGRPFLRRKDREALRERWLQRAGQAFERMFAEANQDQLVTLTEREDRACLLGKELTTFLLEEHVAADHEVRPSEKRPPCCPKCKQPGERVAKANTALLERELTTRAGEIIVQREQWRCQKCRIVFFSVRPQAEIGDGALQSADSGEGRAPGQQGGLVQGSQ
Ga0247727_1018546423300031576BiofilmMISGQLFTFPQAFGGTAMEGRSFLRRKDREALRERWFQRAGKAFERMFAEGNQDQLVTFTEREDMACLLGEELAAFLVEEHAAADSQVRPAEQRPPCCPKCQQPAARVTKRKEKLPERDLTSCAGEIKLRREQWRCQKCRVLFFSVRPQAEVGNGEIQSAASAEGSAASEQGGIVQGCQ
Ga0310813_1050215723300031716SoilMEGRSFLRRKDREALRERWLLRAGKAFERMFAEANQEQLVTFDEREDMACLLGKELAAFLLEEHVGADSQVRPSATRPPSCPKCQEPAEPVTKRDAKLPERELRTDAGDIKVGRAKWRCKKCRILFFSVRPQTEVGHGGIQPALGAEGGAPGEQVGVVQRSQ
Ga0307469_1050129613300031720Hardwood Forest SoilTAMEGRAFLRRQDREGLRERWLLRASKAFERMFGEANQEQLVTFTEREDMACLLGKELAAFLLEEHAAADGQVRPSEKRAPCCPKCQKPAMRVTQRHEELPERPLRTRAGDIKLRRQQWRCRKCRIIFFSARPEAAVGDGGIQSVAGAESGSAGQQGGVVQGRQ
Ga0318500_1002849223300031724SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAADDSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0306918_1024938523300031744SoilRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318543_1007078913300031777SoilGAAMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318547_1018280223300031781SoilVLAAARGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318552_1029741923300031782SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKC
Ga0318503_1029248613300031794SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFFR
Ga0318576_1006925323300031796SoilFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318567_1006774913300031821SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0306925_1129925723300031890SoilMEGRPFLRRKDREALRERWLQRAGQAFERMFAEANQDQLVTLTEREDRACLLGKELTTFLLEEHVAADHEVRPSEKRPPCCPKCKQPGERVAKANTALLERELTTRAGEIIVQREQWRCQKCRIVFFSVRPQAEIGDGALQSADSGEGRAPGQQGGLVQGSQ
Ga0318536_1029928213300031893SoilMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318522_1028704313300031894SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQ
Ga0306923_1060365723300031910SoilMEGRRFLPRKDREALRERWLLRAGKAFERMFAEANQDQLVTFTEREDMACLLGQELAAFLLEEHVAAEGAVRASEKRPPICPHCQQPGQRVTQPEQELPERELTTRAGKVKLRREKWTCTKCRRVFFSVRPPTAVRDGGVQSADSTEGRTAGGQGSVVQGGE
Ga0318549_1012118713300032041SoilSSGYAGAFPFPRRVFPGSMIPRQLLMVLRVFGGAAMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0318556_1057938913300032043SoilMEGRSFLRRKDREALRECWLLRAGQAFERMFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQG
Ga0318558_1032412813300032044SoilACLLGKELTTFLLEEHVAADHEVRPSEKRPPCCPKCKQPGERVAKANTALLERELTTRAGEIIVQREQWRCQKCRIVFFSVRPQAEIGDGALQSADSGEGRAPGQQGGLVQGSQ
Ga0318533_1126876823300032059SoilRSFLRRKDREALRERWLQRAGQAFERMFAEANQDQLVTLTEREDRACLLGKELTTFLLEEHVAADHEVRPSEKRPPCCPKCKQPGERVAKANTALLERELTTRAGEIIVQREQWRCQKCRIVFFSVRPQAEIGDGALQSADSGEGRAPGQQGGLVQGSQ
Ga0318513_1038175623300032065SoilFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGS
Ga0318553_1008072913300032068SoilFGEANQDQLVTFTEREDMACLLGEELAAFLLQEHAAADSQVRPSEKRPPCCPKCQQPGERVTKPKEKLAERELTTRAGEIKLQREQWRCRKCRVLFFSVRSQAEVGDGEIQSADPGEGRTPSRQGGVVQGSQ
Ga0311301_1100160513300032160Peatlands SoilRSFLRRKDREALRERWLLRAGVAFERMFGETNQSQLVTFTEREDMACALAKELAAFLVEEHVAADSQTRPSVKQPPHCPKCQQPGEQVTKRNDKLPERVLKTRAGDVTMRREQWRCKKCRIVFFSAGPQAGLGDGGLQSAGVGESSAASSQGNVVPRRQC
Ga0311301_1274306013300032160Peatlands SoilMEGRPFLRRKDREALRERWLLRAGQAFERMFGEAGQDQLVTFTEREDMACVLGKELAAFLLEQHAAADSQVRPSEKQPPCCPRCQQPGVPVTQSDEPLAERELTTRAGEIKLQRQQWRCQKCRIVFFSVRPQAEVGDGALQSADSGE
Ga0306920_10092228523300032261SoilANQDQLVTLTEREDRACLLGKELTTFLLEEHVAADHEVRPSEKRPPCCPKCKQPGERVAKANTALLERELTTRAGEIIVQREQWRCQKCRIVFFSVRPQAEIGDGALQSADSGEGRAPGQQGGLVQGSQ
Ga0335085_1058746323300032770SoilMEGRSILRGKEKEALRDRWLLRAGKAFERMFGKANQDQLVTFSEREDMACLLGKEMAAFLLEEHAMAEGQVRASAKRPPCCPKCQQPAKRVTKAKEPLPERELTTRAGEIKVRREQWRCPKCRVVFFSAGPQTEVGDGALQSADFGESGAAVKQGCVVSGSQ
Ga0335084_1189055123300033004SoilTFTEREDMACLLGKELAAFLLEEHATADSQVRPSDKHPPCCPKCHQPAERVTPPKAPLPERELTTRAGEITLQREQWRCKKCRVVFFSVRSKAEIGDGAVQSAGVGEGGTAGEHGRVVQGSQ
Ga0335084_1194301713300033004SoilMEGHLILRRQDREALREKWFQRAGQAFERMFGKANQDQLVTFTQREDMACALGEELAAFLLEEHVAVDAEVQRSEKRPPCCPRCQQPGQQVSKRKERKENLPERALTTRAGQVTMRRQQWRCRKCRILFFFR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.