NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F046963

Metagenome Family F046963

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F046963
Family Type Metagenome
Number of Sequences 150
Average Sequence Length 99 residues
Representative Sequence MVLTVELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Number of Associated Samples 100
Number of Associated Scaffolds 150

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 20.67 %
% of genes near scaffold ends (potentially truncated) 12.00 %
% of genes from short scaffolds (< 2000 bps) 65.33 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (90.000 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand
(15.333 % of family members)
Environment Ontology (ENVO) Unclassified
(29.333 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(47.333 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 77.00%    β-sheet: 0.00%    Coil/Unstructured: 23.00%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 150 Family Scaffolds
PF05402PqqD 20.67
PF01008IF-2B 8.67
PF00155Aminotran_1_2 7.33
PF03807F420_oxidored 2.00
PF00300His_Phos_1 2.00
PF03721UDPG_MGDP_dh_N 1.33
PF13847Methyltransf_31 1.33
PF03720UDPG_MGDP_dh_C 0.67
PF10432bact-PGI_C 0.67
PF02350Epimerase_2 0.67
PF10518TAT_signal 0.67
PF08282Hydrolase_3 0.67

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 150 Family Scaffolds
COG01825-methylthioribose/5-deoxyribulose 1-phosphate isomerase (methionine salvage pathway), a paralog of eIF-2B alpha subunitAmino acid transport and metabolism [E] 8.67
COG1184Translation initiation factor 2B subunit, eIF-2B alpha/beta/delta familyTranslation, ribosomal structure and biogenesis [J] 8.67
COG0240Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 1.33
COG0677UDP-N-acetyl-D-mannosaminuronate dehydrogenaseCell wall/membrane/envelope biogenesis [M] 1.33
COG1004UDP-glucose 6-dehydrogenaseCell wall/membrane/envelope biogenesis [M] 1.33
COG12503-hydroxyacyl-CoA dehydrogenaseLipid transport and metabolism [I] 1.33
COG1893Ketopantoate reductaseCoenzyme transport and metabolism [H] 1.33
COG0381UDP-N-acetylglucosamine 2-epimeraseCell wall/membrane/envelope biogenesis [M] 0.67
COG0560Phosphoserine phosphataseAmino acid transport and metabolism [E] 0.67
COG0561Hydroxymethylpyrimidine pyrophosphatase and other HAD family phosphatasesCoenzyme transport and metabolism [H] 0.67
COG0707UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferaseCell wall/membrane/envelope biogenesis [M] 0.67
COG1877Trehalose-6-phosphate phosphataseCarbohydrate transport and metabolism [G] 0.67
COG2217Cation-transporting P-type ATPaseInorganic ion transport and metabolism [P] 0.67
COG3769Mannosyl-3-phosphoglycerate phosphatase YedP/MpgP, HAD superfamilyCarbohydrate transport and metabolism [G] 0.67


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms90.00 %
UnclassifiedrootN/A10.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000571|JGI1358J11329_10004689All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae9751Open in IMG/M
3300000571|JGI1358J11329_10006614All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosarchaeum → Candidatus Nitrosarchaeum limnium7775Open in IMG/M
3300001199|J055_10292883All Organisms → cellular organisms → Archaea569Open in IMG/M
3300002120|C687J26616_10012833All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae3259Open in IMG/M
3300002120|C687J26616_10019970All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae2543Open in IMG/M
3300002120|C687J26616_10020979All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosarchaeum2473Open in IMG/M
3300002120|C687J26616_10059884All Organisms → cellular organisms → Archaea1297Open in IMG/M
3300002123|C687J26634_10233024All Organisms → cellular organisms → Archaea625Open in IMG/M
3300002149|C687J26657_10136708All Organisms → cellular organisms → Archaea503Open in IMG/M
3300002503|C687J35164_10009646All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae3292Open in IMG/M
3300005167|Ga0066672_10220362All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → unclassified Thaumarchaeota → Thaumarchaeota archaeon1214Open in IMG/M
3300005176|Ga0066679_10598913All Organisms → cellular organisms → Archaea719Open in IMG/M
3300005213|Ga0068998_10139834All Organisms → cellular organisms → Archaea571Open in IMG/M
3300005518|Ga0070699_100269110All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis1524Open in IMG/M
3300005536|Ga0070697_100061671All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis3058Open in IMG/M
3300005536|Ga0070697_100948047All Organisms → cellular organisms → Archaea764Open in IMG/M
3300005542|Ga0070732_10031893All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis3004Open in IMG/M
3300005545|Ga0070695_100107069All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1891Open in IMG/M
3300005557|Ga0066704_10006436All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis6134Open in IMG/M
3300006755|Ga0079222_11276404All Organisms → cellular organisms → Archaea666Open in IMG/M
3300006797|Ga0066659_10488286All Organisms → cellular organisms → Archaea985Open in IMG/M
3300006804|Ga0079221_10626686All Organisms → cellular organisms → Archaea732Open in IMG/M
3300006806|Ga0079220_12126504All Organisms → cellular organisms → Archaea503Open in IMG/M
3300006903|Ga0075426_10317783All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → unclassified Thaumarchaeota → Thaumarchaeota archaeon1141Open in IMG/M
3300006914|Ga0075436_100818921All Organisms → cellular organisms → Archaea694Open in IMG/M
3300006954|Ga0079219_12354768All Organisms → cellular organisms → Archaea516Open in IMG/M
3300009012|Ga0066710_100504749All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea1824Open in IMG/M
3300009137|Ga0066709_100257603All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis2341Open in IMG/M
3300009444|Ga0114945_10045509All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2381Open in IMG/M
3300009691|Ga0114944_1465027All Organisms → cellular organisms → Archaea536Open in IMG/M
3300009691|Ga0114944_1481738All Organisms → cellular organisms → Archaea527Open in IMG/M
3300009777|Ga0105164_10002533All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis11275Open in IMG/M
3300009777|Ga0105164_10055279All Organisms → cellular organisms → Archaea2120Open in IMG/M
3300009798|Ga0105060_103208All Organisms → cellular organisms → Archaea860Open in IMG/M
3300009801|Ga0105056_1014944All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota910Open in IMG/M
3300009805|Ga0105079_1009065All Organisms → cellular organisms → Archaea783Open in IMG/M
3300009807|Ga0105061_1009148All Organisms → cellular organisms → Archaea1220Open in IMG/M
3300009809|Ga0105089_1002336All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1963Open in IMG/M
3300009816|Ga0105076_1046589All Organisms → cellular organisms → Archaea782Open in IMG/M
3300009817|Ga0105062_1011649All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1394Open in IMG/M
3300009817|Ga0105062_1118295All Organisms → cellular organisms → Archaea535Open in IMG/M
3300009819|Ga0105087_1064198All Organisms → cellular organisms → Archaea626Open in IMG/M
3300009823|Ga0105078_1008391All Organisms → cellular organisms → Archaea1076Open in IMG/M
3300009836|Ga0105068_1036936All Organisms → cellular organisms → Archaea864Open in IMG/M
3300010391|Ga0136847_11313933All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis3295Open in IMG/M
3300010391|Ga0136847_11391884All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1149Open in IMG/M
3300010391|Ga0136847_11712813All Organisms → cellular organisms → Archaea580Open in IMG/M
3300011269|Ga0137392_11589068All Organisms → cellular organisms → Archaea512Open in IMG/M
3300012189|Ga0137388_11509696All Organisms → cellular organisms → Archaea609Open in IMG/M
3300012202|Ga0137363_10282497All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → unclassified Thaumarchaeota → Thaumarchaeota archaeon1358Open in IMG/M
3300012203|Ga0137399_10032664All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis3604Open in IMG/M
3300012206|Ga0137380_10436376All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → unclassified Thaumarchaeota → Thaumarchaeota archaeon1159Open in IMG/M
3300012918|Ga0137396_10086472All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis2218Open in IMG/M
3300012922|Ga0137394_10844885All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → unclassified Thaumarchaeota → Thaumarchaeota archaeon766Open in IMG/M
3300013092|Ga0163199_1032867All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis2474Open in IMG/M
3300017927|Ga0187824_10048145All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis1311Open in IMG/M
3300017930|Ga0187825_10164103All Organisms → cellular organisms → Archaea789Open in IMG/M
3300017930|Ga0187825_10275573Not Available622Open in IMG/M
3300017936|Ga0187821_10243502All Organisms → cellular organisms → Archaea701Open in IMG/M
3300017993|Ga0187823_10044071All Organisms → cellular organisms → Archaea1209Open in IMG/M
3300018031|Ga0184634_10015164All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2827Open in IMG/M
3300018031|Ga0184634_10069335All Organisms → cellular organisms → Archaea1496Open in IMG/M
3300018031|Ga0184634_10562280All Organisms → cellular organisms → Archaea504Open in IMG/M
3300018063|Ga0184637_10081240Not Available1989Open in IMG/M
3300018063|Ga0184637_10121497All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1611Open in IMG/M
3300018074|Ga0184640_10035569Not Available2002Open in IMG/M
3300018074|Ga0184640_10161072All Organisms → cellular organisms → Archaea1004Open in IMG/M
3300018074|Ga0184640_10297726All Organisms → cellular organisms → Archaea733Open in IMG/M
3300018077|Ga0184633_10015616All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis3693Open in IMG/M
3300018077|Ga0184633_10159777Not Available1168Open in IMG/M
3300018077|Ga0184633_10349573All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota746Open in IMG/M
3300018077|Ga0184633_10357886All Organisms → cellular organisms → Archaea736Open in IMG/M
3300018077|Ga0184633_10401450All Organisms → cellular organisms → Archaea685Open in IMG/M
3300018077|Ga0184633_10568288All Organisms → cellular organisms → Archaea539Open in IMG/M
3300018079|Ga0184627_10508620All Organisms → cellular organisms → Archaea620Open in IMG/M
3300018082|Ga0184639_10078970All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus1727Open in IMG/M
3300019360|Ga0187894_10155773Not Available1156Open in IMG/M
3300019458|Ga0187892_10003638All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota27665Open in IMG/M
3300019458|Ga0187892_10394981All Organisms → cellular organisms → Archaea660Open in IMG/M
3300019487|Ga0187893_10490923All Organisms → cellular organisms → Archaea804Open in IMG/M
3300021171|Ga0210405_10312372All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1242Open in IMG/M
3300021178|Ga0210408_10125007All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2030Open in IMG/M
3300021178|Ga0210408_10257202All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis1391Open in IMG/M
3300021178|Ga0210408_10456887All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis1016Open in IMG/M
3300022563|Ga0212128_10033583All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra3305Open in IMG/M
3300022563|Ga0212128_10217737All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea1214Open in IMG/M
3300024182|Ga0247669_1034079All Organisms → cellular organisms → Archaea857Open in IMG/M
3300025119|Ga0209126_1048642Not Available1253Open in IMG/M
3300025146|Ga0209322_10022729Not Available3149Open in IMG/M
3300025146|Ga0209322_10123194Not Available1171Open in IMG/M
3300025149|Ga0209827_10028827All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra3336Open in IMG/M
3300025149|Ga0209827_10654848All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra2016Open in IMG/M
3300025149|Ga0209827_10942171All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales1745Open in IMG/M
3300025149|Ga0209827_11066337All Organisms → cellular organisms → Archaea522Open in IMG/M
3300025155|Ga0209320_10005867All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra6031Open in IMG/M
3300025155|Ga0209320_10044556All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2115Open in IMG/M
3300025157|Ga0209399_10048079All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra1761Open in IMG/M
3300025164|Ga0209521_10021315Not Available4623Open in IMG/M
3300025173|Ga0209824_10004096All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra7053Open in IMG/M
3300025173|Ga0209824_10125721All Organisms → cellular organisms → Archaea933Open in IMG/M
3300025173|Ga0209824_10352424All Organisms → cellular organisms → Archaea502Open in IMG/M
3300025289|Ga0209002_10139483All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1555Open in IMG/M
3300025313|Ga0209431_10003723Not Available12083Open in IMG/M
3300025313|Ga0209431_10019380All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota5229Open in IMG/M
3300025313|Ga0209431_10610115All Organisms → cellular organisms → Archaea817Open in IMG/M
3300025318|Ga0209519_10100508All Organisms → cellular organisms → Archaea1698Open in IMG/M
3300025319|Ga0209520_10192204All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1286Open in IMG/M
3300025319|Ga0209520_10291058Not Available1004Open in IMG/M
3300025326|Ga0209342_10193919All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1815Open in IMG/M
3300025327|Ga0209751_10477083All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1023Open in IMG/M
3300026328|Ga0209802_1024826All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra3229Open in IMG/M
3300026552|Ga0209577_10532026All Organisms → cellular organisms → Archaea761Open in IMG/M
3300026856|Ga0209852_1006930All Organisms → cellular organisms → Archaea661Open in IMG/M
3300026888|Ga0209900_1001785Not Available1439Open in IMG/M
3300027006|Ga0209896_1037039All Organisms → cellular organisms → Archaea560Open in IMG/M
3300027013|Ga0209884_1010558All Organisms → cellular organisms → Archaea875Open in IMG/M
3300027027|Ga0209844_1004339All Organisms → cellular organisms → Archaea999Open in IMG/M
3300027163|Ga0209878_1002804All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2539Open in IMG/M
3300027163|Ga0209878_1008198All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1494Open in IMG/M
3300027332|Ga0209861_1004950All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2007Open in IMG/M
3300027379|Ga0209842_1016735All Organisms → cellular organisms → Archaea1432Open in IMG/M
3300027384|Ga0209854_1090255All Organisms → cellular organisms → Archaea546Open in IMG/M
3300027561|Ga0209887_1050797All Organisms → cellular organisms → Archaea895Open in IMG/M
(restricted) 3300027799|Ga0233416_10002096All Organisms → cellular organisms → Archaea6284Open in IMG/M
(restricted) 3300027799|Ga0233416_10158753All Organisms → cellular organisms → Archaea775Open in IMG/M
3300027835|Ga0209515_10001475Not Available39282Open in IMG/M
3300027835|Ga0209515_10002640All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota27152Open in IMG/M
3300027835|Ga0209515_10020969All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota6200Open in IMG/M
3300027835|Ga0209515_10176407All Organisms → cellular organisms → Archaea1305Open in IMG/M
3300027842|Ga0209580_10006220All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea5107Open in IMG/M
3300027862|Ga0209701_10391811All Organisms → cellular organisms → Archaea776Open in IMG/M
3300027952|Ga0209889_1035630Not Available1074Open in IMG/M
(restricted) 3300027995|Ga0233418_10194257All Organisms → cellular organisms → Archaea666Open in IMG/M
(restricted) 3300028043|Ga0233417_10007027All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota4097Open in IMG/M
(restricted) 3300028043|Ga0233417_10009289All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota3627Open in IMG/M
3300028536|Ga0137415_10012182All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea devanaterra8523Open in IMG/M
3300031576|Ga0247727_10001433All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota56625Open in IMG/M
3300031576|Ga0247727_10012498All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea14561Open in IMG/M
3300031576|Ga0247727_10019926All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota10330Open in IMG/M
3300031576|Ga0247727_10073656All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea3831Open in IMG/M
3300031576|Ga0247727_10116221All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea2725Open in IMG/M
3300031576|Ga0247727_10128828All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea2524Open in IMG/M
3300031576|Ga0247727_10186091All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1930Open in IMG/M
3300031576|Ga0247727_10382676All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota1145Open in IMG/M
3300032180|Ga0307471_100160244All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea2182Open in IMG/M
3300032205|Ga0307472_100662352All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota932Open in IMG/M
3300032770|Ga0335085_10927526All Organisms → cellular organisms → Archaea947Open in IMG/M
3300033233|Ga0334722_10000025All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea232625Open in IMG/M
3300033233|Ga0334722_10077531All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2555Open in IMG/M
3300033233|Ga0334722_10144604Not Available1783Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand15.33%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment10.67%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil8.67%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs6.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.00%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment5.33%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm5.33%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.33%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater3.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.33%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.67%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.67%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment2.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.33%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.33%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.33%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze1.33%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.33%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.67%
LoticEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Lotic0.67%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.67%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.67%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000571Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 mEnvironmentalOpen in IMG/M
3300001199Lotic microbial communities from nuclear landfill site in Hanford, Washington, USA - IFRC combined assemblyEnvironmentalOpen in IMG/M
3300002120Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2EnvironmentalOpen in IMG/M
3300002123Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_3EnvironmentalOpen in IMG/M
3300002149Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2EnvironmentalOpen in IMG/M
3300002503Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005213Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D2EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300009798Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_40_50EnvironmentalOpen in IMG/M
3300009801Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30EnvironmentalOpen in IMG/M
3300009805Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_0_10EnvironmentalOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300009809Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_30_40EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300009817Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20EnvironmentalOpen in IMG/M
3300009819Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50EnvironmentalOpen in IMG/M
3300009823Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_40_50EnvironmentalOpen in IMG/M
3300009836Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300013092Freshwater microbial communities from Powell Lake, British Columbia, Canada to study Microbial Dark Matter (Phase II) - PL_2010_150mEnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300025119Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025146Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 1EnvironmentalOpen in IMG/M
3300025149Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 (SPAdes)EnvironmentalOpen in IMG/M
3300025155Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 4EnvironmentalOpen in IMG/M
3300025157Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3 (SPAdes)EnvironmentalOpen in IMG/M
3300025164Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 4EnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026856Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300026888Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027006Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027013Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027027Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027163Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027332Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027384Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027561Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027995 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MGEnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI1358J11329_1000468993300000571GroundwaterMSLSLPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS*
JGI1358J11329_1000661453300000571GroundwaterMSLSVPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSIEAEEMRKIIFKRALEMKKAIAYVRFQESKS*
J055_1029288323300001199LoticMVLTVQLKDALHSLIDNLMINPQQMRNMASDLKPLAKDDVEVAFGIFVGYVTGGFAELFYESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESSS*
C687J26616_1001283353300002120SoilMVLTVELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS*
C687J26616_1001997043300002120SoilMVLTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRA*
C687J26616_1002097933300002120SoilMILTKTMVLTIELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS*
C687J26616_1005988433300002120SoilMTLTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS*
C687J26634_1023302423300002123SoilMVLTIELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS*
C687J26657_1013670813300002149SoilMVLTVELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERA
C687J35164_1000964643300002503SoilMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS*
Ga0066672_1022036233300005167SoilMGLSVPLKDAIISLIDNLMANPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVKAQESKR*
Ga0066679_1059891313300005176SoilMSLSLPLKEALVSLIDHLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSIEAEEMRKIIFKR
Ga0068998_1013983413300005213Natural And Restored WetlandsMMPRTIMTLTFELKNALQQLIDNLMINPQQMRNMAHDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRNMTPTELIEVRKILFDRALEMKKAIAYVRAQSQS*
Ga0070699_10026911023300005518Corn, Switchgrass And Miscanthus RhizosphereMSLALPLKEALISLIDNLMVNPHNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERAFEMKKAIAYVRSQESKS*
Ga0070697_10006167143300005536Corn, Switchgrass And Miscanthus RhizosphereMSLSEPLKDTLVSLIDNLMINPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAEIFFESQKRGMTSMEAEEMRKIIFERALEVKKAIAYVRAQESKS*
Ga0070697_10094804713300005536Corn, Switchgrass And Miscanthus RhizosphereIDNLMANPQNMRNMAFDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRGMMPAEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0070732_1003189353300005542Surface SoilMSLSLPLKEALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEEMRKIIFERALEMKKAIAYVRSQK*
Ga0070695_10010706923300005545Corn, Switchgrass And Miscanthus RhizosphereMSLPVPLRDALVLLIDNLMANPQNMRNMAFDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRGMMPAEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0066704_10006436103300005557SoilMLICLPMSLSVPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSIEAEEMRKIIFKRALEMKNAIAYVRFQESKS*
Ga0079222_1127640423300006755Agricultural SoilMSLSGPLKEELVSLIDNLMVNPQIMRNMAYVLKQLTKDDTEVPFGFFVGYVTGGFAELFFESQKRGMTSVEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0066659_1048828613300006797SoilMGLSVPLKDAIISLIDNLMANPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSLEAEEMRKIIFERALEMKKAIAYVKAQESKR*
Ga0079221_1062668623300006804Agricultural SoilMSLSEPLKEALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEEMRKIIFERAFEMKKAIAYVRAQESKS*
Ga0079220_1212650413300006806Agricultural SoilLKEALVSLIDNLMVNPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTYVEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0075426_1031778333300006903Populus RhizosphereMSLSLPLKEALVSMIDHLMINPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0075436_10081892113300006914Populus RhizosphereEALVSLIDNLIITPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAEIFFESQKRGMTSVEAEEMRKIIFERALEMKKAITYVRAQESKS*
Ga0079219_1235476823300006954Agricultural SoilMSLSEPLKDALVSLIDNLMINPQNMRNMAYDLQPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTYAEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0066710_10050474923300009012Grasslands SoilMSLSLPLKEALVSLIDHLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0066709_10025760343300009137Grasslands SoilMSLSLPLKEALVSLIDHLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0114945_1004550933300009444Thermal SpringsMTLTVELKNALHSMIDHLMTSSQNIRNMAYDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRCMTVAEAKEMRKILFERAFEMKKAIAYARTQESRS*
Ga0114944_146502723300009691Thermal SpringsMHTKIMVLTIELKDILQQLIDNLMLNPQQMRNMAYDLKPLTKDDIEVGFGIFVGYVTGGFAELFFESQKRSMTASELIEVRRILFERALEMKKAIAYVRAQESQS*
Ga0114944_148173813300009691Thermal SpringsMTLTVELKDALHSMIDNLMTSSQNIRNMASDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRGMTSAEAKEMQKILFERAFEMKKAIAYARTQESRS*
Ga0105164_10002533143300009777WastewaterMVLTVQLKDALHSLIDNLMINPQQMRNMASDLKPLAKDDVEVAFGIFVGYVTGGFAELFYESQKRSMTASELIEIRKILFERALEMKKAIAYVRAQESSS*
Ga0105164_1005527933300009777WastewaterMALNQQLKESLLLLIDNLMANPHNLRNIANDLKPLSKDDTEVAFGIFIGYVTGGFAELFFESQKRGMTTSEAKEMKKIILERALEMKKAIAHARIPETRS*
Ga0105060_10320823300009798Groundwater SandMTLTKIMALTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS*
Ga0105056_101494423300009801Groundwater SandMALTVELKDALYTLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFTELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSGS*
Ga0105079_100906523300009805Groundwater SandMTLTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTAPELIQVRKILFERALEMKKAIAYVRAQESQS*
Ga0105061_100914823300009807Groundwater SandMTRTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS*
Ga0105089_100233623300009809Groundwater SandMTLTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESQS*
Ga0105076_104658913300009816Groundwater SandMTRTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESQS*
Ga0105062_101164923300009817Groundwater SandMALTVELKDALYTLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSGS*
Ga0105062_111829523300009817Groundwater SandIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS*
Ga0105087_106419823300009819Groundwater SandTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTAPELIQVRKILFERALEMKKAIAYVRAQESQS*
Ga0105078_100839123300009823Groundwater SandMTRTKIMVLTMELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS*
Ga0105068_103693623300009836Groundwater SandMALTVALKDALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS*
Ga0136847_1131393323300010391Freshwater SedimentMINPQQMRNMASDLKPLARDDVEVAFGIFVGYVTGGFAELFYESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESSS*
Ga0136847_1139188413300010391Freshwater SedimentMALTVVLKDALYSIIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQQSRS*
Ga0136847_1171281323300010391Freshwater SedimentMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESEKRSITAKEAIEVRKILFERALEMKNAIAYVRAQQSRS*
Ga0137392_1158906813300011269Vadose Zone SoilMGLSVPLKDTLVSLIDNLMVNPENMQNMASDLKPLTRDDTEVAFGIFVGYVTGGFAEIFFESQKRDMTSMEADEMRKIIFERALEMKKAIAYVRAHESKS*
Ga0137388_1150969623300012189Vadose Zone SoilMSLSEPLKNALVSLIDNLIVNPQNMRNIASDLKPLTRDNTEVAFGIFVGYVTGGFAEIFFESQKRGMTSIEAEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0137363_1028249713300012202Vadose Zone SoilMSLSVQLKNALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEETRKIIFERALEMKKAIAYVRAQESKS*
Ga0137399_1003266413300012203Vadose Zone SoilMSLSVPLKDALNSLIDNLMVNPQNMRNMASDLKPLTKDDTEVAFGIFAGYVTGGFAELFFESQKRGMTSIEAKEMRKIIFERALEMKKAIAYVRAHESKS*
Ga0137380_1043637623300012206Vadose Zone SoilMSLSTPLKDALVSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRGMTYVEAEEMQKIIFKRALEMKKAIAYVRTQESKS*
Ga0137396_1008647233300012918Vadose Zone SoilMSLSVQLKDALVSLIDNLMLNPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSIEAEEMRKIIFERALEMKKAIAHVRAQESKS*
Ga0137394_1084488523300012922Vadose Zone SoilMSLSVQLKNALVSLIDNLMVNPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSIEAEEMRKIVFERALEMKKAIAYVRAQESKN*
Ga0163199_103286733300013092FreshwaterMSLSVSLKDALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAEIFFESQKRGMTSMEVEEMRKIIFERALEMKKAIAYVRAQESKS*
Ga0187824_1004814533300017927Freshwater SedimentMSLSVSLKSALVSLIDNLMANPQNMRNMASDLKPLARDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERAFEMKKAIAYVRAQESKS
Ga0187825_1016410323300017930Freshwater SedimentMSLSLSLKQALVSLIDNLMINPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRSQESKS
Ga0187825_1027557313300017930Freshwater SedimentNLMVNPQNMRNMAFDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERAFEMKKAIAYARSQESKS
Ga0187821_1024350213300017936Freshwater SedimentNLMANPQNMRNMASDLKPLARDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERAFEMKKAIAYVRAQESKS
Ga0187823_1004407133300017993Freshwater SedimentEALVSLIDNLMINPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTYMEAEEMRKIIFDRALEMKKAIAYVKSQESKS
Ga0184634_1001516433300018031Groundwater SedimentMALTVELKDALYSLIDNLMANPQKMRNMASDLKPLTKDNIEVAFGIFVGYVTGGFAELFFESEKRSMTAKEAIEVRKILFERALEMKKAIAYVQSQESRS
Ga0184634_1006933513300018031Groundwater SedimentNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQQSRS
Ga0184634_1056228023300018031Groundwater SedimentMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVRGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVQAQQSRS
Ga0184637_1008124013300018063Groundwater SedimentNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVQAQQSRS
Ga0184637_1012149743300018063Groundwater SedimentMALTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0184640_1003556943300018074Groundwater SedimentMALTVELKDALYSLIDNLMANPQKMRNMASDLKPLTKDNIEVAFGIFVGYVTGGFAELFFESEKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSRS
Ga0184640_1016107223300018074Groundwater SedimentMALTVELKDALYSLIDNLMANPQKMRNMASDLKPLAKDDIEVAFGIFVGYVAGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0184640_1029772623300018074Groundwater SedimentMALTVELKDALYSIIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0184633_1001561623300018077Groundwater SedimentMILTKIMVLTVELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0184633_1015977733300018077Groundwater SedimentMVLTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSRS
Ga0184633_1034957333300018077Groundwater SedimentMALTVELKDALCSLIDNLMKNPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAVEVRKILFERA
Ga0184633_1035788613300018077Groundwater SedimentMALTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSRS
Ga0184633_1040145013300018077Groundwater SedimentMALTVELKDALYSLIDNLMANPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0184633_1056828823300018077Groundwater SedimentMALTVELKDALYSLTDNLMKNPQKMRNMAFDLKPLAKDDIEVAFGIFVAYVTGGFAELFFESQKRSMTAKEAVEVRKILFERALEMKKAIAYVQAQESRS
Ga0184627_1050862013300018079Groundwater SedimentMALTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVQAQQSRS
Ga0184639_1007897033300018082Groundwater SedimentMALTVELKDALYSLTDNLMKNPQKMRNMAFDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAVEVRKILFERALEMKKAIAYVQAQESRS
Ga0187894_1015577333300019360Microbial Mat On RocksMALTVDLKDALYSLIDNLMANPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0187892_10003638163300019458Bio-OozeMTLTVELKDALHSMIDNLMTSSQNIRNMASDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRCMTAAEAKEMRKILFERAFEMKKVIAYARTQESRS
Ga0187892_1039498123300019458Bio-OozeMALTVALKDALYSLIDGLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEVIEVRKILFERALEMKKAIAYVQAQESRS
Ga0187893_1049092333300019487Microbial Mat On RocksMALTVALKDALYSLIDGLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0210405_1031237213300021171SoilMSLSVPLKDALISLIDNLMVNPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSIEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0210408_1012500713300021178SoilMSLSIPLKDTLISLIDNLMINPQNMRSMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFEKALEMKKAIAYVKAQESKR
Ga0210408_1025720233300021178SoilMSLSEPLKDALVSLIDNLMVNPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0210408_1045688723300021178SoilMSLSEPLKDALVSLIDDLMVNPQNLRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0212128_1003358333300022563Thermal SpringsMTLTVELRNALHSMIDNLMTSSQNIRNMAYDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRCMTVAEAKEMRKILFERAFEMKKAIAYARTQESRS
Ga0212128_1021773713300022563Thermal SpringsMVLTIELKDILQQLIDNLMLNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRRILFERALEMKKAIAYVRAQESQS
Ga0247669_103407913300024182SoilMSLALPLKEALISLIDNLMVNPHNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERAFEMKKAIAYVRSQESKS
Ga0209126_104864233300025119SoilMVLTIELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0209322_1002272923300025146SoilMVLTVELKDALYSLIDNLMTNPQKIRYMALDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSRS
Ga0209322_1012319433300025146SoilMTLTKIMVLTVELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0209827_1002882713300025149Thermal SpringsMTLAIELKNALHSMIDNLMTSSQNIRNMASDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRGMTSAEAKEMQKILFERAFEMKKAIAYARTQESRS
Ga0209827_1065484823300025149Thermal SpringsMALTVELKDALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0209827_1094217133300025149Thermal SpringsCLVYNIKNLMTLTVELRNALHSMIDNLMTSSQNIRNMAYDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRCMTVAEAKEMRKILFERAFEMKKAIAYARTQESRS
Ga0209827_1106633723300025149Thermal SpringsMVLTIELKDILQQLIDNLMLNPQQMRNMAYDLKPLTKDDIEVGFGIFVGYVTGGFAELFFESQKRSMTASELIEVRRILFERALEMKKAIAYVRAQESQS
Ga0209320_10005867103300025155SoilMTLTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0209320_1004455633300025155SoilMALTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRA
Ga0209399_1004807933300025157Thermal SpringsMTLTIELKNALYTMIDNLMTSSQNIRNMAYDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRCMTVAEAKEMRKILFERAFEMKKAIAYARTQESRS
Ga0209521_1002131553300025164SoilMALTVELKDALYSLIDNLMTNPQKIRYMALDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSRS
Ga0209824_1000409653300025173WastewaterMSLSVSLKDALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAEIFFESQKRDMTSMEVEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0209824_1012572123300025173WastewaterMALNQQLKESLLLLIDNLMANPHNLRNIANDLKPLSKDDTEVAFGIFIGYVTGGFAELFFESQKRGMTTSEAKEMKKIILERALEMKKAIAHARIPETRS
Ga0209824_1035242413300025173WastewaterMVLTVQLKDALHSLIDNLMINPQQMRNMASDLKPLAKDDVEVAFGIFVGYVTGGFAELFYESQKRSMTASELIEIRKILFERALEMKKAIAYVRTQESSS
Ga0209002_1013948323300025289SoilMTLTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209431_1000372363300025313SoilMVLTVELKDALYSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRA
Ga0209431_1001938033300025313SoilMILTKTMVLTIELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0209431_1061011523300025313SoilMTITVELKNALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFIGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0209519_1010050833300025318SoilMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209520_1019220433300025319SoilTLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209520_1029105833300025319SoilMVNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0209342_1019391933300025326SoilMVLTIELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0209751_1047708313300025327SoilMTLTVELKNALYSLIDNLMKNPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTSKEAIEVRKILFERALEMKKAIAYVQTQQKLNFSRQNSI
Ga0209802_102482633300026328SoilMLICLPMSLSVPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSIEAEEMRKIIFKRALEMKKAIAYVRFQESKS
Ga0209577_1053202613300026552SoilNLMANPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVKAQESKR
Ga0209852_100693023300026856Groundwater SandMALTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209900_100178523300026888Groundwater SandMTRTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209896_103703913300027006Groundwater SandMALTVELKDALYTLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSGS
Ga0209884_101055823300027013Groundwater SandMTLTVELKDALYSLIDNLMTNSQKIRNMASDLKPLTKDDIEVAFGIFVGYVAGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSRS
Ga0209844_100433923300027027Groundwater SandMTRTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTAPELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209878_100280423300027163Groundwater SandMTVELKDALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0209878_100819833300027163Groundwater SandMILTVEFKDALYSLIDSLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVQAQQSRS
Ga0209861_100495053300027332Groundwater SandMTRTKIMVLTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTAPELIQVRKILFERALEMKKAIAYVRAQESQS
Ga0209842_101673513300027379Groundwater SandMILTVEFKDALYSLIDSLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSGS
Ga0209854_109025513300027384Groundwater SandMALTVALKDALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0209887_105079713300027561Groundwater SandMALTVELKDALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVRAQQSGS
(restricted) Ga0233416_1000209623300027799SedimentMHTKIMVLTIELKDTLQKLIDNLMLNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRRILFERALEMKKAIAYVRAQESQR
(restricted) Ga0233416_1015875323300027799SedimentMCHLQTMVLTTQLKETLHQLIDNLMVNPQQMRNMAYDLKPLAKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAHESQN
Ga0209515_10001475373300027835GroundwaterMSLSVPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSIEAEEMRKIIFKRALEMKKAIAYVRFQESKS
Ga0209515_10002640183300027835GroundwaterMSLSLPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSIEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0209515_1002096953300027835GroundwaterMVLTVQLKDALHSLIDNLMINPQQMRNMASDLKPLAKDDVEVAFGIFVGYVTGGFAELFYESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESSS
Ga0209515_1017640723300027835GroundwaterMSLSIPLKNALISLIDNLLINPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0209580_1000622033300027842Surface SoilMSLSLPLKEALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSMEAEEMRKIIFERALEMKKAIAYVRSQK
Ga0209701_1039181113300027862Vadose Zone SoilMSLSVPLKDALNSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFVELFFESQKRDMTSMEAEEMRKIIFERALEMKKAIAYVRVQQ
Ga0209889_103563023300027952Groundwater SandMTLTKIVALTIELKETLHQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
(restricted) Ga0233418_1019425713300027995SedimentMTHLQTMVLTTQLKETLHQLIDNLMVNPQQMRNMAYDLKPLAKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRANESQN
(restricted) Ga0233417_1000702783300028043SedimentMTHLQTMVLTTQLKETLHQLIDNLMVNPQQMRNMAYDLKPLAKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAHESQN
(restricted) Ga0233417_1000928923300028043SedimentMHTKIMVLTIELKDTLQQLIDNLMLNPQQMRNMAYDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRRILFERALEMKKAIAYVRAQESQR
Ga0137415_10012182103300028536Vadose Zone SoilMSLSVQLKDALVSLIDNLMLNPQNMRNMASDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSIEAEEMRKIIFERALEMKKAIAHVRAQESKS
Ga0247727_10001433473300031576BiofilmMALTVQLKDALYSLIDNLMVNPQQMRNMASDLKPLAKDDIEVAFGIFVGYVTGGFAELFYESQKRSMTASELIELRKILFERALEMKKAIAYVQAQQSSS
Ga0247727_1001249813300031576BiofilmMTRTKIMALTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0247727_1001992613300031576BiofilmMELKGTLHQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRVQESKS
Ga0247727_1007365673300031576BiofilmMILTKIMVLAVELKDTLQQLIDNLMVNPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESQS
Ga0247727_1011622133300031576BiofilmMTLTVELRNALHSMIDNLMTSSQNIRNMASDLKPLSKDDTEVAFGIFVGYVTGGFAELFFDSQKRCMTVAEAKEMRKILFERAFEMKKAIAYARTQESRS
Ga0247727_1012882833300031576BiofilmMAMTVELKDALCSLIDNLMTNPQKIRNMASDLKPLTKDDIEVAFGIFVGYVTGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKNAIAYVQAQQSRS
Ga0247727_1018609143300031576BiofilmMALTVQLKDALYSLIDNLMVNPQQMRNMASDLKPLARDDVEVAFGIFVGYVTGGFAELFFESQKRNMTASELLEVRKILFERALEMKKAIAYVRAQESSS
Ga0247727_1038267633300031576BiofilmMALTVELKDALYSLIDNLMENPQKMRNMASDLKPLAKDDIEVAFGIFVGYVAGGFAELFFESQKRSMTAKEAIEVRKILFERALEMKKAIAYVQAQESRS
Ga0307471_10016024423300032180Hardwood Forest SoilMSLSEPLKEALVSLIDNLMVNPQNMRNMAYDLKPFTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0307472_10066235223300032205Hardwood Forest SoilMSLSLPLKEALVSLIDNLMVNPQNMRNMAYDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRGMTSMEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0335085_1092752623300032770SoilMVTPQNMRNMATDLKPLTKDDTEVAFGIFVGYVTGGFAELFFESQKRSMTSKEAEEMRNIIFKRALEMKKAISYVRTQERNSKF
Ga0334722_10000025633300033233SedimentMSLSEPLKDALVSLIDNLMVNPQNMRNMASDLKPLTRDDTEVAFGIFVGYVTGGFAEIFFESQKRGMTSIEAEEMRKIIFERALEMKKAIAYVRAQESKS
Ga0334722_1007753133300033233SedimentMALTAQLKDALHLLIDNLMVNPQQMRNMASDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIEVRKILFERALEMKKAIAYVRAQESSS
Ga0334722_1014460423300033233SedimentMVLTIELKDTLHQLIDNLMVNPQQMRNMALDLKPLTKDDVEVAFGIFVGFVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESQS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.