NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F096967

Metagenome Family F096967

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096967
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 304 residues
Representative Sequence MARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Number of Associated Samples 93
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 55.34 %
% of genes near scaffold ends (potentially truncated) 44.23 %
% of genes from short scaffolds (< 2000 bps) 51.92 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.038 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(28.846 % of family members)
Environment Ontology (ENVO) Unclassified
(43.269 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.115 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 75.31%    β-sheet: 0.00%    Coil/Unstructured: 24.69%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF13472Lipase_GDSL_2 20.19
PF03169OPT 18.27
PF02567PhzC-PhzF 3.85
PF00034Cytochrom_C 1.92
PF01909NTP_transf_2 1.92
PF08240ADH_N 0.96
PF00581Rhodanese 0.96
PF13772AIG2_2 0.96
PF13860FlgD_ig 0.96
PF00571CBS 0.96
PF07732Cu-oxidase_3 0.96
PF02866Ldh_1_C 0.96
PF00174Oxidored_molyb 0.96
PF01926MMR_HSR1 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG1297Predicted oligopeptide transporter, OPT familyGeneral function prediction only [R] 18.27
COG0384Predicted epimerase YddE/YHI9, PhzF superfamilyGeneral function prediction only [R] 3.85
COG0039Malate/lactate dehydrogenaseEnergy production and conversion [C] 0.96
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.96
COG2132Multicopper oxidase with three cupredoxin domains (includes cell division protein FtsP and spore coat protein CotA)Cell cycle control, cell division, chromosome partitioning [D] 0.96
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.04 %
UnclassifiedrootN/A0.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002560|JGI25383J37093_10001950All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii5884Open in IMG/M
3300002561|JGI25384J37096_10006187All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii4425Open in IMG/M
3300002908|JGI25382J43887_10079001All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1771Open in IMG/M
3300005166|Ga0066674_10175175All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1018Open in IMG/M
3300005171|Ga0066677_10148840All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1281Open in IMG/M
3300005176|Ga0066679_10065770All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2130Open in IMG/M
3300005177|Ga0066690_10521794All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium797Open in IMG/M
3300005186|Ga0066676_10178659All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1352Open in IMG/M
3300005450|Ga0066682_10024204All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium3481Open in IMG/M
3300005467|Ga0070706_100464151All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1178Open in IMG/M
3300005536|Ga0070697_100028774All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii4451Open in IMG/M
3300005540|Ga0066697_10031924All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2921Open in IMG/M
3300005553|Ga0066695_10166898All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1374Open in IMG/M
3300005557|Ga0066704_10037721All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2992Open in IMG/M
3300005561|Ga0066699_10172612All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1491Open in IMG/M
3300005568|Ga0066703_10052329All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2290Open in IMG/M
3300005586|Ga0066691_10180598All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1224Open in IMG/M
3300006797|Ga0066659_10118040All Organisms → cellular organisms → Bacteria1835Open in IMG/M
3300006800|Ga0066660_10009994All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii5049Open in IMG/M
3300007265|Ga0099794_10009593All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii4079Open in IMG/M
3300009012|Ga0066710_100224258All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2698Open in IMG/M
3300009012|Ga0066710_101140569All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1206Open in IMG/M
3300009038|Ga0099829_10146925All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1879Open in IMG/M
3300009088|Ga0099830_10047375All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium3016Open in IMG/M
3300009090|Ga0099827_10004593All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira8126Open in IMG/M
3300009137|Ga0066709_100051573All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira moscoviensis4650Open in IMG/M
3300009777|Ga0105164_10000525All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira26834Open in IMG/M
3300009777|Ga0105164_10042077All Organisms → cellular organisms → Bacteria → Nitrospirae2480Open in IMG/M
3300009777|Ga0105164_10186856All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1053Open in IMG/M
3300010301|Ga0134070_10004800All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii4094Open in IMG/M
3300010304|Ga0134088_10043642All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2048Open in IMG/M
3300010391|Ga0136847_10975737All Organisms → cellular organisms → Bacteria → Nitrospirae4840Open in IMG/M
3300011271|Ga0137393_10165091All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1856Open in IMG/M
3300012189|Ga0137388_10141327All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2120Open in IMG/M
3300012189|Ga0137388_10237225All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1656Open in IMG/M
3300012204|Ga0137374_10016082All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira8601Open in IMG/M
3300012206|Ga0137380_10217910All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1727Open in IMG/M
3300012207|Ga0137381_10201086All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1727Open in IMG/M
3300012209|Ga0137379_10109259All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2661Open in IMG/M
3300012349|Ga0137387_10260053All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1252Open in IMG/M
3300012356|Ga0137371_10085512All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2449Open in IMG/M
3300012359|Ga0137385_10009650All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira8535Open in IMG/M
3300012360|Ga0137375_10146755All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2312Open in IMG/M
3300012918|Ga0137396_10055296All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2733Open in IMG/M
3300012927|Ga0137416_10050412All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2916Open in IMG/M
3300017654|Ga0134069_1063568All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1172Open in IMG/M
3300017973|Ga0187780_10315296All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1101Open in IMG/M
3300018058|Ga0187766_10351533All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium965Open in IMG/M
3300018063|Ga0184637_10270493All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1033Open in IMG/M
3300018431|Ga0066655_10135181All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1440Open in IMG/M
3300018433|Ga0066667_10376945All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1138Open in IMG/M
3300018468|Ga0066662_10161046All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1714Open in IMG/M
3300021357|Ga0213870_1000257All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira28184Open in IMG/M
3300025165|Ga0209108_10077221All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1816Open in IMG/M
3300025173|Ga0209824_10000018All Organisms → cellular organisms → Bacteria156237Open in IMG/M
3300025173|Ga0209824_10124355All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium939Open in IMG/M
3300025289|Ga0209002_10368743All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium825Open in IMG/M
3300025311|Ga0209343_10047459All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira4044Open in IMG/M
3300025311|Ga0209343_10179655All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1614Open in IMG/M
3300025312|Ga0209321_10189871All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1068Open in IMG/M
3300025313|Ga0209431_10408251All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → unclassified Armatimonadetes → Armatimonadetes bacterium JGI 0000077-K191047Open in IMG/M
3300025318|Ga0209519_10305531All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium922Open in IMG/M
3300025319|Ga0209520_10147593All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1492Open in IMG/M
3300025324|Ga0209640_10056154All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium3413Open in IMG/M
3300025324|Ga0209640_10250251All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1493Open in IMG/M
3300025325|Ga0209341_10342453All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1223Open in IMG/M
3300025327|Ga0209751_10255366All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1488Open in IMG/M
3300026296|Ga0209235_1001343All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira13276Open in IMG/M
3300026297|Ga0209237_1002062All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira11989Open in IMG/M
3300026298|Ga0209236_1058578All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1881Open in IMG/M
3300026317|Ga0209154_1034293All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2318Open in IMG/M
3300026318|Ga0209471_1073520All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1522Open in IMG/M
3300026324|Ga0209470_1035984All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2492Open in IMG/M
3300026325|Ga0209152_10077299All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1217Open in IMG/M
3300026327|Ga0209266_1005702All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira7464Open in IMG/M
3300026327|Ga0209266_1114847All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1153Open in IMG/M
3300026329|Ga0209375_1054447All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1974Open in IMG/M
3300026331|Ga0209267_1149313All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium963Open in IMG/M
3300026332|Ga0209803_1009478All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii5253Open in IMG/M
3300026335|Ga0209804_1033244All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2602Open in IMG/M
3300026532|Ga0209160_1136328All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1168Open in IMG/M
3300026537|Ga0209157_1058117All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2005Open in IMG/M
3300026538|Ga0209056_10135033All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1925Open in IMG/M
3300026538|Ga0209056_10196596All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1481Open in IMG/M
3300026540|Ga0209376_1052192All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2345Open in IMG/M
3300026542|Ga0209805_1086992All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1503Open in IMG/M
3300026552|Ga0209577_10005438All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira11880Open in IMG/M
3300027835|Ga0209515_10006308All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira14863Open in IMG/M
3300027846|Ga0209180_10049156All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2322Open in IMG/M
3300027862|Ga0209701_10017687All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii4653Open in IMG/M
3300027862|Ga0209701_10304949All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium914Open in IMG/M
3300027875|Ga0209283_10087515All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2026Open in IMG/M
3300027882|Ga0209590_10121701All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1587Open in IMG/M
3300028536|Ga0137415_10090312All Organisms → cellular organisms → Bacteria2920Open in IMG/M
3300031820|Ga0307473_10053657All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1926Open in IMG/M
3300031820|Ga0307473_10298766All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1012Open in IMG/M
3300032018|Ga0315272_10162383All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1055Open in IMG/M
3300032180|Ga0307471_100416752All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1475Open in IMG/M
3300032275|Ga0315270_10446448All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium829Open in IMG/M
3300032770|Ga0335085_10069569All Organisms → cellular organisms → Bacteria4662Open in IMG/M
3300033004|Ga0335084_10286302All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1704Open in IMG/M
3300033803|Ga0314862_0003245All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2470Open in IMG/M
3300034177|Ga0364932_0003441All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira moscoviensis5978Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil28.85%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.12%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.54%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater4.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil4.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.88%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.92%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater1.92%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.92%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.92%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.96%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.96%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.96%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.96%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017973Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MGEnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021357Freshwater microbial communities from subterranean cave lake in Wind Cave National Park, South Dakota, United States - WICALVC2017EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025311Groundwater microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_0.2 (SPAdes)EnvironmentalOpen in IMG/M
3300025312Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 4 - CSP-I_5_4EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032018Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_middleEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032275Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_bottomEnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033803Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_0_10EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1000195033300002560Grasslands SoilMMARAVSASCAWLLCGXLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGXIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
JGI25384J37096_1000618763300002561Grasslands SoilMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
JGI25382J43887_1007900133300002908Grasslands SoilHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGAGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQXEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066674_1017517513300005166SoilSTERGCAMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066677_1014884013300005171SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLHWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLVTVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLNQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP*
Ga0066679_1006577023300005176SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMTPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSVEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLERAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP*
Ga0066690_1052179413300005177SoilLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLNQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYRE
Ga0066676_1017865923300005186SoilTPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066682_1002420433300005450SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0070706_10046415123300005467Corn, Switchgrass And Miscanthus RhizosphereAYLLGLAHFLLYNPDRALPMFDACLSLLDRDQSAPQQRLDCLYWSARVYALKGALAWYDRTSILDGLFKSRRAITVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARALVARALALERDNLRANYLQARLDLYHNGRRDLARSRFAMVRKLLDRGVGGPEALLLRRWVDFAQAEVAFLDQNFPAAIAYADAYLSQVPDGAEGYALKGACLKFLGQQKAGEMLIDKATELNPFVRRYREP*
Ga0070697_10002877413300005536Corn, Switchgrass And Miscanthus RhizosphereRRALFDRHDPRLALRRLEGRWTTPLTDGETAYLLGLAHFLLYDPDRALPMFDACLSFLDRDQSAPHQRLDCLYWSARAYALKGALAWYDRTSILDGLFKSRRAITVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSLVARALALERDNLRANYLQARLDLYHNGRRDLARSRFAMVRKLLDRGVGGPEALLLRRWVDFAQAEVAFLDQNFPAAIAYADAYLSQVPDGAEGYALKGACLKFLGQQKAGEMLIDKATELNPFVRRYREP*
Ga0066697_1003192433300005540SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066695_1016689813300005553SoilKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066704_1003772123300005557SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066699_1017261223300005561SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAIQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLVTVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP*
Ga0066703_1005232933300005568SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSVEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLERAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLNQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP*
Ga0066691_1018059813300005586SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWVTPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSVEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLERAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQD
Ga0066659_1011804013300006797SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0066660_1000999463300006800SoilMMARAVPASCAWLLCWALGLATPPDARHTKPQQEAIQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGADGYALKGACLKFLGQQEDGEVLIKKARELNPLVRRYREP*
Ga0099794_1000959313300007265Vadose Zone SoilARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSKTPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMLAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0066710_10022425833300009012Grasslands SoilMAARAVCGLCALFLGGGFSLATPADANHTTPHREAVQAARSALFDRKDPGLALRLLEGMWASPLTDSETPYLLGLAHVLLYDPDRALSMFDACLSLEGRDQSVPQRRLDCLYWSARAYSLKAALAWYYRQSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGAMLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALDRDNLRAQYLQARLDLYYNGRRDLARNRFATARQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFRAAIAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLIKKGQELNPLVRRYREP
Ga0066710_10114056923300009012Grasslands SoilMTSTERGCSTAARAVSASVAWLLCATLDPATPADAHHTKPHQEAVQAARSALFDRKDPSLALRHLEGLWANSPTDGETLYLLGLAHFLLYDPDRALSMFDACLSREGLEQPVAHQRLECLYWSARASALKAALEWYHRTSILDGVIKSRRAIMIALDLYEQVLTQAPGHVGALLGQAEYYMAAPYLPPLAYGDVDKARSLVARALSLERDNLRARYLQARLDLYYNGRRDLARSGFATVRLLLDRGAGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGAEGYALKGACLKFLGQEEAGEALINKARELNPHVRRYREP
Ga0099829_1014692523300009038Vadose Zone SoilMARAVSASCAWLLCVALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0099830_1004737543300009088Vadose Zone SoilMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0099827_1000459343300009090Vadose Zone SoilMMARAVSASCAWLLCGALGPAAPVDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADLPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLTQVPDGAEGYALKGACLKFLGQQEAGEALIKKARELNPLVRRYREP*
Ga0066709_10005157363300009137Grasslands SoilLRHLEGLWADSPTDSETPYLLGLAHFLLYDPDRALSMFDACLSRESLDKRVAHQRLECLYWSARASSLKAALAWYQRTSILDGVIKSRRAIRIALDLYEQVLAQEPGHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALSLERDNLRARYLQARLELYYNGRRDLARSGFATVRQLLDRGAGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLSQVPDGAEGYALKGASLKFLGQPEAGEALINKARELNPHVRRYREP*
Ga0105164_10000525203300009777WastewaterMVVLAVALSVWTGPLSAQSSDSAPPYRLGLAAFLLYDPDHALLLFEQCITQEPQHEDSPQGLDCLYWSGRAYAMKAALAWFYRTSIVEGLVKGRWAIGTALDRYEHVLAKAPNHVGALLSQAEYYMAAPYLPPMAYGDVAKARALVARGLALDADNPRAHYLQARLDLYHNNRRDQARTGYARVRYLIEKGIGGVDATLLQRWVEFAQAEVAFLDEDYPAAIAYADAYIRQVPDGADGYALKGASLKFMGRAADGEVQIQHAQALNPHVRRYREP*
Ga0105164_1004207723300009777WastewaterMVALAVALSVWAAPVSAQPSDSAQPYQLGLAAFLLYDPDHALMLFEQCIEETRNGDSPQRLDCLYWSGRAWAMKAALAWFYRTSMVEGLFKSRWAIGAALDRYEQVLAKEPDHAGALLSQAEYYMAAPYLPPMAYGDVAKARALVARALALEADNPRAHYLQARLDLYHNNRRDQARTGYARVRDLLERGIGGVEAVLLQRWVEFAQAEVAFLDQDYPAAIAYADAYIRQVPDGADGYALKGASLKFLGQEADGEAHLQKARAFNPHVRRYREPSAKPVAP*
Ga0105164_1018685613300009777WastewaterQVKRLLEGLWAKQPSDSETPYLLGVAHFLLYDPDKSLAMFEVCLTNETRSGDSPQRLECLYWSGRASALKAALVWHYRESIVDGLVKSRRAIGAALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVAKARALVARALALEADNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLERGIGGAEAALLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGADGYALKGASLKFLGQEAAGEAQIKKAREFNPHVRRYREP*
Ga0134070_1000480033300010301Grasslands SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0134088_1004364233300010304Grasslands SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYRE
Ga0136847_1097573733300010391Freshwater SedimentVLAAVLAAGAVPVAAHHTKPHEEAVLAARIALFDKKDPQQVKRLLEGMWVEQPSDSETPYLLGVAHVLLYDPDKALPLLEGCLANESRNGDTTQHLDCLYWSGRASALKAALVWYYRESIVDGLLKSWRAIRAALDRYEQVLAKAPDHVGTLLSQAEYRMAAPYLPPLAYGDVAKARMLVAKALALEADNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLGRGVGGAETVLLWRWVDFAQAEVAFLDQDYNAAILYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKAREFNPHVRRYREP*
Ga0137393_1016509133300011271Vadose Zone SoilMMARAVSASCAWLLCEALGPATPVDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWANLPTDSETPYLLGLAHFLLYDPDRALSMFDACLALEGRDQSVPPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYRE
Ga0137388_1014132723300012189Vadose Zone SoilMARAVSASCAWLLCVALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0137388_1023722513300012189Vadose Zone SoilMMARAVSASCAWLLCGALGPATPVDAHHTKLHQEAVQAARSALFDRKDPGLALRHLEGMWANLPTDSETPYLLGLAHFLLYDPDRALSMFDACLALEGRDQSVPPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAVLDQDLPAAVA*
Ga0137374_1001608273300012204Vadose Zone SoilMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQGRLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLVNKARELNPLVRRYREP*
Ga0137380_1021791023300012206Vadose Zone SoilMARAVSTSCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0137381_1020108623300012207Vadose Zone SoilMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLVNKARELNPLVRRYREP*
Ga0137379_1010925933300012209Vadose Zone SoilMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0137387_1026005313300012349Vadose Zone SoilMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0137371_1008551223300012356Vadose Zone SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP*
Ga0137385_1000965063300012359Vadose Zone SoilMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP*
Ga0137375_1014675533300012360Vadose Zone SoilMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRWGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQGRLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPSAVAYTDAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVR
Ga0137396_1005529633300012918Vadose Zone SoilMMARAVSASCAWLLCGALGPATPVDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADLPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVSPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQTPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGAEGYALKGACLKFLGQQEAGEALIKKARELNPLVRRYREP*
Ga0137416_1005041233300012927Vadose Zone SoilMMARAVSASCAWLLCGALGPATPVDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADLPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVSPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEALIKKARELNPLVRRYREP*
Ga0134069_106356813300017654Grasslands SoilHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0187780_1031529613300017973Tropical PeatlandMTSTERPVAGRRTGWNVAVGLMAVLATGAVPAWGHHAKPHQEAIQAARTALFEKKDPQQAKRLLEGMWSEQPSDSETPYLLGVAHVLLYEPDKALLLFDSCLANESRNGNSSQRLECLYWSGRAFSMKAALAWYYRENILDGVMKSQRATTAARDRYEQVLALAPDHTGALLSQAEYYMAAPYLPPLAYGDVDKARTLVARALELEADSPRAQYLQSRLDLYYNGRRDRARAGFAKVRQLLDGGSGGAEAVLLRRWVDFAQAEVAFLDRDYRAAIAYADAYLRQVPDGAEGYALKGASLKFLGEQAAGDAQIQRARELNPNVRRYREP
Ga0187766_1035153313300018058Tropical PeatlandAVLATGAVPAWGHHAKPHQEAIQAARTALFEKKDPQQAKRLLEGMWSEQPSDSETPYLLGVAHVLLYEPDKALLMFDSCLANESRNGNSSQRLECLYWSGRAFSMKAALAWYYRENILDGVMKSQRATTAARDRYEQVLALAPDHTGALLSQAEYYMAAPYLPPLAYGDVDKARTLVARALELEADSPRAQYLQSRLDLYYNGRRDRARAGFAKVRQLLDGGSGGAEAVLLRRWVDFAQAEVAFLDRDYRAAIAYADAYLRQVPDGAEGYALKGASLKFLGEQAAGDAQIQRARELNPNVRRYREP
Ga0184637_1027049313300018063Groundwater SedimentPHEEAVLAARIALFDKKDPQQVKRLLEGMWVKQPSDSETPYLLGVAHVLLYDPDKALPLLEGCLANESRNGDTTQRLDCLYWSGRASALKAALVWYYRESIVDGLLKSWRAIRAALDRYEQVLAKAPDHVGTLLSQAEYRMAAPYLPPLAYGDVAKARMLVAKALALEADNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLGRGVGGAETVLLWRWVDFAQAEVAFLDQDYNAAISYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKAREFNPHVRRYREP
Ga0066655_1013518123300018431Grasslands SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0066667_1037694513300018433Grasslands SoilAWTDASRPMSSTERGCSTAARAASASLAWLLCATLGLATPADAHHTKPHQEAVQAARSALFDRKDPSLALRHLEGLWANSPTDSETPYLLGLAHFLLYDPDRALSMFDACLSREGLDQRVAHQRLECLYWSARASSLKAALAWYHRTSILDGVIKSRRAIRIALDLYEQVLAQAPGHIGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALSLERDNLRARYLQARLELYYNGRRDLARSGFATVRQLLDRGAGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLSQVPDGAEGYALKGASLKFLGQQEAGEALINTARELNPHVRRYREP
Ga0066662_1016104623300018468Grasslands SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRYLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0213870_1000257103300021357FreshwaterMVRRLLEGPWGEQPSDSDTPYLLGVAHFLLYNPDQSLAWFERCLANETQSGDSPQRLDCLYWSGRAGAMKAAFAWYYRESILEGLVKSRRAIGAALDRYEHVLSKAPDHVGAMLSQAEYYMAAPYLPPLAYGDVVTARALVARALALEVDNPRAQYLQARLDLYANNRRDQARAGYARVRHLLERGIGGLEIVLLQRWVDFAQAEVAFLDQDYPAAIAYADAYIRQVPDGADGYALKGASLKFLGQEADGEAQIKKAREFNPRVRRYREP
Ga0209108_1007722133300025165SoilMMSSWKSIAVMDGSRPSTSTESRAGERRFAILNVTVVLAAMLAAWAVPVAAHHTQPHQEAVQAARVALFEKKDPQQVKRLLEGMWVAQPADSETPYLLGVAYFLLYDPDKALPLLEGCLTNESRNGNSPQHVDCLYWSGRASAMKAAFSWYYRESVVDGLIKSRRAIGAALDRYEQVLAKMPDHVGAMLSQAEYYMAAPYLPPMAYGDVTKARTLVARALALEANNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGEAQLKKARELNPHVRRYREP
Ga0209824_100000181573300025173WastewaterMVVLAVALSVWTGPLSAQSSDSAPPYRLGLAAFLLYDPDHALLLFEQCITQEPQHEDSPQGLDCLYWSGRAYAMKAALAWFYRTSIVEGLVKGRWAIGTALDRYEHVLAKAPNHVGALLSQAEYYMAAPYLPPMAYGDVAKARALVARGLALDADNPRAHYLQARLDLYHNNRRDQARTGYARVRYLIEKGIGGVDATLLQRWVEFAQAEVAFLDEDYPAAIAYADAYIRQVPDGADGYALKGASLKFMGRAADGEVQIQHAQALNPHVRRYREP
Ga0209824_1012435513300025173WastewaterLEGLWAKQPSDSETPYLLGVAHFLLYDPDKSLAMFEVCLTNETRSGDSPQRLECLYWSGRASALKAALVWHYRESIVDGLVKSRRAIGAALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVAKARALVARALALEADNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLERGIGGAEAALLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGADGYALKGASLKFLGQEAAGEAQIKKAREFNPHVRRYREP
Ga0209002_1036874313300025289SoilVLVSALAAWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYA
Ga0209343_1004745923300025311GroundwaterMDGSRLSTSTESRAGERRSAILNVTAVLAAVLAAWAVPVAAHHTQPHQEAVQAARVALFEKKDPQQVKRLLEGMWVAQPADSETPYLLGVAYFLLYDPDKALPLLEGCLANESRNGNSPQHVDCLYWSGRASAMKAALSWYYRESVVDGLIKSRRAIGAALDRYEQVLAKMPDHVGAMLSQAEYYMAAPYLPPMAYGDVTKARTLVTRALALEANNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLERGIGGVEVVFLRRWVDFAQAEVAFLDQDYAAAIAYSDAYIRQVPDGADGYALKGASLKFLGQAAAGEAQIKKARELNPHVRRYREP
Ga0209343_1017965513300025311GroundwaterMAVLVSALAAWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAI
Ga0209321_1018987123300025312SoilMAVLVSALAAWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGA
Ga0209431_1040825113300025313SoilEGRRTADPFSYGLGTGDNPIFKGMYRASALYSGASIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSSQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGEAQLKKARELNPHVRRYREP
Ga0209519_1030553113300025318SoilAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSSQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGEAQLKKARELNPHVRRYREP
Ga0209520_1014759323300025319SoilMMSSWKSIAVMGGSRPSTSTEYQAGGRRPAILNVMAVLVSALAAWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGEAQLKKARELNPHVRRYREP
Ga0209640_1005615433300025324SoilMWVAQPADSETPYLLGVAYFLLYDPDKALPLLEGCLTNESRNGNSPQHVDCLYWSGRASAMKAAFSWYYRESVVDGLIKSRRAIGAALDRYEQVLAKMPDHVGAMLSQAEYYMAAPYLPPMAYGDVTKARTLVARALALEANNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLERGIGGVEVVFLRRWVDFAQAEVAFLDQDYAAAIAYSDAYIRQVPDGADGYALKGASLKFLGQAAAGEAQIKKARELNPHVRRYREP
Ga0209640_1025025113300025324SoilAGGRRPAILNVMAVLVSALAAWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGEAQLKKARELNPHVRRYREP
Ga0209341_1034245313300025325SoilAVWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGEAQLKKARELNPHVRRYREP
Ga0209751_1025536613300025327SoilILNVMAVLVSALAAWAVPAEAHHTKPHQEAIQAARAALVEKKDPQQVKRLLEGLWAEQPSDSETPYLLGMAHFLLYDPGKALALFEVCLANEARSGDSPQRLECLYWSGRASALKAALVWYYRESIVDGLIRSRRAIGVALDRYEQVLAKAPDHVGTMLSQAEYYMAAPYLPPMAYGDVARARALVSRALVLETDNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGAEVVLLRRWVDFAQAEVAFLDQDYPAAIAYSDAYIRQVPDGAEGYALKGASLKFLGQDVAGETQLKKARELNPHVRRYREP
Ga0209235_1001343193300026296Grasslands SoilMMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGAGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209237_1002062103300026297Grasslands SoilMMARAVSASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGAGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQSEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209236_105857813300026298Grasslands SoilPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209154_103429313300026317SoilTKPHQEAIQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLNQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP
Ga0209471_107352023300026318SoilPASCAWLLCWVLCLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAYFLLYDPDRALSMFDACLSVEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLERAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP
Ga0209470_103598433300026324SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209152_1007729913300026325SoilDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAYFLLYDPDRALSMFDACLSVEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLERAPEHVGALLGQAEYYMVAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLNQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP
Ga0209266_100570233300026327SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209266_111484713300026327SoilKSIAVTGASRPMTSTERGCAMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209375_105444713300026329SoilHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209267_114931313300026331SoilMMARAVPASCAWLLCWVLCLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSVEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLERAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAQYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYL
Ga0209803_100947873300026332SoilMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209804_103324423300026335SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAIQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLNQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP
Ga0257176_103860313300026361SoilPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPPRRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLTQVPDGAEGYALKGACLKFLGQQEAGEGLIKKA
Ga0209160_113632823300026532SoilMVARAVSASCAWLLCGALGLATPVDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADLPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVSPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQTPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLSQVPDGAEGYALKGACLKFLGQQEAGEALIKKARELNPLVRRYREP
Ga0209157_105811733300026537SoilDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVLINKARELNPLVRRYREP
Ga0209056_1013503313300026538SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYALKAALAWYSRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARAFVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYAL
Ga0209056_1019659633300026538SoilLEGLWANSPTDSETPYLLGLAHFLLYDPDRALSMFDACLSRESLDKRVAHQRLECLYWSARASSLKAALAWYHRTSILDGVIKSRRAIRIALDLYEQVLAQEPGHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALSLERDNLRARYLQARLELYYNGRRDLARSGFATVRQLLDRGAGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLSQVPDGAEGYALKGASLKFLGRPEAGEALINKARELNPHVRRYREP
Ga0209376_105219233300026540SoilMMARAVSASCAWLLCGTLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALHHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRVGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRYSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDHGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTN
Ga0209805_108699213300026542SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLVTVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP
Ga0209577_1000543873300026552SoilMMARAVPASCAWLLCWALGLATPADARHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWMNPPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPQQRLDCLYWSARAYSLKAALAWFYRHSILDGVFKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALALERDNLQAHYLQARLDLYYNGRRDLARSRLATVRQLLDRGVGGADAVLLRRWVDFAQAEVAFLDQDFPAAIAYADAYLSQVPDGADGYALKGACLKFLGQQEAGEVLIKKARELNPLVRRYREP
Ga0209515_10006308213300027835GroundwaterVLAATLAAGAVPVAAHHTKPHEAAVLAARIALFDKKDPQQVKRLLEGMWAEQPSDSETPYLLGVANFLLYDPDKALPLLEGCLANESRNGDTTQRLDCLYWSGRASALKAALVWYYRESVVDGLLKSWRAILAALDRYEQVLAKAPDHVGTLLSQAEYYMAVPFLPPLAYGDVAKARMLVAKALALEADNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLDRGVGGAETALLRRWVDFAQAEVAFLDQDYNAAISYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKAREFNPHVRRYREP
Ga0209180_1004915613300027846Vadose Zone SoilMARAVSASCAWLLCVALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP
Ga0209701_1001768773300027862Vadose Zone SoilLGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP
Ga0209701_1030494913300027862Vadose Zone SoilMMARAVSASCAWLLCGALGPATPVDAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWAGLPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVRAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQD
Ga0209283_1008751513300027875Vadose Zone SoilMARAVSASCAWLLCVALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGGGDQSVPQQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEVMINKARELNPLVRRYREP
Ga0209590_1012170113300027882Vadose Zone SoilQEAVQAARSALFDRKDPGLALRHLEGMWADLPTDSETPYLLGLAHFLLYDPDRALSMFDACLSLEGRDQSVPPQRLDCLYWSARAYSLKAALAWYYRHSILDGVIKSRRAIMVALDLYEQVLAQAPEHVGALLGQAEYYMAAPYLPPLAYGDVDKARSFIARALALERDNLRAHYLQARLDLYYNGRRDRARSRFATVRQLLDRGVGGAEAVLLRRWVDFAQAEVAFLDQDFPAAIAYTDAYLTQVPDGAEGYALKGACLKFLGQQEAGEALIKKARELNPLVRRYREP
Ga0137415_1009031223300028536Vadose Zone SoilMKQQAHLHPPTERRSFATLWSSALLVCAVLASCAWLLCGALGPATPADAHHTKPHQEAVQAARSALFDRKDPGLALRHLEGMWADPLTDSETPYLLGLAHFLLYDPDRALSMFDACLSRGAGDQSVPQQRLDCLYWSARAYSLKAALAWYSRHSILDGAIKSRRAIMVALDLYEQVLAQAPEHVGALLGQSEYYMAAPYLPPLAYGDVDKARSFVARALVLERDNLRAHYLQARLDLYYNGRRDLARSRFATVRQLLDRGMGGAEAVLLRRWVDFAQAEVAFLDQDFPAAVAYTNAYLSQVPDGAEGYALKGACLKFLGQQEAGEALIKKARELNPLVRRYREP
Ga0307473_1005365723300031820Hardwood Forest SoilMLVAGAVPVAAHHTQPHQEAVLAARIALFDKKDPQQVTRLLEGMWAAQPSDSETPYLLGVAHFLLYEPDKALPLFDACESKAGGPAQQVECLYWSGRASALKAALAWYYRESIVDGAIKSRRAIMAALERYEQVLARAPDHVGSLLSQAEYYMAAPYLPPLAYGDGEKARMLVARALALEPDNPRANYLQARLNLYYNGRRDLARTGFARVRHLLEQDIGGVEVVLLRRWADFAQAEVAFLDQDYNAAISYADAYLRQVPDGAEGYALKGASLKFLEQEAAGEAQIKKARELNPHVRRYREP
Ga0307473_1029876623300031820Hardwood Forest SoilTLNVAAVLTALLAAGAVPVAAHHTQPHQEAVLAARIALFDKKDPQQVKRLLEGMWAAQPSDNETPYLLGVAHFLLYDPDKALPLLEGCLANESRNGDSPLRLDCLYWSGRASALKAALAWYYRESIVDGVLTSRRAIMAALERYEQVLAKAPDHAGTLLSQAEYYMAAPYLPPLAYGDVAKARLLVAKALALEADNPRAHYLQSRLDLYYNGRRDQARIGYARVRQLLDRGVGGAETVFLRRWVDFAQAEVAFLDQDYHAAVSYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKAREFNPRVRRYREP
Ga0315272_1016238323300032018SedimentMISSWKLIAVMDGSRPSTSTERRAGGRRPAILNVTAILASVLAAWALPAEAHHTKPHQEAVQAARAALVEKKDPRQVKRLLEGLWAEQPSDSETPYLLGVAHFLLYDPDKSLAMFEVCLANEARNGDSPQRLECLYWSGRASALKAALVWYYRESIVAGLVKSRRAIGAALDRYEQVLEKAPDHAGTMLSQAEYYMAAPYLPPMAYGDVARARTLVSRALALEADNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGVEAVLLRRWVDFAQAEVAFLDQDYPAAIAYADAYIRQVPDGAEGYALKGASLKFLGQD
Ga0307471_10041675213300032180Hardwood Forest SoilMTSSWKSIAVMDGSRPTTSTERRAAVRRFAILKFTAALAVALVAGAVPVAAHHTQPHQEAVLAARIALFDKKDPQQVKRLLEGLWAEQSSDSETPYLLGVAHFLLYDPDKALLLLEGCLTNESRKADATQRLDCLYWSGRASALKAALAWYYRESIIDGLVKSRRAIMVALERYEQVLAKTPDHVGALLSQAEYYMAAPYLPPLAYGDVAKARMLVVKALALEPDNPRANYLQARLNLYYNGRRDLARTGYARVRHLLEQGVGGVEVVLLRRWVDFAQAEVAFLDQDYNAAI
Ga0315270_1044644813300032275SedimentGSRPSTSTERRAGGRRPAILNVTAILASALAAWAVPAEAHHTKPHQEAVQAARAALVEKKDPRQVKRLLEGLWAEQPSDSETPYLLGVAHFLLYDPDKSLAMFEVCLANEARNGDSPQRLECLYWSGRASALKAALVWYYRESIVAGLVKSRRAIGAALDRYEQVLEKAPDHAGTMLSQAEYYMAAPYLPPMAYGDVARARTLVSRALALEADNPRAHYLQARLDLYYNSRRDQARAGYARVRQLLERGVGGVEAVLLRRWVDFAQAEVAFLDQD
Ga0335085_1006956933300032770SoilMDGSRPTTSTERRTARRRSAILIIVAALAAALAAGIVPVAAHHTKLHQEAVQAARIALFEKKDPQQVTRLLEGLWAEQPSDSETPYLLGVAHFLLYEPDRALTLFDQCLANEPQSGESPQRLDCLYWSGRASALKAALAWYYRESIVAGLVKSRRAIMAALDRYEQVLAKAPEHVGAMLSQAEYYMAAPYLPPIAYGDVDKARALVARALVLEADNPHAWYLQARLDLYHNGRRDRARTGFARVRQLLDRGVGGVNVVLLRRWVDFAQAEVAFLDQDYAATIAYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKARELNPRVRRYREP
Ga0335084_1028630233300033004SoilAAALAAGIVPVAAHHTKLHQEAVQAARIALFEKKDPQQVTRLLEGLWAEQPSDSETPYLLGVAHFLLYEPDRALTLFDQCLANEPQSGESPQRLDCLYWSGRASALKAALAWYYRESIVAGLVKSRRAIMAALDRYEQVLAKAPEHVGAMLSQAEYYMAAPYLPPIAYGDVDKARALVARALVLEADNPHAWYLQARLDLYHNGRRDRARTGFARVRQLLDRGVGGVNVVLLRRWVDFAQAEVAFLDQDYAATIAYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKARELNPRVRRYREP
Ga0314862_0003245_743_16603300033803PeatlandVLAAGIVPVAAHHTKLHQEAVQAARIALFEKKDPQQVIRLLEGLWAEQPSDSETPYLLGVAHFLLYEPDRALTLFDQCLANEPQSGESPQRLDCLYWSGRASALKAALAWYYRESIVAGLVKSRRAIMAALDRYEQVLAKAPEHVGAMLSQAEYYMAAPYLPPIAYGDIDKARALVARALALEAENPHAWYLQARLDLYYNGRRDRARTEFARVRQLLDRNVGGVNAVLLRRWVDFAQAEVAFLDQDYAAAIAYADAYLRQVPDGAEGYALRGASLKFLGQEAAGEAQIKKARELNPHVRRYREQ
Ga0364932_0003441_3201_41303300034177SedimentVLAAVLAAGAVPVAAHHTKPHEEAVLAARIALFDKKDPQQVKRRLEGMWAEQPSDSETPYLLGVAHFLLYDPDKALPLLEGCLANESRNGDTTQRLDCLYWSGRAYALKAALAWYYRESIIDGLIKSQRAIKTALDRYEQVLAKAPDHVGTLLSQAEYYMAAPFLPPLAYGDVAKARMLVAKALALEADNPRAHYLQARLDLYYNGRRDQARTGYARVRQLLDRGVGGAETVLLRRWVDFAQAEVAFLDQDYNAAISYADAYLRQVPDGAEGYALKGASLKFLGQEAAGEAQIKKAWEFNPHVRRYREP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.