NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072224

Metagenome / Metatranscriptome Family F072224

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072224
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 260 residues
Representative Sequence MLGRFLGVLLVATLMPILAIADAPIASSSFVGAEDPLSENGAWAALTSLSPNGTRFQKNNGAYPDRLIYRDHAGARTTAVVPADHYSEIVVGHVGNYDYNNVGPIVRVQPTGPSIDSHYLWWASGPNGVNYLYRIDADGTTYSANGLIPTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYHTGPDTTKYSTGTAGMLAFGSSFMTDAMIASWSSGTAPVSSGTWASSTFAGTENPLDEGDRWYPLP
Number of Associated Samples 89
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 52.89 %
% of genes near scaffold ends (potentially truncated) 96.69 %
% of genes from short scaffolds (< 2000 bps) 84.30 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.521 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(39.669 % of family members)
Environment Ontology (ENVO) Unclassified
(43.802 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.893 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 4.01%    β-sheet: 33.58%    Coil/Unstructured: 62.41%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF00041fn3 3.31
PF00132Hexapep 2.48
PF04185Phosphoesterase 1.65
PF01370Epimerase 1.65
PF02585PIG-L 0.83
PF08241Methyltransf_11 0.83
PF03808Glyco_tran_WecG 0.83
PF00483NTP_transferase 0.83
PF12708Pectate_lyase_3 0.83
PF02954HTH_8 0.83
PF00990GGDEF 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 1.65
COG1922UDP-N-acetyl-D-mannosaminuronic acid transferase, WecB/TagA/CpsF familyCell wall/membrane/envelope biogenesis [M] 0.83
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms97.52 %
UnclassifiedrootN/A2.48 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_101107566Not Available679Open in IMG/M
3300002560|JGI25383J37093_10003278All Organisms → cellular organisms → Bacteria4891Open in IMG/M
3300002908|JGI25382J43887_10124692All Organisms → cellular organisms → Bacteria1336Open in IMG/M
3300002912|JGI25386J43895_10055891All Organisms → cellular organisms → Bacteria1118Open in IMG/M
3300002914|JGI25617J43924_10124901All Organisms → cellular organisms → Bacteria904Open in IMG/M
3300002917|JGI25616J43925_10130767All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300005174|Ga0066680_10163942All Organisms → cellular organisms → Bacteria1395Open in IMG/M
3300005174|Ga0066680_10427957All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300005174|Ga0066680_10604921All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300005175|Ga0066673_10048821All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria2125Open in IMG/M
3300005177|Ga0066690_10674121All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300005178|Ga0066688_10437020All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300005178|Ga0066688_10604058All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300005179|Ga0066684_10139120All Organisms → cellular organisms → Bacteria1530Open in IMG/M
3300005181|Ga0066678_10310330All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300005186|Ga0066676_10740031All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300005187|Ga0066675_10894867All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300005447|Ga0066689_10080019All Organisms → cellular organisms → Bacteria1831Open in IMG/M
3300005447|Ga0066689_10487199All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. ARR65777Open in IMG/M
3300005447|Ga0066689_10527832All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300005451|Ga0066681_10285015All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Corynebacteriales incertae sedis → Tomitella → Tomitella biformata1008Open in IMG/M
3300005555|Ga0066692_10699057All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300005556|Ga0066707_10031113All Organisms → cellular organisms → Bacteria2955Open in IMG/M
3300005557|Ga0066704_10219295All Organisms → cellular organisms → Bacteria1289Open in IMG/M
3300005557|Ga0066704_10226964All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300005558|Ga0066698_10535784All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Rhodopseudomonas → Rhodopseudomonas palustris795Open in IMG/M
3300005559|Ga0066700_10556103All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300005575|Ga0066702_10596194All Organisms → cellular organisms → Bacteria664Open in IMG/M
3300005576|Ga0066708_10634622All Organisms → cellular organisms → Bacteria682Open in IMG/M
3300005598|Ga0066706_10056230All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria2687Open in IMG/M
3300006755|Ga0079222_10249811All Organisms → cellular organisms → Bacteria1114Open in IMG/M
3300006794|Ga0066658_10134512All Organisms → cellular organisms → Bacteria1228Open in IMG/M
3300006794|Ga0066658_10219124All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1012Open in IMG/M
3300007255|Ga0099791_10205853All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300007258|Ga0099793_10048800All Organisms → cellular organisms → Bacteria1860Open in IMG/M
3300007265|Ga0099794_10107478All Organisms → cellular organisms → Bacteria1396Open in IMG/M
3300007265|Ga0099794_10259768Not Available896Open in IMG/M
3300009012|Ga0066710_101377077All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1094Open in IMG/M
3300009012|Ga0066710_102077567All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium837Open in IMG/M
3300009038|Ga0099829_10010286All Organisms → cellular organisms → Bacteria6041Open in IMG/M
3300009038|Ga0099829_10252579All Organisms → cellular organisms → Bacteria1438Open in IMG/M
3300009088|Ga0099830_10636229All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium876Open in IMG/M
3300009089|Ga0099828_10151753All Organisms → cellular organisms → Bacteria2047Open in IMG/M
3300009089|Ga0099828_10872946All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300009089|Ga0099828_10955697All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300009143|Ga0099792_10060730All Organisms → cellular organisms → Bacteria1874Open in IMG/M
3300010304|Ga0134088_10019579All Organisms → cellular organisms → Bacteria2988Open in IMG/M
3300010323|Ga0134086_10155289All Organisms → cellular organisms → Bacteria837Open in IMG/M
3300010336|Ga0134071_10311541All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium791Open in IMG/M
3300011269|Ga0137392_10787629All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300011271|Ga0137393_10574880All Organisms → cellular organisms → Bacteria966Open in IMG/M
3300011271|Ga0137393_10738380Not Available843Open in IMG/M
3300011271|Ga0137393_10893413All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300011271|Ga0137393_10986831All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium717Open in IMG/M
3300012169|Ga0153990_1095833All Organisms → cellular organisms → Bacteria664Open in IMG/M
3300012189|Ga0137388_10422194All Organisms → cellular organisms → Bacteria1236Open in IMG/M
3300012189|Ga0137388_11185013All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium701Open in IMG/M
3300012189|Ga0137388_11186012All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300012202|Ga0137363_10096603All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2236Open in IMG/M
3300012202|Ga0137363_10682346All Organisms → cellular organisms → Bacteria869Open in IMG/M
3300012202|Ga0137363_11019192All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300012203|Ga0137399_10043762All Organisms → cellular organisms → Bacteria3202Open in IMG/M
3300012203|Ga0137399_10231402All Organisms → cellular organisms → Bacteria1510Open in IMG/M
3300012362|Ga0137361_10891913All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300012363|Ga0137390_10800449All Organisms → cellular organisms → Bacteria900Open in IMG/M
3300012363|Ga0137390_10831939All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300012582|Ga0137358_10155726All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1556Open in IMG/M
3300012685|Ga0137397_10198759All Organisms → cellular organisms → Bacteria1491Open in IMG/M
3300012918|Ga0137396_10147108All Organisms → cellular organisms → Bacteria1713Open in IMG/M
3300012923|Ga0137359_10147179All Organisms → cellular organisms → Bacteria2098Open in IMG/M
3300012923|Ga0137359_10960221All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300012927|Ga0137416_10285912All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300012927|Ga0137416_10745178All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300012930|Ga0137407_10133257All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2174Open in IMG/M
3300012944|Ga0137410_11006870All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300012972|Ga0134077_10120293All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1030Open in IMG/M
3300015241|Ga0137418_10737282All Organisms → cellular organisms → Bacteria749Open in IMG/M
3300015245|Ga0137409_10831798All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300018433|Ga0066667_10250596All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1342Open in IMG/M
3300018468|Ga0066662_10102079All Organisms → cellular organisms → Bacteria2034Open in IMG/M
3300021046|Ga0215015_10532217All Organisms → cellular organisms → Bacteria1348Open in IMG/M
3300021080|Ga0210382_10030559All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2021Open in IMG/M
3300021086|Ga0179596_10351568All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300021559|Ga0210409_10083737All Organisms → cellular organisms → Bacteria → Acidobacteria2952Open in IMG/M
3300022718|Ga0242675_1023311All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300024330|Ga0137417_1257538All Organisms → cellular organisms → Bacteria1974Open in IMG/M
3300026296|Ga0209235_1137415All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium998Open in IMG/M
3300026297|Ga0209237_1127360All Organisms → cellular organisms → Bacteria1054Open in IMG/M
3300026313|Ga0209761_1026091All Organisms → cellular organisms → Bacteria3572Open in IMG/M
3300026325|Ga0209152_10190060All Organisms → cellular organisms → Bacteria771Open in IMG/M
3300026328|Ga0209802_1098626All Organisms → cellular organisms → Bacteria1318Open in IMG/M
3300026328|Ga0209802_1200673All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300026330|Ga0209473_1151430All Organisms → cellular organisms → Bacteria939Open in IMG/M
3300026332|Ga0209803_1163144All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300026333|Ga0209158_1043356All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1878Open in IMG/M
3300026333|Ga0209158_1161906All Organisms → cellular organisms → Bacteria814Open in IMG/M
3300026355|Ga0257149_1037174All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300026528|Ga0209378_1018732All Organisms → cellular organisms → Bacteria3911Open in IMG/M
3300026538|Ga0209056_10256029All Organisms → cellular organisms → Bacteria1241Open in IMG/M
3300026548|Ga0209161_10016579All Organisms → cellular organisms → Bacteria5385Open in IMG/M
3300026548|Ga0209161_10172312All Organisms → cellular organisms → Bacteria1231Open in IMG/M
3300026551|Ga0209648_10128218All Organisms → cellular organisms → Bacteria2035Open in IMG/M
3300027655|Ga0209388_1096941All Organisms → cellular organisms → Bacteria846Open in IMG/M
3300027663|Ga0208990_1100143All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300027787|Ga0209074_10085752All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300027846|Ga0209180_10034717All Organisms → cellular organisms → Bacteria2732Open in IMG/M
3300027846|Ga0209180_10070041All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1965Open in IMG/M
3300027846|Ga0209180_10228508All Organisms → cellular organisms → Bacteria1072Open in IMG/M
3300027862|Ga0209701_10109444All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1719Open in IMG/M
3300027875|Ga0209283_10139014All Organisms → cellular organisms → Bacteria1607Open in IMG/M
3300027875|Ga0209283_10571794All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300027882|Ga0209590_10595661All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300029636|Ga0222749_10065176All Organisms → cellular organisms → Bacteria1642Open in IMG/M
3300029636|Ga0222749_10462924All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300031753|Ga0307477_10374717All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300031753|Ga0307477_10577235All Organisms → cellular organisms → Bacteria759Open in IMG/M
3300031754|Ga0307475_10485684All Organisms → cellular organisms → Bacteria992Open in IMG/M
3300031754|Ga0307475_10715486All Organisms → cellular organisms → Bacteria798Open in IMG/M
3300031820|Ga0307473_10785118All Organisms → cellular organisms → Bacteria678Open in IMG/M
3300031962|Ga0307479_10230343All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1827Open in IMG/M
3300032180|Ga0307471_101062140All Organisms → cellular organisms → Bacteria976Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil39.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.75%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.13%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.65%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.65%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.83%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012169Attine ant fungus gardens microbial communities from North Carolina, USA - TSNC074 MetaGHost-AssociatedOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022718Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10110756613300002245Forest SoilPIASSGFAGAEDPLSENGAWAALTSLSPGGGRFRKNNGAFPDRTGPDHAGARATAVIPNDHYSGIIVGHVANNGNNNVGPIVRVQSSGSSIDSHYLWWAASPGNGPNNLYRVDANGTSYTASPILPSSPVADGDTLQLIARGQVIYGIKNGVRDFIYNTGPDTTKYSTGTTGILAYTSSPSMTDATIASWFSGAAPVSSGTWDSSTFTGIENPLDEGDRWYPLPT
JGI25383J37093_1000327853300002560Grasslands SoilMSLRAIPSVLIAGLLGGLLIGTLMPAMAIADEPIASSSFVGAENPLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTGTDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGLVIYGIKNGVRDFIYNTGYDATKYSTGTTGILAHAAGAVTDAMIASWSTGAAPVASGTWTSSTFAGTENPLDEGDRWYPLPGYT
JGI25382J43887_1012469223300002908Grasslands SoilMSLRAIPSVLIAGLLGGLLIGTLMPAMAIADEPIASSSFVGAENPLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTGTDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGLVIYGIKNGVRDFIYNTGYDATKYSTGTTGILAHAAGAVTDAMIASWSTGAAPVASGTWTSSTFAGTENPL
JGI25386J43895_1005589113300002912Grasslands SoilMKLRATHSIMLGRFLGVLLVATLMPVLAIADTPIASSSFVGAEDPLSENGAWAAITSLAPQGTRFQKNDGAYPDLFLXLNHAGARTTAVLPADHYSEIVVGHIGINSGPNDCCNDVGPLVRVQASGPAIDSHYLWWAGANLNHCCNNALYRVDANGTTYSVNQIKKTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYNTGPDSTKYSTGTAGMLAYASSAVTDAKIASWASGAAPVSLGTWTSSTFAGTENPLDEGDRWYPLPGY
JGI25617J43924_1012490113300002914Grasslands SoilMNLVEPHRTLRRLRIAGRLLAGLLAVTLVPVLATADAPIASSSFVGAEDPLYENGAWAALTSLSPFGTRFQKNNGAYPDRVIQHDHAGARTTAAVPADHYSEIVVGHVAIYNNVGPIVRVQPSGLAVDSHYLWWASATDGVNSLYRIDANGITYSATSIIPTSPVVDGDRLRLIARGRVIYGLKNGVRDFIYNTGLNSTQYATGTTGILAYPSSAATDAMIASWSTGPAPISSGTWASSSFAGTENPLDEGDRWYPLPGYAGFKKTGGLA
JGI25616J43925_1013076713300002917Grasslands SoilVNFSGARSALRQSHLTCRLWLLLTALIPGAAIAGTPIASSNFAGVESPLYENGAWAAITSLAPQGTRFQKNNGALPDRFVGPDHNHAGARTTAAIPTDQYSEIVVGHLGNNSNNVGPNVRVQTSGASVDSHYLWWASNGGINSLYRIDANGTSYTGTPLIASSPVADRDALRLIARGQVIYGLKNGVRDFIYNIGTDRTKYPAGTTGMLAYPSTTTLTDAMIASWSSGAAPVSSGAWDSSNFAGTENPLDEGDRWYPLPGYSGFKKAGGLAM
Ga0066680_1016394213300005174SoilMALLMSGLLAAVLVPATAVAAGAIALSSFVGTENPLSENGLWAPITSLSPNGGRFQKNNGAFPTLLAPDHAGARTTAAVPADHYSEIVVGHVGPTVPALPTYNNVGPVVRVQTAGASLDSHYLWWAAQPNGVNGLYRIDATGTTYQPNLLMQTSSVVDGDRLRLIARGNVIYGIKNGVVRDFIYNTGANATRLGGGSTGMLAYSNTNVSDAVIASWSTGAAPTSSGTVASSTFAGVEDPLDEGDRWYPLPGYQGFRKAGGVAVGRESGHNASGVWSI
Ga0066680_1042795713300005174SoilVALRLAVALLPAAAMAGAPIASSSFAGAEDPLFENGAWAALTSLSPNGGRLQKNNAAFPDRLSPDHGGARTTALLPADHYSEIVVAHVGTSFSNVGPMVRVQTSGAAIDSNYLWWASPPNGLNNLYRIDANGTSYTAGAILSTSPIVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPQATKYSGGTAGILAFTASPSLTDAAIASWSSGAAPASSGTHDSSNFIGVENPLDEGDRWYPLPTYSGFRKAGGLAIGLDFGHNASGVWSIAPPATQYSEVT
Ga0066680_1060492113300005174SoilSFVGVEDPLSENGAWVALTSLSPNGTRFQKNNGAYPDKVYQVNDHGHGGARTTAVVPADHYSEIVVGHVGSAIGPAGCCNDVGPIVRVQASGLTIDSHYLWWAGLNMNFCCNNALYRIDANGITYTANFIIPTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAVAVSAVTDAMIASWATGPAPVSSGTWASSTFAGTENPLDEGDRWYPL
Ga0066673_1004882123300005175SoilMRLCSQQLLLSRESSMPSQYAPVRPRSTTRDFLVSGDCRESSERHRAGGALVGVLAIALVVTAFIPNALAADPLASSSLVGTEDPLFENGAWEALTSLSPDGTRFMKSNGAFPDRIVGPGQNHAGARTTAIMPSDHYSEIVVGHIGSSRNNVGPIVRVQPSGAAVDSHYLWWGSQVNGLNALYRVDANGTSYTATPLLQTSPVVDGDRMRLIARGLVIYGMKNGVREFIYNTGPDNAQYLTGTTGMLAFGAGPELTDAMIASWSSGGAPASSGTHDSSSFIGVEDPLDEGDRWYPLPGYSGF
Ga0066690_1067412113300005177SoilALRLAVALLPAAAMAGAPIASSSFAGAEDPLFENGAWAALTSLSPNGGRLQKNNAAFPDRLSPDHGGARTTALLPADHYSEIVVAHVGTSFSNVGPMVRVQTSGAAIDSNYLWWASPPNGLNNLYRIDANGTSYTAGAILSTSPIVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPQATKYSGGTAGILAFTASPSLTDAAIASWSSGAAPASSGTHDSSNFIGVENP
Ga0066688_1043702013300005178SoilMRLVHLPRALRQLRMRRRLLGSLLALTLMSGMMIGQGPIASSSFVGAESPLSENGAWAALTSLSPNGTRFQKNNGAYPDRIMSGDHAGARATAVVPADHYSEIVVGHVGSTANNVGAIVRVQASGPSIDSHYLWWASINGGQNSLYRIDANGTSYVPNSLVATSPVVDGDRLRLIARGPVLYGLKNGVRDFIYNTGTNTTKFSTGSAGILAHADTAVTDARIASWSTGAAPVSSG
Ga0066688_1060405813300005178SoilAAAPISSSGFAGAEDPLVENGAWAPLTSLSPNGGRFQKNDGAFPDRFGPDHAGARTTAVVPTDHYSEIVVGHLGTNLSNVGPIVRVQTSGASVDSHYLWWASEPNGLNRLYRIDANGTSYTADPLLDTSPVFQGDTLRLIARGLVIYGIKNGVRDFIYNTGPDATKYAAGTSGMLAFTGGSALTDATIASWSTGAAPASSGTHDSSNFIGVENPLDEGDRWYPLPGYSGFRKAG
Ga0066684_1013912013300005179SoilVTLVERNALCRRRRMRLFLGGFLATLIPATAIAQPITSSDFVGVESPLFENGAWAALTSLSPQGTRFQKNNGAFADRIVGPGQNHAGARTTAVIPADHYSEIVVGHVGNDRNNVGPIVRVQASGPTIDSHYLWWASLTTGVNGLFRIDANGTTFIDTRIVGTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYPTGTTGILAFPTGPALTDAMIASWSSGAAPVSSGTWTSSTFAGTENPLDEGDRWYPLPTYAGFRKAGGLAIGQDAGHNASGVWSIT
Ga0066678_1031033013300005181SoilMTGRVVGALLAAMLISITAIAAGPIASSDFAGAENPLSENGAWAALTSLSPAGGRFQKNNGAFPNQSPPDHAGARTTIALPADHYSEIVVGHVGGNTNNVGPIVRVQTSGAAVDSHYLWWASQASGVNNLYRIDANGTSYRADLILPTSSVVDGDRLRLIARGPVIYGIKNGVREFIYNTGPDATRYSAGTAGILAWAGNAVETDSQIASWSAGAAPASSGTWASSSFIGAENPLDEGDRWYPLPGYSG
Ga0066676_1074003113300005186SoilAPISSSGFAGAEDPLFENGAWAPLTSLSPNGGRFQKNAGAFPDRFGPDHAGARTTAVVPTDHYSEIVVGHLGTNLSNVGPIVRVQTSGASVDSHYLWWASEPNGLNRLYRIDANGTSYTADPLLDTSPVIQGDTLRLIARGLVIYGIKNGVRDFIYNTGPDATKYAAGTSGMLAFTGGSALTDATIASWSTGAAPASSGTHDSSNFIGAENPLDEGDRWYPLP
Ga0066675_1089486713300005187SoilVGTEDPLFENGAWEALTSLSPDGTRFMKSNGAFPDRIVGPGQNHAGARTTAIMPSDHYSEIVVGHIGSSRNNVGPIVRVQPSGAAVDSHYLWWGSQVNGINALYRVDANGTSYTATPLLQTSPVVDGDRMRLIARGLVIYGMKNGVREFIYNTGPDNAQYLTGATGMLAFGAGPELTDAMIASWSSGGAPASSGTHDSSSFIGVEDPLDEGDRWYPLPGYSGF
Ga0066689_1008001933300005447SoilMTNTTLSPMIALRLAVALLPAAAMAGAPIASSSFAGAEDPLFENGAWAALTSLSPNGGRLQKNNAAFPDRLSPDHGGARTTALLPADHYSEIVVAHVGTSFSNVGPMVRVQTSGAAIDSNYLWWASPPNGLNNLYRIDANGTSYTAGAILSTSPIVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPQATKYSGGTAGILAFTASPSLTDAAIASWSSGAAPASSGTHDSSNFIGVENPLDEGDRWYPLPTYSGFRKAGGLAIGLDFG
Ga0066689_1048719913300005447SoilASSSFVGAEDPLSENGSWAALTSLSPNGTRFQKNNGAFADLLIARNHAGARTTAVVPADHYSESVVGHIGVDINTPDCCNNVGPIVRVQASGPAIDSHYLWWAGLNIPNHCCNNALYRVDANGTTYNPVQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGILAHPSTTVTDAMIASWATGSAPASNGTWASSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPLHIGDTLHNAS
Ga0066689_1052783213300005447SoilSRRRLGLTAKLLSGILVASLFPASAIAAGPIASSTFIGTEDPLLENGAWAALTSLSPNGGRFQKNNGAFPDKPGPDHAGARTTAAVPVDQFSEIVVGHVGTTNNNVGPIVRVQASGPSIDSHYLWWTSQPNGVSGLYRIDANGTSYGSARILGSSSVVDGDRLRLIARGPVIYGVKNGAREFIYNTGPDTVKYSTGTTGMLAWAGNGVLTDSKIASWSTDAAPVSLGTWASSTFAGVEDPLDETDRWY
Ga0066681_1028501513300005451SoilMSHVDRPRRRRPLHFASPRLVGLVAVMLVPLAAVADGPIASSTFAGAEDPLSENGAWAALTSLSPNGTRFQKNNGAFPDRFVRPDHAGARSTAVLPADHYSEIVIGHIGSNADNLGPTVRVQAAGGAVDSHYLWWASRTDGVNSLYRIDANGTSYSATPLIPCSAITDGDRLRLIARGPVVYGIKNGVRDFIYNTGPDPVQLGGGAAGMLAWPAGETLSDAMIGAWSGGAAPASSGTWASSSFVG
Ga0066692_1069905713300005555SoilSSSFVGAENPLSENGTWVTLTAFSPFGTRFQKNNGAYPDQLIYHDHAGARTTAVVPADHYSEIVVGHVGNYDYNNVGPIVRVQPTGPSIDSHYLWWASGPNGVNYLYRIDADGTTYSANGLIPTSPVVDGDRLRLIARGRVIYGIKNGVRDFIYHTGPNPTQYSTGTAGMLAYSSGAVRDAMIASWSTGPAPVSSGTWASSAFAGTEDP
Ga0066707_1003111313300005556SoilMNLIERQHALRQLRMIGRLLGSLLVATLLPVLAVADSPIASSSFVGAEDPLSENGAWAALTSFAPQGTRFQKNNGAYADQLFAKNHAGARTTAVVPADHFSEIVVGHLGSDINNPQGGTPDCCNNVGPIVRVQASGPAIDSHYLWWAGLNINQCCNNALYRIDANGTTYNPAQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAHPSTAVTDAKIASWSSGAAPVSLATWTSSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPLHIGDTLHNASGVWSITPPATQYSEVTLG
Ga0066704_1021929513300005557SoilMSAVAAGPRAVRVAFVARQGADPSRRRLGLTAKLLSGILVASLFPATAIAAGPIASSTFIGTEDPLLENGAWAALTSLSPGGGRFQKNNGAFPDKPGPDHAGARTTAAVPVDQFSEIVVGHVGTTNNNVGPIVRVQASGPSIDSHYLWWTSQPNGVSGLYRIDANGTSYGSARILGSSSVVDGDRLRLIARGPVIYGVKNGAREFIYNTGPDTVKYSTGTTGMLAWAGNGVVTDSRIASWSTDAAPVSLGTWASSTFTGVED
Ga0066704_1022696413300005557SoilMNLIERQRALRQLRLLGRLLGGLLVATLMPVLAVADSPIASSSFVGAEDPLSENGSWAALTSLSPNGTRFQKNNGAFADLLIARNHAGARTTAVVPADHYSESVVGHIGVDINTPDCCNNVGPIVRVQASGPAIDSHYLWWAGLNIPNHCCNNALYRVDANGTTYNPVQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGILAHPSTTVTDAMIASWATGSAPASNGTWASSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPLHIGDTLHNASGVWSITPPATQYS
Ga0066698_1053578413300005558SoilEDPLSENGAWAPLTNPGLFPNGGRVQKNNGAFPDKPSPDHAGARTTAAVPPDHYSEIVVGHLGGPQNNVGPIVRVQTSGPSIDSHYLWWASTVNGFNGLVRDDENSTGHHPVGPIVPSSPVTDGDTLRLIARGQVIYGIKNGVRDFIYNTGPDAIKYSGGATGILAWAGNQVVTDSKIASWSTGPAASSSGTWASSTFQGTENPLDEGDRWYPLPGYAGFRKAGGFAVGLNAGFHNASGVWSIVAPQKQYSEVTLGAVASGGGG
Ga0066700_1055610313300005559SoilMKLRATHSIMPGRFLGVLLVATLMPVMAIADAPIASSSFDGTEDPLSENGAWVALTSLSPTGGRFQKNGGVAFPDKLYGSTRDHAGARTTATIPDDHYSEIVIGHITNAPITNVGPIVRVQTSPIAIDSHYLWWATANGGVNNLYRIDANGITYNATALIPTSPVVDGDRLRFIARGPVIYGVKNGVRDFIYTIGQDSTRYLTGTTGMLAVAGSALTDATIDSWSSGPAPVSTGGTWASTTFVGAEN
Ga0066702_1059619413300005575SoilSLSPAGGRFQKNNGAFPNQSPPDHAGARTTIALPADHYSEIVVGHVGGNTNNVGPIVRVQTSGAAVDSHYLWWASQASGVNNLYRIDANGTSYRADLILPTSSVVDGDRLRLIARGPVIYGIKNGVREFIYNTGPDATRYSAGTAGILAWAGNAVETDSQIASWSAGAAPASSGTWASSSFIGAENPLDEGDRWYPLPGYSGFRKAGGLAIGRDWGQNASG
Ga0066708_1063462213300005576SoilPNGGRLQKNNAAFPDRLSPDHGGARTTALLPADHYSEIVVAHVGTSFSNVGPMVRVQTSGAAIDSNYLWWASPPNGLNNLYRIDANGTSYTAGAILSTSPIVDGDRLRLIARGPVIYGIKNGVRDFIYTTGPQATKYSGGTAGILAFTASPSLTDAAIASWSSGAAPASSGTHDSSNFIGVENPLDEGDRWYPLPTYSGFRKAGGLAIGLDFGHNASGVWSIAPPA
Ga0066706_1005623013300005598SoilMNLIERQHALRQLRMIGRLLGSLLVATLLPVLAVADSPIASSSFVGAEDPLSENGAWAALTSFAPQGTRFQKNNGAYADQLFAKNHAGARTTAVVPADHFSEIVVGHLGSDINNPQGGTPDCCNNVGPIVRVQASGPAIDSHYLWWAGLNINQCCNNALYRIDANGTTYNPAQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAHPSTAVTDAKIASWSSGAAPVSLATWTSSTFAGTEDPLDEGDRWYPLPGYHGFKKTGGLASGIDSNHNASGVWSI
Ga0079222_1024981113300006755Agricultural SoilVKKLGLALALLPIAALAQAPVASSNFAGAEDPLFENGAWAALTSLSPNGGQFQKNNGAFPDRFSPDHAGARTTAVLPADHYSEIVVGNLGTSVSNVGPSVRVQTAGAAIDSHYLWWASLPNGLNNLYRIDANGTSYSADPLMATSPVTNGDRLRLVARGLVIYGIKNGVRDFIYNTGPDATKLAGGTSGILAFTGDSALTDATIASWSTGPAPASSGTQDSSLFVGVENPLDEGDRWYPLPGYSGFRKAGGVAMGRDFNHNAEAAWSISPPAAQYSEVTLGTVASGGGG
Ga0066658_1013451223300006794SoilMTNTTLSPMIALRLAVALLPAAAMAGAPIASSSFAGAEDPLFENGAWAALTSLSPNGGRLQKNNAAFPDRLSPDHGGARTTALLPADHYSEIVVAHVGTSFSNVGPMVRVQTSGAAIDSNYLWWASPPNGLNNLYRIDANGTSYTAGAILSTSPIVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPQATKYSGGTAGILAFTASPSLTDAAIASWSSGAAPASSGTHDSSNFIGVENPLDEGDRWYPLPTYSGFRQAGGLAI
Ga0066658_1021912413300006794SoilMSLRAMPSVLIAGLRGGLLIGTLMPAMAIADEPIASSSFVGAENPLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTGSDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGYDATKYSTGTTGILAHAAGAVTDAKIASWSTGAAPVASGTWTSSTFAGTENPLDEGDRWYPLPGYSGFKKAGGAAIGRDSAHNSSGV
Ga0099791_1020585313300007255Vadose Zone SoilMSRSLSARVLGTLFLGLACGRDAAGPGGATAQGSSPVMAPATAAVQAPIASSDFVGFEDPLSENGAWVALTSMAPQGTRFQKNNGAYPDRFVGPDQNHAGARTTAIIPPDHYSEIVVGHVGDNNGNVGPIVRVQTSGSAIDSHYLWWASHSDGHNNLYRIDANGTSFTASPILPTSPVADRDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVENPLDEGDRWYPLPTYSGFRKAGGFAIGRDWGHNASGVWS
Ga0099793_1004880013300007258Vadose Zone SoilVKLTGLLGALLIPAVATAAPIASSSFAGVENPLLENGAWAALTSLSPNGGRFQKNNAAFPNRFSPDHAGARTTAALPADHYSEIVVAHLGTNVSNVGPTVRVQTGGASVDSHYLWWASLPNGLNNLYRIDANGTSYTADPRIATSPVTDGDKLRLIARGPVIYGIKNGVRDFIYNTGPDATKYGGGTAGILAFTGDQSLTDASIASWAGGAAPVSSGTRDSSTFTGIENPLDEGDRWYPLPGYSGFRKTGGLAMGRDLGHNASGAWSIT
Ga0099794_1010747823300007265Vadose Zone SoilMDIGERPPRQRRLPGRVLLLAALLPGIAVADSPVASSKFAGAEDPLFENGAWAPLTSLSPGGGRFQKGNGAFPDRFAPEHAGARTTAPIPADHYSEIVVGHVGSQPVNNNVGPIVRVQASGPSIDSHYLWWAGPTNGVNNLYRIDANGTSFTASPILPTSAVTDGDTLRLIARGQVIYGIKNGVREFIHNTGQDFATYSGGTAGMLAFASGPALTDASIASWSAGASPASSNGWTSSNFAGVENPLDEGDRWYPLPGY
Ga0099794_1025976813300007265Vadose Zone SoilMGLFLGVLLATLIPATAMAQPIASSSFVGAESPLSENGAWEAITALAPQGTRFQKNNGAYPDRIVGPANNHAGARTTALIPPDHYSEIVVGHLGNNRNNVGPIARVQASGASIDSHYLWWGSLTNGVNNLYRIDANGTTFTATPILSGTSPVADGDRLRLIARGPVIYGIKNGVRDFIYNTGPDITKYSTGTAGMLAFPTGPLLTDAMIASWSSGAAPVSSGTWASSTFSGTENPLDEGDRWYPLPTYFGFRKAGVLAIGLNSGHNASGVWSIAPPAKQYSEVTLGTAVSG
Ga0066710_10137707723300009012Grasslands SoilMNLIERQRALRQLRLLGRLLGGLLVATLMPVLAVADSPIASSSFVGAEDPLSENGAWAALTSFAPQGTRFQKNNGAYADQLFAKNHAGARTTAVVPADHYSEIVVGHIGSDINNPQGGTPDCCNNVGPMVRVQASGPAIDSHYLWWAGLNINQCCNNALYRIDANGTTYNPAQIFKTSPVVDGDRLRLIARGQVLYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAHPSTAVTDAKIASWSSGAPPVSLATWTSSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPLHIGDTLHNASGVWSITP
Ga0066710_10207756713300009012Grasslands SoilMNLVERQRTLRQLRRISRLLGGLLVATLVPVMAMADGPIASSSFVGAEDPLSENGAWAALTSLSPNGTRFQKNNGAYPDQLIYNDHAGARTTAVVPADHYSEIVVGHIESINYNNVGPIVRVQPTGPSIDSHYLWWASGPNGVNALYRIDANGTTFTANSIIRTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYHTGPDATKYSTGTAGMLAYPTGAVTNAMIASWSTGPAPVSSGTWASSAFAGTEDPLDEGDRWYPLPGYLGFKKTGGRAI
Ga0099829_1001028613300009038Vadose Zone SoilVRRRLRRTRGGFLGGFLVATLIPTLASADPPIASSSFVGAEDPLSENGAWAPLTSVAPQGTRFQKNNGAYPDRFVGPDHNHAGARTTAIIPPDHYSEIVVGHVGTNSDNVGPIVRVQTSGSTIDSHYLWWATQTNGHNALYRVDANGTSFTASPILTTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNIGPDTTKYSTGTAGMLAFNSGPTLTDAMITSWSSGAAPGSSGTWASSTFAGIENPLDEGDRWYPLPTYSGFRKAGGL
Ga0099829_1025257913300009038Vadose Zone SoilMNLVDLHRTLCVVGLLAVALLPVGAIADTPIASSNFLGAEDPLSENGAWAALTSLSPNGVRFQKNNGAYPDLFISPNHAGARSTAAIPVDHYSEILVGHVAGNTNNVGPIVRVQPSGSAVDSHYLWWGSGTGGTVNYLYRVDANGISYSASPILPTSPIADGDRLRLIARGLLIYGIHNGAREFIYNTGPDTTKYSTGTAGMLAYTSGPTLTDATIASWSSGAAPASSGGWASSNFSGVENPLDEGDRWYPLPTYSGFRKAGGLAIGRDGGQNVSGVWSITPPAKQYSEVTLGTVVSGGG
Ga0099830_1063622913300009088Vadose Zone SoilMNLVELQRAPRTLRAMGRLPVGLLVVMLMAIMAIANGPIASSSFVGAEDPLSENGAWAALTSLSPNGSRFQKINGAFPDQPGNGVNHAGARTSAVVPTDHYSEIVVGHVGNNITNFNNVGPIVRVQASGPSIDSHYLWWASVVANGVNNLYRIDANGTTYTANRIMSTSPIVDGDILRLIARGPVIYGIKNGLRDFIYNTGPDATKYSTGTTGMLAYAGDGGVTNAKIASWSTAAAPVSSGTRASSTFTGIENPLDEGDRWYPLPGYSGFKKVGGLAIGRDSVHNA
Ga0099828_1015175333300009089Vadose Zone SoilVNLVALRLAVALLPAAALAGAPIASSGFAGAEDPLFENGAWAALTSLSPNGGRFRKNNAAFPDRFSPDHAGARTTALLPADHYSEVVVGHVGTSASNVGPIVRVQTSGAAVDSHYLWWATLANGLNNLYRIDANGTSYSASPILPTSPVVDGDTLRLVARGLVIYGIKNGVRDFIYNTGPDAAKYSAGTAGMLAFTGDSALTDATIASWSGGAAPASSGTHDSSSFLGVEDPLDEGDRWYPL
Ga0099828_1087294613300009089Vadose Zone SoilENGAWAPLTSVAPQGTRFQKNNGAYPDRFVGPDHNHAGARTTAIIPPDHYSEIVVGHVGTNSDNVGPIVRVQTSGSTIDSHYLWWATQTNGHNALYRVDANGTSFTASPILTTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNIGPDTTKYSTGTAGMLAFNSGPTLTDAMITSWSSGAAPGSSGTWASSTFAGIENPLDEGDRWYPLPTYSGVRKAGGLATGRDGGHNGSGVWSIAPPARQYSEVTLGTVASGGGGPIVRIDRSN
Ga0099828_1095569713300009089Vadose Zone SoilLSENGAWVALTSLSPNGTRFQKNNGAYPDQLGDTAIAHNHAGARTTAVVPADHYSEIVVGHLASYNDVGPIVRVQPSGLSVDSHYLWWAALASGGLNFLYRIDANGATYTANGIIPTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDGTRYSTGTTGMLAFASSAMTDAMIASWSTGPAPVSTSTWASSTFAGTENPLDEGDRWYPLPTYSGFKKTGGAAIGLSSGYNASGVWSIAPPATQYSEVTLGA
Ga0099792_1006073033300009143Vadose Zone SoilVKPVTFWSGLAALVPAAAIAQAPIASSSFAGTEDPLSENGAWAALTSLSPNGGRFQKNNGAFPDRFSPDHAGVRTTAVVPADHYSEIVVGHLGTSLSNVGPTVRVQTAGASIDSHYLWWASLPNGLNNLYRIDANGTSYRADPIVSTSPVTDGDRIRLIARGLVIYGIKNGVRDFIYNTGPDATKYASGASGMLAFTGDSSLTDATIASWSTGPAPASSGTHDSSNFIGVENPLDEGDRWYPLPGYSGFKKAGGLAVGRDFGHNATGVWSIAPPARQYSEVTLG
Ga0134088_1001957913300010304Grasslands SoilMLGRFLGVLLVATLMPVLAIADTPIASSSFVGAEDPLSENGAWAAITSLAPQGTRFQKNDGAYPDLFLNLNHAGARTTAVLPADHYSEIVVGHIGINSGPNDCCNDVGPLVRVQASGPAIDSHYLWWAGANLNHCCNNALYRVDANGTTYSVNQIKKTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYNTGPDSTKYSTGTARMLAYASSAVTDAKIASWASGAAPVSLGTWTSSTFAGTENPLDEGDRWYPLP
Ga0134086_1015528913300010323Grasslands SoilLSITRLIYPRIRTVSSSRGNFDTSQGIAVAAHGDDSDGARRMRMSLRGTHSALVGGLLGGLLIGTLMPAMAMAADPIASSSFVGAESPLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTATDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGYDATKYSTGTTGILAHAAGAVTDAKIASWSTGAAPV
Ga0134071_1031154113300010336Grasslands SoilMLGRFLGVLLVATLMPILAIADAPIASSSFVGAEDPLSENGAWAALTSLSPNGTRFQKNNGAYPDRLIYRDHAGARTTAVVPADHYSEIVVGHVGNYDYNNVGPIVRVQPTGPSIDSHYLWWASGPNGVNYLYRIDADGTTYSANGLIPTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYHTGPDTTKYSTGTAGMLAFGSSFMTDAMIASWSSGTAPVSSGTWASSTFAGTENPLDEGDRWYPLP
Ga0137392_1078762913300011269Vadose Zone SoilIPTLASADPPIASSSFVGAEDPLSENGAWAPLTSVAPQGTRFQKNNGAYPDRFVGPDQNHAGARTTAIIPPDHYSEIVVGHVGDNNGNVGPIVRVQTSGSAIDSHYLWWASHSDGHNNLYRIDANGTSFTASPILPTSPVADRDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVENPLDEGDRWYPLPTYSGFRKAGGFAIGRDWGHNASGVWSI
Ga0137393_1057488013300011271Vadose Zone SoilVGVVLVVTLVGVVAMAAGPIASSSFVGVEDPLSEGGAWAALTSLSPNGGQFQKLNGAFPDRFGPDHAGARTTAVVPADHYSEIVVGHVGNVVNYINNVGPMVRVQTSGPSIDSHYLWWASGTNGLNYLYRVDANGTSYTASAILPSSPVVDGDRIRLIARGPVIYGLKNGVRDFIYNSGPSTPRYSTGTAGMLAYTAGPSLTDAMIASWSCSSAPVPSGTWTSSNFIGAEDPLDEGDRWYPLPNYAGFRKA
Ga0137393_1073838013300011271Vadose Zone SoilVASSNFVGFEDPLSENGAWAALTSLAPEGTRFEKNNGAFPDRFVGPDNNHAGARTTAAVPTDHYSEIVVGHVGNQYSYVGALVRVQPSGPSIDSNYLWWGSLANGQNNFLYRVDANGTSYHVAAILPHSPFADGDRIRLVARGPVIYGIKNGVREFIYNTGRDTTVYSTGTAGMLAYVPNSTLTDAMIASWSAGAAPVSSGTWASSTFAGVEDPLDEGDRWYPLPGYSGFKKAGGLAIGKDGGHNISGVWSIAPPARQYSEVTLGTAASGGGGPIVRIDR
Ga0137393_1089341313300011271Vadose Zone SoilMRGGFLLATLIPALASADPPIASSSFVGVEDPLYENGAWVALTSLAPNGIRFQKNSGAYPDRFISPNHAGARTTAIIPVDHYSEIVVGHVGDNTDNVGPIVRVQTSGSAIDSHYLWWATQMNGHNNLYRLDANGTSYTASPILPTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVENPLDEGDRWYP
Ga0137393_1098683113300011271Vadose Zone SoilGFLVATLIPTLASADPPIASSSFVGAEDPLSENGAWAPLTSVAPQGTRFQKNNGAYPDRFVGPDQNHAGARTTAIIPPDHYSEIVVGHVGDNNGNVGPIVRVQTSGSAIDSHYLWWASHSDGHNNLYRIDANGTSFTASPILPTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVENPLDEGDRWYP
Ga0153990_109583313300012169Attine Ant Fungus GardensIADGPIASSSFVGAEDPLSENGAWAALTSLSPNGSRFQKNNGAYPDLAGYPNHAGAHTTAVVPADHYSEIVVGHVGSVNDDVGPIVRVQAAGPSIDSHYLWWASVFNGVNNLYRIDANGTSYTATVIMPTSAVADGDRLRLIARGPVIYGIRNGVRDFIYNTGPDTTKFSTGTAGMLAYAGSGTVTNAMIASWSTGAAPVSSGTWASSNFAGIEDPLDEG
Ga0137388_1042219413300012189Vadose Zone SoilMNLVDLHRALRVVGLLAVALLPVGAIADTPIASSNFLGAEDPLSENGAWAALTSLAPNGVRFQKNNGAYPDLFISHNHAGARTTAAIPADHYSEILVGHVAGNTNNVGPIVRIQPSGSAVDSHYLWWGSGTGGTVNYLYRVDANGISYSASPILPTSPIADGDRLRLIARGLLIYGIHNGAREFIYNTGPDTTKYSTGTAGMLAYTSGPTLTDATIASWSSGAAPASSGGWASSNFSGVENPLDEGDRWYPLPTYSGFRKAGGLAIGRDGGQNV
Ga0137388_1118501313300012189Vadose Zone SoilGFLVATLIPTLASADPPIASSSFVGAEDPLSENGAWAPLTSVAPQGTRFQKNNGAYPDRFVGPDQNHAGARTTAIIPPDHYSEIVVGHVGDNNGNVGPIVRVQTSGSAIDSHYLWWASHSDGHNNLYRIDANGTSFTASPILPTSPVADRDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVENPLDE
Ga0137388_1118601213300012189Vadose Zone SoilATLIPALASADPPIASSSFVGVEDPLYENGAWVALTSLAPNGIRFQKNSGAYPDRFISPNHAGARTTAIIPVDHYSEIVVGHVGDNTDNVGPIVRVQTSGSAIDSHYLWWATQMNGHNNLYRLDANGTSYTASPILPTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVENPLDEGDRWYP
Ga0137363_1009660323300012202Vadose Zone SoilVLLATLIPATAIAQPIASSSFVGAESPLSENGAWEALTALAPQGTRFQKNNGAYPDRIVGPDNNHAGARTTALIPPDHYSEIVVGHLGNNRNNVGPIVRVQALGASIDSHYLWWASLTNGVNNLYRIDANGTTFTAAVIRPGTSPVADGDALRLIARGPVIYGIKNGVRDFIYNTGLETTRYSTGTAGMLAFPAGPSLTDAMIASWSSGAAPASSGTWASSTFAGTENPLDEGDRWYPLPGIVGSYSYSGFRK
Ga0137363_1068234613300012202Vadose Zone SoilMRLSLGVLLATLIPATAIAQPITSSNFVGVESPLSENGAWVAITSLAPQGTRFQKNNGAFPDRIVGPAQNHAGARTTAVIPADHYSEIVVGHVGNDRNNVGPIVRVQTSGPTIDSHYLWWASLTNGVNNLYRIDANGTTFINTRIIATSPVVDGNRLRLIARGQVIYGIKNGMRDFIYNTGPDTTKYPTGTTGILAFPTGPALTDAMIASWSSSAAPVSSGTWTSSTFAGTENPLDEGDRWYPLP
Ga0137363_1101919213300012202Vadose Zone SoilSSSFAGAEDPLFENGAWAALTSLSPGGGRFRKNNGAFPDRTGPDHAGARTTAVIPNDHYSEIIVGHVANNGNNNVGPIVRVQSSGSSIDSHYLWWAASPGNGPNNLYRVDANGTSYTASPILPSSPVADGDTLQLIARGQVIYGIKNGVRDFTYNTGPDTTKYSTRTTGILAYTPSPSMTDATIASWFSGAAPVSSGTWDSSTFTGIENPLDEGDRWYPLPTYSGFRKAGSLA
Ga0137399_1004376213300012203Vadose Zone SoilVKLAGLLGALLIPAVATAAPVASSTFAGAEDPLFESGAWAALTSLSPNGGRFQKNNAAFPDRFSPDHAGARTTAVLPADHYSEIVVAHLGTNVSNVGPTVRVQTGGASVDSHYLWWASLPNGLNNLYRIDANGTSYTADPRIATSPVTDGDKLRLIARGPLIYGIKNGVRDFIYNTGPDATKYGGGTAGMLAFTGDQSLTDASIASWAGGAAPVSSGTRDSSTFTGVENPLDEGDRWYPLPGYSGFRKT
Ga0137399_1023140233300012203Vadose Zone SoilVRLLACLLAITIAPGMAIADGPIASSSFTGVEDPLYENGAWAPLTSLAPQGIRFQKNNGALPDRFTGPDHNHAGARTTAVIPADHYCEIVVGHLGDNSNNLGPIVRVQPSGSSIDSHYLWWATRSNGLNELYRVDANGSSYNASPILASSPVADGDRLRLIARGQVIYGIRNGVRDFIYNTGPDAIRYSAGTAGMLAYPSGPTLTDDVIASWSSGAAPVSSGTWASTTPGRPDGCCSSTPTTRPGRESSR*
Ga0137361_1089191313300012362Vadose Zone SoilVKPVTFWSGLAALVPAAAIAQAPIASSSFAGTEDPLSENGAWAALTSLSPNGGRFQKNNGAFPDRFSPDHAGVRTTAVVPADHYSEIVVGHLGTSLSNVGPTVRVQTAGASIDSHYLWWASLPNGLNNLYRIDANGTSYRADPIVSTSPVTDGDRIRLIARGLVIYGIKNGVRDFIYNTGPDATKYASGASGMLAFTGDSSLTDATIASWSTGPAPASSGTHDSSNF
Ga0137390_1080044913300012363Vadose Zone SoilMNLVELQRAPRTLRAMGRLPVGLLVVMLMAIMAIADGPIASSSFVGAEDPLSENGAWAALTSLSPHGSRFQKNNGAYPDLPGYPGGNHAGARTTAVIPADHYSEIVVGHVGNNITNFNNVGPIVRVQASGPSIDSHYLWWASVANGVNGLYRIDANGTAYTANGIMPTSPVIDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDATKYSTGTTGMLAYAGDGGVTNAKIASWSTGAAPVSSGTRASSTFTGIENPLDEGDRWYPLPGYSGFKKVGGLAIGRDSVHNASGVWSIAP
Ga0137390_1083193913300012363Vadose Zone SoilMTLVDRTLLRAARRLPVGLLAVTLIQAIAVADGPLASSSFVGSEDPLSENGAWVGLTSLSPNGTRFQKSNGAYPDQFIHPDHAGARTTAVIPTDHYAEIVVGHLGSNTDNVGPIVRVQTSGASVDSHYLWWTSRTNGINELYRIDANGTSYTAEPILATSPVTDGDRLRLIARGPVIYGINNGVRDFIYNTGPETRKYSAGTAGMLAWNGSPTLTDAMIASWSSGAAPVSSGTWASSNFTGIENPLDE
Ga0137358_1015572623300012582Vadose Zone SoilLNLVELQSALGQLRRKGRLLIGLLVLASLPVAAIADAPIASSSFAGAEDPLFENGAWAALTSLSPGGGRFRKNNGAFPDRTGPDHAGARTTAVIPNDHYSEIIVGHVANSGNNNVGPIVRVQSSGSSIDSHYLWWAASPGNGPNNLYRVDANGTSYTASPILPSSPVADGDTLQLIARGQVIYGIKNGVRDFIYNTGPDTTKYSTGTTGILAYTSSPSMTDATIASWFSGAAPVSSGTWDSSTFTGIENPLDEGDRWYPLPTYSGFRKAGSLAIGRDWGHNASGVWSITPPAKQYSEVTLGTVTRGGG
Ga0137397_1019875923300012685Vadose Zone SoilMRLFLGGLLATLIPATAIAQPITSSNFVGVESPLFENGAWVAITSLSPQGTRFQKNNGAFPDRIVGPDQNHAGARTTAVIPADHYSEIVVGHVGNDRNNVGPIVRVQTSGPTIDSHYLWWASLTTGVNGLFRIDANGTTFIDTRIVGTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYPTGTTGILAFPTGPALTDAMIASWSSGAAPVSSGTWTSSTFAGTENPLDEGDRWYPLPTYSGFRKAGSLAIGQDGGHNASGVWSIT
Ga0137396_1014710833300012918Vadose Zone SoilPQGIRFQKNNGALPDRFTGPDHNHAGARTTAVIPADHYCEIVVGHLGDNSNNLGPIVRVQPSGSSIDSHYLWWATRSNGLNELYRVDANGSSYNASPILASSPVADGDRLRLIARGQVIYGIRNGVRDFIYNTGPDAIRYSAGTAGMLAYPSGPTLTDDVIASWSSGAAPVSSGTWASTTPGRPDGCCSSTPTTRPGRESSR*
Ga0137359_1014717913300012923Vadose Zone SoilVKLTGLLGALLIPAVATAAPIASSSFAGVENPLLENGAWAALTSLSPNGGRFQKNNAAFPNRFSPDHAGARTTAALPADHYSEIVVAHLGTNVSNVGPTVRVQTGGASVDSHYLWWASLPNGLNNLYRIDANGTSYTADPRIATSPVTDGDKLRLIARGPVIYGIKNGVRDFIYNTGPDATKYGGGTAGILAFTGDQSLTDASIASWAGGAAPVSSGTRDSSTFTGVENPLDEGDRWYPLPGYSGFRKTGGLAMGRDFGHNASGV
Ga0137359_1096022113300012923Vadose Zone SoilTSLSPGGGRFQKNNAAFPDRISPDHAGARTTVALPADHYSEIVVGQLGTTSSNVGPTVRVQTSGASVDSHYLWWASLPNGLNNLYRIDANGTSYTADPLMPTSPVAEGDRLRLIARGPVIYGIKNGVRDFIYNTGPDTTKYPTGTTGILAFPTGPALTDAMIASWSSSAAPVSSGTWTSSTFAGTENPLDEGDRWYPLPTYSGFSKAGGLAIGLDGGHNASGVWSITPPARQYSEVTLGSVPSG
Ga0137416_1028591243300012927Vadose Zone SoilVRLLACLLAITIAPGMAIADGPIASSSFTGVEDPLYENGAWAPLTSLAPQGIRFQKNNGALPDRFTGPDHNHAGARTTAVIPADHYCEIVVGHLGDNSNNLGPIVRVQPSGSSIDSHYLWWATRSNGLNELYRVDANGSSYNASPILASSPVADGDRLRLIARGQVIYGIKNGVRDFIYNTGPDAIRYSTGTAGMLAFPSGPTLTDDVIASWSSGAAPVSSGTWASTTPGRPDGCCSSTPTTRPGRESSR*
Ga0137416_1074517813300012927Vadose Zone SoilLNLVELQSALGQLRRKGRLLIGLLVLASLPVAAIADAPIASSSFAGAEDPLFENGAWVALTSLSPGGGRFRKNNGAFPDRTGPDHAGARTTAVIPSDHYSEIIVGHVASSGNNNVGPIVRVQSSGSSIDSHYLWWASQGNGPNNLYRIDANGTSYTASPILPSSPVADGDTLQLIARGQVIYGIKNGVRDFIYNTGPDTTKYSTGTTGILAYTSSPSMTDAMIASWFTGSAPVSSGSWASSTFTGIENPLDEGDRWYPLPTYSGFRKAGGL
Ga0137407_1013325723300012930Vadose Zone SoilMGLFLGVLLATLIPATGIAQPIASSSFVGAESPLSENGAWEAITSLAPQGTRFQKNNGAYPDRIVGPDNNHAGARTTALIPPDHYSEIVVGHLGNNRNNVGPIVRVQASGASIDSHYLYWASLTNGVNNLYRVDANGTTFTATPILPGTSPVADGDTLRLIARGQVIYGIKNGMRDFIYNTGPDTTKYSTGTAGMLAFPTGPSLTDAMIASWSSGAAPASSGTWASSTFAGIENPLDEGDRWYPLPTYSGFRKAVGLV
Ga0137410_1100687013300012944Vadose Zone SoilAGPIASSNFAGAENPLSENGAWAAITSLSPHGTRFQKSNGAQPNQVLYPDHAGARSTAVVPADQYSEIVVGHVGSAQYHNVGPIVRVQASGPSVDSHYLWWAAATNGVNNLYRIDANGTSYTASLLTASSPIADGDTLRLIARGPVLYGIKNGVRDFIYNTGRDATTYSAGAPGMLAYSSSTVANATIASWSTGAAPVSSGTWASSNFAGIENPLDEGDRWYPLPGYTGFRKAGGL
Ga0134077_1012029313300012972Grasslands SoilMMGRLLGGLLVATMPVLAIADAPIASSSFVGVEDPLSENGAWAALTSLSPNGTRFQKNNGAYPDQVGGIEHAGARTTAVIPVDHYSEIVVGHVGSNYNNVGPIVRVQASGSSIDSHYLWWASLINGHNYLYRIDANGTTYTANAIIPTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYHTGPDTTKYSTGTAGMLAYPTGAVTNAMIASWSTGPAPVSSGTWASSAFAGTEDPLDEGDRWYPLPGYLGFKKTGGLAIGRDLGHNTSG
Ga0137418_1073728213300015241Vadose Zone SoilVGVESPLFENGAWAALTSLSPQGTRFQKNNGAFPDRIVGPGQNHAGARTTAVIPADHYSEIVVGHVGNDRNNVGPIVRVQASGPTIDSHYLWWASLTTGVNGLFRVDANGTTFTAARILATSPAVDGDRLRLIARGQMIYGIKNGVRDFIYNTGPDTTKYPTGTTGILAFPTGPALTDAMIASWSSSAAPVSSGTWTSSTFAGTENPLDEGDRWYPLPTYAGFRKAGGLAIGQVGGHNASGVWSITPPA
Ga0137409_1083179813300015245Vadose Zone SoilAGPIASSNFAGAENPLSENGAWAAITSLSPHGTRFQKSNGAQPNQVLYPDHAGARSTAVVPADQYSEIVVGHVGSAQYHNVGPIVRVQASGPSVDSHYLWWAAATNGVNNLYRIDANGTSYTASLLTASSPIADGDTLRLIARGPVLYGIKNGVRDFIYNTGRDATTYSAGAPGMLAYSSSTVANATIASWSTGAAPVSSGTWASSNFAGIENPLDEGDRWYPLPGYTGFRKAGGLASGLDWNHNVAGVWSI
Ga0066667_1025059623300018433Grasslands SoilMSLRGTHSALVGGLLGGLLIGTLMPAMADEPIASSSFVGAENPLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTGTDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGYDATNYSTGTTGILAHAAGAVTDAMIASWSTGAAPVASGTWTSS
Ga0066662_1010207913300018468Grasslands SoilMTNTTLSPMIALRLAVALLPAAAMAGAPIASSSFAGAEDPLFENGAWAALTSLSPNGGRLQKNNAAFPDRLSPDHGGARTTALLPADHYSEIVVAHVGTSFSNVGPMVRVQTSGAAIDSNYLWWASPPNGLNNLYRIDAHGTSYTAGAILSTSPIVDGDRLRLIARGPVVYGIKNGVRDFIYNTGPQTTKYSGGTAGILAFTASPSLTDAAIASWSSGAAPASSGTHDSSNFIGVENPLDEGDRWYPLPTYSGFRKAGGLAIGLDFGHNASGVW
Ga0215015_1053221713300021046SoilMNLFESQRTSCQVCRKGRLPVSLFVVMAMSTMAIADGPIASSSFVGVEDPLSENGAWAALTSLAPAGTRFQKNNGAYPDRLISANHAGARTTAVLPADHYSEIVVGHVGNIINHVNNVGPTVRVQASGASVDSHYLWWASGANGVNNLYRIDANGITYTANPIIPTSPVVDGDRLRLIARGPLIYGIKNGVRDFIYNTGPNATKYSTGTAGMLAYADPSGAVTEAEIVSWSAGAAPVSSGTWASSTFTGIENPLDEGDRWYPLPGYAGFKKAGGQVIGCLLYTSDAADDM
Ga0210382_1003055923300021080Groundwater SedimentMSGQSPRAVGIVSALGGLLLAALLPITAIAAGPIASSTFAGMEDPLSESGAWAALTSLSPNGGRFQKNNGAFPNRPGPDHAGARTTAAVPSDHYSEIVVGHIGSLNINNVGPTVRIQSSGTAVDSHYLWWASGPNGLNYLYRIDATGTSYVADPLIPTSSVIDGDRLRLIARGPVIYGIKNGVRDFIYNYGANAKKYSGGTTGILAYTNGNVADATIASWSTGAAPASSGTAAASNFVGAENPLDEGDRWYPLPGYSGFRKMGGLATGRDSGHNASGAWSIAPPSRQFSEVVLG
Ga0179596_1035156813300021086Vadose Zone SoilFPGVLLATLIPATAIAQPIASSSFVGAESPLSENGAWEALTALAPQGTRFQKNNGAYPDRIVGPDNNHAGARTTALIPPDHYSEIVVGHLGNNRNNVGPIVRVQALGASIDSHYLWWASLTNGVNNLYRIDANGTTFTAAVIRPGTSPVADGDALRLIARGPVIYGIKNGVRDFIYNTGLETTRYSTGTAGMLAFPAGPSLTDAMIASWSSGAAPASSGTWASSTFAGTENPLDEGDRWYPLPGIV
Ga0210409_1008373743300021559SoilMNLVELQHAPRLLRTMGRLPVGLLVVMLMSAMAIADGPIASSSFVGAEDPLSENGAWAPIMSMAPNGTQFQKNNGAYPDRLLDGNHAGARTTAVLPTDHYSEIVVGHVGNIINHANNVGPNVRVQASGPLMDSHYVWWASGTNGDNHLYRIDANGTTYNYTAIIPSSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDTIKYSAGTAGMLAYADPGGGGTGGAVTEAEIASWSAGAAPVSSGTWASSTFTGVENPLDEGDRWYPLPGYSGFEKTGGLAIGRDSGHNA
Ga0242675_102331113300022718SoilMAAGPIASSSFVGVEDPLSEGGAWAALTSLSPNGGRFQKLNGAFPDRFGPDHAGARTTAAVPADHYSEIVVGHVGNVVNYINNVGPMVRVQASGPSIDSHYLWWASGTNGLNYLYRVDANGTSYTASAILPSSPVVDGDRIRLIARGPVIYGLKNGVRDFIYNSGPSTPRYATGTAGMLAYTAGPSLTDAMIASWSCDSAPVSSGTWTSSNFVGAEDPLDEGDRWYPLPNYSGFKKAGGLAIGKDSAHNASGVWSIMP
Ga0137417_125753823300024330Vadose Zone SoilVRLLACLLAITIAPGMAIADGPIASSSFTGVEDPLYENGAWAPLTSLAPQGIRFQKNNGALPDRFTGPDHNHAGARTTAVIPADHYCEIVVGHLGDNSNNLGPIVRVQPSGSSIDSHYLWWATRSNGLNELYRVDANGSSYNASPILASSPVADGDRLRLIARGQVIYGIRNGVRDFIYNTGPDAIRYSAGTAGMLAYPSGPTLTDDVIASWSSGAAPVSSGTWASTTPGRPDGCCSSTPTTRPGRESSR
Ga0209235_113741513300026296Grasslands SoilMKLRATHSIILGRFLGGLLVATLMPVLATADAPIAASSFVGVEDPLSENGAWVALTSLSPNGTRFQKNNGAYPDKVYQVNDHGHGGARTTAVVPADHYSEIVVGHVGSAIGPAGCCNDVGPIVRVQASGLTIDSHYLWWAGLNMNFCCNNALYRIDANGITYTANFIIPTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAVAVSAVTDAMIASWATGPAPVSSGTWASSTFAGTENPLDEGDRWYPLPGYSGF
Ga0209237_112736013300026297Grasslands SoilMNLVASQRTLRQPRMMGRLLGSLLVVTLMPVMAIAGVPIASSSFDGAEDPLSENGAWAALTSLSPSGGRFQKSGGVAYPDKLYGATHDHAGARTTVTIPDDHYSEIVVGHVGSTGNHNVGPIVRVQASGLSIDSHYLWWASANGGVNAFYRIDADGITYTANGIMLTSPVVDGDRLRLIARGPVIYGVKNGVRDFIYPIGQDATHYLTGTTGILAVATSALTDATIGSWESGAAPVSTGGNWASTTFVGAENPLDENDSWYPLQGYLGFKKSGGQAIGLLASHNATGNWGITPPAKQYSE
Ga0209761_102609133300026313Grasslands SoilMKQRATHSVMLGRLLGVLLVATLMPILAIADAPIASSSFVGAEDPLFENGAWAALTSLSPNGARFQKNNGAYPDKLYQANDHEHAGARTTAVIPADHYSEIVVGHVGSNMTINNCCNNVGPIVRVQSSGPAIDSHYLWWASRSDGVTSVNILYRIDANGTTYTAHELTQTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYHTGPDTTKYSTGTTGMLAVAVSTVTDAMIASWATGPAPVSSGNWASSTFAGTENPLDEGDRWYPLPGYSGFKKTG
Ga0209152_1019006013300026325SoilMPAMAIADEPIASSSFVGAENPLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTGSDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGYDATKYSTGTTGILAHAAGAVTDAKIASWSTGAAPVASGTWTSSTFAGTENPLDEGDRWYPLPGYSGFKKAGGAAIGRDSAHNSSGV
Ga0209802_109862623300026328SoilMNLIERQHALRQLRMIGRLLGSLLVATLLPVLAVADSPIASSSFVGAEDPLSENGAWAALTSFAPQGTRFQKNNGAYADQLFAKNHAGARTTAVVPADHYSEIVVGHIGSDINNPQGGTPDCCNNVGPIVRVQASGQAIDSHYLWWAGLNINQCCNNALYRIDANGTTYNPAQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAHPSTAVTDAKIASWSSGAAPVSLATWTSSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPL
Ga0209802_120067313300026328SoilLSENGAWAAITSLAPSGTRFQKNNGAYPDRLTGTDDHAGARTTAVVPADHYSEIVVGHVGSINDNVGPSVRVQASGPSIDSQYVWYASLTGGFNILYRIDANGTTFTSPSILLTSPVVDGDRLRLIARGLVIYGIKNGVRDFIYNTGYDATKYSTGTTGILAHAAGAVTDAMIASWSTGAAPVASGTWTSSTFAGTENPLDEGDRWYPLPGYTGFKKAGGAVIGRDSAHNASGVWSIAPPAKQYSEVTLGAVASG
Ga0209473_115143013300026330SoilVTLVERNALCRRRRMRLFLGGFLATLIPATAIAQPITSSDFVGVESPLFENGAWAALTSLSPQGTRFQKNNGAFADRIVGPGQNHAGARTTAVIPADHYSEIVVGHVGNDRNNVGPIVRVQASGPTIDSHYLWWASLTTGVNGLFRIDANGTTFIDTRIVGTSPVVDGDRLRLIARGQVIYGIKNGVRDFVYNTSPDTTQYSTGTAGMLAYAFGGAVSDAMIASWSAGAAPVSLGAWD
Ga0209803_116314413300026332SoilLLGRLLGGLLVATLMPVLAVADSPIASSSFVGAEDPLSENGSWAALTSLSPNGTRFQKNNGAFADLLIARNHAGARTTAVVPADHYSESVVGHIGVDINTPDCCNNVGPIVRVQASGPAIDSHYLWWAGLNIPNHCCNNALYRVDANGTTYNPVQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGILAHPSTTVTDAMIASWATGSAPASNGTWASSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPLHIGDTLHNAS
Ga0209158_104335623300026333SoilMTGRVVGALLAAMLISITAIAAGPIASSDFAGAENPLSENGAWAALTSLSPAGGRFQKNNGAFPNQSPPDHAGARSTIALPADHYSEIVVGHVGGNTNNVGPIVRVQTSGAAVDSHYLWWASQASGVNNLYRIDANGTSYRADLILPTSSVVDGDRLRLIARGPVIYGIKNGVREFIYNTGPDATRYSAGTAGILAWAGNAVETDSQIASWSAGAAPASSGTWASSSFIGAENPLDEGDRWYP
Ga0209158_116190613300026333SoilRMMGWFLGGLLVATLMPVRVAAGGPIASSRFVGAEDPLSENGAWAALTALSPNGTRFQKNNGAYPDQLIHLDHAGARTTVAVPADHYSEIVVGHVGSMSSNVGPIVRVQTSGPSIDSHYLWWASLTNGINALYRLDANGATYTGNKIVASSPIVDGDILRLIARGQVIYGIKNGVRDFVYNTSPDTTQYSTGTAGMLAYAFGGAVSDAMIASWSAGAAPVSLGAWDYATFAGTENPLDERDRWYPLPTYTGFKKAGGAAIGLDAGHNASG
Ga0257149_103717413300026355SoilLLVITLTAVVAIADGPIVSSSFVGMEDPLSENGAWAALTSLAPTGTRFQKNNGAYPDRLISANHAGARTTAVLPADHYSEIVVGHVGNIINHVNNVGPTVRVQASGSSIDSHYLWWASGTNGVNNLYRIDANGATYTANSIMPTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGLDATKYSTGAAGMLAYADPGGAVTEAEIISWSAGAAPVSSGTWASSVFTGIE
Ga0209378_101873213300026528SoilMNLIERQHALRQLRMIGRLLGSLLVATLLPVLAVADSPIASSSFVGAEDPLSENGAWAALTSFAPQGTRFQKNNGAYADQLFAKNHAGARTTAVVPADHFSEIVVGHLGSDINNPQGGTPDCCNNVGPIVRVQASGPAIDSHYLWWAGLNINQCCNNALYRIDANGTTYNPAQIFKTSPVVDGDRLRLIARGLVLYGIKNGVRDFIYNTGPDSTKYSTGTTGMLAHPSTAVTDAKIASWSSGAAPVSLATWTSSTFAGTEDPLDEGDRWYPLPGYSGFKKSGGFASARPPLHIGDTLHNASGVWSIT
Ga0209056_1025602913300026538SoilMALLMSGLLAAVLVPATAVAAGAIALSSFVGTENPLSENGLWAPITSLSPNGGRFQKNNGAFPTLLAPDHAGARTTAAVPADHYSEIVVGHVGPTVPALTTYNNVGPVVRVQTAGASLDSHYLWWAAQPNGVNGLYRIDATGTTYQPNLLMQTSSVVDGDRLRLIARGNVIYGIKNGVVRDFIYNTGANATRLGGGSTGMLAYTNTNVSDAVIASWSTGAAPTSSGTVASSTFAGVEDPLDEGDRWYPLPGYQGFRKAGGVAVGRESGHNASGVWSI
Ga0209161_1001657923300026548SoilMNTHNYAFRMMGWLLGGLLVTTLIPVMVAAGEPIASSRFVGAEDPLSENGAWAALTALSPNGTRFQKNNGAYPDQLIHLDHAGARTTVAVPADHYSEIVVGHVGSAGNNVGPIVRVQTSGPSMDSHYLWWASLTNGINALYRIDANGATYTANWIVASSPIVDGDVLRLVARGQVIYGIKNGVRDFVYNTSPDTTQYSTGTAGMLAYAFGGAVSDAMIASWSAGAAPVSLGAWDYATFAGTE
Ga0209161_1017231223300026548SoilMKGEVKPHALRMIARVLGGLLVAMLMPVTAPADGPIASSSFVGVEDPLFENGAWAPLTSLAPHGSRFQKTNGAYPDQGFYPNHGGARTTAAIPADHYSEIVVGHVGSKISNVGPIVRVQTSGPSVDSHYLWWGSTPDGVNGLYRIDANGATYTANLLAPTSSVVDGDKLRLVARGQAIYGIKNGVREFFYRTSTDATRYSGGTAGILAYAGDGVLTNAVIASWSAGAAVSSGAWASSTFAGTENPLDEGDRWYPLPSYSGFKKTGGAAF
Ga0209648_1012821813300026551Grasslands SoilMRGGFLLATLIPALASADPPIASSSFVGVEDPLYENGAWVALTSLAPNGIRFQKNSGAYPDRFISPNHAGARTTAIIPVDHYSEIVVGHVGDNTDNVGPIVRVQTSGSAIDSHYLWWATQMNGHNNLYRLDANGTSYTASPILPTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYVTGTSGMLAYTSGPTLTDAMISWWSSGAAASSSGTWASSTFAGVEN
Ga0209388_109694113300027655Vadose Zone SoilFVGVESPLFENGAWVAITSLSPQGTRFQKNNGAFPDRIVGPDQNHAGARTTAVIPADHYSEIVVGHVGNDRNNVGPIVRVQTSGPTIDSHYLWWASLTTGVNGLFRIDANGTTFIHPRIVGTSPVVDGDRLRLIARGQVIYGIKNGVRDFIYNTGPDTTKYPTGTTGILAFPTGPALTDAMIASWSSGAAPVSSGTWTSSTFAGTENPLDEGDRWYPLPTYSGFRKAGSLAIGQDGGHNASGVWSITPPARQYSEVTLGSVPSGGGGPIVRIDRSGPGQTG
Ga0208990_110014313300027663Forest SoilMKLFDRDALCRLRRMGLFLGVLLATLIPATAIAQPIASSNFVGAESPLSENGAWDALTSLAPQGTRFQKNNGAYPDRIVGPDNNHAGARTTALIPPDHYSEIVVGHLGNNRNNVGPIVRVQASGASIDSHYLWWASLTNGVNNLYRIDANGTTFTAPVIRPGTSPVADGDTLRLIARGPVIYGIKNGVRDFIYNTGPDATKYSTGTAGMLAFVTGPSLTDAMIASWSSGAAPASSGTWASSTFA
Ga0209074_1008575213300027787Agricultural SoilVKKLGLALALLPIAALAQAPVASSNFAGAEDPLFENGAWAALTSLSPNGGQFQKNNGAFPDRFSPDHAGARTTAVLPADHYSEIVVGNLGTSVSNVGPSVRVQTAGAAIDSHYLWWASLPNGLNNLYRIDANGTSYSADPLMATSPVTNGDRLRLVARGLVIYGIKNGVRDFIYNTGPDVTKLAGGTSGILAFTGDSALTDATIASWSTGPAPASSGTQDSSLFLGVENPLDEGDRWYPLPGYSGFRKAGGVAMGRDFDHNAEAAWSISPPAAQ
Ga0209180_1003471743300027846Vadose Zone SoilVTLVGVRRRLRRTRGGFLGGFLVATLIPTLASADPPIASSSFVGAEDPLSENGAWAPLTSVAPQGTRFQKNNGAYPDRFVGPDHNHAGARTTAIIPPDHYSEIVVGHVGTNSDNVGPIVRVQTSGSTIDSHYLWWATQTNGHNALYRVDANGTSFTASPILTTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNIGPDTTKYSTGTAGMLAFNSGPTLTDAMITSWSSGAAPGSSGTWASSTFAGIENPLDEGDRWYPLPTY
Ga0209180_1007004113300027846Vadose Zone SoilMNLVELQRAPRTLRAMGRLPVGLLVVMLMAIMAIADGPIASSSFVGAEDPLSENGAWAALTSLSPHGSRFQKNNGAYPDLPGYPGGNHAGARTTAVIPADHYSEIVVGHVGNNITNFNNVGPIVRVQASGPSIDSHYLWWASVANGVNGLYRIDANGTAYTANGIMPTSPVIDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDATKYSTGTTGMLAYAGDGGVTNAKIASWSTGAAPVSSGTWASSTFTGIENPLDEGDRWYPLPGYSGFKKVGGLAIGRDSVH
Ga0209180_1022850823300027846Vadose Zone SoilMNLVDLHRTLCVVGLLAVALLPVGAIADTPIASSNFLGAEDPLSENGAWAALTSLSPNGVRFQKNNGAYPDLFISPNHAGARSTAAIPVDHYSEILVGHVAGNTNNVGPIVRVQPSGSAVDSHYLWWGSGTGGTVNYLYRVDANGISYSASPILPTSPIADGDRLRLIARGLLIYGIHNGAREFIYNTGPDTTKYSTGSAGMLAYTSSPTLTDATIASWSSGAAPASSGEPGFPVLDYIGFHYDAGSAHSVIRCSSVHHSPDDGSS
Ga0209701_1010944423300027862Vadose Zone SoilMNLVELQRAPRTLRAMGRLPVGLLVVMLMAIMAIADGPIASSSFVGAEDPLSENGAWAALTSLSPHGSRFQKNNGAYPDLPGYPGGNHAGARTTAVIPADHYSEIVVGHVGNNITNFNNVGPIVRVQASGPSIDSHYLWWASVVANGVNNLYRIDANGTTYTANRIMSTSPIVDGDILRLIARGPVIYGIKNGLRDFIYNTGPDATKYSTGTTGMLAYAGDGGVTNAKIASWSTAAAPVSSGTRASSTFTGIENPLDEGDRW
Ga0209283_1013901413300027875Vadose Zone SoilVKLAGLVGVMVVSAAATAAPIASSSFAGAEDPLFENGAWVGLTSLSPNGGRFQKNNGAFPDRFSPDHAGARTTVVLPADHYSEIVVGHVGTTTSNVGPTVRVQTSGAAIDSHYLWWATLPNGLNNLYRIDANGTSYTADPLIATSPVTDGDDLRLIARGPVIYGIKNGVRDFIYNTGPDATKVGGGTAGMLAFTGDQSLNDASIASWAGGAAPVSSGTRDSSNF
Ga0209283_1057179413300027875Vadose Zone SoilIPTLASADPPIASSSFVGAEDPLSENGAWAPLTSLAPQGTRFQKNNGAYPDRFVGPDHNHAGARTTAIIPPDHYSEIVVGHVGTNSDNVGPIVRVQTSGSTIDSHYLWWATQTNGHNALYRVDANGTSFTASPILTTSPVADGDRLRLIARGQVIYGIKNGVRDFIYNIGPDTTKYSTGTAGMLAFNSGPTLTDAMITSWSSGAAPGSSGTWASSTFAGIENPLDEGDRWYPLPTYSGF
Ga0209590_1059566113300027882Vadose Zone SoilITAIGAGPIASSDFAGAENPLSENGAWAALTSLSPAGGRFQKNNGAFPNQSPPDHAGARTTIALPADHYSEIVVGHVGGNTNNVGPIVRVQTSGAAVDSHYLWWASQASGVNNLYRIDANGTSYRADLILPTSSVVDGDRLRLIARGPVIYGIKNGVREFIYNTGPDATRYSAGTAGILAWAGNAVETDSQIASWSAGAAPASSGTWASSSFIGAENPLDEGDRWYPLPGYSGFRK
Ga0222749_1006517613300029636SoilMNLVEPHRSLRRVARLPVSLLVAMLMSIMAIADGPIATSSFVGAEDPLSENGAWAPLTSLAPNGTRFEKNNGAYPDRLISANHAGARTTAVLPPDHYSEIVVGHVGNIMNHVNNVGPTVRVQTSGPSIDSHYLWWASGTNGVNNLYRIDANGTTYTAIAIVPTSPVIDGDRLRLIARGSVLYGIKNGVRDFIYNTGPDGTKYSAGTAGMLAYADPGGAVTDAEIVSWSAGAAPVSSGTWASTTFTGIENPLDEGDRWYPLPGYSGFKKAGGFAIGRDPGHNASGVWSITPPAKQYSEVTLGTATSGGG
Ga0222749_1046292413300029636SoilFVGAEDPLSENGAWAPLTSLSPNGSRFQKNNGAYPDQPNYPNHAGARTTAAVPADQYSEIVVGHVGSTSSNVGPIVRVQASGTSIDSHYLWWASQSNGVNNLYRIDANGTTYTANSIMPSSPVADGDRLRLIARGPVIYGIKNGVRDFIYNTGPDTTKYPTGSAGILAYAGDGVVTNSTIASWSAGTAPASSGTWASSTFLGIENPLDAGDRWYKMQGYCGFQKVF
Ga0307477_1037471713300031753Hardwood Forest SoilMILVQLRRALRLLRALGRVHVALFVLLLLTVMTIANGPIASSSFVGVEDPPSENGAWAALTSLSPNGSRFQKNNGAYPDQPGNNVNHAGARATAVVPADHYSEIVVGHLGNNVNNHNNVGPIVRVQASGPSIDSHYLWWASVATNGVNNLYRIDANGTSYTANSIVPTSPVVDGDRLRLMARGPVIYGIKNGVRDFIYNTGPDATKYSTGTTGMLAYAGDGGVTNAMIASWSNWCCSRLVWDLGLLDLYWE
Ga0307477_1057723513300031753Hardwood Forest SoilVLMSAIAIAVGPITSSSFVGTEDPLSENGAWAALTSLSPNGSRFQKNNGAYPDQPGNNVNHAGARTTAVVPADHYSEVVVGHLGNNVSNHNNLGPIVRVQASGPSIDSHYLWWESVAANGVYGLYRIDANGTSYTANLIMPTSPAVDGDRLRLIARGPVIYGIKNGVRDFIYNTGPDVTKYSTGTTGMLAYAGDGGLTNAMIASWSTGAAPLSSGTWASSTFTGTENPLDEGDRWYPLPGYLGFRKAGGQAI
Ga0307475_1048568413300031754Hardwood Forest SoilMNLVEPHRSLRRVARLPVSLLVAMLMSIMAIADGPIATSSFVGAEDPLSENGAWAPLTSLSPNGTRFQKNNGAYPDRLISANHAGARTTAVLPPDHYSEIVVGHVGNIMNHVNNVGPTVRVQASGPSIDSHYLWWASGTNGVNNLYRIDANGTTYTAIAIMPTSPVIDGDRLRLIARGSVLYGIKNGVRDFIYNTGPDGTKYSAGTAGMLAYADPGGAVTDAEIVSWSAGAAPVSSGTWASTTFTGIENPLDEGDRWYPLPGYSGFKKAGGFAIGRDPGHNASGVWSITPPAKQYSEVTLGT
Ga0307475_1071548613300031754Hardwood Forest SoilMNLIELQRVLRTLRRLPVSLLVVMLMSTMAIADGPIASSSFVGAEDPLSENGAWAPLASLAPNGTRFQKNNGAYPDQLISANHAGARTTAVVPADHYSEIVVGHVGNIINHVNNVGPIVRVQASGSSIDSHYLWWAGGTNGVNNLYRIDANGTTYTANPIMPTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYSTGPDATKYSTGTVGMLAYADPGGTATEAEIASWSAGAAPVSSGTWDSSTFTGIENPLD
Ga0307473_1078511813300031820Hardwood Forest SoilMAIADGPIASSSFVGVEDPLSENGAWAPLTSLAPAGTRFQKNNGAYPDQLISANHAGARTTAVLPADHYSEIVVGHVGNIINHVNNVGPTVRVQASGPSIDSHYLWWASGANGVNNLYRIDANGTTYTANPIMPTSPVVDGDRLRLIARGPVIYGTKNGVRDFIYNTGPDATKYSTGTLGMLAYADPGGTATEAEIASWSAGAAPVSSGTWDSSTFTGIENPLDE
Ga0307479_1023034323300031962Hardwood Forest SoilMILVQLHRAPRLLRALSRLPIALFVVLLLSVMTIADGPIASSSFVGVEDPLSENGAWAALTSLSPNGSRFQKNNGAYPGQPGNNVNNAGAHTTAVVPADHYSEIVVGHVGNNVNNHNNLGPIVRVQASGPSIDSHYLWWASVATNGVNNLYRIDANGASYTANLIMPTSPVVDGDRLRLIARGPVIYGIKNGVRDFIYNTGRDATKYSTGTTGMLAYAGDGGVTNAMIASWSTGAAPVSTGTWASSTFTGAENPLDEGDRWYPLPGYPGFKKAGGQAIGLSSGHDASGVWSIAPPANQYSQVTLGTVAS
Ga0307471_10106214013300032180Hardwood Forest SoilMNLIELQRVLRTLRRLPVSLLVVMLMSTMAIADGPIASSSFVGAEDPLSENGAWAPLTSLAPNGTRFQKNNGAYPDQLISANHAGARTTAVVPADHYSEIVVGHVGNIINHVNNVGPIVRVQASGSSIDSHYLWWASGTNGVNNLYRIDANGTTYTANPIMPTSPVVDGDRLRLIARGPVIYGTKNGVRDFIYNTGPDATKYSTGTLGMLAYADPGGTATEAEIASWSAGAAPVSSGTWGSSTFTGIENPLDE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.