NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F034511

Metagenome Family F034511

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F034511
Family Type Metagenome
Number of Sequences 174
Average Sequence Length 207 residues
Representative Sequence LSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGSSLDRRGFPEAKTKKRSLHLPSFNQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYFLNYNAVLVLAGLLGVLIFQLLPRKLQSRNLQD
Number of Associated Samples 84
Number of Associated Scaffolds 174

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 64.53 %
% of genes near scaffold ends (potentially truncated) 41.95 %
% of genes from short scaffolds (< 2000 bps) 55.75 %
Associated GOLD sequencing projects 66
AlphaFold2 3D model prediction Yes
3D model pTM-score0.33

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (87.931 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(51.149 % of family members)
Environment Ontology (ENVO) Unclassified
(46.552 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(54.598 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 56.13%    β-sheet: 0.00%    Coil/Unstructured: 43.87%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.33
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 174 Family Scaffolds
PF13614AAA_31 28.16
PF01656CbiA 8.62
PF12773DZR 6.32
PF08192Peptidase_S64 3.45
PF03176MMPL 3.45
PF01596Methyltransf_3 1.15
PF13649Methyltransf_25 1.15
PF12847Methyltransf_18 0.57
PF10996Beta-Casp 0.57
PF00291PALP 0.57
PF07995GSDH 0.57
PF01070FMN_dh 0.57
PF09754PAC2 0.57
PF13489Methyltransf_23 0.57
PF08327AHSA1 0.57
PF08713DNA_alkylation 0.57

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 174 Family Scaffolds
COG1033Predicted exporter protein, RND superfamilyGeneral function prediction only [R] 3.45
COG2409Predicted lipid transporter YdfJ, MMPL/SSD domain, RND superfamilyGeneral function prediction only [R] 3.45
COG2518Protein-L-isoaspartate O-methyltransferasePosttranslational modification, protein turnover, chaperones [O] 1.15
COG4122tRNA 5-hydroxyU34 O-methylase TrmR/YrrMTranslation, ribosomal structure and biogenesis [J] 1.15
COG4123tRNA1(Val) A37 N6-methylase TrmN6Translation, ribosomal structure and biogenesis [J] 1.15
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.57
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.57
COG2133Glucose/arabinose dehydrogenase, beta-propeller foldCarbohydrate transport and metabolism [G] 0.57
COG49123-methyladenine DNA glycosylase AlkDReplication, recombination and repair [L] 0.57


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.85 %
UnclassifiedrootN/A1.15 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2007427000|2007478740All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon808Open in IMG/M
3300002558|JGI25385J37094_10004921All Organisms → cellular organisms → Archaea4689Open in IMG/M
3300002558|JGI25385J37094_10005434All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria4477Open in IMG/M
3300002558|JGI25385J37094_10099563All Organisms → cellular organisms → Archaea864Open in IMG/M
3300002558|JGI25385J37094_10159714All Organisms → cellular organisms → Archaea603Open in IMG/M
3300002560|JGI25383J37093_10001608All Organisms → cellular organisms → Archaea6351Open in IMG/M
3300002561|JGI25384J37096_10001385All Organisms → cellular organisms → Archaea7949Open in IMG/M
3300002561|JGI25384J37096_10003311All Organisms → cellular organisms → Archaea5707Open in IMG/M
3300002561|JGI25384J37096_10018806All Organisms → cellular organisms → Archaea2689Open in IMG/M
3300002562|JGI25382J37095_10004006All Organisms → cellular organisms → Archaea5050Open in IMG/M
3300002562|JGI25382J37095_10099631All Organisms → cellular organisms → Archaea1030Open in IMG/M
3300002562|JGI25382J37095_10239112All Organisms → cellular organisms → Archaea548Open in IMG/M
3300002908|JGI25382J43887_10000039All Organisms → cellular organisms → Archaea24384Open in IMG/M
3300002908|JGI25382J43887_10003130All Organisms → cellular organisms → Bacteria7463Open in IMG/M
3300002908|JGI25382J43887_10003975All Organisms → cellular organisms → Archaea6829Open in IMG/M
3300002908|JGI25382J43887_10021146All Organisms → cellular organisms → Archaea3465Open in IMG/M
3300002908|JGI25382J43887_10024084All Organisms → cellular organisms → Archaea3263Open in IMG/M
3300002908|JGI25382J43887_10037127All Organisms → cellular organisms → Archaea2644Open in IMG/M
3300002908|JGI25382J43887_10074885All Organisms → cellular organisms → Archaea1827Open in IMG/M
3300002908|JGI25382J43887_10095142All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1581Open in IMG/M
3300002912|JGI25386J43895_10001199All Organisms → cellular organisms → Archaea6040Open in IMG/M
3300002912|JGI25386J43895_10002585All Organisms → cellular organisms → Archaea4616Open in IMG/M
3300002912|JGI25386J43895_10035385All Organisms → cellular organisms → Archaea1465Open in IMG/M
3300002912|JGI25386J43895_10066277All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon997Open in IMG/M
3300005167|Ga0066672_10282313All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1075Open in IMG/M
3300005174|Ga0066680_10064406All Organisms → cellular organisms → Archaea2182Open in IMG/M
3300005174|Ga0066680_10098366All Organisms → cellular organisms → Archaea1787Open in IMG/M
3300005174|Ga0066680_10103415All Organisms → cellular organisms → Archaea1745Open in IMG/M
3300005174|Ga0066680_10240158All Organisms → cellular organisms → Archaea1150Open in IMG/M
3300005180|Ga0066685_10027080All Organisms → cellular organisms → Archaea3514Open in IMG/M
3300005181|Ga0066678_10196679All Organisms → cellular organisms → Archaea1285Open in IMG/M
3300005446|Ga0066686_10479138All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon848Open in IMG/M
3300005447|Ga0066689_10157811All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1353Open in IMG/M
3300005468|Ga0070707_102273182All Organisms → cellular organisms → Archaea510Open in IMG/M
3300005518|Ga0070699_100048507All Organisms → cellular organisms → Archaea3675Open in IMG/M
3300005518|Ga0070699_100953429All Organisms → cellular organisms → Archaea786Open in IMG/M
3300005536|Ga0070697_100016995All Organisms → cellular organisms → Archaea5720Open in IMG/M
3300005536|Ga0070697_100331279All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1312Open in IMG/M
3300005552|Ga0066701_10342559All Organisms → cellular organisms → Archaea926Open in IMG/M
3300005554|Ga0066661_10248169All Organisms → cellular organisms → Archaea1102Open in IMG/M
3300005555|Ga0066692_10159925All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1388Open in IMG/M
3300005555|Ga0066692_10953604All Organisms → cellular organisms → Archaea525Open in IMG/M
3300005556|Ga0066707_10023640All Organisms → cellular organisms → Bacteria3305Open in IMG/M
3300005558|Ga0066698_10001137All Organisms → cellular organisms → Archaea11268Open in IMG/M
3300005559|Ga0066700_10108784All Organisms → cellular organisms → Archaea1828Open in IMG/M
3300005559|Ga0066700_10352466All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300005561|Ga0066699_10290274All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1163Open in IMG/M
3300005568|Ga0066703_10011390All Organisms → cellular organisms → Bacteria4302Open in IMG/M
3300005568|Ga0066703_10452794All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon769Open in IMG/M
3300005568|Ga0066703_10798545All Organisms → cellular organisms → Archaea540Open in IMG/M
3300005568|Ga0066703_10860730All Organisms → cellular organisms → Archaea517Open in IMG/M
3300005586|Ga0066691_10380312All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon837Open in IMG/M
3300005598|Ga0066706_10012726All Organisms → cellular organisms → Bacteria4759Open in IMG/M
3300006797|Ga0066659_10002951All Organisms → cellular organisms → Archaea8006Open in IMG/M
3300007255|Ga0099791_10023192All Organisms → cellular organisms → Bacteria2682Open in IMG/M
3300007255|Ga0099791_10069926All Organisms → cellular organisms → Archaea1591Open in IMG/M
3300007258|Ga0099793_10019970All Organisms → cellular organisms → Archaea2739Open in IMG/M
3300007258|Ga0099793_10133871All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1167Open in IMG/M
3300007258|Ga0099793_10145532All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1120Open in IMG/M
3300007258|Ga0099793_10339531All Organisms → cellular organisms → Archaea733Open in IMG/M
3300007265|Ga0099794_10575059All Organisms → cellular organisms → Archaea596Open in IMG/M
3300009012|Ga0066710_100036670All Organisms → cellular organisms → Bacteria5834Open in IMG/M
3300009038|Ga0099829_10009497All Organisms → cellular organisms → Archaea6236Open in IMG/M
3300009038|Ga0099829_10012822All Organisms → cellular organisms → Bacteria5506Open in IMG/M
3300009088|Ga0099830_10208715All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1531Open in IMG/M
3300009088|Ga0099830_10426244All Organisms → cellular organisms → Archaea1075Open in IMG/M
3300009088|Ga0099830_10539478All Organisms → cellular organisms → Archaea954Open in IMG/M
3300009089|Ga0099828_10191472All Organisms → cellular organisms → Archaea1823Open in IMG/M
3300009089|Ga0099828_10619966All Organisms → cellular organisms → Archaea973Open in IMG/M
3300009089|Ga0099828_10704614All Organisms → cellular organisms → Archaea906Open in IMG/M
3300009089|Ga0099828_10857296All Organisms → cellular organisms → Archaea812Open in IMG/M
3300009089|Ga0099828_11927786All Organisms → cellular organisms → Archaea518Open in IMG/M
3300009089|Ga0099828_11960767All Organisms → cellular organisms → Archaea513Open in IMG/M
3300009090|Ga0099827_10017379All Organisms → cellular organisms → Bacteria4893Open in IMG/M
3300009090|Ga0099827_10023823All Organisms → cellular organisms → Archaea4299Open in IMG/M
3300009090|Ga0099827_10254417All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1477Open in IMG/M
3300009090|Ga0099827_10981321All Organisms → cellular organisms → Archaea733Open in IMG/M
3300009090|Ga0099827_11446323All Organisms → cellular organisms → Archaea598Open in IMG/M
3300010304|Ga0134088_10003294All Organisms → cellular organisms → Bacteria6529Open in IMG/M
3300011269|Ga0137392_10264069All Organisms → cellular organisms → Archaea1418Open in IMG/M
3300011270|Ga0137391_10678687All Organisms → cellular organisms → Archaea858Open in IMG/M
3300011271|Ga0137393_10061888All Organisms → cellular organisms → Bacteria2958Open in IMG/M
3300012189|Ga0137388_10090978All Organisms → cellular organisms → Archaea2598Open in IMG/M
3300012189|Ga0137388_10442154All Organisms → cellular organisms → Archaea1206Open in IMG/M
3300012189|Ga0137388_11465020All Organisms → cellular organisms → Archaea621Open in IMG/M
3300012198|Ga0137364_10322289All Organisms → cellular organisms → Archaea1150Open in IMG/M
3300012199|Ga0137383_10055949All Organisms → cellular organisms → Archaea2825Open in IMG/M
3300012203|Ga0137399_10022731All Organisms → cellular organisms → Bacteria4150Open in IMG/M
3300012203|Ga0137399_10043291All Organisms → cellular organisms → Archaea3216Open in IMG/M
3300012203|Ga0137399_10047791All Organisms → cellular organisms → Archaea3087Open in IMG/M
3300012203|Ga0137399_10071307All Organisms → cellular organisms → Bacteria2607Open in IMG/M
3300012203|Ga0137399_10112119All Organisms → cellular organisms → Archaea2130Open in IMG/M
3300012203|Ga0137399_11606072All Organisms → cellular organisms → Archaea538Open in IMG/M
3300012206|Ga0137380_10083287All Organisms → cellular organisms → Archaea2927Open in IMG/M
3300012206|Ga0137380_10326578All Organisms → cellular organisms → Archaea1371Open in IMG/M
3300012206|Ga0137380_10334551All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1352Open in IMG/M
3300012206|Ga0137380_11272127All Organisms → cellular organisms → Archaea621Open in IMG/M
3300012206|Ga0137380_11444037All Organisms → cellular organisms → Archaea573Open in IMG/M
3300012207|Ga0137381_10345058All Organisms → cellular organisms → Archaea1298Open in IMG/M
3300012207|Ga0137381_10454372All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1118Open in IMG/M
3300012207|Ga0137381_10531083All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1026Open in IMG/M
3300012207|Ga0137381_10621890All Organisms → cellular organisms → Archaea940Open in IMG/M
3300012209|Ga0137379_10016311All Organisms → cellular organisms → Bacteria7122Open in IMG/M
3300012209|Ga0137379_10243290All Organisms → cellular organisms → Archaea1712Open in IMG/M
3300012209|Ga0137379_10274547All Organisms → cellular organisms → Archaea1597Open in IMG/M
3300012209|Ga0137379_10340847All Organisms → cellular organisms → Archaea1409Open in IMG/M
3300012209|Ga0137379_11608153All Organisms → cellular organisms → Archaea548Open in IMG/M
3300012210|Ga0137378_10136638All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Thermoflexia → Thermoflexales → Thermoflexaceae → Thermoflexus → Thermoflexus hugenholtzii2268Open in IMG/M
3300012210|Ga0137378_10163750All Organisms → cellular organisms → Archaea2064Open in IMG/M
3300012210|Ga0137378_11489466All Organisms → cellular organisms → Archaea588Open in IMG/M
3300012349|Ga0137387_10047717All Organisms → cellular organisms → Archaea2845Open in IMG/M
3300012349|Ga0137387_10119933All Organisms → cellular organisms → Archaea1853Open in IMG/M
3300012349|Ga0137387_10271880All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1223Open in IMG/M
3300012351|Ga0137386_10041892All Organisms → cellular organisms → Bacteria3144Open in IMG/M
3300012351|Ga0137386_10177492All Organisms → cellular organisms → Archaea1528Open in IMG/M
3300012351|Ga0137386_10483118All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon892Open in IMG/M
3300012357|Ga0137384_10121003All Organisms → cellular organisms → Archaea2189Open in IMG/M
3300012359|Ga0137385_10176200All Organisms → cellular organisms → Archaea1874Open in IMG/M
3300012359|Ga0137385_10284863All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1424Open in IMG/M
3300012359|Ga0137385_10311999All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1352Open in IMG/M
3300012359|Ga0137385_10554879All Organisms → cellular organisms → Archaea969Open in IMG/M
3300012359|Ga0137385_11537401All Organisms → cellular organisms → Archaea529Open in IMG/M
3300012918|Ga0137396_10005792All Organisms → cellular organisms → Archaea7213Open in IMG/M
3300012918|Ga0137396_10021923All Organisms → cellular organisms → Archaea4114Open in IMG/M
3300012918|Ga0137396_10094834All Organisms → cellular organisms → Archaea2121Open in IMG/M
3300012918|Ga0137396_10281259All Organisms → cellular organisms → Archaea1229Open in IMG/M
3300012918|Ga0137396_10544377All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon860Open in IMG/M
3300012925|Ga0137419_10067772All Organisms → cellular organisms → Archaea2366Open in IMG/M
3300012927|Ga0137416_10002780All Organisms → cellular organisms → Archaea9430Open in IMG/M
3300012927|Ga0137416_10072340All Organisms → cellular organisms → Archaea2505Open in IMG/M
3300012927|Ga0137416_10165164All Organisms → cellular organisms → Archaea1747Open in IMG/M
3300012927|Ga0137416_11834427All Organisms → cellular organisms → Archaea554Open in IMG/M
3300012944|Ga0137410_10958789All Organisms → cellular organisms → Archaea726Open in IMG/M
3300012972|Ga0134077_10010885All Organisms → cellular organisms → Archaea2967Open in IMG/M
3300012976|Ga0134076_10013805All Organisms → cellular organisms → Archaea2792Open in IMG/M
3300015358|Ga0134089_10010007All Organisms → cellular organisms → Archaea3034Open in IMG/M
3300017659|Ga0134083_10000890All Organisms → cellular organisms → Archaea8238Open in IMG/M
3300018468|Ga0066662_10035560All Organisms → cellular organisms → Archaea3017Open in IMG/M
3300018468|Ga0066662_10071364All Organisms → cellular organisms → Archaea2327Open in IMG/M
3300021046|Ga0215015_10806985All Organisms → cellular organisms → Archaea1169Open in IMG/M
3300021046|Ga0215015_11079142All Organisms → cellular organisms → Archaea6452Open in IMG/M
3300021088|Ga0210404_10216063All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1033Open in IMG/M
3300024330|Ga0137417_1144353All Organisms → cellular organisms → Archaea2214Open in IMG/M
3300024330|Ga0137417_1319598All Organisms → cellular organisms → Archaea1904Open in IMG/M
3300024330|Ga0137417_1371297All Organisms → cellular organisms → Archaea2735Open in IMG/M
3300025922|Ga0207646_10161396All Organisms → cellular organisms → Archaea2023Open in IMG/M
3300026295|Ga0209234_1158222All Organisms → cellular organisms → Archaea797Open in IMG/M
3300026296|Ga0209235_1000437All Organisms → cellular organisms → Archaea20704Open in IMG/M
3300026297|Ga0209237_1000170All Organisms → cellular organisms → Archaea33567Open in IMG/M
3300026298|Ga0209236_1051764All Organisms → cellular organisms → Archaea2043Open in IMG/M
3300026298|Ga0209236_1094445All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1357Open in IMG/M
3300026313|Ga0209761_1002054All Organisms → cellular organisms → Archaea13152Open in IMG/M
3300026313|Ga0209761_1004038All Organisms → cellular organisms → Archaea9724Open in IMG/M
3300026313|Ga0209761_1009209All Organisms → cellular organisms → Archaea6435Open in IMG/M
3300026317|Ga0209154_1137573All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1024Open in IMG/M
3300026325|Ga0209152_10001548All Organisms → cellular organisms → Bacteria9074Open in IMG/M
3300026328|Ga0209802_1123957All Organisms → cellular organisms → Archaea1130Open in IMG/M
3300026328|Ga0209802_1219110All Organisms → cellular organisms → Archaea707Open in IMG/M
3300026333|Ga0209158_1150942All Organisms → cellular organisms → Archaea850Open in IMG/M
3300026529|Ga0209806_1000202All Organisms → cellular organisms → Archaea37933Open in IMG/M
3300026536|Ga0209058_1018377All Organisms → cellular organisms → Archaea4666Open in IMG/M
3300026538|Ga0209056_10301289All Organisms → cellular organisms → Archaea1103Open in IMG/M
3300027655|Ga0209388_1000358All Organisms → cellular organisms → Archaea9669Open in IMG/M
3300027655|Ga0209388_1139624All Organisms → cellular organisms → Archaea687Open in IMG/M
3300027671|Ga0209588_1134432All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon788Open in IMG/M
3300027875|Ga0209283_10267460All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1132Open in IMG/M
3300027875|Ga0209283_10936518All Organisms → cellular organisms → Archaea521Open in IMG/M
3300027882|Ga0209590_10032515All Organisms → cellular organisms → Bacteria2771Open in IMG/M
3300027882|Ga0209590_10807353All Organisms → cellular organisms → Archaea596Open in IMG/M
3300028536|Ga0137415_10009687All Organisms → cellular organisms → Archaea9556Open in IMG/M
3300028536|Ga0137415_10438066All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1112Open in IMG/M
3300032180|Ga0307471_104330921All Organisms → cellular organisms → Archaea501Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil51.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.11%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil19.54%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.45%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.15%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.57%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.57%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2007427000Uranium contaminated groundwater from Oak Ridge Integrated Field Research Center, TennesseeEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
20074957302007427000GroundwaterLADGVNGVKRRRRPAQKGVQAVFTSDSSWKGRAQVALGLITIGVGTVAWIYSTRGLGGLGLRNVIPWNASLAAVLLPISIPLLIGGVGLCIYHLAMRRTRGATSRIESAFYELEALAGEKDAAPGSSLDPGTVLGAKAARKPRFGLVSKALAVALVEAVILIAIYSGLV
JGI25385J37094_1000492113300002558Grasslands SoilLSTGVKRRRKSAQKGVQAVFTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRALGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLPGVKAGKSRFGLVSKALSMALVEAVILILIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA*
JGI25385J37094_1000543413300002558Grasslands SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGTPLGGGVIPEAKKGGKRSFHLPSFNVGSKALAIALVEAVLLIIIYGG
JGI25385J37094_1009956313300002558Grasslands SoilLANGVKTRRRTAQKGVXAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSRRLPL
JGI25385J37094_1015971413300002558Grasslands SoilKGVQAVFTPSNNWKGKAQAILGLAAIGAGTAAWTYTTRGLGGFGLGNLIPYDASASAILLPISIPLLISGAGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPRALPDVKGTRKSRFGLVSKALSIALVEAVTLILIYGGLVREYASNVNMQNWVQANFAPGSYFLNYNGVXALAGLLGVLIFQLLPRKS
JGI25383J37093_1000160883300002560Grasslands SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGXPLGGXVIPEAXKGGKRSFHLPSFNXGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
JGI25384J37096_1000138593300002561Grasslands SoilLANGVKTRRRTAQKGVHAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSPKASPLRGA*
JGI25384J37096_1000331133300002561Grasslands SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGTPLGGGVIPEAKKGGKRSFHLPSFNVGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
JGI25384J37096_1001880633300002561Grasslands SoilLSTGVKRRRKSAQKGVQAVFTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRXLGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLLGVKAGKSRFGLVSKALSMALVEAVILSLIYGGLVREYASNVNMQNWVRANFALGSY
JGI25382J37095_1000400633300002562Grasslands SoilLSTGVKRRRKSAQKGVQAVFTPSXNWKGRAQASLGLIAIGGGTAAWTYTTRALGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLLGVKAGKSRFGLVSKALSMALVEAVILSLIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA*
JGI25382J37095_1009963123300002562Grasslands SoilLDLLRTDSPLANGVKRRRKTAQKGVQAVFTADTGWKGRAQVAFGLIAIGIGTAAWEYSTRALGGFGLGNLIPWDSSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLQSSKL*
JGI25382J37095_1023911213300002562Grasslands SoilVNGVKRKRRQAQKGVQAVFTPNNNWKGRAQASLGLAAIGAGAAAWTYTTRGLGGFGLGNLIPYDASASAILLPISIPILIGGVGLCTYYLAMRRTWRASSWIESALYELETLVGQKNGAPGSSPDPGTLLGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYASNVNMQNWVQTNF
JGI25382J43887_10000039193300002908Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALCELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN*
JGI25382J43887_10003130133300002908Grasslands SoilVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGTPLGGGVIPEAKKGGKRSFHLPSFNVGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
JGI25382J43887_1000397553300002908Grasslands SoilLANGVKTRRRTAQKGVXAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSPKASPLRGA*
JGI25382J43887_1002114673300002908Grasslands SoilVNGVKRKRRQAQKGVQAVFTPNNNWKGRAQASLGLAAIGAGAAAWTYTTRGLGGFGLGNLIPYDASASAILLPISIPILIGGVGLCTYYLAMRRTWRASSWIESALYELETLVGQKNGAPGSSPDPGTLLGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYASNVNMQNWVQTNFALGSYLLSYNAVLALAGLLGVLIFQLLPRKIRSKSPRDSSSSKTSSHKGA*
JGI25382J43887_1002408423300002908Grasslands SoilLANGVKRRRKTAQKGVQAVFTADTGWKGRAQVAFGLIAIGIGTAAWEYSTRALGGFGLGNLIPWDSSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLQSSKL*
JGI25382J43887_1003712743300002908Grasslands SoilLANSVNGVKRKRRPAQKGVQAVFTPSNNWKGRAQASLGLGAIGAGAAAWTYTTRGLGGFGLGNLTPYDASASAILLPISVPLLIGGAGLCTYYLAMRRTWRASSRIESALYELEALVGQKNGAPGSSPDLGTLPGVKVTGKSRFGLVSKALSITLVEAVILIMIYGGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGIMIFHLLPRKIR*
JGI25382J43887_1007488523300002908Grasslands SoilLANSVNGVKRKRRQAQKGVQAVFTPNNNWKGRAQASLGLAAIGAGAAAWTYTTRGLGGFGLGNLIPYDASASAILLPISIPILIGGVGLCTYYLAMRRTWRASSWIESALYELETLVGQKNGAPGSSPDPGTLLGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYASNVNMQNWVQTNFALGSYLLSYNAVLALAGLLGVLIFQLLPRKIRSKSPRDSSSSKTSSHKGA*
JGI25382J43887_1009514223300002908Grasslands SoilLANGVKRRRRATQKGVQAVFTADTTWKGRVQVAVGLIAIGIGTAAWAYSTRALGGFGLGSLIPSDSSISAILLPLSIPLLIGGVSVCTYYLAMRRTWRARNRIESALYELEALVGQKNAASGSSAEAAVARETKTATKPSFHLLSKALAVALVEGVLLIAIYGGLVQEYVSNVNMQTWVKANFSPGSYFLNYNGVLALAGLLGVLIFQLLPRKLRSGKLQG*
JGI25386J43895_1000119983300002912Grasslands SoilLANGVKTRRRTAQKGVXAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNXILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSPKASPLRGA*
JGI25386J43895_1000258563300002912Grasslands SoilLSTGVKRRRKSAQKGVQAVFTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRALGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLLGVKAGKSRFGLVSKALSMALVEAVILILIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA*
JGI25386J43895_1003538523300002912Grasslands SoilLANGVKRRRKATQKGVQAVFTADTSWKGRVQVALGLIAIGIGTAAWAYSTRALSGFGLGSLIPSDSSTSAILLPLSIPLLIGGVSVCTYYLVMRRTWRARNRIESALYELEALVGQKNASSGSSAEAGVARETKTATKPQFLLLSKALPVALIEGVLLIAIYGGLVQEYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLRSSKL*
JGI25386J43895_1006627713300002912Grasslands SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGTPLGGGVIPEAKKGGKRSFHLPSFNXGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLA
Ga0066672_1028231323300005167SoilLSTGVKRRRKPAQKGPQKGVQAVFTPSNNWQGLAQLTLGLAAIATGTAAWTYTTRVLGGFGLGNPILYDTSASAILLPISIPLLIGGVGLCTYYLAMRRTRRASSRIESALYELGALVGQKNGAPGSSPDQGTLPGVKATGKSRFGLASKALSIALVEAVILIMIYGGLVRGYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVLIFQLLPRKIRSRSPRNSSSPKTSPLRGA*
Ga0066680_1006440623300005174SoilLANGVKRRRKATQKGVQAVFTADTSWKGRVQVALGLIAIGIGTAAWAYSTRALSGFGLGSLIPSDSSTSAILLPLSIPLLIGGVSVCTYYLVMRRTWRARNRIESALYELEALVGQKNASSGSSAEAGVARETKTATKPQFLLLSKALAVALIEGVLLVAIYGGLVQEYASNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLRSSKL*
Ga0066680_1009836623300005174SoilLANGVKTRRRTAQKGVQAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSPKASPLRGA*
Ga0066680_1010341523300005174SoilLANSVNGVKRKRRPAQKGVQAVFTPSTNWKGRAQASLGLAAIGAGTAAWTYTTRGLGGFGLGNLIPYDASASAMLLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQKNGAPGSPPDPGTVAGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYVSNVNMQNWVQTNFALGGYLLSYNAVLVLTGLLGVLIFQLLPRKIRSKSPRNSSS*
Ga0066680_1024015813300005174SoilYTTRGLGGFGLGNLIPYDASASAILLPISIPILIGGVGLCTYYLAMRRTWRASSWIESALYELETLVGQKNGAPGSSPDPGTLPGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYASNVNMQNWVQTNFALGSYLLSYNAVLALAGLLGVLIFQLLPRKIRSKSPRDSSSSKTSSHKGA*
Ga0066685_1002708023300005180SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALCELEALVGQKNVAPGTSLDGRGFPEVKTRKRSFHLPSLSVVSMSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRTLQPKKLQN*
Ga0066678_1019667923300005181SoilLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0066686_1047913823300005446SoilLSTGVKRKRRPAQKGVQAVFTPSTNWRGRAQLTLGLIAMGAGTASWIYTTRALGGFGLGNLIPWNALVSALLLPISIPLLIGGVGLCIYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLN
Ga0066689_1015781113300005447SoilLANGVKTRRRTAQKGVQAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSS
Ga0070707_10227318213300005468Corn, Switchgrass And Miscanthus RhizosphereRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLFPWNETASAILLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELETLVGQKNGALGSSPDQGRLPDMGVARKSRFGFVSKALSVALVEAVLLILIYSGLVQEYASNVNMQN
Ga0070699_10004850723300005518Corn, Switchgrass And Miscanthus RhizosphereLSTGAKRKRRPAQKGVQAVFTPSTNWKGRAQLALGLVALGAGTASWIYTTRELRGFGLGNLIPWNESVSAILLPISIPLLIGGVGLCTYFLAMRRTWRASSRIESALYELEALVGQRNGVSGASSEARVVTEANSRKRSFHLPSFNIVSKALAIALVEAVVLIVIYGGLVQEYASNVNMQNWIRANFAPGTYLLNYNAMLVLAGLLGVLIFQLLPRKLQSRKLQS*
Ga0070699_10095342923300005518Corn, Switchgrass And Miscanthus RhizosphereTASWVYTTRALGGFGLGNLIPWNETVSALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGVGVIPEAKNGGKRSFHFPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQVNFAPGSYLLNYNAVLVLAGLLGLLIFQLLPRKLESKKLKAGPGL*
Ga0070697_10001699513300005536Corn, Switchgrass And Miscanthus RhizosphereLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLISWNETASAILLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELETLVGQKNGALGSSPDQGRLPDMGVARKSRFGFVSKALSVALVEAVILILIYSGLVQEYASNVNMQNWVQANFAPGSYLLNYNAVLVLAGLLGVLIFQLLPRKLQSKKLKAGPGL*
Ga0070697_10033127913300005536Corn, Switchgrass And Miscanthus RhizosphereLSTGAKRKRRPAQKGVQAVFTPSTNWKGRAQLAFGLVALGAGTASWIYTTRELGGFGLGNLIPWNESVSAILLPISIPLLIGGVGLCTYFLAMRRTWRASSRIESALYELEALVGQRNGVSGASSEARVVTEANSRKRSFHLPSFNIVSKALAIALVEAVLLIVIYGGLVQEYASNVNMQNWIRANFAPGTYLLNYNAMLVLAGLLGVLIFQLLPRKLQSRKLQS*
Ga0066701_1034255923300005552SoilLIAIGIGTAAWEYSTRALGGFGLGNLIPWDSSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLQSSKL*
Ga0066661_1024816913300005554SoilITISAGTASWIYTNRALGGFGLGNLIPWNASASALLLPISIPLLIGGVGVCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFRLPSLSVVSKSVAIALVEAVLLIVIYGGLVREYASNVNMQNWVQANFALGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN*
Ga0066692_1015992523300005555SoilLANSVNGVKRKRRPAQKGVQAVFTPSTNWKGRAQASLGLAAIGAGTAAWTYTTRGLGGFGLGNLIPYDASASAMLLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLPGVKAGKSRFGLVSKALSMALVEAVILILIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSS
Ga0066692_1095360413300005555SoilLRTDSPLANGVKRRRKATQKGVQAVFTADTSWKGRVQVALGLIAIGIGTAAWAYSTRALSGFGLGSLIPSDSSTSAILLPLSIPLLIGGVSVCTYYLVMRRTWRARNRIESALYELEALVGQKNASSGSSAEAGVARETKTATKPQFLLLSKALPVALIEGVLLIAIYGGLVREY
Ga0066707_1002364033300005556SoilLPTRYLPSLGPPRTDPILSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVPKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0066704_1003874053300005557SoilLGGFGLGNLIPYDASASAILLPISIPLLISGAGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPRALPDVKGTRKSRFGLVSKALSIALVEAVILILIYGGLVREYVSNVNMQNWVQTNFALGGYLLSYNAVLVLTGLLGVLIFQLLPRKIRSKSPRNSSS*
Ga0066698_1000113723300005558SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKV*
Ga0066700_1010878433300005559SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0066700_1035246623300005559SoilLSGVKGRRKAAKKDVQAVFTPGSNWKERSQAIIGLAAIGAGAASWTYTTRVFGGSGLGNLVAWNAAASAILLPVSIPLLIGGVGLCTYYFAMRRTWRASNRIESALLELEALVSQKNATSSPWLDQGTVPDAKTARRLRFDFLPRTLAIALVEAVILVIIYSGLVQEYVSNVNMRNWVQANFAPGIYLLNYYVVFILAGLLGMLIFRLLPRKPQPKEVQKTSSTIESKRSYG*
Ga0066699_1029027423300005561SoilLSGVKGRRKAAKKDVQAVFTPGSNWKERSQAIIGLAAIGAGAASWTYTTRVFGGSGLGNLVAWNAAASAILLPVSIPLLIGGVGLCTYYFAMRRTWRASNRIESALLELEALVSQKNATSSPWLDQRTVPDAKTAIRLRFDFLPRTLAIALVEAVLLVIIYSGLVQEYVSNVNMRNWVQANFAPGIYLLNYYVVFILAGLLGMLIFRLLPRKPQPKEVQKTSSTIESKRSYG*
Ga0066703_1001139023300005568SoilVQKGVQAVFTPSTNWKGRAQLTLGLITISAGTASWIYTNRALGGFGLGNLIPWNASASALLLPISIPLLIGGVGVCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFRLPSLSVVSKSVAIALVEAVLLIVIYGGLVREYASNVNMQNWVQANFALGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN*
Ga0066703_1045279423300005568SoilLANGVKTRRRTAQKGVQAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVR
Ga0066703_1079854513300005568SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVH
Ga0066703_1086073013300005568SoilAIGAGAASWTYTTRVFGGSGLGNLVAWNASASAILLPVSIPLLIGGVGLCTYYFAMRRTWRASNRIESALLELEALVNQKNATSSPWLDQGTVPDAKTARRLRFDFLPRTLAIALVEAVILVIIYSGLVQEYVSNVNMRNWVQANFAPGIYLLNYYVVFILAGLLGMLIFRL
Ga0066691_1038031223300005586SoilLANGVKTRRRTAQKGVQAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEA
Ga0066706_1001272643300005598SoilLPTRYLPSLGPPRTDPILSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0066659_10002951113300006797SoilLPTRYLPSLGPPRTDPVLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVPKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0099791_1002319213300007255Vadose Zone SoilLSTSIKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLIQWNASASVLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGNRSIHLPSFNLGSKALGVALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAV
Ga0099791_1006992623300007255Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSSNWRGRAQLALGLAAIGAGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVPDAKVARKPRFGLVPKALTIALVEAVLLILIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVVIFQLLPRKLQAEKLQP*
Ga0099793_1001997063300007258Vadose Zone SoilRPAQKGIQAVFTPSTNWKGRAQLTLGLIAMGAGTASWAYTTRALGGFGPGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQRNAVPGSSLYAGAVNEAKTKKSSLHLPSFNLVSKALAIALVKAVLLIIIYGGLVQEYASNVNMQNWVRANFAPGGYLLNYNAVLVLAGLLGVLIFQLLPRKLQARKLQG*
Ga0099793_1013387113300007258Vadose Zone SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRKTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKKGGKRSFHLPSFNLGSKALAIALAEAVLLIIIYGGLVREYVANVNMQNWIRSNFAPGSYLLNYNAVLVIA
Ga0099793_1014553223300007258Vadose Zone SoilLSNGVKRRRKPAQKGLQAVFTPSNNWTGRAQIILGVIAIGAGIVAWTYTTRAFGGAGLGSLIPWDASASAVLLPLSIPLLIGGVGLCTYYLAMRRTWWASNRIESALLELEMLVGQKNPNPALGLDPGTVPVVKRPRRFRLQLPRTLAVALIEAVILVTIYSGLVREYMLNVNMQNWVQTNFIPGIYLLNYYVVLILAGLLGMLIFRLLPRKLQ
Ga0099793_1033953113300007258Vadose Zone SoilWIYTTRALGGFGLGNLIPWNASASSLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNVASGSSSDRGAFSEAKTRKRSFHLPSLSVVSKSVAIALVEAVLLIVIYGGLVREYASNVNMQNWVQANFALGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN
Ga0099794_1057505913300007265Vadose Zone SoilLSTSIKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLIQWNASASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGNRSIHLPSFNLGSKALGVALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYN
Ga0066710_10003667073300009012Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN
Ga0099829_1000949723300009038Vadose Zone SoilLSTSVKKRRRPTQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTWSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKKGGKRSFHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFALGSYLLNYTAVLVLAGLLGILIFQLVPRKLHSRKLQG*
Ga0099829_1001282223300009038Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGSSLDRRGFPEAKTKKRSLHLPSFNQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYFLNYNAVLVLAGLLGVLIFQLLPRKLQSRNLQD*
Ga0099830_1020871523300009088Vadose Zone SoilMYTTRALGGFGPGNLIPYDASASAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVADAKVARKPRFGLVPKALTVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQAEKLQP*
Ga0099830_1042624413300009088Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGSSLDRRGFPEAKTKKRSLHLPSFNQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFALGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
Ga0099830_1053947823300009088Vadose Zone SoilAAIATGTAAWTYTTRALGGFGLGSLIPWDASASAILLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQRNGAPGSSPDTGTLPGVKVAGKSRFGLVSKSLSIALVEGVILILIYGGLVREYASNVNMQNWVQVNFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKIQEKKFQP*
Ga0099828_1019147223300009089Vadose Zone SoilMSTGVKRRRKPAQKGVQAVFTPSNNWKGRAQLTLGLAAIATGTAAWTYTTRALGGFGLGSLIPWDASASAILLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQRNGAPGSSPDTGTLPGVKVAGKSRFGLVSKSLSIALVEGVILILIYGGLVREYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKIQEKKFQP*
Ga0099828_1061996623300009089Vadose Zone SoilMYTTRALGGFGLGSLIPWNAAASSILLPISIPLLIGGVGLCTYFLAMRRTWRASSRIESALYELEALVGQKSSTPVSMLDGGVVPGAKKARKRSFHLPSFNIVSKALAIALVEAVVLIIIYGGLVREYGSNVNMQNWVQANFALGSYFLNYNAVLVLAGLLGVLIFQLLPRKFQAKKLQP
Ga0099828_1070461413300009089Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWRGRVQLTIGLAAIGAGTVSWIYTTRTLGGFGLGDLIPWNASASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPQLGVGVIPEAKRGGKRSIHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNALLALAGLLGVLIFQLLPRKLQAKKVKAGPNN*
Ga0099828_1085729613300009089Vadose Zone SoilKPAQKGVQAVFTPSSNWRGRAQLALGLAAIGAGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVADAKVARKPRFGLVPKALTVALVEAVLLILIYGGLVQEYASNVNMQNWVRANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQAEKLQP*
Ga0099828_1192778613300009089Vadose Zone SoilTGVKRRRKPAQKGVQAVFTPSSNWNGRAQLALGLVAIGAGTAAWTYTTRALGGFGLGNLIPYDASASAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGSSSEPRTVPDAKASRKPRFGLVPKALTVALVEAVLLILIYGGLVQEYASNVN
Ga0099828_1196076713300009089Vadose Zone SoilKGRVQLTLGLVATGAGTASWTYTTRALGGFGLGNLIPWNASASAILLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGYSLDGGSVHETKVRKRSFHLPSFDLVSKSVAIALAEAVLLIIIYGGLVREYSSNVNMQNWVQANFAPGTYLLNYN
Ga0099827_1001737913300009090Vadose Zone SoilLSTGVKRKRRPSQKGVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWIYTTRTLGGIGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGKRSIHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKIKAGPSI*
Ga0099827_1002382333300009090Vadose Zone SoilLANGVKRRRRTAQKGVQAVFTADTGWKGRAQVAFGLIAIGIGTAAWAYSTRALGGFGLGNLIPWDSSTSAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNGAPGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGLLALAGLLGVLMFQLLPRKLRSSKL*
Ga0099827_1025441713300009090Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNASASALLLPISIPLLIGGVGLCTYFLAMRRIWRASNQIESALYELEALVGQKSGVAGPLLGVGVIPETKRGGKRSIHLPSFNLGSKALAIALVEAVLLIII
Ga0099827_1098132113300009090Vadose Zone SoilQLSLGLLAIGAGTWSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKKGGKRSFHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQVNFALGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
Ga0099827_1144632313300009090Vadose Zone SoilLGLAAIGAGATAWTYTTRALGGFGFGNLIPWNASASAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVVQKNGAPGSSPDPAFLADVRGARKLRFGLVSKALSIAVVEAVLLILIYGGLVREYASNVNMQNWVQTNFAPGSYFLNYNAVLVLAGLLCFLIFQLLPRKLQSRKLKG*
Ga0134088_1000329453300010304Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALVSALLLPISIPLLIGGVGLCIYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSMSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN*
Ga0137392_1026406923300011269Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGSSLDRRGFPEAKTKKRSLHLPSFNQVSKALAIALAEAVLLI
Ga0137391_1067868713300011270Vadose Zone SoilLGLAAIATGTAAWTYTTRALGGFGLGSLIPWDASASAILLPISIPLFIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQRNGAPGSSPDTGTLPGVKVAGKSRFGLVSKSLSIALVEGVILILIYGGLVREYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKIQEKKFQP*
Ga0137393_1006188823300011271Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNESISAFLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGSSLDRRGFPEAKTKKRSLHLPSFNQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYFLNYNAVLVLAGLLGVLIFQLLPRKLQSRNLQD*
Ga0137388_1009097853300012189Vadose Zone SoilLSTSVKKRRRPTQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTWSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKKGGKRSFHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFALGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
Ga0137388_1044215423300012189Vadose Zone SoilMSTGVKRKRRPAQKGVQAVFTPSTNWKGRVQLTLGLVATGAGTASWTYTTRALGGFGLGNLIPWNASASAILLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGYSLDGGSVHETKVRKRSFHLPSFDLVSKSVAIALVEAVLLIIIYGGLVREYSSNVNMQNWVQANFAPGSYFLNYNALLVLAGLLGVLIFQLLPRKLQPRKPQLAPEERLGA*
Ga0137388_1146502013300012189Vadose Zone SoilGAGTVSWIYTTRTLGGFGLGDLIPWNASASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPQLGVGVIPEAKRGGKRSIHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPNN*
Ga0137364_1032228923300012198Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEELVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0137383_1005594923300012199Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGAPGSSPGPGSLPDMKGARKSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0137399_1002273123300012203Vadose Zone SoilVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWIYTNRTLGGFGLGNLIAWNESASTLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGKRSIHLPSFNLGSKALPIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAAGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPSI*
Ga0137399_1004329163300012203Vadose Zone SoilVQAVFTPSTNWRGRAQLALGLVAITAGTVSWIYTTRALGGFGLGNLIPWNESVSGVLLPISIPLLIAGVGLCTYFLAMRRTWRASNRIESALCELEALVGQKNGVSGSALGVGVIPEAKKGGKRSFHLPSFNLGSKALAIALAEAVLLIIIYGGLVREYVANVNMQNWIRSNFAPGSYLLNYNAVLVIAGLLGVLIFQLLPRKLQSRKLEA*
Ga0137399_1004779123300012203Vadose Zone SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEARKGGKRSFHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFSPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
Ga0137399_1007130723300012203Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTRSKNWRGQAQLALGLAAIGAGTVAWTYTTRALGGFGLGNLIPYDASASTILLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGSSSHEPRILPDAKAARKPRFGLVLKTITVALVEAVLLILIYGGLVQEYTSNVNMQNWVQANFVPGSYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKNLQP*
Ga0137399_1011211923300012203Vadose Zone SoilLSNGVKRRRKPAQKGLQAVFTPSNNWTGRAQIILGVIAIGAGIVAWTYTTRAFGGAGLGSLIPWDASASAVLLPLSIPLLIGGVGLCTYYLAMRRTWWASNRIESALLELEMLVGQKNPNPALGLDPGTVPVVKRPRRFRLQLPRTLAVALIEAVILVTIYSGLVREYMLNVNMQNWVQTNFIPGIYLLNYYVVLILAGLLGMLIFRLFPRKLQPEEIRKMS*
Ga0137399_1160607213300012203Vadose Zone SoilGVKGKRRPAQKGVQAVFTPSTNWRGRAQLTLGLIAMGAGTALWIYTTRALGGFGLGNPVPWSASASALLLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELETLVGQKNAGQGFALDGGAVPEAKTKKRSSHLPSLSVVSNSVAIALVEAVLLIIIYGGLVREYASNVNMQNW
Ga0137380_1008328723300012206Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLTPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSAVLGSSLDGRGFPEAKTKKRSLHLPSLNQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPSI*
Ga0137380_1032657823300012206Vadose Zone SoilKRKRRPSQKGVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWTYTTRTLGGIGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGKRSIHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWIQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKIKASPSI*
Ga0137380_1033455123300012206Vadose Zone SoilLANGVKRRRRTAQKGVQAVFTADTGWKGRVQVALGLIAIGVGTAAWAYSTRALGGFGLGNLILWDSSISAILLPLSIPLLIGGVSVCTYYLAMRRTWRARNRIESALYELEGLVGQKNAAPGSSAEEGVARETKAEAKPQFRLRSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLRSSKL*
Ga0137380_1127212713300012206Vadose Zone SoilNWKGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNETASAILLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPGPGSLSAMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVREYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL*
Ga0137380_1144403713300012206Vadose Zone SoilNWKGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPGSLLDMKGARKSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIFQLLPRKL
Ga0137381_1034505813300012207Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLTPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSAVLGSSLDGRGFPEAKTKKRSLHLPSLNQVSKALAIALAEAVLLITIYGGL
Ga0137381_1045437223300012207Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGL
Ga0137381_1053108323300012207Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGAPGSSPGPGSLPDMKGARKSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWV
Ga0137381_1062189013300012207Vadose Zone SoilVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWTYTTRTLGGIGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIQKAKRGGKRSIHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWIQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPSI*
Ga0137379_10016311103300012209Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLTPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSAVLGSSLDGRGFPEAKTKKRSPHLPSFSQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPSI*
Ga0137379_1024329023300012209Vadose Zone SoilLSTSVRRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPGSLLDMKGARKSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIFQLLPRKLESKKLKAGPGL*
Ga0137379_1027454723300012209Vadose Zone SoilLANGVKRRRKTAQKGVQAVFTADTGWKGRAQVAFGLIAIGIGTAAWEYSTRALGGFGLGNLIPWDLSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLQSSKL*
Ga0137379_1034084723300012209Vadose Zone SoilVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWTYTTRTLGGIGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGKRSIHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWIQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKIKASPSI*
Ga0137379_1160815313300012209Vadose Zone SoilGTAAWAYSTRALGGFGLGNLILWDSSISAILLPLSIPLLIGGVSVCTYYLAMRRTWRARNRIESALYELEGLVGQKNAAPGSSAEEGVAREKKAEAKPQFRLRSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLRSSKL*
Ga0137378_1013663833300012210Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGAPGSSPGPGSLPDMKGARKSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIF
Ga0137378_1016375033300012210Vadose Zone SoilMGAGTASWVYTTRALGGFGLGNLIPWNETASAILLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPGPGSLSAMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVREYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIF
Ga0137378_1148946613300012210Vadose Zone SoilGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPGSLLDMKGARKSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIFQLLPRKLESKKLKAGP
Ga0137387_1004771743300012349Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNAVPGSSLDRRGFPEAKTKKRSLHLPSFNQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYFLNYNAVLVLAGLLGVLIFQLLPRKLQSRNLQD*
Ga0137387_1011993323300012349Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLTPWNESASTLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSAVLGSSLDGRGVPEAKTKKRSPHLPSFSQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPSI*
Ga0137387_1027188023300012349Vadose Zone SoilLANGVKRRRRAAQKGVQAVFTADTGWKGRVQVALGLIAIGVGTAAWAYSTRALGGFGLGNLILWDSSISAILLPLSIPLLIGPVSVCTYYLAMRRTWRARNRIESALYELEGLVGQKNAAPGSSAEEGVAREKKAEAKPQFRLRSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQ
Ga0137386_1004189223300012351Vadose Zone SoilVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWTYTTRTLGGIGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSAVLGSSLDGRGFPEAKTKKRSPHLPSFSQVSKALAIALAEAVLLITIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKLQAKKVKAGPSI*
Ga0137386_1017749223300012351Vadose Zone SoilLSTSVRRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPGSLLDMKGARNSRFGLVSKTLAVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIFQLMPRKLESKKLKAGPGL*
Ga0137386_1048311823300012351Vadose Zone SoilLANGVKRRRKTAQKGVQAVFTADTGWKGRAQVAFGLIAIGIGTAAWEYSTRALGGFGLGNLIPWDSSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNW
Ga0137384_1012100323300012357Vadose Zone SoilLANGVKRRRRAAQKGVQAVFTADTGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEGLVAQKNAAPGSSAEEGVARETKAEAKPQFRLRSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLQSSKL*
Ga0137385_1017620013300012359Vadose Zone SoilLSTSVKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLTPWNESASTLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSAVLGSSLDGRGFPEAKTKKRSPHLPSFSQVSKALAIALAEA
Ga0137385_1028486323300012359Vadose Zone SoilMGAGTASWVYTTRALGGFGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPGPGSLSAMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLVREYASNVNMQNWVQANFAPGSYLLNDNAVLVLAGLLGVLIFQLLPRKLE
Ga0137385_1031199923300012359Vadose Zone SoilLANGVKRRRRTAQKGVQAVFTADTGWKGRVQVALGLIAIGVGTAAWAYSTRALGGFGLGNLILWDSSISAILLPLSIPLLIGGVSVCTYYLAMRRTWRARNRIESALYELEGLVGQKNAAPGSSAEEGVAREKKAEAKPQFRLRSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLRSSKL*
Ga0137385_1055487923300012359Vadose Zone SoilVQAVFTPPTNWKGRAQLTIGLAAIGAGTVSWTYTTRTLGGIGLGNLIPWNETASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGKRSIHLPSFNLGSKALAIALIEAVLLIIIYGGLVREYASNVNMQNWIQANFAPGTYLLNYNAVLVLAGLLGDLIFQLLPRKLQAKKVKAGPSI*
Ga0137385_1153740113300012359Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGAPGSSPGPGSLPDMKGARKSRFGLVSKTLAVA
Ga0137396_1000579243300012918Vadose Zone SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKKGGKRSFHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFALGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG*
Ga0137396_1002192343300012918Vadose Zone SoilVKRRRRSAQKGVQAVFTPTNDWKGRAQIILGLAAIVAGAASWTYTTRALNGVGFDGLIPWNKSAPAILLPISIPLLIGGVGLCTYYLAMRRTWRARNRIESALLELEALVGQTNPNLGTSVDARPAREMKIARKLGFDLGSRTLAIALVEAVILVIIYSGLVQEYVSNINMQNWVQANFAPGSYLLNYYMVLILAGLLGMLIFRLLPRRLQAERVQN*
Ga0137396_1009483423300012918Vadose Zone SoilVQAVFTPSTNWRGRAQLALGLVAIIAGTVSWIYTTRALGGFGLGNLIPWNESVSGVLLPISIPLLIAGVGLCTYFLAMRRTWRASNRIESALCELEALVGQKNGVSGSALGVGVIPEAKKGGKRSFHLPSFNLGSKALAIALAEAVLLIIIYGGLVREYVANVNMQNWIRSNFAPGSYLLNYNAVLVIAGLLGVLIFQLLPRKLQSRKLEA*
Ga0137396_1028125913300012918Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSSNWNGRAQLILGLAAIGAGTAAWTYTTQTLGGFGPGSLIPYDASASAVFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVADAKVARKPRFGLVPKALTVALVEGVLLVLIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQAEKLQP*
Ga0137396_1054437723300012918Vadose Zone SoilLSTGVKRKRRPAQKGVQAVFTPSTNWRGRAQLTLGLIAMGAGTALWIYTTRALGGFGLGNPVPWSASASALLLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELETLVGQKNAGQGFALDGGAVPEAKTKKRSSHLPSLSVVSNSVAIALVEAVLLITIYGGLVREYASNVNMQNWVQVNFALGSYFLN
Ga0137419_1006777233300012925Vadose Zone SoilLSNGVKRRRKPAQKGLQAVFTPSNNWTGRAQIILGVIAIGAGIVAWTYTTRAFGGAGLGSLIPWDASASAVLLPLSIPLLIGGVGLCTYYLAMRRTWWASNRIESALLELEMLVGQKNPNPALGLDPGTVPVVKRPRRFRLQLPRTLAVALIEAVILVTIYSGLVREYMLNVNMQNWVQTNFIPGIYLLNYYVVLILAGLLGMLIFRLLPRKLQPEEIRKMS*
Ga0137416_10002780113300012927Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSSNWRGRAQLALGLAAIGAGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWWASNRIESALYELEALVSQKNGASGSSLEPRTVADAKVARKPRFGLVPKALTVALVEGVLLVLIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQAEKLQP*
Ga0137416_1007234023300012927Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSYNWKGRAQLALGLAAIAAGAAAWTYTTRAIGGFGLGSLIPWNSSASAILLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGLSPEPRTVPDAQSAGKTRLGLVPKALPVALVETVLLILIYGGLVQEYASNVNMQKWILDNFVPGSHFLNYNAVLVLAGLLGVMIFQLLPRKFQAKNLQP*
Ga0137416_1016516423300012927Vadose Zone SoilLSTSIKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLIQWNASASVLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGNRSIHLPSFNLGSKALGVALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKFQAKKVKAGPSI*
Ga0137416_1183442713300012927Vadose Zone SoilAQLALGLAAIAAGTAAWTYTTQALGGFGLGNLIPYDASASAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPEARNVSDPKAARKPRLGLVPKALTVALVEAGLLILIYGGLVQEYASNVNMQKWVLDNFVPGSDFLNYNAVLVLAGLLGVMIFQLLPRK
Ga0137410_1095878923300012944Vadose Zone SoilLSTSIKRKRRPAQKGVQAVFTPSTNWKGRAQLSLGLIAMGAGTASWVYTTRALGGFGLGNLIQWNASASVLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGNRSIHLPSFNLGSKALGVALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAP
Ga0134077_1001088523300012972Grasslands SoilLSTGVKRKRRPAQKGVQAVFTPSTNWRGRAQLTLGLIAMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRTLQPKKLQN*
Ga0134076_1001380523300012976Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCIYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKV*
Ga0134089_1001000723300015358Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALVSALLLPISIPLLIGGVGLCIYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSMSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKV*
Ga0134083_1000089023300017659Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALVSALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN
Ga0066662_1003556023300018468Grasslands SoilLANGVKTRRRTAQKGVQAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSPKASPLRGA
Ga0066662_1007136433300018468Grasslands SoilLANSVNGVKRKRRPAQKGVQAVFTPSTNWKGRAQASLGLAAIGAGTAAWTYTTRGLGGFGLGNLIPYDASASAMLLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQKNGAPGSPPDPGTVAGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYVSNVNMQNWVQTNFALGGYLLSYNAVLVLTGLLGVLIFQLLPRKIRSKSPRNSSS
Ga0215015_1080698523300021046SoilLSTGVKRKRRPAQKGVQAVFTPSSNWKGRAQLTLGLIAMGAGTASWVYTTRALGGFGLGNLIPWNESASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALAGQKNGVAGSALGVGVIPEAKKGGKWSFHLPSFNLVSKALAMALVEAVLLIIIYGGLVQEYTSNVNMQNWIHANFSPGSYFLNHNAVLVLAGLLGVLIFQLLPRKLQSKKLKASPSL
Ga0215015_1107914273300021046SoilLANGVNGVKRRRRPAQKGVQAVFTEDASWKGRAQVALGLITIGVGTVTWAYSTKGLGGFGLGNLIPWDSSVSAILLPISIPLLIGGVGLCTYYLVMRRTWRARNRIESALYELEALFSQKNGAPGPSLDTGALPDVKAARKSRFGFLSKALSIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLTGLLGVLIFQLLPRKLQSKKLQP
Ga0210404_1021606313300021088SoilLSTGVKRRRRPAQKGVQAVFTASTSWKGRAELALGLVGLGAGTASWIYTTRVLGGFGLGNLVPWNESASNILLPISIPLLIGGVALCTYVLVMRKTWRASNRIESALYELEALVSQRNGAPGASSTERPVLDAKISKGSFHLPSFNLVSKALAIALVEAVLLIVMYGGIVQE
Ga0137417_114435323300024330Vadose Zone SoilVQAVFTPSYNWKGRAQLALGLAAIAAGAAAWTYTTRAIGGFGLGSLIPWNSSASAILLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGLSPEPRTVPDAQSAGKTRLGLVPKALPVALVETVLLILIYGGLVQEYASNVNMQKWILDNFVPGSHFLNYNAVLVLAGLLGVMIFQLLPRKFQAKNLQP
Ga0137417_131959833300024330Vadose Zone SoilLSTGVKRRRKSAQKGVQAVFTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRELGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLPGVKAGKSRFGLVSKALSMALVEAVILILIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA
Ga0137417_137129713300024330Vadose Zone SoilLSTGVKRRRKSAQKGVQAVFTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRALGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLPGVKAGKSRFGLVSKALSMALVEAVILILIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA
Ga0207646_1016139633300025922Corn, Switchgrass And Miscanthus RhizosphereLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKKGGKRSFRLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFALGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG
Ga0209234_115822213300026295Grasslands SoilRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEARKGGKRSFHLPSFNLGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFSPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG
Ga0209235_100043753300026296Grasslands SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGTPLGGGVIPEAKKGGKRSFHLPSFNVGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG
Ga0209237_100017023300026297Grasslands SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGGFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGVAGTPLGGGVIPEAKKGGKRSFHLPSFNIGSKALAIALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGSYLLNYNAVLVLAGLLGILIFQLVPRKLHSRKLQG
Ga0209236_105176423300026298Grasslands SoilLANGVKTRRRTAQKGVHAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSRRLPLSEEHDSRRIVS
Ga0209236_109444513300026298Grasslands SoilLSTGVKRRRKPAQKGVQAVFTPSNNWKGKAQAILGLAAIGAGTAAWTYTTRGLGGFGLGNLIPYDASASAILLPISIPLLISGAGLCTYYLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDPRALPDVKGTRKSRFGLVSKALSIALVEAVTLILIYGGLVREYASNVNMQNWVQANFAPGSYFLNYNGVMALAGLLGVLIFQLLPRKSRSSKLQA
Ga0209761_1002054103300026313Grasslands SoilLANGVKTRRRTAQKGVHAVFTADAGWKGRVQVALGLIAIGIGTAAWAYSTRALGGFGLGNLILWDSSVSAVLLPLSIPMLIGGVSLCTYYFAMRRTWRARNRIESALYELEALVGQKNAAPGSSPDLGTLPGVRAVGKSRYGPVSKALSIALVEAVILIMIYSGLVREYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVVIFQLLPRKIRSRSPRDSSSPKASPLRGA
Ga0209761_100403883300026313Grasslands SoilLSTGVKRRRKSAQKGVQAVFTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRALGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLLGVKAGKSRFGLVSKALSMALVEAVILSLIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA
Ga0209761_100920923300026313Grasslands SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALCELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN
Ga0209154_113757323300026317SoilLSTGVKRRRKPAQKGPQKGVQAVFTPSNNWQGLAQLTLGLAAIATGTAAWTYTTRVLGGFGLGNPILYDTSASAILLPISIPLLIGGVGLCTYYLAMRRTRRASSRIESALYELGALVGQKNGAPGSSPDQGTLPGVKATGKSRFGLASKALSIALVEAVILIMIYGGLVRGYASNVNMQNWIQANFALGSYLLSYNAVLVLAGLLGVLIFQLLPRKIRSRSPRNSSSPKISPLRG
Ga0209152_10001548133300026325SoilLSGVKGRRKAAKKDVQAVFTPGSNWKERSQAIIGLAAIGAGAASWTYTTRVFGGSGLGNLVAWNAAASAILLPVSIPLLIGGVGLCTYYFAMRRTWRASNRIESALLELEALVSQKNATSSPWLDQGTVPDAKTARRLRFDFLPRTLAIALVEAVILVIIYSGLVQEYVSNVNMRNWVQANFAPGIYLLNYYVVFILAGLLGMLIFRLLPRKPQPKEVQKTSSTIESKRSYG
Ga0209802_112395713300026328SoilLANGVKRRRKATQKGVQAVFTADTSWKGRVQVALGLIAIGIGTAAWAYSTRALSGFGLGSLIPSDSSTSAILLPLSIPLLIGGVSVCTYYLVMRRTWRARNRIESALYELEALVGQKNASSGSSAEAGVARETKTATKPQFLLLSKALAVALIEGVLLVAIYGGLVQEYASNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLRSSKL
Ga0209802_121911013300026328SoilTPSNNWKGRAQASLGLIAIGGGTAAWTYTTRALGGFGLGSLVPYDTSASAILLPISIPLLMGGVGLCTYYLAVRRTWRASSRIESALYELESLVGQKNGAPGSLPDPGTLPGVKAGKSRFGLVSKALSMALVEAVILILIYGGLVREYASNVNMQNWVRANFALGSYLLNYNAVLVLAGLLGVLIFQLLPRKIRSKSPRNSSSSKTSSPKGA
Ga0209158_115094223300026333SoilLANSVNGVKRKRRPAQKGVQAVFTPSTNWKGRAQASLGLAAIGAGTAAWTYTTRGLGGFGLGNLIPYDASASAMLLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQKNGAPGSPPDPGTVAGVKVTGKSRFGLVSKALSIALVEAVILILIYGGLVREYVSNVNMQNWVQTNFALGGYLLSYNAVLVLTGLLGVLILQLLPRKIRSKSPRNS
Ga0209690_114339113300026524SoilSTRALGGFGLGNLIPWDSSISAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGVLALAGLLGVLMFQLLPRKLQSSKL
Ga0209806_1000202253300026529SoilLSTGVKRKRRPVQKGVQAVFTPSTNWKGRAQLTLGLITISAGTASWIYTNRALGGFGLGNLIPWNASASALLLPISIPLLIGGVGVCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFRLPSLSVVSKSVAIALVEAVLLIVIYGGLVREYASNVNMQNWVQANFALGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKLQN
Ga0209058_101837743300026536SoilMGAGTASWIYTTRALGGFGLGNLIPWNALASALLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNVAPGSSLDGRGFPEVKTRKRSFHLPSLSVVSKSVAIALVEAVLLIIIYGGLVREYASNVNMQNWVQTNFAPGIYFLNYNAVLVLAGLLGVLIFQLLPRKLQPKKV
Ga0209056_1030128923300026538SoilQRNIGGSRRSCSRDPLPTRYLPSLGPPRTDPILSTGVKRKRRPAQKGVQAVFTPSTNWKGRAQLTLGLIAIGAGTASWTYTTRALGGFGLGNLIPWNEKASAIILPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKNGAPGSSPDQGSLPDMKGARKSRFGFVSKTLAVALVEAVLLILIYGGLQEYASNVNMQNWVHANFAPGSYLLNYDAVLVLAGLLGVLIFQLLPRKFESKKLKAGPGL
Ga0209388_100035823300027655Vadose Zone SoilMGAGTASWVYTTRALGGFGLGNLIQWNASASVLLLPISIPLLIGGVGLCTYFLAMRRTWRASNRIESALYELEALVGQKSGVAGPLLGVGVIPEAKRGGNRSIHLPSFNLGSKALGVALVEAVLLIIIYGGLVREYASNVNMQNWVQANFAPGTYLLNYNAVLVLAGLLGVLIFQLLPRKFQAKKVKAGPSI
Ga0209388_113962413300027655Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSSNWRGRAQLALGLAAIGAGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVADAKVARKPRFGLVPKALTIALVEAVLLILIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVV
Ga0209588_113443223300027671Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSGNWNGRAQLILGLAAIGAGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVPDAKAARKPRFGLVPKALTIALVEAVLLILIYGGLVQEYASN
Ga0209283_1026746023300027875Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSTNWNGRAQLILGLAAIAVGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWRASNRIESALYELEALVSQKNGASGSSLEPRTVADAKAARKPRFGLVPKALTVALVEAVLLILIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGL
Ga0209283_1093651813300027875Vadose Zone SoilAAIATGTAAWTYTTRALGGFGLGSLIPWDASASAILLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQRNGAPGSSPDTGTLPGVKVAGKSRFGLVSKSLSIALVEGVILILIYGGLVREYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQL
Ga0209590_1003251533300027882Vadose Zone SoilMSTGVKRRRKPAQKGVQAVFTPSNNWKGRAQLTLGLAAIATGTAAWTYTTRALGGFGLGSLIPWDASASAILLPISIPLLIGGVGLCTYYLAMRRTWRASSRIESALYELEALVGQRNGAPGSSPDTGTLPGVKVAGKSRFGLVSKSLSIALVEGVILILIYGGLVREYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKIQEKKFQP
Ga0209590_1080735313300027882Vadose Zone SoilGVKRRRRTAQKGVQAVFTADTGWKGRAQVAFGLIAIGIGTAAWAYSTRALGGFGLGNLIPWDSSTSAILLPLSIPLLIGGVSLCTYYLAMRRTWRARNRIESALYELEALVGQKNAALGSSAEAGVARETKAEAKPQFRLLSKALAVALIEGVLLIAIYGGLVREYVSNVNMQNWVQANFAPGSYFLNYNGLLALAGL
Ga0137415_10009687113300028536Vadose Zone SoilLSTGVKRRRKPAQKGVQAVFTPSSNWRGRAQLALGLAAIGAGTAAWTYTTRALGGFGPGNLIPYDASVSAIFLPISIPLLIGGVGLCTYYLAMRRTWWASNRIESALYELEALVSQKNGASGSSLEPRTVADAKVARKPRFGLVPKALTVALVEGVLLVLIYGGLVQEYASNVNMQNWVQANFVPGSYFLNYNAVLVLAGLLGVLIFQLLPRKLQAEKLQP
Ga0137415_1043806613300028536Vadose Zone SoilLSTSVKKRRRPAQKGVQAVFTPSTNWKGRAQLSLGLLAIGAGTGSWIYTTRALGEFGLGNLIPWNESAAAIFLPVSIPLLIGGVGLWTYFLAMRKTWRASNRIESALYELEALVGQKNGVAGSPLGGGVIPEAKRGGKRSFHLPSFNLGSKALAIALVE
Ga0307471_10433092113300032180Hardwood Forest SoilGAGTASWIYTTRELGGFGLGNLIPWNESVSAILLPISIPLLIGGVGLCTYFLAMRRTWRASSRIESALYELEALVGQRNGVSGTSSEARVVTEANSRKRSFHLPSFNIVSKALAIALVEAVLLIVIYGGLVQEYASNVNMQNWVRSNFAPGSYLLSYNAVLVLAGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.