NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F073824

Metagenome / Metatranscriptome Family F073824

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073824
Family Type Metagenome / Metatranscriptome
Number of Sequences 120
Average Sequence Length 284 residues
Representative Sequence MSWHRRAFLFSFLIANIFAPRADSQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKILLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMGHPTE
Number of Associated Samples 86
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 13.45 %
% of genes near scaffold ends (potentially truncated) 33.33 %
% of genes from short scaffolds (< 2000 bps) 44.17 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.78

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.167 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(40.833 % of family members)
Environment Ontology (ENVO) Unclassified
(40.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.667 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 8.23%    β-sheet: 42.72%    Coil/Unstructured: 49.05%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.78
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.1.18.0: automated matchesd4jcma34jcm0.8317
b.1.18.2: E-set domains of sugar-utilizing enzymesd3bmva13bmv0.81992
b.1.18.1: NF-kappa-B/REL/DORSAL transcription factors, C-terminal domaind2cxka12cxk0.73039
b.1.18.14: Quinohemoprotein amine dehydrogenase A chain, domains 4 and 5d1pbya31pby0.70895
b.1.18.18: Other IPT/TIG domainsd1uadc_1uad0.67122


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF02646RmuC 30.00
PF13672PP2C_2 11.67
PF13487HD_5 8.33
PF00498FHA 5.83
PF07238PilZ 2.50
PF08388GIIM 1.67
PF08308PEGA 1.67
PF00481PP2C 0.83
PF04041Glyco_hydro_130 0.83
PF13354Beta-lactamase2 0.83
PF01144CoA_trans 0.83
PF13620CarboxypepD_reg 0.83
PF00873ACR_tran 0.83
PF05226CHASE2 0.83
PF14279HNH_5 0.83
PF01230HIT 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG1322DNA anti-recombination protein (rearrangement mutator) RmuCReplication, recombination and repair [L] 30.00
COG0631Serine/threonine protein phosphatase PrpCSignal transduction mechanisms [T] 0.83
COG1788Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunitLipid transport and metabolism [I] 0.83
COG2057Acyl-CoA:acetate/3-ketoacid CoA transferase, beta subunitLipid transport and metabolism [I] 0.83
COG2152Predicted glycosyl hydrolase, GH43/DUF377 familyCarbohydrate transport and metabolism [G] 0.83
COG4252Extracytoplasmic sensor domain CHASE2 (specificity unknown)Signal transduction mechanisms [T] 0.83
COG4670Acyl CoA:acetate/3-ketoacid CoA transferaseLipid transport and metabolism [I] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.17 %
UnclassifiedrootN/A0.83 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10422495All Organisms → cellular organisms → Bacteria → Acidobacteria800Open in IMG/M
3300002908|JGI25382J43887_10002830All Organisms → cellular organisms → Bacteria7711Open in IMG/M
3300002914|JGI25617J43924_10016441All Organisms → cellular organisms → Bacteria2510Open in IMG/M
3300002914|JGI25617J43924_10064214All Organisms → cellular organisms → Bacteria → Acidobacteria1341Open in IMG/M
3300005172|Ga0066683_10046688All Organisms → cellular organisms → Bacteria2546Open in IMG/M
3300005174|Ga0066680_10045528All Organisms → cellular organisms → Bacteria2556Open in IMG/M
3300005332|Ga0066388_100277729All Organisms → cellular organisms → Bacteria → Acidobacteria2318Open in IMG/M
3300005446|Ga0066686_10020541All Organisms → cellular organisms → Bacteria3693Open in IMG/M
3300005558|Ga0066698_10348068All Organisms → cellular organisms → Bacteria → Acidobacteria1024Open in IMG/M
3300005566|Ga0066693_10023944All Organisms → cellular organisms → Bacteria → Acidobacteria1877Open in IMG/M
3300006034|Ga0066656_10530623All Organisms → cellular organisms → Bacteria → Acidobacteria766Open in IMG/M
3300006176|Ga0070765_100027391All Organisms → cellular organisms → Bacteria4375Open in IMG/M
3300006903|Ga0075426_10013968All Organisms → cellular organisms → Bacteria5732Open in IMG/M
3300007265|Ga0099794_10013759All Organisms → cellular organisms → Bacteria3528Open in IMG/M
3300007788|Ga0099795_10026878All Organisms → cellular organisms → Bacteria → Acidobacteria1943Open in IMG/M
3300009012|Ga0066710_100010069All Organisms → cellular organisms → Bacteria → Acidobacteria9440Open in IMG/M
3300009012|Ga0066710_100054880All Organisms → cellular organisms → Bacteria4973Open in IMG/M
3300009038|Ga0099829_10035273All Organisms → cellular organisms → Bacteria3603Open in IMG/M
3300009038|Ga0099829_10100351All Organisms → cellular organisms → Bacteria2251Open in IMG/M
3300009038|Ga0099829_10146422All Organisms → cellular organisms → Bacteria → Acidobacteria1882Open in IMG/M
3300009088|Ga0099830_10017110All Organisms → cellular organisms → Bacteria4628Open in IMG/M
3300009088|Ga0099830_10053381All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria2860Open in IMG/M
3300009089|Ga0099828_10631521All Organisms → cellular organisms → Bacteria → Acidobacteria963Open in IMG/M
3300009090|Ga0099827_10294918All Organisms → cellular organisms → Bacteria → Acidobacteria1371Open in IMG/M
3300010048|Ga0126373_10060463All Organisms → cellular organisms → Bacteria → Acidobacteria3394Open in IMG/M
3300010304|Ga0134088_10127944All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1204Open in IMG/M
3300010358|Ga0126370_10286333All Organisms → cellular organisms → Bacteria → Acidobacteria1299Open in IMG/M
3300010359|Ga0126376_10239664All Organisms → cellular organisms → Bacteria → Acidobacteria1534Open in IMG/M
3300010361|Ga0126378_10000318All Organisms → cellular organisms → Bacteria36378Open in IMG/M
3300010361|Ga0126378_10034770All Organisms → cellular organisms → Bacteria → Acidobacteria4555Open in IMG/M
3300010376|Ga0126381_100132307All Organisms → cellular organisms → Bacteria → Acidobacteria3248Open in IMG/M
3300010379|Ga0136449_100085721All Organisms → cellular organisms → Bacteria → Acidobacteria6601Open in IMG/M
3300011120|Ga0150983_11181777All Organisms → cellular organisms → Bacteria → Acidobacteria777Open in IMG/M
3300011269|Ga0137392_10007565All Organisms → cellular organisms → Bacteria7015Open in IMG/M
3300011269|Ga0137392_10266460All Organisms → cellular organisms → Bacteria → Acidobacteria1412Open in IMG/M
3300011271|Ga0137393_10025866All Organisms → cellular organisms → Bacteria → Proteobacteria4369Open in IMG/M
3300012096|Ga0137389_10121015All Organisms → cellular organisms → Bacteria → Acidobacteria2109Open in IMG/M
3300012199|Ga0137383_10074098All Organisms → cellular organisms → Bacteria → Acidobacteria2456Open in IMG/M
3300012202|Ga0137363_10000761All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia17957Open in IMG/M
3300012202|Ga0137363_10451818All Organisms → cellular organisms → Bacteria → Acidobacteria1075Open in IMG/M
3300012202|Ga0137363_10469802All Organisms → cellular organisms → Bacteria → Acidobacteria1054Open in IMG/M
3300012205|Ga0137362_10117441All Organisms → cellular organisms → Bacteria → Acidobacteria2253Open in IMG/M
3300012205|Ga0137362_10265860All Organisms → cellular organisms → Bacteria → Acidobacteria1483Open in IMG/M
3300012205|Ga0137362_10269835All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1471Open in IMG/M
3300012205|Ga0137362_10342676All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1294Open in IMG/M
3300012207|Ga0137381_10004532All Organisms → cellular organisms → Bacteria → Acidobacteria10064Open in IMG/M
3300012208|Ga0137376_10025646All Organisms → cellular organisms → Bacteria → Acidobacteria4603Open in IMG/M
3300012211|Ga0137377_10171507All Organisms → cellular organisms → Bacteria → Acidobacteria2081Open in IMG/M
3300012357|Ga0137384_10608679All Organisms → cellular organisms → Bacteria → Acidobacteria891Open in IMG/M
3300012361|Ga0137360_10722222All Organisms → cellular organisms → Bacteria → Acidobacteria856Open in IMG/M
3300012362|Ga0137361_10329174All Organisms → cellular organisms → Bacteria → Acidobacteria1400Open in IMG/M
3300012363|Ga0137390_10819458All Organisms → cellular organisms → Bacteria → Acidobacteria888Open in IMG/M
3300012582|Ga0137358_10169001All Organisms → cellular organisms → Bacteria → Acidobacteria1491Open in IMG/M
3300012683|Ga0137398_10000180All Organisms → cellular organisms → Bacteria19786Open in IMG/M
3300012683|Ga0137398_10000899All Organisms → cellular organisms → Bacteria → Acidobacteria11427Open in IMG/M
3300012923|Ga0137359_10035089All Organisms → cellular organisms → Bacteria → Proteobacteria4321Open in IMG/M
3300012924|Ga0137413_10016744All Organisms → cellular organisms → Bacteria → Proteobacteria3759Open in IMG/M
3300012925|Ga0137419_10417703All Organisms → cellular organisms → Bacteria → Acidobacteria1051Open in IMG/M
3300012931|Ga0153915_10004853All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria12452Open in IMG/M
3300014154|Ga0134075_10075466All Organisms → cellular organisms → Bacteria → Acidobacteria1411Open in IMG/M
3300015051|Ga0137414_1045703All Organisms → cellular organisms → Bacteria → Acidobacteria1817Open in IMG/M
3300015193|Ga0167668_1012744All Organisms → cellular organisms → Bacteria → Acidobacteria1895Open in IMG/M
3300020170|Ga0179594_10059240All Organisms → cellular organisms → Bacteria → Acidobacteria1311Open in IMG/M
3300020199|Ga0179592_10116460All Organisms → cellular organisms → Bacteria → Acidobacteria1226Open in IMG/M
3300020579|Ga0210407_10000395All Organisms → cellular organisms → Bacteria50305Open in IMG/M
3300020580|Ga0210403_10291796All Organisms → cellular organisms → Bacteria → Acidobacteria1336Open in IMG/M
3300020580|Ga0210403_10698511All Organisms → cellular organisms → Bacteria → Acidobacteria813Open in IMG/M
3300020581|Ga0210399_10104889All Organisms → cellular organisms → Bacteria → Acidobacteria2315Open in IMG/M
3300020583|Ga0210401_10006448All Organisms → cellular organisms → Bacteria → Acidobacteria11895Open in IMG/M
3300020583|Ga0210401_10143266All Organisms → cellular organisms → Bacteria → Acidobacteria2235Open in IMG/M
3300021088|Ga0210404_10001408All Organisms → cellular organisms → Bacteria10354Open in IMG/M
3300021168|Ga0210406_10244297All Organisms → cellular organisms → Bacteria → Acidobacteria1475Open in IMG/M
3300021170|Ga0210400_10004449All Organisms → cellular organisms → Bacteria12135Open in IMG/M
3300021170|Ga0210400_10075776All Organisms → cellular organisms → Bacteria2639Open in IMG/M
3300021171|Ga0210405_10000203All Organisms → cellular organisms → Bacteria84512Open in IMG/M
3300021171|Ga0210405_10204014All Organisms → cellular organisms → Bacteria → Acidobacteria1567Open in IMG/M
3300021180|Ga0210396_10026674All Organisms → cellular organisms → Bacteria → Acidobacteria5348Open in IMG/M
3300021403|Ga0210397_10245897All Organisms → cellular organisms → Bacteria → Acidobacteria1296Open in IMG/M
3300021420|Ga0210394_10032217All Organisms → cellular organisms → Bacteria → Acidobacteria4700Open in IMG/M
3300021432|Ga0210384_10001495All Organisms → cellular organisms → Bacteria → Acidobacteria31627Open in IMG/M
3300021432|Ga0210384_10031566All Organisms → cellular organisms → Bacteria → Acidobacteria4946Open in IMG/M
3300021432|Ga0210384_10161246All Organisms → cellular organisms → Bacteria → Acidobacteria2013Open in IMG/M
3300021432|Ga0210384_10537618All Organisms → cellular organisms → Bacteria → Acidobacteria1051Open in IMG/M
3300021476|Ga0187846_10075587All Organisms → cellular organisms → Bacteria → Acidobacteria1466Open in IMG/M
3300021478|Ga0210402_10115805All Organisms → cellular organisms → Bacteria → Acidobacteria2419Open in IMG/M
3300021479|Ga0210410_10001901All Organisms → cellular organisms → Bacteria19125Open in IMG/M
3300021479|Ga0210410_10138683All Organisms → cellular organisms → Bacteria2164Open in IMG/M
3300026538|Ga0209056_10018146All Organisms → cellular organisms → Bacteria6924Open in IMG/M
3300026551|Ga0209648_10003216All Organisms → cellular organisms → Bacteria13702Open in IMG/M
3300026551|Ga0209648_10019471All Organisms → cellular organisms → Bacteria5931Open in IMG/M
3300026551|Ga0209648_10068082All Organisms → cellular organisms → Bacteria → Acidobacteria3003Open in IMG/M
3300026551|Ga0209648_10114006All Organisms → cellular organisms → Bacteria2193Open in IMG/M
3300026557|Ga0179587_10138671All Organisms → cellular organisms → Bacteria → Acidobacteria1507Open in IMG/M
3300027587|Ga0209220_1052429All Organisms → cellular organisms → Bacteria → Acidobacteria1088Open in IMG/M
3300027645|Ga0209117_1005553All Organisms → cellular organisms → Bacteria4328Open in IMG/M
3300027846|Ga0209180_10000077All Organisms → cellular organisms → Bacteria44098Open in IMG/M
3300027846|Ga0209180_10102203All Organisms → cellular organisms → Bacteria → Acidobacteria1634Open in IMG/M
3300027862|Ga0209701_10001631All Organisms → cellular organisms → Bacteria14492Open in IMG/M
3300027862|Ga0209701_10005770All Organisms → cellular organisms → Bacteria8117Open in IMG/M
3300027862|Ga0209701_10164800All Organisms → cellular organisms → Bacteria → Acidobacteria1344Open in IMG/M
3300027875|Ga0209283_10333154All Organisms → cellular organisms → Bacteria → Acidobacteria998Open in IMG/M
3300027882|Ga0209590_10130212All Organisms → cellular organisms → Bacteria → Acidobacteria1539Open in IMG/M
3300027882|Ga0209590_10218722All Organisms → cellular organisms → Bacteria → Acidobacteria1209Open in IMG/M
3300027882|Ga0209590_10310813All Organisms → cellular organisms → Bacteria → Acidobacteria1013Open in IMG/M
3300027903|Ga0209488_10029574All Organisms → cellular organisms → Bacteria3990Open in IMG/M
3300028536|Ga0137415_10234998All Organisms → cellular organisms → Bacteria → Acidobacteria1644Open in IMG/M
3300029636|Ga0222749_10215262All Organisms → cellular organisms → Bacteria → Acidobacteria965Open in IMG/M
3300031753|Ga0307477_10000547All Organisms → cellular organisms → Bacteria40803Open in IMG/M
3300031754|Ga0307475_10146592All Organisms → cellular organisms → Bacteria → Acidobacteria1872Open in IMG/M
3300031820|Ga0307473_10449135All Organisms → cellular organisms → Bacteria → Acidobacteria858Open in IMG/M
3300031823|Ga0307478_10057535All Organisms → cellular organisms → Bacteria2902Open in IMG/M
3300031962|Ga0307479_10041352All Organisms → cellular organisms → Bacteria → Acidobacteria4425Open in IMG/M
3300031962|Ga0307479_10080260All Organisms → cellular organisms → Bacteria → Acidobacteria3157Open in IMG/M
3300031962|Ga0307479_10108699All Organisms → cellular organisms → Bacteria2697Open in IMG/M
3300031962|Ga0307479_10333406All Organisms → cellular organisms → Bacteria → Acidobacteria1500Open in IMG/M
3300031962|Ga0307479_10617763All Organisms → cellular organisms → Bacteria → Acidobacteria1065Open in IMG/M
3300031962|Ga0307479_10619782All Organisms → cellular organisms → Bacteria → Acidobacteria1063Open in IMG/M
3300032160|Ga0311301_10179748All Organisms → cellular organisms → Bacteria → Acidobacteria3703Open in IMG/M
3300032205|Ga0307472_100731321All Organisms → cellular organisms → Bacteria → Acidobacteria894Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil40.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil9.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.67%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.67%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.67%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.83%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.83%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015193Arctic soil microbial communities from a glacier forefield, Rabots glacier, Tarfala, Sweden (Sample Rb6, proglacial stream)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1042249513300001593Forest SoilRIAFICMRFSTTIPRMPLHRRGISYFLTCLIASLFAPWANAQQSPPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDATGRGLFVAPLNPGVIFASIAGREGRVATAILSPGEIASSSIEISSAPRVASLSDRFELFGRGFCGNADANQVTIAGQPAIVLASSPSSLVVLPPLELGPGPAVVEVSCAKRQAASFSITLAKLELEADSSPLAPGEHRRLTVHISGTTAKIPLEARNLAPDIAELAGGNPV
JGI25382J43887_1000283033300002908Grasslands SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASRARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLELGRATAEVACAKREAPPFSLTLVGLELETDSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
JGI25617J43924_1001644123300002914Grasslands SoilLPDGVRITFIRMSFSTTIPRMCLPRXGAALLLVCLIPGVFAAVGNAQQSAPAPSGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSLAGRAGRVATAVLSPSEAASTSIEVSSAPRVASLTDRFEIFGKSFCGDADANQVTIAGQRGIVLASSPTTLVVLPPPELQSGSAAVEVSCAKRQLSPAFTVTFVGLELEADSSALKPDEHRALTVRVHGTASKLALEARNLAPDIAELAGGNPVRSSSSGGAENLARFEVVGRKTGSFLISIRLVPSLGRPQP*
JGI25617J43924_1006421423300002914Grasslands SoilMSWHRRAFLFSFLIASIFAPWADGQESAPAASRARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNSGVIFASIAGRPGRVAAAVLVSSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0066683_1004668823300005172SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0066680_1004552833300005174SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELA
Ga0066388_10027772913300005332Tropical Forest SoilLPFAPIVSGSYGQDCAGVFAGGAADTREELAFAISLSNHRLLASSFPRQTFSTTIQHMPARRRLLFVAFLLVGFLTLGLYAQESPPAPSAARILLLPRRIVSGERATLAVLDVKGRLTPGVTVTFSNGDQVKTNATGRAFFVAPLDPGVIFGSIDGRPGRVPTTILAPAEAAASSLEVSSTPRFASLADRFEILGKGFCGDADANRVTTSSRPALVLASSPASLLVLPPADLDPGPASVSIACSSRKGSEFFITFVELALKADSSPLKRGEHRTLSIRVRGTTEKIALEARNLAPDIAVLSGGNPLQLQSSGGDQNVALFDVVGQKTGNFLISIRLVPSLGRPQ*
Ga0066686_1002054133300005446SoilLLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSSASLVVLPPPDLEPGRATAEVTCAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0066698_1034806813300005558SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNSLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0066693_1002394423300005566SoilAFIGTWFSTTIPRMFVHGRGPALFFACLLAGVPAHRVSAQQLQPPASGARMLLLPRRIVSGERATLAVLDVHGRLTPGVTVNFSNGDRLTTDASGRALFVAPLNPGVIFGSIAGRTERVATAILWPGEAAAAEIQVSSAPQVASLTDRFEFFGRSFCGDADANRVTIAGQPAIVLASSPTSLVVLPPQDLRPGPATVEVSCAKRQSPPFSITLVGLELEASSLPLKTGERRALSVRVRGTTAKIALEARNLAPEIAELSGGNPLRASSSGGAENFARFEVVGQKNGSFRISIRLVPSLGRPQ*
Ga0066656_1053062313300006034SoilTIPRMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNSLRL
Ga0070765_10002739133300006176SoilMLTARDNLFRVRLPLACIRHVPLRCVFFLVTFLAAISLETADAQRPPSASGARTLLLPRQIVSGERATLAVLDGNGRLTPGVKVQFSNGDRFMTDVTGRALFVAPLDPGVLFASIAGRTDRVPTAVVSPSEAVTSSMEIAGSPSVASLTDRFELLGRGFCGDADANQVTIADQPALVLASSPASLVILPPMDLEAGTAKVEVACAKRQAPAFSMTLVELELQADSSPLKPGERRALSVRVHGSTAKVSLEARNLAPRVAELTGGDLVRGSSTGGTENVARFHLTGRKNGSFAISIRLVPSLTHPQ*
Ga0075426_1001396843300006903Populus RhizosphereMSFRRCSLFLLFACFAAPFFAIWAAAQQQAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNLGIVFGSIEGRTGRVATAILSPGEAASASIEVSSAPRVASLTDRFEVFGKGFCGDADSNQVTIAGQRAIVLSSSPASLIVLPPQDLQPGAAEVEISCAKRQSPPFSLTFVGLDLEADSSPLKPGEHRVLTVRVHGTAAKITLEARNLAAEIAELTGGNPMRTSSSGGPENLAKFEVVGRKNGSFLISMRLVSSLGRPQTGSPQQ*
Ga0099794_1001375933300007265Vadose Zone SoilMSLHRRGLSLIFGCLIASALAHGASAQQTAPPASGARRLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRMGRVATAILSPSEAASTSIEVSSAPRVASLTDRFELFGKSFCGDADANQVTIAGQAAIVLASSPTALVVLPPPELQPGGASVKISCAKRQAPPFSITLVELELKANSSPLKPDEHRTLTVRVRGTTGKIAVEARNLAPEITELSGGNPVGATSSGGGENFARFEVVGRKNGSFLISIRLVPSPGRPLP*
Ga0099795_1002687823300007788Vadose Zone SoilVLGTAIFSAAFAQQPPPASAARILLLPKRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIYASIAQRPGRVHTTVLTAAESAASGIEVASAPRFASLTDRFELSGLGFCGDADANQVRLGGQPALVLASSPLSLLVLPPADLEPGREVLDITCAKRTSSSFSTTFVALSLEADSTPLAPGEQRTVTVHVRGTTSKVGLEARNLAPDVADLTAGNPARISSTGGADNVGRFLLVGRQHGTFLISMRLLPSVAPP
Ga0066710_10001006913300009012Grasslands SoilRRAFLFSFLIANIFAPWADGQESAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASISGRPGRMATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE
Ga0066710_10005488013300009012Grasslands SoilGPPRFARRRFSTTIPRMARHRPLLFLACSIASFFVLRSAAQQPPPAPSGARILLLPRRIVSCQRASLAVLYVNGRLTPGGVVNFSNGDRVTTDSTGRALFVAPLTPGVILGLLTGRAGRVATTMLRPTEAVASSMEISSAPRVASLTDRFEIFGKGFCGDADANQVRIAGQTSLVLASSSVSLVILPPSDLRPGGATVEIVCAKGEARPFSLTLVGLDLVADSSPLKPGEHRGLTVRVRGITARISLEARNLAPEVAELAGGNPARALSSGGTENWARFEVTGRKSGNFLISIRLVPWLGRPQ
Ga0099829_1003527333300009038Vadose Zone SoilMLRRPALPLFCIQVAFLLSLFAFGQQPPPPTAARMLLLPKRIVSGEHATLAVLDVNGRLTPGVTVTFSNGDQLKTDATGRGLFVAPLTLGVLFASIEGRPGRVPTAVVSAAENSSSAIEIASAPRFASLSDRFQITGRGFCGDADANQVTIRGKSALALAASPDSLVVLPPAELEPGPAEVEIACAKRTARVFSLILVALQLEADSSPLGPGEHRTLTVRVKGTALKVGLGAQNLAPDIAELAGGNPVRLSSTGGSENLGRFEVVGRTRGSFVISIHLLSSGALVRP*
Ga0099829_1010035123300009038Vadose Zone SoilMSWHRRAFLFSFLIANIFAPRADSQSGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISAAPRVASLTDRFELFGRGFCGDADSNQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKILLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0099829_1014642213300009038Vadose Zone SoilMSLHCRGSAIFLTFLAVFFFAREATAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTGRVATAILSPSEAVSASIEVSSTPRVASLTDRFEIFGKSFCGDADANQVTIAGQPAIVLASSPAALVVLPPSELQPGSAAVEVSCAKRHSSEFSATFVGLELEADSSPLKPGEQRTLTVRVRGTLAKVALEARNLAPEIAGLTGGNPVRASSSGGTENSANFEVVGRNNGGFLISIRLVPSLGRPAP*
Ga0099830_1001711043300009088Vadose Zone SoilMFLHRRGIALLFGCLITGVFPAVGNAQQSPPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVMVNFSNGDRLTTDATGRTLFVAPLNPGVIFGSIAGRAGQVATAILSSSEAASTSIEVSSAPRVVSLTDRFEISGKSFCGDADANQVTIAGQRAIVLASSPTALVVLPPPDLQPGSAAMEVSCAKRQSSPTFSVTFVKLELEADSSPLKPDEHRALTVRVHGTASKLALEARNLAPEIAELAGGNPVRASSSGGAENLGRFEVVGRKNGSFLISIRLVPSLGR
Ga0099830_1005338133300009088Vadose Zone SoilMLSRRFWFAVALPCAAAFLLLDGVSAQQAPPASGARTLLLPRRIVSGERATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGAASSIEVSSAPRAASISDRIELTGHGFCGDADANQVRIGGKAALVLASSPSSLVVLPPIDLEPGPAALKVACAKRSAPIFNITFVALQLEANSSPLSPGEHRVLTVHARGTTAKVGLEARNLAPDIAELGGGNPLRLSTSGGAENSVRFEITGRKRGSFLISIRLLSSIGPPR
Ga0099828_1063152113300009089Vadose Zone SoilMSWHRRAFLFSFLIANIFAPRADSQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISAAPRVASLTDRFELFGRGFCGDADSNQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKILLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMG
Ga0099827_1029491813300009090Vadose Zone SoilMSLRRRCPGLLSACLIVGVFAPGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNADRLTTDATGRVLFVAPLNPGVIFGSIAGRAGRVATTILSPSEAASTAIEISSVPRVASLTDRFELFGKSFCGDADANQVTIAGQPAIVLASSPIALVVLPPPDLHPGSAAVEVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPFLRSPLP*
Ga0126373_1006046333300010048Tropical Forest SoilMPSRLRLLFLAFALAYFFAPVLNAQEPPPAPSAARILLLPRRIVSGERATLAVLDLKGRLTPGVTVTFSNGDQVKTNTTGRAVFVAPLDPGVIFGSIAGRPGRVSTVILSPAEAAASSLEISSAPRFASLADRFEIFGKGFCGDADANQVTVGGQPALVLASSPASLLVLPPVALDPGAAPVSIACSGRKAPESSITFVELSLKADSSPLKRGEHRMLSVYVQGTTEKITLEARNLAPDIAVLSDGNPVQHQSSGGDQNVAQFEVVGQKTGNFLISIRLIPSVGHSQ*
Ga0134088_1012794413300010304Grasslands SoilIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSSASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0126370_1028633313300010358Tropical Forest SoilADTREELAFAISLSNHRLLASSFPRQTFSTTIQHMPARRRLLFVAFLLVGFLTLGLYAQESPPAPSAARILLLPRRIVSGERATLAVLDVKGRLTPGVTVTFSNGDQVKTNATGRAFFVAPLDPGVIFGSIDGRPGRVPTTILAPAEAAASSLEVSSTPRFASLADRFEILGKGFCGDADANRVTTSSRPALVLASSPASLLVLPPADLDPGPASVSIACSSRKGSEFFITFVELALKADSSPLKRGEHRTLSIRVRGTTEKIALEARNLAPDIAVLSGGNPLQLQSSGGDQNVALFDVVGQKTGNFLISIRLVPSLGRPQ*
Ga0126376_1023966423300010359Tropical Forest SoilLFLAFALACFFAPVLNAQEPPPVPSAARILLLPRRIVSGERATLAVLDLKGRLTPGVTVTFSNGDQVKTNTTGRAVFVAPLDPGVIFGSIAGRPGRVSTVILSPAEAAASSLEISSAPRFASLADRFEIFGKGFCGDADANQVTVGGQPALVLASSPASLLVLPPVALDPGAAPVSIACSGRKVPESSITFVELSLKADSSPLKRGEHRTLSVRVQGTTEKITLEARNLAPDIAVLSGGNPVQHQSSGG
Ga0126378_1000031893300010361Tropical Forest SoilMATRQMLPLPACLFAAVLSFAPLLAAQQAPPVPAAARTLLLPRRIVSGERATLAVLDANGRLTPGVAVTFSNGDRVKTDETGRALFVAPLDPGIIFGSIGGRPGRVSTVILTPAEAAGTSLEISDVPRVASFTDRFEILGKGFCGDADSNKVTIAGRPALVLASSPASLIVLPSDDLSPGSAAVAVTCAQHKAPEFSVAFVELALKADSSPLKRGEHRTLSVLIHGTTEKATLEARNLAPEIAALSGGNSAQRQSSGGEQNVAQFEVVGQKSGNFLISIRLVPSLTHPQ*
Ga0126378_1003477053300010361Tropical Forest SoilMPSRLRLLFLAFALAYFFAPVLNAQEPPPAPSAARILLLPRRIVSGERATLAVLDLKGRLTPGVTVTFSNGDQVKTNTTGRAVFVAPLDPGVIFGSIAGRPGRVSTVILSPAEAAASSLEISSAPRFASLADRFEIFGKGFCGDADANQVTVGGQPALVLASSPASLLVLPPVALDPGAAPVSIACSGRKVPESSITFVELSLKADSSPLKRGEHRTLSVRVQGTTEKITLEARNLAPDIAVLSGGNPVQHQSSGGDQNVAQFEVVGQKTGNFLISIRLIPSVAHSQ*
Ga0126381_10013230713300010376Tropical Forest SoilMPSRQRLLFLAFALAYFFAPVLNAQEPPPTPSAARILLLPRRIVSGERATLAVLDLKGRLTPGVTVTFSNGDQVKTNTTGRAVFVAPLDPGVIFGSIAGRPGRVSTVILSPAEAAASSLEISSAPRFASLADRFEIFGKGFCGDADANQVTVGGQPALVLASSPASLLVLPPVALDPGAAPVSIACSGRKVPESSITFVELSLKADSSPLKRGEHRTLSVRVQGTTEKITLEARNLAPDIAVLSGGNPVQHQSSGGDQNVAQFEVVGQKTGNFLIS
Ga0136449_10008572133300010379Peatlands SoilMLSACDKLFRVKLPLNYIRHMPRRAVFVLMAFFAAMSLGTADAQRPLSASGARTLLLPRQIVSGERATLAVLDANGRLTPGVTVKFSNGDQLTTDVTGRALFVAPLDPGVIFASIAGRSERLTTVVLSAAEAATSSMEISGAPTAASLTDRFELLGRGFCGDADSNQVTIGGQPALVLASSPAALVILPPMEIEAGTAKLEVTCAKWQAPAFSIALVELELQADSSPLKPDEHRVLTVRVRGSTAKVSLEARNLAPTIAELAGGNPVRISSTGGSENFARFHLVGRKNGSFVISIRLVPLLTQPQ*
Ga0150983_1118177713300011120Forest SoilCFPAAILCAAGCLFLGDVLAQQAPPASGARILLLPRRIVSGERATLAVLDANGRLTPGVKVNFSNGDRFTTDITGRSLFVAPLTLGVIFASIEGRPGRVPAAVLSAEEGASPSIEVSSAPRVASVSDRFELAGHGFCGDADANQVMIGGKAALVLASSPSALIVLPSGDLGPGPAVVEISCAKRSAPAFTITLVSLQLEANSSPLAPAERRVLTVHVHGTTSKVALEARNLAPDVAELTGGNPVRLSTSGGIENFVKF
Ga0137392_1000756533300011269Vadose Zone SoilMLSRRFWFAVALLCAVAFLLLDVVSAQQAPPASGARTLLLPRRIVSGERATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGAASSIEVSSAPRAASISDRIELTGHGFCGDADANQVRIGGKAALVLASSPSSLVVLPPIDLEPGPAALKVACAKRSAPIFNITFVALQLEANSSPLSPGEHRVLTVHARGTTAKVGLEARNLAPDIAELGGGNPLRLSTSGGAENSVRFEITGRKRGTFLISIRLLSPVGPPRP*
Ga0137392_1026646023300011269Vadose Zone SoilMSLHCRGSAIFLTFLAVFFFARDATAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTGRVATAILSPSEAVSASIEVSSTPRVASLTDRFEIFGKSFCGDADANQVTIAGQPAIVLASSPAALVVLPPSELQPGSAAVEVSCAKRHSSEFSATFVGLELEADSSPLKPGEQRTLTVRVRGTLAKVALEARNLAPEIAGLTGGNPVRASSSGGTENSANFEVVGRNNGGFLISIRLVPSLGRPAP*
Ga0137393_1002586613300011271Vadose Zone SoilMLSRRFWFAVALLCAVAFLLLDVVSAQQAPPASGARTLLLPRRIVSGERATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGAASSIEVSSAPRAASISDRIELTGHGFCGDADANQVRIGGKAALVLASSPSSLVVLPPIDLEPGPAALKVACAKRSAPIFNITFVALQLEANSSPLSPGEHRVLTVHARGTTAKVGLEARNLAPDIAELGGGNPLRLSTSGGAENSVRFEITGRKRGSFLISIRLLSSIGPPRP*
Ga0137389_1012101523300012096Vadose Zone SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGGRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISAAPRVASLTDRFELFGRGFCGDADSNQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0137383_1007409833300012199Vadose Zone SoilMFVHGRGPALFFACLIAGVPAHRVSAQQPLPPASGARMLLLPRRIVSGERATLAVLDVHGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVIFGSIAGRTERVATAILSPGEAAAAEIQVSSAPQVASLTDRFEFFGRSFCGDADANRVTIAGQPAIVLASSPTSLVVLPPQDLRPGPATVEVSCAKRRSPTFSITLVGLELEASSLPLKTGERRALSVRVRGTTAKIALEARNLATEIAELSGGNLLRVSSSGGAENFARFEVVGRKNGGFRISIRLVPSQGRPQ*
Ga0137363_1000076163300012202Vadose Zone SoilMFVHGRGPALLFACLIAGVLAHWVSAQQLQPPASAARMLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTERVATAILSPGEAAAAEIQVSSVPQVASLTDRFEFFGRSFCGDADANRVTIARQPAIVLASSPTSLVVLPPQDLRPGPATVEVSCAKRQSPPFSITLVGLELEASSLPLKTGERRALSVRVRGTTAKVALEARNLAPEIAELSGSNPLRASSSGGAENFARFEVVGRKNGSFRISIRLVPSQGRPQ*
Ga0137363_1045181813300012202Vadose Zone SoilMLSRRFWFALALLYAVAFLLLDVVSAQQAPPASGARTLLLPRRIVSGERATLAVLDANGRLTPGVAVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGATSSIEVNSAPRAASISDRIELTGHGFCGDADANQVRIGGKAALVLASSPSSLVVLPPIDLEPGPAALEVACAKRSAPIFNITFVALQLEANSSPLAPGEHRVLAVHVRGTSAKVRLEARNLAPDIAELSGGNPLHLSTSGGAENSVRFEITGRKRGSFLISIRLLSPVGPPRP*
Ga0137363_1046980213300012202Vadose Zone SoilMIGVFALGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILLPSEAASTSIEVSSLPRVASLTDRFELFGKGFCGDADANQVMIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSLRSPLP*
Ga0137362_1011744113300012205Vadose Zone SoilMLSRRFWFAVALLCAVAFLLLDVVSAQQAPPASGARTLLLPRRIVSGERATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGATSSIEVSSAPRAASISDRIELTGHGFCGDADANQVRIGGKAALVLASSPSSLVVLPPIDLEPGPAAVEVACTKRSAPIFNITFVALRLETDSSPLAPGEHRVLAVHVRGTTAKVGLEARNLAPDVAELSGGNPLRLSTSGGAENSVRFEITGRKRGSFLVSIRLLSPVGPPRP*
Ga0137362_1026586013300012205Vadose Zone SoilMSLRRRCPGLLSACLIVGVFAPGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILSPSEAASTSIEVSSVPRVASLTDRFELFGKGFCGDADANQVMIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSRRSPLP*
Ga0137362_1026983513300012205Vadose Zone SoilMSWHRRAFLFSFLIANIFAPWADGQESVPAASRARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNSGVIFASIAGRPGRVAAAVLAPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKSGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0137362_1034267613300012205Vadose Zone SoilTLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNSGVIFASIAGRPGRVAAAVLSPSEADSTSIEISAAPRVASLTDRFELFGRGFCGDADGNQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGAENLARFDLVGRKSGSFLISIRLMPSMGHPTE*
Ga0137381_1000453253300012207Vadose Zone SoilMFVHGRGPTLLFACLIAGVLAHWVSAQQLQPPASGARMLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVIFGSIAGRTERVATAILSPGEAAAAEIQVSSAPQVASLTDRFEFFGRSFCGDADANRVTIAGQPAIVLASSPTSLVVLPPQDLRPGPATVEVSCAKRRSPTFSITLVGLELEASSLPLKTGERRALSVRVRGTTAKIALEARNLATEIAELSGGNLLRVSSSGGAENFARFEVVGRKNGGFRISIRLVPSQGRPQ*
Ga0137376_1002564653300012208Vadose Zone SoilMLLLPRRIVSGERATLAVLDMNGRLTPGVTVNFSNGDRLTADETGRALFVAPLNPGVIYGSIADRTGRVATAILAPGEAASTSIEVSSAPQFASLIDRFELFGKSFCGDADANEVTIAKQPAIVLAASPTALVVLPPPDVHPGKAAVEVSCAKRRSSPFSITLVGLALEANSSPLKLGEHRALTVRVRGTRAKIALEARNLAPEIAELFGGNPLRASSSGGAENLARFEIVGRKNGSFLISIRLVPFLGRP*
Ga0137377_1017150713300012211Vadose Zone SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGAENLARFDLIGRKSGSFLISIRLMPSMGHPTE*
Ga0137384_1060867923300012357Vadose Zone SoilMFVHGRGPTLLFACLIAGVLAHWVSAQQLQPPASGARMLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVIFGSIAGRTERVATAILSPGEAAAAEIQVSSAPQVASLTDRFEFFGRSFCGDADANRVTIAGQPAIVLASSPTSLVVLPPQDLRPGPATVEVSCAKRRSPTFSITLVGLELEASSLPLKTGERRALSVRVRGTTAKIALEARNLATEIAELSGGNLLRVSSSGGAENFARF
Ga0137360_1072222213300012361Vadose Zone SoilVKFSTTIPRLSWHRRAFLFSFLIANIFAPWADGQQPAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSVSMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSSASLVVLPPPDLEPGRATAEIACAKREAPPFSLTLVGLELETDSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELA
Ga0137361_1032917413300012362Vadose Zone SoilMFVHGRGPALLFACLIAGVLAHWVSAQQLQPPASAARMLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTERVATAILSPGEAAAAEIQVSSVPQVASLTDRFEFFGRSFCGDADANRVTIARQPAIVLASSPTSLVVLPPQDLRPGPATVEVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSRRSPLP*
Ga0137390_1081945813300012363Vadose Zone SoilMSLHCRGSAIFLTFLAVFFFARDATAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTGRVATAILSPSEAVSASIEVSSTPRVASLTDRFEIFGKSFCGDADANQVTIAGQPAIVLASSPAALVVLPPSELQPGSAAVEVSCAKRHSSEFSATFVGLELEADSSPLKPGEQRTLTVRVRGTLAKVALEARNLAPEIAGLTGGNPVRASSSGGTENSANFEVVGRNNGGFLI
Ga0137358_1016900123300012582Vadose Zone SoilMSLRRRCPGLLSACIIVGVFAPGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILSPSEAASTSIEVSSGPRVASLTDRFELFGKGFCGDADANQVMIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGR
Ga0137398_10000180153300012683Vadose Zone SoilMKRWRCGLKSQDNGDFRYNSSMLGRRSVAACAVLGTAIFSAAFAQQPPPASAARILLLPKRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIYASIAQRPGRVHTTVLTAAESAASGIEVASAPRFASLTDRFELSGLGFCGDADANQVRLGGQPALVLASSPLSLLVLPPADLEPGREVLDITCAKRTSSSFSTTFVALSLEADSTPLAPGEQRTVTVHVRGTTSKVGLEARNLAPDVADLTAGNPARISSTGGADNVGRFLLVGRQHGTFLISMRLLPSGAPPRP*
Ga0137398_1000089923300012683Vadose Zone SoilMVGVFALGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVHGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILSPSEAASTSIEVSSVPRVASLTDRFELFGKGFCGDADANQVLIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSLRSPLP*
Ga0137359_1003508943300012923Vadose Zone SoilMLSRRFWFALALLYAVAFLLLDVVSAQQAPPASGARTLLLPRRIVSGDRATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGATSSIEVNSAPRAASISDRIELTGHGFCGDADANQVTIGGKAALVLASSPSSLVVLPPIDLEPGPAALEVACAKRSAPIFNITFVALRLEANSSPLAPGEHRVLAVHVRGTTAKVGLEARNLAPDIAELSGGNPLHLSTSGGAENSVRFEITGRKRGSFLISIRLLSSIGPPRP*
Ga0137413_1001674443300012924Vadose Zone SoilMVGVFALGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVHGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILSPSEAASTSIEVSSVPRVAWLTDRFELFGKGFCGDADANQVLIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSLRSPLP*
Ga0137419_1041770313300012925Vadose Zone SoilMSLRRRGSAIFLALLTVFLLARDADAQRPAPAASGARILLLPRRIVSGERATLAVLDLNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPTAILLPGESASASIEISSAPRVASLTDRFEIFGKSFCGNADANQVTIAGQPAIVLASSRAALVVLPAPDLQPGSAVVEVSCAKRQAAPFSVTFAGLELEADSSPLKPGEHRVLTVRVRGTTAKIALEARNLAPEIAELSGGNPARASSCGGEENVAKFEVAGRQNGSFLISIRLVPSLRGPLPCMAASTPCFL
Ga0153915_1000485363300012931Freshwater WetlandsMLALRFFLISLILGIFCADAQVFSFAQQAPSASGARMLILPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRFTTDSTGRALFVAPLNPGVIFASIAGRPGRVATAILSPQEVASTAIKISSVPQVALVTDRFEISGRGFCGDADANQVTVGGQPALVLASSPASIVVLPPFETKPGAAAVEVSCAKNAAPAFLTTFVALELEADSSPLKPGEHRSLTVRVRGTNSKIALEARNLAPDIAELTGGTAVRQSSNGGMNNTAQFELVGRKQGSFLISIRLVPVVSRPHPNQ*
Ga0134075_1007546623300014154Grasslands SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRMATTVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSSASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENLARFDLIGRKSGSFLISIRLMPSMGHPTE*
Ga0137414_104570313300015051Vadose Zone SoilLLLPRRIVSGERATLAVLDVHGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILSPSEAASTSIEVSSVPRVAWLTDRFELFGKGFCGDADANQVLIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSLRSPLP*
Ga0167668_101274423300015193Glacier Forefield SoilMILHRRGMAFFFTCLIASIFVPWANAQQSPPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDATGRGLFVAPLNPGVIFASIAGREGRVATTVLSLGEPAFSSIEISSAPRVASLSDRFELFGRGFCGNADANQVTIAGQRAIVLASSPTSLVVLPPLELGPGAAAVEVSCAKRQAASFSITLAGLELEADSSPLAPGEHRGLTVRVSGTTAKIPLEARNLAPDIAELAGGNPVRLSSSGGAENFARFDLVGRKKGSFLISIRLVAVSVRPQ*
Ga0179594_1005924013300020170Vadose Zone SoilMRFSTTIPRMSLRRRCPGLLSACLIVGVFAPGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPRVTVNFSNGDRLTSDATGRALFVAPLNPGMIFGSIAGRAGRVATTILSPSEAASTSIEVSSVPRVASLTDRFELFGKGFCGDADANQVMIAGQPAIVLASSPIALVVLPPPDLHPGSAAVQVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPSLRSPLP
Ga0179592_1011646013300020199Vadose Zone SoilMSLRRRGSAIFLALLTVFLLARDADAQRPAPAASGARILLLPRRIVSGERATLAVLDLNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPTAILLPGESASASIEISSAPRVASLTDRFEIFGKSFCGNADANQVTIAGQPAIVLASSPAALVVLPPPDLQPGSAAVEVSCAKRQAAPFSVTFVGLELEADASPLKPGEHRALTVRVRGTTAKIALEARNLAPEIAELSGGNPARASSSGGEQNVAKFEVAGRQNGSFLISIRLALSAGRPLPANVPQ
Ga0210407_10000395453300020579SoilMFWRGLVRIGVWLVILCAVAQVFSSPRPAQQAPLASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEVTASSVEITSAPRVASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPMDLGAGPASVEVSCAKRQAPHLAITLVALELEADSSPLKAGEHRALTVHIRGTASKVALEARNLSPDIAELTGGNPVRLSSTGGAENLAHFDLTGRKNGSFLISIRMVPALAHSPHSQ
Ga0210403_1029179613300020580SoilMLWRGLIRTGVWPVVLCTVAQVCSFPRPPQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEADSSSLEISSAPRVASLTDRFELVGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPLDLGAGSASVEISCAKRQAHPFSITLVALELEADASPLKAAEHRILTVHVRGTAAKIALEARNLSPDVAELAGGNPVRLSSSGGTENLAHFDLTGRKNGSFLISIRLVPALAHSQ
Ga0210403_1069851113300020580SoilIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLDLGVMFASIEGRPDRVKTVIFSPSEATASSVEITSAPRVASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPMDVGAGPASVEISCAKHQASPLAITLVALELEADSSTLKAGEHRALTVRIRGTAAKIALEARNLSPDIAELTGGNPVRLSSTGGAENLAHFDLTARKNGSFLISIRLVPALAHSQ
Ga0210399_1010488923300020581SoilMTFAQPRFSTTISRMLWRGLIRIGVWLVVLRVAAQVFPLPQFSQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVSVAFSNGDKLITNATGRALFVAPLNLGVMFASIDGRPDRVKTVIFSPSETVSSSLEITSSPRVASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSLLVLPPMDLGPGSASVEISCAKHQAPPFSITLVALELEADSSPLKAGEHRALTVRVRGTTGKIALEARNLSPDVAELAGGNPVRVSSSGGAENLAHFDLTGRKNGSFLISIRLVPALAHSQKRN
Ga0210401_1000644833300020583SoilMSVLVVVLCAVEQFFLVPAPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVAFASIQGRPDKVSTAIIAPDEAAATAIEIASAPRFASLTDRFELLGRGLCGDADANRVMVGGQPAIVLAASPVSLVVLPPMDREPGPANVEISCAKRQAAPLTITLVSLEMEADSSPLKSGEHRALTIRVRGTPAKIALEARNLAPEIATLSGGNPVKLSSSGGAENFVRFELIGRKNGSFLISIRLVPTLGRPQ
Ga0210401_1014326633300020583SoilRRIVSGERATLAVLDVNGRLTPGVNVAFSNGDELTTNATGRALFVAPLHLGVMFATIVGRAGRVATAILSPSEAAASVVEVSSAPRVASLTDRFEILGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPMDLGPGPASVEISCLKRRAPSFAITFVELELDADSSPLKAGEHRALTVRVRGTSAKIMLEARNLSPDIAELAGSNPARISSSGGAENLAHFDLTGRKNGSFMISIRLVPNLAHSQ
Ga0210404_1000140863300021088SoilMLWRSLIRTGVWLVVLCTVAQVRSFPRPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEADSSSLEISSAPRVASLTDRFELVGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPLDLGAGSASVEISCAKRQAPPFSITLVALELEADASPLKAAEHRILTVHVRGTAAKIALEARNLSPDVAELAGGNPVRLSSSGGAENLAHFDLTGRKNGSFLISIRLVPALAHSQ
Ga0210406_1024429713300021168SoilMLWRGLIRTGVWPVVLCTVAQVCSFPRPPQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEADSSSLEISSAPRVASLTDRFELVGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPLDLGAGSASVEISCAKRQAPPFSITLVALELEADASPLKAAEHRILTVHVRGTAAKIALEARNLSPDVAELAGGNPVRLSSSGGTENLAHFDLTGRKNGSFLISIRLVPALAHSQ
Ga0210400_1000444933300021170SoilMLWRGLIRTGVWLVVLCTVAQVRSFPRPGQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEADSSSLEISSAPRVASLTDRFELVGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPLDLGAGSASVEISCAKRQAPPFSITLVALELEADASPLKAAEHRILTVHVRGTAAKIALEARNLSPDVAELAGGNPVRLSSSGGAENLAHFDLTGRKNGSFLISIRLVPALAHSQ
Ga0210400_1007577623300021170SoilVTVLSGFGEAAIARIAFIGMRFSTTIPRMSIHRHGPALLFACLIAGIFAAGASAQRSAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLRTDVTGRALFVAPLNPGVIFGSIAGRTGRVATAIVSPNEAASTSIEVSSAPRVASLTDRFELFGRGFCGDADANQVTVAGQPAIVLASSPTALVVLPSPDLQPGSAVVTVSCAKRQSPPFSITLVGLELEADSSPLKQGEHRALTVRVRGTTAKIALEARNLAPEIAELSGGNPVRASSSGGEENFAKFEVVGRKNGNFLISIRLAPSLGRPLP
Ga0210405_10000203333300021171SoilMSVLVAVLCVVVEFFLVPAPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVAFASIQGRPDKVSTAIIAPDEAAPTAIEIASAPRFASLTDRFELLGRGLCGDADANRVMVGGQPAIVLAASPVSLVVLPPMDREPGPANVEISCAKRQAAPLTITLVSLEMEADSSPLKSGEHRALTIRVRGTPAKIALEARNLAPEIATLSGGNPVKLSSSGGAENFVRFELIGRKNGSFLISIRLVPTLGRPQ
Ga0210405_1020401423300021171SoilMSLRRRGSLIFLAIVTAFLCAREADAQRPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPAAILLPGEAASASIEVSSAPRVASLIDRFEIFGKSFCGEADSNQVTIAGQLAIVLASSPAVLVVLPPPDLQPGGATVEVSCAKRQAALFSVTFVGLELEADSSPLKPGEHRTLTVRVRGTTAKIALEARNLAPEIAELSGGNPVRASSSGGEENVAKFEVVGRQNGSFLISIRLALSAGRPLPASVPQ
Ga0210396_1002667433300021180SoilMFRMNSPLHYNRQMLLRGVLLCTAFLAATSFESANPQRPPSASGARTLLLPRQIVSGERATLAVLDGSGRLTPDVTIQFSNGDRFTTDVTGRALFVAPLDPGVLFASIAGRTDRVPAAIVSPLEAVTSSMEIVGTPNVASLTDRFELLGRGFCGDADANQVTIGGQSALVLASSPASLVILPPMDLEAGTAKVEVACAKRQAPAFSMTLVELELQADSSPLKPGERRTLSVHLHGSAAKVSLEARNLAPRIAQLAGGNVVRASSTGGTENVAHFHLTGRKNGSFAISIRLIPSLARPQ
Ga0210397_1024589723300021403SoilMVLRGLIRTGVGLVVLCAVAQVFSSPPPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLDLGVMFASIEGRPDRVKTVIFSPSEATASSVEITSAPRVASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPMDVGAGPASVEISCAKHQASPLAITLVALELEADSSTLKAGEHRALTVRIRGTAAKIALEARNLSPDIAELTGGNPVRLSSTGGAENLAHFDLTARKNGSFLISIRLVPALAHSQ
Ga0210394_1003221733300021420SoilFDTMFRMNSPLHYNRQMLLRGVLLCTAFLAATSFESANPQRPPSASGARTLLLPRQIVSGERATLAVLDGSGRLTPDVTIQFSNGDRFTTDVTGRALFVAPLDPGVLFASIAGRTDRVPAAIVSPLEAVTSSMEIVGTPNVGSLTDRFELLGRGFCGDADANQVTIGGQSALVLASSPASLVILPPMDLEAGTAKVEVACAKRQAPAFSMTLVELELQADSSPLKPGERRTLSVHVHGSAAKVSLEARNLAPRITELAGGNVVRASSTGGTENVAHFHLTGRKNGSFAISIRLIPSLARP
Ga0210384_10001495123300021432SoilMFRMNSPLHYNRQMLLRGVLLCTAFLAATSFESANPQRPPSASGARTLLLPRQIVSGERATLAVLDGSGRLTPDVTIQFSNGDRFTTDVTGRALFVAPLDPGVLFASIAGRTDRVPAAIVSPLEAVTSSMEIVGTPNVASLTDRFELLGRGFCGDADANQVTIAGQSALVLASSPASLVILPPMDLEAGTAKVEVACAKRQAPAFSMTLVELELQADSSPLKPGERRTLSVHVHGSAAKVSLEARNLAPRITELAGGNVVRASSTGGTENVAHFHLTGRKNGSFAISIRLIPSLARPQ
Ga0210384_1003156633300021432SoilMAFIDPFFSTTISRMLWRGIIRAGIWLIVFYAVAQIFASPRPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVNVAFSNGDKLTTNATGRALFVAPLNLGVMFASIVGRAGRVTTAILSPSEAAASAVEVSSAPRVASLTDRFEILGRGFCGEADANQVSVAGQPAIVLAASPVSLVVLPPMDLGPGPASVEISCLKRRAPSFAITFVALDLDADSSPLKAGEHRALTVRVHGTSAKIVLEARNLSPDIAELAGGNPVRVSSSGGAENLAHFDLTGRKNGSFIISIRLVPNLAHSQ
Ga0210384_1016124623300021432SoilMLWRGLIRTGIWLVVLCGMAQVFSLPHPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTIEFSNGDKLTTNATGRALFVAPLSLGVMFASIEGRPDRVKTVIFSPSEAASFSLEITSAPRLASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSILVLPPMDLGPGPASVEISCAKRQAPPISITLVALELEADASPLKAAEHRILTVRVRGTAAKIGLEARNLSPDIAELAGGNPVRLSSSGGAENVAHFDLTGRKNGSFLISIRLVPSLARPQ
Ga0210384_1053761813300021432SoilASGARMLLLPRRIVSGERATLAVLDVNGRLTPGVTIEFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEAASSSLEITSSPRLASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSILVLPPMDLGPGLASVEISCAKRQAPPFSITLVALELEADASPLKAAEHRILTVRVRGTAAKIALEARNLSPDIAELAGGNPVRLSSSGGAENLAHFDLTGRKNGSFLISIRLVPSLAHPQ
Ga0187846_1007558723300021476BiofilmSGERATLAVLDFNGRLTPGVTVTFSNGDRLTTDTTGRALFVAPLNPGTILGSIAGREGKIPTTILADSGAASTPIEITSVPALASLADRFELAGRGFCGDADANQVAISGAKALVLASSPTSLIVLPPADLEPGRASVDISCAKRTGPPIEIVFVSLALEADSSPLKPGEHRALTVRVRGTTGKVALEARNLAPDIAELAGGNPVKASSSGGAENLAHFEVVGRKRGNFLISIRLATPFRRTLRN
Ga0210402_1011580533300021478SoilMFSRGFVRIGVWIVVLCAVAQVFSSPRPAQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDKLTTNATGRALFVAPLNLGVMFASIEGRPDRVKTVIFSPSEATASSVEITSAPRVASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPMDLGAGPASVEISCAKHQASPLAITLVALELEADSSPLKAGEHRALTVRIRGTVAKIALEARNLSPDIAELTGGNPVRLSSSGGAENLAHFDLTGRKNGSFLISIRLVPALAH
Ga0210410_1000190163300021479SoilMLTARDNLFRVRLPLACIRHVPLRCVFFLVTFLAAISLETADAQRPPSASGARTLLLPRQIVSGERATLAVLDGNGRLTPGVKVQFSNGDRFMTDVTGRALFVAPLDPGVLFASIAGRTDRVPTAVVSPSEAVTSSMEIAGSPSVASLTDRFELLGRGFCGDADANQVTIADQPALVLASSPASLVILPPMDLEAGTAKVEVACAKRQAPAFSMTLVELELQADSSPLKPGERRALSVRVHGSTAKVSLEARNLAPRVAELTGGDLVRGSSTGGTENVARFHLTGRKNGSFAISIRLVPSLTHPQ
Ga0210410_1013868323300021479SoilMLWRGLFRTGTLPVILCAVAQVFSFPLPEQQAPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVNVAFSNGDELTTNATGRALFVAPLHLGVMFATIVGRAGRVTTAILSPSEAAASVVEVSSAPRVASLTDRFEILGRGFCGEADANQVTVAGQPAIVLAASPVSLVVLPPMDLGPGPASVEISCLKRRAPSFAITFVELELDADSSPLKAGEHRALTVRVRGTSAKIMLEARNLSPDIAELAGSKPARISSSGGAENLAHFDLTGRKNGSFMISIRLVPNLAHSQ
Ga0209377_115158813300026334SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRMATTVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSSASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEA
Ga0209056_1001814633300026538SoilMSWHRRAFLFSFLIANIFAPWADGQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAVPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNSLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE
Ga0209648_10003216103300026551Grasslands SoilVRRSAAGACIAFISMRFSTTIPGMCWHRRVLAFIFTCVIASVFAPGAAGAQQSAPAASGARMLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTYATGRALFVAPLNPGVVFGSIAGRTGRVVTAVLSPSEDVSASLEVSSAPRVASLTDRFEIFGKSFCGDADANQVTIAGQPAIVLASSPTALVVLPPADLQPGSAAVEVSCAKCQSPPFSITFLGFELEADSAPLRPGEHRELTVRVRGTPAKIALEARNLAPEIAELSGGNPVRASTGGGAENFARFEVVGRSNGSFLISIRLVPSLRSPLP
Ga0209648_1001947123300026551Grasslands SoilLPDGVRITFIRMSFSTTIPRMCLPRCGAALLLVCLIPGVFAAVGNAQQSAPAPSGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSLAGRAGRVATAVLSPSEAASTSIEVSSAPRVASLTDRFEIFGKSFCGDADANQVTIAGQRGIVLASSPTTLVVLPPPELQSGSAAVEVSCAKRQLSPAFTVTFVGLELEADSSALKPDEHRALTVRVHGTASKLALEARNLAPDIAELAGGNPVRSSSSGGAENLARFEVVGRKTGSFLISIRLVPSLGRPQP
Ga0209648_1006808233300026551Grasslands SoilMSMHRRSRAFIFACLIVGVFAPAINAQQSAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLRTDVTGRALFVAPLNPGVIFGSIAGHAGRVVTAILAPGEASSTSIEVSSAPRVASLTDRFELFGRSFCGDADANQVTIAGQPAIVLASSPAALVVLPPPDLQPGSAKVEISCAKRQSPPFSVTFVGLELEADPSPLKPGEHRELTVRVRGTTAKIALEARNLAPEIAELTGGSPVRGSSSGGEENFAKFEVVGRKNGNFLISIRLVPSLGRPWP
Ga0209648_1011400623300026551Grasslands SoilMSWHRRAFLFSFLIASIFAPWADGQESAPAASRARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNSGVIFASIAGRPGRVAAAVLVSSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGNPLRLSSTGGTENFARFDLVGRKSGSFLISIRLMPSMGHPTE
Ga0179587_1013867123300026557Vadose Zone SoilMSLRRRGSAIFLALLTVFLLARDADAQRPAPAASGARILLLPRRIVSGERATLAVLDLNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPTAILLPGESASASIEISSAPRVASLTDRFEIFGKSFCGNADANQVTIAGQPAIVLASSPAALVVLPPPDLQPGSAVVEVSCAKRQAAPFSVTFVGLELEADASPLKPGEHRALTVRVRGTTAKIALEARNLAPEIAELSGGNPARASSSGGEQNVAKFEVAGRQNGSFLISIRLALSAGRPLPANVPQ
Ga0209220_105242923300027587Forest SoilMPLHRRGISYFLTCLIASLFAPWANAQQSPPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDATGRGLFVAPLNPGVIFASIAGREGRVATAILSPGEIASSSIEISSAPRVASLSDRFELFGRGFCGNADANQVTIAGQPAIVLASSPSSLVVLPPLELGPGPAVVEVSCAKRQAASFSITLARLELEADSSPLAPGEHRRLTVHISGTTAKIPLEARNLAPDIAELAGGNPVRLSTSGGAENFARFELVGRKKGSFLISIRL
Ga0209117_100555323300027645Forest SoilMSLHRRKPALVFACLIAVAFAPGANPQRFPPPASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTNATGRALFVAPLNPGVIYGSIAGRTVRVATAILSPSEAASTSIEVSSAPRVASLTDRAELLGRGFCGDADSNQVTIAGQPAIVLASSPTALVVLPSPDLQPGSAIVEVSCAKRQSPPFSITFVGLNLEADSSPLKPGEHRAVTVRVGGTTAKIALEARNLSPEIAELSGDNPVRASSSGGAENIAKFEVVGRKNGSFLISIRLVPSLDRPLP
Ga0209180_1000007783300027846Vadose Zone SoilMSWHRRAFLFSFLIANIFAPRADSQSGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISAAPRVASLTDRFELFGRGFCGDADSNQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKILLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMGHPTE
Ga0209180_1010220313300027846Vadose Zone SoilMRFSTTIPRMSLHCRGSAIFLTFLAVFFFAREATAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTGRVATAILSPSEAVSASIEVSSTPRVASLTDRFEIFGKSFCGDADANQVTIAGQPAIVLASSPAALVVLPPSELQPGSAAVEVSCAKRHSSEFSATFVGLELEADSSPLKPGEQRTLTVRVRGTLAKVALEARNLAPEIAGLTGGNPVRASSSGGTENSANFEVVGRNNGGFLISIRLVPSLGRPAP
Ga0209701_1000163143300027862Vadose Zone SoilLPDSARIAFIPTMFSTTIPRMFLHRRGIALLFGCLITGVFPAVGNAQQSPPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVMVNFSNGDRLTTDATGRTLFVAPLNPGVIFGSIAGRAGQVATAILSSSEAASTSIEVSSAPRVVSLTDRFEISGKSFCGDADANQVTIAGQRAIVLASSPTALVVLPPPDLPPGSAAMEVSCAKRQLSPFSITLVGLELEADSSTLKPGEHRALTVRVRGTEARVALEARNLAPEIAELARGNPVRSSSSGGAENFARFEVVGRRNGSFLISIRLVPSLRGPLP
Ga0209701_1000577063300027862Vadose Zone SoilMSWHRRAFLFSFLIANIFAPRADSQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNPGVIFASIAGRPGRVATAVLSPSEADSASMEISSAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKILLEARNLAPNIAELAGGNPLRLSSTGGAENFARFDLVGRKSGSFLISIRLMPSMGHPTE
Ga0209701_1016480023300027862Vadose Zone SoilARTLLLPRRIVSGERATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTAVLSPAEGAASSIEVSSAPRAASISDRIELTGHGFCGDADANQVRIGGKAALVLASSPSSLVVLPPIDLEPGPAALKVACAKRSAPIFNITFVALQLEANSSPLSPGEHRVLTVHARGTTAKVGLEARNLAPDIAELGGGNPLRLSTSGGAENSVRFEITGRKRGTFLISIRLLSPVGPPRP
Ga0209283_1033315413300027875Vadose Zone SoilVFFFAREATAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRTGRVATAILSPSEAVSASIEVSSTPRVASLTDRFEIFGKSFCGDADANQVTIAGQPAIVLASSPAALVVLPPSELQPGSAAVEVSCAKRHSSEFSATFVGLELEADSSPLKPGEQRTLTVRVRGTLAKVALEARNLAPEIAGLTGGNPVRASSSGGTENSANFEVVGRNNGGFLISIRLVPSLGRPAP
Ga0209590_1013021213300027882Vadose Zone SoilMLLLPRRIVSGERATLAVLDMSGRLTPGVTVNFSNRDRLTSDATGRALFVAPLNPGVIYGSIAGRTGRVATAILAPGEAASTSIEVSSAPQFASLIGRFELFGKSFCGDADANEVTIAKQPAIVLAASPTALVVLPPPDLHPGKAAVEVPCAKRQSSPFSITLVGLALEANSSPLKLGEHRALTVRVRGTRAKIALEARNLAPEIAELFGGHPLRASSSGGAENLARFEIVGRKNGSFLISIRLVPSLGRP
Ga0209590_1021872223300027882Vadose Zone SoilRFSTTIPRMSLRRRCPGLLSACLIVGVFAPGASAQQPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNADRLTTDATGRVLFVAPLNPGVIFGSIAGRAGRVATTILSPSEAASTAIEISSVPRVASLTDRFELFGKSFCGDADANQVTIAGQPAIVLASSPIALVVLPPPDLHPGSAAVEVSCAKRQSPPFSITLVGLALEANSSPLKPEEHRTLTVRVRGTMAKIALEARNLAPEIAELSGVNPVRASSSGGAENFARFEVVGRKNGNFLISIRLVPFLRSPLP
Ga0209590_1031081313300027882Vadose Zone SoilMSWHRRAFLFSFLIANIFAPRADSQQSAPAASGARILLLPRRIVSGERATLAVLDISGRLTPGVTVNFSNGDRLTTDETGRALFAAPLNTGVIFASIAGRPGRVATAVLSPSEADSASMEISAAPRVASLTDRFELFGRGFCGDADANQVTIGGQQAIVLASSPASLVVLPPPDLEPGRATAEVACAKREAPPFSLTLVGLELEADSSPLKPGEHRALTVRVRGTTTKIPLEARNLAPNIAELAGGN
Ga0209488_1002957423300027903Vadose Zone SoilMKRWRCGLKSQDNGDFRYNSSMLGRRSVAACAVLGTAIFSAAFAQQPPPASAARILLLPKRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIYASIAQRPGRVHTTVLTAAESAASGIEVASAPRFASLTDRFELSGLGFCGDADANQVRLGGQPALVLASSPLSLLVLPPADLEPGREVLDITCAKRTSSSFSTTFVALSLEADSTPLAPGEQRTVTVHVRGTTSKVGLEARNLAPDVADLTAGNPARISSTGGADNVGRFLLVGRQHGTFLISMRLLPSGAPPRP
Ga0137415_1023499813300028536Vadose Zone SoilMSLHRRGLSLIFGCLIASALAHGASAQQTAPPASGARRLLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRAGRVATTILSPSEAASTSIEVSSVPRVASLTDRFELFGKSFCGDADANQVTIAGQPAIVLASSPIALVVLPPPDLHPGSAAVEVSCAKRRSPPFSITLVGLALEANSSPLKPGEHRELTVRVRGTPAKIALEARNLAPEIAELSGGNPVRASTGGGAENSARFEVVGRSNGSFLISIRLVPSLRSPLP
Ga0222749_1021526223300029636SoilVLDVNGRLTPGVTIEFSNGDKLTTNATGRALFVAPLSLGVMFASIEGRPDRVKTVIFSPSEAASSSLEITSSPRLASLTDRFELLGRGFCGEADANQVTVAGQPAIVLAASPVSILVLPPMDLGPGPASVEISCATRQAPPISITLVALELEADASPLKAAEHRILTVRVRGTAAKIALEARNLSPDIAELAGGNPVRLSSSGGAENLAHFDLTGRKNGSFLISIRLVPSLAHPQ
Ga0307477_10000547163300031753Hardwood Forest SoilLAVTILSVAAFFLSVAGLAQQVPPPSGARVLLLPRRIVSGERATLAVLDTNGRLTPGVKVSFSNGDLLTTDTTGRSLFVAPLTPGMIFASIEGRPVRVPIAVLSAAEGASSSIKVSSAPHFATLGDRFELAGKGFCGDADANRVAIGAKPALVLASSPSALIVLPSGDLGPGPAVVEISCAKRSAPAFTITLVSLQLEANSSPLAPAERRVLTVHVHGTTSKVALEARDLAPDVAELTGGNPVRLSTSGGIENFVKFEISGRKRGSFLVSIRLLSPAGPPRP
Ga0307475_1014659223300031754Hardwood Forest SoilMSLRHRGYAIFLAIATVFLFAREADAQRPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPTAILLPGEAASASIEISSVPRVALLTDRFEIFGKSFCGNADANQVTIAGQPAIVLASSPAALVVLPPPDLQPGGAAVEVSCAKRQSLPFSVTFAGLELEADSSPLKPGEHRALTVRVRGTTAEILLEARNLAPEIAELSGGNPARATSSGGEKNVAKFEVTGRQNGSFLISIRLALSAGRPLPANVPQ
Ga0307473_1044913513300031820Hardwood Forest SoilNMPLRWPLVFFALLSVLAVLVASSVSAQQAPPAASAARILLLPRHIVSGERATLAVLDVNGRLTPGVGVNFSNGDHLTTDTTGRALFVAPLNPGVISAVISGRPGRVYTTILSPTDPASPSLEISFAPRIASLSDRFELSGTGFCGDADANRVTVGGQSALVLASSPTSLEVLPPSGLEPGAASVEVGCAKRHAPAFSIKFVALTLEADSSPLAPGDHRALTVRVRGTTWKVPLEARNLAPDIADLSGGNPVRVTSSGGADNVAKFHLVGRQRGSFVVSIRLLPT
Ga0307478_1005753513300031823Hardwood Forest SoilGARTLLLPRQIVSGERATLAVLDVNGRLTPGVSVKFSNGDQLTTDNTGRSLFVAPLDPGVIFASIAGVSDRTPSVVLAPSEAATSSMEISEVPSVASLTDRFELMGRGFCGDADANQVTVGGQPALVLASSPAALVILPPVDLEAGVAKVEVACAKRQAPPFSITLVELELQADSSPLKPGERRTLSVRVRGSAAKVSLEAHNLAPKVAELAGGNLVRASSSGGTENVTRFHLTGRKNGSFSISIRLVQSPIHPQ
Ga0307479_1004135233300031962Hardwood Forest SoilMSLRRRGYAIFLAIVTVFLFAREADAQRPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPAAILLPGEAASVSIEISSAPRVASLTDRFEILGKSFCGEADSNQVTIAGQPAIVLASSQAALVVLPPPDLQPGSGAVEVSCAKRQASPFSVTFVGLELEADSSPLKPGEHRALTVRVRGTTAKIPLEARNLAPEIAELSGGNPARATSSGGEENVAKFEVTGRQNGSFLISIRLAASAGHPLPANVPQ
Ga0307479_1008026023300031962Hardwood Forest SoilMSLRCRGPAIFLAIVTVCLFAREVTAQRPAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPAAILLPGEAASASIEISSAPRVASLTDPFEIFGKSFCGDADSNQVTIAGQPAIVLASSPAALVVLPPPDLQPGRATVEVSCAKRQAAPFSVTFAGLELEADSSPLKPGEHRALTVRVRGTTAKIALGARNLAPEIAELSGGNPARVSSSGGEENVAKFEVTGRRNGSFLISIRLVPSAGRPLPANVPQ
Ga0307479_1010869923300031962Hardwood Forest SoilMLSRCSWFAVVLLCAAAFLLLDGVSAQQAPPASGARTLLLPRRIVSAERATLAVLDANGRLTPGVTVNFSNGDRLTTDTTGRALFVAPLNPGVVFASIEGRPGRVPTMVLSPAEGATSSIEVSSAPRAASLSDRIELTGRGFCGDADANQVRIGGTAALVLASSPRFLVVLPPSDLEPGPAAVEVACAKRSAPIFNIAFVALQLEADSSPLAPGEHRVLTVRVRGATEKVGLEARNLAPDIAELGGGNPLRLSTSGGAENSVRFEITGRKRGSFLISIRLLSPAGPPRP
Ga0307479_1033340623300031962Hardwood Forest SoilTTIPRMHLYHRSFALLLLLSNFGAFERETNAQQPAAAPAASGARILLLPRRIVSGERATLAVLDVNGRLTPGVTVSFSNGDRFTTNATGRALFVAPLNPGVIFGSIAGRTGKVATAVLTPSEAASTAIEISSAPRVASLVDRFEILGRAFCGDADSNQVTIAGQPAIVLASSPTALVVLPPPDLQPRSAPVEVSCAKRQAPQFSITLVGLELQADASPLKPGEHRALAVRVRGTSTKIELEARNLAPEIAELTGGNPVRASSSGGTENFAKFEVVGRKNGGFLVSIRLVPYLGRPQP
Ga0307479_1061776323300031962Hardwood Forest SoilTILCAAAFLFLRAVLAQQAPPASGARVLLLPRRIVSGERATLAVLDANGRLTPGAKVHFSNGDRLTTDTTGRSLFVAPLTPGVIFASIEGRPGRVPTAVLSAEEGASPSIAVSSAPRFASLSDRIELAGHGFCGDADSNQVTIGGKAALVLASSPSSLIVLPYGDLEPGRAAVKASCAKRNAPAFTITLVVLQLDANSSPMAPAERRVLTVHVRGTTSRVELEAHNLAPDVAELTGGNPVRRSTSGGTENHAQFDITGRKRGSFLISIRLISPAGPPRP
Ga0307479_1061978213300031962Hardwood Forest SoilMSLRRRGYAIFLAIATVFLFAREADAQRPAPAASMARILLLPRRIVSGERATLAVLDVNGRLTPGVTVNFSNGDRLTTDVTGRALFVAPLNPGVIFGSIAGRSGRVPAAILLPGEAASVSIEISSAPRVASLTDRFEILGKSFCGEADSNQVTIAGQPAIVLASSPAALVVLPPPDLQPGGAAVEVSCAKRQSLPFSVTFAGLELEADSSPLKPGEHRALTVRVRGTTAKIPLETRNLAPEIAELSGGNPARAISSGGEENVAKF
Ga0311301_1017974823300032160Peatlands SoilMLSACDKLFCVKLPLNYIRHMPRRAVFVLMAFFAAMSLGTADAQRPLSASGARTLLLPRQIVSGERATLAVLDANGRLTPGVTVKFSNGDQLTTDVTGRALFVAPLDPGVIFASIAGRSERLTTVVLSAAEAATSSMEISGAPTAASLTDRFELLGRGFCGDADSNQVTIGGQPALVLASSPAALVILPPMEIEAGTAKLEVTCAKWQAPAFSIALVELELQADSSPLKPDEHRVLTVRVRGSTAKVSLEARNLAPTIAELAGGNPVRISSTGGSENFARFHLVGRKNGSFVISIRLVPLLTQPQ
Ga0307472_10073132113300032205Hardwood Forest SoilMSLRFCVRVRVVIFRYNSNMPLRWPLVFFALLSVLAVLVASSVSAQQAPPAASAARILLLPRHIVSGERATLAVLDVNGRLTPGVGVNFSNGDHLTTDTTGRALFVAPLNPGVISAVISGRPGRVYTTILSPTDPASPSLEISFAPRIASLSDRFELSGTGFCGDADANRVTVGGQSALVLASSPTSLEVLPPSGLEPGAASVEVGCAKRHAPAFSIKFVALTLEADSSPLAPGDHRALTVRVRGTTWKVPLEARNLAPDIADLSGGNPVRVTSS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.