NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F082191

Metagenome / Metatranscriptome Family F082191

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082191
Family Type Metagenome / Metatranscriptome
Number of Sequences 113
Average Sequence Length 187 residues
Representative Sequence MRTHSDKWISGLHLVVALLFALATAAADAQSPSNPKHFFWAPGQPNTPSPSSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGSVYTSQQLQNYVTSFLNNLGGTSWAAIQDEYCNNVPVGTTSCAAVGAGNYVTNPRKQLKGVWTDPTPVPSDIVTLGLAENLADDPLATEAI
Number of Associated Samples 89
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 70.09 %
% of genes near scaffold ends (potentially truncated) 94.69 %
% of genes from short scaffolds (< 2000 bps) 89.38 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (93.805 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(38.938 % of family members)
Environment Ontology (ENVO) Unclassified
(36.283 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(43.363 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 34.72%    β-sheet: 4.63%    Coil/Unstructured: 60.65%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF00440TetR_N 11.50
PF13502AsmA_2 1.77
PF00593TonB_dep_Rec 1.77
PF00378ECH_1 1.77
PF00072Response_reg 0.88
PF13545HTH_Crp_2 0.88
PF01638HxlR 0.88
PF13620CarboxypepD_reg 0.88
PF05690ThiG 0.88
PF01523PmbA_TldD 0.88
PF00082Peptidase_S8 0.88
PF04542Sigma70_r2 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG0214Pyridoxal 5'-phosphate synthase subunit PdxSCoenzyme transport and metabolism [H] 0.88
COG0312Zn-dependent protease PmbA/TldA or its inactivated homologGeneral function prediction only [R] 0.88
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.88
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.88
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.88
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.88
COG2022Thiazole synthase ThiGH, ThiG subunit (thiamin biosynthesis)Coenzyme transport and metabolism [H] 0.88
COG2070NAD(P)H-dependent flavin oxidoreductase YrpB, nitropropane dioxygenase familyGeneral function prediction only [R] 0.88
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms93.81 %
UnclassifiedrootN/A6.19 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2189573001|GZR05M102GWA2VAll Organisms → cellular organisms → Bacteria510Open in IMG/M
3300000546|LJNas_1027944All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300001137|JGI12637J13337_1013642All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300001162|JGI12714J13572_1002676All Organisms → cellular organisms → Bacteria883Open in IMG/M
3300001167|JGI12673J13574_1011107All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300001182|JGI12668J13544_1003274All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300001545|JGI12630J15595_10015353All Organisms → cellular organisms → Bacteria1640Open in IMG/M
3300001545|JGI12630J15595_10042245All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300001867|JGI12627J18819_10379027All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300002907|JGI25613J43889_10055831All Organisms → cellular organisms → Bacteria1095Open in IMG/M
3300002910|JGI25615J43890_1081830All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300005167|Ga0066672_10637247All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300005176|Ga0066679_10526426All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → unclassified Chryseobacterium → Chryseobacterium sp. OV715770Open in IMG/M
3300005181|Ga0066678_10778917All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → unclassified Chryseobacterium → Chryseobacterium sp. OV715634Open in IMG/M
3300005434|Ga0070709_11480577All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300005435|Ga0070714_101119082All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300005542|Ga0070732_10436229All Organisms → cellular organisms → Bacteria791Open in IMG/M
3300005586|Ga0066691_10409338All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300005610|Ga0070763_10911540All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300006059|Ga0075017_100087370All Organisms → cellular organisms → Bacteria2155Open in IMG/M
3300007258|Ga0099793_10051564All Organisms → cellular organisms → Bacteria → Acidobacteria1816Open in IMG/M
3300009137|Ga0066709_103692805All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300011269|Ga0137392_10155133Not Available1851Open in IMG/M
3300011269|Ga0137392_10399179All Organisms → cellular organisms → Bacteria1141Open in IMG/M
3300011271|Ga0137393_10330538All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1300Open in IMG/M
3300012096|Ga0137389_10891597All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300012198|Ga0137364_10349796All Organisms → cellular organisms → Bacteria1102Open in IMG/M
3300012198|Ga0137364_11361903All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. GP55527Open in IMG/M
3300012200|Ga0137382_10011614All Organisms → cellular organisms → Bacteria → Proteobacteria4729Open in IMG/M
3300012202|Ga0137363_10421077All Organisms → cellular organisms → Bacteria1114Open in IMG/M
3300012203|Ga0137399_11121883All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300012205|Ga0137362_10033244All Organisms → cellular organisms → Bacteria4105Open in IMG/M
3300012205|Ga0137362_10728293All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae → Oleiagrimonas → Oleiagrimonas soli852Open in IMG/M
3300012208|Ga0137376_11305899All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. GP55616Open in IMG/M
3300012211|Ga0137377_10295450All Organisms → cellular organisms → Bacteria1554Open in IMG/M
3300012361|Ga0137360_10347438All Organisms → cellular organisms → Bacteria1241Open in IMG/M
3300012582|Ga0137358_10008250All Organisms → cellular organisms → Bacteria6401Open in IMG/M
3300012582|Ga0137358_10135670All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1674Open in IMG/M
3300012685|Ga0137397_10590009All Organisms → cellular organisms → Bacteria827Open in IMG/M
3300012918|Ga0137396_10159736All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1645Open in IMG/M
3300012922|Ga0137394_11216833All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300012923|Ga0137359_10229017All Organisms → cellular organisms → Bacteria1659Open in IMG/M
3300012925|Ga0137419_10283360All Organisms → cellular organisms → Bacteria1261Open in IMG/M
3300012929|Ga0137404_10545992All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300012944|Ga0137410_10187483All Organisms → cellular organisms → Bacteria → Acidobacteria1595Open in IMG/M
3300012944|Ga0137410_10251257All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → unclassified Sinobacteraceae → Sinobacteraceae bacterium1386Open in IMG/M
3300012944|Ga0137410_10660845All Organisms → cellular organisms → Bacteria868Open in IMG/M
3300015053|Ga0137405_1230021All Organisms → cellular organisms → Bacteria1098Open in IMG/M
3300015054|Ga0137420_1108470All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300015054|Ga0137420_1134879All Organisms → cellular organisms → Bacteria → Acidobacteria1100Open in IMG/M
3300015054|Ga0137420_1206174All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300015241|Ga0137418_11257426All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300015245|Ga0137409_10273202All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → unclassified Sinobacteraceae → Sinobacteraceae bacterium1494Open in IMG/M
3300015264|Ga0137403_10669355All Organisms → cellular organisms → Bacteria899Open in IMG/M
3300020170|Ga0179594_10302895All Organisms → cellular organisms → Bacteria605Open in IMG/M
3300020579|Ga0210407_10817811All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus718Open in IMG/M
3300020581|Ga0210399_10004718All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis10574Open in IMG/M
3300020581|Ga0210399_10460625All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300020581|Ga0210399_11134195All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus624Open in IMG/M
3300021088|Ga0210404_10010663All Organisms → cellular organisms → Bacteria3764Open in IMG/M
3300021168|Ga0210406_10388359All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300021171|Ga0210405_10311695All Organisms → cellular organisms → Bacteria1243Open in IMG/M
3300021178|Ga0210408_10313987All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus1250Open in IMG/M
3300021404|Ga0210389_10654656All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300021420|Ga0210394_10532063All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus1034Open in IMG/M
3300021420|Ga0210394_10575935All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae990Open in IMG/M
3300021559|Ga0210409_11227063All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300022529|Ga0242668_1055050All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus719Open in IMG/M
3300024178|Ga0247694_1030186All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300024330|Ga0137417_1092504All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus1337Open in IMG/M
3300024330|Ga0137417_1393742All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus558Open in IMG/M
3300026308|Ga0209265_1209483All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300026333|Ga0209158_1206189All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300026356|Ga0257150_1058737All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus575Open in IMG/M
3300026490|Ga0257153_1073381All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300026496|Ga0257157_1026377All Organisms → cellular organisms → Bacteria953Open in IMG/M
3300026548|Ga0209161_10555904All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300026551|Ga0209648_10683205All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300026552|Ga0209577_10386460All Organisms → cellular organisms → Bacteria1014Open in IMG/M
3300026557|Ga0179587_10145675All Organisms → cellular organisms → Bacteria1472Open in IMG/M
3300027512|Ga0209179_1028801All Organisms → cellular organisms → Bacteria1127Open in IMG/M
3300027583|Ga0209527_1094543All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus671Open in IMG/M
3300027603|Ga0209331_1131164All Organisms → cellular organisms → Bacteria602Open in IMG/M
3300027605|Ga0209329_1059195All Organisms → cellular organisms → Bacteria822Open in IMG/M
3300027643|Ga0209076_1046379All Organisms → cellular organisms → Bacteria1227Open in IMG/M
3300027643|Ga0209076_1163483All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus620Open in IMG/M
3300027674|Ga0209118_1052148All Organisms → cellular organisms → Bacteria1207Open in IMG/M
3300027681|Ga0208991_1105502All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300027684|Ga0209626_1113680All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus706Open in IMG/M
3300027884|Ga0209275_10068162All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1755Open in IMG/M
3300028047|Ga0209526_10740768All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300028047|Ga0209526_10801851All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300030730|Ga0307482_1264667All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300030991|Ga0073994_10042029All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300031718|Ga0307474_11075015All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus637Open in IMG/M
3300031754|Ga0307475_10223432All Organisms → cellular organisms → Bacteria1508Open in IMG/M
3300031754|Ga0307475_11150703All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300031754|Ga0307475_11385638All Organisms → cellular organisms → Bacteria542Open in IMG/M
3300031820|Ga0307473_10874132All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300031823|Ga0307478_10706848All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300031962|Ga0307479_10661938All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptacidiphilus → Streptacidiphilus albus1024Open in IMG/M
3300031962|Ga0307479_11762954All Organisms → cellular organisms → Bacteria572Open in IMG/M
3300031962|Ga0307479_11995762All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300032174|Ga0307470_10091421All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1713Open in IMG/M
3300032180|Ga0307471_102172290All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300032180|Ga0307471_102311839All Organisms → cellular organisms → Bacteria678Open in IMG/M
3300032180|Ga0307471_102368895All Organisms → cellular organisms → Bacteria670Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil38.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil16.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil13.27%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil12.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.08%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.54%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.77%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.77%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.89%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.89%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.89%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.89%
Quercus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Quercus Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2189573001Grass soil microbial communities from Rothamsted Park, UK - FD2 (NaCl 300g/L 5ml)EnvironmentalOpen in IMG/M
3300000546Quercus rhizosphere microbial communities from Sierra Nevada National Park, Granada, Spain - LJN_Illumina_AssembledHost-AssociatedOpen in IMG/M
3300001137Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3EnvironmentalOpen in IMG/M
3300001162Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2EnvironmentalOpen in IMG/M
3300001167Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2EnvironmentalOpen in IMG/M
3300001182Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2EnvironmentalOpen in IMG/M
3300001545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022529Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FD2_060561002189573001Grass SoilIGTACVLIAVSAWAQTNPKHFFWAPGQPNTPNPSTLTSDLIYHGGNAGTGAIGVETVPATYLIFWGPDWQNGFTTTDLNGVAYSSQQLQAYVTSFLTNLGGTSWAAINDEYCNNVAAGTTSCAAAGGGNYVTNPRNQLKGVWTDKTAVPAEIVTLGLEENLVDDPLAME
LJNas_102794423300000546Quercus RhizosphereMHRSCVAVQASVMLLVLTFASIAQAQTNPKHFFWAPGQPNTPNPSSLANDLIYHGGNAGQGAIGVENKPATYLIFWGPDWANGFTTTDARGQAFTSQQLQNYVTSFLTNLGGTSWAAIQTEYCNNVPAGSTTCTNQAAAKYVSNPFKQLKGVWTDSSAVPSDIL
JGI12637J13337_101364213300001137Forest SoilMFTTTLEHLSALGLRVLALLSLSLAVITTQASGATNEKHFFWAAGQAPNPSSVSNDLIYHGGNAGQGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYVTSFLTNLGGTSWAAIQMEYCNNVPAGTTSCASASGGGYVTNPRKQLKGVWTDTTPVPSDIIALGLAENVADDPLAMEAMRASAHFNYNPQATYIILTPPSSIATGQ
JGI12714J13572_100267613300001162Forest SoilVTKKTAFTCDTIPCRRENFSSFGNTFGNLPKETVISTKTLERLPALGLRVLALLSILLAVITAQASGTTDEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPXWANGFTTTDVNGKQYTSQQLQGYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIALGLAENVADDPLAMEAIRASAHFGYD
JGI12673J13574_101110713300001167Forest SoilLLSMLLAVITAQASGTTDEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQGYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIALGLAENVADDPLAREAIRASAHFGYD
JGI12668J13544_100327413300001182Forest SoilMRAHSDKWISGLHLVVALLFALATSAADAQSLSNPKHFFWAPGQPNTPSASSLTNDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTADANGSAYTSQQLQNYVTSFLNNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGDYVTNPRKQLKGVWTDPTPVPSDIVTLGLAEN
JGI12630J15595_1001535313300001545Forest SoilMSAKTVERLPALGLRVLALVAISLAVITQASATTNQKHFFWAPKQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQNYITSFLSNLGGTSWAAIQTEYCNNVPAGTTSCANVSTGGYVTNPRKQLKGVWTDATPVPSDIVALGLAEN
JGI12630J15595_1004224513300001545Forest SoilMRTHSDKWISGLHLVVALLFALATAAADAQSPSNPKHFFWAPGQPNTPSPSSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGSVYTSQQLQNYVTSFLNNLGGTSWAAIQDEYCNNVPVGTTSCAAVGAGNYVTNPRKQLKGVWTDPTPVPSDIVTLGLAENLADDPLATEAI
JGI12627J18819_1037902713300001867Forest SoilMRTRSDRRISGLHLVAALVLALATFAAAAQSPSNPKHFFWAPGQPNTPSPSSAANDLIYHGGNAGAGAIGVETVPATYLIFWGPDWANGFTTTDANGSVYTSKQLQTYVTSFLTNLAGTSWAAIQNEYCNNVPAGTTSCAAVGGGNYVTDPRKQLAGVWTDPTPVPSDIVTLGLAQNLVDDPI
JGI25613J43889_1005583113300002907Grasslands SoilMTIQKVFQRLSALGLLLLAFVCTLVLICSPAWAATNAKHIFWAPNQPNTPSPGSLANDLIYHGGNAGPGAIGVETKPATYLIFWGPAWANGFTTADANGRVYTSQQLQKYVTAFLSNLGGTSWAAIQNEYCRNVPVGTTSCADVAGADYVTNPRNQLK
JGI25615J43890_108183013300002910Grasslands SoilGNFDNTFGEFPVETTMFTTTPKHLPALGLRMLAFLTISLAVIAAQASGATNEKHFFWAAGQAPNPSSVANDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDANGKQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANANGTGYVTNPRKQFKGVWTDSTPVPSDIIALG
Ga0066672_1063724713300005167SoilMDLFPSLFESQSANCANSRTGKWGLSGIALLMLVSVSIPAIGQAQTNAKHFFWAPGQPNTPNPSSLANDLIYHGGNAGPGAIGVENRPATYLIFWGPDWANGFTTTDANGQVFTSQQLQTYVTSFLTNLGGTSWDAIPTEYCNNISAGNTSCANITGANYITDPRKQLKGIWTDSTAVPSDIVALGLAENLADDPIA
Ga0066679_1052642613300005176SoilMSLSQAPSTARPFLLGILLLTVTTITTTTFAGSSDPKHIFWAPGQAPSPASVSNDLIYHGGSTGSGAIGVEVKPATYLIFWGPDWANGFTTADANGRVYTSAQLQSYITSFLSNLGGTSWAGIQTEYCRNVPAGTTSCASVPGATYITNPRKQLKGVWTDPTAVPADIVATGLAENLADDPIAMEAMRAS
Ga0066678_1077891713300005181SoilAKPISGLHLIVALLFALASSAAAAQPPSNPKHFFWAPGQTNAPSPSALANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTSDANGSVYTSQQLQNYVTSFLGNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGNYVTNPRKQLKGVWTDPAPVPSDIVTLGLAQNLVDDPLATEAIRASMHFGYDPRATYIILTPPTTIGTGQ
Ga0070709_1148057713300005434Corn, Switchgrass And Miscanthus RhizosphereRLRGLIALGFVLVALTATAQTNPKHFFWAPNQPNTPGPSALTNDLIYHGGNAGTGAIGVETVPATYLIFWGPDWANGFTTTDAHGTVYTSQQLQTYVTSFLSNLGGTSWAAIQNEYCNNVPAGTTSCAAVGGGNYVSNPRKQLKGVWTDASAVPADIVALGLEENLVDDPLAMEAVRASAHFS
Ga0070714_10111908223300005435Agricultural SoilMHRSCIPVQALLMLLVLTFAAVAQSQTNPKHFFWAPGQPNTPNPSSLQNDLIYHGGNAGQGAIGVENKPAIYLIFWGPDWANGFTTTDARGQSFTSQQLQNYVTSFLTNLGGTSWAAIQTEYCNNIPAGSTTCLNQAGAKYVTNPFKQLKGVWTDATAVPSDIVALGLAENLADDPIAQE
Ga0070730_1006114533300005537Surface SoilMKDRISKSQPAVRGRVIVALAFVYFAFSAAAAVQTNPKHFFWAPGQPNTPSPSSLSNDLIYHGGNAGSGAIGVETKPATYLIFWGPDWQSGFTTTDASGVSYTSQQLQNYVTSFLTNLGGTSWAAIQNEYCRNVPAGTTSCANIAGADYVTNPRKQLKGVWTDPTPVPADIVTLG
Ga0070732_1043622913300005542Surface SoilMCLSSFLCSRLDLSHGNLPTAKTPGKMIRHILSSLALLAVAYTLATATATAQSNPKHFFWASNQPSTPNPSSLANDLIYHGGNAGQGAIGVENKPATYLIFWGPDWINRFTTTDANGQVFSSQQLQNYVISFLANLGGTSWAAIPTEYCNDIPAGDTSCANVAGARYVTDPRKQLKGVWADGTPVPSNIIALGLAENLQDDPIAQEAIRAAAHFNYDPQATYIILTPPSTIA
Ga0066691_1040933813300005586SoilMRAHSDRRISGWHVIAALAFSLAASAAGAQAPTNPKHFFWAPGQPNTPSPSSLANDLIYHGGNTGAGAIGVETVPATYLIFWGPDWANGFTTTDANGSAYTSQQLQTYVTSFLTNLAGTSWAAIQDEYCNNVPAGTTSCAAVGGGNYVTDPRKQLKGVWTDPTPVPSDIVTSGLAQNLVDDPIATEAIRASAHFGYD
Ga0070763_1091154013300005610SoilLALLAISLAVIVARSSGATNDKHIFWAAGQSPNPNSALNDLVYHGGNAGPGAIGVETSPAIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYVTSFLSNLGGTSWAAIQTEYCNNVPAGTTSCASVGGGGGGYVTNPRKQLKGVWIDTTPVPSDIIALGLAENVADDPLAMEAI
Ga0075017_10008737023300006059WatershedsMYLSLSRPDQSHRNTKDVNLTGKILKPLVGLRALVALACVLAAISATAQTNPKHFFWAPGQPNTPSGASLANDLIYHGGNAGPGAIGVETTPATYLIFWGPDWANGFTTADAKGSVYTSQQLQSYVTSFLTNLGGTSWAAIQDEYCNNVSAGTTSCAAVGGGNYVANPRKQLKGVWTDTTPVPTDIITLG
Ga0099793_1005156413300007258Vadose Zone SoilMSTTTLKHLPALGLRVLAFLTISMAVIAAQTSGATNEKHFFWAAGQAPNPSSVANDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDVNGKQYSSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCASANGTGYVTNPRKQLKGVWTDPTPVPSGI
Ga0099794_1002080533300007265Vadose Zone SoilMTNPNRLPAQGLRVIAFLTISLAVIASQVSGATNDKHFFWAPGQAPTPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYVTSFLTNLGGTSWAAIQMEYCNVAPAGTTSCANANGSGYVTNPRKQLKGVWTDSTPV
Ga0066709_10369280513300009137Grasslands SoilTLEGLPGLGLRVLALVAISLAVITTQASATTNQKHFFWAPRQAPNPSSVSNDLIYHGGHAGPGAIGVETTPGTYLIFWGHDWANGFTTTDGNGKQYTSQQLQNYVTSFLSNLGGTSWASIQTEYCNNVPAGTTSCANVSAGRYVTNPRKQLKGVWTDATPVPSDIVALGLAENVADDPLAVEEI
Ga0137392_1015513323300011269Vadose Zone SoilMPTKTLERLPALGLRVLALLSILLAVITAQASGTTSEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQAYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIASGLAENVADDPLATEAIRASAHFRYDPQATYIILTPPT
Ga0137392_1039917913300011269Vadose Zone SoilMLSVLMIASVAQAQTNPKHFFWAPGQPNTPNPSSLANDLIYHGGNAGQGAIGVENKPATYLIFWGPDWANGFTTTDAKGQVFTSQQLQNYVTSFLTNLGGTSWAAIQTEYCNNVPVGTTSCASASGATYITNPRKQLKGVWTDSTAVPSDIVASGLAQNLADDPIAQEAIRASAHFNYDPQAT
Ga0137393_1033053813300011271Vadose Zone SoilMSMTNPNRLPALGLRVLALLTISLAVIASQVSGATNDKHFFWAPGQAPTPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDANGSQYTSQQLQTYVTSFLTNLGGTSWAAIQMEYCNVAPAGTTSCANANGSGYVTNPRKQLKGVWTDSTPVPSDIIALGLAENVANDPLAMEAIRASGHFGYNPQATYIILTPPT
Ga0137389_1089159713300012096Vadose Zone SoilMTMTSRNLERLPALSLRMLALLTISLALITTQASGTTPEKHFFWAPGQAPNASSVANDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDVNGRQYTSQQLQTYVTSFLTNLAGTPWAALQTEYCNNVPAGTTSCAAVSGGGFVTNPRKQLKGVWTDASPVPS
Ga0137364_1034979613300012198Vadose Zone SoilVLALVAISLAVITTQASATTNQKHFFWAPGQAPNPSAVSNDLIYHGGHAGPGAIGVETTPGTYLIFWGPDWANGFTTTDVNGKQYTSKQLQNYVTSFLSNLGGTSWASIQTEYCNNVPAGTTSCANVSAGRYVTNPRKQLKGVWTDATPVPSDIVALGLAENVADDPLAVEAIRASTHFKYDPQATYIILTPPTSIATG
Ga0137364_1136190313300012198Vadose Zone SoilAADAQSPSSPKHFLWAPGQPNTPSASSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTVTDANGSPYTSQQLQSYVTSFLSNLGGTSWAAIQDEYCNNVPAGTTSCAAAGGGNYVTDPRKQLKGVWTDTTPVPSDIVTLGLAENLADDPLATEAIRASAHFGYNPQA
Ga0137382_1001161453300012200Vadose Zone SoilMRTHSDKWISGFHLVAALLVAVATSAADAQSPSSPKHFFWAPGQPNTPSASSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTVTDANGSPYTSQQLQSYVTSFLSNLGGTSWAANQDEYCNNVPAGTTSCAAAGGGNYVTDPRKQLKGVWTDTTPVPSDIVTLGLAE
Ga0137363_1042107713300012202Vadose Zone SoilMFTTTFKHLPALGLRVLAFLTISLAVIAAQTSGATNEKHFFWAAGQAPNPSSVSNDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDANGKQYSSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCASANGTGYVTNPRKQLKGVWTDPTPVPSDIIALGLAENVADDPLAMEAIRASAH
Ga0137399_1112188313300012203Vadose Zone SoilMTMFTTTLKHLPAPGLRVLAFLTISLAVIAAQASGATNEKHFFWAAGQAPNPSSVANDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDANGKQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCASANGTGYVTNPRKQLKGVWTDPTPVPS
Ga0137362_1003324413300012205Vadose Zone SoilMTNQNVFPHKSGFGLQVLASVCALALITAPAFGGTNPKHFFWAPNQPNTPSSNSLANDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSGGFTTTDANGSVYTSQQLQNYITSFLTNLGGTSWAAIQNEYCRNVPAGTTSCADIAG
Ga0137362_1072829313300012205Vadose Zone SoilMIPRIAQAQTNPKHFFWAPGQPQTPNPSSLANDLIYHGGNAGQGAIGVENKPATYLIFWGPDWANGFTTTDVNGQVFTSQQLQNYVTSFLTNLGGTSWAAIQTEYCNNVSAGTTSCASVASANYITNPRKQLKGVWTDSTAVPSDIVASGLAQNLADDPIAQE
Ga0137376_1130589913300012208Vadose Zone SoilMPLTSGSRTTTVLRTNNTNAGPREDREVTMRTHSDKWISGFHLVAALLVAVATSAADAQSPSSPKHFFWAPGQPNTPSASSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTVTDANGSPYTSQQLQSYVTSFLSNLGGTSWAAIQDEYCNNVPAGTTSCAAAGGGNYVTDPPKQLKGGWTD
Ga0137377_1029545023300012211Vadose Zone SoilMLVLLTIPIIGHAQTNAKHFFWAPGQPNTPNPSSLTNDLIYHGGNAGPGAIGVEDRPTTYLIFWGPDWANGFTTSDASGQVFSSQQLQAYVTSFLTNLGGTSWDAIPTEYCNNIAAGNTSCANITGANYVTDPRKQLKGVWTDPTPVPSDIVALGLAE
Ga0137360_1034743813300012361Vadose Zone SoilMTNQNVFPHKSGFGLQVLASVCALALITAPAFGGTNPKHFFWAPNQPNTPSSNSLANDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSGGFTTTDANGSVYTSQQLQNYITSFLTNLGGTSWAAIQNEYCRNVPAGTTSCADIAGADYVTNPRKQLKGVWT
Ga0137358_1000825013300012582Vadose Zone SoilMTNQNVFPHKSGFGLQVLASVCALALITAPAFGGTNPKHFFWAPNQPNTPSSNSLANDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSGGFTTTDANGSVYTSQQLQNYITSFLTNLGGTSWAAIQNEYCRNVPAGTTSCADIAGADYVTNPRKQLKGVWTDTTPVPNDIVTLGLAENLADDPLATEAMRASAHFGYDPQATYIILTPPT
Ga0137358_1013567033300012582Vadose Zone SoilMRTHSDKWISGLHLVVALLFALVTSAAAAQSPSNPKHFFWAPGQPNTPSASALANDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGSVYTSQQLQNYVTSFLSNLGGTSWAAIQNEYCNNVPVGTTSCAAVGGGNYVTNPRKQLKGVWTDPTPVPSDIVTLGLAENLVDDPLATEAIRASAHFGYDPQATYIIL
Ga0137398_1066149413300012683Vadose Zone SoilMRTHSDKRISNLHLVVALLFALATAAADAQSPSNPKHYFWAPSQPNTPGPSSLANDLIYHGGNAGPGAIGVETVPAIYLIFWGPDWTNGFTTTDANGSAYTSQQLRNYVTSFLSNLGGTSWAAIQNEYCDNVPVGTTSCAAVGGGNYITNPRKQ
Ga0137397_1059000913300012685Vadose Zone SoilMTEQKVLPRLSAFGLQILAFACVLSLAAGPALGATNPKHFFWAPNQPNTPSPNSLANDLIYHGGNAGLGAIGVETKPATYLIFWGPAWSSGFTTTDVNGSVCTSQQLKNYVTSFLTNLGGTSWAAIQNEYCRNVPAGTTSCADVAGADYVTNPRKQLKGVWTDTTPVPNDIVTLGLAEHL
Ga0137396_1015973623300012918Vadose Zone SoilMTEKVSRRLLALGLQVLAFVCALTLITGTASGATNPKHFFWAPNQPNTPSPGSLVSDLIYHGGNAWPGAIGVETKPATYLIFWGPAWAKGFTTTDANGRLYTSQQLQNYITSFLTSLGGTSWAAIQSEYCRNVPAGTTTCADVADADYVTNPRTQLKGVWTDTTP
Ga0137396_1104107613300012918Vadose Zone SoilMFTTTPKHLPALGLRVLAFLTISLAVIAAQTSGATNEKHFFWAAGQAPNPSSVANDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANANGTGYVTNPRKQFKGVWTD
Ga0137394_1121683313300012922Vadose Zone SoilMTNQNVFPHKSGFGLQVLASVCALALITAPAFGGTNPKHFFWAQNQPNTPSSNSLANDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSGGFTTTDANGSVYTSQELQNYITSFLTNLGGTSWAAIQNEYCRNVPAGTTSCADIAGADYVTNPRKQLKGVWTDTTPVP
Ga0137359_1022901723300012923Vadose Zone SoilMTNQNVFPHKSGFGLQVLASVCALALITAPAFGGTNPKHFFWAPNQPNTPSSNSLANDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSGGFTTTDANGSVYTSQQLQNYITSFLTNLGGTSWAAIQNEYCRHVPAGTTSCADIAGADYVTNPRKQLKGVWTDTTPVPNDIVTLGLAENLADDPLATEAMRASAHFGYDPQATYIILTPPTTIGTGQPVYCGY
Ga0137419_1028336033300012925Vadose Zone SoilMTIQKVFQRLSALGLLLLAFVCTLVLICSPAWAATNAKHIFWAPNQPNTPSPGSLANDLIYHGGNAGPGAIGVETKPATYLIFWGPAWANGFTTADANGRVYTSQQLQKYVTAFLSNLGGTSWAAIQNEYCRNVPVGTTSCADVAGADYVTNPRNQLKGVWTD
Ga0137404_1054599213300012929Vadose Zone SoilMRTHSDRWISGLHLVVALLFALATSAADAQSPTNPKHFFWAPGQPNTPSPGALTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGNYVTNPRRQLKGVWTDATPVPSHIVTLGLAENVADDPLATEAIRASAHFGYNPQAT
Ga0137410_1018748313300012944Vadose Zone SoilMTIQKVFQRLSALGLLLLAFVCTLVLICSPAWAATNAKHIFWAPNQPNTPSPGSLANDLIYHGGNAGPGAIGVVTKPATYLIFWGPAWANGFTTADANGRVYTSQQLQKYVTAFLSNLGGTSWAAIQNEYCRNVPVGTTSCADVAGADYVTNP
Ga0137410_1025125713300012944Vadose Zone SoilMRTHSDKRISSWHLIVSLLFALATAAADAQSPSNPKHFFWAPGQPKTPGPSSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGSEYTSQQLQSYVTSFLSNLGGTSWAAIQDEYCDNVPVGTTSCAAVGGGNFVTNPRKQLKGVWSDPTPVPSDIVTLGLAENLANDPLAAEAMRASVHF
Ga0137410_1066084513300012944Vadose Zone SoilMRAHSDGWISGWPVVIALLLVLAAPAPAAQSLSNPKHFFWAPGQPNTPSASALANDLIYHGGNVGSGAIGVETVPATYLIFWGPDWANGFTTPDANGSTYTSQQLQNYVSSFLGNLGGTSWAAIQDEYCNNVPAGTTSCAVTGGGSYVTNPRKQLKGVWTDPTPVPSDIVTLGLAENL
Ga0137405_123002113300015053Vadose Zone SoilMRTHSDRWISGLHLVVALLFALATSAADAQSPTNPKHFFWAPGQPNTPSPGALTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGNYVTNPRRQLKGVWTDATPVPSDIVTLGLAENVADDPLATEAIRASAHFGYNPQATDVHHPYAAHDYRHRAARLLRLSHADYQRRRLR
Ga0137420_110847013300015054Vadose Zone SoilMDYTLLVRARTAGWFAFTFSKVFAMAALVAVFTFALAPISASAGNTNPKHFFWAPGQPNTPSPGALASDIIYHGGNAGAGAIGVETKPATYLIFWGPDWANGFTTTDANGRVFNSQQLQSYMTSFLSNLGGSSWAAIQNEYCRNVDAGTTNCADVSGADFITNPRNQLKGVWTDSTPVPADIVVTLGLAENLVDDPLAAEAMRASA
Ga0137420_113487923300015054Vadose Zone SoilMSNAIFKGLPASGLRVLAFLAISLAVLASHASGATNDKHFFWAAGQAPTPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDANGRQFTSQQLQTYIASFLTNLGGTSWAAIQMEYCNVAPAGTTSCANANGSGYVTNPRKQLKGVWTDSTAVPSDIIAL
Ga0137420_120617423300015054Vadose Zone SoilMDYTLLVRARTAGWFAFTFSKVFAMAALVAVFTFALAPISASAGNTNPKHFFWAPGQPNTPSPGALASDIIYHGGNAGAGAIGVETKPATYLIFWGPDWANGFTTTDANGRVFNSQQLQSYMTSFLSNLGGSSWAAIQNEYCRNVDAGTTNCADVSGADFITNPRNQLKGVWTDSTPVPADIVTLGLAENLVDDPLAAEAMRASAH
Ga0137418_1125742613300015241Vadose Zone SoilAQSPGNPKHFFWAPGQPNTPGPSSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTADANGSVYTSQQLQNYMTSFLSNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGHYVTNPRKQLKGVWADPTPVPSDIVTLGLAENLANDPLAAEAIRASAHFGYNPKA
Ga0137409_1027320213300015245Vadose Zone SoilMRTHSDKRISSWHLIVSLLFALATAAADAQSPSNPKHFFWAPGQPKTPGPSSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGSVYTSQQLQNYVTSFLSNLGGTSCTAIQNEYCNNVPVGTTSCAAVSGGNYVTNPRKQLKGVWTDPTPVPSDIVTLGLAENLANDPLATEAIRASAHFGYDPQATYIILTPPTTIGTGQPVYC
Ga0137403_1066935513300015264Vadose Zone SoilMRTHSDKWISGLHLVVALLFALASSAAAAQSPSNPKHFFWAPGQPNTPSASSLANDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGSAYTSQQLQSYVTSFLSNLGGTSWAAIQDEYCNNVAAGTTSCAAVGGGNYVTNPRKQLKGVWTD
Ga0179594_1030289513300020170Vadose Zone SoilMRTHSDRWISGLYLIALLFALATSAADAQPPTNPKHFFWAPGQPNTPSPGSLTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQNEYCNNVPLGTTSCAAVGGGNYVSNPRKQLKGVWTDPTPVPSDIVTL
Ga0210407_1081781113300020579SoilVTKKTAFTSDTIPCRRETLSSFGNTFGNLPEETAMSTKTLERLPALGLRVLALLSILLAVITAQASGTTDEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQAYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWT
Ga0210399_1000471813300020581SoilMSMTVFKSLPDLGRRALALLTISLAVIASQASGATNDKHFFWAAGQTPSPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDANGRQYTSQQLQNYVTSFLTNLGGTSWAAIQMEYCNSAPAGTTSCAAANGSGYVTNPRKQLKGVWTDPTPVPSDIIALGLAENVANDPLAAEAIRASGHFGYNPQATYIILTPPTSIA
Ga0210399_1046062513300020581SoilMFSTTLGRLPALGLRVIALLYISLAVITTQASGATNQKHFFWAAGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDANGKQYTSQQLQTYVTSFLTNLGGTSWAAIQMEYCNNVPAGTTSCASVGGGGYVTNPRKQLRGVWTDTTPVPSDIIALGLAENVADDPLAMEAIRASAHFNYNPQATYI
Ga0210399_1113419513300020581SoilHETFSSFGNTFGNLPKETVMSTKTLERLPALGLRVLALLSILLAVITAQASGTTDEKHFFWAPGQAPSPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQAYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTNCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIALGLAENVADDPLAMEVI
Ga0210404_1001066343300021088SoilMSTKTIERLPALGIRVLALVAISLAVITQAFATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQNYVTSFLGNLGGTSWAAIQTEYCNNVPAGTTSCASVSGGGYVTDPRKQLKGVWTDTTAVPSDIVALGLAENVADDPLAMEAIRAS
Ga0210406_1038835923300021168SoilMSSTTLGRLPALGLRVLAFLTILLAVIASQASGATNQKHFFWAAGQAPNPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYITSFLTNLGGTSWAAIQMEYCNIAPAGTTSCASVNGSGYVTNPRKQLKGVWTDTTAVPSDIIALGLAENVADDPLAAEAIRASAHFGYN
Ga0210405_1031169523300021171SoilMRVPFPQRSLIERSLQSLPRLCLRVLVVMVFSLAAITAVAQSAPTNPKHFFWAPGQPNTPSGASLANDLIYHGGNAGAGAIGVETTPATYLIFWGPDWSSGFTTADVNGSTYTSQELQNYVTSFLTNLGGTSWAAIQNEYCNNVPAGTTSCAAVGGGNYVTNPRKQLKGVWTDPTPVPADIITLGLAEHLADDPLALEA
Ga0210408_1031398713300021178SoilVTKKTAFTSDTIPCRRETFSSFGNTFGNLPEETAMSTKTLERLPALGLRVLALLSILLAVITAQASGTTDEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQAYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIALGLAEN
Ga0210389_1065465613300021404SoilMRVPFPQRSLIERSLQSLPRLCLRVLVVMVFSLAAITAVAQSAPTNPKHFFWAPGQPNTPSGASLANDLIYHGGNAGAGAIGVETTPATYLIFWGPDWSSGFTTADVNGSTYTSQELQNYVTSFLTNLGGTSWAAIQNEYCNNVPAGTTSCAAVGGGNYVTNPRKQLKGVWTDPTPVPADIVTLGLAENLADDPLATEAIRASAHFNYDPQATYIILTPPTTIGTGQPVY
Ga0210394_1053206313300021420SoilVTKKTAFTSDTIPCRRETFSSFGNTFGNLPEETAMSTKTLERLPALGLRVLALLSILLAVITAQASGTTDEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYATSFLSNLGGTSWAAIQTEYCNNVPAGTTSCASVGGGGYVTNPRKQLKGVWTD
Ga0210394_1057593523300021420SoilMPTTAFERPLSLGLRVLALLSLSLAAVVTQASGQTSDKHFFWAPGQTPNANSVSNDLIYHGGNAGAGAIGVETKPGVYLIFWGPDWANGFTTTDAKGVQFTSQQLQSYVTSFFTNLGGTPWAAIQTEYCNNVPAGTTSCASVSGGGYVTDPRKQLKGVWTDATPVPSDIIALGLAENVADDPLAT
Ga0210409_1122706313300021559SoilMRTHSYRWISGLHLVVALLFALATSAADAQAPSNPKHFFWAPGQPNTPGPSSLTNDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGSAYASQQLQNYVTSFLNNLGGTSWAAIQDEYCNNVAVGTTSCAAVDGGNYVTNPRKQLKGVWTDPTPVPSDIVTLGLAENLVDDPLA
Ga0242668_105505013300022529SoilMPSRNLERLSALSLRMLALLTISLAVITTQASGTTPEKHFFWAPGQAPNASSVANDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDVNGRQYTSQQLQTYVTSFLANLGGTPWAAIQTEYCNNVPAGTTSCAAVSGGGFVTNPRKQLKGVWTDASPVPSDIVALGL
Ga0247694_103018613300024178SoilLRALAFVTFSLAAIASQASGATNDKHFFWAAGQAPTPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDANGRQFTSQQLQTYITSFLSNLGGTSWAAIQMEYCNVAPAGTTSCANANGSGYITNPRKQLKGVWTDSTPVPSDIIALGLAENVANDPLAAEAIRASGHFGYNSQATYIILTPPTSIATGQ
Ga0137417_109250413300024330Vadose Zone SoilMSTKTIERLPALGIRVLALVAISLAVVTQAFATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQNYVTSFLSNLGGTSWGAIQTEYCNNVPAGTTSCTSVSGGGYVTNPRKQLKGVWTDTTPVPSDIVALGLAE
Ga0137417_139374213300024330Vadose Zone SoilLKETAMSTKTIERLPALGIRVLALVAISLAVITQALATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQNYVNSFLSNLGGTSWGAIQTEYCNNVPAGTTSCTSVSGGGYVTNPRKQLKGVWTDTTPVPSDIVALGLAENVADDP
Ga0209265_120948313300026308SoilCATLKTLTMAALVAVFIFATATLPLAAQSKTNPKHFFWAKGQPNTPNPNSLANDLIYHGGNAGSSAIGIEKKPATYLIFWGPDWANGFTTTDNNGVVFTSQQLQSYITSFLSNLGGTSWAGIQTEFCRNVAAGTTNCASVSGADFITNPRNQLKGVWTDSTPVPDDIVTLGLAENL
Ga0209158_120618913300026333SoilMRTHSDRWISGLHLIVALLFALATSAADAQSPTNPKHFFWAPGQPNTPSPGSLTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQNEYCNNVPLGTTSCAAVGGGNYVSNPRRQLKGVWTDPTPVPSDIVTLGLAENLVDDPLAMEAIRAAAHFGYNPQATYII
Ga0257150_105873713300026356SoilSDTIARRCETLSSFGNAFGDFLKETAMSTKTIERLPALGIRVLALVAISLAVVTQAFATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQNYVTSFLSNLGGTSWGAIQTEYCNNVPAGTTSCASVSGGGYVTNPRKQLKGVWTDTAPVP
Ga0257153_107338123300026490SoilMFTTTLKRLPGLGLRVLALVAVSLAVIAIQASGATNQKHFFWAAGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPAIYLIFWGPDWANGFTTTDVNGKQYSSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANANGSGYVTNPRKQLKGVWTDPTAVPSDIIALGLA
Ga0257157_102637723300026496SoilMSTKTIERLPALGIRVLALVAISLAVVTQALATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQNYVTSFLSNLGGTSWGAIQTEYCNNVPAGTTSCASVSGGGYVTNPRKQLKGVWTDTTPVPSDIVALGLAENVADDPLAMEAIRA
Ga0209161_1055590413300026548SoilMRTHSDRWISGLHLIVALLFASATSAADAQSPTNPKHFFWAPGQPNTPSPGSLTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQNEYCNNVPLGTTSCAAVGGGNYVSNPRRQLKGVWT
Ga0209648_1068320513300026551Grasslands SoilMRTHSDRWISGLHLVVALLFALATSAADAQSPTNPKHFFWAPGQPNTPSPSSLTNDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGNYVTNPRTAGRPHGYVVAVLVLNGGPG
Ga0209577_1038646013300026552SoilLLLAAITAITTTTFAGSTDPKHIFWAPGQAPSPASVSNDLIYHGGSTGSGAIGVEVKPATYLIFWGPDWANGFTTTDANGRVYTSAQLQSYITSFLSNLGATSWAGIQTEYCRNVPAGTTSCASVPGATYITNPRKQLKGVWTDSTAVPADIVATGLAENLADDPIAMEAMRASAHFNYDPQATYIKPVAWMVSEIPTACNMRSFPSSTPVGRFWAIRAAE
Ga0179587_1014567513300026557Vadose Zone SoilMRTHSDRWISGLHLVVALLFALATSAADAQSPTNPKHFFWAPGQPNTPSPSSLTNDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGNYVTNPRRQLKGVWTDPTPVPSDIVTLGLAENLVDDPIATEA
Ga0179587_1043871723300026557Vadose Zone SoilMSTTTLKHLPALGLRVLAFLTISLAVIAAQTSGATNEKHFFWAAGQAPNPSSVANDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDANGKQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANANGTGYVTNPRKQFKGVWTDSTPV
Ga0209179_102880113300027512Vadose Zone SoilMRTHSDRWISGLHLVVALLFALATSAADAQSSTNPKHFFWAPGQPNTPSPGSLTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQNEYCNNVPVGTTSCAAVGGGNYVTNPRKQLKGVWTDATPVPSDIVTLGLAENVVDDPL
Ga0209527_109454313300027583Forest SoilLVGMLLPYLAVIPSQPTQACARRQAPFSFDNNFGEFPTEMAMFTTIQRPLLALGLRVLALLTISLAVIATQASGATTQKHFFWAAGQSPNPSSVSNDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTITDVNGKQYTSQELQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCATLSGGGYVTNPRKQLKGVWTDTTPVPSDIVALGLAENVADDP
Ga0209331_113116413300027603Forest SoilMRTHSDKWISGLHLVVALLFALATAAADAQSPSNPKHFFWAPGQPNTPSPSSLANDLIYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGNVYTSQQLQNYVTSFLNNLGGTSWAAIQDEYCNNVPVGTTSCAVVGGGNYVTNPRKQLKGVWTDPTPVPSNIVT
Ga0209329_105919513300027605Forest SoilMFKTTIGYPPALGLRVLALLTISLALITTQASGATNQKHFFWAAGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQNYITSFLSNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIALGLAENVADDPLAMEAIRASAHFNYNPQATYIILTPPTSIATGQP
Ga0209076_104637913300027643Vadose Zone SoilMKISRKVLPRLSAFGLQIIAFVCVLALAAAPALGVTNPKHFFWAPNQPNTPSPNSLTNDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSSGFTTTDVNGSVYTSRQLQNYVTSFLTNLGGTSWAAIQTEYCKNVPAGTTACADVAGANYVSNPRKQLKGVWTDTTAVPDDIVALGLVENLANDPLAMEAMRASAHFNYDPQATYIILTPPTTIATGQPVY
Ga0209076_116348313300027643Vadose Zone SoilIRVLALVAISLAVITQAFATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQNYVNSFLSNLGGTSWGAIQTEYCNNVPAGTTSCASVSGGGYVTNPRKQLKGVWTDTTPVPSDIVALGLAENVADDPLAVEAIRASAHFKYDPQATYIILTPPTTIATGQPVYCG
Ga0209118_105214813300027674Forest SoilMFTTTLEHLSALGLRVLALLSLSLAVITTQASGATNEKHFFWAAGQAPNPSSVSNDLIYHGGNAGQGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYVTSFLTNLGGTSWAAIQMEYCNNVPAGTTSCASASGGGYVTNPRKQLKGVWTDTTPVPSDIIALGLAENVADDPLAMEAMRASAHFNYNPQ
Ga0208991_110550213300027681Forest SoilMRTHSDRWISGLHLVVALLFALATSAADAQSPTNPKHFFWAPGQPNTPSPGSLTNDLIYHGGNAGSGAIGVETVPAIYLIFWGPDWANGFTTTDANGGAYTSQVLQSYVTSFLSNLGGTSWAAIQDEYCNNVPVGTTSCAAVGGGNYVTNPRRQLKGVWTDATPV
Ga0209626_111368013300027684Forest SoilGSTVTKKTAFTSDTIPCRRENSSFGNTFGNLPKETVISTKSTKTLERLPALGLRVLALLSMLLAVITAQASGTTDEKHFFWAPGQAPNPSSVANDLIYHGGNAGAGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLQGYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANVSGGGYVTNPRKQLKGVWTDSTAVPSDIIALGLAENVADDPLAMEAIRASAHFR
Ga0209275_1006816223300027884SoilMLKTIPNHLSALDLRVLALLAISLAVIVARSSGATNDKHIFWAAGQSPNPNSALNDLVYHGGNAGPGAIGVETSPAIYLIFWGPDWANGFTTTDVNGKQYTSQQLQTYVTSFLSNLGGTSWAAIQTEYCNNVPAGTTSCASVGGGGYVTNPRKQLKGVWTDTTPVPSDIIALGLAENVADDPLAMEAIRASAHFNYNPQATYIILTPPTSI
Ga0209526_1074076813300028047Forest SoilVHAQTNAKHFFWAPEQPNTPNPSSLANDLIYHGGNAGPGAIGVENRPATYLIFWGPDWANGFTTSDAHGQVFSSQQLQAYVTSFLTNLGGTSWDAIPTEYCNDIAPGNTSCANIAGANYVTDPRRQLKGVWTDPAAVPSDIVALGLAENLADDPIAQEAVRASAHFTYDPQATYIILTPPTTIATGQPVYCGYHSQTSSVDGVG
Ga0209526_1080185113300028047Forest SoilMTKQNVSSRGAVFGLQVLGFVWVLALAAAPALGATNPKHFFWAPGQPNTPNPNSLANDLIYHGGNAGSGAIGVETKPATYLIFWGPAWSSGFTTTDAKGSVYTSQQLQNYITSFLTNLGGTSWAAIQNEYCRNVPAGTTNCADVAGADYVTNPRRQLKGVWTDTTAVPDDIVTLGLAENLVDDPLAI
Ga0307482_126466713300030730Hardwood Forest SoilNRALNNKQLTSEPRAGVSMHTHAVMRIRRVQLLAAWLLVLVTGSAAAQSPLNPKHFFWAPGQPATPNPSSLANDLVYHGGNAGAGAIGVETVPATYLIFWGPDWGNGFTTTDVNGAAYTSQLLQTYVTSFLTNLAGTSWAAIQDEYCNHVPAGTISCATAGGGNYVTDPRKQLKGVWTDP
Ga0073994_1004202913300030991SoilMRTHSNKWISGLHLVVALLFALATSAADAQSPSNPKHFFWAPGQPNTPSASSLANDLVYHGGNAGSGAIGVETVPATYLIFWGPDWANGFTTTDANGSVYTSQQLQNYVTSFLSNLGGTSWAAIQNEYCNSVPVGTTSCAVVGGGNYVTNPRKQLKGVWTDPTAVPSNIVTLGLA
Ga0073994_1241005413300030991SoilVRQKTAFSSDTIARRCETLSSFGNTLGDFLKETAMSTKTIERLPALGIRVLALVAISLAVITQAFATTNQKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDASGKQYTSQQLQSYVTSFLSNLGGTSWGAIQTEYCNNVPAGTTSCASV
Ga0307474_1107501513300031718Hardwood Forest SoilMFITTLKHLPALGLRVFVLLTISLAVIAARTSGATNEKHFFWAAGQAPSPSSVANDLIYHGGNVGSGAIGVETTPAIYLIFWGPDWANGFTTTDANGKQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCASANGTGYVTNPRRQLKGVWTDPTPVPSDII
Ga0307475_1022343223300031754Hardwood Forest SoilMPTTAFERPLSLCLRVLALLSLALAAVVTQASGQTSDKHFFWAPGQTPNANSVSNDLIYHGGNAGAGAIGVETKPGVYLIFWGPDWANGFTTTDVNGVQFTSQQLQNYVTSFFTNLGGTPWAAIQTEYCNNVPAGTTSCASVSGGGYVTDPRKQLKGVWTDATPVPSDIIA
Ga0307475_1115070313300031754Hardwood Forest SoilGLRVLAFLTISLAVIAAQTVGATNEKHFFWAAGQAPNPSSVANDLIYHGGNAGSGAIGVETTPAIYLIFWGPDWANSFTTTDANGKQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCANANGTGYVTNPRKQLKGVWTDATPVPSDIIALGLAENVADDPLAMEAIRASAHFNYNPQATYIILTPPTSIAT
Ga0307475_1138563813300031754Hardwood Forest SoilMHTHAVMRIRRVQLLAAWLLVLVTGSAAAQSPLNPKHFFWAPGQPATPNPSSLANDLVYHGGNAGAGAIGVETVPATYLIFWGPDWGNGFTTTDVNGAAYTSQLLQTYVTSFLTNLAGTSWAAIQDEYCNHVPAGTTSCATAGGGNYVTDPRKQLKGVWTDPTPVPSDIVTLG
Ga0307473_1087413213300031820Hardwood Forest SoilMHLHSDQKTTSWCLAAGCLLLLATSLAHAQSPTNPKHFFWAPGQPNTPSPSFLTNDLIYHGGNAGPGAIGVETTPATYLIFWGPDWANGFTTTDANGSVYTSQQLQTYATSFLTNLGGTSWAAIQDEYCNNVAIGTTSCAAVGGGNYVTNPRNQLKGVWTDPTPVPSDIVTLGLAENLAD
Ga0307478_1070684813300031823Hardwood Forest SoilMLTTTFRHLPALGLRVLAVITLSLAVIASRAAGATNDKHFFWAAGQAPNPSSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDANGRQYTSQQLQTYVTSFLTNLGGTSWAAIQMEYCNIVPVGTTSCANANGSGYVTNPRKQLKGVWTDTTAVPSDIIALGLAENVADDPLAMEAIRASGHFGYN
Ga0307479_1066193813300031962Hardwood Forest SoilMTSRNLGLRVLTLLTISLAALTTQTSGATDDKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVENKPGIYLIFWGPSWANGFKTTDVNGAQYTSQQLQTYVTSFLTNLGGTSWAAIQTEYCNHVPAGTTSCASVSGGGYVTDPRKQLKGVWTDATPVPSDIVALGLTENVANDPLAVEAMRASAHFNYDP
Ga0307479_1176295413300031962Hardwood Forest SoilYAMVVAAVVIFATVAQAQTNPKHFFWAPGQPNTPNPSSLTNDLIYHGGNAGAGAIGVENRPATYLIFWGPDWSNGFSTTDARGQVFTSEQLQSYVTSFFTNLGGTSWAAIQTEYCNNIPAGNTSCANIAGANYVTNPRKQLKGVWTDASAVPADIIALGLAENLADDPLAQEAIRASAHFNYDPQATYII
Ga0307479_1199576213300031962Hardwood Forest SoilMFFTISLAVIIAQASGATNEKHFFWAKGASPNPSSVSNDLIYHGGNAGPGAIGVETTPAIYLIFWGPDWANGFTTADAQGRQYTSQQLQDYVTSFLANLGGTSWAAIQMEYCKNVPVGTTSCASANGSGFVTNPRKQLKGVWNDPASVPSDIVALGLAENVADDPLAMEAVRASAH
Ga0307470_1009142123300032174Hardwood Forest SoilMSTAVFKRLPALGLRVLAFLTITLASIANPASGATSDKHFFWAAGQAPTPGSVSNDLIYHGGNAGAGAIGVETTPAIYLIFWGPDWANGFTTTDANGRQFTSQQLQNYITSFLTNLGGTSWAAIQMEYCNVAPAGTTSCTNANGSGYVTNPRKQLKGVWTDSTPVPSDIIALGLAENVANDPLAAEAIRASGHFGYNPQATYIILT
Ga0307471_10217229013300032180Hardwood Forest SoilMHLSARRVQGPRDSALLIGTARNLKRLSILGLQVFGGLLACAFIAFSAHADGTTNPKHFFWAPGQAPNANSISNDLIYHGGDAGPGAIGVEKTPAIYLIFWGPDWANGFTTTDAKGVQFTSQQLQTYVTSFLTNVGGTPWAAIQKEYCRNVPAGTTSCVGIAGADFITNPQKQLK
Ga0307471_10231183913300032180Hardwood Forest SoilMTSRNLECLPALALRMLALLTISLAVSTTQASGTTAEKHFFWAPGQAPNASSVANDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDVNGRQYTSKQLQAYVTSFLTNLGGTPWAAIQMEYCNNVPAGTTSCAAVSGGGFVTNPRKQLKGVWTDATPVP
Ga0307471_10236889513300032180Hardwood Forest SoilAPFIFGNKFGEFPTEVAMLTTTLKHLPALGLRMLVLLTLSLAVITTQASGTTDEKHFFWAPGQAPNPSSVSNDLIYHGGNAGPGAIGVETTPGIYLIFWGPDWANGFTTTDVNGKQYTSQQLRSYVTSFLTNLGGTSWAAIQTEYCNNVPAGTTSCASVIGGGYVTNPRKQLKGVWTDTTPVPSDIVALGLAENVADDPLATEAMRASVHFNYDPQATYIILT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.