NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F037152

Metagenome / Metatranscriptome Family F037152

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F037152
Family Type Metagenome / Metatranscriptome
Number of Sequences 168
Average Sequence Length 146 residues
Representative Sequence MSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK
Number of Associated Samples 107
Number of Associated Scaffolds 168

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Archaea
% of genes with valid RBS motifs 25.00 %
% of genes near scaffold ends (potentially truncated) 39.88 %
% of genes from short scaffolds (< 2000 bps) 61.31 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (92.857 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(30.357 % of family members)
Environment Ontology (ENVO) Unclassified
(61.310 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(69.048 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.77%    β-sheet: 20.98%    Coil/Unstructured: 48.25%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 168 Family Scaffolds
PF07883Cupin_2 14.29
PF00982Glyco_transf_20 7.74
PF00230MIP 5.36
PF02358Trehalose_PPase 4.17
PF08753NikR_C 4.17
PF00483NTP_transferase 4.17
PF00730HhH-GPD 2.98
PF06745ATPase 2.38
PF01555N6_N4_Mtase 1.79
PF02933CDC48_2 1.79
PF01918Alba 1.79
PF00296Bac_luciferase 1.19
PF02359CDC48_N 1.19
PF11838ERAP1_C 1.19
PF13412HTH_24 0.60
PF01738DLH 0.60
PF01040UbiA 0.60
PF01435Peptidase_M48 0.60
PF00641zf-RanBP 0.60
PF00005ABC_tran 0.60
PF00723Glyco_hydro_15 0.60
PF01370Epimerase 0.60
PF00950ABC-3 0.60
PF13419HAD_2 0.60

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 168 Family Scaffolds
COG0380Trehalose-6-phosphate synthase, GT20 familyCarbohydrate transport and metabolism [G] 7.74
COG0580Glycerol uptake facilitator or related aquaporin (Major Intrinsic protein Family)Carbohydrate transport and metabolism [G] 5.36
COG0864Metal-responsive transcriptional regulator, contains CopG/Arc/MetJ DNA-binding domainTranscription [K] 4.17
COG1877Trehalose-6-phosphate phosphataseCarbohydrate transport and metabolism [G] 4.17
COG0177Endonuclease IIIReplication, recombination and repair [L] 2.98
COG01223-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 2.98
COG1059Thermostable 8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 2.98
COG1194Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairsReplication, recombination and repair [L] 2.98
COG22313-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamilyReplication, recombination and repair [L] 2.98
COG0863DNA modification methylaseReplication, recombination and repair [L] 1.79
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 1.79
COG1581DNA/RNA-binding protein AlbA/Ssh10bTranscription [K] 1.79
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 1.79
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.19
COG4606ABC-type enterochelin transport system, permease componentInorganic ion transport and metabolism [P] 0.60
COG3387Glucoamylase (glucan-1,4-alpha-glucosidase), GH15 familyCarbohydrate transport and metabolism [G] 0.60
COG1108ABC-type Mn2+/Zn2+ transport system, permease componentInorganic ion transport and metabolism [P] 0.60
COG0609ABC-type Fe3+-siderophore transport system, permease componentInorganic ion transport and metabolism [P] 0.60


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms97.62 %
UnclassifiedrootN/A2.38 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002123|C687J26634_10046998All Organisms → cellular organisms → Archaea1638Open in IMG/M
3300002407|C687J29651_10008239All Organisms → cellular organisms → Archaea4306Open in IMG/M
3300002485|C687J35088_10107218All Organisms → cellular organisms → Archaea835Open in IMG/M
3300002558|JGI25385J37094_10004381All Organisms → cellular organisms → Archaea4922Open in IMG/M
3300002558|JGI25385J37094_10012983All Organisms → cellular organisms → Archaea2987Open in IMG/M
3300002558|JGI25385J37094_10029146All Organisms → cellular organisms → Archaea1951Open in IMG/M
3300002558|JGI25385J37094_10134881All Organisms → cellular organisms → Archaea679Open in IMG/M
3300002558|JGI25385J37094_10174542All Organisms → cellular organisms → Archaea576Open in IMG/M
3300002558|JGI25385J37094_10175919All Organisms → cellular organisms → Archaea573Open in IMG/M
3300002560|JGI25383J37093_10002223All Organisms → cellular organisms → Bacteria5621Open in IMG/M
3300002560|JGI25383J37093_10013702All Organisms → cellular organisms → Archaea2682Open in IMG/M
3300002560|JGI25383J37093_10032131All Organisms → cellular organisms → Archaea1751Open in IMG/M
3300002560|JGI25383J37093_10152859All Organisms → cellular organisms → Archaea611Open in IMG/M
3300002561|JGI25384J37096_10007722All Organisms → cellular organisms → Bacteria4012Open in IMG/M
3300002561|JGI25384J37096_10010633All Organisms → cellular organisms → Archaea3499Open in IMG/M
3300002561|JGI25384J37096_10013132All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3179Open in IMG/M
3300002562|JGI25382J37095_10025005Not Available2334Open in IMG/M
3300002562|JGI25382J37095_10035939All Organisms → cellular organisms → Archaea1940Open in IMG/M
3300002562|JGI25382J37095_10133912All Organisms → cellular organisms → Archaea828Open in IMG/M
3300002908|JGI25382J43887_10067896All Organisms → cellular organisms → Archaea1928Open in IMG/M
3300002908|JGI25382J43887_10397634All Organisms → cellular organisms → Archaea582Open in IMG/M
3300002916|JGI25389J43894_1038401All Organisms → cellular organisms → Archaea824Open in IMG/M
3300002916|JGI25389J43894_1052219All Organisms → cellular organisms → Archaea691Open in IMG/M
3300005175|Ga0066673_10016732All Organisms → cellular organisms → Archaea3291Open in IMG/M
3300005176|Ga0066679_10101875All Organisms → cellular organisms → Bacteria1746Open in IMG/M
3300005176|Ga0066679_10483747All Organisms → cellular organisms → Archaea808Open in IMG/M
3300005177|Ga0066690_10056548All Organisms → cellular organisms → Archaea2410Open in IMG/M
3300005178|Ga0066688_10019290All Organisms → cellular organisms → Archaea3580Open in IMG/M
3300005179|Ga0066684_10057109All Organisms → cellular organisms → Archaea2261Open in IMG/M
3300005180|Ga0066685_10002887All Organisms → cellular organisms → Bacteria8177Open in IMG/M
3300005180|Ga0066685_10013580All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon4681Open in IMG/M
3300005180|Ga0066685_10037547All Organisms → cellular organisms → Bacteria3047Open in IMG/M
3300005180|Ga0066685_10064421All Organisms → cellular organisms → Archaea2388Open in IMG/M
3300005180|Ga0066685_10264677All Organisms → cellular organisms → Bacteria1187Open in IMG/M
3300005186|Ga0066676_10189894All Organisms → cellular organisms → Archaea1314Open in IMG/M
3300005447|Ga0066689_10023864All Organisms → cellular organisms → Archaea3032Open in IMG/M
3300005468|Ga0070707_100008335All Organisms → cellular organisms → Bacteria9621Open in IMG/M
3300005536|Ga0070697_100004740All Organisms → cellular organisms → Archaea10420Open in IMG/M
3300005536|Ga0070697_100252459All Organisms → cellular organisms → Archaea1508Open in IMG/M
3300005554|Ga0066661_10331734All Organisms → cellular organisms → Archaea934Open in IMG/M
3300005555|Ga0066692_10677122All Organisms → cellular organisms → Archaea641Open in IMG/M
3300005556|Ga0066707_10008995Not Available4797Open in IMG/M
3300005557|Ga0066704_10008924All Organisms → cellular organisms → Archaea5427Open in IMG/M
3300005557|Ga0066704_10801393All Organisms → cellular organisms → Archaea586Open in IMG/M
3300005559|Ga0066700_11020539All Organisms → cellular organisms → Archaea543Open in IMG/M
3300005561|Ga0066699_10015832All Organisms → cellular organisms → Archaea3971Open in IMG/M
3300005574|Ga0066694_10422399All Organisms → cellular organisms → Archaea625Open in IMG/M
3300005576|Ga0066708_10058129All Organisms → cellular organisms → Archaea2186Open in IMG/M
3300005586|Ga0066691_10333362All Organisms → cellular organisms → Archaea897Open in IMG/M
3300005598|Ga0066706_10004607All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon6639Open in IMG/M
3300005598|Ga0066706_10166672All Organisms → cellular organisms → Archaea1670Open in IMG/M
3300006031|Ga0066651_10020055All Organisms → cellular organisms → Archaea2864Open in IMG/M
3300006032|Ga0066696_10904401All Organisms → cellular organisms → Archaea562Open in IMG/M
3300006034|Ga0066656_10235929All Organisms → cellular organisms → Archaea1173Open in IMG/M
3300006791|Ga0066653_10042681All Organisms → cellular organisms → Archaea1852Open in IMG/M
3300006797|Ga0066659_10267546All Organisms → cellular organisms → Archaea1288Open in IMG/M
3300006797|Ga0066659_10681488All Organisms → cellular organisms → Archaea839Open in IMG/M
3300007255|Ga0099791_10299460All Organisms → cellular organisms → Archaea766Open in IMG/M
3300009012|Ga0066710_100432571All Organisms → cellular organisms → Archaea1969Open in IMG/M
3300009012|Ga0066710_100616452All Organisms → cellular organisms → Archaea1647Open in IMG/M
3300009012|Ga0066710_101146996All Organisms → cellular organisms → Archaea1203Open in IMG/M
3300009038|Ga0099829_10372345All Organisms → cellular organisms → Archaea1178Open in IMG/M
3300009038|Ga0099829_10561673All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon948Open in IMG/M
3300009089|Ga0099828_10458297All Organisms → cellular organisms → Archaea1150Open in IMG/M
3300009089|Ga0099828_10625324All Organisms → cellular organisms → Archaea969Open in IMG/M
3300009090|Ga0099827_10286954All Organisms → cellular organisms → Archaea1390Open in IMG/M
3300009804|Ga0105063_1006595All Organisms → cellular organisms → Archaea1148Open in IMG/M
3300010335|Ga0134063_10172501All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1010Open in IMG/M
3300010366|Ga0126379_12214801All Organisms → cellular organisms → Archaea651Open in IMG/M
3300011120|Ga0150983_14788403All Organisms → cellular organisms → Archaea541Open in IMG/M
3300011271|Ga0137393_10458215All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae1093Open in IMG/M
3300012096|Ga0137389_10075988All Organisms → cellular organisms → Archaea2612Open in IMG/M
3300012189|Ga0137388_10187712All Organisms → cellular organisms → Archaea1856Open in IMG/M
3300012189|Ga0137388_10364153All Organisms → cellular organisms → Archaea1335Open in IMG/M
3300012189|Ga0137388_10463399All Organisms → cellular organisms → Archaea1176Open in IMG/M
3300012189|Ga0137388_11173738All Organisms → cellular organisms → Archaea705Open in IMG/M
3300012203|Ga0137399_10195063All Organisms → cellular organisms → Archaea1641Open in IMG/M
3300012203|Ga0137399_10587606All Organisms → cellular organisms → Archaea936Open in IMG/M
3300012205|Ga0137362_10146311All Organisms → cellular organisms → Archaea → Euryarchaeota2018Open in IMG/M
3300012206|Ga0137380_10163585All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2028Open in IMG/M
3300012206|Ga0137380_11706275All Organisms → cellular organisms → Archaea514Open in IMG/M
3300012207|Ga0137381_10183671All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1809Open in IMG/M
3300012354|Ga0137366_10160933All Organisms → cellular organisms → Archaea1687Open in IMG/M
3300012356|Ga0137371_10099151All Organisms → cellular organisms → Archaea → Euryarchaeota2269Open in IMG/M
3300012360|Ga0137375_10564688All Organisms → cellular organisms → Archaea955Open in IMG/M
3300012363|Ga0137390_11562197All Organisms → cellular organisms → Archaea599Open in IMG/M
3300012390|Ga0134054_1236245All Organisms → cellular organisms → Archaea571Open in IMG/M
3300012918|Ga0137396_10445160All Organisms → cellular organisms → Archaea960Open in IMG/M
3300012918|Ga0137396_11224118All Organisms → cellular organisms → Archaea527Open in IMG/M
3300012925|Ga0137419_10511513All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon954Open in IMG/M
3300012944|Ga0137410_10130472All Organisms → cellular organisms → Archaea1895Open in IMG/M
3300012948|Ga0126375_10587421All Organisms → cellular organisms → Archaea848Open in IMG/M
3300012971|Ga0126369_13561502All Organisms → cellular organisms → Archaea510Open in IMG/M
3300012972|Ga0134077_10474274All Organisms → cellular organisms → Archaea551Open in IMG/M
3300012976|Ga0134076_10156106All Organisms → cellular organisms → Archaea937Open in IMG/M
3300014157|Ga0134078_10349025All Organisms → cellular organisms → Archaea650Open in IMG/M
3300015245|Ga0137409_10346856All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae → Nitrososphaera → Candidatus Nitrososphaera evergladensis1295Open in IMG/M
3300017656|Ga0134112_10009883All Organisms → cellular organisms → Archaea3144Open in IMG/M
3300017659|Ga0134083_10026893All Organisms → cellular organisms → Archaea2073Open in IMG/M
3300018431|Ga0066655_10011562All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3847Open in IMG/M
3300018431|Ga0066655_10030149All Organisms → cellular organisms → Archaea2634Open in IMG/M
3300018433|Ga0066667_10036207All Organisms → cellular organisms → Archaea2837Open in IMG/M
3300018433|Ga0066667_10061275Not Available2328Open in IMG/M
3300018433|Ga0066667_10300125All Organisms → cellular organisms → Archaea1249Open in IMG/M
3300018433|Ga0066667_10504323All Organisms → cellular organisms → Archaea997Open in IMG/M
3300018468|Ga0066662_10075071All Organisms → cellular organisms → Archaea2284Open in IMG/M
3300018468|Ga0066662_10985780All Organisms → cellular organisms → Archaea833Open in IMG/M
3300021046|Ga0215015_10345919All Organisms → cellular organisms → Archaea6422Open in IMG/M
3300021046|Ga0215015_10456071All Organisms → cellular organisms → Archaea1441Open in IMG/M
3300025160|Ga0209109_10083432All Organisms → cellular organisms → Archaea1662Open in IMG/M
3300025324|Ga0209640_10571828All Organisms → cellular organisms → Archaea914Open in IMG/M
3300026296|Ga0209235_1008993All Organisms → cellular organisms → Archaea5557Open in IMG/M
3300026296|Ga0209235_1024159All Organisms → cellular organisms → Archaea3215Open in IMG/M
3300026296|Ga0209235_1056898Not Available1848Open in IMG/M
3300026297|Ga0209237_1001248All Organisms → cellular organisms → Archaea15029Open in IMG/M
3300026297|Ga0209237_1006688All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon6947Open in IMG/M
3300026297|Ga0209237_1008403All Organisms → cellular organisms → Archaea6182Open in IMG/M
3300026297|Ga0209237_1012775All Organisms → cellular organisms → Archaea4959Open in IMG/M
3300026297|Ga0209237_1045590All Organisms → cellular organisms → Archaea2248Open in IMG/M
3300026298|Ga0209236_1000222All Organisms → cellular organisms → Archaea30677Open in IMG/M
3300026298|Ga0209236_1001953All Organisms → cellular organisms → Archaea12571Open in IMG/M
3300026301|Ga0209238_1163212All Organisms → cellular organisms → Archaea664Open in IMG/M
3300026306|Ga0209468_1042283All Organisms → cellular organisms → Archaea → Euryarchaeota1564Open in IMG/M
3300026309|Ga0209055_1016963All Organisms → cellular organisms → Archaea3595Open in IMG/M
3300026313|Ga0209761_1000286All Organisms → cellular organisms → Archaea29190Open in IMG/M
3300026313|Ga0209761_1007209All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota7284Open in IMG/M
3300026313|Ga0209761_1191661All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon912Open in IMG/M
3300026316|Ga0209155_1004943All Organisms → cellular organisms → Archaea → Euryarchaeota5976Open in IMG/M
3300026318|Ga0209471_1134117All Organisms → cellular organisms → Archaea1041Open in IMG/M
3300026332|Ga0209803_1004745All Organisms → cellular organisms → Archaea7880Open in IMG/M
3300026332|Ga0209803_1327650All Organisms → cellular organisms → Archaea528Open in IMG/M
3300026334|Ga0209377_1002040All Organisms → cellular organisms → Archaea13313Open in IMG/M
3300026335|Ga0209804_1045055All Organisms → cellular organisms → Archaea → Euryarchaeota2176Open in IMG/M
3300026342|Ga0209057_1226545All Organisms → cellular organisms → Archaea531Open in IMG/M
3300026371|Ga0257179_1035965All Organisms → cellular organisms → Archaea618Open in IMG/M
3300026480|Ga0257177_1012499All Organisms → cellular organisms → Archaea1142Open in IMG/M
3300026480|Ga0257177_1091509All Organisms → cellular organisms → Archaea500Open in IMG/M
3300026499|Ga0257181_1049333All Organisms → cellular organisms → Archaea697Open in IMG/M
3300026514|Ga0257168_1053504All Organisms → cellular organisms → Archaea886Open in IMG/M
3300026524|Ga0209690_1100602All Organisms → cellular organisms → Archaea1183Open in IMG/M
3300026524|Ga0209690_1190519All Organisms → cellular organisms → Archaea664Open in IMG/M
3300026529|Ga0209806_1002088All Organisms → cellular organisms → Archaea12602Open in IMG/M
3300026530|Ga0209807_1000226All Organisms → cellular organisms → Archaea25171Open in IMG/M
3300026536|Ga0209058_1150925All Organisms → cellular organisms → Archaea1090Open in IMG/M
3300026537|Ga0209157_1071129All Organisms → cellular organisms → Archaea1744Open in IMG/M
3300026538|Ga0209056_10047006All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3893Open in IMG/M
3300026538|Ga0209056_10715519All Organisms → cellular organisms → Archaea507Open in IMG/M
3300026540|Ga0209376_1030669All Organisms → cellular organisms → Archaea3373Open in IMG/M
3300026540|Ga0209376_1160735All Organisms → cellular organisms → Archaea1066Open in IMG/M
3300026548|Ga0209161_10000863All Organisms → cellular organisms → Archaea24861Open in IMG/M
3300026552|Ga0209577_10121858All Organisms → cellular organisms → Archaea2087Open in IMG/M
3300027643|Ga0209076_1051942All Organisms → cellular organisms → Archaea1160Open in IMG/M
3300027748|Ga0209689_1009683All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon6459Open in IMG/M
3300027846|Ga0209180_10145118All Organisms → cellular organisms → Archaea1366Open in IMG/M
3300027846|Ga0209180_10475329All Organisms → cellular organisms → Archaea702Open in IMG/M
3300027862|Ga0209701_10052400All Organisms → cellular organisms → Archaea2615Open in IMG/M
3300027862|Ga0209701_10267326All Organisms → cellular organisms → Archaea994Open in IMG/M
3300027862|Ga0209701_10271103All Organisms → cellular organisms → Archaea985Open in IMG/M
3300027875|Ga0209283_10010464All Organisms → cellular organisms → Archaea5489Open in IMG/M
3300027875|Ga0209283_10059720All Organisms → cellular organisms → Archaea2443Open in IMG/M
3300027875|Ga0209283_10107786All Organisms → cellular organisms → Archaea1827Open in IMG/M
3300027875|Ga0209283_10109313All Organisms → cellular organisms → Archaea1814Open in IMG/M
3300027875|Ga0209283_10420512All Organisms → cellular organisms → Archaea869Open in IMG/M
3300027882|Ga0209590_10283912All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1061Open in IMG/M
3300027882|Ga0209590_10537010All Organisms → cellular organisms → Archaea754Open in IMG/M
3300028536|Ga0137415_10145371All Organisms → cellular organisms → Archaea2209Open in IMG/M
3300031820|Ga0307473_11058901All Organisms → cellular organisms → Archaea595Open in IMG/M
3300032180|Ga0307471_100456339All Organisms → cellular organisms → Archaea1418Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.36%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil26.79%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil24.40%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil2.38%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.79%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.19%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.19%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.60%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.60%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.60%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002123Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_3EnvironmentalOpen in IMG/M
3300002407Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1EnvironmentalOpen in IMG/M
3300002485Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012390Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C687J26634_1004699823300002123SoilMSSFDNWVKSYENQLRDWLSKCPACMLSPQVALHAQRGMAMVQVGQSFEFDQRLRGQNNPPNHLVAKRLVPFNGKIKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASVLEGLKTALKAEKIDIDSLSDWDLKKWK*
C687J29651_1000823923300002407SoilMSSFDNWVKSYENQLRDWLSKCPACMLSPQVALHAQRGTAMVQVGQSFEFDQRLRGQNNPPNHLVAKRLVPFNGKXKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASVLEGLKTALKAEKIDIDSLSDWDLKKWK*
C687J35088_1010721813300002485SoilMSSFDNWVKSYENQLRDWLSKCPACMLSPQVALHAQRGMAMVQVGQSFEFDQRLRGQNNPPNHLVAKRLVPFNGKVKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASVLEGLKTALKAEKIDIDSLSDWDLKKWK*
JGI25385J37094_1000438133300002558Grasslands SoilMYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK*
JGI25385J37094_1001298313300002558Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK*
JGI25385J37094_1002914623300002558Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGRIKNPIIFWLNSSFPSDPIYQLHKQPVIHHVLRLGPGYTFTNVLGGLRTALKAEKINGDSIDWDPKKLK*
JGI25385J37094_1013488123300002558Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINSDSIEWDPKKLK*
JGI25385J37094_1017454213300002558Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLG
JGI25385J37094_1017591913300002558Grasslands SoilRDWLSKCPNCLLSPQVSLHARRGTAMVQVGQSFEFDQRLGNRDNPPNRLVANRLVPYNGKLKNPIIFWLNTLFPKDQVYQLHEKPVVHHVLRLGAGYTFAMVREGLKAALKAEKIDIDTLSDWDPKKWQEK*
JGI25383J37093_1000222383300002560Grasslands SoilMSSYDNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
JGI25383J37093_1001370233300002560Grasslands SoilMSSFDNWVKSYEIQLRDWLSKCPNCLLSPQVSLHARRGTAMVQVGQSFEFDQRLGNRDNPPNRLVANRLVPYNGKLKNPIIFWLNTLFPKDQVYQLHEKPVVHHVLRLGAGYTFAMVREGLKAALKAEKIDIDTLSDWDPKKWQEK*
JGI25383J37093_1003213113300002560Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHXVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKLK*
JGI25383J37093_1015285913300002560Grasslands SoilMSSYDNQVKALEIQVRDWMAKCPNCMLSPQVALHARRGTAMVQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKVDIDSLGEWLPKKKKEKK*
JGI25384J37096_1000772263300002561Grasslands SoilKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
JGI25384J37096_1001063323300002561Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKXK*
JGI25384J37096_1001313253300002561Grasslands SoilRSTTLLSRDAYLPGMSSFDNAVESYEKQLRDWLSKXSXTLLSRDAYLPGMSSFDNAVESYXKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINSDSIEWDPKKLK*
JGI25382J37095_1002500543300002562Grasslands SoilMSSYDNQVKALEIQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
JGI25382J37095_1003593923300002562Grasslands SoilMSSFDNAVESYEKQLRDWLSRCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTNVLGGLKTALKAEKINADSLDWDPKKLK*
JGI25382J37095_1013391213300002562Grasslands SoilVSRGDKKTAT*XHGSTTLL*SDAYLLGMSSFDNAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKIN
JGI25382J43887_1006789633300002908Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSLDWDPKRLK*
JGI25382J43887_1039763413300002908Grasslands SoilRMSSYDNQVKALEIQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
JGI25389J43894_103840113300002916Grasslands SoilFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK*
JGI25389J43894_105221913300002916Grasslands SoilMLS*TEEAVALEARSTTLLSRDAYLR*MSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK*
Ga0066673_1001673233300005175SoilMTDGTLRGPPRQADCLMARSTTLLSSRAILWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK*
Ga0066679_1010187523300005176SoilMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLIGLKTALKAEKINGDSLEWDPKKLK*
Ga0066679_1048374713300005176SoilMSSFTSAVQSYENQLRDWLSKCPACVLSPQVSLNARRGTAMVQVGQSFEFDERLHARDNPPNRLVADRLVPYNGKLKNPIIFWLNSSFPTDSVYQLHKQPVTHHVLRLGAGYTFASVLAALKTALKAEKINIDSLSDWDPKKWK*
Ga0066690_1005654823300005177SoilMSSFGVAMETYEKQLKDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKELK*
Ga0066688_1001929023300005178SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK*
Ga0066684_1005710943300005179SoilMTDGTLRGPPRQADCLMARSTTLLSSRAILWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK*
Ga0066685_1000288773300005180SoilMTSRKDSVYHATWSTTLLSRGTMYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK*
Ga0066685_1001358033300005180SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK*
Ga0066685_1003754723300005180SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKLK*
Ga0066685_1006442133300005180SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKRLK*
Ga0066685_1026467723300005180SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK*
Ga0066676_1018989413300005186SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK*
Ga0066689_1002386433300005447SoilMSSFDSAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK*
Ga0070707_10000833573300005468Corn, Switchgrass And Miscanthus RhizosphereMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVIHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0070697_10000474073300005536Corn, Switchgrass And Miscanthus RhizosphereMTRHKDYVSMMTLSTTLLSRGAFYLGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0070697_10025245913300005536Corn, Switchgrass And Miscanthus RhizosphereLGMSSFDNAVESYEKQLRDWLAKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRENPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0066661_1033173413300005554SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK*
Ga0066692_1067712213300005555SoilSYDNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
Ga0066707_1000899543300005556SoilMTDGTLRGPPRQADCLMARSTTLLSSRAILWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFT
Ga0066704_1000892473300005557SoilMTGSTTLLSSDAILVAMSSFDNALESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDPVYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSLDWDPKKLK*
Ga0066704_1080139313300005557SoilSYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK*
Ga0066700_1102053913300005559SoilNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
Ga0066699_1001583233300005561SoilMVSFDSAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK*
Ga0066694_1042239913300005574SoilYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK*
Ga0066708_1005812923300005576SoilMLWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK*
Ga0066691_1033336213300005586SoilLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK*
Ga0066706_1000460783300005598SoilMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSLEWDPKKLK*
Ga0066706_1016667213300005598SoilAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK*
Ga0066651_1002005513300006031SoilILWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK*
Ga0066696_1090440123300006032SoilGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK*
Ga0066656_1023592923300006034SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKMNADSIDWDPKRLQ*
Ga0066653_1004268113300006791SoilMSSLGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK*
Ga0066659_1026754613300006797SoilDRSKAQGSGPCPVGVRGFKSHPPHQSGSVTIPRFLHFANWRMTSRKDSVYHATLSTTLLSRGTIYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK*
Ga0066659_1068148813300006797SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIDWDPKKLK
Ga0099791_1029946013300007255Vadose Zone SoilMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHSRENPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSIDWDPKKLK*
Ga0066710_10043257123300009012Grasslands SoilMYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTTIIQLGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK
Ga0066710_10061645213300009012Grasslands SoilMNSFNERVKTDEVQLRDWLSRCPVCQLSPQVSLHARKGMVMVQVGQSFEFDERLRAHDNPPNRLVAEHLVSYNSKVENPIIFWLNSSFPRDSVYNYHDQPVAHHVLRLGPGYTFASLLEGLKTALRAETIDVDSLQDWLPKNWK
Ga0066710_10114699613300009012Grasslands SoilLWMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK
Ga0099829_1037234513300009038Vadose Zone SoilMARSTTLLSSAAVLERMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGVKTALKAEKINADSLEWDPKKLK*
Ga0099829_1056167313300009038Vadose Zone SoilMLSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKIALKAEKVDIDSLGEWLPKKKKEKK*
Ga0099828_1045829713300009089Vadose Zone SoilNAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRENPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSIDWDPKKLK*
Ga0099828_1062532413300009089Vadose Zone SoilMMTLSTTLLSRGTSYLGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLIGVKT
Ga0099827_1028695423300009090Vadose Zone SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARNNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPVYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK*
Ga0105063_100659523300009804Groundwater SandMSSFDNWVKSYENQLRDWLSKCPACMLSPQVALHAQRGTAMVQVGQSFEFDQRLRRQDNPPNHLVAKRLVPFNGKIKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASVLEGLKTALKAEKIDIDSLSDWDLKKWK*
Ga0134063_1017250123300010335Grasslands SoilMSSYDNQVKALEVQVRDWMAKCPNCILSPQVALHARRGTAMVQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
Ga0126379_1221480113300010366Tropical Forest SoilMSSFNDSVKAYEDKLRDGLSKCPECMLSPQVALHARRGSAMVQVGQSFEFDERLRAHDNPPNRLVASRLVPFNGKVKNPIIFWLNSSFPRDSVYQLHKQSFVHHVLRLGAGYTFASLFEGLKTALKAEGIDVESLRDWDLKNWK*
Ga0150983_1478840323300011120Forest SoilAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0137393_1045821523300011271Vadose Zone SoilMARSTTLLSSAAVLEGMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLIGVKTALKAEKINADSIEWDPKKLK*
Ga0137389_1007598813300012096Vadose Zone SoilMTSHKDNVYHDDLSTTLLSRGTTYLGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0137388_1018771213300012189Vadose Zone SoilYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKIALKAEKINSDSIEWDPKKLK*
Ga0137388_1036415323300012189Vadose Zone SoilMARSTTLLSSAAVLEGMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARNNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0137388_1046339913300012189Vadose Zone SoilNLLGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGVKTALKAEKINGDSIDWDPKKLK*
Ga0137388_1117373813300012189Vadose Zone SoilMLSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKIALKAEKVDIDSLGEWLPKKKKEHNKN*
Ga0137399_1019506323300012203Vadose Zone SoilMSSFDDAVESFEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0137399_1058760613300012203Vadose Zone SoilTLLSRDATYPGMSSFGDAVESYEKQLRDYQSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSLEWDPKKLK*
Ga0137362_1014631133300012205Vadose Zone SoilLGMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINVDSLEWDPKKLK*
Ga0137380_1016358513300012206Vadose Zone SoilGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKLK*
Ga0137380_1170627513300012206Vadose Zone SoilMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGVKTALKAEKINADSLEWDPKKLK*
Ga0137381_1018367113300012207Vadose Zone SoilKKAANLVTRSTTLLSRGTIYLGMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGVKTALKAEKINADSLEWDPKKLK*
Ga0137366_1016093323300012354Vadose Zone SoilMVNNSIMKGRLLGGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINGDSIEWDPKKLK*
Ga0137371_1009915133300012356Vadose Zone SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINGDSIDWDPKKLK*
Ga0137375_1056468813300012360Vadose Zone SoilKSYESQLRDWLSKCPACMLSPQVALHAQRGTAMVQVGQSFEFDQRLRGQNNPPNHLVAKRLVPFNGKVKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASILEGLKTALKAEKIDIDSLSDWDLKKWK*
Ga0137390_1156219713300012363Vadose Zone SoilKMTLSTTLLSRATIYVGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLIQYNGKIKNPIIFWLNSSFPSDSIYQFHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK*
Ga0134054_123624513300012390Grasslands SoilVKTDEVQLRDWLSRCPVCQLSPQVSLHARKGMVMVQVGQSFEFDERLRAHDNPPNRLVAEHLVSYNSKVENPIIFWLNSSFPRDSVYNYHDQPVAHHVLRLGPGYTFASLLEGLKTALRAETIDVDSLQDWLSKNWK*
Ga0137396_1044516023300012918Vadose Zone SoilGTSYLGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINGDSLEWDPKKLK*
Ga0137396_1122411813300012918Vadose Zone SoilMSSYDNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKTALKAEKVDIDSLGEWLPKK
Ga0137419_1051151313300012925Vadose Zone SoilMSSYDNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVIHHVIRLGAGYTFAAVYEGLKTALKAEKVDIDSLGEWLPKKKKEKK*
Ga0137410_1013047223300012944Vadose Zone SoilMSSYDNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKTALKAEKVDIDSLGEWLPKKKKEKK*
Ga0126375_1058742113300012948Tropical Forest SoilDGVRLYEAQLRDWMSKCPLCVLSPQVSLHARKGTAMVQVGQSFEFDERLRDRNNPPNRIISNLLLPYNGKIKNPIIFWLNSSYPRDSAYELHKEPVVHHVLRIGAGYTFTSLLQGLKTAFKAEKIDIDTLSDWNPKNWK*
Ga0126369_1356150213300012971Tropical Forest SoilEDKLRDGLSKCPECILSPQVALHARRGSAMVQVGQSFEFDERLRARDNPPNRLVASRLVPFNGKVKNPIIFWLNTSFPREGIYQLHKQPFVHHVLRLGAGYTFPSLLEGLKTALKAEKIDVDTLRDWDLKNWK*
Ga0134077_1047427413300012972Grasslands SoilMSSYDNQVKALEIQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKID
Ga0134076_1015610613300012976Grasslands SoilMSSYDNQVKALEIQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIGLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK*
Ga0134078_1034902513300014157Grasslands SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKT
Ga0137409_1034685623300015245Vadose Zone SoilMSSYDNQVKALEVQVRDWMAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKTALKAEKVDIDSLGEWLPKKKKEKK*
Ga0134112_1000988343300017656Grasslands SoilMNSFNERVKTDEVQLRDWLSRCPTCQLSPQVSLHARKGMVMVQVGQSFEFDERLRAHDNPPNRLVAEHLVSYNSKVENPIIFWLNSSFPRDSVYNYHDQPVAHHVLRLGPGYTFASLLEGLKTALRAETIDVDSLQDWLPKNWK
Ga0134083_1002689333300017659Grasslands SoilMSAFDNWVKSYEVQVREWMSKCPECNLSPQVSLHARKGMVMVQVGQSFEFDERLRAHDNPPNRLVAEHLVSYNSKVENPIIFWLNSSFPRDSVYNYHDQPVAHHVLRLGPGYTFASLLEGLKTALRAETIDVDSLQDWLPKNWK
Ga0066655_1001156213300018431Grasslands SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAE
Ga0066655_1003014923300018431Grasslands SoilMSSFDNWVKSYEIQLRDWLSKCPNCLLSPQVSLHARRGTAMVQVGQSFEFDQRLGNRDNPPNRLVANRLVPYNGKLKNPIIFWLNTLFPKDQVYQLHEKPVVHHVLRLGAGYTFAMVREGLKAALKAEKIDIDTLSDWDPKKWQDK
Ga0066667_1003620733300018433Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK
Ga0066667_1006127533300018433Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKLK
Ga0066667_1030012523300018433Grasslands SoilMLWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK
Ga0066667_1050432323300018433Grasslands SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK
Ga0066662_1007507123300018468Grasslands SoilMTGSTTLLSSDAILVAMSSFDNALESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDPVYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSLDWDPKKLK
Ga0066662_1098578013300018468Grasslands SoilAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK
Ga0215015_1034591933300021046SoilMSSFGNAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLGGLKTALKAEKINADSLEWDPKKLK
Ga0215015_1045607133300021046SoilMSSFDNAVESYEKQVRDWLAKCPLCVLSPQVMLHARRGPAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK
Ga0209109_1008343223300025160SoilMSSFDNWVKSYENQLRDWLSKCPACMLSPQVALHAQRGTAMVQVGQSFEFDQRLRGQNNPPNHLVAKRLVPFNGKVKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASVLEGLKTALKAEKIDIDSLSDWDLKKWK
Ga0209640_1057182813300025324SoilMSSFDNWVKSYENQLRDWLSKCPACMLSPQVALHAQRGMAMVQVGQSFEFDQRLRGQNNPPNHLVAKRLVPFNGKVKNPIIFWLNTSFPRDSVYQLHEQPVVHHVLRLGAGYTFASVLEGLKTALKAEKIDIDSLSDWD
Ga0209235_100899343300026296Grasslands SoilMSSFDNAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPPDPVYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSLDWDPKKLK
Ga0209235_102415923300026296Grasslands SoilMYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK
Ga0209235_105689833300026296Grasslands SoilMSSYDNQVKALEIQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK
Ga0209237_100124823300026297Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSLDWDPKKLK
Ga0209237_100668863300026297Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINSDSIEWDPKKLK
Ga0209237_100840353300026297Grasslands SoilLLSSDAILVAMSSFDNAVESYEKQLRDWLSRCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTNVLGGLKTALKAEKINADSLDWDPKKLK
Ga0209237_101277533300026297Grasslands SoilMSSYDNQVKALEVQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK
Ga0209237_104559013300026297Grasslands SoilSYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK
Ga0209236_1000222383300026298Grasslands SoilMSSFDNWVKSYEIQLRDWLSKCPNCLLSPQVSLHARRGTAMVQVGQSFEFDQRLGNRDNPPNRLVANRLVPYNGKLKNPIIFWLNTLFPKDQVYQLHEKPVVHHVLRLGAGYTFAMVREGLKAALKAEKIDIDTLSDWDPKKWQEK
Ga0209236_100195393300026298Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK
Ga0209238_116321223300026301Grasslands SoilNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKSK
Ga0209468_104228313300026306SoilSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK
Ga0209055_101696333300026309SoilMTSRKDSVYHATLSTTLLSRGTIYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK
Ga0209761_1000286273300026313Grasslands SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGRIKNPIIFWLNSSFPSDPIYQLHKQPVIHHVLRLGPGYTFTNVLGGLRTALKAEKINGDSIDWDPKKLK
Ga0209761_100720943300026313Grasslands SoilMSSFDDAVESFEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSIEWDPKKLK
Ga0209761_119166123300026313Grasslands SoilMSSYDNQVKALEIQVRDWLAKCPNCILSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKTALKAEKIDIDSLGEWLPKKKKEKK
Ga0209155_100494333300026316SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK
Ga0209471_113411723300026318SoilMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLIGLKTALKAEKINGDSLEWDPKKLK
Ga0209803_100474553300026332SoilEARSTTLLSRDAYIRGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINGDSIEWDPKKLK
Ga0209803_132765013300026332SoilRCLMAGSTTLLRRDAYRLWMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKLK
Ga0209377_1002040113300026334SoilLLGMSSFDNAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSLDWDPKKLK
Ga0209804_104505533300026335SoilSSFTSAVQSYENQLRDWLSKCPACVLSPQVSLNARRGTAMVQVGQSFEFDERLHARDNPPNRLVADRLVPYNGKLKNPIIFWLNSSFPTDSVYQLHKQPVTHHVLRLGAGYTFASVLAALKTALKAEKINIDSLSDWDPKKWK
Ga0209057_122654513300026342SoilRDAYLRGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINSDSIEWDPKKLK
Ga0257179_103596513300026371SoilLPNCRVNNSIMKGCLLWGMSSFGSAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRNNPPNHLLAVQLVQYNGKIKNPIIFWLNSSFPSDPVYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK
Ga0257177_101249913300026480SoilKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSLEWDPKKLK
Ga0257177_109150913300026480SoilKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLRGRDNPPNRLVANRLVPFNGMIKNPIILWLNSSFPRDSIYQLHEQPVVHHVLRLGAGYTFASLLEGLKTALKAEKIDVDTLRDWDVKNWK
Ga0257181_104933323300026499SoilMSSFDNAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRENPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSIDWDPKKLK
Ga0257168_105350413300026514SoilKSHPPHQSGSVARPRFIHFVNFRGWPATKITSPKMTLSTTLLSRGTIYVGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARNNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSIEWDPKKLK
Ga0209690_110060213300026524SoilMSSYDNQVKALEVQVRDWMAKCPNCMLSPQVALHARRGTAMVQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHLIRLGAGYTFAAVYEG
Ga0209690_119051913300026524SoilMSSFDNAVESYEKQLRDWLSKCPVCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSLDWDPKRLK
Ga0209806_1002088113300026529SoilNWRMTSRKDSVYHATLSTTLLSRGTIYLGMSSFDDAVESYEKQLREWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTNVLVGLKTALKAEKINADSLEWDPKKLK
Ga0209807_100022653300026530SoilMTDGTLRGPPRQADCLMARSTTLLSSRAILWGMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINADSIDWDPKKLK
Ga0209058_115092523300026536SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTS
Ga0209157_107112923300026537SoilVKTDEVQLRDWLSRCPVCQLSPQVSLHARKGMVMVQVGQSFEFDERLRAHDNPPNRLVAEHLVSYNSKVENPIIFWLNSSFPRDSVYNYHDQPVAHHVLRLGPGYTFASLLEGLKTALRAETIDVDSLQDWLPKNWK
Ga0209056_1004700613300026538SoilDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWDPKKLK
Ga0209056_1071551913300026538SoilYERQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSLEWDPKKLK
Ga0209376_103066953300026540SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLRTALKAEKINADSID
Ga0209376_116073513300026540SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKRLK
Ga0209161_1000086323300026548SoilMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSLEWDPKKLK
Ga0209577_1012185823300026552SoilMSSFGLAVETYEKQLRDWLSKCPVCLLSPQVMLHARKGMAMIQVGQSFEFDERLHARDNPPNHLLARQLIQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTGVLGALKTALKAEKINTDSIDWAHKKLK
Ga0209076_105194223300027643Vadose Zone SoilMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINGDSIEWDPKKLK
Ga0209689_100968343300027748SoilMSSFTSAVQSYENQLRDWLSKCPACVLSPQVSLNARRGTAMVQVGQSFEFDERLHARDNPPNRLVADRLVPYNGKLKNPIIFWLNSSFPTDSVYQLHKQPVTHHVLRLGAGYTFASVLAALKTALKAEKINIDSLSDWDPKKWK
Ga0209180_1014511823300027846Vadose Zone SoilMAKCPNCMLSPQVALHARRGTAMIQVGQSFEFDQRLKAYDDPPNYLVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKIALKAEKVDIDSLGEWLPKKKKEKK
Ga0209180_1047532923300027846Vadose Zone SoilMGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGVKTALKAEK
Ga0209701_1005240013300027862Vadose Zone SoilPTLSFLHDATTLLSSASILEGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINSDSIEWDPKKLK
Ga0209701_1026732623300027862Vadose Zone SoilTYTVVLTPRTNQVQPSRQRFSDFAGSYPVTRHDKYPCVSAQSTTLLSRDTNLLGMSSFDNAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRENPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPVYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINADSIDWDPKKLK
Ga0209701_1027110313300027862Vadose Zone SoilMGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSIDWDPKKLK
Ga0209283_1001046453300027875Vadose Zone SoilMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINSDSIEWDPKKLK
Ga0209283_1005972043300027875Vadose Zone SoilVGMSSFDNAVESYEKQLRDWLSRCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHSRENPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINTDSIDWDPKKLK
Ga0209283_1010778623300027875Vadose Zone SoilMSSFDDSVKSYENKLRDWLSKCPDCVLSPQVALHARRGSAMVQVGQSFEFDERLRARDNPPNRLVANRLVPFNGKVKNPIIFWLNSSFPRDSIYQLHKQPVVHHVLRLGAGYTFVSLLEGLKTALKAEKIDIESLRDWDLKSWK
Ga0209283_1010931323300027875Vadose Zone SoilLLGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIRNPIIFWLNSSFPSDQIYQLHKQPVVHHVLRLGPGYTFTSVLGGLKTALKAEKINSDSIEWDPKKLK
Ga0209283_1042051213300027875Vadose Zone SoilWPATKITSTKMTLSTTLLSRGAIYLGMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGVKTALKAEKINADSLEWDPKKLK
Ga0209590_1028391213300027882Vadose Zone SoilTTLLSSAAVLERMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGVKTALKAEKINADSLEWDPKKLK
Ga0209590_1053701013300027882Vadose Zone SoilMGMSSFDNAVESYEKQLRDWLSKCPLCVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHLLARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGPGYTFTSVLGGVKTALKAEKINGDSIDWDPK
Ga0137415_1014537153300028536Vadose Zone SoilMSSFGDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMVQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDPIYQLHKQPVVHHVLRLGAGYTFTSVLVGLKTALKAEKINADSLEWDPKKLK
Ga0307473_1105890113300031820Hardwood Forest SoilPNCMLSPQVALHARRGTAMVQVGQSFEFDQRLKAYDDPPNYMVANRLVPFNGKVKNPIIFWLNTSFPRENVYQLHEQPVVHHVIRLGAGYTFAAVYEGLKTALKAEKVDIDSLGEWLPKKKKEKK
Ga0307471_10045633913300032180Hardwood Forest SoilMYLGMSSFDDAVESYEKQLRDWLSKCPACVLSPQVMLHARRGTAMIQVGQSFEFDERLHARDNPPNHILARQLVQYNGKIKNPIIFWLNSSFPSDSIYQLHKQPVVHHVLRLGAGYTFTSVLVGVKTALKAEKINADSLEWDPKKLK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.