NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F041671

Metagenome Family F041671

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F041671
Family Type Metagenome
Number of Sequences 159
Average Sequence Length 128 residues
Representative Sequence MTSVMPERRLALIHRVIVPNKEYPKEYAILVTDRRSVFIRHKKTRSSFVLRGEMRYGTALVTDVMPKTLEDYEQTSLESLTADSANFTVPHEALVSLVMRKEEPKFRLRDFFVWLTMR
Number of Associated Samples 92
Number of Associated Scaffolds 159

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 84.28 %
% of genes near scaffold ends (potentially truncated) 93.08 %
% of genes from short scaffolds (< 2000 bps) 86.16 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (56.604 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(36.478 % of family members)
Environment Ontology (ENVO) Unclassified
(52.830 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.491 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 22.60%    β-sheet: 17.12%    Coil/Unstructured: 60.27%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 159 Family Scaffolds
PF12773DZR 10.06
PF06197DUF998 3.77
PF02547Queuosine_synth 2.52
PF13793Pribosyltran_N 1.89
PF01384PHO4 1.89
PF01648ACPS 1.26
PF01638HxlR 1.26
PF13614AAA_31 1.26
PF00118Cpn60_TCP1 1.26
PF06545DUF1116 0.63
PF13337BrxL_ATPase 0.63
PF01070FMN_dh 0.63
PF06778Chlor_dismutase 0.63
PF01022HTH_5 0.63
PF00005ABC_tran 0.63
PF12840HTH_20 0.63
PF13147Obsolete Pfam Family 0.63

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 159 Family Scaffolds
COG3371Uncharacterized membrane proteinFunction unknown [S] 3.77
COG0809S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase)Translation, ribosomal structure and biogenesis [J] 2.52
COG0306Phosphate/sulfate permeaseInorganic ion transport and metabolism [P] 1.89
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 1.26
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 1.26
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.63
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.63
COG3253Coproheme decarboxylase/chlorite dismutaseCoenzyme transport and metabolism [H] 0.63


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A56.60 %
All OrganismsrootAll Organisms43.40 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002557|JGI25381J37097_1023344All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1078Open in IMG/M
3300002560|JGI25383J37093_10182054All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon553Open in IMG/M
3300002561|JGI25384J37096_10006439All Organisms → cellular organisms → Bacteria4354Open in IMG/M
3300002561|JGI25384J37096_10246634Not Available521Open in IMG/M
3300002911|JGI25390J43892_10148384Not Available546Open in IMG/M
3300005167|Ga0066672_10106873Not Available1714Open in IMG/M
3300005167|Ga0066672_10671669All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon667Open in IMG/M
3300005178|Ga0066688_10011703All Organisms → cellular organisms → Bacteria → Proteobacteria4353Open in IMG/M
3300005178|Ga0066688_10567581Not Available729Open in IMG/M
3300005178|Ga0066688_10980108Not Available516Open in IMG/M
3300005180|Ga0066685_10469077All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon873Open in IMG/M
3300005181|Ga0066678_10012301All Organisms → cellular organisms → Bacteria4228Open in IMG/M
3300005446|Ga0066686_10554202All Organisms → cellular organisms → Archaea781Open in IMG/M
3300005468|Ga0070707_101038193Not Available785Open in IMG/M
3300005552|Ga0066701_10680431Not Available620Open in IMG/M
3300005555|Ga0066692_10015522All Organisms → cellular organisms → Archaea3738Open in IMG/M
3300005555|Ga0066692_10400822All Organisms → cellular organisms → Archaea870Open in IMG/M
3300005557|Ga0066704_10395861Not Available918Open in IMG/M
3300005558|Ga0066698_10237584All Organisms → cellular organisms → Archaea1254Open in IMG/M
3300005558|Ga0066698_10586003All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon753Open in IMG/M
3300005558|Ga0066698_10910298All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon562Open in IMG/M
3300005559|Ga0066700_10296775Not Available1138Open in IMG/M
3300005568|Ga0066703_10016699All Organisms → cellular organisms → Bacteria3695Open in IMG/M
3300005568|Ga0066703_10410159All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon813Open in IMG/M
3300005568|Ga0066703_10690301All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon588Open in IMG/M
3300005569|Ga0066705_10531424Not Available732Open in IMG/M
3300005586|Ga0066691_10137822Not Available1396Open in IMG/M
3300005598|Ga0066706_10453685Not Available1021Open in IMG/M
3300005598|Ga0066706_10622633Not Available856Open in IMG/M
3300005598|Ga0066706_10719993Not Available792Open in IMG/M
3300006032|Ga0066696_10199213Not Available1277Open in IMG/M
3300006794|Ga0066658_10675959Not Available568Open in IMG/M
3300006796|Ga0066665_10479091Not Available1023Open in IMG/M
3300006796|Ga0066665_10630625All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon856Open in IMG/M
3300006800|Ga0066660_10112260All Organisms → cellular organisms → Bacteria1954Open in IMG/M
3300006804|Ga0079221_10586776Not Available748Open in IMG/M
3300006804|Ga0079221_11547256All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon534Open in IMG/M
3300007258|Ga0099793_10484485Not Available614Open in IMG/M
3300009012|Ga0066710_100129099All Organisms → cellular organisms → Archaea3466Open in IMG/M
3300009012|Ga0066710_101965718All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon870Open in IMG/M
3300009038|Ga0099829_11191980Not Available631Open in IMG/M
3300009088|Ga0099830_10603922All Organisms → cellular organisms → Archaea900Open in IMG/M
3300009088|Ga0099830_11398872All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon582Open in IMG/M
3300009089|Ga0099828_10037689All Organisms → cellular organisms → Archaea3941Open in IMG/M
3300009089|Ga0099828_10150830Not Available2053Open in IMG/M
3300009089|Ga0099828_10213095All Organisms → cellular organisms → Archaea1725Open in IMG/M
3300009090|Ga0099827_10160389All Organisms → cellular organisms → Archaea1847Open in IMG/M
3300009090|Ga0099827_10690757Not Available882Open in IMG/M
3300009090|Ga0099827_10765160Not Available835Open in IMG/M
3300009090|Ga0099827_11070725Not Available700Open in IMG/M
3300009804|Ga0105063_1013678All Organisms → cellular organisms → Archaea895Open in IMG/M
3300009820|Ga0105085_1035711All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon883Open in IMG/M
3300010303|Ga0134082_10266780Not Available712Open in IMG/M
3300010304|Ga0134088_10593821Not Available550Open in IMG/M
3300010320|Ga0134109_10260952Not Available656Open in IMG/M
3300010333|Ga0134080_10675039Not Available510Open in IMG/M
3300010335|Ga0134063_10485839Not Available616Open in IMG/M
3300010336|Ga0134071_10016512All Organisms → cellular organisms → Archaea3062Open in IMG/M
3300010336|Ga0134071_10369236All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon727Open in IMG/M
3300010336|Ga0134071_10798103Not Available505Open in IMG/M
3300010359|Ga0126376_10979499Not Available844Open in IMG/M
3300011269|Ga0137392_11454706Not Available544Open in IMG/M
3300012096|Ga0137389_10638141Not Available915Open in IMG/M
3300012199|Ga0137383_10009501All Organisms → cellular organisms → Archaea6456Open in IMG/M
3300012199|Ga0137383_10300672All Organisms → cellular organisms → Archaea1175Open in IMG/M
3300012199|Ga0137383_10994012Not Available611Open in IMG/M
3300012199|Ga0137383_11336915Not Available509Open in IMG/M
3300012203|Ga0137399_10154606All Organisms → cellular organisms → Archaea1835Open in IMG/M
3300012203|Ga0137399_10938845All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon728Open in IMG/M
3300012206|Ga0137380_10083287All Organisms → cellular organisms → Archaea2927Open in IMG/M
3300012206|Ga0137380_10099249All Organisms → cellular organisms → Archaea2660Open in IMG/M
3300012206|Ga0137380_10114453Not Available2464Open in IMG/M
3300012206|Ga0137380_10323032All Organisms → cellular organisms → Archaea → TACK group → Crenarchaeota → unclassified Thermoproteota → Crenarchaeota archaeon 13_1_20CM_2_51_81380Open in IMG/M
3300012206|Ga0137380_10424492All Organisms → cellular organisms → Archaea1178Open in IMG/M
3300012206|Ga0137380_10468157All Organisms → cellular organisms → Archaea1113Open in IMG/M
3300012206|Ga0137380_10525658All Organisms → cellular organisms → Archaea1040Open in IMG/M
3300012206|Ga0137380_10527011All Organisms → cellular organisms → Archaea1039Open in IMG/M
3300012206|Ga0137380_11392964Not Available586Open in IMG/M
3300012207|Ga0137381_10170440Not Available1879Open in IMG/M
3300012207|Ga0137381_10247180All Organisms → cellular organisms → Bacteria1550Open in IMG/M
3300012207|Ga0137381_10673110Not Available900Open in IMG/M
3300012207|Ga0137381_11799220Not Available502Open in IMG/M
3300012209|Ga0137379_10316708Not Available1471Open in IMG/M
3300012209|Ga0137379_10485124All Organisms → cellular organisms → Archaea1144Open in IMG/M
3300012209|Ga0137379_10534843All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1079Open in IMG/M
3300012209|Ga0137379_11200832Not Available665Open in IMG/M
3300012209|Ga0137379_11622413All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon545Open in IMG/M
3300012210|Ga0137378_10076650Not Available3042Open in IMG/M
3300012210|Ga0137378_10475819All Organisms → cellular organisms → Archaea1155Open in IMG/M
3300012211|Ga0137377_11277868Not Available664Open in IMG/M
3300012349|Ga0137387_10132133All Organisms → cellular organisms → Bacteria1766Open in IMG/M
3300012349|Ga0137387_10569369Not Available822Open in IMG/M
3300012351|Ga0137386_10446281Not Available932Open in IMG/M
3300012351|Ga0137386_11220306Not Available525Open in IMG/M
3300012357|Ga0137384_10100190All Organisms → cellular organisms → Archaea2418Open in IMG/M
3300012357|Ga0137384_10750916Not Available791Open in IMG/M
3300012357|Ga0137384_11207988Not Available601Open in IMG/M
3300012362|Ga0137361_10512531All Organisms → cellular organisms → Archaea1101Open in IMG/M
3300012363|Ga0137390_10485054Not Available1211Open in IMG/M
3300012363|Ga0137390_10784900Not Available911Open in IMG/M
3300012918|Ga0137396_10491133Not Available910Open in IMG/M
3300012918|Ga0137396_10778611Not Available704Open in IMG/M
3300012918|Ga0137396_10960742Not Available622Open in IMG/M
3300012918|Ga0137396_11034124Not Available592Open in IMG/M
3300012944|Ga0137410_10898232Not Available749Open in IMG/M
3300012972|Ga0134077_10193714Not Available826Open in IMG/M
3300012972|Ga0134077_10354265Not Available626Open in IMG/M
3300012976|Ga0134076_10564889Not Available527Open in IMG/M
3300015358|Ga0134089_10033657All Organisms → cellular organisms → Archaea1816Open in IMG/M
3300015359|Ga0134085_10461874All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon577Open in IMG/M
3300017656|Ga0134112_10019835All Organisms → cellular organisms → Archaea2305Open in IMG/M
3300017656|Ga0134112_10176560Not Available829Open in IMG/M
3300017656|Ga0134112_10473756Not Available527Open in IMG/M
3300017659|Ga0134083_10032270Not Available1916Open in IMG/M
3300017659|Ga0134083_10228470Not Available774Open in IMG/M
3300017659|Ga0134083_10552216Not Available521Open in IMG/M
3300017934|Ga0187803_10162281All Organisms → cellular organisms → Bacteria → Atribacterota → unclassified Atribacterota → Candidatus Atribacteria bacterium HGW-Atribacteria-1880Open in IMG/M
3300018433|Ga0066667_11136858Not Available677Open in IMG/M
3300018468|Ga0066662_10123384All Organisms → cellular organisms → Bacteria1896Open in IMG/M
3300018468|Ga0066662_11419880Not Available719Open in IMG/M
3300018468|Ga0066662_12041320Not Available600Open in IMG/M
3300018482|Ga0066669_11505528Not Available611Open in IMG/M
3300021046|Ga0215015_10350359Not Available676Open in IMG/M
3300021046|Ga0215015_10681091Not Available599Open in IMG/M
3300021088|Ga0210404_10028592Not Available2472Open in IMG/M
3300021088|Ga0210404_10067528All Organisms → cellular organisms → Archaea1729Open in IMG/M
3300021476|Ga0187846_10108493Not Available1191Open in IMG/M
3300026277|Ga0209350_1022336All Organisms → cellular organisms → Archaea1944Open in IMG/M
3300026295|Ga0209234_1246285Not Available583Open in IMG/M
3300026297|Ga0209237_1065864Not Available1728Open in IMG/M
3300026298|Ga0209236_1054933All Organisms → cellular organisms → Archaea1965Open in IMG/M
3300026309|Ga0209055_1043571All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1970Open in IMG/M
3300026309|Ga0209055_1093100All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1222Open in IMG/M
3300026310|Ga0209239_1351884Not Available515Open in IMG/M
3300026313|Ga0209761_1094532Not Available1510Open in IMG/M
3300026313|Ga0209761_1306931Not Available549Open in IMG/M
3300026315|Ga0209686_1226770Not Available507Open in IMG/M
3300026325|Ga0209152_10020795Not Available2242Open in IMG/M
3300026325|Ga0209152_10410441All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon534Open in IMG/M
3300026328|Ga0209802_1113009All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1204Open in IMG/M
3300026328|Ga0209802_1319963Not Available513Open in IMG/M
3300026331|Ga0209267_1043717All Organisms → cellular organisms → Bacteria2056Open in IMG/M
3300026333|Ga0209158_1108489Not Available1050Open in IMG/M
3300026334|Ga0209377_1247183Not Available586Open in IMG/M
3300026529|Ga0209806_1226612All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon635Open in IMG/M
3300026530|Ga0209807_1013153All Organisms → cellular organisms → Archaea4074Open in IMG/M
3300026536|Ga0209058_1272528All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon585Open in IMG/M
3300026538|Ga0209056_10493424All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon647Open in IMG/M
3300026538|Ga0209056_10600347Not Available555Open in IMG/M
3300026540|Ga0209376_1363911All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon544Open in IMG/M
3300027748|Ga0209689_1046276Not Available2467Open in IMG/M
3300027748|Ga0209689_1103810Not Available1460Open in IMG/M
3300027846|Ga0209180_10013109All Organisms → cellular organisms → Bacteria4316Open in IMG/M
3300027875|Ga0209283_10205930All Organisms → Viruses → Predicted Viral1306Open in IMG/M
3300027875|Ga0209283_10262240Not Available1144Open in IMG/M
3300031753|Ga0307477_10957320Not Available563Open in IMG/M
3300032160|Ga0311301_11655585Not Available774Open in IMG/M
3300032180|Ga0307471_103357803Not Available567Open in IMG/M
3300032180|Ga0307471_103728246Not Available539Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil36.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil28.93%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.95%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.95%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.26%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.26%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.26%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.63%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.63%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.63%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.63%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.63%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.63%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300009820Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_102334423300002557Grasslands SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDLIPKTLEDYQQTSVESLTADSTNIMVHHEAVISLAVKKEEQKPRKYDFFVRLTMRMRREEF
JGI25383J37093_1018205413300002560Grasslands SoilMSNVIQEKRLALLHRVMVPDEEYPKEYAVLVTDSKSIFILQKKTRGNFWLRREMRYGTALITDVVPKTLEDYQRTSLESLTADGANITIPHEAVISLALKKEVQK
JGI25384J37096_1000643943300002561Grasslands SoilMTSVMPERRXALIHRVIVPNKEYPKEYAILVTDRRSVFIRHKKTRSSFVLRGEMRYGTALVTDVMPKTLEDYEQTSLESLTADSANFTVPHEALVSLVMRKEEPKFRARDFFVWLTMRRQGEIFQVYDFE
JGI25384J37096_1024663413300002561Grasslands SoilVPNKEYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVIPKTLEDYEQTSLESLTADSANLTVPHEALVSLVMRKEEPKFRLRDFFVWLTMRRQGE
JGI25390J43892_1014838413300002911Grasslands SoilMLSQAAKGSDLVPERRLALIHRVIIPDEEYPKEYSILVTDQRSIFIRQEKTRSSFVLRQEMRIGTALVTDVIPKTLEDYEETSLQTLTADSRNLTVPHDSVTSLVM
Ga0066672_1010687313300005167SoilMIYETGEGSYAVSERKLALVHRVIVPDKDYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEQTNLESLAADASNIAVAHSAVISLVMSKGEPEFRLR
Ga0066672_1067166913300005167SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDLIPKTLEDYQQTSVESLTADSTNIMVHHEAVISLAMKKEEQQPRKYDFFVRLTMRMQREEFQVYDFEMNYRESPNSETMIKFYM
Ga0066688_1001170313300005178SoilLRNMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKNTRGNFWLRREISYGTALITDLIPKTLEDYQQTSVESLTADSTNIMVHHEAVISLAVKKEEQKPRKYDFF
Ga0066688_1056758113300005178SoilLNLAVSYETGAGSYVELERKLALLHRVIVPDKDYPKEYATLVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEKTSLKSLATDASNIAIPHDTVISLVMTKGE
Ga0066688_1098010813300005178SoilMISQVRQGPERVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDAIPKTLDDFEQTSPESLMADDSNLTVPHEAVISLAMRTEEHE
Ga0066685_1046907723300005180SoilMASVTPETRLALIHRVIVPGEEYPKEYAILVTDHRSIFVRQKSTRGNFWLRREMSYGTALVTDVVPKTLEDYEETSLDSLTADTANLAVPHEAAISLTLKKEEQKPRKYDFFVRLTMRMQKEVFQVYDFELVYR
Ga0066678_1001230113300005181SoilMTSVMPERRLALIHRVIVPNKEYPKEYAILVTDRRSVFIRHKKTRSSFVLRGEMRYGTALVTDVMPKTLEDYEQTSLESLTADSANFTVPHEALVSLVMRKEEPKFRLRDFFVWLTMR
Ga0066686_1055420223300005446SoilMRLALIHRVIVPDEEYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVIPKTLEDYAQSTLESLTADSANLTVPHEALISLVMRREEPKFRARDFLIWLTMRRQGEIFQVYDFEMNYR
Ga0070707_10103819323300005468Corn, Switchgrass And Miscanthus RhizosphereMTGSVPEKTLALIHRVIAPDKEYPKEYAVLVTDRRSIFIRLEKTRSSYWLRGEMKFGTALMTDVIPKTLEDYEQTNVESLIADNANLTIPHEKVTSLVLRKEEPEFGAREFFVWLTMRRQ
Ga0066701_1068043113300005552SoilMPERRLALIHRVIVPNKEYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVIPKTLEDYEQTSLESLTADSANLTVSHEALVSLVMRKEEPKFRLRDFFVWLTMRRQGEIFQVYDFEM
Ga0066692_1001552213300005555SoilMMYETGEESYAAHERRLALVHRVIVADKDYPKEYVILVTDRRSIFIPQRKTRSSFVLRGEMRYGTALVTDVEPRTLEDYEQTSLEWLAADASNIAIPHDAVISLFMTKGEPKFRLRDFFVWLTMRRQGHKFHVYD
Ga0066692_1040082213300005555SoilMVPERRLALIHRVIVPDKEYPKEYAILVTDRRSIFIRQSRTSGNFWLRREMSYGTALITNVVPKTLEDYEQTSLESLTADSANLSVPHEEVISLAMQKEEQKPHAYDFFVRLTMRMQGEEFRVYYFEMNYRQIQNPETKIMFY
Ga0066704_1039586113300005557SoilLEMMYETGEESYAAHERRLALVHRVIVADKDYPKEYVILVTDRRSIFIPQRKTRSSFVLRGEMRYGTALVTDVEPRTLEDYEQTSLEWLAADASNIAIPHDAVISLFMTKGEPKFRLRDFFVWLTMRRQGHKFHVYD
Ga0066698_1023758423300005558SoilMLSQAAKGSDLVPERRLALIHRVIIPDEEYPKEYSILVTDQRSIFIRQEKTRSSFVLRQEMRIGTALVTDVIPKSLEDYEETSLQTLTADSRNLTVPHEAVT
Ga0066698_1058600313300005558SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDVVPKTLEDYEQTGVESLTADSTNIMVPHEAVISLAMKKEEQKPRKYDFFVRLTMRMQQEEFQVYDFEMNYRESRNSETMIKFYMVPLG
Ga0066698_1091029823300005558SoilMTSVTTETRLALIHRVIVPDEEYPKEYAILLTDRSSIFVRQNSTRGNFWLRREMSYGTALVTDVLPKTLEDYEQTSLDSLTADTANLTVPHEAVISLTLKKEEQKPRKYD
Ga0066700_1029677523300005559SoilLEMMYETGEGSYTVPERKLALVHRVIMPDKDYPKEYALLVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEHTSLVSLAADASNIAVPHDAVVSLVMTKEGRISVYGTCSSG*
Ga0066703_1001669913300005568SoilVTTETRLALIHRVIVPDEEYPKEYAILVTDRQSIFIRQKSTRGNFWLRREMSYGTALVTDVAPKTLEDYEQTSLDSLTADPANLVVPHQLAISLRLKKEEQKLHKYDF
Ga0066703_1041015913300005568SoilMTGTVTEKRLALIHRVIIPDEEYPKEYAILVTDHRSIFIRQKKSRSNFWLRREISYGTALITDIAPKTLEDYEQSSLESLTDDSSNITVPHEWVVSLALKKDMQKPRRYDFFVRLTMRLQREEFQVY
Ga0066703_1069030113300005568SoilVIQEKRLALLHRVMIPDEEYPKEYAVLVTDSKSIFILQKKTRGNFWLRREMRYGTALITDVVPKTLEDYQRTSLESLTADGANITIPHEAVISLALKKEVQKPRAYDFFVRLTMRMQGEE
Ga0066705_1053142423300005569SoilLNLAVSYETGAGSYVELERKLALVHRVIVPDKAYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEKTSLKSLATDASNIAIPHDTVISLVMTKGEPKFR
Ga0066691_1013782233300005586SoilLNLAVSYETGAGSYVVLERKLALVHRVIVPDKDYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEKTSLKSLATDASNIAIPHDTVISLVMTKGEPKFRLRDFFVW
Ga0066706_1045368513300005598SoilMLPERRLALIHRVILPDKDYPKEYAILVTDRRSIFIRQKKTRRSFVLRYEMRIGTALVTDVIPKTLEDYEQTSLESLTADDANLTVPHEAVISLAMRAEEHEHRKRDFFLWLTMRRQEEIFQVYNFEMKYRPGPDQ
Ga0066706_1062263323300005598SoilLEMMYEAAEGPYTAPERKVGLVHRVIVPDKDYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYDQTSLESLATDASNIAIPHDTVISLVMTKGEPKFR
Ga0066706_1071999323300005598SoilMISQVRQGPETVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDAIPKTLDDFEQTSPESLMADDSNLTVPHEAVISLAMRTEEHERRK
Ga0066696_1019921313300006032SoilLNLAVSYETGAGSYVELERKLALVHRVIVPDKDYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEKTSLKSLATDASNIAIPHDTVISLVMTKGEPKFRLRDFFVWLTMRRQGHKFHVYDFEMNY
Ga0066658_1067595923300006794SoilLEMMYEAAEGPYTAPERKVGLVHRVIVPDKDYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYDQTSLESLATDASNIAIPHDTVISLVMTKGEPKFRLQDLFIWLTMRRQGHKFHVYDFEMNYRDNANLETRIRFYM
Ga0066665_1047909113300006796SoilVPERKLALVHRVIMPDKDYPKEYALLVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEHTSLVSLAADASNIAVPHDAVVSLVMTKEGRISVYGTCSSG*
Ga0066665_1063062513300006796SoilMASVTPETRLALIHRVIVPGEEYPKEYAILVTDRRSIFVRQKSTRGNFWLRREMSYGTALVTDVVPKTLEDYEETSLDSLTADTANLAVPHEAAISLTLKKEEQKPRKYDLFVRLTMRMQKEVFQVYDFEL
Ga0066660_1011226023300006800SoilMMYEAAEGPYTAPERKVGLVHRVIVPDKDYPKEFAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEQTNLESLAAEASNIAVAHSAVISLVMSKGEPEFR*
Ga0079221_1058677613300006804Agricultural SoilLNSATLYETGAGSFTVPERRLALIHRVNVPDKDYPKEHAVLVTDRRSIFIRQNRTRSSFVLRGEMRYGTALVTDVQPKTLEDYEGTSVESLAADASNVTVPHTSVTSLVMTEGEPKFRLQDLWVWLTMKRQGHEFRVYDFEMSYADSANQKT
Ga0079221_1154725613300006804Agricultural SoilMIPEASVALIHRVIVPDGESPREYAILVSDRRSIFIRLNKTRGDFFLRREMSYGTALITDVVPKTLGDYEKTSLESLSADTANITVPHDAVISIALKKEVKKPSRLDLLVRLTMRMQK
Ga0099793_1048448513300007258Vadose Zone SoilMTGSVPEKTLALIHRVIAPDKEYPKEYAVLVTDRRSIFIRLEKTRSSYWLPGEMKFGTALMTDVIPKTLEDYEQTSVESLIADNANLTIPHEKVTSLVLRKEEPEFRAREFFVWLTMRR
Ga0066710_10012909913300009012Grasslands SoilVSGSVPEKRLALIHRVIVPDKEYPKEYALLVTDSRSIFIRLEKTRSSYWLRGEMKFGTALVTDVIPKTLEDYERIGLESLTGDKTNLTVPHEDVTSLALKKEEPQFRAREFFVW
Ga0066710_10196571823300009012Grasslands SoilMASVTPETRLALIHRVIVPGEEYPKEYAILVTDRRSIFVRQKSTRGNFWLRREMSYGTALVTDVVPKTLEDYEETSLDSLTADTANLAVPHEAAISLTLKKEEQKPRKYDFFVRLTMRMQKEVFQVYDFELVYRQT
Ga0099829_1119198023300009038Vadose Zone SoilMPESRLALIHRVIVPDKEYPREYAVLVTDSRSVFIRQKKTRSSFVLRGEIRFGTALVTDVIPKTVEDYEQTSLESLMADSANLTVPHGMVISLVMRKEEQKFHLPDLFIWLTMRRQGHKFHVYDFEMNYRDSASLEAGIRFYMVPLGFVSSREDKPRPEKRFSVNTLRTHSR
Ga0099830_1060392213300009088Vadose Zone SoilMTSETTEKRLALIHRVIVPDEESPKEYAVLVTDRRCIFIRQKSTRGNFWLRREMSYGTAIITDVVPKTLEDYERTSPDSLTLDASNLTVSHEEIVSLALKKEEQKHRKYDFFVRLTMRMQKEVFQVYDFEMVYRPSPNSETTTKFYMVPL
Ga0099830_1139887213300009088Vadose Zone SoilMTSSIPERRLALIHRVMVPDKEYPKEYAVLLTDRRSIFIRQEKTRGNFWLRREMRYGTALITDVTPKTLEGYEQTTLESLAADSSNLVVPHEAVISLALRKEEQKPRKYDF
Ga0099828_1003768933300009089Vadose Zone SoilMTSETTEKRLALIHRVIVPDEESPKEYAVLVTDRRSIFIRQKSTRGNFWLRGEMSYGTALITDVVPKTLEDYEGNSLDSLTLDAANLTVSNEEIISLALKKEEQRPRIRDFFVRLTMRMQKGSVPSIRL*
Ga0099828_1015083013300009089Vadose Zone SoilMESPERRVALIHRVMVPYEASPKEYAILVTDRRSIFIRQERTRSSFVLRGEMRYGTALVTDVVPKTLDDYEQTTLDSLVADTENLTVSHEAVVTLELKKEVPQFRVRDFFIWLTMRRQGE
Ga0099828_1021309513300009089Vadose Zone SoilMIVRAVFEKGMTGKMPERRLALIHRVIVPDKEYPKEYAVLVTDSRSVFIRQKKTRSSFVLRGEMRFGTALVTDVIPKTLEDYEQTSLESLTADSANLTVPHEMVISLVMRKEEQKFHLRDLFIWLTMRRQGHKFQVYDFEMNYRQSPKGETGIRFYMVP
Ga0099827_1016038953300009090Vadose Zone SoilMLEKPMIDSVPERRLALIHRVIVPDEEYPKEYAILVTDRRSIFIRQDKTRGNFWLRHEMRYGTALITDAAPKTLEDYVQTTLELLVADSSNLVVPHEAVISLGLKKEVQKPRAYDFFVRLTMRMQR
Ga0099827_1069075723300009090Vadose Zone SoilMETLEKRIAIIHRVIIPDEDSPEEYAVLVTDKRSVLIRQKKTRSAFVLRGEMRYGTALVTDAVPKTLEDYEETTLESLTYDSKNLTIPHGSVLSLVLKKEVSKFRLRDAFSWLTMRRQGEIF*
Ga0099827_1076516013300009090Vadose Zone SoilVLEKEMSSDLTERRLALIHRVIVRDQKYLAEYAILVTDTRSIFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLESLTADPANITAPHETVVSLVMEKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYQQSQKLEPLIKFYMVPLG
Ga0099827_1107072513300009090Vadose Zone SoilMLIIENSARHAMSFPWDEKTKERVTSLIPERRLALIHRVIVPDKEYPREYAILLTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVTPKILEDYEPNSLESLTADSANLTVPHESLISLVMRKEPPKFRLRDFFVCLTMRRQREVFQVYDFEMN
Ga0105063_101367823300009804Groundwater SandMTSVIPERRLALIHRVIVPDKDYPKEYAILVTEGRSIFIRQRKTRSSFVLRGEMRFGTALVTDVIPKTLEDYEQTSLESLTGDSANLTIPHEAVISLVMRSEEQKFRGY
Ga0105085_103571123300009820Groundwater SandMNKIGEAVLGKGMTTVIPERRLALIHRVIVPDKEYPKEYAILVTEGRSIFIRQKKTRSSFVLRGEMRFGTALVTDVIPKTLEDYEQTSLESLTGDSANLTIPHEAVISLVMRSEEQKFRGYDFFIWL
Ga0134082_1026678013300010303Grasslands SoilMISQVKQGSETVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEDYAQNSLESLIADDVNLTVPHDAVISIAMRTEEHERRRRDFFLWLVMR
Ga0134088_1059382123300010304Grasslands SoilMNTSVSVKAGRLMETPEKRVAIIHRVIIPDEDFPKEYAVLVTDRRSVLIRQRKTRSTFVLRGEMRYGTALVTDVVPKTLEDYEKTTLESLASDSENLTIPHSSELSLV
Ga0134109_1026095213300010320Grasslands SoilMPDKDYPKEYALLVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEQTNLESLAADASNIAVAHSAVISLVMTKGEPNFRLQDLFIWLTMRRQGHKFHVYDF
Ga0134080_1067503923300010333Grasslands SoilMETEEKRVALIHRVMIPDEDFPKEYAILVTNRQSIFISQKKSRSAFVLRGQMRYGTALVTDVVPKTLDDYEKTDLESLASERENLTIPHSSVVSLVLKREVPKFRLRDLFVWLTM
Ga0134063_1048583913300010335Grasslands SoilMMYETGEGSYAVPERKLALVHRVVMSDKDYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYDQTSLESLATDASNIAIPHDTVISLVMTKGEPKFRLRDFFVWLTMRRQGHKFHVYDFEMNYRDNTNLETGIRFF
Ga0134071_1001651213300010336Grasslands SoilMISQVKQGSETVPERRLALVHRVIVPDKEYPKEYAILVTDCRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEEYAQTSLESLIADDVNLTVPHDAVISIAMRTEEHERRRRDFFLWLVMR
Ga0134071_1036923613300010336Grasslands SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDVVPKTLEDYEQTSVESLTADSTNIMVPHEAVISLAMKKEKQKPRK
Ga0134071_1079810313300010336Grasslands SoilALTGAMNTSVSVKAGRLMETPEKRVAIIHRVIIPDEDSPKEYTVLVTDRRSVLIRQKKTRSTFVLRGEMRYGTALVTDVVPKTLEDYEKTTLESLASDSETLTIPHSSVLSLVLKKEVPKFRLRDTFIWLTMRRQGEIFQVYSFEITYLKN*
Ga0126376_1097949923300010359Tropical Forest SoilMKTQESRIALVHRVIVPDEKLPREYAILVTDKRSIFIHQEKTRSSFVLRGEMRYGTALVTDNVPKTLEDYKETSLESLMLDKENESIPHSSVTSLVLKKEVPRFRLRDAFVWLTM
Ga0137392_1145470613300011269Vadose Zone SoilMTPETKLALIHRVIVPDKEYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVIPKTLEDYTQSSLESLTAVSANLTVPHESLVSLVMRREEPKFRARDIFIWLTMRRQGE
Ga0137389_1063814113300012096Vadose Zone SoilMIREKGPGSDQASERRLALVHRVIVQDEEYPREYAILVTDNRSIFIRQARTRRSFVLRGEMRYGTALVTDVEPKTLEDYEQTSLESLAADAANLTVPHEAVISLAMRKGEPKFRLRDSFIWLTMRRQGHKFHVYDFEMNYWQSSNH
Ga0137383_1000950113300012199Vadose Zone SoilMNSLPPETRLALIHRVIVPDEKYPTEYAILVTDTRSIFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLGSLTTDPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYRQ
Ga0137383_1030067223300012199Vadose Zone SoilVLEKEMSSHLTERRLALIHRVIVPDEKYPTEYAILVTDSRSVFIRQKKTRSSYWLRGEMKFGTALVTDVIPKILEDYEQTSLESLTADPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYRQSPN
Ga0137383_1099401213300012199Vadose Zone SoilMLSQAAKGSDVVPERRLALIHRVIIPDEQYPKEYSILVTDQRSIFIHQEKTRSSFVLRQEIRIGTALVTDVIPKSLEDYEETSLQTLTADSRNLTVPHEAVTSLVMKADK
Ga0137383_1133691513300012199Vadose Zone SoilMEMLEKMVAIIHRVIVPDKDFPKEYAVLVTDRRSVLIRQKKTRSSFVLRGEMRYGTALVTDVVPKTLEDYEKTTLESLASDSENLTIPHTSVLSLVLKKDFPKSRLRDTFIWLTMRRQGEIFQVYYLEMTYLKNQNEQRRVQM
Ga0137399_1015460623300012203Vadose Zone SoilMYETGNGSYAGPERRLVLIHRVIVPDKEYPREHAILVTDRRSIFIGQKKSRSGFVLRSETRYGTALVTDTQPKTLEDYENMSLGSLVADASNIIIPHSSAISLVMTKGEPEFRLRDLFIWLTMRRQGHKFHVYDFEMNYW
Ga0137399_1093884523300012203Vadose Zone SoilLLEKEMSSLSPEARLALIHRVIVPDKKYPTEYSVLVTDSRSVFIRQTKTRSSYWLRGEMKFGTALVTDVMPKTLKDYEQTSLGSLTADPANITVPHEAVVSLVMGKEEPEFRAREFFVWL
Ga0137380_1008328733300012206Vadose Zone SoilVLEKEMSSHLAERRLALIHRVIVPDQKYPAEYAILVTDTRSIFMRQEKTRSSYWLRGEMKFGTALVTDVIPKTLEDFEQTSLESLTADSANITVPHETVVSLVMGKEEPEFRAREFFCLANDEKTRAQIPSVQL*
Ga0137380_1009924943300012206Vadose Zone SoilMPESRLALIHRVIVPDKEYPREYAVLVTDSRSVFIRQKKTRSSFVLRGEIRFGTALVTDVIPKTVEDYEQTSLESLMADSANLTVPHGMVISLVMRKEEQKFHLPDLFIWLTMRRQGHKFQVYDFEMNYRQSPKG
Ga0137380_1011445313300012206Vadose Zone SoilLFETGERSYTVPERRLALVHRVIVPDADYPKEYAVLVTDRRSILIRQKKTRARFVLRGEMRYGTALVTDVQLKTLEDYENASLESLATEVSNIAVPHDAV
Ga0137380_1032303213300012206Vadose Zone SoilMMSQTGEMSDMGSETRLALIHRVIVPDQDYPREYAILVTDRRSIFIRQRKTRSSFVLRYEMRVGTALVTDVTPKTLEDYEQTSLEALTADDGNLTVPHEAVISLVLRADEPEHRRRDFFLWLTMKRQGEVFQVFNFEMNYRLSSNQDAKVK
Ga0137380_1042449213300012206Vadose Zone SoilMTSVTTETRLALIHRVIVPDEEYPKEYAILLTDRSSIFVRQNSTRGNFWLRREMSYGTALVTDIVPKTLEDYEQTSLDSLTADTANLTVPHEAVISLTLKKEEQKSRKYDFFVRLTMRMQKEVFQVYDFELVYRQSPNSETMIKFYMVPL
Ga0137380_1046815723300012206Vadose Zone SoilVLEKEMSSHLTERRLALIHRVIVPDQKYPTEYAILVTDSRSVFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLGSLSADPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRR
Ga0137380_1052565833300012206Vadose Zone SoilMNSLPPETRLALIHRIIVPDEKYPTEYAILVTDTRSIFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLGSLTTDPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQ
Ga0137380_1052701113300012206Vadose Zone SoilMSNVIQEKRLALIHRVIVPDEEYPKEYAILVTESRSVFIRQKKTRGNFWLRREMSYGTALITDVVPKSLEDYEQTSLESLTADSANLTIPHEAVISLALKKEVQKLRAYDFFVRLTMRMQREEFQVYDFEMNYRQAPNSQVMIK
Ga0137380_1139296413300012206Vadose Zone SoilMMYEAAEGPYTAPERKVGLVHRVIVADKDYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYENASLESLATEVSNIAVPHDAVISLLMTKGEPNFRLQDLFIWLTMRRQGHKFHV
Ga0137381_1017044033300012207Vadose Zone SoilMMYETGGSHTAPERKLALVHRVIMPDKDYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEQTNLESLAADASNIAVAHSAVISLVMSKGEPE
Ga0137381_1024718013300012207Vadose Zone SoilLFEKEMNSLPPETRLALIHRVIVPDEKYPTEYAILVTDTRSIFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLGSLTTDPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYRQ
Ga0137381_1067311023300012207Vadose Zone SoilMPDKDYPKEYALLVTDRRSIFIRQEKTRNSFVLRGEMRYGTAMVTDVQPKTLEDYEQTNLESLAADASNIAVAHSAVISLVMSKGEPEFRLRDFFIWLTHEKAGAQVPRVRL*
Ga0137381_1179922023300012207Vadose Zone SoilVLEKEMSSDLTERRLALIHRVIVSDQKYPAEYAILVTDTRSIFIRQKKTRSSYWLRGEMKFGTALVTDVIPKSLQDYEQTSLESLTGDPTNVTVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQ
Ga0137379_1031670813300012209Vadose Zone SoilMMYETGGSHTAPERKLALVHRVIMPDKDYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEQTNLESLAADASNIAVAHSAVISLVMSKGEPEFRLRDFFIWLTMRRQGHKFHVYDFEMNY
Ga0137379_1048512413300012209Vadose Zone SoilVLEKEMSSHLTERRLALIHRVIVPDQKYPTEYAILVTDSRSVFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLGSLSADPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKY
Ga0137379_1053484313300012209Vadose Zone SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREINYGTALITDVVPKTLEDYEQTSVESLTADSTNIMVPHEAVISLAMKKEEQKPRKHDFFVRLTMRMQREEFQVYDFEMNYRESPNSETM
Ga0137379_1120083213300012209Vadose Zone SoilVLEKEMSSDLTERRLALIHRVIVSDQKYPAEYAILVTDTRSIFIRQKKTRSSYWLRGEMKFGTALVTDVIPKSLQDYEQTSLESLTCDPTNVTVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNF
Ga0137379_1162241313300012209Vadose Zone SoilMTSVTTETGLALIHRVIVPDEGYPKEYAILLTDRSSIFVRQNSTRGNFWLRREMSYGTALVTDVVPKTLEDYEQTSLDSLTADTANLTVPHEAVISLTLKKEEQKPRKYDFFVRLTMRMQKEVFQVYDFELVYRQSPNSET
Ga0137378_1007665033300012210Vadose Zone SoilMETLEKRIAIIHRVIIPDEGSPKEYAVLVTDKRSVFIRQKKTRSAFVLRGEMRYGTALVTDAVPKTLEDYEETTLESLAYDSKNLTVPHGSVLSLVLKKEVSKFRLRDAFIWLTMRRQGEIFQVYYLEITYLKN*
Ga0137378_1047581923300012210Vadose Zone SoilVLEKEMSSHLTERRLALIHRVIVPDEKYPTEYAILVTDSRSVFIRQKKTRSSYWLRGEMKFGTALVTDVIPKILEDYEQTSLESLTADPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYRQSPNAEPSI
Ga0137377_1127786813300012211Vadose Zone SoilVLEKEMSSHLTERRLALIHRVIVPDEKYPTEYAILVTDSRSVFIRQKKTRSSYWLRGEMKFGTALVTDVIPKILEDYEQTSLESLTADPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYRQSP
Ga0137387_1013213313300012349Vadose Zone SoilVLEKEMSSHLAERRLALIHRVIVPDQKYPAEYPILVTDTRSIFMRQEKTRSSYWLRGEMKFGTALVTDVIPKTLEDFEQTSLESLTADSANITVPHETVVSLVMGKEEPEFRAREFF
Ga0137387_1056936913300012349Vadose Zone SoilMPESRLALIHRVIVPDKEYPREYAVLVTDSRSVFIRQKKTRSSFVLRGEIRFGTALVTDVIPKTVEDYEQTSPESLMADSANLTVPHGMVISLVMRKEAQKFHLPHLLIWLRMRRQGHKFQVYDFEMNYRQSPKGETGIRFYMVP
Ga0137386_1044628113300012351Vadose Zone SoilMEMLEKMVAIIHRVIVPDKDFPKEYAVLVTDRRSVLIRQKKTRSSFVLRGEMRYGTALVTDVVPKTLEDYEKTTLESLASDSENLTIPHTSVLSLVLKKDFPKSRLRDTFIWLTMRRQGEIFQVYYLE
Ga0137386_1122030613300012351Vadose Zone SoilMSILVAPWEEIASLIPEKRLALIHRVIEPHEDYPREYAILVTDRRSIFIRQRKTRAGFVLRGEMRYGTALVTDVIPKNLEDYEQNSEESLTADNTNLTVPHEEVVSLVMRNGEPKFRMRDLFIWLT
Ga0137384_1010019013300012357Vadose Zone SoilMISQVRQGPERVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEDYAQNSLESLIADDVNLTVPHDAVI
Ga0137384_1075091623300012357Vadose Zone SoilVLEKEMSSHLTERRLALIHRVIVPDQKYPTEYAILVTDSRSVFIRQKKTRSSYWLRGEMKFGTALVTDVIPKTLEDYEQTSLGSLSADPANITVPHEAVVSLVMGKEEPEFRAREFFVWLTMRRQGHRFQVYNFEMKYRQSPN
Ga0137384_1120798813300012357Vadose Zone SoilMPESRLALIHRVIVPDKEYPREYAVLVTDSRSVFIRQKKTRSSFVLRGEIRFGTALVTDVIPKTVEDYEQTSLESLMADSANLTVPHGMVISLVMRKEEQKFHLPALFIWLTMRRQGHKFQVYDFEMNY
Ga0137361_1051253113300012362Vadose Zone SoilVLEKGMARAMLESRLALIHRVLVQDKDYPKEYAILVTDSRSIFIRQEKTRSSYWLRGEMKFGTALMTDVIPKTLEDYEQTSLESLTTDTANLTVPHETVTSLVMRKEEPEFRAREFFVWL
Ga0137390_1048505413300012363Vadose Zone SoilMPESKLALIHRVIVQDKEYPREYAVLVTDSRSVFIRQKKTRSSFVLRGEMRFGTALVTDVIPKTVEDYEQTSLESLMADSANLTVPHGMVISLVMRKEEQKFHLR
Ga0137390_1078490013300012363Vadose Zone SoilMETLERRVAIIHRIIVPDQDFPKEYALLVTDSRSVLIRQKKTRSSFVLRGEMRYGTALVTDAIPKTLEDYEETTLESLAYDSKNLTIPHGSVLSLVLKKEV
Ga0137396_1049113313300012918Vadose Zone SoilVTLKKRLALIHRVIIPDKEYPKEYAILVTDTRSIFIHQRKTRSSFILRGETRFGTALVTDVKPKSLHDYEQTTIESLTADKENITIPHETVISLVMKTEEQKFHARDLFVWL
Ga0137396_1077861123300012918Vadose Zone SoilMTNVAIENRLALIHRVITPDEEDPREYAILITNTKSIFIRQKKTRRGFVLRGEMRFGTALVTDVKPKTIEDYDKTSHESLTTDTANISVPHEAVSSLALRTEEQKFRARDFFVWLTMRRQGHKFQVYDFEMRYQNTPNHE
Ga0137396_1096074213300012918Vadose Zone SoilVPDKEYPKEYAILVTDARSIFIRQKKTRSSFVLRGEMRFGTALVTDVAPKSLEDYEQTSLESLTADTANITVPHEAVISLALRTEEQKFRARDFFIWLTMRRQGHK
Ga0137396_1103412413300012918Vadose Zone SoilMTSAVLERRLAMIHRVTVPDEECPREYAILLTGRRSIFVRQQRTRSSFVLRGEMRYGTALVTDTIPKTLEDYDKTSLETLMADSANINVPHEDVISLMMKKEELRFR
Ga0137410_1089823213300012944Vadose Zone SoilVTSEKRLALIHRVLVPDKEYPKEYAILVTDARSIFIRQKKTRSSFVLRGEMRFGTALVTDVAPKSLEDYEQTSLESLTADPANITVPHEAVISLALRTEEQKFRARDFFIWLTM
Ga0134077_1019371413300012972Grasslands SoilMETPEKRVAIIHRVIIPDEDFPKEYAVLVTDRRSVLIRQRKNRSTFVLRGEMRYGTALVTDVVPKTLEDYETKPLESLATDSENLTIPHSSVLSLVLKKEVPKFRLRDTFIWLTMRCQGEIFQVYYLEITYLKNQNE
Ga0134077_1035426513300012972Grasslands SoilMMCETGEGSYVVHERRMALVHRVIVPDKGYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYENISLESLAVDASNIAVPHEAVVSLVMTKGEPNFRLRDLFVWLTMRRQGHKFHVY
Ga0134076_1056488923300012976Grasslands SoilMETPEKRVAIIHRVIIPDEDFPKEYAVLVTDRRSVLIRQRKTRSTFVLRGEMRYGTALVTDVVPKTLEDYEKTTLESLASDPENLTIPHSSVLSLVLKKDIPKFRLRDTLIW
Ga0134089_1003365723300015358Grasslands SoilMISQVKQGSETVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEDYAQTSLESLIADDVNLTVPHDAVISIAMRTEEHERRRRDFFLWLVMRRQG
Ga0134085_1046187413300015359Grasslands SoilMNPLGNGQAVISNSELVPEKRLTLIHRVIVPDEDYPKEYAILVTDRRLVFIHQNKTRSTFWLRREMSYETALVTDVVPKTLEDYEQTSLDSLTADTANLTVPHEAVISLTLKKEEQKPRKYDFLVRLTMRMQKEVFQVYDF
Ga0134112_1001983513300017656Grasslands SoilMISQVKQGSETVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEDYAQNSLESLIADDVNLTVPHDAVISIAMRTEEHERRRRDFFLWLVMRRQG
Ga0134112_1017656023300017656Grasslands SoilMETLERRVAIIHRVIIPDADFPKEYAVLVTDRRSVLIRQKKTRSNFVLRGEMRYGTALVTDVVPKTLEDYETKPLESLATDSENLTIPHSSVLSLVLKKEVPKFRLRDTF
Ga0134112_1047375623300017656Grasslands SoilLNSATLYETGEGSYTVSERRLAIIHRVIVTDKDYPKEYGVLVTDGRSVFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEGTNMESLVSDASNLAIPHDSVTSIVMTEGEPKFRLQDLWVWLTMRRQGHKF
Ga0134083_1003227013300017659Grasslands SoilMETPEKRVAIIHRVIIPDEDFPKEYAVLVTDRRSIFIRQTKARSTVVLRGEMRYGTALVTDVVPKTLEDYENTNLESLASDSENLTIPHSSVLSLVLKKEVPKFRLRDTFIWLTMRRQGEIFQVYSFEITYLKN
Ga0134083_1022847013300017659Grasslands SoilLEMMYETGEESYAAHERRLALVHRVIVADKDYPKEYVILVTDRRSIFIPQRKTRSSFVLRGEMRYRTALVTDVEPRTLEDYEQTSLEWLAADASNIAIPHDAVISLFMTKGEPKFRLRDFFIWLTMRRQGHKFHVYDFEMDYRDSANQETKLR
Ga0134083_1055221613300017659Grasslands SoilLNSATLYETGEGSYTVSERRLAIIHRVIVTDKDYPKEYAVLVTDRRSVFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEGTTIESLVSDASSLAIPHDSVTSIVMTEGEPKFRLQDLWVWLTMRRQGHKFHVYD
Ga0187803_1016228113300017934Freshwater SedimentMSQSAKGSDVVLERRLALVHRVIVPDKEYPKEYAILVTDRRSIFIHQKKTRSSFVLRGEIRYGTALVTDVQPKTLEDYESTSLESLASDASNIAVSHDSVISLVMTKGEPEFRLQDLFVWLTMKRQ
Ga0066667_1113685813300018433Grasslands SoilMISQVSRGPEPVPERRLARVHQAMVHDKDIHKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEEYAQTSLESLIADDVNLTVPHDAVISIAMRTEEHER
Ga0066662_1012338413300018468Grasslands SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFSLRREISYGTALITNVVPKTLEDYEQTSVESLTAAITNIMVPHEAVISLAMKKEEQKPRKHDFFVRLTMRMQREEFQVYDFEMS
Ga0066662_1141988013300018468Grasslands SoilMTSVMPERRLALIHRVIVPNKEYPKEYAILVTDRRSVFIRHKKTRSSFVLRGEMRYGTALVTDVMPKTLEDYEQTSLESLTADSANFTVPHEALVSLVMRKEEPKFR
Ga0066662_1204132013300018468Grasslands SoilLEMMYETGEESYAAHERRLALVHRVIVPDKDYPKEYVILVTDRRSIFIPQRKTRSSFVLRGEMRYGTALVTDVEPRTLEDYEQTSLEWLAADASNIAIPHDAVISLFMTKGEPKFRLRDFFVWLTMRRQGHKFHVYD
Ga0066669_1150552813300018482Grasslands SoilMISQVKQGSETVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVTPKTLEDYAQNSLESLIADDVNLTVPHDAVISIAMRTEEHERRRRDFFLWLVMRRQGEVFQVYDFEMKYRLGPHQDATVKFYMVP
Ga0215015_1035035913300021046SoilMAILYETGKGPCTVPERRLALIHRVIVPDKDYPKEYAILVTDRRSVFIRQKKTRSGFVLRGEMTYGTALVTDVQPKTLEDYEGTNIESLAADTSNIAIPNDSVTSLVMTEGEPKFRLQDLFVWLTMRRQGHKFHAVSYTHLTLPTICSV
Ga0215015_1068109113300021046SoilMPERRLALIHRAIVPDTDYPKEYAVLVSDSRSIFISQKKTRSGFVLRGEMRYGTALVTDVQPKTLEDYEGTDLESLAADASNIAIPHDSVTSLVMTKGEPKFRLQDLFVWLTMRRQEHKFHVYD
Ga0210404_1002859213300021088SoilLNSATLYETGEGSYVVPERRLALIHRVIVPDTDYPKEYAVLVTDRRSIFIRQKKTRSSFLLRGEMRYGTALVTDVQPKTLEDYEGTSTESLAADASNVTIPHMSVTSLVM
Ga0210404_1006752853300021088SoilMTGSGTEKRIALIHRVILPHKESPKEYAVLVTDIRSIFIRMEKTRSSYWLRGEMKFGTALLTDVMPKTLEDYEQIGLEALAADNANFTVPHENVTSLILRKEEPEFRAREFFVWLTM
Ga0187846_1010849313300021476BiofilmMIQDSRLALIHRVIVPDKEYPKEYAILVTDRRSIFIRQDKTRSSFVLRGEMRYGTALVTDVAPKNLEDYGRTSLESLEVGGENFTVPHEAVVSLAMGKKEPEFRFRDLFIWLTIWRQGEIFQVYNFEMNYRQSSGRDVRT
Ga0209350_102233613300026277Grasslands SoilMISQVRQGPERVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVIPKTLDDFEQTSPESLMADDSNLTVPHEAVISLAMRTEEHERRKRDLFLWLTMRRQGEVFQVYN
Ga0209234_124628513300026295Grasslands SoilMTGSVPEKRLALIHRVIVPDKEYPKEYAVLVTDSRSIFIRLEKTRSSYWLRGEMKFGTALMTDVMPKTLEDYEQTSVESLIADNANLTIPHEKVTSLVLRKEEPEFRAREFFVWLTMRRQGHRFQVYDFEMGYRQSSND
Ga0209237_106586433300026297Grasslands SoilMMYETGEESYAAHERRLALVHRVIVADKDYPKEYVILVTDRRSIFIPQRKTRSSFVLRGEMRYGTALVTDVEPRTLEDYEQTSLEWLAADASNIAIPHDAVISLFMTKGEPKFRLRDFFVWLTMRRQGHKFHVYDFEMNYRDNANLEARIRFY
Ga0209236_105493333300026298Grasslands SoilMISQVRQGPERVPERRLALVHRVIVPDKEYPKEYAILVTDRRSVFIRQRKTRRSFVLRYEMRIGTALVTDVIPKTLDDFEQTSPESLMADDSNLTVPHEAVISLAMRTEEHE
Ga0209055_104357113300026309SoilVTTETRLALIHRVIVPDEEYPKEYAILVTDRQSIFIRQKSTRGNFWLRREMSYGTALVTDVAPKTLEDYEQTSLDSLTADPANLVVPHQLAISLRLKKEEQKLHKYDFFVRLTMRMQKEVFQVYDFD
Ga0209055_109310023300026309SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDLIPKTLEDYQQTSVESLTADSTNIMVHHEAVISLAMKKEEQQ
Ga0209239_135188413300026310Grasslands SoilVHERRMALVHRVIVPDKGYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYENISLESLAVDASNIAVPHEAVVSLVMTKGEPNFRLRDLFVWLTMRRQGHKFHVYG
Ga0209761_109453233300026313Grasslands SoilMMYETGKGSYLAPERKLALVHRVIMPDKDYPKEYSVLVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYENISPGTLAADASNIAVPHDAVISLAMTKGEPNFRLQDLFI
Ga0209761_130693123300026313Grasslands SoilLEMMYETGEGSYTVPERKLALVHRVIMPDKDYPKEYALLVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEHTSLVSLAADASNIAVPHDAVVSLVMTKGEPNFRLRDLFIWLTMRRQGHKFHVYDFEMNYRDNANL
Ga0209686_122677013300026315SoilLEMMYEAAEGPYTAPERKVGLVHRVIVPDKDYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYDRTSLESLATDASNIAIPHDTVISLVMTKGEPKFRLQDLFIW
Ga0209152_1002079533300026325SoilMMYEAAEGPYTAPERKVGLVHRVIVPDKDYPKEYAILVTDRRSIFIRQEKTRSSFVLRGEMRYGTALVTDVQPKTLEDYDRTSLESLATDASNIAIPHDTVISLVMTKGEPKFRLQDLFI
Ga0209152_1041044113300026325SoilMTSTVPERRLALIHRVIVPDQEHLKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDLIPKTLEDYQQTSVESLTADSTNIMVHHEAVISLAMKKEEQQPRKYDFFV
Ga0209802_111300933300026328SoilMSNVMQERRLALIHRVIVPDEEYPKEYAILVTDRRSIFIRQKKTRGNFWLRREMSYGTALITDVVPKTLEDYEQTSMESLTADSANLTVPHDALISLAMKKEEQKPRAYDFFVRLTMRMQREEFQVYDFEM
Ga0209802_131996313300026328SoilVTMVGLVASVPIVSGSYRKNLGPDGYGGPNDLEVMYETAEGPYTAPERKVGLVHRVIMPDKDYPKEYAILLTDRRSIFIRQGKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEQTNLESLAADASNIAIPHVAVISLFMTKGEPKFRLRDFFIWLTMRRQGHKFHVYDFE
Ga0209267_104371733300026331SoilLRNMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKNTRGNFWLRREISYGTALITDLIPKTLEDYQQTSVESLTADSTNIMVHHEAVISLAVKKEEQKPRKY
Ga0209158_110848923300026333SoilLNLAVSYETGAGSYVVLERKLALVHRVIVPDKGYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEKTSLKSLATDASNIAIPHDTVISLVMTKGEPKFRLRDFFVWLTMRRQGHKFHVYD
Ga0209377_124718313300026334SoilMTSVMPERRLALIHRVIVPNKEYPKEYAILVTDRRSVFIRHKKTRSSFVLRGEMRYGTALVTDVMPKTLEDYEQTSLESLTADSANFTVPHEALVSLVMRKEEPKFRARDFFVWLTMRRQGEIFQV
Ga0209806_122661213300026529SoilVTTETRLALIHRVIVPDEEYPKEYAILVTDRQSIFIRQKSTRGNFWLRREMSYGTALVTDVAPKTLEDYEQTSLDSLTADPANLVVPHQLAISLRLKKEEQKLHKYDFFVR
Ga0209807_101315373300026530SoilLNLAVSYETGAESYVELERKLALVHRVIVPDKAYPKEYAILVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLKDYEKTSLKSLATDASNIAIPHDTVISLVMTKGEPKFRLRDFFVWLTM
Ga0209058_127252813300026536SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDVVPKTLEDYEQTGVESLTADSTNIMVPHEAVISLAMKKEEQKPRKYDF
Ga0209056_1049342423300026538SoilMASVTPETRLALIHRVIVPGEEYPKEYAILVTDRRSIFVRQKSTRGNFWLRREMSYGTALVTDVVPKTLEDYEETSLDSLTADTANLAVPHEAAISLTLKKEEQKPRKYDFFVRLTMRMQKEVFQVYDFELVYRQ
Ga0209056_1060034713300026538SoilGLEMMYETGEGSYTVPERKLALVHRVIMPDKDYPKEYALLVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEHTSLVSLAADASNIAVPHDAVVSLVMTKEGRISVYGTCSSG
Ga0209376_136391113300026540SoilMTSTVPERRLALIHRVIVPDQEHPKEYAILVTDRRSIFIRQKKTRGNFWLRREISYGTALITDVVPKTLEDYEQTGVESLTADSTNIMVPHEAVISLAMKKEEQKPRKYDFFVRLTMRMQ
Ga0209689_104627643300027748SoilLNLAVSYETGAGSYVELERKLALLHRVIVPDKDYPKEYATLVTDRRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKTLEDYEKTSLKSLATDASNIAIPHDTVISLVMTKGEPNFRLQDLFIWLTMRRQGHKFNVYDFEMNYRDSGNQETKLRFYM
Ga0209689_110381033300027748SoilMMYETGEGSYTVPERKLALVHRVIMPDKDYPKEYALLVTDRRSIFIRQEKTRSSFVLRGEMRYGTAMVTDVQPKTLEDYEHTSLVSLAADASNIAVPHDAVVSLVMTKEGRISVYGTCSS
Ga0209180_1001310913300027846Vadose Zone SoilMTGSVPEKTLALIHRVIAPDKEYPKEYAVLVTDRRSIFIRLEKTRSSYWLRGEMKFGTALMTDVIPKTLEDYEQTNVESLIADNAILTIPHEKVTSLVLRKEEPEFRAREFFVWLTMRRQGHRF
Ga0209283_1020593033300027875Vadose Zone SoilMVYETGEGSYTAPERKLALVHRVIMPDKDYPKEYAILVTDRRSIFIHQNKTRISFVLRGEMRYGTALVTDVQPKTLEDYANISLESLAADASNIAVPHDGVISLVMTKGEPSFRLQDLFI
Ga0209283_1026224043300027875Vadose Zone SoilMIVRAVFEKGMTGKMPERRLALIHRVIVPDKEYPKEYAVLVTDSRSVFIRQKKTRSSFVLRGEMRFGTALVTDVIPKTLEDYEQTSLESLTADSANLTVPHEMVISLVMRKEEQKFHLRDLFIWLTMRRQGHKFQVYDFEMNYRQSPKGETGIRFYLVPLG
Ga0307477_1095732023300031753Hardwood Forest SoilMHPGADSRRESPVGQHAGAERRLALIHRVLVPGEDHPKEYSILVTDKRSVFIRQAKTRSAFVLRGEMRYGTALVTDVVPKTLEDYERETLDSLTAEDGNLAVPHDSVTSLAMKEDALEFRLRDLFV
Ga0311301_1165558513300032160Peatlands SoilMFQTKSGRESTAGQPADSERRLALIHRVIVPGEDYPKEYAIMVTDRRSVFIRQAKTRNAFVLRGEMRYGTALVTDVVPKTLEDYKLTSLDSLAGEDGDVTVSHDSVTSFAMGKEEPEFRLRDLFVWLTMKRQGETFQVYNFQMDW
Ga0307471_10335780313300032180Hardwood Forest SoilLAVLYETGEGSYTLPERRLALIHRVIVSDTDYPREYAVLVTDRRSIFIRQKKTRSGFVLRGEMRYGTALVTDVQPKTLEDYERTKIEFLAADASNIVVPHDTVTSLVMTEGEPKFRLQDLWVWLTMKRQGH
Ga0307471_10372824613300032180Hardwood Forest SoilLNSATLYETGEGSYTVPERRLALIHRVIVPDRDYPKEYAVLVTDSRSIFIRQKKTRSSFVLRGEMRYGTALVTDVQPKSLEDYAETSLESLATEASSISIPHRSVTSLVMTEGEPKFRLQDLWVWLT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.