NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069893

Metagenome / Metatranscriptome Family F069893

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069893
Family Type Metagenome / Metatranscriptome
Number of Sequences 123
Average Sequence Length 197 residues
Representative Sequence NGQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR
Number of Associated Samples 87
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Archaea
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 96.75 %
% of genes from short scaffolds (< 2000 bps) 85.37 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.24

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (93.496 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(45.529 % of family members)
Environment Ontology (ENVO) Unclassified
(49.593 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.033 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 51.32%    β-sheet: 0.00%    Coil/Unstructured: 48.68%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.24
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 123 Family Scaffolds
PF14947HTH_45 1.63
PF01493GXGXG 1.63
PF05048NosD 1.63
PF04326AlbA_2 0.81
PF01956EMC3_TMCO1 0.81
PF07690MFS_1 0.81
PF06224HTH_42 0.81
PF01555N6_N4_Mtase 0.81
PF01503PRA-PH 0.81
PF01645Glu_synthase 0.81
PF03237Terminase_6N 0.81
PF12367PFO_beta_C 0.81
PF01209Ubie_methyltran 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 123 Family Scaffolds
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.81
COG0863DNA modification methylaseReplication, recombination and repair [L] 0.81
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 0.81
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.81
COG1422Archaeal YidC/Oxa1-related membrane protein, DUF106 familyCell wall/membrane/envelope biogenesis [M] 0.81
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 0.81
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 0.81
COG22272-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylaseCoenzyme transport and metabolism [H] 0.81
COG2865Predicted transcriptional regulator, contains HTH domainTranscription [K] 0.81
COG3214DNA glycosylase YcaQ, repair of DNA interstrand crosslinksReplication, recombination and repair [L] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms97.56 %
UnclassifiedrootN/A2.44 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10124200All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Hawaiian Drosophila → picture wing clade → grimshawi clade → grimshawi group → grimshawi subgroup → Drosophila grimshawi728Open in IMG/M
3300002562|JGI25382J37095_10014770All Organisms → cellular organisms → Bacteria → Proteobacteria2976Open in IMG/M
3300002909|JGI25388J43891_1035296All Organisms → cellular organisms → Archaea798Open in IMG/M
3300002912|JGI25386J43895_10161412All Organisms → cellular organisms → Archaea561Open in IMG/M
3300002916|JGI25389J43894_1049329All Organisms → cellular organisms → Archaea713Open in IMG/M
3300005167|Ga0066672_10033428All Organisms → cellular organisms → Archaea2834Open in IMG/M
3300005172|Ga0066683_10249782All Organisms → cellular organisms → Archaea1099Open in IMG/M
3300005174|Ga0066680_10927687All Organisms → cellular organisms → Archaea515Open in IMG/M
3300005177|Ga0066690_10135807All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1608Open in IMG/M
3300005180|Ga0066685_10003912All Organisms → cellular organisms → Bacteria7360Open in IMG/M
3300005180|Ga0066685_10159749All Organisms → cellular organisms → Archaea1536Open in IMG/M
3300005180|Ga0066685_10538888All Organisms → cellular organisms → Archaea806Open in IMG/M
3300005518|Ga0070699_101968553All Organisms → cellular organisms → Archaea534Open in IMG/M
3300005540|Ga0066697_10082473All Organisms → cellular organisms → Archaea → TACK group → Crenarchaeota → unclassified Thermoproteota → Crenarchaeota archaeon 13_1_20CM_2_51_81857Open in IMG/M
3300005553|Ga0066695_10363842All Organisms → cellular organisms → Archaea904Open in IMG/M
3300005555|Ga0066692_10477399All Organisms → cellular organisms → Archaea795Open in IMG/M
3300005556|Ga0066707_10238846All Organisms → cellular organisms → Archaea → TACK group1183Open in IMG/M
3300005557|Ga0066704_10106091All Organisms → cellular organisms → Archaea1849Open in IMG/M
3300005558|Ga0066698_10312132All Organisms → cellular organisms → Archaea1089Open in IMG/M
3300005558|Ga0066698_10633884All Organisms → cellular organisms → Archaea715Open in IMG/M
3300005559|Ga0066700_10978242All Organisms → cellular organisms → Archaea558Open in IMG/M
3300005569|Ga0066705_10009566All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon4535Open in IMG/M
3300005574|Ga0066694_10044134All Organisms → cellular organisms → Archaea2017Open in IMG/M
3300005575|Ga0066702_10405202All Organisms → cellular organisms → Archaea833Open in IMG/M
3300005576|Ga0066708_10911454All Organisms → cellular organisms → Archaea548Open in IMG/M
3300005586|Ga0066691_10792020All Organisms → cellular organisms → Archaea559Open in IMG/M
3300005598|Ga0066706_11043694All Organisms → cellular organisms → Archaea628Open in IMG/M
3300006034|Ga0066656_10475601All Organisms → cellular organisms → Archaea814Open in IMG/M
3300006796|Ga0066665_11172073All Organisms → cellular organisms → Archaea586Open in IMG/M
3300006804|Ga0079221_10944645All Organisms → cellular organisms → Archaea639Open in IMG/M
3300007255|Ga0099791_10178061All Organisms → cellular organisms → Archaea999Open in IMG/M
3300007258|Ga0099793_10512469All Organisms → cellular organisms → Archaea597Open in IMG/M
3300007265|Ga0099794_10297799All Organisms → cellular organisms → Archaea835Open in IMG/M
3300007265|Ga0099794_10446992All Organisms → cellular organisms → Archaea678Open in IMG/M
3300009012|Ga0066710_101268783All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1142Open in IMG/M
3300009012|Ga0066710_102961157All Organisms → cellular organisms → Archaea663Open in IMG/M
3300009038|Ga0099829_10272071All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1385Open in IMG/M
3300009038|Ga0099829_10635430All Organisms → cellular organisms → Archaea887Open in IMG/M
3300009088|Ga0099830_11353916All Organisms → cellular organisms → Archaea592Open in IMG/M
3300009089|Ga0099828_11008340All Organisms → cellular organisms → Archaea742Open in IMG/M
3300009090|Ga0099827_11126884All Organisms → cellular organisms → Archaea681Open in IMG/M
3300009090|Ga0099827_11949693All Organisms → cellular organisms → Archaea511Open in IMG/M
3300009137|Ga0066709_100619963All Organisms → Viruses → Predicted Viral1544Open in IMG/M
3300009137|Ga0066709_101801443All Organisms → cellular organisms → Archaea859Open in IMG/M
3300010301|Ga0134070_10068865All Organisms → cellular organisms → Archaea1206Open in IMG/M
3300010301|Ga0134070_10318808All Organisms → cellular organisms → Archaea596Open in IMG/M
3300010303|Ga0134082_10073983All Organisms → cellular organisms → Archaea1328Open in IMG/M
3300010337|Ga0134062_10004422All Organisms → cellular organisms → Archaea4989Open in IMG/M
3300010895|Ga0138113_179680All Organisms → cellular organisms → Archaea587Open in IMG/M
3300011270|Ga0137391_11055716All Organisms → cellular organisms → Archaea659Open in IMG/M
3300011271|Ga0137393_10795298All Organisms → cellular organisms → Archaea808Open in IMG/M
3300012199|Ga0137383_10328854All Organisms → cellular organisms → Archaea1119Open in IMG/M
3300012201|Ga0137365_10048118All Organisms → cellular organisms → Archaea → TACK group → Crenarchaeota → unclassified Thermoproteota → Crenarchaeota archaeon 13_1_20CM_2_51_83235Open in IMG/M
3300012201|Ga0137365_10499717All Organisms → cellular organisms → Archaea894Open in IMG/M
3300012201|Ga0137365_11333332All Organisms → cellular organisms → Archaea508Open in IMG/M
3300012202|Ga0137363_11811159All Organisms → cellular organisms → Archaea503Open in IMG/M
3300012203|Ga0137399_10187009All Organisms → cellular organisms → Archaea1675Open in IMG/M
3300012204|Ga0137374_10076463All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3266Open in IMG/M
3300012206|Ga0137380_10009111All Organisms → cellular organisms → Bacteria8985Open in IMG/M
3300012206|Ga0137380_10509120All Organisms → cellular organisms → Archaea1060Open in IMG/M
3300012206|Ga0137380_10876877All Organisms → cellular organisms → Archaea772Open in IMG/M
3300012206|Ga0137380_11241957All Organisms → cellular organisms → Archaea630Open in IMG/M
3300012206|Ga0137380_11259647All Organisms → cellular organisms → Archaea624Open in IMG/M
3300012206|Ga0137380_11485609All Organisms → cellular organisms → Archaea562Open in IMG/M
3300012207|Ga0137381_10017042All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_1_20CM_2_51_125686Open in IMG/M
3300012207|Ga0137381_10599999All Organisms → cellular organisms → Archaea959Open in IMG/M
3300012207|Ga0137381_10623871All Organisms → cellular organisms → Archaea939Open in IMG/M
3300012207|Ga0137381_10826988All Organisms → cellular organisms → Archaea802Open in IMG/M
3300012207|Ga0137381_11737181All Organisms → cellular organisms → Archaea514Open in IMG/M
3300012209|Ga0137379_10023546All Organisms → cellular organisms → Archaea5916Open in IMG/M
3300012209|Ga0137379_10651938All Organisms → cellular organisms → Archaea959Open in IMG/M
3300012209|Ga0137379_11279075All Organisms → cellular organisms → Archaea640Open in IMG/M
3300012211|Ga0137377_11802210All Organisms → cellular organisms → Archaea531Open in IMG/M
3300012349|Ga0137387_10328428All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1107Open in IMG/M
3300012349|Ga0137387_11265620All Organisms → cellular organisms → Archaea518Open in IMG/M
3300012351|Ga0137386_10383481All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1012Open in IMG/M
3300012356|Ga0137371_10459040All Organisms → cellular organisms → Archaea985Open in IMG/M
3300012356|Ga0137371_11205548All Organisms → cellular organisms → Archaea565Open in IMG/M
3300012359|Ga0137385_10048075Not Available3818Open in IMG/M
3300012359|Ga0137385_10407555All Organisms → cellular organisms → Archaea1158Open in IMG/M
3300012359|Ga0137385_10623122All Organisms → cellular organisms → Archaea905Open in IMG/M
3300012359|Ga0137385_10686294All Organisms → cellular organisms → Archaea856Open in IMG/M
3300012359|Ga0137385_10713492All Organisms → cellular organisms → Archaea → TACK group837Open in IMG/M
3300012360|Ga0137375_10632311All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_1_40CM_4_53_4885Open in IMG/M
3300012361|Ga0137360_11494931All Organisms → cellular organisms → Archaea580Open in IMG/M
3300012362|Ga0137361_11854603All Organisms → cellular organisms → Archaea520Open in IMG/M
3300012390|Ga0134054_1260790All Organisms → cellular organisms → Archaea560Open in IMG/M
3300012392|Ga0134043_1137683All Organisms → cellular organisms → Archaea722Open in IMG/M
3300012398|Ga0134051_1308872All Organisms → cellular organisms → Archaea808Open in IMG/M
3300012400|Ga0134048_1011387All Organisms → cellular organisms → Archaea593Open in IMG/M
3300012409|Ga0134045_1173883All Organisms → cellular organisms → Archaea521Open in IMG/M
3300012918|Ga0137396_10000812All Organisms → cellular organisms → Archaea15550Open in IMG/M
3300012918|Ga0137396_10097668All Organisms → cellular organisms → Archaea2092Open in IMG/M
3300012918|Ga0137396_10172408All Organisms → cellular organisms → Archaea1583Open in IMG/M
3300012918|Ga0137396_11156774All Organisms → cellular organisms → Archaea549Open in IMG/M
3300012927|Ga0137416_10172506All Organisms → cellular organisms → Archaea1714Open in IMG/M
3300012972|Ga0134077_10092996All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1160Open in IMG/M
3300014150|Ga0134081_10035107All Organisms → cellular organisms → Archaea1464Open in IMG/M
3300014150|Ga0134081_10199521All Organisms → cellular organisms → Archaea679Open in IMG/M
3300014154|Ga0134075_10540672All Organisms → cellular organisms → Archaea525Open in IMG/M
3300015241|Ga0137418_10902917All Organisms → cellular organisms → Archaea649Open in IMG/M
3300017654|Ga0134069_1263482All Organisms → cellular organisms → Archaea602Open in IMG/M
3300017657|Ga0134074_1198823All Organisms → cellular organisms → Archaea711Open in IMG/M
3300017659|Ga0134083_10287316All Organisms → cellular organisms → Archaea695Open in IMG/M
3300017961|Ga0187778_10563482All Organisms → cellular organisms → Archaea761Open in IMG/M
3300018431|Ga0066655_10011588All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon3845Open in IMG/M
3300018433|Ga0066667_10209296All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1442Open in IMG/M
3300018433|Ga0066667_10459032All Organisms → cellular organisms → Archaea1042Open in IMG/M
3300021046|Ga0215015_10445982All Organisms → cellular organisms → Archaea533Open in IMG/M
3300025922|Ga0207646_10315890All Organisms → cellular organisms → Archaea1412Open in IMG/M
3300026297|Ga0209237_1254440All Organisms → cellular organisms → Archaea539Open in IMG/M
3300026298|Ga0209236_1130756All Organisms → cellular organisms → Archaea1094Open in IMG/M
3300026298|Ga0209236_1231502All Organisms → cellular organisms → Archaea628Open in IMG/M
3300026328|Ga0209802_1059165All Organisms → cellular organisms → Archaea1858Open in IMG/M
3300026343|Ga0209159_1135503All Organisms → cellular organisms → Archaea1004Open in IMG/M
3300026523|Ga0209808_1258080All Organisms → cellular organisms → Archaea556Open in IMG/M
3300026538|Ga0209056_10590295All Organisms → cellular organisms → Archaea561Open in IMG/M
3300026540|Ga0209376_1001777All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon21186Open in IMG/M
3300026547|Ga0209156_10373585All Organisms → cellular organisms → Archaea605Open in IMG/M
3300027875|Ga0209283_10661391All Organisms → cellular organisms → Archaea656Open in IMG/M
3300031820|Ga0307473_11307723All Organisms → cellular organisms → Archaea543Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil45.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil23.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil13.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.20%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.81%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.81%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010895Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012390Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012400Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012409Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1012420013300002558Grasslands SoilLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTSKRLLQNVGSSETVEETAVEVVEPAKKTVAEPSKKSVVEGKLANQPKPGRGKWGRRFIMASGVLAILYLAYYYIWPLLPK*
JGI25382J37095_1001477033300002562Grasslands SoilWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRXHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGASESSEEAAVELVEPGKKSELSEPSKRSAIEENLAIQSKPHRSRWGRRFIMATGVLAILYLAWEYIWPLLR*
JGI25388J43891_103529613300002909Grasslands SoilRKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVSALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPHRSKKGSRVSMVVVAAGVLTILFLAWQYIWPLLR*
JGI25386J43895_1016141213300002912Grasslands SoilLKKLSGTVSVKDSNGQLQKMRRVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPPVSKKWIMRNVGLRETSEETAVELVEPDKSEPSQPNRKSAVEEKLAIQTRPHRGK
JGI25389J43894_104932913300002916Grasslands SoilFKRGMRQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVTKRWIMRNVGAIETSQETAFEIVEPGGKSELRDPEKKSAVEEKLTIQTRPHRGKWGRRFLMATGVLAILYLAWAYLRPLLR*
Ga0066672_1003342823300005167SoilNGQLQKMRKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWLLRNVGSSETSEETSGELVEPSGKSEPSQPGNRSAVEEELAIQSAPHRGKWGRRFLMATGALAILYLAWEYILPLLR*
Ga0066683_1024978223300005172SoilSGTVAVKDSNGRLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDIRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSAFQEKITVQARPDRSKKGRRVSMVVVAAGVLTILFLAWQYIWPLLR*
Ga0066680_1092768713300005174SoilDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSIETSEETAVELVEPVKKSELSEPSKKSAVEQELTIQPKPHRGKWGRRFIM
Ga0066690_1013580713300005177SoilQKMRKIDYWLDKWYDIRGELFKRGMKQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGASETSEETAVEIVEPGRKSELSEPSKRSMVEEKLAIQPKPHGGRWGRRFVVATGVLVILYLAWEYLWPLFR*
Ga0066685_1000391213300005180SoilADDMTKALKKLSGTVAVKDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSREASEETAGELVEPDKKSEQSEPSKKSAVEQKRTIQTQPHRGKWGRRFVMATGVLAILYLAWEYIRPLLR*
Ga0066685_1015974913300005180SoilDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRIPRPPVSKKWLMRNVGSSETSEETSVELVDPIKKSELSEPSKKSVAEERLTIQSKPHRGKWGRRFVIGTGVLAIAYLAWEYVLPLLSQWIHL*
Ga0066685_1053888813300005180SoilYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRWGRRFIIAAGVLVILYLAWEYIWPLLR*
Ga0070699_10196855313300005518Corn, Switchgrass And Miscanthus RhizosphereGQVQKMRTVDYWLDKWYDIRGELFKRGMRQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKIEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRTPVTSKRFLQNVGSSETMEEKVVEIVEPARKSAVEAGKKSVVEGKLAIQSKPGRGKWGRRFI
Ga0066697_1008247323300005540SoilNGQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR*
Ga0066695_1036384213300005553SoilWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDIRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSAFQEKITVQARPDRSKKGRRVSMVVVAAGVLTILFLAWQYIWPLLR*
Ga0066692_1047739913300005555SoilKPPKAVLLFADDMTKALKKLSGTVAVKDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPAVSKKWLLRNVGSSETSEETTIEIVEPSRKSELREPEKKSAVEEKITIQTRPHRGKWGRRFLMATGVLAILYLAWAYVWPLLR*
Ga0066707_1023884613300005556SoilLFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR*
Ga0066704_1010609113300005557SoilNGQVQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPPISKKWIMRNVGSSETAEETTVELVEPIKKSELSEPDKKSAVEEEPAIQPTPHRGRWGRRFIMATGVLAILYLAWEYVLPLISLWVHL*
Ga0066698_1031213213300005558SoilLFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPLVSKKWILRNVGSNEISEESAVEIVEPNRKSEPSELGKKSLVEEKLTIRTKPHRGKWVRRILMATGVLTILFLAWEYIWPLLK*
Ga0066698_1063388413300005558SoilEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDASKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDSNGHKKKSALQEKITVQPMPHQSKKGQRVSIVIMAAGVLTVLFFAWKYIWPLLR*
Ga0066700_1097824213300005559SoilYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETVEETAVEMVEPLKKSELSQPGKKSAVEPNKKSLVEEKIAIQPRPRRGKWGRRFIMATGVLAML
Ga0066705_1000956663300005569SoilLLFADDMTKALKKLSGTVSVKDSNGQLQKMRRVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR*
Ga0066694_1004413413300005574SoilWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDASKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVETSEELVVEDDEPKKSVVDPEGSKKRSVLQEKIAVHARPHRSKRGRRASIVIIAAGVLMIAFLAWEYIWPLLR*
Ga0066702_1040520213300005575SoilAVLLFADDMTKALKKLSGTVAVKDSNGQVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPSVSKKWIMRNVGPKESSEVTAVELVEPSGKSEPKEPGKRSRVEEELTIQTRPHRGKWGRRFVMATGVLSILYLAWKYILPLILQ*
Ga0066708_1091145413300005576SoilGTVAVKDSNGQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRTPVSKKWIMRNVGSSETSEDTSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRW
Ga0066691_1079202013300005586SoilDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPSVSKKWIMRNVGPKESSEETAVELVEPSGKSEPKEPGKRSRVEEELTIQTRPHRGKWGRRFVMATGVLAILYLAWKYILPLI
Ga0066706_1104369413300005598SoilDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPLVSKKWILRNVGSNEISEESAVEIVEPNRKSEPSELGKKSLVEEKLTIRTKPHRGKWVRRILMATGVLTILFLAWEYIWPLLK*
Ga0066656_1047560123300006034SoilKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRSVGSSETSRETAVELVEPNKKSELSESSKKSAVEGELTIQPRPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR*
Ga0066665_1117207313300006796SoilLFADDMTKALRKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPHR
Ga0079221_1094464513300006804Agricultural SoilRPPKAVVLFADDMTKALKKLSGTVAVKDSNGQLQKMKTVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGTVVFYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPITKRWIMKNVGSMETREETALELVEPDKKSELRVPSRRSELSEPRKRSVVEEKPIVRSKPHQSR
Ga0099791_1017806113300007255Vadose Zone SoilDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETMEETAVEIVEPLKKSELSQPGKKSAVEPNKKSLVEEKIAIQPKPRLGKWGRRFIMATGVLAILYLAYYYIWPLIPK*
Ga0099793_1051246913300007258Vadose Zone SoilVLFADDMTKALKKLSGSVAVKDSNGQVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPSVSKKWIMRNVGPKESSEETAVEVVEPGKRTELSETSKKSAVEEKLTIQTRPHRGKW
Ga0099794_1029779913300007265Vadose Zone SoilGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLENVGSSETVEEKVVEIVEPARKSAVESGKKSVVEEKLAIQSKPGRGKWGRRFIMASGVLAILYLAYYYIWPLIPK*
Ga0099794_1044699213300007265Vadose Zone SoilNGQVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPSVSKKWIMRNVGPIETSQETAVELVEPDKTSELSQPRKSAVEEKLTIQTRPHRGKWGRRFLMGTGVLAILYLAWAYLWPLLK*
Ga0066710_10126878333300009012Grasslands SoilKPPRAVLLFADDMTKALKKLSGTVAVKDSNGQLQKMRRVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWLLQNVGSRETSEETAVEIVEPIKKSELNEPSKKSVSEEKLTIQPKPHRGKWGRRFIIGTGVLAILYLAWEYILPLLSQWIHL
Ga0066710_10296115713300009012Grasslands SoilLFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVSKKWLMRNVGSSETFQETAVEIVEPTKKSEQSEPGKKSAIEEKLTIQPKPHRGKWGRRFIIGTAVLALLYLA
Ga0099829_1027207113300009038Vadose Zone SoilNMDKLFKRFRKPPKAVVLFADDMTKALKKLSGTVAVKDSNGHVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWLMRNVGSTSEGSEESAVALVEPDKRSEVSEPSKRSVAKERLTVQPEPHRGKWARRVFMATGVLAILYLAWAYLWPLLR*
Ga0099829_1063543023300009038Vadose Zone SoilKKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSENVEETAVEIVEPAKKSAVEPNKKSLVEEKIAIQPKPRRGKWGRRFIMATGVLAMLYLAWNYLWPLIPK*
Ga0099830_1135391613300009088Vadose Zone SoilTKALKKLSGTVSVKDSNGQVQKMRRVDYWLDKWYDIRGELLKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWLLRNVGSTETSEEAAVELVEPGKKSELSEPNKKSAAEEKLTIQPKPHQSRWGRRFII
Ga0099828_1100834023300009089Vadose Zone SoilYWLDKWYDIRGELFKGGMRQGLVIMVSALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVSKKWLMRNVGPIETSQETAVEIVEPSRKSELSEPGKKSSVEEELTDQSKSHGGKWARRVLMAVGVLAILFLTWEYLLPLLSQWIHL*
Ga0099827_1112688413300009090Vadose Zone SoilLLFADDMTKALKKLSGMVGVKDSNGQLQKMRKVDYWLDKWYDIRGELYKRGMRQGIVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPLATSKHLLQNVGSTETVEETTVEIVEPLKKSELRQPGKKSAAEEKLVIQAKPHRSKWGRRFVIATGVLAVLYLAWNYLWPLLLAFA
Ga0099827_1194969313300009090Vadose Zone SoilDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSENVEETAVEIVEPAKKSAVEPNKKSLVEEKIAIQP
Ga0066709_10061996323300009137Grasslands SoilDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTKKWIMRNVGSSETSEETSIEIVEPNRKSELSEPGKKSSVEEELTIQSKPHRGKWGRRFMIGTGVLAILYLAWEYILPLLSQWIHL*
Ga0066709_10180144313300009137Grasslands SoilGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDGSKKKSLLQEKISVQPMPHRRKKDRRVSIVIMAAGVLTVLFFAWKYIWPLLR*
Ga0134070_1006886523300010301Grasslands SoilVKDSNGRLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDIRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSAFQEKITVQARPDRSKKGRRVSMVVVAAGVLTILFLAWQYIWPLLR*
Ga0134070_1031880813300010301Grasslands SoilKDSNGRLEKMRKIDYWLDKWYDIRGELFKRGMRQGIVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEALAVEDDEPKKSAVELDGPKKRSLLQGKISVRPMPHRSKKGRRVSIVIMAAGVLTVLFF
Ga0134082_1007398323300010303Grasslands SoilGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPHRSKKGRRVSTVVVAACVLTILFLAWQYIWPLLR*
Ga0134062_10004422123300010337Grasslands SoilLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDSPKKKSLLQEKISVQTRPRRSKKGRRVSIVIMAAGVLTVLFFAWKYIWPLLR*
Ga0138113_17968013300010895Grasslands SoilELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYSMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPDRSKKGRRVSMVVVAAGVLTILFLAWQYIWPLLR*
Ga0137391_1105571613300011270Vadose Zone SoilNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTKKWIMRNVGSSEASEETTVELVEPGRKSELSEANRKSVVEEKLTVQPKPHRGKWGRRFVMATGVLAILYLAWEYILPFLR*
Ga0137393_1079529813300011271Vadose Zone SoilLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETMEETAVEIVEPAKKSQQSQPGKKSAVEPNKKSLVEEKIAIQPKPRRGKWGRRFIMATGVLAILYLAYYYIWPLIPK*
Ga0137383_1032885413300012199Vadose Zone SoilKWYDIRGELFKRGMRQGLVIMVASLHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVLRVPRPLVSKKWIMRDVGARETSEETAVELVEPSGKSEPKAPSKRSGIEEELTIQLKPRRGKWGRRFIIGTVVLVILYLAWEYILPLLALWVHI*
Ga0137365_1004811833300012201Vadose Zone SoilVAVKDSNGQVQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKEWIMRNVGSSESSEETAVELVEPNKKSELSGSSKKSAVEEELTVQPGPHRGRWGRRFIIATGVLAILYLAWEYIRPLLR*
Ga0137365_1049971713300012201Vadose Zone SoilKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPTVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPSVSTKWLLRNVGPKESSEAPAVETTTPERKPTIELDERKRKSVVAEKPNTQPKVHRGRWGRRVSVALMATGVLAILFLAWTYVWPLLK*
Ga0137365_1133333213300012201Vadose Zone SoilKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPTVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPSVSTKWLLRNVGPKESSEAPTVETTTPEKKPTIELDEPKRKSVVAEKPNTQPKVHRGRWGHRVSIALMATGVLAIL
Ga0137363_1181115913300012202Vadose Zone SoilSNGQLEKMRRLDYWLDKWYDIRGELFKRGMHQGLVVMVAGLHRFYGGDIDLRADADLLFVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPSVSTKWLMRNVGSSETSQELSVEVDGPRKKSTDEAGERRMRSVVEEKP
Ga0137399_1018700913300012203Vadose Zone SoilKRFRKPPKAVLLFADDMTKALKKLSGTVAVKDSNGQVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVSKKWIMRNVGPRDTSEETAVEIIEPARKSELREPSKKSATEKGLTIQSKPQRGKWGRRFILGTGVLVILYLAWEYILPFLSRWIHF*
Ga0137374_1007646313300012204Vadose Zone SoilQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSVEASEETSVEIVEPNRKSELKEPEKRSAVDEELTIQTRPHRGKWGRRFVMATGVLAILYLAWEYILPLLR*
Ga0137380_10005897133300012206Vadose Zone SoilGMRQGLVIMVASLHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPRVSKKWIMRNVGSSETSEETTIEIVEPNKKSELSEPSKKSLAEEKLAIQAKPHQSRWRRRFIIGVGVLAMLFSAWEYILPLLSLWVHL*
Ga0137380_1000911113300012206Vadose Zone SoilKAVLLFADDMTKALKKLSGTVAVKDSNGQVQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPAVSKKWLLRNVGSIETTEETAVELVEPGKKSELSEPSKRSAIEEKLAIQPKPHRSRWGRRFIMATGVLAILYLAWEYIWPLLR*
Ga0137380_1050912023300012206Vadose Zone SoilLKKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRFLQNVGSSETMEETAVEIVEPAKKSAVEPNKKSLVEEKMAIRPKPRRGKWGRRFILSTGVLAMLYLAYYYVWPLIPK*
Ga0137380_1087687713300012206Vadose Zone SoilMTKALKKLSGTVPVRDANGGEQKMRKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPTVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPSVSTRWLLRNVGPKESSEASTVETTTPEKKPTIELDEPKRKSVVAEKPNIQPKVHRGRWGRRVSIALMATGVLAILFLAWTYVWPLLK*
Ga0137380_1124195713300012206Vadose Zone SoilMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSNKRLLQSVGSSETMEETAVEIVEPAKKSAVEPNKRSLVEEKIAIQPKPQRGKRGRRFIMATGVLAILYLAYYYIWPLIPK*
Ga0137380_1125964713300012206Vadose Zone SoilRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDASKVERMLGPVVYYAMRQHEVDALRNRKELAWTAFVAKTGSGVFRVPRPPLSTQWLIRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPHRSKKGRRVSMVVVAAGVLTILFLAWQYIWPLLR*
Ga0137380_1148560913300012206Vadose Zone SoilALKKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETVEETAVEIVEPTKKSAVEPNKKSLVEEKIAIQTKPQRGK
Ga0137381_1001704283300012207Vadose Zone SoilLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETVEETAVEIVEPAKKSAVEPNKKSLVEEKIAIQPKPRRGKRGRRFIMATGVLAILYLAYYYIWPLIPK*
Ga0137381_1059999923300012207Vadose Zone SoilKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRFLQNVGSSETMEETAVEIVEPAKKSAVEPNKKSLVEEKMAIRPKPRRGKWGRRFILSTGVLAMLYLAYYYVWPLIPK*
Ga0137381_1062387113300012207Vadose Zone SoilLKKLSGTVAVKDSNGQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWMMRNVGSSESSEETGIELVEPIKKSELNEPSKKSVAEEKLSIQHKPHRGKWGRRFIMATGVLAILYLAWEYIWPLLK*
Ga0137381_1082698813300012207Vadose Zone SoilLKKLSGTVAVKDSNGQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPISKKWIMRNVGASETREETAVELVEPERKSELSETSRRSTVEEKLTIQPKPHRGKWGRRFVMATGVLAILYLAWEYIWPLLK*
Ga0137381_1173718113300012207Vadose Zone SoilIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVTSKRLLQDVGSSETVEEKVVEIVESARKSAVESDKKSVVEEKLTIQPKPGRGKRGRRFIMATGVLAILYL
Ga0137379_1002354633300012209Vadose Zone SoilMTKALKKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELYKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETMEETAVEIVEPAKKSAAEPNKKSLVEEKIAIQPKPRRGKWSRRFIMATGVLAILYLAWNYLWPLIPK*
Ga0137379_1065193813300012209Vadose Zone SoilKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETVEETAVEIVEPTKKSAVEPNKKSLVEEKIAIQTKPQRGKRGRRFIVATGVLAMLYLAYYYIWPLIPK*
Ga0137379_1127907513300012209Vadose Zone SoilKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGFVAKTGSGVFRVSRPPVSTKWLMRNVGSVEASEELVDEDDEPKKSVVDLDGPRKKSALHEKIVVQARTNRSKRGGRASVVIMAASALAIAFLAWKYIWPLLR*
Ga0137379_1159079513300012209Vadose Zone SoilDMTKALKKLSGTVAVKDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWMMRNVGSSESSEETGIELVEPIKKSELNEPSKKSVAEEKLSI
Ga0137377_1180221013300012211Vadose Zone SoilTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGSVVYYAMRQHEVDALRDRKELTWTAFVAKTGSGVFRVSRPPVSTKWLMRNVGSVETSEELAVDVDEPKKSVVDLDGPKKKSVFEEKTAVQA
Ga0137387_1032842813300012349Vadose Zone SoilYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTKKWIMRNVGARETSEETAVELVEPSGKSEPKAPSKRSGIEEELTTQLKPRRGKWGRRFIIGTGVLVILYLAWEYILPLLALWVHI*
Ga0137387_1126562013300012349Vadose Zone SoilQGLVIMVASLDRFYGGDIDLRAYADLPLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVSRPPVSTKWLMRNVGSLETSEELAVDDDEPKKSAVELDGPKKKSLLQEKIVVQARPNRSKRGRGVSVVIMAVSALAIAFLAWKYIWP
Ga0137386_1038348113300012351Vadose Zone SoilDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTKKWIMRNVGARETSEETAVELVEPSGKSEPKATSKRSGIEEELTTQLKPRRGKWGRRFIIGTGVLVILYLAWEYILPLLALWVHI*
Ga0137371_1045904013300012356Vadose Zone SoilIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVTKKWLMQNVGSRETSEETAVEIVEPIKKSELNEPGKKSVAEEKLTIQPKPHRGKWGRRFIIATGVLAILYLAWEYIWPLIR*
Ga0137371_1120554813300012356Vadose Zone SoilKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGTTVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPSVSTKWLLRNVGPKESSEAPTVETTTPEKKPTIELDEPKRKSVVAEKPNTQPKVHRGRWGRRVSIALMATGVLAILFLAWTYVWPLLK*
Ga0137385_1004807513300012359Vadose Zone SoilRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETMEETAVEIVEPAKKSAAEPNKKSLVEEKIAIQPKPRRGKWSRRFIMATGVLAILYLAWNYLWPLIPK*
Ga0137385_1040755523300012359Vadose Zone SoilYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETVEETAVEIVEPAKKSAVEPNKKSLVEEKIAIQPKPRRGKRGRRFIMATGVLAILYLAYYYIWPLIPK*
Ga0137385_1062312213300012359Vadose Zone SoilVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSETVEETAVEIVEPTKKSAVEPNKKSLVEEKIAIQTKPQRGKRGRRFIVATGVLAMLYLAYYYIWPLIPK*
Ga0137385_1068629413300012359Vadose Zone SoilVLLFADDMTKALKKLSGTVAVKDSNGQVQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPAVSKKWLLRNVGSIETTEETAVELVEPGKKSELSEPSKRSAIEEKLAIQPKPHRSRWGRRFIMATGVLAILYLAWEYIWPLLR*
Ga0137385_1071349213300012359Vadose Zone SoilYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPPVSKKWLMQNVGSSETSEETSVELVEPIRKSEQSVPSKKSSVDEELAIQPKPHRGKWGRRFIIATGVLVVLYLAWEYVLPLLSMWIHL*
Ga0137375_1063231113300012360Vadose Zone SoilTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPSLSSKWLLRNVGHSEATETSTVEVMATEKVTAPDKKSAVELDEPNRRSAVEAKPVVRRKERRNKWARGVSVAVMMMGVLAILFLAWTYVWPLLR*
Ga0137360_1149493113300012361Vadose Zone SoilNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPSVSSKRLLQNVGSSENVEETAVEIVEPARKSAVEPNKKSLVEEKIAIQPKPRRGKRGRRFIMATGVLAILYLAYY
Ga0137361_1185460313300012362Vadose Zone SoilKRRMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPSVSKKWIMRNVGPKESSEETAVELVEPSGKSEPKEPGKRSRVEEELTIQTRPHRGKWGRRFVMATGVLAILYLAWKYILP
Ga0134054_126079013300012390Grasslands SoilKMRKIDYWLDKWYDIRGELFKRGMHQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRLPISTKWLMRNVGSLETSEELAVEDDEPKKSAVELDGPKKKSLLQEKISVQPMPHRSKKRSACFHSDYGRGCLD
Ga0134043_113768313300012392Grasslands SoilALRKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDGPKKKSLLQEKISVQARPRGSKKGRRVSIVIIAAGVLTVLFFAWKYIWPLLR*
Ga0134051_130887213300012398Grasslands SoilLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDIRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSAFQEKITVQARPDRSKKGRRVSMVVVAAGVLTILFLAWQYIWPLLR*
Ga0134048_101138713300012400Grasslands SoilLFADDMTKALRKLSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMHQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGIFRVPRPTVSSKWLMRNVGSVETSEELVVEDDEPKKSVVDPEGSKKRSALQEKIAVHARPHRSK
Ga0134045_117388313300012409Grasslands SoilFKRGMHQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDASKVERMLGPVVYYAMRQHEVDALRDRKDLAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDGPKKKSLLQEKISVQTRPRRSKKGRRVSIVIMAAGVLTVLFFAW
Ga0137396_10000812143300012918Vadose Zone SoilFADDMTKALKKLSGTVAVKDSNGQVQKMKKVDYWLDKWYDIGGELLKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTKKWLLQNVGARETAEETAVEIVEPTRKSDPTEPRKRSSVEEELTIQTRPRRGKWGRRFVMATGVLAILYLAWEYILPLILR*
Ga0137396_1009766813300012918Vadose Zone SoilGQVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPSVTKKWLLQNVGPKETAEETAVEIVEHSGRSALREPSKKSAVEEELTIQPKPQRGKWGRRFIMATGILAILYLAWAYILPLLSQWIHL*
Ga0137396_1017240833300012918Vadose Zone SoilDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVSKKWIMRNVGPRDTSEETAVEIIEPARKSELREPSKKSATEKGLTIQSKPQRGKWGRRFILGTGVLVILYLAWEYILPFLSRWIHF*
Ga0137396_1115677413300012918Vadose Zone SoilNGQVQKKKKFEYWLDKWYDIRGELLKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYIAKTGSGVFRAPRPPVSKKWLLQNVGPRETSEETAVEIVEPSRKSDLTEPRKKLSVEEELTVQSRPHRGKWGRRFVMAT
Ga0137416_1017250623300012927Vadose Zone SoilSGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTSKRLLQNVGSSETVEETAVEVVEPAKKTVAEPSKKSVVEGKLANQPKPSRGKWGRRFIMATGVLAILYLAYYYIWPLLPK*
Ga0134077_1009299623300012972Grasslands SoilFADDMTKALKKLSGTVAVKDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRHGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGPRETSEETAVELVEPIKKSDLNGPSKKSVAEEKPIVQLKPHRGKWGRRFVIGTGVLVILYLAWEYILPLLALWVHI*
Ga0134081_1003510713300014150Grasslands SoilKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPHRSKKGRRVSTVVVAAGVLTILFLAWQYIWPLLR*
Ga0134081_1019952113300014150Grasslands SoilDKLFKRFRKPPRAVLLFADDMTNALKKLSGTVSVKDSNGQLQKMRRVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTKKWIMRNVGAIETSQETAFEIVEPGGKSELREPEKKSAVEEKLTIQTRPHRGKWGRRFLMATGVLAI
Ga0134075_1054067213300014154Grasslands SoilKWYDIRGELFKRGMRQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWLLQNVGSRETSEETAVEIVEPIKKSEINEPSKKSVSEEKLTIQPKPHRGKWGRRFIIGTGVLAIL
Ga0137418_1090291713300015241Vadose Zone SoilIRGELFKRGMKQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFFVPRPPVTSKRFLQNVGSSETMEEKVVEIVEPARKPAVESGKKSLAEERLAIQSKPSRGKWGRRFIMATGVLAILYLAYYYVWPLIPK*
Ga0134069_126348213300017654Grasslands SoilKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDSPKKKSLLQEKISVQARPRGSKKGRRVSIVMIAAGVLTVLFLAWKYIWPLLR
Ga0134074_119882313300017657Grasslands SoilGTVAVKDSNGQLEKMRKIDYWLDKWYDIRGELFKRGMHQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYSMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDGPKKKSLLQEKISVQARPRGSKKGRRVSIVMIAAGVLTVLFLAWKYIWPLLR
Ga0134083_1028731613300017659Grasslands SoilIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKRWIVRNVGSSEGSEETTIELVEPGKKSELSEPSKRSAVEEKLTVQPKLHRSRWARRFVVATGVLAILYLAWKYLLPILSLWVRL
Ga0187778_1056348213300017961Tropical PeatlandDRMRKVDYWLDKWYDIRGELFKHGMRQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPTVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPAVNAKWLLRNVGPVESSEDIALEEVDTTKSGVEEKPATGSRRPPRKIARYVSSAVMGIGVLAIIFLVWTYILPLLR
Ga0066655_1001158813300018431Grasslands SoilVKDSNGQVQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRSVGSSETSRETAVELVEPNKKSELSESSKKSAVEGELTIQPRPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR
Ga0066667_1020929623300018433Grasslands SoilDSNGQLQKMRKVDYWLDKWYDIRGELFKRGMGQGLVIMVAALHRFYGRDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWLMQNVGSRETSEETSVELVESIRKSEQSVPSKKSSVDEELAIQPKPHRGKWGRRFIIATGVLVVLYLAWEYVLPLLSMWIHL
Ga0066667_1045903213300018433Grasslands SoilAVLLFADDMTKALKKLSGTVAVKDSNGQSQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR
Ga0215015_1044598213300021046SoilLDKWYDIRGELYKHGMRQGLVIMVSALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYSMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSSKRLLQNVGSSETSGETVVEIVEPIKRSAVEPGRKSMLDLSLIHI
Ga0207646_1031589013300025922Corn, Switchgrass And Miscanthus RhizosphereLQKMRKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKMGSGVFRVPRPPVTSKRLLQNVGSSETVEETAVDVVEPAKKTVAEPSKKSVVERKLANRPKPGRGKWGRRFIMATGILAILYLAYYYVWPLLPK
Ga0209237_125444013300026297Grasslands SoilWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPPVSKKWIMRNVGLRETSEETAVELVEPDKSEPSQPNRKSAVEEKLAIQTRPHRGKWGRHFLMATGVLAILYLAWAYI
Ga0209236_113075613300026298Grasslands SoilFRKPPKAVVLFADDMTKALKKLSGTVAVKDSNGQVQKMKKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKQLAWTAYVAKTGSGVFRVPRPPVSKKWLLRNVGSSETSEETAVELVEPSGKSELREPGKRSTVEEKPAIQTKPHRSRWGRRFVMATGVLAILYLAWAYIWPLLR
Ga0209236_123150213300026298Grasslands SoilLFADDMTKALKKLSGTVSVKDSNGQLQKMRRVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPPVSKKWIMRNVGLRETSEETAVELVEPDKSEPSQPNRKSAVEEKLAIQTRPHRGKWGRRFLMATGVLAI
Ga0209802_105916513300026328SoilNGQVQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTGYVAKTGSGVFRVPRPPISKKWIMRNVGSSETAEETTVELVEPIKKSELSEPDKKSAVEEEPAIQPTPHRGRWGRRFIMATGVLAILYLAWEYVLPLISLWVHL
Ga0209159_113550313300026343SoilKLSGTVAVKDSNGRLEKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEALAVEDDEPKKSAVELDGPKKRSLLQGKISVRPMPHRSKKGRRVSIVIMAAGVLTVLFFAWKYIWPLLR
Ga0209808_125808013300026523SoilKALKKLSGTVAVKDSNGQLQKMRKVDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRNVGSSETSEETSIEIVEPGRKSELSEPNKKSSVEELTIQPKPH
Ga0209056_1059029513300026538SoilDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAFVAKTGSGVFRVPRPPLSTQWLMRNVGSVDASEELAVEVDEPTKSVVDPNGHKKKSALQEKITVQARPHRSKKGRRVSTVVVAAGVLTILFLAWQYI
Ga0209376_100177713300026540SoilLFADDMTKALKKLSGTVAVKDSNGQLQKMRKIDYWLDKWYDIRGELFKRGMRQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVSKKWIMRSVGSSETSRETAVELVEPNKKSELSESSKKSAVEGELTIQPRPHRGRWGRRFIIATGVLVILYLAWEYIWPLLR
Ga0209156_1037358513300026547SoilKIDYWLDKWYDIRGELFKRGMHQGLVIMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDREELAWTAFVAKTGSGVFRVPRPPVSTKWLMRNVGSLETSEELAVEDDEPKKSAVELDGPKKKSLLQEKISVQARPRGSKKGRRVSIVIMAAGVLTVLFLAWKYIWPLLR
Ga0209283_1066139113300027875Vadose Zone SoilRGELFKRGMRQGLVIMVAAFHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVERMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRAPRPPVTKKWIMRNVGSSERSEETAVELVEPNRKSELSEPSKKSAVEEKLTIQAKPHRGKWGRRFIMGTGVLAILYLAWEYILPLVSQWVHI
Ga0307473_1130772313300031820Hardwood Forest SoilDKWYDIRGELFKRGMRQGLVVMVAALHRFYGGDIDLRADADLLLVRSTGTPGTFDANKVEKMLGPVVYYAMRQHEVDALRDRKELAWTAYVAKTGSGVFRVPRPPVTSKRFLQNVGSSETLEEKVVEIVEPARKSAVEIGKKSVMEEKLAVQPRPGRGRWGRRFIMATGVLAIVYLAYYY


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.