NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F045047

Metagenome / Metatranscriptome Family F045047

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F045047
Family Type Metagenome / Metatranscriptome
Number of Sequences 153
Average Sequence Length 150 residues
Representative Sequence MNIGGMGSVGWNFKPWLQLVADSSYSVVTISGTKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTITGVGGYTTSTNCISYRVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY
Number of Associated Samples 110
Number of Associated Scaffolds 153

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.96 %
% of genes from short scaffolds (< 2000 bps) 1.31 %
Associated GOLD sequencing projects 103
AlphaFold2 3D model prediction Yes
3D model pTM-score0.62

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.732 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(37.909 % of family members)
Environment Ontology (ENVO) Unclassified
(36.601 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.176 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 62.35%    Coil/Unstructured: 37.65%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.62
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 153 Family Scaffolds
PF01979Amidohydro_1 11.76
PF01740STAS 1.96
PF13437HlyD_3 1.96
PF00578AhpC-TSA 1.31
PF06718DUF1203 1.31
PF13442Cytochrome_CBB3 1.31
PF07927HicA_toxin 0.65
PF12844HTH_19 0.65
PF00881Nitroreductase 0.65
PF01548DEDD_Tnp_IS110 0.65
PF08241Methyltransf_11 0.65
PF12838Fer4_7 0.65
PF13641Glyco_tranf_2_3 0.65
PF12900Pyridox_ox_2 0.65
PF01381HTH_3 0.65
PF01638HxlR 0.65
PF10067DUF2306 0.65
PF01850PIN 0.65
PF16656Pur_ac_phosph_N 0.65
PF02954HTH_8 0.65
PF12686DUF3800 0.65
PF14384BrnA_antitoxin 0.65
PF13620CarboxypepD_reg 0.65
PF00486Trans_reg_C 0.65
PF02077SURF4 0.65
PF00795CN_hydrolase 0.65
PF13561adh_short_C2 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 153 Family Scaffolds
COG1724Predicted RNA binding protein YcfA, dsRBD-like fold, HicA-like mRNA interferase familyGeneral function prediction only [R] 0.65
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.65
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 0.65
COG3547TransposaseMobilome: prophages, transposons [X] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.73 %
All OrganismsrootAll Organisms3.27 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300011270|Ga0137391_10040868All Organisms → cellular organisms → Bacteria → Acidobacteria3931Open in IMG/M
3300021171|Ga0210405_10004681All Organisms → cellular organisms → Bacteria12752Open in IMG/M
3300021478|Ga0210402_11001815All Organisms → cellular organisms → Bacteria → Acidobacteria762Open in IMG/M
3300027591|Ga0209733_1000741All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6658Open in IMG/M
3300032180|Ga0307471_100249411All Organisms → cellular organisms → Bacteria → Acidobacteria1826Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil37.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.42%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil11.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil11.11%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment6.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.58%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.61%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.31%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.31%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.31%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.65%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.65%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.65%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.65%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009700Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_4_PS metaGEnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017942Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_3EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022717Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031247Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-CB2-25 metaGHost-AssociatedOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25388J43891_100539923300002909Grasslands SoilVRSTWERRNPGGVRLTPLIEVFGGYAFARLDGGGGYWTNMTIGGMGSVGWNFKPWLQIVADSSYSTVTISGTKNVLYGNHFGPRYFYHSRNRWGITPFVEGLVGGSRADTTVPGVGGYTASANCISYRVGGGVDLHPSRRWEIRLLDVDYYRTSFGTNLHQNNYWASAGIVLRLFGGSSDY*
Ga0063454_10122279513300004081SoilKSWLQVVGDSSYNVITVSGAKNVLYGNHYGPRLFRRGRNRWGITPFVEALAGGSRLDVTVSGVGGYKTSESCFSFKVGGGIDIKPSRHFEIRLFDIDYYRSTFGTNLHQNNYWASTGIVLRLFGGASE*
Ga0062387_10016811033300004091Bog Forest SoilTQNVLFGNHYGPRFFYRIRNRWGITPFAEALVGGSDLRTTVSGAGGYTAASGSSLSYKVGGGIDIHPSHRWQIRLLDVDYYGTSFVANTHQSNYWITTGVVLRLFGGGAE*
Ga0062389_10225331423300004092Bog Forest SoilNVLYGNHYGPRFFYRMRNRWGITPFGEALVGGSDLKTTISGSGGYTASTGSLLSYKVGGGVDIHPSKRWEIRLIDVDYYRTGFGTNAHQTNYWVSTGIVLRLFGGDPQ*
Ga0063455_10078581013300004153SoilPGYWTNVHGAMGSFGWNVKSWLQVVGDSSYNVITVSGAKNVLYGNHYGPRLFRRGRNRWGITPFVEALAGGSRLDVTVSGVGGYKTSESCFSFKVGGGIDIKPSRHLEIRLFDIDYYRSTFGTNLHQNNYWASTGIVLRLFGGASE*
Ga0066674_1000792223300005166SoilLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0066678_1069022313300005181SoilAAKTGLERSIPSTRKQPLVELFGGYAFVRLDNGGGYRSNLNGALGAFGWNVKPWLQIVADTSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNINPSRHFEIRLFDFDYYRTSFGLNSHQNNYWASTGIVVRLFGGRSE*
Ga0066689_1019251933300005447SoilDGGGGYWTNMNVGGMGSVGWNFKPWLQIVADSSYSTVTISGTKNVLYGNHFGPRYFYHSRNRWGITPFVEGLVGGSRADTTVPGVGGYTASANCISYRVGGGVDLHPSRRWEIRLLDVDYYRTSFGTNVHQKNYWASAGIVLRLYGGSSDY*
Ga0066661_1008883433300005554SoilDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0066704_1035717423300005557SoilTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGHSEE*
Ga0066700_1014483733300005559SoilSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0066703_1050194113300005568SoilLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGG
Ga0066691_1035690913300005586SoilGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0070761_1053481823300005591SoilSGTKTVLYGNHYGPRYYYRGLGRLHITPFAEAFIGGSRADVTASGSTTSQNCISYKIGGGIDYRASRRWEIRLFDFDYYRTSFGTNAHQTNYWASTGVVLRLFGGSE*
Ga0066706_1076519513300005598SoilVRSTWERRNPGGVRLTPLIEVFGGYAFARLDGGGGYWTNMTIGGMGSVGWNFKPWLQIVADSSYSTVTISGTKNVLYGNHFGPRYFYHSRNRWGITPFVEGLVGGSRADTTVPGVGGYTASANCISYRVGGGVDLHPSRRWEIRLLDVDYYRTSFGTNLHQNNYWAS
Ga0079222_1034011313300006755Agricultural SoilIRPVETRSTWERRNPGGVRRTPLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGVKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0079221_1036834323300006804Agricultural SoilMGSVGWNFKPWLQLVGDSSYSTVTISGVKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0073928_1093533313300006893Iron-Sulfur Acid SpringMPLIELFGGYQFARLDGGGGTGTNLHGALGSFGWNLKPWLQIVADTSYNVVTISGTKNVLYGNHWGPRFFRHTRNRWGATPFVEALVGGSRADTTVTGTGGYSTSTNCLSYKVGGGVDIHPYRHFEIRLFDIDYYRTAFGVNLHQNNYSASAGIVLRLFGGGSE*
Ga0099793_1059669013300007258Vadose Zone SoilVPLVELFGGYAFERFVSAGTATNFNGGLGSVGWNVKPWLQLVADSSYSVVAAANTENVIYGNHYGPRLFRRGRNRWGLTPFAEALVGGSRADTTVSGVGGYKASQNCFSIKVGGGVDIHSSRRFDIRLFDVDYYRTSFGTNLHQNNYWVSTGIVLRLFGGGSE*
Ga0099829_1025390513300009038Vadose Zone SoilLLELYGGYTFARLVAGAGTASNLNGAIGAFGWNIKPWLQIVGDSSYNVVTVSGTKNVLYGNHFGPRYVHRSRNRWGLTPFVEALVGGSRADTSVAGSSAYNTSANCISYKAGGGIDVHPSRHIDIRLFDVDYYRTAFGTNLHQNNYWASAGIVLRLFGGGSE*
Ga0099829_1126702713300009038Vadose Zone SoilSWEKPNRAARKPTLFEFYGGYAFARLGGSGGGTYSSLNGAMGSIGLTLRPWLQIVADSTYNYVTATGTKNVLYGNHFGARYFYRSHNRWGATPFFEGLVGGSRADTTITGTGGYTTSVNCLSYKAGGGMDIHPTRHWEIRVIDVDYYRTAFGTGLHQNNYWVTTGVVLHLFGGGNQ*
Ga0099830_1005562723300009088Vadose Zone SoilMGSFGWNMTPWLQVLGDSSYNVVTVTGTKYVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSREDTTVSGTGGYKTSINCLSYKAGGGLDVHPSRHIDIRLFDFDYYRTAFGTNLHQNNYWASAGIVIRLFGGGSE*
Ga0099830_1035578223300009088Vadose Zone SoilLLELYGGYTFARLVAGAGTASNLNGAMGAFGWNIKPWLQIVGDSSYNVVTVSGTKNVLYGNHFGPRYVHRSRNRWGLTPFVEALVGGSRADTSVAGSSAYNTSANCISYKAGGGIDVHPSRHIDIRLFDVDYYRTAFGTNLHQNNYWASAGIVLRLFGGGSE*
Ga0099830_1104378113300009088Vadose Zone SoilSSYNFVTVNGAKTVLYGNHFGPRYFYRRHNRFGATPFVEALIGGSRADVTVTGTGGYTTSNNCMSYKVGGGLDLHPSRRWEIRVFDFDYYRTSFGTNVHQNNYSASAGIVLRLFGGAE*
Ga0099830_1142326813300009088Vadose Zone SoilMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWFQLVADSSYSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFVEALVGGSRADTTVSGVGGYKTSQNCFSIKAGGGIEIHPSRHVDIRLFDVDY
Ga0099830_1158522123300009088Vadose Zone SoilYSVVTIAGTKNILYGNHFGPRYFHRGRNRWGLTPFVEALVGGSREDATVTGTGGYTISANCLSYKVGGGLDLHPSRRWEIRLFDVDYYRTSFGTGVHQNNYWATTGIVLRLFGGAE*
Ga0099828_1084555613300009089Vadose Zone SoilMGSFGWNVKPWLQIVADSSYSVVTVSGVKNVLYGNHFGPRYFHRGRNRWGMTPFVEALVGGSRADATVTGVGGYTTSANCLSYKVGGGLDLRPSRHWEIRVFDVDYYRTAFGTNMHQNNY
Ga0099827_1103123113300009090Vadose Zone SoilLLELYGGYTFARLVGGAGTATNLNGAMGAFGWNVKPWLQIVGDSSYNVVTVSGTKNVLYGNHFGPRYVHRGRNRWGLTPFVEALVGGSRADTSVAGSSAYNTSANCISYKAGGGVDIHPSRHIDIRLFDVDYYR
Ga0066709_10283866213300009137Grasslands SoilGSNLNGALGAFGWNVKPWLQIVADTSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNINPSRHFEIRLFDFDYYRTSFGVNSHQNNYWASTGIVVRLFGGRSE*
Ga0116217_1044205223300009700Peatlands SoilLFEFYGGYAFARLGASGGGTSSNLNGAMGSVGLTLRPWLQIVADSTYNYVTATGVKNVLYGNHFGGRYFYRSRNRWGATPFFEGLVGGSRSDTTITGTGGYTYSVNCLSYKAGGGLDLHPTRHWEFRVIDVDYYRTAFGTGLHQNNYWVTTGVVLHLFGGGAR*
Ga0099796_1007020213300010159Vadose Zone SoilMIELFGGYGFTRLDSGGGTMTNLNGALGSFGWNFRPWLQLVADSSYSVATISGTKNVLYGNHWGPRLFHHGRYPLGAVPFVEALVGGSRADTTVSGVGGYKTSSNCLSFKVGGGVDIHPSRHFKIRLFDFDYYRTAFGANLHQNNYSASVGIVLRLFGGGAE*
Ga0134088_1014847013300010304Grasslands SoilLERSIPNVRKQPLVELFGGYAFVRLDNGGGYGSNLNGALGAFGWNVKPWLQIVADTSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNINPSRHFEIRLFDFDYYRTSFGANTTQNYYTASAGIILRLFGSSSAE*
Ga0134063_10000790113300010335Grasslands SoilLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFYRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0134063_1025521513300010335Grasslands SoilISGTKNVLYGNHFGPRYFYHSRNRWGITPFVEGLVGGSRADTTVPGVGGYTASANCISYRVGGGVDLHPSRRWEIRLLDVDYYRTSFGTNLHQNNYWASAGIVLRLFGGSSDY*
Ga0150983_1238720913300011120Forest SoilTKNVLYGNHFGPRYFHRSRNRWGATPFVEALVGGSRADTTVTGTGGYTTSVNCLSYKVGGGLDLRPSRHWEIRVFNVDYYRTAFGTNLHQNNYWASAGIVLRFFGGSSDY*
Ga0137392_1000060513300011269Vadose Zone SoilNLNGAMGSFGWNMTPWLQVLGDSSYNVVTVTGTKYVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSREDTTVSGTGGYKTSINCLSYKAGGGLDVHPSRHIDIRLFDFDYYRTAFGTNLHQNNYWASAGIVIRLFGGGSE*
Ga0137392_1140264613300011269Vadose Zone SoilMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWFQLVADSSYSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFVEALAGGSRADTTVSGVGGYKTSENCFSIKAGGGIEIHPSRHVDIRLFDVDYYRTSFG
Ga0137391_1004086823300011270Vadose Zone SoilVELFGGYAFARLAGSGGTSSNLNGALGSFGWNFKPWLQIVADSSYSVVTISGTKNVLYGNHFGPRYFHRGRNRWGLTPFVEALVGGSRADTTITGVGGYTTSNNCLSYKAGGGLDIHPSRRWEIRLFDIDYYRTSFGANVHQNNYWASTGIVLHFFGGSSDY*
Ga0137393_1012692713300011271Vadose Zone SoilMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWFQLVADSSYSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFVEALAGGSRADTTVSGVGGYKTSQNCFSIKAGGGIEIHPSRHVDIRLFDVDYYRTSFGTNLHQNNYWASTGIVLRLFGGGSE*
Ga0137393_1065132713300011271Vadose Zone SoilLLELYGGYAFAHTNDGAGTTTNLNGAMGSFGWNFKSWLQILADSSYSVVTVSGTKTVLYGNHFGPRYFHRGHNRWGLTPFAEALVGGSRADTTVTGTGGYKISQNCLSYKVGGGVDIHPSRRIDIRLFDADYYRPAFGTNLHQNNYWISTGIVFRLLGGRASD*
Ga0137388_1100830123300012189Vadose Zone SoilVGDSSYNFVTVNGAKTVLYGNHFGPRYFYRRHNRFGATPFVEALIGGSRADVTVTGTGGYTTSNNCMSYKVGGGLDLHPSRRWEIRVFDFDYYRTSFGTNVHQNNYSASAGIVLRLFGGAE*
Ga0137399_1004314733300012203Vadose Zone SoilLLEFYGGYAFARLVGSAGTATNLNGAMGSFGWNVKPWLQIVADSSYSLVTVGTTKNVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSRADTTVSGTGGYKTSDNCLSYKAGGGLDVHPSRHIDIRLFDVDYYRTAFGANLHQNNYWASAGIVLRLFGGGSE*
Ga0137399_1033365913300012203Vadose Zone SoilLQIAADSSYSFVTTNGTKNVLYGNHFGPRYFHRARNRWGLAPFVEGLVGASRADTTVSGVTTSDNCLSFKVGGGVDIHPSRRFDIRLFDVDYYRTSFGTNVHQNNYWASAGIVVRLFGGGSE*
Ga0137399_1059390223300012203Vadose Zone SoilMEVYGGYAFARLVSGGTGTNLNGALGSFGYNIRPWLQLVADSSYSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFFEGLVGGSRADATVSGVGGYKTSQNCFSIKVGGGIEIHPSRHVDIRLFDVDYYRTSFGTNLHQNNYWVSAGIVLRLLGGGSE*
Ga0137399_1087822923300012203Vadose Zone SoilSVAWNFKPWLQLVADSSYSVVTISGVKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTVTGVGGYTTSTNCISYRVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY*
Ga0137399_1121579913300012203Vadose Zone SoilLVADSSYSVVTISGTKNVLYGNHFGPRYFHRRLSRWGATPFVEGLVGGSRADTTIPGVGGYTTSANCISYKVGGGVDLHPSRHWEIRLFDVDYYRTSFGTNVHQNNYWASAGIVLRLFGGSSDY*
Ga0137362_1006390423300012205Vadose Zone SoilMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE*
Ga0137362_1041113923300012205Vadose Zone SoilLIELFGGYAFARLDGGAGTWTNMNLGGMGSVAWNFKPWLQLVADSSYSVVTISGVKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTVTGVGGYTTSTNCISYRVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY*
Ga0137380_1104240623300012206Vadose Zone SoilQLVADSSYSVVTVGTTKNVLYGNHFGPRYFHRGRNRWGATPFVEGLIGGSRADTTITGVGGYTTSVNCLSYKVGGGLDLHPSRHWEIRLFDVDYYRTSFGTNVQQNNYWVSTGIILHLFGGRAY*
Ga0137380_1119119213300012206Vadose Zone SoilLIEVFGGYAFARLDGGGGTATNLNGALGSFGWNFKPWLQLVADSSYSVVTVGTTKNVLYGNHFGPRYFHHGRNRWGATPFVEGLIGGSRADTTISGVGGYTISANCLSYKVGGGLDLHPSRHWEIRLFDVDYYR
Ga0137380_1166747613300012206Vadose Zone SoilALGAFGWNVKPWLQIVADTSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNINPSRHFEIRLFDFDYYRTSFGVNSHQNNYWASTGIVVRLFGGRSE*
Ga0137379_1140227913300012209Vadose Zone SoilSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNIDPSRHFEIRLFDFDYYRTSFGLNSHQNNYWASTGIVVRLFGGRSE*
Ga0137377_1051019023300012211Vadose Zone SoilLQLVADSSYSVVTVGTTKNVLYGNHFGPRYFHRGRNRWGATPFVEGLIGGSRADTTISGVGGYTISANCLSYKVGGGLDLHPSRHWEIRLFDVDYYRTSFGTNVQQNNYWVSTGIVLHLFGGRSE*
Ga0137387_1007516413300012349Vadose Zone SoilLERSIPSTRKQPLVELFGGYAFVRLDNGGGYGSNLNGALGAFGWNVKPWLQIVADTSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNINPSRHFEIRLFDFDYYRTSFGVNSHQNNYWASTGIVVRLFGGRSE*
Ga0137387_1074326923300012349Vadose Zone SoilKPWLQLVADSSYSVVTVGTTKNVLYGNHFGPRYFHRGRNRWGATPFVEGLIGGSRADTTITGVGGYTTSVNCLSYKVGGGLDLHPSRHWEIRLFDVDYYRTSFGTNVQQNNYWVSTGIILHLFGGRAY*
Ga0137384_1035731113300012357Vadose Zone SoilWNFKPWLQLVADSSYSVVTVGTTKNVLYGNHFGPRYFHRGRNRWGATPFVEGLIGGSRADTTISGVGGYTISANCLSYKVGGGLDLHPSRHWEIRLFDVDYYRTSFGTNVQQNNYWVSTGIVLHLFGGRSE*
Ga0137360_1014693023300012361Vadose Zone SoilMPLIELFGGYQFARLDGGGGTGTNLNGALGSFGWNLKPWLQIVADTSYNVVTISGTKNVLYGNHWGPRFFRHTRNRWGATPFVEALVGGSRADTTVTGTGGYTTSNNCLSYKVGGGVDIHPYRHFEIRLFDVDYYRTAFG
Ga0137361_1020314123300012362Vadose Zone SoilLIELYGGYAFARLGGAGTWTNFNGALGSFGWNVKPWLQIVADTSYSYVTANGVKNVLYGNHYGPRFFRHGRNRWGATPFVEALFGGSRADTTITGTGGYTSSENSLSFKVGGGLDIHPSPRLKIRLFDVDYYRTSFGPNLHQNNYWASAGIVVRLFGGSSE*
Ga0137361_1020658323300012362Vadose Zone SoilGGMGSVAWNFKPWLQLVADSSYSVVTISGVKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTVTGVGGYTTSTNCISYRVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY*
Ga0137361_1053554513300012362Vadose Zone SoilLIELFGGYAFARLDGGAGTWTNLNGALGSFGWNVKPWLQIVADTSYDVVTVSGTKTVLWGNHYGPRLFFGRVRNRWGITPFVEGLVGGSRADVTVSGTGGYATSVNSISYKVGGGLDIKPSPHFEIRLLDVDYYRT
Ga0137390_1172065713300012363Vadose Zone SoilKYVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSREDTTVSGTGGYKTSINCLSYKAGGGLDVHPSRHIDIRLFDFDYYRTAFGTNLHQNNYWASAGIVIRLFGGGSE*
Ga0137398_1012319823300012683Vadose Zone SoilMIELFGGYGFTRLDSGGGTMTNLNGALGSFGWNFRPWLQLVADSSYSVATISGTKNVLYGNHWGPRLFHHGRYPLGAVPFVEALVGGSRADTTVSGVGGYKTSSNCLSFKVGGGVDIHPSRHFKIRLFDFDYYRTAFG
Ga0137398_1078698713300012683Vadose Zone SoilGPRYFYHNRNRWGVTPFVEGLVGGSRADTTISGVGGYTTSANCISYKVGGGVDLHPSRHWEIRLFDVDYYRTSFGTNVHQNNYWASAGIVLRLFGGSSDY*
Ga0137395_1000921613300012917Vadose Zone SoilMNGGLGSFGLNLKPWLQIVADSSYSFVTAGTTKNVLYGNHYGPRYFHRSRNRWGATPFVEALFGASRADTTITGPGGYTISSNSFSYKVGGGIDLHPSSHWEIRLFDVDYYRTSFGPNIHQNNY
Ga0137395_1015448523300012917Vadose Zone SoilPPAAAEAQPAPAEPPAPAVRAASARESWEKPNRGTRRAAMIELFGGYGFTRLDSGGGTMTNLNGALGSFGWNFRPWLQLVADSSYSVATISGTKNVLYGNHWGPRLFHHGRYPLGAVPFVEALVGGSRADTTVSGVGGYKTSSNCLSFKVGGGVDIHPSRHFKIRLFDFDYYRTAFGANLHQNNYSASVGVVLRLFGGGAE*
Ga0137396_1025118513300012918Vadose Zone SoilLIELYGGYAFARLDGGGGTASNLNGAMGSFGYNMKPWLQIAADSSYSFVTTNGTKNVLYGNHFGPRYFHRARNRWGLAPFVEGLVGASRADTTVSGVTTSDNCLSFKVGGGVDIHPSRRFDIRLFDVDYYRTSFGTNVHQNNYWASAGIVVRLFGGGSE*
Ga0137396_1045586613300012918Vadose Zone SoilMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWLQLVADSSYSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFFEGLVGGSRADATVSGVGGYKTSQNCFSIKVGGGIEIHPSRHVDIRLFDVDYYRTSFGTNLHQNNYWASTGIVLRLFGGGSE*
Ga0137359_1148021013300012923Vadose Zone SoilGVRNVGMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE*
Ga0137419_1049318613300012925Vadose Zone SoilTESWEKPNPGVRTLGVRNVGMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE*
Ga0137416_1004722863300012927Vadose Zone SoilMPLIELFGGYGFARLDGGAGTWTNLNGVMGSFGWNVKPWLQLVADSSYSVVTVANTKNVIYGNHYGPRLFRRGRNRWGLTPFAEALVGGSRADTTVSGTGGYSASENSFSVKLGGGLDFKPTRHFEIRLIDIDYYRTSFGPNVHQNNYWASAGIVIRLFGRSE*
Ga0137416_1045964523300012927Vadose Zone SoilVQPANATESWEKPNPGVRTLGVRNVGMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE*
Ga0137416_1051471713300012927Vadose Zone SoilPASASTPAEPPAPSIRPVNDRTSWEKPNPGLRRRAPLLEFYGGYAFARLVGSAGTATNLNGAMGSFGWNVKPWLQIVADSSYSLVTVGTTKNVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSRADTTVSGTGGYKTSDNCLSYKAGGGLDVHPSRHIDIRLFDVDYYRTAFGANLHQNNYWASAGIVLRLFGGGSE*
Ga0137416_1059329923300012927Vadose Zone SoilLIEVFGGYAFARLDGGAGTWTNMNIGGMGSVGWNFKPWLQLVADSSYSVVTISGTKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTITGVGGYTTSTNCISYRVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY*
Ga0137407_1222198813300012930Vadose Zone SoilLQVVADSSYSVVTISGTKNVLYGNHFGPRYFHRGRNRWGATPFVEALIGGSRADTTVTGAGGYTTSVNCMSYKVGGGLDLRPSRHWEIRVFNVDYYRTAFGTNAHQNNYWASAGIVLRLFGGSSDY*
Ga0137410_1043668523300012944Vadose Zone SoilPTAQPEPASPEPVRPAVQPANATESWEKPNPGVRTLGVRNVGMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE*
Ga0134076_1046729013300012976Grasslands SoilKNVLYGNHFGPRYFYHSRNRWGITPFVEGLAGGSRADTTVPGAGGYTASANCISYKVGGGVDLHPSRRWEIRLLDVDYYRTSFGTNLHQNNYWASACIVLRLFGGSSDY*
Ga0134081_1001549213300014150Grasslands SoilLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDH*
Ga0137418_1038226923300015241Vadose Zone SoilMRRTPLIEVFGGYAFARLDGGGGYWTNMNVGGMGSVGWNFKPWLQLVADSSYSTVTISGTKNVLYGNHFGPRYFYHSRNRWGVTPFVEGLVGGSRADTPISGVGRYTTSANFTSGKVGGGGDRQPPRHWEIRLFDVDYYRTSFGTNAHQNNYWASAGIVLRLFGGSSDY*
Ga0137418_1042931513300015241Vadose Zone SoilRNASMKESWEKPKPVARTVGVRNVGMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE*
Ga0134089_1012941723300015358Grasslands SoilFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY*
Ga0134085_1041632813300015359Grasslands SoilGWNVKPWLQIVADTSYNFVTVSGTKTVLYGNHFGPRYTHRGRNKWGVTPFVEVLFGGTRADITVSGTGGYNTSDNSLSIKAGGGLNINPSRHFEIRLFDFDYYRTSFGLNSHQNNYWASTGIVVRLFGGRSE*
Ga0187818_1022647413300017823Freshwater SedimentGAGAYNNMNGGLGSFGWNWKPWLQLTGDTSYNFVTIAGTKYVLYGNHYGGRFFYRGHSRWRWSATPFAEALVGGSREDTTVSGASGYSTSQNCITYKVGGGVDLHPSRRWEIRLFDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187801_1045876613300017933Freshwater SedimentSFGWNWKPWLQLTGDTSYNFITTAGTKYVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187808_1024233413300017942Freshwater SedimentSNMNGALGSFGWNWKPWLQLTGDTSYNFITTAGTKYVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187819_1025206213300017943Freshwater SedimentYAFARMDGGAGAYSNMNGALGSFGWNWKPWLQLTGDTSYNFITTAGTKYVLYGNHYGARFFHRTHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187819_1069507413300017943Freshwater SedimentWLQLTGDTSYNFITTAGTKYVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187817_1003705333300017955Freshwater SedimentVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187817_1012796733300017955Freshwater SedimentRFFYRGHSRWRWSATPFAEALVGGSREDTTVSGASGYSTSQNCITYKVGGGVDLHPSRRWEIRLFDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187817_1049547913300017955Freshwater SedimentVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLYGGRGYE
Ga0187817_1062292813300017955Freshwater SedimentAFARMDGVAGAYSNMNGALGAFGWNWKPWLQLTGDTSYNFITTAGTKYVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRTAFGTNLHQNNYWVSTGVVLRLFGGRGYE
Ga0187816_1043978113300017995Freshwater SedimentQPAPARSVEPAVQPVDPNTRWEKPLPSVRQAPLIELFGGYAFARMDGGAGAYSNMNGALGSFGWNWKPWLQLTGDTSYNFITTAGTKYVLYGNHYGARFFHRVHNRWAATPFAEALIGGSREDTTITGPGGYTYSQNCISYKVGGGVDLHPSRRWEIRLVDFDYYRIAFGTNLHQNNYWVSTGVVLRLFGGRGY
Ga0066667_1006849213300018433Grasslands SoilTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0066662_1005030243300018468Grasslands SoilLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0179592_1020335113300020199Vadose Zone SoilNHFGPRYFHRGRNRWGATPFVEALIGGSRADTTVTGAGGYTTSVNCMSYKVGGGLDLRPSRHWEIRVFNVDYYRTAFGTNAHQNNYWASAGIVLRLFGGSSDY
Ga0210404_1042423323300021088SoilDTSYNFVTVGTTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYARHFKIRLFDFDYYRTSFGTNLHQNNYSVSTGIVLRLFGGGSE
Ga0210406_1134436613300021168SoilNLNGALGSFGWNVRPWLELVADTSYNFVTVGTTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYARHFKIRLFDFDYYQTSFGTNLHQNNYSISTGIVLRLFGGGSE
Ga0210405_1000468163300021171SoilMGSFGWNMTPWLQVLGDSSYNVVTVTGTKYVLYGNHFGPRYFHRSHNRWGLTPFVEALVGGSREDTTVSGTGGYKTSINCLSYKAGGGLDVHPSRHIDIRLFDFDYYRTAFGTNLHQNNYWASAGIVIRLFGGGSE
Ga0210405_1014202543300021171SoilHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKAGGGLDIRPSRYSRYFKIRLFDFDYYRTSFGTNLHQNNYSVSTGIVLRLFGGGSE
Ga0210405_1017005433300021171SoilGDSSYNFVTVGTTKNVLYGNHYGPRYFYRGLNRWKVTPFAEALVGGSRADVTVSGAGGYSTSQNSLSFKVGGGVDYRPSRRWEIRLFDFDYYRTSFGTNAHQNNYSASAGIVLRLFGGRS
Ga0210405_1030188623300021171SoilMEVYGGYAFARLVSGGTGTNLNGVLGSFGYNIKPWLQLMADSSYNVVTTNGVKNVLYGNHFGPRFFRRGRNRWSATPFVEALVGGSRADTTVSGAGGYKTSQNCFSIKAGGGVDIHPSRRIDIRLFDVDYYRTSFGTNVHQNNYWVSTGIVVRLFGGGSE
Ga0210397_1071656113300021403SoilARPAIRAVKEQERWEKPNRGTHRAAVLEFFGGYEFARLNDGAGTFTNLNGALGSFGWNVRPWLELVADTSYNFVTVGTTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYARHFKIRLFDFDYYRTSFGTNLHQNNYSISTGIVLRLFGGGSE
Ga0210386_1045914523300021406SoilMRTGSGSSATNFNGALGSFGWNFKPWLQIVGDSSYNFVTVGTTKNVLYGNHYGPRYFYRGLNRWKVTPFAEALVGGSRADTTVSGAGGYSISQNSISYKVGGGVDYRPSRRWEIRLFDFDYYRTSFGTNAHQNNYSASAGIVLRLFGGRTE
Ga0210402_1100181523300021478SoilYNTVTYSGTKNVLYGNHYGPRYFYRTQNRWHLTPFVEGLVGASRADSTITGTGGYTTSVNGMSFKFGGGVDYRPSRRLEIRLLDVDYYRTSFGTNAYQTNYWISSGIVIRLFGGNSD
Ga0210410_1031154913300021479SoilSASARESWEKPNAGLRRMPLIELFGGYQFARLDGGGGTGTNLHGALGSFGWNLKPWLQIVADTSYNVVTISGTKNVLYGNHWGPRFFRHTRNRWGATPFVEALVGGSRADTTVTGTGGYTTSTNCLSYKVGGGVDIHPYRHFEIRLFDVDYYRTAFGVNLHQNNYSASAGIVLRLLGGGS
Ga0242663_105016413300022523SoilAPAQPAAPAIRPADTHDTWERPNASVRVAPLLELFGGYAFMRTGSGSSATNFNGALGSFGWNFKPWLQIVGDSSYNFVTVGTTKNVLYGNHYGPRYFYRGLNRWKVTPFAEALVGGSRADTTVSGAGGYSISQNSISYKVGGGVDYRPSRRWEIRLFDFDYYRTSFGTNAHQNNYSASAGIVLRLFGGRAEYRPEARATRAGERPQGRPPCKASSRDGSSAVRRAGRTTPKMF
Ga0242660_115245513300022531SoilWLQIVGDSSYNFVTVGTTKNVLYGNHYGPRYFYRGLNRWKVTPFAEALVGGSRADTTVSGAGGYSISQNSISYKVGGGVDYRPSRRWEIRLFDFDYYRTSFGTNAHQNNYSASAGIVLRLFGGRAE
Ga0242661_104293113300022717SoilTAPAPAEAPAPAVRSANDRTSWEKPNPGLRRRAPLLEFFGGYAFARMAGTAGTATNLSGGMGSVGWNIKPWLQILGDSSYSVVTASGAKNVLYGNHFGPRYFHRSHNRWGLTPFVEALVGGSREDTTVSGTGGYKTSINCLSYKAGGGLDVHPSRHIDIRLFDFDYYRTAFGTNLHQNNYWASAGIVIRLFGGGSE
Ga0242665_1015292013300022724SoilEATTATAPETAAAAQPEPAAAEPAPAIRNVSMKESWEKPNPVARTVGVLNVGMPRLPLIELFGGYGFARLDNGAGSTTNVNGVLGSFGWNVKPWLQLIADSSYSRTTISGTKNVLYGNHWGVRYFRRLRHSWGAAPFVEGLIGGSRADVTVSGASGYSTSTNCVSYKVGGGLDFHPLRHFDIRVVDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE
Ga0242654_1010060313300022726SoilMPLIELFGGYSFARFDNGTGYSASNLNGAMGSFGYNFKPWLQIVGDTSYNFVTTNGVKTVIYGNHYGARYFYHKQNRWHITPFVEGLVGGSRADATVSGTGGYKTSSNCISYKAGGGLDFHPSRRWEIRLLNVDYYRTSFGTNLHQNNYWASAGVVLRLFGGAAAE
Ga0209237_126524613300026297Grasslands SoilETRSTWERRNPGGVRRTPLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRL
Ga0209238_103068423300026301Grasslands SoilVRSTWERRNPGGVRLTPLIEVFGGYAFARLDGGGGYWTNMTIGGMGSVGWNFKPWLQIVADSSYSTVTISGTKNVLYGNHFGPRYFYHSRNRWGITPFVEGLVGGSRADTTVPGVGGYTASANCISYRVGGGVDLHPSRRWEIRLLDVDYYRTSFGTNLHQNNYWASAGIVLRLFGGSSD
Ga0209469_113526513300026307SoilPLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0209239_110194523300026310Grasslands SoilPPAIRPVETRSTWERRNPGGVRRTPLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0209472_122216813300026323SoilRNPGGVRRTPLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0209804_103749133300026335SoilLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRFA
Ga0257179_102829613300026371SoilAAQPQPTWGQPAPEQPAPAEAAAPAIRPADSRSGWERPRPSARTAPLMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWFQLVADSSYSFVTTNGIKNVLYGNHFGPRFFRRGRNRWGATPFFEGLVGGSRADTTVSGVGGYKTSQNCFSIKVGGGIEIHPSRHVDIRLFDVDYYRTSFGTNLHQNNYWASTGIVLRLFGGGSE
Ga0257181_105686913300026499SoilGQPAPAQPAPEQPAPAEAAAPAIRPADSRSGWERPRPSARTAPLMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWFQLVADSSFSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFFEGLVGGSRADTTVSGVGGYKTSQNCFSIKVGGGIEIHPSRHVDIRLFDVDYYRTSFGTNLHQNNYWASTGIVLRLFGGGSE
Ga0209808_117207813300026523SoilVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0209056_1039297723300026538SoilVRSTWERRNPGGVRLTPLIEVFGGYAFARLDGGAGYWTNMTIGGMGSVGWNFKPWLQIVADSSYSTVTISGTKNVLYGNHFGPRYFYHSRNRWGITPFVEGLVGGSRADTTVPGVGGYTASANCISYRVGGGVDLHPSRRWEIRLLDVDYYRTSFGT
Ga0209474_1055471013300026550SoilVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0209577_1031910913300026552SoilVETRSTWERRNPGGVRRTPLIEVFGGYAFARIDSGAGYWTNMNIGGMGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0179593_101687423300026555Vadose Zone SoilLLEFYGGYAFARLVGSAGTATNLNGAMGSFGWNVKPWLQIVGDSSYSLVTVGTTKNVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSRADTTVSGTGGYKTSDNCLSYKAGGGLDVHPSRHIDIRLFDVDYYRTAFGANLHQNNYWASAGIVLRLFGGGSE
Ga0179587_1040978813300026557Vadose Zone SoilKPNPVARTVGVRNVGMPRLPLIELFGGYGYARLNNGAGYVTNSNGVLGSFGWNIKPWLQVIADSSYNRVTVSGTKNVLYGNHWGVRYFHRLRHSWGAAPFVEGLIGGSRADTTVSGAGGYSTSNIGMSYKVGGGVDIHPLRHFEIRVIDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE
Ga0179587_1105506513300026557Vadose Zone SoilMTTTVVVQPERSASKQPEPAPAEPPAPAIRPANDRTSWEKPNPGLRRRAPLLEFYGGYAFARLAGSGGTATNLNGAMASFGWNLKPWLQLVADSSYSVVTVTGTKNVLYGNHFGPRYFHRSRNRWGLTPFVEALVGGSRADTTVSGTGGYKTSDNCLSYKAGGGLDVHPSRHIDIRLF
Ga0209733_100074113300027591Forest SoilRWEKPNRGTHRAAVLEFFGGYEFARLNNGAGTFTNLNGALGSFGWNVRPWLELVADTSYNIVTVSGTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYARHFKIRLFDFDYYRTSFGTNLHQNNYSVSTGIVLRLFGGGSE
Ga0209689_110965833300027748SoilGSVGWNFKPWLQLVGDSSYSTVTISGTKNVLYGNHFGARYFHRRLSRWGATPFVEALAGGSRADTIVSGVGGYTASANCVSYKVGGGVDLHPSRHWEIRLFDVDYYRTAFGTNGHENNYWASAGIVLRLFGGSSDY
Ga0209274_1027881513300027853SoilNHYGPRYYYRGLGRLHITPFAEAFIGGSRADVTASGSTTSQNCISYKIGGGIDYRASRRWEIRLFDFDYYRTSFGTNAHQTNYWASTGVVLRLFGGSE
Ga0209526_1022342923300028047Forest SoilPEAAPPVRPAIRAVKEKERWEKPNRGTHRAAVLEFFGGYEFARLNDGAGTFTNLNGALGSFGWNVRPWLELVADTSYNFVTVGTTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYLRHFKIRLFDFDYYRTSFGTNLHQNNYSISTGIVLRLFGAGSE
Ga0209526_1048782913300028047Forest SoilFGWNFRPWLQLVADSSYSMVTISGTKNVLYGNHWGPRLFLHGRYPLGAVPFVEGLVGGSRADTTVSGVGGYKTSSNCLSFKVGGGVDIHPSRHFKIRLFDFDYYRTAFGTNLHQNSYSASAGIVLRLFGGGAE
Ga0137415_1034436923300028536Vadose Zone SoilMNIGGMGSVGWNFKPWLQLVADSSYSVVTISGTKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTITGVGGYTTSTNCISYRVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY
Ga0257175_107075113300028673SoilMEAYGGYAFARLVSGGTGTNLNGLMGSFGYNIRPWFQLVADSSYSFVTTNGTKNVLYGNHFGPRFFRRGRNRWGATPFVEALVGGSRADTTVSGVGGYKISENCFSIKVGGGIEIHPSRHVDIRLFDVDYYRT
Ga0073994_1003716213300030991SoilLELYGGYAFARLVSGGAASNLNGALGSFGWNVKPWLQVVGDSSYSAVTSAGTKNVLYGNHYGPRFFRRVRNRWGATPFVEALVGMSRSDTTITGSGGYTTSQNALSYKAGGGVDIHPSRHIEIRLFDFDYYRTSFGVNLHQNNYWVSTGIVIRLFGGGSE
Ga0170834_10710042413300031057Forest SoilWLQIVGDSSYNFVTISGTKNILYGNHFGARYFYRKHSRWGITPFAEALVGGSRADATVSGAGGYTASVNCLSYKVGGGFDIHPSRHWEIRVLDFDYYRTSFGTNVHQNNYWASTGIVLRLFGGAE
Ga0265340_1031455613300031247RhizosphereGASGYSGTNFNGALGSFGVNVRPWLQLVGDSSYNFVTVSGTKNVLYANHYGPRFYYRGLHRLNITPFAEAFVGGSRSDVTVSAYTTSQNCISFKVGGGIDYRASRRWEVRAIDFDYYRTSFGTNTQQTNYSISAGVVLRLFGGSNE
Ga0310686_10444069833300031708SoilMELYGGFAFARLVGGGSSTNFNGALGSFGWNFKPWLQIVGDTSYSFVTVSGTKNVLYGNRYGPRYYYRSRNRWNVTPFAEAFVGGSRSDTTVSGSSGYNTSQNSISYKVGGGIDFRPSRRWEIRLFDVDYYRTSFGTNAHQNNYWVSTGIVLRLFGGRSE
Ga0307476_1124004313300031715Hardwood Forest SoilRAQPQPAPAQPALPAIRPADTRETWEKPLPSVRKFPLIELYGGFAFARLVSGGSSTNFNGALGSFGWNFKPWLQIVGDTSYSFVTVSGTKNVLYGNHYGPRYYYRSRNRWNVTPFAEAFVGGSRSDTTVSGIGGYTTSENCISFKVGGGIDFRPSRRWEIRLFDVDYYRTAFGTNAHQNN
Ga0307469_1001323153300031720Hardwood Forest SoilMAGGGSGSNLIGALGSFGWNIKPWLQIVGDSSYNTVTYSGTKNVLYGNHYGPRYFYRTQNRWHLTPFVEGLVGASRADSTITGPGGYTTSVNGLSFKFGGGVDYRPSRRLEIRLFDVDYYRTSFGTNVYQTNYWISSGIVIRLFGGNSD
Ga0307469_1146532023300031720Hardwood Forest SoilGSVGWNFKPWLQVVADSSYSVVTISGTKNVLYGNHFGPRYFYRTRNRWGAIPFVEALIGGSRADTTFPAGGGTFSQNCMSYKVGGGLDLHPSRHWEIRVFDVDYYRTAFGTNMHQNNYWASAGIVLRLFGGSSDY
Ga0307469_1177344013300031720Hardwood Forest SoilRAASARESWEKPNAGVRTIGPRRLPLIELFGGYAFARLDGGGGTWSNLNGVLGSFGWNVKPWLQIVGDSSYDFVTVSGTKTVIWGNHYGPRLFGRMRNRWGITPFVEGFVGGSRADVTVSGTGGYATSVNSISYKVGGGFDLKPSRRFEIRLLDVDYYRTSFGTNLHQNNYWASAGIVIRLFGGKSE
Ga0307477_1016258623300031753Hardwood Forest SoilFGPRYFHRGRSRWGATPFVEALVGGSRADITITGAGGYTTSVNCLSYKVGGGLDLRPSRHWEIRVFDVDYYRTAFGTNMHQNNYWASAGVVLRLFGGSSDY
Ga0307475_1017388633300031754Hardwood Forest SoilNRGTHRAAVLEFFGGYEFARLNDGAGTFTNLNGALGSFGWNVRPWLELVADTSYNFITVGTTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYLRHFKIRLFDFDYYRTSFGTNLHQNNYSVSTGIVLRLFGGGSE
Ga0307475_1056699923300031754Hardwood Forest SoilELFGGYAFARLDGGAGTWTNMNLGGMGSVAWNFKPWLQLVADSSYSVVTISGVKNVLYGNHFGPRYFHRRLSRWGATPFVEALAGGSRADTTVTGVGGYTTSTNCISYKVGGGLDLHPSRHWEIRLFDVDYYRTAFGTNVHQNNYWASAGIVLRLFGGSSDY
Ga0307473_1091235713300031820Hardwood Forest SoilNGFMGSIGWNLKPWLQIVADSSYNTTTISGTKNILYGNHFGPRYFHRGHNRWGLTPFAEALIGGSRADTKVSGSSIYNTSQNCMSFKVGGGVDIHPSRHIDIRLFDANYYRTSFGTNANQNNYWISTGIVIRLFSGGAE
Ga0307478_1069257213300031823Hardwood Forest SoilLELVADTSYNFITVGTTKNVLYGNHWGPRFFYHSRFPWGATPFVEALVGGSRADTTISGTGGYKTSDIGISYKFGGGLDIRPARYLRHFKIRLFDFDYYRTSFGTNLHQNNYSVSTGIVLRLFGGGSE
Ga0307478_1132140813300031823Hardwood Forest SoilAEPAAPAIRPVNDHTTWEKPNPGARGRAPLIELYGGYAFTRLDGGAGTGTNLNGVMGSFGWNVKPWLQIVADTSYSTMTVGTTKNILYGNHFGPRYFHRSRNRWGATPFVEGLIGGSRADTTVSGTGGYTTSQNCLSYKIGGGVDIHPSRRLDVRLFDVDYYRTAFGTNLHQNNYWASAGIVLRLFGGANSE
Ga0307478_1169065813300031823Hardwood Forest SoilQLIADSSYSRTTISGTKNVLYGNHWGVRYFRRLRHSWGAAPFVEGLIGGSRADVTVSGASGYSTSTNCVSYKVGGGLDFHPLRHFDIRVVDFDYYRTSFGTNLHQNNYFISTGIVMRLFGRSEE
Ga0307479_1002472333300031962Hardwood Forest SoilLLELYGGYSFARLAGTAGAASNLNGAMGSFGWNIKPWLQIVADTSYNVVTASATKNVLYGNHYGFRYLHRSRNRWGLTPFVEGLVGGSRADTTVSGTGGYTTSTNCLSYKAGGGVDVHPSRRLDIRLFDVDYYRTAFSANLHQNNYWASAGIVLRLFGGGSE
Ga0307479_1028616223300031962Hardwood Forest SoilAAAVRPANDRTSWEKPNPGLRRRAPLLEFFGGYAFARMAGAAGTATNLSGGMGSFGWNIKPWLQILGDSSYSVVTASGAKNVLYGNHFGPRYFHRGHNRWGLTPFVEALVGGSRSDTTVSGTSGYKYSDNCMSYKAGGGLDVHPSRHIDIRLFDVDYYRTAFGTNAHQNNYWASAGIVIRLFGGGSE
Ga0307479_1138477913300031962Hardwood Forest SoilVVADSSYNVVTVGNTKNVLYGNHFGPRYFHRGRSRWGATPFVEALVGGSRADITITGAGGYTTSVNCLSYKVGGGLDLRPSRHWEIRVFDVDYYRTAFGTNMHQNNYWASAGVVLRLFGGSSDY
Ga0307471_10008263623300032180Hardwood Forest SoilMPLIELFGGYSFARLDGGGGAYSNLNGFLGSFGWNVKPWLQIVADTSYNVVTISGTKNVLYGNHYGPRIFSRVRNRWGIIPFVEGLVGGSRADTTVSGAGGYTTSVNTISYKVGGGLDMKPSRRVEIRLINFDYYRTSFGTNLQQNNYWASAGIVIRLFTGSE
Ga0307471_10024941113300032180Hardwood Forest SoilPAPAQPASAEPPAPAIRPADTRTSWEKPRPGGVRRVPLLQLFGGYAFARLDGGGGTGTNVNGFMGSIGWNLKPWLQIVADTSYNTITISGTKNILYGNHFGPRYFHRGRNRWGLTPFAEALVGGSRADTKVSGSPTYNTSQNCMSFKVGGGVDIHPSRHIDIRLFDANYYRTSFGTNANQNNYWISTGIVIRLFSGGAE
Ga0307472_10042979913300032205Hardwood Forest SoilVPLLQLFGGYAFARLDGGGGTGTNVNGFMGSIGWNLKPWLQIVADSSYNTVTISGTKNILYGNHFGPRYFHRGHNRWGLTPFAEALIGGSRADTKVSGSSIYNTSQNCMSFKVGGGVDIHPSRHIDIRLFDANYYRTSFGTNANQNNYWISTGIVIRLFSGGAE
Ga0307472_10245051023300032205Hardwood Forest SoilVVADSSYSVVTISGTKNVLYGNHFGPRYFHRSRNRWGATPFVEALVGGSRADTTVTGAGGYTTSVNCLSYKVGGGLDLRPSRHWEIRVFNVDYYRTAFGTNLHQNNYWASAGIVLRLFGGSSDY


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.