NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100852

Metagenome Family F100852

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100852
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 191 residues
Representative Sequence APAELRRETDVAPISTLRGQGFIGAVSGTGRRVAYWVTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSFDRTGILAVVESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVSRLVGACVYGADGMAIAWAVVGEDALSARVAMDGGIPAITIRASGKDVLGVLK
Number of Associated Samples 79
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.98 %
% of genes from short scaffolds (< 2000 bps) 0.98 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.020 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(27.451 % of family members)
Environment Ontology (ENVO) Unclassified
(49.020 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.745 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 1.40%    β-sheet: 49.53%    Coil/Unstructured: 49.07%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.68.7.1: Tricorn protease N-terminal domaind1k32a21k320.74916
b.69.8.0: automated matchesd4wjka_4wjk0.7453
b.67.2.2: Levansucrased1oyga_1oyg0.73857
b.68.4.1: TolB, C-terminal domaind2hqsa12hqs0.73628
b.69.4.0: automated matchesd4ggca_4ggc0.72977


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00929RNase_T 3.92
PF06733DEAD_2 0.98
PF01906YbjQ_1 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG1199Rad3-related DNA helicase DinGReplication, recombination and repair [L] 1.96
COG0393Uncharacterized pentameric protein YbjQ, UPF0145 familyFunction unknown [S] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.02 %
All OrganismsrootAll Organisms0.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005556|Ga0066707_10824229All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae → Rhodospirillum → Rhodospirillum rubrum572Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil27.45%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.47%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.76%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.82%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.94%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.96%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.98%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300023058Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m1EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028716Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198EnvironmentalOpen in IMG/M
3300028722Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_368EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066680_1088696313300005174SoilPRRSPTPPPQREGLLLSETRGFIALEGETTPARIRRETDATAVAPLRGQGFIGAVSGTGRRVAYWVTSNGATRELRVFDVAAPDQDTSIATVLETERGAGAVWSTDRTGLLVAIGSSGRAGTGEAPGQFSALRVVDTPTRTIHEIARLSDGTNFWPVGWDRDARLTGACVASADGD
Ga0066678_1091252413300005181SoilDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGPDATGIAWVVVGEDALSSRVPMEAGIPAITIRASGNDVLGVLNASVLRVWTLASYNDHREFGAASGERIGFARWKPGADEIVVLVADRVEIWPKAGGARRVV
Ga0066676_1086829713300005186SoilAPISALRGEGFIGAVSGTGRRVAYWVSRSGSSTADGATRELRVFDVTAPDQDTTLATVLDTERGTAAVWSSDRTGIVAVVESSAPAGTADTPGSFSALRVVDTPTRSIHEISRLTDGSQYWPVAWDRVTRLVGACVYGGADAMGIAWVVDGEDGLSSRVPMEGGIPAITIRANGNDVLGILNGSVIRVWTIASYNEHREFGA
Ga0066686_1094982213300005446SoilLASFALGSAKGPEVQVAAAPTVRPSPPAILRPAGPVLSESRGFIGLGAPDAAATVRRETDAAPLGSLRGQGFIGAVSGSGRRVAYWVALNGVTQELRVFDVTAPDQQTPLTTVLTAERGAAVVWSADRTGLLLVVESSARAGGGDEAGPFSALRVLDAPTRVLHEIARLSDGSQFWPVAWDRDSRV
Ga0070707_10197066213300005468Corn, Switchgrass And Miscanthus RhizosphereTAKAAPAQLRRETDGAPISELRGQGFIGAVSGTGRRVAYWVTSDAATRELRVFDVTAPDQDTSIAAIPDTERGAAAVWSSDRAGILAVVESSGLPGSAEAPGPFSALRVVDTPTRSIHEVSRLTDGSQFWPGGWDRVSRLIGACVYGADGMGIAWAVVGEDAVSARVPMDEGIPALTILASGS
Ga0070707_10225823513300005468Corn, Switchgrass And Miscanthus RhizosphereTPPTPATGSLPVLQSGPLLSDSRGFIAVPAKNAPAQLRRETDAIATSELRGQGFIGAVSGTGKRVAYWVTSDGATRELRVFDVTAPDQDTSLATVLDTERGAAAVWSSDRTGILAVVESSGRVGTAEAPGPFSALRVVDTPTRSIHEVSRLTDGSQYWPAGWDRVSRLVG
Ga0066697_1012436513300005540SoilMPAELRRETDAGAISQLRGQGFIGAVSGTGRRVAYWVTAEAATRELRVFDVTAPDQDTSLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGGDGMGIAWAVVGEDALSARVPMDGGIPARTILVNGNDVLGVLNATVI
Ga0066697_1053250113300005540SoilRRVAYWVTSDGATRELRVFDVTAPDQDTSLATVADTERGAAAVWSADRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGTGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVMKSTVIRVWTLASYTQHLEFGAVSGERISFARWRPSADDIVVLVADRLELWPKGGGDRRVMTRGLP
Ga0066697_1070974313300005540SoilATPAGPLLSDSRGFIALPGKAAAAELRRETHAASITSLRGQGFVGAVSGTGRRVAYWVSMTADGAASELRVFDVTAPDQDTTLAAVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGPDATGIAWVVVGEDA
Ga0066707_1062997813300005556SoilLPGKAAAAELRRETDAAPVSSLRGQGFVGAVSGTGRRVAYWVSRTADGAASELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGPDATGIAWVVVGEDALSSRVPMEAGIPAITIRASGNDVLGVLNGSVLRVWTLASYNDHREFGAASGERIGFARWK
Ga0066707_1082422913300005556SoilSEAPPTPTPATRPSASAPAIQPGPLLSDSRGFIALPAKGAPAQLRRESDPAPFSELRGQGFIGAVSGTGRRVAYWLTSEGATRELRVFDVTAPDQDTSLATVADTERGAAAVWSADRTGVLAVVESSGRAGSAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRASRLAGACVYGADGMGIAW
Ga0066704_1046252213300005557SoilVAYWVTSEGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSVDRTGIVVVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPVGWDRVSRLVGACVYGPDGMGIGWAVVGEDALSARVPMDGGIPALTIFAAGNDVLGVQNASVIRVW
Ga0066704_1092404013300005557SoilAAATVRRETDAAPAGSLSGEGFVGAVTGSGRRVAYWVSLKNGATQELRVLDTNAPDQQTPLTTVLAAERGAAAVWSADRTGLLLVVESSGRSGGGEDPGTFSALRVLDAPTRVIHEIARLSDGSQFWPVAWDRDSRLTGACITAADGSAVAYAVIGEDALSARIPMEAGIPARTVQSSG
Ga0066698_1007402223300005558SoilVPTVTATTPATPLGPVLSDSRGFVALPGSAAPAELRRETDVTPISGLRGQGFIGAVSGTGRRVAYWVTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSSDRTGILAVVESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRVSRLVGACVYGADGMGIAWAVVGEDALSARIAMDAGIPAITIRASGNDVLGVLKE
Ga0066699_1082128613300005561SoilSVALRSEAPPTPTPSTRPSGSAPAIQPGPVLSDSRGFIALPAKAAPAQLRRETDPAPFSELRGQGFIGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTSLATVLDTERGAAAVWSADRTGVLAVVESSGRAGSAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGMGISWAVVGEDAVGARVPMDDGIPALT
Ga0066699_1106788913300005561SoilTGRRVAYWVTSNGGTQELRVFDVAAPDQDTSIATVLETERGAGAVWSTDRTGLLLAIGSSGRAGTGEAPGQFSALRVVDTPTRSIHEIARLTDGTSFWPVGWDRDARLTGACVASADGNAVAYAVIGEDALSARVPMDSGIPARTVESSGSAVLGIMKESVIRVWSIASYNEHRELGAPSGERIA
Ga0066691_1049212913300005586SoilLLSDSRGFIATPGKTSPAELRRETDAAPVSELRGQGFIGAVSGTGRRVAYWVTSDGATRELRVLDVTAPDQDTSLATVLETERGAAAVWSADRTGIVVVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPVGWDRVSRLVGACVYGPDGVGIGWAVVGEDALSARVPMDGGIPALTIFAAGNDVLGVQNASVIRVWTIASYTQHLEFGAAAGERIAFARWRPGTDDIIVLIA
Ga0066656_1008968123300006034SoilVPTVTATAPATPLGPVLSDSRGFVALPGGAAPAELRRETDVAPISGLRGQGFIGAVSGTGRRVAYWVTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSSDRTGILAVVESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRVSRLVGACVYGADGIGIAWAVVGEDALSARIAMDAGIPAITIRASGNDVLGVLK
Ga0066656_1092481813300006034SoilAPAELRRETDVAPISTLRGQGFIGAVSGTGRRVAYWVTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSFDRTGILAVVESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVSRLVGACVYGADGMAIAWAVVGEDALSARVAMDGGIPAITIRASGKDVLGVLK
Ga0066652_10190714313300006046SoilLRRETDPAPISTLRGQGFIGAVSGTGRRVAYWVSTTADGATRELRVLDVTAPDQDTTLATLLDTERGAAAVWSSDRTGVVAVVESSGQPGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACAYGGADATGIAWVIVGEDALSSRVPMENGIPAVTIRANG
Ga0066653_1064225913300006791SoilLRGLDVTAPDQDTTLATLLDTERGAAAVWSSDRTGVVAVVESSGQPGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGADATGIAWVVVGEDALSSRVPMEAGIPAITIRASGNDVLGVLNGSVLRVWTLASYNDHREFGAASGERIGFARWKPGTDEI
Ga0066665_1130987113300006796SoilSGPSLPAELRRETDPVAIDQLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGADGMGIAWAVVGEDALSARVPMDGGIPARTIVASGNDV
Ga0066659_1193788813300006797SoilSGPLLSESRGFIALSGPNLPAELRRETDPVAIDQLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGADGMG
Ga0079221_1160229413300006804Agricultural SoilASVRRETDAAPIASLRGQGFIGTISGTGRRVAYWVATASGTRELRVFDATAPDQDTAMLSVLPGERGASAVWSVDRTGLLLVVEAAGQADQSGPFSALRVLDAPTRSVHEIARLVDGSQFWPVGWDRDSRLTGSCVVGPDGGAIEYAVIGEDAISARTPMDPGIPAKTVRSGGTA
Ga0075425_10275146413300006854Populus RhizosphereFIGAVSGTGRRVAYWASGSSARDGAARELRVFDVTAADQDTLLATLPEAERGALVVWSSDRSGILVVVESSGKPDSTGAPGPFSALRVVDVPTRSVREIARVSDGSQLWPVGWDRVARVAGACVYRSDGMAIAWSVVGEDTLSARVPMDEGIPAKTVRGNGTGVLGVQNESVIRVWTLAS
Ga0066710_10409276113300009012Grasslands SoilREGALLSDTRGFIALEGETSPAKIRRETEATAVAPLRGQGFVGAVSGTGRRVAYWVTSNGGTQELRVFDVAAPDQDTSITTVLETERGAGAVWSTDRTGLLVAIGASGRAGTGEAPGQFSALRVVDTPTRSIHEIARLTDGTSFWPVGWDRDARLTGACVASADGDAIAYTVIGEDALSA
Ga0099828_1204256213300009089Vadose Zone SoilIALTAKAAPAQLRRETDGAPISALRGQGFIGTVSGTGRRVAYWVTSDAATRELRVFDVTAPDQDTSIAAIPDTERGAAAVWSSDRTGLLAVVESSGLPGSAEAPGPFSALRVVDTPTRSIHEVSRLTDGSQFWPGGWDRVSRLIGACVYGADGMGIAWTVVGEDAV
Ga0099827_1128364713300009090Vadose Zone SoilVGALLSDSRGFVALPASTAPAELRRETDVAPISTLRGQGFLGAVSGTGRRVAYWVSSTTDGGTRELRVFDVTAPDQDTTLATVLETERGAAAVWSSDRTGILAVVESSGKAGTAEAPGPFSALRVVDTPTRSVHEISRITDGSQYWPAGWDRVSRLVGACVYGADGFGIAWAVVGEDALSARVPMDGGIPAITI
Ga0066709_10282275813300009137Grasslands SoilGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTSLATVADTERGAAAVWSADRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGTGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVMKSSVIRVWTLASYTQHLEFGAVSGERISFARWRPSADDIVVLVADRLELWPK
Ga0066709_10444099413300009137Grasslands SoilAPDQDTSLATVQETERGVRAVWSTDRTGLLVAIGSSGRAGTGETPGQFSALRVVDTPTRSIHEIARLTDGTIYWPVGWDRDARLTAACVASADGEAITYAVVGEDALSARVPMDPGIPARTIESSGNAVLGVMKESVIRIWSIASYNEHRELAAPSGERIALARWKPGGS
Ga0114129_1295602113300009147Populus RhizosphereLRVFDVTAPDQDTSLATVFDTERGAAAVWSSDRTGILAVVESSGRVGTAEAPGPFSALRVVDTPTRSIHEVSRLTDGSQYWPAGWDRASRLVGACVYGADGMGIAWAVVGEDAVAARDPMDEGIPALSIVASGNDVLGVQNGSVIRVWTLASYTQHLEFGAASGERIAAARWRPGTDEIAVSVADR
Ga0075423_1250338513300009162Populus RhizosphereRELRVFDVTAPDQDTSLATVFDTERGAAAVWSSDRTGILAVVESSGRVGTAEAPGPFSALRVVDTPTRSIHEVARLTDGSQYWPAGWDRASRLVGACVYGADGMGIAWAVVGEDAVAARDPMDEGIPALSIVASGNDVLGVQNGSVIRVWTLASYTQHLEFGAASGERIAAARWRPGTDEIAVSVAD
Ga0134088_1040607813300010304Grasslands SoilGESVATPTPSASSPTTVAPAQAAPFLSDARGFIALATETAPAVLRREVDAEPIGTLRGRGFIGAVSGTGRRVAYWLTVGDATRELRVFDVTAPDQDTSLTTLGETERGAAVAWSADRTGLLVVVESNGAGTAEAPGPFSALRVVDNPTRSIREIARVTDSSQFLPIGWDRTSRVAVACVYLNDGRAIAWAIFGENGISARVPMDEGIPVSTVRASGGDVLG
Ga0134067_1025792513300010321Grasslands SoilAELRRETDPVAIDQLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGADGMGIAWAVVGEDALSARVPMDGGIPARTIVASGNDVLGVLNATVIRVWTLASYTQHLEFGAVSGERISFARWRPTT
Ga0134067_1037789413300010321Grasslands SoilERGAAAVWSADRTGVLAVVESSGRAGSAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGMGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVIKSSVVRVWTLASYTQHLEFGAVSGERISFARWRPSADEIVVLVADRLELWPKGGGDRRVMTRGLPQAK
Ga0134086_1037855413300010323Grasslands SoilLRRETDSGAINQLRGQGFIGAVSGTGRRVAYWVTADAATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGADGMGIVWAVVGEDALSARVPMDGGIPARTIVASGNDVLGVLNATVIRV
Ga0134086_1043959413300010323Grasslands SoilSVATPTPSASSPTTVAPAQAAPFLSDARGFIALATETAPAVLRREVDAEPIGTLRGRGFIGAVSGTGRRVAYWLTVGDATRELRVFDVTAPDQDTSLTTLGETERGAAVAWSADRTGLLVVVESNGAGTAEAPGPFSALRVVDNPTRSIREIARVTDSSQFLPIGWDRTSRVAVACV
Ga0134071_1078482413300010336Grasslands SoilVAPLRGQGFIGAVSGTGRRVAYWVTSNGATRELRVFDVAAPDQDTSIATLLETERGAGAVWSTDRTGLLVAIGSSGRAGTGEAPGQFSALRVVDTPTRTIHEIARLSDGTNFWPVGWDRDARLTGACVASADGDAIEYAVIGEDALSARVPMDQGILARTVESSGSAVL
Ga0134062_1042911613300010337Grasslands SoilPSLPAELRRETDPVAIDQLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGADGMGIAWAVVGEDALSARVPMDGGIPARTIVASGNDVLGVLNATVIRVWTLASYTQHLEFGAVSGERISFA
Ga0137391_1111734313300011270Vadose Zone SoilESAPTPTPLATATIPATAPGPLLSDSRGFIAVPGSAAPAELRRETDPAPISVLRGQGFIGAVSSTGRRVAYWISSTTDGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGVVAVVESSAPGTGEAPGPFSALRVVDTPTRSIHEIARVTDGSHYSPAGWDRVAHRVGACVYGGADATAIAWVVVGEDGLSSRVPMESGIPASTI
Ga0137389_1107282613300012096Vadose Zone SoilSALRGEGFIGAVSGTGRRVAYWVSRFGSSPADGATRELRVFDVTAPDQDTTLATVLDTERDAAAVWSSDRTGIVAVVESSAPAGTAETPGSFSALRVVDTSTRSIHEISRLTDGSQYWRVGWDRVTRLVGACVYGGADAMGIAWVVVGEDGLSSRVPMEGGIPAITIRANGNDVLGILSGSVIRVWTLASYNEHREFGAASGERISFARWKPGADEIVALVTDRLEIWP
Ga0137389_1154097213300012096Vadose Zone SoilSGPLLSDSRGLIALTAKAAPAQLRRETDGAPISELRGQGFIGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTSIAAIPDTERGAAAVWSSDRTGLLAVVESSGLPGSAEAPGPFSALRVVDTPTRSIHEVSRLTDGSQFWPGGWDRVSRLIGACVYEADGMGIAWTVVGEDAVSARVPMDEG
Ga0137388_1114806413300012189Vadose Zone SoilVPGKAAPAELRHETDAAPISALRGEGFIGAVSGTGRRVAYWVSRSGSSTVDRATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSAPAGTAETPGSFSALRVVDTSTRSIHEISRLTDGSQYW
Ga0137364_1008853613300012198Vadose Zone SoilVSQSGPLLSDSRGFIAVPAKSAPAQLRRETDPAPLAELRGQGFIGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTTLATVVETERGAAAVWSSDRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLIGACVYTSDGMGITWAVVGEDAVSARVAMDGG
Ga0137383_1091698413300012199Vadose Zone SoilRRVAYWVTSDGATRELRVFDVTAPDQDTSLAIVLDTERGAAAVWSSDRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGMGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVMKSSVVRVWTLASYTQHLEFGAVSGERISFARWRPSADEIVVLVADRLELWPKGGGDRR
Ga0137376_1129129613300012208Vadose Zone SoilGATRELRVFDVTAPDQDTSLATVADTERGAAAVWSSDRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGTGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVMKSKVIRVWTLASYTQHLEFGAASGERISFARWRPSADDIVVLVADRLELWPKGGGDRRVMT
Ga0137376_1140471213300012208Vadose Zone SoilLRGQGFIGAVSGTGRRVAYWVSGSSTADSATRELRVFDVSAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVERAGTGDAAGPFSSLRLVDTPTRSIREISRLTDGSQYWPVGWDRVSHVVGACAYGGADATGIAWVVVGEDALSSRVPMEGGIPAISIRASGNDVLGVLNRSVVRVWTLASYATHGEFGAVPG
Ga0137376_1141060713300012208Vadose Zone SoilVALRGESAPTPTPLPLPTVTAPATPAGPLLSDSRGFIALPGKAAAAELRRETDAAPISSLRGQGFVGAVSGTGRRVAYWVSRTADGAASELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGADATGI
Ga0137370_1002790823300012285Vadose Zone SoilVSQPGPLLSDSRGFIAVPAKSAPAQLRRETDPAPLAELRGQGFIGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTTLATVVETERGAAAVWSSDRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLVGACVYTSDGMGITWAVVGEDAVSARVAMDGGIPALTIVAQGDDVLGVQNGSVIRVWTLASYTQH
Ga0137370_1094523913300012285Vadose Zone SoilGLVTVVESSGQTEGRVAPSPFSSLRVVDTPTRSIREISRLTDGSQYWPVGWDRVTRLVGACAYGGADATGIAWVVVGEDALSSRVPMESGIPAVTIRASGNDVLGILNGSVIRVWTLASYGEHREFGAAPGERISFARWKPGADEIVVLVADRLEIWPKAGGDRRIIAQGLPVASD
Ga0137387_1108195013300012349Vadose Zone SoilLRGESAPTPTPLPLPTVTTPATPAGPLLSDSRGFIALPGKAAAADLRRETDAAPISSLRGQGFVGAVSGTGRRVAYWVSRTADGAASELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGPDA
Ga0137386_1114731513300012351Vadose Zone SoilFIALPGKAAAANLRRETDAAPIGSLRGQGFVGAVSGTGRRVAYWVSRTADGAASELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGPDATGIAWVVVGEDALSSRVPMEAGI
Ga0137360_1123479813300012361Vadose Zone SoilPISALRGEGFIGAVSGTGRRVAYWVSRSGSSTVDRATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRSGIVAVVESSAPAGTAETPGSFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGADAMGIAWVVVGEDGLSSRVPMEGGIPAITIRANGNDVLGILSGSVIRVWTLASYNEHREFGAASGERISFARWKPG
Ga0137361_1132552513300012362Vadose Zone SoilSSTVDRATRELRVFDVTAPDQDTTLATVLDSERGAAAVWSSDRTGIVAVVESSAPAGTAETPGSFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGADAMGIAWVVVGEDGLSSRVPMEGGIPAITIRANGNDVLGILNASVIRVWTLASYNTHREFGAASGERISFARWKPGADEIVALVADRLEIWPKAGGERRIVAQG
Ga0137390_1161486913300012363Vadose Zone SoilQSGPLLSDSRGFIALTAKAAPAQLRRETDGAPISELRGQGFIGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTSIAAIPDTERGAAAVWSSDRTGLLAVVESSGLPGSAEAPGPFSALRVVDTPTRSIHEVSRLTDGSQFWPGGWDRVSRLIGACVYEADGMGIAWTVVGEDAVSARVPMDEGIPVLTILA
Ga0137396_1091499613300012918Vadose Zone SoilSSTADSATRELRVFDVSAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVERAGPGDAAGPLSSLRIVDTPTRSIREISRLTDGSQYWPVGWDRVSHMVGACAYGAADATAIGWVVVGEDALSSRVPMEGGIPAISIRASGNDVLGVLNQSVVRVWTLASYATHGEFGATPGERIAFARWRPGADEIVVSVADRLEIWPKAGGDRRIVARGLPS
Ga0137396_1099074413300012918Vadose Zone SoilSRGCSAGPANTAPAELRRETDVAPISTLRGQGFIGAVSGTGRRVAYRVSSTTDGGTRELRVFDFTAPDQDTTLATVLETERGAAAVWSSDRTGILAVVESSGKAGTAEAPGPFSALRVVDTPTRSVHEISRITDGSQYWPAGWDRVSRLVGACVYGADGFGIA*
Ga0137394_1148868813300012922Vadose Zone SoilPIGTLRGQGFIGAVSGTGRRVAYWVTTDGATRELRVLDVTAPDQDTSLATVLETERGAAAVWSSDRTGIVAVVESSGRPGTAEAPGTFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRGSRLVGACVYGADGLGIAWAVVGEDALSARVPMEHGIPAVTIRASGSDVLGVLNESVI
Ga0137419_1122889413300012925Vadose Zone SoilVPGNSIAAELRRETDAAPIGELRGRGFIGAVSGTGRRVAYWLTSEGATRELRVFDVTAADQDTSLATVLDIERGAGAVWSSDRTGILVVVESSGRAGTAEAPGPFSALRVVDTPTRPIHEISRVTDGSQYWPVGWDRVSRLVGACVYGADGMGIAWAVVGE
Ga0137419_1138363013300012925Vadose Zone SoilLRGESAPTPTPSALVSAVAPTTPGPLLSDSRGFIALPGSAAPAELRRETDPAPISTLRGQGFIGAVSGTGRRVAYWVTAEGATRELRVFDATAPDQDTSLATVQDTERGAAVVWSSDRTGLVTVVESTGRGATGEAPSPFSSLRIVDTPTRSIREISRLTDGSQYLPVGWDRITRLVGACAYGGADATGIAWVVVGE
Ga0137416_1193474913300012927Vadose Zone SoilAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVERAGSGDAGGPLSSLRIVDTPTRSIREISRLTDGSQYWPVGWDRVSHMVGACAYGGADATAIGWVVVGEDALSSRVPMEGGIPAISIRASGNDVLGVLNQSVVRVWTLASYTTHGEFGAMPGERIAFARWKPGADEIVVSVADRLE
Ga0134087_1046184113300012977Grasslands SoilVLAVLLAVLANGSIALPGESVATPTPSASSPTTVAPAQAAPFLSDARGFIALATETAPAVLRREVDAEPIGTLRGRGFIGAVSGTGRRVAYWLTVGDATRELRVFDVTAPDQDTSLTTLGETERGAAVAWSADRTGLLVVVESNGAGTAEAPGPFSALRVVDNPTRSIREIARVTDSSQFLPIGWDRASRVAVACVYLNDGRAIAWAIFG
Ga0134075_1053148813300014154Grasslands SoilLLSDSRGFVALAASAAPAELRRETDVAPISTLRGQGFIGAVSGTGRRVAYWVTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSFDRTGVLAVVESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVAWDRVSRLVGACVYGADGMGIAWAVVGEDALSA
Ga0134078_1060701613300014157Grasslands SoilGFIGAVSGTGRRVAYWVTSDGATRELRVFDVTAPDQDTSLATVADTERGAAAVWSSDRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGTGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVMKGTLDSPLT
Ga0137418_1014260213300015241Vadose Zone SoilVAYWVTADGATRELRVLDVTAPDQDTSLATVLETERGAAAVWSSDRTGIVAVVESSGRPGTAEAPGTFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRGSRLVGACVYGADGLGIAWAVVGEDALSARVPMEHGIPAVTIRASGSDVLGVLNESVIRVWTLASYTEHREFGAPAGERIAFARWRPGSDAIVVLVADRLELWPKAGGDRRVVAQGL
Ga0137418_1063724113300015241Vadose Zone SoilVPGNSIAAELRRETDAAPIGELRGRGFIGAVSGTGRRVAYWLTSEGATRELRVFDVTAADQDTSLATVLDIERGAGAVWSSDRTGILVVVESSGRAGTAEAPGPFSALRVVDTPTRPIHEISRVTDGSQYWPVGWDRVSRLVGACVYGADGMGIAWAVVGEDALSARVPMDDGIPALTIRASGNDVLGVQNASVIRVWTLASYTQHLEFGAAAGERIAFARWRPGTDDI
Ga0134069_137363813300017654Grasslands SoilTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGADATGIAWVVVGEDALSSRVPMEAGIPAITIRASGNDVLGVLNGSVLRVWTLASYNDHREFGAA
Ga0134074_111910023300017657Grasslands SoilVRGKDAHPVLFWIGTVLLLIVLLAVLAGGSVALRGESAPTPTPLPLPTVTAPATPAGPLLSDSRGFIALAGKDAPASLRRETDAAPITSLRGQGFIGAVSGTGRRVAYWVANASDTRELRVFDATAPDQDAAMLSLLQGERGASAVWSVDRTGLLLVVEANGTADQGSPFSALRVLDTPTRGVHEIARLVDGSQFWPVGWDRDSRLTGSCVVGADGAAIAYAVIGEDAISARTPMDEGIPAA
Ga0134074_140558513300017657Grasslands SoilVTAPATPAGPLLSDSRGFIALPGKAAAAELRRETDAAPVSSLRGQGFVGAVSGTGRRVAYWVSRTADGAASELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIHEISRLSDGSQYWPVGWDRVTRLVGACV
Ga0184610_130358013300017997Groundwater SedimentGAVSGTGRRVAYWVTVDGATRELRVFDVSAPDQDTSLATVLATERGAGAVWSSDRTGLVTVVESSARAGAGEAPAPFSALRVVDTPTRGIREISRLTDGSQYWAVGWDRVTRLVGACAYGGSDAMGIAWVVVGEDTLSSRVSMESGIPALTIRASGNDVLGVVNGSVLRVWTLAS
Ga0184604_1024341513300018000Groundwater SedimentGFIGAVSGTGRRVAYWVTVNGATRELRVFDVTAPDQETSLATVLDTERGAAAVWSADRTGVLAVVESSGRTGAAEAPGPFSALRVVDTPTRSVHELSRVTDGSQYWPAGWDRVSRLVGACVYGSDGMGIAWAVVGEDALSARVPMESGIPAITVRASGRDVIGVLNGSVVRVWTLASYTEHREFGAQAGERISFARWKPGADVVVVLVA
Ga0066655_1102148613300018431Grasslands SoilRGQWSVGAGSGTGRRVAYWVTSHGATRELRVFDVTAPDQDTSLATVVDTERGAAAVWSSDRTGVLAVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLAGACVYGSDGMGIAWAVVGEDAVGARVPMDDGIPALTIVANGNDVLGVIKSSVVRVWTLASYTQHLEFGAVS
Ga0066655_1121166413300018431Grasslands SoilTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYGADGMGIVWAVVGEDALSARVPMDGGIPARTIVASGNDVLGVLNATVIRVWTLASYTQHLEFGAVSGERISFAWWRPTTDEIAVLVA
Ga0066662_1255132013300018468Grasslands SoilESRGFIALSGPNLPAELRRETDPVAIDQLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGVVAVVESSAPGSGEAPGPFSSLRLVDAPTRSIHEIARVTDGSQYWPAGWDRVAHLVGACVYGGADAVGVAWVVVGEDALSSRAPMES
Ga0210382_1050390713300021080Groundwater SedimentLRGQSFIGAVSGTGRRVAYWVTVDGATRELRVFDVSAPDQDTSLATVLATERGAGAVWSSDRTGLVTVVESSARAGAGEAPAPFSALRVVDTPTRGIREISRLTDGSQYWAVGWDRVTRLVGACAYGGSDAMGIAWVVVGEDTLSSRVSMESGIPALTIRASGNDVLGVVNGSVLRVW
Ga0193714_103578013300023058SoilTSVGPFLSDSRGFIAVPRVGAAAELRRETDAAPISALRGQGFVGGVSGTGRRVAYWVAGAEGATRELRVFDVTAPDQDTSLVTLPETERGAATVWSADRTGVVAVVESSGRAGTGDPPSPFSALRVVDTPTRSVHEISRLTDGSQYWPVGWDRVTRLVGACAYNGADAMGIAWVVVGEDTLSSRVPMESGIPAATIRASGNDVLGLRNAGVVRVWTLASYNDHREFGAAAGERIAFARWRPG
Ga0209234_129228213300026295Grasslands SoilQLRRETDATPISVLRGQGFIGAVSGTGRRVAYWVTSDAATRELRVFDVTAPDQDTSLATLPETERGAAAVWSSDRTGVLAVVESSGRAGTAEGTGPFSALRVVDTPTRSIHEISRLTDGSQFWPAGWDRTSRLVGACVYGADGMGIAWAVVGEDAVSGRVPMEGGIPALSILA
Ga0209237_123742313300026297Grasslands SoilAAPDQDTSIATVLETERGAGAVWSTDRTGLLVAIGSSGRAGTGEAPGQFSALRVVDTPTRSIHEIARLSDGTNFWPVGWDRDARLTGACVASADGDAIEYAVIGEDALSARVLMDQGIPARTVESSGSAVLGIMKGSVIRVWSIASYNEHRELGAASGERIAFARWRPGGAEILVSIADRLEIWPA
Ga0209761_114503313300026313Grasslands SoilVAPLRGQGFVGAVSGTGRRVAYWVTSNGGTQELRVFDVAAPDQDTSITTVLETERGAGAVWSTDRTGLLVAIGASGRAGTGEAPGQFSALRVVDTPTRSIHEIARLTDGTSFWPVGWDRDARLTGACVASADGDAIAYTVIGEDALSARVPMDPGIPARTVASSGG
Ga0209471_122617113300026318SoilLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSVDRTGIVVVVESSGRAGTAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPVGWDRVSRLVGACVYGPDGMGIGWAVVGEDALSARVPMDGGIPALTIFAAGNDVLGVQNASVIRVWTIASYTQHLEFGAAAGERIAFARWRPGTDDIIVLIADRLELWPKGGGAR
Ga0209471_132704513300026318SoilRGFIALAGKDAPASLRRETDAAPITSLHGQGFIGAVSGTGRRVAYWVANASDTRELRVFDATAPDQDAAMLSLLQGERGASAVWSVDRTGLLLVVEANGTADQGSPFSALRVLDTPTRGVHEIARLVDGSQFWPVGWDRDSRLTGSCVVGADGAAIAYAVIGEDAISA
Ga0209059_130511613300026527SoilSLATVFDTDRGAAAVWSSDRTGILTVVESSAHPGTGEVPAPYSTLRVVDTPTRSIREVARVTDGTHYWPVGWDRAARLVDACVYGPDGMAIEWTVIGEDTVSKRLPMEGGIPAITVRASGDDVLGVLNKSVIRVWTLASYSEHREFGATSGERIAFARWKPGADEIVVLVADRLELWPK
Ga0209059_131249613300026527SoilSPTPSGRPSPTVAPSPSGPLLSESRGFIALSGPNLPAELRRETDAVAIDQLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGILVVVESSGRSGSAEAPGPFSALRVVDTPTRSIHEISRITDGSQYWPAGWDRVSRLVGACVYG
Ga0209058_130867313300026536SoilRGQGFIGAVSGTGRRVAYWVTTDGATRELRVFDVTAPDQDTSLATVLETERGAAAVWSFDRTGVLAVIESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVGWDRVSRLVGACVYGADGMAIAWAVVGEDALSARVAMDGGIPAITIRASGKDVLGVLKENVIRVWTLAS
Ga0209157_129513613300026537SoilTSLATVLETERGAAAVWSFDRTGILAVVESSGRSGTAEPPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRVSRLVGACVYGADGMAIAWAVVGEDALSARVAMDGGIPAITIRASGKDVLGVLKENVIRVWTLASYTEHHEFGAVSGERISFARWRPGTDDIVVLVADRLELWPKAGGDRRIVAQGLHAANDL
Ga0209156_1050417913300026547SoilESSGQPGTGEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWAVGWDRVTRLVGACVYGGTDAMGIAWVVVGEDALSARVPMENGIPAVTIRANGNDVLGILNGTVIRVWTLASYNTHREFGAASGERIAFARWRPGADEIVVLVADRLEIWPNVGGDRRIVARGLP
Ga0209846_105975613300027277Groundwater SandADRPQQVFGQGGQSEGYIKEDQAATAPATPAGPLLSDSRGFIALPGSDKSAEVRRETDATPISTLRGQGFIGAVSGTGRRVAYWVMAGQATLELRVFDVTAPDQETSLAAVLETERGAAAVWSSDRTGILAVVESSGLAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPAGWDRVSRLVGACVYGS
Ga0209220_116596213300027587Forest SoilEVAPTPTPLPLPTATVPATPRGPLLSDSRGFIALPGDAAAPEIRRETDAAPMSTLRGQGFIGAVSGTGRRVAYWVTADGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGLVAVVESSGRVGTGEAPGPFSALRVVDTPTRSIHEISRVTDGTQYWPAGWDRVTHLVGACVYGGADHMGIA
Ga0208988_116327213300027633Forest SoilTVLDTERGAAAVWSSDRTGIVAVVERAGTGDAAGPFSSLRIVDTPTRSIREIARLTDGSQYWPVGWDRVSHVVGACAYGADATGIAWVVVGEDALSSRVPMEGGVPAISIRASGNDVLGVLNQSVVRVWTLASYATHGEFGAVPGERIAFARWKPGADEIIVLVADRLEIWPKAGG
Ga0209180_1068608713300027846Vadose Zone SoilTATIPATPPGPLLSDSRGFIAVPGKAAPAELRHETDAAPISALRGEGFIGAVSGTGRRVAYWVSRFGSSPADGATRELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSAPAGTAETPGSFSALRVVDTSTRSIHEISRLTDGSQYWPVGWDRVTRLVGACVYGGADAMGIAWV
Ga0307311_1018254413300028716SoilSTLRGKGFIGAVSGTGRRVAYWVSGPSTSDGTARELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGGADTPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRAARLVGACVYGGADANGIAWVVVGEDALGSRVPMEAGVPAISIRASGNDVIGVLNETVVRVWTLASYATHREFGAAPGERI
Ga0307319_1018382913300028722SoilTPSALLNTTVPSTLPGPLLSDARGFIALPGSGAPAELRRETSAAPSSALRGQSFIGAVSGTGRRVAYWVTVDGATRELRVFDVSAPDQDTSLATVLATERGAGAVWSSDRTGLVTVVESSARAGAGEAPAPFSALRVVDTPTRGIREISRLTDGSQYWAVGWDRVTRLVGACAYGGSDAMGIAWVVVGEDTLSSRVSMESGIPALTIRASGNDVLGVVNGSVLRVW
Ga0307290_1030282513300028791SoilATASATPSGPLLSDSRGFIALPGKGTAAELRREPDPAPLATLRGQGFIGAVSGTGRRVAYWVTTTEGARELRVFDVTAPDQDTTLATILDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIREISRVTDGSQYWPVGWDRVARLIGACVYGGADATGIAWVVVGEDALSSRVPMESG
Ga0307299_1041729913300028793SoilRAGGADTPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRAARLVGACVYGGADANGIAWVVVGEDALGSRVPMEAGVPAISIRASGNDVIGVLNETVVRVWTLASYATHREFGAAPGERIAFARWKPGADEIVVLVADRLEIWPKTGGDRRILARGLPAASEL
Ga0307305_1053217613300028807SoilVTAPDQDTTLATLLDTERGAAAVWSSDRTGIVAVVESGARTRTGEAPAPFSALRVVDTPTRSIHEIARLTDGTQYWPVAWDRITRLVGACVYGGADAMGIAWVVVGEDALSSRTSMEAGIPAITVRASGNDVLGVLNGSVIRVWTIASYSTHREFGAASGERIAFARWKPCSDD
Ga0307310_1054238113300028824SoilEGATRELRVFDVTAPDQDTSLLTVPETERGAAAVWSGDRTGVVAVVESSGRAATGAPPSPFSALRVVDTPTRSVHEISRLTDGSQYWPVGWDRITRLVGACAYSGADAMGIAWVVVGEDTLSSRVPMESGIPAMTIRASGNDVLGILNGSVVRVWTLASYNDHREFGAAAGERIAFARWRPGAEEIVVLVADRLEV
Ga0307310_1066446813300028824SoilDTERGAAAVWSSDRTGIVAVVESSGRAGTGEAPGPFSALRVVDTPTRSIREISRVTDGSQYWPVGWDRVARLIGACVYGGADATGIAWVVVGEDALSSRVPMESGIPAITIRASGNDVLGVLNGAVLRVWTLASYNEHREFGAAPGERIAFARWKPGADEIVVLVADRLEVWPKAGG
Ga0307310_1066911213300028824SoilATVPPTPSGPLLSDSRGFISLPANTASAELRRETDAVPLSTLRGKGFIGAVSGTGRRVAYWVSGPSTSDGTARELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGGADTPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRAARLVGACVYGGADA
Ga0307278_1032573913300028878SoilSDSRGFIALPGNAAPAELRRETDPAPTSTLRGQGFIGAVSGTGRRVAYWVTSEGATRELRVFDVTAPDQDTSLATVLDTERGAAAVWSSDRTGLVAVVESSGRAGTGEAPGPFSALRVLDTPTRSIHEISRLTDGSQYWPVGWDRVTRLVGACAYGGPDAIGIAWVVVGEDALSSRVPMETGIPAVTIRANGNDVLGIVNGSVIRVWTLTSYNTHSEFGAAPGER
Ga0307308_1041628413300028884SoilELRRETDAVPLSTLRGKGFIGAVSGTGRRVAYWVSGPSTSDGTARELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGGADTPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRAARLVGACVYGGADATAIAWVVVGEDALGSRVPMEAGVPAISIRASGNDVIGVLNETVVRVWTLASYATHREFGAAPGERI
Ga0307304_1045723013300028885SoilFIGAVSGTGRRVAYWVSGPSTSDGTARELRVFDVTAPDQDTTLATVLDTERGAAAVWSSDRTGIVAVVESSGRAGGADTPGPFSALRVVDTPTRSIHEISRVTDGSQYWPVGWDRAARLVGACVYGGADANGIAWVVVGEDALGSRVPMEAGVPAISIRASGNDVIGVLNETVVRVWTLASYATHREFGAAPG
Ga0307471_10298335513300032180Hardwood Forest SoilPSSSTADAATRELRVFDVTAPDQDTTLATVLETERGAAAVWSSDRTGIVAVVESSGLAGTAEAPGPFSALRVVDTPTRSIHEISRLTDGSQYWPVAWDRVTRLVGACVYGGAAATGIAWVVVGEDALSSRVPMEAGIPAITIRANGNDVLGVLNSSVVRVWTLESYNDHREFGAAAGERIGFARWKPGADEIVVLVADR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.