NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F095068

Metagenome Family F095068

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095068
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 193 residues
Representative Sequence RVSFQKPAYQVETRELTAAEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLVGVAVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWSAFTTDGKIRLSGPVRRIENVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP
Number of Associated Samples 83
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.73

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil
(23.809 % of family members)
Environment Ontology (ENVO) Unclassified
(42.857 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(44.762 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 1.77%    β-sheet: 41.59%    Coil/Unstructured: 56.64%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.73
Powered by PDBe Molstar

Structural matches with PDB biological assemblies

PDB IDStructure NameBiol. AssemblyTM-score
2xicPILUS-PRESENTED ADHESIN, SPY0125 (CPA), P212121 FORM (ESRF DATA)10.63005
2xicPILUS-PRESENTED ADHESIN, SPY0125 (CPA), P212121 FORM (ESRF DATA)20.60937
4bugPILUS-PRESENTED ADHESIN, SPY0125 (CPA), CYS426ALA MUTANT20.60104
2xidPILUS-PRESENTED ADHESIN, SPY0125 (CPA), P212121 FORM (DLS)10.59294


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF13620CarboxypepD_reg 20.00
PF00795CN_hydrolase 3.81
PF13378MR_MLE_C 3.81
PF01595CNNM 2.86
PF01966HD 1.90
PF13489Methyltransf_23 1.90
PF04235DUF418 1.90
PF13298LigD_N 1.90
PF05951Peptidase_M15_2 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG2311Uncharacterized membrane protein YeiBFunction unknown [S] 1.90
COG3108Uncharacterized conserved protein YcbK, DUF882 familyFunction unknown [S] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil23.81%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment11.43%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil9.52%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment6.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.67%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment3.81%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.81%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.86%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands2.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.90%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.90%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland0.95%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.95%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.95%
FreshwaterEnvironmental → Aquatic → Freshwater → Pond → Sediment → Freshwater0.95%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.95%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.95%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.95%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.95%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.95%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.95%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004061Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushMan_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004155Freshwater pond sediment microbial communities from the University of Edinburgh, under environmental carbon perturbations - Low cellulose week 11EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300009037Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 1-3cm March2015EnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009146Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009165Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 1-3cm September2015EnvironmentalOpen in IMG/M
3300009179Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Plant_0915_D1EnvironmentalOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300012157Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT760_2EnvironmentalOpen in IMG/M
3300012160Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT630_2EnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300014303Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - WestPond_TuleA_D1EnvironmentalOpen in IMG/M
3300014304Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushMan_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300014320Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300014868Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT830_16_10DEnvironmentalOpen in IMG/M
3300014874Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_2_16_10DEnvironmentalOpen in IMG/M
3300014875Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_1_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300015258Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_1DaEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300018055Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_coexEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300022213Sediment microbial communities from San Francisco Bay, California, United States - SF_Oct11_sed_USGS_4_1EnvironmentalOpen in IMG/M
3300023072Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S151-409C-6EnvironmentalOpen in IMG/M
3300023102Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S184-509B-5EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025311Groundwater microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_0.2 (SPAdes)EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025946Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushMan_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300027731Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031873Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G15_0EnvironmentalOpen in IMG/M
3300031892Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D2EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300032143Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_0EnvironmentalOpen in IMG/M
3300032164Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_0EnvironmentalOpen in IMG/M
3300032173Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_topEnvironmentalOpen in IMG/M
3300032177Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G05_0EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032256Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_topEnvironmentalOpen in IMG/M
3300032397Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_0EnvironmentalOpen in IMG/M
3300032401Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G03_0EnvironmentalOpen in IMG/M
3300032516Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_0EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033408Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day20_noCTEnvironmentalOpen in IMG/M
3300033413Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day10_noCTEnvironmentalOpen in IMG/M
3300033416Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_OW2_C1_D5_CEnvironmentalOpen in IMG/M
3300033418Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_T1_C1_D1_AEnvironmentalOpen in IMG/M
3300033419Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day5_noCTEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033481Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day5_CTEnvironmentalOpen in IMG/M
3300033482Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D1_CEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033487Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_May_M1_C1_D6_AEnvironmentalOpen in IMG/M
3300033489Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT95D214EnvironmentalOpen in IMG/M
3300033760Tropical peat soil microbial communities from peatlands in Loreto, Peru - SR_CEnvironmentalOpen in IMG/M
3300034115Sediment microbial communities from East River floodplain, Colorado, United States - 29_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1229179113300000550SoilSFGFEDLEPNRYRVSFSKAAYQVESRELTASEDAADVRVELKRGEGITLEAKDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGHGEVPSLKPGVYDLRAESSGYAPVRLPTIQVPSSTLSLLLTPGGSLEIQAGPATLALPQPTGRLIGPDQRIYMFSAFTTDGKIRLSVPVRRLENVAPGSYTLEVERGVRRDVAITEG
Ga0055438_1001343713300003995Natural And Restored WetlandsRVTFQKPAYQVETRELTAAEDSDLRVELRRGDGIAIEARDGIFATPLRGLFVRVADGSGASVFAGSVSLDSDGRGEVPSLRPGVYELRAESSGYAPVSLAGVAVPSRTLTLLLTPGGSLEIQAGPATLALPQATGRLIGADGRVYMWSAFTPDGAIRLSGPVRRLENVTPGRYSFAVAGGESRDVTIAEGGRAVITLP*
Ga0055487_1001707923300004061Natural And Restored WetlandsDMEPKSYRVTFQKPAYQVETRELQAAEDSDLRIELRRGDGIVIEAKDGIFATPLRGLFLRVADGSGAAVFTGSVSLDSGGRGEGPSLKPGVYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQVGPETLALPQPSGRLIGIDGRVYMWSAFTPDGVIRLGGPVRRVDNVAPSRYTLEIEGGVRRDVTITEGGRALVSLP*
Ga0066600_1025847913300004155FreshwaterEDLEPKRYRVSFQKPAYQVETREITAAEESDLRVELRRGDGMAIEAHDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPGVSVPSRTVTLVLTPGGSLEVRVGEQTLALPQPTARLLAGDGRVYMWNAFTTDGKIRLNGPVRRFENVAPGRYTLEVEGGVRRDVDIREGMPSTVSLP*
Ga0062592_10199670213300004480SoilYRVSFEKPAYQIESRELTAAEDAQDVRVELKRGEGLTLEARDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYELRAESSGYAPVRLPSIQVPSRTISVQLTPGGSLEIQAGPATLALPQATGRLIGADQRVYMWSSFTSDGAIRLTSPVRRFENVAPGAYTLQVDGGVQREVAIT
Ga0062594_10340682613300005093SoilEPSRYRVSFQKAAYQVETRELTAAEDSDVRVELKRGEGIELEAKDGIFATPLRGLFVRVVDGAGSPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLHLLLTPGGSLEIQAGPATLALPQPTGRLIGPDQRIYMWSAFTSDGKIRLSGP
Ga0075420_10100302913300006853Populus RhizosphereVELKRGEGITLEAKDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGHGEVPSLKPGVYELRAESSGYAPVRLPSVQVPSRTISLLLTPGGSLEIQAGPTTLALPQAAGRLIGPDQRVYMWSSFTSDGKIRLTSPVRRLENVAPGSYSFEVEGGVRRDVAITEGGRAIVSLP*
Ga0075429_10132439413300006880Populus RhizosphereSEDAADVRVELKRGEGITLEAKDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGHGEVPSLKPGVYELRAESSGYAPVRLPSVQVPSRTISLLLTPGGSLEIQAGPTTLALPQAAGRLIGPDQRVYMWSSFTSDGKIRLTSPVRRLENVAPGSYSFEVEGGVRRDVAITEGGRAIVSLP*
Ga0105093_1085047313300009037Freshwater SedimentSFQKPAYQVETRELTAAEESDVRVEMKRGEGIALEARDGIFATPLRGLFVRALDAAGQAAFAGGVSLDSEGRGEVPSLKPGVYELRAESSGYAPARRPGVMVPSSTITLVLTPGGSLEIQAGPQTLARPEASGRLISADGRVYMWNVFTNDGKIRLNGPVRRIENVVPGRYSFEVEG
Ga0105095_1023469813300009053Freshwater SedimentERHRVSFQKPAYQVETRELVAAEESDVRVEMKRGEGIALEARDGIFATPLRGLFVRAVDGAGQTAFAGGLSLDSEGRGEVPSLKPGAYELRAESSGYAPVVRPGVAVPSSTISLVLTPGGSLEIQAGPQTLALAEASGRLLGADGRVYMWNVFTTDGKIRLTNPLRRIENVVPGRYTLEVEGGVRREVTVTEGGRSVVTLP*
Ga0105106_1067041913300009078Freshwater SedimentFQKPAYEAETRQVTAAEETEVRVEMRRGEGIALVARDGLFATPLRGLMARVVDGAGATVFTGSVPLDSDGRGEVPALKPGNYELRAESSGYAPVARPVGVPTSELTLTLTPGGPLEIRVGPQTQALPQPTARLLGADGRVYLPFIFSNDGKIRLNGPVRRLENVVPGRYVLEVEGGVRRDVDVREGVPSSVSLP*
Ga0105106_1076788213300009078Freshwater SedimentTRELVAAEESDVRVEMKRGEGIALEARDGIFATPLRGLFVRAVDGAGQTAFAGGLSLDSEGRGEVPSLKPGAYELRAESSGYAPVVRPGVAVPSSTISLVLTPGGSLEIQAGPQTLALAEASGRLLGADGRVYMWNVFTTDGKIRLTNPLRRIENVVPGRYTLEVEGGVRREVTVTEGGRSVVTRP*
Ga0105091_1001861913300009146Freshwater SedimentRELAAAEESDVRVEMRRGEGIALEARDGIFATPLRGLFVRAVDASGQAVFAAGLALDGEGRGEVPSLKPGVYELRAESSGYAPVHRPGVTVPASTISLVLTPGGSLEIQAGPQTLALTEASGRLVGADGRVYLWNAFTADGKIRLTGPLRRIENVVPGRYAFEVEGGVRREVTITEGGRSVVAVP*
Ga0105102_1012382413300009165Freshwater SedimentSSGRFAFEDLEPGSYRVSFQKPAYQVETRELTAAEESDLRVEMRRGEGIALEARDGIFATPLRGLFVRAVDGSGQVAYAGSVALDSEGRGEVPSLKPGVYEVRAESSGYAPVVLPGVAVPSSRALSLVLTPGGSLEIRVGPQTLALPQPTARLLDADGRVYMWSAFTTDGKIRLGGPVRRLENVVPGRYVLEVEGGTRQEVEVREGMPSTVSLP*
Ga0115028_1073052413300009179WetlandGVTLEARDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEMRAESSGYAPVSLPAVSVPSRTVTLTLTPGGSLEIRAGEQTLALPQPTARLLGADGRVYMWNVFTTDGKIRLNGPVRRFENVAPGRYVLEVEGGARRDVDIREGMPSPVSLP*
Ga0105340_107513413300009610SoilSTDSSGRFAFEDMEPKRYRVTFQKPAYQVETRELTAAEESDVKIELKRGEGLQVEARDGIFATPLRGLMVRVSDAAGASVFQGSLSLDSDGRGEVPSLKPGTYEVRAESSGYAPVSLPGVAVPSRTLSLVLTPGGTLEIQAGPVTLALPQATGRLIGRDGRPYMWNAFTADGKIRLGSPVRRLENVAPGRYSFEVEGGERRDVTIAEGGRAVVSLP*
Ga0137353_102274623300012157SoilGDGLALEARDGIFATPLRGLFVRAIDGSGQSAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLLGVAVPSRTVTLALTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSGPVRRIDNVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP*
Ga0137349_102572623300012160SoilVETRELTAAEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLVAVAVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWSAFTTDGKIRLSGPVRRIENVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP*
Ga0157303_1014291613300012896SoilRELTAAEDAQDVRVELKRGEGLTLEARDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYELRAESSGYAPVRLPSIQVPSRTISVQLTPGGSLEIQAGPATLALPQATGRLIGADQRVYMWSSFTSDGAIRLTSPVRRFENVAPGAYTLQVDGGVQREVAITEGGRAVVSLP*
Ga0153916_1180524413300012964Freshwater WetlandsVRVEMRRGEGIGLEARDGIFATPLRGLFVRALDGAGQAAFAGSVSLDSEGRGEVPSLKPGVYELRAESSGYAPVVRPGVAVPSSAISLVLTPGGSLDIQAGPQTLALPDASGRLVGADGRVYMWNVFTSDGKIRLTNPLRRIENVVPGRYVFEVEGGVSREVAVTEGGRSVVTLP*
Ga0075358_111124013300014303Natural And Restored WetlandsGRFVFEDLDPKSYRVSFQKPSYQVETRELTAAEESELRVEMRRGEGIALEAHDGIFATPLRGLLVRAIDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPVSLPGVGVPSSRPLTLALTPGGSLEIRVGEQTLALPQPTARLLGADGRVYLWNAFTSDGKIRLSSPVRRLENVAP
Ga0075340_109257913300014304Natural And Restored WetlandsPAYQLETRELQAAEDSDLRVELRRGDGIAIEARAGIFATPLRGLFVRVGDGSGAAVFTGSVSLDSGGRGEVPSLKPGVYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQVGPETLALPQPSGRLIGIDGRVYMWSAFTPDGVIRLGGPVRRVDNVAPSRYTLEIEGGVRRDVTITEGGRALVSLP*
Ga0075342_125742313300014320Natural And Restored WetlandsSDLRIELRRGDGIAIEAKDAVFATPLRGLFVRVADGSGAAVFTGSVSLDSDGRGEVPSLQPGVYEVRAESSGYAPVSLSGVAVPSRTLSLLLTPGGSLEIQSGPTTLALPQPSGRLIGIDGRVYMWSAFTPDGVIRLGGPVRRLDNVAPGRYSFEVEGGVRRDVTVTEG
Ga0180088_107596813300014868SoilAFEDLEPKRYRVSFQKPAYQVETRELTAAEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLVGVAVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSGPVRRIENVAPGRYTLEIEGGVRRDVDLREG
Ga0180084_100268113300014874SoilRVSFQKPAYQVETRELTAAEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLVGVAVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWSAFTTDGKIRLSGPVRRIENVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP*
Ga0180083_110413813300014875SoilVETRELSAAEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLVGVAVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWSAFTTDGKIRLNGPVRRIENVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP*
Ga0180094_112844813300014881SoilAFEDLEPNRYRVSFQKPAYQVETRELQAAEESDVRVELRRGEGVALEARDGIFGTPLRGLFVRVLDGSGKAAFAGGVSLDSEGRGEVPSLKAGVYEVRAESSGYAPARLPTVSVPASTVPLVLTPGGSLEVQVGPQTLALPEPTARLLAADGRVYMWNAFTDDGKIRLSGPLRRFENVVPDRYVLEVEGGVRREVD
Ga0180094_115404013300014881SoilTRELSAAEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPLSLLGVAVPSRTVTLALTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSGPVRRIDNVAPGRYTLEIEGGVRRDVDLREGM
Ga0180093_115977813300015258SoilDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSCYAPLSLLGVAVPSRTVTLALTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSGPVRRIENVVPGRYTLEIEGGVRRDVDLREGMPSTVSLP*
Ga0180085_110937213300015259SoilRGDGLALEARDGIFATPLRGLFVRAIDGSGQSAFAGSVSLDSDGRGEVPSLKSGTYELRAESSGYAPLSLVAVAVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSGPVRRIENVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP*
Ga0132258_1153172813300015371Arabidopsis RhizosphereNVNTTDGSGRFEFEDMEPKAYRVSFQKAAYQVETRELTAAEDSDVRVELKRGEGIQLEAKDGIFATPLRGLFVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLNLLLTPGGSLEIQAGPATLALPQPTGRLIGGDQRTYMWSAFSSDGKIRLSGVRRLENVAPGSYTLEVEGGVRRDVAITEGGRAVVSLP*
Ga0132256_10107693823300015372Arabidopsis RhizosphereGDVEVRIEEEQTGGGRFMNVATTDSSGRFGFEDLEPKQYRVSFQKQAYQIESRELGASEDAPEVRVELKRGEGLTLEAKDGIFATPLRGLMVRVVDGSGNPAFSGSVALDSDGRGEVPSLKAGSYELRAESSGYAPVRLAGVQVPSRTISLLLTPGGSLEIQAGPTTLALPQAAGRLIGADQRVYMWSSFTSDGKIRLTGPVRRLENVTPGAYTFEVDGGLRRDVAITEGGRAVIALP*
Ga0132256_10248649613300015372Arabidopsis RhizosphereSSGRFEFEDMEPKAYRVSFQKAAYQVETRELTAAEDSDVRVELKRGEGIQLEAKDGIFATPLRGLFVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLNLQLTPGGSLEIQAGPATLALPQPTGRLIGGDQRTYMWSAFSSDGKIRLSGVRRLENVAPGSYTLEVEGGVRRDVAITEGGR
Ga0132257_10261227313300015373Arabidopsis RhizosphereKAAYQVETRELTAAEDSDVRVELKRGEGIQLEAKDGIFATPLRGLFVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLNLLLTPGGSLEIQAGPATLALPQPTGRLIGGDQRTYMWSAFSSDGKIRLSGVRRLENVAPGSYPLEVEGGVRRDVAITEGGRAVVSLP*
Ga0132255_10297736013300015374Arabidopsis RhizosphereKAAYQVETRELTAAEDSDVRVELKRGEGIQLEAKDGIFATPLRGLFVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLNLLLTPGGSLEIQAGPATLALPQPTGRLIGGDQRTYMWSAFSSDGKIRLSGVRRLENVAPGSYTLEVEGGVRRDVAITEGGRAVVSLP*
Ga0187779_1109906613300017959Tropical PeatlandVRVELRRGEGIALEAHDGLFGTPLRGLFVRVLDGSGKAAFAGSVSLDSDGRGEVPALKPGVYAVRAESSGYAPASLPSVSVPSSTVPLVLTPGGSLEVQIGPQTLALPQPTARLLGTDGRVYLWNALTDDGKIGLFGPMRRLENVVPGRYVLAVEGGVSRDVDVREGMPSVAVLP
Ga0184616_1020584213300018055Groundwater SedimentTTDSSGRFVFEDLEPKGYRVSFQKSAYQVETRELTAAEGSELRVEMRRGDGIGLEARDGIYATPLRGLFVRALDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYELRAESSGYAPVSLPGVAVPSRTLTLVLTPGGSLEIRVGPQTLALPQPTGRLLGASGQVYMWSAFTTDGKIRLAGPVRRLENVVPGRYVLEVEGGARQDVDIREGMPSTVSLP
Ga0184616_1041311613300018055Groundwater SedimentEESDVRVELRRGDGLALEVRDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGTYELRAESSGYAPVSLLGVAVPSRTVTLVLTPGGSLEIQVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSGPVRRIENVAPGRYTLEIEGGVRRDVD
Ga0184615_1007884323300018059Groundwater SedimentTASEDSPEARVELRRAEGIALEARDGIFATPLRGLFVRVVDGTGQAAFSGSVLLDSEGRGEISAVKPGIYEVRAQSSGYAAVSLPGIPVPSRAIVLTLTPGGSLEIQAGPQTLALPKPRARLLGADGRPCVWNVFSSDGVVLLGPAVRPLDNVAPGRYTLAVEGGVTRDVTITEGGRATVTLP
Ga0184615_1045318113300018059Groundwater SedimentTDSSGRFLFEDLEPRRYRLSFQKPAYQVETRELTATEESDVRVELRRGDGLALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKAGTYELRAESSGYAPASLPGVAVPSRTVTLVLTPGGSLEIQVGEQTLALPQPTARLLGADGRVYMWSAFTTDGKIRLNGPVRRIENVAPGRYTLEIEGGVRRDVDLREGMPSTVSLP
Ga0173482_1038676013300019361SoilREEEQGGARFMTMATTDSSGRFAFEDLEPSRYRVSFEKPAYQIESRELTAAEDAPDVRVELKRGEGITLEARDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYELRAESSGYAPVRLPSIQVPSRTISVQLTPGGSLEIQAGPATLALPQATGRLIGADQRVYMWSSFTSDGKIRLTSPVRRLENVAPGAYSFEVDGGV
Ga0210379_1024669313300021081Groundwater SedimentADVKVELKRGEGIELEAKDGIFATPLRGMMVRVVDGAGNPAFSGSVPLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSTTLNLLLTPGGSLEIQAGPATLALPQPTGRLIGPDQRIYMWSAFTTDGKIRLSGPVRRLENVAPGSYTLEVEGGVRRDVAITEGGRAVVSLP
Ga0210380_1009358413300021082Groundwater SedimentGIALEAHDGIFATPLRGLLVRAIDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPVSLPGVAVPSRTLTLTLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNAFTTDGKIRLSSSVRRFENVAPGRYVLEVEGGVRREVQVTEGMPSTVSLP
Ga0210377_1031751433300021090Groundwater SedimentEMRRGEGIGIEARDGIFATPLRGLFVRVLDSSGNAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPASLPGVAVPSRTLTLVLTPGGALEIQAGPQTLALPQPTARLLGADGRVYIWSAFTTDGKIRLAGPVRRLENVVPGRYTLEVEGGVRRDVTITEGGRAVVSLP
Ga0210377_1073210413300021090Groundwater SedimentPREYRLSFQKPAYQAETRPVTASEESEVRVELRRGEGIALLARDGLFATPLRGLMVRVLDGTGAAVFTGSVPLDSDGRGEVPALKPGSYELRAESSGYAPVTRPVGVPSSELTLALTPGGSLEIQIGPQTQALPQPTGRLIAADGRVYLPFIFSNDGKIRLGGPVRRLENIVPGRYTFEV
Ga0224500_1037484113300022213SedimentYRVSFQKPAYRVETRDLAAAEESDVRVEMRRGEGIVLEARDGIYAVPLRGLFVRALDATGQAAFAGSVSLDGEGRGEVPSLKPGVYELRAESSGYAPARLAGVAVPSQTIALVLTPGGSLEIRVGPQTLALPQATARLLAADGRVYMWSAFTTDGTLRLASPVRRIENVVP
Ga0247799_104822313300023072SoilRVELKRGEGITLEARDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYELRAESSGYAPVRLPSIQVPSRTISVQLTPGGSLEIQAGPATLALPQAAGRLIGPDQRVYMWSSFTSDGKIRLTSPVRRLENVSPGAYSFEVDGGVRREVAITEGGRAVVNLP
Ga0247754_111819613300023102SoilSGRFAFEDLEPSRYRVSFEKPAYQIESRELTAAEDAPDVRVELKRGEGITLEARDGIFATPLRGLMVRVMDGAGNPAFSGSVSLDSDGHGEVPSLKPGTYELRAESSGYAPVRLPSVQVPSRTISLLLTPGGSLEIQAGPATLALPQAAGRLIGPDQRVYMWSSFTSDGKIRLTSPVRKLENVSPGAYSFEVDGGVRREVAITEGGRAVVNLP
Ga0209109_1002036343300025160SoilIEDDGGGMRFASMASTDSSGRFAFEDMEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLTGTDGRPYMWSAFTPDGKIRLNGPVRRLENVAPGRYTFEVEGGERRYLTISEGGRAVVALP
Ga0209108_1001796453300025165SoilDSSGRFAFEDMEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLTGTDGRPYMWSAFTPDGKIRLNGPVRRLENVAPGRYTFEVEGGERRDLTISEGGRAVVALP
Ga0209343_1074999813300025311GroundwaterMEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLTGTDGRPYMWSAFTPDGKIRLNGPVRRLENVAPGRYTFEV
Ga0209431_1034547723300025313SoilRFAFEDMEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLTGTDGRPYMWSAFTPDGKIRLNGPVRRLENVAPGRYTFEVEGGERRDLTISEGGRAVVALP
Ga0209341_1026327523300025325SoilEDGGGMRFASMASTDSSGRFAFEDMEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLTGTDGRPYMWSAFTPDGKIRLNGPVRRLENVAPGRYTFEVEGGERRDLTISEGGRAVVALP
Ga0210126_10660113300025946Natural And Restored WetlandsDMEPKSYRVTFQKPAYQVETRELQAAEDSDLRIELRRGDGIVIEAKDGIFATPLRGLFLRVADGSGAAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLSGVAVPSRTLSLLLTPGGSLEIQVGPETLALPQPSGRLIGIDGRVYMWSAFTPDGVIRLGGPVRRVDNVAPSRYTLEIEGGVRRDVTITEGGRALVSLP
Ga0209592_116908823300027731Freshwater SedimentAPLFDRVGRRVLLNGHGARLVALADRLLPELVAAEESDVRVEMKRGEGIALEARDGIFATPLRGLFVRAVDGAGQTAFAGGLSLDSEGRGEVPSLKPGAYELRAESSGYAPVVRPGVAVPSSTISLVLTPGGSLEIQAGPQTLALAEASGRLLGADGRVYMWNVFTTDGKIRLTNPLRRIENVVPGRYTLEVEGGVRREVTVTEGGRSVVTLP
Ga0307281_1033837013300028803SoilQKAAYQVETRELTAAEDSDLRVELKRGEGIELEARDGIFATPLRGLVVRVVDAAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPAATLNLLLTPGGSLEIQAGPATLALPQPTGRLTGPDQRIYMWSAFTTDGKIRLSGPVRRLENVAPGSYTLEVEGGVRRDVAITEGGRA
Ga0302046_1079006813300030620SoilDVGVRIEEDGGGMRFASMASTDSSGRFAFEDMEPKRYRVTFQKPAFQLETRELHAAEESEMRVELKRGEGIAIVAKDGIFATPLRGLLVRVADSSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPVSLPGVAVPSRTLNLLLTPGGSLEVQAGPATLALPQAAGRLIGADGRPYMWSALTPDGKIRLSGPVRRLENVAPGRYTFEVEGGGERRDVTISEGGRAVVALP
Ga0307499_1031964513300031184SoilQVETRELTAAEDSDVRVELKRGEGIQLEAKDGIFATPLRGLFVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLNLLLTPGGSLEIQAGPATLALPQPTGRLIGGDQRTYMWSAFSSDGKIRLSGVRRLENVAPGSYTLEVEGGVR
Ga0307497_1013419023300031226SoilRIEEEQGGARFMNMATTDSSGRFGFEDLEPSRYRVSFQKAAYQVETRELTAAEDSDVRVELKRGEGIQLEAKDGIFATPLRGLFVRVVDGAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLNLLLTPGGSLEIQAGPATLALPQPTGRLIGGDQRTYMWSAFSSDGKIRLSGVRRLENVAPGSYTLEVEGGVRRDVAITEGGRAVVSLP
Ga0318555_1082794713300031640SoilYQVDTRELTAAEESDVRVELRRGDGIALEAHDGIFQTPLRGLFLRVTDGSGAAVFAGSVSLDSDGHGEVPSLKAGVYSLQAESSGYAPVSLPSVTVPSRTLSLLLTPGGSLEIQAGPTTLALPQPTGRLLGPDGRPYMWSAFTSDGVLRLNGPVRRLENVAPGAYT
Ga0315297_1159186013300031873SedimentSFQKPAYQVETRELTAAEESDLRVEMRRGEGIALEARDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPISLPGVAVPSRALTLVLTPGGSLEIRVGEQTLALAQPTARLLGADGRVYLWNAFTTDGKIRLGSPVRRFENVAPGRYTLEV
Ga0310893_1026873913300031892SoilFAFEDLEPSRYRVSFEKSAYQIESRELTAAEDAPDVRVELKRGEGITLEARDGIFATPLRGLMVRVVDGAGNPAFSGSVSLDSDGHGEVPSLKPGTYELRAESSGYAPVRLPSVQVPSRTISLLLTPGGSLEIQAGPATLALPQAAGRLIGPDQRVYMWSSFTSDGKIRLTSPVRRLENVSPGAYSFEVDGGVRREVAITEGGRAVVNLP
Ga0310900_1088329713300031908SoilDVRVELKRGEGIELEAKDGIFATPLRGLFVRVVDGAGYPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSATLHLLLTPGGSLEIQAGPATLALPQPTGRLIGPDQRIYMWSAFTSDGKIRLSGPVRRLENVAPGSYTLEVEGGVRRDVAIMEGGRAVVSLP
Ga0214473_1044937913300031949SoilTRPVTASEESEVRVELRRGEGIALLARDGLFATPLRGLMVRVLDGAGAAVFTGSVPLDSDGRGEVPALKPGSYELRAESSGYAPVTRPVGVPSSELTLVLTPGGSLEIQIGPQTQALPQPTGRLIAADGRVYLPFIFSNDGKIRLGGPVRRLENVVPGRYTFEVEGGVRRDVDIREGMPTAMSLP
Ga0326597_1192131613300031965SoilRFTFEDMEPKRYRVTFQKPAYQVETKELTAAEESDLRVELRRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFSGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVSVPSRTLNLLLTPGGSLEIQAGPATLALPQAAGHLIGAGGRPYMWSAFTPDGKIRLSAPVRRLENVAPG
Ga0315292_1065879713300032143SedimentRFAFEDLEPKRYRVSFQKPAYQVETRELTAAEESDLRVEMRRGEGIALEARDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPISLPGVAVPSRALTLVLTPGGSLEIRVGEQTLALAQPTARLLGADGRVYLWNAFTSDGKIRLGSPVRRFENVAPGRYTLEVEGGVRRDVDIREGMPSTVLLP
Ga0315292_1080271313300032143SedimentLEPKRYRVSFQKPAYQVETRELAAAEESDLRVELRRGDGLAIEAHDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEVRVGEQTLALPQPTARLLGADGRVYMWNVFTTDGKIRLNTPVRRFENVAPGRYTLEVEGGVRRDVDIREGMPSTVALP
Ga0315283_1168490813300032164SedimentESDLRVEMRRGEGMAIEAHDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEIRVGEQTLALPQSTARLLGADGRVYMWNVFTTDGKIRLNGPVRRFDNVAPGRYTLEVEGGVRRDVDIREGMPSTAALP
Ga0315268_1220263913300032173SedimentMATTDSSGRFAFEDLEPKRYRVSFQKPAYQVETRELTAAEESDLRVEMRRGDGLAIEAHDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEIRVGEQTLALPQSTARLLGADGRVYMWNVFTTDGKIRLNGPVRRFDNV
Ga0315276_1110164023300032177SedimentSSGRFAFEDLEPKRYRVSFQKPAYQVETRELAAAEESDLRVELRRGDGLAIEAHDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEIRVGEQTLALAQPTARLLGADGRVYLWNAFTSDGKIRLGSPVRRFENVAPGRYTLEVDGGVRRDVDIREGMPSTVSLP
Ga0307472_10196890513300032205Hardwood Forest SoilEGDGARFVNVNTTDGSGRFEFEDMEPKAYRVSFQKSAYQVESRQLQAAEESDVRVELRRGEGIALEAKDGIFATPLRGLFVRVTDASGAAVFAGSVSLDSDGRGEVPSLKPGVYELRAESSGYAPVRLPSVQVPSRTLSLLLTPGGSLEIQSGPATLALPDATGRLIGGDGRVYLWNTFSTDGKVRLNGPVRRL
Ga0315271_1079390613300032256SedimentSDLRVEMRRGEGMAIEAHDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEIRVGEQTLALPQSTARLLGADGRVYMWNVFTTDGKIRLNGPVRRFDNVAPGRYTLEVEGGVRRDVDIREGMPSTAALP
Ga0315287_1115334813300032397SedimentDGLAIEAHDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEIRVGEQTLALPQPTARLLGADGRVYMWNVFTTDGKIRLGSPVRRFDNVAPGRYTLEVEGGVRRDADVREGMPSTVSLP
Ga0315287_1173037023300032397SedimentEMRRGEGMAIEAHDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEIRVGEQTLALPQSTARLLGADGRVYMWNVFTTDGKIRLNGPVRRFDNVAPGRYTLEVEGGVRRDVDIREGMPSTAALP
Ga0315275_1166014713300032401SedimentGGGMRFMNTATTDSSGRFAFVDLEPKRYRVSFQKPAYQVETRELAADEESDFRVEMRRGEGIAIEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEMRAESSGYAPVSLPGVAVPSRTVALVLTPGGSLEIRIGAQTLARPQPTARLLEADGRVYTWNVFTTDGKILLSGPVRRLENVAPGRYVFEVEDAPRQDVDIREGVPTIVLLP
Ga0315273_1044627333300032516SedimentSSGRFAFEDLEPKRYRVSFQKPAYQVETRELAAAEESDLRVELRRGDGLAIEAHDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPVSLPAVSVPSRTVTLVLTPGGSLEVRVGEQTLALPQPTARLLGADGRVYMWNVFTTDGKIRLGSPVRRFENVAPGRYTLEVEGGVRRDVDIREGMPSTVSLP
Ga0315273_1234277513300032516SedimentETRELTAAEESDLRVEMRRGEGIALEARDGIFATPLRGLFVRALDGSGQAAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPISLPGVAVPSRALTLVLTPGGSLEIRVGEQTLALAQPTARLLGADGRVYLWNAFTTDGKIRLGSPVRRFENVAPGRYTLEVDGGVRRDVDIREGMPSTVSLP
Ga0335085_1153925913300032770SoilPKPYRVSFQKAAYQVETRELTAADESDVRVELRRGEGIGLEARDGLFGTPLRGLFVRVLDGTGKAAFAGRVSLDSEGRGEVPALKAGVYEVRAESSGYAPTSLPAVSVPAGTVALLLTPGGSLEIQVGPQTLALSEPAARLLSPDGRVYMWSVFSDDGKIRLGGPLRRLDNVSPGRYVLEVLGGVRRDVEIREGMPSTATLP
Ga0335084_1097074713300033004SoilEESELRIELRRGDGIALEAHDGIFATPLRGLFVRVTDGSGAAAFAGSVSLDSDGHGEVPSLKPGVYSLQAESSGYAPVSLPSVMVPSRTLSLLLTPGGSLEIQAGPTTLALPQPTGRLIGADGRVYLWNAFTSDGAIRLTGPVRRLENVAPGAYTFQVDGGVRRDVAIAEGGRALVTLP
Ga0316605_1097244313300033408SoilEGIALEARDGIFATPLRGLFVRALDAAGQAAFAGSVSLDSEGRGEVPSLKPGAYELRAESSGYAPVSRPGVAVPSPTVTLVLTPGGSLEIRVGPQTLALADPSARLLRDDGRVYMWNAFTADGKIRLVGPVRRLDHVAPGRYVLEVQGGVRQDVDIREGIPSSVSLP
Ga0316605_1119400513300033408SoilFVNMASTDSSGRFAFEDVEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGAAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPTTLALPQAAGRLIGTDGRPYMWSAFTPDGKIRLSGPVRRLENVAPGRYALEVEGGERRDVTVTEGGRAVVVLP
Ga0316603_1164070013300033413SoilEEEDAGRRFASMASSDSSGRFAFEDLEPKPYRVSFQKPAYQVETRELAAAEESDVRVEMRRGEGIALEARDGLFGTPLRGLFVRAADGSGQTAFSGSVALDSEGRGEVPSLKAGVYEVRAESSGYAPVTLPGVAVPSRTVTLTLTPGGALEIRVGPLTLALPQPAARLVGADGRVYMWNAFTPDGKIRLGSPVRTLENVAPG
Ga0316622_10162428623300033416SoilAFEDLEPRRYRVSFQKPAYQVETRELTAADESDVRVEMRRGEGIALEARDGIFATPLRGLFVRALDGAGHAAFTGGVSLDSEGRGEVPSLKAGAYELRAESSGYAPVIRPGVSVPSSTIALVLTPGGSLEIQAGPQTLALPEASGRLIGVDGRVYLWNVFTNDGKIRLTSPVRRLENVVPGRYTFEVEGGVRREVDIREGLPSVVALP
Ga0316622_10279418713300033416SoilMRFVNMATSDSAGRFAFEDLEPRRYRISFQKPAYQVETRELVAAEESEVRVEMRRGEGIGLEARDGIFATPLRGLFVRAIDGAGQAAFAGSVSLDSEGRGEVPSLKPGVYELRAESSGYAPARRPGVMVPSSAITLVLTPGGSLEIQVGPQTLALAEPAARLVGADGLVYMWSAFTSDGKIR
Ga0316622_10298608513300033416SoilDLEPMSYRVSFQKPAYQVETREIAAAEESDVRVEMRRGEGIALEARDGIFATPLRGLFVRAVDQSGQSAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPARLPGVAVPSPTVRLVLTPGGSLEIRVGPHTLALAEPTARLFGADGRVYMWNAFTTDGKIRLNGPVRRLENVVPGR
Ga0316625_10032073123300033418SoilDLEPKRYRVSFQKAAYQVESRELAASEESDLRVELRRGEGLSLEARDGMFATPLRGLFVRVLDASGNTAFSGSVTLDSDGRGEVPALKAGSYELRAESSGYAVAVVRGVAVPSRAVSLLLTPGGALEIQAGPQTLALPQPEAVLAGADGLPCIWNPFTTDGRIRLSGPARTLENVPPGRYTLRVGGASREVTITEGGRATVALP
Ga0316601_10021602923300033419SoilFAFEDLEARRYRVSFQKPAYQVETRELTAAEESDVRVEMRRGEGIGIEARDGIFATPLRGLFVRALDGAGQAAFAGGVSLDSEGRGEVPSLKPGVYELRAESSGYAPARLPAVSVPASTVPLVLTPGGSLEVQVGPQTLALPQPTARLLAADGRAYMWSVFTDDGKIRLSGPMRRIENVAPGRYVLEVEGGVRRELDIREGQPSVVALP
Ga0316601_10225797113300033419SoilDMEPKRYRVTFQKPAFQVETRELQAAEESDLRIELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLIGADGRPYMWSAFTPDGKIRLSGPLRRLENVAPGRYTF
Ga0316620_1004415613300033480SoilADVRVEMKRGEGIGLEARDGIFATPLRGLFVRAVDGAGQTAFAGGLSLDSEGRGEVPSLKSGAYELRAESSGYAPVARPGVTVPSSTILLMLTPGGSLDIQAGPQTLALADATGRLVGEDGRVYMWNVFTNDGKIRLSSPLRRIDNVVPGRYVFQVEGGVTREVAVTEGGRSVVTLP
Ga0316620_1047291313300033480SoilDLEPMSYRVSFQKPAYQVETREIAAAEESDVRVEMRRGEGIALEARDGIFATPLRGLFVRAVDQSGQSAFAGSVSLDSDGRGEVPSLKPGVYEVRAESSGYAPARLPGVAVPSPTVRLVLTPGGSLEIRVGPQTLALPEPTARLFGADGRVYMWNAFTTDGKIRLNGPVRRLENVVPGRYTLDVEGGSPQDVEIREGMPTTVSLP
Ga0316620_1072211513300033480SoilGRFALEDLEAKSYPVSFQKPAYQVENRDLAAAEDSDVRVEMRRGEGIGLEAHDGIFATPLRGLFVRAVDGAGQTAFSGGLSLDSDGRGEIPSLKPGAYELRAESSGYAPVVRPGVTVPASTISLTLTPGGALNIQAGPQTLALPDASGRLVGADGRVYMWNVFTTDGRIRLTSPLRRIENVVPGSYVFQVEGGASREVTVTEGGRAVVTLP
Ga0316620_1114913813300033480SoilLRVELRRGEGIALEARDGIFATPLRGLFVRVVDASGAAAFAGSVSLDSAGRGEVPAVRPGVYELRAESSGYAPIALPGVAVPSPSITLVLTPGGALQIRAGTQTLALPKPEGRLLTADGRVYMWNAFTSDGVIRLDNAIRRLDNVAPGRYTLQVTGGVRREVTISEGGQAAVELP
Ga0316620_1197602613300033480SoilDGDGGMRFVNVATTDSSGRFAFEDIEPKRYRVSFQKPAYQVETRELTAAEESDVRVEMRRGEGIALEARDGIFATPLRGLFVRALDGRGQDAFTGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPTSLPGVAVPSPTVTLVLTPGGSLEIRVGAQTLALPQSSARLLRPDGRVYMWNAFTSDGKIRLASPV
Ga0316620_1207898013300033480SoilGMRLVNMATSDSSGRFAFEDLEPKRYRVSFQKPAYQVETRELAAAEEIDVRVEMRRGEGIGLEARDGIFATPLRGLVVRALDGAGQAAFAGSVSLDSEGRGEVPSLKPGVYELRAESSGYAPARLPAVSVPASTVPLVLTPGGSLEVQVGPQTLALPQPTARLLAADGRAYMWSVFTDDGKIRLGGPM
Ga0316600_1081637313300033481SoilFEDLEPRRYRVSFQKPAYQVETRELTAAEEGDLRVEMRRGEGIGLEARDGIFATPLRGLFLRVLDGSGQAAFAGSVSLDSDGRGEVPSLKPGIYEVRAESSGYAPVSLPGVAVPSRTVTLVLTPGGSLEIRVGPQTLALAEPTARLLGAEGRVYAWNVFTTDGKIRLGAPVRRLENVAPGRYVFVVEHGARQDVEIREGMPSTVSLP
Ga0316627_10129593713300033482SoilNMATTDSSGRFAFEDLEPRRYRVSFQKPAYQVETRELTAAEEGDLRVEMRRGEGIGLEARDGIFATPLRGLFLRVLDGSGQAAFAGSVSLDSDGRGEVPSLKPGIYEVRAESSGYAPVSLPGVAVPSRTVTLVLTPGGSLEIRVGPQTLALAEPTARLLGAEGRVYAWNVFTTDGKIRLGGPVRRLENVAPGRYVFVVEHGARQDVEIREGMPSTVSLP
Ga0316627_10162944623300033482SoilVEMRRGEGIALEARDGIFATPLRGLFVRALDGAGQAAFAGSVSLDSEGRGEVPSLKPGAYELRAESSGYAPARRPGVTVPSSMISLVLTPGGSLEIQAGPQTLALPEASARLLGVDGQVYMWSAFTTDGKIRLAGPVRRIENVAPGRYTLEVEGGVRREVDVREGVPSVVALP
Ga0316627_10200613113300033482SoilYQVSFQKPAYQVETRELGAAEESDLRIEMRRGEGIALEARDGIFATPLRGLFVRAIDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPASLPAVAVPSRTVTLVLTPGGSLEIRVGAQTLALPQPTARLLGADGRVYMWSVFTTDGKIHLSGPVRRLENVAPGRYVLEVEGGVRRDVQVSEGMPSTVTLP
Ga0316624_1012193013300033486SoilGRFAFEDLEARSYQVSFQKPAYQVENRELAAAEDSDVRVEMRRGEGVGLEAHDGIFATPLRGLFVRAVDGAGQTAFAGGLSLDGDGRGEIPSLEPGVYELRAESSGYAPVIRPGVSVPASTISLALTPGGSLDIQAGPQTLALPNASGRLVGADGRVYMWNVFTADGKIRLTNPLRRIDNVVPGAYVFQVEGGVSREATVTEGGRAVVTLP
Ga0316624_1043933123300033486SoilVETRELSAGEDSDARVELRRGEGLSLEARDGIFATPLRGLFVRVVDASGAAAFAGSVSLDSAGRGEVPAVRPGVYELRAESSGYAPIALPGVAVPSPSITLVLTPGGALQIRAGTQTLALPKPEGRLLTADGRVYMWNAFTSDGVIRLDNAIRRLDNVAPGRYTLQVTGGVRREVTISEGGQAAVELP
Ga0316624_1081294423300033486SoilSDVRVEMRRGEGIALEARDGIFATPLRGLFVRALDGRGQDAFTGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPTSLPGVAVPSPTVTLVLTPGGSLEIRVGAQTLALPQSSARLLRPDGRVYMWNAFTSDGKIRLASPVRRIENVVPGRYTLEVEGGSRQDVEIREGMPSTVSLP
Ga0316630_1106338813300033487SoilSGRFAFEDLEPKRYRASFQKPAYQVETRELVAAEESDLRVEMRRGEGIALEARDGIFATPLRGLFVRVLDGSGQAAFAGSVSLDSEGRGEVPSLKPGVYEVRAESSGYAPASLPAVAVPSRTVTLVLTPGGSLEIRVGAQTLALPQPTARLLGADGRVYMWSVFTTDGKIHLSGPVRRLENVAPGRYVLEVEGGVRRDVQVSEGMPSTVTLP
Ga0299912_1021643413300033489SoilMRFVNMASTDSTGRFAFEDVEPKRYRVTFQKPAYQVETKELTAAEESDLRVELKRGEGIAIEAKDGIFATPLRGLFVRVADGSGTAVFTGSVSLDSDGRGEVPSLRPGTYEVRAESSGYAPISLPGVAVPSRTLSLLLTPGGSLEIQAGPATLALPQAAGRLTGTDGRPYMWSAFTPDGKIRLNGPVRRLENVAPGRYTFEVEGGERRDLTISEGGRAVVALP
Ga0314870_024521_1_5433300033760PeatlandVENRELAAAEDADVRVEMRRGEGIGLEAHDGIFATPLRGLFVRAVDASGQTAFSGGLSLDGEGHGEVPSLKPGAYELRAESSGYAPVVRPGVTVPAPTISLLLTPGGSLDIQAGPQTLALPDASGRLVGADGRVYMWSIFTTDGKIRLASALRRIENVVPGAYVFQVEGGASREVTVTEG
Ga0364945_0158613_59_6523300034115SedimentVSFQKAAYQVETRELTAAEDSDLRVELKRGEGIELEARDGIFATPLRGLVVRVVDAAGNPAFSGSVSLDSDGRGEVPSLKPGVYDLRAESSGYAPVRLPSIQVPSSTLNLLLTPGGSLEIQAGPTTLALPQPTGRLIGPDQRIYMWSAFTTDGKIRLSGPVRRLENVAPGSYTLEVEGGVRRDVAITEGGRAVVGLP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.