NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086948

Metagenome Family F086948

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086948
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 113 residues
Representative Sequence RIVEGRVAHLPYHFSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK
Number of Associated Samples 93
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 33.33 %
% of genes near scaffold ends (potentially truncated) 2.73 %
% of genes from short scaffolds (< 2000 bps) 3.64 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (94.545 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(30.000 % of family members)
Environment Ontology (ENVO) Unclassified
(33.636 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(60.909 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.69%    β-sheet: 20.15%    Coil/Unstructured: 67.16%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF00892EamA 22.73
PF13676TIR_2 0.91
PF13539Peptidase_M15_4 0.91
PF01764Lipase_3 0.91
PF01370Epimerase 0.91
PF01039Carboxyl_trans 0.91
PF13476AAA_23 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG0777Acetyl-CoA carboxylase beta subunitLipid transport and metabolism [I] 0.91
COG0825Acetyl-CoA carboxylase alpha subunitLipid transport and metabolism [I] 0.91
COG4799Acetyl-CoA carboxylase, carboxyltransferase componentLipid transport and metabolism [I] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A94.55 %
All OrganismsrootAll Organisms5.45 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005179|Ga0066684_10888152All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium583Open in IMG/M
3300005543|Ga0070672_101981020All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium524Open in IMG/M
3300006791|Ga0066653_10747664All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium510Open in IMG/M
3300012350|Ga0137372_10017951All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia6674Open in IMG/M
3300026538|Ga0209056_10002636All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium19191Open in IMG/M
3300031720|Ga0307469_10651161All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium949Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.45%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil9.09%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.36%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.55%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.64%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.64%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.73%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere2.73%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.82%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.82%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.82%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.91%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.91%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000837Forest soil microbial communities from Amazon forest - Pasture72 2010 replicate I A100EnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004022Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025932Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025940Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027876Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10126330923300000364SoilMSKDLATRIVKGRVADLPYRFSIKISRGNDSEPGTLEVNVLNSSAKSVAGFPQVMQNPLSRTGDTSRREFEIPISRALKKNIERRLLVHDQFVTHVDLIVGMDEDFLSGDFPN*
INPhiseqgaiiFebDRAFT_10145251823300000364SoilPMSHVAFSDGVSERIVKGRVAHLPYEFLVKITRRSDSETGTLEVNVLDSLGKPLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK*
F24TB_1139760023300000550SoilVAHLPYRFSIKISRGNDSEPGTLEVNVLTSSGKSVAGFPQVIQNPLSRTSDTSRREFEIPISRALKKNIEKRLLVHEQFVTHVDLIVGKDEDFLSGDFPK*
F14TC_10117315633300000559SoilVNVLTSSGKSVAGFPQVIQNPLSRTSDTSRREFEIPISRALKKNIEKRLLVHEQFVTHVDLIVGKDEDFLSGDFPK*
F14TC_10307487123300000559SoilLPQSRVTISDGVAERIVEGRVAHLPYQFSVKISQRSDSETGTLEVNVLDSSGKHLTGFPQAMPNPLTKTGDSSRKEFEIQIDQALKKKIETTLLAKEQFLTHVDLIIGMDDDXXXXXXXK
F14TC_10570403723300000559SoilGEKSVSAPSFEREGLPMSRIAILDGVAERTVEGRVRHLPFQFLVKISQRDGTKAGTLEVNVLDTSGKPLTGFPQVMPNPLTNTVDSIRKDFELPVGSALKKKIEKSVLAKNQFLTYVDLIVGMDDDFLSAHFPK*
JGI1027J11758_1283267913300000789SoilISQRDGTKAGTLEVNVLDTSGKPLTGFPQVMPNPLTNTVDSSRKDFELPVGSALKKKIEKSVLAKNQFLTYVDLIIGMDNDFLSAHFPK*
AP72_2010_repI_A100DRAFT_106510223300000837Forest SoilREGLPKLRVADLNGVPEQIVEGRVGHLPYKFLVKISRANGRSEAETLEVNVLDSSGRPLPGFPQVMPNPLTKGGDISRKEFEMPIDQPLKEKIEKSFLAKEQFITHVDLVVGMDDDFLSQDFPK*
JGI1027J12803_10464383013300000955SoilMSRIAILDGVAERIVEGRVRHLPFQFLVKISQRDGTKAGTLEVNVLDTSGKPLTGFPQVMPNPLTNTVDSSRKDFELPVGSALKKKIEKSVLAKNQFL
JGI1027J12803_10954846113300000955SoilLLMSKDLATRIVKGRVADLPYRFSIKISRGNDSEPGTLEVNVLNSSAKSVAGFPQVMQNPLSRTGDTSRREFEIPISRALKKNIERRLLVHDQFVTHVDLIVGMDEDFLSGDFPN*
JGI10216J12902_10953903923300000956SoilEREGLPISRVEISQGIAKRVLVGRVAHLPYLFLVKVRPEIENEAGILEVNVLDSSRRPLIGFPKSMPNPLTKAGESSRKAFELPISKTLKREIRETLLLPEQFLTHVDLIIGMDEDFLSQDFPK*
Ga0055432_1016814813300004022Natural And Restored WetlandsISQRVDSETGTLEVNILDSSGKSLTGFPQVMPDPLTKTGDSSRTEFELPVGSALKKKIEKSLLAKNQFITYVDLIVGMDDDFLSAYFPK*
Ga0066674_1001197653300005166SoilLPTSHVAISDGVSERIVEGRVAHLPYHFSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK
Ga0066688_1025053413300005178SoilITRRSDSETGTLEVNVLDSSGKPLVGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK*
Ga0066684_1061183313300005179SoilETGALSVNVLDSSGKPLSGFPQVMQNPLSRTGDTSRKEFEVPISRALKKKIETRVLVQDQFITHVDLIVGMDEDFRSGNFPK*
Ga0066684_1088815213300005179SoilVRYHFREHFRHAKPGEKTVSAPSFKREGLPKLRVADLNGVPERMVEGRVGHLPYQFLVKISRANAGSEAETLEVNVLASSGRPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEKIEKSLLAQEQFITHVDLIVGMDVDFLSQDFHK*
Ga0066685_1099800113300005180SoilLDSSGKPLSGFPQVMQNPLSRTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFRSGNFPK*
Ga0066676_1007473443300005186SoilISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK*
Ga0066388_10311280623300005332Tropical Forest SoilVLDSSGRPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEKIERSFLAKEQFITHVDLIVGMDDDFLSQDFPE*
Ga0070660_10055684813300005339Corn RhizosphereENGTLEVNVLNSSGKPAAGFPQVMANPLTKAGDSSRKEFDLPIGKSRKQKIKKALLTQDEFLTHVDLIVGMDDDFLSADFPK*
Ga0066682_1015533043300005450SoilMPSRGRKVSAPSFEREGLPASHVAISYGLATRIVEGRVAHLPYQFSVKITRGSDSETATLEVNVLDSSGKPVMGFPQIMRNPLTRTGDTSRKEFEIPIDQALKKKIERPLLAKEQFLTHVDLIIGMDDDFLSADFPK*
Ga0066681_1054136113300005451SoilTRGSDSETATLEVNVLDSSGKPVMGFPQIMRNPLTRTGDTSRKEFEIPIDQALKKKIERPLLAKEQFLTHVDLIIGMDDDFLSADFPK*
Ga0066697_1006098613300005540SoilSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK*
Ga0070672_10198102013300005543Miscanthus RhizosphereCEVRYYFREHFRHAKPGEKSVSASAFEREALPLSRVTISNGVPERTVEGRVAHLPYQFLVKIRGGTDSENGTLEVNVLNSSGKPAAGFPQVMPNPLTKAGDSSRKEFDLPIGKSRKQKIKKALLTQDEFLTHVDLIVGMDDDFLSADFPK*
Ga0066695_1049805223300005553SoilSVNVLDSSGKPLSGFPQVMQNPLSRTGDTSRKEFEVPINRALERKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK*
Ga0066707_1014550123300005556SoilVEGRVAHLPYQFLVKIRRGSDSETGTPEVNVLDNLGKPLAGFPQVMPNPLTKTGDPSRKEFELPVEKDLNKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFP
Ga0066704_1029027123300005557SoilRIVKGRVAHLPYEFLVKITRRSDSETGTLEVNVLDSSGKPLVGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK*
Ga0066699_1033214313300005561SoilDSETGTLEVNVLDSSGKPLVGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK*
Ga0066706_1010234523300005598SoilVEGRVAHLPYQFLVKIRRGSDSETGTPEVNVLDNLGKPLAGFPQVMPNPLTKTGDPSRKEFELPVEKDLNKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK*
Ga0066903_10757678513300005764Tropical Forest SoilEVNVLDSSGKPLAGFPQVMPNPLKKSGDISRKEFEMPIDQALKEQIEKSLLAEEQFITHVDLIVGMDDDFLSQDFP*
Ga0066651_1056101913300006031SoilLISRGLATRIVKGRVAHLPIQFSVKINRGNDSETGTLSVNVLDSSGKPLSGFPQVMQNPLSRTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFRSGNFPK*
Ga0066656_1025344213300006034SoilRIVEGRVAHLPYHFSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK*
Ga0066653_1074766413300006791SoilMSRLIRLRLLRARFVITFVSIFAMPSRGRKVSAPSFEREGLPASHVAISYGLATRIVEGRVAHLPYQFSVKITRGSDSETATLEVNVLDSSGKPVMGFPQIMRNPLTRTGDTSRKEFEIPIDQALKKKIERPLLAKEQFLTHVDLIIGMDDDFLSADFPK*
Ga0066665_1013784013300006796SoilVRYHFREQFNNAKPGQKRFSAPSYDREPLPMSRMAVSDGLASRIVEGSVAQLPYQFSVNISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK*
Ga0066710_10238231723300009012Grasslands SoilMSHVALLDGVSERIVKGRVAHLPYEFLVKITRRSDSETGTLEVNVLDSLGKPLAGFPQVMPNPLTKTGDSSRKEFELPVEQDLKKKIRETRLAQDQFLTHVDLIVGMDDDFLSADFPK
Ga0066710_10339551713300009012Grasslands SoilVKGRVAHLPYEFLVKITRRSDSETGTLEVNVLDSSGKPLVGFPQVMPNPLTNTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK
Ga0075418_1061685223300009100Populus RhizosphereISRANGGSEAETLEVNVLDSSGKPLAGFPQVMPNPLKKGGDISRKEFEMPIDQALKEQIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPE*
Ga0066709_10111652923300009137Grasslands SoilLPISRITISNGLETRIVEGRVAHLPYQFLVKIRRGSDSETGTPEVNVLDNLGKPLAGFPQVMPNPLTKTGDPSRKEFELPVEKDLNKKIRETLLTQEQ
Ga0066709_10129929523300009137Grasslands SoilSDGLASRIVEGSVAQLPYQFSVNISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK*
Ga0066709_10313644413300009137Grasslands SoilGVSERIVEGSVEHLPYHFSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK*
Ga0066709_10452312613300009137Grasslands SoilVEGRVAHLPYHFLVKISRGADSENGTLEVKVLDSSGKPVAGFPQVMPNRLTKTGDSSRKEFELPVGSSRKQTIKKALLTPDQFLTHVDLIVGMDEDFLSADFPK*
Ga0075423_1292127013300009162Populus RhizosphereHFREHFRHAKPGEKTVSAPSFQREGLPKLRVADLNGVPKRIVEGRVGHLPYQFSVKISRANGGSEAETLEVNVLDSSGKPLAGFPQVMPNLFTKGGDISRKEFEMPIDQALKEKIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPE*
Ga0126380_1226874113300010043Tropical Forest SoilHFREHFLHAKPGEKTVSAPSFKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSEAEKLEVNILDSSGRPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEKIEKSLLAKEQFITHVELIVGIDDDFLSQDFPK*
Ga0126373_1132820113300010048Tropical Forest SoilPGEKTVSAPSFKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSEAEKLEVNILDSSGRPLAGFPQVMPNPLTKGGDISRKEFEMPIDQTLKEKLEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK*
Ga0126370_1168967413300010358Tropical Forest SoilSFKREGLPKLRVADLNGVPERIVKGRVGHLPYQFLVKISRANGGSEAETLEVKVLDSSGRPLAGFPQVMPNPLTKGGDISRKEFEIPIDQALKEKIEKSFLAKEQFITHVDLIVGIDDDFLSQDFPK*
Ga0126376_1253098013300010359Tropical Forest SoilHFREHFRHAKPGEKTVSAPSFKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRTNGGSEGETLEVNVLDSSGRPLAGFPQVMPNPLTKGADNSRKEFEIPIDQALKEKIEKSLLAKEQFITHVDLIVGMDDDFLSQAFPK*
Ga0126372_1305148113300010360Tropical Forest SoilKPGEKTVSSPAFKREGLPELHVADLNGVPERIVEGRVGHLPYQFLVKINRANGGSEDETLEVNVLDSSGRPLAGFPQVMPNPLTKGGGISRKEFEMPIDQALKEKIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK*
Ga0126381_10132773813300010376Tropical Forest SoilIVEGRVGHLPYQFLVKISRANGGSETETLEVNVLDSSGKPLAGFPQVMPNPLAKGGDISRKEFEIPIDQALKEKIEKSFLAKEQFITHVDLIVGIDDDFLSQDFPK*
Ga0126383_1247306423300010398Tropical Forest SoilGRVGHLPYQFLVKISRANGGSEAETLEVNVLDSSGRALAGFPQVMPNPLTKGGDISRKEFEMPIDQPLKEKIEKSFLAQEQFITHVDLVVGMDDDFLSQDFPK*
Ga0126383_1271790723300010398Tropical Forest SoilKPGEKTVSAPSFKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSEAETLEVNVLDSSGRPLAGFPQVMPNPLTKSGDISRKEFEMPIDRPLKEKIEKSFLAKEQFITHVDLIVGMDDDFLSQDFPK*
Ga0137364_1092780423300012198Vadose Zone SoilRVAHLPYQFLVRVSRSDKSQAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKECEIPVSEAVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSAHFSNVR*
Ga0137364_1106620513300012198Vadose Zone SoilMPNRGEKSVSTPSFEREALPTSRLLISRDLATRIVKGRVAHLPYQFSVKINRGNDTETGTLSVNVLDSSGKPLSGFPQVVQNPLSKTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMNEDFLSGDFPK*
Ga0137364_1110983723300012198Vadose Zone SoilRVAHLPYQFLVRVSRSDKSQAEDLEVNVLDSSGKPLAGFPRVMPNPLTKTEDSSRKEFEIPVSEVVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSAHFSNVR*
Ga0137383_1093855513300012199Vadose Zone SoilLRNAKPGEKKFSAPSYEREALPMSRVAVSNGLATRIVEGRVAHLPYQFLVKIGRNSEGESGTLEVNVLDSSGKSLAGFPQTMPNPLTKTGDTSRREFEIPIDETLKKNIEKALLAKDQFLTHA
Ga0137365_1004422613300012201Vadose Zone SoilDAKPGQKNVSAPSYEREALPMSRIAISDGSAARIVEGRVTHLPYQFLVRISRSDKSQAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKEFEIPVGEAVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSAHFSNVR*
Ga0137399_1102039613300012203Vadose Zone SoilAISDGSAARIVQGRVAHLPYQFLVRISRSDKSRAEDLEVNVLDNSGKPLAGFPQVMPNPLTKTGDSSRKEFEIPVSEAVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSAHFSNVR*
Ga0137376_1131025413300012208Vadose Zone SoilGTLSVNVLDSSGKPLSGFPQVMQNPLSRTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGNFPK*
Ga0137377_1073476313300012211Vadose Zone SoilLRNAKPGEKKFSAPSYEREALPMSRVAVSNGLATRIVEGRVAHLPYQFLVKIGRNSEGESGTLEVNVLDSSGKSLAGFPQTMPNPLTKTGDTSRREFEIPIDETLKKNIEKALLAKDQFLTHADLIIGIDDDFLSADFPK*
Ga0137370_1005866423300012285Vadose Zone SoilVNVLDSSGKPLSGFPQVVQNPLSKTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGNFPK*
Ga0137372_1001795163300012350Vadose Zone SoilLREAKPGEKKFSAPSYEREALPMSRVAVSNGLATRIVEGRVAHLPYQFLVKIGRNSESESATLEVNVLDSSGKSLTGFPQTMPNPLTKTGDTSRREFEIPIDDTLKKNIEKALLAKDQFLTHADLIIGMDDDFLSADFPK*
Ga0137371_1012448333300012356Vadose Zone SoilFSAPSYDREPLPMSRMAVSDSLASRIVEGSVAQLPYQFSVSISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK*
Ga0137361_1006728023300012362Vadose Zone SoilSDGSAARIVEGRVAHLPYQFLVRISRSDKSQAEDLEVNVLDSSGKPLAGFPQAMPNPLTKTGDSSRKEFEIPVSEAVKKKIEKTFLEKDQFLTHVDLIVGMDDDFLSAHFSNVR*
Ga0137419_1055465213300012925Vadose Zone SoilKPGEKSVSAPSFEREGLPTSRLLISRGLATRIVKGRVAHLPYQFSVKISRGNDSETGTLSVNVLDSSGKPLSGFPHVMQNPLSRTGDTSRKEFEVPISRALKKKIEKSVLVQDQFITHVDLIVGMDEDFLSGDFPK*
Ga0137404_1072783113300012929Vadose Zone SoilHFPDAKLGEKNVSAPSYEREALPMSRIAVSDGSAARIVEGRVAHLPYQFLVRVSRSDKSQAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKEFEISVSEAVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSANFSNVR*
Ga0137407_1112658113300012930Vadose Zone SoilVNVLDSSGKSLAGFPQTMPNPLTKTGDTSRREFEIPIDDTLKKNIEKALLGKDQFLTHVDLVIGMDDDFLSADFPK*
Ga0137407_1223457713300012930Vadose Zone SoilTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK*
Ga0126369_1206422913300012971Tropical Forest SoilRANGGSETETLEVNVLDSSGKPLAGFPQVMPNPLAKGGDISRKEFEIPIDQALKEKIEKSFLAKEQFITHVDLIVGIDDDFLSQDFPK*
Ga0134081_1027678023300014150Grasslands SoilHAKPGEKSVSAPSFEREALPTSHVAISDGVSERIVEGRVAHLPYHFSVKISRGSDSETGTLEVDVLDSSGNSLAGFPQVMPNPLIKTGDSSRKEFELPVEQDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK*
Ga0134075_1053746313300014154Grasslands SoilFKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSEAETLEVKVLDSSGRPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEKIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK*
Ga0182037_1173485423300016404SoilADLNGVPERIVEGRVEHLPYQFLVKISRANGGSEAETLEVNVLDSSGKPLAGFPQVMPNPLTKGGDISRKEFEIPIDQALKEKIEKSFLAKEQFITHVDLIVGIDDDFLSQDFPK
Ga0184605_1040786723300018027Groundwater SedimentRVVNTVSPTTQVVEGRVAHLPYHFLVKVAQCNGSETGTLEVNIADNSGKSLAGFPRVIPNPLTKTGDTSRKEFEIPIDQSLKKKIEKALLAKGQFLTHVDLIIGMDEDFLSADFPK
Ga0184605_1045675713300018027Groundwater SedimentKNVSAPSYEREALPMSRIAVSDGSAARIVEGRVAHLPYQFLVRISRSDKSQAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKEFEIPVSEAVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSAHFSNVR
Ga0066667_1039936013300018433Grasslands SoilSRGSDSESGTLEVDVLDSSGNSRAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK
Ga0173482_1039026913300019361SoilEHFRHVKPWGIAVSASSFKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKSSRANGGSEAETLEVNVLDSSGKPLAGFPQVMPNPLTKGGDISRKEFEIPIDQALKEKIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK
Ga0137408_112597533300019789Vadose Zone SoilLPMSRIAVSDGSAARIVEGRVAHLPYQFLVRVSRSDKSQAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKEFEIPVSEAVKKNIEKTLLEKDQFLTHVDLIVGMDDDFLSANFSNVR
Ga0193722_101407213300019877SoilYQFSVKISRGNDSETGTLSVNVLDSSGKPLSAFPQVMQNPLSRTGDTSRKEFEVPIGRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK
Ga0193729_1000600123300019887SoilVNVLDSSGKPLSAFPQVMQNPLSRTGDASRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK
Ga0193728_135513613300019890SoilISRRNDSETGTLSVNVLDSSGKPLSAFPQVMQNPLSRTGDASRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK
Ga0210405_1071959013300021171SoilLPLSKVAISDGSVARIIEGRVAHLPYRFFVKITRNSDADKGTLEVNVLDTSGKPLKGFPRVIANPVTKTGDSSRKDFEIPVSKTLKKEIEKRLLAKDQFLTYVDLIVG
Ga0210383_1078534323300021407SoilADKGTLEVNILDNSGKPLKGFPQVIANPLTKTGDSSRKDFEIPVSKTLTKEIEETLLAKDQFLTYIDLMIGMDDDFLSLDFPK
Ga0193709_100266953300021411SoilLPYQFSVKISRRNDSETGTLSVNVLDSSGKPLSAFPQVMQNPLSRTGDTSRKEFEVPIGRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK
Ga0126371_1384155813300021560Tropical Forest SoilREGLPESRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSETETLEVNVLDSSGKPLAGFPQVMPNSLTKGGDISRKEFEIPIDQALKEKIEKSLLAKEQFITHVDLIVGIDDDFLSQDFPK
Ga0207699_1100234513300025906Corn, Switchgrass And Miscanthus RhizosphereVKPWGIAVSAASFKREGLPKLRIADLNGVPERIVEGRVGHLPYQFLVKISRANGGSEAETLEVEVLDSSGRPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEKIEKSLLTKEQFITHVDLIVGMDNDFLSQEFP
Ga0207657_1036148323300025919Corn RhizospherePYQFLVKIRGGTDSENGTLEVNVLNSSGKPAAGFPQVMANPLTKAGDSSRKEFDLPIGKSRKQKIKKALLTQDEFLTHVDLIVGMDDDFLSADFPK
Ga0207690_1047319423300025932Corn RhizosphereSAFEREALPLSRVTISNGVPERTVEGRVAHLPYQFLVKIRGGTDSENGTLEVNVLNSSGKPAAGFPQVMANPLTKAGDSSRKEFDLPIGKSRKQKIKKALLTQDEFLTHVDLIVGMDDDFLSADFPK
Ga0207691_1144713613300025940Miscanthus RhizosphereGEKSVSASAFEREALPLSRVTISNGVPERTVEGRVAHLPYQFLVKIRGGTDSENGTLEVNVLNSSGKPAAGFPQVMPNPLTKAGDSSRKEFDLPIGKSRKQKIKKALLTQDEFLTHVDLIVGMDDDFLSADFPK
Ga0209470_105840233300026324SoilVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK
Ga0209801_126870923300026326SoilGEKSVSAPAFEREALPMSHVALSDGVSERIVKGRVAHLPYEFLVKITRRSDSETGTLEVNVLDSSGKPLVGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK
Ga0209801_130624413300026326SoilADREPLPMSRMAVSDSLASRIVEGSVAQLPYQFSVNISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK
Ga0209266_107579813300026327SoilVAHLQYHFSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK
Ga0209803_129579723300026332SoilPPSYDREPLPMSRMAVSDSLASRIVEGSVAQLPYQFSVSISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK
Ga0257176_106769013300026361SoilPYQFSVKISRGNDSETGTLSVNVLDRSGKPLSGFPQVMQNPLSRTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK
Ga0209160_132550613300026532SoilSQRIVKGRVAHLPYEFLVKITRRSDSETGTLEVNVLDSSGKPLVGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK
Ga0209056_10002636243300026538SoilVEGRVAHLPYQFLVKIRRGSDSETGTPEVNVLDNLGKPLAGFPQVMPNPLTKTGDPSRKEFELPVEKDLNKKIRETLLTQEQFLTHVDLIVGMDDDFLSADFPK
Ga0209056_1049133213300026538SoilVRYHFREQFNNAKPGQKRFSAPSYDREPLPMSRMAVSDGLASRIVEGSVAQLPYQFSVNISRSSDSKAGTLEVNVLDSSGKSLAGFPQVMTNPLTKAGDTSRKEFEIPIDQTLKKKIEKALLAKDQFLTHVDLIIGMDDDFLSADFPK
Ga0209156_1023323113300026547SoilYHFREHFRHAKPGEKSVSAPSFEREALPTSHVAISDGVSERIVEGRVAHLPYHFSVKISRGSDSESGTLEVDVLDSSGNSLAGFPQVMPNPLTKTGDSSRKEFELPVEKDLKKKIKKMLLAQDQFLTHVDLIVGMDDDFLSADFPK
Ga0209974_1043777013300027876Arabidopsis Thaliana RhizosphereFREHFHHAKPGEKTVSAPSFEREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSGAETLEVNVLDSSGRPLAGFPQLMPNPLAKGGDISRKEFEIPIDQALKEKIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK
Ga0209526_1080210313300028047Forest SoilPTSRLLISRGLATRIVKGRVAHLPYQFSVKISRGNDTEPGTLSVNVLDSSGKSLTGFPQVMQNPLSRTGDTSRKEFEVPISRALKKKIEKRVLVQDQFITHVDLIVGMDEDFLSGDFPK
Ga0268265_1140462413300028380Switchgrass RhizosphereDSENGTLEVNVLNSSGKPAAGFPQVMANPLTKAGDSSRKEFDLTIGKSRKQKIKKALLTQDEFLTHVDLIVGMDDDFLSADFPK
Ga0170824_10840793033300031231Forest SoilMSRIAVSDGSAARIVEGRVANLPYQFLVRISRSDKSQAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKEFEIPVSEAVKKNIEKTLLKKDQFFTHVDLIVGMDDDFVSAHFSNVR
Ga0170824_11231059913300031231Forest SoilRIAVSVGLAARIVEGRVAHLPYRFSIKISRSNDSEPGTLEVNVLTSSGKSVAGFPQVMQNPISRTGDTSRREFEIPISGALKKNIEKRLLVHDQFVTHVDLIVGMDEEFLSGDFPN
Ga0170820_1711152513300031446Forest SoilAEDLEVNVLDSSGKPLAGFPQVMPNPLTKTGDSSRKEFEIPVSEAVKKNIEKTLLKKDQFFTHVDLIVGMDDDFVSAHFSNVR
Ga0307469_1065116113300031720Hardwood Forest SoilACEVRYHFREHFRDAKPGAKRVSAPSYEREPLPLSRVEVSDDLATRIVEGHVTHLPYEFLVKIGRNSESEVATLEINVLDSSGKSLAGFPQVMPNPLTRSGDTSRKEFEIPIDEALKKKIEKALLAKEQFLTHVDLIIGMDDDFLSADFPK
Ga0318500_1051377213300031724SoilKREGLPKLRVADLNGVPERIVEGRVGHLPYQFLVKISRANGGSEAETLEVNVLDSSDKPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEQIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK
Ga0306918_1047375323300031744SoilLPYQFLVKISRANGGSEAETLEVNVLDSSGKPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEQIEKSLLAEEQFITHVDLIVGMDDDFLSQDFPK
Ga0307477_1084641423300031753Hardwood Forest SoilAISDGFVARIIEGRVAHLPYRFFVKITRNSDADKGTLEVNVLDTSGKPLKGFPRVIANPVTKTGDSSRKDFEIPVSKTLKKEIEKRLLAKDQFLTYVDLIVGIPQSSLIYRVGPFHSPRKKTI
Ga0307473_1129389313300031820Hardwood Forest SoilYHFREHFRDAKPGAKGVSAPSYEREPLPLSRVEVSDGLATRIVEGHVTHLPYEFLVKIGRNSESEVATLEINVLDSSGKSLAGFPQVMPNPLTRSGDTSRKEFEFPIDEALKKKIEKALLAKEQFLTHVDLIIGMDDDFLSADFPK
Ga0306925_1018796253300031890SoilVPERIVEGRVGHLPYQFLVKISRANGGSEAETLEVNVLDSSDKPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEQIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK
Ga0310909_1114074923300031947SoilYQFLVKISRANGGSEAETLEVNVLDSSGKPLAGFPQVMPNPLTKGGDISRKEFEMPIDQALKEQIEKSLLAKEQFITHVDLIVGMDDDFLSQDFPK
Ga0307472_10045158023300032205Hardwood Forest SoilNSDADKGTLEVNVLDTSGKPLKGFPQVIANPLTKTGDSSRKDFEIPVSKTLKKEIEKTLLAKDQFLTYVDLIVGMDEDFLSEWLSALLGTFCPRQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.