NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F040027

Metagenome / Metatranscriptome Family F040027

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F040027
Family Type Metagenome / Metatranscriptome
Number of Sequences 162
Average Sequence Length 86 residues
Representative Sequence GFTSCGQRTSGAFTSNNVARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP
Number of Associated Samples 89
Number of Associated Scaffolds 161

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.62 %
% of genes from short scaffolds (< 2000 bps) 0.62 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.20

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.383 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(45.062 % of family members)
Environment Ontology (ENVO) Unclassified
(85.802 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(88.889 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 16.82%    Coil/Unstructured: 83.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.20
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 161 Family Scaffolds
PF01391Collagen 2.48
PF00857Isochorismatase 1.86
PF01883FeS_assembly_P 1.86
PF13432TPR_16 1.24
PF13414TPR_11 0.62
PF01796OB_aCoA_assoc 0.62
PF00254FKBP_C 0.62
PF14833NAD_binding_11 0.62
PF00510COX3 0.62
PF02628COX15-CtaA 0.62
PF00378ECH_1 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 161 Family Scaffolds
COG1335Nicotinamidase-related amidaseCoenzyme transport and metabolism [H] 1.86
COG1535Isochorismate hydrolaseSecondary metabolites biosynthesis, transport and catabolism [Q] 1.86
COG1545Uncharacterized OB-fold protein, contains Zn-ribbon domainGeneral function prediction only [R] 0.62
COG1612Heme A synthaseCoenzyme transport and metabolism [H] 0.62
COG1845Heme/copper-type cytochrome/quinol oxidase, subunit 3Energy production and conversion [C] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.38 %
All OrganismsrootAll Organisms0.62 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300026314|Ga0209268_1076473All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium991Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil45.06%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil32.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.02%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.79%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.70%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.09%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012224Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012378Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066677_1011783113300005171SoilPIACTSNADCSTVSGFVSCGQRTSGAFTALDVARTISETGAAAGALTTGGAAKPAKLVSIFCIPLTFNSLVDSAADLPGPGAVALQGSAQNLP*
Ga0066677_1082357813300005171SoilVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066673_1009487423300005175SoilPGFTSCGQRTSGAFTSSNIARTIVETGAVSGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0066673_1045468613300005175SoilLPIPCTSNADCSGQTQGLGFPSCGQRTSGAFTASNVARTIVETGSPATALTTGGAAQPAKLVSIFCIPLTFSTLVDSAGDLPGPGAVALPVTMQNQ*
Ga0066690_1044167723300005177SoilVAAPCLPIPCTANADCSGQIQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIAGTAQALP*
Ga0066690_1044971623300005177SoilQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP*
Ga0066688_1023591123300005178SoilFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066684_1013974123300005179SoilADCAGQTQSPGFTSCGQRTSGAFTSSNIARTIVETGAVSGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0066685_1072378623300005180SoilCSGQTQSPGFTSCGQRTSGAFTATNVARTIVENGSPAGPLTTGGPAKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAI*
Ga0066678_1018194023300005181SoilFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLASIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP*
Ga0066678_1051571223300005181SoilANTDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066678_1067177723300005181SoilSCGQRTAGAFTAMDVARTIVEMGSPSGSLATGGPAKPATLVSIFCIPLTFSTLVDSAADLPGPGAVAITGVAQALP*
Ga0066676_1048755613300005186SoilCSGAPCLPVPCTANANCASVTGFTSCGQRTSGAFCSGDLTSEVNPCSDLARTIVEKGVPAGALTTGGAAKPATLVSIFCMPLTFSALADSADDLPGPAAVALPIQVQLQP*
Ga0066676_1114362413300005186SoilAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLASIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP*
Ga0066675_1006351223300005187SoilAAAPCLPIPCTSNADCSGQIQSPGFTSCGQRTSGAFTASNVARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP*
Ga0070708_10065490913300005445Corn, Switchgrass And Miscanthus RhizosphereSGNANCAAATGFTSCGQRTSGAFTANDVARTIVETGVAVGPIVTGGPAQPQTLVSIFCIPPSFSPAVDTAADLPGPGAVALQGTTQLQ*
Ga0066686_1063683013300005446SoilCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066689_1006863413300005447SoilACGGDPCLPVPCTSNTDCSTLGAFNSCGQRTSGAFTAVDVARTIVETGTAAGALTTGGLPQPGDLVSIFCIPLTFNSLVDSAGDLPGPGAVALPVTMQIQ*
Ga0066689_1072113723300005447SoilARTIYETGLPAGPLTTGGPAVHQILVSIFCIPPTFSPAVDAAADLPGPGAVAFDGSVVMTP*
Ga0066682_1020936723300005450SoilSVALPCLPIPCASNTDCSGQTQSPGFTSCGQRTSGAFTGSNVARTIVESGAAAGAMTTGGPAKPAKLASIFCIPLTFSALVDTAADLPGPGAVALSGTAQNLP*
Ga0070697_10059395813300005536Corn, Switchgrass And Miscanthus RhizosphereTIHETGSPAGALTTNGPAKPATLVSIFCIPPSFTAVVDSAADLPGPGAVSLPGTTQNLP*
Ga0070697_10179289713300005536Corn, Switchgrass And Miscanthus RhizosphereQRTAGAFCGIGSPSNPCDDVTRTIVETGSPAGALTTGGAAKPAKLVSIFCIPPSFNALVDSAADLPGPGAVSIPGVTQAMP*
Ga0066697_1000626563300005540SoilAVSCASNADCTPLATRGFGSCGQRTPGAFTANDLARTIVEQGAAAGPLTTGGAAAPETLVSIFCIQPTFNALVDAGYDLPGPGAVALSGTTQLQ*
Ga0066697_1034664523300005540SoilAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066692_1027193913300005555SoilAAPCLPIPCTANSTCAGQTQAPGFTSCGQRTSGAFTASNVARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNSLVDTAANLPGPGAVAIQGETQALP*
Ga0066707_1034450413300005556SoilGFTSCGQRTSGAFTGSNVARTIVESGAAAGAMTTGGPAKPAKLASIFCIPLTFSALVDTAADLPGPGAVALSGTAQNLP*
Ga0066707_1046524623300005556SoilSVAAPCLPIPCTSNANCSGQTQSSGFTSCGQHTAGAFTSSNGARTIVETGSPAGALTTGGAAKAGKLVSIFCIPLTFTMLVDSAADLPGPGAVALPVTMQTQ*
Ga0066707_1063719813300005556SoilCTSNTDCSTLGAFNSCGQRTSGAFTAVDVARTIVETGTTAGALTTGGLPQPGNLVSIFCIPLTFNSLVDSAGDLPGPGAVALPVTMQIQ*
Ga0066704_1096519923300005557SoilNADCSGQIQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAISGTAQALP*
Ga0066698_1007433423300005558SoilFTNTDTARTIVEQGSPSGPLTTGGPAKPATLVSIFCIPLTFSSLVDGAADLPGPGAVALPGMARALP*
Ga0066698_1048415413300005558SoilCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066698_1073129523300005558SoilGFTSCGQRTSGAFTGLNTARTIVETGSPSGPLTVGGPGAPSTLVSAFCIPLTFSSLVDTAADLPGPGAVAITGTAQALP*
Ga0066670_1087581413300005560SoilGFTSCGQRTSGAFTASNVARTIVETGSPATALTTGGAAKPAKLVSIFCIPLTFSTLVDSAGDLPGPGAVALPVTMQNQ*
Ga0066705_1041756423300005569SoilQSPGFTSCGQRTSGAFTSANVARTIVETGTPSGPLTVGGAGAPSTLVSIFCIPLTFTSLVDTAADLPGPGAVAITGTAQALP*
Ga0066708_1015907813300005576SoilACSGAPCLPVACTANTDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066708_1082913523300005576SoilGQHTAGAFTSSNGARTIVETGSPAGALTTGGAAKAGQLVSIFCIPLTFTMLVDSAADLPGPGAVALPVTMQTQ*
Ga0066691_1014859333300005586SoilTGFTSCGQRTSGAFSSADIARTIYETGLPAGPLTTGGPAVHQILVSIFCIPPTFSPAVDAAADLPGPGAVAFDGSVTLSP*
Ga0066691_1070407813300005586SoilARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNSLVDTAANLPGPGAVAIQGVTQALP*
Ga0066654_1014945413300005587SoilAVSCASNADCTPLATRGFGSCGQRTPGAFTANDLARTIVEQGAAAGPLMTGGAAAPETLVSIFCIQPTFNALVDAGYDLPGPGAVALSGTTQLQ*
Ga0066706_1098588823300005598SoilSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0066651_1005425823300006031SoilSNADCSGQIQSPGFTSCGQRTSGAFTANNIARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0066651_1056895423300006031SoilVARTIVENGSPAGPLTTGGPAKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP*
Ga0066696_1035629213300006032SoilDCSGQTQSPGFLSCGQRTSGAFTANNIARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0066696_1043929613300006032SoilGNTGTIGAPCSAAAPCLPIPCTSNTDCAGQTQSPGFVSCGQRTSGAFTGANVARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIAGTAQALP*
Ga0066656_1007015813300006034SoilARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP*
Ga0066656_1031180813300006034SoilGKNFASCGQRTAGAFTAMDVARTIVEMGSPSGSLATGGPAKPATLVSIFCIPLTFSALVDSAADLPGPGAVAITGVAQALP*
Ga0066656_1051780123300006034SoilPCAACSGAPCLPVPCTANANCASVTGFTSCGQRTSGAFCSGDLTSEVNPCSDLARTIVEKGVPAGALTTGGAAKPATLVSIFCMPLTFSALADSADDLPGPAAVALPIQVQLQP*
Ga0066656_1058750113300006034SoilGFTSCGQRTSGAFCSGDTTSAVNPCGDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVAITGVTQALP*
Ga0066652_10190819623300006046SoilIPCTSNADCSGQIQSPGFTSCGQRTSGAFTANNIARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0066658_1069365813300006794SoilPCTSNTDCSTLGAFNSCGQRTSGAFTAVDVARTIVETGTAAGALTTGGLPQPGDLVSIFCIPLTFNSLVDSAGDLPGPGAVALPVTMQTQ*
Ga0066658_1087152113300006794SoilPCLPIPCTGNATCSGQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTVADLPGPGAVAIQGVTQSLP*
Ga0066665_1034479613300006796SoilCSVALPCLPIPCTGNADCSGQTQSPGFLSCGQRTSGAFTSSNVARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0066665_1038828113300006796SoilAPCLPIPCTSNANCSGQTQSSGFTSCGQHTAGAFTSSNGARTIVETGSPAGALTTGGAAKAGKLVSIFCIPLTFTMLVDSAADLPGPGAVALPVTMQTQ*
Ga0066659_1025841923300006797SoilTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP*
Ga0066659_1111075723300006797SoilEPGSPAGPLTTGGPPTPQTLVSIFCVPPTFSTVVDSAADLPGPGAVALEGTTQLQ*
Ga0066710_10192737013300009012Grasslands SoilIPCTSNATCSGQTQSPGFTSCGQRTSGAFTATNVARTIVENGSPAGPLTTGGPAKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP
Ga0066710_10193507923300009012Grasslands SoilQTQSPGFTSCGQRTSGAFTSSNIARTIIETGSPSGALTTGGPGAPATLISIFCMPLTFSSLVDTAYDLPGPGAVSLPVQAQLLP
Ga0066710_10247186923300009012Grasslands SoilSGFTSCGQHTAGAFTSSNGARTIVETGSPAGALTTGGAAKPGKLVSIFCIPLTFTMLVDSAADLPGPGAVALPVTMQTQ
Ga0066710_10404318023300009012Grasslands SoilADCADQTKPPGFTSCGQRTSGAFTSYNVARTIVETGSPAGLLAAGGPAQPATLVSIFCMPLTFSSLVDSADDLPGPGAVAITGTAQALP
Ga0099828_1054170313300009089Vadose Zone SoilPANGNADCAAATGFTSCGQRTSGAFTGTNVARTIVETGSPAGPQTTGGAAQPQTLVSIFCIPPTFSPAVDTAADLPGPGAVALQGTVVLTP*
Ga0099827_1074593423300009090Vadose Zone SoilVTGFTSCGQHTPGAFTNTDTARTIVETGSPSGPLTTGGPAKPATLVSIFCIPLTFSALVDGAADLPGPGAVALPGMAKALP*
Ga0099827_1117346313300009090Vadose Zone SoilNADCSAQTQSPGFTSCGQRTSGAFTALNTARTIVESGAASGALTTGGAAKPAKLASIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0099827_1164346213300009090Vadose Zone SoilTSDTDCAGQTQTQGGGPFISCGQRTGGAFTSSGVARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAITGTAQALP*
Ga0066709_10011901513300009137Grasslands SoilNTGTTGAPCNSAAPCLPIPCTANSTCAGQTQAPGFTSCGQRTSGAFTASNVARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNSLVDTAANLPGPGAVAIQGETQALP*
Ga0066709_10044770933300009137Grasslands SoilVSGFTSCGQRTGGAFTPNGVARTIVETGSPSGTLTTGGPAKPATLVSIFCIPQTFSALIDGAADLPGPGAVAIEGMAQALP*
Ga0134070_1018156013300010301Grasslands SoilQRTSGAFTSSNIARTIVETGAVSGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0134082_1001495313300010303Grasslands SoilVACTANTDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134082_1016361713300010303Grasslands SoilCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLASIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP*
Ga0134082_1031107213300010303Grasslands SoilNADCSGQTQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVRIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP*
Ga0134088_1024266113300010304Grasslands SoilVPCTANADCTAVTGFTSCGQRTSGAFCSGDPTSAVNACSDLARTIVETGAPAGALTTGGASKPATLVSIFCIPPTFNTLVDSAADLPGPGAVSLVGTTQLQ*
Ga0134067_1049519213300010321Grasslands SoilQTLGGGPFISCGQRTGGAFTSSNTARTIVETGSPSGPLTVGGPGAPSTLVSIFCIPLTFSSLVDSAADLPGPGAVAIAGTAQALP*
Ga0134084_1010254613300010322Grasslands SoilACSGAPCLAVSCASNADCTPLATRGFGSCGQRTPGAFTANDLARTIVEQGAAAGPLMTGGAAAPETLVSIFCIQPTFNALVDAGYDLPGPGAVALSGTTQLQ*
Ga0134084_1026208013300010322Grasslands SoilQRTSGAFTASNVARTIVETGSPATALTTGGAAKPGKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134064_1012969713300010325Grasslands SoilALDSDCTSVTGSTSCGQRTAGAFTANDVSRTIVETGMPAGALTTGGPAQPGTLVSIFCIPPSFTQTVDAAADLPGPGAVAIPGMAQALP*
Ga0134065_1005551033300010326Grasslands SoilVPCTSNTDCSTLGAFNSCGQRTSGAFTAVDVARTIVETGTTAGALTTGGLPQPGNLVSIFCIPLTFNSLVDSAGDLPGPGAVALPVTMQIQ*
Ga0134065_1028049823300010326Grasslands SoilIPCTSNADCSGQTQGLGFPSCGQRTSGAFTASNVARTIVETGSPATALTTGGAAQPAKLVSIFCIPLTFSTLVDSAGDLPGPGAVALPVTMQNQ*
Ga0134065_1032875413300010326Grasslands SoilGFTSCGQRTSGAFTASNIARPIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP*
Ga0134065_1046422613300010326Grasslands SoilTANADCSGQTQSPGFTSCGQRTSGAFTSNNVARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0134111_1048907013300010329Grasslands SoilATGFTSCGQRTSGAFTATNVARTIVETGSPAGPLTTGGPPTPQTLVSIFCVPPTFNTVFDSAADLPGPGAVALEGTTQLQ*
Ga0134080_1030654113300010333Grasslands SoilTGFTSCGQRTSGAFTSNDVARTIVETGSPAGSQTTGGAAQPQTLVSIFCIPPTFNTLVDSAADLPGPGAAVLQGTVQLQ*
Ga0134063_10000086283300010335Grasslands SoilVTNFTSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134063_1000140913300010335Grasslands SoilDCTPLATRGFGSCGQRTPGAFTANDLARTIVEQGAAAGPLMTGGAAAPETLVSIFCIQPTFNALVDAGYDLPGPGAVALSGTTQLQ*
Ga0134062_1004728313300010337Grasslands SoilPCLPIPCTGNATCSGQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP*
Ga0134062_1015624823300010337Grasslands SoilADCSAQTQSPGFTSCGQRTSGAFTSNNVARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0134062_1025871113300010337Grasslands SoilVPCTSNASCTGATGFTSCGQRTSGAFCSGDTTSAVNPCGDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVAITGVTQALP*
Ga0134062_1046306413300010337Grasslands SoilTASTDCAAVSGFTSCGQRTGGAFTPNGVARTIVETGSPSGTLTTGGPAKPATLVSIFCIPGTFNSLVDSAANIPGPGAVALQGTAQNLP*
Ga0134062_1058475123300010337Grasslands SoilVARTIVETGSPSGPLTVGGPGAPSTLVSIFCIPLTFSSLVDSAADLPGPGAVAITGTAQALP*
Ga0137364_1020399413300012198Vadose Zone SoilSNIARTIVETGAVSGALTTGGAAQPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0137382_1032588023300012200Vadose Zone SoilLSCGQRTSGAFTSSNIARTIVETGAVSGALTTGGAAQPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0137382_1111260423300012200Vadose Zone SoilPCTSNATCSGQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP*
Ga0134028_110246413300012224Grasslands SoilGQRTSGAFTASNIARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIAGTAQALP*
Ga0134025_111273813300012378Grasslands SoilIPCTANADCSGQTQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIAGTAQALP*
Ga0134025_111273823300012378Grasslands SoilVARTIVETGSPAGALTVGGAPAPSTLVSIFCIPLTFSSLVDTAADLPGPGAVAINGQAQ
Ga0137395_1116856613300012917Vadose Zone SoilTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNSLVDTAANLPGPGAVAIQGVTQALP*
Ga0137404_1178998223300012929Vadose Zone SoilSGAFTANNVARTIVETGSPAGLLTTGGPAAPSTLVSIFCMPLTYSSLVDTASDLPGPGAVAITGTMQALP*
Ga0137404_1204404713300012929Vadose Zone SoilFTSANVARTIVETGSPSGPLTVGGTGKPSTLVSIFCIPLTFTNLVDTAADLPGPGSVAITGTAQALP*
Ga0137407_1047268623300012930Vadose Zone SoilAPCLPIPCTSNADCSGQIQSPGFTSCGQRTSGAFTANNIARTIVETGAVSGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0134077_1013702723300012972Grasslands SoilGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134110_1000857613300012975Grasslands SoilTANTDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134110_1002587713300012975Grasslands SoilCGQRTSGAFCSGDTTSAVNPCGDLARTSVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVAITGVTQALP*
Ga0134110_1009309823300012975Grasslands SoilVARTIVETGSPSGPLTIGGPGAPSTLVSIFCIPLTFSSLVDSAADLPGPGAVAITGTAQALP*
Ga0134110_1014533723300012975Grasslands SoilGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134110_1021102223300012975Grasslands SoilGFTSCGQRTSGAFTSNNVARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0134110_1023902423300012975Grasslands SoilCTANTDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134076_1023054113300012976Grasslands SoilSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134076_1023454613300012976Grasslands SoilSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134076_1050554323300012976Grasslands SoilGQRTSGAFTSSNIARTIVETGAVTGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0134087_1006805313300012977Grasslands SoilTSNADCAGQTQSPGFTSCGQRTSGAFTSSNIARTIVETGAVSGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP*
Ga0134087_1027613113300012977Grasslands SoilTANAGCSGQTQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAISGTAQALP*
Ga0134087_1045825623300012977Grasslands SoilGFTSCGQRTSGAFTASNVARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNSLVDTAANLPGPGAVAIQGETQALP*
Ga0134081_1005405513300014150Grasslands SoilSNATCSGQTQSPGFVSCGQRTSGAFTGANVARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAITGTAQALP*
Ga0134081_1025991723300014150Grasslands SoilVGGPSPGALCCGGANCLGGYVSCAQRDPGAFTAIDVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP*
Ga0134075_1005361623300014154Grasslands SoilAPCLPVPCTSDTDCSTVSGFISCGQRTAGAFTSLDTARTISETGAAAGALTTGGAAKPAKLVSIFCIPLTFNSLVDSAADLPGPGAVALQGSAQNLP*
Ga0134075_1037542023300014154Grasslands SoilTGGAFTPNGVARTIVETGSPSGALTTGGTAKPATLVSIFCIPLTFSALVDSAADLPGPGAVAITGVAQALP*
Ga0134075_1041688223300014154Grasslands SoilLDSDCTSVTGSTSCGQRTAGAFTANDVSRTIVETGMPAGALTTGGPAQPGTLVSIFCIPPSFTQTVDAAADLPGPGAVAIPGMAQALP*
Ga0134075_1053942713300014154Grasslands SoilTSNATCSGQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDSAADLPGPGAVAITGVTQALP*
Ga0134078_1060762813300014157Grasslands SoilTSNADCSGQIQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP*
Ga0134073_1012888123300015356Grasslands SoilVTGFTSCGQRTPGAFTNTDTARTIVEQGSPSGPLTTGGPAKPATLVSIFCIPLTFSSLVDGAADLPGPGAVALPGMAKALP*
Ga0134073_1024401713300015356Grasslands SoilCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP*
Ga0134072_1008195223300015357Grasslands SoilNADCTAVTGFTSCGQRTSGAFCSGDPTSAVNACSDLARTIVETGAPAGALTTGGASKPATLVSIFCIPPTFNTLVDSAADLPGPGAVSLVGTTQLQ*
Ga0134072_1027264313300015357Grasslands SoilAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP*
Ga0134089_1033719913300015358Grasslands SoilQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGAGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAITGTAQALP*
Ga0134089_1053292013300015358Grasslands SoilAAAPCLPIPCTANADCSGQTQSPGFTSCGQRTSGAFTANNIARTIVETGAASGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALSGTAQNLP*
Ga0134112_1013367023300017656Grasslands SoilCGGAPCLPVPCTSNASCTGATGFTSCGQRTSGAFCSGDTTSAVNPCGDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVAITGVTQALP
Ga0134112_1049861023300017656Grasslands SoilTGFTSCGQRTSGAFTSADVARTIIETGSPAGAQTTGGAAQPQTLVSIFCIPPTFNALVDSAADLPGPGAAALQGTTQLQ
Ga0066655_1006865923300018431Grasslands SoilGQIQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP
Ga0066655_1017765523300018431Grasslands SoilLPIPCASNADCSGQTQSPGFTSCGQRTSGAFTGSNVARTIVESGAAAGAMTTGGPAKPAKLASIFCIPLTFSALVDTAADLPGPGAVALSGTAQNLP
Ga0066655_1043997323300018431Grasslands SoilVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0066667_1218412313300018433Grasslands SoilITESGSPAGPLTTGGAAEPETLVSIFCVPPSFSQIVDPSADLPGPGAVALPYATQLQ
Ga0066669_1089888413300018482Grasslands SoilPVACTANTDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0066669_1243240023300018482Grasslands SoilRTSGAFTATNVARTIVENGSPAGPLTTGGPAKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGVTQSLP
Ga0207684_1027533713300025910Corn, Switchgrass And Miscanthus RhizospherePCTFDADCATVTGFVSCQQRTSGAFTAANVARTIVETGAAAGALTTGGLPKPGTLVSIFCIPPSFTAVVDSAADLPGPGAVALPVAMQTQ
Ga0207684_1075480523300025910Corn, Switchgrass And Miscanthus RhizosphereADTGCASVTGFKSCGQRTGGAFTALDVARTIVETGVPSGALTTGGPAKPGTLVSIFCIPPSFNPTVDAAAALPGPGAVAIPGMAQAFP
Ga0207646_1062205633300025922Corn, Switchgrass And Miscanthus RhizosphereSNTGTTTAPCNAAAPCLPIPCTSNTDCAGQTQSPGFLSCGQHTAGAFTASNVARTIVETGAAAGALTTGGAARPAKLVSIFCIPNTFNTIVDNSADLPGPGAVALQGTAQNLP
Ga0209236_120913513300026298Grasslands SoilGAFTAMDVARTIVEMGSPSGSLATGGPAKPATLVSIFCIPLTFSTLVDSAADLPGPGAVAITGVAQALP
Ga0209469_108274913300026307SoilIPCTSNADCAGQTQSPGFTSCGQRTSGAFTSSNIARTIVETGAVSGALTTGGAAKPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP
Ga0209268_107647323300026314SoilSCSVGVPCLQCGGAPCLPVPCTSNASCTGATGFTSCGQRTSGAFCSGDTTSAVNPCGDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVAITGVTQALP
Ga0209268_112858213300026314SoilDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGAAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0209155_112763623300026316SoilCSGQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGMTQSLP
Ga0209470_100101213300026324SoilSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0209470_100326863300026324SoilFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0209152_1020996923300026325SoilTANDVSRTIVETGMPAGALTTGGPAQPGTLVSIFCIPPSFTQTVDAAADLPGPGAVAIPGMAQALP
Ga0209152_1033380623300026325SoilTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0209267_110708923300026331SoilVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0209267_114037513300026331SoilVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQALP
Ga0209267_117768213300026331SoilTSGAFTSSNIARTIVETGAVSGALTTGGAAQPGKLVSIFCIPLTFSSLVDTAADLPGPGAVALQGNAQALP
Ga0209158_118825713300026333SoilARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNSLVDTAANLPGPGAVAIQGVTQAL
Ga0209808_105713123300026523SoilVTGSTSCGQRTAGAFTANDVSRTIVETGMPAGALTTGGPAQPGTLVSIFCIPPSFTQTVDAAADLPGPGAVAIPGMAQALP
Ga0209378_128802013300026528SoilGFTSCGQRTSGAFTGSNVARTIVESGAAAGAMTTGGPAKPAKLASIFCIPLTFSALVDTAADLPGPGAVALSGTAQNLP
Ga0209806_108780213300026529SoilSFTGFTSCGQRTSGAFSSADIARTIYETGLPAGPLTTGGPATHQILVSIFCIPPTFSPAVDAAADLPGPGAVAFDGSVTLSP
Ga0209807_125807713300026530SoilNADCSGQIQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAISGTAQALP
Ga0209160_123096113300026532SoilSNVARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFNALVDTAANLPGPGAVAIQGETQALP
Ga0209056_1043055823300026538SoilGLGFTSCGQRTSGAFTATNVARTIVETGSPATALTTGGAAKPAKLVSIFCIPLTFSTLVDSAGDLPGPGAVALPVTMQNQ
Ga0209376_109961123300026540SoilDCAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLASIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP
Ga0209376_136871713300026540SoilPIPCTGNATCSGQTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPAKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAI
Ga0209156_1006815033300026547SoilTQSPGFTSCGQRTSGAFTATNVARTIVEIGSPAGPLTTGGPEKPATLVSIFCIPLTFSSLVDTAADLPGPGAVAIQGMTQSLP
Ga0209156_1012669013300026547SoilVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLVSIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP
Ga0209161_1028801523300026548SoilTIVETGSPSGPLTVGGPGAPSTLVSAFCIPLTFSSLVDTAADLPGPGAVAITGTAQALP
Ga0209474_1019157913300026550SoilAAVTGFVSCGQRTSGAFCSGDPTSAVNPCSDLARTIVETGAPAGALTTGGPAKPAKLASIFCIPLTFSSLVDSAADLPGPGAVALQGTTQAFP
Ga0209474_1048552213300026550SoilPCSAAAPCLPIPCTSNADCSGQIQSPGFTSCGQRTSGAFTASNIARTIVETGSPSGPLTVGGPGAPSKLVSIFCIPLTFSSLVDTAADLPGPGAVAIPGTAQALP
Ga0209474_1055268913300026550SoilSNVARTIVETGSPAGALTVGGAPAPSTLVSIFCIPLTFSSLVDTAADLPGPGAVAIAGTAQALP
Ga0307471_10417120223300032180Hardwood Forest SoilVSCGQHTSGAFTSADVTRTIVETGAPAGPLTTGGPARPAKLVSIFCIPLTFNSLVDASADLPGPGAVALTGATQALP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.