NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101756

Metagenome / Metatranscriptome Family F101756

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101756
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 125 residues
Representative Sequence MAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKGAGATPSSSGPAATRERPQP
Number of Associated Samples 80
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.020 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil
(31.372 % of family members)
Environment Ontology (ENVO) Unclassified
(28.431 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(57.843 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Mixed Signal Peptide: No Secondary Structure distribution: α-helix: 59.49%    β-sheet: 0.00%    Coil/Unstructured: 40.51%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF09723Zn-ribbon_8 39.22
PF03952Enolase_N 18.63
PF00857Isochorismatase 13.73
PF00905Transpeptidase 4.90
PF00912Transgly 2.94
PF02749QRPTase_N 1.96
PF03699UPF0182 1.96
PF13424TPR_12 1.96
PF00781DAGK_cat 0.98
PF01039Carboxyl_trans 0.98
PF01645Glu_synthase 0.98
PF08545ACP_syn_III 0.98
PF08541ACP_syn_III_C 0.98
PF01729QRPTase_C 0.98
PF13683rve_3 0.98
PF02609Exonuc_VII_S 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0148EnolaseCarbohydrate transport and metabolism [G] 18.63
COG1335Nicotinamidase-related amidaseCoenzyme transport and metabolism [H] 13.73
COG1535Isochorismate hydrolaseSecondary metabolites biosynthesis, transport and catabolism [Q] 13.73
COG0157Nicotinate-nucleotide pyrophosphorylaseCoenzyme transport and metabolism [H] 2.94
COG0744Penicillin-binding protein 1B/1F, peptidoglycan transglycosylase/transpeptidaseCell wall/membrane/envelope biogenesis [M] 2.94
COG1488Nicotinic acid phosphoribosyltransferaseCoenzyme transport and metabolism [H] 2.94
COG4953Membrane carboxypeptidase/penicillin-binding protein PbpCCell wall/membrane/envelope biogenesis [M] 2.94
COG5009Membrane carboxypeptidase/penicillin-binding proteinCell wall/membrane/envelope biogenesis [M] 2.94
COG1597Phosphatidylglycerol kinase, diacylglycerol kinase familyLipid transport and metabolism [I] 1.96
COG1615Uncharacterized membrane protein, UPF0182 familyFunction unknown [S] 1.96
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.98
COG0777Acetyl-CoA carboxylase beta subunitLipid transport and metabolism [I] 0.98
COG0825Acetyl-CoA carboxylase alpha subunitLipid transport and metabolism [I] 0.98
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.98
COG1722Exonuclease VII small subunitReplication, recombination and repair [L] 0.98
COG4799Acetyl-CoA carboxylase, carboxyltransferase componentLipid transport and metabolism [I] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.02 %
All OrganismsrootAll Organisms0.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300027667|Ga0209009_1000007All Organisms → cellular organisms → Bacteria133902Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil31.37%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil28.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.73%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.96%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.98%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.98%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.98%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000878Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A20-5 cm-9A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001086Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3EnvironmentalOpen in IMG/M
3300001089Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3EnvironmentalOpen in IMG/M
3300001145Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2EnvironmentalOpen in IMG/M
3300001160Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010860Boreal forest soil eukaryotic communities from Alaska, USA - C5-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015193Arctic soil microbial communities from a glacier forefield, Rabots glacier, Tarfala, Sweden (Sample Rb6, proglacial stream)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027537Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027546Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AL9A1W_108937823300000878PermafrostMAEQRPSDGTERIREVVRDGFTIMVGAASWAFEQGDRLVGTWMDQGHVSREEGRRRFDEFAGKTKQAGEDLSRRVSESVKSARTSMPVATRDQVASLERRVEELS
JGI12709J13192_100523123300001086Forest SoilMAERHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGQMSREESRRRFEEFKSNTRRRGEDLSRRVSEGVRSSMPVATRDQIASLERQIADLKRQLESMQSAGATASSSGPTVTRERPQP*
JGI12683J13190_101104233300001089Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIE
JGI12682J13319_100412523300001145Forest SoilMAEHHDPNEHQHSEASDRVREVVRDSFTIALGAASWAFEQGDRLIDTWLHQGHVSREEGRRRFEEFASNTRRRGEDLSRRMRSSMPVATRDEIAKLERQVAELKSEIESLKAGNTVTQ*
JGI12654J13325_100458923300001160Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDXWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSAGTTPSSSTPATRDRPSQS*
JGI12635J15846_1003023053300001593Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSAGTTPSSSTPATRDRPSQS*
JGI12635J15846_1019553033300001593Forest SoilMAEQHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLRQGEMSREESRRRFEEFKSNTRSRGEDLSRRVQERVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSSSGTTATRERPQP*
JGI12053J15887_1001574263300001661Forest SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGHVSREESRRRFDELASSTRRRGEDLSRRVQQSVRSSMPIATRDQIASLERQVADLKQQLESMKSAGATTSSSGPTATRERPPQG*
JGIcombinedJ26739_10004302733300002245Forest SoilMAEQHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVETWLHQGEMSREESRRRFEEFKSNTRSRGEDLSRRVQERVRSSMPIATRDQIASLERQVADLKRQLESMQSAGATPSSSGTTATRERPQP*
JGIcombinedJ26739_10007817443300002245Forest SoilMAEQHQHSEGTERVREAIRDSFTIMLGAASWAFEMGDQMVDTWMHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQQGVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSTSKPD*
JGIcombinedJ26739_10026658333300002245Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDSWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSAGTTPSSSTPATRDRPSQS*
JGI25382J43887_1023324533300002908Grasslands SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVETWLHQGQISREEGRRRFEEFASSSRRRGEDLSRRVSEGMRSSMPVATRDQVASLERQVAELKQ
Ga0066672_1063297323300005167SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELDSLKGTGGTPSSPGQTATRERPPLS*
Ga0066677_1038253123300005171SoilMAEHHPPDGGDRTTQHQPSDGGERVREVIRDGFTIMLGAASWAFDQGDRLVDTWMQQGQVSREQGRRRFEEFASRTRQAGEDLGRRVQDSVKTARSSMPLATRDQVADLERQVA
Ga0066679_1047355723300005176SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTGATPSSPGQTATRERPPLS*
Ga0066690_1070394313300005177SoilRDGFTIMLGAASWAFEQGDRLVDTWMHQGEISREEGRKRFEEFTSRTRRAGEDLGRKVQDSVRTARSSMATREQVANLERQVAELTRQVESLKGTSASPSGQAVTRERPQP*
Ga0070714_10001743653300005435Agricultural SoilMAEQEHSAGGDPEHLAGGDHGHSPGTERVREAVRDGFTIMLGAASWAFEQGDRMVDTWLHQGELSRAESRKRFDEFASRTRRAGEDFGRKVQDSMRSARSTVATREQVANLEKQVAELTRQVESLKGSGSPPGVSVTRERPQP*
Ga0070708_10227671823300005445Corn, Switchgrass And Miscanthus RhizosphereMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKGA
Ga0066682_1006555433300005450SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVETWLHQGHISREEGRRRFEEFASSSRRRGEDLTRRVSEGMRSSMPVATRDQVASLERQVAELK*
Ga0070699_10079274523300005518Corn, Switchgrass And Miscanthus RhizosphereMAEQHSHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVDTWFEQGQISREAGRRRFEEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPPSPGPAGTRERSQT*
Ga0070699_10138526013300005518Corn, Switchgrass And Miscanthus RhizosphereMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKGAGATPSSSGPAATRERPQP*
Ga0070697_10000202643300005536Corn, Switchgrass And Miscanthus RhizosphereMAEQHSHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVDTWLQQGQISREQGRRRFEEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSASATPPSPGPAATRERSQP*
Ga0066701_1080942713300005552SoilMAEEHQHSEGSERVREAVRDGFTIMLGAASWAFEQGDRLIDTWLHQGQISREEGRRRFEEFASNTRRRGEDLSRRVSDSMRSSMPVATRDQVANLERQVAELTRQIE
Ga0066700_1006609033300005559SoilMAEQHQRSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGQVSREESRRRFDEFASSTRRRGEDLSRRVQQSMRSSMPIATRDQIASLERQVADLKQQLESMKGAGATTSSSGSTATRERPPQS*
Ga0066670_1037601723300005560SoilHEPYEGSERVREAIRDGFTIMLGAASWAFEQGDRLVDTWLREGRVSREEGRRRFEDFTTRTRRAGEDLGRRVQDSMRNARSSMPVASREQVANLERQVAELTRQSESMKSGGGPTTSTQRGRTDS*
Ga0066706_1001786243300005598SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTGATPSAPGQTATRERPPLS*
Ga0066696_1002896433300006032SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTSATPSAPGQTATRERPPLS*
Ga0070716_10043259313300006173Corn, Switchgrass And Miscanthus RhizosphereMAEQEHSAGGDPEHLAGGDHGHSPGTERVREAVRDGFTIMLGAASWAFEQGDRMVDTWLHQGELSRAESRKRFDEFASRTRRAGEDFGRKVQDSMRSARSTVATREQVANLEKQVAELTRQVESLKGSGSPPGVSV
Ga0066660_1001326813300006800SoilMAEHHPPDGGDRTTQHQPSDSGERVREVIRDGFTIMLGAASWAFDQGDRLVDTWMQQGQVSREQGRRRFEEFASRTRQAGEDLGRRVQDSVKTARSSMPLATRDQVADLERQVAELTRQI
Ga0099793_1001291743300007258Vadose Zone SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELDSLKGTGGTPSSPGQTATRDRPPLS*
Ga0099793_1031074123300007258Vadose Zone SoilMAEQHPHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPPPAGPTGTRERPQP*
Ga0102924_105131123300007982Iron-Sulfur Acid SpringMEEQHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDSWLHYGETSREESRRRFEEFKSNTRRRGEDLSRRVSHSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPPSAGPTLTRERHQP*
Ga0099829_1064894413300009038Vadose Zone SoilMAEQHPHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGEMSREESRRRFEEFKSNTRRRGEDLSRRVQEGVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSPSGSTATRERTQP*
Ga0099830_1044899823300009088Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKSGATPSSSGPAATRERPQS*
Ga0099828_1003880543300009089Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKGAGVNPSSPGPTATRERPQS*
Ga0099828_1139920413300009089Vadose Zone SoilMAEQHPHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGEMSREESRRRFEEFKSNTRRRGEDLSRRVQEGVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSPSGATATRERTQP*
Ga0099792_1006078113300009143Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVETWLHQGEVSREESRRRFEEFKSDTRRRGQDLSRRVSESVRSSMPVASREHVANLERQVAELTRQIESMKGPGATPSSPGPAASRERPQP*
Ga0134067_1023298513300010321Grasslands SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGDASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTGATPSAPGQT
Ga0126351_106544813300010860Boreal Forest SoilHSEGSERVRETIRDGFTIMLGAASWAFEMGDQMVDTWLHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQQSVRSSMPVATRDQIASLERQVADLKRQLEAMQAGASPSGATPTRERPQP*
Ga0137389_1006221343300012096Vadose Zone SoilMAEQHPHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGEMSREESRRRFEEFKSNTRRRGEDLSRRVQEGVRSSMPVATRDQIASLERQVADLKRQLESMQGAGATPSPSGATATRERTQP*
Ga0137388_1019426933300012189Vadose Zone SoilMAEQHPHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGEMSREESRRRFEEFKSNTRRRGEDLSRRVQEGVRSSMPVATRDQIASLERQVADLKR
Ga0137399_1003406143300012203Vadose Zone SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELDSLKGTGATPSSPGQTATRDRPPLS*
Ga0137376_1063073513300012208Vadose Zone SoilMAEHEQSPGSEHVHSPGSERLREALRDGFTIMLGAASWAFERGDRMVDTWLHQGEVSREQGRRRFDEFASNARRRGEDLSRRVSSSMRSSVPVATRDQVANLERQVAELTRQ
Ga0137376_1158021013300012208Vadose Zone SoilPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSSRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTGATPSSPGQTATRERPPLS*
Ga0137390_1139547713300012363Vadose Zone SoilEAIRDGFTIMLGAASWAFELGDRMVETWLHQGEVSREESRRRFEEFKSDTRRRGQDLSRRVSESVRSSMPVASREHVANLERQVAELTRQIESMKGAGVNPSSPGPTATRERPPS*
Ga0137390_1169787913300012363Vadose Zone SoilIMLGAASWAFELGDRMVETWLHQGEVSREESRRRFEEFKSDTRRRGQDLGRRVSESVRSSMPVASREHVANLERQVAELTRQIESMKGPGATPSSPGPTASRERPQP*
Ga0137397_1067603023300012685Vadose Zone SoilMAEQHPHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPSSSGSAATRERTPQS*
Ga0137395_1066197323300012917Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVETWLHQGEASREESRRRFEEFKSDTRRRGQDLSRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKGVGATPSSPGPAATRERPQP*
Ga0137396_1002303153300012918Vadose Zone SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGHVSREESRRRFDEFASTTRRRGEDLSRRVQQGVRSSMPVATRDQIASLERQVADLKQQLESMKSAGATTSSSGPTATRERPPQG*
Ga0137396_1015244913300012918Vadose Zone SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIARLERQVAELKSEVESLKGTGATPSSAG
Ga0137396_1030244523300012918Vadose Zone SoilDGFTIMLGAASWAFEQGDRMVDTWLHQGHVSREQGRRRFEEFASNARRRGEDLGRKVSDSMRSSVPVATRDQVANLERQVAELTREVESLKGGGSSSPPGQP*
Ga0137396_1086146813300012918Vadose Zone SoilMAEQHPHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLRKGQFTRDESRRRFDEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPSSSGPAATRERTPQS*
Ga0137394_1047677213300012922Vadose Zone SoilMAEQHPHSEGTERVREAIRDGFTIMLGAASWAFEMGDQMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPSSSGPAATRERTPQS*
Ga0137416_1018370133300012927Vadose Zone SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGHVSREESRRRFDEFASTTRRHGEDLSRRVQQGVRSSMPVATRDQIASLERQVADLKQQLESMKSAGATTSSSGPTATRERPPQG*
Ga0167668_102511323300015193Glacier Forefield SoilMAEQHQRSEGSERVRETIRDGFTIMLGAASWAFEMGDQMVDTWLHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQESVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATASSPPQRSHTET*
Ga0137418_1019467623300015241Vadose Zone SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELDSLKGTGTTPSSPGQTATRDRPPLS*
Ga0193751_101984643300019888SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEQGDRMVDTWLHHGEMSREESRRRFDEFTSNARRRGEDLGRRVSNSMRSSMPVATRDQVANLERQVAELTREVESLKAGGAASPSEPTASERR
Ga0215015_1094498933300021046SoilMAEEHPHSEGTERVREAIRDGFTIMLGAASWVFELGDRMVDTWLHQGEVSREESRRRFEEFKSNTRRRGQDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIEAMKGGGATPSSQGPTATRERPQR
Ga0210409_1119776423300021559SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEMSDQMVDTWLHQGQVSREESRRRFEEFKSNTRRRGEDLSRRVSEGVRSSIPVATRDQITSLERQVADLKRQLESMQG
Ga0137417_110346623300024330Vadose Zone SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGHVSREESRRRFDEFASTTRRHGEDLSRRVQQGVRSSMPVATRDQIASLERQVADLKQQLESMKSAGATTSSSGPTATRERPPQG
Ga0207664_1004104723300025929Agricultural SoilMAEQEHSAGGDPEHLAGGDHGHSPGTERVREAVRDGFTIMLGAASWAFEQGDRMVDTWLHQGELSRAESRKRFDEFASRTRRAGEDFGRKVQDSMRSARSTVATREQVANLEKQVAELTRQVESLKGSGSPPGVSVTRERPQP
Ga0207665_1055258623300025939Corn, Switchgrass And Miscanthus RhizosphereMAEQEHSPGGDHEHSPGTERVREAVRDGFTIMLGAASWAFEQGDRMVDTWLHQGELSRAESRKRFDEFASRTRRAGEDFGRKVQDSMRSARSTVATREQVANLERQVAELTRQVESLKGSGSPPGVSVPRERPQP
Ga0209027_100342143300026300Grasslands SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTGATPSSPGQTATRERPPLS
Ga0209686_121008613300026315SoilMTEQHQHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVETWLHQGQVSREEGRRRFEEFAASSRRRGEDLSRRVSEGMRSSMPVATRDQVASLERQVAELKQQ
Ga0257177_107910923300026480SoilMAEQHPHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGEMSREESRRRFEEFKSNTRRRGEDLSRRVQEGVRSSMPVATRDQIASLERQVADLKRQLESM
Ga0257181_104119913300026499SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVETWLHQGEVSREESRRRFEEFKSDTRRRGQDLSRRVSESVRSSMPVASREHVANLERQVAELTRQIESMKGAGATPSSPGPTASRERP
Ga0209807_100789743300026530SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTSATPSAPGQTATRERPPLS
Ga0209161_1002763243300026548SoilMAEHNDPHEHQHAEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTGATPSAPGQTATRERPPLS
Ga0209474_1006556923300026550SoilVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELESLKGTSATPSAPGQTATRERPPLS
Ga0209648_1049567623300026551Grasslands SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVETWLHQGEVSREESRRRFEEFKSDTRRRGQDLGRRVSESVRSSMPVASREHVANLERQVAELTRQIESMKGAGATPSSPGPTASRERPQP
Ga0209648_1052168013300026551Grasslands SoilMAEEHQHSEGGERVREVVRDGFTIMLGAASWAFEQADHMVDTWLHQGHISREEGRRRFDDFTSTARMKGEEVSRRVQETMRSARMPLATREQVANLERQVEELTRQIESLKSGDAERPE
Ga0209419_100287323300027537Forest SoilMAEHHDPNEHQQSESSERVREVVRDSFTIALGAASWAFEQGDRLIDTWLHQGHVSREEGRRRFEEFASNTRRRGEEFGHRMRSSMPVATRDEVARLERQVAELKSEIESLKAGSSVTQ
Ga0208984_102491823300027546Forest SoilMAEHNDPHEHQHAAGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSRQEGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIARLERQIADLKSQLESMKSAGTTSSSSSPTATRERPQP
Ga0208984_102687633300027546Forest SoilREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGHVSREESRRRFDELASSTRRRGEDLSRRVQQSVRSSMPIATRDQIASLERQVADLKQQLESMKSAGATTSSSGPTATRERPPQG
Ga0209220_100098793300027587Forest SoilMAEQHQHSEGTERVRETIRDGFTIMLGAASWAFELGDRMIDTWMRQGQVTREESRRRFDEFASTTRRRGEDLGRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSSGATASSSGPTATRERPQP
Ga0209220_100819443300027587Forest SoilMAERHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLHQGQMSREESRRRFEEFKSNTRRRGEDLSRRVSEGVRSSMPVATRDQIASLERQIADLKRQLESMQSAGATASSSGPTVTRERPQP
Ga0209220_109409023300027587Forest SoilMAEQHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLRQGEMSREESRRRFEEFKSNTRSRGEDLSRRVQERVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSSSGTTATRERPQP
Ga0209733_100054533300027591Forest SoilMAEHHDPNEHQHSEGSERVREVVRDSFTIALGAASWAFEQGDRLIDTWLHQGHVSREEGRRRFEEFATNTRRRGEEFGHRMRSSMPVVTRDEVARLERQVAELKSEIESLKAGSTVTQ
Ga0209733_102760713300027591Forest SoilMAEQHQRSEGSERVRETIRDGFTIMLGAASWAFEMGDQMVDTWLHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQQSVRSSMPVATRDQIASLERQVADLKRQLESMQAGASPSGATATRERPQP
Ga0209331_103856123300027603Forest SoilMAEQHQHSEGTERVREAIRDSFTIMLGAASWAFEMGDQMVDTWMHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQQGVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSTTKPD
Ga0209076_108283813300027643Vadose Zone SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSREQGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIAKLERQIAELKSELDSLK
Ga0209117_105503423300027645Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDSWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVTELTRQIESMKGAGTTPSSSTPATRDRPSQS
Ga0209217_100303133300027651Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSAGTTPSSSTPATRDRPSQS
Ga0209217_101863433300027651Forest SoilMAEQHQHSEGTERVREAIRDSFTIMLGAASWAFEMGDQMVDTWMHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQQGVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSTSKPD
Ga0209217_103469633300027651Forest SoilMAEQHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVETWLHQGEMSREESRRRFEEFKSNTRSRGEDLSRRVQERVRSSMPIATRDQIASLERQVADLKRQLESMQSAGATPSSSGTTATRERPQP
Ga0209009_1000007623300027667Forest SoilMAEHHDPNEHQHSEASDRVREVVRDSFTIALGAASWAFEQGDRLIDTWLHQGHVSREEGRRRFEEFASNTRRRGEDLSRRMRSSMPVATRDEIAKLERQVAELKSEIESLKAGNTVTQ
Ga0209118_100177763300027674Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDSWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSAGTTPSSSTPATRDRPTQS
Ga0209118_101332333300027674Forest SoilMAEQHQHSEGTERVREAVRDGFTIMLGAASWAFEMGDRMVDTWLRQGEMSREESRRRFEEFKSNTRSRGEDLRRRVQERVRSSMPVATRDQIASLERQVADLKRQLESMQSAGATPSSSGTTATRERPQP
Ga0209118_110792413300027674Forest SoilMAEQHQHSEGTERVREAIRDSFTIMLGAASWAFEMGDQMVDTWMHRGQVSREESRRRFDEFASSTRRRGEDLGRRVQQGVRSSMPVATRDQIASLERQVADLKRQ
Ga0208991_101601923300027681Forest SoilMAEQHQHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGHVSREESRRRFDELASSTRRRGEDLSRRVQQSVRSSMPIATRDQIASLERQVADLKQQLESMKSAGATTSSSGPTATRERPPQG
Ga0208991_102089823300027681Forest SoilMAEHNDPHEHHHSEGSERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSRQEGRRRFEEFASNTRRRGEDLSRRVSESMRSSMPVATRDEIARLERQIADLKSQLESMKSAGTTSSSSSPTATRERPQP
Ga0209626_112158823300027684Forest SoilMAEQHRHSEGTERLRETVRDGFTIMLGAASWAFELGDRMVDSWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIESMKSAGTTPSSSTPATRDRPSQS
Ga0209626_116275023300027684Forest SoilRVREVVRDSFTIALGAASWAFEQGDRLIDTWLHQGHVSREEGRRRFEEFASNTRRRGEEFGHRMRSSMPVATRDEVARLERQVAELKSEIESLKGPGATPSSPGQTATRERPQP
Ga0209328_1000241463300027727Forest SoilMAEQHPHSEGTERVRETIRDGFTIMLGAASWAFELGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESVRSSMPVATRDHVASLERQVAELTRQIETMKSAGTTPSSSTPATRDRPSQS
Ga0209689_104096543300027748SoilMAEQHQRSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLHQGQVSREESRRRFDEFASSTRRRGEDLSRRVQQSMRSSMPIATRDQIASLERQVADLKQQLESMKGAGATTSSSGSTATRERPPQS
Ga0209701_1056358023300027862Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKSGATPSSSGPAATRERPQS
Ga0209283_1004439323300027875Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVDTWLHQGEVSRDESRRRFEEFKSNTRRRGEELGRRVSESVRSSMPVATRDHVAQLERQVAELTRQIESMKGAGVNPSSPGPTATRERPQS
Ga0209488_1006184133300027903Vadose Zone SoilMAEEHQHSEGTERVREAIRDGFTIMLGAASWAFELGDRMVETWLHQGEVSREESRRRFEEFKSDTRRRGQDLGRRVSESVRSSMPVASREHVANLERQVAELTRQIESMKGPGATPSSPGPTASRERPQP
Ga0137415_1080906113300028536Vadose Zone SoilMAEQHPHSEGTERVREAIRDGFTIMLGAASWAFEMGDRMVDTWLRQGQVTREESRRRFDEFASTTRRRGEDLSRRVSESMKSSMPVATRDQVASLERQVAELKQQIESMKSAGATPS
Ga0307477_1001055833300031753Hardwood Forest SoilMAEHHDPNEHQHSEGGERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSRDEGRRRFEEFASNTRRRGEEFGQRMRSSMPVATRDEVARLEREVAELKSEIESLKAGSSVTS
Ga0307478_10002344143300031823Hardwood Forest SoilMAEHHDPNEHQHSEGGERVREVVRDGFTIMLGAASWAFEQGDRLVDTWLHQGHVSRDEGRRRFEEFASNTRRRGEEFGQRMRSSMPVATRDEVARLERQVAELKSEIESLKAGSSVTS
Ga0307471_10002296743300032180Hardwood Forest SoilMAEQHSHSEGTERVREAIRDGFTIMLGAASWAFEQGDRLVDTWFEQGQISREQGRRRFEEFASTTRRRGEDLTRRVSDSMRSSMPVATRDQVASLERQVAELKQQIESMKSAGATPPSPGPAGTRDRSQT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.