NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F094191

Metagenome Family F094191

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094191
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 165 residues
Representative Sequence MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRP
Number of Associated Samples 99
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.924 % of family members)
Environment Ontology (ENVO) Unclassified
(26.415 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.453 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 68.15%    β-sheet: 0.00%    Coil/Unstructured: 31.85%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF03949Malic_M 25.47
PF05494MlaC 5.66
PF04909Amidohydro_2 4.72
PF02518HATPase_c 3.77
PF12697Abhydrolase_6 1.89
PF13291ACT_4 0.94
PF01609DDE_Tnp_1 0.94
PF02129Peptidase_S15 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG0281Malic enzymeEnergy production and conversion [C] 25.47
COG0686Alanine dehydrogenase (includes sporulation protein SpoVN)Amino acid transport and metabolism [E] 25.47
COG2854Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 5.66
COG3039Transposase and inactivated derivatives, IS5 familyMobilome: prophages, transposons [X] 0.94
COG3293TransposaseMobilome: prophages, transposons [X] 0.94
COG3385IS4 transposase InsGMobilome: prophages, transposons [X] 0.94
COG5421TransposaseMobilome: prophages, transposons [X] 0.94
COG5433Predicted transposase YbfD/YdcC associated with H repeatsMobilome: prophages, transposons [X] 0.94
COG5659SRSO17 transposaseMobilome: prophages, transposons [X] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil16.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.43%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.55%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.72%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.72%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.89%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.89%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.89%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.89%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.94%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.94%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.94%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25612J43240_101502313300002886Grasslands SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSLRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLAR
Ga0066680_1036450423300005174SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPR
Ga0066685_1112169113300005180SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHA
Ga0066388_10034968413300005332Tropical Forest SoilMKRALVALLVVLVVLGATLYFFVVRPLLRPSEMTAVAENALATDDLLLLGGINVNQAVFLERWFFGAPAATATPASLPAAADRSLIEHLRAAGVDPRHDVDYVLYALYPAAEATRHAVVLVGRFNPNAVNGYLTRELAATPRAPAGPASYAVVRTDQSTCQPAATWIVTANSRWIVLADPATHAVLLPRLTSPPPESGDRLTWWRPLARADVAGLGITG
Ga0066681_1047999313300005451SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKHAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRAGAGPTSFEVARTDPRRRSSRSSSPWT
Ga0070698_10050396813300005471Corn, Switchgrass And Miscanthus RhizosphereMKKVLAAAIAVLVVLGAGLYFFVARPLLRPSEVTAVAERALAADDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADASVSRHAAILVGRFNPAAVNTYLARELAATPRPGPGPASFDVIRIDPATCQPGASWVVTVTREWIVMA
Ga0070695_10057986613300005545Corn, Switchgrass And Miscanthus RhizosphereMKKALAALAAALVVLVAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRPGAAWVVTVSPEWIVLADPAS
Ga0066704_1058935413300005557SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPAPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAGPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVTRIDPATCQPGASWVVTVAREWIVMADPISQPPLVARLAG
Ga0066708_1011528213300005576SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHD
Ga0075024_10082875613300006047WatershedsRTIVDMKRALAALLGVLIVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFFGTPRISTVQAVPTPAVADRTLIDHLRVAGVDARHDVDYALYAVYPAAETTRHAVVLLGRFNPPAINAYLTRELQATPRAGAGPASYEVVRTNPTTCQPDATW
Ga0070712_10151262423300006175Corn, Switchgrass And Miscanthus RhizosphereMKKALAALAAALVVLVAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTREL
Ga0066653_1010851023300006791SoilMIVMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYAL
Ga0075425_10204249113300006854Populus RhizosphereMKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRP
Ga0075434_10074592323300006871Populus RhizosphereMIGMKKVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAEGALVTEDLLLLGGINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLTRELKATPRAGSGP
Ga0075424_10088775713300006904Populus RhizosphereMIGMKKVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAEGALVTEDLLLLGGINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLTRELKATPRAGSGPASFEVARTDPATCQPGATWVVTATAEWIVL
Ga0075424_10149265723300006904Populus RhizosphereMKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHIRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRP
Ga0079219_1162289413300006954Agricultural SoilMKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGPRHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDP
Ga0079219_1201528213300006954Agricultural SoilRPLLRPSEVTAVMERVLATDDLLLMGAVNVKQAVFLEKWFLGVPRATPVVDPPAPAPSVADRTLLDHLRAAGVDPRHDVDGALYALYPTDGPAARHAAVLVGRFNPATVNAYLTRELGATPRPGPGPASYQVTRIDPATCQPGAAWLVTVSREWIVMADPVSHPMLLARVASPPAGSPERLAW
Ga0066710_10137352613300009012Grasslands SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRTGAGPTSFEVARTDPATCQPGATW
Ga0099829_1023264523300009038Vadose Zone SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAIAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQVVPTPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRPDSTTC
Ga0099829_1031869423300009038Vadose Zone SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRLGPGPASFDVARIDPATCQPGASWVVTVAREWIVMADPISQPTLVAR
Ga0099829_1089721923300009038Vadose Zone SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRPDSTTC
Ga0099830_1009744323300009088Vadose Zone SoilMKKVLAVVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDP
Ga0099827_1012735533300009090Vadose Zone SoilMIGMKRVVAAVLCVLVALTIALYLLVVRPLLRPPDVTAVAEGALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHALYALYPASAEGARHAVILVGRFNPATINAYLTRELKATLRAGAGPASFEV
Ga0114129_1061490913300009147Populus RhizosphereMKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHIRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRPGAAWVVTVSPEWIVLA
Ga0126373_1126243523300010048Tropical Forest SoilMKRALVALLVVLVVLGATLYFFVVRPLLRPSEMTAVAENALATDDLLLLGGIKVNQAVFLERWFFGAPAAAATPASLPAAADRSLIEHLRAAGVDPRHDVDYVLYALYPAAEATRHAVVLVGRFNPNAVNGYLTRELAATPRAPAGPASYAVVRT
Ga0126376_1226111013300010359Tropical Forest SoilEDVAVAVDDHGGTLPYPAEITSGKPRLSLTIEAMKRALVALLVVLVVLGATLYFFVVRPLLRPSEMTAVAENALATDDLLLLGGINVNQAVFLERWFFGAPAATATPVSLPAAADRSLIEHLRAAGVDPRHDVDYVLYALYPAAEATRHAVVLVGRFNPNAVNGYLTRELAATPRAPAGPASYAVVRTDPSTCQP
Ga0137389_1012685933300012096Vadose Zone SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLQAKPRAGAGSASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADP
Ga0137380_1072650823300012206Vadose Zone SoilMKKVLAAVVAVLVVLGAGLYFFVARLLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPAPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAGPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASF
Ga0137378_1137564713300012210Vadose Zone SoilMIGMKRVVAAVLCALVALTIALYLFVVRPLLRSSDVTAVAEGALVTEDLLLLGSINVKQAAFLEKWLLGAPQGTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHALYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRAGAGPASFEVARTDPSTCQPGATWVVTATAEWIVLADPSSHPALLARFA
Ga0137375_1047727913300012360Vadose Zone SoilMKRVLAALLCGLIVLGAALYLFVVRPLLRPSELTAIAESALATEDLLVLGGINVRQAVFLERWFQGTPRVPPAQTVSTPTAVADRTFLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTALNAYLTRDLQAKPRAGAGPASYEVVRTDSTTCQPAAPWIVTVAPEWIVLADPASHTPLLSR
Ga0137361_1076743813300012362Vadose Zone SoilMKKVLAAVLAVLVMLGAGVYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRALLDHLRAAGVDPRHDVDHVLYALYPADAAGSRHAAILVGRFNPAAVNAYLARELAATPRPGPGPASFDVIRIDPATCQPGASWVVTATREWVVMADPISQPTLIARLIGAPAATPE
Ga0137390_1121692113300012363Vadose Zone SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGSASYE
Ga0137358_1012907223300012582Vadose Zone SoilMKKVLAAAIALLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDLDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLARELAATPR
Ga0137404_1060691613300012929Vadose Zone SoilMKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFMGTPRVPTVPAVSAPAMADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAEPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHPTLLPRLASPPAENKEQLGWWR
Ga0137403_10006220123300015264Vadose Zone SoilMKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFMGTPRVPTAPAVSAPAMADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAEPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHPTLLPRLASPPA*
Ga0134085_1009334913300015359Grasslands SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHA
Ga0134069_129848013300017654Grasslands SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLT
Ga0187821_1015280313300017936Freshwater SedimentMRKALVGLVAALVVLVAALYFFVARPLLRPSEVTAVMERALATDDLLLMAAVNVKQAVFLEKWFLGAPRATPVADPPAPASSVADRTLLDHLRAAGVDPRHDVDGALYALYPTDGPAARHAAVLVGRFNPATVNAYLTRELGATPRPGPGPASYQVTRIDPATCQPGAA
Ga0184608_1052924513300018028Groundwater SedimentGSIHRNRTTNHFGLNSVSRTIVGMKRALAALLCVLVVLGVALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPTAVADRTLLEHLRVAGVDARQDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATP
Ga0184621_1026918913300018054Groundwater SedimentMKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVVENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVATPAVADRSLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLRGRFNPTAINTYLTRDLHATPRAGVGPASYEVVRTDSTTCQPGAPWIVTVASEWIVLADPASHI
Ga0184619_1037467713300018061Groundwater SedimentMKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVAENAVATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPTAVADRSLFEHLRGAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGA
Ga0184632_1005127613300018075Groundwater SedimentLVVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFLGTPRGPTGHAVPTPAVVDRTLIEHLRVAGVDARHDVDYALYAVYPAAAEATRHALVLLGGFNPTAINAYLTRDLQATPRAGAGPASYPASYEVVRTDPTTCQPGAPSTTMAGPWRWLGPISPLPPGRSNG
Ga0184632_1046817013300018075Groundwater SedimentTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAIYPAAAEATRHAVVLLGRFSPTAINAYLTRDLHATPRAGAGPASYEVVRPDSTTCQPGATWVVTVAPEWIVLADPASHTTLLPRLASPLPENKEQLGW
Ga0184609_1013856313300018076Groundwater SedimentMKRALAALLCVLVVLGASLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPAGQAVPTPAVADRTLFEHLRVAGVDARHDVDHALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVIRTDSTTCRPGATWIVTVAPEWIVLADPASHTALLPRLASP
Ga0184609_1043308013300018076Groundwater SedimentMKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVVENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAASTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPEWIALADPASHP
Ga0184612_1040050813300018078Groundwater SedimentMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAVPPPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVLLLGRFNPTAINAYLTRDLHATPRAGAGPASFEVLRTDSTT
Ga0066667_1064805423300018433Grasslands SoilMIGMKRVVAAVLCALVALTIALYLLVIRPLLRPSDVTAVAEGALVTEDLLLLGSVNVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPR
Ga0193727_102633723300019886SoilMRKVLAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLSRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHTTLLPRLASPPPENREELGWWRSLARADVAS
Ga0193743_105300213300019889SoilMKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWLLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYPASYEVIRTDSTTCQPGAPWIVTVAPEWIVLADPASHTTLLPRLASPPPENKEQLGWWRPLARAD
Ga0193731_104015823300020001SoilMKKVLAAVLAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYGLYPADAAGSRHAAILVGRFNPAAVNAYLTRELAATPRPGPGPASFDVTRIDPATCQPGASWVV
Ga0193717_118243013300020060SoilMKRALAALLGVLVVGGAALYLFVVRPLLRPADMTAAAESALATEDLLLLGGINVKQAVFLERWFLGTPPISTVPTATTSAVADRTLFDHLRAAGVSTRHDVDHALYAVYPAAAESTRHAVVLLGRFNPAAINAYLTRELQAAPRAGAGPASYEVVRTDPTTCR
Ga0179594_1007181513300020170Vadose Zone SoilMKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFMGTPRVPTVQAVSAPAMADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAEPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHPTLLPRLASPPAENKEQLGWWRALARADVASVGIMAPDRLETG
Ga0210407_1061094723300020579SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGVNVKQAAFLERWLLGRPPATTAAAESAPGAADRTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRTDPTTCRP
Ga0210403_1082603823300020580SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRIDPTTCRPSAAWLVTAAPEWIVLADPASHAILLPRFAGAS
Ga0210401_1068753513300020583SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLMRELQATPHEGSGPASFEVTR
Ga0210379_1038871623300021081Groundwater SedimentMKRALAALLCVLVMLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPAGQAMPTPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTALNAYLTRELRATPRAGVGPASYEVVR
Ga0210404_1023221013300021088SoilMKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSGPASFDVTRVDPTTCRP
Ga0210400_1020137313300021170SoilMKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTAAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTHELGAVPRPGSGPASFDVTRVDPTTCRPGAAWVVTVSPEWIVLADPA
Ga0210408_1119855113300021178SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGVNVKQAAFLERWLLGRPPATTAAAESAPGAADRTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPH
Ga0193719_1039291513300021344SoilMTKVLAAAIAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLTRELAATPRPGPGPASFDVTRIDPATCQPGASWVV
Ga0210389_1091159913300021404SoilMKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLKRWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSGP
Ga0210394_1059526813300021420SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHERS
Ga0210409_1110625613300021559SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTSTDPT
Ga0207684_1147162513300025910Corn, Switchgrass And Miscanthus RhizosphereMKRALAALLGVLVVLGATLYLFVVRPLLRPSDVTAAAESALATEDLLLLGGINVKQAVFLERWFLGTPRVSTVQAVPTPAVADRTLVDHLRAAGVDARQDVDYALYAVYPAAAETTRHAVVLLGRFNPTAINAYLTRELRAPPRAGA
Ga0207663_1070345413300025916Corn, Switchgrass And Miscanthus RhizosphereMKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLT
Ga0209266_112140323300026327SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRTGAGPTSFE
Ga0257179_104960013300026371SoilLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDPATCQPGASWVVTVAREWIVMADPISQPTLVAR
Ga0257167_108182913300026376SoilMKRALAALLCVLVVLGAVLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLQAKPRAGAGSASY
Ga0257177_100516713300026480SoilMKRALAALLCVLVVLGATLYLFVVRPLLWPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPEWIVLADPASHPTLLLRLASPPAENKDQLAWWRA
Ga0257159_100253413300026494SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDPATCQPGASWVVTVAREWIVMADPISQPTLV
Ga0257164_102722013300026497SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLARELAAAPRPGP
Ga0257156_100650413300026498SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLARELAAA
Ga0257181_106063413300026499SoilMKKVLAAAIALLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFIGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDPATCQP
Ga0257165_100475813300026507SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLARELAAAPRPGPGPASFDVIRIDPATCQPGASWVVTVTRE
Ga0257168_101311123300026514SoilMKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLARELAATPRPGPGPASFDVIRIDPATCQPGASWVVTV
Ga0209376_135939713300026540SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVG
Ga0209805_104991723300026542SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRTGAGPTSFEVARTDPATCQPGATWVVTATAEWIVLADPSSHPALLARFA
Ga0209577_1024497413300026552SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQ
Ga0209217_102556313300027651Forest SoilMKKALAVLAAALVVLGAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPAGAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRP
Ga0209689_120461223300027748SoilMIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQVLYALYPTSAEAARHAVILVGRFN
Ga0209180_1038044023300027846Vadose Zone SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAIAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQVVPTPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRP
Ga0209701_1020821723300027862Vadose Zone SoilMKRVLAALLGVLVVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVPTPAVADRTLIEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPMAINAYLTRELQATPRAGAGPASYEVVRTDPTTCQPGATWIVTVAPEWI
Ga0209283_1092890513300027875Vadose Zone SoilMIGMKRVVAAVLSALVALTIALYLFVVRPLLRPSDVTVVAESALATEDLLLLGGINVKQAAFLEKWFLGAPQVAAVRGEPAPPAADRTLFDHLRAAGVDARRDVDHALYALYPASGEAARHAVIL
Ga0209068_1000591813300027894WatershedsMKKVLAAVVAALVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAAINVKQAVFLEKWFLGSPRATPVAATPPPSVADRSLLDHLRAAGVNPRHDVDHALYALYPAEAPVSRHAAILVGRFNPAAINAYLTRELAATPR
Ga0209488_1045931613300027903Vadose Zone SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRP
Ga0209583_1027511223300027910WatershedsMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGVNVKQAAFLERWLLGRPPATTGVAAPAPGAADRTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLT
Ga0209069_1073610413300027915WatershedsRTIVDMKRALAALLGVLIVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFFGTPRISTVQAVPTPAVADRTLIDHLRVAGVDARHDVDYALYAVYPAAETTRHAVVLLGRFNPPAINAYLTRELQATPRAGAGPASYEVVRTNPTTCQPDATWIVTVAPEWIVLADPASHTALL
Ga0307504_1003096923300028792SoilMKRALAALLCVLAVLGATLYLFVVRPLLRPSDMTAVAESALATEDLLLLGGINVKQAVFLERWFFGTPRISTVQAVPTPAVADRTLIDHLRVAGVDARHDVDYALYAVYPAAETTRHAVVLLGRFNPPALNAYLTRELRATPRAGTGPASY
Ga0307287_1014887313300028796SoilMKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAASTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINTYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASH
Ga0307302_1067507213300028814SoilMKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWLLGTPRVPTVQAVATPAVADRSLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAG
Ga0307277_1032615223300028881SoilMTKVLAAAIAVLVVLGAGLYFFVAGPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRATPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLTRELAATPRPGPGPASFDVTRIDP
Ga0308309_1158905113300028906SoilPRTIGSMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRTDPTTCRPSAAWLVTAAPEWIVL
(restricted) Ga0255310_1022519413300031197Sandy SoilLETGRQETDAGRGTNPFGLMTPSRTMIGMKRVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAAFLEKWFLGAPQVAAVRGEPAPPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLTRELKATPRAGS
(restricted) Ga0255312_111959713300031248Sandy SoilMIDMKRVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAAFLEKWFLGAPQVAAVRGEPAPPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLARELK
Ga0310813_1054063213300031716SoilMKKVLVAVVAALVVLVAALYFFVARPLLRPSEVTAVMERALATDDLLLVAAVNVKQAVFLEKWLIGTPRATPVADTPAPAVADRTVLDHLRAAGVDPRHDVDGALYALYPADGPAARHAAVLVGRFNPAAVNAYLTRELAATPRPGPGPA
Ga0307469_1131731523300031720Hardwood Forest SoilMKRALAALLGVLVLLGAALYLFVVRPLLRPVELTAAAESALATEDLLLLGGINVKQAVFLERWFLGSPRVSTGTAPPPAVADRALLDHLRAAGVDPRHDVDQALYAVYPAAESTRHAVVLIGRFNPTAIDAYLRREL
Ga0307468_10102128313300031740Hardwood Forest SoilMKKALAVLAAAVVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGTRHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRPGAAWVVTVS
Ga0307473_1008589823300031820Hardwood Forest SoilMKKVLAAAIAVLVVLGAGLYFFVARPLLRPSEVSAVAERALAADDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADASVSRHAAILVGRFNPAAVNTYLAR
Ga0307479_1041935813300031962Hardwood Forest SoilMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRTDPTTCRPSAAWLVTAAPEWIVLADSASHAILLPRFAGASTESPEKLAWWRGLARADVASLGIPGLD
Ga0307470_1171143813300032174Hardwood Forest SoilMKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDV
Ga0307471_10066647623300032180Hardwood Forest SoilVKRALAALLGVLVVLGATLYLFVVRPLLRPSDVTAAAESALATEDLLLLGGINVKQAVFLERWFLGTPRVSTVQAVPTPAVADRTLVDHLRAAGVDARQDVDYALYAVYPAGAETTRHAVVLLGRFNPTAINAYLTRELRATPRAGAGPASYEVVRTDPTTCQPGATWVVTVAPAWIVLADPASHTALLPRLASPPSE
Ga0307471_10379524613300032180Hardwood Forest SoilQQIASVPRSRSRTIGGMKKALAVLAAALVVLVAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTC
Ga0307472_10032583413300032205Hardwood Forest SoilMKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASF
Ga0326726_1133627113300033433Peat SoilMKKVLVAVVATLMVLVAALYFFVARPLLRPSEVTAVMERVLATDDLLLMAAVNVKQAVFLEKWFFGAPRATPVADTPAPAVADRTFLDHLRAAGVDPRHDVDGALYALYPTDGAAARHATILVGRFNPATVNAYLTRELGATPRPGPGRASYQVTRTDPATCQPGATWLVTVSREWIVMADPVSHPMLLARVASPPV
Ga0326731_100605113300033502Peat SoilMKKVLVAVVATLVVLVAALYFFVARPLLRPSEVTAVMERVLATDDLLLMAAVNVKQAVFLEKWFFGAPRATPVADTPAPAVADRTFLDHLRAAGVDPRHDVDGALYALYPTDGPAARHATVLVGRFNPATVKAYLTRELG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.