NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068450

Metagenome / Metatranscriptome Family F068450

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068450
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 150 residues
Representative Sequence MARPASNLHLQPKNGGWRARVLVPVELQARLGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGEGYCEAVEVEADGPFKPLVFPSVRGVVTAGTENSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAF
Number of Associated Samples 95
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(34.677 % of family members)
Environment Ontology (ENVO) Unclassified
(36.290 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.613 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 36.46%    β-sheet: 8.84%    Coil/Unstructured: 54.70%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF00589Phage_integrase 3.23
PF13472Lipase_GDSL_2 3.23
PF04392ABC_sub_bind 1.61
PF13191AAA_16 0.81
PF07883Cupin_2 0.81
PF04986Y2_Tnp 0.81
PF13515FUSC_2 0.81
PF14534DUF4440 0.81
PF13751DDE_Tnp_1_6 0.81
PF05598DUF772 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 1.61


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil34.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil16.13%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.29%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil10.48%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.84%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.03%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil3.23%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.61%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.61%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.81%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2035918004Soil microbial communities from sample at FACE Site 2 North Carolina CO2-EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027174Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF040 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031736Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f21EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031793Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f21EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031897Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f16EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032025Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f20EnvironmentalOpen in IMG/M
3300032055Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f23EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032065Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f20EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032094Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FACENCA_52157002035918004SoilKKLFHTPVWRVTKAEAAALAYPEVRKFEALIDRARSGGGYCEAVEVEAEEPLRPLVFPTIAGCTTIGTATTFTALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGTEFFETVVIEAEGELYPAIGDAALGDE
JGIcombinedJ26739_10179493813300002245Forest SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARTGGGYCKAVEVEVEGPLDHLAFPSVRGAVSAGTESRFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHD
Ga0070709_1170756113300005434Corn, Switchgrass And Miscanthus RhizosphereCHSVRCVWGQNRGQYAPLLNGGREMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKYCSAIEVETQASLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAEMHFRRLAEFLGHDNGADVTS
Ga0070714_10137240613300005435Agricultural SoilMARPPRSFHLQPKSGGFRARVLVPDELQGKIGKKVLCTPVWQVSEFEAAKLAWPEVQKFEAMIENARTGKFVPVVEREAPGPLQPLVPTFKTHGIRNDASETTFTKLIAEWARKKRIDNPRTRRRAETHFEALAEFLGHDNGAEVTSRDIVRFEKHLETTPDPRTGKL
Ga0070714_10150347723300005435Agricultural SoilMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKYCSAIEVETQASLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAVTHFQRLAEFLGHDNGTDVTSHDIVRFE
Ga0070713_10231327913300005436Corn, Switchgrass And Miscanthus RhizosphereLNGGREMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKYCSAIEVETQASLRPLVPTFHVRGTRTDPSKATFSALIVEWARKKRITNPQSKGKKKSHFDALAEFLGHEDGGRVTPQDIVAFEKYLETTPDPRTGK
Ga0066687_1062818613300005454SoilMARPPRSFHLQPKSGGYRARVLVPVELQGKIGKKVLYTSVWQASETEAANLAWPQVQKFEALLERAKSGKFYPAREMEAEGPLRPLVPTFAVCGTRTDPSETTFTALIAEWARKKRIDNPRTKQHRATHFRALAEFLGHDDGGRVTSHDIVRFEKHLETTPDPRTKKPRHP
Ga0070762_1008722233300005602SoilMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVETQASLQPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAVTHFEALADFLGHDDGADVTSRDIVR
Ga0070763_1035093723300005610SoilLVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCRSAIEVEAQAPLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAVTHFEALADFLGHDDGADV
Ga0070766_1005629933300005921SoilMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVETQASLQPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAVTHFEALADFLGHDDGADVTSRDIVRFEKHLETTPDPRTGKL
Ga0099795_1030354313300007788Vadose Zone SoilMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRRRAETHFQRLAEFLGHDNGADVTSPDIV
Ga0066709_10192422813300009137Grasslands SoilLVPVELQGILGKRLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYREAVEVEAEGPLKPLVFPTIAGCTTIGTATTFMALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRG
Ga0126374_1122570113300009792Tropical Forest SoilLHLQPKNGGWRARVLVPVEVQGILGKKLFTTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGEGYCEAVEVEVQGPLKPLVFPSVRGVVTADSEASFTALIAEWARKKRINNPQTKQQRETHFKPCRFPRSR*
Ga0126373_1166620813300010048Tropical Forest SoilVLVPVEVQGILGKKLFTTPVWQVTKSAAAVLAYPEVRKFEAQIEQARSGGGCYAVDVEAEGPLKPLVFPSVRGVVSADNETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSKNIVAFEKMLET
Ga0127503_1115406513300010154SoilPASNLHLQPKNGGWRARVLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCEAVEVEAEGPLNPLVFPTITGCTTIETATTFTALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGKHPEVVKRLIPYMEKQ
Ga0099796_1052704513300010159Vadose Zone SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCEAVEVEAEGPLNPLVFPTITGCTTIETATTFTALIEEWARKKRIDNPQTKQKVETHFKSLADFLGHDDGAKVTSQN
Ga0126370_1214910113300010358Tropical Forest SoilMGRPASNLHLQPKNGGWRARILVPVELQDRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGIRNDATETTFTKLIEEWARKKRINNPQTKQQRETHFKSLADFLGH
Ga0126370_1242141923300010358Tropical Forest SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGEGYCEAVEVEVQGPLKPLVFPSIRGFVTVGTETSFTALIAEWARKKRINNPQT
Ga0126378_1139326823300010361Tropical Forest SoilLVPVELQGILGKKLFQTPVWRVSKSEAAVLAYPEVQKFEALIEQAKSGGCFCEAVEVKAEGPLKPLVFPPVRGVVTVGTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAF
Ga0126378_1141990323300010361Tropical Forest SoilMGVGERGILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIDVEVQGPPNHLAFPSVRGVVTAGTETSFTALIAEWARKKRIDNPQTKQQRETHFMSLADFLGDDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVIGYLSSFRGVFTIAVKQSLSTQTP*
Ga0126379_1226189513300010366Tropical Forest SoilLVPVELQGILGKKLFQTPVWRVSKSEAAVLAYPEVQKFEALIERAKSGGYFCKAVEVEVQGPLKPLVFPSVRGVVSADNETSFTALIAEWARKKRINNPQTKQQ
Ga0126379_1254602123300010366Tropical Forest SoilVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEVQGPPNHLAFPSVRGVVTAGTETSFTALIAEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFERMLETTPDPRTGKLRH
Ga0126381_10301243923300010376Tropical Forest SoilLVPVELQGILGKKLFRTPVWRVSKSEAAMLAYPEVKKFEALIERAKSGGCFCEVVEMEAQGPLKPLVFPSVRGVVSVDTETSFTALIAEWARKKRINN
Ga0126381_10307669713300010376Tropical Forest SoilMGRPASNLHLQPKNGGWRARILVPVELQDRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAVEVEAQAPLRALVPSFDFRGIRNDASETTFTKLIAEWARKKRIDNPQTKQQRETHFRALAEFLGYDNGADVTSRDIVRFEKYLETTPDPRTGKP
Ga0126383_1136871013300010398Tropical Forest SoilMARPATNLHLQPKNGGWRARILVPVELQDRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQASLRPLVPSFQLRGIRNDATETTFTKLIEEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEK
Ga0150983_1100189023300011120Forest SoilMARPASNLHLQPKNGGWRARVLVPVELQTILGKKLFHTPVWRVTKAEAAALAYPEVRKFEAQIEQARSGGGCYAVEVEAHGIPDHLVFPSIRGIGSTGSETSFTALIAEWARKKRINNPQTRQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLEATPTLALASCATRTRSSAICRASGASSPSRCNKF*
Ga0150983_1163113713300011120Forest SoilMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGVRNDASETTFTKLIEEWARKKRINNPQTRGRAVTHFEALADFLGHDDGADVTSRDIVRFEKHLETTPDPRTGKP
Ga0137363_1162663913300012202Vadose Zone SoilMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRRRAETHFQR
Ga0137363_1169218613300012202Vadose Zone SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYREAVEVEAEGPLKPLVFPTIAGCTTTETATTFMALIEEWARKKRIDNPQTKQQRE
Ga0137399_1057509213300012203Vadose Zone SoilMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFHLRGVRNDARETTFTKLIEEWARKKRINNPQTRRRAETHFQRLAEFLGHDNGADVTSHDIVRFEKHLETTPDPRTGKLRHPNTPLTY
Ga0137362_1096741813300012205Vadose Zone SoilMARPASNLHLQPKNGGWRARVLVPVELQASLGKKLFRTPVWRVTKSEAAVLAYPEVQKFEAQIEQARFGGGCFAVDVEAQGPLNHLAFPSIRGINTVATATTFTALIEEWARKKRIDNPQTKKKNETHFKSLADFLGHDDAKRVTSRDIVAFEKRLSTTPDPRTGKLRHPNTILSYLSS
Ga0150985_11451879813300012212Avena Fatua RhizospherePINGGLRARVLIPTELQGKLGKKVFYSPVFRVPETEAAKLAWPEVQKFEAMIEGARTGTFVQAVEMEADGALRPVVPSFQIRGIRNAATETTFPKLIAEWARKKRIDNPLTIAQRHTHFEALADFLGHDNGADVTAADIVGFERYLETTPDPRTGKLRSPNTIIGYLSSSKGVFTVAVQQILLDASP
Ga0137384_1065063423300012357Vadose Zone SoilMKSGGYRARVLVPVELQPKLGRKVFSTPVWQVEKDEAAALAWHHVQKFEAMIEGARTGKFVPIVEIEAPGPLQPLVPTFKTHGIRNDASETTFTKLIAEWARKKRIDNPRTRRRAETHFEALADFLGHDNGADVTSRDIVRFEKHLETTTDPRTGKLRHPN
Ga0150984_10869113913300012469Avena Fatua RhizosphereREMARPASNLHLQPKNGGWRARVLVPVELQAILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCKAVEVEAEGPLKPLVFPTIMGCTTIETATTFTALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVISQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGVFTIAV
Ga0150984_12055492713300012469Avena Fatua RhizosphereLRARVSVPVEVQDKIGKKVLYSPVWRVPETEAAKLAWPHVQKFEAMIDGARTGKFVPVVERELPGPLQPLVPSFKVRGIRNAATETTFPKLIAEWARKKRIDNPQTRQQRKTHFEALAEFLGHDNGAEVTSRDIVRFEKHLETTPDPRTGKLRSANTIISYLSSFKG
Ga0137395_1040621313300012917Vadose Zone SoilMARPASNLHLQPKNGGWRARVLVPVELQASLGKKLFRTPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGGGCFAVDVEAQGPLNHLAFPSIRGISTVATATTFTALIEEWARKKRIDNPQTKQKVETHFNRLAAFLGHDNGTKVTSQNIVAFEKLLATTPDPRTG
Ga0137395_1055152413300012917Vadose Zone SoilMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGVRNDASETTFTKLIEEWARKKRINNPQTRGRAVTHFEALADFLGHDDGAD
Ga0137396_1123188413300012918Vadose Zone SoilQIAKIQAILPFFLFALGDKIGDKTHYRNRGREMARPASNLHLQPKNGGWRARVLVPVELQASLGKKLFSAPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGGGCYAADVEAQGPLNHLAFPSIRGISTVPNATPFTALIEEWARKKRIDNPQTKQKVETHFKSLADFLGHDDGAK
Ga0137416_1118510513300012927Vadose Zone SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCEAVEVEAEGPLKPLVFPTIAGCTTIETATTFMALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGKLR
Ga0137404_1141880823300012929Vadose Zone SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALINRARSGGGYCEAVEVEAEGPLKPLVFPTIAGCTTIETATTFTALIEEWARKKRIDNPQTKQQRETHFKSLAAFLGHDNGTKV
Ga0137407_1053620623300012930Vadose Zone SoilLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCEAVEVEAEGPLKPLVLPTIAGCTTIETATTFMALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVISQ
Ga0126369_1245819713300012971Tropical Forest SoilLVPVELQAKLGKKLFHTPVWRVTKSEAAVLAYPEVRKFEALIERAKSGESCCKTIEVEVQGPLKPLVFPSVRGVVSADTETSFTALIAEWARKKRINNPQTKQQ
Ga0182036_1062022113300016270SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALI
Ga0182036_1149864913300016270SoilEGQVANNPAILPFRPLYLGTKQETIPPLSNGGREMGRPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFYTPVWRVSKSEAAVLAYPEVQKFEALIENARSGKCCSAIEVEAQGMLLPQPAFAVRGVVTAGTETSFTALIAEWARKKRINNPHTKQQRETHFRSLADFLGHDDGAKVTSQNIVAFEK
Ga0182041_1171017613300016294SoilLVPVELQGILGKKLFQTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGEGYCEVVEMEAQRPLKPLVFPSVRGVVTAGTETSFTALIAEWARKKRINNPQTKQQRKTHFKRVADFLGH
Ga0182033_1103828313300016319SoilMGRPASNLHLQPKNGGWRARVLVPVELQGILGKKLFQTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGEGYCEVVEMEAQRPLKPLVFPSVRGVVTAGTETSFTALIAEWARKKRINNPQTKQQRKTHFKRVADFLGHDDGAKV
Ga0182035_1073067923300016341SoilMARPASNLHLQPKNGGWRARVLVPVELQARLGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGEGYCEAVEVEADGPFKPLVFPSVRGVVTAGTENSFTALIAEWARKKRINNPQTKQQ
Ga0182035_1087226513300016341SoilLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPHRQAAPPEYGPWLSVELQGRLHRRGAT
Ga0182032_1165642513300016357SoilLVPVELQGMIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIDVEVQGRPNHLAFPSVQGIVTAGTETSFTALIAEWARKKRIDNPQTKQQREAHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGK
Ga0182040_1053518913300016387SoilMGRPASNLHLQPKNGGWRARVLVPVELQGILGKKLFQTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGEGYCEVVEMEAQRPLKPLVFPSVRGVVTAGTETSFTALIAEWARKKRINNPQTKQQRKTHFKRVADFLGHDDGAKVTSKNIVAFEKMLETTPDPRTGKL
Ga0182040_1183822413300016387SoilLVPVELQSKVGQKVFYTPVWRVSKSEAAVLAYPEVQKFEALIENARSGKCCSVIEVEAQGRILPQPVFAVRGVSTVRTKTETSFTALIAEWARKKRLNNPQTKQQRATHFKSLADFLGHDNGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGVFTIAVQQFL
Ga0182037_1036301213300016404SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLQPH
Ga0182039_1123436313300016422SoilLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPR
Ga0182038_1081227313300016445SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPRTGKLRHPNT
Ga0182038_1152924113300016445SoilLNGGREMARPATNLHLQPKNGGWRARILVPSELQDRIGKKVLSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIDVEVQGRPNHLAFPSVQGIVTAGTETSFTALIAEWARKKRLNNPQTKQQRATHFKSLADFLGHDNGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGVFTIAVQQFLI
Ga0210407_1006086523300020579SoilMARPATNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSATEVEAQGPILPQPAFAVRGVSTVRTKTETSFTALIAEMGEEETDR
Ga0210403_1039200013300020580SoilLVPVELQAKLGKKLFHTPVWRVSKSEAAVLAYPEVQKFEALIENARSGKYCSAIEVETQASARPLVPSFQLRGVRNDARETTFTKLVEEWARKKRINNPQTRGRAVTHFQRLAEFLGHDDGDTDLAEKLMVVENED
Ga0210403_1051782523300020580SoilMARPASNLHLQPKNGGWRARVLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGEGYCKAVQVEAEGPLNPLVFPSIRGISTVATPTTFTALIEEWARKKRIDNPQTKKKNA
Ga0210399_1063041823300020581SoilMARPASNLHLQPKNGGWRARVLVPVELQAILGKKLFHTPVWRVTKAEAAALAYPEVRKFEAQIEQARSGGGCYAVEVEAQGIPDHLAFPSIRGISTTGTETSFTALIAEWAR
Ga0210399_1127282613300020581SoilLVPVELQAKLGKKLFHTPVWRVSKSEAAVLAYPEVQKFEALIENARSGKYCSAIEVETQASARPLVPSFQLRGVRNDARETTFTKLVEEWARKKRINNPQTRGRAVTHFQRLAEFLGHDDGDTDLAEK
Ga0210404_1007567733300021088SoilMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGR
Ga0210404_1040679713300021088SoilMARPPRSFHLQPKSGGFRARVLVPDELQGKIGKKVLCTPVWQVSEFEAAKLAWPEVQKFEAMIENARTGKFVPVVEREAPGPLQPLVPTFKTHGIRNDASETTFTKLIAEWARKKRIDNPRTRRRAETHFEALAEFLGHDNGAEVTSRDIVRFEKHLETTPDPRTGKLRHPNTILSYLSSFKG
Ga0210400_1124470213300021170SoilMARPAANLHLQPKNGGWRARVLVPVDLQAKIGKRQFYTPVWRVPRDEAAALAYPEVQKFEALIERARSGMSYVEAVEMEARGPLKPVVFPRIRGISTVASESTRFPALIEEWARKKRIDNPQTKQKVEGHFKSLADFLGHDNGANVTSHDIVRFEKHLETTPDPRTGKLRHPN
Ga0210408_1094406713300021178SoilMARPASNLHLQPKNGGWRARVLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYRKAVEVEAEGPLNPLVFPTITGCTTIGTATTFTALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVT
Ga0210387_1101962313300021405SoilLVPVELQTILGKKLFHTPVWRVTKAEAAALAYPEVRKFEAQIEQARSGGGCYAVEVEAHGIPDHLVFPSIRGIGSTGSETSFTALIAEWARKKRINNPQTRQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLEATPEP
Ga0210386_1011961533300021406SoilMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKYCSAIEVETQASARPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRR
Ga0210386_1109531313300021406SoilMARPASNLHLQPKSGGWRARVLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCEAVEVEAEGPLNPLVFPTIAGCTTIETATTFMALIEEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGVFTVA
Ga0210384_1148413213300021432SoilMARPASNLHLQPKNGGWRARVLVPVELQGILGQKLFYTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGSCEAVEVEAEGPPKPLVFPRIAGCTTIETATTFTALIEEWVRKKHIDNPQTKQKVETHFNR
Ga0210384_1187689313300021432SoilAGLNRNRLSETPKFAPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGSYCEAVEVEAEGPFKPLVFPSIRGISTVATATTFTALIEEWARKKRIDNPQTKKKNAGHFKSLVDFLGHDEAKRVTSRDIVAFEKHLSTTPDPRTGKLRHPNTILSYLSSFTGVLTVAVQS
Ga0210392_1062026813300021475SoilMARPASNLHLQPKNGGWRARILVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKYCSAIEVETQASPRPLVPTFHVRGTRTDPSKATFSALIVEWARKKRITNPQSKG
Ga0210402_1065237323300021478SoilMARPASNLHLQPKNGGWRARVLVPVELQGILGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGGCFCEAVEVEVQGIPDHLAFPSIRGISTTGTETSFTALIAEWARKKRINN
Ga0210409_1090509513300021559SoilLVPVELQTILGKKLFHTPVWRVTKAEAAALAYPEVRKFEAQIEQARSGGGCYAVEVEAHGIPDHLVFPSIRGIGSTGSETSFTALIAEWARKKRINNPQTRQQRETHFKSLADFLGHDDGAKVTSQNIVA
Ga0126371_1253926523300021560Tropical Forest SoilPLINGGREMARPASNLHLQPKNGGWRARVLVPVDLQGILGKKLFQTPVWRVTKSEAAVLAYPEVRKFEALIERAKSGESCCKTIEVEVQGPLKPLVFPSVRGVVSADSETSFTALIAE
Ga0207693_1009264213300025915Corn, Switchgrass And Miscanthus RhizosphereWGQNRGQYAPLLNGGREMARPASNLHLQPKNGDWRARVLVPVELQAKLGKKLFHTPVWRVSKSEAAVLAYPEVQKFESLIEQARSGRGYCEAVEIEAEGLLKPLAFPSVRGLVTADTETSFTPRFALPALS
Ga0207693_1102416813300025915Corn, Switchgrass And Miscanthus RhizosphereLVPVELQAKLGKKLFHTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCEAVEAEAQAPPRPLVPSFQLRGIHNDASETTFTALIAEWARKKRIDNPQTKQQRETHFRSLADFLGHDDGAKVTSQNVVAFEKMLETTPDPRTGKLRHPNTVLG
Ga0207663_1156158813300025916Corn, Switchgrass And Miscanthus RhizosphereEAAVLAYPEVQKFEAQIEQARSGEGYCKAVEVEAEGPLNPLVFPSIRGISTVATATNFTALIEEWARKKRIDNPQTKKKNAGHFKSLADFLGHDEAKRVTSRDIVAFEKHLSTTPDPRTGKLHHPNTILSYLSSFTGVLTVAVQSFRCCQVNLSRSEKPEAGCAITLPPLHSAIRSS
Ga0207700_1095209923300025928Corn, Switchgrass And Miscanthus RhizosphereMARPPRSFHLQPKSGGFRARVLVPDELQGKIGKKVLCTPVWQVSEFEAAKLAWPEVQKFEAMIENARTGKFVPVVEREAPGPLQPLVPTFKTLGIRNDASETTFTKLIAEWARKKRIDNPRTRRRAETHFEALAEFLGHDNGAEVTSRDIVRFEKHLETTPDPRTGKLRHPNTILSYT
Ga0207948_101116923300027174Forest SoilMARPASNLHLQPKNGGWRARVLVPVELQGILGKKLLHTPVWRVTKSEAAVLAYPEVQKFEALIENARSGKCCSAIEVETQASLQPLVPSFQLRGVRNDARETTFLKLIEEWARKKRINNPQTRGRAVTHFQRLPSFLDMVTEPT
Ga0209329_114537213300027605Forest SoilLVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVETQASLQPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAVTHFEALADFLGHDD
Ga0209693_1039232613300027855SoilMARPPRSFHLQPKSGGFRARVLVPDELQGKIGKKVLCTPVWQVSEFEAAKLAWPEVQKFEAMIENARTGKFVPVVEREAPGPLQPLVPTFKTHGIRNDASETTFTKLIAEWARKKRIDNPRTRRRAETHFEALAEFLGHDNGAEVTSRDIVRFEKHLETTPDPRTGKLRHPNTILSYLS
Ga0209488_1120113313300027903Vadose Zone SoilPVELQASLGKKLFSTPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGGGCYAVEVEAQGILDHLAFPSIRGISSTGTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFVGHDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGVFTIAVQQILI
Ga0222749_1031639213300029636SoilMARPASNLHLQPKNGGWRARVLVPVELQGILGKKLFHTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGGYCEAVEVEAEGPLNPLVFPTIAGCTTIGTATTFTALIEEWARKKRIDNPQTRQQRETHFKSLADFLGNDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVLGYLSSFRGVFTIAVQQFL
Ga0170823_1763492413300031128Forest SoilMARPASNLHLQPKNGGWRARVLVPVELQATLGKKLFSTPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGGGCCEAVEVEAEGPLRPLVFPTIAGCTTIETATTFTALIEEWARKKRIDNPQTKQKVETHFKSLADFLGHDAGAKVTSQNIVAFEKMLETTPDPRTGKLRHPN
Ga0170824_11786419713300031231Forest SoilLVPVELQAILGKKLFRTPVWRVTKAEAAALAYPEVRKFEAQIEQARSGGGCYAVEVEAQGIPDHLAFPSIRGIGSTGTETTFTALTAEWARKKRIDNPQTKQQRGTHFKSLADFLGHDDG
Ga0170820_1742728543300031446Forest SoilMARPASNLHLQPKNGGWRARVLVPIELQGILGKKLFHTPVWRVSKSEAAVLAYPEVQKFESLIEQARSGRGYCEAVEVDAEGPLKPLVFPTIAGCTTIETATTFIRGMGEEKSD
Ga0170819_1513233913300031469Forest SoilMARPASNLHLQPKNGGWRARVLVPVELQATLGKKLFRTPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGGGCYAADVEAQGPLNHLAFPLIRGIGTVAAATTFTALIEEWARKKRIDNPQTKKKNETHFNSLADF
Ga0170818_10268574613300031474Forest SoilMARPASNLHLQPKNGGWRARVLVPVELQATLGKKLFRTPVWRVTKSEAAVLAYPEVQKFEAQIEQARSGGGCYAADVEAQGPLNHLAFPSIRGIGTVATATTFTALIEEWARKKRIDNPQTKKKNETHFNSLADF
Ga0170818_10534574113300031474Forest SoilPPEIQPFCHFVRCIWGQNRGQYAPLLNGGREMARPASNLHLQPKNGGWRARVLVPVELQAILGKKLFHTPVWRVTKAEAAALAYPEVRKFEAQIEQARSGGGCYAVEVEAQGIPDHLAFPSIRGIGSTGTETTFTALTAEWARKKRIDNPQTKQQRGTHFKSLADFLGHDDGSKVTSQNI
Ga0318541_1033911713300031545SoilLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVRKFEALIERAKSEESYCVTVEVEMQGSFKPVIFPRVRGVVNLDTGTSFTALIAEWARKKRIDNRQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKMLETTPD
Ga0310915_1025666713300031573SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPHRQAAPP
Ga0310915_1123438113300031573SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFHTPVWRVSKSEAAMLAYPEVQKFEALIGQAKSGGYFCKAVEVEVQGSLKPLVFPSVRGVVSANTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSKNIVAFEMMLETTP
Ga0318561_1014235623300031679SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPRTGKLRHPNTVLGY
Ga0318560_1062051213300031682SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGESYCVTVEVEMQGSFKPVIFPRVRGVVNLDTGTSFTVLIAKWARKK
Ga0318501_1015431423300031736SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPHRQAAPPEYGPWLSVELQGRLHRRRGAIPDRHESHG
Ga0307477_1025442713300031753Hardwood Forest SoilARPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFHTPVWRVAKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQGPILPQPAFAARGVSTVRTKTETSFTALIAEWARKKRIDNPQTKQQRETHFRSLADFLGHDDGAKVTSQNVVAFETMLETTPDPRTGKLRHPNTVIGYLSSFRASSQSRCNNS
Ga0307475_1139756013300031754Hardwood Forest SoilLVPVELQSKLGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGGSYCETFEVEAEGPLKPLVFPSVRGVVTANSETSFTALIAEWARKKRINNPQTRQQRETHFKSLADFLGHDDGAKVTSQ
Ga0307475_1145277513300031754Hardwood Forest SoilMPRPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFHTPVWRVAKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQGPILPQPAFAARGVSTVRTKTETSFTALIAEWAR
Ga0318509_1046263813300031768SoilLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNPQTKQQRETHFK
Ga0318548_1055496913300031793SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFHTPVWRVSKSEAAMLAYPEVQKFEALIGQAKSGGYFCKAVEVEVQGSLKPLVFPSVRGVVSANTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFERMLETTPDPRTGKLRHPN
Ga0318568_1042882723300031819SoilLLNGGREMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPHRQAAPPEYGPWLSVELQGRLHRRGATNSDRHEPHG
Ga0318567_1058257013300031821SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLQPHPTRAPASCATRIRSLVICRASGASSPSRCNKF
Ga0310917_1001268893300031833SoilMPRPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLAT
Ga0310917_1114529313300031833SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPELRKFEALIERAKSGESYCVTVEVEMEGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTPQNIVKFEKHLATTPDP
Ga0306919_1001085383300031879SoilMARPASNLHLQPKNGGWRARVLVPVELQARLGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGEGYCEAVEVEADGPFKPLVFPSVRGVVTAGTENSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTPQNIVKFE
Ga0306919_1023303713300031879SoilNPVILPFRPLYLGTKPGTIRPLLNGGREMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPHRQAAPPEYGPWLSVELQGRLHRRRGAIPDRHESHG
Ga0306919_1096790323300031879SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVRKFEALIERAKSEESYCVTVEVEMQGSFKPVIFPRVRGVVNLDTGTSFTALIAEWARKKRIDNRQTKQ
Ga0306925_1063104613300031890SoilMARPASNLHLQPKNGGWRARVLVPVELQARLGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGEGYCEAVEVEADGPFKPLVFPSVRGVVTAGTENSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAF
Ga0306925_1142687023300031890SoilMGRPASNLHLQPKNGGWRARVLVPVELQGILGKKLFQTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGEGYCEVVEMEAQRPLKPLVFPSVRGVVTAGTETSFTALIAEWARKKRINNP
Ga0318551_1027121913300031896SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGESYCVTVEVEMEGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTPQNIVK
Ga0318551_1043101623300031896SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFHTPVWRVSKSEAAMLAYPEVQKFEALIGQAKSGGYFCKAVEVEVQGSLKPLVFPSVRGVVRANTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSKNIVAFEKMLETTPDLRTGKLRHPN
Ga0318551_1081605513300031896SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPHRQA
Ga0318520_1022322833300031897SoilMPRPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVRKFEALIERAKSEESYCVTVEVEMQGSFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNP
Ga0310916_1013326443300031942SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFHTPVWRVSKSEAAMLAYPEVQKFEALIGQAKSGGYFCKAVEVEVQGSLKPLVFPSVRGVVSANTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSKNIVAFEKMLE
Ga0306926_1224579113300031954SoilPLYLGTKQETIPPLSNGGREMGRPASNLHLQPKNGGWRARILVPVELQGMIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQGMLLPQPAFAVRGVVTAGTETSFTALIAEWARKKRINNPHTKQQRETHFRSLADFLGHDDGAKVTSQNIVAFEKMLETTPDPRTGKLRHPNTVIGYLSSFR
Ga0318507_1015472333300032025SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNP
Ga0318575_1056197013300032055SoilLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNPQTKQQRETHFKSLADFLG
Ga0318505_1060708413300032060SoilLVPVELQGILGKKLFQTPVWRVTKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNPQTKQQRETHFK
Ga0318513_1050496413300032065SoilMARPASNLHLQPKNGGWRARVLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVRKFEALIERAKSEESYCVTVEVEMQGSFKPVIFPRVRGVVNLDTGTSFTVLIAKWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTPQNIVKFEKHLSTTPDPRT
Ga0306924_1110373413300032076SoilMGRPASNLHLQPKNGGWRARVLVPVELQAKLGKKLFYTPVWRVSKSEAAVLAYPEVQKFEALIENARSGKCCSAIEVEAQGMLLPQPAFAVRGVVTAGTETSFTALIAEWARKKRINNPHTKQQRETHFRSLADFLGHDDGAK
Ga0318540_1039666613300032094SoilLVPVELQAKLGQKIFYTPVWRVSKSEAAVLAYPEVQKFEALIDRARSGGSYCKVVEMEAEGPLKRRVFLSVRGVVNADTETSFTALIAEWARKKRIDNPQTKQQRET
Ga0318540_1054870113300032094SoilMARPASNLHLQPKNGGWRARVLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQNIVAFEKHLATTPDPRTGKLRHPNTVLG
Ga0307472_10043026023300032205Hardwood Forest SoilLVPVELQGRIGKKVFSTPVWKVTKSEAAALAWPEVQKFEALIENARSGKCCSAIEVEAQAPLRPLVPSFQLRGVRNDARETTFTKLIEEWARKKRINNPQTRGRAETHFQRLAEFLGHDNGADVTSHDIVRFEKHLETTPDPRTGKLRHPNTVLGYLSSFR
Ga0310914_1075585823300033289SoilMARPASNLHLQPKNGGWRARVLVPVELQARLGKKLFHTPVWRVSKSEAAVLAYPEVRKFEALIERAKSGEGYCEAVEVEADGPFKPLVFPSVRGVVTAGTENSFTALIAEWARKKRINNPQTKQQR
Ga0318519_1052848013300033290SoilLVPVELQSKVGQKVFYTPVWRVSKSEAAALAYPEVQKFEALIERAKSGDSYCVGVEVEMQGPFKPVIFPRVRGVVSADTETSFTALIAEWARKKRINNPQTKQQRETHFKSLADFLGHDDGAKVTSQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.