NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F094254

Metagenome / Metatranscriptome Family F094254

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094254
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 87 residues
Representative Sequence MTTFLQPDDAVDFAESGRRRGQDRVACFLHLRDLVYLHPEAASASTIAFVQTLEAAVIAEKTTREKRLADARERARQKA
Number of Associated Samples 71
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.94 %
% of genes from short scaffolds (< 2000 bps) 0.94 %
Associated GOLD sequencing projects 61
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.113 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(36.792 % of family members)
Environment Ontology (ENVO) Unclassified
(38.679 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.868 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 64.49%    β-sheet: 0.00%    Coil/Unstructured: 35.51%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF13676TIR_2 3.77
PF02796HTH_7 1.89
PF01738DLH 1.89
PF00572Ribosomal_L13 0.94
PF13817DDE_Tnp_IS66_C 0.94
PF13517FG-GAP_3 0.94
PF13683rve_3 0.94
PF00440TetR_N 0.94
PF01710HTH_Tnp_IS630 0.94
PF13403Hint_2 0.94
PF02518HATPase_c 0.94
PF05598DUF772 0.94
PF02371Transposase_20 0.94
PF07883Cupin_2 0.94
PF00313CSD 0.94
PF13586DDE_Tnp_1_2 0.94
PF13191AAA_16 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG0102Ribosomal protein L13Translation, ribosomal structure and biogenesis [J] 0.94
COG3415CRISPR-associated protein Csa3, CARF domainDefense mechanisms [V] 0.94
COG3547TransposaseMobilome: prophages, transposons [X] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.11 %
All OrganismsrootAll Organisms1.89 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001546|JGI12659J15293_10002614All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium5512Open in IMG/M
3300002245|JGIcombinedJ26739_101090421Not Available685Open in IMG/M
3300027895|Ga0209624_10002976All Organisms → cellular organisms → Bacteria → Proteobacteria12285Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil36.79%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil14.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.38%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.60%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.66%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil5.66%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.72%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.89%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.89%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.94%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.94%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459023Grass soil microbial communities from Rothamsted Park, UK - FA3 (control condition)EnvironmentalOpen in IMG/M
3300001471Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2EnvironmentalOpen in IMG/M
3300001546Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027109Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF008 (SPAdes)EnvironmentalOpen in IMG/M
3300027619Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300030763Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030815Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSU2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031799Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032009Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f19EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033134Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FA3_018612102170459023Grass SoilMTTFLGPDDAANVDLAESRQKRDQGRVACFLHLRDLVYLHREAASASAVAFVQKLEAAVIAEKTARERRLAEAR
JGI12712J15308_1011628113300001471Forest SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKVTREKQLAEARERAERAEGRREAD
JGI12659J15293_1000217423300001546Forest SoilMTRFLPSEEAVDLAESGRKRGQDRVACFLHLRDLVYLHPEMASASSISHVKTIEAAVIAEKAAREKQLAAMRERAERRREAERQKAATEQAKAG*
JGI12659J15293_1000261423300001546Forest SoilMTIFLGTDETGHVDLAESRHTRDKDRVACFLHLRDLVYMHTQAASATAIAFVQTLEAAVIAHKAARDRRLAEARARDAGRRQQEANRQKAAPDVPDDTTEDDPWGH*
JGI12659J15293_1000883313300001546Forest SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLGDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERGERAEGRRE
JGIcombinedJ26739_10030754833300002245Forest SoilMTRFLQPDEAAIDAVAESRTKRGRDRVACFLHLRDLVYLHPKAASTNAIAFVQSLEAAVIAQKGTREKRLAEARARAEGWLEAAR*
JGIcombinedJ26739_10109042113300002245Forest SoilMTTFLGPDEATNVDLADSRHKRDQDRVACFLHLRDLVYLHPAAASASAVSFVQTLEAKVIAEKAAREKRLAEARAREASRRYEADRQ
JGIcombinedJ51221_1023065413300003505Forest SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTXAIAFVQSLEAAVIAQKGTREKRLAXARATS*
Ga0062389_10255267113300004092Bog Forest SoilMTTFLGPDEATNVDLADSRHRRDQDRVACFLHLRDLVYLHPEAASASAVSFVQTLEAKVIAEKAAREKRLAEVRAREASRRSTVRGGSTESGGRASEGGARED*
Ga0070710_1132725623300005437Corn, Switchgrass And Miscanthus RhizosphereMTTFFDPDEATDVDFAETKRGQSRLACFLHLRDLCYLHPEAAPAGAIAHVIRLEAAVVAEKVAREKRLANVRERARQKAAVEQAKAESQLRK*
Ga0070711_10053447313300005439Corn, Switchgrass And Miscanthus RhizosphereMTTFLQPDDAVDFAESGRRRGQDRVACFLHLRDLVYLHPEAASASTIAFVQTLEAAVIAEKTTREKRLADARERARQKA
Ga0070711_10075815023300005439Corn, Switchgrass And Miscanthus RhizosphereMTTVLQPDEAGIESRTKRGQNKRAQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKAAREKRLVEARGRADAHG*
Ga0070762_1018168843300005602SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTNAIAFVQSLEAAVIAQKGTREKRLAEARATS*
Ga0070763_1038844613300005610SoilMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHREAASANTIAFVRMLEAAVIAEKTARDKRLADVRERARREADRQKEA*
Ga0070764_1109797813300005712SoilMTTILQLDEAGNVDFAEPRQKRGRDRVACFLHLRDLVYLHREAASANTIAFVRTLEAAVIAEKAAREKRLADARERARREADRQKEA*
Ga0075026_10088014913300006057WatershedsMTTFLQTDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERAERAE
Ga0075019_1065598923300006086WatershedsMTTFLQPDEAGIESRTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARKREAERRQEADLQ
Ga0070716_10178345623300006173Corn, Switchgrass And Miscanthus RhizosphereDFAETKRGQSRLACFLHLRDLCYLHPEAAPAGAIAQVIRLEAAVVAEKVAREKRLANVRERARQKAAVEQAKAESQLRK*
Ga0070712_10092691323300006175Corn, Switchgrass And Miscanthus RhizosphereMTTFLQPDDAVDFAESGRRRGQDRVACFLHLRDLVYLHPEAASASTIAFVQTLEAAVTAEKAAREKRLAAGRERAERRREADL*
Ga0070765_10021790023300006176SoilMTRFPPSGEAVDFAESGRKRGQDRVACFLHLRDLIYLHPEAASASAVSFVQTFEAKVIAEKAAREKRLADVRERARW*
Ga0136449_10002457233300010379Peatlands SoilMTRFLQPDVAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGTREKRLAEARARTEGWLEAAR*
Ga0137413_1098105023300012924Vadose Zone SoilMTTFLQPDEATNVDFAEPRQKRDRDRVTCFLHLRDLVYLHPEAALASAVAFVQKLEAAVIAEKAARERRPAEARKRQAEQ
Ga0182036_1077464213300016270SoilMTTFLQPDEGINVDFAESRQKRDRDRVACFLHLRDLVYLHREAASVSTITFVQTLEAAVIAEKAAREKRLAAARERDAERRREADRQKAAAERAKVEKEDRASDGG
Ga0182041_1172941523300016294SoilMTTFLQPDEGINVDFAESRQKRDRDRVACFLHLRDLVYLHREAASVSTITFVQTLEAAVIAEKAARERRLAAARERDAERRREADRQKAAAERAETSLGGQK
Ga0182035_1106791913300016341SoilMSLQPDDDTDNVDFVESRHKRDPRVACFLHLRDLVYLHPEAASANAVAFVQTLEAKVIAEKAAREKR
Ga0182034_1065000323300016371SoilMTTFLQPDEGINVDFAESRQKRDRDRVACFLHLRDLVYLHREAASVSTITFVQTLEAAVIAEKAAREKRLAAARERGAERRREAVRKKAAPARAETEKEKT
Ga0182040_1136724313300016387SoilMTTFLQPDEGINVDFAESRQKRDRDRVACFLHLRDLVYLHREAASVSTITFVQTLEAAVIAEKAAREKRLAAARERDATRRREADRQKAAAERAKIEKEDRASDGG
Ga0182039_1204171523300016422SoilMTTFLQPDEGINADFAESRQKDRDRVACFLHLRDLVYMHRDAASVSTVAFVQTLEAAVIAEKAAREKRLAAVRKRDAERRREADRQKAAAERAETSLGGQK
Ga0182038_1053110713300016445SoilMSLQPDDDTDNVDFVESRHKRDPRVACFLHLRDLVYLHPEAASASAVAFVQTLEAKVIAEKAAREKRLAEARNREAERQKAAAERVTPLHVPVPRGTL
Ga0182038_1066370123300016445SoilMTTFLQPDEGINVDFAESRQKRDRDRVACFLHLRDLVYLHREAASVSTITFVQTLEAAVIAEKAAREKRLAAARERDAERRREADRQKAAAECAETSVGGQK
Ga0210407_1000410663300020579SoilMTTFLQPDEATNVDFAEPRQKRDRDRVACFLHLRDLVYLHPKAASADAIAFVQQLEAAVIAEKAAREKRLAEVCKREAERRQEADRQKAAVERAKAESQLRE
Ga0210407_1063684313300020579SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERAERAE
Ga0210403_1040104613300020580SoilMTRFLPSEEAVDLAESGRKRGQDRVACFLHLRDLVYLHPEMASASSISHVKTIEAAVIAEKAAREKQ
Ga0210395_1021651543300020582SoilMTRFPPSGEAVDFAESGRKRGQDRVACFLHLRDLIYLHPEAASASAVSFVQTFEAKVIAEKAAREKRLADVRERARW
Ga0210395_1026985523300020582SoilRMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHRKAASASTIAFVQTLEAAVIAEKAAREKRLADARERARREADRQKGA
Ga0210395_1036631123300020582SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTNAIAFVQSLEAAVIAQKGTREKRLAEARATS
Ga0210395_1065888723300020582SoilSIRAAGRIMTTPFDPDEAIEVDFTKSRTKRDRDRIACFLHLRDLVYLHPEAASADAIALVQTLEAKVIAEKAAREKRLAEGRERKAAWREVDRVKH
Ga0210401_1023901243300020583SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGTREKRLAEARATS
Ga0210401_1029217223300020583SoilMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHREAASANTIAFVRTLEAAVIAEKAAREKRLADARERARREADRQKEA
Ga0210406_1037878723300021168SoilMTTFLQPDDTGIESRQKLDQDRVACFLHLRDLVYLHPQAAPAAAISHVKTLEAAVIAQKAAREKRLAEVRKREAERRQEADRQKAAAEQPP
Ga0210406_1069701623300021168SoilMTRFPPSGEAVDFAESGRKRGQDRVACFLHLRDLIYLHPEAASASAVSFVQTFEAKVIAEKAAREKRLADVRERARQKAAAEQAKADDGDGKLRRPAGAAREQGCIP
Ga0210400_1033750733300021170SoilMTTFLDPDDATNVDFAESRQNRGQDRVACFLHLRDLVYLHPEMASASSISHVKTIEAAVIAEK
Ga0210400_1144934013300021170SoilMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHREAASANTIAFVRTLEAAVIAEKAAREKRLADVRE
Ga0210405_1011446613300021171SoilMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHREVASANTIAFVRMLEAAVIAEKTAREKRLADARERARREADRQKEA
Ga0210405_1017901833300021171SoilMRAAGQMTTFLQPDEATNVDFAEPRQKRDRDRVACFLHLRDLVYLHPKAASADAIAFVQQLEAAVIAEKAAREKRLAEVCKREAERRQEADRQKAAVERAKAESQLRE
Ga0210405_1022431833300021171SoilMTTFLQPDEAGIESRTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIVFVQSLEAAVIAEKATREKQLAEARERTERAEGRRE
Ga0210405_1070731723300021171SoilMTTSFDPDEAIEVDFTKSRTKRDRDRIACFLHLRDLVYLHPEAASADAIALVQTLEAKVIAEKAAREKRLAEGRERKAAWREVDRVKH
Ga0210405_1109908923300021171SoilTRFPPSGEAVDFAESGRKRGQDRVACFLHLRDLIYLHPEAASASAVSFVQTFEAKVIAEKAAREKRLADVRERARW
Ga0210396_1134635513300021180SoilMTTPFDPDEAIEVDFTKSRTKRDRDRIACFLHLRDLVYLHPEAASADAIALVQTLEAKVIAEKAAREKRLAEGRERKAAWREVDRVKH
Ga0210388_1092524623300021181SoilMTTILGTDETGNVDLAESGRKRSQDRVACFLHLRDLVYLHPGAASASAVVFVQTLEAKVIAEKTTRDVELCGKLGDGVRKAA
Ga0210393_1004343933300021401SoilMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHRKAASASTIAFVQTLEAAVIAEKAAREKRLADARERARREADRQKGA
Ga0210393_1029115623300021401SoilMTTFLQPDEAGIDVDFAESRTKRGRDRVACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAQKGTREKRLAEARATS
Ga0210393_1077254713300021401SoilMTTILGTDETGNVDLAESGRKRSQDRVACFLHLRDLVYLHPGAASASAVVFVQTLEAKVIAEKATREKRLADARERARREADRQKAAAEQARAESQLRE
Ga0210397_1064279413300021403SoilMTTFLGPAEATNVDFAEPRHKRNQAHVACFLHLRDLVYLHPEAASASVVSFVQMFEAKVIAEKAAREKRLAEVRAREASRREADRQ
Ga0210397_1066745333300021403SoilGEAVDFAESGRKRGQDRVACFLHLRDLIYLHPEAASASAVSFVQTFEAKVIAEKAAREKRLADVRERARW
Ga0210387_1053360623300021405SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERTERAEGRREADRVKAAAE
Ga0210387_1098954913300021405SoilMTTFLGPDEATNVDFAEPRHKRNQAHVACFLHLRDLVYLHPEAAPASVVSFVQMFEAKVIAEKAAREKRLAEVRAREASRREADRQKATVEQ
Ga0210387_1147315323300021405SoilMTTILWPDETGNVDFAESRHKRDQDRVACFLHLRDLVYLHPEGASASAIAFVQKLEAAVIAEKAAREKRLAEARERDERRREADRQKAAEQAK
Ga0210387_1171324413300021405SoilMTTFLGPDEATNVDLADSRHKRDQDRVACFLHLRDLVYLHPAAASASAVSFVQTLEAKVIAEKAAREKRLAE
Ga0210383_1166341423300021407SoilMMSFQESDEAVNVDESRNQRARARVACFLHLRDLVYLHPEAASASAVSVVRTLEAQVIAEKAAREKGRADARAKGVRFG
Ga0210383_1169108313300021407SoilYMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTNAIAFVQSLEAAVIAQKGTREKRLAEARATS
Ga0210384_1153967323300021432SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERTERAEGRREADRV
Ga0210390_1036319113300021474SoilMTTSFDPDEAIEVDFTKSRTKRDRDRIACFLHLRDLVYLHLEAASADAIALVQTLEAKVIAEKAAREKRLAEGRERKAAWREVDRVKH
Ga0210392_1038166913300021475SoilQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHREVASANTIAFVRMLEAAVIAEKTAREKRLADARERARREADRQKEA
Ga0210410_1018382443300021479SoilDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHRKAASASTIAFVQTLEAAVIAEKAAREKRLADARERARREADRQKGA
Ga0207693_1022071723300025915Corn, Switchgrass And Miscanthus RhizosphereMSRLTPGVKTLSYLARARDRARQGGQMTTFLQPDDAVDFAESGRRRGQDRVACFLHLRDLVYLHPEAASASTIAFVQTLEAAVTAEKAAREKRLAAGRERAERRREADL
Ga0207693_1070456023300025915Corn, Switchgrass And Miscanthus RhizosphereMTTFLWPDDAINVDVAEFRQKREKDRVACFLHLRDLVYLHPKAASADAIALVQSLEAAVIAEKAAREKRLVEARGRADAHA
Ga0207663_1024732723300025916Corn, Switchgrass And Miscanthus RhizosphereMTTFLQPDDAVDFAESGRRRGQDRVACFLHLRDLVYLHPEAASASTIAFVQTLEAAMIAEKAACEKRLAAGRERAERRREADLQKSGGRASKGG
Ga0207663_1074937113300025916Corn, Switchgrass And Miscanthus RhizosphereMTTFVQPDEATNVDFAESRQRDQDRVACFLHLRDLVYMHTQGASATAIAFVKKLESAVLAEKAAREKRLAEAREREAER
Ga0208603_106642813300027109Forest SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERA
Ga0209330_112078013300027619Forest SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPGAASASAVAFVQTLEAKVIAEKTTREKRLADMRERAERQK
Ga0209275_1057714323300027884SoilMTRFLEPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTNAIAFVQSLEAAVIAQKGTREKRLAEARATS
Ga0209624_1000297653300027895Forest SoilMTIFLGTDETGHVDLAESRHTRDKDRVACFLHLRDLVYMHTQAASATAIAFVQTLEAAVIAHKAARDRRLAEARARDAGRRQQEANRQKAAPDVPDDTTEDDPWGH
Ga0209624_1000921523300027895Forest SoilMTRFLPSEEAVDLAESGRKRGQDRVACFLHLRDLVYLHPEMASASSISHVKTIEAAVIAEKAAREKQLAAMRERAERRREAERQKAATEQAKAG
Ga0209006_1045992023300027908Forest SoilMTRFLPSDEAVDFAESGRKRGQDRVACFLHLRDLVYLHPEAASASAVSFVQTLEAKVIAEKAAREKRLA
Ga0209006_1083417913300027908Forest SoilYWREGHEKRAPQGGRMTTILQLDEAGNVDFAEPRQKRDRDRVACFLHLRDLVYLHRKAASASTIAFVQTLEAAVIAEKAAREKRLADARERARREADRQKGA
Ga0209006_1086082413300027908Forest SoilMTRFLPSEQAVDFAESERKRGQDRVACFLHLRDLCYLHPEAAPAVAIAHMKTLEATVIAKKATREKQLADWR
Ga0209006_1123670313300027908Forest SoilMTTFLGPDEATNVDFAEPRHKRNQAHVACFLHLRDLVYLHPEAASASVVSFVQMFEAKVIAEKAAREKRLAEVRAREASRREGDRQKATVEQATAGNGDGYRT
Ga0308309_1176150413300028906SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKVTREKQLAEARERAERAEGRR
Ga0265763_101276113300030763SoilMTTILGTDETGNVDLAESGRKRSQDRVACFPHLRDLVYLHPGAASASAVAFVQTLEAKVIAEKTTREKRLADMRERAERQNAADRQKAAAETREGGELA
Ga0265746_103123923300030815SoilMTTILGTDETGNVDLAESGRKRSQDRVACFPHLRDLVYLHPGAASASAVAFVQTLEAKVIAEK
Ga0265753_104746313300030862SoilMTTILGTDETGNVDLAESGRKRSQDRVACFLHLRDLVYLHPGAASASAVVFVQTLEAKVIAEKATREKRLADARERARREADRQKAAAEQAKAESQLTG
Ga0265753_112547813300030862SoilMTTFLQPDEAGIESGTRRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERAEGPREADRVKAAAEK
Ga0265760_1030708823300031090SoilMTTILGTDETGNVDLAESGRKRSQDRVACFLHLRDLVYLHPGAASASAVVFVQTLEAKVIAEKATREKRLADARERARREADRQKAAAEQAKAESQL
Ga0170824_10597640713300031231Forest SoilMTTFLQPDEAGIESGTKRGQNKRGQNRAACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAEKATREKQLAEARERTER
Ga0170824_10738367713300031231Forest SoilMTTFLGPDDAANVDLAESRQKRDQGRVACFLHLRDLVYLHREAASASAVAFVQKLEAAVIAEKTARERRLAEARKREADRQ
Ga0170824_10800545013300031231Forest SoilMTTILGTDETGNVDLAESGRKRSQDRVACFLHLRDLVYLHPGAASASAVVFVQTLEAKVIAEKATREKRLADARERARREADRQKAAAEQAKAESQLRE
Ga0170824_11994608743300031231Forest SoilMTTILWPDEATNVDFAESRRKRDQDRVACFLHLRDLVYMHREAASASIIAFVQTLEAAVIAQKAAREKRLADVREREASRRYEADRQKAAAEQAKAGNGDG
Ga0170818_10044200623300031474Forest SoilMTRFPPSEEAVDFAESGRKRGQDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGTREKRLAEARTG
Ga0318555_1045980923300031640SoilDDADNVDFVESRHKRDHRVSCFLHLRDLVYLHPEAASASAVAFVQTLEAKVISEKAAREKRLAEVRKREAERQKAAAERAKADDGDG
Ga0310686_10459320743300031708SoilMTTILGTDETGNVDLAESGRKRSQDRVACFLHLRDLVYLNPGAASASAVVFVQTLEAKVIAEKATREKRLADARERARREADRQKAAAEQAKAESQLREQYL
Ga0310686_10712681423300031708SoilMTRFLQPDEAGIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGAREKRLAEARARAEGWL
Ga0307476_1010532533300031715Hardwood Forest SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGTREKRLAEARARAEGWLEAAR
Ga0307474_1016266213300031718Hardwood Forest SoilMTTFLGPDEATNVDLAESRHTRDQGRVACFLHLRDLVYLHPEAASAGAVSHVKILEAEVIAKKAAREKRLADARERARQKAAAECAKTDAGDG
Ga0307475_1032452113300031754Hardwood Forest SoilMTTFLQPDEATNVDFAESRHTSDKDRVACFLHLRDLVYMHTQAASATAIAFVKKLEAAVLAQKAAREKRLGPVFS
Ga0318565_1056257513300031799SoilMTTFLQPDEGINVDFAESRQKRDRDRVACFLHLRDLVYLHREAASVSTITFVQTLEAAVIAEKAAREKRLAVARERDAERRREADRQKAAAERAKVEKE
Ga0307478_1124418413300031823Hardwood Forest SoilQGQMTTFLQPDEATNVDFAEPRQKRDRDRVACFLHLRDLVYLHPKAASADAIAFVQKLEAAVIAEKAAREKRLAEVRKREAERRQEADRQKAAVERAKAESQLRE
Ga0307478_1167661013300031823Hardwood Forest SoilMTRFLQPDEAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGTREKRLVALHSDYDSLTGVG
Ga0306921_1140097413300031912SoilMTMFLQPDDDADNVDFVESRHKRDSRVACFLHLRDLVYLHPQAASASAVAFVQTLEAKVIAEKAAREKRLAEARKREAERQKAAAEPVTPLHVPVPFQSGGTL
Ga0307479_1204949913300031962Hardwood Forest SoilGGQMTTFLQPDDAVDFAESGRRRGQDRVACFLHLRDLVYLHPKAASADAIAFVQSLEAAVIAQKGTREKRLAEARARAEGWLEAAR
Ga0306922_1179034113300032001SoilMMTLEPDEAINVDFAEPRQKRDKDRVACFLHLRDLVYLHPEAASANAVAFVQTLEAKVIAEKAAREK
Ga0318563_1024600213300032009SoilMTMFLQPDDDADNVDFVESRHKRDSRVGCFLHLRDLVYLHPEAASASAVAFVQTLEAKVISEKAAREKRLAEVR
Ga0318533_1079952023300032059SoilMTMFLQPDDDADNVDFVESRPKRDSRVACFLHLRDLVYLHPQAASASAVAFVQTLEAKVIAEKAAREKRLAEARKREAERQKAAAEPVTPLHVPVPFQSGGTL
Ga0311301_1003274533300032160Peatlands SoilMTRFLQPDVAAIDAAAESRTKRGRDRVACFLHLRDLVYLHPKAASTDAIAFVQSLEAAVIAQKGTREKRLAEARARTEGWLEAAR
Ga0306920_10405429923300032261SoilMTTFLAPDEAIDFADSRRKRGQDQRVACFLHLRDLVYLHPEAAPASAVSFVRTIEARVIAAKFARERKD
Ga0335073_1008660643300033134SoilMTTFLQPDEANNVDFAEHRQKRDRDRAACFLHLRDLVYLHPGAASASAIAFVQKLEVAVIAERAAREKRLAEARKREAERRARSGSTESGGQTGRRLG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.