NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F082943

Metagenome Family F082943

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082943
Family Type Metagenome
Number of Sequences 113
Average Sequence Length 55 residues
Representative Sequence MNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDT
Number of Associated Samples 85
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 2.65 %
% of genes from short scaffolds (< 2000 bps) 1.77 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.345 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(41.593 % of family members)
Environment Ontology (ENVO) Unclassified
(37.168 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.673 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 61.73%    β-sheet: 0.00%    Coil/Unstructured: 38.27%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF01066CDP-OH_P_transf 3.54
PF02518HATPase_c 3.54
PF01638HxlR 3.54
PF08818DUF1801 1.77
PF11154DUF2934 1.77
PF04397LytTR 0.88
PF04185Phosphoesterase 0.88
PF00448SRP54 0.88
PF04366Ysc84 0.88
PF07730HisKA_3 0.88
PF12704MacB_PCD 0.88
PF01553Acyltransferase 0.88
PF00512HisKA 0.88
PF13751DDE_Tnp_1_6 0.88
PF07676PD40 0.88
PF02321OEP 0.88
PF00313CSD 0.88
PF07929PRiA4_ORF3 0.88
PF12779WXXGXW 0.88
PF04191PEMT 0.88
PF12732YtxH 0.88
PF02567PhzC-PhzF 0.88
PF01541GIY-YIG 0.88
PF00072Response_reg 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG0558Phosphatidylglycerophosphate synthaseLipid transport and metabolism [I] 3.54
COG1183Phosphatidylserine synthaseLipid transport and metabolism [I] 3.54
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 3.54
COG5050sn-1,2-diacylglycerol ethanolamine- and cholinephosphotranferasesLipid transport and metabolism [I] 3.54
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.77
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 1.77
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 1.77
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 1.77
COG0384Predicted epimerase YddE/YHI9, PhzF superfamilyGeneral function prediction only [R] 0.88
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 0.88
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.88
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.88
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.88
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.88
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.35 %
All OrganismsrootAll Organisms2.65 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100508392All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300002907|JGI25613J43889_10003662All Organisms → cellular organisms → Bacteria → Acidobacteria4160Open in IMG/M
3300012917|Ga0137395_10103720All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1893Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil41.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.73%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.08%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.19%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.65%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.65%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.65%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.77%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.89%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000580Forest soil microbial communities from Amazon forest - 2010 replicate II A01EnvironmentalOpen in IMG/M
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A01DRAFT_107264123300000580Forest SoilMNKNKVAIVAIGLTALLAFSPAASFAQATTFSGEAVGLKANVVGVSLSLADT
AF_2010_repII_A1DRAFT_1011659113300000597Forest SoilMNKNKVAIVAIGLTALLAFSPAASFAQATTFSGEAVGLKANVVGVSLSLADTGALPSSGGNLSNSLASVNVAGI
JGIcombinedJ26739_10005833513300002245Forest SoilMNANKNSTIVIFLTAILVVSPVASFAQAATTFSGEAVALRRASAVGISLAVSDTGPLPASGGNLKTSVGSV
JGIcombinedJ26739_10050839213300002245Forest SoilMNANKKSTIVIFLTAILVVSPVASFAQAATTFSGEAVALRASAPGISLAVSDTGPLPASGGNLKTSVGSV
JGI25613J43889_1000366253300002907Grasslands SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTGPLAA
JGI25388J43891_100431743300002909Grasslands SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVALRASAVGISLALADTGA
JGI25615J43890_100751523300002910Grasslands SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDT
JGI25616J43925_1001288253300002917Grasslands SoilMNTKKNSTIAICLTAXLALSPVASFAQATTTFSGEAVALRANALGISLALSDTGPLAA
JGI25616J43925_1025537213300002917Grasslands SoilMNTKKNSMMAACLTAMLAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPASGGNLSTSLASVNVLGLAS
JGI25616J43925_1028655613300002917Grasslands SoilMNTRKNTTIAIFLTAILAFSPLAGFAQAATTFSGEAVALRANALGISASISDTGPLPSS
Ga0066673_1001785043300005175SoilMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKALGISLDLSDT
Ga0066679_1041959123300005176SoilMQMNAKNTPMIAICLAAVLALGPVASFAQATTTFSGRAVALRA
Ga0070713_10151769713300005436Corn, Switchgrass And Miscanthus RhizosphereLNTKKNATLAICLTALLVFSPLAGYAQATTTFSGEGVALKANALGISLSAADTGALPSSGGNLSTSLASVNVLGL
Ga0066681_1030209013300005451SoilLNAKKNATIALCLTALLAFSPLAGFAQAATTYSGDATALQASAVGISLALSHAG
Ga0070706_10141530113300005467Corn, Switchgrass And Miscanthus RhizosphereMATKKPSTIAICLIGALAFGPVTSFAQATATFSGQAVALRASAVGLALA
Ga0066701_1011723033300005552SoilMKTKKKSTMAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASGGNLST
Ga0066707_1004834513300005556SoilMKTKKKSTMAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGLLAASGGNLSTSLASVNVLGLASADALKSTTSGSG
Ga0066704_1012357553300005557SoilMQMSTKKNSTIAICLTAILALSPVASFAQATATFSGEAVALRANALGISLALSDTGALQSSGGNLSRSLASVNV
Ga0066702_1021472233300005575SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVAL
Ga0066651_1083693213300006031SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALR
Ga0070765_10212615333300006176SoilMNTKKNSTMAICLTAILVLSPIASFAQATTTFSGEAVALRASVVGISLSLADT
Ga0079222_1004405923300006755Agricultural SoilMNKKRTSMLAICLTALLAFSPLITFAQSATTFSGEAVALKANALGVSLALRCTSPGQVST
Ga0079222_1054287913300006755Agricultural SoilMNTNMHANKNSIIATCLVAILTLGPLRGFAQSATTFSGEAVALRANAVGISLALSDTGALPSSGGSL
Ga0066658_1002576513300006794SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVALRASAVGISLALADTGALPSSGGSLSTS
Ga0066665_1059420723300006796SoilMNMQKNSTIAICLTAILALSPIASLAQATTTFSGEAVALRA
Ga0066665_1140370323300006796SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGE
Ga0079220_1008915413300006806Agricultural SoilMLAMCLTALLAFSPLITFAQSATTFSGEAVALKANALGVSLALR
Ga0075431_10015592133300006847Populus RhizosphereVHMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGISL
Ga0075424_10230069813300006904Populus RhizosphereVHMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGIS
Ga0099791_1019031613300007255Vadose Zone SoilMNAKKNSTIAICLMVLVTFSPVASFAQATITFSGQAVALRVSAVGL
Ga0099791_1051077013300007255Vadose Zone SoilMQMSTKKNSTIAICLTAILALSPVASFAQATTTFNGEAVALRANALGISLALSDTGALQSSGGNLSRSLASVNVLGLASADA
Ga0099794_1065931813300007265Vadose Zone SoilMQMSTKKNSTIAICLTAILALSPVASFAQATATFSGEAVALRAN
Ga0099830_1107791313300009088Vadose Zone SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRASALGISLALSD
Ga0099828_1031383313300009089Vadose Zone SoilMQMNANKNSTITIAICLIAILVFSPVAGFAQAAITFSGEAVALRASAAGISLAVSDTGPLPA
Ga0075418_1111237023300009100Populus RhizosphereMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGISLSLSDT*
Ga0126382_1036034913300010047Tropical Forest SoilMNMKNNAAIAVCLTAILAFSPLAGFAQAATTFSGEAVALRANALGISLSLSDAG
Ga0126382_1043694713300010047Tropical Forest SoilMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGISLSLSDAGPLP
Ga0134082_1005510713300010303Grasslands SoilLEEENVQMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKAL
Ga0126370_1029229913300010358Tropical Forest SoilMITMKNPSIAILLAAILAFSQFAGLAQATTFSGEAVALKANALGVSLTASDTGPLP
Ga0126378_1145581413300010361Tropical Forest SoilMNTKKNAMIAILLTAILAFSPLAGFAQAATTFSGQAVALKANALGISLDVSDTGALPSSGGNLST
Ga0134066_1006119413300010364Grasslands SoilMITKKNSTMAICLMTILVFSPVATFAQATITFSGEA
Ga0126381_10475293113300010376Tropical Forest SoilMNEKKAIIAICLAAVLVLGLSPGVSLAQGTTTFSGEAVALKASVAGISLDLGDTGALPSSGGNLSTSLAS
Ga0137391_1152136613300011270Vadose Zone SoilMQMNPKKNSTIAICLMAILAFSPVATFAQASTTFSGQAVALRASAVGLALALSDTGPLPA
Ga0137388_1197660613300012189Vadose Zone SoilMNAKKNSTIAICLMVLVAFGPVASFAQAPNTFRDQAVALLVSAVGLGLALSDTGPIPAS
Ga0137364_1007872743300012198Vadose Zone SoilMQMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDL
Ga0137364_1010060633300012198Vadose Zone SoilMNTKKNSTIAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137364_1054659623300012198Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDL
Ga0137383_1032921623300012199Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137382_1025029513300012200Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKA
Ga0137382_1093863923300012200Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQAPVTFSGEAVALPARALGLSLDLSDTGPLAASG
Ga0137363_1073356113300012202Vadose Zone SoilMNTKKNAGIAVFLTALLLYSPLAGFAQTATSFSGEGIALK
Ga0137399_1009160213300012203Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANAL
Ga0137399_1020285413300012203Vadose Zone SoilMNAKKNSTIAICLMAILAFGSVTSFAQARTTYSGQAVALRASAT
Ga0137399_1069290313300012203Vadose Zone SoilMNTKKNSTMAICLTAILALSPIASLAQAATTFSGEAVALKANALGISLSLADTEALPSS
Ga0137399_1088800213300012203Vadose Zone SoilMNAKKNSTIAICLMAILAFGPVVSFAQARTTYSGQAVALRASA
Ga0137399_1106529613300012203Vadose Zone SoilMITKKNSTIAICLMTILVFGPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASGGN
Ga0137399_1114905213300012203Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGAVPARGGNLSTSLATFRRKNACCR*
Ga0137377_1128001813300012211Vadose Zone SoilMNTKKNSAIAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137370_1004717243300012285Vadose Zone SoilMQMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137370_1021439323300012285Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQTTITFSGEAVALRAKALGISLDLSDTGPLAARGGNLSTSLA
Ga0137387_1075435813300012349Vadose Zone SoilMNTRKNSTIAICLTAVLGLSPVASFAQATTTFSGEAVALKANALGISLALSD
Ga0137361_1073257023300012362Vadose Zone SoilMQMNANKNSTITIAICLIAILVFSPVAGFAQAAITFSGEAVALRASAAGISLALSDTGPLPASDGNLKTSVGSVSVLGL
Ga0137361_1103375923300012362Vadose Zone SoilMQMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASGGSLSTSLA
Ga0137398_1002208243300012683Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISL
Ga0137398_1021810623300012683Vadose Zone SoilMNMQKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTGALPSNGGN
Ga0137398_1027254013300012683Vadose Zone SoilMNTQKNSTIAICLTAILALSPIVSFAQATTTFSGEAVALRANALGISLA
Ga0137397_1108471013300012685Vadose Zone SoilMKNKKLSTIAICLIAVLAFGPVTSFAQATTTFSGQAVALRAS
Ga0137395_1006000013300012917Vadose Zone SoilMNMQKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTGA
Ga0137395_1010372013300012917Vadose Zone SoilMNTKKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISLALSDTGA
Ga0137396_1019427123300012918Vadose Zone SoilMNAKKNSTIAICLMAILAFGSVTSFAQARTTYSGQAVALHASATGLALALSDT
Ga0137396_1024437913300012918Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATITFSGEAVALKANALG
Ga0137394_1066709913300012922Vadose Zone SoilMNTKNSTVAIGLVAALAFNPVSTFGQANTFSGQAVALRASAVGIALAL
Ga0137419_1015709833300012925Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISLA
Ga0137416_1055600913300012927Vadose Zone SoilMNTKKNSTIAICLTAILALSPVVGFAQATITFSGEAVALRASALGISLALSDTGP
Ga0137410_1085187223300012944Vadose Zone SoilMSAKKNSTIAICLMVLTTFSPVAAFAQATNTFSGQAVALRV
Ga0137410_1140882913300012944Vadose Zone SoilMQMSTKKNSTMAICLTAILALSPVASFAQATTTFSGEAVALRAS
Ga0126375_1052288413300012948Tropical Forest SoilMNTSKNAMIAICLTAILAFAPLAGFAQTATAFSGEAVALRANA
Ga0126369_1182787813300012971Tropical Forest SoilMNTKKNAVIAIFLTAILAFSSLAGFAQTATSFSGEAVALKANALG
Ga0134079_1048617213300014166Grasslands SoilLEEENVQMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKALGISLDLSDT
Ga0137418_1022808423300015241Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALR
Ga0137418_1080779013300015241Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPARGGNLSTSVASVNVL
Ga0132258_1153884623300015371Arabidopsis RhizosphereMKTHKPSTLAICLMAALTFGPLDVFAQTTTFSGQAVALRASAVGLALALSDTGALPAAGGNLATSLASV
Ga0190272_1011214713300018429SoilMPAKHNSTIAICLTAILVLGPVAASAQTNTFSGQAVALRASVIGVALALSDTGPLPATGGDLKTS
Ga0066669_1234821013300018482Grasslands SoilMNTTKNSTIVICLIAILAFSPVATFAQASATFSGRAVALRASAVGLALALS
Ga0179590_122928423300020140Vadose Zone SoilVHTNTKENSTIAICLMAILVFSPVASFAQATITFSGEAVALRAKALGISLD
Ga0210407_1022915813300020579SoilMNTKKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRASVVGISLSLADTGALPSSGGN
Ga0210401_1099338813300020583SoilMNTKKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRASVVGISLSLADTGALPSS
Ga0210388_1139920913300021181SoilMNTKKNSTMAICLTAILVLSPIASFAQATTTFSGEAVALRASVVGISLSLADTGALPSRGGNLS
Ga0210402_1097380813300021478SoilMNTKKNSTMAICLTAILVLSPIASFAQATTTFSGEAVAL
Ga0207700_1152419413300025928Corn, Switchgrass And Miscanthus RhizosphereLNTKKNATLAICLTALLVFSPLAGYAQATTTFSGEGVALKANALGISLSAADTGAL
Ga0209238_116676123300026301Grasslands SoilMKTKKKSTMAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASG
Ga0209240_102581413300026304Grasslands SoilMNTKKNTTIAIFLTAILAFSPLAGFAQAATTFSGEAVALRVNALGISA
Ga0209240_126932213300026304Grasslands SoilMNTKKNSTVAICLTAILALSPIASFAQATTTFSGEA
Ga0209152_1008668733300026325SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVALRASAVGISLALADTGALPSSGGSLSTSL
Ga0257163_107999713300026359SoilMNTKKNSMMAACLTAMLAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPASGGNLSTSLASVNVLGL
Ga0257172_111010413300026482SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGQAVA
Ga0257158_110788813300026515SoilMNTKKNSMMAACLTAMLAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPASGGNLSTSLAS
Ga0209474_1068276413300026550SoilMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKALGISL
Ga0209648_1065738123300026551Grasslands SoilMNTKKNSTIAICLTAILALSPVASLAQATTTFSGEAVALRANALGISL
Ga0179587_1041796623300026557Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISLALSDTGALPSSGGNLSMSLAN
Ga0209076_107819113300027643Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATITFSGEAVALK
Ga0209076_109092413300027643Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPARGGNLS
Ga0137415_1013921333300028536Vadose Zone SoilMNTKKNSTIALCLTAILAFSPVASFAQATTTFSGEAGALRGNALGISLALSDTGPLASGGNLNTSLASVNVLGLASA
Ga0137415_1038963113300028536Vadose Zone SoilMKMNAKKNSTIAICLTAILAFSPVASFAQATTTFTGEAVTLRASAVGISLALSDTGALLSSGGN
Ga0307476_1031518213300031715Hardwood Forest SoilMNTKKNATIASCLMAVLVFSPVATFAQATTTFTGEAV
Ga0307477_1109980423300031753Hardwood Forest SoilMNEKRTSMMTICLRVVLALSPSLSLAQGTTTFSGEAVALKANVLGISLTLADTGQLPSTG
Ga0307475_1039667313300031754Hardwood Forest SoilMNEKRTSMMTICLLVGLALSPALGFAQGTTTFSGEAVALKANALGISLSIADTG
Ga0307478_1098909913300031823Hardwood Forest SoilMNAKRTSTIAICLTAIVALSPVAGFAQATTTFSGEAVALRANVLGTSLALSDTGALPS
Ga0307478_1151528413300031823Hardwood Forest SoilMNTKKNSTMAIGLTAILALSPVASFAQATTTFSGEAVALRASALGISLTLADTGA
Ga0307479_1019890613300031962Hardwood Forest SoilVQINAKTNSTIAICLTAILALIPIASFAQATITFSGEAVALRANAAGIALALSDTGALPSSGGNLSTSLASVN
Ga0307479_1035232113300031962Hardwood Forest SoilMNSKKSSTIAICFVAILAFGPVASFAQATTTFSGEAVALRANALGISLDLSDTGPLAAS
Ga0307471_10011767133300032180Hardwood Forest SoilMNTKTNSTIAICLTAILALSPVASFARGTTTFSGEAVALRASAVGISLALSDTGAFRLAAGI
Ga0306920_10122298913300032261SoilMNEKKAIIAVCLAAVLAFSPVVSLAQGTTTFSGEAVALKATAAGISLALG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.