NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099329

Metagenome / Metatranscriptome Family F099329

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099329
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 94 residues
Representative Sequence VEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDQTATKYGLKLTEEEVSTLAAIAGSQELSGEDLAAVSGGLTFFDNNCGCSA
Number of Associated Samples 73
Number of Associated Scaffolds 99

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.592 % of family members)
Environment Ontology (ENVO) Unclassified
(25.243 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(43.689 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.74%    β-sheet: 0.00%    Coil/Unstructured: 51.26%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 99 Family Scaffolds
PF04055Radical_SAM 1.01
PF00756Esterase 1.01



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.59%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated10.68%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere10.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.74%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere8.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil5.83%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment4.85%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen4.85%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere3.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.88%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil2.91%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.94%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.94%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.94%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.94%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.97%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.97%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.97%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog0.97%
EctomycorrhizaHost-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza0.97%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.97%
Enhanced Biological Phosphorus Removal BioreactorEngineered → Wastewater → Nutrient Removal → Biological Phosphorus Removal → Activated Sludge → Enhanced Biological Phosphorus Removal Bioreactor0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002864Avena fatua rhizosphere microbial communities - H2_Rhizo_Litter_7 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003161Avena fatua rhizosphere microbial communities - H2_Rhizo_Litter_8 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003163Avena fatua rhizosphere microbial communities - H1_Rhizo_Litter_2 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003305Avena fatua rhizosphere microbial communities - H3_Rhizo_Litter_13 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300003544Grassland soil microbial communities from Hopland, California, USA - Sample H2_Rhizo_33 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003569Grassland soil microbial communities from Hopland, California, USA - Sample H2_Bulk_36 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003572Grassland soil microbial communities from Hopland, California, USA - Sample H3_Bulk_40 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003574Grassland soil microbial communities from Hopland, California, USA - Sample H1_Rhizo_26 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003577Grassland soil microbial communities from Hopland, California, USA - Sample H2_Rhizo_32 (Metagenome Metatranscriptome, Counting Only)Host-AssociatedOpen in IMG/M
3300003840Avena fatua rhizosphere microbial communities from Hopland, California, USA - H2_Bulk_Litter_50 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004785Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - roots SR-1 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004798Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - roots SR-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004799Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-3 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004800Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-1 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004801Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - roots SR-3 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006417Combined Assembly of Gp0110018, Gp0110022, Gp0110020EngineeredOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011332Soil microbial communities from California, USA to study soil gas exchange rates - SR-CA-SC2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300011333Cornfield soil microbial communities from Stanford, California, USA - CI-CA-CRN metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300011431Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT157_2EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300019229Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT660_1_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019244Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT293_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019263Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300020065Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022503Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022530Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022721Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028869Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Fen_N1_4EnvironmentalOpen in IMG/M
3300029984I_Fen_E1 coassemblyEnvironmentalOpen in IMG/M
3300029990I_Fen_N2 coassemblyEnvironmentalOpen in IMG/M
3300030114I_Fen_E2 coassemblyEnvironmentalOpen in IMG/M
3300030339III_Bog_N1 coassemblyEnvironmentalOpen in IMG/M
3300030776Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030917Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FB5 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030967Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA11 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030981Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines PO 4C (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030988Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_157 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031096Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_194 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031123Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_196 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031232Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_3EnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031507Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 10_EMHost-AssociatedOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300034659Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034662Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034665Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034666Metatranscriptome of lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034671Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0006764J43180_11224213300002864Avena Fatua RhizosphereGVNDTQTTDKENNVEDKKDGHRAYTAERKAKLIQLTEDLRSSDTLLTEFVGKPDPTAAKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNTNCGCNDV*
Ga0006765J45826_10891413300003161Avena Fatua RhizosphereVEDKKDGHRAYTAERKAKLIQLTEDLRSSDTLLTEFVGKPDPTAAKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNTNCGCNDV*
Ga0006759J45824_102463213300003163Avena Fatua RhizosphereDTQTTDKENNVEDKKDGHRAYTAERKAKLIQLTEDLRSSDTLLTEFVGKPDPTAAKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNTNCGCNDV*
Ga0006759J45824_103237313300003163Avena Fatua RhizosphereVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAAIAGDQELSEAGLATVSGGFFDNNCGCGGGGA*
Ga0006770J48903_100969313300003305Avena Fatua RhizosphereVEDNKDGHRAYTAERKAKLIRLTADLRNSDLLLTEFVGKPGTTATKYGLQLTEEEVSTLAAIAGNQELTGDDLSAVSGGLMMFDNNCGCGST*
soilH1_1033122413300003321Sugarcane Root And Bulk SoilVEEKKDGHRAYTAERKAKLLRLAEALRNSDQLLAEFAGKPAQTAAKYDLQLTEEEVSALTAIARSQELDEEALAAVAGGSLNPDATGSNGNCNC*
soilH1_1033122423300003321Sugarcane Root And Bulk SoilVEDKKAGHRAYAAERKAKLLRLTEDLRNSDPLLAEFVGKPDQTAAKYGLTLTEEEVSTLAAIAGNQELSGDDLAAVSGGGGIATFFDNNCGCSA*
Ga0007417J51691_100709713300003544Avena Fatua RhizospherePPSGVNDTQTTDKENNVEDKKDGHRAYTAERKAKLIQLTEDLRSSDTLLTEFVGKPDPTAAKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNTNCGCNDV*
Ga0007420J51693_100133623300003569Avena Fatua RhizosphereVEENKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDRTATKYGLQLTEEEVSTLAAIAGNQELSEEGLSAVSGGTAMIFDNNCNCAGSV*
Ga0007424J51698_104821613300003572Avena Fatua RhizosphereQTTDKENNVEDKKDGHRAYTAERKAKLIQLTEDLRSSDTLLTEFVGKPDPTAAKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNTNCGCNDV*
Ga0007410J51695_100400613300003574Avena Fatua RhizosphereVEEKKDGHRAYTAERKAKIIRLTADLRNSDSLLTEFVGKPDQTASKYGLQLTTEEVSTLAAIAGNQELTGEALEAVSGGADKPLFFDNNCGCSKV*
Ga0007416J51690_101593413300003577Avena Fatua RhizosphereVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSTLAAIAGDQELSEAGLATVSGGFFDNNCGCGGGGA*
Ga0032355_103368913300003840Avena Fatua RhizosphereVNDTQTTDKENNVEDKKDGHRAYTAERKAKLIQLTEDLRSSDTLLTEFVGKPDPTAAKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNTNCGCNDV*
Ga0058858_100340923300004785Host-AssociatedVEERKDGHRAYAAERKAKLIRLTADLRNSGSLLTEFVGKPDATATKYGLQLTQDEVSALAAIAGNEELSGEELSAVSGGVFDNNCNCVNDTI*
Ga0058859_1000755533300004798Host-AssociatedQRHATTRNDTQRHTRRTNVEEKKDGHRAYTTERKGKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAAIAGDHELTEEGLASVSGGIFDNNCNCSGGGA*
Ga0058859_1012857913300004798Host-AssociatedAKLIRLTADLRNSDSLLTEFVGKPDATATKYGLQLTQDEVTALAAIAGNQELSGEELSAVSGGVFDNNCNCVNDTI*
Ga0058859_1012916213300004798Host-AssociatedAKLIRLTADLRNSDSLLTEFVGKPDATATKYGLQLTQDEVSALAAIAGNQELTGEELSAVSGGVFDNNCNCVNDTI*
Ga0058863_1004548813300004799Host-AssociatedVEEKKDGHRAYSAERKAKLIRLTADLRNSDSLLTEFVGKPDQTATKYGLTLTEEEVSTLAAIAGNGELTEEGLAAVSGGVPKAFFDNNCGCSA*
Ga0058863_1005801523300004799Host-AssociatedVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDQTATKYGLKLTEEEVSTLAAIAGSQELTEEGLAAVSGGNPEEILGFFDNNCGCSH*
Ga0058863_1127070423300004799Host-AssociatedVEDKNDGHRAYTAERKASIIRLTEELRSSDSLLTEFVGNPDKLATKYGLKLTEEEISALAAIAGSQELDEAALNAVAGGSTDTTINGNCNC*
Ga0058861_1203905723300004800Host-AssociatedVEDKNDGHRAYTAERKASIIRLTEELRSSDSLLTEFVGNPNKLATKYGLKLTEEEISALAAIAGSQELDEAALNAVAGGSTDTTINGNCNC*
Ga0058860_1003088513300004801Host-AssociatedVEEKKDGHRAYTTERKGKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAAIAGDHELTEEGLASVSGGIFDNNCNCSGGGA*
Ga0058860_1010999023300004801Host-AssociatedVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSSLAAIAGDQELSEAGLASVSGGIFDNNCGCSNSGA*
Ga0058860_1016340513300004801Host-AssociatedVEERKDGHRAYAAERKAKLIRLTADLRNSGSLLTEFVGKPDATATKYGLQLTQDEVSALAAIAGNEELSGEELSA
Ga0070685_1018611113300005466Switchgrass RhizosphereVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAAIAGDHELTEEGLASVSGGIFDNNCNCSGGGA*
Ga0070741_1027979323300005529Surface SoilMEDKKDGHRAYTAERKARIIRLTADLRNSDSLLTEFVGKPDHTANKYGLQLTQEEVATLAAIAGTGELNGDELAAISGGSSRTVFDNNCNCAGTT*
Ga0070741_1033078723300005529Surface SoilVDNKNDGHRAYTAERKAKIAQLTEDLRSSDSLLTELLSKPDQTAAKYGLQLTQEEVSTIAAIAGGQELSGEELAAVAGGDNGNCNCSNQK*
Ga0070702_10037882323300005615Corn, Switchgrass And Miscanthus RhizosphereVEEKKDGHRAYVAERRAKLVRLAADLRNSDSALTEFVGKPTETATKYGLQLTDDEVSTLAAIAGSQELSGEDLAAVSGGLMIFDNNCGCSGGGPAAW*
Ga0068863_10016252223300005841Switchgrass RhizosphereVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDATATKYGLQLTQDEVSALAAIAGNQELTGEELSAVSGGVFDNNCNCVNDTI*
Ga0070717_1090015713300006028Corn, Switchgrass And Miscanthus RhizosphereRSDTQRRTNVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTDFVGKPDSTASKYGLQLTDEEISALAAIAGDHELSEAGLASVSGGLFDNNCGCGGGKA*
Ga0069787_1316447923300006417Enhanced Biological Phosphorus Removal BioreactorVDNKNEGHRAYTAERKAKIIRFTADLRNSETLLTEFVGKPDMTADKYGLKLTEEEVSTLAAIAGNGELNGEELAAVSGGAAAMFFDNNCGCANAGCDNDAV*
Ga0075425_10097037333300006854Populus RhizosphereMEQKKDGHGAYAAERKAKLIRLAADLRNSDSLLTDFVGKPDETATKYGLQLTDEEVSTLALIAQNQELSGNDLAAVSGGIATLFDVNCMCPKG*
Ga0079219_1216774623300006954Agricultural SoilVEEKKDGHRAYNAERKAKLLRLTTDLRSSDSLLTEFVGKPDSTASKYGLQLTDDEISALAAIAGDQELSEAGLAAVSGGLFDNNCGCGGV
Ga0075435_10100462013300007076Populus RhizosphereRQTNDTQRRTNVEEKKDGHRAYVAERRAKLVRLAADLRNSDSALTEFVGKPTETATKYGLQLTDDEVSTLAAIAGSQELSGEDLAAVSGGLMIFDNNCGCSGGGPAAW*
Ga0105237_1252228723300009545Corn RhizosphereVEDKKDGHRAFTAERKAKLIQLTEDLRKSDTLLTEFVGKPDQTATKYGLKLTEEEVTALAALAAGQGELTGEALTAVSGGSNT
Ga0126372_1187376013300010360Tropical Forest SoilVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDQTATKYGLKLTEEEVSTLAAIAGSQELTEEGLAAVSGGLPMSAFFNNNCSCG*
Ga0134121_1174404123300010401Terrestrial SoilVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLSYEEVSSLAAIAGDQELSEAGLASVSGGIFDNNCGCSNSGA*
Ga0134123_1017134023300010403Terrestrial SoilVEEKNEGHRAYTAERKVKLIRLTADLRNSDSLMTEFVGKPDQTANKYGLKLTEEEVSTLAALAGTGELSGDDLAAVSGGFFDNNCGCIKVQTE*
Ga0126317_1050189713300011332SoilVEEKKDGHRAYTAERKAKLIRLTADLRNSGSLLTEFVGKPDHTANKYGLKLTEEEVSTLAAIAGSQELTDEGLAAVSGGSSPIETFFDNNCGCSA*
Ga0126317_1106499323300011332SoilVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLMEFVGKPDQTATKYGLKLTEEEVSTLAAIAGSQELSGEDLAAVSGGLTFFDNNCGCSA*
Ga0127502_1004188233300011333SoilVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAMIAGDQELSEAGLASVSGGLFDNNCGCGVIIDKTM*
Ga0137438_100125143300011431SoilVNDTQTTHQGEQDVEDKKDGHRAYTAERKAKIIRLTADLRNSDSLLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGTGELTGEALDAVSGGADKPLFFDNNCGCGPIDQA*
Ga0150985_10048497323300012212Avena Fatua RhizosphereAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSTLAAIAGDQELSEAGLATVSGGFFDNNCGCGGGGA*
Ga0150985_10489892023300012212Avena Fatua RhizosphereVEEKKDGHRAYTSERKAKLIRLTADLRNSDSLLTEFVGKPDRTATKYGLQLTEEEVTTLAAIAGNQELSEEGLSAVSGGTAMIFDNNCNCAGSV*
Ga0150985_10856800813300012212Avena Fatua RhizosphereVEEKKDGHRAYAAERKAKLLRLTADLRNSDSLLTEFVGKPDSTAKKYELQLTADEVSTLAAIAGDQELSDAGLAAVSGGFFDNNCGCSNKDV*
Ga0150985_11093703413300012212Avena Fatua RhizosphereVEDKKDGHRAYTAERKAKLIRLTADLRNSDSLLTDFVGKPDQTASKYGLQLTEEEVSTLAAIAGNQELTGEALAAVSGGVMTFDNNCNCGAV*
Ga0150985_11342777423300012212Avena Fatua RhizosphereVEDNKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPGTTATKYGLQLTEEEVSTLAAIAGNQELTGDDLSAVSGGLMMFDNNCGCGST*
Ga0150985_12070933813300012212Avena Fatua RhizosphereVEDKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGTPDSTATKYGLQLTEEEVSTLAAIAGNQELSEAGLDAVSGGAAAGFFDNNCNCGNST*
Ga0150984_10076827933300012469Avena Fatua RhizosphereVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDQTATKYGLKLTEEEVSTLAAIAGSQELSGEDLAAVSGGLTFFDNNCGCSA*
Ga0150984_11451993923300012469Avena Fatua RhizosphereAYTAERKAKIIRLTADLRNSDSLLTEFVGKPDQTASKYGLQLTTEEVSTLAAIAGNQELTGEALEAVSGGADKPLFFDNNCGCSKV*
Ga0150984_11958922313300012469Avena Fatua RhizosphereVEENKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDRTATKYGLQLTEEEVSTLAAIAGNQELSEEGLAAVSGGTAMIFDNNCGCSGSV*
Ga0180116_113462523300019229Groundwater SedimentVEEKKDGHRAYAAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSTLAAIAGDQELSDAGLAAVSGGFFDNNCGCGGGKIDV
Ga0180111_136514013300019244Groundwater SedimentVEKKETGHRAYTAERKANIIRLTEDLRSSDSLLTEFVGKPDQMATKYGLKLTEEEVSALAAIAGSQELNEDALAAVAGGGDININCPCSNEA
Ga0184648_121721413300019249Groundwater SedimentVEEKQDGHRAYSAERKVNLIRLTADLRNSGALLTEFVGKPDQTASKYNLKLTEEEVSTLAAIAGDGELSGEDLAAVSGGFFDNNCGCVKSIDV
Ga0184647_125482913300019263Groundwater SedimentLAIERHTNKEPRRTNVEEKKDGHRAYVAERKAKLIQLNTDLRKSDSLLAEFVSKPDQTATKYGLKLTEEEASVLAAIAGSQELTDDALAAVAGGSLADNGNCNC
Ga0193712_102170523300019880SoilVEDKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTEEEVSTLAAIAGSQELSEEGLAAVSGGNAVAFFDNNCNCAG
Ga0180113_140387013300020065Groundwater SedimentSNDTQATDKENDVEEKEDGHRAYTAERKANIIRLTEDLRSSDSLLTEFVGNPDQMATKYGLRLTEEEVSALAAIAGSQELNDEALAAVAGGDSLDVNIGCPVTNNSGCG
Ga0206356_1022777013300020070Corn, Switchgrass And Miscanthus RhizosphereVEDKNDGHRAYTAERKASIIRLTEELRSSDSLLTEFVGNPNKLATKYGLKLTEEEISALAAIAGSQELDEAALNAVAGGSTDTTINGNCNC
Ga0242650_101615113300022503SoilVEDKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDTTATKYGLQLTDEEVSTLAAIAGDQELSEAGLATVSGGFFDNNCGCGNGKA
Ga0242650_102276923300022503SoilVEDNKDSHRAYTSERKARIIRLTADLRNSDSLLTEFVGKPDQTAGKYGLQLTEEEVTTLAAIAGTQELTGEALDAVSGGTNQPHFFDNNCSCRPQQA
Ga0242650_102332513300022503SoilNVEDKKDGHRAYAADRKAKLTKLTEDLRNSDSLLSEFVGNPTPTATKYGLQLTQEEVSALAAIAGGQELSGEDLAAVAGGRAPDNGNCNC
Ga0242658_119289013300022530SoilMADNKDGHRAYTAERNAKLSQLTDDLRSSDTLLIEFVGKPDQTATKYGLQLTEEEVSTLAAIAGSQELSGDALAAVSGGKTDININCGC
Ga0242658_122649513300022530SoilMADNKDGHRTYTAERKAKLSQLTEDLRNSESLLAEFVANPEQTATKYSLQLTEEEIVAISGNQELSGEDLAAVAGGNGNCNCGDQK
Ga0242666_101692323300022721SoilVEDKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDQTSAKYGLSLTQEEVSTLAAIAGDGELTGESLAAISGGSSKSAFFDNNCSCAAGTT
Ga0242666_101692333300022721SoilVEDKKDGHRAYSAERKARILRLTADLRNSDSLLTEFVGKPAQTAAKYGLSLTEEEVTTLAAIAGSQELTGEDLAAVAGGTNSPHFFDNNCSCRPSTE
Ga0242666_115285213300022721SoilVADKKDGHREYAAERKANLTRLTEDLRGSDSLLKEFVDNPDQTSTKYNLRLTEEEVTALAAIAGSQELDEAALAAVAGGETNVNCHGACSSA
Ga0207670_1020748723300025936Switchgrass RhizosphereVEERKDGHRAYAAERKAKLIRLTADLRNSGSLLTEFVGKPDATATKYGLQLTQDEVSALAAIAGNEELSGEELSAVSGGVFDNNCNCVNDTI
Ga0207670_1028430013300025936Switchgrass RhizosphereVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSSLAAIAGDQELSEAGLASVSGGIFDNNCGCSNSGA
Ga0207670_1040767323300025936Switchgrass RhizosphereVEEKKDGHRAYTTERKGKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAAIAGDHELTEEGLASVSGGIFDNNCNCSGGGA
Ga0207703_1150278923300026035Switchgrass RhizosphereVEEKKDGHRAYVAERRAKLVRLAADLRNSDSALTEFVGKPTETATKYGLQLTDDEVSTLAAIAGSQELSGEDLAAVSGGLMIFDNNCGCSGGGPAAW
Ga0207641_1016810923300026088Switchgrass RhizosphereVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDATATKYGLQLTQDEVSALAAIAGNQELTGEELSAVSGGVFDNNCNCVNDTI
Ga0268264_1262210823300028381Switchgrass RhizosphereVEEKKDGHRAYVAERRAKLVRLAADLRNSDSALTEFVGKPTETATKYGLQLTDDEVSTLAAIAGSQELSGEDLAAVSGGLMIFDNNCGCSGGGPAA
Ga0302263_1014980223300028869FenMEQKKDGHREYVAERKAKLVRMAADLRNSDSLLTEFVGKPGQTATKYDLQLTEEEISALAAIAGSQELSGEDLAAVSGGLITAFDNNCGCGGGGGTNSW
Ga0311332_1082442723300029984FenVEEKKDGHRAYLTERKAKIIRLTADLRNSDSLLTEFVGKPGQTATKYGLQLTEEEVSTLAMIAGNQELNGEDLAAVSGGVVTFFDNNCKCG
Ga0311336_1177104823300029990FenTNMEQKKDGHREYVAERKAKLVRMAADLRNSDSLLTEFVGKPGQTATKYDLQLTEEEISALAAIAGSQELSGEDLAAVSGGLITAFDNNCGCGGGGGTNSW
Ga0311333_1179618213300030114FenEYVAERKAKLVRMAADLRNSDSLLTEFVGKPGQTATKYDLQLTEEEISALAAIAGSQELSGEDLAAVSGGLITAFDNNCGCGGGGGTNSW
Ga0311360_1068695813300030339BogDGHREYVAERKAKLVRMAADLRNSDSLLTEFVGKPGQTATKYDLQLTEEEISALAAIAGSQELSGEDLAAVSGGLITAFDNNCGCGGGGGTNSW
Ga0075396_147263723300030776SoilVEDKKDGHRVYTAERKAKLIRLTADLRNSDALLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGNQELTGEALAAISGGTAMQFDNNCGCGGST
Ga0075382_1167343923300030917SoilVEEKQDGHRAYTAERKVNLIRLTADLRNSDALLTEFVGKPDQTAMKYNLKLTEEEVSTLAAIAGSGELSGDDLAAVSGGFFDNNCGCIKTT
Ga0075382_1168017013300030917SoilNDTQRRMNVEDKKDGHRVYTAERKAKLIRLTADLRNSDALLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGNQELTGEALAAISGGTAMQFDNNCGCGGST
Ga0075399_1130343113300030967SoilDTQRRMNVEDKKDGHRVYTAERKAKLIRLTADLRNSDALLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGNQELTGEALAAISGGTAMQFDNNCGCGGST
Ga0102770_1122857513300030981SoilVEDKKDGHRAYTAERKAKIIRLTADLRNSDALLTEFVGKPDQTAAKYGLQLTEEEVTTLAAIAGTQELTGEALEAV
Ga0102770_1122857523300030981SoilEDKKDGHRAYAADRKAKLVKLTEDLQGSDSLLTEFVKKPDQTATKYGLQLTEEEVATLAAIGGQELSGEALAAVSGGRAPDNGNCNC
Ga0308183_103771513300030988SoilVADKKDGHREYAAERKANIIRLTEDLRSSESQLSEFVGNPDQTATKYNLRLTTEEVAALAALAGSQELDEAALAAVAGGTKESTDININCGC
Ga0308178_116630013300030990SoilQRRANVEEKKDGHRAYTAERKGKIIRMTADLRNSDSLLTEFVGKPDQTATKYGLTLTQDEISALAAIAGSGELSGEDLAAVSGGFFDNNCGCGKVEIDAG
Ga0308199_107886113300031094SoilVEDKKDGHRAYTSERKAKIIRLTADLRNSDSLLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGTGELTGEALDAVSGGADKPLFFDNNCNCNQR
Ga0308199_112086723300031094SoilVEDKKDGHRVYTAERKAKLIRLTADLRNSDALLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGNQELTGEALAAISGGTAMQFDNNCGCGSGT
Ga0308193_102667713300031096SoilAIERHTNDTPRRTNVEDKKDGHRAYTAERKAKIIRLTADLRNSDSLLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGTQELTGEALEAVAGGADKPLFFDNNCGCGKNTN
Ga0308187_1017187723300031114SoilVEEKQDGHRAYSAERKVNLIRLTADLRNSGALLTEFVGKPDQTASKYNLKLTEEEVSTLAAIAGTGELTGEALDAVSGGADKPLFFDNNCGCVKSIDV
Ga0308195_101379113300031123SoilMDEKDGHRAYITERKAKIIRLAADLRNSDSLLTEFVGKPGPTATKYGLQLTEEEVSTLAMIAGNQELSGQDLAAVSGGVATFFDNNCKCSG
Ga0308195_101379123300031123SoilMEEKKDGHRAYVAERKAKLVRMAADLRNSDSLLTEFVGKPGQTATKYGLQLTEEEISTLAAIAGSQELSGDDLAAVSGGLITVFDNNCGCGGGGGSSSW
Ga0170824_10845806113300031231Forest SoilVEEKQDGHRAYSAERKVNLIRLTADLRNSGALLTEFVGKPDQTASKYGLKLTEEEVSTLAAIAGDGELSGDDLAAVSGGFFDNNCGCIKTT
Ga0302323_10033902513300031232FenGHRAYLTERKAKIIRLTADLRNSDSLLTEFVGKPGQTATKYGLQLTEEEVSTLAMIAGNQELNGEDLAAVSGGVVTFFDNNCKCG
Ga0170819_1734518813300031469Forest SoilERKAKLIRLTADLRNSDALLTEFVGKPDQTATKYGLQLTEEEVSTLAAIAGNQELTGEALAAISGGTAMQFDNNCGCGGST
Ga0307509_1062778323300031507EctomycorrhizaVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTEFVGKPDQTATKYGLNLTQEEVSTLAAIAGNQELTDEGLSAVSGGSSPIETFFDNNCGCSA
Ga0214473_1065333413300031949SoilVEEKKDGHRAYTADRKVKLLRLTADLRSSDSLLTEFVGKPDSTATKYGLQLTDDEVSALAAIAGDQELSEASLASVSGGFFDNNCGCSGSSNG
Ga0214473_1121774913300031949SoilAERKVKLLRLTADLRNSDSLLTEFVGKPDSTAGKYGLQLTDDEVSTLAAIAGDQELSEASLASVSGGFFDNNCGCSGGGKDIDA
Ga0314780_050254_453_7403300034659SoilVEEKKDGHRAYSAERKAKLIRLTADLRNSDSLLTEFVGKPDHTANKYGLQLTEEEVSTLAAIAGNQELTDEGLAAVSGGSSPISTFFDNNCGCSA
Ga0314780_181751_237_5123300034659SoilVEDKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDSTATKYGLQLTDEEVSALAVIAGDQELSDAGLAAVSGGVFDNNCGCGVKS
Ga0314783_178014_97_3873300034662SoilVEEKKDGHRAYSAERKAKLIRLTADLRNSDSLLTEFVGKPDHTANKYGLQLTEEEVSTLAAIAGNQELSDEGLAAVSGGAMQMIFDNNCNCGNGTV
Ga0314787_083812_114_3953300034665SoilVEEKKDGHRAYTAERKAKLLRLTADLRNSDSLLTEFVGKPDQTATKYGLTLTEEEVSTLAALAGTGELSGDDLAAVSGGFFDNNCGCIKVQTE
Ga0314788_049720_98_3733300034666SoilVEEKKDGHRAYTAERKVKLIQLTADLRGSDSLLTEFVGKPDHTASKYGLKLTEEEVSTLAAIAGGQELSGDDLAAVSGGLFDNNCGCIRDV
Ga0314796_113632_216_5003300034671SoilVEEKKDGHRAYTAERKAKLIRLTADLRNSDSLLTDFVGKPDSTASKYGLQLTDEEISALAAIAGDQELSEAGLASVSGGLFDNNCGCGGGVKTS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.