NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F055589

Metagenome / Metatranscriptome Family F055589

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F055589
Family Type Metagenome / Metatranscriptome
Number of Sequences 138
Average Sequence Length 68 residues
Representative Sequence GETIFFCQDEDTKAALVEAGAEPWSVYTRDELRVLVAQNRVAPLSHAELRKVHDIKRTFNARIAE
Number of Associated Samples 82
Number of Associated Scaffolds 138

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 5.07 %
% of genes from short scaffolds (< 2000 bps) 5.07 %
Associated GOLD sequencing projects 66
AlphaFold2 3D model prediction Yes
3D model pTM-score0.52

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.101 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(28.985 % of family members)
Environment Ontology (ENVO) Unclassified
(39.130 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(73.913 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 39.78%    β-sheet: 6.45%    Coil/Unstructured: 53.76%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.52
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 138 Family Scaffolds
PF00216Bac_DNA_binding 1.45
PF13884Peptidase_S74 1.45
PF00226DnaJ 0.72
PF01351RNase_HII 0.72
PF00326Peptidase_S9 0.72
PF08241Methyltransf_11 0.72
PF01839FG-GAP 0.72
PF12704MacB_PCD 0.72
PF13520AA_permease_2 0.72
PF13371TPR_9 0.72
PF13181TPR_8 0.72
PF08279HTH_11 0.72
PF13668Ferritin_2 0.72
PF00589Phage_integrase 0.72
PF05656DUF805 0.72
PF12770CHAT 0.72
PF03544TonB_C 0.72
PF12307DUF3631 0.72
PF04226Transgly_assoc 0.72
PF00041fn3 0.72
PF01402RHH_1 0.72
PF07878RHH_5 0.72
PF13847Methyltransf_31 0.72
PF13481AAA_25 0.72

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 138 Family Scaffolds
COG0776Bacterial nucleoid DNA-binding protein IHF-alphaReplication, recombination and repair [L] 1.45
COG0164Ribonuclease HIIReplication, recombination and repair [L] 0.72
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.72
COG1039Ribonuclease HIIIReplication, recombination and repair [L] 0.72
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 0.72
COG3152Uncharacterized membrane protein YhaH, DUF805 familyFunction unknown [S] 0.72


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.10 %
All OrganismsrootAll Organisms2.90 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005713|Ga0066905_100283162All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1292Open in IMG/M
3300005764|Ga0066903_100010078All Organisms → cellular organisms → Bacteria → Proteobacteria8975Open in IMG/M
3300005764|Ga0066903_107593225Not Available559Open in IMG/M
3300006796|Ga0066665_10915977Not Available678Open in IMG/M
3300012915|Ga0157302_10277414Not Available640Open in IMG/M
3300018071|Ga0184618_10390849Not Available590Open in IMG/M
3300020012|Ga0193732_1048005All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium730Open in IMG/M
3300032001|Ga0306922_10840561All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia957Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil28.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil11.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.70%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil8.70%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil7.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.35%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil4.35%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil2.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.45%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.45%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.45%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.72%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.72%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.72%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.72%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2170459002Grass soil microbial communities from Rothamsted Park, UK - March 2009 direct MP BIO 1O1 lysis 0-21 cmEnvironmentalOpen in IMG/M
2170459004Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cm (2)EnvironmentalOpen in IMG/M
2170459005Grass soil microbial communities from Rothamsted Park, UK - July 2009 direct MP BIO1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
2170459007Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect in plug lysis (for fosmid construction) 10-21cmEnvironmentalOpen in IMG/M
2170459010Grass soil microbial communities from Rothamsted Park, UK - December 2009 direct MP BIO1O1 lysis 0-9cm (no DNA from 10 to 21cm!!!)EnvironmentalOpen in IMG/M
2170459013Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis soil at the rocks surface 0-21cmEnvironmentalOpen in IMG/M
2170459023Grass soil microbial communities from Rothamsted Park, UK - FA3 (control condition)EnvironmentalOpen in IMG/M
2189573001Grass soil microbial communities from Rothamsted Park, UK - FD2 (NaCl 300g/L 5ml)EnvironmentalOpen in IMG/M
2189573004Grass soil microbial communities from Rothamsted Park, UK - FG2 (Nitrogen)EnvironmentalOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002899Soil microbial communities from Manhattan, Kansas, USA - Combined assembly of Kansas soil 100-500um Nextera (ASSEMBLY_DATE=20140607)EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012915Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S103-311B-2EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300020012Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s1EnvironmentalOpen in IMG/M
3300027018Grasslands soil microbial communities from Kansas, USA, that are Nitrogen fertilized - NN575 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032051Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f26EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_029147002088090014SoilVLFFCQDEATKAALIEAGADEWSIYTRAELQALCVQNGIKPFTQAELRKLQEIKRTLGARIAK
GPIPI_025217002088090014SoilMVYSEILEETIFFAEDESSKAALVEAGAEPWSIYTRDELRVLVAQNRVAPLTAAELRKVYEIKRTFHGRITK
E1_008075102170459002Grass SoilSTIFFCQDEATKAALIEAGADAWSIYTRSELQVLCEQNRVAPLSPDELRKVHEIKRTFNGRIAK
E4B_010769502170459004Grass SoilATKPALVEAGASEWSIYTRDELRILCEQNRIAPLSPAELRKVHEIKRTFSGRITS
E41_122211802170459005Grass SoilLLELPFVMVYSKALEETVFFCQDEATKAALIEAGADQWSIYTKDELRVLVAQNRIAPLSQAELRKVHEVNRTFRGRIAR
L02_017348302170459007Grass SoilTKAALIEAGADSFAIYTRAELTILREANRTAPLTQAELAKLNAIKRTFRGTIY
F62_067605702170459010Grass SoilEATKAALVQAGAEEWSIYTKDELRTLREQNRVAPLSPAELRKVHEIKRTFSARIASQKRC
N57_089251702170459013Grass SoilLRDHKPQLIALLGLPFVMVFSETLKETVFFCADEATKAALVQAGASEWRIYTRAELQILCEANRVAPISADELRKLHEITRTFHGRIAK
FA3_002530702170459023Grass SoilMVFSESLGETIFFCDDEHTKAALVQAGADEWSIYTRAELQILCEHNRIAPFTPAELRKIHEIKRTLNARISPP
FD2_087080602189573001Grass SoilSETLEETIFFCQDEDTKAALIEAGAEEWSIYTKDELRALVAQNRVAPLSPDELRKVHEIKGTFNGRIVE
FG2_080778802189573004Grass SoilSEALEETVFFCADEATHTALVQAGAEEWSIYTKDELRTLCEQNRVAPLSPAELRKVHEIKRTFSARIASQKRC
FG2_084762802189573004Grass SoilVRASAKPSSSAADEATRNVLVEAGALEWSIYTKDELRVLVAQNRVAHLSPDELRKVHEIKRTFNGRIAK
INPgaii200_086163112228664022SoilMVYSKAFEETIFSCEDEDTKAALVGAGAEPFSIYTRAELEVLVKANRVAPLTIAELTKLNEIKRTFDTKIADRK
INPgaii200_100924312228664022SoilFCEDEDTKAGLQEAGASEWSIYTRDKLQILVAQNRIAPLGATQLRKLYQIKRTFNARFT
ICChiseqgaiiFebDRAFT_1096903613300000363SoilDEDTKAGLQEAGASEWSIYTRDKLQILVAQNRIAPLGATQLRKLHQIKRAFNARFT*
INPhiseqgaiiFebDRAFT_10148943923300000364SoilMVFSQIIGETIFFCQDEHTKGALGKGGAEPWSIYTRVELHTLCEQNRVAPLSPAELQKIHEIRKTFDGRIPPQ*
INPhiseqgaiiFebDRAFT_10185274413300000364SoilDDQTRDALVEAGADSFSIYTKDELRILCEQNRVAPISAAELRKIHDLKRTFNARITSEDFK*
INPhiseqgaiiFebDRAFT_10520459423300000364SoilAALIEAGADSFSIYTKRELPVLIAQNRVAPFTDDELRKVHEIKRTFNGRIAE*
JGI11643J11755_1173165323300000787SoilFFCEDEDTKAGLQEAGASEWSIYTRDKLQILVAQNRIAPLGATQLRKLHQIKRAFNARFT
JGI11643J11755_1176808123300000787SoilMVYSEILEETIFFAEDESSKAALVEAGAEPWSIYTRDELRVLVAQNRVAPLTAAXLRKVYEIKRTFHXRITK*
JGI1027J11758_1282720523300000789SoilFFCQDEGTKEILIEAGAEPWSIYTRDELQILVAQNRIAPLSEDDLKKVHTLKRTFNARVAE*
JGI1027J12803_10021490833300000955SoilMVYSKAFEETIFSCEDEDTKAALVGAGAEPFSIYTRAELEVLVKANRVAPLTIAELTKLNEIKRTFDTKIADRK*
JGI1027J12803_10912569343300000955SoilLCEDEVTKAALVEVGASPWSIYTKDELQTLCEQNRIAPLTTAELRKLHEIKRTFHGRITK
JGI10216J12902_10232846213300000956SoilPFVMVFSQILEETLFFCQDQRTKAALIDAGASEWSIYTKDELRILVAQNRIAPLSDAELRKIHDIKRTFNARYSP*
JGIcombinedJ43975_1006698313300002899SoilDEDTKGALVEAGASEWSIYTRDELQTLCEQNRVAPLSPAELRKVHEIKRTVSGRVTK*
JGIcombinedJ43975_1008231913300002899SoilSKTLEETIFFCQDDDTRDALVRAGADPFAIYTKAELRVLVEQNRIAPFTPAELRKVHEIKRTVSGRITS*
Ga0062593_10271827223300004114SoilMRTPKGTLVEAGADSWSIYTRAELQTLCVQNRVAPLTAAELRKIHEIKRAFDGRITK*
Ga0062593_10308827313300004114SoilFVMLFSESLGTPIFFCPDDDTKAALCEGGAEPFSIYTRDELRVLVEQNRVARISIAELRKVHDIKRTFSGRITSYKT*
Ga0062590_10189287323300004157SoilGLPFVMVFSETLKETIFFCADEGTKAALIEAGAEEWNIYTKAELRTLCEQNRVAPLTADELRKVHEIKRTFDGRIAT*
Ga0066395_1084905013300004633Tropical Forest SoilCDDEHTKAALVEAGASEWSIYTRAELRTLVTQNRVRPFTDDELRKVHEIKRTFHGRITS*
Ga0066388_10058855523300005332Tropical Forest SoilMLSWPFVMVYSQALGETVFFCEDEDTKASLVEAGASEWSIYTKAELRILVAQNRIAPLTDQELRTLHSIKGAFNARITEY*
Ga0066388_10197108213300005332Tropical Forest SoilDTKAALAEAGAEPWSIYTLDELQVLVAQNRIKPLSPHELQKIHELKTAFNAPITPK*
Ga0066706_1141489323300005598SoilLHLLRLPFVMVYSERVGETLIFCEDDATRGALVEAGVSEWSIYTKAELRTLCAQNRIAPLSDAELRKLHDIRKTFHARIARGNGG*
Ga0066905_10028316213300005713Tropical Forest SoilEREWPRKRHLLTMARWPFVMVYSKALGEIIVFCKDEDTKAALVEAGASPWSIYTKAELRQLVQQNRIAPLSQAELRKLHEIKRTFDARINTE*
Ga0066905_10061849213300005713Tropical Forest SoilEREWPRKRHLLTMARWPFVMVYSKALGEIIVFCKDEDTKEILIEAGAEPWSVYTRAELQILVAQNRVGPLSPDDLKNVHALKRTFNARIVE*
Ga0066905_10180835723300005713Tropical Forest SoilMALLALLRLPFVMVFSEALGEKIFFCEDEHTKEALVEAGASEWSVYTKDELRTLVAQNRARPFTDDELGKVHEIKRIFQGRISR*
Ga0066903_10001007843300005764Tropical Forest SoilMVYSQALQETIFFCQDEATKEALTEAGASEWSIYTKEELRILVAQNRVAPLADAELRRLSELKKTFSATINTE*
Ga0066903_10118613633300005764Tropical Forest SoilLLSLLRLPFCMAYSETLEETIFLCQGEDTKTTLIEAGAEPWSVYTRAELQILVARNRIMPLTQAELRKVHEIKRTFGARIAE*
Ga0066903_10204834333300005764Tropical Forest SoilDEHTKAALVEAGASEWSIYTRAELRTLVTQNRVRPFTDDELRKVHEIKRTFHGRITS*
Ga0066903_10651095533300005764Tropical Forest SoilTWIEVFSERLGETVFFCEDEHTKAALVEAGASEWSIYTKAELRALVAHNRVKPFTDDELRKVHEIKRTFHGRITS*
Ga0066903_10759322513300005764Tropical Forest SoilFLMVDSKAIGDLLFFSEDEDTKAALVQAGASRCSIYIKEEMRILVAQNRIAPLTHAELRKVHDIKRSFNARINTE*
Ga0066903_10841337713300005764Tropical Forest SoilGETIFFCEDEPTKEALVEAGAEEWRIYTRDELRFLVAQNRVAPLSERDLKKAHTLKRTFNARIRE*
Ga0068862_10142588113300005844Switchgrass RhizosphereAALIEAGADEWSIYTRAELQILCEQNRVASLSPDELRKVHEIKRTFGSRIATDHGS*
Ga0066665_1091597733300006796SoilFFCEDEDTKAALIEAGASEWSIYTRDELRILCEQNRIKPFTQAELRKLQEIKRTLGARIAK*
Ga0126374_1175220213300009792Tropical Forest SoilSESLGETIFLCEDEATKEALVGVGASAWSIYTRPELRILMEQNRIAPLTLFELNLLHAFKRTFNARIAE*
Ga0126380_1003169323300010043Tropical Forest SoilMLLTLPFVIVYSKALGEIIFFCHDDDTKAALVEAGASEWSIYTKNELRTLIAQNRAKPFTDDELRKAHEIKRTFRGRIAS*
Ga0126373_1043267613300010048Tropical Forest SoilALLQTNSVTWIEVYSERLGETIFFCQDEDTKEILIEAGAERWSVYTRDELQILVAQNRIAPLSEDDLKKVHTLKRTFGGRIAE*
Ga0126373_1290739313300010048Tropical Forest SoilFCEDEVTKEALVEDGAEPWSVYTRDELQILVAQNRVAPLSDAELRKVHDIKRTFSAKITE
Ga0126370_1109229213300010358Tropical Forest SoilLEALGETIVFCEGEDTKTTLIEAGAEPWSVYTRAELQVLVAQNRIKPLKQAELRKLHDIKRTFGARIAE*
Ga0126376_1216632623300010359Tropical Forest SoilMALLALLRLPFVMVFSEALGEKIFFCEDEHTKEALVEAGASEWSVYTKDELRTLVAQNRARPFTDDELGKVHEIKR
Ga0126377_1001808763300010362Tropical Forest SoilMLLMLPFVIVYSKALGEIIFFCHDDDTKAALVEAGASEWSIYTKNELRTLIAQNRAKPFTDDELRKAHEIKRTFRGRIAS*
Ga0126379_1111680133300010366Tropical Forest SoilWIELYSERLGETVFFCQDEETRDALIEAGASEWSIYTKSELRTLCVQNRVVPFSDAELRKLHEIKRTFQGRITT*
Ga0126383_1167618313300010398Tropical Forest SoilGRTWTKFYSERLGATVFFCEDEKTRDALVEAGASQWSIYTKAELRTLVAQNRVKPFTDAELSKVHELKRTFNARINTSFNIAL*
Ga0126383_1278080923300010398Tropical Forest SoilVMVFSEALGETIFFCQDEDTKVALVEAGAEPWSVYTRDELRILVAQNRIKPLTQAELRKVHDIKRTFGARIAE*
Ga0137378_1015871553300012210Vadose Zone SoilCENEDTKGALVEVGAEPWSIHTKDELQTLCEQNRIAPLTTAELRNLHEIKRTFHGRITK*
Ga0137371_1026732513300012356Vadose Zone SoilLLGLCKFPFTPEFRPILQETLFSAENHGTRAALVEAGAEPWRIYTKDELRVLVAQNRVAPLTAAELRKVHEIKRTFDGRITK*
Ga0157302_1021683113300012915SoilLGETIFFCENEATRAALVEAGAEPWRIYTKDELRVLVAQNRVAPLSPAELRKVHEIKRTFDGRITK*
Ga0157302_1027741433300012915SoilTIFFAEDDNTKGALVQAGAEPWSIYTRDELRILCEQNRIKPFTQAELRKLQEIKRTLGARIAK*
Ga0126375_1204264123300012948Tropical Forest SoilDEDTKAALVEAGASEWSVYTKQELRVLVGQNRIKPFLPDELRKVHEIRRTFHARITR*
Ga0126369_1308689223300012971Tropical Forest SoilMVFSERLGETILFCRDKDTKAALAEAGAEPWSIYTLDELQVLVAQNRIKPLSPHELQKIHELKTAFNAPITPK*
Ga0157374_1076784623300013296Miscanthus RhizosphereCEDEATNAALIEAGADEWSIYTRAELQILCEQNRVASLSPDELRKVHEIKRTFGSRIATDHGS*
Ga0132255_10433865513300015374Arabidopsis RhizosphereLLALLRLPFVMVFSETLGETIFFSDDEDTKAALLEAGASEWSIYTKAELRTLIEQNRIAPLSSTELRRLHEIRKTFEGRITK*
Ga0182036_1155357313300016270SoilLDSGAVGELLFFCQDEETKEILIGNGAEPWSVYTRDELQILVAQNRVAPLSDDELRKVHEIKRTFGARITE
Ga0182036_1164858323300016270SoilYSERLGEDLLFCRDEDTKATLIEAGAEPWSVYTRAELRILVAQNRAAPLSQAELRKIHDIKRTFGARIAE
Ga0182041_1029430213300016294SoilLLPLLQTKGITWIEVYSERLGEDLLFCRDEDTKATLIEAGAEPWSVYTRAELQVLVAQNRVAPLSQAELKKVHQIKRTFGARIVE
Ga0182033_1044042223300016319SoilVGELLFFCQDEETKEILIGNGAEPWSVYTRDELQILVAQNRVAPLSDDELRKVHEIKRTFGARIT
Ga0182033_1098483433300016319SoilYSKRLNETIFFCHDEDTKEILIEAGAEPWSIYTRDELQILVTQNRIAPLSEDDLKKVHTLKRTFGARIAE
Ga0182035_1020793323300016341SoilVWYWVRTIFFCRHEDTKEILIEAGAEYWSTYTRNELLILVAQNRLKPLTQAELRKVHEIKRTFGAKIAE
Ga0182035_1042447933300016341SoilCRDEDTKAMLTEAGAEAWSIYTRAELQILVEQNRIEPLSRADLKKLHDIKRTFGARITK
Ga0182035_1050095413300016341SoilHLLPLLQLPFVMVFSEALGETIFFCQDENTKEALVEAGASGWSIYTKEELRILVVQNRIAPLTQAELRKIHDIKRTFGARIAE
Ga0182035_1174948323300016341SoilEDTKATLIEAGVEPWSIYTRSELQILLAQNRIEPLTLAELRKVHQIKRTFGARTAE
Ga0182032_1053854113300016357SoilDEDTKAALVEAGAEPWSIYTRAELAILVAQNRTKPFTQAELRKVHEIKRTFGGRITE
Ga0182034_1033926513300016371SoilHAVGELLFFCQDEDTKEALVEAGAEPWSVYTRAELRILVAQNRIKPLTQAELRKVHEIKRTFKARIAE
Ga0182034_1053628033300016371SoilLFCRDEDTKAALVEAAAEPWSIYTRAELQVLVAQNRAAPLSDDELRKLHQIKRTFGARIA
Ga0182034_1103453413300016371SoilMKTALTQAGASEWRIYTKEELRILVAQNRIKPLTQAELRKVHDIKRTFGARIAE
Ga0182034_1180990923300016371SoilFCRDEDTKATLIEAGAEPWSVYTRAELQVLVAQNRVAPLSQAELKKVHQIKRTFGARIVE
Ga0182040_1088987513300016387SoilTIFFCEDEDTKSVLIEAGAEPWSVYTRNELRTLVLQNRIKPLTQAELRKVHDIKRTFGARITE
Ga0182039_1030103633300016422SoilFFCQDEDTKAALVEAGASGWSIYTKEELRILVVQNRIAPLTQAELRKIHDIKRTFGARIA
Ga0182039_1084408713300016422SoilKQPLLALLGLPFVMVYSEALGERIFFCQDEDTKAALVEAGAEPWSIYTRAELAILVAQNRTKPFTQAELRKVREIKRTFGGRITE
Ga0182039_1153452413300016422SoilMVFSEALGETVFFCEDERTKEALVEAGAEPWSVYTRDELRILVAQNRIKPLTQAELKKVHQIKRTFGARITE
Ga0182038_1120112323300016445SoilVHSERLGEDLLFCRDEDTKATLIEAGAEPWSVYTRAELQVLVAQNRVAPLSQAELKKVHQIKRTFGARIVE
Ga0184618_1039084913300018071Groundwater SedimentTKAALIEAGASEWSIYTRDELRILCEQNRIKPFTQAELRKLQEIKRTLGARIAK
Ga0066655_1029599113300018431Grasslands SoilLPFVMVFSQILSETIFFCEDEDTKAALIEAGASEWSIYTRDELRILCEQNRIKPFTQAELRKLQEIKRTLGARIAK
Ga0066669_1098086023300018482Grasslands SoilIEAGASEWSIYTRDELRILCEQNRIKPFTQAELRKLQEIKRTLGARIAK
Ga0066669_1100910023300018482Grasslands SoilEDAAAALKEAGTSPWNLYTRNELRILMAQNRIAPLTVSELNLLHAFQRTFNARIAE
Ga0173482_1048528113300019361SoilKALEETVFFAADEDTKAALIEAGASEWSIYTKAELRVLVAQNRVAPLSPAELRKVHEIKRTFDGRITK
Ga0193756_105150123300019866SoilIEAGASEWSIYTRNELRILCEQNRIKPFTQAELRKQEIKRTLGARIAK
Ga0193732_104800513300020012SoilALIEAGASEWSIYTRDELRILCEQNRIKPFTQAELRKQEIKRTLGARIAK
Ga0208475_103099113300027018SoilQVASSRVEAGADEWSIYTRDELRILVTENRIAPFSDAELRKVHQMKRTFGGTIVE
Ga0268265_1266719323300028380Switchgrass RhizosphereYSQAVEEMLFFCEDEATNAALIEAGADEWSIYTRAELQILCEQNRVASLSPDELRKVHEIKRTFGSRIATDHGS
Ga0075386_1163900123300030916SoilMVFSQILEEPIFFCEDEATKAALVEAGADSFSVYTKDELRILVAQNRVAPLSPDELRKVHEIKRTFNGRIAK
Ga0170824_11025322423300031231Forest SoilAEPKSLRQPRRQSLPFVMVFSETLEETIFFCANDNTREWLIDAGADPFSIYTRDELRVLCEQNRVAPLSPDELRKVHEIKKTFDGRIAK
Ga0170824_11443175923300031231Forest SoilLDLLRLPFVMVFSQILEEPIFFCEDEATKAALVEAGADSFSVYTKDELRILVAQNRVAPLSPDELRKVHEIKRTFNGRIAK
Ga0170824_11907054013300031231Forest SoilIALLGLPFVMVFSETLKETIFFCDDEDTRNVLVEAGAEEWSIYTRDELRILCEQNRIAPLTQADLAKLYEIKRTFRGTITEP
Ga0170824_12458325523300031231Forest SoilFVMVYSKALEQTVFFCEDEATKDALVNAGADEWSIYTKRELRQLIAQNRIAPISADELRKLHEIKRTFHARITPQ
Ga0170824_12481635913300031231Forest SoilHIQALLQLGWVIVYSQTLNETVFFAEDEDTKAALIEAGASEWSIYTKDELRTLSEHNRIAPLSPTELHKVHEIKRTVSGRISS
Ga0170824_12631891813300031231Forest SoilLVKAGVESWSIYTRDELQILVTQNRIVPFSDAELRKVHQMKRTFGGWITENDFE
Ga0170824_12853307423300031231Forest SoilVETVFFCEDEDTKAALVEAGAYEWSIYTKDELRGLVVQNRIAPLSIAELRKVHEIKRTFHGTITE
Ga0170820_1102237833300031446Forest SoilIALLGLPFVMVFSETLKETIFFCDDEDTRNVLVEAGAEEWSIYTRDELRILCEQNRIAPLTQAELAKLYEIKRTFRGTITEP
Ga0170820_1202007123300031446Forest SoilVFFCADDDTRIALIEAGASEFAIYTRDELRILCEQNRVAPLSPAELRKVHEIKRTFSGRIAS
Ga0170820_1703457123300031446Forest SoilLPFVMVYSQILEETVFFCENEDTKAALIEAGAEEWSIYTRAELRILCEANRVAPLSATELKQLHQIKRTFSARIE
Ga0170818_11533295313300031474Forest SoilLKLGWVMVFSQALQETVFFCDDEDTKAALIEAGADSFAIYTKDELRILVAENRVAPLSATELKQMHQIKRTFNARIK
Ga0310915_1068170413300031573SoilTLIEAGAEPWSVYTRDELQVLVAQNRVAPLSDDELRKIHEIKRTFGARITE
Ga0310915_1111484313300031573SoilGETIFFCQDEDTKAALVEAGAEPWSVYTRDELRVLVAQNRVAPLSHAELRKVHDIKRTFNARIAE
Ga0306917_1069206013300031719SoilAVLPLLQTKGITWIEVHSERLGEDLLFCRDEDTKAMLTEAGAEAWSIYTRAELQILVEQNRIEPLSRADLKKLHDIKRTFGARITK
Ga0306918_1038923423300031744SoilKSHLLPLLTLPFVMVFSEALEETIFFCEDEETKEALVEAGAEPWSIYTRNELRILVAQNRIAPLTQAELRKVHTLKRTFGARIAE
Ga0306919_1069579213300031879SoilDFVLVDSHAVGELLFFCQDEYTKEALVEAGAEPWSVYTRAELRILVAQNRIKPLTQAELRKVHEIKRTFKARIAE
Ga0306919_1137442213300031879SoilLIEAGAEPWSIYTRNELRILVAQNRVAPLTDDELRKVQTLKRTFGARIAEDR
Ga0306925_1015625813300031890SoilETLEETIFFCEDEDTKSVLIEAGAEPWSVYTRNELRTLVLQNRIKPLTQAELRKVHDIKRTFGARITE
Ga0306925_1022069733300031890SoilSERIGETIFFCEDEDTKEALVEAGAEPWSIYTKAELRTLCAQNRIAPLSSSELRKLYEIKHTLNARIQD
Ga0306925_1025153313300031890SoilLGEDLLFCGDEDTKATLIVAGAEEWGIYTLAELQILVAQNRVAPLSDAELRKVHDIKRTFDARIV
Ga0306925_1025258553300031890SoilSKGRTWTEFYSERLGDTVFFCEDEKTRDALVEAGASQWSIYTKAELGTLVVQNRVKPFTDAELKKVHELKRTFNARINTSFNIAL
Ga0306925_1036976513300031890SoilMVFSEALGETVFFCEDERTKEALVEAGAEPWNIYTCNELRTLVLQNRIKHLTQAELRKVHNIKRTFGARITE
Ga0306923_1137933523300031910SoilKVYSEALGETIFFCEDEETKTALIEAGAEPWGIYTRSELQILVAQNRIKPLTQDELRKVQNIKRTFGARIVE
Ga0306921_10005250143300031912SoilLLQTKGITWIEVHSERLGEDLLFCRDEDTKATLIEAGAEPWSVYTLDELQILVAQNRIKPLTQAELRKVHDIKRTFGARIVE
Ga0306921_1007009623300031912SoilMLTEAGAEAWSIYTRAELQILVEQNRIEPLSRADLKKLHDIKRTFGARITK
Ga0306921_1151562713300031912SoilHKWRLLSSLRLPFVMVFSEALGEMIFFCENEPTKTALVEAGASEWSVYTRAELQILVAQNRVAPLSDDELRKVHQIKRTFGARIAE
Ga0310912_1094599613300031941SoilVLVDSHAVGELLFFCQDEDTKEALVEAGAEPWSVYTRAELRVLVAQNRVAPLSHAELRKVHEIKRTFGARIAE
Ga0310916_1026074533300031942SoilIEVHSERLGEDLLFCGDEDTKATLIVAGAEEWGIYTLAELQILVAQNRVAPLSDAELRKVHDIKRTFDARIV
Ga0310916_1167962613300031942SoilDEDTKEALVEAGAEPWSVYTRDELRVLVAQNRVAPLSHAELRKVHDIKRTFGGRITE
Ga0310916_1170905113300031942SoilQEYKPALVWLLVSKGRTWTEFYSERLGDTVFFCEDEKTRDALVEAGASQWSIYTKAELQTLVAHNRVKPFTDAELSKVHELKRTFNARIGE
Ga0310913_1010810713300031945SoilDEDTKEILIEAGAEPWSIYTRDELQILVTQNRIAPLSEDDLKKVHTLKRTFGARIAE
Ga0310913_1057817513300031945SoilEVYSERLGETIFFCQDEDTKAALVEAGASGWSIYTKEELRILVVQNRIAPLTQAELRKIHDIKRTFGARIAE
Ga0310910_1034441723300031946SoilMVFSEALGETVFFCQDEDTKAALVEAGAEPWSVYTRDELRVLVAQNRVAPLSHAELRKVHDIKRTFGGRITE
Ga0310909_1122695523300031947SoilIFFCEDEETKEALVEAGAEPWSIYTRNELRILVAQNRIAPLTQAELRKVHTLKRTFGARIAE
Ga0306926_1245246713300031954SoilLGKTIFFCEDDATKAALVKAGAEPWSVYTRDELRILVAQNRIKPLTQAELRKVHQIKRTFGARIVE
Ga0306922_1036367443300032001SoilTKEILIEAGAEPWSIYTRDELQILVTQNRIAPLSEDDLKKVHTLKRTFGARIAE
Ga0306922_1080982023300032001SoilLGEDLLFCRDEDTKATLIEAGAEPWSVYTRAELQVLVAQNRVAPLSQAELKKVHQIKRTFGARIVE
Ga0306922_1084056113300032001SoilALIEAGAEPWSVYTRAELQILTAQNRVEPLTQAELRKVHEIKRTFGARIAE
Ga0306922_1157787813300032001SoilTIFFCEDDATKAALVKAGAEPWSVYTRDELRILVAQNRIKPLTQAELRKVHDIKRTFGARITE
Ga0306922_1193066413300032001SoilMRKSTVWYWVRTIFFCRHEDTKEILIEAGAEYWSTYTRNELLILVAQNRLKPLTQAELRKVHEIKRTFGAKIAE
Ga0306922_1235467513300032001SoilTWIEVYSERIGESLFFCEDEETRDALIEAGASEWSIYTKAELRTLIAQNRIAPFSDAELRKVHEIKQTFSAKIAE
Ga0318532_1024461613300032051SoilTEAGAEAWSIYTRAELQILVEQNRIEPLSRADLKKLHDIKRTFGARITK
Ga0306920_10281704113300032261SoilATKGITWIEVHSERLGKTIFFCEDDATKAALVKAGAEPWSVYTRDELRILVAQNRIKPLTQAELRKVHQIKRTFGARIVE
Ga0310812_1042524133300032421SoilLVEAGADSWSIYTRDELQTLCVQNRVAPLTAAELRKVHEIKRAFDGRITK
Ga0310914_1064007223300033289SoilETIFFCQHEDTKAALVEAGAEPWSVYTRDELRVLVAQNRVAPLSHAELRKVHDIKRTFGGRITE
Ga0310914_1075538413300033289SoilDSHAVGELLFFCQDEDTKEALVEAGAEPWSVYTRAELRILVAQNRIKPLTQAELRKVHEIKRTFKARIAE
Ga0318519_1098085423300033290SoilDERTKEALVEAGAEPWSVYTRDELRILVAQNRIKPLTQAELKKVHQIKRTFGARITE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.