NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086970

Metagenome Family F086970

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086970
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 50 residues
Representative Sequence LKRAAVLAPGYGGTAEQPILRKLAAALDGFGIASRAITFRTRGSRPSKDYV
Number of Associated Samples 99
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 33.33 %
% of genes near scaffold ends (potentially truncated) 10.91 %
% of genes from short scaffolds (< 2000 bps) 10.00 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.62

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (89.091 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(20.000 % of family members)
Environment Ontology (ENVO) Unclassified
(35.455 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.545 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 15.19%    β-sheet: 10.13%    Coil/Unstructured: 74.68%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.62
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF01896DNA_primase_S 25.45
PF04679DNA_ligase_A_C 12.73
PF13298LigD_N 7.27
PF05960DUF885 0.91
PF13407Peripla_BP_4 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 12.73
COG4805Uncharacterized conserved protein, DUF885 familyFunction unknown [S] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A89.09 %
All OrganismsrootAll Organisms10.91 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005552|Ga0066701_10038446All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2552Open in IMG/M
3300006854|Ga0075425_101379759All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi798Open in IMG/M
3300007076|Ga0075435_100157005All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1914Open in IMG/M
3300009147|Ga0114129_12410200All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium630Open in IMG/M
3300010304|Ga0134088_10426108All Organisms → cellular organisms → Bacteria → Terrabacteria group649Open in IMG/M
3300010323|Ga0134086_10506491All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi501Open in IMG/M
3300010403|Ga0134123_10585912All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1069Open in IMG/M
3300012532|Ga0137373_10269372All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1367Open in IMG/M
3300012925|Ga0137419_11508580All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi569Open in IMG/M
3300018476|Ga0190274_13164277All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium GWC2_70_10554Open in IMG/M
3300026296|Ga0209235_1104143All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1218Open in IMG/M
3300027907|Ga0207428_11182692All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium532Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.09%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.18%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.36%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.45%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.73%
Polar Desert SandEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand1.82%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.82%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.82%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.82%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.82%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.91%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.91%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.91%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.91%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.91%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001536Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A15-65cm-8A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012186Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ416 (21.06)EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012530Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ85 (21.06)EnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014058Permafrost microbial communities from Nunavut, Canada - A3_65cm_0.25MEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015193Arctic soil microbial communities from a glacier forefield, Rabots glacier, Tarfala, Sweden (Sample Rb6, proglacial stream)EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027637Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A1565W1_1114864323300001536PermafrostVKRAAVLAPGYGGTAEQPILRKLKAALDGFGIASRAVTFRTS
JGI25387J43893_106672723300002915Grasslands SoilLKRAAVLAPGYGGTAEQPILRKLASALGGFGIASQAVTFTTRGSRPSKDYV
Ga0066674_1042779623300005166SoilVIKRAAVLAPGYGGTADQPIVKKLASALGSFGIESRAVTFRTRGSRPSR
Ga0066676_1028217323300005186SoilLKRAAVLAPGYGGTAEQPILQKLGAALGGFGIASQAVTFRTRGSRPSKDY
Ga0070690_10072465613300005330Switchgrass RhizosphereVSRAAVLAPGYGGTATQPILRHLVKALAGHGIEALPITFRTRGKRPSREYESELDD
Ga0068869_10087882213300005334Miscanthus RhizosphereLTRAAVLAPGYGGTAEQGILRKLVRALAKFDIASSAITFRTSG
Ga0068868_10032745023300005338Miscanthus RhizosphereLTRAAVLAPGYGGTAEQGILRALVRALARFDIASSAITFRTSGKRPSRGYVSELEDL
Ga0070691_1024397013300005341Corn, Switchgrass And Miscanthus RhizosphereLTRAAVLAPGYGGTAEQGIVRALARALARFDIATGAITFRTSGKRPSRGYVS
Ga0070691_1084771913300005341Corn, Switchgrass And Miscanthus RhizosphereVSRAAVLAPGYGGTATQPILRHLVKALAGHGIEALPITFRTRGKRPSR
Ga0070694_10029869523300005444Corn, Switchgrass And Miscanthus RhizosphereLKRAAVLAPGYGGTAEQPILRALASALTTFDIASRAITFRTRGSRPSKEYASEIEDLRTARD
Ga0070694_10055085813300005444Corn, Switchgrass And Miscanthus RhizosphereLTRAAVLAPGYGGTAAQPILRALGIRLASYDIAIRAITFRTRGTRPSREYVSELEDLRS
Ga0066686_1071187213300005446SoilVLAPGYGGGERQPILRALAARLARDGIVSRPIAFSTRGARPSRAYERELA
Ga0066689_1000846743300005447SoilVVLAPGYGGDDRQPILRALAARLAKDGIAARAVTFSTRGSRP
Ga0070698_10054214813300005471Corn, Switchgrass And Miscanthus RhizosphereLTRAAVLAPGYGGTAAQPILRALGIRLASYDIAIRAITFRTRGTRPSR
Ga0070679_10052190213300005530Corn RhizosphereLTRAAVLAPGYGGTAEQGILRALVRALARFDIASSAITFRT
Ga0070697_10009103723300005536Corn, Switchgrass And Miscanthus RhizosphereLTRAAVLAPGYGGTAEQGILRALVRALARFDIASSAITFR
Ga0070697_10175886023300005536Corn, Switchgrass And Miscanthus RhizosphereVTRAAVLAPGYGGTAEQGILRALARALGRFDIDSRAITFRTSGKRPSRGYVSEL
Ga0066697_1046366913300005540SoilMSARGAVLAPGYGGTERQPILAALRDALAPFEIATRAVTFSTRGSRPSP
Ga0070696_10009449923300005546Corn, Switchgrass And Miscanthus RhizosphereLTRAAVLAPGYGGTAEQGIVRALARALARFDIATSAITFRTSGKR
Ga0066701_1003844613300005552SoilLKRAAVLAPGYGGTAEQPILRALASALKTFEIAAQAITFRTRGSRPSKDYASEIEDLRAARDAL
Ga0066695_1063453713300005553SoilLKRAAVLAPGYGGTAEQPILRKLGAALDGFGIASQAVTFGTRGSRPSKDY
Ga0066692_1011825523300005555SoilLTRAAVLAPGYGGTAEQPILRTLVKRLASYGIASHAVTFRTRGKRPSREY
Ga0066707_1004465413300005556SoilLKRAAVLAPGYGGTAEQPILRALASALKTFEIAGQAITFRTRGSRPSKDYASEIEDLRAA
Ga0066670_1048260013300005560SoilMTRGAVLAPGYGGGATQPILRSLAKALAGYGIESLPIEFATRGKRPSR
Ga0066693_1014159423300005566SoilVSGRAAVLAPGYGGTERQPILRKLATALAECGIVSRAVTFSTRGSRPSPRY
Ga0066705_1050064413300005569SoilLKRAAVLAPGYGGTAEQPILRALASALKTFEIAAHAI
Ga0070702_10112849523300005615Corn, Switchgrass And Miscanthus RhizosphereVSRAAVLAPGYGGTATQPILRHLVKALAGHGIEALPITFRTRGKRPSREYESELDDL
Ga0066656_1002467833300006034SoilVTRAAVLAPGYGGTAEQGILRALAPALRRFDIDSRAITFRT
Ga0066653_1009135623300006791SoilLTRAAVLAPGYGGTAEQPILRTLVKRLASYGIASHA
Ga0066658_1050952923300006794SoilLKRAAVLAPGYGGTAEQPILRALASALETFEIAAQAITFRTRGSRPSKDYASEIEDL
Ga0066659_1189349413300006797SoilLKRAAVLAPGYGGTAEQPILRALASALKTFDIAAQAITFRTRGSRPSKDYASEIEDLR
Ga0075425_10137975913300006854Populus RhizosphereVLAPGYGGTAEQPILRSLARALDAFGIASRAVTFRTRGSRPSRDYGSEIEDLRAARDALR
Ga0079217_1078699813300006876Agricultural SoilLTRAAVLAPGYGGTAEQPILRALARRLTEFDIASHAITFRTRGKRPSREYASEL
Ga0079215_1163403523300006894Agricultural SoilLTRAAVLAPGYGGTAAQPILRALARRLREFDIESQGITFRTRGRRPSREYALEL
Ga0075435_10015700523300007076Populus RhizosphereMTRAAVLAPGYGGTATQAILRSLTKALAGYNIDALPITFRTRGSRPSRDYADELADLRA
Ga0099827_1188667423300009090Vadose Zone SoilVTRRAAVLAPGYGGSERQPILRALGDALARYDIGSRAVRFSTRGARPSPKYA
Ga0114129_1241020013300009147Populus RhizosphereLTRAAVLAPGYGGTAAQPILRALARRLRDFDIASHAITFRTRGKRPSREYASELEDLRAARDA
Ga0075423_1235675213300009162Populus RhizosphereMTRRAAVLAPGYGGTERQPILRALGDALARYEIGSRAVRFSTRGSRPS
Ga0105061_110288913300009807Groundwater SandLTRAAVLAPGYGGTAEQPILRALARRLNEFDIASRAITFRTRGKRPSREY
Ga0134082_1010197523300010303Grasslands SoilLKRAAVLAPGYGGTAEQPILRALASALKTVEIAAHAITFRTRGSRPSKDYTSEI
Ga0134088_1042610823300010304Grasslands SoilVIKRAAVLAPGYGGTADQPIVKKLASALGSFGIESRAVTFRTRGSRPSREYALELDDLRAARDAL
Ga0134088_1069049913300010304Grasslands SoilLRRAAVLAPGYGGTAEQPILRKLAQALDSFGVASRA
Ga0134086_1050649113300010323Grasslands SoilMSGRAAVLAPGYGGTARQPILRALADALATYAIASRAVTFATRGSRPSAGYERELADLRVARDAL
Ga0134071_1048027913300010336Grasslands SoilVKRAAVLAPGYGGTAQQPILRALALALEQFGIASRAVTFGTRGKRPSKDYASEIEDL
Ga0134124_1074717223300010397Terrestrial SoilLKRAAVLAPGYGGTAEQPILRKLGAAIDEYGIASRAVTFRTRGVR
Ga0134123_1058591223300010403Terrestrial SoilLKRAAVLAPGYGGTAEQPILRKLGAALDEYGIASRAVTFRTRGLRPSKDYVSELDDLRAARDLFRS
Ga0137389_1171200713300012096Vadose Zone SoilVKRAAVLAPGYGRTAEQPILRKVGSALDAFGIATRAVTFT
Ga0136620_1045630913300012186Polar Desert SandVTRAAVLAPGYGGTADQPILVALGEALGGYGIAARPITFTTRGRPSTAYASEIADVR
Ga0137364_1003951913300012198Vadose Zone SoilLTRAAVLAPGYGGTAEQPILRTLVKRLASYGIASH
Ga0137363_1032772113300012202Vadose Zone SoilLTRAAVLAPGYGGTAEQPILRTLAKRLASYGIASHAVTFRTRGKRPSREYATELDDLR
Ga0137399_1079481223300012203Vadose Zone SoilMRRAAVLAPGYGGTAEQPILRALASALKTFGIASRSVTFRTRGSRSSKGYAS
Ga0137374_1073123413300012204Vadose Zone SoilLTRAAVLAPGYGGTSTQPVLRALARRLASYDIASR
Ga0137376_1048519223300012208Vadose Zone SoilLKRAAVLAPGYGGTAEQPILRALTSALESFGIEPRAITFRTRGSRPSKDYVS
Ga0137376_1107330723300012208Vadose Zone SoilVTRAAVLAPGYGGGATQPILRALAKALAGYGIESLPMEFAT
Ga0137378_1067002513300012210Vadose Zone SoilLKRAAVLAPGYGGTAEQPILLALASALDRFGITSSAVTFGTRGKRPSKDYASEIEDLRVARD
Ga0137377_1028089823300012211Vadose Zone SoilLKRAAVLAPGYGGTAEQPILQKLGAALGGFGIASQAVTFRT
Ga0137386_1046865013300012351Vadose Zone SoilLTRAAVLAPGYGGTAEQRILRALARALKRFDIDSRAITFRTSGKRPSR
Ga0137385_1018773413300012359Vadose Zone SoilLKRAAVLAPGYGGTAEQPILQKLGAALGGFGIASHAVALRTR
Ga0137385_1108111523300012359Vadose Zone SoilLKRAAVLAPGYGGTAEQPILRKLGAALDGFGIASQAVTFRTRGTRPSRDY
Ga0137390_1069966723300012363Vadose Zone SoilLKRAAVLAPGYGGTADQPILRKLGAALDGFGIVSRAVTFRTRGTRPSKDYASEIEDLRAARD
Ga0136635_1033653723300012530Polar Desert SandVRAAVLAPGYGGGAEQPILRAVASALEAEGIVARAMEFSTRGRRPSGGYTVEIAELRAARDVLRA
Ga0137373_1026937213300012532Vadose Zone SoilVTRAAVLAPGYGGTAEQGILRALARALGRCDIDSRAITFRTSGKRPRRGYVSELEDLRAARDA
Ga0137397_1014617623300012685Vadose Zone SoilVTRAAVLAPGYGGSAEQPILRALARRLTSYEIASRAVTFRTRGKRPSLEYATELEDLRAA
Ga0137396_1010597413300012918Vadose Zone SoilMRRAAVLAPGYGGTAEQPILRALASALTTFGIASRSVTFRTRGSRPSKDYAS
Ga0137419_1150858013300012925Vadose Zone SoilMRRAAVLAPGYGGTAEQPILRALASALKTFGIASRSVTFRTRGSRPSKDYASELEDLRVARDALG
Ga0137416_1013342523300012927Vadose Zone SoilLKRAAVLAPGYGGTAEQPILRRLGLALEGFGIASQAITFKTRGSRPSKDYVSEIEDL
Ga0137410_1006922913300012944Vadose Zone SoilLKRAAVLAPGYGGTAEQPILRKLAAALDGFGIASR
Ga0164303_1121611823300012957SoilLKLAAVLAPGYGGTAEQPILRKLGAAIDEYGIASRAVTFRTRGVRP
Ga0134076_1046115223300012976Grasslands SoilVIKRAAVLAPGYGGTAEQPILKKLASALASFGIEARAVTFRTHGSRPSKEYVLELDDLRAGR
Ga0134076_1053347923300012976Grasslands SoilMTRRAAVLAPGYGGTDRQPILRALGDALAAYGIGSRAIRFSTRGSRPSR
Ga0120149_112517423300014058PermafrostVKRAAVLAPGYGGTAEQPILRKVEAALSAFGIASRAVTFR
Ga0134075_1004745523300014154Grasslands SoilVTRAAVLAPGYGGTAEQGILRALARALRRFDIDSRAITFRTRGTRPSRGYVSEL
Ga0167668_103477013300015193Glacier Forefield SoilVKRAAVLAPGYGGTAEQPILRKVEAALDGFGIASRAVTFRTRGT
Ga0132257_10198576923300015373Arabidopsis RhizosphereVTRGAVLAPGYGGTATQPILRSLVKALAGYGIDALPITF
Ga0134069_112978623300017654Grasslands SoilLKRAAVLAPGDGGTAEQPILQTLGTALDGFGIASQAVTFGTRGSRPSKDYVSEIEDLR
Ga0134083_1021289123300017659Grasslands SoilLKRAAVLAPGYGGTAEQPILRALASALKTFDIAAQAITFRTRGSRPSKDYAS
Ga0184619_1016845723300018061Groundwater SedimentVKRAAVLAPGYGGTAEQSILRALVRALRTFDIDSCAITFRTSGKRPSRGYVS
Ga0066655_1105802023300018431Grasslands SoilMSARGAVLAPGYGGTERQPILAALRDALAPFEIATRAVTFSTRGSRPSPDY
Ga0066667_1217605223300018433Grasslands SoilLKRAAVLAPGYRGTAEQPILRALASALKTFEIAAQAITFRTRGSRPSKDYASAGD
Ga0066662_1055181423300018468Grasslands SoilLKRAAVLAPGYGGTAEQPILQKLGAALGGFAIASQAVTFR
Ga0066662_1148157713300018468Grasslands SoilMSGRAAVLAPGYGGTARQPILRALADALATYAIASRAVTFATRGSRPSAGYEREL
Ga0190274_1316427713300018476SoilLTRAAVLAPGYGSTAAQPILRALGIRLQSYGIEANAITFSTRGKRPSRDYAAELSDLRSARD
Ga0193755_114139313300020004SoilVKRAAVLAPGYGGTAQQPILRALASALDRFGITTRAVTFGTRGKRPSKEYASEIEDLR
Ga0210382_1001968613300021080Groundwater SedimentLTRAAVLAPGYGGTAAQPILRAVGMRLAQYDIAILAISFRTRGQRPSRDYFSELE
Ga0222622_1074376413300022756Groundwater SedimentLKRAAVLAPGYGGTAQQPILRRLASALDGFGIASRAVTFRTSGSRPSKDY
Ga0222622_1136864213300022756Groundwater SedimentLKRAAVLAPGYGGTAEQPILRKLAAALDGFGIASRAITFRTRGSRPSKDYV
Ga0207707_1059950623300025912Corn RhizosphereVSRAAVLAPGYGGTATQPILRHLVKALAGHGIEALPITFRTRGKRPSREYESEL
Ga0209235_105159013300026296Grasslands SoilLKRAAVPAPGYGDTAEQPILRDLASALERFGIAARAVTFRTRGSWPSKDYLSEIEDL
Ga0209235_110414323300026296Grasslands SoilVIKRAAVLAPGYGGTADQPIVKKLASALGSFGIESRAVTFRTRGSRPSRDYALELDDLRAARDALG
Ga0209468_118643513300026306SoilLKRAAVLAPGYGGTAEQPILRALASALKTFDIAAQAIT
Ga0209155_108546313300026316SoilLRRAAVLAPGYGGTAEQPILRKLGAALEGFGITSKA
Ga0209687_129044513300026322SoilMSARGAVLAPGYGGTERQPILAALRDALAPFEIATRAVTFSTRGSRPSPDYARELDDLR
Ga0209158_103019723300026333SoilLKRAAVLAPGYGGTAEQPILRALASALKTFEIAARAITFRTRGSR
Ga0209377_108870123300026334SoilVIKRAAVLAPGYGGTADQPIVKKLASALGSFGIESRAVTFRTRGSRPSREYALELE
Ga0209057_112095423300026342SoilMSARGAVLAPGYGGTERQPILAALRDALAPFEIATRAVTFSTRGSRPSPGYA
Ga0209376_117331723300026540SoilLKRAAVLAPGYGGTAEQPILRALTSALESFGIEPRAITFRTRGSRPSKDY
Ga0209818_119848213300027637Agricultural SoilLTRAAVLAPGYGGTAEQPILRALARRLTEFDIASHAITFRTR
Ga0209481_1033727613300027880Populus RhizosphereLTRAAVLAPGYGGTAAQPILRALARRLRDFDIASHAITFRTRGK
Ga0209590_1043954313300027882Vadose Zone SoilMTRRAAVLAPGYGGTDRQPILRALGDALAAYGIGSRAIRFSTRGSRP
Ga0207428_1118269223300027907Populus RhizosphereLTRAAVLAPGYGGTAEQGILRKLVRALAKFDIASSAITFRTSGKRPSRGYVSELEDLRAARVA
Ga0307298_1027694613300028717SoilLTRAAVLAPGYGGTAEQGILRALVRALAKFDIASSAITFRTSGKRP
Ga0307323_1032277323300028787SoilVSRAAVLAPGYGGTADQGILRALARALRTFDIDSRAIT
Ga0307290_1030601623300028791SoilVTKRGAVLAPGYGGTADQPILKKLAAALASFGIES
Ga0307504_1025536223300028792SoilVKRAAVLAPGYGGTAEQPILRKLGAALDGFGIASRAVTFGTRG
Ga0307504_1035339223300028792SoilVKRAAVLAPGYGGSSTQPILRNLTKALMVYGIDAL
Ga0307284_1027296013300028799SoilVSRAAVLAPGYGGTATQPILRALTKALAGHGIEALPITFRTRGKR
Ga0307292_1025696523300028811SoilLKRAAVLAPGYGGTAEQPILRRLGSALEAFGITSQAITFRTSGS
Ga0307312_1116832723300028828SoilVKRAAVLAPGYGGSADQPILRKLGAALDGFAIASRAVTFTTRGSRPSKDYASEIDDLRAA
Ga0307286_1040024523300028876SoilLTRAAVLAPGYGGTAEQGILRALVRALAKFDIASSAITFRSSGKRPSRGYVSELEDLRAA
Ga0307304_1055182813300028885SoilVKRAAVLAPGYGGSADQPILRKLGAALDGFAIASRAVTFTTRGSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.