NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F090161

Metagenome Family F090161

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F090161
Family Type Metagenome
Number of Sequences 108
Average Sequence Length 69 residues
Representative Sequence MADVQDEIELLQIEIEKANAAIAKALRDRRALHKDSPLVADFNAAVTAARQALIAVEIKLRTLYESKGD
Number of Associated Samples 65
Number of Associated Scaffolds 108

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 69.57 %
% of genes near scaffold ends (potentially truncated) 22.22 %
% of genes from short scaffolds (< 2000 bps) 78.70 %
Associated GOLD sequencing projects 61
AlphaFold2 3D model prediction Yes
3D model pTM-score0.67

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (81.481 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere
(12.963 % of family members)
Environment Ontology (ENVO) Unclassified
(37.963 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(49.074 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 60.82%    β-sheet: 0.00%    Coil/Unstructured: 39.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.67
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 108 Family Scaffolds
PF00313CSD 3.70
PF12704MacB_PCD 1.85
PF06182ABC2_membrane_6 0.93
PF07690MFS_1 0.93
PF01839FG-GAP 0.93
PF02321OEP 0.93
PF13304AAA_21 0.93
PF08448PAS_4 0.93
PF13414TPR_11 0.93
PF07592DDE_Tnp_ISAZ013 0.93
PF13517FG-GAP_3 0.93
PF02371Transposase_20 0.93
PF05050Methyltransf_21 0.93
PF12796Ank_2 0.93
PF01966HD 0.93
PF01019G_glu_transpept 0.93
PF13432TPR_16 0.93
PF00106adh_short 0.93
PF00072Response_reg 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 108 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.85
COG0405Gamma-glutamyltranspeptidaseAmino acid transport and metabolism [E] 0.93
COG3547TransposaseMobilome: prophages, transposons [X] 0.93
COG3694ABC-type uncharacterized transport system, permease componentGeneral function prediction only [R] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A81.48 %
All OrganismsrootAll Organisms18.52 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000956|JGI10216J12902_110504906All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales1272Open in IMG/M
3300004114|Ga0062593_102286637Not Available608Open in IMG/M
3300004114|Ga0062593_103302641Not Available517Open in IMG/M
3300004157|Ga0062590_100718114Not Available903Open in IMG/M
3300004643|Ga0062591_100691457Not Available920Open in IMG/M
3300004643|Ga0062591_101379226Not Available698Open in IMG/M
3300004643|Ga0062591_101783842Not Available627Open in IMG/M
3300004643|Ga0062591_102545725All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas putida group → Pseudomonas putida538Open in IMG/M
3300005093|Ga0062594_103051667Not Available523Open in IMG/M
3300005295|Ga0065707_10320323Not Available931Open in IMG/M
3300005332|Ga0066388_101045722Not Available1374Open in IMG/M
3300005332|Ga0066388_101930841Not Available1055Open in IMG/M
3300005332|Ga0066388_102567651Not Available927Open in IMG/M
3300005434|Ga0070709_10660479Not Available810Open in IMG/M
3300005436|Ga0070713_100495963Not Available1152Open in IMG/M
3300005444|Ga0070694_101730970Not Available532Open in IMG/M
3300005458|Ga0070681_10860671All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae → Anatilimnocola824Open in IMG/M
3300005458|Ga0070681_10946859Not Available780Open in IMG/M
3300005529|Ga0070741_10130198All Organisms → cellular organisms → Bacteria → PVC group2574Open in IMG/M
3300005553|Ga0066695_10907632Not Available503Open in IMG/M
3300005576|Ga0066708_10897349All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300006032|Ga0066696_10081952All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1897Open in IMG/M
3300006794|Ga0066658_10600052Not Available600Open in IMG/M
3300006797|Ga0066659_11255550Not Available618Open in IMG/M
3300006904|Ga0075424_101253760Not Available789Open in IMG/M
3300006904|Ga0075424_102391338Not Available554Open in IMG/M
3300006914|Ga0075436_100204468Not Available1399Open in IMG/M
3300009012|Ga0066710_100304628All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → unclassified Pirellulales → Pirellulales bacterium2336Open in IMG/M
3300009012|Ga0066710_104303647Not Available532Open in IMG/M
3300009094|Ga0111539_10508088Not Available1404Open in IMG/M
3300009137|Ga0066709_101186206All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae → Anatilimnocola1124Open in IMG/M
3300009137|Ga0066709_103270623Not Available590Open in IMG/M
3300009162|Ga0075423_12711004Not Available542Open in IMG/M
3300009551|Ga0105238_10640157All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300009609|Ga0105347_1186120Not Available828Open in IMG/M
3300009609|Ga0105347_1192968All Organisms → cellular organisms → Bacteria816Open in IMG/M
3300009678|Ga0105252_10246784Not Available780Open in IMG/M
3300010360|Ga0126372_10581506Not Available1070Open in IMG/M
3300010375|Ga0105239_13240311Not Available530Open in IMG/M
3300010397|Ga0134124_10490475Not Available1186Open in IMG/M
3300010401|Ga0134121_12150588Not Available594Open in IMG/M
3300011415|Ga0137325_1069902Not Available769Open in IMG/M
3300011439|Ga0137432_1086782Not Available969Open in IMG/M
3300011445|Ga0137427_10013333All Organisms → cellular organisms → Bacteria → Proteobacteria3255Open in IMG/M
3300012206|Ga0137380_11636144Not Available528Open in IMG/M
3300012212|Ga0150985_101988147Not Available1160Open in IMG/M
3300012212|Ga0150985_104578225Not Available802Open in IMG/M
3300012212|Ga0150985_107238256Not Available1177Open in IMG/M
3300012212|Ga0150985_108962803Not Available518Open in IMG/M
3300012212|Ga0150985_109414794Not Available788Open in IMG/M
3300012212|Ga0150985_111529847Not Available648Open in IMG/M
3300012212|Ga0150985_113505610Not Available2184Open in IMG/M
3300012212|Ga0150985_114162485All Organisms → cellular organisms → Bacteria1690Open in IMG/M
3300012212|Ga0150985_114216279Not Available556Open in IMG/M
3300012212|Ga0150985_115539012All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes941Open in IMG/M
3300012212|Ga0150985_119659423Not Available1239Open in IMG/M
3300012212|Ga0150985_119905272Not Available593Open in IMG/M
3300012212|Ga0150985_121237068Not Available1240Open in IMG/M
3300012469|Ga0150984_100690029Not Available613Open in IMG/M
3300012469|Ga0150984_102758343Not Available662Open in IMG/M
3300012469|Ga0150984_102912498Not Available573Open in IMG/M
3300012469|Ga0150984_105759732Not Available1932Open in IMG/M
3300012469|Ga0150984_110672344All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae2540Open in IMG/M
3300012469|Ga0150984_117474903Not Available886Open in IMG/M
3300012469|Ga0150984_122369432Not Available503Open in IMG/M
3300012986|Ga0164304_10492894Not Available893Open in IMG/M
3300012987|Ga0164307_11916648Not Available502Open in IMG/M
3300012988|Ga0164306_10261841All Organisms → cellular organisms → Bacteria1245Open in IMG/M
3300013296|Ga0157374_11099301Not Available815Open in IMG/M
3300013308|Ga0157375_11806417Not Available725Open in IMG/M
3300015053|Ga0137405_1346682All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1783Open in IMG/M
3300015371|Ga0132258_10492560All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae3063Open in IMG/M
3300015371|Ga0132258_11863811Not Available1515Open in IMG/M
3300015371|Ga0132258_12595061Not Available1266Open in IMG/M
3300015371|Ga0132258_13161144Not Available1137Open in IMG/M
3300015371|Ga0132258_13269395Not Available1116Open in IMG/M
3300015374|Ga0132255_104944751Not Available564Open in IMG/M
3300018083|Ga0184628_10382839Not Available736Open in IMG/M
3300018084|Ga0184629_10685631Not Available517Open in IMG/M
3300018468|Ga0066662_11051914All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300021082|Ga0210380_10052767Not Available1757Open in IMG/M
3300025928|Ga0207700_10444626Not Available1142Open in IMG/M
3300026041|Ga0207639_11667993All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae → Anatilimnocola597Open in IMG/M
3300031716|Ga0310813_10076176All Organisms → cellular organisms → Bacteria2546Open in IMG/M
3300031716|Ga0310813_10317020Not Available1320Open in IMG/M
3300031716|Ga0310813_10456226Not Available1110Open in IMG/M
3300031716|Ga0310813_11037585Not Available749Open in IMG/M
3300031716|Ga0310813_11543736Not Available619Open in IMG/M
3300031938|Ga0308175_103286356Not Available501Open in IMG/M
3300033412|Ga0310810_11051496Not Available687Open in IMG/M
3300034147|Ga0364925_0274026Not Available629Open in IMG/M
3300034147|Ga0364925_0414109Not Available512Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere12.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil10.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.26%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil8.33%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere6.48%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere6.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.56%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.63%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.63%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.63%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.78%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.78%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.85%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.85%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.85%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.85%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.85%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.85%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.93%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.93%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.93%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011398Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT600_2EnvironmentalOpen in IMG/M
3300011415Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT469_2EnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027533Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700 (SPAdes)EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300034147Sediment microbial communities from East River floodplain, Colorado, United States - 44_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_11050490623300000956SoilVADVQSEIDLLQVEIQRANSAITEALRKRKELAKDSPLLPDFNAAVVAARLALTEVEVKIRKLYESKDDE*
Ga0062593_10228663713300004114SoilMDDIQHEIELLQVEIKNANAVIDKAIRERRATAKDSPLMADVNAAVVAAREALAAVEIKLRALYERKGDD*
Ga0062593_10330264113300004114SoilMTDVQAEIGLLQIEIQKANDVITKAISNRRAIGKDSPLAADAQAAVVAARDALAAIEIKLRMLYESKPD*
Ga0062590_10071811423300004157SoilMADVQDEIELLQIEIAKANAAIAKALRDRRAVNKDSPLVADMNSAVVAAKEALAAVEVKLRALYESKSED*
Ga0062590_10214355413300004157SoilMNYPQNELVAQALVPRMNDEQQEIESLQIEIKNANAAIDKAIRDRRGISKDSPLLVDSNAAVAAAREALAAAELRLRSVYERKGDD*
Ga0062595_10115306313300004479SoilMNDEQQEIELLQVEMRSASAAIDKAIRDRRGIAKDSPLLADVNAAVATARAALAAVELRLRNVYERKGDD*
Ga0062595_10157904623300004479SoilMSDQQEIESLQVEIKNASAAIDKAIRDRRGISKDSPLLADSNAAVAAAREALAAAELRLRGVYERKSDD*
Ga0062591_10069145733300004643SoilMADVQDEIELLQIDIEKANAAISKALRDRRAINKDSPLVADFNAAVTAARQALAGVEAKL
Ga0062591_10137922623300004643SoilMTDVQTEIGLLQIEIQKANDVIAKAINNRRAIGKESPLAADAQAAVVAARNALAAVEIKLRMLYESK
Ga0062591_10178384213300004643SoilMADIQAEIELLQIEIQKANDVISKAISNRRAIGKDSPLAADANAAVAAAREALATVEIKLRML
Ga0062591_10254572513300004643SoilRGSELRRANKNKREPVADIQAEIELLQIEMQKANEIITKAIANRRAIGKDSPLAADANAAVTAARYALAAVEIKLRMLYESKND*
Ga0062594_10305166713300005093SoilVYFEVKGSPAPRGQRMADVQDEIELLQIEIAKANAAIAKALRDRRAVNKDSPLVADMNSAVVAAKEALAAVEVKLRALYESKSED*
Ga0065707_1032032313300005295Switchgrass RhizosphereMADIQAEVELLQIEIQKANDVISKAISNRRAIGKDSPLAADANAAVAAAREALATVEIKLRMLYESKPD*
Ga0066388_10104572233300005332Tropical Forest SoilMADIQDDIELLQIEIAKANAAIAKAVRERRAVNKDSPLVADLNAAVVAAREALTAVEVKLRKLYESKGDD*
Ga0066388_10193084123300005332Tropical Forest SoilMDDIQHEIELLQVEIKNANAVIEKAIRERRATAKDSPLMADLNAAVVAAREALAAVEIRLRALYERKGDNHDNSLGNT*
Ga0066388_10256765133300005332Tropical Forest SoilVTLSESDHTRKAIMADVQDEIALLQIESLKANAEIAKAIRERRAMNKDSPLIADCNAAVSAARQALADVEVKLRKLYESKSDD*
Ga0070709_1066047933300005434Corn, Switchgrass And Miscanthus RhizosphereMPDVQDEIEALQAEIEKANAVIAKALRDRRALSKDSPLVADCNAAVAAARQALTVVEMKLRALYE
Ga0070713_10049596323300005436Corn, Switchgrass And Miscanthus RhizosphereMPDVQDEIEALQVEIEKANAVIAKALRERRALSKDSPLVADCNAAVAAARQALAVVEIKLRALYESKDD*
Ga0070710_1029943113300005437Corn, Switchgrass And Miscanthus RhizosphereKANAVIAKALRERRALSKDSPLVADCNAAVAAARQALAVVEIKLRALYESKDD*
Ga0070694_10173097013300005444Corn, Switchgrass And Miscanthus RhizosphereMDDIQHEIELLQVEIASANAVIEKAIRERRATAKDSPLLADVNAAVVAARESLAAVEMKLRALYERKGDD*
Ga0070681_1086067113300005458Corn RhizosphereDNGMGDVQDEIELLQTEIGKANAAIAKALRERRALNKDSPLVDDFNAAVVTARQALVSVEIKLRQLYESKGD*
Ga0070681_1094685923300005458Corn RhizosphereMDDIQHEIELLQVEIASANAVIEKAIRERRATAKDSPLMADVNAAVVAAREALAAVEIKLRALYERKGDD*
Ga0070741_1013019853300005529Surface SoilMDVQDEIELLQIEIQKANAVIAEALRRRKAVAKDSTLAGECNAAVVAAREALAKVEIKLRALYESKED*
Ga0066695_1090763223300005553SoilMDVQDEIELLQVEIDKANAAIAKALKDRRAVNKDSPLVADCNAAVVAARQALTAVEVKLRKLYESKED*
Ga0066708_1089734913300005576SoilMDDIQHEIELLQVEIKSANAIIEKAIRERRATAKDSPLIADLNAAVAAARAALAAVEIKLRALYERKSDD*
Ga0066696_1008195243300006032SoilMANVQDEIELLQVEIEKANAAIAKALRDRRALHKDSPLVADFNAAVTAARQALIAVEIKLRTLYESKGD*
Ga0066658_1060005223300006794SoilQSEIELLQIEIQKANVVISKAISNRRAIGKDSPLAADANAAVAAAREALAAVEIKLRTLYESKPD*
Ga0066659_1125555023300006797SoilMADVQDEIELLQIEIEKANAAIAKALRDRRALHKDSPLVADFNAAVTAARQALIAVEIKLRTLYESKGD*
Ga0075424_10125376023300006904Populus RhizosphereMDNIQDEIELLQIEIQQANAAIDKALRDRRALNKDSPLIADFNAAVASARQALASVEVKLRRLYESKGDD*
Ga0075424_10239133823300006904Populus RhizosphereMDVQEEIELLQLEIDKANAAIAKALRDRRAVNKDSPLVADCNAAVVAARQALTVVEVKLRKLYESKED*
Ga0075436_10020446823300006914Populus RhizosphereMDVQDEIELLQVEIDKANAAITKALKDRRAVNKDSPLVADCNAAVVAARQALTAVEVKLRKLYESKED*
Ga0066710_10030462823300009012Grasslands SoilMADVQDEIELLQIEIQKANAVIVKAIRDRRAVNKDSPLVADFNAAVAAARQALNAVEVKLRKLYESKED
Ga0066710_10430364723300009012Grasslands SoilMADVQDEIELLQVEIQRANAAIAKALRDRRALNKDSPLIADFNAAVVSARQALASVEVKLRKLYESKGDD
Ga0111539_1050808833300009094Populus RhizosphereMDDIQDEIELLQIEIANANAAIAKAVRDRRAVNKDSPLAADCTAAVLAAREALTAVEVKLRKLYESKDDE*
Ga0105245_1296160113300009098Miscanthus RhizosphereQQELNVQQEIELLQVEIRNANAAIDKAIRNRRGISKDSPLLADSNATVVAARAALAAVELRLRNAYERKGDE*
Ga0066709_10118620623300009137Grasslands SoilEIRNANAALDKAVRDRRAINKDSPLIADFNAAVSAARQALIAVEVKLRTLYESKDDG*
Ga0066709_10327062323300009137Grasslands SoilMADVQDEIELLQVEIQRANAAIAKALRDRRALNKDSPLIADFNAAVVSARQALASVEVKLRKLYESKGDD*
Ga0075423_1271100423300009162Populus RhizosphereVQDQIELLQIEITRANAAITKAVSDRRAVSKDSPLVADLNAAIVAAREALTAVEVKLRKLYESKSDD*
Ga0105238_1064015733300009551Corn RhizosphereMTDVQAEIGLLQIEIQKANGVISKAISDRRAIGKDSPLAAEANAAVAAAREALAAVEIKLRMLYESKPD*
Ga0105347_118612013300009609SoilMDDVQDQIALLQVEIEKANAAIAKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESKED*
Ga0105347_119296823300009609SoilMRDATAPQGNMMADVQDEIALLQIDIEKANAAIAKALRDRRATNKDSPLVADFNAAVTAARQALAGVEAKLRILYESKSGD*
Ga0105340_107796513300009610SoilMADLQDEMSLVEQEIEQANAALAKAVRERRSAKGDSPLVAALEAAVVAARNALTDAEVKLRKLYESKDDGG*
Ga0105252_1024678413300009678SoilMDDVQDQIALLQVEIEKANAAITKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESK
Ga0126372_1058150623300010360Tropical Forest SoilMADVQDEIALLQIESLKANAEIAKAIRERRAMNKDSPLIADCNAAVSAARQALADVEVKLRKLYESKSDD*
Ga0105239_1324031123300010375Corn RhizosphereMGDVQDEIELLQTEIGKANAAIAKALRERRALNKDSPLVDDFNAAVVTARQALVSVEIKLRQLYESKGD*
Ga0134124_1049047533300010397Terrestrial SoilMADIQDEIELLKIEIQKANVVITKAISNRRTIGKDSPLASDANTAVTAARHALAAVEIKLRMLYERKDS*
Ga0134121_1215058813300010401Terrestrial SoilMDVQDEIELLRIEIQKANAAIVQAQRERRAVQKDSPLVADLNAAVVAARQALNAVEVKLRKLYESKED*
Ga0137348_100553343300011398SoilMADLQDEMSLVEQEIEQANAALAKAVRERRSAKGDSPLVAALEAAVVAARNALTDAEVKLRKLY
Ga0137325_106990233300011415SoilQIALLQVEIEKANAAITKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESKED*
Ga0137432_108678223300011439SoilMDDVQDQIALLQVEIEKANAAITKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESKED*
Ga0137427_1001333343300011445SoilMRDPTTHKGNIMADVQDEIQLLQIDIEKANAAIAKALRDRRAINKDSPLVADFNAAVTAARQALASVEAKLRILYESKSGGD*
Ga0137380_1163614423300012206Vadose Zone SoilMADVQDEIELLQVEIEKANAAIAKALRDRRALHKDSPLVADFNAAVTAARQALIAVEIKL
Ga0150985_10198814733300012212Avena Fatua RhizosphereELLQNEIQKANAVISKAINNRRAIGKDHPLAAEANAAVAMAREALAAIEIKLRTLYETKPD*
Ga0150985_10457822523300012212Avena Fatua RhizosphereLAASTGGNAMVDVQHEIELLQIEIAKANAEIAKALRERRAVAKDSPLVTDLNAAVVAARQSLVSVEVKLRKLYESKDED*
Ga0150985_10723825623300012212Avena Fatua RhizosphereMADVQDEIELLRLEIEKANATIAKALRDRRALQKDSPLVADCNAAVTAARQALTVVEMKLRALYESKGD*
Ga0150985_10896280313300012212Avena Fatua RhizosphereEIELLQLEIGKANAAIAQALRNRRAIDKNSPLVADCNAAVTSARQALSLVEIKLRTLYESKGD*
Ga0150985_10941479423300012212Avena Fatua RhizosphereMVDVQHEIELLQIEIAKANAEIAKALRERRAVAKDSPLVTDLNAAVVAARQSLVSVEVKLRKLYESKDED*
Ga0150985_11152984723300012212Avena Fatua RhizosphereVADVQSEIELLQIEIKQANDVITKALKDRRAVAKDSPYVAELNAAVTAARQALNSVEVKLRKLYESKEA*
Ga0150985_11350561053300012212Avena Fatua RhizosphereMADVQAEIELLQIEIASANAAIAKAMRDRRAASKDSPLVADLNAAVVAARQALHAVEVKLRNLYESKGED*
Ga0150985_11416248543300012212Avena Fatua RhizosphereMADIEAEIELLQIEIQKANDAINKAISNRRAIGKDSPLTADANTAVAAAREALAAVEIKLRVLYESKPE*
Ga0150985_11421627923300012212Avena Fatua RhizosphereMAQVQDEIELLQAEILKANEVIAASIKNRRAIAKDSPLAADAEAAVVKARQALSSVELKIRALYESKDD*
Ga0150985_11553901213300012212Avena Fatua RhizosphereMENVQSEIELLQDEIKRVNAVIDKAIRERRASGKESPLVPDLNATIATAREALAVVELKLRKLYESRGDD*
Ga0150985_11857533323300012212Avena Fatua RhizosphereMDHDQQAIESLQAEIKSVSATIDKAIRDRRSISKDSPLLADSNAAVAAARQALAAVELRLRSVYERKSDD*
Ga0150985_11965942323300012212Avena Fatua RhizosphereMADVQDEIELLQIEIAKANAAIAKAVRDRRAVNKDSPLVTDFNAAVVAARQALVAVEVKLRKLYESKADD*
Ga0150985_11990527213300012212Avena Fatua RhizosphereMAHVQDEIELLQAEILKANEVIAASIKNRRAIAKDSPLAADAEAAVVKARQALVSVELKVRALYESKGD*
Ga0150985_12123706833300012212Avena Fatua RhizosphereMADIQDEIELLHAEIEKANAVIAKALRDRRALSKDSPLVADCNAAVTAARQALMAVEIKLRTLYESKGD*
Ga0150984_10069002923300012469Avena Fatua RhizosphereVERMADIQSEIELLQIEIQKANEVISKAISNRRAIGKDSPLAAEANAAVATAREALAAVEIKLRMLYESKPD*
Ga0150984_10275834313300012469Avena Fatua RhizosphereMELQDEIELLHGEIKKANEVIAEALRRRKAVAKDSPLVADCNAAVAAARDALAKVEIKLRALYESK
Ga0150984_10291249813300012469Avena Fatua RhizosphereMADIQAEIELLQIEIQKANAVISKAISNRRAIGKDSPLAADANAAVAAAREALATVEIKLRTLYESKPD*
Ga0150984_10575973253300012469Avena Fatua RhizosphereMADVQAEIELLQIEIASANAAIAKAMRDRRAASKDSPLVADLNAAVVAARQALHAVEVKLRNLYESKGDD*
Ga0150984_11067234463300012469Avena Fatua RhizosphereMEVEQMADIQAEIELLQNEIQKANAVISKAINNRRAIGKDHPLAAEANAAVAMAREALAAIEIKLRTLYESKPD*
Ga0150984_11747490313300012469Avena Fatua RhizosphereMADIQAEIELLQTEIQKATVVINKAISNRRAIGKDSPLAADANAAVAAAREALAAVEIKLRTLYESKPD*
Ga0150984_12236943223300012469Avena Fatua RhizosphereMADVQDEIELLHIEIEKANAVIAKALRDRRALHKDSPLVADFNAAVTAARQALIVVEIKLRTLYESKGD*
Ga0164304_1049289423300012986SoilMDNIQDEIELLQIEIQQANAAIDKALRDRRALNKGSPLIADFNAAVASARQALASVEVKLRRLYESKGDD*
Ga0164307_1191664813300012987SoilMDVQDEIELLQIEIDKANAAIVKALRERRAVNKDSPLVADLNAAVVAAREALTAVEVKLRKLYESKED*
Ga0164306_1026184133300012988SoilVADTQDEIELLQIEIQKANDAIAKAVRDRRAIGKDSPLLADLNATIVTARESLAAVEIKLRKLYES
Ga0164306_1044963113300012988SoilNAAIAKATTNRRALGKDSPLVADANAAVTAARQALAAVEIKMRALYESKDD*
Ga0157374_1109930123300013296Miscanthus RhizosphereVADTQDEIELLQIEIQKANDAIAKAVRDRRAIGKDSPLLADLNATIVTARESLAAVEIKLRKLYESKGD*
Ga0163162_1237318023300013306Switchgrass RhizosphereMNDEQQEIESLQIEIKNANAAIDKAIRDRRGISKDSPLLVDSNAAVAAAREALAAAELRLRSVYERKGDD*
Ga0157375_1180641723300013308Miscanthus RhizosphereMQDVQIEIELLHVEIQAANKTLEKALRDRKAANKDSPLVADLNLAVQAARQALNTVEVKLRALYESKDDG*
Ga0137405_134668233300015053Vadose Zone SoilMADIQAEIELLQIEIQKANAVISKAISNRRAIGKDSPLAVEANAAVAAAREALAAIEIKLRMLYESKPD*
Ga0132258_1011049953300015371Arabidopsis RhizosphereMNQQQEINTQQEIALLQVEIQNANAAIAKALRDRRGISKDSPLFADANATITTARENLAAVELRLRNAYERKGDD*
Ga0132258_1049256053300015371Arabidopsis RhizosphereMDDIEDEIESLQVEINKAREVLDKAVRDRRALHKESPLRDDCVAAVVAAREALAAVEIRLRTLYESKNRD*
Ga0132258_1186381133300015371Arabidopsis RhizosphereMADIQSEIELLQIEITKANAAIAKALRDRRAVNKDSPLIADLNAAVVAARQSLTDVEVKLRKLYESKSSDD*
Ga0132258_1259506113300015371Arabidopsis RhizosphereMADLQDEIELLQNEIQKANELIVKTVRERRAANKNSPLAAELNAAVAAARKALNDVEVKLRKLYVSKDD*
Ga0132258_1316114433300015371Arabidopsis RhizosphereMEDIQAEIELLQIEIKQAQAAITKALSDRKAINKDSPFVADLNAAVAAARESLAAVEIKLRKLYESKG
Ga0132258_1326939523300015371Arabidopsis RhizosphereMADVQDEIDLLHVEIEKANAVVAQALRDRRALNKDSPLVAECNAAVTAARQALIVVEIKLRALYESKDD*
Ga0132255_10494475123300015374Arabidopsis RhizosphereMGDIQSEIELLQIEITKANAAIAKALRDRRAVNKDSPLIADLNAAVVAARQSLTDVEVKLRKLYESKSSDD*
Ga0184638_101908413300018052Groundwater SedimentMADLQDEMSLVEQEIEQANAALAKAVRERRAAKGDGPLVAALEAAVVAARNALTDAEVKLRKLYESKDDGG
Ga0184628_1038283913300018083Groundwater SedimentQIALLQVEIEKANAAIAKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESKED
Ga0184629_1068563113300018084Groundwater SedimentMGDVQDQIELLQIDIKNANDAVVKAVKQRREINKDSPLAADYGAAVTAAKKALVDAEVKLRALYESKED
Ga0066662_1105191423300018468Grasslands SoilMDDIQHEIELLQVEIKSANAIIEKAIRERRATAKDSPLIADLNAAVAAARAALAAVEIKLRALYERKSDD
Ga0210380_1005276733300021082Groundwater SedimentMDDVQDQIALLQVEIEKANAAIAKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESKED
Ga0207687_1174466823300025927Miscanthus RhizosphereNAAIDKAIRNRRGISKDSPLLADSNATVVAARAALAAVELRLRNAYERKGDE
Ga0207700_1044462623300025928Corn, Switchgrass And Miscanthus RhizosphereMPDVQDEIEALQVEIEKANAVIAKALRERRALSKDSPLVADCNAAVAAARQALAVVEIKLRALYESKDD
Ga0207703_1231621623300026035Switchgrass RhizosphereMNDEQQEIELLQVEMRSASAAIDKAIRDRRGIAKDSPLLADVNAAVATARAALAAVELRLRNVYERKGDD
Ga0207639_1166799313300026041Corn RhizosphereTEIGKANAAIAKALRERRALNKDSPLVDDFNAAVVTARQALVSVEIKLRQLYESKGD
Ga0207641_1244926313300026088Switchgrass RhizosphereESLQIEIKNANAAIDKAIRDRRGISKDSPLLVDSNAAVAAAREALAAAELRLRSVYERKGDD
Ga0208185_103496913300027533SoilRTNRQRQGMADLQDEMSLVEQEIEQANAALAKAVRERRSAKGDSPLVAALEAAVVAARNALTDAEVKLRKLYESKDDGG
Ga0310813_1007617633300031716SoilMDDIQHEIELLQVEIASANAVIEKAIRERRATAKDSPLLADVNAAVVAARESLAAVEMKLRALYERKGDD
Ga0310813_1031702023300031716SoilMADIQAEIELLQIEIQKANGVISKAISDRRAIGKDSPLAAEANAAVAAAREALAAVEIKLRMLYESKPD
Ga0310813_1045622623300031716SoilMTDVQTEIGLLQIEIQKANDVIAKAINNRRAIGKESPLAADAQAAVVAARNALAAVEIKLRMLYESKPD
Ga0310813_1103758513300031716SoilMTNVQAEIELLQIEIQKANDVITKAISNRRAIGKDSPLAAEAQAAVVAARDALAAVEIKLRMLYESKPD
Ga0310813_1154373633300031716SoilMADIQAEIELLQIEIAKANAAIAKALRERRAANKDSPLVAELNAAVSAARQSLNLVE
Ga0308175_10328635623300031938SoilMGDVQDEIELLQTEIGKANAAIAKALRERRALNKDSPLVDDFNAAVVTARQALVSVEIKLRQLYESKGD
Ga0310810_1105149623300033412SoilCKRNSMDDIQHEIELLQVEIASANAVIEKAIRERRATAKDSPLLADVNAAVVAARESLAAVEMKLRALYERKGDD
Ga0364925_0274026_37_2463300034147SedimentMDDVQDQIALLQVEIEKANAAITKAVRERRAINKDSPLIADFQAAVATAKQALHNAEVKLRALYESKED
Ga0364925_0414109_139_3873300034147SedimentMRDPTTHKGNIMADVQDEIQLLQIDIEKANAAIAKALRDRRAINKDSPLVADFNAAVTAARQALASVEAKLRILYESKSGGD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.