NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300025983

3300025983: Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Tolay_CordC_D2 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300025983 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111376 | Gp0101393 | Ga0210106
Sample NameWetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Tolay_CordC_D2 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size145181152
Sequencing Scaffolds38
Novel Protein Genes41
Associated Families37

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Phycisphaerae → unclassified Phycisphaerae → Phycisphaerae bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria6
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cellvibrionales → Halieaceae → Pseudohalioglobus → Pseudohalioglobus sediminis2
Not Available19
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales1
All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium1
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Aneurinibacillus group → Aneurinibacillus → Aneurinibacillus tyrosinisolvens1
All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1
All Organisms → cellular organisms → Archaea1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameNatural And Restored Wetland Microbial Communities From The San Francisco Bay, California, Usa, That Impact Long-Term Carbon Sequestration
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands → Natural And Restored Wetland Microbial Communities From The San Francisco Bay, California, Usa, That Impact Long-Term Carbon Sequestration

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationUSA: San Francisco Bay, California
CoordinatesLat. (o)38.15012Long. (o)-122.438774Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F005055Metagenome / Metatranscriptome413N
F005727Metagenome / Metatranscriptome392Y
F014614Metagenome / Metatranscriptome261Y
F021303Metagenome / Metatranscriptome219Y
F025653Metagenome / Metatranscriptome200Y
F027171Metagenome / Metatranscriptome195N
F029111Metagenome / Metatranscriptome189Y
F036767Metagenome / Metatranscriptome169Y
F039643Metagenome163Y
F041201Metagenome / Metatranscriptome160Y
F042907Metagenome / Metatranscriptome157Y
F042909Metagenome157Y
F048676Metagenome / Metatranscriptome148Y
F050381Metagenome / Metatranscriptome145Y
F051215Metagenome / Metatranscriptome144Y
F052508Metagenome / Metatranscriptome142N
F053305Metagenome / Metatranscriptome141Y
F054044Metagenome / Metatranscriptome140Y
F054047Metagenome / Metatranscriptome140Y
F057384Metagenome / Metatranscriptome136Y
F058175Metagenome / Metatranscriptome135Y
F058721Metagenome / Metatranscriptome134Y
F066735Metagenome / Metatranscriptome126N
F070131Metagenome / Metatranscriptome123Y
F076109Metagenome118Y
F077317Metagenome / Metatranscriptome117Y
F077987Metagenome / Metatranscriptome117Y
F081352Metagenome / Metatranscriptome114Y
F084254Metagenome / Metatranscriptome112Y
F087210Metagenome / Metatranscriptome110Y
F087950Metagenome / Metatranscriptome110Y
F090406Metagenome108N
F097951Metagenome / Metatranscriptome104Y
F098573Metagenome / Metatranscriptome103Y
F099322Metagenome / Metatranscriptome103Y
F099325Metagenome / Metatranscriptome103Y
F102142Metagenome / Metatranscriptome102Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0210106_1001210All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Phycisphaerae → unclassified Phycisphaerae → Phycisphaerae bacterium3384Open in IMG/M
Ga0210106_1004793All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1956Open in IMG/M
Ga0210106_1009816All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1442Open in IMG/M
Ga0210106_1009994All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1431Open in IMG/M
Ga0210106_1010418All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cellvibrionales → Halieaceae → Pseudohalioglobus → Pseudohalioglobus sediminis1408Open in IMG/M
Ga0210106_1012239All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1314Open in IMG/M
Ga0210106_1015323Not Available1186Open in IMG/M
Ga0210106_1017190Not Available1124Open in IMG/M
Ga0210106_1018369All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1090Open in IMG/M
Ga0210106_1019888All Organisms → cellular organisms → Bacteria1050Open in IMG/M
Ga0210106_1022643Not Available990Open in IMG/M
Ga0210106_1024240All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales960Open in IMG/M
Ga0210106_1025202All Organisms → cellular organisms → Bacteria942Open in IMG/M
Ga0210106_1030301Not Available864Open in IMG/M
Ga0210106_1033638Not Available823Open in IMG/M
Ga0210106_1038066All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria776Open in IMG/M
Ga0210106_1038634Not Available770Open in IMG/M
Ga0210106_1039670All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium760Open in IMG/M
Ga0210106_1044168Not Available722Open in IMG/M
Ga0210106_1046836Not Available701Open in IMG/M
Ga0210106_1047249Not Available698Open in IMG/M
Ga0210106_1052418Not Available664Open in IMG/M
Ga0210106_1053059All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon660Open in IMG/M
Ga0210106_1053242Not Available659Open in IMG/M
Ga0210106_1059343Not Available625Open in IMG/M
Ga0210106_1061586All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cellvibrionales → Halieaceae → Pseudohalioglobus → Pseudohalioglobus sediminis613Open in IMG/M
Ga0210106_1066134All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Aneurinibacillus group → Aneurinibacillus → Aneurinibacillus tyrosinisolvens592Open in IMG/M
Ga0210106_1070267All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon575Open in IMG/M
Ga0210106_1073261Not Available564Open in IMG/M
Ga0210106_1076639Not Available551Open in IMG/M
Ga0210106_1076879Not Available550Open in IMG/M
Ga0210106_1077641All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium548Open in IMG/M
Ga0210106_1080727All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium537Open in IMG/M
Ga0210106_1080759Not Available537Open in IMG/M
Ga0210106_1081224Not Available536Open in IMG/M
Ga0210106_1082981Not Available530Open in IMG/M
Ga0210106_1084435All Organisms → cellular organisms → Archaea525Open in IMG/M
Ga0210106_1086016Not Available519Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0210106_1001210Ga0210106_10012106F052508MSYLYQKKSYKKIILNTNDAIASVGKANAGTGNSEFVFYNFNTITIKEPSYLKVIGASADISTNSVWTIKLDNVSYNETSYYNSDADGSPTILTRCFNGKSSLMLDNIALELPPQDMSDIKLIILDEAGNGLNGAKMMFE
Ga0210106_1004793Ga0210106_10047931F099322MVRRASISDDDSMRIVLDGVSLIRNSDGAWCCVAPPSAHDLKDNWYRASPSLLIRWLDAVGQVTPVN
Ga0210106_1009816Ga0210106_10098161F077317MEESLINPVDETVWYENDQPIKEFLESFRKPAVLEADPQSEEPSWLKIIEKSIQI
Ga0210106_1009994Ga0210106_10099942F054044MNYRNSDRLNHLPADRLGMSNTVGILVLVFALTALILLFLDFSFRSAADQEECLQLVRALGLNSLSLAPSGRPLRNPGAIDPSIDLRFDPKLWRTQK
Ga0210106_1010418Ga0210106_10104182F090406QMKCERNVLWLIVVLAGLAACSTTLTSVDKEGGNVYALPPTVVDQMLKDAMSTEISTGTLRRGSKPYPSYLGEVTWGSLDKDTITASARPAKGRKLDGTIIDGYVFEVTRKGTAPATGEPTVKRIFAKLQNDAELTGTGVAFVEFTD
Ga0210106_1012239Ga0210106_10122392F087950MQCWINGPATGGIDDKIKMVNILLKTNIPSFHPSIIPFPGKIRKPQKTSIL
Ga0210106_1015323Ga0210106_10153233F039643MRKFWKKQVEDLGGDPSVLIKGRTQLAKEMVMALIAAHKEAITMRNAGYVIGRAACIVKNPCVEF
Ga0210106_1017190Ga0210106_10171901F084254SMQARRIPVFISESIVIEILAEVIRQSKRHPRADQVASALAKRGLPIAEKDVMSVFDHYDIEKKTADSHYFDT
Ga0210106_1018369Ga0210106_10183691F053305LMALFVFSFIAVATVSADETQKITGTVMSVNVETGEVIVKDDAGEMKSLMADPKAGVDLKMLKEGDPVNVESDSNGVIKSLKAAE
Ga0210106_1019888Ga0210106_10198882F021303VQADVLHHAAEFVELVYRQAPAEYHRLLAADRFHSDLERGGAFNHGKLYAYAEFSVFRREFLDRFPEASPAACINAFSGLFLKNDISIHNLVEFVEGADDRTQLRLNSPNP
Ga0210106_1022643Ga0210106_10226432F048676SRLCGKASDTVDAVKLEIPKGFKLLKFKMSEYLKLILKLTISKI
Ga0210106_1024240Ga0210106_10242401F048676ASDAVDAVKLEIPKGFKLLKFKMSEYLKNIVKLTISKI
Ga0210106_1025202Ga0210106_10252021F054047GSDIGHGLFVGWVEHLDIFRWVSFLNPTLPAIFVVSAKPNKMASIF
Ga0210106_1030301Ga0210106_10303012F014614MTDWYADYYRQSRGYNWNDLRELSYQKPKVELTVPDVFKHRFATRAEYDAWVEERRKLYF
Ga0210106_1030301Ga0210106_10303014F027171IDSLEFDPDGLIRVTAVVDQVVLTHHATQWDPEEYGSALCRGSFYLSDEDLIPATDAELCKLIAERVDDWAPVDPDE
Ga0210106_1033638Ga0210106_10336381F099325MRDALLRFWRWSQKSPAERRAAKARARFWLELREGQREAEVHSRP
Ga0210106_1038066Ga0210106_10380662F048676ASDAVDAVKLEIPKGFKLLKFKMSEYLKNIVKLPISKI
Ga0210106_1038634Ga0210106_10386342F042909MVELLTFIGSLNLWTFIVIWTSTIFLVIFISLLLISVSQMNKETIKISEKVKILIEDLSKKKLKLGNHDNKFF
Ga0210106_1038770Ga0210106_10387701F087210MRFLMGVLALAGAETALAHTPDTGFLSNFGHQLTSLHHSPAAMLLGVLVLALLLVA
Ga0210106_1039670Ga0210106_10396701F036767MRKNSCGKVDILKTDYDAKISKVPELIGKHPGVIAPVTICKPKPLEFIKPASASRVSGSMEVEVKIREGNEELERNLKKIVLTIDGNRFEFDKPPYKVYFDTSHARYRVITLTAEALGKDEEQEDAVLSSFYTNVIAENGECDKTKPLLLFAGVFEPKIENRRETWTPEMYSKAYTFSEKMMSHLMHYGF
Ga0210106_1044168Ga0210106_10441681F070131FSSTPMLNEYTVLNIEIRALIDTPNTLIEIEIPRDGFEIISGSTQFNEDLSSGSTATYQLEVLPTAQGQYTIAASARSEETDYIFGKREELYVNIGEGFSELSKTSFTPEIANNRSGAIKIGNLSGPPTQVLPDHKPIEDQAVSYFAAPGSGQIVVRGYWFYQDKNEVNQPLREAMVEIWDSGSGSSGDTLLDTTYTNDSGYYASGNISNSDDEGVGQDIYVKVFSTDNRSVRVTDFSTP
Ga0210106_1046836Ga0210106_10468361F102142QFIAQLVGISYDEFHGGMPTQYLFPLFLIYTAIALVLAKMKKNLNVSKRGAFFIIFAFHYFIVSFLPELEGKIYLPDFPFLSTLISTFILALAVVSLIFYLWKQEDHPEAKTGQQIRSYFSSRSILSWAWRFFLVWILFYVVTMIIGIVALPFNGEYLDDATNTLGMVVPSMGALFAITQFRSLFYILVTLPFIIFWSSSKRSLLLYLALILIIQYPLLGDGLAYFWPGMYRL
Ga0210106_1047249Ga0210106_10472492F081352MIRLIRKISTITLACLLVLNLTAGAAVVVEHCPPLLGSSSPMDIDHCDGMLNFAFPMQGCCGECNDIFCDLIKNPLQDANAVNASPFQGSYYFFILGTVDSIAESGAWIAASAPRYLFCAAPAWSQIPLYIENLALII
Ga0210106_1052418Ga0210106_10524181F041201KNRILVQDRDGAEFKTAGILLYVEDLKRGTNKDIGPKDIFAIGL
Ga0210106_1053059Ga0210106_10530592F058721MTMLRATLPCADPKCKEKMRLVFQNERFIGYRCLLKPNTHNFRYDRASKKWEKIIIKTKPIIGYKECPHHIAYKDELVVESI
Ga0210106_1053242Ga0210106_10532421F005727TRSRSADFHRATMLKLIVAAGLLYLLWEPIKPVRLVTADLLHTTGDLVAR
Ga0210106_1059343Ga0210106_10593431F042907MTDEMRVEAEEKLFNDLRDEPCIVCGDPYLHVLGSYTPEEPKRSKNREDRAPVLYYTLCRTCFNNGRIPTARIESAYKSKFGKMAA
Ga0210106_1061586Ga0210106_10615861F090406MKFRLNVWWWIIVFAGLTSCSTTLTSVDGKGGNVYALPPTVVDQMLQDAMSTEISSGTLRRGSTPYPSYLGEVTWGSLDKDTITASARPAKGRKLDGTVVDGYVFEVSRKGTAPATGEPTVKRIFAKLQKDAELTGTGAAFVEFPD
Ga0210106_1066134Ga0210106_10661341F077987MRIEYKCSGGFGGLRLAYRGETDDLPSEQAKELSDLVEAAGIFELTPKQLSKKSPTIPDDFSCLLTVLKAGKRKTLSFNELGAPANLRRLSVHLRKLAIQQKGR
Ga0210106_1070267Ga0210106_10702671F066735MNDVDKFVAAIVEQDSLAGSNPILSEDDIQQMQLMNSIDDVQMYAIEKGIDAGKVNNALISTGLDTASDPMADLTGLFGFGEAEKSIIGGLPSDYSPRNAGATDFYREGDEYNIFANLPTEDLYSLQARLIQGGLLARGAFT
Ga0210106_1073261Ga0210106_10732612F050381MKKILVIIAIILIPTIAFANFSIQLDNNTGKKMFYLLYWVDHTYDWPHPFNLAGGELKSSESIDLRSDYRNGKYFVVWSDNGDWQNKVMMN
Ga0210106_1075701Ga0210106_10757011F098573ALIIRYKRYETDAVEKLMTLLFITEDSYRMAGVSQALGLDTQQAGTWLALEGENSRAIQPMALWGLVQWLGQLSPAQLNLALGVEPPHHIIEDEKHTKECGQKAYIPMVYAPKEALIWWIDYIHSVSEVELTASLERFKTISDRLKDITGATVDGWEAAQNALQAAFEGIILAECHFHAMLKLGQ
Ga0210106_1076639Ga0210106_10766391F058175QTGADGTALTAAAGDVVLGYAREAGVDGQIIEIEMIQGGNVVPA
Ga0210106_1076879Ga0210106_10768791F029111GGNRTAGSSPDSDDSRTVSRTGVDGNTSKEHEQNALPDILMTMSQIPDVNRELVEVGAMFAVFRAVNNDPPRS
Ga0210106_1077641Ga0210106_10776411F057384MTRTRMDEATINASLDACLHGAKEGDTDQARRLHEMLDLMLTERETENGKLWLTEHGKLLLAEMHRALSHCEGSGHRLEETVLDAVQLKPRQGHWRDTCEYLHDLRVAIAVANELCEQRAAGDKPSVSQAAKAVAGRGEFDMGAPRIREVYDEIASTLGGF
Ga0210106_1080727Ga0210106_10807271F051215VAVDRTQGSGSYGTSRTVLVGSLVGLVLAAIGVTVLIDKEGGQKLLASIYDMLGNAAGAEALRNGQGDQFFAKLLLAAIAMLVGVGGIWLLFTGVS
Ga0210106_1080759Ga0210106_10807591F042909MVELMTFIGNLNLGTFIVLWTSTILFVICVSLLLFYIRQMNKEAITISKKVK
Ga0210106_1081224Ga0210106_10812241F097951AARAALHTLNVPGRQRHSERAQKARPKFRHATATKTTVTGKPEVRVFAPGFY
Ga0210106_1082981Ga0210106_10829811F005055QTILMTASTRGFTFKPATINDVHELTSQMLDRGLLDFERVGQHPVLSLAMYIHEDDSYLIYGPDGSLYGAYGVSEDNAVWIQMTKQVKKNPRTTVRFGKALMEHINRPYLWTTIDIKNTDLINLARYLGFKVLRVFPDGPDNVYSIEIVRL
Ga0210106_1084435Ga0210106_10844352F025653MEAIKYFETKLKTMSLAELQDYKKRLDESITEKISKTVPNEQIAPLILYRGILEHEIKKRTNLR
Ga0210106_1086016Ga0210106_10860161F076109MEVLFKMLFVIALIATIFCLCVFIGGLFNAFQKEE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.