NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300031476

3300031476: Metatranscriptome of soil surface biofilm microbial communities from soil inoculated with nitrogen-fixing consortium DG1, State College, Pennsylvania, United States - MICR_R6 (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300031476 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0132857 | Gp0330689 | Ga0314827
Sample NameMetatranscriptome of soil surface biofilm microbial communities from soil inoculated with nitrogen-fixing consortium DG1, State College, Pennsylvania, United States - MICR_R6 (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size37721669
Sequencing Scaffolds30
Novel Protein Genes33
Associated Families28

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1
Not Available16
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Thermoguttaceae → Thermogutta → Thermogutta terrifontis1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei3
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Surface Biofilm Microbial Communities From Soil Inoculated With Nitrogen-fixing Consortium Dg1, State College, Pennsylvania, United States
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil → Soil Surface Biofilm Microbial Communities From Soil Inoculated With Nitrogen-fixing Consortium Dg1, State College, Pennsylvania, United States

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomelandbiofilm material
Earth Microbiome Project Ontology (EMPO)Unclassified

Location Information
LocationUSA: Pennsylvania
CoordinatesLat. (o)40.7997Long. (o)-77.8629Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000203Metagenome / Metatranscriptome1619Y
F000240Metagenome / Metatranscriptome1481Y
F000344Metagenome / Metatranscriptome1257Y
F001633Metagenome / Metatranscriptome660Y
F010686Metagenome / Metatranscriptome300Y
F011252Metagenome / Metatranscriptome293Y
F016001Metagenome / Metatranscriptome250Y
F017060Metagenome / Metatranscriptome243Y
F017262Metagenome / Metatranscriptome241Y
F023530Metagenome / Metatranscriptome209Y
F025923Metagenome / Metatranscriptome199Y
F027649Metagenome / Metatranscriptome194Y
F030637Metagenome / Metatranscriptome184Y
F033437Metagenome / Metatranscriptome177Y
F042753Metagenome / Metatranscriptome157Y
F043340Metagenome / Metatranscriptome156Y
F045595Metagenome / Metatranscriptome152Y
F053640Metagenome / Metatranscriptome141Y
F057461Metagenome / Metatranscriptome136Y
F070535Metagenome / Metatranscriptome123Y
F072453Metagenome / Metatranscriptome121Y
F075814Metagenome / Metatranscriptome118Y
F081897Metagenome / Metatranscriptome114Y
F082185Metagenome / Metatranscriptome113Y
F087976Metagenome / Metatranscriptome109Y
F090047Metagenome / Metatranscriptome108Y
F102642Metagenome / Metatranscriptome101Y
F105419Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0314827_105475All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1004Open in IMG/M
Ga0314827_105834Not Available973Open in IMG/M
Ga0314827_106033All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Thermoguttaceae → Thermogutta → Thermogutta terrifontis957Open in IMG/M
Ga0314827_106294Not Available939Open in IMG/M
Ga0314827_106326Not Available938Open in IMG/M
Ga0314827_106439All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae928Open in IMG/M
Ga0314827_106803Not Available904Open in IMG/M
Ga0314827_108362All Organisms → cellular organisms → Bacteria → Proteobacteria814Open in IMG/M
Ga0314827_108448All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia811Open in IMG/M
Ga0314827_108485All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei810Open in IMG/M
Ga0314827_108564Not Available806Open in IMG/M
Ga0314827_108773Not Available797Open in IMG/M
Ga0314827_109311Not Available774Open in IMG/M
Ga0314827_109612All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei763Open in IMG/M
Ga0314827_109940All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila750Open in IMG/M
Ga0314827_110541All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium729Open in IMG/M
Ga0314827_110830Not Available719Open in IMG/M
Ga0314827_112073Not Available680Open in IMG/M
Ga0314827_112363All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei673Open in IMG/M
Ga0314827_112538Not Available669Open in IMG/M
Ga0314827_113015Not Available657Open in IMG/M
Ga0314827_114290Not Available629Open in IMG/M
Ga0314827_115782Not Available600Open in IMG/M
Ga0314827_116271All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia591Open in IMG/M
Ga0314827_116551All Organisms → cellular organisms → Bacteria → Acidobacteria586Open in IMG/M
Ga0314827_116584All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia586Open in IMG/M
Ga0314827_117060Not Available578Open in IMG/M
Ga0314827_119866Not Available539Open in IMG/M
Ga0314827_122124Not Available511Open in IMG/M
Ga0314827_123139All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas500Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0314827_105475Ga0314827_1054751F105419MRKFIALSALLVAAAASNGCISTTMYGCEITETLAPGASEGEVVMKHGAPDNIVYLGGQYFNPQTGERGEVDKYLYEYRIGGGTTLLGKVFASDEFHNIAYLIEGGRVMGGGYVGEGKGSIILGNDFGVLNTPLGTMDLRFGGFLHPKA
Ga0314827_105834Ga0314827_1058341F072453VEISGWTLIALACVVLVNTLFIVGLAVALFMLNKKIDEALDKAAPLLQKATETLNQVEETTSQLQQRVDRVLDKTTRLVDQVSERVDTTTAIAEEAVTEPLIGAASIMAGINRGLRVYSERTSEKGNGK
Ga0314827_106033Ga0314827_1060332F010686LRGLDLCGQATKGTWGMSWRQKAMKGVEDCEKPGGIVKQVMIPGFPN
Ga0314827_106294Ga0314827_1062941F016001VFSRRLDPIPSWACLLQVFALDAVGTPSRSLTLMILMATLSSHCRHRPSAFRHRAWLASLEAAYLLEVLDLPATPSCPEISDEVRRSASPNPLRDPVP
Ga0314827_106326Ga0314827_1063261F023530RTILLLVAAAGVLARVHHHHAKGTAAPPVPNFPLDWTANEEDYMVVYQGQYSVNNGLYCCGDTSCEVQTQYQSGTNYFDFTHNRTRFDDPVNGDIVSLFYPVYKEMAVDNTNTCTAYCPIQEDISPYGIAPNSTYQGQKVINGQKYDDWQYVDKEFGIVFETDDIFVVPSTQLPYQEVDQLTPFGQAIGQSTSTYHTFTPGTPDPSKFDVKGVENCPQSQNCGNSQRQFVRRRWNQWKTWMQAYQQNGLEKAEQISRRMARH
Ga0314827_106439Ga0314827_1064391F001633VGVGVRHTLFPGGTRLDTLAFAGVVLPDATLCGMQMFRSHGGTVLTVAGRDLSSEASAPGSDAPCRERRAGRGADTPATFVVSRRHPYHGDGTGFWPIVGPALRV
Ga0314827_106803Ga0314827_1068031F030637PVPSSPLRRFEDSMRCPVSCRQATGTSREVGCSPEFYEHGPCRLVGLPAATSTLRFPARPGGSTFRVVPRLLAKTGSSSPELGLLFRVRTASNLPLARMRGAPSLGSRSQSRCQPRRSTCERGSQPRPTFRPRRFSRPRRFSLSTTSWACFIPLPRPGFTFQGFVPATWPGRLVDGPCPPVVDHPLLSSRCRGDSRSGDLAFRALIQLAIRSNRRSV
Ga0314827_108362Ga0314827_1083622F043340MIERIRKAGWLTVEAALLLIVLCVLLNIILGSESGPFISAVAGNATQLLQAIPPGTLLGVLLIVGLYWLVRGRLSGQS
Ga0314827_108448Ga0314827_1084481F000344MRPKHPHAVESGVGKHITRESERVQACAAGKERVTNAH
Ga0314827_108485Ga0314827_1084851F001633FVTRSFPAAHAWTRSLFAGGVLPDATLRGMRMFRSHGGTVLTVAGRELSSEASAPGSDAPCRERRAGRGVDTPATFIFRVGTFYRGVGISLWLIVGPALRV
Ga0314827_108564Ga0314827_1085641F000344MRPKHPRAAESGVGKHTARESESAQACAAGKERVAN
Ga0314827_108709Ga0314827_1087091F000203GVRHALFPVPALGAALAGAAGFPTLFSTASGVFGLVAGPSNALLR
Ga0314827_108773Ga0314827_1087731F081897WEFAKLSFPCRHLAPYLGMAAGFPTLFSTASGVCGLVAGPSSTLRMLNFE
Ga0314827_108806Ga0314827_1088061F000240AAGALAAQDAWPYSAGITGYVPTSTPFDNPKTLTERTPTIPNSPDCSVRPQTVVEIQRMRESASRIAQMIEAEVQIMNKRKTFVEQMTTYLNDRIRELNKVKGELAEETRWIEVSTNRIQELAEREKLVKMQDILACLNGDKKTLADESSAQASTIEALKSQSEAVSKRIAEIKAKIEAAAEGKDAGGKGGDAGGEDKE
Ga0314827_109311Ga0314827_1093111F082185MGVRASGLLANPTPVNTGVMSKVFHMYKGQTLYAQPWRMYNPGLRMLKAVKRDRYFQHYYPYSLDAKRGVKLDDTRWHPFVMRRWFRKKRRQGWVKTHRRRILPDYKQALAWLSQEQWDERLQRKGRYSDFNTKGTCLPAEMIDHYHTTGWYYTLETWPLQWEAARRRKEKMEYARGEFRDIWGNREYPVEVDNGSIAWFDEQTIEAAAAAKK
Ga0314827_109612Ga0314827_1096121F001633FAGGVLPDATLRGMRMSRSHGGTVLTVTGWDLSSEASAPGSDAPCRERRAGRGADTPATFTFSRRHLYRGDGIGFWLIVGPALRV
Ga0314827_109940Ga0314827_1099401F033437GGVLPDATLRGMRMFRSHGGTVLTVTGRDLLSEASSPSSVAPCRERRAGRGADTSAILLRVGTLTTGTAPAFGWPLALRYGSGLLTLRLRSLLQCGGVVFHAAP
Ga0314827_110541Ga0314827_1105411F090047VARTPIEFTCQSCRKTFQSTDWGTVFPCPHCNEGLGRIQQLDQLLDQWFYPRRWRADLHEPNPFYLLEKLWTANGQGETLYNGIAPAHANYDVFRHLVTRLVAQGVDEGWVELEFPDDPLDENPVYQLKFNDPDRFAKGVERLFPEVDWDEQIEVPATEADDAMP
Ga0314827_110830Ga0314827_1108301F045595MRKLLSIAGAMMISTTMFATPASAQAMAGGSCGMSSGDYGITDVNSAKVNNAGHTVPALFYQVNYAGVPAPTTVGVIVRYNGELETQLAVGSASAANSGGNIEGVLRANISPDNQGFANSINQARRGQADGPPDGNRGTRTQNSPTQGGLMPGEYVFYIYTGSVGDVWNVKDGTVARNAFIADEKGYLGTFSCGVSTDQGSGPG
Ga0314827_112073Ga0314827_1120731F057461ARCRARRESYEMPIDLRSIIRFTAAVGTVTVLIGGCDFVQKEPQRSESGGEVDETFEPATITSIKGVAAGEVRTAIDQRLDRGRPKPITEAQWNHTKKLYAEFDGNPIWLDKDGIRERRTKTLMSALLASDADALALDAYPLEELNRVLSALLKES
Ga0314827_112363Ga0314827_1123631F025923KRSGLKDDAESTAGIIPGDRGKAGDNWCRPPLPMAKAR
Ga0314827_112538Ga0314827_1125382F070535MYQAPKLERLGTFREVTLAGGDFNPGDGGNPYHRYAPLPS
Ga0314827_113015Ga0314827_1130151F017060MSRSAESHLQFASSNRDDLRTLFALSFSEPARNDDDLRHAVLDYVRAAKAERRTPEMVIVSLKRAIIDAAAARISYRAANELTDRVVRWFIDGYYEADGSTDRGLELRMAPAPRAS
Ga0314827_114290Ga0314827_1142901F017262VAPKANCAPLCEETSCSWSCAKPTTCPRPKCELQCSKPACDVKDKQKCCKCGAKGVKRALAAAPRFEEVHGDAEMMPSFMEMVATFKHATENGVEECCPCAKKF
Ga0314827_115782Ga0314827_1157821F087976ARDAQLKSEADAFKYKRELARVKRAQADRLREERRKCAGKCDREFPILVGPKNVPAKETRSVVVIGDRAANRKRWSAVSTHKRMFWHPKGLKKTARHITKAGRRIAAAHAKTKRISEAPLVEKPLF
Ga0314827_116271Ga0314827_1162711F001633VLPDATLRGMRMFRSHGGTVLTVAGRDLSSEASAPGSDAPCRERRAGRGADTPATFTVARRHPYHGDGINFWLIVGPALRV
Ga0314827_116551Ga0314827_1165511F011252VSEYTFIEPNAAFPLPATFNFDEYIETERELWALFPETDGRRVNFVQIGLTAMIYAEVPDTGAELGFDETYILMPCQDFAMRPIRMRACRRLTLSEYRMGHLTAIKLVGKVNEAGEVMREIGCQILGDEVLSEAQARLSEEQE
Ga0314827_116584Ga0314827_1165841F053640VAAQRSEMVAESTTGIIPGDRGKAGGNWCRPPLMPAKAVMRHISPVPLAGVVSGQSTHELGTEPQAAIRNRVEWSQATQGVSTCASTQLPQRLRLLLRRPERHRVSRRDDPAKRPHSPHEWGAQGTYGGGERTDLGKVREPPHRGGVKHTSPSCKRQRSLRGKRSDR
Ga0314827_117060Ga0314827_1170602F072453FGRPNHASAVSEDLFVEISNAAIWALTVRVLLNTLFIGGICVVLFLLNKKLNDALAKAEPVLIRATETLGRVEETTVRLQHKVDEVLDKATELVEQVSERVDTTTAIAEEAVTEPLIGAASLMAGINRGLRAYAERSHEKGDGRS
Ga0314827_119866Ga0314827_1198661F075814RTHRRYEVSRRNESVNRPTKCFDAREVGVVQLGTGSYDITVQVMPHSSFNQTTRLTLSSVSPSSAPAVSGGTLVDSVVFQLRAQSSCDGADINPLPNLANMGIVYNVPNAVDKSKLQIVMWNGSSWTNIDTVPDPVPGNPYVSATVNMAGTYALIQKP
Ga0314827_122124Ga0314827_1221241F042753PLRRMSTEFDHESIEKERLAHEQERLTRHVVAPGARTEFEKSDNLTQRTQAVEDAKVRAMMEKKASLPRFDPSNKDAPGAFKTLDAFEVQTLVTLRNNEFTQAFNSGNIAGLHDFFTRTGGVVGVGGKEFSGKADVTAYFQRLRDSGVKNLKHTPGNYKAESDRDVQEW
Ga0314827_122696Ga0314827_1226962F102642MTLAVFFNEEPLSERRRKAGQREHLKREGNNVLAGRGNTFRNVGDVEKGTAGL
Ga0314827_123139Ga0314827_1231391F027649KMGSSPQPPEPAAAKSEEVNMAYGLDPRLFLENGPANQAAAAHARSSSDSYRAWAGLDAEHRFELVERALRNAANDRIGLPIAL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.