| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300031476 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0132857 | Gp0330689 | Ga0314827 |
| Sample Name | Metatranscriptome of soil surface biofilm microbial communities from soil inoculated with nitrogen-fixing consortium DG1, State College, Pennsylvania, United States - MICR_R6 (Metagenome Metatranscriptome) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 37721669 |
| Sequencing Scaffolds | 30 |
| Novel Protein Genes | 33 |
| Associated Families | 28 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 1 |
| Not Available | 16 |
| All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Thermoguttaceae → Thermogutta → Thermogutta terrifontis | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 3 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Surface Biofilm Microbial Communities From Soil Inoculated With Nitrogen-fixing Consortium Dg1, State College, Pennsylvania, United States |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil → Soil Surface Biofilm Microbial Communities From Soil Inoculated With Nitrogen-fixing Consortium Dg1, State College, Pennsylvania, United States |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → land → biofilm material |
| Earth Microbiome Project Ontology (EMPO) | Unclassified |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Pennsylvania | |||||||
| Coordinates | Lat. (o) | 40.7997 | Long. (o) | -77.8629 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000203 | Metagenome / Metatranscriptome | 1619 | Y |
| F000240 | Metagenome / Metatranscriptome | 1481 | Y |
| F000344 | Metagenome / Metatranscriptome | 1257 | Y |
| F001633 | Metagenome / Metatranscriptome | 660 | Y |
| F010686 | Metagenome / Metatranscriptome | 300 | Y |
| F011252 | Metagenome / Metatranscriptome | 293 | Y |
| F016001 | Metagenome / Metatranscriptome | 250 | Y |
| F017060 | Metagenome / Metatranscriptome | 243 | Y |
| F017262 | Metagenome / Metatranscriptome | 241 | Y |
| F023530 | Metagenome / Metatranscriptome | 209 | Y |
| F025923 | Metagenome / Metatranscriptome | 199 | Y |
| F027649 | Metagenome / Metatranscriptome | 194 | Y |
| F030637 | Metagenome / Metatranscriptome | 184 | Y |
| F033437 | Metagenome / Metatranscriptome | 177 | Y |
| F042753 | Metagenome / Metatranscriptome | 157 | Y |
| F043340 | Metagenome / Metatranscriptome | 156 | Y |
| F045595 | Metagenome / Metatranscriptome | 152 | Y |
| F053640 | Metagenome / Metatranscriptome | 141 | Y |
| F057461 | Metagenome / Metatranscriptome | 136 | Y |
| F070535 | Metagenome / Metatranscriptome | 123 | Y |
| F072453 | Metagenome / Metatranscriptome | 121 | Y |
| F075814 | Metagenome / Metatranscriptome | 118 | Y |
| F081897 | Metagenome / Metatranscriptome | 114 | Y |
| F082185 | Metagenome / Metatranscriptome | 113 | Y |
| F087976 | Metagenome / Metatranscriptome | 109 | Y |
| F090047 | Metagenome / Metatranscriptome | 108 | Y |
| F102642 | Metagenome / Metatranscriptome | 101 | Y |
| F105419 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0314827_105475 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium | 1004 | Open in IMG/M |
| Ga0314827_105834 | Not Available | 973 | Open in IMG/M |
| Ga0314827_106033 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Thermoguttaceae → Thermogutta → Thermogutta terrifontis | 957 | Open in IMG/M |
| Ga0314827_106294 | Not Available | 939 | Open in IMG/M |
| Ga0314827_106326 | Not Available | 938 | Open in IMG/M |
| Ga0314827_106439 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae | 928 | Open in IMG/M |
| Ga0314827_106803 | Not Available | 904 | Open in IMG/M |
| Ga0314827_108362 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 814 | Open in IMG/M |
| Ga0314827_108448 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia | 811 | Open in IMG/M |
| Ga0314827_108485 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 810 | Open in IMG/M |
| Ga0314827_108564 | Not Available | 806 | Open in IMG/M |
| Ga0314827_108773 | Not Available | 797 | Open in IMG/M |
| Ga0314827_109311 | Not Available | 774 | Open in IMG/M |
| Ga0314827_109612 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 763 | Open in IMG/M |
| Ga0314827_109940 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila | 750 | Open in IMG/M |
| Ga0314827_110541 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 729 | Open in IMG/M |
| Ga0314827_110830 | Not Available | 719 | Open in IMG/M |
| Ga0314827_112073 | Not Available | 680 | Open in IMG/M |
| Ga0314827_112363 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Enterobacter → Enterobacter cloacae complex → Enterobacter hormaechei | 673 | Open in IMG/M |
| Ga0314827_112538 | Not Available | 669 | Open in IMG/M |
| Ga0314827_113015 | Not Available | 657 | Open in IMG/M |
| Ga0314827_114290 | Not Available | 629 | Open in IMG/M |
| Ga0314827_115782 | Not Available | 600 | Open in IMG/M |
| Ga0314827_116271 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia | 591 | Open in IMG/M |
| Ga0314827_116551 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 586 | Open in IMG/M |
| Ga0314827_116584 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia | 586 | Open in IMG/M |
| Ga0314827_117060 | Not Available | 578 | Open in IMG/M |
| Ga0314827_119866 | Not Available | 539 | Open in IMG/M |
| Ga0314827_122124 | Not Available | 511 | Open in IMG/M |
| Ga0314827_123139 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas | 500 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0314827_105475 | Ga0314827_1054751 | F105419 | MRKFIALSALLVAAAASNGCISTTMYGCEITETLAPGASEGEVVMKHGAPDNIVYLGGQYFNPQTGERGEVDKYLYEYRIGGGTTLLGKVFASDEFHNIAYLIEGGRVMGGGYVGEGKGSIILGNDFGVLNTPLGTMDLRFGGFLHPKA |
| Ga0314827_105834 | Ga0314827_1058341 | F072453 | VEISGWTLIALACVVLVNTLFIVGLAVALFMLNKKIDEALDKAAPLLQKATETLNQVEETTSQLQQRVDRVLDKTTRLVDQVSERVDTTTAIAEEAVTEPLIGAASIMAGINRGLRVYSERTSEKGNGK |
| Ga0314827_106033 | Ga0314827_1060332 | F010686 | LRGLDLCGQATKGTWGMSWRQKAMKGVEDCEKPGGIVKQVMIPGFPN |
| Ga0314827_106294 | Ga0314827_1062941 | F016001 | VFSRRLDPIPSWACLLQVFALDAVGTPSRSLTLMILMATLSSHCRHRPSAFRHRAWLASLEAAYLLEVLDLPATPSCPEISDEVRRSASPNPLRDPVP |
| Ga0314827_106326 | Ga0314827_1063261 | F023530 | RTILLLVAAAGVLARVHHHHAKGTAAPPVPNFPLDWTANEEDYMVVYQGQYSVNNGLYCCGDTSCEVQTQYQSGTNYFDFTHNRTRFDDPVNGDIVSLFYPVYKEMAVDNTNTCTAYCPIQEDISPYGIAPNSTYQGQKVINGQKYDDWQYVDKEFGIVFETDDIFVVPSTQLPYQEVDQLTPFGQAIGQSTSTYHTFTPGTPDPSKFDVKGVENCPQSQNCGNSQRQFVRRRWNQWKTWMQAYQQNGLEKAEQISRRMARH |
| Ga0314827_106439 | Ga0314827_1064391 | F001633 | VGVGVRHTLFPGGTRLDTLAFAGVVLPDATLCGMQMFRSHGGTVLTVAGRDLSSEASAPGSDAPCRERRAGRGADTPATFVVSRRHPYHGDGTGFWPIVGPALRV |
| Ga0314827_106803 | Ga0314827_1068031 | F030637 | PVPSSPLRRFEDSMRCPVSCRQATGTSREVGCSPEFYEHGPCRLVGLPAATSTLRFPARPGGSTFRVVPRLLAKTGSSSPELGLLFRVRTASNLPLARMRGAPSLGSRSQSRCQPRRSTCERGSQPRPTFRPRRFSRPRRFSLSTTSWACFIPLPRPGFTFQGFVPATWPGRLVDGPCPPVVDHPLLSSRCRGDSRSGDLAFRALIQLAIRSNRRSV |
| Ga0314827_108362 | Ga0314827_1083622 | F043340 | MIERIRKAGWLTVEAALLLIVLCVLLNIILGSESGPFISAVAGNATQLLQAIPPGTLLGVLLIVGLYWLVRGRLSGQS |
| Ga0314827_108448 | Ga0314827_1084481 | F000344 | MRPKHPHAVESGVGKHITRESERVQACAAGKERVTNAH |
| Ga0314827_108485 | Ga0314827_1084851 | F001633 | FVTRSFPAAHAWTRSLFAGGVLPDATLRGMRMFRSHGGTVLTVAGRELSSEASAPGSDAPCRERRAGRGVDTPATFIFRVGTFYRGVGISLWLIVGPALRV |
| Ga0314827_108564 | Ga0314827_1085641 | F000344 | MRPKHPRAAESGVGKHTARESESAQACAAGKERVAN |
| Ga0314827_108709 | Ga0314827_1087091 | F000203 | GVRHALFPVPALGAALAGAAGFPTLFSTASGVFGLVAGPSNALLR |
| Ga0314827_108773 | Ga0314827_1087731 | F081897 | WEFAKLSFPCRHLAPYLGMAAGFPTLFSTASGVCGLVAGPSSTLRMLNFE |
| Ga0314827_108806 | Ga0314827_1088061 | F000240 | AAGALAAQDAWPYSAGITGYVPTSTPFDNPKTLTERTPTIPNSPDCSVRPQTVVEIQRMRESASRIAQMIEAEVQIMNKRKTFVEQMTTYLNDRIRELNKVKGELAEETRWIEVSTNRIQELAEREKLVKMQDILACLNGDKKTLADESSAQASTIEALKSQSEAVSKRIAEIKAKIEAAAEGKDAGGKGGDAGGEDKE |
| Ga0314827_109311 | Ga0314827_1093111 | F082185 | MGVRASGLLANPTPVNTGVMSKVFHMYKGQTLYAQPWRMYNPGLRMLKAVKRDRYFQHYYPYSLDAKRGVKLDDTRWHPFVMRRWFRKKRRQGWVKTHRRRILPDYKQALAWLSQEQWDERLQRKGRYSDFNTKGTCLPAEMIDHYHTTGWYYTLETWPLQWEAARRRKEKMEYARGEFRDIWGNREYPVEVDNGSIAWFDEQTIEAAAAAKK |
| Ga0314827_109612 | Ga0314827_1096121 | F001633 | FAGGVLPDATLRGMRMSRSHGGTVLTVTGWDLSSEASAPGSDAPCRERRAGRGADTPATFTFSRRHLYRGDGIGFWLIVGPALRV |
| Ga0314827_109940 | Ga0314827_1099401 | F033437 | GGVLPDATLRGMRMFRSHGGTVLTVTGRDLLSEASSPSSVAPCRERRAGRGADTSAILLRVGTLTTGTAPAFGWPLALRYGSGLLTLRLRSLLQCGGVVFHAAP |
| Ga0314827_110541 | Ga0314827_1105411 | F090047 | VARTPIEFTCQSCRKTFQSTDWGTVFPCPHCNEGLGRIQQLDQLLDQWFYPRRWRADLHEPNPFYLLEKLWTANGQGETLYNGIAPAHANYDVFRHLVTRLVAQGVDEGWVELEFPDDPLDENPVYQLKFNDPDRFAKGVERLFPEVDWDEQIEVPATEADDAMP |
| Ga0314827_110830 | Ga0314827_1108301 | F045595 | MRKLLSIAGAMMISTTMFATPASAQAMAGGSCGMSSGDYGITDVNSAKVNNAGHTVPALFYQVNYAGVPAPTTVGVIVRYNGELETQLAVGSASAANSGGNIEGVLRANISPDNQGFANSINQARRGQADGPPDGNRGTRTQNSPTQGGLMPGEYVFYIYTGSVGDVWNVKDGTVARNAFIADEKGYLGTFSCGVSTDQGSGPG |
| Ga0314827_112073 | Ga0314827_1120731 | F057461 | ARCRARRESYEMPIDLRSIIRFTAAVGTVTVLIGGCDFVQKEPQRSESGGEVDETFEPATITSIKGVAAGEVRTAIDQRLDRGRPKPITEAQWNHTKKLYAEFDGNPIWLDKDGIRERRTKTLMSALLASDADALALDAYPLEELNRVLSALLKES |
| Ga0314827_112363 | Ga0314827_1123631 | F025923 | KRSGLKDDAESTAGIIPGDRGKAGDNWCRPPLPMAKAR |
| Ga0314827_112538 | Ga0314827_1125382 | F070535 | MYQAPKLERLGTFREVTLAGGDFNPGDGGNPYHRYAPLPS |
| Ga0314827_113015 | Ga0314827_1130151 | F017060 | MSRSAESHLQFASSNRDDLRTLFALSFSEPARNDDDLRHAVLDYVRAAKAERRTPEMVIVSLKRAIIDAAAARISYRAANELTDRVVRWFIDGYYEADGSTDRGLELRMAPAPRAS |
| Ga0314827_114290 | Ga0314827_1142901 | F017262 | VAPKANCAPLCEETSCSWSCAKPTTCPRPKCELQCSKPACDVKDKQKCCKCGAKGVKRALAAAPRFEEVHGDAEMMPSFMEMVATFKHATENGVEECCPCAKKF |
| Ga0314827_115782 | Ga0314827_1157821 | F087976 | ARDAQLKSEADAFKYKRELARVKRAQADRLREERRKCAGKCDREFPILVGPKNVPAKETRSVVVIGDRAANRKRWSAVSTHKRMFWHPKGLKKTARHITKAGRRIAAAHAKTKRISEAPLVEKPLF |
| Ga0314827_116271 | Ga0314827_1162711 | F001633 | VLPDATLRGMRMFRSHGGTVLTVAGRDLSSEASAPGSDAPCRERRAGRGADTPATFTVARRHPYHGDGINFWLIVGPALRV |
| Ga0314827_116551 | Ga0314827_1165511 | F011252 | VSEYTFIEPNAAFPLPATFNFDEYIETERELWALFPETDGRRVNFVQIGLTAMIYAEVPDTGAELGFDETYILMPCQDFAMRPIRMRACRRLTLSEYRMGHLTAIKLVGKVNEAGEVMREIGCQILGDEVLSEAQARLSEEQE |
| Ga0314827_116584 | Ga0314827_1165841 | F053640 | VAAQRSEMVAESTTGIIPGDRGKAGGNWCRPPLMPAKAVMRHISPVPLAGVVSGQSTHELGTEPQAAIRNRVEWSQATQGVSTCASTQLPQRLRLLLRRPERHRVSRRDDPAKRPHSPHEWGAQGTYGGGERTDLGKVREPPHRGGVKHTSPSCKRQRSLRGKRSDR |
| Ga0314827_117060 | Ga0314827_1170602 | F072453 | FGRPNHASAVSEDLFVEISNAAIWALTVRVLLNTLFIGGICVVLFLLNKKLNDALAKAEPVLIRATETLGRVEETTVRLQHKVDEVLDKATELVEQVSERVDTTTAIAEEAVTEPLIGAASLMAGINRGLRAYAERSHEKGDGRS |
| Ga0314827_119866 | Ga0314827_1198661 | F075814 | RTHRRYEVSRRNESVNRPTKCFDAREVGVVQLGTGSYDITVQVMPHSSFNQTTRLTLSSVSPSSAPAVSGGTLVDSVVFQLRAQSSCDGADINPLPNLANMGIVYNVPNAVDKSKLQIVMWNGSSWTNIDTVPDPVPGNPYVSATVNMAGTYALIQKP |
| Ga0314827_122124 | Ga0314827_1221241 | F042753 | PLRRMSTEFDHESIEKERLAHEQERLTRHVVAPGARTEFEKSDNLTQRTQAVEDAKVRAMMEKKASLPRFDPSNKDAPGAFKTLDAFEVQTLVTLRNNEFTQAFNSGNIAGLHDFFTRTGGVVGVGGKEFSGKADVTAYFQRLRDSGVKNLKHTPGNYKAESDRDVQEW |
| Ga0314827_122696 | Ga0314827_1226962 | F102642 | MTLAVFFNEEPLSERRRKAGQREHLKREGNNVLAGRGNTFRNVGDVEKGTAGL |
| Ga0314827_123139 | Ga0314827_1231391 | F027649 | KMGSSPQPPEPAAAKSEEVNMAYGLDPRLFLENGPANQAAAAHARSSSDSYRAWAGLDAEHRFELVERALRNAANDRIGLPIAL |
| ⦗Top⦘ |