| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300010230 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0121427 | Gp0154116 | Ga0136236 |
| Sample Name | Filterable freshwater microbial communities from Conwy River, North Wales, UK. Not filtered control. After WGA |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Fidelity Systems Inc |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 114345546 |
| Sequencing Scaffolds | 27 |
| Novel Protein Genes | 40 |
| Associated Families | 40 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 12 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia | 1 |
| Not Available | 9 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_4_56_7 | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Filterable Freshwater Microbial Communities From Conwy River, North Wales, Uk |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater → Filterable Freshwater Microbial Communities From Conwy River, North Wales, Uk |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater river biome → river → river water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Water (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Conwy River, Conwy, North Wales, UK | |||||||
| Coordinates | Lat. (o) | 53.2 | Long. (o) | -3.82 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000506 | Metagenome / Metatranscriptome | 1070 | Y |
| F000857 | Metagenome / Metatranscriptome | 858 | Y |
| F001629 | Metagenome | 660 | Y |
| F003392 | Metagenome / Metatranscriptome | 489 | Y |
| F003640 | Metagenome / Metatranscriptome | 475 | Y |
| F004204 | Metagenome | 448 | Y |
| F004517 | Metagenome / Metatranscriptome | 434 | Y |
| F005200 | Metagenome | 408 | Y |
| F007522 | Metagenome / Metatranscriptome | 349 | Y |
| F009320 | Metagenome / Metatranscriptome | 319 | Y |
| F010521 | Metagenome | 302 | Y |
| F010743 | Metagenome / Metatranscriptome | 299 | Y |
| F014456 | Metagenome / Metatranscriptome | 263 | Y |
| F016373 | Metagenome | 247 | Y |
| F018874 | Metagenome / Metatranscriptome | 232 | Y |
| F018879 | Metagenome / Metatranscriptome | 232 | N |
| F021938 | Metagenome / Metatranscriptome | 216 | Y |
| F022833 | Metagenome | 212 | Y |
| F023572 | Metagenome | 209 | Y |
| F024733 | Metagenome | 204 | Y |
| F028495 | Metagenome / Metatranscriptome | 191 | N |
| F036460 | Metagenome / Metatranscriptome | 170 | Y |
| F037627 | Metagenome | 167 | N |
| F037702 | Metagenome | 167 | Y |
| F039531 | Metagenome / Metatranscriptome | 163 | Y |
| F043287 | Metagenome | 156 | Y |
| F044911 | Metagenome | 153 | Y |
| F047492 | Metagenome | 149 | Y |
| F048157 | Metagenome | 148 | N |
| F048985 | Metagenome / Metatranscriptome | 147 | Y |
| F049485 | Metagenome | 146 | N |
| F061649 | Metagenome | 131 | Y |
| F065407 | Metagenome | 127 | Y |
| F076368 | Metagenome | 118 | Y |
| F078663 | Metagenome | 116 | Y |
| F083895 | Metagenome / Metatranscriptome | 112 | N |
| F091218 | Metagenome | 107 | Y |
| F096599 | Metagenome | 104 | Y |
| F101053 | Metagenome | 102 | Y |
| F102396 | Metagenome | 101 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0136236_1000016 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 31991 | Open in IMG/M |
| Ga0136236_1000035 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 20223 | Open in IMG/M |
| Ga0136236_1000042 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 19318 | Open in IMG/M |
| Ga0136236_1000114 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia | 11741 | Open in IMG/M |
| Ga0136236_1000129 | Not Available | 11161 | Open in IMG/M |
| Ga0136236_1000136 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 10978 | Open in IMG/M |
| Ga0136236_1000170 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 9946 | Open in IMG/M |
| Ga0136236_1000218 | All Organisms → cellular organisms → Bacteria | 8820 | Open in IMG/M |
| Ga0136236_1000309 | Not Available | 7311 | Open in IMG/M |
| Ga0136236_1000325 | Not Available | 7162 | Open in IMG/M |
| Ga0136236_1000366 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 6707 | Open in IMG/M |
| Ga0136236_1000635 | Not Available | 4891 | Open in IMG/M |
| Ga0136236_1003136 | All Organisms → Viruses → Predicted Viral | 2028 | Open in IMG/M |
| Ga0136236_1003313 | Not Available | 1962 | Open in IMG/M |
| Ga0136236_1003700 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1846 | Open in IMG/M |
| Ga0136236_1003794 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1818 | Open in IMG/M |
| Ga0136236_1009111 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1118 | Open in IMG/M |
| Ga0136236_1009128 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1116 | Open in IMG/M |
| Ga0136236_1020348 | Not Available | 812 | Open in IMG/M |
| Ga0136236_1021206 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 803 | Open in IMG/M |
| Ga0136236_1023940 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 774 | Open in IMG/M |
| Ga0136236_1024321 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 770 | Open in IMG/M |
| Ga0136236_1024575 | Not Available | 768 | Open in IMG/M |
| Ga0136236_1031510 | Not Available | 696 | Open in IMG/M |
| Ga0136236_1035495 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_4_56_7 | 602 | Open in IMG/M |
| Ga0136236_1036187 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 566 | Open in IMG/M |
| Ga0136236_1036468 | Not Available | 553 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0136236_1000016 | Ga0136236_100001621 | F048157 | MATTAEVGTPLKPKEAQPDIGVLDAIKKTRGEDEGKSMNALQVVQASMANQLPPGVTMDTLLRKLASALQDPNNKLVQIGNSAFLVTLVGDGVVEFHTFSAEQPQKLLKNYIGLANTLKKQGIKKATTYSDRPEFVDLAKKSGLPVKVGQSQKMMGNEMKPVYTFELDL* |
| Ga0136236_1000016 | Ga0136236_100001639 | F049485 | MAHSGLQPISNILGSKDIQSLQLWLKRVGASMPDISTVERKIMENQKKPPFVPQEMKGRMTKNTYKKQGSSEPDWKGTFMYQGQVITFGAWENDAGFGPYYNIKLNDPNWNKQQQQYPKEVTDKPARSYPKDSEIPF* |
| Ga0136236_1000016 | Ga0136236_100001642 | F102396 | MLLFKTKRLERLEQEVVMLEDLFAQALQRIDKLEEARWGLKVDGTPKAKPGRKVKDERIS |
| Ga0136236_1000016 | Ga0136236_100001644 | F022833 | MATKLQDSLVRKIQENRELKDTIKDLHTQAEKDKEFLREVQDESNNLELALKKCIHSKAELNDKIEQLTDDLEKYTELYARATLVVTALGEAVFFLTKESNHGK* |
| Ga0136236_1000016 | Ga0136236_100001645 | F037627 | MKPLICVDCKWHLASKSSSLIANYDRCKASENINLVTGESTYKYCESMRLADGECGMDAKLFELNQAEETPNGN* |
| Ga0136236_1000016 | Ga0136236_100001646 | F061649 | MRDQTDCHYPHTRLCLQDCEDGCRARKSVWRKRTIRELEDQELEDEAFARIPQHPYECKHLGICNDRPTHCLDCPSNNA* |
| Ga0136236_1000035 | Ga0136236_100003526 | F078663 | MITWFAIQFTDNSTGYQKMEDGNCIGVYRADGTVISSEEAVEYTCTDMNATAPSWA* |
| Ga0136236_1000035 | Ga0136236_10000358 | F001629 | MILNQGKLAGGLVDELLEVVHKYDETLYMATVIGALELVKQQLIQDSIEDEDE* |
| Ga0136236_1000042 | Ga0136236_100004211 | F043287 | MIIQTQAQELSPEVMDVATKVEILCPACNRDVDTAELAAQKCNECGADLSDPKQHVAVAVTSVPVIGITW* |
| Ga0136236_1000114 | Ga0136236_10001144 | F028495 | MSVQMTETESVVGFPALESNYAVLINNAIGEHRNSDVNYIVRSYNMEKNNITRKSVIERLRDAF* |
| Ga0136236_1000129 | Ga0136236_100012911 | F047492 | MNFDKNIFAQGQSLFTQDEFNKALTEAKAEIMAIAIQTTKQAIFMERKACAQLLMNMAEADDEGAVCTALRNAAEEVMNRIPAQCQ* |
| Ga0136236_1000129 | Ga0136236_100012917 | F018874 | MSEYDQAMLDRMQMLEEALERAETGIATRDDWDVIRFECGIPRRYKTTLETVSIRSEQWL |
| Ga0136236_1000129 | Ga0136236_100012925 | F037702 | MLSEETVKQIFFYCDVYGPDALIADEVDICQFADKIIQYVTPLITQKEHQRCVAIVDDMNKEVGRALLNQRP* |
| Ga0136236_1000129 | Ga0136236_10001295 | F044911 | MTPAELLHKDAARYATNRKIAYIEAFHKGEAEHMSEEALNGRWLAHYEGYREGYWVATGDTKFSTDPIKFNAEGKATL* |
| Ga0136236_1000136 | Ga0136236_10001367 | F021938 | MLCPKCGYSEGNHIETKKTDEEFFLEWWTPTIGEEAAKASWQDKVAMKSRIAPMVMPDIEGHISMADGTWVSSRSKHRENLKRNNCIELGNDVPVSQKTHEFSRKEQEARKRQIAEITYSKLNYR* |
| Ga0136236_1000170 | Ga0136236_10001702 | F048985 | MPNTSQKTVTTSATLLVTANRADQLVYLHSSSGTIYLGNSDVTTSTGYRMDNGDKLTLQLSDNEALYGITNTGTATMMVMATVS* |
| Ga0136236_1000170 | Ga0136236_10001708 | F000506 | MAKLKITRANGEVSEHKITPGVEYAFEIKYGAGISKVLREHERQSEIFWLAWECLRRANVTVTTFGLEFIETLDTVEVLDDAKK* |
| Ga0136236_1000218 | Ga0136236_10002184 | F003640 | MLAVVYDPRYHTFDSWASLMCEAYAGQQLSIPNESTDWQEWAAGLKAIDVFMNEGIPGPYNFNKWQDWAQALVGAVNQPTVE* |
| Ga0136236_1000309 | Ga0136236_100030911 | F023572 | MKTPKSKRGLYYNINKRRKAGLPAKKPGQKGYPTAQAFKRSKRTAKR* |
| Ga0136236_1000325 | Ga0136236_100032512 | F065407 | MFTFDEQYKKYEELTDRTKQAYEFWYNCVLFTWKDLYKFGK* |
| Ga0136236_1000325 | Ga0136236_10003255 | F004517 | MVKMIPTTPMSRKYKKEDAMLRPHTETTLEKQQRERLERRAALANKLKDLDKEVK* |
| Ga0136236_1000366 | Ga0136236_10003662 | F024733 | MQYKKFDQRLHDACDPPARNAVAEWLKNLWLVDALPNPDKYAVDLVLSRKGEHIGYAEVEVRDWGMDFCPYNTIHIAKRKEKLFNHPRTTMYVVTRDLTHAYWIRADKIKDCPVIEVPNTAVAKEEYFYDVPKNLWKYVDLRELF* |
| Ga0136236_1000635 | Ga0136236_10006357 | F096599 | MEINTIIHTEEQVRVSVDQYDEGVWLSLQARGASMYAVLTRAEAEQMLAGLQAILNKEVTA* |
| Ga0136236_1003136 | Ga0136236_10031361 | F018879 | NYTYQIDDSWVMDQTGADNTSALFLNPDVIQWGSLRELGPNNEVFSSADASLDQYIMEGTLIVRNPAGVAVLAAISPTGAAVTTPRPTXXXQVKRYLA* |
| Ga0136236_1003136 | Ga0136236_10031362 | F000857 | MDDDIKVNEEYYSKGILEAGVDGVFRHNDKLFNEVKSGTWSQTFKTPNIEYKVGAVDGVRYVQYDQKNVEEVKQFCKERREFHKVHGTDNPFFAGTAHMMQLPKCFAHEISSKWFANRPWELIKQDREDKILFYAIVNEYYSDFVCHPSGKIPIPYNPSIPTK* |
| Ga0136236_1003313 | Ga0136236_10033133 | F014456 | MASRAYVVDIAGGTATATVQLQAKQTLKQFLVSWVNAAAGKIELSLSATSQIGTAQPDSNVLARVSASAGANTATARFDISQPVVAFQSIYVHCTGAGNLGTAVLS* |
| Ga0136236_1003700 | Ga0136236_10037002 | F004204 | MVIYDPRGLTWDHWCSRMADLFAANQLGTVXXXVTEDKWKDWADGIQGIGYFVNSAVPDPRGFNEWYQWAESLVGIMNVDTRQL* |
| Ga0136236_1003794 | Ga0136236_10037944 | F016373 | MKTTQNQNHSTHLSFSDDGWILIHQGSPLCDYKTTYADVMKVVAYYKITLPEVTWNGNRMEWVSTSTIEEAIAA* |
| Ga0136236_1009111 | Ga0136236_10091111 | F005200 | VDPITIFAACKAAHAGIRECIDLYQDFKKDGKDVGDIVGDIGKNLGAFFTHQESFKEAEKEAKKNPLPKNISINEEAMNRILRQQQLEQMETDLREMIIYQIGMPGLWSKFVEMREVVRKEREKVEREQKKPWSWLLSKDGNSLTNGKFVHRYLLAALSS* |
| Ga0136236_1009128 | Ga0136236_10091282 | F091218 | MDSKSKNRQLMPETAKVVDYFRTVFPKIKVIYAEEGDYRIGKKPNPSKYVVPCFSEPVKPKKGKKK* |
| Ga0136236_1012624 | Ga0136236_10126242 | F010521 | MKLTPSIIKNLYSAIYCMKPFKRWPMPLPEQIKFVIDQDDAMGTYLYDDGEKYEHIITISEKKCGHLSTVIRVLAHEAIHMSRWKTPKWSHHDAEFRRRTKVVSEELGFDPLEL* |
| Ga0136236_1020348 | Ga0136236_10203481 | F010743 | XVAHVNTVLKHLGAGVYAEVADLINLLHGQAKPQIEASNQATPVEPVTTEPPTE* |
| Ga0136236_1021206 | Ga0136236_10212061 | F003392 | MNEYDFGSRDKSQKRKAHRNGDDQIQYLLSAEEQLLQSISARAPLPEVLNGICSALDCQIGNVISLICLPGDDASGLAAIAMNAALFGLYTFCSEGVVAENDELLGSLEVYCCVPRSPSLEEFQLIERAKCLAAIAIKRHNEAGHHDNWCIHGDRPVRGSVLEWPVSMN* |
| Ga0136236_1023940 | Ga0136236_10239401 | F101053 | XXXXAKPSVFSPQGYSSQAEGLKWWDSFFAYIANDTKLAQGFETKDRTWRPDLVWIVNATNFAKIIDGKYQK* |
| Ga0136236_1024321 | Ga0136236_10243211 | F009320 | XXXXREVFEDNYGLNCISGAILNPVNDQGKVTNHKSYGINIMRVGMERGTFKINESCVEFLDEARNYAIDDAGRFSDPDDHIDSARIGILALIQGHGESLVSRANTFQFRRVEAPEGKVQRI* |
| Ga0136236_1024575 | Ga0136236_10245752 | F083895 | HGVTKGLFLCQLTVLRALAKNLARAATKTGLLSSLISINMKDVSIVSPFFGASSSRLALLKSIIAVAVDNNV* |
| Ga0136236_1031510 | Ga0136236_10315102 | F039531 | MTLDLTINEINIILQALGNAPYASVFELIENIRTQAQAQVQ |
| Ga0136236_1035495 | Ga0136236_10354951 | F036460 | VAQTPRFDEIIMQDIRPTDGWLYNVTTGTTEMGTPVEVTQDRFRSVFPNTTKPWARKVANGPGCVGNPCDMIEHQIGWGADRLTWYAEQQFWNTPLFCYDQDMHITAAMEHISQIISKILKPATTAISSNFLRRRHLLWSFVKNVANKNCGVQNSDGVFTFQWTNDANGDEAFFDCSVSPTRVFKLVPQNVDWLKVPQSA |
| Ga0136236_1036187 | Ga0136236_10361872 | F007522 | MSLKEAGLWWVASMVTIIVAYGMFENAKQTHYWRGRKDGWDMHRRMIDNKTNADNN* |
| Ga0136236_1036468 | Ga0136236_10364681 | F076368 | MTEEEEAAQKPARSGKVHVEGQYEYKNLEIAREAAEVACTAAARLKPSRMKTIDDDDWTEHPDRSD* |
| ⦗Top⦘ |