Basic Information | |
---|---|
Family ID | F103270 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 101 |
Average Sequence Length | 38 residues |
Representative Sequence | MTEKIKPIFNSFQEYIEAENMIFQTLHERVTQGTNNKE |
Number of Associated Samples | 76 |
Number of Associated Scaffolds | 101 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 0.00 % |
% of genes near scaffold ends (potentially truncated) | 12.87 % |
% of genes from short scaffolds (< 2000 bps) | 70.30 % |
Associated GOLD sequencing projects | 64 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.50 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (52.475 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine (43.564 % of family members) |
Environment Ontology (ENVO) | Unclassified (71.287 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) (88.119 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 51.52% β-sheet: 0.00% Coil/Unstructured: 48.48% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.50 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 101 Family Scaffolds |
---|---|---|
PF02796 | HTH_7 | 15.84 |
PF04404 | ERF | 9.90 |
PF04542 | Sigma70_r2 | 7.92 |
PF08279 | HTH_11 | 6.93 |
PF14743 | DNA_ligase_OB_2 | 6.93 |
PF00722 | Glyco_hydro_16 | 0.99 |
PF01068 | DNA_ligase_A_M | 0.99 |
PF03819 | MazG | 0.99 |
PF12684 | DUF3799 | 0.99 |
PF00303 | Thymidylat_synt | 0.99 |
COG ID | Name | Functional Category | % Frequency in 101 Family Scaffolds |
---|---|---|---|
COG0568 | DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) | Transcription [K] | 7.92 |
COG1191 | DNA-directed RNA polymerase specialized sigma subunit | Transcription [K] | 7.92 |
COG1595 | DNA-directed RNA polymerase specialized sigma subunit, sigma24 family | Transcription [K] | 7.92 |
COG4941 | Predicted RNA polymerase sigma factor, contains C-terminal TPR domain | Transcription [K] | 7.92 |
COG0207 | Thymidylate synthase | Nucleotide transport and metabolism [F] | 0.99 |
COG1423 | ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) family | Replication, recombination and repair [L] | 0.99 |
COG1793 | ATP-dependent DNA ligase | Replication, recombination and repair [L] | 0.99 |
COG2273 | Beta-glucanase, GH16 family | Carbohydrate transport and metabolism [G] | 0.99 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
Unclassified | root | N/A | 52.48 % |
All Organisms | root | All Organisms | 47.52 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300000115|DelMOSum2011_c10027114 | All Organisms → Viruses → Predicted Viral | 2598 | Open in IMG/M |
3300000116|DelMOSpr2010_c10031996 | All Organisms → Viruses → Predicted Viral | 2448 | Open in IMG/M |
3300001349|JGI20160J14292_10023490 | Not Available | 3392 | Open in IMG/M |
3300001450|JGI24006J15134_10027276 | All Organisms → Viruses → Predicted Viral | 2547 | Open in IMG/M |
3300001450|JGI24006J15134_10030332 | All Organisms → Viruses → Predicted Viral | 2378 | Open in IMG/M |
3300001450|JGI24006J15134_10030823 | All Organisms → Viruses → Predicted Viral | 2354 | Open in IMG/M |
3300001450|JGI24006J15134_10030980 | Not Available | 2347 | Open in IMG/M |
3300001450|JGI24006J15134_10047866 | All Organisms → Viruses → Predicted Viral | 1768 | Open in IMG/M |
3300001460|JGI24003J15210_10022078 | All Organisms → Viruses → Predicted Viral | 2409 | Open in IMG/M |
3300001460|JGI24003J15210_10022980 | Not Available | 2351 | Open in IMG/M |
3300001460|JGI24003J15210_10042691 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 1561 | Open in IMG/M |
3300001460|JGI24003J15210_10161843 | Not Available | 561 | Open in IMG/M |
3300001472|JGI24004J15324_10069215 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 988 | Open in IMG/M |
3300001589|JGI24005J15628_10175379 | Not Available | 624 | Open in IMG/M |
3300001718|JGI24523J20078_1017914 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 890 | Open in IMG/M |
3300001941|GOS2219_1002120 | All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium | 1888 | Open in IMG/M |
3300001947|GOS2218_1029840 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1483 | Open in IMG/M |
3300005941|Ga0070743_10283793 | Not Available | 535 | Open in IMG/M |
3300006468|Ga0082251_10062719 | All Organisms → Viruses → Predicted Viral | 1404 | Open in IMG/M |
3300006735|Ga0098038_1013794 | All Organisms → Viruses → Predicted Viral | 3125 | Open in IMG/M |
3300006735|Ga0098038_1119184 | Not Available | 898 | Open in IMG/M |
3300006750|Ga0098058_1049435 | Not Available | 1189 | Open in IMG/M |
3300006752|Ga0098048_1093220 | Not Available | 914 | Open in IMG/M |
3300006921|Ga0098060_1047943 | Not Available | 1268 | Open in IMG/M |
3300006922|Ga0098045_1027290 | Not Available | 1488 | Open in IMG/M |
3300006929|Ga0098036_1217183 | Not Available | 580 | Open in IMG/M |
3300007543|Ga0102853_1076518 | Not Available | 609 | Open in IMG/M |
3300007647|Ga0102855_1064267 | Not Available | 990 | Open in IMG/M |
3300007647|Ga0102855_1072407 | Not Available | 928 | Open in IMG/M |
3300007692|Ga0102823_1138457 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 646 | Open in IMG/M |
3300007862|Ga0105737_1015613 | Not Available | 1724 | Open in IMG/M |
3300007954|Ga0105739_1168813 | Not Available | 524 | Open in IMG/M |
3300008999|Ga0102816_1142115 | Not Available | 742 | Open in IMG/M |
3300009026|Ga0102829_1167192 | Not Available | 707 | Open in IMG/M |
3300009026|Ga0102829_1253638 | Not Available | 579 | Open in IMG/M |
3300009056|Ga0102860_1083218 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 881 | Open in IMG/M |
3300009086|Ga0102812_10151615 | Not Available | 1268 | Open in IMG/M |
3300009507|Ga0115572_10073105 | All Organisms → Viruses → Predicted Viral | 2106 | Open in IMG/M |
3300009543|Ga0115099_10988746 | Not Available | 548 | Open in IMG/M |
3300009593|Ga0115011_10225202 | Not Available | 1396 | Open in IMG/M |
3300010149|Ga0098049_1147281 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 728 | Open in IMG/M |
3300010150|Ga0098056_1129070 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 857 | Open in IMG/M |
3300010392|Ga0118731_103653840 | Not Available | 1380 | Open in IMG/M |
3300010430|Ga0118733_107978596 | Not Available | 548 | Open in IMG/M |
3300017708|Ga0181369_1027443 | All Organisms → Viruses → Predicted Viral | 1356 | Open in IMG/M |
3300017713|Ga0181391_1118637 | Not Available | 593 | Open in IMG/M |
3300017714|Ga0181412_1006893 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia → Escherichia coli | 3625 | Open in IMG/M |
3300017714|Ga0181412_1040093 | Not Available | 1223 | Open in IMG/M |
3300017727|Ga0181401_1013856 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Enterobacteriaceae → Escherichia → Escherichia coli | 2491 | Open in IMG/M |
3300017756|Ga0181382_1162916 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 576 | Open in IMG/M |
3300017758|Ga0181409_1052489 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 1254 | Open in IMG/M |
3300017758|Ga0181409_1052766 | Not Available | 1250 | Open in IMG/M |
3300017782|Ga0181380_1187402 | Not Available | 697 | Open in IMG/M |
3300020347|Ga0211504_1015151 | Not Available | 2177 | Open in IMG/M |
3300020438|Ga0211576_10176516 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 1146 | Open in IMG/M |
3300021957|Ga0222717_10038097 | Not Available | 3155 | Open in IMG/M |
3300021957|Ga0222717_10052564 | All Organisms → Viruses | 2637 | Open in IMG/M |
(restricted) 3300022920|Ga0233426_10044870 | All Organisms → Viruses → Predicted Viral | 2149 | Open in IMG/M |
(restricted) 3300022920|Ga0233426_10088722 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 1385 | Open in IMG/M |
(restricted) 3300024062|Ga0255039_10376394 | Not Available | 612 | Open in IMG/M |
3300024228|Ga0228633_1041331 | All Organisms → Viruses → Predicted Viral | 1191 | Open in IMG/M |
3300024228|Ga0228633_1142188 | Not Available | 536 | Open in IMG/M |
3300024297|Ga0228658_1044044 | Not Available | 1141 | Open in IMG/M |
3300024332|Ga0228659_1023648 | All Organisms → Viruses | 1427 | Open in IMG/M |
3300024343|Ga0244777_10397240 | Not Available | 859 | Open in IMG/M |
3300024346|Ga0244775_10056697 | All Organisms → Viruses | 3395 | Open in IMG/M |
3300024346|Ga0244775_11216290 | Not Available | 586 | Open in IMG/M |
3300024348|Ga0244776_10404910 | Not Available | 904 | Open in IMG/M |
3300025026|Ga0207879_102251 | All Organisms → Viruses → Predicted Viral | 1197 | Open in IMG/M |
3300025048|Ga0207905_1014475 | Not Available | 1343 | Open in IMG/M |
3300025048|Ga0207905_1069415 | Not Available | 516 | Open in IMG/M |
3300025070|Ga0208667_1040727 | Not Available | 785 | Open in IMG/M |
3300025071|Ga0207896_1003388 | Not Available | 2995 | Open in IMG/M |
3300025071|Ga0207896_1004401 | Not Available | 2601 | Open in IMG/M |
3300025071|Ga0207896_1007511 | Not Available | 1970 | Open in IMG/M |
3300025071|Ga0207896_1008291 | Not Available | 1874 | Open in IMG/M |
3300025072|Ga0208920_1017331 | All Organisms → Viruses → Predicted Viral | 1572 | Open in IMG/M |
3300025084|Ga0208298_1055490 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 767 | Open in IMG/M |
3300025086|Ga0208157_1007105 | Not Available | 3911 | Open in IMG/M |
3300025098|Ga0208434_1031238 | All Organisms → Viruses | 1251 | Open in IMG/M |
3300025099|Ga0208669_1001596 | All Organisms → Viruses | 8208 | Open in IMG/M |
3300025120|Ga0209535_1015136 | All Organisms → Viruses → Predicted Viral | 4116 | Open in IMG/M |
3300025120|Ga0209535_1030650 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 2547 | Open in IMG/M |
3300025128|Ga0208919_1089608 | Not Available | 1000 | Open in IMG/M |
3300025137|Ga0209336_10096075 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 844 | Open in IMG/M |
3300025138|Ga0209634_1030796 | All Organisms → Viruses → Predicted Viral | 2843 | Open in IMG/M |
3300025141|Ga0209756_1314681 | Not Available | 546 | Open in IMG/M |
3300025626|Ga0209716_1001049 | Not Available | 22311 | Open in IMG/M |
3300025626|Ga0209716_1040980 | Not Available | 1609 | Open in IMG/M |
3300025849|Ga0209603_1062564 | Not Available | 1861 | Open in IMG/M |
3300027204|Ga0208924_116805 | Not Available | 591 | Open in IMG/M |
3300027506|Ga0208973_1001853 | All Organisms → Viruses | 8959 | Open in IMG/M |
3300027788|Ga0209711_10174128 | All Organisms → Viruses → Predicted Viral | 1013 | Open in IMG/M |
(restricted) 3300027861|Ga0233415_10008720 | All Organisms → Viruses → Predicted Viral | 4011 | Open in IMG/M |
(restricted) 3300027861|Ga0233415_10019707 | All Organisms → Viruses | 2679 | Open in IMG/M |
(restricted) 3300027861|Ga0233415_10023842 | All Organisms → Viruses | 2449 | Open in IMG/M |
(restricted) 3300027861|Ga0233415_10040013 | All Organisms → Viruses → Predicted Viral | 1928 | Open in IMG/M |
3300027906|Ga0209404_10126495 | All Organisms → Viruses | 1535 | Open in IMG/M |
3300028125|Ga0256368_1042437 | Not Available | 808 | Open in IMG/M |
3300028194|Ga0257106_1230579 | Not Available | 626 | Open in IMG/M |
3300031774|Ga0315331_10516166 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium | 864 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Marine | Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine | 43.56% |
Estuarine | Environmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine | 11.88% |
Seawater | Environmental → Aquatic → Marine → Strait → Unclassified → Seawater | 8.91% |
Seawater | Environmental → Aquatic → Marine → Inlet → Unclassified → Seawater | 5.94% |
Seawater | Environmental → Aquatic → Marine → Coastal → Unclassified → Seawater | 3.96% |
Pelagic Marine | Environmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine | 3.96% |
Marine | Environmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine | 2.97% |
Marine | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine | 2.97% |
Estuarine | Environmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine | 2.97% |
Estuary Water | Environmental → Aquatic → Marine → Coastal → Unclassified → Estuary Water | 1.98% |
Estuarine Water | Environmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water | 1.98% |
Marine | Environmental → Aquatic → Marine → Neritic Zone → Unclassified → Marine | 1.98% |
Marine | Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine | 0.99% |
Sediment | Environmental → Aquatic → Marine → Oceanic → Sediment → Sediment | 0.99% |
Marine | Environmental → Aquatic → Marine → Coastal → Sediment → Marine | 0.99% |
Marine Sediment | Environmental → Aquatic → Marine → Coastal → Sediment → Marine Sediment | 0.99% |
Sea-Ice Brine | Environmental → Aquatic → Marine → Coastal → Unclassified → Sea-Ice Brine | 0.99% |
Seawater | Environmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater | 0.99% |
Pelagic Marine | Environmental → Aquatic → Marine → Neritic Zone → Unclassified → Pelagic Marine | 0.99% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
3300000115 | Marine microbial communities from Delaware Coast, sample from Delaware MO Summer July 2011 | Environmental | Open in IMG/M |
3300000116 | Marine microbial communities from Delaware Coast, sample from Delaware MO Spring March 2010 | Environmental | Open in IMG/M |
3300001349 | Pelagic Microbial community sample from North Sea - COGITO 998_met_10 | Environmental | Open in IMG/M |
3300001450 | Marine viral communities from the Pacific Ocean - LP-53 | Environmental | Open in IMG/M |
3300001460 | Marine viral communities from the Pacific Ocean - LP-28 | Environmental | Open in IMG/M |
3300001472 | Marine viral communities from the Pacific Ocean - LP-32 | Environmental | Open in IMG/M |
3300001589 | Marine viral communities from the Pacific Ocean - LP-40 | Environmental | Open in IMG/M |
3300001718 | Marine viral communities from the Pacific Ocean - LP-48 | Environmental | Open in IMG/M |
3300001941 | Marine microbial communities from Browns Bank, Gulf of Maine - GS003 | Environmental | Open in IMG/M |
3300001947 | Marine microbial communities from the Gulf of Maine, Canada - GS002 | Environmental | Open in IMG/M |
3300005941 | Estuarine microbial communities from the Columbia River estuary, USA - metaG S.697 | Environmental | Open in IMG/M |
3300006468 | Deep-sea sediment bacterial and archaeal communities from Fram Strait - Combined Assembly of Gp0119454, Gp0119453, Gp0119452, Gp0119451 | Environmental | Open in IMG/M |
3300006735 | Marine viral communities from the Subarctic Pacific Ocean - 5B_ETSP_OMZ_AT15132_CsCl metaG | Environmental | Open in IMG/M |
3300006750 | Marine viral communities from the Subarctic Pacific Ocean - 19_ETSP_OMZ_AT15317 metaG | Environmental | Open in IMG/M |
3300006752 | Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaG | Environmental | Open in IMG/M |
3300006921 | Marine viral communities from the Subarctic Pacific Ocean - 21_ETSP_OMZ_AT15319 metaG | Environmental | Open in IMG/M |
3300006922 | Marine viral communities from the Subarctic Pacific Ocean - 11_ETSP_OMZ_AT15265 metaG | Environmental | Open in IMG/M |
3300006929 | Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaG | Environmental | Open in IMG/M |
3300007543 | Estuarine microbial communities from the Columbia River estuary - metaG 1370B-3 | Environmental | Open in IMG/M |
3300007647 | Estuarine microbial communities from the Columbia River estuary - metaG 1370B-02 | Environmental | Open in IMG/M |
3300007692 | Estuarine microbial communities from the Columbia River estuary - Ebb tide non-ETM metaG S.743 | Environmental | Open in IMG/M |
3300007862 | Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1373A_0.2um | Environmental | Open in IMG/M |
3300007954 | Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1373B_0.2um | Environmental | Open in IMG/M |
3300008999 | Estuarine microbial communities from the Columbia River estuary - Flood tide non-ETM metaG S.545 | Environmental | Open in IMG/M |
3300009026 | Estuarine microbial communities from the Columbia River estuary - Freshwater metaG S.575 | Environmental | Open in IMG/M |
3300009056 | Estuarine microbial communities from the Columbia River estuary - metaG 1449A-3 | Environmental | Open in IMG/M |
3300009086 | Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.713 | Environmental | Open in IMG/M |
3300009507 | Pelagic marine microbial communities from North Sea - COGITO_mtgs_120607 | Environmental | Open in IMG/M |
3300009543 | Marine eukaryotic communities from Pacific Ocean to study complex ecological interactions - MBTS_20Mar14_M2_3um Metatranscriptome (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300009593 | Marine eukaryotic phytoplankton communities from Atlantic Ocean - Tropical Atlantic ANT8 Metagenome | Environmental | Open in IMG/M |
3300010149 | Marine viral communities from the Subarctic Pacific Ocean - 13B_ETSP_OMZ_AT15268_CsCl metaG | Environmental | Open in IMG/M |
3300010150 | Marine viral communities from the Subarctic Pacific Ocean - 17B_ETSP_OMZ_AT15314_CsCl metaG | Environmental | Open in IMG/M |
3300010392 | Coastal sediment microbial communities from Rhode Island, USA. Combined Assembly of Gp0121717, Gp0123912, Gp0123935, Gp0139423, Gp0139424, Gp0139388, Gp0139387, Gp0139386, Gp0139385 | Environmental | Open in IMG/M |
3300010430 | Marine sediment microbial communities from Gulf of Thailand under amendment with organic carbon and nitrate - JGI co-assembly of 8 samples | Environmental | Open in IMG/M |
3300017708 | Marine viral communities from the Subarctic Pacific Ocean - Lowphox_04 viral metaG | Environmental | Open in IMG/M |
3300017713 | Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 14 SPOT_SRF_2010-08-11 | Environmental | Open in IMG/M |
3300017714 | Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 35 SPOT_SRF_2012-08-15 | Environmental | Open in IMG/M |
3300017727 | Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 24 SPOT_SRF_2011-07-20 | Environmental | Open in IMG/M |
3300017756 | Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 5 SPOT_SRF_2009-10-22 | Environmental | Open in IMG/M |
3300017758 | Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 32 SPOT_SRF_2012-05-30 | Environmental | Open in IMG/M |
3300017782 | Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 3 SPOT_SRF_2009-08-19 | Environmental | Open in IMG/M |
3300020347 | Marine microbial communities from Tara Oceans - TARA_B100000497 (ERX556109-ERR598994) | Environmental | Open in IMG/M |
3300020438 | Marine microbial communities from Tara Oceans - TARA_B100001094 (ERX555907-ERR598942) | Environmental | Open in IMG/M |
3300021957 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_18D | Environmental | Open in IMG/M |
3300022920 (restricted) | Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_10_MG | Environmental | Open in IMG/M |
3300024062 (restricted) | Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_1 | Environmental | Open in IMG/M |
3300024228 | Seawater microbial communities from Monterey Bay, California, United States - 41D | Environmental | Open in IMG/M |
3300024297 | Seawater microbial communities from Monterey Bay, California, United States - 71D | Environmental | Open in IMG/M |
3300024332 | Seawater microbial communities from Monterey Bay, California, United States - 73D | Environmental | Open in IMG/M |
3300024343 | Combined assembly of estuarine microbial communities from Columbia River, Washington, USA >3um size fraction | Environmental | Open in IMG/M |
3300024346 | Whole water sample coassembly | Environmental | Open in IMG/M |
3300024348 | 0.2um to 3um size fraction coassembly | Environmental | Open in IMG/M |
3300025026 | Marine viral communities from the Pacific Ocean - LP-24 (SPAdes) | Environmental | Open in IMG/M |
3300025048 | Marine viral communities from the Subarctic Pacific Ocean - LP-49 (SPAdes) | Environmental | Open in IMG/M |
3300025070 | Marine viral communities from the Subarctic Pacific Ocean - 11B_ETSP_OMZ_AT15265_CsCl metaG (SPAdes) | Environmental | Open in IMG/M |
3300025071 | Marine viral communities from the Pacific Ocean - LP-36 (SPAdes) | Environmental | Open in IMG/M |
3300025072 | Marine viral communities from the Subarctic Pacific Ocean - 19_ETSP_OMZ_AT15317 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025084 | Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaG (SPAdes) | Environmental | Open in IMG/M |
3300025086 | Marine viral communities from the Subarctic Pacific Ocean - 5_ETSP_OMZ_AT15132 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025098 | Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025099 | Marine viral communities from the Subarctic Pacific Ocean - 21_ETSP_OMZ_AT15319 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025120 | Marine viral communities from the Pacific Ocean - LP-28 (SPAdes) | Environmental | Open in IMG/M |
3300025128 | Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025137 | Marine viral communities from the Pacific Ocean - LP-32 (SPAdes) | Environmental | Open in IMG/M |
3300025138 | Marine viral communities from the Pacific Ocean - LP-40 (SPAdes) | Environmental | Open in IMG/M |
3300025141 | Marine viral communities from the Pacific Ocean - ETNP_6_85 (SPAdes) | Environmental | Open in IMG/M |
3300025626 | Pelagic marine microbial communities from North Sea - COGITO_mtgs_120531 (SPAdes) | Environmental | Open in IMG/M |
3300025849 | Pelagic marine microbial communities from North Sea - COGITO_mtgs_120607 (SPAdes) | Environmental | Open in IMG/M |
3300027204 | Estuarine microbial communities from the Columbia River estuary - metaG 1370B-3 (SPAdes) | Environmental | Open in IMG/M |
3300027506 | Ammonia-oxidizing marine microbial communities from Monterey Bay, California, USA - CAN11_66_BLW_10 (SPAdes) | Environmental | Open in IMG/M |
3300027788 | Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_88 (SPAdes) | Environmental | Open in IMG/M |
3300027861 (restricted) | Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_12_MG | Environmental | Open in IMG/M |
3300027906 | Marine eukaryotic phytoplankton communities from Atlantic Ocean - Tropical Atlantic ANT8 Metagenome (SPAdes) | Environmental | Open in IMG/M |
3300028125 | Sea-ice brine viral communities from Beaufort Sea near Barrow, Alaska, United States - SB | Environmental | Open in IMG/M |
3300028194 | Marine microbial communities from Northeast Subartic Pacific Ocean, Canada - LP_J_2011_P26_10m | Environmental | Open in IMG/M |
3300031774 | Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 60m 34915 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
DelMOSum2011_100271144 | 3300000115 | Marine | MTDKLKPIFNSFQEYIESENMIFTTLHENMTIANKDKE* |
DelMOSpr2010_100319965 | 3300000116 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEDRVTHGTKVIK* |
JGI20160J14292_100234907 | 3300001349 | Pelagic Marine | MTKKLKPIFDSFQEYIQAEDMIFETLHASMTIADKDK* |
JGI24006J15134_100272764 | 3300001450 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEARVTHGNKDIK* |
JGI24006J15134_100303326 | 3300001450 | Marine | MTEKIKPIFNSFQEYIEAENMIFETLQDRVTNGTKDIK* |
JGI24006J15134_100308236 | 3300001450 | Marine | MTEKIKPIFNSFQEYIEAENMIFQTLHERVTQGINNKE* |
JGI24006J15134_100309804 | 3300001450 | Marine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTNSTKDIE* |
JGI24006J15134_100478664 | 3300001450 | Marine | MTRKLEPIFNSFQEYIEAENMIFETLEDRVTHSTKVIK* |
JGI24003J15210_100220784 | 3300001460 | Marine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTHSTKDXX* |
JGI24003J15210_100229804 | 3300001460 | Marine | MTDKIKPIFNSFQEYIQAEDMVFQTLHERVTHSTKDKE* |
JGI24003J15210_100426913 | 3300001460 | Marine | MTEKIKPIFDSFQEYIQAENMIFQTLHERVTQGTNNKE* |
JGI24003J15210_101618432 | 3300001460 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEDRVTHSTKVIK* |
JGI24004J15324_100692152 | 3300001472 | Marine | MTRKLEPIFNSFQEYIEAENMIFETLEDRVTHSTKVVK* |
JGI24005J15628_101753793 | 3300001589 | Marine | MTDKLKPIYDSFQEYIEAENMIFTTLHESVTTASKDK* |
JGI24523J20078_10179143 | 3300001718 | Marine | MTKKLEPIFNSFQEXIEAENMIFETLEXRVTHGNKDIK* |
GOS2219_10021202 | 3300001941 | Marine | MTEKIKPIFDSFQEYIQAEDMIFQTLHDRVTHSTKDKE* |
GOS2218_10298403 | 3300001947 | Marine | MKKKLEPIFNSFQEYIEAEDMIFKTLEARVTHSIKDIE* |
Ga0070743_102837932 | 3300005941 | Estuarine | MTEKIKPIFNSFQEYIEAENMVFQTLHARVTQGTNNKE* |
Ga0082251_100627192 | 3300006468 | Sediment | MKQKLKPIFDSFQEYIEAENMIFETLEDRVTHGTKVIK* |
Ga0098038_10137946 | 3300006735 | Marine | MTEKIKPIFNSFQEYIQAEDMIFQTLHDRVTHSTKDKE* |
Ga0098038_11191844 | 3300006735 | Marine | MTEKIKPIFNSFQEYIEAEDMVFKTLHARVTQGTNNKE* |
Ga0098058_10494355 | 3300006750 | Marine | MTEKIKPIFNSFQEYIESEDMVFQTLHARVTQGTNNKE* |
Ga0098048_10932203 | 3300006752 | Marine | MTEKIKPIFNSFKEYIQAEDMIFKTLQDRVTHSTKDKE* |
Ga0098060_10479433 | 3300006921 | Marine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTQGTNNKE* |
Ga0098045_10272904 | 3300006922 | Marine | MTEKIKPIFDSFQEYIKAEDMIFQTLHERVTHSTKDKE* |
Ga0098036_12171831 | 3300006929 | Marine | MTEKIKPIFNSFQEYIESEDMVFQTLHAGVTQGTNNKE* |
Ga0102853_10765181 | 3300007543 | Estuarine | MTAKLKPIFDSFQEYIKAEDMIFTTLHENMTIASKDKE* |
Ga0102855_10642674 | 3300007647 | Estuarine | MTEKIKPIFDSFQEYIQAEDMIFQALHERVTHSTKDKE* |
Ga0102855_10724073 | 3300007647 | Estuarine | MTEKIKPIFNSFQEYIEAENMVFKTLHARVTQGTNNKE* |
Ga0102823_11384572 | 3300007692 | Estuarine | MTEKIKPIFNSFQEYIEAENMIFQTLHERVTQGTNNKE* |
Ga0105737_10156136 | 3300007862 | Estuary Water | MTKKLEPIFNSFQEYIEAENMIFETLEARVTHGIKGIE* |
Ga0105739_11688133 | 3300007954 | Estuary Water | MTAKLKPIFDSFQEYIKAEDMIFTSLHENMTIASKDKE* |
Ga0102816_11421152 | 3300008999 | Estuarine | MTEKIKPIFNSFQEYIQAEDMIFQTLHERVTHSTKDKE* |
Ga0102829_11671922 | 3300009026 | Estuarine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTHSTKDKE* |
Ga0102829_12536381 | 3300009026 | Estuarine | MTEKIKPIFNSFQEYIESENMIFTTLHENMTIANKDKE* |
Ga0102860_10832183 | 3300009056 | Estuarine | MTEKIKPIFDSFQEYIEAENMVFKTLHARVTQGTNNKE* |
Ga0102812_101516151 | 3300009086 | Estuarine | FKTPMTEKIKPIFNSFQEYIEAENMVFQTLHARVTQGTNNKE* |
Ga0115572_100731054 | 3300009507 | Pelagic Marine | MTKKLEPIFNSFQEYIEAENMIFETLEPRVTHGNKDIK* |
Ga0115099_109887463 | 3300009543 | Marine | MTEKIKPIFNSFQEYIETENMIFQTLHERVTQGTNNKE* |
Ga0115011_102252022 | 3300009593 | Marine | MTDKIKPIFDSFQEYIEAEDMIFETLHERVTQACSTKE* |
Ga0098049_11472811 | 3300010149 | Marine | EKIKPIFNSFQEYIESEDMVFQTLHARVTQGTNNKE* |
Ga0098056_11290703 | 3300010150 | Marine | PMTEKIKPIFNSFQEYIQAEDMIFQTLHERVTQGTNNKE* |
Ga0118731_1036538403 | 3300010392 | Marine | MTEKIKPIFNSFQEYIEAEDMIFETLQDRVTNGTKDIK* |
Ga0118733_1079785963 | 3300010430 | Marine Sediment | MTKKLEPIFNSFQEYIEAEDMIFETLQDRVTNGTK |
Ga0181369_10274431 | 3300017708 | Marine | FKKFMTEKIKPIFNSFQEYIQAEDMIFQTLHDRVTHSTKDKE |
Ga0181391_11186372 | 3300017713 | Seawater | MTEKIKPIFNSFQEYIQAENMIFQTLHERVTQGTNNKE |
Ga0181412_10068934 | 3300017714 | Seawater | MTEKIKPIFNSFQEYIESENMIFTTLHENMTLASKDKE |
Ga0181412_10400932 | 3300017714 | Seawater | MTEKIKPIFDSFQEYIKAEDMIFTTLHENMTIASKDKE |
Ga0181401_10138565 | 3300017727 | Seawater | MTAKIQPIFNSFQEYIEAEDMIFKTLHEGVTQGTNNKE |
Ga0181382_11629161 | 3300017756 | Seawater | EKIKPIFNSFQEYIEAENMIFQTLHERVTQGTNNKE |
Ga0181409_10524893 | 3300017758 | Seawater | EKIKPIFDSFQEYIQAENMIFQTLHERVTQGTNNKE |
Ga0181409_10527664 | 3300017758 | Seawater | MTEKIKPIFNSFQEYIETENMIFQTLHERVTKGTNNKE |
Ga0181380_11874022 | 3300017782 | Seawater | MTEKIKPIFDSFQEYIQAEDMIFKTLHDRVTHSTKDKE |
Ga0211504_10151513 | 3300020347 | Marine | MTEKIKPIFDSFQEYIKAEDMIFTTLHENMTTASKDKE |
Ga0211576_101765163 | 3300020438 | Marine | MTEKIKPIFNSFQEYIETENMIFQTLHERVTQGTNNKE |
Ga0222717_100380974 | 3300021957 | Estuarine Water | MTEKIKPIFNSFQEYIQAEDMIFQTLHERVTHSTKDKE |
Ga0222717_100525643 | 3300021957 | Estuarine Water | MTEKIKPIFDSFQEYIQAENMIFQTLHERVTQGTNNKE |
(restricted) Ga0233426_100448704 | 3300022920 | Seawater | MTEKIKPIFNSFQEYIEAENMVFKTLHARVTQGTNNKE |
(restricted) Ga0233426_100887224 | 3300022920 | Seawater | MTEKIKPIFNSFQEYIETENMIFQTLHERVTRGTNNKE |
(restricted) Ga0255039_103763942 | 3300024062 | Seawater | MKKKLEPIFNSFQEYIEAEDMIFKTLEARVTHSIKDIE |
Ga0228633_10413312 | 3300024228 | Seawater | MTEKIKPIFNSFKEYIQAEDMIFKTLQDTVTHSTKDKE |
Ga0228633_11421882 | 3300024228 | Seawater | MTEKIKPIFDSFQEYIKAEDMIFQTLHDRVTHSTKDKE |
Ga0228658_10440444 | 3300024297 | Seawater | MTEKIKPIFNSFKEYIQAEDMIFKTLQDTVTHNTKDKE |
Ga0228659_10236483 | 3300024332 | Seawater | KKLMTEKIKPIFNSFKEYIQAEDMIFKTLQDTVTHSTKDKE |
Ga0244777_103972402 | 3300024343 | Estuarine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTHSTKDKE |
Ga0244775_100566973 | 3300024346 | Estuarine | MTEKIKPIFNSFQEYIEAENMVFQTLHARVTQGTNNKE |
Ga0244775_112162902 | 3300024346 | Estuarine | MTEKIKPIFNSFQEYIESENMIFTTLHENMTIANKDKE |
Ga0244776_104049103 | 3300024348 | Estuarine | TPMTEKIKPIFNSFQEYIEAENMVFKTLHARVTQGTNNKE |
Ga0207879_1022514 | 3300025026 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEDRVTHGTKVIK |
Ga0207905_10144752 | 3300025048 | Marine | MTEKIKPIFNSFQEYIEAENMIFQTLHERVTQGINNKE |
Ga0207905_10694152 | 3300025048 | Marine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTNSTKDIE |
Ga0208667_10407272 | 3300025070 | Marine | MTEKIKPIFNSFQEYIEAEDMVFKTLHARVTQGTNNKE |
Ga0207896_10033883 | 3300025071 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEARVTHGNKDIK |
Ga0207896_10044013 | 3300025071 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEARVTHGIKGIE |
Ga0207896_10075114 | 3300025071 | Marine | MTEKIKPIFNSFQEYIEAENMIFETLQDRVTNGTKDIK |
Ga0207896_10082913 | 3300025071 | Marine | MTDKLKPIYDSFQEYIEAENMIFTTLHESVTTASKDK |
Ga0208920_10173311 | 3300025072 | Marine | MTEKIKPIFNSFQEYIESEDMVFQTLHARVTQGTNNKE |
Ga0208298_10554901 | 3300025084 | Marine | SFKKPMTEKIKPIFNSFQEYIESEDMVFQTLHARVTQGTNNKE |
Ga0208157_10071057 | 3300025086 | Marine | MTEKIKPIFNSFQEYIQAEDMIFQTLHDRVTHSTKDKE |
Ga0208434_10312383 | 3300025098 | Marine | KFMTNKIKPIFDSFQEYIQAEDMIFQTLHERVTQGTNNKE |
Ga0208669_100159614 | 3300025099 | Marine | MTEKIKPIFDSFQEYIQAEDMIFQTLHERVTQGTNNKE |
Ga0209535_10151366 | 3300025120 | Marine | MTEKIKPIFNSFQEYIEAENMIFQTLHERVTQGTNNKE |
Ga0209535_10306504 | 3300025120 | Marine | MTDKIKPIFNSFQEYIQAEDMVFQTLHERVTHSTKDKE |
Ga0208919_10896082 | 3300025128 | Marine | MTEKIKPIFNSFQEYIESEDMVFQTLHAGVTQGTNNKE |
Ga0209336_100960752 | 3300025137 | Marine | MTRKLEPIFNSFQEYIEAENMIFETLEDRVTHSTKVVK |
Ga0209634_10307963 | 3300025138 | Marine | MTRKLEPIFNSFQEYIEAENMIFETLEDRVTHSTKVIK |
Ga0209756_13146812 | 3300025141 | Marine | MTEKIKPIFNSFQEYIEAEDMIFETLHDRVTQASSTKE |
Ga0209716_100104921 | 3300025626 | Pelagic Marine | MTKKLKPIFDSFQEYIQAEDMIFETLHASMTIADKDK |
Ga0209716_10409803 | 3300025626 | Pelagic Marine | MTDKLKPIFNSFQEYIESEDMIFTTLHENMTIASKDKE |
Ga0209603_10625643 | 3300025849 | Pelagic Marine | MTKKLEPIFNSFQEYIEAENMIFETLEPRVTHGNKDIK |
Ga0208924_1168052 | 3300027204 | Estuarine | MTAKLKPIFDSFQEYIQAEDMIFQTLHERVTHSTKDKE |
Ga0208973_100185315 | 3300027506 | Marine | MTAKLKPIFDSFQEYIKAEDMIFTTLHENMTIASKDKE |
Ga0209711_101741282 | 3300027788 | Marine | MTKKLEPIFNSFQEYIEAENMIFETLEPRVTHGSKDIK |
(restricted) Ga0233415_100087206 | 3300027861 | Seawater | MTRKLEPIFNSFQEYIEAENMIFETLEDRVTHGTKVIK |
(restricted) Ga0233415_100197073 | 3300027861 | Seawater | MTNKIKPIFNSFQEYIEAEDMIFKTLQERVTHSTKDKE |
(restricted) Ga0233415_100238423 | 3300027861 | Seawater | MTEKIKPIFNSFQEYIETENMIFQTLHEMVTRGANNKE |
(restricted) Ga0233415_100400133 | 3300027861 | Seawater | MTKKLEPIFNSFQEYIEAENMIFETLEPRVTHGSKYIE |
Ga0209404_101264953 | 3300027906 | Marine | MTDKIKPIFDSFQEYIEAEDMIFETLHERVTQACSTKE |
Ga0256368_10424372 | 3300028125 | Sea-Ice Brine | MTKKLEPIFNSFQEYIEAENMIFETLEPRVTHGIKDIE |
Ga0257106_12305791 | 3300028194 | Marine | KKLMTDKLKPIYDSFQEYIEAENMIFTTLHESVTTASKDK |
Ga0315331_105161662 | 3300031774 | Seawater | MTDKLQPIFNSFKEYIESENMIFTTLHENMTIASKDKE |
⦗Top⦘ |