Basic Information | |
---|---|
Family ID | F090331 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 108 |
Average Sequence Length | 41 residues |
Representative Sequence | MSIALAIAIYTALVAFVSSLMVYYFKVMYPREEAQLKEKSK |
Number of Associated Samples | 56 |
Number of Associated Scaffolds | 108 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 24.07 % |
% of genes near scaffold ends (potentially truncated) | 17.59 % |
% of genes from short scaffolds (< 2000 bps) | 61.11 % |
Associated GOLD sequencing projects | 52 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.49 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (43.519 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake (22.222 % of family members) |
Environment Ontology (ENVO) | Unclassified (37.037 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) (44.444 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 56.52% β-sheet: 0.00% Coil/Unstructured: 43.48% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.49 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 108 Family Scaffolds |
---|---|---|
PF02086 | MethyltransfD12 | 7.41 |
PF04851 | ResIII | 6.48 |
PF01555 | N6_N4_Mtase | 3.70 |
PF07460 | NUMOD3 | 1.85 |
PF07669 | Eco57I | 0.93 |
PF04965 | GPW_gp25 | 0.93 |
PF01541 | GIY-YIG | 0.93 |
PF14328 | DUF4385 | 0.93 |
PF13392 | HNH_3 | 0.93 |
PF13640 | 2OG-FeII_Oxy_3 | 0.93 |
PF02672 | CP12 | 0.93 |
PF13759 | 2OG-FeII_Oxy_5 | 0.93 |
PF00856 | SET | 0.93 |
PF13661 | 2OG-FeII_Oxy_4 | 0.93 |
PF02945 | Endonuclease_7 | 0.93 |
PF05433 | Rick_17kDa_Anti | 0.93 |
COG ID | Name | Functional Category | % Frequency in 108 Family Scaffolds |
---|---|---|---|
COG0338 | DNA-adenine methylase | Replication, recombination and repair [L] | 7.41 |
COG3392 | Adenine-specific DNA methylase | Replication, recombination and repair [L] | 7.41 |
COG0863 | DNA modification methylase | Replication, recombination and repair [L] | 3.70 |
COG1041 | tRNA G10 N-methylase Trm11 | Translation, ribosomal structure and biogenesis [J] | 3.70 |
COG2189 | Adenine specific DNA methylase Mod | Replication, recombination and repair [L] | 3.70 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
All Organisms | root | All Organisms | 56.48 % |
Unclassified | root | N/A | 43.52 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300001213|JGIcombinedJ13530_106933571 | Not Available | 537 | Open in IMG/M |
3300002835|B570J40625_100761444 | Not Available | 857 | Open in IMG/M |
3300005512|Ga0074648_1026126 | All Organisms → Viruses → Predicted Viral | 3081 | Open in IMG/M |
3300005527|Ga0068876_10008710 | Not Available | 6795 | Open in IMG/M |
3300005527|Ga0068876_10021807 | All Organisms → Viruses → Predicted Viral | 4048 | Open in IMG/M |
3300005662|Ga0078894_10049853 | All Organisms → Viruses → Predicted Viral | 3544 | Open in IMG/M |
3300005805|Ga0079957_1004100 | Not Available | 12201 | Open in IMG/M |
3300005805|Ga0079957_1004709 | Not Available | 11279 | Open in IMG/M |
3300005805|Ga0079957_1064873 | All Organisms → Viruses → Predicted Viral | 2142 | Open in IMG/M |
3300006637|Ga0075461_10129380 | All Organisms → Viruses | 782 | Open in IMG/M |
3300006802|Ga0070749_10053686 | All Organisms → Viruses → Predicted Viral | 2449 | Open in IMG/M |
3300006802|Ga0070749_10110920 | All Organisms → Viruses | 1616 | Open in IMG/M |
3300006802|Ga0070749_10675901 | Not Available | 553 | Open in IMG/M |
3300007538|Ga0099851_1228950 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CBM2 | 670 | Open in IMG/M |
3300007541|Ga0099848_1100059 | Not Available | 1113 | Open in IMG/M |
3300007541|Ga0099848_1113027 | All Organisms → Viruses → Predicted Viral | 1032 | Open in IMG/M |
3300007541|Ga0099848_1253588 | Not Available | 614 | Open in IMG/M |
3300007542|Ga0099846_1284261 | Not Available | 569 | Open in IMG/M |
3300007545|Ga0102873_1219129 | Not Available | 571 | Open in IMG/M |
3300007600|Ga0102920_1153359 | Not Available | 738 | Open in IMG/M |
3300007644|Ga0102902_1055691 | All Organisms → Viruses → Predicted Viral | 1184 | Open in IMG/M |
3300008448|Ga0114876_1078407 | All Organisms → Viruses → Predicted Viral | 1380 | Open in IMG/M |
3300010293|Ga0116204_1036187 | All Organisms → Viruses → Predicted Viral | 1905 | Open in IMG/M |
3300010354|Ga0129333_10002313 | Not Available | 17699 | Open in IMG/M |
3300010354|Ga0129333_10009562 | Not Available | 9162 | Open in IMG/M |
3300010354|Ga0129333_10009823 | Not Available | 9037 | Open in IMG/M |
3300010354|Ga0129333_10032632 | All Organisms → Viruses → Predicted Viral | 4916 | Open in IMG/M |
3300010354|Ga0129333_10077606 | All Organisms → Viruses → Predicted Viral | 3083 | Open in IMG/M |
3300010354|Ga0129333_10096377 | All Organisms → Viruses → Predicted Viral | 2734 | Open in IMG/M |
3300010354|Ga0129333_10118829 | All Organisms → Viruses → Predicted Viral | 2436 | Open in IMG/M |
3300010354|Ga0129333_10140225 | All Organisms → Viruses → Predicted Viral | 2221 | Open in IMG/M |
3300010354|Ga0129333_10151302 | All Organisms → Viruses → Predicted Viral | 2129 | Open in IMG/M |
3300010354|Ga0129333_10176765 | All Organisms → Viruses → Predicted Viral | 1950 | Open in IMG/M |
3300010354|Ga0129333_10547958 | All Organisms → Viruses → Predicted Viral | 1009 | Open in IMG/M |
3300010354|Ga0129333_10579062 | Not Available | 976 | Open in IMG/M |
3300010354|Ga0129333_10644816 | Not Available | 915 | Open in IMG/M |
3300010354|Ga0129333_11217571 | Not Available | 625 | Open in IMG/M |
3300010354|Ga0129333_11380655 | Not Available | 580 | Open in IMG/M |
3300010354|Ga0129333_11513111 | Not Available | 549 | Open in IMG/M |
3300010354|Ga0129333_11746069 | Not Available | 505 | Open in IMG/M |
3300010370|Ga0129336_10162418 | All Organisms → Viruses → Predicted Viral | 1286 | Open in IMG/M |
3300010370|Ga0129336_10506451 | Not Available | 650 | Open in IMG/M |
3300010389|Ga0136549_10002666 | Not Available | 13372 | Open in IMG/M |
3300011984|Ga0119931_1010744 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Tamkungvirus → Tamkungvirus ST4 | 1006 | Open in IMG/M |
3300013087|Ga0163212_1118058 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → unclassified Kyanoviridae → Synechococcus phage S-H38 | 847 | Open in IMG/M |
(restricted) 3300013126|Ga0172367_10025263 | Not Available | 5396 | Open in IMG/M |
3300020074|Ga0194113_10004152 | Not Available | 19063 | Open in IMG/M |
3300020074|Ga0194113_10099282 | All Organisms → Viruses → Predicted Viral | 2558 | Open in IMG/M |
3300020074|Ga0194113_10408597 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae | 995 | Open in IMG/M |
3300020074|Ga0194113_10433063 | Not Available | 958 | Open in IMG/M |
3300020074|Ga0194113_10803639 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → Bellamyvirus → unclassified Bellamyvirus → Synechococcus phage S-SM2 | 643 | Open in IMG/M |
3300020083|Ga0194111_10043213 | All Organisms → Viruses → Predicted Viral | 4076 | Open in IMG/M |
3300020083|Ga0194111_10091890 | All Organisms → Viruses → Predicted Viral | 2466 | Open in IMG/M |
3300020109|Ga0194112_10816618 | All Organisms → Viruses | 608 | Open in IMG/M |
3300020151|Ga0211736_10070449 | All Organisms → Viruses → Predicted Viral | 4110 | Open in IMG/M |
3300020151|Ga0211736_10392355 | All Organisms → Viruses → Predicted Viral | 1685 | Open in IMG/M |
3300020151|Ga0211736_10691616 | Not Available | 966 | Open in IMG/M |
3300020161|Ga0211726_10093911 | Not Available | 6310 | Open in IMG/M |
3300020172|Ga0211729_10398929 | All Organisms → Viruses → Predicted Viral | 1139 | Open in IMG/M |
3300020179|Ga0194134_10020883 | All Organisms → Viruses → Predicted Viral | 4340 | Open in IMG/M |
3300020183|Ga0194115_10028695 | All Organisms → Viruses → Predicted Viral | 4008 | Open in IMG/M |
3300020183|Ga0194115_10227914 | All Organisms → Viruses | 897 | Open in IMG/M |
3300020183|Ga0194115_10331699 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → Bellamyvirus → unclassified Bellamyvirus → Synechococcus phage S-SM2 | 680 | Open in IMG/M |
3300020190|Ga0194118_10015258 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 6228 | Open in IMG/M |
3300020190|Ga0194118_10030928 | All Organisms → Viruses → Predicted Viral | 3801 | Open in IMG/M |
3300020190|Ga0194118_10521455 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → Bellamyvirus → unclassified Bellamyvirus → Synechococcus phage S-SM2 | 564 | Open in IMG/M |
3300020190|Ga0194118_10524872 | All Organisms → Viruses | 561 | Open in IMG/M |
3300020204|Ga0194116_10032514 | All Organisms → Viruses → Predicted Viral | 4014 | Open in IMG/M |
3300020204|Ga0194116_10378084 | Not Available | 709 | Open in IMG/M |
3300020204|Ga0194116_10521399 | Not Available | 553 | Open in IMG/M |
3300020214|Ga0194132_10161622 | All Organisms → Viruses → Predicted Viral | 1314 | Open in IMG/M |
3300020214|Ga0194132_10261373 | Not Available | 938 | Open in IMG/M |
3300020221|Ga0194127_10831364 | Not Available | 567 | Open in IMG/M |
3300021323|Ga0210295_1107539 | All Organisms → Viruses → Predicted Viral | 2200 | Open in IMG/M |
3300021376|Ga0194130_10308188 | Not Available | 876 | Open in IMG/M |
3300021959|Ga0222716_10240649 | All Organisms → Viruses → Predicted Viral | 1122 | Open in IMG/M |
3300021960|Ga0222715_10006453 | Not Available | 9913 | Open in IMG/M |
3300021960|Ga0222715_10010331 | Not Available | 7495 | Open in IMG/M |
3300021961|Ga0222714_10003411 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 16209 | Open in IMG/M |
3300021961|Ga0222714_10007828 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 9700 | Open in IMG/M |
3300021961|Ga0222714_10026802 | All Organisms → Viruses → Predicted Viral | 4367 | Open in IMG/M |
3300021961|Ga0222714_10081323 | All Organisms → Viruses → Predicted Viral | 2111 | Open in IMG/M |
3300021961|Ga0222714_10102375 | All Organisms → Viruses → Predicted Viral | 1808 | Open in IMG/M |
3300021962|Ga0222713_10103362 | All Organisms → Viruses → Predicted Viral | 2038 | Open in IMG/M |
3300021962|Ga0222713_10577890 | Not Available | 659 | Open in IMG/M |
3300021963|Ga0222712_10000386 | All Organisms → Viruses | 60081 | Open in IMG/M |
3300022198|Ga0196905_1171688 | Not Available | 551 | Open in IMG/M |
3300022198|Ga0196905_1188456 | Not Available | 520 | Open in IMG/M |
3300022200|Ga0196901_1070261 | All Organisms → Viruses → Predicted Viral | 1267 | Open in IMG/M |
3300024289|Ga0255147_1000012 | All Organisms → Viruses | 162364 | Open in IMG/M |
3300024487|Ga0255222_1064122 | Not Available | 556 | Open in IMG/M |
3300025283|Ga0208048_1078487 | Not Available | 726 | Open in IMG/M |
3300025647|Ga0208160_1167691 | Not Available | 520 | Open in IMG/M |
3300025828|Ga0208547_1163694 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CBM2 | 625 | Open in IMG/M |
(restricted) 3300028043|Ga0233417_10214758 | Not Available | 849 | Open in IMG/M |
3300029930|Ga0119944_1001480 | All Organisms → Viruses → Predicted Viral | 4147 | Open in IMG/M |
3300029930|Ga0119944_1005718 | All Organisms → Viruses → Predicted Viral | 1994 | Open in IMG/M |
3300029930|Ga0119944_1033213 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae | 657 | Open in IMG/M |
3300029930|Ga0119944_1040247 | Not Available | 580 | Open in IMG/M |
3300029930|Ga0119944_1045687 | Not Available | 533 | Open in IMG/M |
3300029933|Ga0119945_1006668 | All Organisms → Viruses → Predicted Viral | 1575 | Open in IMG/M |
3300031857|Ga0315909_10213623 | All Organisms → Viruses → Predicted Viral | 1516 | Open in IMG/M |
3300031857|Ga0315909_10290437 | All Organisms → Viruses → Predicted Viral | 1229 | Open in IMG/M |
3300031857|Ga0315909_10811211 | Not Available | 590 | Open in IMG/M |
3300033418|Ga0316625_101320965 | Not Available | 670 | Open in IMG/M |
3300033521|Ga0316616_102973458 | Not Available | 639 | Open in IMG/M |
3300034066|Ga0335019_0000126 | Not Available | 45778 | Open in IMG/M |
3300034103|Ga0335030_0208277 | All Organisms → Viruses → Predicted Viral | 1357 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 22.22% |
Freshwater To Marine Saline Gradient | Environmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient | 17.59% |
Aqueous | Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous | 12.96% |
Estuarine Water | Environmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water | 10.19% |
Aquatic | Environmental → Aquatic → Freshwater → Drinking Water → Unclassified → Aquatic | 5.56% |
Freshwater | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater | 4.63% |
Freshwater | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater | 4.63% |
Estuarine | Environmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine | 3.70% |
Freshwater Lake | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake | 2.78% |
Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Lake | 2.78% |
Freshwater | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater | 2.78% |
Freshwater | Environmental → Aquatic → Freshwater → River → Unclassified → Freshwater | 1.85% |
Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 1.85% |
Freshwater | Environmental → Aquatic → Freshwater → Lentic → Epilimnion → Freshwater | 0.93% |
Anoxic Lake Water | Environmental → Aquatic → Freshwater → Lake → Unclassified → Anoxic Lake Water | 0.93% |
Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 0.93% |
Drinking Water Treatment Plant | Environmental → Aquatic → Freshwater → Drinking Water → Unclassified → Drinking Water Treatment Plant | 0.93% |
Wetland | Environmental → Aquatic → Marine → Wetlands → Sediment → Wetland | 0.93% |
Saline Water And Sediment | Environmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Epilimnion → Saline Water And Sediment | 0.93% |
Marine Methane Seep Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Marine Methane Seep Sediment | 0.93% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
3300001213 | Combined assembly of wetland microbial communities from Twitchell Island in the Sacramento Delta (Jan 2013 JGI Velvet Assembly) | Environmental | Open in IMG/M |
3300002835 | Freshwater microbial communities from Lake Mendota, WI - (Lake Mendota Combined Ray assembly, ASSEMBLY_DATE=20140605) | Environmental | Open in IMG/M |
3300005512 | Saline surface water microbial communities from Etoliko Lagoon, Greece - halocline_water | Environmental | Open in IMG/M |
3300005527 | Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel5S_2200h metaG | Environmental | Open in IMG/M |
3300005662 | Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MLB.SD (version 4) | Environmental | Open in IMG/M |
3300005805 | Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USA | Environmental | Open in IMG/M |
3300006637 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_15_>0.8_DNA | Environmental | Open in IMG/M |
3300006802 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18 | Environmental | Open in IMG/M |
3300007538 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaG | Environmental | Open in IMG/M |
3300007541 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG | Environmental | Open in IMG/M |
3300007542 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG | Environmental | Open in IMG/M |
3300007545 | Estuarine microbial communities from the Columbia River estuary - metaG 1547B-3 | Environmental | Open in IMG/M |
3300007600 | Estuarine microbial communities from the Columbia River estuary - metaG 1568A-3 | Environmental | Open in IMG/M |
3300007644 | Estuarine microbial communities from the Columbia River estuary - metaG 1555B-02 | Environmental | Open in IMG/M |
3300008448 | Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - August 4, 2014 all contigs | Environmental | Open in IMG/M |
3300010293 | Anoxic lake water microbial communities from Lake Kivu, Rwanda to study Microbial Dark Matter (Phase II) - Lake Kivu water 52m metaG | Environmental | Open in IMG/M |
3300010354 | Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.8_DNA | Environmental | Open in IMG/M |
3300010370 | Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNA | Environmental | Open in IMG/M |
3300010389 | Marine sediment microbial communities from methane seeps within Baltimore Canyon, US Atlantic Margin - Baltimore Canyon MUC-11 12-14 cmbsf | Environmental | Open in IMG/M |
3300011984 | Freshwater microbial communities from drinking water treatment plant - The University of Hong Kong - Raw_water_201107 | Environmental | Open in IMG/M |
3300013087 | Freshwater microbial communities from Lake Malawi, Central Region, Malawi to study Microbial Dark Matter (Phase II) - Malawi_45m_30L | Environmental | Open in IMG/M |
3300013126 (restricted) | Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10m | Environmental | Open in IMG/M |
3300020074 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015017 Mahale Deep Cast 200m | Environmental | Open in IMG/M |
3300020083 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015033 Kigoma Deep Cast 300m | Environmental | Open in IMG/M |
3300020109 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015016 Mahale Deep Cast 400m | Environmental | Open in IMG/M |
3300020151 | Freshwater lake microbial communities from Lake Erken, Sweden - P4710_202 megahit1 | Environmental | Open in IMG/M |
3300020161 | Freshwater lake microbial communities from Lake Erken, Sweden - P4710_101 megahit1 | Environmental | Open in IMG/M |
3300020172 | Freshwater lake microbial communities from Lake Erken, Sweden - P4710_102 megahit1 | Environmental | Open in IMG/M |
3300020179 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015056 Kigoma Offshore 0m | Environmental | Open in IMG/M |
3300020183 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015002 Mahale S4 surface | Environmental | Open in IMG/M |
3300020190 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015013 Mahale N5 surface | Environmental | Open in IMG/M |
3300020204 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015008 Mahale S9 surface | Environmental | Open in IMG/M |
3300020214 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015054 Kigoma Offshore 80m | Environmental | Open in IMG/M |
3300020221 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015036 Kigoma Deep Cast 100m | Environmental | Open in IMG/M |
3300021323 | Metatranscriptome of estuarine water microbial communities from the Columbia River estuary, Oregon, United States ? R9.63AS (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300021376 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015050 Kigoma 12 surface | Environmental | Open in IMG/M |
3300021959 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_13D | Environmental | Open in IMG/M |
3300021960 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_9D | Environmental | Open in IMG/M |
3300021961 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_3D | Environmental | Open in IMG/M |
3300021962 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_649D | Environmental | Open in IMG/M |
3300021963 | Estuarine water microbial communities from San Francisco Bay, California, United States - C33_657D | Environmental | Open in IMG/M |
3300022198 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v3) | Environmental | Open in IMG/M |
3300022200 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v3) | Environmental | Open in IMG/M |
3300024289 | Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Miss_RepA_8h | Environmental | Open in IMG/M |
3300024487 | Metatranscriptome of freshwater microbial communities from Columbia River, Oregon, United States - Colum_Cont_RepB_0h (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300025283 | Freshwater microbial communities from Lake Malawi, Central Region, Malawi to study Microbial Dark Matter (Phase II) - Malawi_45m_30L (SPAdes) | Environmental | Open in IMG/M |
3300025647 | Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (SPAdes) | Environmental | Open in IMG/M |
3300025828 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_22_N_<0.8_DNA (SPAdes) | Environmental | Open in IMG/M |
3300028043 (restricted) | Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MG | Environmental | Open in IMG/M |
3300029930 | Aquatic microbial communities from drinking water treatment plant in Pearl River Delta area, China - influent_20120727 | Environmental | Open in IMG/M |
3300029933 | Aquatic microbial communities from drinking water treatment plant in Pearl River Delta area, China - influent_20120727_2 | Environmental | Open in IMG/M |
3300031857 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125 | Environmental | Open in IMG/M |
3300033418 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_T1_C1_D1_A | Environmental | Open in IMG/M |
3300033521 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D1_B | Environmental | Open in IMG/M |
3300034066 | Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME11Jul2017-rr0087 | Environmental | Open in IMG/M |
3300034103 | Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME27Sep2002-rr0119 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
JGIcombinedJ13530_1069335713 | 3300001213 | Wetland | MSIGFAIAIYTALVAFVSSTVLYYFKVMYPREEQQLKEKLK* |
B570J40625_1007614443 | 3300002835 | Freshwater | MSLGIAFAIYTTLVAFVSSIMIYYFRVMYPREEHQLKENSK* |
Ga0074648_10261263 | 3300005512 | Saline Water And Sediment | MSIALAIAIYTALIAFVSSLMVYYFKVMYPREEAQLREKSK* |
Ga0068876_1000871015 | 3300005527 | Freshwater Lake | MSIGLAFTIYTALVAFVSSIMIYYYKVMYPREENQLKEKSK* |
Ga0068876_100218073 | 3300005527 | Freshwater Lake | MSIGLGIAIYAALVAFVSSIMIYYYKVMYPTEEAQLKEKSK* |
Ga0078894_1004985311 | 3300005662 | Freshwater Lake | MSIALAFAIYSALVAVVSSLMIYYWKVMYPQQEAKLKERSK* |
Ga0079957_100410018 | 3300005805 | Lake | MTIGLGVVIYISLVAFVSSIMVYYFKVIYPREEAQIKENSK* |
Ga0079957_100470913 | 3300005805 | Lake | MSIGIGIAIYTALVAFVSSIMVYYFKVMYPREEAQLKEKSK* |
Ga0079957_10648738 | 3300005805 | Lake | MSLGIAFAIYTTLVAFVSSIMVYYFGVMYPREEHQLREKSK* |
Ga0075461_101293804 | 3300006637 | Aqueous | MSIALATLIYTALVAFVSSVMVYYFKVMYPREEAQLREKSK* |
Ga0070749_100536863 | 3300006802 | Aqueous | MSIALATLIYTALVAFVSSLMVYYFKVMYPREEAQLREKSK* |
Ga0070749_101109204 | 3300006802 | Aqueous | MSIALAIAIYTALVAFVSSVMVYYFKVMYPREEAQLKEKSK* |
Ga0070749_106759012 | 3300006802 | Aqueous | MSIALAIAIYSAIVAFVSSLMVYYFKVMYPREEAQLREKS |
Ga0099851_12289503 | 3300007538 | Aqueous | MSLFLGLAIYSALVAFISSLMIYYYKVMYPREEAKLKEKSK* |
Ga0099848_11000594 | 3300007541 | Aqueous | MTIALALSIYTALVAVVSSLMVYYFRVMYPQEEKQ |
Ga0099848_11130272 | 3300007541 | Aqueous | MSIALGIAIYTALVAFVSSLMVYYFRVMYPREEQQLRKNPND* |
Ga0099848_12535881 | 3300007541 | Aqueous | MSVGLGIAIYTALVAVVSSMMVYYFRVMYPREEQQLREKSK* |
Ga0099846_12842613 | 3300007542 | Aqueous | MSIALGLAIYSALVAFISSLMIYYYKVMYPREEAKLKEKSK* |
Ga0102873_12191292 | 3300007545 | Estuarine | MSLGIAFAIYTALIAFVSSIMIYYYKVMYPLEEAQLNEKSK* |
Ga0102920_11533594 | 3300007600 | Estuarine | PRESNSSNEMSLGIAFAIYTALIAFVSSIMIYYYKVMYPLEEAQLNEKSK* |
Ga0102902_10556915 | 3300007644 | Estuarine | MSLGIAFAIYTALIAFVSSIMIYYYKVMYPLEAAQLNEKSK* |
Ga0114876_10784072 | 3300008448 | Freshwater Lake | MSLALAFAIYTSLVAFVSSIMIYYLKVMYPREEAQLKEKSK* |
Ga0116204_10361875 | 3300010293 | Anoxic Lake Water | MSIGVGIAIYTALVAVVSSLMIYYFRVMYPREEAQLKEKSND* |
Ga0129333_1000231340 | 3300010354 | Freshwater To Marine Saline Gradient | MSIGLGVAIYAALVAFVSSTMIYYFKVMYPAEEAQLKEKSK* |
Ga0129333_100095628 | 3300010354 | Freshwater To Marine Saline Gradient | MSIAVALAIYTSLIAFVSSIMVYYYKMMYPREEAKLMEKSK* |
Ga0129333_1000982321 | 3300010354 | Freshwater To Marine Saline Gradient | MSIGLGAAIYAALVAFVSSIMVYYFRVMYPTEEAQLKEKSK* |
Ga0129333_100326322 | 3300010354 | Freshwater To Marine Saline Gradient | MSVGLGIAIYTALVAVVSSLMVYYFRVMYPREEKQLKEKSK* |
Ga0129333_100776065 | 3300010354 | Freshwater To Marine Saline Gradient | MSIALAIAIYTAIVAFVSSLMVYYIKVMYPREEAQLREKSK* |
Ga0129333_100963776 | 3300010354 | Freshwater To Marine Saline Gradient | MSIALGIAIYSALVAFISSIMVYYYRVMYPQEEAKLREKSK* |
Ga0129333_101188292 | 3300010354 | Freshwater To Marine Saline Gradient | MSVGLGIAIYTALVAVVSSLMVYYFRVMYPREEQQLKEKSK* |
Ga0129333_101402253 | 3300010354 | Freshwater To Marine Saline Gradient | MSIALGIAIYTTLVAVVASLMVYYFRVMYPREEQQLKEKSK* |
Ga0129333_101513026 | 3300010354 | Freshwater To Marine Saline Gradient | MSIGLGIAIYTALVAVISSLMVYYFRVMYPREEQQLKEKSK* |
Ga0129333_101767655 | 3300010354 | Freshwater To Marine Saline Gradient | MSIALGLAIYSALVAFISSLMIYYYKVMYPREEAKLKEKTK* |
Ga0129333_105479582 | 3300010354 | Freshwater To Marine Saline Gradient | MSVGLGIAIYTALVAVISSLMVYYFRVMYPREENQLKEKSK* |
Ga0129333_105790622 | 3300010354 | Freshwater To Marine Saline Gradient | MTIALAIAIYTALIAVVSSLMVYYFRVMYPQEEKQLKEKSK* |
Ga0129333_106448161 | 3300010354 | Freshwater To Marine Saline Gradient | LGIAIYSTLIAFISSIMVYYYRVMYPQEEAKLKEKSK* |
Ga0129333_112175714 | 3300010354 | Freshwater To Marine Saline Gradient | MSALVGLAIYSALVAFISSLMIYYYKVMYPREEAELKEKSK* |
Ga0129333_113806552 | 3300010354 | Freshwater To Marine Saline Gradient | MSIGLAIAIYTALVAVVSSLMVYYFRVMYPQEEKQLREKSK* |
Ga0129333_115131111 | 3300010354 | Freshwater To Marine Saline Gradient | MSLALGIAIYAALVASISSLMLYYYKVRYPQEEEKLKEKTK* |
Ga0129333_117460692 | 3300010354 | Freshwater To Marine Saline Gradient | MSLGIVFAIYTTLVAFVSSIMVYYFGVMYPREEHQLREKSK* |
Ga0129336_101624183 | 3300010370 | Freshwater To Marine Saline Gradient | MSIALAIAIYTAIVAFVSSLMVYYFKVMYPREEAQLREKSK* |
Ga0129336_105064512 | 3300010370 | Freshwater To Marine Saline Gradient | MSIGLGVAIYAALVAFVSSIMVYYFRVMCPTEEAQLKEKSK* |
Ga0136549_1000266625 | 3300010389 | Marine Methane Seep Sediment | MTITVALLIYSGLVALVSSLMVYYFKVIRPKEEAQYK* |
Ga0119931_10107442 | 3300011984 | Drinking Water Treatment Plant | MSIALGITIYTSLVAVVASLMVYYFRVMYPREEQQLKEKSK* |
Ga0163212_11180581 | 3300013087 | Freshwater | MSIGLGIAIYTALVAVISSLMVYYFGVIYPREEQELRSKK* |
(restricted) Ga0172367_100252635 | 3300013126 | Freshwater | MSIGVGIAIYTALVAVISSLMVYYFRVIYPREEQQLKEKSND* |
Ga0194113_100041529 | 3300020074 | Freshwater Lake | VSIGLGIAIYAALVAFVSSIMAYYLKVMYPREEQQLKEKSK |
Ga0194113_100992822 | 3300020074 | Freshwater Lake | MSIGLGIAIYTAMVAVVSSLMVYYFGVMYPREEQELKSKK |
Ga0194113_104085973 | 3300020074 | Freshwater Lake | MSIGVGIAIYTALVAVISSLMVYYFRVIYPREEQQLKEKSNDRLQ |
Ga0194113_104330633 | 3300020074 | Freshwater Lake | MTIELAIGIYVALVAFVSSIMVYYFKVMYSREEAQLKENSK |
Ga0194113_108036393 | 3300020074 | Freshwater Lake | MSIGVGIAIYTALVAVISSLMVYYFRVIYPREEQQLKEKSND |
Ga0194111_100432132 | 3300020083 | Freshwater Lake | MSIGLGIAIYTAMVAVVSSLMVYYFGVMYPREEQELRSKK |
Ga0194111_100918902 | 3300020083 | Freshwater Lake | MSIGLGIAIYTAILAFVSSLMGYYFKVMYPREEQELKSKK |
Ga0194112_108166181 | 3300020109 | Freshwater Lake | MSIGLGITIYAALVAVISSLMVYYFRVMYPRDEQQLKEKSND |
Ga0211736_100704492 | 3300020151 | Freshwater | MSIGLGVAIYATLVAFVSSIMVYYFKVIYPTEEAQLKEKSK |
Ga0211736_103923552 | 3300020151 | Freshwater | MSIGLGVAIYAALVAFVSSIMVYYFKVMYPTEEAQLEEKSK |
Ga0211736_106916162 | 3300020151 | Freshwater | MSIGVAFAIYTSLVAFVSSIIIYYFKVMYPTEEAQLKEKSK |
Ga0211726_100939119 | 3300020161 | Freshwater | MTVGLAFAIYTALVAFVSSIMLYYLKVMYPREENQLKEKSK |
Ga0211729_103989294 | 3300020172 | Freshwater | MSIGVAFAIYTSLVAFVSSIMIYYFKVMYPTEEAQLKEKSK |
Ga0194134_100208834 | 3300020179 | Freshwater Lake | MSIGLGVVIYAALVAVVSSLMVYYFRIMYPRDEQQLKEKSNDRLQ |
Ga0194115_100286955 | 3300020183 | Freshwater Lake | MSIGVGIAIYAALVAGVSSLMVYYFRVIYPRDEQQLKEKSNDRLQ |
Ga0194115_102279141 | 3300020183 | Freshwater Lake | MSIGVGIAIYAALVAGVSSLMVYYFRVIYPRDEQQLKEKS |
Ga0194115_103316993 | 3300020183 | Freshwater Lake | IGVGIAIYTALVAVISSLMVYYFRVIYPREEQQLKEKSNDQLQ |
Ga0194118_1001525820 | 3300020190 | Freshwater Lake | MSIGVGIAIYAALVAGVSSLMVYYFRVIYPRDEQQLK |
Ga0194118_100309282 | 3300020190 | Freshwater Lake | MSIGLGITIYAALVAVISSLMVYYFRVMYPRDEQQLKEKSNDRLQ |
Ga0194118_105214551 | 3300020190 | Freshwater Lake | IGVGIAIYTALVAVISSLMVYYFRVIYPREEQQLKEKSNDRLQ |
Ga0194118_105248723 | 3300020190 | Freshwater Lake | MSIGVGIAIYTALVAVISSLMVYYFRVIYPREEQQLK |
Ga0194116_100325145 | 3300020204 | Freshwater Lake | VSIGLGIAIYSALVAFVSSIMVYYLKVMYPREEQQLKENSK |
Ga0194116_103780842 | 3300020204 | Freshwater Lake | MSIGVGIAIYTALVAVISSLMIYYFRVIYPREEQQLKEKSND |
Ga0194116_105213991 | 3300020204 | Freshwater Lake | MSIGLGIAIYTALVAVISSLMVYYFGVMYPREEQELRSKK |
Ga0194132_101616221 | 3300020214 | Freshwater Lake | VSIGLGIAIYSALVAFVSSIMVYYLKVMYPREEQQ |
Ga0194132_102613734 | 3300020214 | Freshwater Lake | MSIGVGIAIYAALVAGVSSLMVYYFRVIYPREEQQLKEKSNDRLQ |
Ga0194127_108313643 | 3300020221 | Freshwater Lake | VSIGLGIAIYAALVAFVSSIMAYYLKVMYPREEQQL |
Ga0210295_11075394 | 3300021323 | Estuarine | MSLGIAFAIYTALIAFVSSIMIYYYKVMYPLEEAQLNEKSK |
Ga0194130_103081881 | 3300021376 | Freshwater Lake | MSIELGIAIYTAIVVVISSLMVYYFRVMYPREEAQLKEKSK |
Ga0222716_102406494 | 3300021959 | Estuarine Water | MSIALATLIYTALVAFVSSLMVYYFKVMYPREEAQLREKSK |
Ga0222715_1000645313 | 3300021960 | Estuarine Water | MSIALAIAIYTALVAFVSSLMVYYFKVMYPREEAQLKEKSK |
Ga0222715_1001033124 | 3300021960 | Estuarine Water | MSIALAIAIYTALVAFVSSLMVYYFKVMYPREEAQLREKSK |
Ga0222714_1000341145 | 3300021961 | Estuarine Water | MTLGLAIAIYTALVAFVSSIMLYYLKVMYPREEAQLKEKSK |
Ga0222714_1000782822 | 3300021961 | Estuarine Water | MSIGFAIAIYTALVAFVSSTVLYYFKVMYPREEQQLKEKLK |
Ga0222714_1002680211 | 3300021961 | Estuarine Water | MSVGLGIAIYTSLVAVVSSLMVYYFRVMYPREEKQLKEKSK |
Ga0222714_1008132310 | 3300021961 | Estuarine Water | MSIGFAITIYTTLVAFVSSIMVYYLKVMYPREEHQLKENSK |
Ga0222714_101023751 | 3300021961 | Estuarine Water | MSITLGIAIYSAIVAFVSSLMVYYFKVMYPREEAQLKEKSK |
Ga0222713_101033627 | 3300021962 | Estuarine Water | MSLGIAFAIYSSLVAFVSSIMVYYLKVMYPREEQQLKEKLK |
Ga0222713_105778901 | 3300021962 | Estuarine Water | LFLGLAIYSALVAFISSLMIYYYKVMYPREEAKLKEKSK |
Ga0222712_1000038667 | 3300021963 | Estuarine Water | MSIGLAFAIYTTLVAFVSTIMLYYLKVIYPREEAQLKEKSK |
Ga0196905_11716882 | 3300022198 | Aqueous | MSIALAIAIYTAIVAFVSSLMVYYIKVMYPREEAQLREKSK |
Ga0196905_11884563 | 3300022198 | Aqueous | MSIGLGIAIYSALVAFISSLMIYYYKVMYPREEAKLREKSK |
Ga0196901_10702611 | 3300022200 | Aqueous | MSLFLGLAIYSALVAFISSLMIYYYKVMYPREEAKLKEKSK |
Ga0255147_100001239 | 3300024289 | Freshwater | MSIVLGVAIYAALVAFVSSIMLYYLKVMYSREEAKLKEKSK |
Ga0255222_10641222 | 3300024487 | Freshwater | MSLVIAFVIYTTLVAFVSSIMLYYLKVMYPREEAQLKEKSK |
Ga0208048_10784872 | 3300025283 | Freshwater | MSIGLGIAIYTALVAVISSLMVYYFGVIYPREEQELRSKK |
Ga0208160_11676914 | 3300025647 | Aqueous | IGLGVAIYAALVAFVSSTMIYYFKVMYPAEEAQLKEKSK |
Ga0208547_11636943 | 3300025828 | Aqueous | MSIALAIAIYTALVAFVSSVMVYYFKVMYPREEAQLKEKSK |
(restricted) Ga0233417_102147582 | 3300028043 | Sediment | MSIGLAIAIYTALVAVVSSLMIYYFCVMRPNEEKQSND |
Ga0119944_100148012 | 3300029930 | Aquatic | MSIGIGIVIYTALVAFVSSIMVYYFKVMYPREEAQLKEKSK |
Ga0119944_10057184 | 3300029930 | Aquatic | MSIALGIAIYTAIVAVISSLMVYYFRVMYPREEAQLKEKSK |
Ga0119944_10332132 | 3300029930 | Aquatic | MSIGLAIATYTALVAVISSLMIYYFRVMYPREEAQLKKEQSND |
Ga0119944_10402472 | 3300029930 | Aquatic | MSIGIGIAIYTALVAFVSSIMVYYFKVMYPREEAQLKENSK |
Ga0119944_10456872 | 3300029930 | Aquatic | MSIALGVAIYSVLVAFVSSVIVYYFKVMYPHEETQLKENSK |
Ga0119945_10066684 | 3300029933 | Aquatic | MSIALGIAIYTVIVAVISSLMVYYFRVMYPREEAQLKEKSK |
Ga0315909_102136231 | 3300031857 | Freshwater | FLGLAIYSALVAFISSLMIYYYKVMYPREEAKLKEKSK |
Ga0315909_102904372 | 3300031857 | Freshwater | MSLALAFAIYTSLVAFVSSIMIYYLKVMYPREEAQLKEKSK |
Ga0315909_108112112 | 3300031857 | Freshwater | MSIAVALAIYTSLIAFVSSIMVYYYKMMYPREEAKLMEKSK |
Ga0316625_1013209652 | 3300033418 | Soil | MSIAFAFAIYYALVAVVSSLMIYYWKVMYPQQEAKLKERSK |
Ga0316616_1029734581 | 3300033521 | Soil | MSIALAFAIYSALVAVVSSLMIYYWKVMYPQQEAKLRERSK |
Ga0335019_0000126_1194_1319 | 3300034066 | Freshwater | MSLGIAFAIYTTLVAFVSSIMIYYFKVMYPTEEAQLKEKSK |
Ga0335030_0208277_1244_1357 | 3300034103 | Freshwater | MSIGLGVAIYAALVAFVSSIMVYYFKVIYPTEEAQLKE |
⦗Top⦘ |