Basic Information | |
---|---|
Family ID | F084884 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 112 |
Average Sequence Length | 46 residues |
Representative Sequence | VHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGRFLA |
Number of Associated Samples | 74 |
Number of Associated Scaffolds | 112 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Bacteria |
% of genes with valid RBS motifs | 10.38 % |
% of genes near scaffold ends (potentially truncated) | 85.71 % |
% of genes from short scaffolds (< 2000 bps) | 85.71 % |
Associated GOLD sequencing projects | 73 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.42 |
Hidden Markov Model |
---|
|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Bacteria (88.393 % of family members) |
NCBI Taxonomy ID | 2 |
Taxonomy | All Organisms → cellular organisms → Bacteria |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Aquatic → Freshwater → Lake → Sediment → Microbial Mat (16.071 % of family members) |
Environment Ontology (ENVO) | Unclassified (19.643 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Unclassified (30.357 % of family members) |
⦗Top⦘ |
Full Alignment |
---|
Alignment of all the sequences in the family. |
IDLabel .2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78 |
Powered by MSAViewer |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 28.17% β-sheet: 0.00% Coil/Unstructured: 71.83% |
Feature Viewer | |||||
Position : 0 Zoom : x 1 Enter the variants Position Original Variant |
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.42 |
Powered by PDBe Molstar |
⦗Top⦘ |
⦗Top⦘ |
Visualization |
---|
All Organisms Unclassified |
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Visualization |
---|
Freshwater Sediment Wetland Groundwater Sediment Freshwater Freshwater Lake Sediment Freshwater Lake Sediment Microbial Mat Wetland Sediment Freshwater Wetlands Groundwater Sinkhole Freshwater Groundwater Sinkhole Polar Desert Sand Freshwater Freshwater Marine Sediment Estuarine Salt Marsh Sediment Marine Estuarine Natural And Restored Wetlands Wetland Sediment Saline Water Soil Sediment (Intertidal) Groundwater Sediment Mangrove Sediment Soil Untreated Peat Soil Natural And Restored Wetlands Rice Paddy Soil Fen Soil Fen Activated Sludge Active Sludge Wastewater Effluent Anaerobic Enrichment Culture |
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Geographical Distribution | |
---|---|
|
|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
TB_GS10_10DRAFT_100560662 | 3300000230 | Groundwater | VHPTLGILARFQAFFYASAFFQLDGVPPPTPARVTQTVGTLLDVIGIQ |
TB_FS06_10DRAFT_100330714 | 3300000233 | Groundwater | MHPTLGSLARFQAFFYASAFFQSDGVPPPDPARVTQTVGRLNT* |
TB_FS06_10DRAFT_10059657 | 3300000233 | Groundwater | QSVHPTLGILARFQAFFYALSFSQSDGVPPPAPARVTQTVGTPLA* |
TB_FS06_10DRAFT_10438591 | 3300000233 | Groundwater | QSVHPTLGILARFQAFFYASAFFQSDGVPPPAPAQVTQTVGRFRAKHTK* |
TB_FS08_3DRAFT_10313981 | 3300000236 | Groundwater | QSVHPTLGILARFQAFFYASAFSQSDGVPPPDPARVTQTIGTTRA* |
TB_FS08_3DRAFT_10619722 | 3300000236 | Groundwater | PALGILARFKAFFYASAFFQSDGVPPPAPARVTQTVGTTLAKLKFQ* |
YBMDRAFT_101502361 | 3300001373 | Marine Estuarine | QNAQQSVHPTLGILARFQAFFYASAFFQSDGVPPPDPARVTQTVGR* |
MIS_10579301 | 3300002024 | Sinkhole Freshwater | RFQAFFYASAFFQSDGVPPPAPARVTQTVGTPLAQQGK* |
MIS_11436591 | 3300002024 | Sinkhole Freshwater | SVHPTLGILARFQAFSYASAFSQSDGVPPPDPARVTQTVGTPLA* |
MIS_11520271 | 3300002024 | Sinkhole Freshwater | TLGILARFQAFFYASAFSQLDGVPPPNPARVTQTVGTPLA* |
MIS_11663291 | 3300002024 | Sinkhole Freshwater | FQAFFYASAFFQSDGVPPPAPARVTQTVSHPLPYQG* |
MIS_100945111 | 3300002027 | Sinkhole Freshwater | QQSVHPTLGIRRHFQAFSYASTFFQSDGVPPPAPARVTQTVSRFV* |
MIS_101607191 | 3300002027 | Sinkhole Freshwater | QSVHPTLGILARFQAFFYTSAFSQSDGVPPPDPARVTQTVGRWRMQQNLFENL* |
MIS_101729251 | 3300002027 | Sinkhole Freshwater | QSVHPTLGIRRHFQAFFYASAFSQSDGVPPPAPARVTQTVGRLVSKG* |
Ga0055440_101540462 | 3300004020 | Natural And Restored Wetlands | LGILRKSQAVFYAFSFFQSDGFAVPAPAQVTQTGRHLRVTQIP* |
Ga0066649_102328371 | 3300004209 | Groundwater | PTLGILARFQAFFYASAFSQSDGDPPPAPARVTQTVIGA* |
Ga0062380_104973602 | 3300004779 | Wetland Sediment | PTLGILARFQAFFYASAFFQSDGVPPPAPARVTQTVSPILSVDA* |
Ga0071116_10370953 | 3300005077 | Sinkhole | MRRKINFFTTASQQSVHLTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGWLIVIQ |
Ga0071116_11198282 | 3300005077 | Sinkhole | VHPTLGILARFQAFFYALSFFQSDGVPPPAPARVTQTVGRWRYEGQKLK* |
Ga0071116_12655372 | 3300005077 | Sinkhole | KACQQSVHPTLGSLARFQAFFYASAFFQLDGVPPPAPARVTQTVGRFLAQPK* |
Ga0074473_103569542 | 3300005830 | Sediment (Intertidal) | LIAKVAAQHSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVR |
Ga0074471_106558695 | 3300005831 | Sediment (Intertidal) | VHPTLGILARFQAFFYASAFFQSDGVPPPTPARVTQTVGRYVRNE* |
Ga0074471_106559882 | 3300005831 | Sediment (Intertidal) | VHPTLGILARFQAFFYASAFFQSDGVPPPTPARVTQTVGQFLANNEV* |
Ga0075277_10049952 | 3300005895 | Rice Paddy Soil | VHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTV |
Ga0075272_10643642 | 3300005900 | Rice Paddy Soil | AQQSVHPTLGSLATFQAVFYAFSFSQSDGFAVPAPARVTQTVGR* |
Ga0075280_100050981 | 3300005904 | Rice Paddy Soil | VHPTLGSLRDLQAFFYASAFSQSDGVPPPAPARVTQTVGRWR |
Ga0075156_104433771 | 3300005982 | Wastewater Effluent | HPTLGILARFQAFFYASAFFQSDGVPSPAPARVTQTVSRLS* |
Ga0105105_102075871 | 3300009009 | Freshwater Sediment | SVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVSPPFEKQVSDKKW* |
Ga0105105_102750571 | 3300009009 | Freshwater Sediment | LARFQAFFYTSAFFQSDGVPPPAPARVTQTVGWLVINREK* |
Ga0105105_102763411 | 3300009009 | Freshwater Sediment | QSVHPTLGILARFQAFFYASAFFQSDGVPPPAPARVTQSVGLLAQI* |
Ga0105105_105811463 | 3300009009 | Freshwater Sediment | MLGSLARFQAFFYALSFFQSDGVPPPAPARVTQTVSLPLA* |
Ga0105048_113083521 | 3300009032 | Freshwater | LRSKKPTQVTTAQQSVHPTLGILRGLQAFFYASVFFRSDGAPPPAPARVTQTVGQF |
Ga0105091_101259091 | 3300009146 | Freshwater Sediment | MFSAAQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTV |
Ga0113563_110072303 | 3300009167 | Freshwater Wetlands | ILARFQAFFYASAFSQSDGVPPPAPAQVTQTVGRFTAKHNHDS* |
Ga0115028_102789362 | 3300009179 | Wetland | NKTAAQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVSRLSKIEMK* |
Ga0123573_121713862 | 3300009509 | Mangrove Sediment | MAAQHRVHPTLGIRSVLQALSNALAFSNADGVPPPAPAQVTQTV |
Ga0137331_10413721 | 3300011391 | Soil | VHPTLGSLARFQAFFYASAFSQSDGVPPPDPARVTQTVGWQVKE |
Ga0137460_10273913 | 3300011408 | Soil | PTLGILARFQAFFYASAFFQSDGVPPPAPARVTQTVGKNLA* |
Ga0137448_11488221 | 3300011427 | Soil | SYPKTATQQSVHPTLGILRQSQAFFHASAFSQSDGVPPPDPARVTPTVSHLAEN* |
Ga0136636_102467491 | 3300012044 | Polar Desert Sand | LGSLATSQAFFYALSFFRSDGVPPPAPARVTQTVGQFIGE* |
Ga0138256_104869931 | 3300012533 | Active Sludge | VHPTLGILAKSQAFFYALSFFQSDGVPPPDPARVTQTVGRLEKEEDYH |
Ga0138256_106475822 | 3300012533 | Active Sludge | VHPTLGILARFQAFFYASAFFQSDGIPPPAPARVTQTVSHLPCKNNYE* |
Ga0138256_108847301 | 3300012533 | Active Sludge | ILARFQAFFYASAFFQSDGVPPPAPARVTQTVGRLSDRIMK* |
Ga0138256_109407551 | 3300012533 | Active Sludge | QSMHLTLGILRQSQAVFYASSFFQLDGFAVPTPARVTQTVGRLTQLD* |
Ga0163199_14046831 | 3300013092 | Freshwater | QQSMHPTLGSLARFQAFFYASAFFQSDGVPPPDPARVTQTVGQFLEK* |
(restricted) Ga0172367_1000329619 | 3300013126 | Freshwater | MTLHTAEQSAHPTLGILAQFQAFFQSDGVPPPAPARVMQAISQPATIETI* |
(restricted) Ga0172373_103174651 | 3300013131 | Freshwater | MSLLLPQKSGWQKRAQQSVHPTLGILARFQAFFYASAFSQSDGVPPPDPARVTQTV |
Ga0075315_10018502 | 3300014258 | Natural And Restored Wetlands | QQSVHPTLGSLARFQAFFYASAFFQSDGVPPPAPARVTQTVGRVS* |
Ga0075360_11132382 | 3300014261 | Natural And Restored Wetlands | VHPTLGSLARFQAVFYALSFFQLDGFAVPAPARVTQTVG |
Ga0182021_112465261 | 3300014502 | Fen | MHPTLGILARFQAFSYASAFFQSDGVPPPAPARVTQTV |
Ga0180074_10267631 | 3300014877 | Soil | MKAAQQSVQWICGILPHFQAFFYASAFSQSDGVPPPAPARVTQTVGQIDR* |
Ga0180085_10999442 | 3300015259 | Soil | VHLTPGSLRRFQAFFYASAFFQLDGFAVPAPAQVRQTVGRL* |
Ga0184615_104776391 | 3300018059 | Groundwater Sediment | EYTPQNAAQQSVHPTLGILARFQAFLYASAFFQLGGVPPPNPARVTQTVGQFQECK |
Ga0207193_14203443 | 3300020048 | Freshwater Lake Sediment | PTLGILARFQAFFYALSFSQSDGVPPPAPARVTQTVGTPLAQ |
Ga0194120_100427413 | 3300020198 | Freshwater Lake | VHPTLGILARFQAFFYASAFFQSDGVPPPAPAQVTQTVGTPLAQQGKKH |
Ga0210379_102782872 | 3300021081 | Groundwater Sediment | TTAAQQSVHPTLGILARFQAFFYASEFFQSDGVPPPTPARVTQTVSRLLHN |
Ga0210377_101745003 | 3300021090 | Groundwater Sediment | QQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVSWQIHSL |
Ga0210377_103926801 | 3300021090 | Groundwater Sediment | LARFQAFFYASAFSQLDGVPPPAPARVTQTVGQFKTITAIKV |
Ga0210377_105140531 | 3300021090 | Groundwater Sediment | LASLVVIGFAQQSVHPTLGILARFQAFFYASAFFQLDGVPPPAPARVTQTVGL |
Ga0210365_106761852 | 3300021351 | Estuarine | VHPTLGILARFQAFFYALSFSQSDGGTPSAPARLTHTVSPQPRRLLW |
Ga0224495_100500254 | 3300022208 | Sediment | ACQQSVHPTLGILARFQAFFYASAFFQSDSVPPPTPARVTQTVRRVRQIVLL |
Ga0209751_108438432 | 3300025327 | Soil | QSVHPTLGILRHFQAFSYTSAFFQSDGVPPPTPARVTQTVGQLEKL |
Ga0210129_10356801 | 3300026026 | Natural And Restored Wetlands | VKQKASQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGRLSSVHSA |
Ga0209285_101400391 | 3300027726 | Freshwater Sediment | LGILARFQAFFYASAFSQLDGFAVPAPARVTQTVRRVMVQ |
Ga0209575_100320252 | 3300027739 | Freshwater | MHPTLGILARFQAFFYASAFFQSDGVPPPAPARVTQAVGLLS |
Ga0209288_100634251 | 3300027762 | Freshwater Sediment | ILARFQAFFYASAFSQSDGVPPPAPARVTQTVGQQTFE |
Ga0208980_103205482 | 3300027887 | Wetland | VHPTLGILARFQAFFYASAFSQSDGFAVPAPARVTQTVSPFFANIEIEGKKSYAE |
Ga0209536_1003033781 | 3300027917 | Marine Sediment | GILRRFQAFFYASAFSQLDGFAVPAPAQVTQAVRQLIWMRYG |
Ga0265593_10620211 | 3300028178 | Saline Water | QSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGRLSSME |
Ga0268283_11246512 | 3300028283 | Saline Water | QNRLAKTAQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVSRLS |
(restricted) Ga0247844_13333511 | 3300028571 | Freshwater | AQQSVHPTLGILARFQAFFYASAFFQSDGVPPPAPARVTQTVRWLSQ |
Ga0302159_100662421 | 3300028646 | Fen | PTLGSLARFQAFFHAFSFSQSDGVPPPAPARVTQTVGTPLAK |
Ga0272412_11562881 | 3300028647 | Activated Sludge | QQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGKNLR |
Ga0268298_101218541 | 3300028804 | Activated Sludge | VKNNAAQQSVHPTLGILARFQAFFYASAFFQSDGVPPPAPARVTQTV |
Ga0268298_103160583 | 3300028804 | Activated Sludge | LGILRKPQAVSHAFSFFWLDGFAVPAPAQVTQTVRRLSSRVL |
Ga0310376_10355641 | 3300028916 | Anaerobic Enrichment Culture | AQQSVHPTLGILARFQAFFYTSAFSQSDGVPPPAPARVTQTVSPLFA |
(restricted) Ga0247842_106651372 | 3300029268 | Freshwater | VHPTLGILARFQAFFYALSFSQSDGIPPPAPARVTQTVGWLSKNINKGLFDE |
Ga0302293_102796582 | 3300029981 | Fen | TPMRININAAEQSVHPTLGILARFQAFFYALSFSQSDGVPPPAPARVTQTVGRLS |
Ga0311348_103525173 | 3300030019 | Fen | LFKPAAQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGQSNQ |
Ga0311348_106419882 | 3300030019 | Fen | LGILARFQAFFYASAFFQLDGVPPPAPARVTQTVERTPVLIAN |
Ga0302285_102789091 | 3300030685 | Fen | LARFQAFFYTSAFFQSDGVPPPAPARVTQTVGQFLAKQ |
Ga0311335_108004941 | 3300030838 | Fen | QNCAQQSVHPTLGILCKSQAVFYASAFFQLDGVLPPNPARVTQTVETVE |
Ga0302323_1004827463 | 3300031232 | Fen | QSVHPTLGILRHFQAVSYALSFFKSDGVPPSAPARVTQTVGWQFKSNQVPK |
Ga0315556_10738461 | 3300031256 | Salt Marsh Sediment | MIKTAQQSVHPTLGILARFQAFFHASAFSQSDGVPPPAPARVTQTVGRTAE |
Ga0302321_1000463471 | 3300031726 | Fen | QQSVHPTLGILARFQAFFYASSFFQSDGVPPPAPARVTQTVSPRVYKRITNK |
Ga0302321_1024571732 | 3300031726 | Fen | AAQQSVHPTLGILARFQAFFYASAFFQSDGFAVPAPARVTQTVRRFLVN |
Ga0302321_1027438681 | 3300031726 | Fen | QQSVHWTLGILPHFQAFFYASAFSQSDGVPPPAPAPVTQTVSPLQQ |
Ga0302322_1008189791 | 3300031902 | Fen | HPTLGILARFQAVFYASAFSRSDGVPPPAPARVTQTVGTPLAK |
Ga0315268_10000066110 | 3300032173 | Sediment | MMNLSSEAQQSVHPTLGILARFQAFFYASAFFQSDGVPPPAPARVTQTVGR |
Ga0316608_10442501 | 3300033420 | Microbial Mat | WQKASQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGLPLMKYL |
Ga0316608_10742321 | 3300033420 | Microbial Mat | LGILARFQAFFYALSFSQSDGGTPSNPARVTQTFSPPLA |
Ga0316608_10746243 | 3300033420 | Microbial Mat | LGILARFQAFFYALSFFQSDGVPPPAPARVTQTVRQIFGEIDL |
Ga0316608_10966973 | 3300033420 | Microbial Mat | PTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGRFLA |
Ga0316611_10204071 | 3300033446 | Microbial Mat | PTLGILARFQAFFYASAFFQSDGVPPPAPARVTQTVGTPLAKLVKSKRD |
Ga0316611_10623992 | 3300033446 | Microbial Mat | TLGILARFQAFFYALSFSQSDGVPPPAPARVTQTVVPLL |
Ga0316611_10631873 | 3300033446 | Microbial Mat | VHPTLGILARFQAFFYASAFFQSDGVPPPAPARVTQAVGHFLAE |
Ga0316611_10673143 | 3300033446 | Microbial Mat | LGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGRFLAQPK |
Ga0316611_10853731 | 3300033446 | Microbial Mat | LGILARFQAFFYALSFSQSDGVPPPDPARVTQTVGQFLAKHH |
Ga0316611_10865861 | 3300033446 | Microbial Mat | GILARFQAFFYASAFFQSDGVPPPAPARVTQTVSP |
Ga0316611_11115542 | 3300033446 | Microbial Mat | VHPTLGILARFQAFFYASTFSQSDGVPPPTPARVTQTVRQIFGEIDL |
Ga0316621_107757252 | 3300033488 | Soil | KTAQQSVHPTLGIRRHFQAVSYASAFFQSDGVPPPAPARVTPTVGRTNAFA |
Ga0316621_113205681 | 3300033488 | Soil | AAQQSVHPTLVILARFQAFFYASAFSQADSVPPPAPAQVTQTVGQPPTQILN |
Ga0316612_10918621 | 3300033494 | Microbial Mat | QSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGQLLAK |
Ga0316610_10240561 | 3300033498 | Microbial Mat | LGILARFQAFFYALAFFQSDGVPPPAPARVTQTVRPPLAKLVFYNK |
Ga0316610_10323571 | 3300033498 | Microbial Mat | VHPTLGILARFQAFFYASAFPQSDGVPPPDPARVTQTV |
Ga0316610_10508431 | 3300033498 | Microbial Mat | LARFQAFFYALSFFQLDGVPPPAPARVTQTVRQIFGEIDL |
Ga0316610_10749951 | 3300033498 | Microbial Mat | TLGILARFQAFFYASAFFQSDGVPPPAPARVTQTVSP |
Ga0316610_10752523 | 3300033498 | Microbial Mat | VHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGRFLA |
Ga0316610_11149061 | 3300033498 | Microbial Mat | LARFQAFFYALAFFQSDGVPPPDPARVTQTVSPPLAKGGIK |
Ga0316616_1014716781 | 3300033521 | Soil | LGILARFQAFFYASAFFQSDGVPPPAPARVTQTVSRLSKIEMK |
Ga0370502_0059669_282_449 | 3300034156 | Untreated Peat Soil | MQNAAQQSVHPTLGILARFQAFFYASAFSQSDGVPPPAPARVTQTVGWLLAKLGK |
Ga0370499_0120787_510_665 | 3300034194 | Untreated Peat Soil | MKEYAAQHSVHPTLGILARFQAFFYASAFFQFDGVPPPAPARVTQTVGQFM |
⦗Top⦘ |