Basic Information | |
---|---|
Family ID | F101565 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 102 |
Average Sequence Length | 38 residues |
Representative Sequence | GMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Number of Associated Samples | 85 |
Number of Associated Scaffolds | 102 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Bacteria |
% of genes with valid RBS motifs | 4.90 % |
% of genes near scaffold ends (potentially truncated) | 94.12 % |
% of genes from short scaffolds (< 2000 bps) | 83.33 % |
Associated GOLD sequencing projects | 83 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.56 |
Hidden Markov Model |
---|
|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Bacteria (71.569 % of family members) |
NCBI Taxonomy ID | 2 |
Taxonomy | All Organisms → cellular organisms → Bacteria |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil (21.569 % of family members) |
Environment Ontology (ENVO) | Unclassified (22.549 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (47.059 % of family members) |
⦗Top⦘ |
Full Alignment |
---|
Alignment of all the sequences in the family. |
IDLabel .2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52 |
Powered by MSAViewer |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 29.23% β-sheet: 0.00% Coil/Unstructured: 70.77% |
Feature Viewer | |||||
Position : 0 Zoom : x 1 Enter the variants Position Original Variant |
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.56 |
Powered by PDBe Molstar |
⦗Top⦘ |
⦗Top⦘ |
Visualization |
---|
All Organisms Unclassified |
Powered by ApexCharts |
⦗Top⦘ |
Visualization |
---|
Freshwater Sediment Soil Watersheds Soil Vadose Zone Soil Terrestrial Soil Tropical Forest Soil Bulk Soil Grasslands Soil Surface Soil Agricultural Soil Soil Grasslands Soil Soil Soil Tropical Forest Soil Forest Soil Soil Corn, Switchgrass And Miscanthus Rhizosphere Biofilm Avena Fatua Rhizosphere Miscanthus Rhizosphere Switchgrass Rhizosphere Miscanthus Rhizosphere Populus Rhizosphere Switchgrass Rhizosphere Miscanthus Rhizosphere Rhizosphere Miscanthus Rhizosphere Avena Fatua Rhizosphere |
Powered by ApexCharts |
⦗Top⦘ |
Protein ID | Sample Taxon ID | Habitat | Sequence |
JGI11643J12802_101549004 | 3300000890 | Soil | KGLNQVIRVMAEAGQLHAPLPAAERFVDLQYLRAAGLMK* |
Ga0070698_1001549281 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | LKGLSQVIQFMGETGDLKPPLASAERFVDLQYLRAAGLQ* |
Ga0070698_1016803841 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | RAVARCVIPLLPRETGDLKPPLPPAERFVDLQYLRAAGIQ* |
Ga0070734_108073481 | 3300005533 | Surface Soil | DFAGLKQVIAFMAEARLIAPPLPLPERFVDLQYLRAAAIE* |
Ga0066703_103273561 | 3300005568 | Soil | CMIPLLPRETGDLKPPLPPAERFVDLQYLRAAGIQ* |
Ga0066903_1004574531 | 3300005764 | Tropical Forest Soil | DLKGFKTAIEFMGEAGVLKAPLPPPERFVDLQYLRAAGVQ* |
Ga0066903_1018404882 | 3300005764 | Tropical Forest Soil | TAIEFMGEAEVLKAPLPPPERFVDLQYLRAAGLQ* |
Ga0066903_1077216683 | 3300005764 | Tropical Forest Soil | GFKTAIEFMGEAGVLKAPLPPPERFVDLQYLRAAGLQ* |
Ga0075026_1006617411 | 3300006057 | Watersheds | EVIALMAEAGMLKAPLPVAERFVDLQYLRAAGLQ* |
Ga0070765_1006014881 | 3300006176 | Soil | AVIALLGRTGELKAPLPAAERFVDLQYLKAAGLQ* |
Ga0075422_105510441 | 3300006196 | Populus Rhizosphere | EQVIAMMGEGGILAPPLPPADRFVDLQFLQAAGIQ* |
Ga0079220_118246291 | 3300006806 | Agricultural Soil | QAIALMGEGGILKGPLPRAERFVDLQYLQAAGVQ* |
Ga0075431_1013482821 | 3300006847 | Populus Rhizosphere | PKGLAQVIAFMADAGHLQPPLPDAERFVDLGYLARAGVR* |
Ga0075429_1017913522 | 3300006880 | Populus Rhizosphere | PKGVAQVIAFMADAGQIQPPLPHAERFVDLSYLARAGVR* |
Ga0075436_1000971121 | 3300006914 | Populus Rhizosphere | GMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ* |
Ga0075419_103005801 | 3300006969 | Populus Rhizosphere | GFNRVIQIMAEAGEVKPPLPSAERFVDLQYLHAAGLMK* |
Ga0099792_111441012 | 3300009143 | Vadose Zone Soil | QVIAMLGEAGALKPPLPKPEQFVDLQYLRAGGLQ* |
Ga0075423_106173721 | 3300009162 | Populus Rhizosphere | LKGFRTAIEFLGEASVLKAPLPPVERFVDLQYLRAAGLQ* |
Ga0105242_125328781 | 3300009176 | Miscanthus Rhizosphere | VAQVIAFMADAGQIQPPLPDAERFVDLRFLARAGVAPR* |
Ga0126374_103693691 | 3300009792 | Tropical Forest Soil | AGEINIKGMEQVIAMMGEGGVLAPPLPAADRFVDLQFLQAAGIQ* |
Ga0126384_117603251 | 3300010046 | Tropical Forest Soil | LRGMEQVIAMMGEGGVLAPPLPPADRFVDLQFLQAAGIQ* |
Ga0126372_105268743 | 3300010360 | Tropical Forest Soil | AGEINIKGMEQVIAMMGEGGVLAPPLPAADRFVDLQFLQAAGSQ* |
Ga0126378_130893791 | 3300010361 | Tropical Forest Soil | VSAVIALLGRTGELTASLPAAERFVDLQYLEAAGLQ* |
Ga0126379_103894181 | 3300010366 | Tropical Forest Soil | LKGLAQAITLMGEAGTLKGPLPPAERFVDLQYLRLAGFQ* |
Ga0126381_1024377311 | 3300010376 | Tropical Forest Soil | LKQVITLMSEAGNLKPPLPAAERFVDAQYLKDAGVE* |
Ga0126381_1035491221 | 3300010376 | Tropical Forest Soil | ELNLNGLTQVSAFMGEAGTIESPLGAAGRFVDLQYLEAAGVR* |
Ga0126381_1038531101 | 3300010376 | Tropical Forest Soil | GMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAEVQ* |
Ga0126381_1042479892 | 3300010376 | Tropical Forest Soil | GAPARVVAMMAQAGLLKPPLPSAQRFVDLQYLRAAGLQ* |
Ga0136847_105614102 | 3300010391 | Freshwater Sediment | GQVIQFMGEAGELKPPLPAPERFVDLQYLQAAGVY* |
Ga0126383_101452594 | 3300010398 | Tropical Forest Soil | QVVAMMAETGALSPPLPAPETFVDRQYLQAAGVN* |
Ga0126383_113844252 | 3300010398 | Tropical Forest Soil | QAIALMGEAGTLKGPLPPAERFVDLQYLRLAGFQ* |
Ga0126383_117137131 | 3300010398 | Tropical Forest Soil | TAIEFMGEAGVLKEPLPPPERFVDLQYLHAAGVQ* |
Ga0134122_130820801 | 3300010400 | Terrestrial Soil | SFAQVIAFMVEAGQLKPPLPAAERFVDLQYLEAAGVR* |
Ga0137425_10107593 | 3300011422 | Soil | QVIAFMAEAGQLKAPLPLPERFVDLQYLRLAGVR* |
Ga0137388_100136034 | 3300012189 | Vadose Zone Soil | LELVIAMMGEGGALKAPLPQAERFVDLQYLRAAGVQ* |
Ga0137365_100318083 | 3300012201 | Vadose Zone Soil | MEQVIALMGESETIKAPLPAVERFVDLQYLHAAGVQ* |
Ga0150985_1175447971 | 3300012212 | Avena Fatua Rhizosphere | ADLAQVIAFMGDGGMLAQPLPPPERFVDPQYLKAAGAE* |
Ga0150984_1218103203 | 3300012469 | Avena Fatua Rhizosphere | LAQVIAMMADAGAIKTPLPSPEQFVDLQYLRTAGVP* |
Ga0137397_103322013 | 3300012685 | Vadose Zone Soil | RAVSRCMIPLLPRETGDLKPPLPPAERFVDLQYLRAAGIQ* |
Ga0137407_101151084 | 3300012930 | Vadose Zone Soil | ATAIEFMGEAGVLKAPLPPVERFVDLQYLRAAGLQ* |
Ga0137407_104398082 | 3300012930 | Vadose Zone Soil | KALDQVIAMLGEAGALKAPLPSAERFVDLQYLRAAGVQ* |
Ga0137407_111648752 | 3300012930 | Vadose Zone Soil | GQVIQFMAEAGELKPPLPQPERFVDLQYLQAAGLQ* |
Ga0164298_105248782 | 3300012955 | Soil | GMEQAIALMGEGGILKAPLPRAERFVDLQYLQAAGVQ* |
Ga0164303_109984491 | 3300012957 | Soil | MTKVIELLGQTGELKGPLPAAERFVDLQYLEAAGMR* |
Ga0126369_113727171 | 3300012971 | Tropical Forest Soil | GEINLRGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAEVQ* |
Ga0126369_131619401 | 3300012971 | Tropical Forest Soil | QVIAMMGDAGTLGSPLPSPQQFVDLQYLHAAGIQ* |
Ga0134076_106382241 | 3300012976 | Grasslands Soil | KGLGQVIQFMGETGDLKPPLPSAERFVDLQYLRAVGIQ* |
Ga0164309_113115963 | 3300012984 | Soil | ELAGIAQVITFMAEAGQLKPPLPAPERFVDLRYQRR* |
Ga0164306_105604491 | 3300012988 | Soil | LAQVIAMMSDAGAIKTPLPSPEQFVDLQYLRRAGVP* |
Ga0182008_105020311 | 3300014497 | Rhizosphere | EQVIALMGEGGVLEPPLPPAERFVDLQFLAAAGVQ* |
Ga0180082_10038484 | 3300014880 | Soil | AQVIAFMAEAGQLKAPLPLPERFVDLQYLRLAGVR* |
Ga0137412_102992822 | 3300015242 | Vadose Zone Soil | MEQAITLMGESGVIKAPLPAAERFVDLQYLRAAGVQ* |
Ga0182036_117192572 | 3300016270 | Soil | NGLEQVIAMMAEARTLNPPLPSADRFVDLQYLRATGVQ |
Ga0182037_103405071 | 3300016404 | Soil | KLGEINLKGMEQVIALMAEGETINAPVPAAERFVDLQYLHAAGVQ |
Ga0182037_106619323 | 3300016404 | Soil | EQVIAMMAKARTLNPPLPSAERFVDLQYLRSAGVQ |
Ga0182039_104319412 | 3300016422 | Soil | LGEINLKGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0182039_111904451 | 3300016422 | Soil | GILPKLGEINLKGMEQVIALMAEGETIKPPLPAAERFGDLQYLHAAGVQ |
Ga0163161_114317102 | 3300017792 | Switchgrass Rhizosphere | DPKGVAQVIAFMADAGQIQPPLPDAERFVDLRFLARAGVAPR |
Ga0163161_117932242 | 3300017792 | Switchgrass Rhizosphere | AQVIAFMGEGGMLAQPLPPPERFVDPQYLKAAGAE |
Ga0066667_101739681 | 3300018433 | Grasslands Soil | DIKGLGQVIQFMGETGDLKPPLPSAERFVDLQYLRAVGIQ |
Ga0193743_11234123 | 3300019889 | Soil | FVREGIIVEMISIEAETGDLKPPLPSAERFVDLQYLRAAGIQ |
Ga0210405_113275262 | 3300021171 | Soil | AQVIAFMTDGGAIKPPLPAPEQFVDLQYLRAAGAQ |
Ga0210408_101998821 | 3300021178 | Soil | GMEQVIALMGESATIKAPLPAAERFVDLQYLHAAGVQ |
Ga0210408_104817611 | 3300021178 | Soil | NLKGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0210397_107001452 | 3300021403 | Soil | PKLGEISLKGMEQAIALMAEGGMIKPPLPAAEQFADLQYLRAVGVP |
Ga0210394_118553691 | 3300021420 | Soil | ELEALEQVIALMGEAGNLKAPLPSAERFVDTQYLRAAGAQ |
Ga0213878_104688252 | 3300021444 | Bulk Soil | LKGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0210392_113951831 | 3300021475 | Soil | VGNVIELLGRSGELKAPLPAAERFTDLQYLEAAGLQ |
Ga0187846_102715871 | 3300021476 | Biofilm | GLQQVIAMMGEEGLIGKPPPQASRFVDLQYLHAAGVQ |
Ga0210398_100456675 | 3300021477 | Soil | GQVIAFMGEGGTIKPPLPAPEQFVDLQYLRAAGAQ |
Ga0207693_100143131 | 3300025915 | Corn, Switchgrass And Miscanthus Rhizosphere | RGMEQVIALMGESATIKAPLPAAERFVDLQYLHAAGVQ |
Ga0207686_114547592 | 3300025934 | Miscanthus Rhizosphere | EQVIAFMGEAGNLTPPLPAAERFVDLQYLHAAGIK |
Ga0207669_109260461 | 3300025937 | Miscanthus Rhizosphere | QVIAFMADAGQIQPPLPDAERFVDLRFLARAGVAPR |
Ga0207712_117427202 | 3300025961 | Switchgrass Rhizosphere | LKGMEQVIAFMREAGTLNEPVPTAERFTDLQYLRLAGIK |
Ga0207648_108318741 | 3300026089 | Miscanthus Rhizosphere | AQVIAFMGEGGMLPQPLPPPERFVDPQYLKAAGAE |
Ga0207683_117565121 | 3300026121 | Miscanthus Rhizosphere | MAQVIAFMAEAGTVKAPLPAPERFFDLRYLQSALPK |
Ga0209234_12737543 | 3300026295 | Grasslands Soil | MEQVIALMGESETIKAPLPAVERFVDLQYLHAAGVQ |
Ga0209056_101481751 | 3300026538 | Soil | EINLKGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0209701_100676063 | 3300027862 | Vadose Zone Soil | LELVIAMMGEGGALKAPLPQAERFVDLQYLRAAGVQ |
Ga0209486_101126051 | 3300027886 | Agricultural Soil | KGLAQVIAFMAEAGQIQPPLPDVERFVDLSYLARAGVR |
Ga0209488_1000446012 | 3300027903 | Vadose Zone Soil | DPKGFSAAIEFMGEAGVLKPPFPKPDQFIDLQYLQAAGIQ |
Ga0209526_102768101 | 3300028047 | Forest Soil | AGLAKVIELLAETGQIGAPPPPAERFVDLQYLQAAGLQ |
Ga0307302_102648293 | 3300028814 | Soil | QQAIALMEEGGLLAQPLPAAERFIDLQYLRAAGIQ |
Ga0308198_10896561 | 3300030904 | Soil | KGMEQAITLMGESGVIKAPLPAAERFVDLQYLRAAGVQ |
Ga0307497_100125784 | 3300031226 | Soil | AQVIAMMRDAGAIKTPLPSPEQFVDLQYLRAAGVP |
Ga0318573_100416422 | 3300031564 | Soil | INLKGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0318496_101442982 | 3300031713 | Soil | LNGMEQVIALMAEGETINAPLPAAERFVDLQYLHAAGVQ |
Ga0318500_100928152 | 3300031724 | Soil | INLRGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0318500_103302962 | 3300031724 | Soil | TGMEQVIALMGESETIKAPLPVAERFVDLQYLHAAGVQ |
Ga0306918_102454231 | 3300031744 | Soil | LAGLKEVIALMGEAGNLKPPLPSAERFVDTRYLQAAGLQ |
Ga0318554_106501261 | 3300031765 | Soil | MQQAIAMLGESGVIKPPLPGAERFVDLQYLQAAGIQ |
Ga0318498_102306272 | 3300031778 | Soil | LKEVIALMGEAGNLKPPLPSAERFVDTQYLRAAGLQ |
Ga0318566_100908512 | 3300031779 | Soil | GMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0318566_103037831 | 3300031779 | Soil | LRGMEQVIALKAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0318567_100515121 | 3300031821 | Soil | GEINLKGMEQVIALMAEGETIKPPLPAAERFVDLQYLHAAGVQ |
Ga0318517_100099613 | 3300031835 | Soil | MEQVIALMGESETIKAPLPVAERFVDLQYLHAAGVQ |
Ga0306919_106562831 | 3300031879 | Soil | NGLEQVIAMMAEARTINPPLPSAERFVDLQYLRAAGVQ |
Ga0310916_106437422 | 3300031942 | Soil | GLEQVIAFMGEAGNLKPPLPPAERFVELQYLHAAGVK |
Ga0310910_107798671 | 3300031946 | Soil | LKGMEQVIALMGESETIKAPLPAAERFVDLQYLHAAGVQ |
Ga0318518_104566862 | 3300032090 | Soil | EQVIALMGESETIKAPLPVAERFVDLQYLHAAGVQ |
Ga0318519_100142153 | 3300033290 | Soil | NLKGMEQVIALMGESETIKAPLPVAERFVDLQYLHAAGVQ |
Ga0372943_0113994_33_143 | 3300034268 | Soil | LDQVIAFMGEGGALPAPLPPAARFVDLQYLRAAGVE |
⦗Top⦘ |