Basic Information | |
---|---|
Family ID | F097971 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 104 |
Average Sequence Length | 52 residues |
Representative Sequence | PARMSEMVQAGLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRYK |
Number of Associated Samples | 85 |
Number of Associated Scaffolds | 104 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Bacteria |
% of genes with valid RBS motifs | 1.92 % |
% of genes near scaffold ends (potentially truncated) | 96.15 % |
% of genes from short scaffolds (< 2000 bps) | 83.65 % |
Associated GOLD sequencing projects | 77 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.41 |
Hidden Markov Model |
---|
|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Bacteria (98.077 % of family members) |
NCBI Taxonomy ID | 2 |
Taxonomy | All Organisms → cellular organisms → Bacteria |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (41.346 % of family members) |
Environment Ontology (ENVO) | Unclassified (39.423 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (43.269 % of family members) |
⦗Top⦘ |
Full Alignment |
---|
Alignment of all the sequences in the family. |
IDLabel .2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72 |
Powered by MSAViewer |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 15.00% β-sheet: 10.00% Coil/Unstructured: 75.00% |
Feature Viewer | |||||
Position : 0 Zoom : x 1 Enter the variants Position Original Variant |
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.41 |
Powered by PDBe Molstar |
⦗Top⦘ |
⦗Top⦘ |
Visualization |
---|
All Organisms Unclassified |
Powered by ApexCharts |
⦗Top⦘ |
Visualization |
---|
Bog Forest Soil Freshwater Sediment Watersheds Vadose Zone Soil Grasslands Soil Surface Soil Soil Grasslands Soil Soil Hardwood Forest Soil Peatland Tropical Peatland Forest Soil Corn, Switchgrass And Miscanthus Rhizosphere |
Powered by ApexCharts |
⦗Top⦘ |
Protein ID | Sample Taxon ID | Habitat | Sequence |
JGI12709J13192_10093052 | 3300001086 | Forest Soil | RPARMSEMVQAGLLRGIPGDPLGFAYIFGDDGKAELNLDSPLLEQQLLLDRYK* |
JGI12636J13339_10107901 | 3300001154 | Forest Soil | RGIPGDPLGFAYIFGDDGKAELNLDSPLLEQQLLLDRYK* |
JGI25381J37097_10384481 | 3300002557 | Grasslands Soil | LLRGIPGDPLGFAYILGENGKAELNLDSPLLEQQLLLDRFK* |
JGI25384J37096_101084882 | 3300002561 | Grasslands Soil | SELVWAGLLRGIPGDPLGFAYIFSEEGKAELNLDSPLLEQQLLIDRLSSQHR* |
JGI25617J43924_100188582 | 3300002914 | Grasslands Soil | NELVWAGMLPGIPGDPLGFAYIFSEEGKAELNLGSPLLEQQLLIDRLSSQHR* |
JGI25617J43924_101226312 | 3300002914 | Grasslands Soil | NELVWAGMLPGIPGDPLGFAYIFSEEGKAELNLDSPLLEQQLLIDRLSSQHR* |
JGI26344J46810_10219921 | 3300003224 | Bog Forest Soil | RRPTRMGELVQAGLLPRVPADPMGYAYVFGPDGKAALNLDSPLLEQQLLHQDSK* |
JGI26337J50220_10182651 | 3300003370 | Bog Forest Soil | DEFQKRTGRRPTRMGELVQAGLLPRVPADPMGYAYVFGPDGKAALNLDSPLLEQQLLHQDSK* |
Ga0066680_101393031 | 3300005174 | Soil | RRPARMSDLVQAGLLRGVPADPEGFAYVFGEDGKAEINLDSPLLEQQLLLERLNKAVPR* |
Ga0070711_1017301921 | 3300005439 | Corn, Switchgrass And Miscanthus Rhizosphere | QAGLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLIDRLSSQHR* |
Ga0066692_100345212 | 3300005555 | Soil | AGLIRGIPRDPLGFAYVFGEGGKAELNLESPLLEKELMFDRFKKALP* |
Ga0066692_106146691 | 3300005555 | Soil | PTRMSELVWAGLVRGIPGDPLGFAYIFSEEGKAELNLDSPLLEQQLLIDRLSSQHR* |
Ga0066670_107063781 | 3300005560 | Soil | RGIPGDPLGFAYIFGQDGKAQLNLDSPLLEQQLLLDRFK* |
Ga0066691_100702031 | 3300005586 | Soil | KRPARMSEMVQAGLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRFK* |
Ga0075028_1007428072 | 3300006050 | Watersheds | SEMVQAGLLRGIPGDPLGFAYIFGEDGKADLNLDSPLLEQQLLLDRFK* |
Ga0075018_104511551 | 3300006172 | Watersheds | VQAGLIRGIPGDPLGFAYVFGEDGKVELNLDSPLLEQQVLLERYK* |
Ga0070716_1014253421 | 3300006173 | Corn, Switchgrass And Miscanthus Rhizosphere | LLRGVPGDPLGFAYIFGQDGKAQLNLDSPLLEQQLLLDRFK* |
Ga0075014_1008765662 | 3300006174 | Watersheds | AKRFGKRPGRMSEMVQAGLLRGIPGDPLGFAYIFGEDGKTELNLDSPLLEQQLLLERFK* |
Ga0066653_104605282 | 3300006791 | Soil | RPSRMSDLVQAGLVRGIPRDPLGFPYVFGASGKTELNLDSPLLEKQLMIERFRKYVP* |
Ga0066665_100043278 | 3300006796 | Soil | YGKRPARISQMVQAGLLRGIPGDPLGFAYIFGQDGKAQLNLDSPLLEQQLLLDRFK* |
Ga0099793_100417511 | 3300007258 | Vadose Zone Soil | RMNELVWAGLLPGIPADPLGFAYIFGDEGKAELNLDSPLLEQQLLLERFSRQQRR* |
Ga0099794_102643432 | 3300007265 | Vadose Zone Soil | LRGVPADPEGFAYVFGEDGKAELNLDSPLLEQQVLLERLNKAVPR* |
Ga0099830_100672122 | 3300009088 | Vadose Zone Soil | MVQAGLLRGVPGDPLGFAYLFTEDGKAELNPDSPLLEQQLLLDRFK* |
Ga0137392_105442302 | 3300011269 | Vadose Zone Soil | ARADEYEKRYGRRPKHVSELVQAGLLRGLPADPLGYAYIFGEDGKAELNLQSPLLEERLLLEHRK* |
Ga0137391_110937942 | 3300011270 | Vadose Zone Soil | GHRPARMSEMVQAGLLRGIPGDPLGYAYVFGEDGKAELNLKSPLLEQQLLLERFQKAVPRN* |
Ga0137383_101998421 | 3300012199 | Vadose Zone Soil | EMVQAGLLRGIPGDPLGYAYVFGEDGKAELNLDSPLLEQQLLLERFK* |
Ga0137363_113143811 | 3300012202 | Vadose Zone Soil | RPARMSEMVQAGLLRGIPGDPLGFAYILGEDGKAELNLDSPLLEQQLLFDRFK* |
Ga0137399_100322441 | 3300012203 | Vadose Zone Soil | AKRNGRRPARMSDLVQAGLLPGLPEDPAGFVYVFGEEGKAELNLDSPLLEQQLLLDRLNKAVPR* |
Ga0137399_106114282 | 3300012203 | Vadose Zone Soil | GLLPGIPRDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRFK* |
Ga0137380_104466411 | 3300012206 | Vadose Zone Soil | HPTRMSELVWAGLLRGIPGDPLGFAYIFSEEGKAELNLDSPLLEQQLLIDRLSSQHR* |
Ga0137380_106029001 | 3300012206 | Vadose Zone Soil | RMNEMVQAGLLHGISGDPMGYAYVFGEDGKAELNLDSPLLEQKLLLERFK* |
Ga0137381_101099301 | 3300012207 | Vadose Zone Soil | RPARMSEMVQAGLLRRIPGDPLGFAYIFGQDGKAELNLDSPLLQQQLLLDRYKLGP* |
Ga0137381_108266492 | 3300012207 | Vadose Zone Soil | MSELVQAGLIGGLPGDPLGFAYVFGEDGKAALNLNSPLLEQQQLLDRYRRAVP* |
Ga0137378_117578882 | 3300012210 | Vadose Zone Soil | PGDPLGYAYVFGEDGKAELNLDSPLLEQQLLLERFK* |
Ga0137370_103636001 | 3300012285 | Vadose Zone Soil | LLRGIPSDPLGFAYIFGENGKAELNLDSPLLEQQLLLDRFK* |
Ga0137387_105165102 | 3300012349 | Vadose Zone Soil | LLRGIPGDPLGFAYIFGEDGKAELNLDSPLLQQQLLLDRYKLGP* |
Ga0137360_112864521 | 3300012361 | Vadose Zone Soil | QAGLIRGIPGNPLGFAYVFGEDGKAELNLDSPLLEQQLLLDRYKQAEPRN* |
Ga0137360_116416452 | 3300012361 | Vadose Zone Soil | YGKRPAHMSEIVQAGLLRGIPGDPLGFAYILGEDGKAELNLDSPLLEQQLLFDRFK* |
Ga0137390_119290301 | 3300012363 | Vadose Zone Soil | LRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRHK* |
Ga0137396_101591902 | 3300012918 | Vadose Zone Soil | KRNGRRPARMTDLVQAGLLCGVPADPEGFAYVFGEDGKAELNLDSPLLEQQVLLERLNKVVPR* |
Ga0137396_104451121 | 3300012918 | Vadose Zone Soil | GRRPARMMDLVQAGLLRAAPVDPEGFGYVFGEGGKAELNLDSPLLEQQLLIERFNKALPR |
Ga0137396_104694481 | 3300012918 | Vadose Zone Soil | MSEMVQAGLLPGIPRDPLGFAYIFGEDGKAELNLGSPLLEQQLLLDRFK* |
Ga0137396_105116951 | 3300012918 | Vadose Zone Soil | AKRNGRRPARMSDLVQAGLLRGMPADPEGFAYVFGEEGKAELNLDSPLLEQQLLLEHFNKIIPR* |
Ga0137396_111776221 | 3300012918 | Vadose Zone Soil | PARMSEMVQAGLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRYK* |
Ga0137396_112797401 | 3300012918 | Vadose Zone Soil | LVQAGLLRAAPVDPEGFGYVFGEGGKAELNLDSPLLEQQLLIEHFNKALPR* |
Ga0137394_104938661 | 3300012922 | Vadose Zone Soil | AKRNGRRPARMSDLVQAGLLRGVPADPEGFAYVFGEDGKAELNLDSPLLEQQLLLERLNKAVPR* |
Ga0137394_115564522 | 3300012922 | Vadose Zone Soil | AKRNGRRPARMSDLVQAGLLRGVPADPEGFAYVFGEDGKAEINLDSPLLEQQLLLERLNKAVPR* |
Ga0137413_109010292 | 3300012924 | Vadose Zone Soil | VQAGLIRGIPGDPLGFAYVFGEDGKAELNLNSPLLEQQLLFERFKQAEPRY* |
Ga0137419_100188033 | 3300012925 | Vadose Zone Soil | EYAKRYGKRPARISQMVQAGLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLNRFK* |
Ga0137416_113422571 | 3300012927 | Vadose Zone Soil | AKRYDKRPTRMSEMVQVGLLRGIPGDPLGFAYIFGEGGKAELNLGSPLLERQLLFDRLKQASP* |
Ga0137416_113981951 | 3300012927 | Vadose Zone Soil | EYAKRYGKRPARMSEMVQAGLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLFDRFK* |
Ga0137416_114219342 | 3300012927 | Vadose Zone Soil | LIRGIPRDPLGYAYVVGKDGKVELDLDSPLLEQQLMQERFRKAVP* |
Ga0137407_115978412 | 3300012930 | Vadose Zone Soil | GIPEDPLGFAYVFGEGGKAELNLESPLLEKQLMQERFRRAVH* |
Ga0134110_101260402 | 3300012975 | Grasslands Soil | YGKRPARMSEMVQAGLLRRIPGDPLGFAYIFGQDGKAELNLDSPLLQQQLLLDRYKLGP* |
Ga0137414_12138725 | 3300015051 | Vadose Zone Soil | MSEIVQAGLLRGIPGDPLGFAYILGEDGKAELNLDSPLLEQQLLFDRFK* |
Ga0137411_12819237 | 3300015052 | Vadose Zone Soil | MNMRSVFGKRPARMGDLVQAGLLRGIPGDPLGFPYIFGEDEKAELNLDSPLLEQQLLLERFK* |
Ga0187820_10574711 | 3300017924 | Freshwater Sediment | EYAKRFGHRPTRMSELVQAGLLRGIPGDPLGYAYVFDPDGKAALNLDSPLLEQELLMQRF |
Ga0187808_101359792 | 3300017942 | Freshwater Sediment | KRFGHRPTRMSELVQAGLLRGIPGDPLGYAYVFDPDGKAALNLDSPLLEQELLMQRFK |
Ga0187778_100150971 | 3300017961 | Tropical Peatland | QAGLLRGIPGDPKGYAYVFDEDGKAALNLNSPLLEQQLLLENLK |
Ga0187766_109509441 | 3300018058 | Tropical Peatland | RFGHRPKRMTELVQAGLLRGIPGDPKGFAYVFDEDGKVALNLDSPLLEEQLKLENLK |
Ga0187769_100086106 | 3300018086 | Tropical Peatland | AQAGLLRGVPVDPRGYAYVFDENGKAALNLDSPLLEQQLLLEHFK |
Ga0066655_109204191 | 3300018431 | Grasslands Soil | TFQAGLLRPIPGDPLGFAYIFGQDGKAELNLDSPLLQQQLLLDRYKLGP |
Ga0066662_106537462 | 3300018468 | Grasslands Soil | GRRPSRMSEMVQTGLLHGIPGDPLGHAYVFGEDGKAELNLDSPLLEQQLLLERLK |
Ga0066669_122544021 | 3300018482 | Grasslands Soil | SDGKRPDHISQMVQAGLLRGIPGDPLGFAYIFGEAGKAELNLDSPLLEQQLLLDRFK |
Ga0187800_10825872 | 3300019278 | Peatland | VQAGLIRGLPADPKGFAYEFDENGKAALNLDSPLLEQQLTLENFK |
Ga0179590_10130501 | 3300020140 | Vadose Zone Soil | RPARMSEIVQAGLLRGIPGDPLGFAYILGEDGKAELNLDSPLLEQQLLFDRFK |
Ga0179594_101599472 | 3300020170 | Vadose Zone Soil | LIRGIPRDPKGFAYVLGQDGKAELNLDSPLLERELLFDRYKKVTP |
Ga0179594_102862642 | 3300020170 | Vadose Zone Soil | RPARMSEMIQAGLLRGTPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRYK |
Ga0210407_100667501 | 3300020579 | Soil | QAGLIRGIPGDPLGYAYVFGEDGKAELNLDSPLLEQQLLLDRYKQAEPRN |
Ga0210399_101100872 | 3300020581 | Soil | ELVQAGLLQPQFLKDQEGYAYVFGEKGKAELNVDSPLLEQQLLFEHFKAN |
Ga0210399_115958322 | 3300020581 | Soil | PMRMGELVQAGLLPKLLKDPEGFAYVFGEKGKAELNVDSPLLEQQLLFEHFK |
Ga0210395_105060422 | 3300020582 | Soil | KIIELVQAGLVRGLPIDPLGYQYVFGTDGKAELNLDSPLLEEQVMNPDIK |
Ga0210401_114120081 | 3300020583 | Soil | PTHMAELIQAGLLRGVPKDPAGFAYVFGENGKAELNVESPLLEQQLLMERYKK |
Ga0210405_104143671 | 3300021171 | Soil | ELVQAGLLRGIAVDPDGFAYEFSEEGKAELNLDSPLLEQQLLLEKFK |
Ga0210388_100308621 | 3300021181 | Soil | DEYQKRNGKRPTRMAELIQAGLLRGVPKDPAGFVYVFGEYGKAELNVDSPLLEQQLLMERYKK |
Ga0210397_115976001 | 3300021403 | Soil | QAGYLRGLPVDPDGFAYVFSEEGKAELNLDSPLLEQQLLLEKFK |
Ga0210398_107913911 | 3300021477 | Soil | YEKRFGRRPAKMMELVQAGLVRGLPVDPLGYQYVFGADGKAELNLDSPLLEEQVMNPDIK |
Ga0179589_100313364 | 3300024288 | Vadose Zone Soil | ADEYAKQYGKRPARMSEIVQAGLLRGIPGDPLGFAYILGEDGKAELNLDSPLLEQQLLFDRFK |
Ga0209055_12741171 | 3300026309 | Soil | QAGLLRRIPGDPLGFAYIFGQDGKAELNLDSPLLQQQLLLDRYKLGP |
Ga0209802_11830452 | 3300026328 | Soil | MSDLVQAGLLRGVPADPEGFAYVFGEDGKAEINLDSPLLEQQLLLERLNKAVPR |
Ga0209375_12462542 | 3300026329 | Soil | DEYAKRYGKRPARISQMVQAGLLRGIPGDPLGFAYIFGQDGKAQLNLDSPLLEQQLLLDRFK |
Ga0209158_12365412 | 3300026333 | Soil | GIPRDPLGYAYVVGKDGKVELDLDSPLLEQQLMQERFRKAVP |
Ga0209377_10252572 | 3300026334 | Soil | MNDLAQAGLIRGIPRDPLGFAYVFGEGGKAELNLESPLLEKELMFDRFKKALP |
Ga0209804_10482741 | 3300026335 | Soil | LLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRFK |
Ga0257172_10134091 | 3300026482 | Soil | YGKRPARMSEMVQAGLLPGIPRDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRFK |
Ga0209648_104780551 | 3300026551 | Grasslands Soil | EMVHTGLLPGIPRDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRFK |
Ga0208603_10240312 | 3300027109 | Forest Soil | PTRMSEMVQAGLLRAIPGDPLGFAYIFGENGKAELNLDSPLLEQQLILEHFK |
Ga0209733_10753912 | 3300027591 | Forest Soil | GLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRYK |
Ga0209009_11738492 | 3300027667 | Forest Soil | LRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLDRYK |
Ga0209701_101512062 | 3300027862 | Vadose Zone Soil | RYGKRPTRMSEMVQAGLLRGVPGDPLGFAYLFTEDGKAELNPDSPLLEQQLLLDRFK |
Ga0209579_104312212 | 3300027869 | Surface Soil | GLLPGMPVDPAGYAYELGENGKAELNLNSPLLEQQLLFDKSK |
Ga0209590_103372321 | 3300027882 | Vadose Zone Soil | GIPGDPLGYAYVFGEDGKAELNLDSPLLEQQLLLERFK |
Ga0209068_101088292 | 3300027894 | Watersheds | PGDPKGYAYVFGPDGKADLNLDSPLLEQQLLMERFR |
Ga0209488_105385351 | 3300027903 | Vadose Zone Soil | GIPGDPLGFAYILGEDGKAELNLDSPLLEQQLLFDRFK |
Ga0137415_104963871 | 3300028536 | Vadose Zone Soil | IPGDPLGFPYIFGEDEKAELNLDSPLLEQQLLLERFK |
Ga0307469_109316452 | 3300031720 | Hardwood Forest Soil | YAKRYGKRPARMSELVQAGLIRGIPGDPLGFAYVFGEDGKAELNLNSPLLEQQLLFDRFKQAEPRY |
Ga0307475_103569022 | 3300031754 | Hardwood Forest Soil | GLLRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLNRYK |
Ga0307473_107184242 | 3300031820 | Hardwood Forest Soil | GLLRGVPADPEGFAYVFGEDGKAELNLDSPLLEQQLLLERFDKAVPR |
Ga0307473_112131341 | 3300031820 | Hardwood Forest Soil | LRGIPGDPLGFAYIFGEDGKAELNLDSPLLEQQLLLNRYK |
Ga0307479_100664403 | 3300031962 | Hardwood Forest Soil | KRYGRRPARMSELVQAGLLRGIPGDPLGFAYIFSEDGKAELNLDSPLLEQQLLLDRFKQAVPRN |
Ga0307479_120293951 | 3300031962 | Hardwood Forest Soil | PARMSEIVQAGMLQGIPGDPLGFAYIFGEDGKAELNLNSPLLEQQLLLERFK |
Ga0307471_1031537332 | 3300032180 | Hardwood Forest Soil | ARVSDLAEAGLIRGIPRDPKGFAYVFGEDGKAELDLDSPLLEKELLFDRYKKVVP |
Ga0307472_1001718252 | 3300032205 | Hardwood Forest Soil | TRMSELVQAGLLPGLPKDPEGFAYVFGEDGKAELNLDSPLLEQQLLIERYNKAIPR |
Ga0307472_1025164462 | 3300032205 | Hardwood Forest Soil | RNGRRPARMTDLVQAGLLRGVPADPEGFAYVFGEEGKAELNLDSPLLEQQLLLERLNKAVPR |
⦗Top⦘ |