MotiMul: A significant discriminative sequence motif discovery algorithm with multiple testing correction

Koichi Mori, Haruka Ozaki, Tsukasa Fukunaga

研究成果: Conference contribution

抄録

Sequence motifs play essential roles in intermolecular interactions such as DNA-protein interactions. The discovery of novel sequence motifs is therefore crucial for revealing gene functions. Various bioinformatics tools have been developed for finding sequence motifs, but until now there has been no software based on statistical hypothesis testing with statistically sound multiple testing correction. Existing software therefore could not control for the type-l error rates. This is because, in the sequence motif discovery problem, conventional multiple testing correction methods produce very low statistical power due to overly-strict correction. We developed MotiMul, which comprehensively finds significant sequence motifs using statistically sound multiple testing correction. Our key idea is the application of Tarone's correction, which improves the statistical power of the hypothesis test by ignoring hypotheses that never become statistically significant. For the efficient enumeration of the significant sequence motifs, we integrated a variant of the PrefixSpan algorithm with Tarone's correction. Simulation and empirical dataset analysis showed that MotiMul is a powerful method for finding biologically meaningful sequence motifs. The source code of MotiMul is freely available at https://github.com/ko-ichimo-ri/MotiMul.

本文言語English
ホスト出版物のタイトルProceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020
編集者Taesung Park, Young-Rae Cho, Xiaohua Tony Hu, Illhoi Yoo, Hyun Goo Woo, Jianxin Wang, Julio Facelli, Seungyoon Nam, Mingon Kang
出版社Institute of Electrical and Electronics Engineers Inc.
ページ186-193
ページ数8
ISBN(電子版)9781728162157
DOI
出版ステータスPublished - 2020 12月 16
外部発表はい
イベント2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 - Virtual, Seoul, Korea, Republic of
継続期間: 2020 12月 162020 12月 19

出版物シリーズ

名前Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020

Conference

Conference2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020
国/地域Korea, Republic of
CityVirtual, Seoul
Period20/12/1620/12/19

ASJC Scopus subject areas

  • コンピュータ サイエンスの応用
  • 情報システムおよび情報管理
  • 医学(その他)
  • 健康情報学

フィンガープリント

「MotiMul: A significant discriminative sequence motif discovery algorithm with multiple testing correction」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル