A Fast and Exact Randomisation Test for Comparing Two Systems with Paired Data

Rikiya Suzuki, Tetsuya Sakai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The randomisation test with B trials has been used in the information retrieval (IR) field for comparing two systems with paired data (i.e., a common set of topics). It approximates the exact randomisation test whose time complexity is O(2n) for n topics. In this paper, we first show that the counting operation for obtaining the exact p-value in a randomisation test can be reduced to a subsequence sum enumeration problem that can be solved by dynamic programming. By taking advantage of this observation along with the fact that we only require a small number of significant digits in IR evaluation measure scores, we demonstrate that the time complexity of the exact randomisation test can be reduced to O(mn), where m is the maximum subsequence sum of the paired score differences. Hence, researchers can utilise our test if they want to avoid random sampling and/or normality assumptions and want a fast and reliable test. In addition, we utilise Vandermonde's convolution in order to theoretically explain a known fact, namely, that the sign test is a special case of the exact randomisation test.

Original languageEnglish
Title of host publicationICTIR 2021 - Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages239-243
Number of pages5
ISBN (Electronic)9781450386111
DOIs
Publication statusPublished - 2021 Jul 11
Event11th ACM SIGIR International Conference on Theory of Information Retrieval, ICTIR 2021 - Virtual, Online, Canada
Duration: 2021 Jul 11 → …

Publication series

NameICTIR 2021 - Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval

Conference

Conference11th ACM SIGIR International Conference on Theory of Information Retrieval, ICTIR 2021
Country/TerritoryCanada
CityVirtual, Online
Period21/7/11 → …

Keywords

  • permutation test
  • randomization test
  • sign test
  • statistical significance

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Information Systems

Fingerprint

Dive into the research topics of 'A Fast and Exact Randomisation Test for Comparing Two Systems with Paired Data'. Together they form a unique fingerprint.

Cite this