TY - GEN
T1 - Conducting laboratory experiments properly with statistical tools
T2 - 12th ACM International Conference on Web Search and Data Mining, WSDM 2019
AU - Sakai, Tetsuya
PY - 2019/1/30
Y1 - 2019/1/30
N2 - This hands-on half-day tutorial consists of two sessions. Part I covers the following topics: Preliminaries; Paired and two-sample t-tests, confidence intervals; One-way ANOVA and two-way ANOVA without replication; Familiwise error rate. Part II covers the following topics: Tukey's HSD test, simultaneous confidence intervals; Randomisation test and randomised Tukey HSD test; What's wrong with statistical significance tests?; Effect sizes, statistical power; Topic set size design and power analysis; Summary: how to report your results. Participants should have some prior knowledge about the very basics of statistical significance testing and are strongly encouraged to bring a laptop with R already installed. They will learn how to design and conduct statistical significance tests for comparing the mean effectiveness scores of two or more systems appropriately, and to report on the test results in an informative manner.
AB - This hands-on half-day tutorial consists of two sessions. Part I covers the following topics: Preliminaries; Paired and two-sample t-tests, confidence intervals; One-way ANOVA and two-way ANOVA without replication; Familiwise error rate. Part II covers the following topics: Tukey's HSD test, simultaneous confidence intervals; Randomisation test and randomised Tukey HSD test; What's wrong with statistical significance tests?; Effect sizes, statistical power; Topic set size design and power analysis; Summary: how to report your results. Participants should have some prior knowledge about the very basics of statistical significance testing and are strongly encouraged to bring a laptop with R already installed. They will learn how to design and conduct statistical significance tests for comparing the mean effectiveness scores of two or more systems appropriately, and to report on the test results in an informative manner.
KW - Confidence intervals
KW - Effect sizes
KW - Multiple comparison procedures
KW - Randomisation test
KW - Sample sizes
KW - Statistical power
KW - Statistical significance
KW - T-test
KW - Tukey's honestly significant difference test
UR - http://www.scopus.com/inward/record.url?scp=85061708391&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061708391&partnerID=8YFLogxK
U2 - 10.1145/3289600.3291378
DO - 10.1145/3289600.3291378
M3 - Conference contribution
AN - SCOPUS:85061708391
T3 - WSDM 2019 - Proceedings of the 12th ACM International Conference on Web Search and Data Mining
SP - 830
EP - 831
BT - WSDM 2019 - Proceedings of the 12th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
Y2 - 11 February 2019 through 15 February 2019
ER -