Multiple Hypothesis Testing for Data Mining

Multiple Hypothesis Testing for Data Mining

Sach Mukherjee (University of Oxford, UK)
Copyright: © 2005 |Pages: 6
DOI: 10.4018/978-1-59140-557-3.ch161
OnDemand PDF Download:
No Current Special Offers


A number of important problems in data mining can be usefully addressed within the framework of statistical hypothesis testing. However, while the conventional treatment of statistical significance deals with error probabilities at the level of a single variable, practical data mining tasks tend to involve thousands, if not millions, of variables. This Chapter looks at some of the issues that arise in the application of hypothesis tests to multi-variable data mining problems, and describes two computationally efficient procedures by which these issues can be addressed.

Complete Chapter List

Search this Book: