Drawing Representative Samples from Large Databases
Wen-Chi Hou (Southern Illinois University, USA), Hong Guo (Southern Illinois University, USA), Feng Yan (Williams Power, USA) and Qiang Zhu (University of Michigan, USA)
Copyright: © 2005
Sampling has been used in areas like selectivity estimation (Hou & Ozsoyoglu, 1991; Haas & Swami, 1992, Jermaine, 2003; Lipton, Naughton & Schnerder, 1990; Wu, Agrawal, & Abbadi, 2001), OLAP (Acharya, Gibbons, & Poosala, 2000), clustering (Agrawal, Gehrke, Gunopulos, & Raghavan, 1998; Palmer & Faloutsos, 2000), and spatial data mining (Xu, Ester, Kriegel, & Sander, 1998). Due to its importance, sampling has been incorporated into modern database systems.