Order Statistics in Simulation

Order Statistics in Simulation

E Jack Chen
Copyright: © 2014 |Pages: 12
DOI: 10.4018/978-1-4666-5202-6.ch156
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Order statistics refer to the collection of sample observations sorted in ascending order and are among the most fundamental tools in non-parametric statistics and inference. Statistical inference established based on order statistics assumes nothing stronger than continuity of the cumulative distribution function of the population and is simple and broadly applicable. We discuss how order statistics are applied in computer simulation, e.g., tests of independence, tests of goodness of fit, hypothesis tests of equivalence of means, ranking and selection, and quantiles estimation. These order-statistics techniques are key components of many simulation studies.
Chapter Preview
Top

Background

Let 978-1-4666-5202-6.ch156.m01 denote a sequence of mutually independent random samples from a common distribution of the continuous type having a probability density function (pdf) 978-1-4666-5202-6.ch156.m02 and a cumulative distribution function (cdf) F. Let 978-1-4666-5202-6.ch156.m03 be the uth smallest of these 978-1-4666-5202-6.ch156.m04 such that 978-1-4666-5202-6.ch156.m05. Then 978-1-4666-5202-6.ch156.m06 is called the uth order statistic of the random sample 978-1-4666-5202-6.ch156.m07 Note that even though the samples 978-1-4666-5202-6.ch156.m08 are independently and identically distributed (iid), the order statistics 978-1-4666-5202-6.ch156.m09 are not independent because of the order restriction. The difference 978-1-4666-5202-6.ch156.m10 is called the sample range. It is a measure of the dispersion in the sample and should reflect the dispersion in the population.

Suppose 978-1-4666-5202-6.ch156.m11, where “978-1-4666-5202-6.ch156.m12” denotes “is distributed as” and “978-1-4666-5202-6.ch156.m13” denotes a uniform distribution with range [a,b]. We are interested in the distribution of 978-1-4666-5202-6.ch156.m14, which can be viewed as the second order statistic. The cdf of 978-1-4666-5202-6.ch156.m15 is

Key Terms in this Chapter

Indifference Zone: We don’t distinguish values that are deviated less than a significant amount (the indifference amount), i.e., they are within in the indifference zone.

Critical Constant: In ranking and selection, the critical constant is a required parameter for computing the required sample sizes and is the P * quantile of the difference of two specific random values.

Histogram: A graphical representation showing the distribution of data.

Range Statistics: The statistics of the difference between the maximum and minimum observations. Range statistics give an estimate of the spread of the data.

Q-Q Plot: A probability plot for comparing two probability distributions by plotting their quantiles against each other.

Empirical Distribution Function: The cumulative distribution function (cdf) associated with the empirical observations of the sample. The empirical distribution function estimates the true underlying cdf of the sample.

Order Statistics: The collection of sample observations sorted in ascending order.

Tests of Independence: Statistical null hypothesis tests to determine whether sequences of observations appear to be independent.

Quantile Function (Inverse Distribution Function): The quantile function (given a probability p ) returns the value at or below which 100 p percent of the population lies.

Complete Chapter List

Search this Book:
Reset