A Review of Bootstrap Methods in Ranked Set Sampling

A Review of Bootstrap Methods in Ranked Set Sampling

Arpita Chatterjee, Santu Ghosh
Copyright: © 2022 |Pages: 19
DOI: 10.4018/978-1-7998-7556-7.ch008
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

This chapter provides a brief review of the existing resampling methods for RSS and its implementation to construct a bootstrap confidence interval for the mean parameter. The authors present a brief comparison of these existing methods in terms of their flexibility and consistency. To construct the bootstrap confidence interval, three methods are adopted, namely, bootstrap percentile method, bias-corrected and accelerated method, and method based on monotone transformation along with normal approximation. Usually, for the second method, the accelerated constant is computed by employing the jackknife method. The authors discuss an analytical expression for the accelerated constant, which results in reducing the computational burden of this bias-corrected and accelerated bootstrap method. The usefulness of the proposed methods is further illustrated by analyzing real-life data on shrubs.
Chapter Preview
Top

Introduction

The sampling design is a fundamental part of any statistical inference. A well-developed sampling design plays a critical role in ensuring that data can capture the distinct characteristics of the population and are sufficient to draw the conclusions needed. Choice of a sampling plan depends on many aspects, including but not limited to the purpose of the analysis, the rarity of the characteristics under study, the nature of the total population, the size of the area to be studied, and most importantly, the cost of the study. One of the most commonly cited sampling plans is the simple random sampling (SRS). In practice, a more structured sampling mechanism, such as stratified sampling or systematic sampling, may be obtained to achieve a representative sample of the population of interest. Ranked-set sampling (RSS) is an alternative data collection method and has been known as a cost-efficient sampling procedure for many years. This approach to data collection was first proposed by McIntyre (1952) to improve the precision of estimated pasture yield. Later, Takahasi and Wakimoto (1968) established a rigorous statistical foundation for the theory of RSS. The ranked set sampling (RSS) utilizes the basic intuitive properties associated with simple random sampling (SRS). However, it involves the extra structure induced through the judgment ranking and the independence of the resulting order statistics. As a result, the procedures based on RSS lead to more efficient estimators of population parameters than those based on an SRS with the same sample size. The existing literature also includes works on hypothesis testing and point and interval estimation under both parametric and nonparametric settings. The most basic version of RSS is the balanced RSS. The process of generating an RSS involves drawing k2 units at random from the target population. These items are then randomly divided into k sets of k units each. Within each set, the units are then ranked by some means other than a direct measurement. For example, visually or using a concomitant measurement that is comparatively cheaper to measure and easy to obtain than the measurement of interest. Finally, one item from each set is chosen for actual quantification. To be more specific, from the first set, we select the item with the smallest judgment-rank for measurement, from the second set, we choose the item with the second smallest judgment-rank, and so on, until the unit ranked largest is selected from the kth set. This complete procedure, called a cycle, is repeated independently m times to obtain a ranked set sample of size mk. Therefore, a balanced RSS of size mk requires a total of mk2 units to be selected, but only mk of them are measured. Hence, a more comprehensive range of the population can be covered while significantly reducing the sampling cost. According to Takahasi and Wakimoto (1968), for easy implementation of RSS, the set size k is usually kept as four or less. However, we can obtain a large sample by increasing the cycle size m. Another option is that of unbalanced RSS. In an unbalanced RSS, n×k units are selected at random from the target population. These items are then randomly divided into n sets of k units each. Units in each set are judgment ranked without measuring the actual units. In this setting, let mr denote the number of sets allocated to measure units having the rth judgment-rank such that 978-1-7998-7556-7.ch008.m01. The measured observations then constitute an unbalanced RSS of size n.

Key Terms in this Chapter

Coverage Error: The absolute difference between the true Coverage Probability and nominal overage Probability of confidence interval.

Big-Oh: a n =O ( b n ) if there exists a real constant c <0 and there an integer constant n 0 3 1 such that a n £ cb n for every integer n 3 n 0 .

Coverage Probability: This term refers to the probability that a confidence region contains the true value of a parameter.

Pivot: It is function of samples and unknown parameters whose probability distribution does not depends on the population characteristics.

Nominal Coverage Probability: Desired coverage probability of confidence region.

Asymptotic Pivot: It is pivot for diverging sample size.

Complete Chapter List

Search this Book:
Reset