Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Algorithms for Approximate Bayesian Computation

Tom Burr, Alexei Skurikhin

Source Title: Encyclopedia of Information Science and Technology, Third Edition

DOI: 10.4018/978-1-4666-5888-2.ch149

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Computer models have many applications in business, medicine, engineering, and science. A typical model has known inputs such as a task schedule, unknown inputs such as the average time to complete a task, and outputs such as product production quality and rates. Our focus is stochastic computer models, which output different results each time the model is run with the same inputs. Such models are calibrated by comparing real data to model predictions. A calibrated computer model can provide a cost-effective option to examine “what-if” questions such as “what if we could reduce the average time to complete a specific task.” This chapter describes an attractive option to calibrate a stochastic computer model that relies not on the raw data but on summary statistics derived from the real data.

Chapter Preview

Top

Introduction

Computer models can be broadly categorized as deterministic or stochastic. Deterministic models output the same predictions for the same inputs. Stochastic models output different predictions for the same inputs. Our focus is stochastic computer models (SM), and in particular, on a new option to calibrate SMs using approximate Bayesian computation (ABC).

An example used later in this chapter is a SM to model neuronal loss in a region of the human brain that is associated with Parkinson’s disease. Deletion mutations in the mitochondrial DNA (mtDNA) in that brain region are observed to accumulate with age. A deletion mutation converts a healthy copy of mtDNA to the mutant (unhealthy) variant. The number of mutant copies in cases with Parkinson’s disease tends to be higher than in controls without Parkinson’s disease. The role that mtDNA deletions play in neuronal loss is not yet fully understood, so better understanding of how mtDNA deletions accumulate is an area of active research. Henderson et al. (2009) use a simple SM that allows for any of five reactions, occurring at rates to be estimates. The five reactions are mutation, synthesis, degradation, mutant synthesis, and mutant degradation.

Approximate Bayesian computation (ABC) is an approach for using data to calibrate a SM and is especially useful when the likelihood for the data is unknown or intractable. ABC requires a set of summary statistics computed from real data. Then, the same set of summary statistics is computed from the SM for each of many candidate model parameter values. In a parameter acceptance/rejection loop, the candidate SM parameter values that are accepted provide an approximation to the posterior distribution of model parameters given the summary statistics computed from the real data. In a nutshell, ABC favors model parameters for which simulated summary statistics roughly agree with the corresponding summary statistics computed from the observed data. Because ABC relies on user-chosen summary statistics rather than on full data, it becomes computationally feasible. ABC is therefore appealing when the data dimension and/or parameter dimension is large.

This article describes applications of ABC and illustrates the challenges with ABC related to the quality of the approximation to the posterior distribution of model parameters. The challenges involve the fact that the user must choose (1) summary statistics, (2) a distance measure to calculate the distance between summary statistics in the real data and in the model-simulated data, and (3) the acceptance threshold used to accept or reject candidate parameter values in the acceptance/rejection sampling loop.

For a model with parameters θ and data D, a key quantity in Bayesian inference is the posterior distribution of model parameters given by Bayes theorem as

,where

is the probability distribution for θ before observing data D,

is the likelihood, and

is the marginal probability of the data that is used to normalize the posterior probability

to integrate to 1 (Aitken, 2010). The likelihood

can be regarded as the “data model” for a given value of θ. Alternatively, when the data D is considered fixed,

is regarded as a function of θ, and non-Bayesian methods such as maximum likelihood find the value of θ that maximizes

(Aitken, 2010).

Key Terms in this Chapter

Approximate Bayesian Computation (ABC): A method that compares summary statistics generated from real data to the same summary statistics generated from the model in order to provide estimates and uncertainty for model parameters.

Partial Posterior Distribution: In Bayesian statistics, a key quantity is estimate the posterior probability density In ABC, the partial posterior distribution Replaces .

Bayesian Calibration of a Computer Model: A method to choose effective model inputs that regards model inputs as unknown and unknowable parameters.

Stochastic Computer Model: A model with known inputs such as a task schedule, unknown inputs such as the average time to complete a task, and stochastic (random) outputs that differ each time the model is run with the same (or different) inputs.

Distance Measure: A measure of closeness (or distance) between two vectors.

Bayes Theorem: A simple theorem that converts one probability distribution (the distribution of the data given the model parameters) to another (the distribution of the model parameters given the data): .

Parameter Acceptance Loop: ABC requires repeated evaluation of the model at many trial input values. Some of the trial input values are accepted during the simulation loop over many trail input values.

Summary Statistics: In the context of ABC, the simulated model outputs are reduced to summary statistics such as means and variances. The vector of summary statistics computed from the real data is compared to the vector of summary statistics computed from the simulated data to evaluate candidate model parameter values.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Algorithms for Approximate Bayesian Computation

Abstract

Introduction

Key Terms in this Chapter

Complete Chapter List