Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Statistical Machine Learning Approaches for Sports Video Mining Using Hidden Markov Models

Guoliang Fan, Yi Ding

Source Title: Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques

DOI: 10.4018/978-1-60566-766-9.ch022

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This chapter summarizes the authors’ recent research on the hidden Markov model (HMM)-based machine learning approaches to sports video mining. They will advocate the concept of semantic space that provides explicit semantic modeling at three levels, high-level semantics, mid-level semantic structures, and low-level visual features. Sports video mining is formulated as two related statistical inference problems. One is from low-level features to mid-level semantic structures, and the other is from midlevel semantic structures to high-level semantics. The authors assume that a sport video is composed of a series of consecutive play shots each of which contains variable-length frames and can be labelled with certain mid-level semantic structures. In this chapter, the authors present several HMM–based approaches to the first inference problem where the hidden states are directly associated with mid-level semantic structures at the shot level and observations are the visual features extracted from frames in a shot. Specifically, they will address three technical issues about HMMs: (1) how to enhance the observation model to handle variable-length frame-wise observations; (2) how to capture the interaction between multiple semantic structures in order to improve the overall mining performance; (3) how to optimize the model structure and to learn model parameters simultaneously. This work is the first step toward the authors’ long-term goal that is to develop a general sports video mining framework with explicit semantic modeling and direct semantic computing.

Chapter Preview

Top

Introduction

Video mining is to discover knowledge, structures, patterns and events of interest in the video data, and its benefits range from efficient browsing and summarization of video content to facilitating video access and retrieval in a large database or online multimedia repository. There are various types of video data. According to different production and edition styles, videos can be classified into two main categories: scripted and non-scripted videos (Xiong, Zhou, Tian, Rui, & Huang, 2006). Scripted videos are produced according to a certain script or plan that are later edited, compiled, and distributed to users for consumption. News and movies are the examples of scripted videos. Most research efforts on scripted videos focus on the development of a Table-of-Content (TOC) that provides an overview of the video’s naturally organized content to support efficient browsing and indexing. On the other hand, the events in non-scripted videos happen spontaneously and usually in a relatively fixed setting, such as meeting videos, sports videos, and surveillance videos. The research on non-scripted videos involves detecting high-lights and events-of-interest. In this chapter sports video mining is studied using American football as the case study.

Although sports videos are non-scripted content, there are still definite or repetitive structures and patterns. Using these structures and patterns, we can develop some flexible and effective tools for video browsing/indexing. Currently, there are two kinds of approaches for sports video mining, structure-based (Kokaram, et al., 2006; Xie, Chang, Divakaran, & Sun, 2003) and event-based (Assfalg, Bertini, Colombo, Bimbo, & Nunziati, 2003; T. Wang, et al., 2006). The former uses either supervised or unsupervised learning to recognize some basic semantic structures (such as canonical view in a baseball game or play/break in a soccer game) that can serve as an intermediate representation supporting semantics-oriented video retrieval, but usually cannot deliver high-level semantics directly. The latter one provides a better understanding of the video content by detecting and extracting the events-of-interest or the highlights, which could be very specific and task-dependent and requires sufficient and diverse training examples. Because these two approaches are complementary in nature, researchers have investigated how to integrate both of them in one computational framework. For example, a mid-level representation framework was proposed for semantic sports video analysis involving both temporal structures and events hierarchy (L. Y. Duan, Xu, Chua, Q. Tian, & Xu, 2003) and a mosaic-based generic scene representation was developed from video shots and used to mine both events and structures (Mei, Ma, Zhou, Ma, & Zhang, 2005).

Machine learning is one of the most feasible approaches for semantic video analysis (Kokaram, et al., 2006). Hidden Markov models (HMMs) have been the most popular tool in this area (Xie, Chang, Divakaran, & Sun, 2003). Our goal is try to establish a HMM-based machine learning framework that supports both structure analysis and event analysis and delivers the rudimentary building blocks for high-level semantic analysis. There are two major distinctions in our research. One is the concept of semantic space which supports explicit semantic modeling and specifies both mid-level and high-level semantic structures as well as relevant visual features, so that the task is formulated as two inference problems. The other is a new HMM that offers multi-faceted advantages for semantic video analysis.

Key Terms in this Chapter

Video Mining: the process of discovering knowledge, structures, patterns and events of interest in the video data.

Sports Video Analysis: a process to discover the meaningful structure or events of interest in the sports video.

Model Learning: given the data, to learn a probabilistic model with optimized structures and parameters.

Hidden Markov Models: a kind of statistical model in which the system being modeled is assumed to be a Markov process with unknown parameters.

Semantic Structures: an organization or a pattern that represents specific meaning.

EM Algorithm: a statistical estimation algorithm that can find maximum likelihood estimates of parameters in probabilistic models, where the model depends on unobserved latent variables.

Segmental HMMs: A special type of HMM with segmental observation model that can handle variable length observations.

Entropic Prior: a prior model that minimizes a type of mutual information between the data and the parameters.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Statistical Machine Learning Approaches for Sports Video Mining Using Hidden Markov Models

Abstract

Introduction

Key Terms in this Chapter

Complete Chapter List