Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

The Automated Generation and Further Application of Tree-Structure Outline for Lecture Videos with Synchronized Slides

Xiaoyin Che, Haojin Yang, Christoph Meinel

Source Title: International Journal of Technology and Educational Marketing (IJTEM) 4(1)

DOI: 10.4018/ijtem.2014010103

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In this paper, the authors illustrate their motivation and method in the automated generation of tree-structure outline for lecture videos with supplementary synchronized slides, and then propose a further application, lecture video segmentation by slide-group-change event, based on the outline previously generated. Starting with OCR (Optical Character Recognition) result, with an approximate accuracy of 90%, the authors attempt to reconstruct the text system of each slide into an up-to-3-level content tree, and then explore logical relations between slides in order to set them hierarchical. A final up-to-6-level outline will be achieved after removing all the redundancy. And the hierarchy of the slides, which is saved in the outline, will largely simplify the additional segmentation process. Evaluation result shows that, the final outline generated based on the test dataset retains about only 1/4 of the original texts from all slides and is organized well, with a high accuracy of 85% at slide title level. And the majority of the segments the authors' get are logically reasonable, while the average length of them is about 5~15 minutes.

Article Preview

Top

Solution Framework

Figure 1 depicts the framework of our solution. Preprocessing is the first part, which contains logo removing and text modification. Outline generation is the most important part. Both intra-slide reconstruction and inter-slides analysis have quite a lot of detailed procedures. Followed by an independent step, the final outline will be achieved. And the segmentation process is comparatively simple with only two steps: logical segmentation and default time segmentation.

Figure 1.

Diagram of proposed solution framework

Generally there are two main challenges in our research, to analyze the slide layout or the slides logic and the robustness on OCR accuracy problem. All steps have to take both these two challenges under consideration.

Top

Preprocessing

The whole solution begins with preprocessing on the raw data deriving directly from OCR results, in order to exclude potential interfere by useless slide content and modify the recognition errors as much as possible.

Logo and Foot Line

Some lecture or presentation slides, especially those built on a university or company template, always have a logo. When existing, logo appears in the same position of almost every slide, commonly in a corner. Due to size of the logo and where it locates, it may probably be recognized as a major part inside a single slide in the outline generation process, such as title, which may drastically damage the real content structure of the slide.

To solve this problem, we employ a position-based detection scheme, in which the logo, if it exists, will be found out. Any text-lines which share exactly the same position, have same or very similar text content but locate in different slides will be addressed as logo-candidate and their appearance frequency will be counted. When the detection scheme is finished, those logo-candidates with high appearance frequency over the threshold decided by the total number of slides will be removed permanently. And in order to avoid removing some non-logo but logo-like text content, for example a same title shared by multiple continuous slides, the detection scheme will only be applied in the edge areas of the slides.

Despite the logo, some other kind of template-based slide content, such as foot line, will also be eliminated during this step. Practically they are perhaps not as harmful as the logo, but definitely we cannot benefit from them.

OCR Error Modification

The accuracy of the OCR program we used is approximate 90%, and it would be very irresponsible to offer the 10% ill-recognized texts to the e-learning portal users. In our research, text-lines will be checked by splitting into words. If the average word length is shorter than 2 characters, this text-line will be discarded entirely.

Otherwise, a text-line can also be shortened by eliminating ill-recognized words, which include continuous short words, a word with an abnormally long length or containing too much symbols. Besides, a dictionary for frequently used short words or professional initials such as ‘a’, ‘is’ or ‘OS’ is used to keep these meaningful short words from being deleted.

Top

Tree-Structure Outline Generation

As we mentioned in solution framework chapter, the tree-structure outline generation process contains two major parts, intra-slide reconstruction and inter-slides analysis, and an independent following step to generate the final outline. In this chapter, we will explain the two major parts in detail.

Complete Article List

Search this Journal:

Reset

Open Access Articles: Forthcoming

Volume 8: 2 Issues (2018)

Volume 7: 2 Issues (2017)

Volume 6: 2 Issues (2016)

Volume 5: 2 Issues (2015)

Volume 4: 2 Issues (2014)

Volume 3: 2 Issues (2013)

Volume 2: 2 Issues (2012)

Volume 1: 2 Issues (2011)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference