Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Image Analysis for Historical Japanese Book Archives

Chulapong Panichkriangkrai, Liang Li, Ross Walker, Kozaburo Hachimura

Source Title: International Journal of Asian Business and Information Management (IJABIM) 5(2)

DOI: 10.4018/ijabim.2014040101

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This paper describes methods of image analysis for historical Japanese book archives with a dominant focus on character segmentation. The segmentation methodology includes stain and smear removal, binarization, character line extraction, and character extraction by region labeling with integration and separation techniques. The experimental results show that the proposed method can segment all text lines correctly and can extract more than 79% of the characters from 16 pages of Chinsetsu Yumiharizuki, containing 176 text lines and a total of 5181 quite complicated characters.

Article Preview

Top

Introduction

Paper free, easy access and distribution, mobile availability, and interactive multimedia functions make digital books a rapidly growing market. With internet distribution, the digital book market is already a global business. On the other hand, nowadays, there is increasing research on the strategies of developing and introducing national cultural heritages worldwide (Leong, 2010). A large number of historical books were digitized and made available not only to researchers but to the public as digital media in many digital archives (Art Research Center, Ritsumeikan University, 2010; National Institute of Japanese Literature, 2013). However, only limited items have been transcribed into text based digital books, whereas most of the rest were collected as digital images without text information. It is a challenging work to develop learning/reading support system that analyses document images to discriminate figures, titles, and text area from digital archives. This article focused on these topics. Although our method and results are still in a preliminary stage, the approach will be significant when expanding the target of digital book business so as to include historical books in the future.

The analyses of digital archived historical book images in this paper mainly focuses on page segmentation and identification. For example, text region extraction means identifying the text part of a page image, while text-line extraction means identifying the text-line from the text region. Furthermore, character extraction refers to segmenting each character from the text-line. In this paper we propose techniques for both text-line and character segmentation.

The historical Japanese books that are focused on in this paper are books printed in the Edo period (1603-1867). Figure 1 shows examples of Japanese woodblock printed books from the Edo period. The Edo period was a period of calm that provided an ideal environment for developing commercial art. During that period, while Europeans used moveable type printing processes, the Japanese developed and used a woodblock printing process. This process uses wooden blocks, upon which are engraved reversed images of both the text and illustrations, as relief printing. For printing, two consecutive pages were carved on one side of the woodblock. During the Edo period, Japan published over 110,000 titles of books with more than 10 million copies in the markets (Hioki, 2009).

Figure 1.

Example of Japanese woodblock printed historical books

Currently, a large number of the books published in the Edo period have been scanned and made available to the public as digital images. However, experts have transcribed only a small number of book titles printed in the Edo period into modern book productions. Furthermore, only a small number of people can recognize and read the characters used in Edo period books. Old style characters and running scripts are different to modern ones. Characters of this type of historical book are difficult to segment because they have ligature-like characters that join two or more characters.

In this paper, we propose a character segmentation system for character shape comparison, character image retrieval, and to make a statistical analysis of the usage of characters in single or multiple books. The proposed concept can be applied for other Asian historical digital archives to offer cultural and social support.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 1 Issue (2023)

Volume 13: 2 Issues (2022)

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2015)

Volume 5: 4 Issues (2014)

Volume 4: 4 Issues (2013)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Image Analysis for Historical Japanese Book Archives

Abstract

Introduction

Complete Article List