Looking Back to 1850 in 2025: Historascan to Digitize Historical Journals

Looking Back to 1850 in 2025: Historascan to Digitize Historical Journals

Bruno Frutuoso Costa (ISCTE, University Institute of Lisbon, Portugal), Bruno Contreiras Mateus (IADE, European University of Lisbon, Portugal & ISCTE, University Institute of Lisbon, Portugal), Hugo José Pinto (Inovaworks Command and Control S.A., Portugal), and Mohammad Reza Tabrizi (Inovaworks Command and Control S.A., Portugal)
Copyright: © 2025 |Pages: 28
DOI: 10.4018/979-8-3693-3579-6.ch015
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

This chapter analyses current technologies and the challenges involved in extracting and classifying articles and news headlines from historical journals, as well as converting images to text format. The work to develop a tool focused on digitising historical journals was carried out by a multidisciplinary team of experts in media studies, artificial intelligence, image processing, and cultural heritage preservation. The data used derives from two historic Portuguese journals, Diário de Notícias and Jornal de Notícias, which were created in the mid-19th century. This project is based on a mixture of heuristics, computer vision, pattern recognition, and other artificial intelligence and machine learning techniques. The main challenges included the variability in the design of historical journals, preserving the quality of images over time, and continuously improving image processing and OCR techniques to adapt to different styles and periods of newspapers.
Chapter Preview

Complete Chapter List

Search this Book:
Reset