VuFind: Solr Power in the Library

VuFind: Solr Power in the Library

Demian Katz (Villanova University, USA) and Andrew Nagy (Serials Solutions, USA)
DOI: 10.4018/978-1-4666-1912-8.ch004
OnDemand PDF Download:


Apache Solr, an open source Java-based search engine, forms the core of many Library 2.0 products. The use of an index in place of a relational database allows faster data retrieval along with key features like faceting and similarity analysis that are not practical in the previous generation of library software. The popular VuFind discovery tool was built to provide a library-friendly front-end for Solr’s powerful searching capabilities, and its development provides an informative case study on the use of Solr in a library setting. VuFind is just one of many library packages using Solr, and examples like Blacklight, Summon, and the eXtensible Catalog project show other possible approaches to its use.
Chapter Preview

Indexes Vs. Databases

At the core of your typical Integrated Library System, you are likely to find a relational database. Relational databases are extremely useful tools, which efficiently store and retrieve information by breaking data down into tables of granular data and keeping track of how these tables relate to one another. With such a database, it is quite simple to model many of the relationships that can be found in libraries. As a simplistic example, one database table might represent books, other authors, and other patrons. Additional tables could then track relationships—which authors wrote which titles, or which patrons are currently borrowing which books. The beauty of the relational model is that each key piece of data (book, author, patron) is stored only once. When you need to look up information, you join these pieces together using the known relationships in order to get the answers you seek. If you need to change a piece of data, you edit it in just one place, and thanks to all of the relationships, the update automatically affects every context in which the data might be accessed.

Although relational databases remain an important part of the library software landscape, they do have one significant weakness: they are not ideal for search applications, particularly the sort of search engines that users of the Web have become accustomed to—very fast and very accurate. While database systems usually have built-in indexing and can do certain types of well-defined searches very quickly, their performance suffers when it comes to complex, multi-field, or full-text searching. Their data model, with everything spread out across multiple tables, becomes a disadvantage when you want to search through everything quickly—data needs to be reconstituted before it can be searched, and the result is slower performance.

The answer to this problem is an index. An index turns the whole thing upside-down. Rather than concerning itself with efficient storage of unique values, an index instead focuses on fast retrieval of information stored in a heavily pre-processed format. Data is stored redundantly to ease fast lookup, and text from indexed records and user search queries may be analyzed in a variety of ways to increase the probability of finding matches. Obviously, an index is likely to use more memory and disk space than an equivalent relational database, but the benefits include faster, more flexible lookup and powerful results analysis (such as faceting) which are impractical using a relational database.

Complete Chapter List

Search this Book:
Table of Contents
Jesús Tramullas
Chapter 1
Filipe Manuel dos Santos Bento, Lídia de Jesus Oliveira L. da Silva
This chapter exposes some core concepts for an innovative bibliographic information search system model, where not only the document is the point of... Sample PDF
Going Beyond the Bibliographic Catalog: The Basis for a New Participatory Scientific Information Discovery and Sharing Model
Chapter 2
Jesse Prabawa Gozali, Min-Yen Kan
The authors redesign the user interface of an online library catalog, leveraging current Web technologies that allow dynamic and fine-grained user... Sample PDF
Rich and Dynamic Library Catalogs: A Case Study of Online Search Interfaces
Chapter 3
Myung-Ja Han, Christine Cho
Libraries have been trying hard to produce and aggregate resources in both print and digital formats, which need to be integrated into the library... Sample PDF
XML in Library Cataloging Workflows: Working with Diverse Sources and Metadata Standards
Chapter 4
Demian Katz, Andrew Nagy
Apache Solr, an open source Java-based search engine, forms the core of many Library 2.0 products. The use of an index in place of a relational... Sample PDF
VuFind: Solr Power in the Library
Chapter 5
Birong Ho
With new Web tools, information can be released to flow in every direction (library to user, user to library, library to library, and user to user).... Sample PDF
Does VuFind Meet the Needs of Web 2.0 Users?: A Year After
Chapter 6
Elsa Barber, Silvia Pisano, Sandra Romagnoli, Verónica Parsiale, Gabriela de Pedro, Carolina Gregui, Nancy Blanco
This chapter studies user interfaces of Web Online Public Access Catalogs (OPACs) and presents their principal difficulties in facing the... Sample PDF
Quantitative Approach Applied to User Interface of Latin American Web OPACs
Chapter 7
Jesús Tramullas, Piedad Garrido
The use of OPACs has been one of the problems that users of libraries have faced. The application of Web 2.0 techniques in OPACs has favored the... Sample PDF
Intelligent Personal Agents in Library 2.0 Environments: An Assistant Prototype
Chapter 8
Jorge Serrano-Cobos, Alicia Sellés, Nuria Lloret
Therefore, the authors study some useful clues to understand the need to integrate these tools beyond visual consistency by offering each other the... Sample PDF
Library Analytics on the Web 2.0 Era: Technology Integration Needs and Indicators to Monitor “User Awareness” with Web Analytics Techniques
Chapter 9
Elena Corradini, Mario Pérez-Montoro
In this chapter, the authors review current literature on personal information management to analyze how it is understood with the aim of reviewing... Sample PDF
Library 2.0 and Personal Information Management: A Way Forward using Social Networks
Chapter 10
Lara Skelly, Jen Eidelman, Peter Underwood
In response to two different pedagogical challenges, namely delivering varied library instruction to a big first year class and delivering a focused... Sample PDF
Web 2.0 Technology as a Teaching Tool
About the Contributors