Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Token-based Similarity Metrics

Handbook of Research on Digital Libraries: Design, Development, and Impact
Based on statistics of common words and are useful when word order is not important.
Published in Chapter:
Duplicate Journal Title Detection in References
Ana Kovacevic (University of Belgrade, Serbia) and Vladan Devedzic (University of Belgrade, Serbia)
DOI: 10.4018/978-1-59904-879-6.ch023
Abstract
Our research efforts are oriented towards applying text mining techniques in order to help librarians make more informative decisions when selecting learning resources to be included in the library’s offer. The proper selection of learning resources to be included in the library’s offer is one of the key factors determining the overall usefulness of the library. Our task was to match abbreviated journal titles from citations with journals in existing digital libraries. The main problem is that for one journal there is often a number of different abbreviated forms in the citation report, hence the matching depends on the detection of duplicate records. We used character-based and token-based metrics together with a generated thesaurus for detecting duplicate records.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR