Using Query Expansion Techniques and Content-Based Filtering for Personalizing Analysis in Big Data

Using Query Expansion Techniques and Content-Based Filtering for Personalizing Analysis in Big Data

Sadek Menaceur, Makhlouf Derdour, Abdelkrim Bouramoul
DOI: 10.4018/IJITWE.2020040104
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

The recent debates on personalizing analyses in a Big Data context are one of the most solicited challenges for business intelligence (BI) administrators. The high-volume, the high-variety, and the high-velocity of Big Data have produced difficulty in storing, processing, and analyzing data in traditional systems. These 3Vs (volume, velocity, and variety) created many new challenges and make them difficult to extract the specific needs of the users. In addition, the user may be faced with the problem of disorientation; he does not know what information really corresponds to his needs. The information personalization systems aim to overcome these problems of disorientation by using a user profile. The effectiveness of the personalization system in a Big Data context is to demonstrate by the relevance and accuracy of the content of the results obtained, according to the needs of the user and the context of the research. Nevertheless, most of the recent research focused on the relational data warehouse personalizing and ignored the integration of the user context into the analysis of OLAP cubes, which is the first concerned to execute the user's multidimensional queries. To deal with this, the authors propose in this article a dynamic personalizing approach in Big Data context using OLAP cubes, based on the Content-Based Filtering, and the Query Expansion techniques. The first step in the proposal consists of processing the user queries by an enrichment technique in order to integrate the user profile and his searching context to reduce the searching space in the OLAP cube, and use the expansion technique to extend the scope of the analysis in the OLAP cube. The retrieved results are: “as relevant as possible” compared to the user's initial request. Afterward, they use information filtering techniques such as content-based filtering to personalize the analysis in the reduced data cube according to the term frequency and cosine similarity. Finally, they present a case study and experiences results to evaluate and validate their approach.

Complete Article List

Search this Journal:
Reset
Volume 19: 1 Issue (2024)
Volume 18: 1 Issue (2023)
Volume 17: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 16: 4 Issues (2021)
Volume 15: 4 Issues (2020)
Volume 14: 4 Issues (2019)
Volume 13: 4 Issues (2018)
Volume 12: 4 Issues (2017)
Volume 11: 4 Issues (2016)
Volume 10: 4 Issues (2015)
Volume 9: 4 Issues (2014)
Volume 8: 4 Issues (2013)
Volume 7: 4 Issues (2012)
Volume 6: 4 Issues (2011)
Volume 5: 4 Issues (2010)
Volume 4: 4 Issues (2009)
Volume 3: 4 Issues (2008)
Volume 2: 4 Issues (2007)
Volume 1: 4 Issues (2006)
View Complete Journal Contents Listing