Clustering and Ranked Search for Enterprise Content Management

Clustering and Ranked Search for Enterprise Content Management

Juris Rats, Gints Ernestsons
Copyright: © 2013 |Pages: 12
DOI: 10.4018/ijeei.2013100102
(Individual Articles)
No Current Special Offers


The aim of this work is to understand more closely where the border lies between relational and Not Only Structured Query Language (NoSQL) platform as concerns Enterprise Content Management (ECM) area. Another objective (closely related to the first one) is to specify the conceptual architecture of the distributed ECM system. The authors specify the model of the prototype ECM system and compare two platforms for this model – MS SQL based for the relational platform and Clusterpoint for the NoSQL platform. The results of performance measurements of SQL and NoSQL technologies for Enterprise Content Management specific tasks are presented and analyzed. The viability of NoSQL Document-oriented database solution based on clustering and ranked search is demonstrated. The ways to leverage the improved performance and scalability of the software to better serve the business needs of the Enterprise are discussed. The conceptual architecture of the prototype system is outlined.
Article Preview

Enterprise Content Management

Concept of Enterprise Content Management (ECM) has emerged during the last 20 years and is still evolving (see one of the most cited definitions in Blair, 2004). The term nowadays is much wider in scope than its origins. We will follow the more recent and comprehensive definition of Grahlmann, Lehnard and Slatter (2011, p. 5):

Enterprise content management comprises the strategies, processes, methods, systems, and technologies that are necessary for capturing, creating, managing, using, publishing, storing, preserving, and disposing content within and between organizations.

The most important enhancement of the latter is acknowledging the significance of the communication between the organizations. This is the important area for improvements because a lot of information that should be shared is duplicated between enterprises. It is an illusion that a correspondence saved in repositories of two organizations is safer in comparison with one copy in a decently protected shared storage. The shared solution has obvious advantages though – allowing for search in context of two enterprises (if the searcher has necessary authority).

ECM should be regarded as integrative middleware that uses internet technologies for in-house information provision (Kampffmeyer, 2007). It covers a wide area of functionality (Bantin, 2008; Nilsen, 2012; Korb & Strodl, 2010; Kampffmeyer, 2007) that surrounds five functional areas - Document Management, Collaboration of supporting systems (or group-ware), Web Content Management (this includes portals), Records Management (archive and file management systems) and Workflow and Business Process Management.

Complete Article List

Search this Journal:
Volume 14: 1 Issue (2024)
Volume 13: 1 Issue (2023)
Volume 12: 2 Issues (2022)
Volume 11: 2 Issues (2021)
Volume 10: 2 Issues (2020)
Volume 9: 2 Issues (2019)
Volume 8: 2 Issues (2018)
Volume 7: 2 Issues (2017)
Volume 6: 2 Issues (2016)
Volume 5: 2 Issues (2015)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing