Weighted Indication-Based Similar Drug Sensing

Weighted Indication-Based Similar Drug Sensing

Guangli Zhu (Anhui University of Science and Technology, Huainan, China), Congna He (Anhui University of Science and Technology, Huainan, China), Zhang Shunxiang (Anhui University of Science and Technology, Huainan, China), Yanyong Du (Anhui University of Science and Technology, Huainan, China) and Zheng Xu (The Third Research Institute of Ministry of Public Security, Shanghai, China)
DOI: 10.4018/IJSSCI.2015010105


Rapidly finding some drugs with the same or similar treatment effect has great application value, it can be convenient to find alternative medicines for users such as medical staffs, drugs salesman and patients. To solve this problem, this paper presents a Drug Similarity Computation (DSC) algorithm which is a new technology about drug data sensing. Firstly, combining the user dictionary and waste word lib, the authors effectively extract drug terms from the indication text of drugs by the technology of Chinese word segmentation. Secondly, weighted indication knowledge database is built based on some terms of the similar indications to facilitate computing the similarity among drugs. Thirdly, according to the given input, a drug sub-set with similar treatment effect is gotten by searching weighted indication knowledge database. The presented algorithm sorts all records in this drug sub-set and does some filtering, and recommends some reasonable drugs according to the user's requirements. Experimental analysis shows that the presented algorithm is valid.
Article Preview

1. Introduction

With the rapid development of Internet technology, various forms of drug data are also developed rapidly on the Internet. Vast amounts of drug data stored in their specific organization way, users find the resource in the large amounts of drug data according to their requirements. The occurrence of the search engine greatly reduce the difficulty in finding information for users. But usually, it doesn't make users get the most satisfactory retrieval results. For example, when a drug is lack, users input indication terms query by using the generic search tool, the number of query results are a lot, so users hardly select the most appropriate drug(s) through simple judgment at this time. Accordingly, non-appropriate selected drug(s) not only has no good treatment effect, but also lead to more serious consequences and negative effects. Therefore, how to rapidly and accurately find a alternative drug has extensive application prospect in the field of drug information retrieval.

To solve this problem, this paper presents a Drug Similarity Computation (DSC) algorithm based on weighted indication. Our contributions mainly includes three parts. First, we should find a good data structure website to collected drug data resources, then drug data are collected and organized to establish the user dictionary and drug lib based on Chinese word segmentation technology (Abudoulikemu, 2010; Wu, 2011; Zhang, 2014; Ni, 2014). It is important that how to determine keywords of drug indications. Wang (Wang, 2011; Wang, et. al. 2011) puts forward synergy of cognitive informatics, which is helpful for us to extract this kind of indication terms. Second, some weights are assigned to the drug indication terms and establish the weighted indication knowledge database. Also it can prepare for calculation of drug similarity. Finally, the drug similarity is computed to get a drug sub-set with similar treatment effect. When users enter a drug name, the presented algorithm will sorts all records in this drug sub-set and does some filtering, and recommends some reasonable drugs. So users can choose the best alternative drug(s) by similarity computation among some drugs.

The primary task of the drug data sensing is extracting medical terms from the indication texts of drugs, inference as the basic mechanism of thought is abilities gifted to human beings according to Inference algebra (IA) (Wang, 2012), and IA are explored in three categories: a) logical inferences; b) analytic inferences; and c) hybrid inferences. The extracting process of medical terms can utilize Chinese word segmentation technology and our prior window-split idea (Zhang, 2014). In addition, drug data sensing must select a similarity calculation method. Concept algebra (CA) is a denotational mathematical structure for formal knowledge representation and manipulation in machine learning and cognitive computing. CA provides a rigorous knowledge modeling and processing tool (Wang, et. al. 2011). Currently, similarity calculation method can be roughly divided into two kinds: one is statistics with large-scale corpus. This method is based on the probability distribution of vocabulary context information to calculate. Another is usually based on the hierarchy relation of complete semantic dictionary, such as Liu (Liu, 2002) etc. He is put forward similarity calculation based on “Hownet” word (Guan, 2002; Li, 2012). The method based on semantic dictionary is simple and effective, more intuitive, users can quickly complete the calculation by building the related database, so the method based on dictionary is also the main method.

Complete Article List

Search this Journal:
Open Access Articles: Forthcoming
Volume 13: 4 Issues (2021): Forthcoming, Available for Pre-Order
Volume 12: 4 Issues (2020): 2 Released, 2 Forthcoming
Volume 11: 4 Issues (2019)
Volume 10: 4 Issues (2018)
Volume 9: 4 Issues (2017)
Volume 8: 4 Issues (2016)
Volume 7: 4 Issues (2015)
Volume 6: 4 Issues (2014)
Volume 5: 4 Issues (2013)
Volume 4: 4 Issues (2012)
Volume 3: 4 Issues (2011)
Volume 2: 4 Issues (2010)
Volume 1: 4 Issues (2009)
View Complete Journal Contents Listing