Design and Implementation of a Web Editing and Publishing System Based on a Semantic Network Generation Algorithm

Design and Implementation of a Web Editing and Publishing System Based on a Semantic Network Generation Algorithm

Jing Wang
Copyright: © 2022 |Pages: 11
DOI: 10.4018/IJDST.308001
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

In order to solve the problem of web editing data mining effectively, a semantic network generation algorithm is proposed. First of all, on the basis of preprocessing the variant short text, the maximum matching distance between short text is calculated by using the dictionary to expand the semantics of the Chinese words, which is used as an index to measure the formal distance between short text. Finally, a weighted method is used to synthesize formal distance and unit semantic distance into text distance, which is applied to the clustering analysis of online comments. The length of the word list is used to punish the distance. Results show that the most popular query topics on the Internet are shopping 10%, entertainment 10%, pornography 12%, computer 9%, research 9%, healthy life 5%, travel 5%, games 5%, family medical 5%, sports 3%, personal economic plan 3%, holiday 1% and others. It is proved that the improved algorithm proposed in this paper is superior to other methods and the clustering performance is significantly improved.
Article Preview
Top

Introduction

With the development of the Internet, the communication of all mankind is more dynamic and open, the predecessor of Internet is the experimental research network ARPANET of the U.S. defense advanced research projects agency, as a global network information system, it greatly promotes the applicability of Internet and information dissemination on it(Li, Z., 2019). Tim Berners-Lee from European, laboratory, for, Particle, physics, CERN, was influenced by NelSon's concept of “hypertext” and proposed the concept of “Web” for the first time(Xia, D., 2020). After more than ten years, the Web has occupied a dominant position in the Internet, the development and change of Web technology completely guide the development and change of the Internet. However, along with the success of the Web, the exponentially increasing amount of information has made it more and more difficult for users from all fields to find, access, present, and maintain information. The “rich data and poor knowledge problem” is becoming more and more prominent, mainly because the current Web's representation of information is mainly “presentable”, a large amount of information is listed in natural language, pictures and other ways, which makes people submerged in complex labor such as knowledge discrimination and extraction(Zhu, Y., 2021). For information on the Web today, computers can only process and validate it on a format basis, not on a knowledge level.

As things stand, the full power of the Internet depends not only on faster processors and more bandwidth, but also on a mechanism for better communication and dialogue, to eliminate all platform and language differences, to provide a new and high-quality information service for all mankind based on the principles of freedom, equality and openness and on the basis of the consistent understanding of the real world(Liang, Y. J., 2019). In the development of the Internet, one of the most important factors for its success is the establishment of a broad set of standards to ensure interoperability at different levels(Du, W. Y., 2021).

Complete Article List

Search this Journal:
Reset
Volume 15: 1 Issue (2024)
Volume 14: 2 Issues (2023)
Volume 13: 8 Issues (2022)
Volume 12: 4 Issues (2021)
Volume 11: 4 Issues (2020)
Volume 10: 4 Issues (2019)
Volume 9: 4 Issues (2018)
Volume 8: 4 Issues (2017)
Volume 7: 4 Issues (2016)
Volume 6: 4 Issues (2015)
Volume 5: 4 Issues (2014)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing