Social Media Content Analysis and Classification Using Data Mining and ML

Social Media Content Analysis and Classification Using Data Mining and ML

Sambhaji D. Rane
Copyright: © 2021 |Pages: 10
DOI: 10.4018/IJDA.2021070105
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Students' natural conversations on social media such as Twitter and Whatsapp are useful to understand their learning experiences feelings. Collecting and analyzing data from such media can be a difficult task. However, the large scale of data is required for automatic data analysis techniques to classify Twitter data. The proposed new system is a combination of qualitative analysis and large-scale data mining and ML techniques. This system focuses on engineering students' Twitter posts, which are collected from engineering colleges, to understand issues and problems in their learning. The authors first conduct a qualitative analysis using ML studio on tweets collected from engineering colleges using term #DStudentsproblems, engineeringProblem, Aluminisuggestions, and ladyEngineer. Collected tweets are related to engineering students' college lives. In the proposed system, a multi-label classification algorithm to classify tweets reflecting students' problems such as soft skill issues, heavy study load, lack of social engagement, and sleep problems is used.
Article Preview
Top

2. Proposed System

A proposed system is focuses on engineering students’ Twitter posts to understand issues and problems in their educational experiences. The proposed scheme is made up of Twitter data extraction, tweets data cleaning. Classification of tweet data and web module .The proposed scheme performs various operations on tweets as shown in Figure 1

Figure 1.

Architecture of Proposed System for Mining Twitter Data using ML.

IJDA.2021070105.f01

In the first phase user extract tweets from twitter using twitter standard API . Tweet processing operation performed in second phase. Then, tweet classification is perform using Naïve Bayes algorithm, tweets are classified into heavy study load, lack of social engagement, negative emotions, sleep problems, soft-skill issues and other. In data cleaning phase perform various operation on tweet to remove noise from it.

Complete Article List

Search this Journal:
Reset
Volume 5: 1 Issue (2024)
Volume 4: 1 Issue (2023)
Volume 3: 2 Issues (2022): 1 Released, 1 Forthcoming
Volume 2: 2 Issues (2021)
Volume 1: 2 Issues (2020)
View Complete Journal Contents Listing