Predictive Analytics in Mental Health Leveraging LLM Embeddings and Machine Learning Models for Social Media Analysis

Predictive Analytics in Mental Health Leveraging LLM Embeddings and Machine Learning Models for Social Media Analysis

Ahmad Radwan, Mohannad Amarneh, Hussam Alawneh, Huthaifa I. Ashqar, Anas AlSobeh, Aws Abed Al Raheem Magableh
Copyright: © 2024 |Pages: 22
DOI: 10.4018/IJWSR.338222
Article PDF Download
Open access articles are freely available for download

Abstract

The prevalence of stress-related disorders has increased significantly in recent years, necessitating scalable methods to identify affected individuals. This paper proposes a novel approach utilizing large language models (LLMs), with a focus on OpenAI's generative pre-trained transformer (GPT-3) embeddings and machine learning (ML) algorithms to classify social media posts as indicative or not of stress disorders. The aim is to create a preliminary screening tool leveraging online textual data. GPT-3 embeddings transformed posts into vector representations capturing semantic meaning and linguistic nuances. Various models, including support vector machines, random forests, XGBoost, KNN, and neural networks, were trained on a dataset of >10,000 labeled social media posts. The top model, a support vector machine, achieved 83% accuracy in classifying posts displaying signs of stress.
Article Preview
Top

Mental Health And Machine Learning Models: Social Media Analysis

Mental health describes a person's emotional, psychological, and social well-being, encompassing their overall mental and emotional state. It is a dynamic and complex aspect of human health that influences how individuals think, behave, feel, act, and relate to others or objects (World Health Organization [WHO], 2022). In the last decades, mental health illnesses have become widely prevalent. The World Health Organization (WHO) underscores the increasing prevalence of health disorders globally (AlSobeh et al., 20191a), with estimates suggesting that over a billion individuals are affected by a range of mental health conditions (Organizacao Pan-Americana da Saude [OPAS], 2022). From a technology aspect, social media platforms such as Facebook, Instagram, and others have become increasingly popular for individuals to express their thoughts, emotions, and daily experiences. Users often share personal stories, express their frustrations, and seek support from their online communities. This wealth of textual data provides an opportunity to monitor, analyze, and estimate individuals' mental well-being at scale (Chafery, 2024). This escalating trend underscores the imperative for robust, scalable detection and intervention mechanisms. Social media often serves as a digital diary or outlet where users share their innermost feelings, thoughts, and experiences. These expressions can range from joy at personal achievements to stress, anxiety, and other mental health concerns. The language and tone used in these posts reveal significant insights into the user's emotional state and mental well-being. For instance, frequent posts about feelings of sadness, hopelessness, or anxiety could be indicative of underlying mental health issues such as depression or anxiety disorders. Changes in the frequency, content, and nature of social media posts can signal shifts in a person's mental state. A sudden increase in posting, especially if the content is erratic or distressing, or a sudden decrease or absence of activity, can be telling. Analyzing these patterns over time can provide clues about fluctuations in mental health, potentially signaling the onset of a mental health condition or changes in an existing condition. Social media allows users to connect with others, seek support, and engage in communities. How individuals interact with others online, the type of content they share, and the communities they engage with can offer insights into their mental state. Seeking support or discussing mental health challenges in online communities could indicate a need for help or an attempt to cope with personal issues. The advancement in natural language processing (NLP) and ML enables the analysis of vast amounts of unstructured social media data to identify patterns and indicators of mental health conditions. LLMs, like those used in this study, can analyze text data for semantic meaning, emotional tone, and other linguistic markers that are often associated with mental health states.

In the contemporary digital landscape, social media platforms have emerged as pivotal spaces for self-expression and social interaction. Platforms such as Facebook, Instagram, and Twitter serve as conduits for individuals to articulate their thoughts, emotions, and daily experiences. This digital discourse offers an expansive, yet underutilized, dataset that mirrors the collective mental health psyche of its users. The extraction and analysis of this textual data presents an unprecedented opportunity for large-scale mental health monitoring and analysis (Mahlous & Okkali, 2022; Ul Haq et al., 2020).

The field of NLP has seen transformative advancements with the introduction of LLMs such as OpenAI's GPT-3 and GPT-4 (Bubeck et al., 2023). These models represent a new era of NLP, where pre-trained models have begun to take precedence over traditional NLP tasks. OpenAI's models, particularly GPT-4, are notable examples in this domain. They have demonstrated remarkable versatility across various fields including programming and mathematics. These LLMs possess the capability to generate vector-based representations from textual data, effectively capturing sentiment and semantic meanings.

Complete Article List

Search this Journal:
Reset
Volume 21: 1 Issue (2024)
Volume 20: 1 Issue (2023)
Volume 19: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 18: 4 Issues (2021)
Volume 17: 4 Issues (2020)
Volume 16: 4 Issues (2019)
Volume 15: 4 Issues (2018)
Volume 14: 4 Issues (2017)
Volume 13: 4 Issues (2016)
Volume 12: 4 Issues (2015)
Volume 11: 4 Issues (2014)
Volume 10: 4 Issues (2013)
Volume 9: 4 Issues (2012)
Volume 8: 4 Issues (2011)
Volume 7: 4 Issues (2010)
Volume 6: 4 Issues (2009)
Volume 5: 4 Issues (2008)
Volume 4: 4 Issues (2007)
Volume 3: 4 Issues (2006)
Volume 2: 4 Issues (2005)
Volume 1: 4 Issues (2004)
View Complete Journal Contents Listing