Abstract
This paper presents a methodological approach for establishing control corpora in the context of depression detection in the Modern Greek language. We discuss various methods used to create control corpora, focusing on the challenge of selecting representative samples from the general population when the target reference is the depressed population. Our approach includes traditional random selection among Twitter users, as well as an innovative method for creating topic-oriented control corpora. Through this study, we provide insights into the development of
Savecontrol corpora, offering valuable considerations for researchers working on similar projects in linguistic analysis and mental health studies. In addition, we identify several dominant topics in the depressed population such as religion, sentiments, health, sleep and digestion, which seem to align with findings consistently reported in the literature.
| Original language | English |
|---|---|
| Title of host publication | 5th RaPID Workshop |
| Subtitle of host publication | Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 at LREC-COLING 2024 - Workshop Proceedings |
| Editors | Dimitrios Kokkinakis, Kathleen C. Fraser, Charalambos K. Themistocleous, Kristina Lundholm Fors, Athanasios Tsanas, Fredrik Ohman |
| Publisher | European Language Resources Association (ELRA) |
| Pages | 68-76 |
| Number of pages | 9 |
| ISBN (Electronic) | 9782493814111 |
| Publication status | Published - 21 May 2024 |
| Event | 5th RaPID Workshop on Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 - Torino, Italy Duration: 21 May 2024 → … |
Publication series
| Name | 5th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 at LREC-COLING 2024 - Workshop Proceedings |
|---|
Conference
| Conference | 5th RaPID Workshop on Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 |
|---|---|
| Country/Territory | Italy |
| City | Torino |
| Period | 21/05/24 → … |
Keywords
- control corpora
- depression detection
- topic modeling