Understanding Depressive Symptoms and Psychosocial Stressors on Twitter: A Corpus-Based Study.
Danielle L MoweryHilary SmithTyler CheneyGregory J StoddardGlen CoppersmithAnnaBelle O BryanMike ConwayPublished in: Journal of medical Internet research (2017)
We successfully developed an annotation scheme and an annotated corpus, the SAD corpus, consisting of 9300 tweets randomly-selected from the Twitter application programming interface using depression-related keywords. Our analyses suggest that keyword queries alone might not be suitable for public health monitoring because context can change the meaning of keyword in a statement. However, postprocessing approaches could be useful for reducing the noise and improving the signal needed to detect depression symptoms using social media.