Abstract
This paper investigates whether infectious intestinal diseases (IIDs) can be detected and quantified using social media content. Experiments are conducted on user-generated data from the microblogging service, Twitter. Evaluation is based on the comparison with the number of IID cases reported by traditional health surveillance methods. We employ a deep learning approach for creating a topical vocabulary, and then apply a regularised linear (Elastic Net) as well as a nonlinear (Gaussian Process) regression function for inference. We show that like previous text regression tasks, the nonlinear approach performs better. In general, our experimental results, both in terms of predictive performance and semantic interpretation, indicate that Twitter data contain a signal that could be strong enough to complement conventional methods for IID surveillance.
Original language | English |
---|---|
Title of host publication | DH '16 : Proceedings of the 2016 Digital Health Conference |
Number of pages | 5 |
Publisher | Association for Computing Machinery |
Publication date | 2016 |
Pages | 157-161 |
ISBN (Electronic) | 978-1-4503-4224-7 |
DOIs | |
Publication status | Published - 2016 |
Event | 6th International Conference on Digital Health - Montreal, Canada Duration: 11 Apr 2016 → 13 Apr 2016 Conference number: 6 |
Conference
Conference | 6th International Conference on Digital Health |
---|---|
Number | 6 |
Country/Territory | Canada |
City | Montreal |
Period | 11/04/2016 → 13/04/2016 |
Keywords
- Disease surveillance
- IID
- Infectious intestinal disease
- Social media
- User-generated content
- Word embeddings