The enemy in your own camp: how well can we detect statistically-generated fake reviews - an adversarial study

Dirk Hovy

The enemy in your own camp: how well can we detect statistically-generated fake reviews - an adversarial study

Dirk Hovy

Department of Nordic Studies and Linguistics

16 Citations (Scopus)

Abstract

Online reviews are a growing market, but it is struggling with fake reviews. They undermine both the value of reviews to the user, and their trust in the review sites. However, fake positive reviews can boost a business, and so a small industry producing fake reviews has developed. The two sides are facing an arms race that involves more and more natural language processing (NLP). So far, NLP has been used mostly for detection, and works well on human-generated reviews. But what happens if NLP techniques are used to generate fake reviews as well? We investigate the question in an adversarial setup, by assessing the detectability of different fake-review generation strategies. We use generative models to produce reviews based on meta-information, and evaluate their effectiveness against deceptiondetection models and human judges. We find that meta-information helps detection, but that NLP-generated reviews conditioned on such information are also much harder to detect than conventional ones.

Original language	English
Title of host publication	Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics
Number of pages	6
Volume	2
Publisher	Association for Computational Linguistics
Publication date	2016
Pages	351-356
ISBN (Electronic)	978-1-945626-01-2
Publication status	Published - 2016
Event	ACL 2016 - Duration: 7 Aug 2016 → 12 Aug 2016

Conference

Conference	ACL 2016
Period	07/08/2016 → 12/08/2016

Access to Document

http://www.aclweb.org/anthology/P/P16/P16-2057.pdf

Cite this

The enemy in your own camp : how well can we detect statistically-generated fake reviews - an adversarial study. / Hovy, Dirk.

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Vol. 2 Association for Computational Linguistics, 2016. p. 351-356.

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

@inproceedings{322373c6b560453fbb753196746e47fc,

title = "The enemy in your own camp: how well can we detect statistically-generated fake reviews - an adversarial study",

abstract = "Online reviews are a growing market, but it is struggling with fake reviews. They undermine both the value of reviews to the user, and their trust in the review sites. However, fake positive reviews can boost a business, and so a small industry producing fake reviews has developed. The two sides are facing an arms race that involves more and more natural language processing (NLP). So far, NLP has been used mostly for detection, and works well on human-generated reviews. But what happens if NLP techniques are used to generate fake reviews as well? We investigate the question in an adversarial setup, by assessing the detectability of different fake-review generation strategies. We use generative models to produce reviews based on meta-information, and evaluate their effectiveness against deceptiondetection models and human judges. We find that meta-information helps detection, but that NLP-generated reviews conditioned on such information are also much harder to detect than conventional ones.",

author = "Dirk Hovy",

year = "2016",

language = "English",

volume = "2",

pages = "351--356",

booktitle = "Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics",

publisher = "Association for Computational Linguistics",

note = "ACL 2016 ; Conference date: 07-08-2016 Through 12-08-2016",

}

TY - GEN

T1 - The enemy in your own camp

T2 - ACL 2016

AU - Hovy, Dirk

PY - 2016

Y1 - 2016

N2 - Online reviews are a growing market, but it is struggling with fake reviews. They undermine both the value of reviews to the user, and their trust in the review sites. However, fake positive reviews can boost a business, and so a small industry producing fake reviews has developed. The two sides are facing an arms race that involves more and more natural language processing (NLP). So far, NLP has been used mostly for detection, and works well on human-generated reviews. But what happens if NLP techniques are used to generate fake reviews as well? We investigate the question in an adversarial setup, by assessing the detectability of different fake-review generation strategies. We use generative models to produce reviews based on meta-information, and evaluate their effectiveness against deceptiondetection models and human judges. We find that meta-information helps detection, but that NLP-generated reviews conditioned on such information are also much harder to detect than conventional ones.

AB - Online reviews are a growing market, but it is struggling with fake reviews. They undermine both the value of reviews to the user, and their trust in the review sites. However, fake positive reviews can boost a business, and so a small industry producing fake reviews has developed. The two sides are facing an arms race that involves more and more natural language processing (NLP). So far, NLP has been used mostly for detection, and works well on human-generated reviews. But what happens if NLP techniques are used to generate fake reviews as well? We investigate the question in an adversarial setup, by assessing the detectability of different fake-review generation strategies. We use generative models to produce reviews based on meta-information, and evaluate their effectiveness against deceptiondetection models and human judges. We find that meta-information helps detection, but that NLP-generated reviews conditioned on such information are also much harder to detect than conventional ones.

M3 - Article in proceedings

VL - 2

SP - 351

EP - 356

BT - Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics

PB - Association for Computational Linguistics

Y2 - 7 August 2016 through 12 August 2016

ER -

The enemy in your own camp: how well can we detect statistically-generated fake reviews - an adversarial study

Abstract

Conference

Access to Document

Fingerprint

Cite this