UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection

Sudhanshu, S; Saito Stenetorp, PLEPS; Riedel, S; (2018) Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. (pp. pp. 4977-4983). Association for Computational Linguistics: Brussels, Belgium. Green open access

[thumbnail of Saito Stenetorp_D18-1541.pdf]
Preview
Text
Saito Stenetorp_D18-1541.pdf

Download (262kB) | Preview

Abstract

Grammatical error correction, like other machine learning tasks, greatly benefits from large quantities of high quality training data, which is typically expensive to produce. While writing a program to automatically generate realistic grammatical errors would be difficult, one could learn the distribution of naturallyoccurring errors and attempt to introduce them into other datasets. Initial work on inducing errors in this way using statistical machine translation has shown promise; we investigate cheaply constructing synthetic samples, given a small corpus of human-annotated data, using an off-the-rack attentive sequence-to-sequence model and a straight-forward post-processing procedure. Our approach yields error-filled artificial data that helps a vanilla bi-directional LSTM to outperform the previous state of the art at grammatical error detection, and a previously introduced model to gain further improvements of over 5% F0.5 score. When attempting to determine if a given sentence is synthetic, a human annotator at best achieves 39.39 F1 score, indicating that our model generates mostly human-like instances.

Type: Proceedings paper
Title: Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection
Event: 2018 Conference on Empirical Methods in Natural Language Processing
Open access status: An open access version is available from UCL Discovery
DOI: 10.18653/v1/D18-1541
Publisher version: https://www.aclweb.org/anthology/D18-1541/
Language: English
Additional information: This article is published under Creative Commons licence https://creativecommons.org/licenses/by/4.0/
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10057432
Downloads since deposit
62Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item