The MegaNegRaising dataset

The MegaNegRaising dataset

Authors: Hannah YoungEun An and Aaron Steven White


Version: 1.0

Release date: Aug 14, 2019


This MegaNegRaising dataset consists of neg-raising judgments and acceptability judgments for 925 clause-embedding verbs of English in six surface-syntactic frames. The data were collected on Amazon’s Mechanical Turk using Ibex on Mechanical Turk.

For a detailed description of the dataset, the item construction and collection methods, and discussion of how to use a dataset on this scale to address questions in linguistic theory, please see the following paper:

An, H. Y. & White, A. S. 2019. The lexical and grammatical sources of neg-raising inferences. arXiv:1908.05253 [cs.CL]

If you make use of this dataset in a presentation or publication, we ask that you please cite this paper.

Version history

1.0: first public release, 14 Aug 2019.


Column Description Values
participant anonymous integer identifier for participant that provided the response 0…1107
list integer identifier for list participant was responding to 0…247
presentationorder relative position of item in list 1…32
verb clause-embedding verb found in the item see paper
frame clausal complement found in the item see paper
tense matrix tense found in the item present, past
subject matrix subject person found in the item first, third
sentence sentence that was judged see paper
negraising neg-raising response 0…1
acceptability acceptability response 0…1
nativeenglish whether the participant reported speaking American English natively True, False