The MegaNegRaising dataset

Authors: Hannah YoungEun An and Aaron Steven White

Contact: yan2@ur.rochester.edu, aaron.white@rochester.edu

Version: 1.1

Release date: 29 Aug 2020

Overview

This MegaNegRaising dataset consists of neg-raising judgments and acceptability judgments for 925 clause-embedding verbs of English in six surface-syntactic frames. The data were collected on Amazon’s Mechanical Turk using Ibex on Mechanical Turk.

For a detailed description of the dataset, the item construction and collection methods, and discussion of how to use a dataset on this scale to address questions in linguistic theory, please see the following paper:

An, H.Y. & A.S. White. 2019. The lexical and grammatical sources of neg-raising inferences. Proceedings of the Society for Computation in Linguistics 3:23, 220-233.

If you make use of this dataset in a presentation or publication, we ask that you please cite this paper.

Version history

1.0 (14 Aug 2019): first public release 1.1 (29 Aug 2020): adds sentence with embedded negation

Description

mega-negraising-v1.tsv contains the raw data.

Column Description Values
participant anonymous integer identifier for participant that provided the response 0…1107
list integer identifier for list participant was responding to 0…247
presentationorder relative position of item in list 1…32
verb clause-embedding verb found in the item see paper
frame clausal complement found in the item see paper
tense matrix tense found in the item present, past
subject matrix subject person found in the item first, third
sentence1 the sentence with matrix negation see paper
sentence2 the sentence with embedded negation see paper
negraising neg-raising response 0…1
acceptability acceptability response for sentence1 0…1
nativeenglish whether the participant reported speaking American English natively True, False

mega-negraising-v1-normalized.tsv contains data normalized using the procedure described in An & White 2020.

Column Description Values
verb lemma of clause-embedding verb found in the item see paper
subject matrix subject person found in the item first, third
tense matrix tense found in the item present, past
frame clausal complement found in the item see paper
sentence1 the sentence with matrix negation see paper
sentence2 the sentence with embedded negation see paper
negraising normalized neg-raising response 0…1
acceptability normalized acceptability response for sentence1 0…1