The MegaNegRaising dataset
Authors: Hannah YoungEun An and Aaron Steven White
Contact: yan2@ur.rochester.edu, aaron.white@rochester.edu
Version: 1.1
Release date: 29 Aug 2020
Overview
This MegaNegRaising dataset consists of neg-raising judgments and acceptability judgments for 925 clause-embedding verbs of English in six surface-syntactic frames. The data were collected on Amazon’s Mechanical Turk using Ibex on Mechanical Turk.
For a detailed description of the dataset, the item construction and collection methods, and discussion of how to use a dataset on this scale to address questions in linguistic theory, please see the following paper:
An, H.Y. & A.S. White. 2019. The lexical and grammatical sources of neg-raising inferences. Proceedings of the Society for Computation in Linguistics 3:23, 220-233.
If you make use of this dataset in a presentation or publication, we ask that you please cite this paper.
Version history
1.0 (14 Aug 2019): first public release 1.1 (29 Aug 2020): adds sentence with embedded negation
Description
mega-negraising-v1.tsv contains the raw data.
| Column | Description | Values |
|---|---|---|
| participant | anonymous integer identifier for participant that provided the response | 0…1107 |
| list | integer identifier for list participant was responding to | 0…247 |
| presentationorder | relative position of item in list | 1…32 |
| verb | clause-embedding verb found in the item | see paper |
| frame | clausal complement found in the item | see paper |
| tense | matrix tense found in the item | present, past |
| subject | matrix subject person found in the item | first, third |
| sentence1 | the sentence with matrix negation | see paper |
| sentence2 | the sentence with embedded negation | see paper |
| negraising | neg-raising response | 0…1 |
| acceptability | acceptability response for sentence1 | 0…1 |
| nativeenglish | whether the participant reported speaking American English natively | True, False |
mega-negraising-v1-normalized.tsv contains data normalized using the procedure described in An & White 2020.
| Column | Description | Values |
|---|---|---|
| verb | lemma of clause-embedding verb found in the item | see paper |
| subject | matrix subject person found in the item | first, third |
| tense | matrix tense found in the item | present, past |
| frame | clausal complement found in the item | see paper |
| sentence1 | the sentence with matrix negation | see paper |
| sentence2 | the sentence with embedded negation | see paper |
| negraising | normalized neg-raising response | 0…1 |
| acceptability | normalized acceptability response for sentence1 | 0…1 |