The Santa Cruz Sluicing Dataset
Pranav Anand, Daniel Hardt, James McCloskey
January 2021
 

This paper describes a new research resource -- a searchable database of 4700 naturally occurring instances of sluicing in English, annotated so as to shed light on the questions which have shaped research on ellipsis since the 1960's. The paper describes the dataset and how it can be obtained, how it was constructed, how it is organized, and how it can be queried. It also highlights some initial empirical findings, first describing general characteristics of the data, then focusing more closely on issues concerning antecedents and possible mismatches between antecedents and ellipsis sites
Format: [ pdf ]
Reference: lingbuzz/005673
(please use that when you cite this article)
Published in: to appear
keywords: ellipsis sluicing english corpus annotation, semantics, syntax
previous versions: v1 [August 2020]
Downloaded:1039 times

 

[ edit this article | back to article list ]