University of Sussex

File(s) not publicly available

Automatic extraction of subcategorization from corpora

posted on 2023-06-07, 21:39 authored by Ted Briscoe, John Carroll
We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization classes for English. An initial experiment, on a sample of 14 verbs which exhibit multiple complementation patterns, demonstrates that the technique achieves accuracy comparable to previous approaches, which are all limited to a highly restricted set of subcategorization classes. We also demonstrate that a subcategorization dictionary built with the system improves the accuracy of a parser by an appreciable amount.


Publication status

  • Published

Page range


Presentation Type

  • paper

Event name

Proceedings of the 5th ACL Conference on Applied Natural Language Processing (ANLP'97) Washington DC.

Event location

Washington DC.

Event type


Department affiliated with

  • Informatics Publications

Full text available

  • No

Peer reviewed?

  • Yes

Legacy Posted Date


Usage metrics

    University of Sussex (Publications)


    No categories selected