University of Sussex
Browse
DOCUMENT
set-weeds-dowdall.pdf (358.87 kB)
DOCUMENT
set-weeds-dowdall.pdf (467.59 kB)
DOCUMENT
set-weeds-dowdall.pdf (456.72 kB)
1/0
3 files

Using distributional similarity to organise biomedical terminology

journal contribution
posted on 2023-06-07, 14:11 authored by Julie WeedsJulie Weeds, James Dowdall, Gerold Schneider, Bill Keller, David WeirDavid Weir
We investigate an application of distributional similarity techniques to the problem of structural organisation of biomedical terminology. Our application domain is the relatively small GENIA corpus. Using terms that have been accurately marked-up by hand within the corpus, we consider the problem of automatically determining semantic proximity. Terminological units are dened for our purposes as normalised classes of individual terms. Syntactic analysis of the corpus data is carried out using the Pro3Gres parser and provides the data required to calculate distributional similarity using a variety of dierent measures. Evaluation is performed against a hand-crafted gold standard for this domain in the form of the GENIA ontology. We show that distributional similarity can be used to predict semantic type with a good degree of accuracy.

History

Publication status

  • Published

Journal

Terminology

ISSN

0929-9971

Issue

1

Volume

11

Page range

107-141

Department affiliated with

  • Informatics Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2008-02-27

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC