Using distributional similarity to organise biomedical terminology

Weeds, Julie; Dowdall, James; Schneider, Gerold; Keller, Bill; Weir, David

Using distributional similarity to organise biomedical terminology

journal contribution

posted on 2023-06-07, 14:11 authored by Julie WeedsJulie Weeds, James Dowdall, Gerold Schneider, Bill Keller, David WeirDavid Weir

We investigate an application of distributional similarity techniques to the problem of structural organisation of biomedical terminology. Our application domain is the relatively small GENIA corpus. Using terms that have been accurately marked-up by hand within the corpus, we consider the problem of automatically determining semantic proximity. Terminological units are dened for our purposes as normalised classes of individual terms. Syntactic analysis of the corpus data is carried out using the Pro3Gres parser and provides the data required to calculate distributional similarity using a variety of dierent measures. Evaluation is performed against a hand-crafted gold standard for this domain in the form of the GENIA ontology. We show that distributional similarity can be used to predict semantic type with a good degree of accuracy.

History

Publication status

Published

Journal

Terminology

ISSN

0929-9971

External DOI

https://doi.org/10.1075/term.11.1.06wee

Issue

1

Volume

11

Page range

107-141

Department affiliated with

Informatics Publications

Full text available

Yes

Peer reviewed?

Yes

Legacy Posted Date

2008-02-27

Using distributional similarity to organise biomedical terminology

History

Publication status

Journal

ISSN

External DOI

Issue

Volume

Page range

Department affiliated with

Full text available

Peer reviewed?

Legacy Posted Date

Usage metrics

Categories

Keywords

Licence

Exports