DOCUMENT
DOCUMENT
1/1
Using distributional similarity to organise biomedical terminology
journal contribution
posted on 2023-06-07, 14:11 authored by Julie WeedsJulie Weeds, James Dowdall, Gerold Schneider, Bill Keller, David WeirDavid WeirWe investigate an application of distributional similarity techniques to the problem of structural organisation of biomedical terminology. Our application domain is the relatively small GENIA corpus. Using terms that have been accurately marked-up by hand within the corpus, we consider the problem of automatically determining semantic proximity. Terminological units are dened for our purposes as normalised classes of individual terms. Syntactic analysis of the corpus data is carried out using the Pro3Gres parser and provides the data required to calculate distributional similarity using a variety of dierent measures. Evaluation is performed against a hand-crafted gold standard for this domain in the form of the GENIA ontology. We show that distributional similarity can be used to predict semantic type with a good degree of accuracy.
History
Publication status
- Published
Journal
TerminologyISSN
0929-9971External DOI
Issue
1Volume
11Page range
107-141Department affiliated with
- Informatics Publications
Full text available
- Yes
Peer reviewed?
- Yes