Testing Paraphrase Models on Recognising Sentence Pairs at Different Degrees of Semantic Overlap

Peng, Qiwei; Weir, David; Weeds, Julie

Testing Paraphrase Models on Recognising Sentence Pairs at Different Degrees of Semantic Overlap

conference contribution

posted on 2023-08-14, 12:35 authored by Qiwei Peng, David WeirDavid Weir, Julie WeedsJulie Weeds

Paraphrase detection is useful in many natural language understanding applications. Current works typically formulate this problem as a sentence pair binary classification task. However, this setup is not a good fit for many of the intended applications of paraphrase models. In particular, such applications often involve finding the closest paraphrases of the target sentence from a group of candidate sentences where they exhibit different degrees of semantic overlap with the target sentence. To apply models to this paraphrase retrieval scenario, the model must be sensitive to the degree to which two sentences are paraphrases of one another. However, many existing datasets ignore and fail to test models in this setup. In response, we propose adversarial paradigms to create evaluation datasets, which could examine the sensitivity to different degrees of semantic overlap. Empirical results show that, while paraphrase models and different sentence encoders appear successful on standard evaluations, measuring the degree of semantic overlap still remains a big challenge for them.

History

Publication status

Published

File Version

Published version

Journal

Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023)

Publisher

Association for Computational Linguistics

Publisher URL

http://dx.doi.org/10.18653/v1/2023.starsem-1.24

External DOI

https://doi.org/10.18653/v1/2023.starsem-1.24

Page range

259-269

Pages

10

Event name

Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023)

Event type

conference

Event start date

2023-07-01

Event finish date

2023-07-01

Place of publication

Toronto

Department affiliated with

Informatics Publications

Full text available

Yes

Testing Paraphrase Models on Recognising Sentence Pairs at Different Degrees of Semantic Overlap

History

Publication status

File Version

Journal

Publisher

Publisher URL

External DOI

Page range

Pages

Event name

Event type

Event start date

Event finish date

Place of publication

Department affiliated with

Full text available

Usage metrics

Categories

Keywords

Licence

Exports