2022.coling-1.361.pdf (786.97 kB)
Towards structure-aware paraphrase identification with phrase alignment using sentence encoders
conference contribution
posted on 2023-06-10, 04:51 authored by Qiwei Peng, David WeirDavid Weir, Julie WeedsJulie WeedsPrevious works have demonstrated the effectiveness of utilising pre-trained sentence encoders based on their sentence representations for meaning comparison tasks. Though such representations are shown to capture hidden syntax structures, the direct similarity comparison between them exhibits weak sensitivity to word order and structural differences in given sentences. A single similarity score further makes the comparison process hard to interpret. Therefore, we here propose to combine sentence encoders with an alignment component by representing each sentence as a list of predicate-argument spans (where their span representations are derived from sentence encoders), and decomposing the sentence-level meaning comparison into the alignment between their spans for paraphrase identification tasks. Empirical results show that the alignment component brings in both improved performance and interpretability for various sentence encoders. After closer investigation, the proposed approach indicates increased sensitivity to structural difference and enhanced ability to distinguish non-paraphrases with high lexical overlap.
History
Publication status
- Published
File Version
- Published version
Journal
29th International Conference on Computational LinguisticsPublisher
International Committee on Computational LinguisticsPublisher URL
Page range
4113-4123Event name
29th International Conference on Computational LinguisticsEvent location
KoreaEvent type
conferenceEvent date
October 12-17, 2022Series
COLING'2022Department affiliated with
- Informatics Publications
Full text available
- Yes
Peer reviewed?
- Yes
Legacy Posted Date
2022-09-27First Open Access (FOA) Date
2022-10-19First Compliant Deposit (FCD) Date
2022-09-27Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC