Open Domain Response Generation Guided by Retrieved Conversations

Chang Shu; Zijian Zhang; Youxin Chen; Jing Xiao; Jey Han Lau; Qian Zhang; Zheng Lu

doi:10.1109/ACCESS.2022.3225647

Open Domain Response Generation Guided by Retrieved Conversations

Chang Shu, Zijian Zhang, Youxin Chen, Jing Xiao, Jey Han Lau, Qian Zhang, Zheng Lu

School of Computer Science

Research output: Journal Publication › Article › peer-review

Abstract

Open domain response generation is the task of creating a response given a user query in any topics/domain. Limited by context and reference information, responses generated by current systems are often 'bland' or generic. In this paper, we combine a response generation model with a retrieval system that searches for relevant utterances and responses. The generation model has two main components: a keyword extraction module and a two-stage transformer. The keyword extraction module aims to extract two types of keywords in an unsupervised fashion from the retrieved results: (1) keywords in the query not found in the retrieved utterances (DiffKey), and (2) overlapping keywords among the retrieved responses (SimKey). Given these keywords, the two-stage transformer first decides where to insert the keywords in the response, and the second generates the full response given the location of the keywords. The keyword extraction module and the two-stage transformer are connected in a single network, and so our system is trained end-to-end. Experimental results on Cornell Movie-Dialog corpus, Douban and Weibo demonstrate that our model outperforms state-of-the-art systems in terms of ROUGE, relevance scores and human evaluation.

Original language	English
Pages (from-to)	99365-99375
Number of pages	11
Journal	IEEE Access
Volume	11
DOIs	https://doi.org/10.1109/ACCESS.2022.3225647
Publication status	Published - 2023

Keywords

Dialogue generation
deep learning
hybrid retrieval-generation

ASJC Scopus subject areas

General Engineering
General Computer Science
General Materials Science

Access to Document

10.1109/ACCESS.2022.3225647

Cite this

@article{cdfe1fd810e141498b4879ca7abc94ae,

title = "Open Domain Response Generation Guided by Retrieved Conversations",

abstract = "Open domain response generation is the task of creating a response given a user query in any topics/domain. Limited by context and reference information, responses generated by current systems are often 'bland' or generic. In this paper, we combine a response generation model with a retrieval system that searches for relevant utterances and responses. The generation model has two main components: a keyword extraction module and a two-stage transformer. The keyword extraction module aims to extract two types of keywords in an unsupervised fashion from the retrieved results: (1) keywords in the query not found in the retrieved utterances (DiffKey), and (2) overlapping keywords among the retrieved responses (SimKey). Given these keywords, the two-stage transformer first decides where to insert the keywords in the response, and the second generates the full response given the location of the keywords. The keyword extraction module and the two-stage transformer are connected in a single network, and so our system is trained end-to-end. Experimental results on Cornell Movie-Dialog corpus, Douban and Weibo demonstrate that our model outperforms state-of-the-art systems in terms of ROUGE, relevance scores and human evaluation.",

keywords = "Dialogue generation, deep learning, hybrid retrieval-generation",

author = "Chang Shu and Zijian Zhang and Youxin Chen and Jing Xiao and Lau, {Jey Han} and Qian Zhang and Zheng Lu",

note = "Publisher Copyright: Author",

year = "2023",

doi = "10.1109/ACCESS.2022.3225647",

language = "English",

volume = "11",

pages = "99365--99375",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Open Domain Response Generation Guided by Retrieved Conversations

AU - Shu, Chang

AU - Zhang, Zijian

AU - Chen, Youxin

AU - Xiao, Jing

AU - Lau, Jey Han

AU - Zhang, Qian

AU - Lu, Zheng

N1 - Publisher Copyright: Author

PY - 2023

Y1 - 2023

N2 - Open domain response generation is the task of creating a response given a user query in any topics/domain. Limited by context and reference information, responses generated by current systems are often 'bland' or generic. In this paper, we combine a response generation model with a retrieval system that searches for relevant utterances and responses. The generation model has two main components: a keyword extraction module and a two-stage transformer. The keyword extraction module aims to extract two types of keywords in an unsupervised fashion from the retrieved results: (1) keywords in the query not found in the retrieved utterances (DiffKey), and (2) overlapping keywords among the retrieved responses (SimKey). Given these keywords, the two-stage transformer first decides where to insert the keywords in the response, and the second generates the full response given the location of the keywords. The keyword extraction module and the two-stage transformer are connected in a single network, and so our system is trained end-to-end. Experimental results on Cornell Movie-Dialog corpus, Douban and Weibo demonstrate that our model outperforms state-of-the-art systems in terms of ROUGE, relevance scores and human evaluation.

AB - Open domain response generation is the task of creating a response given a user query in any topics/domain. Limited by context and reference information, responses generated by current systems are often 'bland' or generic. In this paper, we combine a response generation model with a retrieval system that searches for relevant utterances and responses. The generation model has two main components: a keyword extraction module and a two-stage transformer. The keyword extraction module aims to extract two types of keywords in an unsupervised fashion from the retrieved results: (1) keywords in the query not found in the retrieved utterances (DiffKey), and (2) overlapping keywords among the retrieved responses (SimKey). Given these keywords, the two-stage transformer first decides where to insert the keywords in the response, and the second generates the full response given the location of the keywords. The keyword extraction module and the two-stage transformer are connected in a single network, and so our system is trained end-to-end. Experimental results on Cornell Movie-Dialog corpus, Douban and Weibo demonstrate that our model outperforms state-of-the-art systems in terms of ROUGE, relevance scores and human evaluation.

KW - Dialogue generation

KW - deep learning

KW - hybrid retrieval-generation

UR - http://www.scopus.com/inward/record.url?scp=85144052544&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2022.3225647

DO - 10.1109/ACCESS.2022.3225647

M3 - Article

AN - SCOPUS:85144052544

SN - 2169-3536

VL - 11

SP - 99365

EP - 99375

JO - IEEE Access

JF - IEEE Access

ER -

Open Domain Response Generation Guided by Retrieved Conversations

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this