Access the full text.
Sign up today, get DeepDyve free for 14 days.
Andras Farag (2010)
Association for Computing Machinery
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova (2019)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
M. Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdel-rahman Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer (2019)
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
Lifeng Shang, Zhengdong Lu, Hang Li (2015)
Neural Responding Machine for Short-Text ConversationArXiv, abs/1503.02364
H. Shum, Xiaodong He, Di Li (2018)
From Eliza to XiaoIce: challenges and opportunities with social chatbotsFrontiers of Information Technology & Electronic Engineering, 19
Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric Smith, Y.-Lan Boureau, J. Weston (2020)
Recipes for Building an Open-Domain Chatbot
Maria Tsimpoukelli, Jacob Menick, Serkan Cabi, S. Eslami, Oriol Vinyals, Felix Hill, Zacharias Janssen (2021)
Multimodal Few-Shot Learning with Frozen Language Models
Min Yang, Zhou Zhao, Wei Zhao, Xiaojun Chen, Jia Zhu, Lianqiang Zhou, Zigang Cao (2017)
Personalized Response Generation via Domain adaptationProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, W. Dolan (2015)
A Neural Network Approach to Context-Sensitive Generation of Conversational ResponsesArXiv, abs/1506.06714
Telmo Pires, Eva Schlinger, Dan Garrette (2019)
How Multilingual is Multilingual BERT?ArXiv, abs/1906.01502
Jiwei Li, Michel Galley, Chris Brockett, Georgios Spithourakis, Jianfeng Gao, W. Dolan (2016)
A Persona-Based Neural Conversation ModelArXiv, abs/1603.06155
Zhaojiang Lin, Andrea Madotto, Chien-Sheng Wu, Pascale Fung (2019)
Personalizing Dialogue Agents via Meta-Learning
[ (2019)
Retrieval-enhanced adversarial training for neural response generationProceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics
Juntao Li, Chang Liu, Chongyang Tao, Zhangming Chan, Dongyan Zhao, Min Zhang, Rui Yan (2021)
Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based ChatbotsACM Transactions on Information Systems (TOIS), 39
Hao Zhou, Minlie Huang, Tianyang Zhang, Xiaoyan Zhu, Bing-Qian Liu (2017)
Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory
Alec Radford, Karthik Narasimhan (2018)
Improving Language Understanding by Generative Pre-Training
[ (1994)
Some simple effective approximations to the 2-poisson model for probabilistic weighted retrievalSIGIR’94
Matthew Crosby, Ronald Petrick (2014)
Association for the Advancement of Artificial Intelligence
Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, Zhi Jin (2016)
Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation
Yixuan Su, Yan Wang, Simon Baker, Deng Cai, Xiaojiang Liu, A. Korhonen, Nigel Collier (2020)
PROTOTYPE-TO-STYLE: Dialogue Generation With Style-Aware Editing on Retrieval MemoryIEEE/ACM Transactions on Audio, Speech, and Language Processing, 29
Rui Yan, Dongyan Zhao, E. Weinan (2017)
Joint Learning of Response Ranking and Next Utterance Suggestion in Human-Computer Conversation SystemProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
Yixin Nie, Mary Williamson, Mohit Bansal, Douwe Kiela, J. Weston (2020)
I like fish, especially dolphins: Addressing Contradictions in Dialogue ModelingArXiv, abs/2012.13391
Hao Zhou, Tom Young, Minlie Huang, Haizhou Zhao, Jingfang Xu, Xiaoyan Zhu (2018)
Commonsense Knowledge Aware Conversation Generation with Graph Attention
Xiao Sun, Jia Li, Xing Wei, Changliang Li, J. Tao (2019)
Emotional Conversation Generation Based on a Bayesian Deep Neural NetworkACM Transactions on Information Systems (TOIS), 38
S. Welleck, J. Weston, Arthur Szlam, Kyunghyun Cho (2018)
Dialogue Natural Language Inference
Ruqing Zhang, Jiafeng Guo, Yixing Fan, Yanyan Lan, Xueqi Cheng (2020)
Dual-factor Generation Model for ConversationACM Transactions on Information Systems (TOIS), 38
Liang Xu, Xuanwei Zhang, Lu Li, Hai Hu, Chenjie Cao, Weitang Liu, Junyi Li, Yudong Li, Kai Sun, Yechen Xu, Yiming Cui, Cong Yu, Qianqian Dong, Yin Tian, Dian Yu, Bo Shi, Jun-jie Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhen-Yi Yang, Kyle Richardson, Zhenzhong Lan (2020)
CLUE: A Chinese Language Understanding Evaluation BenchmarkArXiv, abs/2004.05986
Deng Cai, Yan Wang, Victoria Bi, Zhaopeng Tu, Xiaojiang Liu, Wai Lam, Shuming Shi (2018)
Skeleton-to-Response: Dialogue Generation Guided by Retrieval MemoryArXiv, abs/1809.05296
Haoyu Song, Yan Wang, Kaiyan Zhang, Weinan Zhang, Ting Liu (2021)
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized DataArXiv, abs/2106.06169
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Zhao, Dianhai Yu, Hua Wu (2018)
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network
Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, M. Zhou, Wei-Ying Ma (2016)
Topic Aware Neural Response Generation
Chunyuan Yuan, W. Zhou, Mingming Li, Shangwen Lv, Fuqing Zhu, Jizhong Han, Songlin Hu (2019)
Multi-hop Selector Network for Multi-turn Response Selection in Retrieval-based Chatbots
In The NeurIPS'18 Competition
[ (2020)
The second conversational intelligence challenge (ConvAI2)The NeurIPS’18 Competition. Springer International Publishing
Rui Yan, Yiping Song, Hua Wu (2016)
Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation SystemProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
Zhiliang Tian, Rui Yan, Lili Mou, Yiping Song, Yansong Feng, Dongyan Zhao (2017)
How to Make Context More Useful? An Empirical Study on Context-Aware Neural Conversational Models
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, J. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, T. Henighan, Rewon Child, A. Ramesh, Daniel Ziegler, Jeff Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, S. Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei (2020)
Language Models are Few-Shot LearnersArXiv, abs/2005.14165
Deng Cai, Yan Wang, Wei Bi, Zhaopeng Tu, Xiaojiang Liu, Shuming Shi (2019)
Retrieval-guided Dialogue Response Generation via a Matching-to-Generation Framework
Ilya Sutskever, Oriol Vinyals, Quoc Le (2014)
Sequence to Sequence Learning with Neural NetworksArXiv, abs/1409.3215
Haoyu Song, Weinan Zhang, Yiming Cui, Dong Wang, Ting Liu (2019)
Exploiting Persona Information for Diverse Generation of Conversational Responses
Yiping Song, Zequn Liu, Wei Bi, Rui Yan, Ming Zhang (2019)
Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks
Yinhe Zheng, Guanyi Chen, Minlie Huang, Song Liu, Xuan Zhu (2019)
Personalized Dialogue Generation with Diversified TraitsArXiv, abs/1901.09672
Pei-hao Su, M. Gašić, N. Mrksic, L. Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, S. Young (2016)
On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue SystemsArXiv, abs/1605.07669
S. Robertson, S. Walker (1994)
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
Thomas Wolf, Victor Sanh, Julien Chaumond, Clement Delangue (2019)
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational AgentsArXiv, abs/1901.08149
Liu Yang, Minghui Qiu, Chen Qu, J. Guo, Yongfeng Zhang, W. Croft, Jun Huang, Haiqing Chen (2018)
Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation SystemsThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, M. Zhou, H. Hon (2019)
Unified Language Model Pre-training for Natural Language Understanding and Generation
Xueliang Zhao, Wei Wu, Chongyang Tao, Can Xu, Dongyan Zhao, Rui Yan (2020)
Low-Resource Knowledge-Grounded Dialogue GenerationArXiv, abs/2002.10348
Qiao Qian, Minlie Huang, Haizhou Zhao, Jingfang Xu, Xiaoyan Zhu (2018)
Assigning Personality/Profile to a Chatting Machine for Coherent Conversation Generation
Yuan Zhang, David Weiss (2016)
Stack-propagation: Improved Representation Learning for SyntaxArXiv, abs/1603.06598
[ (2018)
Neural approaches to conversational AIThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. Association for Computing Machinery
R. Barzilay, Min-Yen Kan (2017)
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2
Qian Liu, Yihong Chen, B. Chen, Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang (2020)
You Impress Me: Dialogue Generation via Mutual Persona Perception
Siqi Bao, H. He, Fan Wang, Hua Wu (2019)
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
Shuai Yang, Jiaying Liu, Wenjing Wang, Zongming Guo (2019)
Promoting Diversity for End-to-End Conversation Response GenerationArXiv, abs/1901.09444
Alec Radford, Jeff Wu, Rewon Child, D. Luan, Dario Amodei, Ilya Sutskever (2019)
Language Models are Unsupervised Multitask Learners
Rami Al-Rfou, Marc Pickett, Javier Snaider, Yun-Hsuan Sung, B. Strope, R. Kurzweil (2016)
Conversational Contextual Cues: The Case of Personalization and History for Response RankingArXiv, abs/1606.00372
Semih Yavuz, Abhinav Rastogi, Guan-Lin Chao, Dilek Hakkani-Tür (2019)
DeepCopy: Grounded Response Generation with Hierarchical Pointer NetworksArXiv, abs/1908.10731
Mor Naaman, Jeffrey Boase, Chih‐Hui Lai (2010)
Is it really about me?: message content in social awareness streams
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter Liu (2019)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerJ. Mach. Learn. Res., 21
Jiwei Li, Will Monroe, Alan Ritter, Dan Jurafsky, Michel Galley, Jianfeng Gao (2016)
Deep Reinforcement Learning for Dialogue GenerationArXiv, abs/1606.01541
Oriol Vinyals, Quoc Le (2015)
A Neural Conversational ModelArXiv, abs/1506.05869
L. Humphreys, Phillipa Gill, B. Krishnamurthy (2014)
Twitter: a content analysis of personal informationInformation, Communication & Society, 17
Sergey Golovanov, R. Kurbanov, S. Nikolenko, Kyryl Truskovskyi, Alexander Tselousov, Thomas Wolf (2019)
Large-Scale Transfer Learning for Natural Language Generation
Samuel Bowman, Gabor Angeli, Christopher Potts, Christopher Manning (2015)
A large annotated corpus for learning natural language inference
Yinhe Zheng, Rongsheng Zhang, Xiaoxi Mao, Minlie Huang (2019)
A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data
Margaret Li, Stephen Roller, Ilia Kulikov, S. Welleck, Y-Lan Boureau, Kyunghyun Cho, J. Weston (2019)
Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood TrainingArXiv, abs/1911.03860
Adina Williams, Nikita Nangia, Samuel Bowman (2017)
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Minlie Huang, Xiaoyan Zhu, Jianfeng Gao (2019)
Challenges in Building Intelligent Open-domain Dialog SystemsACM Transactions on Information Systems (TOIS), 38
Zongcheng Ji, Zhengdong Lu, Hang Li (2014)
An Information Retrieval Approach to Short Text ConversationArXiv, abs/1408.6988
A. Turing (1950)
Computing Machinery and IntelligenceMind, LIX
Haoyu Song, Yan Wang, Weinan Zhang, Xiaojiang Liu, Ting Liu (2020)
Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue GenerationArXiv, abs/2004.07672
Kevin Knight, A. Nenkova, Owen Rambow (2016)
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Haoyu Song, Yan Wang, Weinan Zhang, Zhengyu Zhao, Ting Liu, Xiaojiang Liu (2020)
Profile Consistency Identification for Open-domain Dialogue AgentsArXiv, abs/2009.09680
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan Gomez, Lukasz Kaiser, Illia Polosukhin (2017)
Attention is All you Need
Nouha Dziri, Ehsan Kamalloo, K. Mathewson, Osmar Zaiane (2019)
Evaluating Coherence in Dialogue Systems using Entailment
Emily Dinan, V. Logacheva, Valentin Malykh, Alexander Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, A. Black, Alexander Rudnicky, Jason Williams, Joelle Pineau, M. Burtsev, J. Weston (2019)
The Second Conversational Intelligence Challenge (ConvAI2)ArXiv, abs/1902.00098
Jianfeng Gao, Michel Galley, Lihong Li (2018)
Neural Approaches to Conversational AIThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, W. Dolan (2015)
A Diversity-Promoting Objective Function for Neural Conversation ModelsArXiv, abs/1510.03055
Andrea Madotto, Zhaojiang Lin, Yejin Bang, Pascale Fung (2020)
The Adapter-Bot: All-In-One Controllable Conversational Model
Eric Smith, Mary Williamson, Kurt Shuster, J. Weston, Y-Lan Boureau (2020)
Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills
S. Welleck, Ilia Kulikov, Stephen Roller, Emily Dinan, Kyunghyun Cho, J. Weston (2019)
Neural Text Generation with Unlikelihood TrainingArXiv, abs/1908.04319
Jiatao Gu, Zhengdong Lu, Hang Li, V. Li (2016)
Incorporating Copying Mechanism in Sequence-to-Sequence LearningArXiv, abs/1603.06393
Libo Qin, Wanxiang Che, Yangming Li, Haoyang Wen, Ting Liu (2019)
A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding
Emily Dinan, Stephen Roller, Kurt Shuster, Angela Fan, Michael Auli, J. Weston (2018)
Wizard of Wikipedia: Knowledge-Powered Conversational agentsArXiv, abs/1811.01241
Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu, W. Dolan (2019)
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response GenerationArXiv, abs/1911.00536
Saizheng Zhang, Emily Dinan, Jack Urbanek, Arthur Szlam, Douwe Kiela, J. Weston (2018)
Personalizing Dialogue Agents: I have a dog, do you have pets too?ArXiv, abs/1801.07243
Weinan Zhang, Ting Liu, Yifa Wang, Qingfu Zhu (2017)
Neural personalized response generation as domain adaptationWorld Wide Web, 22
Yu Wu, Wei Wu, Ming Zhou, Zhoujun Li (2016)
Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, M. Lewis, Luke Zettlemoyer, Veselin Stoyanov (2019)
RoBERTa: A Robustly Optimized BERT Pretraining ApproachArXiv, abs/1907.11692
With the resurgent interest in building open-domain dialogue systems, the dialogue generation task has attracted increasing attention over the past few years. This task is usually formulated as a conditional generation problem, which aims to generate a natural and meaningful response given dialogue contexts and specific constraints, such as persona. And maintaining a consistent persona is essential for the dialogue systems to gain trust from the users. Although tremendous advancements have been brought, traditional persona-based dialogue models are typically trained by leveraging a large number of persona-dense dialogue examples. Yet, such persona-dense training data are expensive to obtain, leading to a limited scale. This work presents a novel approach to learning from limited training examples by regarding consistency understanding as a regularization of response generation. To this end, we propose a novel stack-propagation framework for learning a generation and understanding pipeline. Specifically, the framework stacks a Transformer encoder and two Transformer decoders, where the first decoder models response generation and the second serves as a regularizer and jointly models response generation and consistency understanding. The proposed framework can benefit from the stacked encoder and decoders to learn from much smaller personalized dialogue data while maintaining competitive performance. Under different low-resource settings, subjective and objective evaluations prove that the stack-propagation framework outperforms strong baselines in response quality and persona consistency and largely overcomes the shortcomings of traditional models that rely heavily on the persona-dense dialogue data.
ACM Transactions on Information Systems (TOIS) – Association for Computing Machinery
Published: Apr 4, 2023
Keywords: Open-domain dialogue
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.