中国综合性科技类核心期刊(北大核心)

中国科学引文数据库来源期刊(CSCD)

美国《化学文摘》(CA)收录

美国《数学评论》(MR)收录

俄罗斯《文摘杂志》收录

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

GRS: 一种面向电商领域智能客服的生成-检索式对话模型

郭晓哲 彭敦陆 张亚彤 彭学桂

郭晓哲, 彭敦陆, 张亚彤, 彭学桂. GRS: 一种面向电商领域智能客服的生成-检索式对话模型[J]. 华东师范大学学报(自然科学版), 2020, (5): 156-166. doi: 10.3969/j.issn.1000-5641.202091010
引用本文: 郭晓哲, 彭敦陆, 张亚彤, 彭学桂. GRS: 一种面向电商领域智能客服的生成-检索式对话模型[J]. 华东师范大学学报(自然科学版), 2020, (5): 156-166. doi: 10.3969/j.issn.1000-5641.202091010
GUO Xiaozhe, PENG Dunlu, ZHANG Yatong, PENG Xuegui. GRS: A generative retrieval dialogue model for intelligent customer service in the field of e-commerce[J]. Journal of East China Normal University (Natural Sciences), 2020, (5): 156-166. doi: 10.3969/j.issn.1000-5641.202091010
Citation: GUO Xiaozhe, PENG Dunlu, ZHANG Yatong, PENG Xuegui. GRS: A generative retrieval dialogue model for intelligent customer service in the field of e-commerce[J]. Journal of East China Normal University (Natural Sciences), 2020, (5): 156-166. doi: 10.3969/j.issn.1000-5641.202091010

GRS: 一种面向电商领域智能客服的生成-检索式对话模型

doi: 10.3969/j.issn.1000-5641.202091010
详细信息
    通讯作者:

    彭敦陆,男,教授,博士生导师,研究方向为自然语言处理、计算机视觉等. E-mail:pengdl@usst.edu.cn

  • 中图分类号: TP391

GRS: A generative retrieval dialogue model for intelligent customer service in the field of e-commerce

  • 摘要: 目前大多数智能聊天系统的实现主要有两种方式. 检索式得到的回复准确且有意义, 但回复内容和回复类型却受限于所选择的语料库. 生成式可以获得语料库中没有的回复, 更具灵活性, 但是容易产生一些错误或是无意义的回复内容. 为了解决上述问题, 本文提出一种新的模型GRS(Generative-Retrieval-Score), 此模型可以同时训练检索模型和生成模型, 并用一个打分模块对检索模型和生成模型的结果进行打分排序, 将得分最高的回复作为整个对话系统的输出, 进而巧妙地将两种方法的优点结合起来, 使最终得到的回复具体多样, 且生成的回复形式灵活多变. 在真实的京东智能客服对话数据集上的实验表明, 本文提出的模型比现有的检索式模型和生成式模型在多轮对话建模上有着更优异的表现.
  • 图  1  JDDC数据集轮数分布直方图

    Fig.  1  Round distribution histogram of the JDDC corpus

    图  2  GRS模型框架图

    Fig.  2  GRS model frame diagram

    图  3  Transformer结构图

    Fig.  3  Transformer model frame diagram

    图  4  自注意力权重的可视化

    Fig.  4  Visualization of the self-attention weight

    表  1  基于检索式和基于生成式对话系统各自的特征

    Tab.  1  Characteristics of retrieval-based and generative dialogue systems

    名称优点不足
    检索式倾向于人类正常对话; 表达方式具有多样性不适合随机对话; 语料库的大小是“瓶颈”
    生成式适合随机对话; 具有高度一致性信息不足; 回复内容单一
    下载: 导出CSV

    表  2  JDDC数据集统计信息

    Tab.  2  Statistics on the JDDC corpus

    总对话数总句子数总单词数句子的平均单词数对话的平均轮数最大轮数最小轮数
    1 024 19620 451 337150 716 1727.420832
    下载: 导出CSV

    表  3  自动评估结果

    Tab.  3  Automatic evaluation results

    BLEURouge-LDist-1Dist-2
    BM25 9.94 19.47 5.03% 28.89%
    BERT-Retrieval 10.27 19.90 5.23% 30.85%
    Vanilla Seq2Seq 9.02 17.11 1.49% 4.25%
    Seq2Seq-Attention 14.15 22.17 1.79% 6.31%
    Seq2Seq-Copy 14.27 23.62 1.79% 6.14%
    GRS 15.53 25.37 4.87% 27.54 %
    下载: 导出CSV
  • [1] KEARNS L I J M, P KORMANN D, SINGH S, et al. Cobot in LambdaMOO: A social statistics agent [C]// Proceedings of the Seventeenth National Conference on Artificial Intelligence. 2001: 36-41.
    [2] JI Z, LU Z, LI H. An information retrieval approach to short text conversation [EB/OL].(2014-08-29)[2020-07-01] https://arxiv.org/pdf/1408.6988.pdf.
    [3] SORDONI A, GALLEY M, AULI M, et al. A neural network approach to context-sensitive generation of conversational responses [C]// Proceeding of NAACL-HLT. 2015: 196-205.
    [4] SERBAN I V, SORDONI A, BENGIO Y. et al. Building end-to-end dialogue systems using generative hierarchical neural network models [C]// Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. 2016: 3776-3784.
    [5] HOCHREITER S, SCHMIDHUBER J. Long short-term memory [J]. Neural Computation, 1997, 9(8): 1735-1780.
    [6] CHO K, MERRIENBOER B V, GULCEHRE C, et al. Learning phrase representations using RNN encoder decoder for statistical machine translation[EB/OL]. (2014-09-03)[2020-07-01]. https://arxiv.org/pdf/1406.1078.pdf.
    [7] YAN R, SONG Y P, WU H. Learning to respond with deep neural networks for retrieval-based human-computer conversation system [C]// Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 2016: 55-64.
    [8] WU Y, WU W, XING C, et al. Sequential matching network: A new architecture for multi-turn response selection in retrieval-based chatbots [EB/OL]. (2016-12-06)[2020-07-01]. https://arxiv.org/pdf/1612.01627.pdf.
    [9] YANG L, QIU M, QU C, et al. Response ranking with deep matching networks and external knowledge in information-seeking conversation systems [EB/OL]. (2018-05-01)[2020-07-01]. https://arxiv.org/pdf/1805.00188.pdf.
    [10] LI X, MOU L L, YAN R, ZHANG M. StalemateBreaker: A proactive content introducing approach to automatic human-computer conversation[EB/OL]. (2016-04-15)[2020-07-01]. https://arxiv.org/pdf/1604.04358.pdf.
    [11] GAO J F, GALLEY M, LI L H. Neural approaches to conversational AI [EB/OL]. (2019-09-10)[2020-07-01]. https://arxiv.org/pdf/1809.08267.pdf.
    [12] RITTER A, CHERRY C, DOLAN W B. Data-driven response generation in social media[C]//EMNLP '11: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011: 583–593.
    [13] ZOPH B, KNIGHT K. Multi-source neural translation[EB/OL]. (2019-01-05)[2020-07-01]. https://arxiv.org/pdf/1601.00710.pdf.
    [14] SUTSKEVER I, VINYALS O, LE Q V. Sequence to sequence learning with neural networks[EB/OL]. (2014-12-14)[2020-07-01]. https://arxiv.org/pdf/1409.3215.pdf.
    [15] VINYALS O, LE Q. A neural conversational model [EB/OL]. (2015-06-19)[2020-07-01]. https://arxiv.org/pdf/1506.05869.pdf.
    [16] SERBAN I V, SORDONI A, LOWE R, et al. A hierarchical latent variable encoder-decoder model for generating dialogues [EB/OL].(2016-06-14)[2020-07-01]. https://arxiv.org/pdf/1605.06069.pdf.
    [17] BAHDANAU D, CHO K, BENGIO Y. Neural machine translation by jointly learning to align and translate [EB/OL]. (2016-05-19)[2020-07-01]. https://arxiv.org/pdf/1409.0473.pdf.
    [18] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017: 6000-6010.
    [19] DEVLIN J, CHANG M W, LEE K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[EB/OL]. (2019-05-24)[2020-07-01]. https://arxiv.org/pdf/1810.04805.pdf.
    [20] SABOUR S, FROSST N, HINTON G E. Dynamic routing between capsules[EB/OL]. (2017-11-01)[2020-07-01]. https://arxiv.org/pdf/1710.09829.pdf.
    [21] YANG M, ZHAO W, YE J, et al. Investigating capsule networks with dynamic routing for text classification[EB/OL]. (2018-09-03)[2020-07-01]. https://arxiv.org/pdf/1804.00538.pdf.
    [22] ZHANG N, DENG S, SUN Z, et al. Attention-based capsule networks with dynamic routing for relation extraction [EB/OL]. (2018-12-29)[2020-07-01]. https://arxiv.org/pdf/1812.11321.pdf.
    [23] CHEN M, LIU R X, SHEN L. The JDDC corpus: A large-scale multi-turn Chinese dialogue dataset for e-commerce customer service [EB/OL]. (2019-11-25)[2020-07-01]. https://arxiv.org/pdf/1911.09969v2.pdf.
  • 加载中
图(4) / 表(3)
计量
  • 文章访问数:  98
  • HTML全文浏览量:  200
  • PDF下载量:  10
  • 被引次数: 0
出版历程
  • 收稿日期:  2020-08-12
  • 网络出版日期:  2020-09-24
  • 刊出日期:  2020-09-24

目录

    /

    返回文章
    返回