Gate-Enhanced Multi-domain Dialog State Tracking for Task-Oriented Dialogue Systems

Yu, Changhong; Zhang, Chunhong; Hu, Zheng; Zhan, Zhiqiang

doi:10.1007/978-981-19-8493-8_43

Changhong Yu⁴¹,
Chunhong Zhang⁴¹,
Zheng Hu⁴² &
…
Zhiqiang Zhan⁴¹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 984))

409 Accesses

Abstract

Dialogue state tracking (DST) task is a core component of task-oriented dialogue systems. Recently, several open-vocabulary-based models were proposed in multi-domain setting, which relies on copy mechanism and slot gate classification modules. However, as the ontology gets more and more complex, it becomes challenging to fill values to slots with type-specific features fully utilized, apply appropriate operation, and tackle the over-trivial carryover problem which used to be neglected. To address the above issues, a hierarchical gate-enhanced DST framework called DSA-Gate DST is proposed in this paper. Domain activity prediction and semantic confirming recognition modules are introduced to track slots from different domains discriminately. Experiment results on multi-domain task-oriented dialog corpora are conducted to show that our model outperforms various baseline algorithms in widely various language settings. Meanwhile, we conduct a comprehensive analysis on the noisy annotation in the MultiWoZ dataset from multiple aspects to explore the potential reasons which limiting DST’s performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Hardcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Novel Transformer-Based Model for Dialog State Tracking

Hierarchical temporal slot interactions for dialogue state tracking

Article 07 November 2022

Hierarchical Dialog State Tracking with Unknown Slot Values

Article 01 November 2018

References

Nouri E, Hosseiniasl E (2018) Toward scalable neural dialogue state tracking model. In: 32nd Conference on neural information processing systems (NeurIPS 2018)
Google Scholar
Ramadan O (2018) Large-scale multi-domain belief tracking with knowledge sharing. arXiv preprint arXiv:1807.06517v1
Gao S, Sethi A, Aggarwal S, Chung T, Hakkani-Tur D (2019) Dialog state tracking: a neural reading comprehension approach. arXiv preprint arXiv:1908.01946.
Kim S, Yang S, Kim G, Lee S-W (2019) Efficient dialogue state tracking by selectively overwriting memory. arXiv preprint arXiv:1911.03906
Wu C-S, Madotto A, Hosseini-Asl E, Xiong C, Socher R, Fung P (2019a) Transferable multi-domain state generator for task-oriented dialogue systems. arXiv preprint arXiv:1905.08743
Wu C-S, Socher R, Xiong C (2019b) Global-to-local memory pointer networks for task-oriented dialogue. arXiv preprint arXiv:1901.04713
Lei W, Jin X, Kan M-Y, Ren Z, He X, Yin D (2018) Sequicity: simplifying task-oriented dialogue systems with single sequence-to-sequence architectures. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1, Long Papers, pp 1437–1447
Google Scholar
Ma Y, Zeng Z, Zhu D, Li X, Yang Y, Yao X, Zhou K, Shen J (2019) An end-to-end dialogue state tracking system with machine reading comprehension and wide & deep classification. arXiv preprint arXiv:1912.09297v1
Zhang L, Wang H (2021) Learn to focus: hierarchical dynamic copy network for dialogue state tracking. arXiv preprint arXiv:2107.1177825
Google Scholar
Zhang J, Hashimoto K, Wu C, Wan Y, Yu PS, Socher R, Xiong C (2019) Find or classify? Dual strategy for slot-value predictions on multi-domain dialog state tracking. arXiv preprint arXiv:1910.03544
Goel R, Paul S, Hakkani-Tür D (2019) Hyst: a hybrid approach for flexible and accurate dialogue state tracking. arXiv preprint arXiv:1907.00883
Lee H, Lee J, Kim T-Y (2019) SUMBT: slot-utterance matching for universal and scalable belief tracking. arXiv preprint arXiv:1907.07421
Shan Y, Li Z, Zhang J, Meng F, Feng Y, Niu C, Zhou J (2020) A contextual hierarchical attention network with adaptive objective for dialogue state tracking. arXiv preprint arXiv:2006.01554
Cho K, van Merrienboer B, Bahdanau D, Bengio Y (2014) On the properties of neural machine translation: encoder-decoder approaches. arXiv preprint arXiv:1409.1259
Tsengy B-H, Daiz Y, Kreyssigy F, Byrne B (2021) Transferable dialogue systems and user simulators. arXiv preprint arXiv:2107.11904
Kim S, Chang M, Lee S-W (2021) NeuralWOZ: learning to collect task-oriented dialogue via model-based simulation. arXiv preprint arXiv:2105.14454
Le HT, Sahoo D, Liu C, Chen NF, Hoi SCH (2020b) Uniconv: a unified conversational neural architecture for multi-domain task-oriented dialogues. arXiv preprint arXiv:2004.14307
Ren L, Ni J, McAuley J (2019) Scalable and accurate dialogue state tracking via hieorarchical sequence generation, pp 1876–1885
Google Scholar
Devlin J, Chang M, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Google Scholar
Eric M, Goel R, Paul S, Kumar A, Sethi A, Ku P, Goyal AK, Agarwal S, Gao S, Hakkani-Tur D (2019) MultiWOZ 2.1: a consolidated multi-domain dialogue dataset with state corrections and state tracking baselines. arXiv preprint arXiv:1907.01669v1.
Kingma DP, Ba J (2014). Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15:1929–1958
MathSciNet MATH Google Scholar
Budzianowski P, Wen T-H, Tseng B-H, Casanueva I, Ultes S, Ramadan O, Gašić M (2018) Multiwoz—a large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling. arXiv preprint arXiv:1810.00278.

Download references

Acknowledgements

This work was supported by the National Key R&D Program of China under grant 2019YFF0302601.

Author information

Authors and Affiliations

Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing Univerity of Post and Telecommunications, Beijing, 100876, China
Changhong Yu, Chunhong Zhang & Zhiqiang Zhan
State Key Laboratory of Networking and Switching Technology, Beijing University of Post and Telecommunications, Beijing, 100876, China
Zheng Hu

Authors

Changhong Yu
View author publications
You can also search for this author in PubMed Google Scholar
Chunhong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Hu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiang Zhan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chunhong Zhang .

Editor information

Editors and Affiliations

Department of Mechanical Engineering, MCKV Institute of Engineering, Howrah, West Bengal, India
Prasenjit Chatterjee
Department of Operations Research and Statistics, Faculty of Organizational Sciences, University of Belgrade, Belgrade, Serbia
Dragan Pamucar
Universidad Internacional de Valencia, Valencia, Spain
Morteza Yazdani
Department of Mechanical Engineering, National Institute of Technology Kurukshetra, Kurukshetra, Haryana, India
Dilbagh Panchal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, C., Zhang, C., Hu, Z., Zhan, Z. (2023). Gate-Enhanced Multi-domain Dialog State Tracking for Task-Oriented Dialogue Systems. In: Chatterjee, P., Pamucar, D., Yazdani, M., Panchal, D. (eds) Computational Intelligence for Engineering and Management Applications. Lecture Notes in Electrical Engineering, vol 984. Springer, Singapore. https://doi.org/10.1007/978-981-19-8493-8_43

Download citation

DOI: https://doi.org/10.1007/978-981-19-8493-8_43
Published: 30 April 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-8492-1
Online ISBN: 978-981-19-8493-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Gate-Enhanced Multi-domain Dialog State Tracking for Task-Oriented Dialogue Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Novel Transformer-Based Model for Dialog State Tracking

Hierarchical temporal slot interactions for dialogue state tracking

Hierarchical Dialog State Tracking with Unknown Slot Values

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Gate-Enhanced Multi-domain Dialog State Tracking for Task-Oriented Dialogue Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Novel Transformer-Based Model for Dialog State Tracking

Hierarchical temporal slot interactions for dialogue state tracking

Hierarchical Dialog State Tracking with Unknown Slot Values

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation