Abstract
The task of resolving repeated objects in natural languages is known as coreference resolution, and it is an important part of modern natural language processing. It is classified into two categories depending on the resolved objects, namely entity coreference resolution and event coreference resolution. Predicting coreference connections and identifying mentions/triggers are the major challenges in coreference resolution, because these implicit relationships are particularly difficult in natural language understanding in downstream tasks. Coreference resolution techniques have experienced considerable advances in recent years, encouraging us to review this task in the following aspects: current employed evaluation metrics, datasets, and methods. We investigate 10 widely used metrics, 18 datasets and 4 main technical trends in this survey. We believe that this work is a comprehensive roadmap for understanding the past and the future of coreference resolution.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
either as entity mentions or event mentions.
References
Abzaliev A (2019) On GAP coreference resolution shared task: insights from the 3rd place solution. In: Proceedings of the first workshop on gender bias in natural language processing, Florence, pp 107–112. Association for Computational Linguistics
Agarwal O, Subramanian S, Nenkova A, Roth D (2019) Evaluation of named entity coreference. In: Proceedings of the second workshop on computational models of reference, anaphora and coreference, Minneapolis, pp 1–7. Association for Computational Linguistics
Angeli G, Johnson Premkumar MJ, Manning CD (2015) Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (Volume 1: Long Papers) Beijing, pp 344–354. Association for Computational Linguistics
Aralikatte R, Lent H, Gonzalez AV, Herschcovich D, Qiu C, Sandholm A, Ringaard M, Søgaard A (2019) Rewarding coreference resolvers for being consistent with world knowledge. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong pp 1229–1235. Association for Computational Linguistics
Atkinson J, Salas G, Figueroa A (2015) Improving opinion retrieval in social media by combining features-based coreferencing and memory-based learning. Inform Sci 299:20–31. https://doi.org/10.1016/j.ins.2014.12.021
Attree S (2019), August. Gendered ambiguous pronouns shared task: Boosting model confidence by evidence pooling. In: Proceedings of the first workshop on gender bias in natural language processing, Florence, pp 134–146. Association for Computational Linguistics
Bagga A, Baldwin B (1998) Algorithms for scoring coreference chains. In: The first international conference on language resources and evaluation workshop on linguistics coreference, Volume 1, pp 563–566. Citeseer
Bamman D, Lewke O, Mansoor A (2020) An annotated dataset of coreference in English literature. In: Proceedings of the 12th Language Resources and Evaluation Conference, Marseille, pp 44–54. European Language Resources Association
Beltagy I, Peters ME, Cohan A (2020) Longformer: the long-document transformer. CoRR abs/2004.05150. arXiv:2004.05150
Bhattacharjee S, Haque R, de Buy Wenniger GM, Way A (2020) Investigating query expansion and coreference resolution in question answering on BERT. In: International conference on applications of natural language to information systems, pp 47–59. Springer
Bornstein A, Cattan A, Dagan I (2020) CoRefi: a crowd sourcing suite for coreference annotation. In; Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations
Brasoveanu A (2008) Donkey pluralities: plural information states versus non-atomic individuals. Linguist Philos 31(2):129–209
Bussmann H, Kazzazi K, Trauth G (2006) Routledge dictionary of language and linguistics. Routledge, London
Caciularu A, Cohan A, Beltagy I, Peters M, Cattan A, Dagan I (2021) November. CDLM: cross-document language modeling. In: Findings of the association for computational linguistics: EMNLP 2021, Punta Cana, Dominican Republic, pp 2648–2662. Association for Computational Linguistics
Cambria E, Liu Q, Decherchi S, Xing F, Kwok K (2022) SenticNet 7: a commonsense-based neurosymbolic AI framework for explainable sentiment analysis. In: Proceedings of the 13th language resources and evaluation conference, pp 3829–3839
Cattan A, Eirew A, Stanovsky G, Joshi M, Dagan I (2020) Streamlining cross-document coreference resolution: evaluation and modeling. CoRR abs/2009.11032. arXiv:2009.11032
Cattan A, Eirew A, Stanovsky G, Joshi M, Dagan I (2021) Cross-document coreference resolution over predicted mentions. In: Findings of the association for computational linguistics: ACL-IJCNLP 2021, Online, pp 5100–5107. Association for Computational Linguistics
Chaturvedi I, Satapathy R, Cavallari S, Cambria E (2019) Fuzzy commonsense reasoning for multimodal sentiment analysis. Pattern Recognit Lett 125:264–270
Chen G, Van DeemterK, Lin C (2018) Modelling pro-drop with the rational speech acts model. In: Proceedings of the 11th international conference on natural language generation, pp 57–66. Association for Computational Linguistics (ACL)
Clark K, Manning CD (2015) Entity-centric coreference resolution with model stacking. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (Volume 1: Long Papers), Beijing, pp 1405–1415. Association for Computational Linguistics
Clark K, Manning CD (2016) Deep reinforcement learning for mention-ranking coreference models. In: Proceedings of the 2016 conference on empirical methods in natural language processing, Austin, pp 2256–2262. Association for Computational Linguistics
Cybulska A , Vossen P (2014) Using a sledgehammer to crack a nut? lexical diversity and event coreference resolution. In: Proceedings of the ninth international conference on language resources and evaluation (LREC’14), Reykjavik, pp 4545–4552. European Language Resources Association (ELRA)
Dai Z, Fei H, Li P (2019) Coreference aware representation learning for neural named entity recognition. In: Proceedings of the Twenty-eighth international joint conference on artificial intelligence, IJCAI-19, pp 4946–4953. International Joint Conferences on Artificial Intelligence Organization
Dakle PP, Desai T, Moldovan D (2020) A study on entity resolution for email conversations. In: Proceedings of the 12th language resources and evaluation conference, Marseille, pp 65–73. European Language Resources Association
Dasigi P, Liu NF, Marasović A, Smith NA, Gardner M (2019) QUOREF: a reading comprehension dataset with questions requiring coreferential reasoning. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, pp 5925–5932. Association for Computational Linguistics
Davis E, Morgenstern L, Ortiz C (2017) The first Winograd schema challenge at IJCAI-16. AI Mag 38(3):97–98. https://doi.org/10.1609/aimag.v38i4.2734
de Marneffe MC, Rafferty AN, Manning CD (2008) Finding contradictions in text. In: Proceedings of ACL-08: HLT, Columbus, pp 1039–1047. Association for Computational Linguistics
Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American chapter of the association for computational linguistics: human language technologies, Volume 1 (Long and Short Papers), Minneapolis, pp 4171–4186. Association for Computational Linguistics
Ding X, Liu B (2010) Resolving object and attribute coreference in opinion mining. In: Proceedings of the 23rd international conference on computational linguistics (COLING 2010), Beijing, pp 268–276. COLING 2010 Organizing Committee
Dobrovolskii V (2021) Word-level coreference resolution. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Online and Punta Cana, Dominican Republic, pp 7670–7675. Association for Computational Linguistics
Durrett G, Klein D (2014) A joint model for entity analysis: coreference, typing, and linking. Trans Assoc Comput Linguist 2:477–490. https://doi.org/10.1162/tacl_a_00197
Eirew A, Cattan A, Dagan I (2021) WEC: deriving a large-scale cross-document event coreference dataset from Wikipedia. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 2498–2510. Association for Computational Linguistics
Ellis J, Getman J, Fore D, Kuster N, Song Z, Bies A, Strassel SM (2015) Overview of linguistic resources for the TAC KBP 2015 evaluations: Methodologies and results. In: Proceedings of the 2015 text analysis conference, TAC 2015, Gaithersburg, November 16–17, 2015, 2015. NIST
Ellis J, Getman J, Kuster N, Song Z, Bies A, Strassel SM (2016) Overview of linguistic resources for the TAC KBP 2016 evaluations: Methodologies and results. In: Proceedings of the 2016 Text analysis conference, TAC 2016, Gaithersburg, November 14–15, 2016. NIST
Ellis K, Albright A, Solar-Lezama A, Tenenbaum JB, O’Donnell TJ (2022) Synthesizing theories of human language with Bayesian program induction. Nat Commun 13(1):1–13
Emami A, Trichelair P, Trischler A, Suleman K, Schulz H, Cheung JCK (2019) The KnowRef coreference corpus: removing gender and number cues for difficult pronominal anaphora resolution. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, pp. 3952–3961. Association for Computational Linguistics
Emami A, Trischler A, Suleman K, Cheung JCK (2018), June. A generalized knowledge hunting framework for the Winograd schema challenge. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, New Orleans, Louisiana, USA, pp. 25–31. Association for Computational Linguistics
Fabbri A, Li I, She T, Li S, Radev D (2019) Multi-news: a large-scale multi-document summarization dataset and abstractive hierarchical model. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, pp 1074–1084. Association for Computational Linguistics
Ferracane E, Marshall I, Wallace BC, Erk K (2016) Leveraging coreference to identify arms in medical abstracts: an experimental study. In: Proceedings of the seventh international workshop on health text mining and information analysis, Auxtin, pp 86–95. Association for Computational Linguistics
Gardner M, Grus J, Neumann M, Tafjord O, Dasigi P, Liu NF, Peters M, Schmitz M, Zettlemoyer LS (2017) AllenNLP: a deep semantic natural language processing platform. In: Proceedings of workshop for NLP open source software (NLP-OSS)
Ge M, Mao R, Cambria E (2022) Explainable metaphor identification inspired by conceptual metaphor theory. In: Proceedings of the 36th AAAI conference on artificial intelligence, pp 10681–10689
Ghaddar A , Langlais P (2016) WikiCoref: an English coreference-annotated corpus of Wikipedia articles. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16), Portorož, Slovenia, pp 136–142. European Language Resources Association (ELRA)
Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw 18(5):602–610. https://doi.org/10.1016/j.neunet.2005.06.042
Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network. In: NIPS deep learning and representation learning workshop
Hovy E, Marcus M, Palmer M, Ramshaw L, Weischedel R (2006) OntoNotes: the 90% solution. In: Proceedings of the human language technology conference of the NAACL, companion volume: short papers, New York City, pp 57–60. Association for Computational Linguistics
Huang YJ, Lu J, Kurohashi S, Ng V (2019) Improving event coreference resolution by learning argument compatibility from unlabeled data. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (Long and Short Papers), Minneapolis, pp 4171–4186. Association for Computational Linguistics
Joshi M, Chen D, Liu Y, Weld DS, Zettlemoyer L, Levy O (2020) SpanBERT: improving pre-training by representing and predicting spans. Trans Assoc Comput Linguist 8:64–77. https://doi.org/10.1162/tacl_a_00300
Joshi M, Levy O, Zettlemoyer L, Weld D (2019) BERT for coreference resolution: Baselines and analysis. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, pp 5803–5808. Association for Computational Linguistics
Khashabi D, Chaturvedi S, Roth M, Upadhyay S, Roth D (2018) Looking beyond the surface: a challenge set for reading comprehension over multiple sentences. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (Long Papers), New Orleans, pp 252–262. Association for Computational Linguistics
Khosla S, Rose C (2020) Using type information to improve entity coreference resolution. In: Proceedings of the first workshop on computational approaches to discourse, pp 20–31. Association for Computational Linguistics
Kirstain Y, Ram O, Levy O (2021) Coreference resolution without span representations. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (Volume 2: Short Papers), pp 14–19. Association for Computational Linguistics
Kocijan V, Camburu OM, Cretu AM, Yordanov Y, Blunsom P, Lukasiewicz T (2019) WikiCREM: a large unsupervised corpus for coreference resolution. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, pp 4303–4312. Association for Computational Linguistics
Kopeć M (2014) MMAX2 for coreference annotation. In: Proceedings of the demonstrations at the 14th conference of the European chapter of the association for computational linguistics, Gothenburg, pp 93–96. Association for Computational Linguistics
Krishna MH, Rahamathulla K, Akbar A (2017) A feature based approach for sentiment analysis using SVM and coreference resolution. In: 2017 International conference on inventive communication and computational technologies (ICICCT), pp 397–399
Kuhn HW (1955) The Hungarian method for the assignment problem. Naval Res Logist Q 2(1–2):83–97
Kundu G, Sil A, Florian R, Hamza W (2018) Neural cross-lingual coreference resolution and its application to entity linking. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 2: short papers), Melbourne, pp 395–400. Association for Computational Linguistics
Lai T, Ji H, Bui T, Tran QH, Dernoncourt F, Chang W (2021) A context-dependent gated module for incorporating symbolic semantics into event coreference resolution. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 3491–3499. Association for Computational Linguistics
Lai TM, Bui T, Kim DS (2022) End-to-end neural coreference resolution revisited: a simple yet effective baseline. In: ICASSP 2022-2022 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 8147–8151
Lan Z, Chen M, Goodman S, Gimpel K, Sharma P, Soricut R (2019) ALBERT: a lite BERT for self-supervised learning of language representations. In: International conference on learning representations
Lee K, He L, Lewis M, Zettlemoyer L (2017) End-to-end neural coreference resolution. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, pp. 188–197. Association for Computational Linguistics
Lee K, He L, Zettlemoyer L (2018) Higher-order coreference resolution with coarse-to-fine inference. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 2 (short papers), New Orleans, pp 687–692. Association for Computational Linguistics
Levesque H, Davis E, Morgenstern L (2012). The Winograd schema challenge. In: Thirteenth international conference on the principles of knowledge representation and reasoning. Citeseer
Levesque HJ (2011) The Winograd schema challenge. In: Logical formalizations of commonsense reasoning, Papers from the 2011 AAAI spring symposium, Technical Report SS-11-06, Stanford, March 21–23, 2011. AAAI
Levy S, Lazar K, Stanovsky G (2021) Collecting a large-scale gender bias dataset for coreference resolution and machine translation. In: Findings of the Association for Computational Linguistics: EMNLP 2021, Punta Cana, pp 2470–2480. Association for Computational Linguistics
Li X, Van Deemter K, Lin C (2018) Statistical NLG for generating the content and form of referring expressions. In: Proceedings of the 11th international conference on natural language generation. Association for Computational Linguistics (ACL)
Lin Q, Mao R, Liu J, Xu F, Cambria E (2023) Fusing topology contexts and logical rules in language models for knowledge graph completion. Inform Fus 90:253–264
Lin Y, Ji H, Huang F, Wu L (2020) A joint neural model for information extraction with global features. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp. 7999–8009. Association for Computational Linguistics
Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) RoBERTa: a robustly optimized BERT pretraining approach. CoRR abs/1907.11692. arXiv:1907.11692
Liu Z, Shi K, Chen N (2021) Coreference-aware dialogue summarization. In: Proceedings of the 22nd annual meeting of the special interest group on discourse and dialogue, Singapore, pp. 509–519. Association for Computational Linguistics
Lu J, Ng V (2018) Event coreference resolution: a survey of two decades of research. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, IJCAI-18, pp 5479–5486. International Joint Conferences on Artificial Intelligence Organization
Lu, J, Ng V (2020) Conundrums in entity coreference resolution: Making sense of the state of the art. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp. 6620–6631. Association for Computational Linguistics
Lu J, Ng V, (2021a) Constrained multi-task learning for event coreference resolution. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 4504–4514. Association for Computational Linguistics
Lu J, Ng V, (2021b) Conundrums in event coreference resolution: Making sense of the state of the art. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Punta Cana, pp 1368–1380. Association for Computational Linguistics
Lu J, Ng V. (2021c) Span-based event coreference resolution. In: Proceedings of the AAAI conference on artificial intelligence 35(15): 13489–13497. https://doi.org/10.1609/aaai.v35i15.17591
Lu Y, Lin H, Tang J, Han X, Sun L (2022) End-to-end neural event coreference resolution. Artificial Intell 303:103632. https://doi.org/10.1016/j.artint.2021.103632
Luo X (2005) On coreference resolution performance metrics. In: Proceedings of human language technology conference and conference on empirical methods in natural language processing, Vancouver, pp 25–32. Association for Computational Linguistics
Luo X, Pradhan S (2016) Evaluation metrics, Anaphora resolution. Springer, Berlin, pp 141–163. https://doi.org/10.1007/978-3-662-47909-4_5
Mao R, Li X (2021) Bridging towers of multi-task learning with a gating mechanism for aspect-based sentiment analysis and sequential metaphor identification. Proc AAAI Conf Artif Intell 35(15):13534–13542
Mao R, Li X, Ge M, Cambria E (2022) MetaPro: a computational metaphor processing model for text pre-processing. Inform Fus 86–87:30–43
Mao R, Lin C, Guerin F (2018) Word embedding and WordNet based metaphor identification and interpretation. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1, pp 1222–1231
Mao R, Lin C, Guerin F (2019) End-to-end sequential metaphor identification inspired by linguistic theories. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 3888–3898
Miltsakaki E (2007) A rethink of the relationship between salience and anaphora resolution. In: Proceedings of the 6th discourse anaphora and anaphor resolution colloquium, pp 91–96
Mitamura T, Liu Z, Hovy EH (2016) Overview of TAC-KBP 2016 event nugget track. In: Proceedings of the 2016 text analysis conference, TAC 2016, Gaithersburg, November 14–15, 2016. NIST
Mitamura T, Liu Z, Hovy EH (2017) Events detection, coreference and sequencing: what’s next? Overview of the TAC KBP 2017 event track. In: TAC
Mitkov R (1999) Anaphora resolution: the state of the art. Citeseer
Moosavi NS, Strube M (2016) Which coreference evaluation metric do you trust? a proposal for a link-based entity aware metric. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: Long Papers), Berlin, pp 632–642. Association for Computational Linguistics
Murugesan K, Atzeni M, Kapanipathi P, Shukla P, Kumaravel S, Tesauro G, Talamadupula K, Sachan M, Campbell M (2021) Text-based RL agents with commonsense knowledge: new challenges, environments and baselines. In: Thirty fifth AAAI conference on artificial intelligence
Ng V (2010) Supervised noun phrase coreference research: The first fifteen years. In: Proceedings of the 48th annual meeting of the association for computational linguistics, Uppsala, pp 1396–1411. Association for Computational Linguistics
Oberle B (2018) SACR: a drag-and-drop based tool for coreference annotation. In: NCC Chair, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, PiperidisS, Tokunaga T (eds.), Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018), Miyazaki. European Language Resources Association (ELRA)
O’Gorman T, Wright-Bettner K, Palmer M (2016) Richer event description: Integrating event coreference with temporal, causal and bridging annotation. In: Proceedings of the 2nd workshop on computing news storylines (CNS 2016), Austin, pp 47–56. Association for Computational Linguistics
OpenAI (2022) Introducing ChatGPT
OpenAI (2023) GPT-4 technical report
Peng H, Chang KW, Roth D (2015) A joint framework for coreference resolution and mention head detection. In: Proceedings of the nineteenth conference on computational natural language learning, Beijing, pp 12–21. Association for Computational Linguistics
Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar, pp 1532–1543. Association for Computational Linguistics
Poesio M, Stuckardt R, Versley Y (2016) Anaphora resolution-algorithms, resources, and applications. Theory and applications of natural language processing. Springer, New York
Pradhan S, Moschitti A, Xue N, Uryupina O, Zhang Y (2012) CoNLL-2012 shared task: modeling multilingual unrestricted coreference in OntoNotes. In: Joint conference on EMNLP and CoNLL - Shared Task, Jeju Island, pp 1–40. Association for Computational Linguistics
Raghunathan K, Lee H, Rangarajan S, Chambers N, Surdeanu M, Jurafsky D, Manning C (2010a) A multi-pass sieve for coreference resolution. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Cambridge, MA, pp 492–501. Association for Computational Linguistics
Raghunathan K, Lee H, Rangarajan S, Chambers N, Surdeanu M, Jurafsky D, Manning C (2010b) A multi-pass sieve for coreference resolution. In: Empirical methods in natural language processing (EMNLP)
Rahman A, Ng V (2012) Resolving complex cases of definite pronouns: the Winograd schema challenge. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning, Jeju, pp 777–789. Association for Computational Linguistics
Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) SQuAD: 100,000+ questions for machine comprehension of text. In: Proceedings of the 2016 conference on empirical methods in natural language processing, Austin, pp 2383–2392. Association for Computational Linguistics
Recasens M, Hovy E (2011) BLANC: implementing the rand index for coreference evaluation. Nat Language Eng 17(4):485–510
Reiter N (2018) CorefAnnotator: a new annotation tool for entity references. Data in the Digital Humanities. In: Abstracts of EADH
Riedel S, Yao L, McCallum A, Marlin BM (2013) Relation extraction with matrix factorization and universal schemas. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, Atlanta, pp 74–84. Association for Computational Linguistics
Ross S, Gordon G, Bagnell D (2011) A reduction of imitation learning and structured prediction to no-regret online learning. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 627–635. JMLR Workshop and Conference Proceedings
Rudinger R, Naradowsky J, Leonard B, Van Durme B (2018) Gender bias in coreference resolution. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 2 (Short Papers), New Orleans, pp 8–14. Association for Computational Linguistics
Shlain M, Taub-Tabib H, Sadde S, Goldberg Y (2020) Syntactic search by example. In : Proceedings of the 58th annual meeting of the association for computational linguistics: system demonstrations, pp 17–23. Association for Computational Linguistics
Stenetorp P, Pyysalo S, Ananiadou S, Tsujii J (2011) Almost total recall: semantic category disambiguation using large lexical resources and approximate string matching. In: Proceedings of the fourth international symposium on languages in biology and medicine. Citeseer
Stenetorp P, Pyysalo S, Topić G, Ohta T, Ananiadou S, Tsujii J (2012) April. BRAT: a web-based tool for NLP-assisted text annotation. In: Proceedings of the demonstrations at the 13th conference of the european chapter of the association for computational linguistics, Avignon, pp. 102–107. Association for Computational Linguistics
Stoyanov V, Gilbert N, Cardie C, Riloff E (2009), August. Conundrums in noun phrase coreference resolution: Making sense of the state-of-the-art. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP, Suntec, Singapore, pp 656–664. Association for Computational Linguistics
Sukthanker R, Poria S, Cambria E, Thirunavukarasu R (2020) Anaphora and coreference resolution: a review. Inform Fus 59:139–162
Sun Y, Wang S, Feng S, Ding S, Pang S, Shang J, Liu J, Chen X, Zhao Y, Lu Y, Liu W, Wu Z, Gong W, Liang J, Shang Z, Sun P, Liu W, Ouyang X, Yu D, Tian H, Wu H, Wang H (2021) ERNIE 3.0: large-scale knowledge enhanced pre-training for language understanding and generation. CoRR abs/2107.02137. arXiv:2107.02137
Teh Y, Bapst V, Czarnecki WM, Quan J, Kirkpatrick J, Hadsell R, Heess N, Pascanu R (2017) Distral: robust multitask reinforcement learning. In: Advances in neural information processing systems, pp. 4496–4506
Thirukovalluru R, Monath N, Shridhar K, Zaheer M, Sachan M, McCallum A (2021) Scaling within document coreference to long texts. In: Findings of the association for computational linguistics: ACL-IJCNLP 2021, pp 3921–3931. Association for Computational Linguistics
Turian J, Ratinov LA, Bengio Y (2010), July. Word representations: a simple and general method for semi-supervised learning. In: Proceedings of the 48th annual meeting of the association for computational linguistics, Uppsala, pp 384–394. Association for Computational Linguistics
Uzuner O, Bodnari A, Shen S, Forbush T, Pestian J, South BR (2012) Evaluating the state of the art in coreference resolution for electronic medical records. JAMIA 19(5):786–791
Varkel Y, Globerson A (2020) Pre-training mention representations in coreference models. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 8534–8540. Association for Computational Linguistics
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, pp 5998–6008
Verga P , McCallum A (2016) Row-less universal schema. In: Proceedings of the 5th workshop on automated knowledge base construction, San Diego, pp 63–68. Association for Computational Linguistics
Vilain M, Burger J, Aberdeen J, Connolly D, Hirschman L (1995) A model-theoretic coreference scoring scheme. In: Sixth message understanding conference (MUC-6): proceedings of a conference held in Columbia, Maryland, November 6–8, 1995
Wang A, Pruksachatkun Y, Nangia N, Singh A, Michael J, Hill F, Levy O, Bowman S (2019). SuperGLUE: a stickier benchmark for general-purpose language understanding systems. In: Wallach H, Larochelle H, Beygelzimer A, d’ Alché-Buc, F, Fox E, Garnett R (eds.), Advances in neural information processing systems, volume 32. Curran Associates, Inc
Wang Y, Shen Y, Jin H (2021) An end-to-end actor-critic-based neural coreference resolution system. In: ICASSP 2021-2021 IEEE International conference on acoustics, speech and signal processing (ICASSP), pp 7848–7852
Webster K, Recasens M, Axelrod V, Baldridge J (2018) Mind the GAP: a balanced corpus of gendered ambiguous pronouns. Trans Assoc Comput Linguist 6:605–617. https://doi.org/10.1162/tacl_a_00240
Welbl J, Stenetorp P, Riedel S (2018) Constructing datasets for multi-hop reading comprehension across documents. Trans Assoc Comput Linguist 6:287–302
Winograd T (1972) Understanding natural language. Cognit Psychol 3(1):1–191. https://doi.org/10.1016/0010-0285(72)90002-3
Wiseman S, Rush AM, Shieber SM (2016) Learning global features for coreference resolution. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, San Diego, pp 994–1004. Association for Computational Linguistics
Wu W, Wang F, Yuan A, Wu F, Li J (2020) CorefQA: coreference resolution as query-based span prediction. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 6953–6963. Association for Computational Linguistics
Xia P, Sedoc J, Van Durme B (2020) Incremental neural coreference resolution in constant memory. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 8617–8624. Association for Computational Linguistics
Xu L, Choi JD (2020) Revealing the myth of higher-order inference in coreference resolution. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 8527–8533. Association for Computational Linguistics
Yadav N, Monath N, Angell R, McCallum A (2021) Event and entity coreference using trees to encode uncertainty in joint decisions. In: Proceedings of the fourth workshop on computational models of reference, anaphora and coreference, Punta Cana, pp 100–110. Association for Computational Linguistics
Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov RR, Le QV (2019) XLNet: generalized autoregressive pretraining for language understanding. In: Wallach H, Larochelle H, Beygelzimer A, d’ Alché-Buc F, Fox E, Garnett R (eds.), Advances in neural information processing systems, volume 32. Curran Associates, Inc
Ye D, Lin Y, Du J, Liu Z, Li P, Sun M, Liu Z (2020) Coreferential reasoning learning for language representation. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 7170–7186. Association for Computational Linguistics
Yu J, Bohnet B, Poesio M (2020) Neural mention detection. In: LREC
Yu X, Yin W, Roth D (2020) Pairwise representation learning for event coreference
Zeldes A (2017) The GUM corpus: creating multilayer resources in the classroom. Lang Resour Eval 59:581–612. https://doi.org/10.1007/s10579-016-9343-x
Zeng Y, Jin X, Guan S, Guo J, Cheng X (2020) Event coreference resolution with their paraphrases and argument-aware embeddings. In: Proceedings of the 28th international conference on computational linguistics, Barcelona, pp 3084–3094. International Committee on Computational Linguistics
Zhang H, Song Y, Song Y, Yu D (2019) Knowledge-aware pronoun coreference resolution. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, pp 867–876. Association for Computational Linguistics
Zhang R, Nogueira dos Santos C, Yasunaga M, Xiang B, Radev D (2018) Neural coreference resolution with deep biaffine attention by joint mention detection and mention clustering. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 2: Short Papers), Melbourne, pp 102–107. Association for Computational Linguistics
Zhang X, Zhao J, LeCun Y (2015) Character-level convolutional networks for text classification. In: Cortes C, Lawrence N, Lee D, Sugiyama M, Garnett R (eds) Advances in neural information processing systems, vol 28. Curran Associates Inc, Red Hook
Zhao J, Wang T, Yatskar M, Ordonez V, Chang KW (2018) Gender bias in coreference resolution: evaluation and debiasing methods. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 2 (Short Papers), New Orleans, pp 15–20. Association for Computational Linguistics
Zhu P, Zhang Z, Li J, Huang Y, Zhao H (2018) Lingke: a fine-grained multi-turn chatbot for customer service. In: Proceedings of the 27th international conference on computational linguistics: system demonstrations, Santa Fe, pp 108–112. Association for Computational Linguistics
Acknowledgements
This study is supported under the RIE2020 Industry Alignment Fund-Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, R., Mao, R., Luu, A.T. et al. A brief survey on recent advances in coreference resolution. Artif Intell Rev 56, 14439–14481 (2023). https://doi.org/10.1007/s10462-023-10506-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-023-10506-3