Skip to main content
Log in

A brief survey on recent advances in coreference resolution

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

The task of resolving repeated objects in natural languages is known as coreference resolution, and it is an important part of modern natural language processing. It is classified into two categories depending on the resolved objects, namely entity coreference resolution and event coreference resolution. Predicting coreference connections and identifying mentions/triggers are the major challenges in coreference resolution, because these implicit relationships are particularly difficult in natural language understanding in downstream tasks. Coreference resolution techniques have experienced considerable advances in recent years, encouraging us to review this task in the following aspects: current employed evaluation metrics, datasets, and methods. We investigate 10 widely used metrics, 18 datasets and 4 main technical trends in this survey. We believe that this work is a comprehensive roadmap for understanding the past and the future of coreference resolution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Notes

  1. either as entity mentions or event mentions.

  2. https://github.com/huggingface/neuralcoref.

  3. https://www.media.mit.edu/projects/open-mind-common-sense/overview/.

References

  • Abzaliev A (2019) On GAP coreference resolution shared task: insights from the 3rd place solution. In: Proceedings of the first workshop on gender bias in natural language processing, Florence, pp 107–112. Association for Computational Linguistics

  • Agarwal O, Subramanian S, Nenkova A, Roth D (2019) Evaluation of named entity coreference. In: Proceedings of the second workshop on computational models of reference, anaphora and coreference, Minneapolis, pp 1–7. Association for Computational Linguistics

  • Angeli G, Johnson Premkumar MJ, Manning CD (2015) Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (Volume 1: Long Papers) Beijing, pp 344–354. Association for Computational Linguistics

  • Aralikatte R, Lent H, Gonzalez AV, Herschcovich D, Qiu C, Sandholm A, Ringaard M, Søgaard A (2019) Rewarding coreference resolvers for being consistent with world knowledge. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong pp 1229–1235. Association for Computational Linguistics

  • Atkinson J, Salas G, Figueroa A (2015) Improving opinion retrieval in social media by combining features-based coreferencing and memory-based learning. Inform Sci 299:20–31. https://doi.org/10.1016/j.ins.2014.12.021

    Article  MathSciNet  Google Scholar 

  • Attree S (2019), August. Gendered ambiguous pronouns shared task: Boosting model confidence by evidence pooling. In: Proceedings of the first workshop on gender bias in natural language processing, Florence, pp 134–146. Association for Computational Linguistics

  • Bagga A, Baldwin B (1998) Algorithms for scoring coreference chains. In: The first international conference on language resources and evaluation workshop on linguistics coreference, Volume 1, pp 563–566. Citeseer

  • Bamman D, Lewke O, Mansoor A (2020) An annotated dataset of coreference in English literature. In: Proceedings of the 12th Language Resources and Evaluation Conference, Marseille, pp 44–54. European Language Resources Association

  • Beltagy I, Peters ME, Cohan A (2020) Longformer: the long-document transformer. CoRR abs/2004.05150. arXiv:2004.05150

  • Bhattacharjee S, Haque R, de Buy Wenniger GM, Way A (2020) Investigating query expansion and coreference resolution in question answering on BERT. In: International conference on applications of natural language to information systems, pp 47–59. Springer

  • Bornstein A, Cattan A, Dagan I (2020) CoRefi: a crowd sourcing suite for coreference annotation. In; Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations

  • Brasoveanu A (2008) Donkey pluralities: plural information states versus non-atomic individuals. Linguist Philos 31(2):129–209

    Article  Google Scholar 

  • Bussmann H, Kazzazi K, Trauth G (2006) Routledge dictionary of language and linguistics. Routledge, London

    Book  Google Scholar 

  • Caciularu A, Cohan A, Beltagy I, Peters M, Cattan A, Dagan I (2021) November. CDLM: cross-document language modeling. In: Findings of the association for computational linguistics: EMNLP 2021, Punta Cana, Dominican Republic, pp 2648–2662. Association for Computational Linguistics

  • Cambria E, Liu Q, Decherchi S, Xing F, Kwok K (2022) SenticNet 7: a commonsense-based neurosymbolic AI framework for explainable sentiment analysis. In: Proceedings of the 13th language resources and evaluation conference, pp 3829–3839

  • Cattan A, Eirew A, Stanovsky G, Joshi M, Dagan I (2020) Streamlining cross-document coreference resolution: evaluation and modeling. CoRR abs/2009.11032. arXiv:2009.11032

  • Cattan A, Eirew A, Stanovsky G, Joshi M, Dagan I (2021) Cross-document coreference resolution over predicted mentions. In: Findings of the association for computational linguistics: ACL-IJCNLP 2021, Online, pp 5100–5107. Association for Computational Linguistics

  • Chaturvedi I, Satapathy R, Cavallari S, Cambria E (2019) Fuzzy commonsense reasoning for multimodal sentiment analysis. Pattern Recognit Lett 125:264–270

    Article  Google Scholar 

  • Chen G, Van DeemterK, Lin C (2018) Modelling pro-drop with the rational speech acts model. In: Proceedings of the 11th international conference on natural language generation, pp 57–66. Association for Computational Linguistics (ACL)

  • Clark K, Manning CD (2015) Entity-centric coreference resolution with model stacking. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (Volume 1: Long Papers), Beijing, pp 1405–1415. Association for Computational Linguistics

  • Clark K, Manning CD (2016) Deep reinforcement learning for mention-ranking coreference models. In: Proceedings of the 2016 conference on empirical methods in natural language processing, Austin, pp 2256–2262. Association for Computational Linguistics

  • Cybulska A , Vossen P (2014) Using a sledgehammer to crack a nut? lexical diversity and event coreference resolution. In: Proceedings of the ninth international conference on language resources and evaluation (LREC’14), Reykjavik, pp 4545–4552. European Language Resources Association (ELRA)

  • Dai Z, Fei H, Li P (2019) Coreference aware representation learning for neural named entity recognition. In: Proceedings of the Twenty-eighth international joint conference on artificial intelligence, IJCAI-19, pp 4946–4953. International Joint Conferences on Artificial Intelligence Organization

  • Dakle PP, Desai T, Moldovan D (2020) A study on entity resolution for email conversations. In: Proceedings of the 12th language resources and evaluation conference, Marseille, pp 65–73. European Language Resources Association

  • Dasigi P, Liu NF, Marasović A, Smith NA, Gardner M (2019) QUOREF: a reading comprehension dataset with questions requiring coreferential reasoning. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, pp 5925–5932. Association for Computational Linguistics

  • Davis E, Morgenstern L, Ortiz C (2017) The first Winograd schema challenge at IJCAI-16. AI Mag 38(3):97–98. https://doi.org/10.1609/aimag.v38i4.2734

    Article  Google Scholar 

  • de Marneffe MC, Rafferty AN, Manning CD (2008) Finding contradictions in text. In: Proceedings of ACL-08: HLT, Columbus, pp 1039–1047. Association for Computational Linguistics

  • Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American chapter of the association for computational linguistics: human language technologies, Volume 1 (Long and Short Papers), Minneapolis, pp 4171–4186. Association for Computational Linguistics

  • Ding X, Liu B (2010) Resolving object and attribute coreference in opinion mining. In: Proceedings of the 23rd international conference on computational linguistics (COLING 2010), Beijing, pp 268–276. COLING 2010 Organizing Committee

  • Dobrovolskii V (2021) Word-level coreference resolution. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Online and Punta Cana, Dominican Republic, pp 7670–7675. Association for Computational Linguistics

  • Durrett G, Klein D (2014) A joint model for entity analysis: coreference, typing, and linking. Trans Assoc Comput Linguist 2:477–490. https://doi.org/10.1162/tacl_a_00197

    Article  Google Scholar 

  • Eirew A, Cattan A, Dagan I (2021) WEC: deriving a large-scale cross-document event coreference dataset from Wikipedia. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 2498–2510. Association for Computational Linguistics

  • Ellis J, Getman J, Fore D, Kuster N, Song Z, Bies A, Strassel SM (2015) Overview of linguistic resources for the TAC KBP 2015 evaluations: Methodologies and results. In: Proceedings of the 2015 text analysis conference, TAC 2015, Gaithersburg, November 16–17, 2015, 2015. NIST

  • Ellis J, Getman J, Kuster N, Song Z, Bies A, Strassel SM (2016) Overview of linguistic resources for the TAC KBP 2016 evaluations: Methodologies and results. In: Proceedings of the 2016 Text analysis conference, TAC 2016, Gaithersburg, November 14–15, 2016. NIST

  • Ellis K, Albright A, Solar-Lezama A, Tenenbaum JB, O’Donnell TJ (2022) Synthesizing theories of human language with Bayesian program induction. Nat Commun 13(1):1–13

    Article  Google Scholar 

  • Emami A, Trichelair P, Trischler A, Suleman K, Schulz H, Cheung JCK (2019) The KnowRef coreference corpus: removing gender and number cues for difficult pronominal anaphora resolution. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, pp. 3952–3961. Association for Computational Linguistics

  • Emami A, Trischler A, Suleman K, Cheung JCK (2018), June. A generalized knowledge hunting framework for the Winograd schema challenge. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, New Orleans, Louisiana, USA, pp. 25–31. Association for Computational Linguistics

  • Fabbri A, Li I, She T, Li S, Radev D (2019) Multi-news: a large-scale multi-document summarization dataset and abstractive hierarchical model. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, pp 1074–1084. Association for Computational Linguistics

  • Ferracane E, Marshall I, Wallace BC, Erk K (2016) Leveraging coreference to identify arms in medical abstracts: an experimental study. In: Proceedings of the seventh international workshop on health text mining and information analysis, Auxtin, pp 86–95. Association for Computational Linguistics

  • Gardner M, Grus J, Neumann M, Tafjord O, Dasigi P, Liu NF, Peters M, Schmitz M, Zettlemoyer LS (2017) AllenNLP: a deep semantic natural language processing platform. In: Proceedings of workshop for NLP open source software (NLP-OSS)

  • Ge M, Mao R, Cambria E (2022) Explainable metaphor identification inspired by conceptual metaphor theory. In: Proceedings of the 36th AAAI conference on artificial intelligence, pp 10681–10689

  • Ghaddar A , Langlais P (2016) WikiCoref: an English coreference-annotated corpus of Wikipedia articles. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16), Portorož, Slovenia, pp 136–142. European Language Resources Association (ELRA)

  • Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw 18(5):602–610. https://doi.org/10.1016/j.neunet.2005.06.042

    Article  Google Scholar 

  • Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network. In: NIPS deep learning and representation learning workshop

  • Hovy E, Marcus M, Palmer M, Ramshaw L, Weischedel R (2006) OntoNotes: the 90% solution. In: Proceedings of the human language technology conference of the NAACL, companion volume: short papers, New York City, pp 57–60. Association for Computational Linguistics

  • Huang YJ, Lu J, Kurohashi S, Ng V (2019) Improving event coreference resolution by learning argument compatibility from unlabeled data. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (Long and Short Papers), Minneapolis, pp 4171–4186. Association for Computational Linguistics

  • Joshi M, Chen D, Liu Y, Weld DS, Zettlemoyer L, Levy O (2020) SpanBERT: improving pre-training by representing and predicting spans. Trans Assoc Comput Linguist 8:64–77. https://doi.org/10.1162/tacl_a_00300

    Article  Google Scholar 

  • Joshi M, Levy O, Zettlemoyer L, Weld D (2019) BERT for coreference resolution: Baselines and analysis. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, pp 5803–5808. Association for Computational Linguistics

  • Khashabi D, Chaturvedi S, Roth M, Upadhyay S, Roth D (2018) Looking beyond the surface: a challenge set for reading comprehension over multiple sentences. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (Long Papers), New Orleans, pp 252–262. Association for Computational Linguistics

  • Khosla S, Rose C (2020) Using type information to improve entity coreference resolution. In: Proceedings of the first workshop on computational approaches to discourse, pp 20–31. Association for Computational Linguistics

  • Kirstain Y, Ram O, Levy O (2021) Coreference resolution without span representations. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (Volume 2: Short Papers), pp 14–19. Association for Computational Linguistics

  • Kocijan V, Camburu OM, Cretu AM, Yordanov Y, Blunsom P, Lukasiewicz T (2019) WikiCREM: a large unsupervised corpus for coreference resolution. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), Hong Kong, pp 4303–4312. Association for Computational Linguistics

  • Kopeć M (2014) MMAX2 for coreference annotation. In: Proceedings of the demonstrations at the 14th conference of the European chapter of the association for computational linguistics, Gothenburg, pp 93–96. Association for Computational Linguistics

  • Krishna MH, Rahamathulla K, Akbar A (2017) A feature based approach for sentiment analysis using SVM and coreference resolution. In: 2017 International conference on inventive communication and computational technologies (ICICCT), pp 397–399

  • Kuhn HW (1955) The Hungarian method for the assignment problem. Naval Res Logist Q 2(1–2):83–97

    Article  MathSciNet  MATH  Google Scholar 

  • Kundu G, Sil A, Florian R, Hamza W (2018) Neural cross-lingual coreference resolution and its application to entity linking. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 2: short papers), Melbourne, pp 395–400. Association for Computational Linguistics

  • Lai T, Ji H, Bui T, Tran QH, Dernoncourt F, Chang W (2021) A context-dependent gated module for incorporating symbolic semantics into event coreference resolution. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 3491–3499. Association for Computational Linguistics

  • Lai TM, Bui T, Kim DS (2022) End-to-end neural coreference resolution revisited: a simple yet effective baseline. In: ICASSP 2022-2022 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 8147–8151

  • Lan Z, Chen M, Goodman S, Gimpel K, Sharma P, Soricut R (2019) ALBERT: a lite BERT for self-supervised learning of language representations. In: International conference on learning representations

  • Lee K, He L, Lewis M, Zettlemoyer L (2017) End-to-end neural coreference resolution. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, pp. 188–197. Association for Computational Linguistics

  • Lee K, He L, Zettlemoyer L (2018) Higher-order coreference resolution with coarse-to-fine inference. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 2 (short papers), New Orleans, pp 687–692. Association for Computational Linguistics

  • Levesque H, Davis E, Morgenstern L (2012). The Winograd schema challenge. In: Thirteenth international conference on the principles of knowledge representation and reasoning. Citeseer

  • Levesque HJ (2011) The Winograd schema challenge. In: Logical formalizations of commonsense reasoning, Papers from the 2011 AAAI spring symposium, Technical Report SS-11-06, Stanford, March 21–23, 2011. AAAI

  • Levy S, Lazar K, Stanovsky G (2021) Collecting a large-scale gender bias dataset for coreference resolution and machine translation. In: Findings of the Association for Computational Linguistics: EMNLP 2021, Punta Cana, pp 2470–2480. Association for Computational Linguistics

  • Li X, Van Deemter K, Lin C (2018) Statistical NLG for generating the content and form of referring expressions. In: Proceedings of the 11th international conference on natural language generation. Association for Computational Linguistics (ACL)

  • Lin Q, Mao R, Liu J, Xu F, Cambria E (2023) Fusing topology contexts and logical rules in language models for knowledge graph completion. Inform Fus 90:253–264

    Article  Google Scholar 

  • Lin Y, Ji H, Huang F, Wu L (2020) A joint neural model for information extraction with global features. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp. 7999–8009. Association for Computational Linguistics

  • Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) RoBERTa: a robustly optimized BERT pretraining approach. CoRR abs/1907.11692. arXiv:1907.11692

  • Liu Z, Shi K, Chen N (2021) Coreference-aware dialogue summarization. In: Proceedings of the 22nd annual meeting of the special interest group on discourse and dialogue, Singapore, pp. 509–519. Association for Computational Linguistics

  • Lu J, Ng V (2018) Event coreference resolution: a survey of two decades of research. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, IJCAI-18, pp 5479–5486. International Joint Conferences on Artificial Intelligence Organization

  • Lu, J, Ng V (2020) Conundrums in entity coreference resolution: Making sense of the state of the art. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp. 6620–6631. Association for Computational Linguistics

  • Lu J, Ng V, (2021a) Constrained multi-task learning for event coreference resolution. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 4504–4514. Association for Computational Linguistics

  • Lu J, Ng V, (2021b) Conundrums in event coreference resolution: Making sense of the state of the art. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Punta Cana, pp 1368–1380. Association for Computational Linguistics

  • Lu J, Ng V. (2021c) Span-based event coreference resolution. In: Proceedings of the AAAI conference on artificial intelligence 35(15): 13489–13497. https://doi.org/10.1609/aaai.v35i15.17591

  • Lu Y, Lin H, Tang J, Han X, Sun L (2022) End-to-end neural event coreference resolution. Artificial Intell 303:103632. https://doi.org/10.1016/j.artint.2021.103632

    Article  MATH  Google Scholar 

  • Luo X (2005) On coreference resolution performance metrics. In: Proceedings of human language technology conference and conference on empirical methods in natural language processing, Vancouver, pp 25–32. Association for Computational Linguistics

  • Luo X, Pradhan S (2016) Evaluation metrics, Anaphora resolution. Springer, Berlin, pp 141–163. https://doi.org/10.1007/978-3-662-47909-4_5

    Book  Google Scholar 

  • Mao R, Li X (2021) Bridging towers of multi-task learning with a gating mechanism for aspect-based sentiment analysis and sequential metaphor identification. Proc AAAI Conf Artif Intell 35(15):13534–13542

    Google Scholar 

  • Mao R, Li X, Ge M, Cambria E (2022) MetaPro: a computational metaphor processing model for text pre-processing. Inform Fus 86–87:30–43

    Article  Google Scholar 

  • Mao R, Lin C, Guerin F (2018) Word embedding and WordNet based metaphor identification and interpretation. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1, pp 1222–1231

  • Mao R, Lin C, Guerin F (2019) End-to-end sequential metaphor identification inspired by linguistic theories. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 3888–3898

  • Miltsakaki E (2007) A rethink of the relationship between salience and anaphora resolution. In: Proceedings of the 6th discourse anaphora and anaphor resolution colloquium, pp 91–96

  • Mitamura T, Liu Z, Hovy EH (2016) Overview of TAC-KBP 2016 event nugget track. In: Proceedings of the 2016 text analysis conference, TAC 2016, Gaithersburg, November 14–15, 2016. NIST

  • Mitamura T, Liu Z, Hovy EH (2017) Events detection, coreference and sequencing: what’s next? Overview of the TAC KBP 2017 event track. In: TAC

  • Mitkov R (1999) Anaphora resolution: the state of the art. Citeseer

  • Moosavi NS, Strube M (2016) Which coreference evaluation metric do you trust? a proposal for a link-based entity aware metric. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: Long Papers), Berlin, pp 632–642. Association for Computational Linguistics

  • Murugesan K, Atzeni M, Kapanipathi P, Shukla P, Kumaravel S, Tesauro G, Talamadupula K, Sachan M, Campbell M (2021) Text-based RL agents with commonsense knowledge: new challenges, environments and baselines. In: Thirty fifth AAAI conference on artificial intelligence

  • Ng V (2010) Supervised noun phrase coreference research: The first fifteen years. In: Proceedings of the 48th annual meeting of the association for computational linguistics, Uppsala, pp 1396–1411. Association for Computational Linguistics

  • Oberle B (2018) SACR: a drag-and-drop based tool for coreference annotation. In: NCC Chair, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, PiperidisS, Tokunaga T (eds.), Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018), Miyazaki. European Language Resources Association (ELRA)

  • O’Gorman T, Wright-Bettner K, Palmer M (2016) Richer event description: Integrating event coreference with temporal, causal and bridging annotation. In: Proceedings of the 2nd workshop on computing news storylines (CNS 2016), Austin, pp 47–56. Association for Computational Linguistics

  • OpenAI (2022) Introducing ChatGPT

  • OpenAI (2023) GPT-4 technical report

  • Peng H, Chang KW, Roth D (2015) A joint framework for coreference resolution and mention head detection. In: Proceedings of the nineteenth conference on computational natural language learning, Beijing, pp 12–21. Association for Computational Linguistics

  • Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar, pp 1532–1543. Association for Computational Linguistics

  • Poesio M, Stuckardt R, Versley Y (2016) Anaphora resolution-algorithms, resources, and applications. Theory and applications of natural language processing. Springer, New York

    MATH  Google Scholar 

  • Pradhan S, Moschitti A, Xue N, Uryupina O, Zhang Y (2012) CoNLL-2012 shared task: modeling multilingual unrestricted coreference in OntoNotes. In: Joint conference on EMNLP and CoNLL - Shared Task, Jeju Island, pp 1–40. Association for Computational Linguistics

  • Raghunathan K, Lee H, Rangarajan S, Chambers N, Surdeanu M, Jurafsky D, Manning C (2010a) A multi-pass sieve for coreference resolution. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Cambridge, MA, pp 492–501. Association for Computational Linguistics

  • Raghunathan K, Lee H, Rangarajan S, Chambers N, Surdeanu M, Jurafsky D, Manning C (2010b) A multi-pass sieve for coreference resolution. In: Empirical methods in natural language processing (EMNLP)

  • Rahman A, Ng V (2012) Resolving complex cases of definite pronouns: the Winograd schema challenge. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning, Jeju, pp 777–789. Association for Computational Linguistics

  • Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) SQuAD: 100,000+ questions for machine comprehension of text. In: Proceedings of the 2016 conference on empirical methods in natural language processing, Austin, pp 2383–2392. Association for Computational Linguistics

  • Recasens M, Hovy E (2011) BLANC: implementing the rand index for coreference evaluation. Nat Language Eng 17(4):485–510

    Article  Google Scholar 

  • Reiter N (2018) CorefAnnotator: a new annotation tool for entity references. Data in the Digital Humanities. In: Abstracts of EADH

  • Riedel S, Yao L, McCallum A, Marlin BM (2013) Relation extraction with matrix factorization and universal schemas. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, Atlanta, pp 74–84. Association for Computational Linguistics

  • Ross S, Gordon G, Bagnell D (2011) A reduction of imitation learning and structured prediction to no-regret online learning. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 627–635. JMLR Workshop and Conference Proceedings

  • Rudinger R, Naradowsky J, Leonard B, Van Durme B (2018) Gender bias in coreference resolution. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 2 (Short Papers), New Orleans, pp 8–14. Association for Computational Linguistics

  • Shlain M, Taub-Tabib H, Sadde S, Goldberg Y (2020) Syntactic search by example. In : Proceedings of the 58th annual meeting of the association for computational linguistics: system demonstrations, pp 17–23. Association for Computational Linguistics

  • Stenetorp P, Pyysalo S, Ananiadou S, Tsujii J (2011) Almost total recall: semantic category disambiguation using large lexical resources and approximate string matching. In: Proceedings of the fourth international symposium on languages in biology and medicine. Citeseer

  • Stenetorp P, Pyysalo S, Topić G, Ohta T, Ananiadou S, Tsujii J (2012) April. BRAT: a web-based tool for NLP-assisted text annotation. In: Proceedings of the demonstrations at the 13th conference of the european chapter of the association for computational linguistics, Avignon, pp. 102–107. Association for Computational Linguistics

  • Stoyanov V, Gilbert N, Cardie C, Riloff E (2009), August. Conundrums in noun phrase coreference resolution: Making sense of the state-of-the-art. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP, Suntec, Singapore, pp 656–664. Association for Computational Linguistics

  • Sukthanker R, Poria S, Cambria E, Thirunavukarasu R (2020) Anaphora and coreference resolution: a review. Inform Fus 59:139–162

    Article  Google Scholar 

  • Sun Y, Wang S, Feng S, Ding S, Pang S, Shang J, Liu J, Chen X, Zhao Y, Lu Y, Liu W, Wu Z, Gong W, Liang J, Shang Z, Sun P, Liu W, Ouyang X, Yu D, Tian H, Wu H, Wang H (2021) ERNIE 3.0: large-scale knowledge enhanced pre-training for language understanding and generation. CoRR abs/2107.02137. arXiv:2107.02137

  • Teh Y, Bapst V, Czarnecki WM, Quan J, Kirkpatrick J, Hadsell R, Heess N, Pascanu R (2017) Distral: robust multitask reinforcement learning. In: Advances in neural information processing systems, pp. 4496–4506

  • Thirukovalluru R, Monath N, Shridhar K, Zaheer M, Sachan M, McCallum A (2021) Scaling within document coreference to long texts. In: Findings of the association for computational linguistics: ACL-IJCNLP 2021, pp 3921–3931. Association for Computational Linguistics

  • Turian J, Ratinov LA, Bengio Y (2010), July. Word representations: a simple and general method for semi-supervised learning. In: Proceedings of the 48th annual meeting of the association for computational linguistics, Uppsala, pp 384–394. Association for Computational Linguistics

  • Uzuner O, Bodnari A, Shen S, Forbush T, Pestian J, South BR (2012) Evaluating the state of the art in coreference resolution for electronic medical records. JAMIA 19(5):786–791

    Google Scholar 

  • Varkel Y, Globerson A (2020) Pre-training mention representations in coreference models. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 8534–8540. Association for Computational Linguistics

  • Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, pp 5998–6008

  • Verga P , McCallum A (2016) Row-less universal schema. In: Proceedings of the 5th workshop on automated knowledge base construction, San Diego, pp 63–68. Association for Computational Linguistics

  • Vilain M, Burger J, Aberdeen J, Connolly D, Hirschman L (1995) A model-theoretic coreference scoring scheme. In: Sixth message understanding conference (MUC-6): proceedings of a conference held in Columbia, Maryland, November 6–8, 1995

  • Wang A, Pruksachatkun Y, Nangia N, Singh A, Michael J, Hill F, Levy O, Bowman S (2019). SuperGLUE: a stickier benchmark for general-purpose language understanding systems. In: Wallach H, Larochelle H, Beygelzimer A, d’ Alché-Buc, F, Fox E, Garnett R (eds.), Advances in neural information processing systems, volume 32. Curran Associates, Inc

  • Wang Y, Shen Y, Jin H (2021) An end-to-end actor-critic-based neural coreference resolution system. In: ICASSP 2021-2021 IEEE International conference on acoustics, speech and signal processing (ICASSP), pp 7848–7852

  • Webster K, Recasens M, Axelrod V, Baldridge J (2018) Mind the GAP: a balanced corpus of gendered ambiguous pronouns. Trans Assoc Comput Linguist 6:605–617. https://doi.org/10.1162/tacl_a_00240

    Article  Google Scholar 

  • Welbl J, Stenetorp P, Riedel S (2018) Constructing datasets for multi-hop reading comprehension across documents. Trans Assoc Comput Linguist 6:287–302

    Article  Google Scholar 

  • Winograd T (1972) Understanding natural language. Cognit Psychol 3(1):1–191. https://doi.org/10.1016/0010-0285(72)90002-3

    Article  Google Scholar 

  • Wiseman S, Rush AM, Shieber SM (2016) Learning global features for coreference resolution. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, San Diego, pp 994–1004. Association for Computational Linguistics

  • Wu W, Wang F, Yuan A, Wu F, Li J (2020) CorefQA: coreference resolution as query-based span prediction. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 6953–6963. Association for Computational Linguistics

  • Xia P, Sedoc J, Van Durme B (2020) Incremental neural coreference resolution in constant memory. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 8617–8624. Association for Computational Linguistics

  • Xu L, Choi JD (2020) Revealing the myth of higher-order inference in coreference resolution. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 8527–8533. Association for Computational Linguistics

  • Yadav N, Monath N, Angell R, McCallum A (2021) Event and entity coreference using trees to encode uncertainty in joint decisions. In: Proceedings of the fourth workshop on computational models of reference, anaphora and coreference, Punta Cana, pp 100–110. Association for Computational Linguistics

  • Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov RR, Le QV (2019) XLNet: generalized autoregressive pretraining for language understanding. In: Wallach H, Larochelle H, Beygelzimer A, d’ Alché-Buc F, Fox E, Garnett R (eds.), Advances in neural information processing systems, volume 32. Curran Associates, Inc

  • Ye D, Lin Y, Du J, Liu Z, Li P, Sun M, Liu Z (2020) Coreferential reasoning learning for language representation. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 7170–7186. Association for Computational Linguistics

  • Yu J, Bohnet B, Poesio M (2020) Neural mention detection. In: LREC

  • Yu X, Yin W, Roth D (2020) Pairwise representation learning for event coreference

  • Zeldes A (2017) The GUM corpus: creating multilayer resources in the classroom. Lang Resour Eval 59:581–612. https://doi.org/10.1007/s10579-016-9343-x

    Article  Google Scholar 

  • Zeng Y, Jin X, Guan S, Guo J, Cheng X (2020) Event coreference resolution with their paraphrases and argument-aware embeddings. In: Proceedings of the 28th international conference on computational linguistics, Barcelona, pp 3084–3094. International Committee on Computational Linguistics

  • Zhang H, Song Y, Song Y, Yu D (2019) Knowledge-aware pronoun coreference resolution. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, pp 867–876. Association for Computational Linguistics

  • Zhang R, Nogueira dos Santos C, Yasunaga M, Xiang B, Radev D (2018) Neural coreference resolution with deep biaffine attention by joint mention detection and mention clustering. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 2: Short Papers), Melbourne, pp 102–107. Association for Computational Linguistics

  • Zhang X, Zhao J, LeCun Y (2015) Character-level convolutional networks for text classification. In: Cortes C, Lawrence N, Lee D, Sugiyama M, Garnett R (eds) Advances in neural information processing systems, vol 28. Curran Associates Inc, Red Hook

    Google Scholar 

  • Zhao J, Wang T, Yatskar M, Ordonez V, Chang KW (2018) Gender bias in coreference resolution: evaluation and debiasing methods. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 2 (Short Papers), New Orleans, pp 15–20. Association for Computational Linguistics

  • Zhu P, Zhang Z, Li J, Huang Y, Zhao H (2018) Lingke: a fine-grained multi-turn chatbot for customer service. In: Proceedings of the 27th international conference on computational linguistics: system demonstrations, Santa Fe, pp 108–112. Association for Computational Linguistics

Download references

Acknowledgements

This study is supported under the RIE2020 Industry Alignment Fund-Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Erik Cambria.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, R., Mao, R., Luu, A.T. et al. A brief survey on recent advances in coreference resolution. Artif Intell Rev 56, 14439–14481 (2023). https://doi.org/10.1007/s10462-023-10506-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-023-10506-3

Keywords

Navigation