Abstract
Topic Detection and Tracking refers to automatic techniques for locating topically related materials in streams of data. As a core of it, story link detection is to determine whether two stories are about the same topic. Up to now, many representation models have been used in story link detection. But few of them are specific to stories. This paper proposes an event model based on the characters of stories. This model is used for story link detection and evaluated on the TDT4 Chinese corpus. The experimental results indicate that the system using the event model achieves a better performance than that using the baseline model. Furthermore, it shows a larger improvement to the former, especially when using uneven SVM as the multi-similarity integration strategy.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Allan, J.: Introduction to topic detection and tracking. In: Allan, J. (ed.) Topic Detection and Tracking - Event-based Information Organization, pp. 1–16. Kluwer Academic Publisher, Dordrecht (2002)
Connell, M., Feng, A., Kumaran, G., Raghavan, H., Shah, C., Allan, J.: Umass at tdt 2004. In: TDT 2004 Workshop (2004)
Chen, F., Farahat, A., Brants, T.: Multiple similarity measures and source-pair information in story link detection. In: HLT-NAACL, pp. 313–320 (2004)
Allan, J., Lavrenko, V., Malin, D., Swan, R.: Detections, bounds, and timelines: Umass and tdt–3. In: Proceedings of Topic Detection and Tracking (TDT–3), pp. 167–174 (2000)
Nallapati, R.: Semantic language models for topic detection and tracking. In: HLT-NAACL (2003)
Lavrenko, V., Allan, J., DeGuzman, E., LaFlamme, D., Pollard, V., Thomas, S.: Relevance models for topic detection and tracking. In: Proceedings of Human Language Technology Conference (HLT), pp. 104–110 (2002)
Van Der Walt, C.M., Barnard, E.: Data characteristics that determine classifier performance. In: Sixteenth Annual Symposium of the Pattern Recognition Association of South Africa, pp. 160–165 (2006)
Morik, K., Brockhausen, P., Joachims, T.: Combining statistical learning with a knowledgebased approach - a case study in intensive care monitoring. In: Proceedings of the 16th International Conference on Machine Learning (ICML 1999), pp. 268–277. Morgan Kaufmann, San Francisco, CA (1999)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, X., Wang, T., Chen, H. (2008). Story Link Detection Based on Event Model with Uneven SVM. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_44
Download citation
DOI: https://doi.org/10.1007/978-3-540-68636-1_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68633-0
Online ISBN: 978-3-540-68636-1
eBook Packages: Computer ScienceComputer Science (R0)