Knowledge Transfer for Multi-labeler Active Learning

Fang, Meng; Yin, Jie; Zhu, Xingquan

doi:10.1007/978-3-642-40988-2_18

Meng Fang²³,
Jie Yin²⁴ &
Xingquan Zhu²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8188))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

3618 Accesses
6 Citations

Abstract

In this paper, we address multi-labeler active learning, where data labels can be acquired from multiple labelers with various levels of expertise. Because obtaining labels for data instances can be very costly and time-consuming, it is highly desirable to model each labeler’s expertise and only to query an instance’s label from the labeler with the best expertise. However, in an active learning scenario, it is very difficult to accurately model labelers’ expertise, because the quantity of instances labeled by all participating labelers is rather small. To solve this problem, we propose a new probabilistic model that transfers knowledge from a rich set of labeled instances in some auxiliary domains to help model labelers’ expertise for active learning. Based on this model, we present an active learning algorithm to simultaneously select the most informative instance and its most reliable labeler to query. Experiments demonstrate that transferring knowledge across related domains can help select the labeler with the best expertise and thus significantly boost the active learning performance.

Download to read the full chapter text

Chapter PDF

Multi-label active learning: key issues and a novel query strategy

Article 30 August 2017

Active Learning Algorithms for Multi-label Data

Multi-label active learning by model guided distribution matching

Article 18 April 2016

Keywords

References

Cohn, D., Caruana, R., McCallum, A.: Semi-supervised clustering with user feedback. Constrained Clustering: Advances in Algorithms, Theory, and Applications 4(1), 17 (2003)
Google Scholar
Crammer, K., Kerns, M., Wortman, J.: Learning from multiple sources. Journal of Machine Learning Research 9, 1757–1774 (2008)
MATH Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 1–38 (1977)
Google Scholar
Donmez, P., Carbonell, J.G.: Proactive learning: Cost-sensitive active learning with multiple imperfect oracles. In: Proc. of CIKM, pp. 619–628 (2008)
Google Scholar
Fang, M., Zhu, X., Li, B., Ding, W., Wu, X.: Self-taught active learning from crowds. In: Proc. of ICDM, pp. 858–863. IEEE (2012)
Google Scholar
Freund, Y., Seung, H.S., Shamir, E., Tishby, N.: Selective sampling using the query by committee algorithm. Machine Learning 28(2), 133–168 (1997)
Article MATH Google Scholar
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42(1), 177–196 (2001)
Article MATH Google Scholar
MacKay, D.J.C.: Information-based objective functions for active data selection. Neural Computation 4(4), 590–604 (1992)
Article Google Scholar
Melville, P., Mooney, R.: Diverse ensembles for active learning. In: Proc. of ICML, pp. 584–591 (2004)
Google Scholar
Nocedal, J., Wright, S.J.: Numerical optimization. Springer (1999)
Google Scholar
Raykar, V.C., Yu, S., Zhao, L.H., Jerebko, A., Florin, C., Valadez, G.H., Bogoni, L., Moy, L.: Supervised learning from multiple experts: Whom to trust when everyone lies a bit. In: Proc. of ICML, pp. 889–896 (2009)
Google Scholar
Raykar, V.C., Yu, S., Zhao, L.H., Valadez, G.H., Florin, C., Bogoni, L., Moy, L.: Learning from crowds. Journal of Machine Learning Research 11, 1297–1322 (2010)
MathSciNet Google Scholar
Roy, N., McCallum, A.: Toward optimal active learning through sampling estimation of error reduction. In: Proc. of ICML, pp. 441–448 (2001)
Google Scholar
Rzhetsky, A., Shatkay, H., Wilbur, W.J.: How to get the most out of your curation effort. PLoS Computational Biology 5(5), e1000391 (2009)
Google Scholar
Saha, A., Rai, P., Daumé III, H., Venkatasubramanian, S., DuVall, S.L.: Active supervised domain adaptation. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011, Part III. LNCS, vol. 6913, pp. 97–112. Springer, Heidelberg (2011)
Chapter Google Scholar
Sheng, V.S., Provost, F., Ipeirotis, P.G.: Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proc. of SIGKDD, pp. 614–622. ACM (2008)
Google Scholar
Shi, X., Fan, W., Ren, J.: Actively transfer domain knowledge. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 342–357. Springer, Heidelberg (2008)
Chapter Google Scholar
Tong, S., Koller, D.: Support vector machine active learning with applications to text classification. Journal of Machine Learning Research 2, 45–66 (2002)
MATH Google Scholar
Wallace, B.C., Small, K., Brodley, C.E., Trikalinos, T.A.: Who should label what? instance allocation in multiple expert active learning. In: Proc. of SDM (2011)
Google Scholar
Xue, G.R., Dai, W., Yang, Q., Yu, Y.: Topic-bridged plsa for cross-domain text classification. In: Proc. of SIGIR, pp. 627–634. ACM (2008)
Google Scholar
Yan, Y., Rosales, R., Fung, G., Dy, J.: Active learning from crowds. In: Proc. of ICML, pp. 1161–1168 (2011)
Google Scholar
Yan, Y., Rosales, R., Fung, G., Schmidt, M., Hermosillo, G., Bogoni, L., Moy, L., Dy, J., Malvern, P.A.: Modeling annotator expertise: Learning when everybody knows a bit of something. In: Proc. of AISTATS, vol. 9, pp. 932–939 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

QCIS, University of Technology, Sydney, Australia
Meng Fang & Xingquan Zhu
CSIRO ICT Centre, Australia
Jie Yin

Authors

Meng Fang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Yin
View author publications
You can also search for this author in PubMed Google Scholar
Xingquan Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001, Leuven, Belgium
Hendrik Blockeel
Fraunhofer IAIS, Department of Knowledge Discovery, University of Bonn, Schloss Birlinghoven, 53754, Sankt Augustin, Germany
Kristian Kersting
LIACS, Universiteit Leiden, Niels Bohrweg 1, 2333 CA, Leiden, The Netherlands
Siegfried Nijssen
Department of Computer Science and Engineering, Czech Technical University, Technicka 2, 16627, Prague 6, Czech Republic
Filip Železný

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fang, M., Yin, J., Zhu, X. (2013). Knowledge Transfer for Multi-labeler Active Learning. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2013. Lecture Notes in Computer Science(), vol 8188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40988-2_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-40988-2_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40987-5
Online ISBN: 978-3-642-40988-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Knowledge Transfer for Multi-labeler Active Learning

Abstract

Chapter PDF

Similar content being viewed by others

Multi-label active learning: key issues and a novel query strategy

Active Learning Algorithms for Multi-label Data

Multi-label active learning by model guided distribution matching

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Knowledge Transfer for Multi-labeler Active Learning

Abstract

Chapter PDF

Similar content being viewed by others

Multi-label active learning: key issues and a novel query strategy

Active Learning Algorithms for Multi-label Data

Multi-label active learning by model guided distribution matching

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation