Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes

Bolelli, Federico; Cancilla, Michele; Baraldi, Lorenzo; Grana, Costantino

doi:10.1007/978-3-030-23987-9_7

Federico Bolelli¹⁹,
Michele Cancilla¹⁹,
Lorenzo Baraldi¹⁹ &
…
Costantino Grana¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11455))

Included in the following conference series:

International Workshop on Reproducible Research in Pattern Recognition

559 Accesses
2 Citations

Abstract

In this paper we describe the algorithmic implementation details of “Connected Components Labeling on DRAGs” (Directed Rooted Acyclic Graphs), studying the influence of parameters on the results. Moreover, a detailed description of how to install, setup and use YACCLAB (Yet Another Connected Components LAbeling Benchmark) to test DRAG is provided.

You have full access to this open access chapter, Download conference paper PDF

Connect Four and Graph Decomposition

Article 25 February 2016

Computing the Metric Dimension by Decomposing Graphs into Extended Biconnected Components

The Threshold Dimension and Threshold Strong Dimension of a Graph: A Survey

1 Introduction

Connected Components Labeling (CCL) is one of the fundamental operations in Computer Vision and Image Processing. With the labeling procedure, all objects in a binary image are labeled with unique values, typically integer numbers. In the last few decades many novel proposals for CCL appeared, and only some of them were compared on the same data and with the same implementation [3, 11, 14]. Therefore, the benchmarking framework Yet Another Connected Components LAbeling Benchmark (YACCLAB in short) has been developed, aiming to provide the fairest possible evaluation of CCL algorithms [10, 15].

The performance evaluation task is not as easy as it may seem, as there are several aspects that could influence an algorithm. However, since CCL is a well-defined problem, admitting a unique solution, the key elements influencing the “speed” of an algorithm can be reduced to: the data on which tests are performed, the quality of the implementations, the hardware capabilities, and last but not least, the code optimization provided by the compiler.

For these reasons, the YACCLAB benchmark is based on two fundamental traits which aim at guaranteeing the reproducibility of the claims made by research papers:

(i)
A public dataset of binary images that covers different application scenarios, ranging from text analysis to video surveillance.
(ii)
A set of open-source C++ algorithms implementations, on which anyone can contribute to, with extensions or improvements.

The results obtained with YACCLAB may vary when the computer architecture or the compiler change, but being the code publicly available, anyone can test the provided algorithms on his own setting, choosing the one which suits his needs best, and verify any claim found in literature.

Following this line of work, in this paper we describe the algorithmic and implementation details of a recently developed CCL algorithm, “Connected Component Labeling on DRAGs” (Directed Rooted Acyclic Graphs) [7], focusing on its integration with YACCLAB and on the installation procedure. A detailed analysis of parameters influence on the result is also provided.

The source code of the aforementioned algorithm is located at [1], whereas the benchmarking suite can be found at [4].

2 How to Test DRAG with YACCLAB

To correctly install and run YACCLAB the following packages, libraries and utilities are required:

CMake 3.0.0 or higher (https://cmake.org);
OpenCV 3.0 or higher (http://opencv.org),
Gnuplot (http://www.gnuplot.info);
C++11 compiler.

The installation procedure requires the following steps:

Clone the GitHub repository [4];
Install the software using CMake, which should automatically find OpenCV path, whether correctly installed on your OS, download the YACCLAB dataset, and create a C++ project for the selected compiler;
Set the configuration file config.yaml placed in the installation folder and select the desired tests;
Open the project, compile and run it.

There are six different tests available: correctness tests are an initial validation of the algorithms; average tests run algorithms on every image of a dataset, reporting for each method the average run-time; average_with_steps separates the labeling time of each scan, and that required to allocate/deallocate data structures; density and granularity use synthetic images to evaluate the performance of different approaches in terms of scalability on the number of pixels, foreground density and pattern granularity; memory tests report an indication on the expected number of memory accesses required by an algorithm on a reference dataset.

YACCLAB stores average results in three different formats: a plain text file, histogram charts, either in color and in gray-scale, and a table, which can be directly included in research papers. If an algorithm employs multiple scans, results will display time spent in each of them separately, producing a stacked histogram chart as output.

All the algorithms included in YACCLAB employ a base interface and implement the following virtual methods:

PerformLabeling: includes the whole algorithm code and it is necessary to perform average, density, granularity and size tests;
PerformLabelingWithSteps: implements the algorithm, dividing it in steps (i.e. alloc/dealloc, first_scan and second_scan for those which have two scans, or all_scan for the others) in order to evaluate every step separately;
PerformLabelingMem: is an implementation of the algorithm that traces the number of memory accesses whenever they occur.

The Union-Find strategy is independent from the CCL one, therefore all CCL algorithms invoke a templated Union-Find implementation. YACCLAB is then able to compare each algorithm (but those for which the labels solver is built-in) with four different labels solving strategies: standard Union-Find (UF), Union-Find with Path Compression (UFPC) [21], Interleaved Rem’s algorithm with splicing (RemSP) [12] and Three Table Array (TTA) [16]. This standardization reduces the code variability, allowing to separate the Union-Find data structures from the ones of CCL algorithms, and provides fairer comparisons without negatively impacting the execution time.

The NULL labeling, also referred as NULL, defines a lower bound limit for the execution time of CCL algorithms on a given machine and a reference dataset. As the name suggests, the NULL algorithm does not provide the correct connected components of an image, but only copies the pixels from the input image into the output one. This “algorithm” allows to identify the minimum time required for allocating the memory of the output image, reading the input image and writing the output one. In this way, all the algorithms can be compared in terms of how costly the additional operations required are.

3 Experiments Reproducibility

The DRAG algorithm was tested on an Intel Core i7-4770 CPU @ 3.40 GHz (\(4\times 32\) KB L1 cache, \(4\times 256\) KB L2 cache, and 8 MB of L3 cache) with Linux OS and GCC 7.2.0 compiler enabling the -O3 and -m32 flags.

The impact of the labels solver on the overall performance is typically limited for most algorithms, so we only reported results obtained with the UFPC solver on the state-of-the-art algorithms.

The DRAG performance have been compared on six different datasets: a collection of histological images [13] with an average amount of 1.21 million pixels to analyze and 484 components to label (Medical), fingerprint images [20], collected by using low-cost optical sensors or synthetically generated, with an average of 809 components to label (Fingerprints), high resolution historical document images [6, 8, 9] with more than 15000 components and a low foreground density (XDOCS), a dataset for people detection [5], tracking, action analysis and trajectory analysis with very low foreground density and few components to identify (3DPeS), a selection of documents [2, 18, 19] collected and scanned using a wide variety of equipment over time with a resolution varying from 150 to 300 DPI (Tobacco800), and a large set of standard resolution natural images [17] taken from Flickr (MirFlickr).

In order to execute the same experiments reported in [7] the perform, algorithms, and average_datasets fields in the configuration file must be set as follows:

Average tests were repeated 10 times (setting the tests_number.average in the configuration file), and for each image the minimum execution time was considered. The use of minimum is justified by the fact that, in theory, an algorithm on a specific environment will always require the same time to execute. This time was computable in exact way on non multitasking single core processors (8086, 80286). Nowadays, too many unpredictable things could occur in background, independently with respect to the specific algorithm. Anyway, an algorithm cannot use less than the required clock cycles, so the best way to get the “real” execution time is to use the minimum value over multiple runs. The probability of having a higher execution time is then equal for all algorithms. For that reason, taking the minimum is the only way to get reproducible results from one execution of the benchmark to another on the same environment.

4 Conclusion

This paper describes how to setup the YACCLAB project to reproduce the result reported in [7]. The processor model –and in particular the cache sizes–, the RAM speed and the background tasks will influence the execution time. Nevertheless, the algorithms relative performance should remain extremely similar. Changing the OS or the compiler is instead likely to heavily influence the outcome.

References

The DRAG Algorithm. https://github.com/prittt/YACCLAB/blob/master/include/labeling_bolelli_2018.h. Accessed 13 Mar 2019
Agam, G., Argamon, S., Frieder, O., Grossman, D., Lewis, D.: The Complex Document Image Processing (CDIP) Test Collection Project. Illinois Institute of Technology (2006)
Google Scholar
Allegretti, S., Bolelli, F., Cancilla, M., Grana, C.: Optimizing GPU-based connected components labeling algorithms. In: Third IEEE International Conference on Image Processing, Applications and Systems (IPAS) (2018)
Google Scholar
The YACCLAB Project. https://github.com/prittt/YACCLAB. Accessed 13 Mar 2019
Baltieri, D., Vezzani, R., Cucchiara, R.: 3DPeS: 3D people dataset for surveillance and forensics. In: Proceedings of the 2011 Joint ACM Workshop on Human Gesture and Behavior Understanding, pp. 59–64. ACM (2011)
Google Scholar
Bolelli, F.: Indexing of historical document images: ad hoc dewarping technique for handwritten text. In: Grana, C., Baraldi, L. (eds.) IRCDL 2017. CCIS, vol. 733, pp. 45–55. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68130-6_4
Chapter Google Scholar
Bolelli, F., Baraldi, L., Cancilla, M., Grana, C.: Connected components labeling on DRAGs. In: 24th International Conference on Pattern Recognition, August 2018
Google Scholar
Bolelli, F., Borghi, G., Grana, C.: Historical handwritten text images word spotting through sliding window HOG features. In: Battiato, S., Gallo, G., Schettini, R., Stanco, F. (eds.) ICIAP 2017. LNCS, vol. 10484, pp. 729–738. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68560-1_65
Chapter Google Scholar
Bolelli, F., Borghi, G., Grana, C.: XDOCS: an application to index historical documents. In: Serra, G., Tasso, C. (eds.) IRCDL 2018. CCIS, vol. 806, pp. 151–162. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73165-0_15
Chapter Google Scholar
Bolelli, F., Cancilla, M., Baraldi, L., Grana, C.: J. Real-Time Image Proc. (2018). https://doi.org/10.1007/s11554-018-0756-1
Bolelli, F., Cancilla, M., Grana, C.: Two more strategies to speed up connected components labeling algorithms. In: Battiato, S., Gallo, G., Schettini, R., Stanco, F. (eds.) ICIAP 2017. LNCS, vol. 10485, pp. 48–58. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68548-9_5
Chapter Google Scholar
Dijkstra, E.W.: A Discipline of Programming. Prentice-Hall, Englewood Cliffs (1976)
MATH Google Scholar
Dong, F., et al.: Computational pathology to discriminate benign from malignant intraductal proliferations of the breast. PloS ONE 9(12), e114885 (2014)
Article Google Scholar
Grana, C., Baraldi, L., Bolelli, F.: Optimized connected components labeling with pixel prediction. In: Blanc-Talon, J., Distante, C., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2016. LNCS, vol. 10016, pp. 431–440. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48680-2_38
Chapter Google Scholar
Grana, C., Bolelli, F., Baraldi, L., Vezzani, R.: YACCLAB - yet another connected components labeling benchmark. In: 23rd International Conference on Pattern Recognition, pp. 3109–3114 (2016)
Google Scholar
He, L., Chao, Y., Suzuki, K.: A linear-time two-scan labeling algorithm. In: International Conference on Image Processing, vol. 5, pp. 241–244 (2007)
Google Scholar
Huiskes, M.J., Lew, M.S.: The MIR Flickr retrieval evaluation. In: MIR 2008: Proceedings of the 2008 ACM International Conference on Multimedia Information Retrieval. ACM, New York (2008)
Google Scholar
Lewis, D., Agam, G., Argamon, S., Frieder, O., Grossman, D., Heard, J.: Building a test collection for complex document information processing. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 665–666. ACM (2006)
Google Scholar
The Legacy Tobacco Document Library (LTDL). University of California, San Francisco (2007)
Google Scholar
Maltoni, D., Maio, D., Jain, A., Prabhakar, S.: Handbook of Fingerprint Recognition. Springer, London (2009). https://doi.org/10.1007/978-1-84882-254-2
Book MATH Google Scholar
Wu, K., Otoo, E., Suzuki, K.: Two strategies to speed up connected component labeling algorithms. Technical report LBNL-59102, Lawrence Berkeley National Laboratory (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Ingegneria “Enzo Ferrari”, Università degli Studi di Modena e Reggio Emilia, Via Vivarelli 10, 41125, Modena, MO, Italy
Federico Bolelli, Michele Cancilla, Lorenzo Baraldi & Costantino Grana

Authors

Federico Bolelli
View author publications
You can also search for this author in PubMed Google Scholar
Michele Cancilla
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Baraldi
View author publications
You can also search for this author in PubMed Google Scholar
Costantino Grana
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Federico Bolelli .

Editor information

Editors and Affiliations

LIRIS, Université de Lyon 2, Bron, France
Bertrand Kerautret
CMLA, ENS Cachan, CNRS, Université Paris-Saclay, Cachan, France
Miguel Colom
Department of Computer Science and Engineering, Lehigh University, Bethlehem, PA, USA
Daniel Lopresti
Laboratoire d'Informatique Gaspard-Monge, Ecole des Ponts Paristech, Marne-la-Vallée, France
Pascal Monasse
CentraleSupelec, Universite Paris-Saclay, Gif-sur-Yvette, France
Hugues Talbot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bolelli, F., Cancilla, M., Baraldi, L., Grana, C. (2019). Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes. In: Kerautret, B., Colom, M., Lopresti, D., Monasse, P., Talbot, H. (eds) Reproducible Research in Pattern Recognition. RRPR 2018. Lecture Notes in Computer Science(), vol 11455. Springer, Cham. https://doi.org/10.1007/978-3-030-23987-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-23987-9_7
Published: 29 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23986-2
Online ISBN: 978-3-030-23987-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes

Abstract