Planogram compliance control via object detection, sequence alignment, and focused iterative search

Yücel, M. Erkin; Ünsalan, Cem

doi:10.1007/s11042-023-16427-1

Planogram compliance control via object detection, sequence alignment, and focused iterative search

Published: 11 August 2023

Volume 83, pages 24815–24839, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Planogram compliance control via object detection, sequence alignment, and focused iterative search

Download PDF

202 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Smart retail stores are becoming the fact of our lives. Several computer vision and sensor based systems are working together to achieve such a complex and automated operation. Besides, the retail sector already has several open and challenging problems which can be solved with the help of pattern recognition and computer vision methods. One important problem to be tackled is the planogram compliance control. In this study, we propose a novel method to solve it. The proposed method is based on object detection, planogram compliance control, and focused and iterative search steps. The object detection step is formed by local feature extraction and implicit shape model formation. The planogram compliance control step is formed by sequence alignment via the modified Needleman-Wunsch algorithm. The focused and iterative search step aims to improve the performance of the object detection and planogram compliance control steps. We tested all these steps on two different datasets. The results show that our proposed method achieves a 0.992 F1 score in object detection and a 0.935 F1 score in planogram compliance control. We further analyzed the strengths and weaknesses of the proposed method from different perspectives. We finally summarized possible extensions to our work.

An Automated Vision Based Change Detection Method for Planogram Compliance in Retail Stores

Embedded planogram compliance control system

Article 05 August 2024

TetraPackNet: Four-Corner-Based Object Detection in Logistics Use-Cases

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The advent of smart retail stores has started shaping our everyday shopping experience. These stores can manage complex operations efficiently with the implementation of cutting-edge computer vision technologies and sensor-based systems. The retail sector faces numerous challenges despite these advancements. Among these, planogram compliance control is a crucial issue that remains to be effectively addressed. Through the application of pattern recognition and computer vision methods, we aim to tackle this particular problem. Planogram is the diagram showing the placement of products on the shelves of a retail store [20]. The usage of planograms aims to improve sales by displaying products in a way that inspires the customer to purchase them [15]. The suppliers also pay for the shelves with high selling potential or apply discounts to their products to be able to place them on these shelves [25]. The planogram also ensures consistency between branches of the same retail store.

Although planogram usage has considerable benefits, the placement of products on a shelf may not always comply with it. A shelf is considered planogram compliant when all products on it are in the right place and quantity. However, this may not be satisfied due to several reasons such as human error, low stock or being out-of-stock [1]. These situations not only result in loss of sales to the retailer [28], they also cause the retailer to pay penalty fees to the supplier. According to Shapiro [2], the planogram compliance ratio is only 70% on average in an average retail store. This results huge sale losses for the retailer. According to the same study, resetting planogram, which is re-allocating all the products, can improve the sales up to 7.8% in two weeks. Therefore, retailers must continuously organize the shelves in compliant with the planogram to maximize overall sales. In order to explain the planogram compliance problem further, we provide a sample visual representation of the planogram and corresponding two shelf images in Fig. 1.

We provide visual representation of the planogram for a shelf in Fig. 1(a). This is the reference planogram which shows how the products should be placed. In this planogram, there are five different products with a total of 19 items in them. Images for these items have been collected from the web such that they have high resolution. We provide a shelf image, taken by camera, fully compliant with the reference planogram in Fig. 1(b). As can be seen here, although the order and number of the items on the shelf are the same as the reference planogram, objects on the shelf have different viewing angle and non-ideal placement. Besides, there are differences in product images of the planogram and fully compliant shelf in terms of brightness and color. We next provide a shelf image, taken by camera, partially compliant with the reference planogram in Fig. 1(c). This image represents most of the planogram compliance problems. We can count them as follows. There is an extra item placed on the leftmost part of the shelf. Hence, all other items are right shifted accordingly. The third product group has one extra item in it. The fourth product group has two less items in it. The fifth product group has one less item in it. There is an empty slot between the fourth and fifth product groups. There is no way we can determine the product this empty slot belongs to. The sixth product group has items resembling actual items, but they are different. Therefore, the partially planogram compliant shelf image in Fig. 1(c) gives information on the challenges we can face on the planogram compliance control.

Planogram compliance control is conventionally performed manually by an employee in retail stores. The employee walks around aisles and controls shelves once or twice a day for their compliance to the planogram. If the employee detects a problem, then the products on the shelf are re-arranged according to the planogram. However, this method is labor intensive and prone to human error. In an alternative approach, the employee uses a mobile device to take photos of shelves [23]. Then, these photos are sent to cloud to check the shelf status. Even though this method eliminates the human error, it is still labor intensive. Another method in the planogram compliance control is using fixed IP cameras instead of mobile devices. Unfortunately, using multiple cameras increases the cost [11]. One recent approach in the planogram compliance control is using robots [10, 26, 29]. In these systems, the mobile robot moves around aisles of the store and collects images and data from the shelves using onboard cameras and sensors. As the robot knows the exact location of itself in the store, it can perform the planogram compliance control. Unfortunately, these systems are fairly expensive.

There are several methods in literature based on computer vision and pattern recognition to solve the planogram compliance control and object detection from shelf images. Tonioni and Di Stefano [20] proposed a method to solve the problem in three steps. First, they detect objects from the shelf image via generalized Hough transform and local feature extraction. Second, they use sub-graph matching to compare the reference planogram and actual item placements. Third, they apply an iterative approach to locate the undetected objects. Our proposed method differs from Tonioni and Di Stefano’s work in the second and third steps. Liu et al. [13] provided a planogram compliance control method based on repetitive image patterns. They did not provide an object detection step in their study. There are also related studies focusing on object detection in retail environment. George and Floerkemeier [19] formed discriminative random forests via all product images. They used SIFT with only one training image per product for image classification. Then, they use deformable dense pixel matching and genetic algorithm optimization. George and Floerkemeier did not provide a planogram compliance control step. Instead, they focused on multi-instance object detection in retail stores. Rosado et al. [30] used panoramic images to locate stocked out products in shelves without object detection. Jund et al. [5] used CNN to recognize products on the shelf. Unfortunately, CNN needs several images from the same product type for training. Franco et al. [6] first extracted corners from the shelf image. They assumed that each corner is a candidate for being a member of a rectangular object. Then, they formed a window and compared histograms to check whether there is an object there. Afterward, they benefit from CNN for fine selection. Karlinsky et al. [3] first extracted global information such as location of the shelf. Then, they proposed three methods as local feature extraction (via SURF) and implicit shape model usage, histogram of oriented gradients feature extraction and sliding window usage, and local feature extraction (via SURF) and bag of words usage. Their first and third approaches are similar to the object detection step proposed in this study. However, Karlinsky et al. did not consider planogram compliance control in their method. This is the main difference between both studies. Santra et al. [24] used both object and part level features to detect fine-grained objects in the shelf image. They proposed a reconstruction-classification network to find object level features. Then, they used BRISK to find keypoints on the product image. They encoded the last level of CNN with the convolutional LSTM to describe the part level feature at each keypoint. Finally, they used both object and part level features to detect fine-grained objects. There are two related studies on detecting objects from densely packed images such as acquired from shelves in retail stores. However, the aim is not object recognition in them. The first study is proposed by Goldman et al. [27] which introduces the SKU100k dataset. It uses different deep learning models on the dataset to detect objects. The second study is proposed by Ye et al. [18] which uses the same dataset on different deep learning models for object detection. There is also a recent review paper by Santra and Mukherjee [22]. The reader can check it for both traditional and deep learning methods for automatic identification of products in retail stores.

In this study, we propose a novel method to detect objects from densely packed shelf images and control planogram compliance. The object detection step depends on local feature extraction and implicit shape model representation. Planogram compliance control step is based on the modified Needleman-Wunsch algorithm which is introduced to align DNA sequences. We also introduce a focused and iterative search step to improve the object detection and planogram compliance control steps. These three steps differ from the ones in literature such that they improve both object detection from shelf images and planogram compliance control sequentially.

The layout of the study is as follows. We first introduce the parts of the object detection step in the proposed method. Therefore, we start with local feature extraction and their representation. Afterward, we explain how brute force search and implicit shape model can be used to detect object center points in a given shelf image. Then, we introduce a bounding box extraction method based on this information. We next explain the planogram compliance control step of the proposed study. To do so, we start with representing the extracted object information as an abstract planogram. Afterward, we introduce the modified Needleman-Wunsch algorithm for planogram compliance control. Then, we explain the focused and iterative search step to improve the performance of the proposed method. We next test the proposed method on two different datasets for the object detection and planogram compliance control steps. Finally, we summarize key findings of the proposed method.

2 Object detection and bounding box extraction

The proposed planogram compliance control method starts with object detection from the shelf image. This operation starts with local feature extraction. Then, we represent the extracted features to be processed in vector form. Afterward, we apply brute force search to match local features extracted from the object of interest and shelf image. This leads to object center point detection via implicit shape model. Finally, we extract bounding boxes for the detected objects on the shelf. We explain all these operations in this section.

2.1 Local feature extraction methods

Objects on a retail shelf generally have complex front views. Moreover, there are several nearby objects with almost the same or similar views. Extracting local features and forming an object detection framework using them is more promising compared to using global features. Therefore, we picked five well-known local feature extraction methods as SIFT [14], SURF [4], ORB [21], AKAZE [16], and BRISK [7]. These methods extract keypoints and a corresponding feature vector from each keypoint as local features via different approaches. We will benefit from the extracted local features and keypoints in detecting objects in the shelf image.

2.2 Representing the extracted local features in a general framework

Since we picked five different local feature extraction methods in this study, we form a general framework to represent them in this section. This will help us to represent the proposed method in a more general framework. Let I(x, y) represent a shelf image with J different objects in it. Assume that the $j^{th}$ object on the shelf has a model (or planogram) image $I_j(x, y)$ for $j = 1, \ldots , J$. Let $w_j$ and $h_j$ be the width and height of $I_j(x, y)$, respectively. We take the model image as tight as possible. Hence, the object has center point $(w_j/2, h_j/2)$.

We can extract local features from $I_j(x, y)$ to represent the $j^{th}$ object. Let $\overrightarrow{k_{jl}}$ be the feature vector extracted at the keypoint $(x_{jl}, y_{jl})$ for $l = 1, \ldots , L_j$ such that $L_j$ is the total number of keypoints extracted using one of the methods given in Section 2.1. Hence, we will have the merged vector form $\overrightarrow{f_{jl}} = (\overrightarrow{k_{jl}}, x_{jl}, y_{jl})$ for $l = 1, \ldots , L_j$ to represent the $j^{th}$ object with $L_j$ keypoints.

We should also extract local features from the shelf image, I(x, y), to detect objects in it. Based on the previous definitions, we can represent the merged vector for the image as $\overrightarrow{f_m} = (\overrightarrow{k_m}, x_m, y_m)$ for $m = 1, \ldots , M$. Here, $\overrightarrow{k_m}$ is the feature vector extracted at the keypoint $(x_m, y_m)$. M is the total number of extracted keypoints from the shelf image. Moreover, let $w_s$ and $h_s$ be the width and height of I(x, y), respectively. We will use this global information in implicit shape model and bounding box extraction steps in Sections 2.4 and 2.5, respectively.

We can explain the local feature representation operation on a sample scenario. Let’s pick the fourth object type in the planogram and shelf image given in Fig. 1(a) and (b), respectively. We provide most of the extracted SIFT keypoints, as blue circles, in Fig. 2(a) and (b), respectively. As can be seen in these figures, the extracted keypoints are located mostly on corners and edges in the images. We will use the extracted local feature vectors in these locations for matching next.

2.3 Brute force search for local feature matching

One way of finding the object of interest in the shelf image is matching extracted local feature vectors from the $j^{th}$ object, $\overrightarrow{k_{jl}}$ for $l = 1, \ldots , L_j$, and shelf image, $\overrightarrow{k_m}$ for $m = 1, \ldots , M$. To do so, we apply brute force search between all combinations and calculate the distance between these two local feature vector groups as $d_{m,jl} = \Vert \overrightarrow{k_m} - \overrightarrow{k_{jl}}\Vert $. Here, we benefit from different distance metrics for different local feature vector types. To be more specific, we use the Euclidean distance for SIFT and SURF since they have float values in their feature vectors. We use the Hamming distance for BRISK, ORB, and AKAZE since they have binary values in their feature vectors.

Lowe [14] proposed a ratio test to increase the robustness of the SIFT algorithm while finding the best matching local features. The aim of this test is to eliminate features that are not distinct enough even though the distance between them may be below a given threshold. We also use the ratio test to keep only strong matches not only for SIFT, but also for all considered local feature extraction methods. Therefore, we calculate the ratio as

$$\begin{aligned} \delta _{m,j} = \frac{d_{m,jl}}{d_{m,jl'}} \end{aligned}$$

(1)

where $d_{m,jl}$ and $d_{m,jl'}$ denote the smallest and second smallest distance between the feature vector $\overrightarrow{k_m}$ extracted from I(x, y) and the feature vector $\overrightarrow{k_{jl}}$ for $l = 1, \ldots , L_j$ extracted from $I_j(x, y)$.

The idea in using (1) is that there should be sufficient difference between the best and the second-best matches. Hence, if $\delta _{m,j}$ is smaller than a matching threshold $\tau $, then we assume that $\overrightarrow{k_m}$ and $\overrightarrow{k_{jl}}$ matched. Setting $\tau $ to a high value increases the number of matches. However, this also increases false matches between feature vectors. On the other hand, setting $\tau $ to a low value keeps only strong matches. Hence, some possible matches may be missed.

Lowe suggests setting $\tau =0.8$ gives the best result for SIFT. Instead of such a constant value, we set the matching threshold as

$$\begin{aligned} \tau _\alpha = 1 - 0.15\alpha \end{aligned}$$

(2)

where $\alpha $ is the iteration parameter (initially set to one) to be introduced in Section 3.3.

Assume that we have $N_j$ local feature vectors from I(x, y) satisfying the threshold constraint for the $j^{th}$ object. Hence, we will have $\left\{ \overrightarrow{f_{jn}}\right\} \subset \left\{ \overrightarrow{f_m}\right\} $ for $n = 1, \ldots , N_j$ and $N_j \le M$. Each matching feature vector has a corresponding feature vector from the $j^{th}$ object representation. Assume that the $n^{th}$ feature vector from I(x, y) matches the $b^{th}$ feature vector from $I_j(x,y)$. We can represent this match as $\overrightarrow{f_{jn}} \Leftrightarrow \overrightarrow{f_{jb}}$.

We apply the brute force search to the object and shelf images in Fig. 2. We provide the matched keypoints, as blue circles, in Fig. 3. As can be seen in the figure, the matched keypoints lie on the correct object in the shelf image most of the times. However, there are also false matches in the image. We will handle them in the next section.

2.4 Implicit shape model

As we match local features, we can use implicit shape model (ISM) to detect the center of matched objects in the shelf image I(x, y) [12]. We picked ISM for this operation since it allows detecting multiple objects of the same type in the image. We modified the original ISM to serve our purposes better. Our modification is in the matching phase of the method. Leibe et al. uses agglomerative clustering to search for matching image patches in their original implementation. Agglomerative clustering requires multiple training objects to group similar image patches and clustering their features. Besides, it has a high computation cost. We have only one representative image for each object in our problem. Therefore, we use only local features and keypoints to create a voting matrix in ISM. This allows us to find object centers in the shelf image fairly fast.

To implement the ISM suitable for our problem, we first create an empty voting matrix $V_j(x, y)$ for the $j^{th}$ object for $j = 1, \ldots , J$. Hence, we form J distinct voting matrices for J candidate objects to be detected in the shelf image, I(x, y). Each voting matrix has the same size as with the shelf image.

Matching $\overrightarrow{f_{jn}} \Leftrightarrow \overrightarrow{f_{jb}}$ can be taken as evidence for one or more objects’ presence in the shelf image. Therefore, each $\overrightarrow{f_{jn}}$ votes for its candidate object center location. For the $n^{th}$ matched local feature of the $j^{th}$ object, we will have the voting coordinate as

$$\begin{aligned} \hat{x}_n= & {} x_{n} + \beta _j r_n \cos (\Theta _n) \nonumber \\ \hat{y}_n= & {} y_{n} + \beta _j r_n \sin (\Theta _n) \end{aligned}$$

(3)

where

$$\begin{aligned} r_n= & {} \left[ (w_j/2 - x_{jb})^2 + (h_j/2 - y_{jb})^2\right] ^{1/2} \nonumber \\ \Theta _n= & {} \arctan \left( \frac{h_j/2 - y_{jb}}{w_j/2 - x_{jb}}\right) \end{aligned}$$

(4)

Here $\beta _j = h_s / h_j$, $h_s$ being the height of I(x, y).

Based on these definitions, we can form the voting matrix for the $j^{th}$ object as

$$\begin{aligned} V_j(x,y) = \sum \limits _{n = 1}^{N_j} \gamma _n \exp \left[ -\frac{(x-\hat{x}_n)^2 + (y-\hat{y}_n)^2}{2\sigma ^2} \right] \end{aligned}$$

(5)

where $\gamma _n = 1 - d'_{n,jb}$, $d'_{n,jb}$ being the min-max normalized form of $d_{n,jb}$ [9]. $\sigma $ is the standard deviation of the Gaussian kernel used as the voting function. We set $\sigma =7$ based on the average size of the objects to be detected. Since we form the voting matrix for each object type separately, it represents vote values for that object type only.

We next consider the matched features given in Fig. 3. We provide the voting matrix, formed for the fourth object type in Fig. 3(a), as in Fig. 4. In this figure, bright locations indicate high votes. As can be seen in this figure, the votes cumulate around possible center locations of the fourth object type in the shelf image.

We can hypothesize that modes of $V_j(x,y)$ are possible object centers for the $j^{th}$ object in the shelf image. However, all modes do not correspond to an object center. Therefore, we take a location as a valid object center when the cumulate votes there exceed a threshold value. To do so, we define a dynamic threshold as

$$\begin{aligned} \tau _v = \frac{\alpha }{2} \max \left( V_j(x,y)\right) \end{aligned}$$

(6)

where $\alpha $ is the iteration parameter (initially set to one) to be introduced in Section 3.3. As a result, we obtain the candidate object centers from $V_j(x,y)$ as

$$\begin{aligned} (x_{jc},y_{jc}) = \arg \max \limits _{(x,y)} \left( V_j(x,y)\right) \end{aligned}$$

(7)

such that $V_j(x_{jc},y_{jc}) > \tau _v$.

Assume that we obtain $C_j$ candidate object centers for the $j^{th}$ object. Hence, we will have $(x_{jc},y_{jc})$ for $c = 1, \ldots , C_j$ and $j = 1, \ldots , J$ and their vote values as $V_j(x_{jc},y_{jc})$. We will use these in extracting bounding boxes for the objects in the shelf image in the next section.

We apply the candidate object center extraction procedure to the voting matrix in Fig. 4. As a reminder, this voting matrix has been formed for the fourth object type, given in Fig. 3(a), in the shelf image. We provide the extracted object centers, as green dots, in Fig. 5. As can be seen in this figure, the fourth object has been successfully located by its center coordinates in the shelf image.

2.5 Bounding box extraction

As we detect possible object center locations in the shelf image, the next step is forming a bounding box for each location. We have the bounding box representation for the object in the planogram image. We know the ratio of planogram object size wrt the shelf image. Hence, we can form the bounding box for the detected object in the shelf image directly. In this implementation, the detected object in the shelf may not have a rectangular shape. It may be shifted or rotated. Therefore, the object in the shelf image may be taken as smaller or larger than the planogram image. To handle this issue, we add a 20% overlap tolerance in all operations.

Let’s formalize the bounding box extraction operation based on the previous definitions. We should form a bounding box for each object center, $(x_{jc},y_{jc})$ for $c = 1, \ldots , C_j$ and $j = 1, \ldots , J$. To do so, we can use the global information of I(x, y) and $I_j(x, y)$ to fit a bounding box around the object center. We represent a bounding box by its top-left and bottom-right coordinates. Since the $c^{th}$ object center is associated with the $j^{th}$ object, we will have

$$\begin{aligned} B_{jc} = \left[ (x_{jc} - \frac{\beta _j w_j}{2}, y_{jc} - \frac{\beta _j h_j}{2}), (x_{jc} + \frac{\beta _j w_j}{2}, y_{jc} + \frac{\beta _j h_j}{2}) \right] \end{aligned}$$

(8)

where $\beta _j$ is the normalizing factor introduced in (4). At this point, we have bounding box, $B_{jc}$, and vote value, $V(x_{jc},y_{jc})$, for the candidate object center $(x_{jc},y_{jc})$.

Some bounding boxes may overlap. This means that there are more than one candidate object in the given location. We should eliminate weak candidates. To do so, we check the intersection over union (IoU) of all bounding boxes [8]. If the IoU value of the two bounding boxes is above an overlap threshold, experimentally set as 20%, then we discard the candidate object center with a lower vote value. Hence, we perform non-maxima suppression. At the end of this operation, we detect a total of D objects, $D \le \sum _{j = 1}^{J} C_{j}$, from the shelf image I(x, y).

Up to now, we explained how to form the bounding box for detected objects in the shelf image. There may also be empty spaces or undetected (unknown) objects in the image. We should analyze these locations as well. Therefore, we next search for empty spaces in the shelf image after detecting all bounding boxes corresponding to the detected objects. To do so, we model the empty space as a black box since it will be dark compared to its surrounding. Then, we apply template matching. We then merge the detected neighboring empty space blocks. As for discriminating the empty space blocks and undetected objects, we first obtain the average width of detected objects in the shelf image. Here, we assume that the undetected object will have similar width as its neighboring objects. If the merged empty space block width is the same or larger than this width, then we label that region as empty space. If we neither detect the object nor an empty space in a part of the shelf image, then we label that region as unknown.

We next consider the fully and partially planogram compliant shelf images in Fig. 1(b) and (c). We provide the corresponding bounding box detection results in Fig. 6(a) and (b), respectively. We set a green bounding box for each detected object center in these images. There may be small overlapping areas in these bounding boxes. Since they did not exceed the overlap threshold, they did not pose any problems. There were undetected objects both in the fully and partially planogram compliant images. We take them as unknown and plot the corresponding bounding boxes as red colored. We will correct these errors by an iterative approach in the following section. There is also an empty space in the partially planogram compliant shelf image. We plot the corresponding bounding box as blue.

3 Planogram compliance control via sequence alignment and focused iterative search

Object detection results in Section 2 lead to planogram compliance control. To do so, we first represent the detection results in an abstract planogram format. Then, we propose an iterative search method to focus on undetected or empty object locations in the shelf image. We explain these steps in detail next.

3.1 Planogram formation from the extracted object information

We can form an abstract planogram representation from object detection results. In our representation, there will be spatially sorted object types, number of objects for each type, and bounding box for each object type. Therefore, we should first sort detected objects in the shelf image. We can perform sorting by using horizontal coordinate of the center point, $x_{jc}$, for the bounding box of detected objects.

The proposed planogram compliance control method can handle objects placed on top of each other. The hard constraint from planogram formation for such cases is that these objects should be of the same type. Therefore, we can sort them. There may be two scenarios for the objects placed on top of each other. First, the horizontal coordinates of these objects may be different. Then, they will be sorted as such. Second, if the horizontal coordinates of overlapping two objects are the same, then the one detected first is placed to the left of the second one.

There is also another hard constraint for planogram formation. If there are objects of the same type on a shelf, then they should be placed side by side. Therefore, we next group sorted objects of the same type and standing side by side (or on top of each other) on the shelf. This can be done by checking bounding boxes. As a result, we form a merged bounding box for the grouped objects.

Based on the performed operations in the previous paragraphs, we can represent the detected objects in the shelf image (in sorted and grouped form) as $L_s = \left[ o_d, q_d, B_d\right] $ for $d = 1, \ldots , E$ where $E \le D$. Here, $o_d$ stands for the detected object group type. $q_d$ stands for the number of objects in the object group $o_d$. $B_d$ stands for the merged bounding box of the grouped objects of type $o_d$. We can call $L_s$ as the detected planogram representation for the shelf.

There should also be the reference (correct or desired) planogram for the shelf. Based on the previous definitions, we can represent it as $L_r = \left[ o_t, q_t, B_t\right] $ for $t = 1, \ldots , T$. Here, $o_t$, $q_t$, and $B_t$ stand for the group type, number of objects in the group, and bounding box, respectively.

We can form the abstract representation of the sample planogram in Fig. 1(a). To do so, we should have executed all the steps up to this point. Based on these, the abstract representation for the reference planogram, $L_r$, will be as in Table 1.

Table 1 Reference planogram $L_r$ for the shelf

Planogram compliance control via object detection, sequence alignment, and focused iterative search

Abstract

Similar content being viewed by others

An Automated Vision Based Change Detection Method for Planogram Compliance in Retail Stores

Embedded planogram compliance control system

TetraPackNet: Four-Corner-Based Object Detection in Logistics Use-Cases

Explore related subjects

1 Introduction

2 Object detection and bounding box extraction

2.1 Local feature extraction methods

2.2 Representing the extracted local features in a general framework

2.3 Brute force search for local feature matching

2.4 Implicit shape model

2.5 Bounding box extraction

3 Planogram compliance control via sequence alignment and focused iterative search

3.1 Planogram formation from the extracted object information

3.2 Needleman-Wunsch

3.3 Focused and iterative search

4 Experiments

4.1 Datasets used in experiments

4.2 Object detection performance: the general case

4.3 Object detection performance: occluded objects

4.4 Object detection performance: the effect of viewing angle

4.5 Object detection performance: objects on top of each other

4.6 Planogram compliance control performance

5 Final comments

Data Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation