1 Introduction

In image-posting social networking services (SNS), such as Instagram, users add multiple hashtags to a single post to improve the searchability of images for other users. However, in many Instagram posts, there is a mismatch between the images and the added hashtags. This mismatch can be a result of several reasons, including intentional mislabeling, where users add popular but irrelevant hashtags to attract more views, and unintentional mislabeling, where users mistakenly use incorrect hashtags or fail to accurately describe the content. Yui et al. [1] referred to such misleading posts as “clickbait.” Cho et al. [2] stated that the effective use of hashtags requires adequate and relevant hashtags, which limits the diversity of hashtags. According to Fedushko et al. [3], “hashtags should always correspond to the subject of the post.” Therefore, when posting an image, it is crucial to choose hashtags that are “highly relevant” to the post to ensure that the content is represented accurately and easily discovered by other users. Misleading or irrelevant hashtags can reduce user satisfaction and engagement, as well as poster credibility.

Several studies have proposed “highly relevant” hashtags based on data from Twitter and other sources. Takenaka et al. [4] proposed a method for selecting appropriate hashtags for tweets using Bayesian inference. Zangerle et al. [5] proposed a method that recommends hashtags based on similar tweets and the hashtags used. In this method, the similarity between tweets and hashtags on Twitter is evaluated using term frequency-inverse document frequency (TF-IDF). Godin et al. [6] developed a method that extracts topics containing the body of tweets and related URL information based on which hashtags are suggested. Kywe et al. [7] considered user preferences and tweet content to suggest hashtags. Liu et al. [8] developed a word trigger method that recommends hashtags based on a large set of hashtag-sentence pairs. Ding et al. [9] proposed a method that suggests hashtags based on the Chinese microblog Sina-Weibo by learning a topic model.

In a previous study, the authors proposed a method for predicting the relevance of Instagram hashtags and posts [10]. First, an object detection algorithm is used to obtain the label of an object that appears in an image included in an image-posting SNS. Then, an Instagram hashtag search is performed using the object label as the hashtag to retrieve the posts. The comments are also extracted from the obtained posts and hashtags. Using the hashtags, variables, including reverse co-occurrence count, reverse co-occurrence ranking, and similarity between comments, are defined. These variables are then used to create a hashtag-post relevance prediction model. By extracting hashtags that co-occur with object labels from the retrieved posts, many hashtags that are highly related to the object labels can be obtained. Ichau et al. [11] also discovered themes related to a specific hashtag by performing co-occurrence network analysis on 1500 Instagram posts containing specific hashtags.

However, the algorithms proposed in the above-mentioned studies cannot obtain “highly relevant” hashtags for a post from a label referring to different objects, such as “mouse,” which can refer to “an animal” or “a computer input device.” Among the 91 categories in the Microsoft Common Objects in Context (COCO) dataset, there are five such labels:“spoon,” “bear,” “mouse,” “keyboard,” and “train.” In this case, hashtags related to any meaning can be retrieved from Instagram, resulting in ambiguity and poor search accuracy.

Accurate hashtag recommendations are crucial for improving the searchability of images on SNSs, like Instagram. Without relevant hashtags, it would be difficult for users to find the images they are looking for, which reduces user satisfaction and engagement. To address this search problem, we developed three algorithms that enhance the relevance of recommended hashtags.

Algorithm 1 uses the same object category from the Microsoft COCO image dataset as additional labels to retrieve the label of an object contained in a post. Algorithm 2 uses the word translated into six other languages as additional labels, and Algorithm 3 is a hybrid of Algorithms 1 and 2. The proposed algorithms disambiguate labels with multiple meanings and provide more accurate hashtag recommendations.

The remainder of this paper is organized as follows: Sect. 2 introduces the algorithm proposed in a previous study, and Sect. 3 introduces the proposed algorithms. Section 4 describes the experiments, and Sect. 5 discusses the results. Section 6 concludes the paper.

2 Existing Algorithms

Herein, we present the algorithms proposed in previous studies for comparison.

2.1 Overview of the Existing Hashtag Recommendation System

Figure 1 shows a schematic of the algorithm proposed in a previous study [10]. The terms “co-occurrence counts,” “reverse co-occurrence counts,” “reverse co-occurrence ranking values,” and “similarity between comments” in the procedure are explained in the section 2.2. The numbers on the diagram correspond to the order numbers of the algorithm process. The algorithm functions as follows:

Fig. 1
figure 1

Schematic of the hashtag recommendation algorithm proposed in a previous study [10]. The numbers in this figure correspond to the process descriptions

  1. 1.

    A user inputs the image.

  2. 2.

    The object detection algorithm YOLO [12] is applied to the image to obtain the labels of the object L contained in the image. In cases where the algorithm uses the Microsoft COCO dataset [13], 91 object labels are considered as candidates for L.

  3. 3.

    Instagram posts are searched using L as a hashtag.

  4. 4.

    Let P be a set of posts obtained by hashtag search.

  5. 5.

    Let C be the set of all comments in P.

  6. 6.

    Let H be the set of hashtags attached to C.

  7. 7.

    Elements of H are sorted in decreasing order of co-occurrence counts. The variable \(H = \{h_j\} = \{h_1, h_2, \ldots , h_X\}\) denotes the top X hashtags in the H hashtag ranking, excluding the object label L.

  8. 8.

    Instagram posts are searched using H as a hashtag.

  9. 9.

    The number of reverse co-occurrence counts, reverse co-occurrence ranking, and similarity between comments [10] with L for each H are calculated using posts retrieved in the previous process.

  10. 10.

    A user questionnaire is conducted to collect the relevance of H to L.

  11. 11.

    A post’s relevance prediction model is created using a random forest with relevance as the correct answer value and reverse co-occurrence count, reverse co-occurrence ranking, and similarity between comments as the explanatory variables (these are represented by dotted lines in Fig. 1).

  12. 12.

    Using these variables, a hashtag-post relevance prediction model is applied to H, and hashtags that are considered “highly relevant” to the post are decided.

  13. 13.

    Finally, hashtags considered “highly relevant” to the post are recommended to the user as suggested hashtags.

2.2 Terms Used in the Algorithm

In this section, we explain the terms “co-occurrence count,” “reverse co-occurrence count,” “reverse co-occurrence ranking value,” and “similarity between comments” used in the above-mentioned algorithm.

Fig. 2
figure 2

Co-occurrence count ranking for the hashtag “mouse”

2.2.1 Co-occurrence Count

In the hashtag H included in Fig. 1, the number of occurrences of each hashtag is the co-occurrence count, and its ranking is the co-occurrence count ranking.

For example, the top 10 co-occurrence counts \(h_1, h_2, \ldots , h_{10}\) obtained using the label “mouse” are shown in Fig. 2. The horizontal and vertical axes represent the number of occurrences and co-occurrence counts of the hashtag, respectively. The top three hashtags based on the co-occurrence counts for the hashtag “mouse” are \(h_1\)=“#gaming”, \(h_2\)=“#cute”, and \(h_3\)=“#mousepad.”

2.2.2 Reverse Co-occurrence Count

For each hashtag H with a label L in Fig. 1, hashtags are retrieved, and co-occurrence ranking is established. The number of occurrences of label L in the ranking is the reverse co-occurrence count of the hashtag.

2.2.3 Reverse Co-occurrence Ranking Value

For each H with a label L in Fig. 1, hashtags are retrieved, and a co-occurrence ranking is established. The rank of label L in the ranking is the reverse co-occurrence ranking value.

2.2.4 Similarity Between Comments

The label L is used to retrieve comments C in Fig. 1, and the label H is used to retrieve comments \(C' = \{C_j\} = \{C_1, C_2, \ldots , C_X\}\).

Let N be the number of words that appear inC or \(C'\); all words are then represented as \(\{t_i\} = \{t_1, t_2, \ldots , t_N\}\). Where \(n_{i,j}\) is the number of occurrences of the word \(t_i\) in the comments \(C_j\), \(\Sigma _k n_{k,j}\) is the sum of the occurrences of all words in the comments \(C_j\), |C| is the number of comments, and \(|\{C:C\ni t_{i}\}|\) is the number of comments containing the word \(t_{i}\), the TF-IDF of the word \(t_i\) in each comment is calculated as follows:

$$\begin{aligned} \text{ tf }-\text{ idf}_{i,j}= & {} \text{ tf}_{i,j} \cdot \text{ idf}_{i} \end{aligned}$$
(1)
$$\begin{aligned} \text{ tf}_{i,j}= & {} \frac{n_{i, j}}{\sum _{k} n_{k, j}} \end{aligned}$$
(2)
$$\begin{aligned} \text{ idf}_{i}= & {} \log \frac{|C|}{\left| \left\{ C: C \ni t_{i}\right\} \right| } \end{aligned}$$
(3)

Finally, if the TF-IDF values of the word vector for the comments C and \(C_j\) are given as a and b, respectively, their cosine, which is the similarity between comments of \(h_j\) to L is calculated as follows:

$$\begin{aligned} {a}= & {} \left\{ a_{1}, a_{2}, \ldots , a_{N}\right\} \end{aligned}$$
(4)
$$\begin{aligned} {b}= & {} \left\{ b_{1}, b_{2}, \ldots , b_{N}\right\} \end{aligned}$$
(5)
$$\begin{aligned} \cos \theta= & {} \frac{\sum _{k=1}^N a_k b_k}{\sqrt{\sum _{k=1}^N a_k^2} \sqrt{\sum _{k=1}^N b_k^2}} \end{aligned}$$
(6)

3 Proposed Algorithm

In the bove-decribed algorithm, when the object label L is a word referring to different objects, the hashtag H often includes hashtags that are less relevant to L. For example, if the object label L is “mouse”, the hashtags H includes the hashtags related to not only “mouse” as a computer input device but also an animal. Herein, we propose three algorithms to solve this problem.

3.1 Algorithm 1

Algorithm 1 employs labels belonging to the same object category in the Microsoft COCO dataset. There are 91 object labels in the Microsoft COCO dataset, which belong to the following categories: Person, Accessory, Animal, Vehicle, Outdoor Objects, Sports, Kitchenware, Food, Furniture, Appliance, Electronics, and Indoor objects.

Algorithm 1 collects submissions for the object label L using labels that belong to the same category. For example, in the category “Electronics,” there are six category labels: “tv,” “laptop,” “mouse,” “remote,” “keyboard,” and “cellphone.” The five category names “tv,” “laptop,” “remote,” “keyboard,” and “cellphone” are used to create a co-occurrence count ranking for “mouse.” Figure 3 shows a schematic of Algorithm 1. The part processes indicated by bold lines differ from the algorithm depicted in Fig. 1.

Fig. 3
figure 3

Schematic of Algorithm 1. Parts of the algorithm that differ from those in Fig. 1 are indicated by bold lines. The numbers in this figure correspond to the process descriptions

The process of Algorithm 1 is summarized as follows. The numbers in the diagram correspond to the numbers of the algorithm process.

  1. 1.

    A user inputs the image.

  2. 2.

    The object detection algorithm YOLO [12] is applied to the input image to obtain the label L of the object contained in the image. s

  3. 3.

    A label \(L' = \{l_1, l_2, \ldots , l_m\}\) that belongs to the same category as label L in the Microsoft COCO dataset is obtained. Here, m is the number of labels excluding L that are in the same category as L.

  4. 4.

    Instagram posts are searched using L and \(L'\) as hashtags.

  5. 5.

    Let \(P_1\) be the set of posts in which \(L'\) is included as a hashtag in the comments on posts and \(P_2\) be the set of posts in which L is included as a hashtag in the comments of posts.

  6. 6.

    Let \(C_1\) and \(C_2\) be the sets of comments obtained from \(P_1\) and \(P_2\), respectively.

  7. 7.

    After removing duplicates of \(C_1\) and \(C_2\), H is the set of all hashtags attached to the comments.

  8. 8.

    The elements of H are sorted in decreasing order of co-occurrence counts. Variable \(H = \{h_j\} = \{h_1, h_2, \ldots , h_X\}\) denote the top X hashtags in the H hashtag ranking, excluding the object label L.

  9. 9.

    Instagram posts are searched using the hashtag H.

  10. 10.

    The reverse co-occurrence count, reverse co-occurrence ranking, and similarity between comments with L for each H are calculated using the posts retrieved in the previous step.

  11. 11.

    Using these variables, a hashtag-post relevance prediction model is applied to H, and hashtags considered “highly relevant” to the post are decided.

  12. 12.

    The “highly relevant” hashtags are recommended to the user as suggested hashtags.

3.2 Algorithm 2

Algorithm 2 employs a translation term for each label.

The labels of the Microsoft COCO dataset that are considered candidates for L are in the English language. Algorithm 2 uses the translated word for L into a language other than English as a candidate for \(L'\). The target languages based on the number of language users in the world and the number of Instagram users are as follows.

  • Hindi

  • Portuguese

  • Indonesian

  • Russian

  • Turkish

  • Japanese.

For example, when the object label L is “spoon”, the candidates for \(L''\) are translated into in Hindi, “colher” in Portuguese, “sendok” in Indonesian, “ложка” in Russian, “kaşık” in Turkish and “スプーン” in Japanese.

Figure 4 shows a schematic of Algorithm 2. The parts indicated by bold lines differ from those of the algorithm depicted in Fig. 1.

Fig. 4
figure 4

Schematic of Algorithm 2. The parts of the algorithm that differ from those in the algorithm depicted in Fig. 1 are indicated with bold lines. The numbers in this figure correspond to the process descriptions of the algorithm

The process of Algorithm 2 is summarized as follows. The numbers in the diagram correspond to the numbers in the algorithm process.

  1. 1.

    A user inputs the image.

  2. 2.

    The object detection algorithm YOLO [12] is applied to the input image to obtain the label L of the object contained in the image.

  3. 3.

    Let \(L' =\{l_1, l_2, \ldots , l_6\}\) be the six translations of label L.

  4. 4.

    Instagram posts are searched using L and \(L'\) as a hashtag.

  5. 5.

    Let \(P_1\) be the set of posts on Instagram obtained using the label L as a hashtag and \(P_2\) denote the set of Instagram posts obtained using the label \(L'\) as the hashtag.

  6. 6.

    Let \(C_1\) and \(C_2\) be the sets of comments obtained from \(P_1\) and \(P_2\), respectively.

  7. 7.

    Let \(H_1\) and \(H_2\) be the sets of all hashtags attached to \(C_1\) and \(C_2\), respectively.

  8. 8.

    A set of \(H_2\) hashtags in \(H_1\) is extracted and denoted as H.

  9. 9.

    The elements of H are sorted in decreasing order of co-occurrence counts. Variable \(H = \{h_j\} = \{h_1, h_2, \ldots , h_X\}\) denote the top X hashtags in the H hashtag ranking, excluding the object label L.

  10. 10.

    Posts on Instagram are searched using the hashtag H.

  11. 11.

    The number of reverse co-occurrence counts, reverse co-occurrence ranking, and similarity between comments with L for each H are calculated using the posts retrieved in the previous step.

  12. 12.

    Using these variables, a hashtag-post relevance prediction model is applied to H, and hashtags considered “highly relevant” to the post are decided.

  13. 13.

    Finally, the “highly relevant” hashtags are recommended to the user as suggested hashtags.

3.3 Algorithm 3

This section describes the proposed Algorithm 3, which is a hybrid of Algorithms 1 and 2. Algorithm 3 leverages the strengths of Algorithms 1 and 2 to improve the relevance of suggested hashtags using both category- and translation-based approaches.

Fig. 5
figure 5

Schematic of Algorithm 3. The top and bottom parts show the processes of Algorithms 1 and 2, respectively. The final output combines the top X/2 hashtags from each algorithm to form the final set of X hashtags

Figure 5 shows a schematic of Algorithm 3. The process of the algorithm is summarized as follows. Here, X is only assumed to be an even number.

  1. 1.

    A user inputs the image.

  2. 2.

    The object detection algorithm YOLO [12] is applied to the input image to obtain a label L for the object in the image.

  3. 3.

    The label L is used to run Algorithm 1 to obtain \(H = \{h_j\} = \{h_1, h_2, \ldots , h_{X/2}\}\), which is the top X/2 hashtags in the H hashtag ranking, excluding the object label L.

  4. 4.

    The label L is used to run Algorithm 2 to obtain \(H = \{h_j\} = \{h_1, h_2, \ldots , h_{X/2}\}\), which is the top X/2 hashtags in the H hashtag ranking, excluding the object label L.

  5. 5.

    The top X/2 hashtags from each algorithm are combined to create a set of X hashtags.

4 Experiment

This study aims to obtain a set of hashtags that are “highly relevant” to a post from a label that refers to different objects. Therefore, the co-occurrence frequency ranking of the algorithm proposed in a previous study and that of Algorithms 1, 2, and 3 were used to obtain the top X hashtags H, excluding object labels L, and the relevance value, which represents the relevance between hashtags and posts for compoarison.

4.1 Determination of the Label L

To validate the effectiveness of the proposed algorithms, we assume that object labels are pre-determined by object recognition using YOLO.

First, a hashtag search is conducted on Instagram using each of the 91 Microsoft COCO dataset categories as a label. After checking the top 30 new posts and finding at least 10 posts that refer to different objects in the Microsoft COCO dataset categories, the labels “spoon”, “bear”, “mouse”, “keyboard”, and “train” are selected as the labels L to be used in the experiment. Different objects used on Instagram and the meaning of each label L in object detection based on the Microsoft COCO dataset, which is the correct value in this study, are listed in Table 1.

Table 1 Meanings of the labels “spoon”, “bear”, “mouse”, “keyboard”, and “train” used on Instagram and their labels recognized by YOLO based on the Microsoft COCO dataset

4.2 User Questionnaire

Herein, we discuss the relevance of the label L and the top 30 hashtags \(H = \{h_1, h_2, \ldots , h_{30}\}\) in terms of co-occurrence frequency for the label L to validate the proposed algorithm.

We administered a questionnaire to 15 subjects. For each of the five labels “spoon”, “bear”, “mouse”, “keyboard”, and “train,” the four algorithms were used to obtain the top 30 hashtags with the highest number of co-occurrences. For each of the 450 hashtags obtained, subjects were asked to rate the relevance between the label L (object recognition label from the Microsoft COCO dataset in Table 1) and the hashtags in three levels (high, medium, and low). Let \(R_j\) be the number of subjects who selected “high” for hashtag \(h_j\), \(S_j\) the number of subjects who selected “medium”, and \(T_j\) the number of subjects who selected “low.” The subjects were asked to choose “unknown” when it is difficult for them to determine the relevance of the hashtag.

The responses “high”, “medium”, and “low” or “unknown” were assigned values of 1, 0.5 and 0, respectively. The relevance \(r_j\) of the hashtags was calculated as follows.

$$\begin{aligned} r_j= (R_j+0.5 \cdot S_j) / 15 \end{aligned}$$
(7)

4.3 Results

For the five labels, “spoon”, “bear”, “mouse”, “keyboard”, and “train”, the results of the questionnaire for a total of 150 hashtags in the top 30 co-occurrences obtained using the algorithms were compared.

The average relevance values for the labels of hashtags with the top 10 and 30 co-occurrence counts are listed in Tables 2 and 3, respectively. For each hashtag, the largest relevance value is written in bold, and the second largest value is underlined.

Table 2 Average relevance of postings to hashtags with the top 10 co-occurrence counts obtained for the label L using the previous and proposed algorithms
Table 3 Average relevance of postings to hashtags with the top 30 co-occurrence counts hashtags obtained for the label L using the previous and the proposed algorithms

Compared to the previous algorithm, the relevance of the hashtags increased by 59% for Algorithm 1 (from 0.29 to 0.46) and 45% for Algorithm 2 (from 0.29 to 0.42), as shown in Table 2. Except for the word “keyboard", Algorithm 3 shows the second-best results, indicating that the hybrid algorithm outperforms the individual algorithms in predicting hashtags regardless of the word.

Similarly, for hashtags with the top 30 co-occurrence counts (Table 3) compared with the preivious algorithm, the average relevance of the hashtags predicted by Algorithms 1 and 2 increased by 18% (from 0.28 to 0.33) and 64% (from 0.28 to 0.46), respectively. Except for the words “mouse" and “keyboard", Algorithm 3 exhibited the second-best results, indicating that Algorithm 3 outperforms Algorithms 1 or 2 in predicting hashtags regardless of the word.

For the labels “spoon,” “bear,” and “mouse,” the relevance values obtained using Algorithm 1 in both cases of top 10 and 30 co-occurrence counts were higher than those obtained by the previous algorithm. In contrast, the relevance value obtained for the label “train” using Algorithm 2 was significantly higher than that obtained by the previous algorithm. However, the values for the labels “mouse” and “keyboard” in the top 10 co-occurrence counts and for “bear”, “mouse”, and “keyboard” in the top 30 co-occurrence counts were lower than those of the previous algorithm. For the label “spoon,” the relevance values obtained by Algorithms 1 and 2 were significantly higher than those obtained by the previous algorithm.

Algorithm 3 exhibited minimum relevance values of 0.33 and 0.27 for the hashtags with the top 10 and 30 co-occurrence counts, respectively. For all labels, Algorithm 3 exhibited no significantly low values.

Tables 4 and 5 list the relevance values for the top 10 hashtags in terms of co-occurrence counts for the labels “spoon” and “train.”

Table 4 Top 10 hashtags in terms of co-occurrence counts and relevance value for “spoon” using the previous and proposed algorithms
Table 5 Top 10 hashtags in terms of co-occurrence counts and relevance value for “train” using the existing and proposed algorithm

5 Discussion

Tables 6 and 7 summarize the results of Algorithm 1 and 2 for each label L.

Table 6 Discussion of each label in Algorithm 1

5.1 Spoon

For the label “spoon,” the top 10 hashtags obtained by the algorithm proposed in the previous study in terms of co-occurrence counts mostly represent car models or manufacturers, as shown in Table 1. However, the ranking established by Algorithms 1 and 2 includes many hashtags related to tools for eating or cooking, which correspond to “labels in the Microsoft COCO dataset” in Table 1. Algorithm 1 uses labels in the same object category in the Microsoft COCO dataset. Therefore, hashtags related to the category Kitchenware, such as “#fork,” “#knife,” and “#kitchen,” were obtained, as shown in Table 4. Alternatively, Algorithm 2 uses a translation term; thus, it retrieved hashtags that describe more specialized “spoon,” such as “#handmade,” “#woodenspoon,” and “#spooncarving,” as shown in Table 4.

Table 7 Discussion for each label in Algorithm 2

5.2 Train

For the label “train,” the top 10 hashtags predicted by the previous algorithm and Algorithm 1 are mostly related to “muscle training,” as shown in Table 1.

The average relevance of hashtags predicted by Algorithm 1 decreased, whereas that of hashtags predicted by Algorithm 2 increased significantly compared to the value for the previous algorithm. The decrease in the average relevance of the hashtags predicted by Algorithm 1 can be attributed to the high number of Instagram posts with the hashtag “#car” and hashtags related to “muscle training.” Algorithm 2, which uses “train” translated into various languages, did not consider “car” as a label. Thus, it could suggest more hashtags related to “train.”

5.3 Mouse

The average relevance of the hashtags predicted by Algorithm 2 for the label “mouse” was significantly lower than that of the previous algorithm, which is primarily because the translations also have the meanings of both “computer input device” and “animal.” Algorithm 1 overcomes this challenge by using labels that belong to the same object category “Electronics” in the Microsoft COCO dataset. Therefore, the average relevance of the predicted hashtags was significantly high.

5.4 Keyboard

The music-related hashtags predicted by the previous algorithm disappeared in Algorithm 1. However, there were no significant changes in the average relevance of the predicted hashtags because the relevance of printer-related hashtags was ranked as “low” by the subjects. In Algorithm 2, similar to the case of the label “mouse,” music-related hashtags were also retrieved because the translations have the meanings of both “computer input device” and “piano keyboard.”

5.5 Bear

The average relevance of the hashtags predicted by Algorithm 2 for “bear” was low for the top 30 cooccurrences because the Portuguese word “urso” means not only “bear” but also “male.” In contrast, the hashtags predicted by Algorithm 1 exhibited higher average relevance values due to the numerous hashtags related to “panda.”

6 Conclusion

Users of image-posting SNSs, such as Instagram, add multiple hashtags to a single post to improve image searchability by other users. However, there is a mismatch between several Instagram posts and the added hashtags. Accurate and relevant hashtags are essential for the effective use of such social media platforms; thus, there is a need to predict “highly relevant” hashtags.

An algorithm proposed in previous studies could not obtain a “high relevance” co-occurrence count ranking for labels like “mouse” due to its multiple meanings (e.g., an “animal” and a “computer input device”). Herein, we propose three algorithms to address this issue.

Algorithm 1 retrieves posts using labels from the same category in the Microsoft COCO dataset and determines hashtags from posts where such labels coexist as hashtags. Algorithm 2 determines hashtags by translating labels into six different languages and retrieving relevant posts. Algorithm 3 is a hybrid of Algorithms 1 and 2.

A questionnaire was administered to 15 subjects to evaluate the proposed algorithms in terms of predicting hashtags for five labels: “spoon,” “bear,” “mouse,” “keyboard,” and “train.” The average relevance of hashtags predicted for “spoon,” “bear,” and “mouse” using Algorithm 1 was higher than that of hashtags predicted using the algorithm reported in a previous study. The relevance of hashtags predicted by Algorithm 2 for “train” was high; however, that for “mouse” and “keyboard” was low, which can be attributed to translation ambiguities. Algorithm 3 exhibited no extremely low values for all labels.

Overall, the proposed algorithms predicted hashtags with improved relevance values for different labels. Considering the top 10 and 30 hashtags based on the co-occurrence counts, compared with the algorithm proposed in a previous study, the relevance of the hashtags predicted by Algorithm 1 increased by 59% and 18%, and that for the hashtags predicted by Algorithm 2 increased 45% and 64%, respectively. These results confirm the effectiveness of the proposed algorithms in suggesting hashtags for labels with multiple meanings. In future studies, the accuracy of the hashtag recommendation model using the Human-in-the-Loop method [14] can be improved by considering users’ opinions.