Keywords

1 Introduction

One effective approach to solve critical social problems is to collect the wisdom from a crowd of participant people. However, it grows difficult to organize a large number of people to discuss in one particular place during a particular time. With the development of the Internet, online discussion forums have attracted much attention as the platform of gathering a crowd of people together to solve common problems. The core advantage of online discussion is that people can join the discussion via the Internet from different places whenever they are free. Therefore, platforms such as Climate CoLab [1], Deliberatorium [2] and Collagree [3] have been developed to encourage people to discuss in online environments. These platforms have been utilized to organize people to participant in online discussions about topics such as global climate change, law reform and city planning. In order to facilitate the discussion to proceed smoothly and achieve consensus efficiently, many platforms [2, 3] introduce human facilitators into the discussions to conduct facilitation. Facilitators can promote the development of the discussion, integrate ideas and opinions, and help the group to build consensus [4]. However, as there is no general definition of online discussion facilitation, a human bias cannot be avoided. In addition, with the increase of the participants number in a certain discussion, scale issues and schedule issues also become critical problems for human facilitators.

As a result, it becomes more and more necessary to develop online discussion facilitation support techniques to help human facilitators relieve their burdens. The challenging part of facilitation in online discussion forums is that it is a very complicated problem, which changes significantly with the discussion development. It means that plenty of information need to be considered for facilitation and it is difficult to describe the method in a number of specific rules. Existing research [5] emphasizes the importance of experiences for human facilitators, since they reuse the successful experience they had in the past to solve new similar problems.

In this paper, we propose using CBR to support online discussion facilitation. CBR is one of the famous artificial intelligence techniques that have been successfully used in real-world applications [6, 7]. In this regard, it provides an effective reasoning paradigm for solving new problems by adopting similar solutions that have been proposed for similar problems in the past [8]. This is very similar to the human facilitator thinking paradigm. Experienced human facilitators facilitate better than novice human facilitators because they have more experience that is derived from the past facilitation they have done. And experienced human facilitators are able to utilize these sorts of experience when they try to conduct facilitation in new discussion situations. Towards this end, in this paper, we propose a CBR based approach to support online discussion facilitation. When using CBR to solve a problem, the first and most important part is to find out the essential characteristics which can be used to express this problem in order to define the problem as a case. As a result, this proposed approach introduces a novel method of defining online discussion cases from their structures. In addition, a case retrieval algorithm is implemented to retrieve the most similar cases from the case base.

The rest of this paper is organized as follows. Section 2 introduces the related work of this research. In Sect. 3, the proposed CBR based online discussion facilitation support approach is introduced. Section 4 presents the experimental settings along with the experimental results.

2 Related Work

Many research efforts have been attempted in order to support group discussion facilitation. In this regard, Dickson et al. [9] explored three human-based facilitation modes in Group Decision Support System(GDSS) meeting and showed that group consensus improved in all the three modes. In addition, Anson et al. [10] showed human facilitators maintain high quality group interactions and improved group processes and greater cohesion. Also, an automated facilitation technique has been developed to support group discussion. Limayem et al. [11] showed both human-facilitated decision making groups and automated-facilitated decision making groups experienced significantly higher post-meeting consensus and perceived decision quality than non-facilitated groups in GDSS meeting. Aiken and Vanjani [12] showed that automated facilitator is better than human facilitator for simple idea generation and voting tasks. Wong and Aiken [13] showed both expert-human facilitated groups and automated facilitated groups perform significantly better than novice-human groups in electronic meetings when faced with relatively simple idea generating and ranking tasks. Derrick et al. [14] demonstrated that automated facilitation of system requirement generation is possible and showed that the agent-facilitated groups generate more complete requirements than non-facilitated groups. As shown in the above mentioned research works, just like human facilitators, automated facilitation techniques can also support group discussions.

However, there are still some problems in group-discussion facilitation. In specific, most of the group-discussion facilitation techniques can only support tasks such as agenda preparer, timekeeper, simple idea generation and voting. Therefore, it becomes difficult to use these techniques to support high level online discussion facilitation, since high level discussion facilitation, such as proper facilitation time detection and facilitation pattern decision, needs to be generated on the basis of the dynamically changing discussion situation. As a result, it is highly critical to develop novel discussion facilitation techniques to support high level facilitation in online discussion forums.

On the other hand, Gu et al. [15] proposed a CBR based online discussion facilitation support approach that is able to adapt to different discussion situations. Specifically, they proposed to use IBIS style to model the discussion structure and to define discussion cases from the quantitative perspective of the IBIS structure elements. IBIS style is based on the principle that the design process for complex problems is fundamentally a conversation amongst several stakeholders [16]. This has been used as a visual aid to help participants and facilitators to understand the discussion structure. However, in this IBIS based discussion process, a lot of inner characteristics cannot be reflected if we just consider the quantitative features. For example, we consider the similarity between Case(a) and Case(b) that are demonstrated in Fig. 1. In this situation, if we use the quantitative features to define a case base, the differences between Case(a) and Case(b) cannot be distinguished because their quantitative features are the same. Even though, they are constituted by different structures, these differences cannot be reflected because they have the same number of issues, ideas, and arguments. On the other hand, it is critical to consider the structure characteristics in the case definition because the discussion structure reflects the relationship among the discussion contents. For example, the connections among different vertexes need to be considered because they reflect the consistency in the discussion while those vertexes which are not connected cannot reflect the consistency in the discussion. As a result, it is critical to design new approaches to define the discussion case in order to reflect the structure characteristics of the discussion.

Fig. 1.
figure 1

Two IBIS style discussion cases

3 Proposed Approach

3.1 Problem Description

In this research, we aim to develop CBR based online discussion facilitation support approach that can be used to help human facilitators conduct facilitation in online discussion platforms.

In order to design a CBR based online discussion facilitation support approach, as the first step, we need to build a reasonable case base that represents the system’s experience in online discussion facilitation. For each case, the definition consists of a problem description part and a problem solution part. In this paper, we propose a novel structure perspective method to describe the online discussion facilitation problem. The solution of the problem is a result that whether facilitation is necessary to be added or not. In addition, we propose a case retrieval algorithm in order to find the most similar case from the case base.

Specifically, we consider the discussion process that is represented in issue based information structure(IBIS) style [16]. We generate IBIS structures of the discussions by using three sorts of elements which are issue, idea, and argument. Issues are defined as the questions that need to be answered during the discussions. Ideas are defined as the possible answers to the issues. Arguments contains both pros and cons. Pros are defined as the support to an issue or an idea and cons are defined as the object to an issue or an idea. Each element in the IBIS style discussion structure is represented as a vertex. Facilitations are the posts that facilitators generated to promote the discussion. Each association between two elements is represented as a directed edge. For example, discussion in Fig. 2, (a) can be represented in IBIS style as demonstrated in Fig. 2, (b). FA represents facilitation posts and argu represents argument posts.

Fig. 2.
figure 2

IBIS style discussion expression

3.2 Discussion Case Definition

Discussion case definition has been proposed from the quantitative perspective which focuses on the numeric IBIS style discussion features such as issue number, idea number and argument number [15]. However, in order to reflect more precise features of the generated IBIS style discussion graphs, the characteristics of the structure in the graph also need to be considered. One of the examples is demonstrated in Fig. 3.

Fig. 3.
figure 3

IBIS style discussion expression

As shown in Fig. 3, one of the salient characteristics in the IBIS style discussion structure is the original vertex. The original issue vertex, which equals to the root topic of the discussion, can be considered as the parent vertex of all other vertexes. All other vertexes show the discussion details which are generated to solve the original discussion issue. Considering the discussion structure, it is obvious that the depth of the vertexes in the discussion structure can reflect status information in the discussion. For example, if two issues hold the same number of ideas, the issue which has more deep ideas can be considered as better discussed than the issue which has fewer deep ideas.

In this research, we define the original vertex’s depth as 0. All other vertexes are in a depth which is more than 0. And we define the discussion case as a sort of labeled graph on the basis of the IBIS style discussion structure. Specifically, we define a labeled discussion graph as a directed graph in which each vertex and each edge is associated with one label. A labeled graph is defined by a triple G = \({<}V, r_V, r_E{>}\), where.

  • V is a finite set of vertexes.

  • \(r_V \subseteq V \times L_V\) shows the relation between vertexes and labels. Each vertex has only one related label, which can be issue, idea or argument. The situation that \(v_i\) has label \(l_{v_i}\) is represented by a tuple \((v_i, l_{v_i})\). \(r_V\) is the set of tuples \(v_i\).

  • \(r_E \subseteq V \times L_V \times V' \times L_{V'}\) shows the relation between edges and labels. One edge \((v_i, l_{v_i}, v_j, l_{v_j})\) is defined as the combination of two labeled vertexes \((v_i, l_{v_i})\) and \((v_j, l_{v_j})\). \(r_E\) is the set of quaternaries \((v_i, l_{v_i}, v_j, l_{v_j})\).

For example, one labeled IBIS style discussion case graph as demonstrated in Fig. 4(a) can be formulated by a labeled group as demonstrated in Fig. 4(b).

Fig. 4.
figure 4

IBIS style discussion expression

3.3 Discussion Case Retrieval

In this research, we calculate the similarity between two discussion cases on the basis of labeled graph similarity algorithm [18].

When we measure the similarity between two labeled IBIS style discussion graphs, we use the idea that comparing the number of features which are common to both objects, to the total number of the features [19]. In our situation, the total number of the features of a labeled IBIS style discussion graph can be represented as a triple G. When we compare the similarity of two graphs \(G_1 = {<}V_1, r_{V_1}, r_{E_1}{>}\) and \(G_2 = {<}V_2, r_{V_2}, r_{E_2}{>}\), such that \(V_1 \cap V_2 = \emptyset \), one additional thing that we need to consider is the relation that connects the two graphs. In this research, we define the relation that connects two graphs as: if two vertexes hold same sort of label and are in the same structure depth, these two vertexes are consider to be similar. Similarity between two graphs \(G_1\) and \(G_2\) can be calculated by Eq. 1.

$$\begin{aligned} sim_S(G_1, G_2) = \frac{f(descr(G_1)\cap descr(G_2))}{f(descr(G_1)\cup descr(G_2))} \end{aligned}$$
(1)

\(descr(G_1)\) and \(descr(G_2)\) are the descriptions of labeled graph \(G_1\) and \(G_2\), respectively. Each description is made up by all the labeled graph vertex features in addition to the edge features. The similarity between the two graphs, i.e., \(G_1\) and \(G_2\), is calculated by using the common features that the two graphs share divided by the set of all the two graph features.

There is a special state where new discussion case is the subgraph of more than two cases in the case base, and the number of common features of these two cases is the same. If these two cases are retrieved from the case base together, and then the algorithm retains the subbranch that includes the new discussion case and cut off other subbranches from the origin vertex. After that, the similarity is recalculated between these two cases in order to choose the most similar case.

4 Experiments

In this section, we introduce the experimental results of comparing the proposed structural perspective discussion case definition with the quantitative discussion case definition in case retrieval results.

In order to demonstrate the ability of the proposed CBR-based approach to retrieve similar discussion cases efficiently, we built a synthetic test case base that includes seven discussion cases. The quantitative information of the case base is shown in Table 1.

Table 1. Synthetic cases in test case base

We choose four sorts of quantitative information as the parameters, number of issues, number of ideas, number of arguments and idea depth. The idea depth means the depth of the deepest idea in the discussion case. And the structural information of the case base is demonstrated in Fig. 5.

Fig. 5.
figure 5

Experiment case base

In addition, we built two synthetic test cases whose information is demonstrated in Table 2 and Fig. 6.

Table 2. Synthetic test cases
Fig. 6.
figure 6

Synthetic test cases

In this experiment, firstly, we retrieve the most similar case to our test cases from the case base by using the Nearest Neighbor (NN) algorithm [17]. In NN algorithm, similarity is calculated on the basis of the euclidean distance of each feature parameter. Two cases are more similar if the euclidean distance between them is smaller. If the euclidean distance between two cases is 0, it means that the two cases are identical, i.e., the two cases hold the same number of quantitative perspective features. Secondly, we retrieve the most similar case to our test case from the case base by using the proposed labeled graph similarity algorithm we introduced in Sect. 3. Two cases are more similar if the labeled graph similarity between them is higher.

Fig. 7.
figure 7

Experiment results

Specifically, two test cases, \(Case\_t1\) and \(Case\_t2\) are designed to test our proposed approach. \(Case\_t1\), which is identical to Case1 of the test case base, is designed to test whether identical cases can be calculated as identical by our proposed approach. \(Case\_t2\), which is similar to Case5, Case6 and Case7 of the test case base, is designed to test whether structure details can be reflected by our proposed approach.

The results of the experiments are shown in Fig. 7. From these results, we can see that proposed approach can work as good as the quantitative perspective discussion case definition method. In specific, it finds the most similar case to \(Case\_t1\) and \(Case\_t2\) from the case base, which are Case1 and Case5, respectively. \(Case\_t1\) is also identified to be identical with Case1 because the labeled graph similarity is 1. In addition, for the similarity results of Case6 and Case7, differences are not reflected when using the quantitative features, as shown in Fig. 7(a) and (b). However, this sort of difference can be reflected when using the structure perspective features and the proposed algorithm, as shown in Fig. 7(c) and (d). This reflects the situation that the quantitative features are the same, but the structures are different.

5 Conclusion

In this paper, we proposed a novel CBR based online discussion facilitation support approach to support online discussion facilitation. In the proposed approach, IBIS style discussion structure format is employed to define discussion cases on the basis of their discussion structure. Labeled graph similarity algorithm is utilized to retrieve the most similar discussion cases from the case base. Experimental results demonstrated the ability of the proposed case definition method to reflect structural discussion cases differences that the quantitative features cannot reflect and the ability of the proposed case retrieval algorithm to retrieve most similar cases from the case base. One of the directions for future work is to improve the efficiency of the proposed case retrieval algorithm when handling large-scale case base. The possible solution is to consider comparing the extracted features from each case instead of comparing two graphs. Another direction of future work is to introduce more information into the case definition in order to reflect more precise discussion situation. One possible solution is to consider the semantic information of each IBIS vertex. These additional information can be added as sub-vertexes that can ensure more precise case retrieval.