Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

There is growing interest in the computational linguistics community in identifying arguments and relations between them in natural language (see for example [13]). Consider also the First ACL Workshop on Argument Mining held in 2014 [4], and the IBM Debating Technologies being developed by IBM Research for extracting arguments from sources such as Wikipedia [5, 6].

An interesting challenge that is arising from attempts to mine arguments (seen as reasons about claims) is the choice of target formalism for representing the extracted arguments. In computational models of argument, abstract argumentation (as proposed by Dung [7]) and logical or structured argumentation (as proposed in [811]) are the two key options. Neither is ideal as a target formalism as we outline below.

Abstract Argumentation: Each argument is atomic. Thus, as a target language, it is quite weak since there is no formal differentiation of reason and claim. A ramification of this is that there can be no recursive embedding (e.g. reasons for the claim “x is a reason for claim y” cannot be represented in abstract argumentation). Also, it does not allow for Boolean combinations of reasons like “x or y is a reason for z”. In addition, it does not differentiate between i) “x being a reason for not claiming y” and ii) “x not being a reason for claiming y”. Introducing a support relation partly addresses some shortcomings of abstract argumentation, but not the ones mentioned above. Note that the notion of a supporting argument is not necessarily the same as the notion of a reason.

Logical Argumentation: Each argument is a set of formulae entailing a claim. As a target language, its expressiveness does not fit with the above needs. Firstly, all the logical formulae for the premises and claims need to be identified. It is currently not feasible to aim for translating argumentation in text and dialogues into logical formulae such that all the premises are represented explicitly and the claim follows logically from the premises. One great challenge for this is that most arguments in text and dialogues are enthymemes. That is, some or all of the premises are implicit, and even the claim may be implicit. For more on computational modelling of enthymemes, see [12]. Logical argumentation is also insufficient because it does not capture embedded relationships (like “(x is not a reason for y) is a reason for z”) nor does it explicitly capture “x is not a reason for claim y” or “x is a reason for not claiming y”. So, as a target language, logical argumentation captures details that cannot be extracted in the short term by argument mining and it lacks some features potentially valuable for argument mining.

Our proposal here is partly motivated by the need for a better formalism for representing argumentation coming from natural language. There is a second motivation. Given a set of arguments and counterarguments represented in our target language, we can infer further arguments and counterarguments. This is not possible with abstract argument since it has no inference machinery over argument graphs. This is also not possible directly with logical argumentation. Given logical arguments \(A_1\) and \(A_2\), we cannot infer a new logical argument \(A_3\) (other than by extracting the support in each argument and using that as a knowledge base for generating further logical arguments).

To address these, we propose a formal language for representing reasons and claims, as well as a framework for inferring with arguments and counterarguments in this language. The implied arguments will be deconstructions, e.g. obtained by flattening the recursive structure in an argument, and constructions obtained by combining arguments. This framework is flexible in that different choices for inference rules can be made.

2 Formal Syntax

We now present our formalism for representing arguments, inspired by Apothéloz [13]. The formalism is built upon a propositional language \({\mathbb {L}}\) with the connectives \(\lnot \), \(\vee \), \(\wedge \), \(\rightarrow \), \(\leftrightarrow \). There are also two operators \({\mathcal {R}}(.)\) and \({\mathcal {C}}(.)\) and an additional negation \(-\). Thus, two negation operators are needed: \(\lnot \) for denying propositional formulas (\(\lnot x\) denotes that x is false), and \(-\) for denying \({\mathcal {R}}(.)\) and \({\mathcal {C}}(.)\). Please note that \(\lnot \lnot x\) is identified with x and \(--{\mathcal {R}}(.)\) is identified with \({\mathcal {R}}(.)\) (similarly, \(--{\mathcal {C}}(.)\) is identified with \({\mathcal {C}}(.)\)).

Definition 1

(Formula). The set of formulas \(L\!_A{}\) is the smallest set such that a formula is of the form \((-){\mathcal {R}}(y):(-){\mathcal {C}}(x)\) where each of x and y is either a formula of \({\mathbb {L}}\) or is itself a formula.

Each formula is either an argument or a rejection of an argument, to be defined next.

Definition 2

(Argument). An argument is a formula of \(L\!_A{}\) of the form \({\mathcal {R}}(y):(-){\mathcal {C}}(x)\).

An argument is a reason for concluding a claim. It has two main parts: premises (the reason) and a conclusion. The functions \({\mathcal {R}}\) and \({\mathcal {C}}\) respectively play the roles of giving reason and concluding. Indeed, an argument is interpreted as follows: its conclusion holds because it follows, according to a given notion, from the premises. The notion refers to the nature of the link between them (e.g., the premises imply the conclusion), formally identified by the colon in the definition. However, the contents may be true while the functions do not hold and vice versa. The intuitive reading is as follows:

The first kind of expression captures two forms of argument about a proposition x: One in which x is supported (\({\mathcal {R}}(y):{\mathcal {C}}(x)\)) and one in which its negation is supported (\({\mathcal {R}}(y):{\mathcal {C}}(\lnot x)\)). As to the expression \({\mathcal {R}}(y):-{\mathcal {C}}(x)\), it encompasses two cases:

figure a

Example 1

In the dialogue next, Mary’s argument supports \(\lnot fe\), a claim opposing Paul’s but John’s argument is only meant to stop concluding fe without committing to \(\lnot fe\).

figure b

Unlike existing definitions of argument where a conclusion x follows from premises y using a notion of derivation [8], Definition 2 does not make the link explicit. We aim at a general definition that allows us to represent any argument in text or dialogue, e.g. enthymemes [12], without judging whether it is a good argument. Also, reasons can be hypothetical (assumed for the purpose of the argument). And our approach is meant to capture links of whatever nature including non-deductive links as in the causal argument

figure c

or abductive links as in the following argument:

figure d

So far, the negation operator “\(-\)” has been used to deny the concluding function. Now, the function of giving reason can be denied as well by placing “\(-\)” in front of \({\mathcal {R}}\). What is denied in this case is not the premises but rather the idea that the premises justify the conclusion of the argument. Such a form is called rejection of argument.

Definition 3

(Rejection). A rejection of an argument is a formula of \(L\!_A{}\) of the form \(-{\mathcal {R}}(y):(-){\mathcal {C}}(x)\).

The intuitive meaning for these formal expressions is as follows:

Example 2

Consider the following dialogue.

figure e

There can be many reasons for rejecting an argument \({\mathcal {R}}(y):{\mathcal {C}}(x)\) as illustrated next:

figure f

Forms 1-3 of rejection amount to blocking the conclusion of the targeted argument. They give rise to arguments whose reason justifies rejecting the targeted argument and whose conclusion inhibits the conclusion of the targeted argument: Eric’s rejection leads to \({\mathcal {R}}(-{\mathcal {R}}(sm):-{\mathcal {C}}(fe)):{\mathcal {C}}(fe)\) and John’s to \({\mathcal {R}}(-{\mathcal {R}}(\lnot wh):{\mathcal {C}}(fe)):-{\mathcal {C}}(fe)\). As to form 4, the reason and conclusion of the targeted argument are acknowledged. Such a rejection is less of a counterargument than a better argument for the conclusion.

No other logic-based approach to modelling argumentation provides a language for expressing rejection of arguments in the object language. This gives a more appropriate encoding of situations and allows us to differentiate between say \(-{\mathcal {R}}(cr):{\mathcal {C}}(bc)\) and \({\mathcal {R}}(cr):-{\mathcal {C}}(bc)\) (let cr denote “The car is red” and bc denote “We should buy the car”).

figure g

Even if we identify the rejection \(-{\mathcal {R}}(cr):{\mathcal {C}}(bc)\), it is possible that we could identify another argument for buying the car using other criteria such as \({\mathcal {R}}(ec \wedge sp):{\mathcal {C}}(bc)\) where ec denotes “The car is economical” and sp denotes “The car is spacious”.

3 Representing Mined Arguments

Our approach first aims at representing mined arguments. We want to use our language to represent arguments as they arise in natural language texts and dialogues. We believe that central to handling texts and dialogues is the need to provide support for nested arguments and rejections. To illustrate some of the richness of our approach, we give in Table 1 various forms (\(F_1\)-\(F_{12}\)) of arguments and rejections allowed by our definitions (xyzt are propositional formulas to simplify matters). The table is not exhaustive.

In our approach, we are not identifying what constitutes a good argument (or a good rejection of an argument). Rather, we are providing a representation of arguments (and rejections thereof). If an argument or rejection occurs in a text or dialogue, then we want it to be mined, and we want to be able to represent it in our language.

A list of arguments below shows that all the forms \(F_i\) can be used as a target for natural language. It indicates how to use our language, rather than suggesting that there is a canonical translation of text in to the formal target language. As with translating a natural language sentence in to any logic, there is some subjectivity in how a sentence is exactly translated, depending on various factors.

Table 1. Forms of argument
figure i

Examples in \(F_1\)-\(F_{12}\) illustrate that the outer reason and claim can be potentially identified using argument mining techniques, such as based on sentiment analysis techniques (e.g. [14]), text entailment techniques (e.g. [1]), or directly via machine learning techniques (e.g. [6]), and then by recursion, the inner reasons and claims can be identified by argument mining techniques. Thus, the nested structure appears better suited as a target language for arguments as they arise in natural language dialogues and texts.

4 Reasoning Systems

Our approach also aims at reasoning with mined arguments and rejections. We want to find what arguments and rejections follow from them. Note that deriving an argument \(\alpha \) does not mean that \(\alpha \) is accepted. Instead, inferring \(\alpha \) means that the argument(s) used while deriving \(\alpha \) cannot be held without \(\alpha \) also being held. Inference captures commitment between arguments. Hence, if a foolish argument is used as a premise then a foolish \(\alpha \) may result: If an agent holds a foolish argument, he henceforth commits to some other foolish arguments.

Our approach is to treat a set of arguments and rejections as a set of formulae from which we apply a reasoning system. A consequence operator \(\Vdash \) is the least closure of a set of inference rules extended with one meta-rule. Different reasoning systems can be defined by adopting different choices for the set of inference rules. In this paper, we only have space to introduce one reasoning system (in Sect. 5), but we will also indicate how alternative reasoning systems can be identified.

For the sake of simplicity, the rules are introduced using propositional formulas. However, they all hold in the general case. So, the variables xyz can be instantiated with propositional formulae, arguments, and rejections of arguments.

Let us introduce the meta-rule. In keeping with the meaning of \(-{\mathcal {R}}(y):{\mathcal {C}}(x)\), i.e., y is not a reason for x, which is the negation of the meaning of \({\mathcal {R}}(y):{\mathcal {C}}(x)\), i.e., y is a reason for x, the meta-rule expresses that we can reverse any inference rule of the form

$$ \frac{{\mathcal {R}}(y):\Phi }{-{\mathcal {R}}(y):\Psi } \quad \quad \quad \text{ into } \quad \quad \quad \frac{{\mathcal {R}}(y):\Psi }{-{\mathcal {R}}(y):\Phi } $$

Of course, the same reversing process takes place whenever “\(-\)” occurs in front of a leftmost “\({\mathcal {R}}\)” so that, in the general case, an inference ruleFootnote 1 where \(i, j \in \{0,1\}\)

$$ \frac{-^{(i)}{\mathcal {R}}(y):\Phi }{-^{(j)}{\mathcal {R}}(y):\Psi } \quad \quad \text{ can } \text{ be } \text{ reversed } \text{ into } \quad \quad \frac{-^{(1-j)}{\mathcal {R}}(y):\Psi }{-^{(1-i)}{\mathcal {R}}(y):\Phi } $$

Now we turn to introducing the inference rules. As a start, consistency of arguments’ reasons is a source of inferences (where x is a formula in \({\mathbb {L}}\)).

$$ \frac{{\mathcal {R}}(y):{\mathcal {C}}(x)}{-{\mathcal {R}}(y):-{\mathcal {C}}(x)} \qquad \frac{{\mathcal {R}}(y):{\mathcal {C}}(x)}{{\mathcal {R}}(y):-{\mathcal {C}}(\lnot x)} \qquad \mathbf{{(Consistency)}} $$

The leftmost rule means that if y is a reason for x then y is not a reason to doubt x. The rightmost rule means that if y is a reason for x then it is also a reason to doubt \(\lnot x\).

Proposition 1

The inference rules below are derived from (Consistency) and the meta-rule (where x is a formula in \({\mathbb {L}}\) in the first, third and fourth inference rules).

$$ \frac{{\mathcal {R}}(y):{\mathcal {C}}(x)}{-{\mathcal {R}}(y):{\mathcal {C}}(\lnot x)}\quad \!\!\frac{{\mathcal {R}}(y):-{\mathcal {C}}(x)}{-{\mathcal {R}}(y):{\mathcal {C}}(x)}\quad \!\!\frac{{\mathcal {R}}(y):{\mathcal {C}}(\lnot x)}{{\mathcal {R}}(y):-{\mathcal {C}}(x)}\quad \!\!\frac{{\mathcal {R}}(y):{\mathcal {C}}(\lnot x)}{-{\mathcal {R}}(y):{\mathcal {C}}(x)} $$

Any reasoning system for our language is to include the meta-rule and the derived rules. In Sect. 5, we consider specific inference rules for an example of a reasoning system.

5 Indicative Reasoning

Indicative reasoning is an example of a reasoning system for our language. We give for this system some simple inference rules that can be used with the meta-rule presented in Sect. 4. We discuss some inference rules that are not part of the set of rules for this example of a reasoning system and give some properties of the consequence relation.

5.1 Inference Rules

The first inference rule for indicative reasoning captures when reasons are interchangeable. This is when x is the reason for y and vice-versa. Hence the next inference rule.

$$ \frac{{\mathcal {R}}(y):{\mathcal {C}}(x) \quad {\mathcal {R}}(x):{\mathcal {C}}(y) \quad {\mathcal {R}}(y):{\mathcal {C}}(z)}{{\mathcal {R}}(x):{\mathcal {C}}(z)} \qquad \mathbf{{(Mutual\,\,Support)}} $$

As an illustration, let x stand for “Paul and Mary are married to each other” and y for “Paul and Mary are in love with each other”.

The next rule gathers different reasons for the same conclusion within a single argument where y and z are formulas in \({\mathbb {L}}\).

$$\begin{aligned} \frac{{\mathcal {R}}(y):{\mathcal {C}}(x) \qquad {\mathcal {R}}(z):{\mathcal {C}}(x)}{{\mathcal {R}}(y \vee z):{\mathcal {C}}(x)} \qquad \mathbf (Or) \end{aligned}$$

Cautious monotonicity means that the reason of an argument can be expanded with any premise it justifies. Cut expresses a form of minimality of the reason of an argument.

figure j

The two next rules concern nesting of \({\mathcal {R}}(.)\) and \({\mathcal {C}}(.)\). Exportation shows how to simplify meta-arguments (where y and z are formulas in \({\mathbb {L}}\)) and Permutation shows that for some forms of meta-arguments, permutations of reasons are possible.

figure k

We show that the consequence operator \(\Vdash \) defined upon the introduced inference rules is consistent. Indeed, \(-{\mathcal {R}}(y) : {\mathcal {C}}(x)\) cannot be schematically derived from \({\mathcal {R}}(y) : {\mathcal {C}}(x)\) and that \(-{\mathcal {R}}(y) : -{\mathcal {C}}(x)\) cannot be schematically derived from \({\mathcal {R}}(y) : -{\mathcal {C}}(x)\). By the inference rules in Property 1, Property 2 actually expresses that neither \({\mathcal {R}}(y):{\mathcal {C}}(\lnot x)\) nor \({\mathcal {R}}(y):-{\mathcal {C}}(x)\) can be schematically derived from \({\mathcal {R}}(y) : {\mathcal {C}}(x)\).

Proposition 2

There is no \(i,j \in \{0,1\}\) s.t. the following is a derived inference rule.

$$ \frac{-^{(i)}{\mathcal {R}}(y) : -^{(j)}{\mathcal {C}}(x)}{-^{(1-i)}{\mathcal {R}}(y) : -^{(j)}{\mathcal {C}}(x)} $$

5.2 Non-Inference

We turn to inference rules that are excluded from our example of a reasoning system. We explain why they are excluded but some of these could be in an alternative system.

Proposition 3

Neither the (Reflexivity) axiom, i.e., \({\mathcal {R}}(x):{\mathcal {C}}(x)\) for all \(x \in {\mathbb {L}}\), nor the following rules hold in indicative reasoning.

figure l

Reflexivity is omitted because it seems unlikely that in full generality x be a reason for x, e.g., not when the link is causality. Hence Logical Consequence is inhibited as well. Indeed, being a reason for x is far more restrictive than having x as logical consequence. Here is an illustration. By classical logic, taking x to stand for “if I am in London then I am in England” while taking y to be \(\lnot z\) where z stands for “if I am in England then I am in Paris” yields \(\lnot z~\models ~x\). However, \(\lnot z\) falls short of being a reason for x.

We refrain from adopting Right Logical Consequence in indicative reasoning again on the grounds that being a reason for x is in general far more restrictive than having x as a logical consequence. Indeed, the nature of the link plays an important role. E.g., consider the causal argument: flu is a reason for your body temperature to be in the range 39\(^{\circ }\) C\(-\)41\(^{\circ }\) C. The fact that being in the range 36\(^{\circ }\) C\(-\)41\(^{\circ }\) C is a logical consequence of being in the range 39\(^{\circ }\) C\(-\)41\(^{\circ }\) C does not make flu a reason for your body temperature to be in the range 36\(^{\circ }\) C\(-\)41\(^{\circ }\) C (it is the only possible range unless you are dead!).

And cannot be adopted either for indicative reasoning. Consider the case where y, x and z stand respectively for “Paul is standing in the middle of the road while a car is approaching”, “Paul should move forward”, and “Paul should move back”.

Transitivity does not hold either for indicative reasoning. Back to the \(F_6\) example, the fact that Paul’s car is broken does not support the conclusion “Paul is in his office”.

In indicative reasoning, blocking a reason is different from blocking a conclusion:

figure m

Argument 1: The fact that several European countries have a good economy (ge) is a reason for not concluding that an economic crisis ( ec ) in Spain is a reason for a declining value of the euro (de).

Argument 1 has the form \({\mathcal {R}}(ge):-{\mathcal {C}}({\mathcal {R}}(ec):{\mathcal {C}}(de))\) and illustrates case (a) because \({\mathcal {R}}(ge):-{\mathcal {C}}(de)\) needs not hold since an economic crisis in Germany may lead to a declining value of the euro.

Argument 2: The fact that Carl did not attend at all the course ( ac ) is a reason for him failing his exams.

That Argument 2, formally captured as \({\mathcal {R}}(\lnot ac):{\mathcal {C}}(fe)\), is doubted on the grounds that Carl is smart can be written \({\mathcal {R}}(sm):-{\mathcal {C}}({\mathcal {R}}(\lnot ac):{\mathcal {C}}(fe))\). However, the latter argument needs not hold even in the presence of \({\mathcal {R}}(sm):-{\mathcal {C}}(fe)\) (Carl being smart is a reason not to conclude him failing his exams) because \(\lnot ac\) is more specific than sm.

We insist that the indicative reasoning system we present here is only one option for a reasoning system with our approach. By giving this system, we also want to question the appropriateness of some simple inference rules that most notions of logic would adhere to. In particular, we suggest that reasoning with arguments needs not use the same inference rules as non-monotonic reasoning (cf e.g. Kraus et al. [15]). For instance, we may choose to set up a system such as the indicative reasoning system, that fails reflexivity (i.e. we do not accept that statement x is automatically a reason for claim x).

5.3 Consequence Relation

When \(\Vdash \) is the smallest inference relation obeying the rules from Sect. 5.1, reflexivity, monotonicity and cut hold, meaning that with the \(\Vdash \) consequence relation, manipulation of arguments by the inference rules is well-founded, in the logic tradition [16]. Indeed:

Proposition 4

Let \(\varDelta \) be a set of (rejections of) arguments. Let \(\alpha \), and \(\beta \) be arguments.

figure n

Also, the \(\Vdash \) consequence relation is paraconsistent in the sense that it is not trivialized by contradiction (i.e., not all formulae of the language \(L\!_A{}\) follow from contradiction).

Proposition 5

\(\{-^{(i)}{\mathcal {R}}(y):-^{(j)}{\mathcal {C}}(x), -^{(1-i)}{\mathcal {R}}(y):-^{(j)}{\mathcal {C}}(x)\} \not \Vdash L\!_A{}\).

\(\Vdash \) is monotonic but involves a non-monotonic operator in its object language: \({\mathcal {R}}\). “being a reason” is a non-monotonic relation \(\mid \!\sim \) as witnessed by transitivity failing (it can be that y is generally a reason for x although there are special cases where this fails). As an aside, please note that Mutual Support is a special instance of Transitivity. Anyway, the fact that \(\mid \!\sim \) is formalized as an operator in our formalism makes the non-monotonicity confined to failure of inferring \({\mathcal {R}}(y \wedge z) : {\mathcal {C}}(x)\) from \({\mathcal {R}}(y) : {\mathcal {C}}(x)\). There is no effect on the logic. The case is similar to conditional logics (they are monotonic) because an operator capturing a counterfactual conditional must be non-monotonic: e.g., “were I to scratch this match, it would ignite” denoted \(y\!\!\rightarrowtail \!\!x\) may hold while “were I to scratch this match, that is wet, it would ignite” denoted \(y\!\wedge \!z\!\rightarrowtail \!x\) fails to hold.

6 Conclusion

We propose a logic for representing and reasoning about arguments in a way that is just not possible with existing formalisms. We think that the formalism is a promising target language for argument mining and that if we obtain arguments (and rejections of arguments) by argument mining, we can use our inference machinery to analyse them.

Our formalism captures arguments and rejections thereof, with definitions encompassing different roles of reasons (concluding and blocking statements), various forms of reasons (factual and hypothetical) and different kinds of links (deductive, abductive, inductive, \(\ldots \)). Unlike existing computational models of argumentation where attack (support) between arguments is expressed by external relations on the set of arguments, any attack (support) can be expressed as an argument in our formalism.

Many well-known logics were first proposed as proof systems. For these logics, semantics were only obtained later. We have identified an alternative reasoning system (that preserves more of the classical inference rules such as reflexivity) with a three valued semantics. An exciting challenge is to identify a semantics for indicative reasoning. Future research includes exploring the space of reasoning systems, semantics for them, and investigating Boolean combinations of arguments (and rejections thereof).