Stimulus equivalence refers to stimulus relations that are untaught but emerge after a set of new relations have been learned in conditional-discrimination training. The stimulus relations qualify as stimulus equivalence if they have the properties of reflexivity, symmetry, and transitivity. The property of reflexivity occurs if a participant without explicit training can match a stimulus onto itself in a select relation task. Furthermore, a positive outcome on a symmetry test occurs if an individual is able to match the stimulus B to the stimulus A after having been taught to match A to B in a conditional-discrimination training. Transitivity occurs when an individual after having been taught to match A to B, and B to C, can match A to C without training (e.g., Sidman & Tailby, 1982).

Three training structures have been used to establish necessary prerequisites to test for emergent relations: linear series (LS), many-to-one (MTO), and one-to-many (OTM; e.g., Saunders et al., 1993). The LS training structure is such that conditional relations are taught from one stimulus to a comparison stimulus, after which the comparison stimulus then serves as sample stimulus to be related to a new comparison stimulus and so on. Thus, depending on the number of stimuli in the set, stimuli can serve as sample and comparison stimuli (i.e., A→B, B→C, C→D). The MTO training structure, also known as comparison-as-node, involves training many sample stimuli to one comparison stimuli (i.e., A→D, B→D, and C→D). With the OTM also referred to as sample-as-node, conditional relations are taught such that one sample stimulus is related to other comparison stimuli (i.e., A→B, A→C, and A→D; e.g., Arntzen & Hansen, 2011; Saunders & Green, 1999).

The use of different training structures has been shown to have different effects on equivalence class formation (see Arntzen, 2012). The LS has been found to produce lower equivalence class formation outcomes because, unlike the OTM and MTO, sample stimuli in LS fluctuates in comparison to previous trials (e.g., Arntzen, 2012; Arntzen & Holth, 1997, 2000; Saunders et al., 2005), therefore, it has been used mostly to examine the variables that influence class formation yields (e.g., Arntzen & Mensah, 2020, Arntzen & Nartey, 2018; Arntzen et al., 2018a; Mensah & Arntzen, 2017; Fields et al., 2012). The present experiment, therefore, employs the LS to investigate the effects of type of stimuli on equivalence class formation.

An equivalence class is made of a specified number of stimuli that are perceptually different from each other but become related to each other through conditional-discrimination training (Fields & Verhave, 1987; Sidman & Tailby, 1982). Several experiments have found that an equivalence class formation is enhanced when meaningful stimuli are used in a set of abstract stimuli relative to only abstract stimuli (e.g., Arntzen, 2004; Arntzen & Lian, 2010; Bentall et al., 1993; Holth & Arntzen, 1998). An abstract stimulus is a stimulus lacking any specific discriminative function (Arntzen et al., 2018b). The types of abstract stimuli that have been used in previous research have included Greek letters, stimuli that are presumed unfamiliar to participant (Sidman & Tailby, 1982), syllables with no meaning in language, symbols from alphabets that the participant has no knowledge of, and unnamable or difficult to name (Nedelcu et al., 2015). A meaningful stimulus, also referred to as a familiar picture or nameable stimulus, is any stimulus that has meaning, can evoke language allowing it to be named efficiently, and has several verbal functions serving as tacts and intraverbals. Thus, meaningful stimuli include nameable pictures, identifiable smell, or familiar words spoken.

A series of experiments have examined equivalence class formation as a function of the inclusion of meaningful stimuli in a set of abstract stimuli in a three 5-member linear series training structure (A→B→C→D→E). In general, the finding in this line of research is that when all stimuli are abstract shapes (the ABS group) a small number of participants form equivalence classes. In contrast, when meaningful pictures are C stimuli and the rest of the stimuli are abstract shapes (the PIC group) the number of participants who respond in accordance with stimulus equivalence is increased substantially (e.g., Arntzen & Mensah, 2020; Arntzen & Nartey, 2018; Arntzen et al., 2014, 2015a; Arntzen et al., 2018a, 2018b; Fields et al., 2012; Mensah & Arntzen, 2017; Nartey et al., 2014, 2015a, 2015b; Nedelcu et al., 2015). For example, in Fields et al. (2012), the results showed that when all the stimuli were abstract, none of 10 adult participants responded in accordance with stimulus equivalence. When the A, B, D, and E stimuli were abstract shapes, and the C stimuli were meaningful pictures, 8 of 10 participants responded in accordance with equivalence.

The nodal structure (the number of nodes and singles) of an equivalence class has been found to be a determinant of equivalence class formation (Fields & Verhave, 1987). An equivalence class, for example, represented by A→B→C→D→E contains three nodal stimuli (B, C, and D) and two singles (A and E). Fields et al. (1997) in an experiment trained necessary conditional discriminations in college students and tested if one-node three-, five-, or seven-member classes or three-node five- or seven-member classes emerged. Fields et al. (1997) found that the number of participants who formed classes was a direct function of the number of nodes in a class. Arntzen and Holth (2000) also found that participants are more likely to form classes when a fewer number of members exist in a class relative to when a larger number of members exist in a class.

Sorting has also been used as a common measure of categorization or concept formation (Arntzen et al., 2017; Arntzen, Norbom et al., 2015; Fields et al., 2014; Fields et al., 2012) but the use of this technique in stimulus equivalence research is still emerging. Findings in respect with sorting measures in equivalence class formation so far have shown a consistent performance in equivalence class formation after conditional discrimination training, and categorization or sorting. The aforementioned studies have shown that participants who showed the formation of equivalence class formation in the MTS-based test for emergent relations, showed the class formation in a postsorting class formation test. Based on these findings, this experiment also seeks to find out whether sorting still provides an alternative measure of equivalence class formation.

At present, the experiments on the effect of meaningful stimuli on class formation have used fewer members (mostly five members with three nodes; e.g., Arntzen & Nartey, 2018; Arntzen, Nartey et al., 2015; Arntzen et al., 2018a, 2018c; Fields et al., 2012; Mensah & Arntzen, 2017). These experiments have found that the inclusion of meaningful stimuli as the middle node in a set of abstract stimuli enhances class formation yields within the range of 70%–80%. Despite this finding, relatively little is known about class formation yields as a function of the inclusion of meaningful stimuli in larger classes. Sorting has also been used as a common measure of categorization or concept formation (Arntzen et al., 2017; Arntzen, Norbom et al., 2015; Fields et al., 2014; Fields et al., 2012) but the use of this technique in stimulus equivalence research is still emerging and inconclusive. Whereas findings from Mensah and Arntzen (2017) and Fields et al. (2012) show that participants who form equivalence classes in an MTS-based test for emergent relations also formed classes in a sorting test after MTS-based training and testing for emergent relations, findings from Arntzen et al. (2021) and Arntzen et al. (2017) show a discordance between performance on the MTS-based test for emergent relations and performance on the sorting test after MTS-based training and testing for emergent relations. The present experiment therefore extends the existing literature on meaningful stimuli as a function of equivalence class formation in larger classes. Thus, this study investigated (1) the extent to which equivalence class formation as a function of meaningful stimuli is modulated by larger to-be formed classes with higher number of nodes, (2) the extent to which increasing the number of meaningful stimuli in a set of abstract stimuli enhances equivalence class formation by using single-subject manipulations, and (3) the concordance between MTS performance on emergent relations tests and sorting tests.

Method

Participants

Twenty university students (12 females and 8 males) voluntarily participated in this experiment. The participants were aged 17 to 27 years (M = 21.4, SD = 2.23). When participants arrived for the experimental session, they were given an informed consent form. The participants were asked to read the informed consent form and consent to participate in the experiment. The form included general information about the experimental situation and the duration of the experiment (about 100 min). Participants were also informed that they could quit at any time without any negative consequences. All participants were fully debriefed after they had finished the experimental session and paid 50 Ghana Cedis (an equivalent of US$11) for participating in the experiment.

Apparatus and Setting

The venue for this experiment was one of the lab rooms at the University of Ghana, Legon. The lab room was approximately 7 m square and furnished with tables and chairs. This experiment used HP Compaq nc6320 computer laptop running Windows 7. The computer had a 43.18 cm screen. A computer mouse was used to click on the stimuli displayed on the screen. Conditional-discrimination training and testing sessions were all conducted with a customized matching-to-sample (MTS) software program. This software program controlled the presentation of stimuli and recorded the responses in the form of the number of training trials, the first trial number, correct or incorrect response choices, and the programmed consequences. In addition, the software program recorded a summary of all training and testing trials; baseline, symmetry, one-node, two-node, three-node, four-node, and five-node trials.

Stimuli

Abstract and meaningful pictorial stimuli were used in the conditional-discrimination training and testing. A sample of the stimulus sets used in this experiment are displayed in Figure 1 and were selected from a plethora of abstract and meaningful stimuli that have been used in previous experiments (e.g., Arntzen & Mensah, 2020). The abstract stimuli used in the present experiment included symbols from alphabets that participants were unlikely to know or be able to name. All stimuli used in this experiment were randomly selected and displayed on a white background on the screen of the computer. The size of each stimulus displayed on the screen of the computer was 9.4 cm x 3.4 cm.

Fig. 1
figure 1

An Example of the Stimulus Set Used as Members of the To-Be-Formed Equivalence Classes

Design

The experiment used a combination of between-group and within-group experimental design. Ten participants each were assigned to the ABS and PIC groups. Five participants who failed to form classes in PIC repeated the experiment in DA-as-PIC, and two participants who failed to form classes in DA-as-PIC repeated the experiment in DAG-as-PIC. The between-group experimental design was used to examine the difference in performances for the two main experimental groups: the ABS Group and the PIC Group. The within-group design was used to examine whether the inclusion of more pictures to replace abstract stimuli will facilitate equivalence class formation for participants who failed to form classes after conditional-discrimination training and testing in the PIC group. The participants in the PIC group who failed to form classes were exposed to a new experimental condition (DA-as-PIC) that had a new set of stimuli where the D and A stimuli were meaningful stimuli, with the B, C, E, F, and G stimuli being abstract stimuli. The participants who failed to respond in accordance with stimulus equivalence in the DA-as-PIC condition were exposed to a new experimental condition (DAG-as-PIC) that had a new set of stimuli where the D, A, and G stimuli were meaningful stimuli, with the B, C, E, and F stimuli being abstract stimuli.

Procedure

Preexperimental Sorting Test

Participants who agreed to participate in the experiment were given 21 plastic-laminated cards that corresponded to each of the “to-be-displayed stimuli or image” in the conditional-discrimination training of the group to which they were assigned (preexperimental sorting). On receiving the laminated cards, participants were instructed to “put the cards into groups.” Participants’ performance on this task were recorded by the experimenter. The preexperimental class formation sorting test served as a means of ensuring that participants did not have any class formation experience before the experiment.

Instruction

Once preexperimental sorting was completed, participants were instructed to click on the computer screen using the computer mouse. The computer screen displayed the following instructions:

In a moment, a stimulus will appear in the middle of the screen. Click on this by using the computer mouse. Three stimuli will then appear in the three corners of the screen. Choose one of them by clicking on it with the mouse. If you choose the stimulus we have defined as correct, words like “very good,” “excellent,” and so on will appear on the screen. If you press a wrong stimulus, the word “wrong” will appear on the screen. At the bottom of the screen, the number of correct responses you have made will be counted. During some stages of the experiment, the computer will NOT tell you if your choices are correct or wrong. However, based on what you have learned so far, you can get all of the tasks correct. Please do your best to get everything right. Thank you and good luck!

No further instructions were given before and after the experiment started. Participants were not informed on how fast or slow the selection clicks throughout the experimental session should be.

Trial Structure and Contingencies

Regardless of the experimental group or condition, participants were trained 18 conditional discriminations arranged in a linear series structure using the simultaneous protocol after the formation of three 7-member equivalence classes were tested. Fields et al. (1997) note that the simultaneous protocol involves presentation of all training trials in a randomized order in training blocks followed by a test block including all possible emergent relations in a randomized order. Training trials started with the presentation of a sample stimulus in the middle of the screen of a computer. Participants responded to the stimulus by clicking on the stimulus with a computer mouse. The click on the stimulus is followed by the display of three comparison stimuli presented at three of the four corners of the computer screen along with the sample stimulus still at the center of the computer screen. Throughout the experiment, the location of the comparison stimuli was random. A click on one of the comparison stimuli was followed by the presentation of words such as correct, very good, super, or excellent on the screen of the computer if the selected comparison stimulus was correct. However, if the selected comparison stimulus was incorrect, the word wrong was presented on the screen of the computer. Programmed consequences presented after the selection of the comparison stimulus in the training trials were displayed at the center of the computer screen for 1,000 ms. An intertrial interval of 500 ms followed the termination of program consequences. The mouse cursor was always reset to the center of the computer screen between trials.

Acquisition of Baseline Relations (Phases 1–11)

Acquisition of baseline relations in the present experiment for all the experimental groups and conditions was trained in 11 phases with a presentation of programmed consequences after the selection of a comparison in each trial (see Table 1 for an overview of each of the experimental phases). A mastery criterion of 90% correct responding was required in each phase in order to progress to the next phase. Phase 1 of the baseline relation training was for the training of AB relations in a block containing nine trials with three trials of each relation (A1B1, A2B2, and A3B3). The participant had to respond correctly on all the nine trials to achieve the mastery criterion. Baseline training relations requirements in Phase 2 were the same as Phase 1 except that this phase trained BC relations (B1C1, B2C2, and B3C3). Phase 3 consisted of an 18-trial block with mixing of AB and BC relations. The fourth phase was the same as the first phase except that this phase trained CD relations (C1D1, C2D2, and C3D3). Phase 5 consisted of a 36-trial block with mixing of AB, BC, and CD relations. Phase 6 of baseline relations training was the same as Phase 1 except that this phase trained DE relations (D1E1, D2E2, and D3E3). Phase 7 consisted of a 45-trial block with mixing of AB, BC, CD, and DE relations. Phase 8 of baseline relations training trials was the same as Phase 1 except that this phase trained EF relations (E1F1, E2F2, and E3F3). Phase 9 consisted of a 54-trial block with mixing of AB, BC, CD, DE, and EF relations. Phase 10 of baseline training was the same as Phase 1 except that this phase trained FG relations (F1G1, F2G2, and F3G3). Phase 11 consisted of a 63-trial block with mixing of all the trained relations in the first 10 phases (AB, BC, CD, DE, EF, and FG). This last phase was followed by a block with trials to ensure that all baseline relations trials were presented in an equal number of times.

Table 1 Sequence of Training and Testing

Maintenance of baseline trials

Training blocks for participants continued after the acquisition of the baseline with a reduction in the probability of programmed consequences (Phases 12–15 in Table 1). The percentage of programmed consequences trials in training blocks after the acquisition phases were systematically reduced to 75%, 50%, 25%, and then 0%. The mastery criterion was 90 % correct choices in a block of 54 training trials.

Emergent relations test block

Once a participant responded to the criterion in the last baseline maintenance phase, a test block for emergent relations was administered. The test block consisted of 378 trials made up of 54 baseline trials, 54 symmetry trials, 90 one-node trials, 72 two-node trials, 54 three-node trials, 36 four-node trials, and 18 five-node trials. All the 378 trials in this block were presented randomly and without programmed consequences (see Table 1). Furthermore, each test relation in the test block was presented three times (see Table 1). For a participant to respond in accordance with stimulus equivalence, the participant had to select at least 90% correct choices for each type of the test trials (i.e., symmetry, one-node, two-node, three-node, four-node, and five-node).

Postexperimental Sorting Test

Upon completion of the MTS-based test for emergent relations, participants were again given the 21 plastic-laminated cards that corresponded to each of the “displayed stimuli or image” in their designated conditional-discrimination training and testing experimental condition or the group to which they were assigned to. On receiving the laminated cards, participants were instructed to “put the cards into groups.” Participants’ performance on this task were recorded by the experimenter. The performance of participants on this task were also recorded. This postexperimental sorting test served as an additional measure of stimulus class formation following the MTS-based training and testing.

Results

Preexperimental Sorting

ABS and PIC Groups

No participant sorted the cards in accordance with the experimenter-defined classes prior to the conditional-discrimination training and testing regardless of the experimental group (see Figure 2).

D-as-PIC and DAG-as-PIC Conditions

Regardless of experimental conditions, none of the participants sorted the cards in the preexperimental sorting test in accordance with the experimenter-defined classes.

Acquisition of Baseline Relations

ABS and PIC Groups

A visual inspection of Figure 3 shows that participants required an average number of trials of 913.5 and 786.2 to acquire baseline for the ABS and PIC groups, respectively. However, a t-test showed no significant difference in the number of trials required to acquire baseline between the ABS and the PIC groups, t(1, 18) = 1.3, p =.21. The six participants who formed classes regardless of the experimental group required a mean of 784.5 trials to acquire the baseline relations relative to 877.86 mean trials for the 14 participants who did not form classes. A t-test showed no significant difference in the number of trials needed for baseline acquisition between participants who formed classes and participants who did not form classes, t(1, 18) = -.85, p =.41.

DA-as-PIC and DAG-as-PIC Conditions

Participants in the DA-as-PIC and the DAG-as-PIC conditions required an average of 603 and 666 trials, respectively, to acquire baseline relations (see Figure 4).

Equivalence Class Formation

ABS and PIC Groups

As shown in Panel 1 of Figure 2, 1 of 10 participants in the ABS group responded in accordance with equivalence class formation, whereas 5 of 10 participants (see Panel 2 of Figure 2) in the PIC group formed equivalence classes. A Fisher’s Exact Test showed that class formation yields for PIC relative to ABS was nonsignificant, p = .07, 95% CI [.002, 1.56].

Fig. 2
figure 2

Preexperimental Sorting, Equivalence Class Formation, and Postexperimental Sorting Tests Outcomes for Participants

Fig. 3
figure 3

Mean Number of Trials Needed to Acquire Baseline Relations in the Experimental Groups (ABS and PIC). Note. ABS = All stimuli are abstract, PIC = D stimuli are meaningful pictures in the stimulus set and the rest abstract

Fig. 4
figure 4

Mean Number of Trials Needed to Acquire Baseline Relations in the Repeated Experimental Conditions (DA-as-PIC and DAG-as-PIC). Note. DA-as-PIC = D and A stimuli are pictures in the stimulus set, DAG-as-PIC = D, A, and G stimuli are meaningful pictures in the stimulus set

Fig. 5
figure 5

Class-Consistent Responses of Participants in Relation to Number of Nodes

Fig. 6
figure 6

Class-Consistent Responses of Participants in Relation to Number of Nodes

DA-as-PIC and DAG-as-PIC Conditions

Panels 4 and 5 of Figure 2 show class formation performance for the DA-as-PIC and DAG-as-PIC conditions. Three of the five participants who did not form classes in the PIC group formed equivalence classes in the DA-as-PIC condition (see Panel 4). Furthermore, all the participants in the DAG-as-PIC condition formed equivalence classes (see Panel 5).

Postexperimental Sorting

ABS and PIC Groups

The one participant who formed equivalence classes in ABS also sorted the stimuli in accordance with the experimenter-defined classes in the postexperimental sorting test (see Panel 1 of Figure 2). Furthermore, the five participants who formed equivalence classes in PIC also sorted the stimuli in accordance with the experimenter-defined classes in the postexperimental sorting test (see Panel 2 of Figure 2). Two participants in PIC who failed to sort the stimuli in accordance with the experimenter-defined classes and also failed the MTS-based test for emergent relations, sorted the stimuli in accordance with the experimenter-defined classes in the postexperimental sorting test (see Panel 3 of Figure 2).

D-as-PIC and DAG-as-PIC Conditions

The three participants who formed equivalence classes in DA-as-PIC also sorted the stimuli in accordance with the experimenter-defined classes in the postexperimental sorting test (see Panel 4 of Figure 2). Furthermore, the two participants in DAG-as-PIC also sorted the stimuli in accordance with the experimenter-defined classes in the postexperimental sorting test (see Panel 5 of Figure 2).

Number of Nodes

Figures 5 and 6 show class formation yields in relation to the number of nodes across the test relations. The results show that except for the DAG-as-PIC condition (see Figure 6), the performance of participants was an inverse function of the number of nodes. Thus, participants responded more correctly on fewer number of nodes and more incorrectly as the number of nodes increased. Furthermore, a t-test showed significant and better performances for the PIC group relative to the ABS group across the different number of nodes: one-node, t(1,18) = 3.3, p =.00, two-node, t(1,18) = 3.0, p =.01, three-node, t(1,18) = 2.62, p =.02, four-node, t(1,18) = 1.96, p = 04, and five-node, t(1,18) = 2.23, p = 04. This finding suggests that the inclusion of meaningful pictures in a set of abstract stimuli decrease the effect of the number of nodes on test outcomes.

Different Types of Response Patterns

Three response patterns occur in the response patterns of participants who failed to form classes in the MTS test for emergent relations. These response patterns are experimenter-defined classes, participant-defined classes, and indeterminate (see Mensah & Arntzen, 2017). A participant responds in accordance with experimenter-defined classes if test trials responses are consistent with experimenter-defined relations. Participant-defined classes response include participants selecting consistently same sample-comparison pairs every time during MTS-based testing that are inconsistent with experimenter-defined classes or sample-comparison pairs (e.g., selecting B2 instead of B1 in the presence A1 on all three test relations trials). Indeterminate responding occurs if a participant selects different and incorrect comparison stimulus for every sample stimulus in test relation trials.

The results from this analysis showed that 46.24% and 58.99% of probes trials produced responses in accordance with experimenter-defined classes for the ABS and the PIC groups, respectively. This result suggests that the inclusion of meaningful stimuli in the PIC group facilitated more correct test trial responses relative to the ABS group. Also, 8.38% and 15.08% of probes trials produced responses in accordance with participant-defined classes for the ABS and PIC groups, respectively. Lastly, 45.38% and 25.93% of probes trials produced indeterminate responses for the ABS and PIC groups, respectively. Furthermore, the results showed that 63.89%, 10.71%, and 25.4% of probe trials produced responses in accordance with experimenter-defined classes, participant-defined class, and indeterminate responses for the DA-as-PIC condition. These results suggest that responding in accordance with experimenter-defined classes, which consequently led to equivalence class formation, is influenced by the inclusion of meaningful stimuli in a set of abstract stimuli.

Discussion

The present experiment investigated the differential effects of meaningful stimuli on the formation of three 7-member classes. The findings of the experiment found that 1 of 10 participants and 5 of 10 participants formed classes in the ABS group and PIC group, respectively. Furthermore, the results showed that three of the five participants who did not form classes in the PIC group and participated in the DA-as-PIC condition formed classes. Two of the five participants who did not form classes in the DA-as-PIC condition and participated in the DAG-as-PIC condition formed classes.

Equivalence Class Formation Enhancement and Meaningful Stimuli

The findings from this experiment demonstrate that class formation yields are a function of the inclusion of a meaningful stimulus in a set of abstract stimuli. This finding is consistent and extends some earlier findings with smaller classes such as the three 5-member classes experiments (Arntzen & Nartey, 2018; Arntzen et al., 2014; Arntzen, Nartey et al., 2015; Arntzen et al., 2018a, 2018c; Mensah & Arntzen, 2017; Nedelcu et al., 2015).

Meaningful stimuli have been found to serve multiple behavioral functions as a result of their acquisition of stimulus control functions (discriminative and conditional) influences equivalence class formation (Fields et al., 2012; Nartey et al., 2014). Fields et al. (2012) and Travis et al. (2014) argue that the enhancement of class formation by a meaningful stimulus is most likely due to the meaningful stimuli being a member of an already established category or equivalence class before experimental training and testing. The authors suggest that the enhanced likelihood of equivalence class formation in a set of meaningless stimuli with one meaningful stimulus reflects the expansion of an already existing class of which the meaningful stimulus is a member. Thus, the meaningless stimuli come to be added to an already existing stimulus class containing the meaningful stimulus.

Despite the findings that the simultaneous protocol produces lower yields of class formation (e.g., Buffington et al., 1997, Arntzen et al., 2010; Arntzen & Hansen, 2011), the reduction in class formation yields from 50% for PIC to 10% for ABS cannot be attributed to the simultaneous matching-to-sample training procedure because participants in both PIC and ABS were exposed to the same training procedure. It is therefore plausible to conclude that the low-class formation yields in ABS is a function of the absence of at least a meaningful stimulus in the to-be-formed classes. Furthermore, it is also plausible to assume that the low yields in ABS may be a function of the functional disparity of the abstract stimuli leading to several overlapping responses that may have conflicted the experimentally defined classes (Sidman, 1994). Participants’ performances in DA-as-PIC condition and the DAG-as-PIC condition can be attributed to reinforcement contingencies in the experiment, or experience resulting from repetition of training and testing procedures, or both. In terms of reinforcement contingencies, the present findings show that equivalence class formation is a function of the number or increasing number of meaningful stimuli in a set of abstract stimuli. This finding is consistent and replicates the findings of Arntzen and Mensah (2020) and Mensah and Arntzen (2017). As an alternative, the present findings can be attributed to order effects or retraining and retesting, and as such are consistent with findings from Buffington et al. (1997), Nartey et al. (2015a), and Wulfert et al. (1991). Buffington et al. (1997) found in their study that pretraining of equivalence classes influenced the rate baseline acquisitions and equivalence class enhancement. Thus, retraining and retesting do enhance the emergence of relational control and consequently equivalence class formation. As stated earlier, this finding can also be a function of the combined effect of reinforcement contingencies and order effects. It is, therefore, imperative that future experiments control for order effects to ascertain the class-enhancing effect of increasing numbers of meaningful stimuli in three 7-member equivalence classes.

Stimulus Control Topography and Failed Class Formation Response Pattern

In general, test probes during emergent relations testing for participants who fail to form classes lead to responses in accordance with experimenter-defined classes, participant-defined classes or indeterminate responses. Responses that are in accordance with the experimenter-defined classes can be accounted for by the stimulus control procedure in the experiment, which produces controlling relations that are consistent with those intended by the experimenter whereas participant-defined classes, or indeterminate responses are as a result of the stimulus control procedure producing controlling relations that are not in accord with those intended by the experimenter (McIlvane & Dube, 1992). Furthermore, the results also showed that the extent of responding in accordance with experimenter-defined classes was a function of the inclusion of meaningful stimuli in to-be-formed classes. This finding is consistent with those reported by Mensah and Arntzen (2017) and Arntzen, Nartey et al. (2015a) and imply that the inclusion of pictures results in the stimulus control procedure producing controlling relations that are consistent with those intended by the experimenter. It is therefore plausible to suggest that stimulus control topographies coherence occasioning experimenter-defined classes responding is enhanced by the inclusion of familiar pictures in a set of abstract stimuli.

The present finding suggests that nonclass-indicative stimulus-control topographies can influence performances on failed equivalence class formation, baseline relations acquisition, and delayed equivalence class formation (Fields, Arntzen, & Doran, 2020). Juxtaposing these findings with a recent finding from Fields and Paone (2020), which showed that despite varied error percentages of baseline relations during concurrent training (highest), serial training (lower), and constructed response training (lowest), class formation yields was about the same across the three training conditions, there is a need for further studies to shift attention to yet unknown variables that occasion equivalence class formation during testing relative to training. It is important that future studies examine the predictive power of stimulus control topography coherence theory in equivalence class formation.

Nodal Structure and Emergent Relations

The class-formation yield of 50% in the present experiment (PIC group) is a drop relative to class formation outcomes of 70%–80% in experiments that have explored meaningful stimuli as a determinant of equivalence class formation in the emergence of three 5-member equivalence classes (e.g., Arntzen & Mensah, 2020; Fields et al., 2012; Mensah & Arntzen, 2017). Experiments that have trained conditional discriminations and tested for three 5-member equivalence classes using the LS training structure have had three nodes in their nodal structure, whereas the nodal structure of the present experiment consisted of five nodes. This finding is consistent with Arntzen and Holth’s (2000) findings, which indicated that equivalence class formation probability was significantly reduced as a result of an increase in the class size. Fields et al. (1997) suggest that the number of nodes in an equivalence class is a determinant of class formation outcomes, with increasing numbers hindering class formation and vice versa. It is therefore plausible to suggest that the drop in class formation yield in the present experiment is a result of the increased number of nodes in the equivalence class, which decreases the positive effect of the inclusion of meaningful stimuli in equivalence class formation.

The present finding shows an inverse relationship between the number of nodes separating stimuli and participants’ performances in terms of correct responding. This finding is consistent with the findings of Kennedy (1991), Buffington et al. (1997), Fields et al. (1997), Fields et al. (2012), and Arntzen and Holth (2000). In general, stimulus relations test performances have been found to be linked to the number of nodes that separates the stimuli (Fields et al., 1993). Therefore, class-consistent comparisons selection in derived relations probes is a function of number of nodes or nodal proximity (Fields et al., 1993). The present finding suggests that the relations among stimuli in different conditional discriminations are a result of the training cluster, which is a function of nodal proximity (Fields & Verhave, 1987) and develops separately from the formation of equivalence class (Fields et al., 1993).

Sorting

The present experiment replicates previous findings on the sorting tests and equivalence class formation (e.g., Arntzen, Granmo, & Fields, 2017; Arntzen, Norbom et al., 2015b; Fields et al., 2014; Fields et al., 2012; Mensah & Arntzen, 2017). The results show that all participants who responded in accordance with stimulus equivalence in the MTS-based emergent relations test, sorted the cards in accordance with the three experimenter-defined classes in the postclass formation sorting test for the ABS and PIC groups, as well as in the DA-as-PIC and the DAG-as-PIC conditions. These results suggest that although the postexperimental sorting stimulus classes are not stimulus equivalence classes, performance on the postexperimental sorting test after conditional-discrimination training can serve as a predictor for performance on the MTS-based test for emergent relations. Furthermore, two participants who did not respond in accordance with stimulus equivalence in the MTS-based emergent relations test in the PIC group sorted the cards in accordance with the three experimenter-defined classes in the postclass formation sorting test. This finding suggests that the MTS test may have provided a false negative result or sorting provided a false positive result, or the sorting test may have provided a more sensitive measure of class formation relative to the MTS test for emergent relations (Fields et al., 2014).

Limitations and Further Experiments

There are some limitations to the present experiment. First, the type of stimuli used may have accounted for the present results because the abstractness and meaningfulness of stimuli are still inconclusive. Thus, despite the preexperimental sorting to test for stimuli familiarity, it is plausible to assume that some of the abstract stimuli may have evoked naming during conditional-discrimination training and testing. It is imperative that future studies test for the abstractness and meaningfulness of stimuli before using them in equivalence class formation experiments. Future studies can also adopt the tailoring of stimuli technique (see Arntzen & Eilertsen, 2020), which uses preexperimental stimuli sorting classes of stimuli to be used in the conditional-discrimination training. Furthermore, the lack of control for experience or repeated training and testing in DA-as-PIC and DAG-as-PIC conditions limits the findings. It is plausible to suggest that class formation yields in the DA-as-PIC and the DAG-as-PIC conditions are not directly a result of the inclusion of additional pictures but rather the repetition of training and testing procedures, or both. In order to clarify these results, a future experiment should control for these effects by exposing participants who failed to form classes in the PIC group to extended cycles of training and testing without including more meaningful stimuli.

Another limitation is the lack of control for repeated training and testing for the ABS group. For example, a future experiment can examine whether additional cycles of training and testing with different stimulus sets of abstract shapes will increase the yields of equivalence class formation. In addition, a group experiment should be conducted in the future to allow for a comparative analysis of both results. Lastly, because the results show a negative effect of the number of nodes on the class-enhancing effect of a meaningful stimulus, it is suggested that future experiments adopt the MTO or OTM training structures because the number of nodes does not increase as a function of the number of members in the to-be-formed classes.

Conclusion

The present experiment examined the class-enhancing effect of meaningful stimuli in a set of abstract stimuli and found that the inclusion of a meaningful stimulus enhanced the formation of an equivalence class. The findings from the present experiment extend the literature by showing the class-enhancing effect of meaningful stimuli in large equivalence class formation. Thus, whereas yields of class formation is about 80% in PIC groups for three 5-member classes, yields of class formation were 50% in the present experiment indicating a reduction in the class-enhancing effect of meaningful stimuli as the number of members in a to-be-formed class increases from five to seven. This drop can be explained as a function of the number of nodes. With this finding, it is imperative that in applied settings, optimal results for the meaningful stimulus can be obtained by the adoption of other training structures such as the MTO or OTM in conditional-discrimination training with larger classes because these training structures limit the effect of the number of nodes in tests for emergent relations.