Keywords

4.1 Introduction

With the continuous decrease in the fertility rate of Taiwanese citizens, the proportion of new-immigrant children among primary school students continues to increase every year. In the 2004–2005 school-year, there were 1,883,533 pupils, and this number dropped to 1,297,120 in the 2013–2014 school year, a decrease of 31.13 %. However, the number of new-immigrant pupils increased from 40,907 in the 2004–2005 academic year to 157,431 in the 2013–2014 academic year, an increase from 2.17  to 12.14 %. The highest number of these children had mothers with Chinese nationality, followed by mothers with Vietnamese and Indonesian nationalities (Ministry of Education 2014). Following school admission, additional attention should be paid to education-related topics such as academic performance, parent–teacher communication, and adjustment to school by children born in transnational marriage families.

The National Academy for Educational Research conducted a large-scale investigation for fourth and sixth-grade students in 2012. The results showed that children from new-immigrant families in the fourth and sixth grades performed significantly worse in mathematics than children from nonimmigrant families. However, the difference in performance seemed to decline with age. In addition, growth in academic performance could vary among children based on the differences in their family socioeconomic status (Wang et al. 2012). Besides the correlation between family socioeconomic status and performance in mathematics, which could change with time, the relationship between psychological variables and academic achievement is a subject of concern in the educational sector. To further understand the effects of the environmental context of new immigrant children and time on their performance in mathematics and to find more convincing evidence, we resampled the 2012 fourth-grade subjects to conduct a long-term longitudinal study on the same population of new-immigrant children in 2014.

To understand the growth trend in new-immigrant children’s performance in mathematics, to investigate the casual pathways of the children’s performance in mathematics, and to analyze the effect of education policies on performance in mathematics, this study developed reliable and valid evaluation tools to accurately measure the growth in the mathematics performance of new immigrant children in fourth to sixth grades.

4.2 Content Standards

We developed a mathematics framework based on the General Guidelines of Grades 1–9 Curriculum for Elementary and Junior High School Education. A mathematics curriculum equips students with an understanding of the basic concepts of figures, shapes, and quantities as well as the ability to calculate and organize and to apply such knowledge and skills in daily life. It also enables comprehending the principles of reasoning and problem solving, the ability to elaborate clearly on mathematics-related concepts, and making appropriate connections among materials and contents between this and other learning areas. The mathematics curriculum for Grades 1–9 is divided into four stages: Stage 1 begins in Grade 1 and ends in Grade 3; Stage 2 begins in Grade 4 and ends in Grade 5; Stage 3 begins in Grade 6 and ends in Grade 7; and Stage 4 begins in Grade 8 and ends in Grade 9 (Ministry of Education 2006).

4.3 Test Item Bank

The mathematics items used in the 2012 large-scale assessment of new-immigrant children were constructed by the TASA, which is the largest nationally representative and continuing assessor of Taiwanese students’ knowledge and skill-sets in five subject areas. Assessments are conducted periodically in mathematics, Mandarin, English, science, and social science. TASA assessments began in 2005. (For a detailed description of the TASA assessment plan, see Table 4.1)

Table 4.1 TASA assessment schedule

TASA classifies mathematics assessment questions into two dimensions: content area and mathematical complexity. Each question is designed to measure one of four mathematics content areas: (a) number and measurement, (b) geometry, (c) statistics and probability, and (d) algebra. Moreover, items are classified according to three types of mathematical abilities: conceptual understanding, procedural knowledge, and problem solving.

The distribution of items among the various mathematical content areas and mathematical complexities reflects the relative proportion of the mathematics curriculum. In 2012, there were 65 selected-response test items and 13 constructed-response test items for Grade 4 as well as Grade 6 students. Table 4.2 lists the distribution of selected-response items, and Table 4.3 shows the distribution of constructed-response test items for Grade 4 in 2012. In addition, Table 4.4 details the distribution of selected-response items, and Table 4.5 displays the distribution of constructed-response test items for Grade 6 in 2012. In order to link the test scale scores between Grade 4 and Grade 6, we embedded 17 common selected-response items into both item pools.

Table 4.2 Distribution of selected-response items for Grade 4 in 2012
Table 4.3 Distribution of constructed-response items for Grade 4 in 2012
Table 4.4 Distribution of selected-response items for Grade 6 in 2012
Table 4.5 Distribution of constructed-response items for Grade 6 in 2012

TASA and our 2012 large-scale assessment of new-immigrant children use a balanced incomplete block (BIB) design to assign blocks or groups of selected-response cognitive items to student booklets. A BIB design satisfies four conditions: every treatment in the booklet design is covered at most only once in a booklet; every treatment in the booklet design appears with equal frequency across all booklets; every booklet has an identical length, containing the same number of clusters; and every pair of treatments in the booklet design occurs together in the booklets with equal frequency (Frey et al. 2009). We assigned 65 selected-response items into a set of 13 booklets, and each booklet comprised four blocks (i.e., every student response to 20 items). The BIB booklet design (Table 4.6) enabled TASA and our 2012 large-scale assessment of new-immigrant children to sample a sufficient number of students to obtain precise results for each test.

Table 4.6 TASA balanced incomplete block booklet design

4.4 Method

Because of budget limitations, we chose only selected-response items from 2012 for the 2014 booklets. We propose three rules for identifying and selecting appropriate items. First, the percentage of anchor items must be 50 %, providing the basis for equating scores on different grades. Second, the Item Response Theory model of TASA is constrained to have a mean ability of zero, and thus, the TASA sample represents the whole population of Taiwanese students with a higher mathematics ability compared with new-immigrant children. Therefore, the Grade 4 items with a difficulty parameter larger than −0.5 and the Grade 6 items with a parameter between −1 and 0.5 are considered high priority. Third, the proportion of each content area should be similar to that in the General Guidelines of Grades 1–9 Curriculum for Mathematics. PARSCALE 4 is used to estimate the item parameters.

4.5 Results

Totally, we selected 78 selected-response items. The distribution of items is listed in Table 4.7. The item distribution across the content areas was 64.10, 19.23, 5.13, and 11.54 %, which is relatively similar to the original distribution of the Grade 4 and Grade 6 items. Following the 2012 BIB design, we compiled all of the items into a set of 13 booklets, and each booklet comprised four blocks (i.e., every new-immigrant student response to 24 selected-response items in 40 min). The mean difficulty in the items on each block ranged from 0.053 to 0.153, and consequently, in each booklet it ranged from 0.083 to 0.128 (Table 4.8).

Table 4.7 Distribution of selected-response items for new-immigrant children in 2014
Table 4.8 The mean difficulty of the items on each booklet

4.6 Conclusion

This study demonstrated that a detailed consideration for the percentage of anchor items, the range of item difficulties, and the distribution of content areas can be useful for constructing a measurement tool in a longitudinal study.