Epidemiological data identify suicide and alcohol use disorder (AUD) as primary determinants of health disparities among Alaska Native people, in contrast to the US and the Alaska general population. Alaska Native youth suicide constitutes a public health crisis; while suicide is 11th leading cause of death in the USA, it is fourth leading cause of death for Alaska Native people, and tragically, the leading cause of death for 15–24-year olds (Allen et al. 2011). Despite significant gaps, existing research suggests a similar unsettling trend of higher AUD among Alaska Native people, particularly in rural settings. Alaska Native people experience the highest rates of AUD in Alaska and die from alcohol-induced health conditions at four times the Alaska general population rate (Hull-Jilly and Casto 2013). Binge drinking is of high prevalence; in one Alaska Native rural community sample, 61% of men and 37% of women screened engaged in binge drinking in the past year, defined as five or more drinks on one occasion (Seale et al. 2006). Though the relationship of alcohol to suicide is complex, existing epidemiological data document high rates of co-occurrence of American Indian/Alaska Native suicide and AUD, with youth at substantial risk (Barlow et al. 2012).

Though several of the existing reviews are dated, reviews of the indigenous youth suicide (Middlebrook et al. 2001; Clifford et al. 2013) and substance use (Hawkins et al. 2004; Whitbeck et al. 2012) iterature all converge on two recommendations: (a) more culture-informed, strengths-focused interventions are needed, and (b) more rigorous designs are necessary to establish intervention efficacy. These reviews also highlight the crucial importance of cultural relevance and ongoing community involvement.

While scope and magnitude of the problem cannot be understated, there exist equally compelling Indigenous cultural strengths that are sources of resilience. Across rural Alaska, Alaska Native people maintain an aboriginal subsistence way of life as hunters and gatherers, while simultaneously engaged in a global economy. The largest Alaska Native group is the Yup’ik, representative of the highest ratio of fluent Indigenous language speakers of any Native group in the USA (Fienup-Riordan 2000). Embedded in the persistence of these and other Yup’ik practices are foundations for health and well-being, providing materials for “culture as prevention,” innovative prevention strategies that vitalize key protective elements of culture.

Three conclusions can be drawn from this brief review. First, existing data documents an enormous health disparity. Alaska Native people constitute an at-risk population in an at-risk state, with rural Alaska Native males experiencing greatest risk (Allen et al. 2011). Acute need exists for design, implementation, and testing of effective prevention strategies. Second, this same data suggests that suicide and AUD often co-occur in this population, and interventions are needed to address co-occurring risk. Third, robust cultural practices present opportunity to devise innovative cultural interventions (Ayunerak et al. 2014).

Whitbeck et al. (2012) note two worlds of prevention in tribal communities: One world consists of practices grounded in local understandings, while a second involves clinical trials that “continue to work from a Western colonial paradigm that ignores, diminishes, and reinterprets Native ways of knowing” (p. 433). Using a community-based participatory (CBPR) framework, Yup’ik communities over the past 25 years have guided a melding of these two worlds, by generating Qungasvik “Toolbox,” a cultural intervention based in a local, Indigenous theory of personal and community change, then collaborating with university researchers to describe it using the methods of western science (Allen et al. 2014a). In contrast to most American Indian and Alaska Native preventive interventions that are problem-focused and individual-level, this intervention is strengths-based and multi-level. Mohatt et al. (2014) demonstrated that the intervention produced dose-related growth on intermediate and ultimate outcome variables protective from suicide and AUD.

This report describes a test of efficacy of the Qungasvik intervention, comparing impact of high versus low intervention intensity. The study is part of a larger ongoing multisite, staggered baseline, dynamic wait-listed design (DWLD; Wyman et al. 2015) prevention trial. Our CBPR efforts with community partners include ongoing analysis and community dissemination of interim results. These efforts identified a naturally occurring comparison of effectiveness by dose; two communities in the early DWLD roll out provided a contrast of impact of higher to lower number of intervention modules at a similar time point. We present these results as a test of the Qungasvik intervention for intended intervention effects.

Method

Setting

Yup’ik communities in southwest Alaska are hundreds of miles off the road system, and accessible only by boat, small plane, or snowmobile. Ethnicity is over 90% Yup’ik. In contrast to the reservation system in the lower 48 states, most of these remote communities are federally recognized tribal entities. Residents are shareholders in the Alaska Native Regional Corporation, and the regional nonprofit health corporation provides universal medical care. Elders speak Yup’ik as a first language, while youth are English first language, and may or may not speak Yup’ik. A subsistence economy is augmented by a limited number of tribal, state, and federal jobs, primarily in government, health care, and schools.

Given high transportation costs and a precipitous decline in the salmon fishery—a long-standing income source—cost of consumer goods places heavy economic burden; these communities are among the 10 lowest per capita income counties in the USA. Food is heavily dependent on local fish, birds, and land and marine mammals. In response to the devastating impact of alcohol, as is the case for most of rural Alaska, the two communities that are the focus of the current study used the Alaska local option law at the time of this study to declare themselves dry—making importation and possession of alcohol illegal.

Participants

Treatment Arm Community (Community 1)

Sixty-one youth were recruited to participate in the intervention from approximately 100 12–17-year olds residing in Community 1. Sixty youth completed Wave 1 assessments, 46 completed Wave 2, 43 completed Wave 3, and 61-completed Wave 4; 37 youth completed all four waves, 8 completed three assessments, and 10 completed two assessments. Figure 1 provides a CONSORT flow diagram. As small sample analyses are particularly susceptible to influence from outlying observations, we identified multivariate outliers using hierarchical cluster analysis (Henry et al. 2005), a statistical method that detects homogenous clusters of cases by iterative groupings based on distance computation. One youth distant from others, along with five youth who completed only the final assessment, was dropped from the analysis, resulting in 54 participants. Mean age at Wave 1 was 14.2 years (SD = 1.72). Gender distribution was 31 females and 23 males, and there was no significant age difference between males and females (t(52) = .73, ns).

Fig. 1
figure 1

CONSORT flow diagram

Comparison Arm Community (Community 2)

Seventy-seven youth were recruited from the approximately 150 12–17-year olds residing in Community 2, all of whom completed Wave 1. Seventy-five youth completed Wave 2, 60 completed Wave 3, and 48 completed Wave 4, resulting in 34 youth that completed all four waves of assessment, 7 that completed three assessments, and 34 that completed two assessments. Using hierarchical cluster analysis, we identified one youth distant from others, resulting in 74 participants. Mean age at Wave 1 was 14.6 years (SD = 1.82), and gender distribution was 20 females and 54 males, with no significant age difference between males and females (t(72) = −1.99, ns). Demographic data by community are presented in Table 1. Samples differed in gender composition with more males in the comparison community, and while mean age was similar, the distribution in Community 2 included fewer grade 7, and more grade 11 and 12 participants. All youth reported Yup’ik ethnicity.

Table 1 Youth demographic characteristics

Qungasvik Intervention Procedures

The Qungasvik intervention implements modules that constitute episodes of Yup’ik cultural engagement. Modules are individual, family, or community level, and delivered in one or more 1–3 h sessions. Each module promotes two to four of 13 protective factors identified in a culture-specific model of protection.

The Qungasvik intervention manual is not prescriptive. Instead, it provides outlines for 26 modules, along with a process for community adaptation to local customs and circumstances, the current season, and advice of community members. Our co-researchers observed the adaptive process results in greater community ownership and intervention that is more ecologically valid to distinctive characteristics of each remote community in the region. The intervention reflects assumptions that components, or form, can vary in important ways to respond to local context. The core elements of the intervention process and delivery of protective factors are what is replicated (Henry et al. 2012). In defining systematic intervention integrity beyond repetition of component activities, the Qungasvik approach facilitates understanding of complex interventions through their interactive functions (Trickett et al. 2011). Fidelity involves adherence to functions rather than fixed components; in Qungasvik, function is defined through (a) delivery of the protective factors assigned in each module and (b) use of the Qasgiq implementation model (Rasmus et al. 2014).

Qasgiq is both name of the historical communal men’s house in each aboriginal Yup’ik community and a cultural model of social organization. Intervention fidelity includes adherence to these Yup’ik cultural protocols that guide a process of (a) always coming together as a group to plan important activities, (b) identifying those with expertise to carry out the activity, and then after the activity, (c) de-briefing on where the activity succeeded in its goals and what has been learned. In the implementation model, different cultural experts are nominated to contribute to planning and delivery of different modules. In particular, individuals recognized as “Elders” for their cultural knowledge and leadership are used extensively.

In one example of a Qungasvik module, coastal communities teach youth cultural values and provide a protective factor experience through the Maliqianeq (seal hunt) module. The module provides training in safety, team work, hunting skills, and elements of Yup’ik worldview. Youth may also learn values through the concepts of Mercecineq and Allaniuneq, which describe how the seal decides to give up life as a gift to the hunter, and how hunters in response are to do such things as give water to the seal, and to place the seal head pointing toward the river when cutting it up, to ensure its spirit safe journey home. Protective factors promoted are Ellangneq, “to become aware,” and communal mastery, or efficacy through problem-solving strategies of joining with others in the social environment. The module has been adapted by communities into beluga whale hunt, and by inland communities into moose hunt, to teach these same values and protective factors. More detailed description of the intervention, Yup’ik terms, the process of engagement, and the Qasgiq intervention implementation model can be found in Rasmus et al. (2014).

Community Enrollment

Community 1 was an intervention development community for the Qungasvik intervention; our research team was invited to co-develop the intervention here by community leadership and tribal resolution. Community 2 was enrolled into the larger DWLD outcomes study, for which we approached “larger” population communities to maximize sample size in this area in Southwestern Alaska. Communities then elected to invite our team to conduct the intervention through tribal resolution. Enrollment in the DWLD ended when three communities were recruited, and intervention start order was randomly assigned with Community 2 assigned first in order.

Measures

Gonzalez and Trickett (2014) described measure development, and Allen et al. (2014b) tested a multi-level theory of change measurement model. In the model, change in intermediate variables at the community, family, peer influence, and individual levels, as an impact of intervention activities, leads to change in two ultimate outcome variables measuring protection from suicide and AUD risk. Supplemental Table S1 summarizes this measurement model.

Intermediate Outcomes

Elluarrluni piyugngariluni: “Learning in the Mind of Doing Things in a Masterful Way”—Individual Characteristics

Two subscales from the Multicultural Mastery Scale (Fok et al. 2012) tap mastery achieved through joining with friends (Mastery-Friends) and family (Mastery-Family) as elements of communal in contrast to individual mastery.

Elluarrluteng ilakelriit: “Nurturing Family”—Family Characteristics

The Brief Family Relationship Scale (Fok et al. (2014) cultural adapts the Family Environment Scale relationship dimension (Moos and Moos 1994) Cohesion, Expressiveness, and Conflict subscales.

Nunamta: “Our Community”—Community Characteristics

The Youth Community Protective Factors Scale, adapted from the Yup’ik Protective Factors scale (Allen et al. 2006), taps youth perceptions of Support and Opportunities as two protective community subscales.

Maryarta: “One who Leads”—Peer Influences

Two scales from the American Drug and Alcohol Survey (Oetting and Beauvais 1990), used extensively with American Indian youth, were adapted to measure rural Alaska Native youth peer attitudes that discourage alcohol and other drug use as protective peer influences in the young person’s social environment.

Ultimate Outcomes

Umyuangcaryaraq: “Reflecting”—Reflective Processes

Reflective Process (RP) (Allen et al. 2012) was adapted from the adult Yup’ik Protective Factors scale (Allen et al. 2006) to tap youth reflection on potential negative consequences from drinking alcohol that have elements of culture-specific meaning. Representative items include “I would feel embarrassed to have drinking in my family” and “I do not want to lose control of myself.”

Yuuyaraqegtaar: “A Way to Live a Very Good, Beautiful Life”—Reasons for Life (RL)

RL is a cultural adaptation and strengths-based extension of the Brief Reasons for Living Inventory for Adolescents (Osman et al. 1996), which was itself based in the Reasons for Living Inventory (Linehan et al. 1983) orginally devised for adults. Items tap elements that provide meaning in life, including culture-specific beliefs and experiences that make life enjoyable and worthwhile within a rural Yup’ik context. Representative items include “My Elders teach me that life is valuable” and “People see I live my life in a Native way.”

Data Analyses

The Qungasvik intervention is intended as a community intervention. We constructed an analytic model to address challenges in implementation and design typical to community-level intervention (Trickett et al. 2011). These challenges are amplified in rural arctic Alaska in ways similar to research in other geographically remote locations, and in ways aligned with features unique to the Yup’ik cultural context. For example, in accord with Yup’ik cultural values emphasizing choice and autonomy of the individual, including young people, intervention was open to all youth to pick and choose activities they wished to attend. Similarly, consistent with values, community activities are open to all, late enrollees were accepted at any time, and youth entered the intervention at different times.

To evaluate intervention effects in the high contrasted to the lower intervention intensity community, we created mixed effects regression models (Hedeker and Gibbons 2006) across four waves of assessment. Baselines 1 and 2 (B1 and B2) were prior to intervention, Time 1 (T1) was midway through intervention, and Time 2 (T2) occurred following approximately 12 months of intervention. Also known as hierarchical linear modeling (HLM; Raudenbush and Bryk 2002), this allowed for the clustering of observations within individuals and, permited evaluation of the impact of four potential confounding variables: (a) pre-existing differences in protection for each individual, (b) duration of participation for each individual, (c) cohort in which each youth began intervention, and (d) pre-existing differences in each measure on the level of community. Pre-existing protection was estimated using Wave 1 scores on the Community Protective Factors Support subscale and then converted through a median split procedure to high and low-protection groups represented by a dummy code comparing high with low-protection participants. Because participants entered the intervention at different times, time (in months) was centered at the date each individual started intervention. The cohort in which each youth began intervention (cohort 1, 2, and 3) was represented by a dummy code that compared cohort participants.

Community consisted of a dummy code to contrast the treatment community with the comparison arm community. At level 1, the outcome variable at each of the four time points was predicted from an individual intercept, linear time slope, community, and interactions. At level 2, the individual level, the level 1 coefficients were predicted by B1 protective factor scores (pre-existing protective factors) and when the individual became involved in the intervention (cohort). In HLM notation, the model may be expressed as follows:

  • Level 1 (time):

$$ {Y}_{ij}={B}_{0j}+{B}_{1j}\left(\mathrm{time}\right)+{B}_{2j}\left(\mathrm{community}\right)+{B}_{3j}\left(\mathrm{time}\ast \mathrm{community}\right)+eij $$
$$ {B}_{0j}={G}_{00}+{G}_{03}\left(\mathrm{high}\ vs.\mathrm{low}\ \mathrm{protection}\right)+{G}_{04}\left(\mathrm{cohort}\ 2\ vs\ 1\right)+{G}_{05}\left(\mathrm{cohort}\ 3\ vs\ 1\right)+{u}_{0j} $$
$$ {B}_{1j}={G}_{10}+{G}_{13}\left(\mathrm{high}\ vs.\mathrm{low}\ \mathrm{protection}\right)+{G}_{14}\left(\mathrm{cohort}\ 2\ vs\ 1\right)+{G}_{15}\left(\mathrm{cohort}\ 3\ vs\ 1\right)+{u}_{1j} $$
$$ {B}_{2j}={G}_{20}+{G}_{23}\left(\mathrm{high}\ vs.\mathrm{low}\ \mathrm{protection}\right)+{G}_{24}\left(\mathrm{cohort}\ 2\ vs\ 1\right)+{G}_{25}\left(\mathrm{cohort}\ 3\ vs\ 1\right)+{u}_{2j} $$
$$ {B}_{3j}={G}_{30}+{G}_{33}\left(\mathrm{high}\ vs.\mathrm{low}\ \mathrm{protection}\right)+{G}_{34}\left(\mathrm{cohort}\ 2\ vs\ 1\right)+{G}_{35}\left(\mathrm{cohort}\ 3\ vs\ 1\right)+{u}_{3j} $$

Results

Psychometric Operating Characteristics of Outcome Measures

Table 2 reports number of items, and coefficient alpha, item separation, and person separation reliabilities, and scale intercorrelations by community at Wave 1. Alpha ranged from acceptable to excellent (α = .62–.96), with exception of RP in Community 1 (α = .49).

Table 2 Psychometric properties of outcome measures: reliabilities and correlations among measures at Wave 1

Fok and Henry (2015) describe how intervention measures, particularly in small samples, require high sensitivity to change. While coefficient alpha reliability taps item interrelation, a critical understudied attribute in intervention research is a measure’s sensitivity to change through item separation and person separation reliabilities. Based in item response theory conceptions of item and scale functioning, these indices make use of Rasch analysis techniques (Bond and Fox 2007). Item separation reliabilities provide an index of the extent the sample of individuals from each community is adequate to scale the item set and confirm its item hierarchy. Person separation reliabilities provide an index in each scale of the extent to which items separate individuals according to their levels of the latent trait; this taps the ability of scales to track different levels of the attribute under study, and change in the attribute in response to intervention.

The item separation reliabilities indicate that in general, each community sample displays good to excellent capabilities to scale each measure, with the lone exception of Peer Influences in Community 2. Here, item separation values fell to .29, suggesting significant limitations in the distribution of data, and the measure is unlikely to track any change among youth in the community. Person separation reliabilities generally displayed good capabilities to index individuals at different levels of the latent trait. However, three measures performed suboptimally: Individual Characteristics in Community 2 (.40), Community Characteristics in Community 2 (.20), and Reflective Processes in Community 1 (.21). Limitations in these measures’ ability to discriminate between persons at different levels of the latent trait create limitation in their ability to capture change within each community over time. On Peer Influences in Community 2, person separation was virtually nonexistent (.09), strongly suggesting that measurement limitations would prevent identification of any change.

Correlations among outcome measures were low to moderate. This suggests each tapped unique variance and separate dimensions, with exception of RL with Community Characteristics.

Evaluation of Intervention Effects

In Community 1, youth attended a mean (SD) of 6.78 (6.76) modules, while in Community 2, youth attended a mean (SD) of 2.31 (3.24) modules. Table 3 reports mean scale scores across communities and four waves of assessment over approximately 1 year. In Community 1, scores on each item were derived from an analog response format coded on a 5-point interval. In Community 2, an upgrade to our assessment software allowed coding on a 20-point interval. This change was to facilitate greater precision in our future measurement refinement efforts; scores were range-standardized prior to analyses to address this scaling issue.

Table 3 Mean scale scores at Waves 1–4 measurement points

Table 4 reports HLM results, including slope, standard error, t statistic, p value, and Cohen’s d as a measure of effect size. Due to complexity of the model described, outcomes are reported only for time, community, protection, and time × community, which examined whether the effects of time differed by community. These analyses allow us to compare effectiveness of intensive (Community 1) to less intensive (Community 2) intervention. Results indicate intensive intervention in contrast to less intensive intervention produced impact on RL (d = .28, p < .05), but not RP or intermediate variables. This can be interpreted to indicate that the intervention produced significantly greater growth in protection from suicide, but not alcohol risk, within the treatment arm, in contrast to the comparison arm. Analyses found significant growth over time within Community 1 but not Community 2, on RL (d = .43, p < .05) but not RP, and on the individual characteristics intermediate variable (d = .34, p < .05), but not family or community characteristics, while peer effects grew in Community 2 but not Community 1 (d = .50, p < .01) (Supplemental Tables S2 and S3).

Table 4 Summary of mixed model comparative effectiveness results (N = 128)

Functional data analysis (Ramsay and Silverman 2005) was used to explore the RL finding over time with greater precision. Figure 2 displays function estimates derived from RL scores at the four time points, along time ranges beginning 2.5 months prior to intervention (−2.5), to intervention start (0), and post-intervention (+12.3). The estimated treatment RL function (solid curve) is below comparison (dotted curve) at month −2.5 and crosses comparison during month 1. At 3 months, treatment RL declines, then after 6 months, increases back to higher levels.

Fig. 2
figure 2

Estimated Reasons for Life (RL) functions for treatment (Community 1) and comparison (Community 2). Cross markers indicate raw treatment RL scores, while circles indicate raw comparison RL scores. The solid curve represents the estimated function for treatment, while the dotted curve indicates comparison. Three data points that were outliers on time in intervention beyond 12.3 months were winsorized to 12.3 months. Drop in the treatment smoothed function curve at 11–12 months is a common functional data analysis artifact when data points are lacking; treatment had few data points available at 11–12 months

Discussion

The primary finding identified impact of the Qungasvik intervention on outcomes protective from suicide risk among rural Yup’ik Alaska Native youth. A more intensive version of the Qungasvik intervention, defined as a higher dose implementation, produced significantly greater intervention impact in contrast to a lower dose. Dose was measured through number of intervention sessions each participating youth attended. Protection from suicide risk included beliefs and experiences that make life enjoyable and worthwhile, and that provide life meaning. This finding provides support for Qungasvik as a promising approach to prevention of suicide risk in these rural Yup’ik communities and additionally suggests that on average, youth begin to benefit from the intervention following attendance in approximately seven activities.

Addressing suicide and co-occurring alcohol misuse risk is a public health priority of immense proportion for American Indian and Alaska Native communities. There is similar intense interest in “culture as treatment,” approaches that use Indigenous traditional knowledge in the development of culturally commensurate, effective intervention strategies (Gone 2012). This study represents, to our knowledge, one of the first efforts in blending perspectives from Western science in an effort to explore effectiveness of a culture as intervention approach.

A second promising element involves the extent of intervention reach with young Alaska Native males, who participated in Qungasvik at rates similar to or exceeding females. Young Alaska Native men are at particularly high risk for suicide (Allen et al. 2011). Development of effective intervention strategies for this group constitutes a high services priority. We believe that the successful engagement of young men here results from the intervention development efforts to include types of activities and cultural knowledge that appeal to young men, recruitment of community staff for gender balance, and program efforts to actively seek out, then retain males.

These findings are subject to four important limitations. First, while intervention roll out order was randomly assigned in the DWLD, dose was not the result of random assignment; as a result, interpretations of differences in effects on the community level are complex. Each intervention is influenced by differing community histories of research engagement and other events, including youth suicide. Randomization resulted in Community 2 implementing the intervention immediately following intervention implementation in the most geographically contiguous community in the region. These two communities have close intercommunity relationships, including rivalries from such areas as high school sports to community and extended kinship family histories. In Community 2, this may have hindered local ownership of an intervention model initially developed in Community 1. Community members questioned this intervention roll out order selection given these considerations; however, we adhered to the randomization schedule in our design. A more protracted process of local adaptation in Community 2 than originally envisioned ensued, and at the one year point, start-up had commenced more slowly and dose was lower. The lower intervention intensity in Community 2 at 12 months reflected this organizational work as part of the implementation. Community 2 sustained implementation following the assessment time frame, highlighting the differing time needs noted in the community-level intervention literature (Trickett et al. 2011). Though suicide was no stranger to both study communities, Community 1 was responding to a recent youth suicide cluster, and implementation followed 2 years of sustained engagement as an intervention development community. In contrast, Community 2 had a briefer pre-intervention engagement and no immediate history of cluster suicide. It also had more males enrolled, who are at greater risk as a population, and this may have impacted outcomes.

Second, though this study enrolled a majority of the youth residing in the two participating communities, sample size is small for a universal prevention trial, and several youth changed community residence or were out of the community at an assessment time point, which impacted retention rates. It is possible that some of the observed nonsignificant trends in the data may have proven statistically significant with a larger sample.

Third, measures of Individual and Community Characteristics and of RP in Community 1 displayed limitations in ability to separate individuals according to level on the latent trait, while Peer Influences in Community 2 provided virtually no capability. These measurement issues could result in Type II error, failure to reject a false null hypothesis by not identifying a true intervention effect, rather than producing a false positive finding. One value of this study was in alerting the research team on the need to refine these measures and to replace the Peer Influences measure in our ongoing work with Qungasvik.

Finally, there were numerous null findings. While Qungasvik aims to promote protection from co-occurring suicide and alcohol risk, significant findings were not observed for alcohol protection, and there were no intermediate outcome differences between communities. However, within community growth occurred for the Individual Characteristics intermediate outcome in Community 1, a proposed mechanism of change variable, but not for other intermediate variables in the model. The negative findings highlight the positive value of these types of incremental small sample studies as part of ongoing larger scale prevention trials. The findings alerted our team to re-examine the alcohol intervention components and to scrutinize fidelity in the alcohol prevention efforts of our ongoing implementation, as well as to enhance family level efforts directed at parents and parenting, and community level activities, in our ongoing implementation.

Important issues surface in asking why did Community 2 produce a lower intensity implementation? Community intervention implementation requires attending to each community’s current level of organizational development. This calls forth more nuanced consideration of the concept of time and of intervention time duration (Allen et al. 2014, Trickett et al. 2014). In some communities, implementation may require a lengthier community development process; time in intervention as a variable and as a level of analysis may be better understood in terms of community position along this developmental process. In fact, we have observed sustained intervention activities, with ongoing elements of direct intervention effects and ripple effects in Community 2, in ways that have expanded our understanding of impact.

This study adopted a comparative effectiveness design in efforts to apply a cultural perspective in evaluation of research risks and benefits, one of four ethical dimensions in culturally sensitive research proposed by Trimble et al. (2010). In their evaluation of risk and benefit, several community representatives expressed discomfort with randomized controlled trials (RCTs). In particular, they observed that withholding a program was inconsistent with Yup’ik cultural values of inclusion; from a community perspective, if an intervention is thought beneficial, why would you randomly withhold it from some? From Yup’ik cultural perspectives, the community is itself one organism; why would you treat half of it? In response, for the larger prevention trial from which this study was drawn, we adopted a DWLD. Two communities at this point in time of our intervention research emerged as implementing at different dose levels. This allowed comparison of one community to another at a point earlier in its intervention process. This approach, a variation of a stepped-wedge design (Henry et al. 2015), avoided a direct comparison of quality of the intervention work across two close-knit communities, because the unit of analysis was intensity of intervention.

How might the field advance by using rigorous designs that allow for better causal inference in these extremely hard-to-reach communities? Health disparities research is often conducted with small population and culturally distinct, difficult to access populations, leading several researchers to argue that small sample studies are essential to health disparities science (Srinivasan et al. 2015; Henry et al. 2015). However, small samples create challenges to effective implementation of the RCT design. Wyman et al. (2015) describe logistical and ethical issues that can arise, and note inefficiencies in how the RCT uses information can result in low power and external validity. They propose the DWLD and regression point displacement design as alternative designs that maintain rigor while optimizing power, and Fok et al. (2015) propose the stepped-wedge design (SWD) and interrupted time-series design. Roll out designs such as the DWLD and SWD allow for the additional element of randomization of intervention start time. By prioritizing optimization in design selection, combined with measurement optimization strategies outlined by Fok and Henry (2015) and additional optimization strategies proposed by Hopkin et al. (2015), studies can more efficiently use available information to maximize power with modest sample size to advance the field. Finally, mixed methods and implementation of Bayesian approaches provide emergent alternatives (Henry et al. 2015).

The current research provides promising findings for the Qungasvik intervention and demonstrates the utility of small sample methods to address socially meaningful issues with culturally distinct, small populations in difficult to access settings. More broadly, these findings provide further window into culture as treatment and the promise of intervention approaches making direct use of culture, drawing from the world of local practices grounded in Indigenous traditional knowledge.