Introduction

Heroin dependence is a disease characterized by recurring impairment of physiological and psychological conditions, resulting from persistent maladaptive use of the psychoactive substance heroin. It is a complex disease where initiation is primarily environment-dependent, but the progression is dependent upon environmental (Rhee et al. 2003; Robinson and Berridge 2008) and genetic factors (Kobayashi and Schultz 2008). The role of the environmental factors may range widely from the availability of psychoactive substances, prevailing laws in a region, social interaction pattern to the developmental conditions of an individual at home or surrounding. Genetic factors to addiction account for 40–60% of an individual’s vulnerability to addiction and can also be influenced by gender, ethnicity, and the developmental stages (National Institute on Drug Abuse 2010).

Dopamine transporter (DAT) is a transmembrane protein responsible for the transportation of dopamine from the synapse into the cytosol. It is a major regulator of dopaminergic neurotransmission (Giros et al. 1996) and thus has been suggested to play an essential role in the dopaminergic reward pathways of the mesocorticolimbic region in substance dependence (Koob and Nestler 1997). Heroin use increases phasic dopamine neurotransmission relative to baseline by increasing dopamine release. The increase in dopamine enhances salience and learning associated with reward and addiction (Sulzer 2011). Impairments in the dopaminergic pathways that regulate the neuronal systems that are associated with self-control, conditioning, stress reactivity, reward sensitivity, and incentive motivation are the major causes of addiction (Volkow et al. 2013). The basolateral amygdala or the nucleus accumbens and the orbital prefrontal cortex (PFC) are richly interconnected, and this neuronal network has been indicated in addiction and conditioned reinforcement. Lesions in this neuronal network have been implicated in impairing the acquisition of cocaine or heroin seeking (Everitt and Robbins 2005). Interactions of the amygdala, hippocampal, and PFC projections in the nucleus accumbens are modulated by mesolimbic dopamine, which in turn can modulate the release of dopamine (Everitt and Robbins 2005). Low dopamine function has been linked to abnormal cravings (Blum et al. 2008).

The human DAT1 gene (SLC6A3) with cytogenetic location 5p15.33 consists of 15 exons and spans approximately 60 kb (Giros et al. 1992; Vandenbergh et al. 1992). The gene exhibits consensus sequences for RNA splicing at each intron-exon junction. Its protein-coding region starts within the exon 2 and ends at the beginning of exon 15 (Bannon et al. 2001). DAT1 and its variants have been reported to be associated with a responsivity of the reward-related network (Dreher et al. 2009), cocaine-induced paranoia (Gelernter et al. 1994), cocaine addiction (Brewer et al. 2015; Guindalini et al. 2006), alcoholism (Vasconcelos et al. 2015), the severity of alcohol withdrawal (Sander et al. 1997), alcohol withdrawal seizures (Le Strat et al. 2008), smoking (Wetherill et al. 2014), and smoking cessation (Ma et al. 2016) in various populations. This series of evidence suggest that DAT1 is a strong candidate gene for heroin addiction. Therefore, the present study aims to investigate the association of DAT1 with heroin abuse in a small homogeneous Indian subpopulation from Manipur, a state in India with a high prevalence of drug dependency. In this study, we have selected eight single-nucleotide polymorphisms (SNPs) and two variable number tandem repeats (VNTRs) of DAT1 to investigate the association of DAT1 variants with heroin addiction.

Materials and Methods

Subject Selection

A total of 279 (147 controls and 132 cases) individuals were involved in this study. Participants were recruited with their informed written consent, following the DSM-IV criteria (American Psychiatric Association: Diagnostic and Statiscal Manual of Mental Disorders 1994). The age of heroin-dependent cases ranged from 22 to 57 years. Among the 132 cases, 110 were males, and 22 were females. Out of the total 147 control subjects, aged between 18 and 58 years, 98 were males, and 49 were females. Heroin-dependent subjects were examined and assessed by an expert psychiatrist specialized in substance abuse. All the cases were undergoing oral substitution therapy (OST) with buprenorphine at the time of blood collection. Control participants were selected based on non-substance user criteria provided in DSM-IV. Individuals with other psychiatric diagnoses, such as psychosis, or chronic physical illness such as diabetes or other metabolic disorders were excluded from the study. The study conformed to the Declaration of Helsinki and was approved by the Institutional Human Ethics Committee of Manipur University.

DNA Isolation

Genomic DNA was prepared from WBCs isolated from the peripheral blood of the participants following the salting out procedure (Miller et al. 1988). DNAs were dissolved in TE (Tris EDTA) buffer pH − 8.0 and stored at − 80 °C to be used for downstream genotyping analysis.

Genotyping Analysis

Eight SNPs (rs40184, rs27048, rs37021, rs250683, rs250682, rs427284, rs458609, rs246997) and two VNTR polymorphisms (rs28363170 and rs3836790) located within the 3′ untranslated region (UTR) and intron 8 of the DAT1 were selected for testing their association with heroin dependence. A diagrammatic representation of DAT1 is provided in Fig. 1a (NCBI Database n.d.) and the marker details in Table 1. The genomic regions encompassing the selected SNPs and VNTRs for the DNA samples were amplified by PCR (T100™ Thermal Cycler, BIORAD) in a 25 μl reaction mixture containing OneTaq®Quick-Load® 2X Master Mix with standard buffer (New England Biolabs®) and specific oligonucleotide primers (Table 1) purchased from Sigma-Aldrich, India. The specific primers were designed using Primer3 software (Koressaar and Remm 2007). PCR cycling conditions for all the SNPs included initial denaturation at 94 °C for 3 min; 35 cycles of denaturation at 94 °C for 30 s, annealing at X °C (X, 55.2 °C to 60.1 °C depending on the primer) for 30 s; extension at 68 °C for 45 s; and a final extension at 68 °C for 5 min. Cycle conditions for VNTR rs28363170 included initial denaturation at 94 °C for 5 min followed by 35 cycles of denaturation at 94 °C for 20 s, annealing at 73 °C for 30 s; extension at 68 °C for 20 s; and a final extension at 68 °C for 5 min. For VNTR rs3836790, initial denaturation was at 94 °C for 40 s followed by 35 cycles of denaturation at 94 °C for 30 s, annealing at 63 °C for 30 s; extension at 68 °C for 40 s; and a final extension at 68 °C for 5 min.

Fig. 1
figure 1

a Diagrammatic representation of the DAT1 (SLC6A3) gene structure. Gene structure was constructed using information gathered from the NCBI gene database. The horizontal black line with vertical bars illustrates the complete gene structure, and the comparative scale in the chromosome is indicated. Dark vertical bars depict exons, and exon numbers are shown above the bar. Locations of the various studied markers are depicted in the gene structure using their respective IDs. Spaces between the exons represent introns. b Graphical representation of pairwise linkage disequilibrium (LD) of DAT1 markers in heroin-dependent cases and controls. Pairwise measures of LD, such as normalized linkage disequilibrium coefficient (D′) and correlation coefficient (r2), are estimated using Haploview program v4.2. The values in the box represent D′. The intensity of the colour is dependent on the D′/LOD value, and greater intensity of the red colour indicates higher D′ value

Table 1 Details of the markers, restriction enzymes, and sequences of the PCR primers used for genotyping analysis of DAT1 polymorphisms

After PCR, the eight SNPs were genotyped by using restriction fragment length polymorphism (RFLP) method, while only PCR was used for genotyping the two VNTRs. Both PCR and RFLP products were analysed by gel electrophoresis. For RFLP, the restriction digestions were performed by the restriction enzymes listed in Table 1, following the manufacturer’s protocols (New England Biolabs®).

Statistical Analysis

The statistical power of the present analysis was calculated as post hoc using GAS Power Calculator, assuming that a multiplicative disease model with a disease prevalence of 0.007 (0.7%), the disease allele frequency of 0.35, a significance level of 0.05, and genotype relative risk of 1.5 was 0.655 (65.5%) which is moderate. The Hardy-Weinberg equilibrium (HWE) analysis of cases and controls for each marker was performed using POPGENE version 1.32 (Yeh et al. 1997). Genotypic/allelic frequency calculation and population-based case-control association analyses based on allelic, genotypic, and haplotypic frequencies were performed by using UNPHASED program version 3.1.5 (Dudbridge 2008), and the multiple testing was corrected by 1000 permutations using the same program. The two VNTRs were also analysed based on genotypic and allelic frequencies using SPSS (version 16.0). Odds ratio (OR) and confidence interval (CI) were calculated using OR calculator (http://www.hutchon.net/confidornulhypo.htm). Linkage disequilibrium (LD) between the paired markers of DAT1 was calculated using HaploView program version 4.2 (Barret et al. 2005). Bimarker-based haplotype association analysis was carried out to identify the unique chromosomal segments that likely harbour disease-specific susceptible haplotypes. For the two VNTRs, the alleles were grouped into two, viz. L (10R and 10+R for rs28363170 and 7R and 7+R for rs3836790) and S (9R and 9-R for rs28363170, and 6R and 6-R for rs3836790), for biallelic representation before the analysis. Bivariate correlation analysis and binary logistic regression analysis were performed using SPSS (version 16.0). Bivariate correlation analysis was performed for different variables, viz. ‘years of drug use’, ‘number of times drug abstained’, ‘onset’, rs28363170 genotypes, and rs246997 genotypes with disease status. Binary logistic regression analysis was performed with heroin use disorder status as a dependent variable and rs28363170-10R/10R, rs28363170-10R/9R, rs28363170-10R/11R, rs246997-TT, rs246997-GT, and rs246997-GG as covariates.

Marker-marker interaction of the DAT1 gene was carried out using multifactor dimensionality reduction (MDR) 2.0 beta 8.4 (Hahn et al. 2003). MDR is a genetic model-free, non parametric method for detecting, characterizing, and interpreting nonlinear interactions among discrete genetic and environmental attributes.

Bioinformatics Analysis

The functional protein interaction network of DAT1 protein was retrieved from STRING version 10.5, an online computational prediction tool available at http://string-db.org (Szklarczyk et al. 2017). The confidence score was set at 0.7 and 0.9 to obtain the interaction results. Variant Effect Predictor (VEP) (McLaren et al. 2016) and Gene-Aware Variant INterpretation (GAVIN) (van der Velde et al. 2017) online tools were used to analyse the functional effect of the markers.

Results

Genotypic Distribution of DAT1 Markers in the Studied Population

The homogeneous distribution of the genotypes in the studied population was evaluated for each DAT1 marker by HWE analysis. The results showed that the genotypic distributions of all the markers except rs37021 (controls) conformed to HWE equilibrium (Table 2). All the ten markers were subjected to further analysis.

Table 2 Case-control association analysis using genotypic distribution and HWE analysis of DAT1 markers

A Biased Genotypic Distribution Between Heroin-Dependent Cases and Controls

Case-control association analysis using the genotypic frequencies demonstrated significant differences between the cases and controls for the marker rs246997. The GG genotype was over-represented in heroin-dependent cases compared with controls (p = 0.002; LRS = 12.443), and significance retained after multiple corrections by 1000 permutations (adjusted p = 0.047) as shown in Table 2. It may be noted that alleles for 3′UTR VNTR (rs28363170) and intron 8 VNTR (rs3836790) were grouped into two, for biallelic representation. ‘L’ allele represents 10R and above for rs28363170, and 7R and above for rs3836790. ‘S’ allele represents 9R and below for rs28363170, and 6R and below for rs3836790. Therefore, to analyse the individual genotypic frequencies, SPSS was used based on their presence or absence, and a significant difference was observed between heroin-dependent cases and controls for the 10R/11R genotype of rs28363170. The 10R/11R genotype was over-distributed among controls as compared with heroin-dependent cases (p = 0.007, chi-square = 7.342, OR = 0.24, df = 1), and further details are provided in Table 3. Overall genotypic analysis by SPSS with df = 5 revealed nominally significant differences between heroin-dependent cases and controls (p = 0.053, chi-square = 10.91) (Table 3).

Table 3 Population-based analysis of the individual alleles and genotypes of the two VNTR sites using SPSS

Allelic Distribution of DAT1 Markers in Heroin-Dependent Cases and Controls

Allelic frequencies of each marker were calculated separately for the cases and controls (Table 4). Allelic frequency-based case-control association analysis showed a significant association of DAT1 with heroin dependence. The G allele of rs246997 had significantly higher distribution in heroin-dependent cases as compared with controls (p = 0.0008; OR = 1.77, LRS = 11.18), and the association remained significant after multiple corrections testing with 1000 permutations (adjusted p = 0.008). SPSS analysis with df = 1 depicted a significant difference between heroin-dependent cases and controls for the 11R allele of 3′UTR VNTR (rs28363170). The 11R allele was over-distributed among controls as compared with heroin-dependent cases (p = 0.008; chi-square = 7.13; OR = 0.25; 95%CI = 0.09–0.69), but at df = 3, no significant difference was observed as shown in Table 3.

Table 4 Population-based case-control analysis of DAT1gene using allelic distribution of the various markers

Biased Linkage Disequilibrium of Paired Markers Between Case and Control

Linkage disequilibrium (LD) analysis showed contrasting LD pattern between controls and cases for some marker pairs, and the details of the results are summarized in Table 5 and Fig. 1b. Comparatively, wider LD block starting from rs37021 to rs458609 was observed in controls, whereas it is disrupted in cases by a low LD region between pair of markers involving rs250683 [rs250683-rs250682 (D′ = 0.232, r2 = 0.048), rs250683-rs427284 (D′ = 0.303, r2 = 0.087), rs250683-rs458609 (D′ = 0.441, r2 = 0.167), rs250683-rs246997 (D′ = 0.299, r2 = 0.048)]. Higher LD profile for the same marker pairs involving rs250683 was observed in controls [rs250683-rs250682 (D′ = 0.711, r2 = 0.460), rs250683-rs427284 (D′ = 0.850, r2 = 0.675), rs250683-rs458609 (D′ = 0.840, r2 = 0.687), rs250683-rs246997 (D′ = 0.428, r2 = 0.167)]. A high LD forming a haplotype block with D′ = 0.946 and r2 = 0.735 was observed for the marker pair rs427284-rs458609 in heroin-dependent cases compared with controls (D′ = 0.792, r2 = 0.602). Comparatively, lower LD was observed in heroin-dependent cases than controls for the marker pairs rs28363170-rs40184 (cases, D′ = 0.746, r2 = 0.234; controls, D′ = 0.947, r2 = 0.277) and rs28363170-rs250683 (cases, D′ = 0.311, r2 = 0.012; controls, D′ = 0.764, r2 = 0.060). It may be noted again that alleles for rs28363170 and rs3836790 were grouped into two for a biallelic representation as mentioned above.

Table 5 Linkage disequilibrium analysis between marker pairs in heroin-dependent cases and controls

Case-Control Association Analysis Using Haplotypes

Bimarker-based haplotype analysis demonstrated that out of forty-five bimarker combinations analysed, fourteen bimarker combinations were found to be significant in the bimarker-based haplotypic association analysis (Table 6). Of these, only five bimarker combinations retained their significance after the corrections for multiple testing, viz. rs37021-rs250683, rs250683-rs250682, rs250683-rs427284, rs250683-rs458609, and rs250683-rs246997. Out of the fourteen significant combinations, all marker combinations involving the marker rs246997 showed significance as also demonstrated in allelic and genotypic analyses. It may be noted that rs250683 showed significant association at the haplotypic level and this finding points towards the existence of a combinatorial effect in heroin dependence.

Table 6 Case-control association analysis using haplotypic distribution of bimarker combinations of DAT1 for heroin dependence

Bimarker-based haplotype analysis demonstrated that haplotype of marker combinations such as L-G rs28363170-rs246997 (p = 0.008, OR = 1.72), T-G rs40184-rs246997 (p = 0.004, OR = 1.30), L-G rs38336790-rs246997 (p = 0.004, OR = 1.72), T-G rs27048-rs246997 (p = 0.006, OR = 1.80), G-T and A-C rs37021-rs250683 (p = 0.0001, OR = 1.58 and 4.49), G-G rs37021-rs250682 (p = 0.006, OR = 1.43), G-G rs37021-rs246997 (p = 0.007, OR = 1.64), T-G and C-C rs250683-rs250682 (p < 0.0001, OR = 2.54 and 2.83), T-G and C-C rs250683-rs427284 (p < 0.0001, OR = 3.56 and 4.45), T-T and C-C rs250683-rs458609 (p < 0.0001, OR = 2.89 and 4.14), C-G rs250683-rs246997 (p = 0.0005, OR = 2.36), G-G rs250682-rs246997 (p = 0.002, OR = 1.82), G-G rs427284-rs246997 (p = 0.001, OR = 2.03), and T-G rs458609-rs246997 (p = 0.004, OR = 1.81) was significantly over-distributed in heroin-dependent cases compared with controls. On the other hand, haplotype of marker combinations such as L-T rs28363170-rs246997 (p = 0.008, OR = 0.55), C-T and T-T rs40184-rs246997 (p = 0.004, OR = 0.67 and 0.62), S-T and L-T rs38336790-rs246997 (p = 0.004, OR = 0.50 and 0.66), C-T rs27048-rs246997 (p = 0.006, OR = 0.60), G-C and A-T rs37021-rs250683 (p = 0.0001, OR = 0.74 and 0.56), G-C and A-G rs37021-rs250682 (p = 0.006, OR = 0.57 and 0.31), G-T and A-T rs37021-rs246997 (p = 0.007, OR = 0.64 and 0.64), T-C rs250683-rs250682 (p < 0.0001, OR = 0.52), T-C and C-G rs250683-rs427284 (p < 0.0001, OR = 0.50 and 0.66), T-C and C-T rs250683-rs458609 (p < 0.0001, OR = 0.62 and 0.68), C-T rs250683-rs246997 (p = 0.0005, OR = 0.61), C-T rs250682-rs246997 (p = 0.002, OR = 0.53), C-T rs427284-rs246997 (p = 0.001, OR = 0.58), and T-T and C-T rs458609-rs246997 (p = 0.004, OR = 0.65 and 0.65) was over-represented in controls (Table 6).

Correlation Analysis

Bivariate correlation analysis among different variables demonstrated significant positive correlation between rs246997-GG (p = < 0.001, Pearson correlation = 0.217) with heroin dependence. Significant negative correlations were observed for heroin dependence with rs246997-TT (p = 0.001, Pearson correlation = − 0.203) and disease status with rs28363170-10R/11R (p = 0.007, Pearson correlation = −0.162). No significant correlations were observed for the variables ‘years of drug use’, ‘number of times drug abstained’, and ‘onset’ with rs28363170 genotypes or rs246997 genotypes.

Regression Analysis

Binary logistic regression analysis with heroin use disorder status as dependent variable and rs28363170-10R/10R, rs28363170-10R/9R, rs28363170-10R/11R, rs246997-TT, rs246997-GT, and rs246997-GG as covariates revealed significant odds ratio for rs246997-GG (OR = 2.576, 95% CI = 1.49–4.45, p = 0.001), rs28363170-10R/11R (OR = 0.176, 95% CI = 0.04–0.81, p = 0.025), and rs246997-TT (OR = 0.392, 95% CI = 0.19–0.80, p = 0.010). The Hosmer-Lemeshow test of goodness of fit at df = 1 was not significant (p = 0.887) depicting the fitness of proposed model. ln(odds) of the derived logistic regression equation was − 0.302 with p = 0.041 and odds ratio of 0.74.

Marker-Marker Interaction Analysis

To investigate the interaction between markers of the DAT1 gene, an MDR test was performed. The interaction of all the markers in the study is presented in the dendrogram generated through MDR (Fig. 2a). The strength of the synergistic interactions between the studied markers along with their values is demonstrated by the entropy-based interaction graph (Fig. 2b). Marker rs250683 has shown a maximum synergistic effect with most of the studied markers. The estimated information gain (IG) for some pairwise combination (interaction effect) was found to be higher than that of individual markers, as seen in the case of rs250683 (individual entropy value = 0.21) with rs427284 (individual entropy value = 0.13, synergistic entropy value = 7.13), rs250682 (individual entropy value = 0.27, synergistic entropy value = 4.33), and rs37021 (individual entropy value = 0.55, synergistic entropy value = 3.80) (Fig. 2b). Marker rs246997 has shown a high individual entropy value of 3.22 but no effective synergism with any of the studied markers. Further, the best interaction models were generated based on maximum cross-validation consistency (CVC) and the lowest prediction error (PE). The result provided four best models which include a highest single-marker effect of rs246997 (p = 0.0017, OR = 2.84, CVC = 10/10, PE = 0.427), highest interactive effects of two marker combination, viz. rs250683-rs427284 (p = < 0.0001, OR = 4.36, CVC = 10/10, PE = 0.355), three marker combination, viz. rs37021-rs250683-rs427284 (p = < 0.0001, OR = 4.97, CVC = 5/10, PE = 0.322), and four marker combination, viz. rs27048-rs37021-rs250683-rs250682 (p = < 0.0001, OR = 8.94, CVC = 4/10, PE = 0.272), for heroin dependence. Genotype GG of rs246997 was more prevalent among cases than the controls (Fig. 2c). The most effective marker interaction was found to be rs250683-rs427284, where TT-GG, TT-GC, TC-GG, TC-CC, CC-GC, and CC-CC genotypes interact more among cases compared with controls (Fig. 2d).

Fig. 2
figure 2

MDR generated individual and interactive data. a Dendrogram showing interactions of all the studied markers of the DAT1 gene. Colour variation indicates the strength of interaction, and the increase in interaction is presented from blue (redundant) towards red (synergy). b Circle graph showing different entropy levels of individual and interacting markers for heroin dependence. The strong interaction is indicated when the entropy value is higher in the combined set than the individual markers. c Graphical view showing high-risk and low-risk genotype for heroin dependence in single-marker model. d Graphical view showing high-risk and low-risk genotype for heroin dependence in two marker interaction model

Bioinformatics Analysis

Variant Effect Predictor (VEP) and Gene-Aware Variant INterpretation (GAVIN) online tools were used to check the predicted effects of the markers. Both VEP and GAVIN have scored the impact of all the studied ten markers as modifier (usually non-coding variants or variants affecting non-coding genes, where predictions are difficult, or there is no evidence of impact). DAT protein interaction network depicting all known, predicted, and other interactions generated through the online search tool ‘STRING’ is provided in Fig. 3. The maximum DAT interaction score was observed with DRD2 (0.971). The remaining nine highest scoring DAT-interacting proteins were SNCA (0.968), CDH1 (0.908), FLOT2 (0.900), CAMK2A (0.870), SLC618A2 (0.860), TH (0.840), CAMK2G (0.837), CAMK2B (0.804), and PRKCA (0.804).

Fig. 3
figure 3

DAT1/SLC6A3 protein interaction model generated through the online search tool ‘STRING’. a DAT1/SLC6A3 and other protein interactions, showing evidence from various sources at a set confidence score of 0.7. b DAT1/SLC6A3 and other protein interactions, depicting molecular action at a set confidence score of 0.9

Discussion

The major results of the present study are as follows: (i) this is the first study to report the association of marker rs246997 of DAT1 gene with heroin dependence at allelic, genotypic, and haplotypic levels; (ii) association of five bimarker-based haplotype combination of DAT1 gene, viz. rs37021-rs250683, rs250683-rs250682, rs250683-rs427284, rs250683-rs458609, and rs250683-rs246997, with heroin dependence even after multiple testing correction; (iii) association of 11R allele and 10R/11R genotype of 3′UTR VNTR (rs28363170) of DAT1 gene with protection against heroin dependence, which is being reported for the first time; (iv) the finding of 7R and 6R alleles as the common alleles of intron 8 VNTR (rs3836790) of DAT1 gene in this study population indicates ethnicity-linked variation in allele frequency of this marker; and (v) this is for the first time that 13R allele (7R/13R genotype) and 14R allele (7R/14R genotype) are identified for intron 8 VNTR (rs3836790).

In the present study, the marker rs246997 of DAT1 was found to be significantly associated with heroin dependence even after correction for multiple testing with 1000 permutation, and it has not been cited or reported earlier for its association with heroin dependence in other similar studies. MDR also demonstrated the highest single-marker effect of rs246997 with a cross-validation consistency of 10/10 and a p value of 0.027. Correlation analysis and regression analysis have also demonstrated the association of this marker with heroin dependence. However, no correlation of rs246997 genotypes with possible determinants of heroin dependence, such as years of drug use, age at onset, and the number of times drug abstained, was observed. This could be because of the small sample size of the study. Moreover, the assessment and analysis of these possible determinants of heroin dependence were based on individual responses to a questionnaire. A previous study (Koijam et al. 2020) with the same population had demonstrated a significant association of an opioid receptor mu 1 (OPRM1) variant with the number of times drug abstained.

Since the VNTRs have more than two alleles and three genotypes, possible differential role of alleles and genotypes as a group and as an individual entity was checked using UNPHASED and SPSS. Analysis using UNPHASED by the grouping of all repeats into higher repeat alleles and lower repeat alleles failed to reveal any possible effect on phenotype. However, when analysed individually with df = 1 by SPSS, a significant difference between heroin-dependent cases and controls for 11R allele and 10R/11R genotype of 3′UTR VNTR (rs28363170) was demonstrated. When overall individual alleles and genotypes of 3′UTR VNTR (rs28363170) were analysed with df > 1, nominal significance was observed at the genotypic level (p = 0.053, df = 5) but not at allelic level.

A study by Hou and Li (2009) on 3′UTR VNTR (rs28363170) of the DAT1 gene in Han Chinese population found no association of the DAT1 gene with heroin dependence. Vereczkei et al. (2013) reported a lack of association of DAT1 gene variants 3′UTR VNTR (rs28363170) and intron 8 VNTR (rs3836790) with heroin dependence in the Hungarian population. Another study by Yeh et al. (2010) on Han Chinese involving 15 variants of the DAT1 gene, including rs27048, 3′UTR VNTR (rs28363170), and intron 8 VNTR (rs3836790), reported absence of association with heroin dependence. The non-association of rs27048 and intron 8 VNTR (rs3836790) with heroin dependence was replicated in the present study. The non-association results involving 3′UTR VNTR (rs28363170) from these studies were based on analysis after the dualization or grouping method of VNTR, which is similar to our result under similar analysis method. However, the present study analysed 3′UTR VNTR (rs28363170) further at individual allele and individual genotype level with df = 1 and df > 1 rather than a limited binary or grouping approach, based on which association of 3′UTR VNTR (rs28363170) was observed as described above. The contradicting results generated from different studies in different populations may be because of differences in the study design, variants studied, analysis methods, sample size, differential allele frequencies, ethnic differences, and population-specific epigenetic interactions with other genes. Further, gene polymorphisms involved in the vulnerability or association with heroin dependence may be population-specific, as demonstrated by Bhaskar et al. (2012).

The participants in the present study are from Manipur of North East India only and are traced to East Asian origin (Saraswathy et al. 2009). In this study, utmost care has been taken in the selection of the subjects. They belonged to an ethnically homogeneous and small population, which is also different from the rest of the Indian population. Controls with appropriate size from a similar background were included for proper representation. Analysis based on population stratification was not performed due to the very low frequency of ethnically different groups and overall small sample size. Population admixtures and gene flow in many generations may play a role in the actual inference of the study as well.

The present study reports for the first time the association of 11R allele and 10R/11R genotype of 3′UTR VNTR (rs28363170) with protection against heroin dependence. Future studies are suggested to look into 11R allele and 10R/11R genotype of 3′UTR VNTR towards association with heroin dependence. The study has also highlighted a differential distribution of the alleles of intron 8 VNTR (rs3836790) in the population, indicating ethnicity-oriented changes in allele frequency. The 7R and 6R alleles of this marker were found to be more common in the population, and lower repeat alleles were undetectable. The commonly reported alleles of the marker are 5R and 6R alleles. Similar allelic prevalence was reported by Kim et al. (2017) in the Korean population and by Zhou et al. (2014) in the African American population, where 6R of intron 8 VNTR was the minor allele as against 5R allele (Guindalini et al. 2006). In the case of intron 8 VNTR (rs3836790), alleles such as 6R, 7R, 8R, 9R, 11R, 12R, 13R, and 14R were detected in the study in heroin-dependent cases; though, most of the alleles showed very low frequencies. However, 11R, 13R, and 14R alleles were absent in the controls in the same study group. 13R (7R/13R genotype) and 14R (7R/14R genotype) alleles of intron 8 VNTR (rs3836790) were also detected in two heroin-dependent cases for the first time. To date, studies have reported the occurrence of only up to 12R alleles for intron 8 VNTR (Gadow et al. 2014).

VEP and GAVIN online prediction tools scored the impact of all the ten markers as modifiers. It may be noted that the mere absence of evidence of impact or difficulty in predictions for a variant cannot rule its involvement in a particular phenotype or regulation out. The 3′UTR VNTR and intron 8 VNTR have also been scored as modifiers, whereas they have been suggested to regulate or modulate DAT expression (Pinsonneault et al. 2011). DAT-interacting proteins provided by STRING like DRD2, SNCA, CAMK2A, and CAMK2B have reports of association with substance dependence (Easton et al. 2014; Janeczek et al. 2014; Lehrmann et al. 2006; Spronk et al. 2016). DAT may also act in combination with these proteins to provide an overall effect on the phenotype of heroin dependence.

Some of the limitations of the present study include relatively smaller sample size and less number of variants screened, which could be extended with higher sample size and more variants. Further, DAT expression levels in brain were not included, and some of the possible determinants of heroin dependence such as years of drug use, age at onset, number of times drug abstained, etc. used in the study were solely based on individual responses to questionnaire.

Such association studies help in the identification of genetic risk factors to a particular disease. The identified genetic risk factors may not necessarily affect expression of the genes. However, the risk factors may vary from population to population due to population-specific variations. Studies of such gene polymorphisms give better understanding of the phenotypic effects that they may induce. One of the extensively studied polymorphisms is the SNP (118A>G) in the OPRM1 (mu opioid receptor) gene whose product is involved in reward, addictive behaviour, and pain regulation. The SNP (118A>G) changes a single amino acid, Asn 40 to Asp in the receptor protein (Beyer et al. 2004), leading to reduced protein translation and availability of the receptor likely stimulating opioid addiction (Ahmed et al. 2018). Therefore, gene polymorphisms may also affect the functions of genes, which in turn may affect the pathophysiology of drug dependence. Further, the extension of such heroin dependence studies towards a metabonomic and clinical perspective (Xie et al. 2015) is highly encouraging. In the present study, individuals carrying the G allele, GG genotype at rs246997, have been identified as having a higher risk for susceptibility to drug dependence. Therefore, the study suggests prioritizing individuals with G allele, GG genotype at rs246997, during prevention, intervention, and clinical treatment of heroin dependence. Further, the 11R allele at 3′UTR VNTR (rs28363170) has been identified as a protective allele for heroin dependency. This result suggests that this marker could be used for stratification of patients with lower risk. The findings of the present study, in concurrence with other similar studies (Dang et al. 2014; Levran et al. 2015), provide strategies for the development of novel prevention, patient stratification, targeted intervention, and personalized treatment for heroin dependence. However, the strategies stemming from such association studies have an inherent problem of the inability for generalized usage across all human populations, owing to the complex nature of the disease such as ethnic susceptibility differences, individual differences, epistatic interactions, and involvement of multiple genes and other environmental conditions. Thus, as with any other complex disease, there is a constant need for understanding all the possible factors and mechanisms underlying heroin dependency.

Conclusion

The present study provides evidence that the DAT1 gene is a susceptibility gene for heroin dependence in a subpopulation from India, where such phenotype is highly prevalent. The presence of ‘G’ allele at rs246997 in an individual is a risk factor that increases the probability of getting heroin-dependent. Furthermore, the presence of 11R at 3′UTR VNTR (rs28363170) confers protection from heroin dependency. Identification of a novel marker and low-frequency allele of a commonly studied polymorphism as effectors for heroin dependence opens up further avenues for research to understand the possible mechanisms through which it modifies the phenotype to reinforce its association with heroin dependence. Although the variants identified in this study are suggested to produce low relative risk towards vulnerability to heroin dependency, it may uncover novel mechanisms of addictive behaviour.