Introduction

The human organic cation transporters, also designated as hOCT1, hOCT2, and hOCT3, mediate electrogenic transport of small organic cations with different molecular structures, independent of sodium gradient. These organic cation substrates include clinically important therapeutics (e.g., metformin, procainamide, and cimetidine), endogenous compounds (e.g., dopamine and norepinephrine), as well as toxic substances [e.g., tetraethylammonium bromide (TEA)] [1]. The genes encoding the three organic cation transporter isoforms (hOCT1, hOCT 2, and hOCT 3) are clustered together on the long arm of chromosome 6. Population genetics analyses identified numerous single-nucleotide polymorphisms (SNPs) in the three genes [2]. These polymorphisms may be the reason for inter-individual variability in pharmacokinetic disposition, efficacy, and toxicity of therapeutic drugs. Recent studies suggest that inter-patient variability in response to metformin therapy, for intense, could be related to some of the genetic polymorphisms in the OCT genes and/or the MATE genes [3].

Human organic cation transporter 1 (SLC22A1), expressed predominantly in the liver, plays a critical role in cellular uptake and elimination of numerous cationic substrates, including therapeutically important compounds [4, 5]. Numerous functionally relevant SNPs have been reported within SLC22A1. Kerb et al. [6] have reported that 25 SNPs have been identified within 57 Caucasian individuals in human SLC22A1. Furthermore, three non-synonymous SNPs (R61C, C88R, and G401S) showed reduced transport activities. A total of 15 protein-altering variants from SLC22A1 from diverse ethnic backgrounds were reported by Shu et al. [7]. Further findings indicated that four SNPs (R61C, G220V, G401S, and G465R) showed reduced transport function, whereas S14F exhibited increased transport function [7]. Two non-synonymous SNPs found in Korean and Japanese populations (P283L and P341L) showed reduced transport activity [8, 9]. Furthermore, it has been established that polymorphic variations exist within the SLC22A superfamily from different ethnic groups.

Genomic diversity within sub-Saharan Africa, and for that matter the entire African continent, is relatively under-studied, despite being home to significant portion of human genomic diversity [10]. There is thus much to be learnt from characterizing human genomic variation in this part of Africa, especially with regards to health applications [10]. South Africa in particular contains a wealth of different population groups. It is home to the indigenous Khoisan, Xhosa, Zulu, Venda, and Sotho Pedi groups, the Afrikaners and the Cape Coloured, the latter being a uniquely admixed population of immigrant Europeans, Asians and the indigenous populations [10]. Admixed groups, such as Latinos, African Americans, or Cape Coloureds from South Africa, share varying proportions of different ancestral populations and their genetic complexity can potentially complicate biomedical research studies [11]. Their mixed ancestry, however, can provide the intrinsic variability needed to untangle complex gene–environment interactions, which may help to explain the population differences in the epidemiology of complex diseases [11].

In this report, we examine the genetic diversity of the SLC22A1 gene and discuss its possible pharmacogenetic implications within the Cape Admixed population of South Africa. The genotypic and allelic frequencies of 20 SNPs in the OCT1 gene was determined from 100 healthy individuals in the Cape Admixed population, and the data was compared to other population data available in literature and relevant databases. The haplotype structure defined by the 20 SNPs in the SLC22A1 gene was also inferred for this population.

Materials and methods

Subjects

Samples were obtained from the participants with informed consent. This study was approved by the Senate Research Ethics Committee of the University of the Western Cape, South Africa. Biological samples were collected in the form of buccal swabs from 100 unrelated healthy volunteers from the Cape Admixed population. This population is known in South Africa as the “Cape Coloured” population [10]. The term Cape Coloured is used to define a uniquely admixed population which was established with contributions from Asian, European and indigenous populations [11, 12]. This population is known to have the highest rate of admixture in the world and comprises approximately 8.9 % (~4 million) of the South African population [12]. Ethnicity of volunteers was determined by self-report as obtained from their completed anonymous questionnaires. To confirm the volunteers’ mixed ancestry, the questionnaires included a short genealogy of two generations for maternal and paternal family members, as well as place of birth, home language and religious affiliation.

SNP selection

A total of 20 OCT1 gene SNPs (19 nonsynonomous and 1 intronic) were selected for this study. SNPs were selected from the literature and the Ensembl database (http://www.ensebl.org) [13]. Variants A306T; A413V; M420V, C436F; I421F: V501E, V519F and I542V were included in this study based on predicted effect on function, using the sorting intolerant from tolerant (SIFT) program [14]. To our knowledge no population data exist in the public domain for these variants.

Primer design

Multiplex PCR primers, listed in Table 1, were designed to have an annealing temperature between 55 and 60 °C using Primer3 software [15]. To test for possible non-specific amplification, primers were aligned with the NCBI sequence databases using Basic Local Alignment Search Tool (www.ncbi.nlm.nih.gov/blast/blast-cgi). Two SNaPshot® Multiplex systems were designed, successfully optimized and used for genotyping. The single base extension primer sets for multiplex 1 and 2 are listed in Tables 2 and 3.

Table 1 OCT 1 Multiplex PCR Primers
Table 2 OCT1 multiplex 1 single base extension primers
Table 3 OCT1 multiplex 2 single base extension primers

Multiplex PCR amplification

All of the OCT1 exons and the portion of intron 9 spanning rs622342 were simultaneously amplified using the primers listed in Table 1. Isolated DNA was diluted to a final concentration of 20 ng µl−1 to be used for multiplex PCR amplification. PCR amplifications were performed using the Quiagen Multiplex kit in a final volume of 10 µl according to manufacturer’s instructions. Amplification was performed on a GeneAmp 2700 (Life Technologies) under cycling conditions: 95 °C for 15 min, 94 °C for 30 s, 58 °C for 60 s, 72 °C for 90 s, for a total of 39 cycles, followed by 72 °C for 10 min and a final hold at 4 °C. PCR products were confirmed by agarose gel electrophoresis. Amplified targets were enzymatically purified using SAP (Amersham) and Exo I (USB, Cleveland, OH, USA) according to the manufacturer’s instructions.

Multiplex mini-sequencing reaction

Multiplex mini-sequencing was performed using the SNaPshot® kit (Life Technologies), according to manufacturer’s instructions as well as in-house design and optimization. Cycling conditions were as follows: 30 cycles of 96 °C for 10 s, 50 °C for 5 s and 60 °C for 30 s, followed by a 4 °C hold. Post-extension treatment was performed by adding 1 U of FastAP to the 10 µl reaction volume and incubation at 37 °C for 30 min followed by 15 min at 80 °C to deactivate the enzyme.

Electrophoresis of the minisequencing products

The purified minisequencing products (1 µl) were mixed with 8.7 µl of HiDi™ formamide and 0.3 µl of GeneScan-120 Liz size standard (Life Technologies) and denatured at 95 °C for 5 min. The fluorescently labelled fragments were separated on 36 cm-long capillaries in POP4 polymer on an ABI Prism 3500 Genetic Analyzer (Life Technologies). Data analyses were performed using GeneMapper® IDX Software Version 1.2. Typical electropherograms for multiplexes I and II are shown in Fig. 1.

Fig. 1
figure 1

Typical electropherograms for multiplex I (a) and II (b). Green peaks indicate the A nucleotide, blue peaks indicate the G nucleotide, black peaks indicate a C nucleotide and red peaks indicate the T nucleotide. (Color figure online)

Statistical analysis

Genotype and allele frequencies as well as the deviation from the Hardy–Weinberg equilibrium were calculated using GenAlEx 6.4.1. [16]. The SHEsis analysis platform was used to infer the haplotypes and determine their frequencies [17, 18]. Statistical significance was defined as having a p value greater than 0.05 (p > 0.05).

Results

The genotype and allele frequencies of the 20 investigated SNPs of the OCT1 gene within a group of 100 Cape Admixed subjects are summarized in Table 4. The allelic frequency of each SNP was in HWE (p > 0.05). Twelve out of the twenty investigated nonsynonomous SNPs were monomorphic in the Cape Admixed population (Table 4). Heterozygosity was only observed for the nonsynonomous SNP variants S14F (rs34447885), P341L (rs2282143), S189L (rs34104736), G220V (rs36103319), M440I (rs35956182), G465R (rs34059508) and V519F (rs78899680). The S14F variant genotype frequencies for homozygote wild-type (CC), heterozygote (CT) and homozygote (TT) were 98, 2 and 0.0 %, respectively. The MAF observed for S14F was 1 %. The P341L variant genotype frequencies, on the other hand, for homozygote wild-type (CC), heterozygote (CT) and homozygote (TT) were 96, 1 and 0.0 %, respectively. The S189L and G220V variants displayed the same genotype frequencies for their respective genotypes; 98, 2 and 0.0 %, respectively. Additionally, the M440I and G465R variants displayed the same genotype frequencies for homozygote wild-type (GG), heterozygote (GA) and homozygote (AA); 99, 1 and 0.0 %, respectively for each. Whereas the genotype frequencies for variant V519F were 97 % [homozygote wild-type (GG)] and 3 % [heterozygote (GT)].

Table 4 Allele and genotype frequencies of 20 SNPs of the OCT1 gene (SLC22A1) in 100 healthy individuals from the Cape Admixed population

The MAF of a selected number of the investigated OCT1 gene SNPs in different ethnic groups are summarized in Table 5. Variants R61C (rs12208357), C88R (rs55918055), P283L (rs4646277) and G401S (rs34130495) were not observed in the Cape Admixed nor the Native American/Hispanic Admixed populations. The S14F variant was observed in the Cape Admixed and other Sub-Saharan populations, but not in all other populations. The S189L and G220V variants were only observed in the Cape Admixed population at a frequency of 1 %.

Table 5 Comparison of MAF of OCT1 gene (SLC22A1) SNPs of the Cape Admixed population to other ethnic groups

The inferred haplotypes for investigated population are listed in Table 6. The most frequently observed haplotypes were CCTCGGCGCGCTAGAGCTGA (80 %), CCTCGGCGCGCTAGCGCTGA (9.9 %) and CCTCGGCGCGCGAGCGCTGA (3.5 %).

Table 6 Haplotype structure defined by 20 SNPs in the SLC22A1 gene in the Cape Admixed population

Discussion

Single nucleotide polymorphisms in OCT1 gene have been recognized as a possible mechanism explaining inter-individual variation in drug response [19]. In this study, we determined the genotypic and allelic frequencies of 20 SNPs in the OCT1 gene from 100 healthy individuals in the Cape Admixed population, and the data was compared to other population data available in literature and relevant databases.

In a study conducted by Leabman et al. [19] numerous polymorphisms within the SLC22A1 gene were identified, and have constituted the focus of many subsequent studies. Furthermore, it was determined that these variant might predict functional changes of the transporter protein [19]. Several SNPs were found to be protein-altering variants. This became important as these inherited differences may contribute to the inter-individual variability in drug response [19, 20]. Fifteen of these protein-altering SNPs were also characterized and evaluated with respect to MPP+ uptake [20]. It was determined that the variant S14F increased function of the transporter protein, whereas variants S189L, M420V and M440I were shown to exhibit the same functionality as the reference transporter. Furthermore, R61C and P341L were identified as reduced function variants, whereas variants G220V, G401S and G465R exhibited a loss of function compared to the reference sequence [20]. In subsequent studies, by Shu et al. [21] and Tzvetkov et al. [2], evaluating the effect OCT1 variants have on metformin uptake, it was shown that the R61C, G401S, M420V and G465R variants all exhibited a reduction in transporter function. In a prior study by Kerb et al. [6], the uptake of 0.1 [3H]1-methyl-4-phenylpyridinium ([3H]MPP) by R61C, C88R and G401S were reduced to 30, 1.4 and 0.9 % compared to wild-type, respectively. The transport of 1 µM [3H] serotonin by C88R and G401S, on the other hand, was reduced to only 13 and 12 % of wild-type respectively [6]. From the data obtained, Kerb et al. [6] suggested that mutants R61C, C88R and G401S could affect the disposition of OCT1 substrates in terms of affinity and selectivity. This has led them to conclude that the duration and intensity of effects of drugs and neurotransmitters may be altered for these substrates [6].

In the present study, 19 protein-altering SNPs were evaluated within the Cape Admixed population. The intronic SNP (rs622342) was also included in the study because it was previously suggested to be associated with the blood glucose-lowering effect of metformin [22].

Variant S14F was observed in the Cape Admixed, Luhya and Yoruba populations at a MAF of 1, 2.5 and 1.5 % respectively. This variant was not observed in any of the other populations included in the analysis (Table 5). Therefore, it may be suggested that OCT1 substrate drugs might have different response profiles in the Cape Admixed and African populations compared to Caucasian and Asian populations.

The G220V variant was observed in the Cape Admixed population with MAF of 1 %, whereas, the other reduced-function variants G401S and G465R were not observed in the population. This may suggest an increased therapeutic response to metformin for our population [21]. The G401S and G465R variants were only observed in the Caucasian population at a MAF of 2.3 and 3.4 % respectively. These variants were not observed in the African and Asian populations (Table 5).

The MAF of P341L observed in the Cape Admixed population (0.5 %) was lower than that of other sub-Saharan African, Asian and Caucasian populations (Table 5). Functional in vitro transport assays have shown that the P341L variant results in a decrease rate of MPP+ transport, and has no effect on the transport of the anti-diabetic drug metformin [21, 23].

The observed MAF at loci S189L, M440I and M420V was 1, 0.5 and 4 % respectively. However, S189L was only observed in the Cape Admixed population whereas M440I was observed in the Cape Admixed and Native American/Hispanic Admixed populations at a similar frequency (Table 5). Although at low frequencies, these variants are suggested to have a reduced transporter function with respect to metformin [21].

The observed MAF for variant V519P in our population was 1.5 %. This value was lower than that of the Luhya and Yoruba, with respective values of 2 and 6 % (Table 5). However, this variant was not observed in any of the other populations, indicating that it might be specific to African populations. The impact of this variant on transport function or drug efficacy has not yet been determined and requires further investigation.

The intronic SNP (rs622342) was observed in the Cape Admixed population at lower MAF (18 %) compared to previously reported frequencies for the European population (Table 5). This includes the Dutch diabetic patients cohort where the variant was initially reported [22]. In this population-based cohort study, Becker et al. [22] concluded that an association existed between genetic variation in the gene encoding for the OCT1 transporter protein and the glucose lowering effect of metformin in diabetes mellitus patients, and that metformin therapy was less effective in patients carrying the minor C allele.

Twelve out of the nineteen investigated nonsynonomous SNPs were monomorphic in the study population. No polymorphisms were observed for these SNPs and they were all homozygous for the ancestral allele. This includes variants R61C, C88S, P283L, R287G, A306T, G401S, A413V, M420V, I421F, C436F, V501E and I542V. The population data available in the databases and the literature for variants R61C, C88S, P283L and G401S is limited, and low MAFs are usually reported for these SNPs. To our knowledge, this study is the first to report population data for A306T; A413V; M420V, C436F; I421F: V501E, V519F and I542V. Further investigations are needed to explore the genetic variations among these SNPs in world populations.

The haplotype structure defined by 20 SNPs in the SLC22A1 gene was also inferred for the Cape Admixed population. A total of 12 haplotypes were observed in the study population (Table 6). The most frequently observed haplotypes were CCTCGGCGCGCTAGAGCTGA (80 %), CCTCGGCGCGCTAGCGCTGA (9.9 %) and CCTCGGCGCGCGAGCGCTGA (3.5 %). Haplotype structure could provide a more complete picture of genetic variations at multiple loci important in drug response at the individual and population level [24]. In our study population, it could be interesting to investigate the overall effect of the observed rare haplotype on the transport efficiency of the OCT1 protein, as well as possible clinical implications related to therapeutic drugs including metformin. This rare haplotype includes variants S14F, S189L, G220V and the intronic SNP rs622342, where further investigations are required to determine the cumulative effect of these variants on substrate transport efficiency and glucose-lowering effect of metformin.

Conclusion

To our knowledge, this is the first study that investigated the allele and genotype frequency distributions of SNPs in the OCT1 gene of the Cape Admixed population from South Africa. This uniquely admixed population is known to have the highest rate of admixture in the world. This study reports also the observed haplotypes in the investigated population. This is the first world population data report for variants A306T, A413V, M420V, C436F, I421F, V501E, V519F and I542V. The reduced-function variants G401S and G465R were not observed in the population, nor have been previously reported in other African populations. This may suggest an increased therapeutic response to metformin for these populations. However, it should be noted that this was only a descriptive study and that no associations are made between any diseases or treatment outcomes. It could be interesting to investigate the overall effect of the rare haplotypes observed in this admixed population on the transport efficiency of the OCT1 protein, as well as possible clinical implications related to therapeutic drugs.