Introduction

The Family Sciaenidae in the Order Perciformes is widely distributed throughout the world with approximately 70 genera and 300 species including about 30 species from Indian waters [1, 2]. The Sciaenidae is a species rich family of primarily marine fishes with particularly high diversity in estuaries of the Atlantic, Pacific and Indian Oceans [3]. The sciaenids, popularly known as croakers or drums, contribute significantly to the world marine fisheries particularly of the warm shallow seas and estuaries of North and South America, West Africa, South and South-East Asia [4]. The Indian sciaenids, including the present commercially important species (Otolithes cuvieri, Otolithes ruber, Johnius dussumieri, Johnius elongatus, Johnieops vogleri, Otolithoides biauritus and Protonibea diacanthus), contribute approximately 4.6% to the total Indian marine fish production.

Conventionally, sciaenid fishes are identified using morphological, meristic and anatomical characters [5, 6]. However, considerable ambiguity exists due to morphological similarity and overlapping meristic counts [6, 7]. Synonym citations in FishBase indicate the possibility of ambiguous identification with respect to some sciaenid species of genus Johnieops and Johnius [8]. Accurate identification of morphologically similar species is essential for population dynamic assessment and fisheries management. Application of molecular tools can provide valuable information for species identification to complement the taxonomic data and validation of systematic positions, phylogeny and other applications like fish product identification in trade monitoring [9]. Among the earlier studies on Indian sciaenids, allozyme markers have been used to document genetic variation in Johnieops dussumieri, Kathala axillaries, Pennahia macropthalmus and Otoloithes ruber [7]. Recently, RAPD markers have been used for molecular identification of five Indian sciaenids [10].

Mitochondrial DNA has been extensively studied in fish phylogenetics since mitochondrial 16S rRNA gene and the protein coding cytochrome c oxidase subunit I (COI) gene are highly conserved. These mitochondrial genes have been sequenced in various invertebrate and vertebrate taxa [1117]. The well characterized COI gene has proved to be a robust evolutionary marker for the analysis of intraspecific and interspecific relationships in many marine fish and shellfish [1619]. The 16S rRNA mitochondrial gene has also been shown to be a good marker to differentiate fish species and has been used in comparative intergeneric and interspecific studies in several families of Perciformes [2024].

The success of conservation programs and effective management policies depend on the levels of genetic divergence within and between species and developing strategies to maintain the natural genetic diversity. In the present study, 16S rRNA and COI genes were selected to study phylogenetic relationship and genetic relatedness among seven commercially important Indian sciaenids.

Materials and methods

Sample collections

Seven sciaenids (O. cuvieri, O. ruber, J. dussumieri, J. elongatus, J. vogleri, O. biauritus and P. diacanthus) were collected from Versova (Latitude 19 °08′ N and Longitude 72 °50′ E) and New Ferry Wharf, Mumbai (Latitude 18 °96′ N and Longitude 72 °85′ E) on the North Western coast of India. Species identification and nomenclature followed the synopsis of Indian sciaenid fishes [6]. Samples were collected during July 2006 and approximately 100 mg of white muscle tissue from five individuals of each species was preserved in 95% ethanol until used.

DNA isolation

The DNA was isolated following [25] with minor modifications. The concentration of isolated DNA was estimated using a UV spectrophotometer. The DNA was diluted to get a final concentration of 100 ng/μl.

Amplification and sequencing

The mitochondrial 16S rRNA gene was amplified in a 50 μl reaction volume with 5 μl of 10× Taq polymerase buffer, 0.2 mM of each dNTP, 0.4 μM of each primer, 2.5 U of Taq polymerase and 5 μl genomic DNA using the thermal cycler PTC 200 (MJ Research). The primers used for the amplification of the partial 16S rRNA gene were 16SAR (5′-CGCCTGTTTATCAAAAACAT-3′) and 16SBR (5′-CCGGTCTGAACTCAGATCACGT-3′) [26]. The thermal profile used was 36 repetitions of a three step cycle consisting of denaturation at 94°C for 1 min, annealing at 55°C for 1 min and extension at 72°C for 1.5 min including 4 min for initial denaturation at 94°C and 7 min for the final extension at 72°C.

COI gene was also amplified in a final concentration of 50 μl volume with a final concentration of 5 μl of 10× Taq polymerase buffer, 2 μl of MgCl2 (50 mM), 0.25 μl of each dNTP (0.05 mM), 0.5 μl of each primer (0.01 mM), 0.6 U of Taq polymerase and 5 μl of genomic DNA. The primers used for the amplification of the COI gene were FishF1–5′-TCAACCAACCACAAAGACATTGGCAC-3′ and FishR1-5′-TAGACTTCTGGGTGGCCAAAGAATCA-3′ [16]. The thermal regime consisted of an initial step of 2 min at 95°C followed by 35 cycles of 40 s at 94°C, 40 s at 54°C and 1 min 10 s at 72°C followed in turn by final extension of 10 min at 72°C.

PCR products were visualized on 1.2% agarose gels and the most intense products were selected for sequencing. Products were labeled using the BigDye Terminator V.3.1 Cycle sequencing Kit (Applied Biosystems, Inc) and sequenced bidirectionally using an ABI 3730 capillary sequencer following manufacturer’s instructions.

Sequence analysis

Sequences were aligned using ClustalW [27] and submitted to GenBank under the accession numbers (EF051049, EF028701, EF528197–EF528233, EF534109-EF534127, EF536890-EF536896). The extent of sequence differences between species was calculated by averaging pair-wise comparisons of sequence difference across all individuals. The 16S rRNA sequences of the five individuals of each species were aligned to yield a final alignment varying from 577 bp (P. diacanthus) to 590 bp (O. biauritus). The COI sequences of the five individuals of each species were aligned to yield a final alignment of 655 bp. Pair-wise evolutionary distance among haplotypes was determined by the Kimura 2-Parameter method [28] using the software program MEGA 3.1 (Molecular Evolutionary Genetics Analysis) [29]. The number of polymorphic sites and nucleotide diversity (Pi), nucleotide composition and number of transition and transversion between species were determined by DnaSp ver 3 [30]. Gaps were considered as missing data on the phylogenetic reconstructions. Neighbor Joining (NJ) and Maximum Parsimony (MP) trees were constructed using MEGA 3 using Thunnus orientalis from NCBI (GenBank Accession No. AB185022) as an out-group. To verify the robustness of the internal nodes of NJ and MP trees, bootstrap analysis was carried out using 1000 pseudoreplications [31].

Results

A total of 35 individuals from 5 genera (Otolithes, Johnius, Johnieops, Protonibea and Otolithoides) were used for partial sequence analysis of 16S rRNA and COI genes, which yielded 70 sequences. Simplicity and un-ambiguity were observed among the sequences of the both mitochondrial regions.

Sequencing of the 16SrRNA gene produced an average of 583 (range 577–590) nucleotide base pairs per taxon. Multiple alignments resulted in a consensus length of 597 sites including base pairs and gaps. Two haplotypes were observed per species. Of the 597 sites, 427, 170, 165 and 5 were conserved, variable, parsimony informative and singleton respectively. The polymorphic sites are given in Fig. 1. The analysis revealed nucleotide frequencies as A = 28.10%, T = 25.50%, G = 22.50% and C = 24.00%. As expected, average transitional pairs (si = 48) were more frequent than transversional pairs (sv = 23) with an average ratio of 2.10.

Fig. 1
figure 1

Alignment of partial DNA sequences of the mitochondrial gene, 16S rRNA of seven Indian sciaenids (only variable sites are reported) (H1: Haplotype 1, H2: Haplotype 2, Oc: Otolithes cuvieri, Or: Otolithes ruber, Je: Johnius elongatus, Jd: Johnius dussumieri, Ob: Otolithoides biauritus, Pd: Protonibea diacanthus, Jv: Johnieops vogleri)

Intergeneric and interspecies sequence divergences are given in Table 1. Average sequence diversity among the sciaenids was 0.139 whereas average interspecies sequence diversity was 0.137. The highest intergeneric sequence diversity (0.195) was between P. diacanthus and J. dussumieri where as the lowest value (0.0710) was between species J. vogleri and J. elongatus. The highest interspecies sequence diversity with in genera (0.0873) was between O. ruber and O. cuvieri and the lowest value (0.0588) was between J. dussumieri and J. elongatus. Interspecies nucleotide differences ranged from 34 to 51 and intergeneric nucleotide difference ranged from 41 to 111 (Table 1).

Table 1 Pair-wise nucleotide differences (below diagonal) and sequence divergence (above diagonal) in 16S rRNA sequences from seven Indian sciaenids

Pair-wise genetic distance values (Kimura 2 parameter) based on 16S rRNA using MEGA 3.1 are given in Table 2. The average genetic distance of individuals among sciaenid species was estimated as 0.139 and within species as 0.002. Congeneric interspecies distance ranged from 0.057 to 0.087 and the intergeneric distance ranged from 0.089 to 0.230. The highest congeneric interspecies genetic distance (0.087) was between O. cuvieri and O. ruber and the highest inter generic distance (0.230) was between P. diacanthus and J. dussumieri.

Table 2 Pair-wise genetic distances (Kimura 2-parameter) of seven Indian sciaenids based on 16S rRNA sequences from seven Indian sciaenids

All five sequences for each species were included in the phylogenetic analysis. The NJ (Fig. 2) and MP trees revealed identical phylogenetic relationship among the species. Three major clusters were obtained with the first cluster formed by the congeneric species under the genus Johnius (J. dussumieri and J. elongatus) with Johnieops vogleri a subcluster. The second cluster was formed by two congeneric species under the genus Otolithes (O. cuvieri and O. ruber) and the third cluster formed by P. diacanthus and O. biauritus. In both the trees, these clusters were supported by high bootstrap values (NJ 88–100% and MP 84–99%).

Fig. 2
figure 2

Neighbour Joining (NJ) phylogenetic tree of Indian sciaenids inferred from DNA sequences of mitochondrial gene 16S rRNA

Sequencing the COI gene produced an average of 655 nucleotide base pairs per taxon. No insertions, deletions or stop codons were observed in any sequence and multiple alignments resulted in a consensus length of 655 sites including base pairs and gaps. One to two haplotypes were observed in all the seven species. Of the 655 sites, 436, 219, 217 and 2 were conserved, variable, parsimony informative and singleton respectively. The polymorphic sites are given in Fig. 3. As expected, all variable changes within species were third codon position transitional substitutions. The analysis revealed nucleotide frequencies as A = 23.6%, T = 28.9%, G = 18.8% and C = 28.7%. As expected, average transitional pairs (si = 61) were more frequent than transversional pairs (sv = 30) with an average ratio of 2.00.

Fig. 3
figure 3

Alignment of partial DNA sequences of the mitochondrial gene, COI of seven Indian sciaenids (only variable sites are reported) (H1: Haplotype 1, H2: Haplotype 2, Oc: Otolithes cuvieri, Or: Otolithes ruber, Je: Johnius elongatus, Jd: Johnius dussumieri Ob: Otolithoides biauritus, Pd: Protonibea diacanthus, Jv: Johnieops vogleri)

Average intergeneric and interspecies sequence divergence values are given in Table 3. Average sequence diversity among the sciaenids was 0.163 whereas average congeneric interspecies sequence diversity was 0.147. The highest intergeneric sequence diversity (0.2137) was observed between O. biauritus and J. vogleri and the lowest value (0.1023) was between J. vogleri and J. elongatus. The highest interspecies sequence diversity (0.0580) was between O. ruber and O. cuvieri and the lowest value (0.0259) was between J. dussumieri and J. elongatus. The average inter-species nucleotide differences ranged from 16 to 36 and intergeneric nucleotide difference ranged from 67 to 140 (Table 3).

Table 3 Pair-wise nucleotide differences (below diagonal) and sequence divergence (above diagonal) in COI sequences from seven Indian Sciaenids

Pair-wise genetic distance values (K2P) based on COI sequences using MEGA 3.1 were given in Table 4. The average genetic distance between species was estimated as 0.160. The average distance within species was 0.002. The interspecies distance with COI ranged from 0.025 to 0.059 and the intergeneric distance ranged from 0.111 to 0.258. The highest intergeneric distance (0.258) was between Otolithoides biauritus and J. vogleri whereas the highest interspecies distance (0.059) was found between O. cuvieri and O. ruber. All five COI sequences for each species were included in the phylogenetic analysis. The NJ (Fig. 4) and MP tree revealed identical phylogenetic relationship among the species. As with 16S rRNA, three major clusters were obtained with the first cluster formed by the congeneric species under the genus Johnius (J. dussumieri and J. elongatus) with Johnieops vogleri as asubcluster of this first clade. The second clade was formed by two congeneric Otolithes species (O. cuvieri and O. ruber). The third cluster was formed by P. diacanthus and O. biauritus. As with 16S rRNA, the clusters were supported by high bootstrap values (NJ 98–100% and MP 95–99%) except at the node formed by P. diacanthus and other species in the MP tree.

Table 4 Pair-wise genetic distances (Kimura 2-parameter) of seven Indian sciaenids based on COI sequences from seven Indian sciaenids
Fig. 4
figure 4

Neighbour Joining (NJ) phylogenetic tree of Indian sciaenids inferred from DNA sequences of mitochondrial gene COI

Discussion

Phylogenetic relationships based on morphological characters and molecules are mostly concordant [16, 22]. The seven species of sciaenids from the Indian coast were found genetically distinct from each other and partitioned into three groups without any haplotypes sharing or overlapping, based on the partial sequence information of both 16S rRNA and COI genes. The sequences of both regions of mtDNA demonstrated simplicity and unambiguity. Stop codons were absent from all amplified sequences of COI and all the amplified sequences were 655 bp in length. This suggests that NUMTs (nuclear DNA sequences originating from mtDNA sequences) were not sequenced (vertebrate NUMTs are typically smaller than 600 bp; [16]). NUMTs in Actinopterygii has not been reported except in the puffer fish, Fugu rubripes [32].

In the present study, the sciaenid species that are morphologically and meristically almost identical,–J. dussumieri and J. elongatus on one hand and O. cuvieri and O. ruber on the other are sister clades in the NJ and MP trees with both 16 Sr RNA and COI gene sequences. Similarly, the large-bodied sciaenids, O. biauritus and P. diacanthus, formed sister clades. Another species J. vogleri was separated into a sister lineage of Johnius with a high bootstrap value. The high genetic divergence values between Johnius and Johnieops (with 16S rRNA, average 7.1% between J. elongatus and J. vogleri and 8.9% between J. dussumieri and J. vogleri; with COI, average 11.1% between J. elongatus and J. vogleri and 10.3% between J. dussumieri and J. vogleri) suggest J. vogleri as a distinct species. By comparing the allozyme and RAPD profiles of nine sciaenids belonging to 6 genera (Johnieops, Johnius, Otolithes, Protonibea, Kathala and Perrahia), Menezes et al. and Lakra et al. have also supported the distinct generic status of Johnieops [7, 10].

In general, the genetic divisions of all the sciaenid species, identified by our data correspond with the taxonomic sub-divisions previously proposed by Mohan [6] and FAO species Identification Sheets [33]. Estimates of genetic divergence with both 16S rRNA and COI genes were sufficient enough to discriminate individuals of different sciaenid species. These values correspond well with the reports in other sciaenids for 16S rRNA (4.8% to 15.3%; [15]) and in other teleosts for COI (1.04 to 20.63%; [16]) partial sequences. In all seven species, levels of intra-specific variation were low (average 0.2% for 16 SrRNA and 0.2% for COI) and this was reflected in the low number of haplotypes (maximum two). This may be due to the high proportion of the identified haplotype in the samples or the limited number of individuals (5 each) collected for the present study. Ward et al. also reported very low within-species genetic diversity in several marine teleosts [16].

The observed transition versus transversion ratios in sciaenids are also comparable to those of many teleosts [16, 34]. Transitions outnumbered transversions in the present study in accordance with the previous reports on mtDNA in fish [13, 15]. Generally for teleost mtDNA, a much larger excess of transitions related to transversion is typically observed [16]. The GC content of the 583 bp 16Sr RNA (49.5%) and 655 bp COI (46.7%) region was on relatively high in all the seven sciaenid species. Ward et al. reported an overall higher GC content in fishes based on complete mtDNA genome ranging from 38.4–43.2% and with COI alone, 42.2–47.1%, which was mostly attributable to 3rd base variation [16]. In our study also, the sciaenids exhibited more nucleotide changes at 3rd position, consistent with most mutations being synonymous.

An interesting observation was the high degree of K2P nucleotide divergence with 16S rRNA gene (inter species 5.9–8.7%; intergeneric 8.9 to 23.0%), indicating its ability to adequately describe interrelationships of sciaenid species. Generally, 16S rRNA is considered more conservative than COI and is used to investigate the relationship among different species and genera. However, sequence analyses of the 16S rRNA gene lacked the ability to resolve relationships in some marine taxa such as sparids and percoid [35]. However, Vinson et al. reported high nucleotide divergence (8.3%) among the sciaenid species in Northern Brazil using 16S r RNA gene sequences and Chakraborty et al. also showed similar results in ribbon fishes and silver biddies, indicating the usefulness of this gene sequence for accurate identification of species [15, 34, 36]. DNA barcoding based on partial sequence information of COI gene has been widely used for species identification of fishes [16, 19]. It provides a framework for the development of simple, low expense, PCR-based assays that can clearly differentiate virtually all fish species. Our results revealed that COI barcoding is an effective marker for identification of sciaenids from Indian seas. The study has also supported the claim of robustness of universal primers for 16S rRNA and COI gene. The primer pairs used in the study following Palumbi et al. and Ward et al. could successfully amplify an approximately 583 bp 16S rRNA gene and 655 bp segment of COI respectively in all sciaenids [16, 26].

In conclusion, our results are congruent with the taxonomic divisions of sciaenids, based on morphological characters as reported by [6]. The species-diagnostic profiles inferred from both 16S rRNA and COI gene sequences were consistent with Johnieops being a distinct genus and Johnieops vogleri being a separate species from Johnius dussumieri and J. elongatus. This is a good starting point for examining the status of sciaenids especially those belonging to the genera Johnieops and Johnius documented in FishBase. The study also supports the earlier results on genetic relationships of sciaenids obtained using allozyme and RAPD markers [7, 10]. It is concluded that partial sequence information of both the mitochondrial genes-16S rRNA and COI can be used as a diagnostic molecular marker in identification and resolution of taxonomic ambiguity of sciaenids.