Introduction

Genus Mystus Scopoli 1777 (Teleostei: Bagridae) comprises small to medium-sized freshwater and estuarine catfishes distributed from the Middle East to South, and South East Asia [1]. Currently, 42 species are considered valid within the genus, of which, 15 species are reported in India. The taxonomic validity of additional six species, described from India, requires confirmation as they have been published in ‘predatory journals’ and are considered ‘unavailable’ [2]. The taxonomy of members of the genus Mystus is in flux, as many species are morphologically similar, and subtle diagnostic characters have been used to delimit the species [1]. Therefore, accurate species-level identification using morphological characters alone is problematic [3]. Further, as the monophyly of the genus has been considered doubtful [4], several studies continue to be carried out on the molecular phylogenetics and genetic- based resolution of species-level identities [5, 6].

Mystus tengara, M. vittatus, and M. carcio, three common ‘striped’ bagrid catfishes distributed on the Indian subcontinent, are used as both food fish and in the aquarium trade [7]. The three species have ambiguous taxonomic history, and thus their identity is confusing as they share similar and often overlapping morphological characters [7, 8]. Initially, Bloch [9] described M. vittatus from Tranquebar (Tamil Nadu), India, and subsequently, M. tengara and M. carcio were described from the erstwhile Bengal Presidency by Hamilton [10]. As the original description of M. tengara and M. carcio were based on the limited number of diagnostic characters [11], many subsequent authors considered M. carcio as a junior synonym of either M. tengara [3, 12] or M. vittatus [12,13,14,15,16,17]. Some researchers also considered M. tengara as a junior synonym of M. vittatus [14,15,16, 18].

Several authors attempted to clarify the long-standing confusion in the literature by re-describing M. carcio and M. tengara. They also confirmed that M. tengara, M. carcio, and M. vittatus are distinct species. Nevertheless, the molecular phylogeny and geographical distribution of these three species have not been studied. Further, studies reported the occurrence of these species far away from their type locality. For example, M. vittatus, described from the south-eastern part of India (Tamil Nadu) was subsequently recorded from North–East India [20,21,22]. Similarly, various authors have recorded M. tengara, a species described from Bengal, in Southern peninsular India [23]. Though, the validity of these records has been debated [11, 24], several genetic sequences presumably of the three species collected from distinct geographical regions are available; thus, necessitating a study to understand and clarify the identity and distribution of M. tengara, M. carcio, and M. vittatus. In the present study, we have attempted to fill this knowledge gap using an integrated taxonomic approach.

Materials and methods

Study area and sampling

Specimens of M. tengara were collected from the Sodepur fish market (n=5), West Bengal, and from Nath Sagar (n=9), Godavari River (19°32′05.9″ N, 75°20′09.7″ E), Maharashtra, India. For molecular analysis, muscle tissue along the left side of the specimens were stored in 95% ethanol. All samples were preserved in 10% formalin for morphological studies.

Morphometrics and meristics

The morphometric characters were measured with an automated digital caliper (to the nearest 0.1 mm), and counts were recorded from the left side of the fish, following Chakraborty and Ng [25]. Measurements were reported as percentages of standard length (SL), whereas subunits along the head region were presented as percentages of head length (HL). Species-level identification was confirmed by using available taxonomic literature [3, 10, 11, 17, 19, 26, 27

DNA isolation and PCR amplification

Total genomic DNA was isolated from the muscle tissue (n=6) using the Phenol-Chloroform method [28]. The partial fragment of mitochondrial cytochrome c oxidase (COI) gene was amplified out using the method described by Ward et al. [29]. The PCR amplified products were purified using GelExtraction Kit (Qiagen, Germany) and both sense and antisense strands were sequenced by Xcelris Lab Limited (Gujarat, India). The generated sequences were deposited in GenBank with accession numbers MT928144- MT928148 and MT928150.

Molecular data analysis

A dataset was prepared including sequences generated in the present study (Five COI sequences of Mystus tengara and one COI sequence of Mystus cf. tengara) and those reported in NCBI GenBank (M. tengara-27, M. vittatus-41, M. carcio-8 and other species of the genus Mystus-22) (Online Resource 1). Sequences of Hemibagrus menoda and H. punctatus were used as outgroup. All the sequences were aligned using Clustal W program [30] (Online Resource 2). The phylogenetic tree was built using the maximum likelihood (ML) approach employing PhyML plugin and Bayesian (BI) approach using MrBayes plugin in Genious Prime v 2019.1.3. The most appropriate model was selected employing jModeltest v2.1 [31] under the Akaike information criterion (AIC), [32]. The best-fit model of sequence evolution was HKY+I+G. The gamma distribution parameter was obtained using jModeltest v2.1, and the robustness of tree topology was estimated by bootstrap analysis based on 1000 replicates. Intra and inter-specific genetic distance values were estimated using the Kimura 2-parameter model using MEGA7 software [29, 33].

Results

Description

A comparative description of morphometric characters of M. tengara (those determined from the present study and published literature), M. carcio, and M. vittatus are presented for differentiation (Table 1). Mystus tengara (Fig. 1) can be distinguished from all other congeners by the following combination of characters: Eye diameter 20.06–26.80% of HL; dorsal spine length 11.44–17.79% of SL; length of adipose fin base 22.62–35.31% of SL; post adipose distance 11.92–15.80% of SL; 12–17 serrae along the posterior margin of pectoral fin; tympanic spot present; presence of four longitudinal stripes separated by three pale interspaces.

Table 1 Morphological features of M. tengara, M. vittatus and M. carcio
Fig. 1
figure 1

Mystus tengara collected from West Bengal (67.95 mm SL)

Fig. 2
figure 2

Phylogenetic tree showing major clades of species of Mystus identified as M. tengara, M. vittatus, and M. carcio and their phylogenetic position within the genus

Body moderately compressed. Dorsal profile rising evenly from tip of snout to origin of the dorsal fin and sloping ventrally from the origin of the dorsal fin to end of caudal peduncle. Ventral profile more convex up to anal fin base, then sloping slightly dorsally to end of caudal peduncle. Bony elements of the dorsal surface of head covered with thin skin. Anterior cranial fontanel extending from the level of posterior nasal opening to posterior orbital margin. Posterior cranial fontanel long, invading the region of supraoccipital bone and reaching the base of the occipital process in juvenile specimens. Occipital process reaching basal bone of dorsal fin (West Bengal specimens), and in some cases a considerable gap seen between occipital process and basal bone of dorsal fin (Maharashtra specimens). Eyes located on the dorsal half of head. Gill membranes free from isthmus. Mouth subterminal, with moderately fleshy lips. Teeth small and villiform. Barbels 4 pairs; maxillary barbel reaching anal fin, sometimes extending just beyond anal fin or reaching caudal fin base in juvenile specimens; nasal barbel reaching base of the occipital process; inner mandibular barbel reaching pectoral fin base, and outer mandibular barbel reaching the posterior tip of pectoral fin.

Skin smooth. Lateral line complete and mid-lateral in position. Dorsal fin with a spinelet, one spine and 7 branched rays; dorsal fin spine moderately long (11.44–17.79% SL) with 7 serrations on its posterior edge. Pectoral fin with the stout spine, sharply pointed at its tip with 7(3)–8(11) rays. Anterior spine margin smooth; posterior spine margin with 12 (4), 14 (4), or 15 (6) serrations along its entire length. Distal margin of pectoral fin straight. Pelvic fin short, slightly convex with i,5 rays. Adipose fin not reaching base of last dorsal fin ray, length of its base about 22.62–35.31% of SL. Anal fin with ii, 8 (5); iii 8 (9) rays. Caudal fin forked with i, 7, 7, i (8); i, 7, 8, i (3); i, 8, 8, i (3) rays, upper lobe slightly longer than lower.

Coloration

In fresh condition, body greenish to bright yellow with dark brown to black stripes on either side of the body along with a dark tympanic spot above the pectoral fin. In 10% formalin, the dorsal surface of the head and body pale brown; the ventral surface of the head and body dirty white. Dark spot in tympanic region present. Four pale brown lateral stripes separated by pale interspaces on both sides.

Phylogenetic and genetic distance analysis

The maximum likelihood (Figs.  2 , 3, and 4) and Bayesian tree (Online Resource 3) revealed a similar topology. In the phylogenetic tree, sequences labeled as M. tengara, M. vittatus, and M. carcio formed four paraphyletic clades with significant bootstrap values However, these values were not high to signify the relationship between clades.

Fig. 3
figure 3

Phylogenetic tree showing clade I (M. tengara sensustricto)

Fig. 4
figure 4

Phylogenetic tree showing clade II, III, IV (Mystus identified as M. tengara, M. vittatus, and M. carcio) and clade V (M. tengara)

In the maximum likelihood tree, Clade I comprises Mystus tengara (samples collected from Maharashtra, Western India, and West Bengal, Eastern India, as a part of present study; published sequences from Assam, North-eastern India, and Bangladesh); M. vittatus (reported from Assam, Tripura, Manipur, Meghalaya—North–east India; West Bengal, Andhra Pradesh–Eastern India; Madhya Pradesh, Telangana—Central India; Maharashtra—Western India and Korea), M. carcio (Assam) and M. horai (Uttar Pradesh). Clade II includes species of M. carcio (Bangladesh) and M. tengara (North–east India, Bangladesh, and Korea). Mystus tengara, recorded from Assam (MH156942), formed a separate branch in the tree, which was observed to be a sister group to clade II. Clade III comprised exclusively of various populations of M. vittatus recorded from northern India (Uttarakhand), north-east India (Arunachal Pradesh), central India (Madhya Pradesh), and Nepal. Clade IV comprised of specimens of Mystus cf. tengara (eastern India: West Bengal) collected in the present study, and M. vittatus (reported from north–east India). Clade III and Clade IV were observed to be sister groups with significant bootstrap values. The published sequences of M. tengara from Uttar Pradesh formed a distinct clade that occupied the basal position in the phylogenetic tree (Clade V).

The average genetic distance values within and between clades are provided in Table 2. Within M. tengara (the present study samples), the genetic distance values are ranged from 0.2 %(West Bengal- Maharashtra) 0.4% (West Bengal). Between Mystus cf. tengara and M. tengara, the average genetic distance value was 12.6 %. The genetic divergence value among clades ranged from 9.0 to 11.3 (Clade I-II), 12.1–18.0 (Clade I–II), 12.6–13.8 (Clade I–IV), 17.8–19.0 (Clade I–V), 9.1–10.3 (Clade II-III), 8.2–9.3 (Clade II–Clade IV), 16.7–18.4 (Clade II–V),2.8-3.8 (Clade III–IV), 15.0-15.7 (clade III–V) and 15.9–17.4 (Clade IV–V) (Online Resource 2).

Table 2 Pair-wise average genetic distance values within and among the clades in the phylogenetic tree

Discussion

Comparative morphometric evaluation

The present study used an integrated taxonomic approach to resolve the identity and distribution of Mystus tengara. Comparative morphological evaluation of freshly collected specimens of M. tengara, from West Bengal, showed close similarity to the original description [10], in having four longitudinal stripes separated by 3 pale interspaces, presence of large tympanic spot above the pectoral fin, length of four barbels longer than the head, and occipital process reaching basal bone of dorsal fin. Specimens of M. tengara collected from Maharashtra also match with the description of Hamilton for M. tengara, except with the presence of a small interspace between the occipital process and dorsal fin base.

M. tengara is differentiated from M. vittatus [10] by the absence of serrations in the dorsal spine (vs. presence), a character which is suggested to be an error [27], based on Gunther’s description of “Macrones tengara”. M. tengara further differentiated from M. vittatus [26] by median longitudinal groove reaching to base of occipital process and occipital process reaching basal bone of dorsal fin vs. median longitudinal groove reaching midway behind the hind edge of the eye and base of the occipital process and the short interspace between occipital process and basal bone of dorsal fin. In the present study, we observed variations in the median longitudinal groove with size and geographical locations.

Darshan et al. [11] re-described M. tengara to establish and confirm its taxonomic identity and differentiated this species from M. vittatus. The specimens of M. tengara identified by Darshan et al. [11] varied from M. vittatus in having a longer maxillary barbel (254.5–360.5 vs. 214.3–244.9), longer dorsal spine (12.3–17.2 vs. 10.7–12.2), and length of the median longitudinal groove (reaching base of occipital process in juvenile or reaching anterior one-third of supraoccipital bone in adult vs. terminating at the anterior border of supraoccipital bone not invading the supraoccipital region). In the present study, specimens of M. tengara, from Eastern India (West Bengal), showed variations from specimens collected from Western India (Maharashtra) in maxillary barbel length, dorsal spine length (14.23–17.79 vs. 11.44–13.31), and extent of the occipital process (reaching dorsal fin base vs. not reaching) but, without any variations from the morphometric description of M. tengara and M. vittatus given by Darshan et al., and Sudasinghe et al., [11, 24], which shows that the effects of environmental variations in this trait cannot be ruled out.

DNA barcoding and phylogenetic study

DNA barcodes have been used for confirming the identity of species and their distribution [35]. Previous studies have shown that a genetic divergence value of 2–3% at DNA barcoding gene (COI) could be used as a threshold value to discriminate species [36, 37]. Accordingly, conspecific individuals show a genetic divergence value of <3%, while congeneric species >3%. During the present study, in Clade I, sequences identified as M. vittatus, M. carcio, and M. horai from different geographical locations, were clustered with ‘M. tengara’ (collected in the present study) having a genetic distance of <3%. The average genetic distance value among M. tengara specimens of the present study is 0.3%. These observations suggest that the sequences identified and labeled as M. carico/M. vittatus/M. horai in GenBank could be misidentifications of M. tengara.

Hamilton [10] described M. carcio and distinguished it from M. tengara in the length of maxillary barbel (extending beyond pectoral vs. reaching to end of caudal) and serrations on dorsal spine (presence vs. absence). Darshan et al. [19] revalidated M. carcio and distinguished it from M. tengara and M. vittatus based on shorter adipose-fin base length (8.5–11.9 vs. 24.0–31.7 and 21.5–26. 0 respectively) and posterior fontanel length (reaching the base of supraoccipital process vs. not reaching the middle of supra-occipital bone vs. terminating at the anterior tip of supraoccipital respectively). The description of the median longitudinal groove in M. vittatus terminating at the anterior border of supraoccipital bone, not invading the supraoccipital region by Darshan et al. [19], is not in agreement with Day [26]. Further, the description of the median longitudinal groove in adult specimens of M. tengara not reaching beyond the middle of supra-occipital bone by Day [26], is not in agreement with the present study. However, M. carcio, re-described by Darshan et al. [19], was distinct in other characters from M. tengara identified in the present study. The re-description of M. carcio [19] was based on specimens collected from Assam, Tripura, and Bangladesh but without any molecular evidence. In our phylogenetic analysis, specimens identified as M. carcio from Assam were grouped with M. tengara sensustricto, whereas, specimens identified as M. carcio, from Bangladesh, grouped with M. tengara clade II with significant genetic variation.

Further, specimens collected from West Bengal and Maharashtra could be distinguished in morphometric characters such as dorsal spine length (14.23–17.79 vs. 11.44–13.31), pectoral spine length (18.76–20.19 vs. 16.55–17.77), eye diameter (25.05–26.8 vs. 20.06–26.00), maxillary barbel length (reaching the posterior tip of anal fin base or to caudal fin base in smaller specimens vs. reaching anal fin base) and occipital process (reaching basal bone of dorsal fin vs. a considerable gap present), which reveals geographical variations in these diagnostic phenotypic characters. Similar to our observations, there is a geographical variation in maxillary barbel length [26] in M. tengara from Punjab and Assam (reaching to the middle of the pectoral fin vs. reaching to the base of pelvic fin). These finding indicates that these characters may be influenced by the size of the fish and the environment, in which they inhabit and cannot be considered as good diagnostic characters for these species [3].

Molecular evidence reveals cryptic species

In clade II, sequences identified and labeled as M. tengara could likely be misidentifications as the genetic distance of these sequences with those in clade I are higher than 3%. M. tengara recorded from Assam (MH156942) also showed a higher genetic distance with sequences in clade I and could be a distinct species. Clade III is comprised of M. vittatus and this species was confirmed to be distributed in northern, north-eastern, western, and central India. Interestingly, specimens of Mystus cf. tengara, the focus of the present study clustered with M. vittatus recorded from north–east India and formed clade IV. Though a sister group relationship was observed between clade III and clade IV, the average genetic distance between these two clades was 3.2% suggesting the occurrence of distinct lineages or cryptic species in this group. Further based on morphometric and meristic data (Online Resource 4), we could not differentiate it from M. tengara and requires further confirmation based on more number of specimens. Species names and identities in clade V are also likely to be erroneous due to morphological ambiguities. Studies on generating reference DNA barcodes without morphological taxonomy could often lead to species misidentification [38, 39], which has been demonstrated recently in hill stream loaches of the Western Ghats [40]. Due to overlapping diagnostic characters and morphological similarities, various authors could have misidentified M. tengara, M. vittatus, and M. carcio resulting in the deposition of erroneous sequences in NCBI GenBank.

Based on morphological and genetic evidence of freshly collected M. tengara, from its type locality, we consider sequences that form part of clade I to be M. tengara sensustricto, with the distribution extending from North East India to West Bengal, North India, Central India, Northern peninsular India, and Bangladesh. Further confirmation on the identity of M. vittatus and M. carcio, by an integrated taxonomic approach based on freshly specimens collected from the type locality, is required. The observation of paraphyletic subclades and evaluation of genetic distance between subclades reveals that there could be at least four cryptic species in this group, opening up avenues for future research on the group.