Introduction

The pigmentation of animal skin, eyes, and hair is mainly determined by the quality, quantity, and distribution of melanin. Melanosomes are located in melanocyte and are specifically responsible for the transfer of melanin to surrounding epidermal cells (also known as keratinocyte) (Ito and Wakamatsu 2008). The iridophore deposits guanine crystals which are assembled into multilayer reflective plates to produce the silver reflectivity. The xanthophore and erythrophore are similar in shape and size and have the same pteridine and carotenoid (Funt et al. 2017). The pigment cells of fish originate from neural crest cells and principally divided into melanocyte, iridophore, xanthophore, and erythrophore. The changes in skin coloration are jointly regulated by the quantity and quality of these pigment cells, which play an important role in communication, species formation, and ecological interaction in fish (Sefc et al. 2014). The skin coloration of fish is composed of one or more pigment cells. The different numbers and shapes of pigment cells determine the monotonous or colorful skin coloration of fish. For example, the skin coloration of large yellow croaker (Larimichthys crocea) is composed of melanocyte and xanthophore (Han et al. 2019; Luo et al. 2020). The pattern of alternating blue and golden stripes displayed by adult zebrafish is composed of three kinds of pigment cells: black melanophore, yellow xanthophore, and silvery-blue iridophore (Mahalwar et al. 2014; Parichy 2003). Generally, there are two ways in which pigment cells cause fish skin coloration. One is physiological changes, involving the diffusion or aggregation of pigment particles, as well as hormone and nerve regulation. The other is morphological changes, involving changes in the amount of pigment particles and pigment cells and the migration of pigment cells. No matter what kind of skin coloration changes, it is inseparable from the participation and regulation of a series of key genes, which is closely related to heredity.

In fish, humans, mice, and other vertebrates, melanocytes are the main or even the only pigment cells. Therefore, the research on the activities of melanocytes is extensive and in-depth. The melanocortin-1 receptor (Mc1r) is a dominant regulatory point of melanin synthesis to demonstrate the function of seven transmembrane G-protein coupled receptors (Guida et al. 2021; Rees 2000). Adrenocorticotropic hormone (ACTH), melanocortin peptides derived from the proopiomelanocortin (pomc) gene, α-melanocyte stimulating hormone (α-MSH), and the endogenous antagonist agouti signaling protein (Asip1) can regulate Mc1r to participate in the control of pigmentation (Cal et al. 2017). The adenylate cyclase is activated after α-MSH binded to Mc1r, and then the cAMP increases which can induce the expression of tyrosinase (Tyr), tyrosinase-related protein 1 (Tyrp1), and tyrosinase-related protein 2 (Tyrp2) (D’Mello et al. 2016; Qomaladewi et al. 2019; Xu et al. 2013). The tyrosinase-related family genes, Tyr, Tyrp1, and Tyrp2, are directly involved in the production and deposition of melanin (Zheng et al. 2010). Tyrp1 is an important downstream regulatory factor for body coloration formation in animals, and the expression of Tyrp1 will be reduced, and then the production of melanin will be inhibited after inhibiting the activity of melanocyte transcription factors such as microphthalmia-associated transcription factor (Mitf) (Fang et al. 2002; Wu et al. 2021a). Tyr enhancer was induced and activated by Mitf and sex-determining region Y-box protein 10 (Sox10) in mouse melanocyte, and then melanin synthesis was initiated (Murisier et al. 2007). Different from melanocyte, the pigment cells of frog liver can synthesize melanin through melanogenesis when Tyr-like (Tyrl) is activated (Cicero et al. 1989). Similarly, it has been confirmed that tyrosinase is one of the regulatory pathways involved in the regulation of cytochrome synthesis by Mitf in most animals. Mitf increases the expression of pre melanosome protein (Pmel) and then regulates the synthesis of melanin (Gutknecht et al. 2015; Zhang et al. 2013). In other ways, Mitf is regulated by the Gpnmb gene with the expression level of Mitf downregulated after overexpression of Gpnmb, and the expression level of Gpnmb is decreased after treatment of active small molecule inhibitors of Mitf (Gutknecht et al. 2015; Zhang et al. 2013). Therefore, Mitf is a key target site for the study of melanin synthesis in mammals. In addition, the development of melanocyte requires non autonomous or autonomous signal transduction through endothelin receptor type B (Ednrb) (Hou et al. 2004; Lee et al. 2003). As an upstream regulatory element of tyrosinase family genes, Ednrb is closely related to melanin synthesis and expressed in many cell types (Hou et al. 2004; Lee et al. 2003). In a word, a series of key skin coloration genes jointly regulates the development, variation, migration, and apoptosis of pigment cells which determine the skin coloration of fish single or colorful.

As a new transcriptome research technology, RNA-Seq technology has attracted more and more attention for its accuracy, efficiency, and speed. RNA-Seq technology can not only annotate and map the fish transcriptome, but also help to understand the biological information of fish, such as immunology, toxicology, physiology, evolutionary biology, and developmental biology (Qian et al. 2014). The DEGs in the skin tissues of seawater and freshwater sticklebacks (Gasterosteus aculeatus) and the light and black skin tissues of freshwater stickleback were screened by RNA-Seq technology, and then the differentially expressed genes (DEGs) were verified by qRT-PCR to clarify the mechanism of skin coloration variation (Greenwood et al. 2012). The transcriptome of the caudal fin of the wild-type and the albino-type of Aulonocara baenschi was sequenced by RNA-Seq to identify their pigment-related genes and analyze the expression patterns of the genes at the transcriptome level (Lee and Lee 2020). In the study of Yellow mutant rainbow trout (YR), YR skin transcriptomes were analyzed to explore temporal expression patterns of pigmentation-related genes in three different stages of skin color development (Wu et al. 2022). The key color variation-associated mRNAs and miRNAs between wild-type (WR) and yellow mutant rainbow trout (YR) were sequenced and identified, which also laid a solid foundation for genetic selection and breeding of fish skin coloration (Wu et al. 2021b). Comparing the fin transcriptome in the wild-type and albino-type of Pelteobagrus fulvidraco, it was found that the expressions of Tyr, Tyrp1, and Ednrbla were upregulated in albino types; this may result from the upregulation of cystine/glutamate transporter in albino types (Zou et al. 2015). By analyzing the transcriptome in the specific skin pigment pattern in trout (Salmo species) with the labyrinthine skin pattern and spotted skin pattern respectively, the DEGs possibly associated with skin pigment pattern were identified, and the expression of Gja5, Clcn2, Cdkn1a, and Tjp1 was detected (Djurdjevic et al. 2019). The application of transcriptome has greatly expanded the basic data of skin coloration research of fish, which is not only conducive to the screening of main genes and regulatory pathways, but also provides reliable auxiliary molecular markers for breeding.

Centropyge has a wide variety of colorations and shapes, which is favored by consumers. Pearlscale angelfish (Centropyge vrolikii), commonly known as the black-tail angelfish, is a unique marine ornamental fish with huge market potential in China and is mainly found near coral reefs in the Indian Ocean and the Pacific Ocean (Dibattista et al. 2016; Fernandez-Silva et al. 2018). In our previous research, the DEGs in gonads at different stages in C. vrolikii were screened using RNA-Seq technology for further analysis (Zhong et al. 2021), and the pluripotent transcription factor Nanog was cloned and analyzed (Zhong, et al. 2022a). Understanding the molecular regulation mechanism of skin coloration is an important prerequisite for the research of skin coloration inheritance and improvement in fish. However, little is known about its skin coloration at the molecular level. In the model organism, the core regulatory network of their skin coloration has been deeply studied, but the molecular mechanism and related technical means of the regulatory network in C. vrolikii have not been reported. In this study, the different colored tissues at different parts of C. vrolikii were sequenced by RNA-Seq technology, and the key genes affecting the development and variation of skin coloration were screened and then verified by qRT-PCR. The results will provide a new perspective on the mechanism of skin coloration in small sea angelfish and have important significance for the skin coloration breeding of C. vrolikii in the future.

Materials and methods

Sample collection

The tissue samples of C. vrolikii were obtained from the Aquatic Experimental Center of Fisheries College of Jimei University (Xiamen China). A total of 24 individual fish (more than 3.5 cm in length, 3 + years old) were caught in the Philippine Sea area, which was disinfected and fed temporarily with circulating seawater, and were taken and carefully dissected for the pectoral fin (PF), anal fin (AF), tail fin (TF), eyeballs (EB), and other tissues. Then the tissues were collected in RNAlater (Ambion) and stored at 4℃ for 24 h and then transferred to − 80℃ until RNA extraction. The squamae and fins were washed with normal saline and directly placed on the culture dish for observation by stereoscopic microscope.

RNA extraction and transcriptome sequencing

A total of 24 fish with a body length of more than 3.5 cm were selected, and the tissues of each 6 fish in the same part were taken as a sample for RNA extraction. RNA was extracted from PF, AF, TF, and EB using the Total RNA Kit II (OMEGA, USA). RNA concentration and purity were determined by the NanoDrop1000 Spectrophotometer (Thermo Scientific, MA, USA). Samples with A260/280 values of 1.8 ~ 2.0 could be utilized for downstream experiments. The integrity of the RNA was tested by 1.0% agarose gel electrophoresis. Subsequently, one part of RNA (PF, AF, TF, and EB) was performed reverse transcription immediately, and others were sent to Biomarker Technologies Co., Ltd. (Beijing, China) for quality testing and transcriptome sequencing.

Transcription sequence acquisition and gene annotation

Low-quality data were removed from raw reads, and clean reads were subjected to further analysis Trinity is an assembly software specially designed for high-throughput transcriptome sequencing. First, the sequenced reads are broken into shorter fragments (k-mer), then these small fragments were extended into longer fragments (contig), and the fragment set (component) was obtained by using the overlap among these fragments. Finally, transcripts were identified in each fragment set by using the de Bruijn diagram method and sequencing read information. The sequence of unigene was compared with NR, Swiss-Prot, COG, KOG, eggNOG4.5, and KEGG databases by using DIAMOND software, and the KEGG ontology results of unigene in KEGG were obtained by using KOBAS. The gene ontology (GO) results of new genes were analyzed by InterProScan (A software package that allows users to scan sequences against member database signatures) using the database integrated by InterPro, and the annotation information of unigene was obtained by comparing with the Pfam database in the HMMER software after the amino acid sequence of unigene was predicted. Bowtie was used to compare the sequenced reads with the UniGene library. According to the comparison results, the expression level was estimated in combination with RSEM (an accurate tool for quantifying transcript abundances from RNA-Seq data). The expression abundance of the corresponding unigene was indicated by the FPKM value. The Pearson’s correlation coefficient R was used as the evaluation index of correlation between samples.

Screening and enrichment of differential expression genes

To compare differences in the gene expression in TF, PF, AF, and EB groups of C. vrolikii, a method described by Benjamini–Hochberg was adopted to evaluate the tag frequency in the different RNA-Seq libraries. The threshold P value in multiple tests was determined by measuring the FDR (false discovery rate). In the process of screening DEGs, the FDR was less than 0.01, and the difference multiple FC (fold change) was greater than or equal to 2 as the screening standard. The DEGs were subjected to the enrichment analysis by GO (Gene Ontology) functional group, KEGG, COG (Cluster of Orthologous Groups of proteins) database, and EggNOG database. Moreover, the Volcano Plot can intuitively show the relationship between FDR and FC of all genes and quickly view the difference and statistical significance of gene expression level between the two groups of samples. The path enrichment analysis of DEGs can analyze whether the DEGs were over-presentation on a certain path. The enrichment degree of the pathway was analyzed by the enrichment factor, and the enrichment significance was calculated by the hypergeometric test method.

qRT-PCR verification of transcriptome sequencing results

In order to verify the accuracy of differential gene expression from transcriptome sequencing, 13 genes were screened for qRT-PCR and primers as shown in Table 1. At the same time, the expression levels of 6 genes related to pigment synthesis were verified in various tissues. And the STRING is a database of protein–protein interactions predicted and experimentally verified by multiple species, including direct physical interactions and indirect functional correlations. Combined with the results of DEGs, qRT-PCR analysis, and the interaction pairs recorded in the database, the interaction network of DEGs was constructed. Using BLAST software, the protein sequence of the target genes (the skin coloration–related genes of C. vrolikii) was compared in the database to find the homologous proteins for the construction of a protein interaction network.

Table 1 All-unigene quality indicators of of TF, PF, AF, and EB in C. vrolikii

The first strand of cDNA was synthesized using the RevertAid First Strand cDNA Synthesis Kit (Thermo Scientific, MA, USA) from 1.0 μg total RNA samples. The product of reverse transcription was diluted 100 times with sterile water and then stored at − 20℃ as a qRT-PCR template. The qRT-PCR was based on the QuantiFast SYBR Green PCR Kit (Qiagen, Nangjing, China) on 384-well PCR plates. The 10 μL of reaction system was adopted: SYBR Green Realtime PCR Mster Mix was 5 μL, the positive and negative primers were 0.25 μL (10 μmoL/L), the cDNA template was 4.5 μL, and the reaction was conducted at 95℃ for 5 min, circulated for 45 times as follows: denaturation at 95℃ for 10 s, annealing at 59℃ for 10 s, extension at 72℃ for 10 s. In this experiment, there were 6 biological replicates for each tissue and 3 technical replicates for each sample. Rps29 was used as a reference gene. The relative fold change of the gene expression level was calculated by the 2−ΔΔCt method (Zhong, et al. 2022b).

Statistical analysis

All statistical analyses were performed using SPSS20.0 software, and one-way ANOVA was used to compare the relative expression levels. The level of significance (P) was less than 0.05.

Result

Pigment distribution

The skin coloration of C. vrolikii was primarily grayish black over the entire. The pectoral fin and ventral fin were both bright yellow color, and the anal fin and tail fin were both entirely black, while the dorsal fin was solid black in the back half and golden brown in the front half (Fig. 1A). Similar to the anal fin, the dorsal fin were primarily made up of melanocytes with a few xanthophore scattered across the base of the fin (Fig. 1B and C). Melanocytes predominated over the whole of the tail fin, and a circle of iridophore was distributed around its margin with blue patches (Fig. 1D and D-1). The morphology of pectoral fin and ventral fin was comparable, and the majority of the pigment cells were xanthophore, which were dispersed in the fin rays and uniformly distributed in the area between the fins and spines (Fig. 1E, E-1, and E-2, and F, F-1, and F-2).

Fig. 1
figure 1

The pigment cell distribution in the fins. Note: A the biological form of fish; B dorsal fin; C anal fin; D and D-1 tail fin; E and E-1 ventral fin; F and F-1 pectoral fin; X, xanthophore; M, melanocyte; I, iridophore

The tail part of C. vrolikii was dark and black in color caused by its scales. The melanocytes and xanthophore were visible as the pigment cells of scales in the tail, which were mostly gathered at the base region of scales (Fig. 2 to A-4). There were two types of melanocytes: type I cells were big, scattered, diffuse, or dendritic; type II cells were smaller than type I cells in size, absence of dendritic structure, and have high-density clumps and granules (Fig. 2).

Fig. 2
figure 2

The pigment cell distribution in the squamae. Note: A and A-1 the squamae of the posterior ocular head; B and B-1 the squamae in the middle of the fish body; C and C-1 the squamae at the front end of the tail; D and D-1 the squamae of the tail. X, xanthophore; M, melanocyte

The color of the abdomen was gray. And the melanocytes, xanthophore, and iridophore made up the majority of pigment cells of scales in C. vrolikii. On the comb side, the pigment cells were uniformly dispersed (Fig. 2 to B-4). The iridophore was substantially larger than other pigment cells, primarily consisted of long oval or polygonal short rods, and exhibited reflecting qualities with transparency under light (Fig. 2 to B-4). The pigment cells of the base of the tail did not seem rich (Fig. 2 to B-4).

The coloration of the pectoral had light yellow sections on the front and back of C. vrolikii. The concentration of pigment granules of scales in the base of the front and back of the pectoral was the most dispersed relative to the tail and abdominal, and there was no pigment cell granular zone at the end of the basal area of pectoral (Fig. 2 to C-4). The red and yellow masses of scales were dispersed in the base region, and there were just a few sporadic pigment cells in the top region of the pectoral (Fig. 2 to C-4). The structure of erythrophore was thicker than melanocytes without evident branching (Fig. 2).

Assembly of transcriptome sequencing data

The RNA with good quality of each sample was used for downstream experiments. Illumina HiSeq platform was used for transcriptome sequencing of PF, AF, TF, and EB from C. vrolikii. The samples were divided into four groups with three parallels: PF group (PF-1, PF-2, PF-3), AF group (AF-1, AF-2, AF-3), TF group (TF-1, TF-2, TF-3), and EB group (EB-1, EB-2, EB-3). After the quality control, a total of 76.27 GB of clean data was obtained, and the Q30 base percentage of each sample was not less than 93.30%. After the removal of low-quality and sub sequences, the clean reads were further analyzed and mapped to the reference genome sequence obtained, and the GC content of all samples in these four groups is shown in Table S2. The clean data of each sample was sequenced with the assembled transcript or unigene library, and the alignment results were compared as shown in Table S2. The reads written to the transcript or unigene were described as mapped reads shown in Table 1 and will be used for subsequent analysis. The total number of all-unigene is 159,213, the total length was 258,944,660, the mean length is 1626.40, and the N50 length (the length of unigene’s N50) is 2928.

Function annotation of all-unigene

A total of 159,213 all-unigene samples were assembled from the transcriptome of PF, AF, TF, and EB in C. vrolikii and compared to the functional database of COG, GO, KEGG, KOG, Pfam, Swissprot, eggNOG, and NR, respectively (Table 2). The annotation results of unigenes by the four databases (KOG, Pfam, SwissProt, and NR) were depicted by an E-value threshold of 1e-5 (1 × 10−5) in the Venn diagram (Fig. 3). The results of all-unigene annotated by NR can be further transferred to the GO database which includes three levels: cellular component, molecular function, and biological process (Fig. 4A). The cellular component contains intracellular, protein-containing complex, cellular anatomical entity, etc.; the molecular function contains the catalytic activity, the structural molecular activity, the transporter activity, the binding, etc.; the biological process contains reproduction, immune system process, behavior, metabolic process, cellular process, etc. (Fig. 4A). The COG database was used for functional annotation of unigene, and 26 COG functional classification information of gene homologous was obtained (Fig. 4B); among them, the top three were the general function prediction only, the carbohydrate transport and metabolism, and the signal transduction mechanisms. The result of BLAST top-hit species distributions for matches between the assembled unigenes of C. vrolikii and genes in the NCBINR database showed that 5046 unigenes of C. vrolikii were significantly similar to Morone saxatilis (17.20%), followed by L. crocea (13.16%), Sparus aurata (9.26%), and Lates calcarifer (5.70%) (Fig. 4C).

Table 2 The summary of all-unigene annotation of TF, PF, AF, and EB in C. vrolikii
Fig. 3
figure 3

Venn diagram of annotation results of PF, AF, TF, and EB in C. vrolikii against four protein databases. Note: The number in each colored block indicates the number of unigenes that can be annotated by single or multiple databases (color Fig. online)

Fig. 4
figure 4

The functional distribution statistics of the transcriptome of C. vrolikii. Note: A: all-unigene GO functional distribution statistics of the transcriptome of C. vrolikii; B: all-unigene COG functional distribution statistics of transcriptome of C. vrolikii; C: Top-hit species distributions for sequences from C. vrolikii submitted to BLASTX against the NCBI-NR database

Analysis of gene expression

The gene expression was detected by the transcriptome data with high sensitivity. The FPKM value of protein-coding gene expression level can be sequenced ranging from 10−2 to 104, the expression levels were similar between different samples (Fig. 5A), and the overall discrete angle of gene expression level distribution of a single sample is not far different (Fig. 5B), which can measure the expression level of each sample from the perspective of the overall dispersion of expression. The closer R2 is to 1, the stronger the correlation between the two samples. The correlation between the gene expression of each sample is illustrated in Fig. 5C. There was no significant difference between the samples of fin rays in each group, but there was a great difference among the samples of fin rays and eyeballs.

Fig. 5
figure 5

The relationship between samples and SNP site prediction. Note: A the FPKM value of protein-coding gene expression level; B the FPKM distribution box diagram of each sample; C the correlation of gene expression of each group of samples

Analysis of differential expression genes

By comparing the transcriptome data of PF, AF, TF, and EB in C. vrolikii, the genes with a difference multiple greater than two and a Q-value < 0.001 were identified as significantly differential expression genes (DEGs). The transcriptome data of PF, AF, TF, and EB were compared in pairs, and the number of DEGs was obtained (Fig. 6; Table 3). The classification of GO functional annotation of DEGs was conducted (see Fig. S1 in Supplementary). In organisms, the different gene products coordinate with each other to perform the biological functions. The analysis of pathway annotation of DEGs is helpful to further clarify the gene functions. The DEGs were classified according to KEGG Pathway annotations (see Fig. S2 in Supplementary), and the first 20 KEGG pathways with the smallest Q-value were made into enrichment bubble diagrams (see Fig. S3 in Supplementary). According to the pathway enrichment analyses, the DEGs identified from a number of KEGG pathways such as “Melanogenesis,” “Tyrosine metabolism,” “Glycosphingolipid biosynthesis-ganglio series,” “Axon guidance,” “Cell adhesion molecules,” “Osteoclast differentiation,” “Nitrogen metabolism,” “Wnt signaling pathway,” “Metabolism of xenobiotics by cytochrome P450,” and “Drug metabolism-cytochrome P450.”

Fig. 6
figure 6

Volcano map of DEGs. Note: A to F volcano map of DEGs in PF vs AF, PF vs TF, PF vs EB, AF vs TF, AF vs EB, TF vs EB, respectively

Table 3 The number of DEGs of PF, AF, TF, and EB in C. vrolikii

Expression of DEGs and identification of protein interaction

The DEGs in different tissues were screened by transcriptome sequencing, and some potential genes regulating skin coloration were obtained, including Slc45a2, Slc24a5, Tryp1, Xdh, Tryp1a, Hps5, Pomc, Pmel, Sox10, Mitf, and Tyr (Table 4). Among them, Slc24a5, Mitf, Hes1-b, and Tyr were highly expressed in EB, Hps5, Alk, and Map2k4a were highly expressed in AF; Xdh, Tryp1, Tryp1a, Pomc, Pmel, and Sox10 were highly expressed in PF, while Flt1 and Pdgfrl were highly expressed in TF (Table 4) . Based on the results of differential expression analysis and the interaction pairs included in the database, the differential expression protein interaction network was constructed. During the analysis, the differentially expressed proteins were mapped into the STRING database of version 11.0 to obtain the interaction relationship information of differentially expressed proteins (Fig. 7). The interaction of Tyr, Mc1r, Mitf, and Sox10 jointly regulate the production and accumulation of melanin. In addition, Tyr, one of the differentially expressed genes, may be a key factor in the core regulation of melanin production, and its dense interaction network with a large number of proteins in the biological system indicates that Tyr actively participates in the pathway of melanin production.

Table 4 DEGs related to skin color in different tissues of C. vrolikii (transcriptome data)
Fig. 7
figure 7

The protein–protein interaction by STRING

qRT-PCR verification

The expression of thirteen DEGs from the transcriptome data of C. vrolikii was verified by qRT-PCR using Rps29 as a reference gene. Among them, in PF vs AF, the mRNA expression levels of Slc24a5, Tryp1, Pomc, Pmel, Sox10, Flt4, and Tyr were increased, while those of Hps5, Alk, and Mib1 were reduced, and those of Xdh, Mitf, and Trpm7 were basically unchanged (Fig. 8A); in PF vs TF, the mRNA expression levels of Slc24a5, Tryp1, Pomc, Pmel, Sox10, Mitf, Tyr, and Flt4 were increased, and other genes were reduced (Fig. 8B); in PF vs EB, the mRNA expression levels of Slc24a5, Tryp1, Pomc, Pmel, Mitf, and Tyr were increased, while other genes were reduced (Fig. 8C); in AF vs TF, the mRNA expression levels of all thirteen DEGs were reduced (Fig. 8D); in AF vs EB, the mRNA expression levels of Slc24a5, Tryp1, Mitf, and Tyr were increased, and other genes were reduced (Fig. 8E); in TF vs EB, the mRNA expression levels of Slc24a5, Tryp1, Hps5, Mitf, and Tyr were increased, while other genes were reduced (Fig. 8F). The above results showed similar patterns of mRNA abundance in RNA-Seq and qRT-PCR analysis, and the R square of the correlation between RNA-seq and qRTPCR was 0.95 (see Fig. S4 in Supplementary), indicating a strong correlation between RNA-seq and qRT-PCR, demonstrating the reliability of the RNA-Seq results in C. vrolikii.

Fig. 8
figure 8

DEG verification by qRT-PCR and comparison between RNA-Seq results and qRT-PCR. Note: The abscissa represents the validated genes, and the ordinate represents the log2 ratios of expression in PF vs AF (A), PF vs TF (B), PF vs EB (C), AF vs TF (D), AF vs EB (E) and TF vs EB (F), respectively

Six color genes, namely Mitf, Mc1r, Sox10, Tyr, Tyrp1, and Pomc, were identified as upregulated candidates in black fins or eyeballs. Based on the transcriptome data of different parts of fins and tissues, the DEGs related to melanin production (Mitf, Mc1r, Sox10, Tyr, Tyrp1, and Pomc) were screened to verify their expression in various tissues (Fig. 9). The qRT-PCR results showed that Mitf and Tyr were highly expressed in eyeballs, and from then on, the expression levels of Mitf and Tyr were also relatively high in the dark fin (tail fin, anal fin) and skin; Tyrp1 was highly expressed in the skin; Sox10 and Pomc were highly expressed in tail fin; Mc1r was highly expressed in the heart and tail fin. And the expression levels of these genes in the dark tissues or black fins were higher than that in the light-colored tissue or yellow fins.

Fig. 9
figure 9

The expression of the DEGs related to melanin production in various tissues by qRT-PCR. Note: “Adjacent letters (a, b, b, c or c, d)” means 0.01 < p < 0.05, “Alternating letters (a, c or b, d)” means 0.001 < p < 0.01, “separated by 2 letters (a, d)” represents p < 0.001

Discussion

In fish, skin pigmentation is associated with several biological processes, including camouflage, UV protection, and nutritional (Colihueque 2010). The skin colorations and patterns displayed by many kinds of fish depend on the presence of several pigment cells, such as melanophore and xanthophore (Bar et al. 2013). The pigment cells of the squamae and fins of C. vrolikii include melanocyte, xanthophore, and iridophore (Figs. 1 and 2). In fish, the skin coloration is an important commercial trait, which is determined by a single gene or multiple genes. Therefore, controlling melanin production is intensely important for regulating skin pigmentation. Melanin production was a biosynthetic pathway in which melanocyte synthesizes melanin through a series of complex chemical and enzymatic catalytic reactions (Pillaiyar et al. 2017). The cAMP/PKA pathway was an important pathway to regulate melanin synthesis by stabilizing and activating the MAPK, Wnt, and p53 signal pathway (Kadekaro et al. 2012; Kingo et al. 2008).

In recent years, with the continuous high-quality development of molecular biology technology, RNA-Seq technology, as a new generation of transcriptome sequencing technology, has been gradually applied to the study of the molecular mechanism of fish coloration development, such as carp (Zhang et al. 2017b) and catfish (Zhang et al. 2019b). In this study, RNA-seq technology was used for the transcriptome analysis, and many genes related to skin coloration development and variation were identified and screened in C. vrolikii for understanding their corresponding functions from a biological perspective. After sequencing, a total of 159,213 unigenes were assembled in our study, and the N50 value of the assembled unigenes was 2263 bp (Table 1). The results of all-unigene function annotation showed that 17.20% of unigenes had significant similarity to M. saxatilis (5046) (Fig. 4C), which indicated that the gene homology was conserved between C. vrolikii and M. saxatilis, and C. vrolikii was more closely related to M. saxatilis. Consistent with this, both of them belong to the order Perciformes in the natural classification system. Through the analysis of the KEGG pathway in the transcriptome of skin in red and white crucian carp, the DEGs are mainly enriched in the Wnt signal pathway, tyrosine metabolism pathway, MAPK signal pathway, and melanogenesis pathway (Zhang et al. 2017b). In the transcriptome data of skin in red and yellow river carp, the skin coloration was regulated by the melanin production pathway, MAPK signal pathway, and Wnt signal pathway (Zhang et al. 2019a). Similarly, the KEGG pathway analysis in the transcriptome of C. vrolikii showed that the DEGs were mainly concentrated in the melanogenesis pathway, tyrosine metabolism, pyruvate metabolism, MAPK signal pathway, Wnt signal pathway, and melanin production pathway (Fig. S3 in Supplementary). The results showed that the tyrosine metabolism pathway, melanogenesis pathway, MAPK signaling pathway, and Wnt signaling pathway were closely related to the variation of skin coloration in fish.

Skin is an important barrier for higher animals to isolate from the external environment. The melanocytes dominate the coloration of skin to some extent. The function of melanocytes was regulated by a series of genes in the heredity. The genes related to melanin production regulate the growth and development of melanocytes and result in the blackening of skin coloration (D’Mello et al. 2016). The transcription of Tyr and Tyrp1 is stimulated by Mitf, a key target of the melanin synthesis pathway, which can lead to the production of melanin (Liu and Fisher 2010). The A5 member of the solute carrier 24 family (Slc24a5) is distributed in the inner membrane of cells, whose golden mutation site may lead to the mutation of skin coloration to gold in zebrafish (Lamason et al. 2005). The transcriptome of the squamae by RNA-Seq in three different skin colorations of orange double crown cichlid (Midas cichlids) showed that Slc24a5 was significantly expressed in gray and golden squamae, which may play a regulatory role in the transformation of skin coloration from gray to gold (Henning et al. 2013). In Red tilmpa, the mRNA expression level of Slc24a5 was upregulated in pink skin with black spots compared with all pink skin (Zhu et al. 2016). Similar to these results, the expression level of Slc24a5 in the transcriptome and qRT-PCR of C. vrolikii was significantly higher in the black fins and tissues than that in the yellow fins or light tissues (Fig. 8 and Table 4). The results suggested that Slc24a5 may regulate the blackening of the skin coloration in fish. In addition, Xdh plays the important role in pteridine coloration, and the expression level of Xdh in the red tail fin was significantly higher than that in the yellow tail fin of Nothobranchius furzeri (Ng’oma et al. 2014). The transcriptome data of C. vrolikii showed that the expression levels of Xdh and Pomc were upregulated in the process of skin coloration variation from light gray or yellow to black (Fig. 8 and Table 4). It is speculated that these genes may be involved in the synthesis of melanin.

In order to further study the molecular mechanism of skin coloration of fish, the expression levels of genes related to melanin production in various tissues of fish were analyzed by qRT-PCR. The mRNA expression level of Sox10 in light and black fins was not significantly different, while the mRNA expression levels of Tyr, Mc1r, Mitf, Pomc, and Tyrp1 in black fins or tissues were significantly higher than those in light fins or tissues (Fig. 9). Studies on the expression of Mitf in Oryzias latipes (Li et al. 2013a), Cyprinus carpio and Carassius auratus (Xu et al. 2014 and Liu et al. 2015), and zebrafish (Elworthy et al. 2003) have also been carried out, and the role of Mitf in fish skin coloration formation was preliminarily explored. The skin coloration–related genes, such as Sox10, Tyrp1, Tyr, and Mc1r, have also been reported in Cyprinus carpio var. color (Gu 2016; Li et al. 2013b and Li 2014), Cyprinus carpio (Bar et al. 2013), and other species with similar evolutionary relationship (Selz et al. 2007; Gross et al. 2009; Kobayashi et al. 2016; Sánchez et al. 2010). In melanocyte, Sox10 directly activated Mitf to induce Tyr and Tyrp2 to produce pigmentation (Hou et al. 2008), and Wnt3a promoted melanin production by upregulating the expression of Mitf, Tyrp1, and Tyr (Guo et al. 2012). Tyr and Tyrp1 are members of the tyrosine gene family, encoding proteins related to melanin production (Camacho-Hubner et al. 2002). Tyr mRNA was expressed in the liver, cecum, gill, muscle, and skin of adult Japanese amphioxus (Branchiostoma japonicum), but not in the intestine, neural tube, and testis (Pang et al. 2013), which is similar to the expression level of Tyr in our study (Fig. 9 B). Through transcriptome analysis of catfish skin of albino and wild-type, Tyrl and Tyrp1 were significantly upregulated in albino (Zhang et al. 2019a). The mRNA expression level of Tyrp1 in gray or black skin was upregulated than that in red skin of red crucian carp (Carassius auratus) and red fin snapper (Lutjanus erythropterus), respectively (Zhang et al. 2015 and 2017). In C. vrolikii, the mRNA expression levels of Tyr, Mc1r, and Mitf in black fin were significantly higher than that in yellow fin, which may be one of the reasons for its regulation of melanin and blackening (Fig. 9). It can be seen that the mRNA expression level of Tyrp1 in gray black skin of fish is higher than that in light coloration skin. Similarly, the mRNA expression levels of Tyr and Tyrp1 in black fins or dark tissues were significantly upregulated compared with yellow fins or light tissues of C. vrolikii. All these showed that Tyr and Tyrp1 were significantly expressed in the process of light coloration blackening, indicating they may play a key role in the process of skin coloration blackening.

Interestingly, these six genes showed high expression level in spleen tissue and some in heart tissue and low expression level in liver tissue. Research has shown that Sox10 has been shown to be extensively expressed in adult tissues such as the brain and heart, with little or no expression in the prostate, bladder, pancreas, liver, and stomach of mammals (Volff and Schartl 2003). In Cynoglossus semilaevis, in addition to a large expression in the brain, gills, skin, and eyes, there is also a high expression in the spleen, heart, head, and muscles, while there is a small expression in the kidneys and intestines, but not in the liver and gonads (Potterf et al. 2001). This study found that Sox10 was mostly in the anal fin, caudal fin, and skin, followed by the kidneys, pectoral fins, heart, eyes, and spleen, with little or no expression in the ovaries, intestines, and testes. Mitf regulates the transcription of tyrosinase (Tyr), and the degree of Mitf expression directly influences melanin production (Curran et al. 2010). In Yellow River carp, it is abundantly expressed in the intestines, brain, gills, liver, and pancreas, as well as the skin, buttocks, and tail (Zhang et al. 2017a); they believe Mitf is engaged in a variety of cellular processes and has several impacts. In this study, Mitf was shown to be expressed to diverse degrees in many tissues, with comparatively high levels of expression in the eyes, kidneys, tail fin, and anal fin, this supports the notion that Mitf participates in a variety of cellular processes and has a wide range of biological roles and consequences. Compared to muscle and liver, the zebrafish’s Mc1r is significantly expressed in the brain, eyes, gills, skin, and testes (Selz et al. 2007). This study demonstrates that there are variations in the expression of Mc1r in various fish tissues, with the heart and spleen exhibiting the highest levels of expression. In contrast to other tissues, where it is little expressed, the gills, skin, and pectoral fin do exhibit some expression. The expression of Mc1r varies from that of other fish, demonstrating the species-specific nature of gene expression. Tyr, Tyrp1, and Pomc were widely expressed in a variety of fish tissues, which implies that they may be involved in a number of cellular processes. Further evidence that these genes may be involved in the growth and differentiation of melanocytes comes from the high expression levels of these genes in the skin, anal fin, and tail fin. The function of these genes in the development of melanocytes and the production of fish body color requires more study.

Conclusion

The differential expression genes (DEGs) related to the skin coloration in the transcriptome data of the pectoral fin (PF), anal fin (AF), tail fin (TF), and eyeballs (EB) of C. vrolikii have been screened and analyzed, respectively. The variation trend of qRT-PCR was consistent with that of transcriptome data. Among the results, some genes related to the skin coloration variation were screened and enriched through the KEGG pathway. It was found that they were involved in the regulation of melanogenesis pathway, tyrosine metabolism, pyruvate metabolism, MAPK signal pathway, and Wnt signal pathway and then affected the skin coloration of C. vrolikii. The information on the interaction between the fish skin coloration and the regulatory pathways is worthy of further investigation. The expression levels of the DEGs related to melanin production (Mitf, Mc1r, Sox10, Tyr, Tyrp1, and Pomc) in various tissues of C. vrolikii were screened and validated. The interaction of these genes may affect the skin coloration under the influence of abiotic factor such as environments during the fish growth. These results are helpful to understand the molecular regulatory mechanism of skin coloration, also present basic support for skin coloration control of C. vrolikii in the aquaculture industry, and provide a theoretical basis and a data source for fish developmental biology.