Leersia perrieri (A. Camus) Launert belongs to the genus Leersia of the tribe Oryzeae in the grass family, which is closely related to the genus Oryza with an estimated divergence time of nearly 14 million years ago (Guo and Ge 2005; Tang et al. 2010). In recent years, human activities have seriously destroyed wetland habitats, making the loss and quick decline of natural populations of L. perrieri, which usually occur in the wetlands, ditches and marshlands of Africa and Madagascar (Raimondo et al. 2009). As chloroplast sequences in plants can offer useful information for the reconstruction of complex evolutionary relationships, they have been extensively applied to species identification and phylogenetic analysis over the last years (Jansen et al. 2007; Moore et al. 2007; Parks et al. 2009; Ruhsam et al. 2015). The information of chloroplast genomes has been extensively applied in understanding plant genetic diversity and evolution that are of great importance in the conservation genetics (Ye et al. 2014; Zhang et al. 2016). The availability of chloroplast genome of L. perrieri will greatly enhance the conservation efforts of this endangered plant species that harbors valuable gene resources for rice breeding programs in the future.

In this study, we sequenced, assembled and annotated the chloroplast genome of L. perrieri using high-throughput genome sequencing technology. Live plants of L. perrieri were provided by the Genetic Resources Center of the International Rice Research Institute (IRRI) at Los Banos, Philippines. Total genomic DNA was extracted from fresh young leaves with a modified CTAB method (Li et al. 2013). According to the Illumina’s protocol, the pair-end libraries were constructed and then sequenced using Hiseq2000 platform, obtaining a total of about 12.94 million high-quality clean reads (~1.09 Gb). We assembled and annotated the cp genome using CLC Genomic Workbench v6.5 (CLC bio, Denmark) and Dogma (https://dogma.ccbb.utexas.edu/), respectively. The annotated genomic sequence had been deposited into the GenBank database under the Accession No. of KY347906.

The circular genome of L. perrieri is 136,196 bp in size, and comprises a pair of inverted repeat (IR) regions of 21,321 bp each, a large single-copy region of 81,587 bp and a small single-copy region of 11,968 bp (Fig. 1). The chloroplast genome harbors 130 genes, including 85 protein-coding genes, 37 transfer RNA genes and 8 ribosomal RNA genes. Of them, 70 genes are located on the negative strand and 60 resided on the positive strand. We found 13 genes contain introns, including rpl2, ndhB, ndhA, rpl16, petD, petB, rps12, ycf3, atpF, and rps16. The base composition is asymmetric (A: 30.70%; T: 30.40%; C: 19.38%; G: 19.52%) with an overall GC content of 38.90%.

Fig. 1
figure 1

The chloroplast genome of L. perrieri

To examine the phylogenetic position of L. perrieri, we constructed the phylogenetic tree between L. perrieri and 18 other closely related grass species, including 13 from Oryzinae, 4 from Zizaniinae and 1 from Leersia, using MEGA software (version 5.05) (Tamura et al. 2011) based on whole chloroplast genome sequences with maximum likelihood method (Fig. 2). Brachypodium distachyon from the subfamily Brachypodieae was selected as outgroup. Our results show that the topology of the phylogenetic tree agrees with the traditional tribe-level taxonomy of the genus Oryza. L. perrieri together with L. Tisserantii were clustered into a single phylogenetic cluster and formed the most close outgroup of the genus Oryza. The complete chloroplast genome of L. perrieri will build a strong foundation for its efficient conservation that greatly help to protect the precious rice gene pool from loss.

Fig. 2
figure 2

Maximum likelihood (ML) tree inferred using MEGA (version 5.05) based on whole chloroplast genome sequences from L. perrieri and 19 other grass species