Search tips
Search criteria 


Logo of peerjLatest ArticlesFor AuthorsEditorial BoardPeerJPeerJ
PeerJ. 2017; 5: e3820.
Published online 2017 September 18. doi:  10.7717/peerj.3820
PMCID: PMC5609524

The complete chloroplast genome of Cinnamomum camphora and its comparison with related Lauraceae species

Academic Editor: Rui Feng


Cinnamomum camphora, a member of the Lauraceae family, is a valuable aromatic and timber tree that is indigenous to the south of China and Japan. All parts of Cinnamomum camphora have secretory cells containing different volatile chemical compounds that are utilized as herbal medicines and essential oils. Here, we reported the complete sequencing of the chloroplast genome of Cinnamomum camphora using illumina technology. The chloroplast genome of Cinnamomum camphora is 152,570 bp in length and characterized by a relatively conserved quadripartite structure containing a large single copy region of 93,705 bp, a small single copy region of 19,093 bp and two inverted repeat (IR) regions of 19,886 bp. Overall, the genome contained 123 coding regions, of which 15 were repeated in the IR regions. An analysis of chloroplast sequence divergence revealed that the small single copy region was highly variable among the different genera in the Lauraceae family. A total of 40 repeat structures and 83 simple sequence repeats were detected in both the coding and non-coding regions. A phylogenetic analysis indicated that Calycanthus is most closely related to Lauraceae, both being members of Laurales, which forms a sister group to Magnoliids. The complete sequence of the chloroplast of Cinnamomum camphora will aid in in-depth taxonomical studies of the Lauraceae family in the future. The genetic sequence information will also have valuable applications for chloroplast genetic engineering.

Keywords: Chloroplast genome, Cinnamomum camphora, Lauraceae, Illumina sequencing, Phylogeny


Cinnamomum, contains 250–300, or more, species worldwide and is a species-rich genus of evergreen aromatic tree and shrub belonging to the Lauraceae family. As the representative species of Cinanamomum, the camphor tree (Cinnamomum camphora) is a broad-leaved evergreen characterized by aromatic oils in all of the plant parts. This species originated in the southern parts of China and Japan, and has a widespread naturalized distribution in many other countries. Camphor trees can reach up to 40 m in height and live to be over 1,000 years old. They have oval or elliptical leaves with wavy margins that give off a distinctive aromatic smell when crushed. In addition to providing sources for spices and essential oils, the species also has a notable commodity value as lumber and is ecologically significant in garden construction (Babu et al., 2003).

Chloroplast serves as the metabolic center of plant life by converting solar energy to carbohydrates through photosynthesis and oxygen release (Neuhaus & Emes, 2000). The advent of high-throughput sequencing technology has facilitated rapid progress in the field of chloroplast genetics and genomics. Approximately 800 complete chloroplast genomes from a variety of land plants have been retained in the National Center for Biotechnology Information (NCBI) organelle genome database since the first chloroplast genome of tobacco (Nicotiana tabacum) (Shinozaki et al., 1986) and liverwort (Marchantia polymorpha) (Ohyama et al., 1986), which were sequenced simultaneously in 1986. The sequenced chloroplast genomes have improved our understanding of plant biology and evolutionary relationships. Chloroplast genomes of land plants have a high degree of conservation in size, structure, gene content and the gene’s linear order. They comprise a single circular chromosome, typically ranging in size from 107 kb (Cathaya argyrophylla) to 218 kb (Pelargonium) (Chumley et al., 2006; Lin et al., 2010). Chloroplast genomes have a quadripartite structure, with a pair of inverted repeats (IRs) separated by one large and one small single copy region (Yurina & Odintsova, 1998). Several plant chloroplast genomes also show significant structural rearrangements, with evidence of the loss of IR regions or entire gene families (Hirao et al., 2008; Yi et al., 2013). Additionally, the presence of IRs might stabilize the chloroplast genomes organization (Chang et al., 2006; Wicke et al., 2011). The chloroplast genome consists of 120–130 genes divided into three functional categories, protein-coding genes, introns and intergenic spacers. Most genes primarily participate in photosynthesis, transcription and translation.

While morphological and palynological studies of the phylogeny of Lauraceae family have been performed (Rohwer, 1993; Shang & Tang, 1994), the classification systems have not been widely accepted or approved. The use of morphological and palynological characteristics generally resolves the majority plant classifications but there is currently insufficient information on the Lauraceae family to provide the high-resolution necessary to differentiate some within-species taxa, whose taxonomic relationships are controversial. Therefore, research has been conducted on the relationships among the members of the Lauraceae family using multiple short sequences inferred from the chloroplast (Chanderbali, van der Werff & Renner, 2001; Lee, Lee & Choi, 2013; Li et al., 2004; Rohwer, 2000), which provides important molecular information that can be applied to deciphering evolutionary relationships between closely related taxa with phylogenetic clades.

In this study, we sequenced and analyzed the complete chloroplast genome of Cinnamomum camphora based on illumina high-throughput sequencing technology. In addition to describing the plastic features of the chloroplast genome, we compared the gene content, repeat structures and sequence divergence with other reported species in the Lauraceae family. We also presented results of a phylogenetic analysis of protein sequences from Cinnamomum camphora and 25 other plant species. The complete chloroplast genome of Cinnamomum camphora, in conjunction with previously reported chloroplast genome sequences, will improve our understanding of the evolution relationships of genera in the Lauraceae family, especially regarding the position of Cinnamomum camphora in evolution and plant systematics. Moreover, the complete genome sequence of Cinnamomum camphora provides valuable data for that can be used in chloroplast genetic engineering.

Materials and Methods

Samples and genome sequencing

Fresh young leaves of Cinnamomum camphora were obtained from the campus of Nanjing Forestry University. The chloroplasts of Cinnamomum camphora were isolated using the Sigma Chloroplast DNA Isolation kit (Sigma, St. Louis, MO, USA), and chloroplast DNA was extracted using DNaesy Plant Mini Kit (QIAGEN, Hilden, Germany). The purified DNA was subjected to hydroshearing, end repair and then interrupted randomly to construct 350 bp libraries. The complete library, with an average read length of 150 bp, was sequenced using Illumina Hiseq2000 platform.

Chloroplast genome assemble and annotation

To ensure accurate and reliable analyses, raw reads were proofread and assembled with SOAP denovo (Li et al., 2008). The generated contigs were assembled using the chloroplast genome sequence of Cinnamomum micranthum (KT348516.1) as a reference. The chloroplast genome sequences of Cinnamomum camphora were annotated through the online program Dual Organellar Genome Annotator (Solovyev et al., 2006). The annotation results were manually checked, including the start and stop codons, and adjusted by comparison to homologous genes from other sequenced chloroplast genomes. Transfer RNA (tRNA) genes were verified using tRNA scan-SE in organellar search mode with default parameters (Schattner, Brooks & Lowe, 2005). The circular chloroplast genome map of Cinnamomum camphora was drawn using the OGDRAW program (Lohse, Drechsel & Bock, 2007). Nucleotide frequency and relative synonymous codon usage (RSCU) (Sharp, Tuohy & Mosurski, 1986) were analyzed using DAMBE (Xia, 2013) on the protein-coding genes and only genes in inverted repeat region A (IRA) were used to represent repeated genes.

Repeat structures and simple sequence repeats (SSRs) analysis

REPuter was used to identify forward and palindromic repeats with a minimal size of 30 bp, hamming distance of three and over 90% identity (Kurtz et al., 2001). Tandem repeat sequences were identified in Cinnamomum camphora using Tandem Repeats Finder with default parameters (Benson, 1999). Simple sequence repeats (SSRs) were detected using the microsatellite identification tool MISA ( with the following thresholds: minimum SSR motif length of 10 bp, and 10 repeat units for mononucleotide SSRs, five repeat units for dinucleotide SSRs, four repeat units for trinucleotide SSRs, and three repeat units for tetra-, penta- and hexanucleotide SSRs. The maximum size of interruption allowed between two different SSRs in a compound SSR was 100 bp (von Stackelberg, Rensing & Reski, 2006). All of the repeats identified with the above programs were manually verified to remove redundant results.

Comparative analysis of different Lauraceae plastomes

To encompass the complete nucleotide diversity among Lauraceae species, the complete chloroplast genome sequences were aligned using MAFFT 7.222 software (Katoh & Standley, 2013), and manually adjusted with BioEdit software (Hall, 1999). DnaSP 5.0 software was used to conduct a sliding window analysis to calculate the nucleotide variability (Pi) values (Librado & Rozas, 2009). Pi is defined as the average number of nucleotide differences per site between two DNA sequences in all possible pairs in the sample population (Nei & Li, 1979). The window length was set to 600 bp, and the step size was set to 200 bp.

Phylogenetic analysis

A molecular phylogenetic tree was constructed using 26 different plant species with TreeBeST ( Among these 26 taxa, Abies koreana, Picea sitchensis, Pinus taiwanensis, Podocarpus lambertii, Cycas revolute and Ginkgo biloba were set as the outgroup. The 26 completed chloroplast genome sequences representing the lineages of angiosperms were downloaded from the NCBI Organelle Genome Resource database. The protein-coding sequences of related Lauraceae species were determined by MUSCLE (Edgar, 2004). A phylogenetic tree was constructed based on a neighbor joining analysis. The bootstrap probability of each branch was calculated by 1,000 replications.

Results and Discussion

Genome organization and gene features

The complete chloroplast genome size of Cinnamomum camphora was 152,570 bp, the same as Cinnamomum micranthum (Wu, Ho & Chang, 2016). The chloroplast genome had a single circular chromosome with a quadripartite structure, which included a pair of IR regions (19,886 bp) that were separated by a large single copy (LSC, 93,705 bp) and a small single copy (SSC, 19,093 bp) regions (Fig. 1). Coding regions (83,429 bp), comprising protein-coding genes (72,222 bp), tRNA genes (2,271 bp) and rRNA genes (8,936 bp) accounted for 54.68% of the genome, whereas non-coding regions (69,141 bp) accounted for the remaining 45.32% of the genome. The overall GC content of the Cinnamomum camphora was 39.13%. The IR regions had a higher GC content of 44.42%, while the GC contents of the LSC and SSC were 37.96% and 33.92%, respectively. The high GC percentage in the IR regions was similar to most reported chloroplast genomes, which could be the result of ribosomal RNA in this region (Asaf et al., 2016; Qian et al., 2013).

Figure 1
Gene map of the Cinnamomum camphora chloroplast genome.

A total of 123 coding regions were identified in the chloroplast genome of Cinnamomum camphora, of which 107 were unique (Fig. 1 and Table 1). In total, 79 (73 unique) protein-coding genes were involved in the processes related to photosynthesis, the genetic system and several currently unknown functions. Additionally, 36 (30 unique) genes encoded tRNAs and eight (four unique) rRNA genes. Like the genes, the introns of chloroplast genomes were basically conserved. In Cinnamomum camphora, seven protein-coding genes (rps16, petD, ndhB (2×), atpF, rpoC1 and rpl2) contain one intron, and four protein-coding genes (ycf3, clpP and rps12 (2×)) contain two introns (Table S1). The loss of introns within the protein-coding genes reported in other plant species has not been found in the chloroplast genome of Cinnamomum camphora (Daniell et al., 2008; Jansen et al., 2008; Wu et al., 2009). As in many other land plants, rps12 was located with a single 5′ end in the LSC region and a repeated 3′ end in both of the IR regions (Raman & Park, 2016; Redwan, Saidin & Kumar, 2015; Yang et al., 2013). There were 15 genes duplicated in the IR regions, including three protein-coding genes, eight tRNAs and four rRNAs. Overall, the gene content, number and structure in chloroplast genomes were generally similar to those of other reported Lauraceae species (Hinsinger & Strijk, 2016; Song et al., 2015, 2016; Wu, Ho & Chang, 2016).

Table 1
List of genes in the chloroplast genome of Cinnamomum camphora.

The sequence analysis indicates 79 protein-coding genes in this genome represented 63,654 bp and 21,218 codons. On the basis of the sequences of protein-coding genes, the frequency of codon usage was calculated (Table 2). Among these codons 2,203 (10.87%) encode leucine, while 255 (1.25%) encode cysteine, which are the most and least used amino acids, respectively. The overall codon bias pattern in of the Cinnamomum camphora genome tended to use A/U—ending codons, and among the 29 preferred codons in the Cinnamomum camphora chloroplast genome (RSCU > 1.0), 27 ended with A/U. This phenomenon is similarly observed in many other chloroplast genomes (Nie et al., 2012; Zhang et al., 2016).

Table 2
Relative synonymous codon usage (RSCU) for protein coding in the chloroplast genome of Cinnamomum camphora.

Comparative analysis of different Lauraceae plastomes

We compared four other reported chloroplast genomes of representative taxa in the Lauraceae family with that of Cinnamomum camphora (Table S2). These taxa included Cinnamomum micranthum (152,570 bp, KR014245.1), Persea americana (152,723 bp, KX437771.1), Machilus yunnanensis (152,721 bp, KT348516.1) and Litsea glutinosa (152,618 bp, KU382356.1). To examine the level of sequence divergence of the Lauraceae family, the Pi values within 600 bp were calculated with DnaSP 5.0 software among the five chloroplast genomes of Cinnamomum camphora, Cinnamomum micranthum, M. yunnanensis, L. glutinosa and P. americana. Between the two Cinnamomum species, Pi values varied from 0 to 0.025 (ycf2) with a mean of 0.00362, indicating that the differences between the two Cinnamomum genomes were small (Fig. 2A). However, five of the genes (ycf2, rrn23, ycf1, trnL-UAG and ndhF) showed high levels of variation, which were much higher than the values of other regions (Pi > 0.02). Among the five Lauraceae species, the Pi values ranged from 0 to 0.3178 (ndhH) with a mean of 0.03361, indicating that the differences among different species of the genera in the Lauraceae family were greater than those between congeneric species (Fig. 2B). Particularly, the entire SSC regions were highly variable among the different genera in the Lauraceae family. Coding regions, including those of ycf1, psaC, ccsA, rpl32 and a set of genes named ndh, have been identified as highly variable regions in the SSC region. The SSCs often have a higher nucleotide substitution rates relative to the IRs in land plants (Perry & Wolfe, 2002; Zhang, Ma & Li, 2011). These comparatively highly variable loci are good for exploiting molecular markers and evaluating interspecies phylogenetic relationships (Chen et al., 2015; Dong et al., 2012; Timme et al., 2007).

Figure 2
Comparison of the nucleotide variability (Pi) values of the whole plastomes in the (A) Cinnamomum and (B) Lauraceae (window length: 600 bp, step size: 200 bp).

Repeat and SSR analyses

Repeat motifs are very useful in the analysis of genome rearrangements and play an important role in phylogenetic analyses (Cavalier-Smith, 2002). A total of 40 repeat structures with a length at least 30 bp were detected in the chloroplast genome of Cinnamomum camphora. Similarly, 61, 45, 46 and 40 repeat structures were detected in Cinnamomum micranthum, L. glutinosa, M. yunnanensis and P. americana (Fig. 3D). Cinnamomum micranthum had the greatest total number of repeats, and the other four Lauraceae species showed similar numbers and patterns of repeats. For repeat analysis of Cinnamomum camphora, 12 forward and 12 palindromic repeats were found with a size of 30–44 bp, whereas only one forward and one palindromic repeats were 15–29 bp in length. Similarly, 13 tandem repeats were 15–29 bp, and one tandem repeat was 30–44 bp in length (Figs. 3A3C). The presence of these repeats indicated that the region is a potential hotspot for genome reconfiguration (Asano et al., 2004; Gao et al., 2009). Additionally, these repeats were informative resources for developing genetic markers for phylogenetic and population genetic studies (Nie et al., 2012).

Figure 3
Number and type of repeated sequences in five plastomes of Lauraceae.

Simple sequence repeats, also known as microsatellites, are 1–6 bp repeating sequences that are widely distributed throughout the chloroplast genome. SSRs are typically co-dominant and have a higher degree of polymorphism. Here, we detected perfect SSRs with a minimum size of 10 bp in Cinnamomum camphora and four other Lauraceae species. The SSRs were interrupted by a maximum distance of 100 bp. Based on the analysis, 83 perfect SSRs were detected in the chloroplast genome of Cinnamomum camphora. Similarly, 88, 81, 82 and 86 were identified in Cinnamomum micranthum, L. glutinosa, M. yunnanensis and P. americana (Fig. 4A). The majority of the SSRs in these chloroplast genomes were mononucleotides, ranging from 54 in L. glutinosa to 65 in Cinnamomum micranthum. Additionally, only one hexanucleotide was present in all of the Lauraceae species. In Cinnamomum camphora, the most abundant motif, at 74.50%, was a run of mononucleotide A/T (Fig. 4B). This result confirmed that the chloroplast SSRs found in most plants are generally composed of polythymine or polyadenine repeats, and infrequently contain cytosine and guanine repeats (Choi, Chung & Park, 2016; Kuang et al., 2011). The total output consisted of 83 SSRs: 76% (63 SSRs) in the LSC region, 19% (16 SSRs) in the SSC region, and 5% (four SSRs) in the IR regions (Table S3). In comparison with the IR region, the SSRs were more prevalent in the LSC and SSC regions. SSRs in coding regions are prone to mutation and cause frame-shifts to occur, which renders the gene non-functional (Wang, Barkley & Jenkins, 2009). Chloroplast SSRs have been used to evaluate genetic variations among plant genotypes (Vendramin et al., 1999) and to investigate the genetic diversity of Lauraceae species (Santos, Spironello & Sampaio, 2008; Zhai et al., 2010). The SSRs in this analysis are good resource for developing molecular markers and will be applied to molecular marker-assisted breeding, population genetics and genetic linkage map construction (Deguilloux, Pemonge & Petit, 2004; Huang et al., 2014).

Figure 4
Number and type of simple sequence repeats (SSRs) in five plastomes of Lauraceae.

Phylogenetic analysis

Chloroplast genome sequences are useful for deciphering phylogenetic relationships among closely related taxa and for clarifying the evolutionary patterns of plant species (Jansen et al., 2007; Kyunghee et al., 2015). To examine the phylogenetic position of Cinnamomum camphora in the Lauraceae family, the core protein-coding genes that are common to all 26 chloroplasts were used to infer their phylogenetic relationships (Fig. 5). Several species in the gymnosperm were set as outgroups. The alignment analysis was conducted by MUSCLE. A neighbor joining analysis was performed with TreeBeST using 1,000 bootstrap replicates. The long branches indicated faster rates of plastid sequence evolution compared with other members in the tree. The tree suggested the correct phylogeny inference, followed by the latest angiosperm phylogeny group III (Bremer et al., 2009). The phylogenetic tree indicated that Calycanthus is most closely related to Lauraceae, which are both members of Laurales and Laurales forms a sister group to Magnoliids. In corroboration with other studies, Endiandra discolor were first separated from the other Lauraceae species (Hinsinger & Strijk, 2016; Rohwer & Rudolph, 2005). Furthermore, the position of Cinnamomum camphora was clustered with Cinnamomum micranthum, both of which are members of Cinnamomum. The chloroplast genome of Cinnamomum camphora will provide valuable and essential genetic information to further the phylogenetic resolution among angiosperms (Leebens-Mack et al., 2005; Moore et al., 2007; Ruhfel et al., 2014).

Figure 5
Molecular phylogenetic tree of 26 species based on a neighbor joining analysis.


We successfully assembled, annotated and analyzed the complete chloroplast sequence of Cinnamomum camphora. As an ancient tree species, the chloroplast genome of Cinnamomum camphorais was still conserved and found to be very similar to its sister taxon, Cinnamomum micranthum. The repeat sequences identified in Cinnamomum camphora could be selected for developing markers, population studies and phylogenetic analyses. A phylogenetic analysis suggested that Calycanthus is most related to Lauraceae, with both of them being members of Laurales, which forms a sister group to Magnoliids. The availability of the Cinnamomum camphora chloroplast genome will aid in for further investigations of this woody plant and will also, in conjunction with previously published chloroplast genome sequences, help to expand our understanding of the evolutionary history of Lauraceae chloroplast genomes, including the position of Cinnamomum camphora in plant systematics and evolution. In addition, it will assist in making other molecular biology applications, such as chloroplast gene transformation, feasible.

Supplemental Information


Supplemental Information 1

Table S1.

The lengths of introns and exons for genes in the C. camphora chloroplast genome.


Supplemental Information 2

Table S2.

Comparison of chloroplast genome characteristics of C. camphora and four species of Lauraceae.


Supplemental Information 3

Table S3.

SSRs for Cinnamomum camphora chloroplast genome.


Supplemental Information 4


Supplemental Information 5


Funding Statement

This project was supported by the National Natural Science Foundation of China (31460209), the Science and Technology Innovation project of Jiangxi Province (20152BCB24014), the Science and Technology Planning Project of Jiangxi Province (20151BBF60018), the Forestry Science and Technology Innovation Special Project of Jiangxi Forestry Department (201701), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD) and the Collaborative Innovation Plan of Jiangsu Higher Education (CIP). There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

The following grant information was disclosed by the authors:

National Natural Science Foundation of China 31460209.
Science and Technology Innovation project of Jiangxi Province 20152BCB24014.
Science and Technology Planning Project of Jiangxi Province 20151BBF60018.
Forestry Science and Technology Innovation Special Project of Jiangxi Forestry Department 201701.
Priority Academic Program Development of Jiangsu Higher Education Institutions .
Collaborative Innovation Plan of Jiangsu Higher Education .

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Caihui Chen performed the experiments, wrote the paper, prepared figures and/or tables.

Yongjie Zheng analyzed the data, wrote the paper, prepared figures and/or tables.

Sian Liu performed the experiments.

Yongda Zhong analyzed the data.

Yanfang Wu contributed reagents/materials/analysis tools.

Jiang Li contributed reagents/materials/analysis tools.

Li-An Xu conceived and designed the experiments.

Meng Xu conceived and designed the experiments, wrote the paper, reviewed drafts of the paper.

DNA Deposition

The following information was supplied regarding the deposition of DNA sequences:

NCBI: MF156716

The sequences have been provided as Supplemental Dataset Files.


Asaf et al. (2016) Asaf S, Khan AL, Khan AR, Waqas M, Kang SM, Khan MA, Lee SM, Lee IJ. Complete chloroplast genome of Nicotiana otophora and its comparison with related species. Frontiers in Plant Science. 2016;7:843. doi: 10.3389/fpls.2016.00843. [PMC free article] [PubMed] [Cross Ref]
Asano et al. (2004) Asano T, Tsudzuki T, Takahashi S, Shimada H, Kadowaki K. Complete nucleotide sequence of the sugarcane (Saccharum officinarum) chloroplast genome: a comparative analysis of four monocot chloroplast genomes. DNA Research. 2004;11(2):93–99. doi: 10.1093/dnares/11.2.93. [PubMed] [Cross Ref]
Babu et al. (2003) Babu KN, Sajina A, Minoo D, John CZ, Mini PM, Tushar KV, Rema J, Ravindran PN. Micropropagation of camphor tree (Cinnamomum camphora) Plant Cell, Tissue and Organ Culture. 2003;74:179–183.
Benson (1999) Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Research. 1999;27(2):573–580. doi: 10.1093/nar/27.2.573. [PMC free article] [PubMed] [Cross Ref]
Bremer et al. (2009) Bremer B, Bremer K, Chase M, Fay M, Reveal J, Soltis D, Soltis P, Stevens P. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Botanical Journal of the Linnean Society. 2009;161(2):105–121. doi: 10.1111/j.1095-8339.2009.00996.x. [Cross Ref]
Cavalier-Smith (2002) Cavalier-Smith T. Chloroplast evolution: secondary symbiogenesis and multiple losses. Current Biology. 2002;12(2):R62–R64. doi: 10.1016/s0960-9822(01)00675-3. [PubMed] [Cross Ref]
Chanderbali, van der Werff & Renner (2001) Chanderbali AS, van der Werff H, Renner SS. Phylogeny and historical biogeography of Lauraceae: evidence from the chloroplast and nuclear genomes. Annals of the Missouri Botanical Garden. 2001;88(1):104–134. doi: 10.2307/2666133. [Cross Ref]
Chang et al. (2006) Chang CC, Lin HC, Lin IP, Chow TY, Chen HH, Chen WH, Cheng CH, Lin CY, Liu SM, Chang CC, Chaw SM. The chloroplast genome of Phalaenopsis aphrodite (Orchidaceae): comparative analysis of evolutionary rate with that of grasses and its phylogenetic implications. Molecular Biology and Evolution. 2006;23(2):279–291. doi: 10.1093/molbev/msj029. [PubMed] [Cross Ref]
Chen et al. (2015) Chen J, Hao Z, Xu H, Yang L, Liu G, Sheng Y, Zheng C, Zheng W, Cheng T, Shi J. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng. Frontiers in Plant Science. 2015;6:447. doi: 10.3389/fpls.2015.00447. [PMC free article] [PubMed] [Cross Ref]
Choi, Chung & Park (2016) Choi KS, Chung MG, Park S. The complete chloroplast genome sequences of three Veroniceae species (Plantaginaceae): comparative analysis and highly divergent regions. Frontiers in Plant Science. 2016;7:355. doi: 10.3389/fpls.2016.00355. [PMC free article] [PubMed] [Cross Ref]
Chumley et al. (2006) Chumley TW, Palmer JD, Mower JP, Fourcade HM, Calie PJ, Boore JL, Jansen RK. The complete chloroplast genome sequence of Pelargonium x hortorum: organization and evolution of the largest and most highly rearranged chloroplast genome of land plants. Molecular Biology and Evolution. 2006;23(11):2175–2190. doi: 10.1093/molbev/msl089. [PubMed] [Cross Ref]
Daniell et al. (2008) Daniell H, Wurdack KJ, Kanagaraj A, Lee S-B, Saski C, Jansen RK. The complete nucleotide sequence of the cassava (Manihot esculenta) chloroplast genome and the evolution of atpF in Malpighiales: RNA editing and multiple losses of a group II intron. Theoretical and Applied Genetics. 2008;116(5):723–737. doi: 10.1007/s00122-007-0706-y. [PMC free article] [PubMed] [Cross Ref]
Deguilloux, Pemonge & Petit (2004) Deguilloux M-F, Pemonge M-H, Petit RJ. Use of chloroplast microsatellites to differentiate oak populations. Annals of Forest Science. 2004;61(8):825–830. doi: 10.1051/forest:2004078. [Cross Ref]
Dong et al. (2012) Dong W, Liu J, Yu J, Wang L, Zhou S. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLOS ONE. 2012;7(4):e35071 doi: 10.1371/journal.pone.0035071. [PMC free article] [PubMed] [Cross Ref]
Edgar (2004) Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32(5):1792–1797. doi: 10.1093/nar/gkh340. [PMC free article] [PubMed] [Cross Ref]
Gao et al. (2009) Gao L, Yi X, Yang YX, Su YJ, Wang T. Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes. BMC Evolutionary Biology. 2009;9(1):130. doi: 10.1186/1471-2148-9-130. [PMC free article] [PubMed] [Cross Ref]
Hall (1999) Hall TA. BioEdit: a user-friendly biological sequence alignment program for Windows 95/98/NT. Nucleic Acids Symposium. 1999;41:95–98.
Hinsinger & Strijk (2016) Hinsinger DD, Strijk JS. Toward phylogenomics of Lauraceae: the complete chloroplast genome sequence of Litsea glutinosa (Lauraceae), an invasive tree species on Indian and Pacific Ocean islands. Plant Gene. 2016;9:71–79. doi: 10.1016/j.plgene.2016.08.002. [Cross Ref]
Hirao et al. (2008) Hirao T, Watanabe A, Kurita M, Kondo T, Takata K. Complete nucleotide sequence of the Cryptomeria japonica D. Don. chloroplast genome and comparative chloroplast genomics: diversified genomic structure of coniferous species. BMC Plant Biology. 2008;8(1):70. doi: 10.1186/1471-2229-8-70. [PMC free article] [PubMed] [Cross Ref]
Huang et al. (2014) Huang H, Shi C, Liu Y, Mao S-Y, Gao L-Z. Thirteen Camellia chloroplast genome sequences determined by high-throughput sequencing: genome structure and phylogenetic relationships. BMC Evolutionary Biology. 2014;14(1):151. doi: 10.1186/1471-2148-14-151. [PMC free article] [PubMed] [Cross Ref]
Jansen et al. (2007) Jansen RK, Cai Z, Raubeson LA, Daniell H, Depamphilis CW, Leebens-Mack J, Müller KF, Guisinger-Bellian M, Haberle RC, Hansen AK, Chumley TW, Lee S-B, Peery R, McNeal JR, Kuehl JV, Boore JL. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(49):19369–19374. doi: 10.1073/pnas.0709121104. [PubMed] [Cross Ref]
Jansen et al. (2008) Jansen RK, Wojciechowski MF, Sanniyasi E, Lee S-B, Daniell H. Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae) Molecular Phylogenetics and Evolution. 2008;48(3):1204–1217. doi: 10.1016/j.ympev.2008.06.013. [PMC free article] [PubMed] [Cross Ref]
Katoh & Standley (2013) Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [PMC free article] [PubMed] [Cross Ref]
Kyunghee et al. (2015) Kim K, Sang-Choon L, Junki L, Yu Y, Yang K, Beom-Soon C, Hee-Jong K, Espinosa WN, Hong-Il C, Nam-Hoon K, Jang W, Park H-S, Lee J, Lee HO, Joh HJ, Lee HJ, Park JY, Perumal S, Jayakodi M, Lee YS, Kim B, Capetti D, Kim S, Kim S, Lim K-B, K Y-D, Lee J, Cho K-S, Park B-S, Wing RA, Yang T-J. Complete chloroplast and ribosomal sequences for 30 accessions elucidate evolution of Oryza AA genome species. Scientific Reports. 2015;5(1):15655. doi: 10.1038/srep15655. [PMC free article] [PubMed] [Cross Ref]
Kuang et al. (2011) Kuang D-Y, Wu H, Wang Y-L, Gao L-M, Zhang S-Z, Lu L. Complete chloroplast genome sequence of Magnolia kwangsiensis (Magnoliaceae): implication for DNA barcoding and population genetics. Genome. 2011;54(8):663–673. doi: 10.1139/g11-026. [PubMed] [Cross Ref]
Kurtz et al. (2001) Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Research. 2001;29(22):4633–4642. doi: 10.1093/nar/29.22.4633. [PMC free article] [PubMed] [Cross Ref]
Lee, Lee & Choi (2013) Lee J-H, Lee D-H, Choi B-H. Phylogeography and genetic diversity of East Asian Neolitsea sericea (Lauraceae) based on variations in chloroplast DNA sequences. Journal of Plant Research. 2013;126(2):193–202. doi: 10.1007/s10265-012-0519-1. [PubMed] [Cross Ref]
Leebens-Mack et al. (2005) Leebens-Mack J, Raubeson LA, Cui L, Kuehl JV, Fourcade MH, Chumley TW, Boore JL, Jansen RK, dePamphilis CW. Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one’s way out of the Felsenstein zone. Molecular Biology and Evolution. 2005;22(10):1948–1963. doi: 10.1093/molbev/msi191. [PubMed] [Cross Ref]
Li et al. (2004) Li J, Christophel D, Conran J, Li H-W. Phylogenetic relationships within the ‘core’ Laureae (Litseacomplex, Lauraceae) inferred from sequences of the chloroplast gene matK and nuclear ribosomal DNA ITS regions. Plant Systematics and Evolution. 2004;246(1–2):19–34. doi: 10.1007/s00606-003-0113-z. [Cross Ref]
Li et al. (2008) Li R, Li Y, Kristiansen K, Wang J. SOAP: short oligonucleotide alignment program. Bioinformatics. 2008;24(5):713–714. doi: 10.1093/bioinformatics/btn025. [PubMed] [Cross Ref]
Librado & Rozas (2009) Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–1452. doi: 10.1093/bioinformatics/btp187. [PubMed] [Cross Ref]
Lin et al. (2010) Lin CP, Huang JP, Wu CS, Hsu CY, Chaw SM. Comparative chloroplast genomics reveals the evolution of Pinaceae genera and subfamilies. Genome Biology and Evolution. 2010;2:504–517. doi: 10.1093/gbe/evq036. [PMC free article] [PubMed] [Cross Ref]
Lohse, Drechsel & Bock (2007) Lohse M, Drechsel O, Bock R. OrganellarGenomeDRAW (OGDRAW): a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Current Genetics. 2007;52(5–6):267–274. doi: 10.1007/s00294-007-0161-y. [PubMed] [Cross Ref]
Moore et al. (2007) Moore MJ, Bell CD, Soltis PS, Soltis DE. Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(49):19363–19368. doi: 10.1073/pnas.0708072104. [PubMed] [Cross Ref]
Nei & Li (1979) Nei M, Li W-H. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proceedings of the National Academy of Sciences of the United States of America. 1979;76(10):5269–5273. [PubMed]
Neuhaus & Emes (2000) Neuhaus HE, Emes MJ. Nonphotosynthetic metabolism in plastids. Annual Review of Plant Physiology and Plant Molecular Biology. 2000;51:111–140. doi: 10.1146/annurev.arplant.51.1.111. [PubMed] [Cross Ref]
Nie et al. (2012) Nie X, Lv S, Zhang Y, Du X, Wang L, Biradar SS, Tan X, Wan F, Song W. Complete chloroplast genome sequence of a major invasive species, Crofton weed (Ageratina adenophora) PLOS ONE. 2012;7(5):e36869 doi: 10.1371/journal.pone.0036869. [PMC free article] [PubMed] [Cross Ref]
Ohyama et al. (1986) Ohyama K, Fukuzawa H, Kohchi T, Shirai H, Sano T, Sano S, Umesono K, Shiki Y, Takeuchi M, Chang Z, Aota S-I, Inokuchi H, Ozeki H. Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA. Nature. 1986;322(6079):572–574. doi: 10.1038/322572a0. [Cross Ref]
Perry & Wolfe (2002) Perry AS, Wolfe KH. Nucleotide substitution rates in legume chloroplast DNA depend on the presence of the inverted repeat. Journal of Molecular Evolution. 2002;55(5):501–508. doi: 10.1007/s00239-002-2333-y. [PubMed] [Cross Ref]
Qian et al. (2013) Qian J, Song J, Gao H, Zhu Y, Xu J, Pang X, Yao H, Sun C, Li X, Li C. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza. PLOS ONE. 2013;8(2):e57607 doi: 10.1371/journal.pone.0057607. [PMC free article] [PubMed] [Cross Ref]
Raman & Park (2016) Raman G, Park SJ. The complete chloroplast genome sequence of ampelopsis: gene organization, comparative analysis, and phylogenetic relationships to other angiosperms. Frontiers in Plant Science. 2016;7:341. doi: 10.3389/fpls.2016.00341. [PMC free article] [PubMed] [Cross Ref]
Redwan, Saidin & Kumar (2015) Redwan RM, Saidin A, Kumar SV. Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae. BMC Plant Biology. 2015;15(1):1–20. doi: 10.1186/s12870-015-0619-x. [PMC free article] [PubMed] [Cross Ref]
Rohwer (1993) Rohwer JG. Lauraceae. In: Kubitski K, Rohwer JG, Bittrich V, editors. The Families and Genera of Flowering Plants. Flowering Plants Dicotyledons–Magnoliid, Hamamelid and Caryophyllid Families. Vol. 2. Berlin: Springer; 1993. pp. 366–391.
Rohwer (2000) Rohwer JG. Toward a phylogenetic classification of the Lauraceae: evidence from matK sequences. Systematic Botany. 2000;25(1):60–71. doi: 10.2307/2666673. [Cross Ref]
Rohwer & Rudolph (2005) Rohwer JG, Rudolph B. Jumping genera: the phylogenetic positions of Cassytha, Hypodaphnis, and Neocinnamomum (Lauraceae) based on different analyses of trnK intron sequences. Annals of the Missouri Botanical Garden. 2005;92(2):153–178.
Ruhfel et al. (2014) Ruhfel BR, Gitzendanner MA, Soltis PS, Soltis DE, Burleigh JG. From algae to angiosperms–inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes. BMC Evolutionary Biology. 2014;14(1):23. doi: 10.1186/1471-2148-14-23. [PMC free article] [PubMed] [Cross Ref]
Santos, Spironello & Sampaio (2008) Santos RP, Spironello WR, Sampaio PdTB. Genetic diversity in rosewood saplings (Aniba rosaeodora ducke, Lauraceae): an ecological approach. Acta Amazonica. 2008;38(4):707–714. doi: 10.1590/s0044-59672008000400014. [Cross Ref]
Schattner, Brooks & Lowe (2005) Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Research. 2005;33:W686–W689. doi: 10.1093/nar/gki366. [PMC free article] [PubMed] [Cross Ref]
Shang & Tang (1994) Shang Q, Tang G. Pollen morphology of the family Lauraceae. Cathaya. 1994;7:53–62.
Sharp, Tuohy & Mosurski (1986) Sharp PM, Tuohy TMF, Mosurski KR. Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes. Nucleic Acids Research. 1986;14(13):5125–5143. doi: 10.1093/nar/14.13.5125. [PMC free article] [PubMed] [Cross Ref]
Shinozaki et al. (1986) Shinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, Matsubayashi T, Zaita N, Chunwongse J, Obokata J, Yamaguchi-Shinozaki K, Ohto C, Torazawa K, Meng BY, Sugita M, Deno H, Kamogashira T, Yamada K, Kusuda J, Takaiwa F, Kato A, Tohdoh N, Shimada H, Sugiura M. The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression. EMBO Journal. 1986;5:2043–2049. [PubMed]
Solovyev et al. (2006) Solovyev V, Kosarev P, Seledsov I, Vorobyev D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biology. 2006;7(S1):S10. doi: 10.1186/Gb-2006-7-S1-S10. [PMC free article] [PubMed] [Cross Ref]
Song et al. (2015) Song Y, Dong W, Liu B, Xu C, Yao X, Gao J, Corlett RT. Comparative analysis of complete chloroplast genome sequences of two tropical trees Machilus yunnanensis and Machilus balansae in the family Lauraceae. Frontiers in Plant Science. 2015;6:662. doi: 10.3389/fpls.2015.00662. [PMC free article] [PubMed] [Cross Ref]
Song et al. (2016) Song Y, Yao X, Tan Y, Gan Y, Corlett RT. Complete chloroplast genome sequence of the avocado: gene organization, comparative analysis, and phylogenetic relationships with other Lauraceae. Canadian Journal of Forest Research. 2016;46(11):1293–1301. doi: 10.1139/cjfr-2016-0199. [Cross Ref]
Timme et al. (2007) Timme RE, Kuehl JV, Boore JL, Jansen RK. A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats. American Journal of Botany. 2007;94(3):302–312. doi: 10.3732/ajb.94.3.302. [PubMed] [Cross Ref]
Vendramin et al. (1999) Vendramin GG, Degen B, Petit RJ, Anzidei M, Madaghiele A, Ziegenhagen B. High level of variation at Abies alba chloroplast microsatellite loci in Europe. Molecular Ecology. 1999;8(7):1117–1126. doi: 10.1046/j.1365-294x.1999.00666.x. [PubMed] [Cross Ref]
von Stackelberg, Rensing & Reski (2006) von Stackelberg M, Rensing SA, Reski R. Identification of genic moss SSR markers and a comparative analysis of twenty-four algal and plant gene indices reveal species-specific rather than group-specific characteristics of microsatellites. BMC Plant Biology. 2006;6:9. [PMC free article] [PubMed]
Wang, Barkley & Jenkins (2009) Wang ML, Barkley NA, Jenkins TM. Microsatellite markers in plants and insects. Part I: applications of biotechnology. Genes, Genomes and Genomics. 2009;3:54–67.
Wicke et al. (2011) Wicke S, Schneeweiss GM, dePamphilis CW, Muller KF, Quandt D. The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Molecular Biology. 2011;76(3–5):273–297. doi: 10.1007/s11103-011-9762-4. [PMC free article] [PubMed] [Cross Ref]
Wu, Ho & Chang (2016) Wu CC, Ho CK, Chang SH. The complete chloroplast genome of Cinnamomum kanehirae Hayata (Lauraceae) Mitochondrial DNA. 2016;27:2681. [PubMed]
Wu et al. (2009) Wu F-H, Kan D-P, Lee S-B, Daniell H, Lee Y-W, Lin C-C, Lin N-S, Lin C-S. Complete nucleotide sequence of Dendrocalamus latiflorus and Bambusa oldhamii chloroplast genomes. Tree Physiology. 2009;29(6):847–856. doi: 10.1093/treephys/tpp015. [PMC free article] [PubMed] [Cross Ref]
Xia (2013) Xia X. DAMBE5: a comprehensive software package for data analysis in molecular biology and evolution. Molecular Biology and Evolution. 2013;30(7):1720–1728. doi: 10.1093/molbev/mst064. [PMC free article] [PubMed] [Cross Ref]
Yang et al. (2013) Yang JB, Tang M, Li HT, Zhang ZR, Li DZ. Complete chloroplast genome of the genus Cymbidium: lights into the species identification, phylogenetic implications and population genetic analyses. BMC Evolutionary Biology. 2013;13(1):84. doi: 10.1186/1471-2148-13-84. [PMC free article] [PubMed] [Cross Ref]
Yi et al. (2013) Yi X, Gao L, Wang B, Su Y-J, Wang T. The complete chloroplast genome sequence of Cephalotaxus oliveri (Cephalotaxaceae): evolutionary comparison of Cephalotaxus chloroplast DNAs and insights into the loss of inverted repeat copies in gymnosperms. Genome Biology and Evolution. 2013;5(4):688–698. doi: 10.1093/gbe/evt042. [PMC free article] [PubMed] [Cross Ref]
Yurina & Odintsova (1998) Yurina NP, Odintsova MS. Comparative structural organization of plant chloroplast and mitochondrial genomes. Genetika. 1998;34:1–16.
Zhai et al. (2010) Zhai S-N, Yan X-L, Nakamura K, Mishima M, Qiu Y-X. Isolation of compound microsatellite markers for the endangered plant Neolitsea sericea (Lauraceae) American Journal of Botany. 2010;97(12):e139–e141. doi: 10.3732/ajb.1000348. [PubMed] [Cross Ref]
Zhang, Ma & Li (2011) Zhang Y-J, Ma P-F, Li D-Z. High-throughput sequencing of six bamboo chloroplast genomes: phylogenetic implications for temperate woody bamboos (Poaceae: Bambusoideae) PLOS ONE. 2011;6(5):e20596 doi: 10.1371/journal.pone.0020596. [PMC free article] [PubMed] [Cross Ref]
Zhang et al. (2016) Zhang Y, Du L, Ao L, Chen J, Li W, Hu W, Wei Z, Kim K, Lee SC, Yang TJ. The complete chloroplast genome sequences of five epimedium species: lights into phylogenetic and taxonomic analyses. Frontiers in Plant Science. 2016;7:306. doi: 10.3389/fpls.2016.00306. [PMC free article] [PubMed] [Cross Ref]

Articles from PeerJ are provided here courtesy of PeerJ, Inc