UPM Institutional Repository

Development of chloroplast genome library for DNA tracking and conservation of endangered Aquilaria species native to Malaysia


Citation

Hishamuddin, Muhammad Syahmi (2022) Development of chloroplast genome library for DNA tracking and conservation of endangered Aquilaria species native to Malaysia. Doctoral thesis, Universiti Putra Malaysia.

Abstract

Aquilaria Lam., species [Aquilaria beccariana Tiegh., (gaharu buaya), Aquilaria crassna Pierre ex Lecomte, (gaharu), Aquilaria hirta Ridl., (chandan bulu), Aquilaria malaccensis Lam., (ching karas), Aquilaria microcarpa Baill., (gaharu putih), Aquilaria rostrata Ridl., (candan gunung), Aquilaria sinensis (gaharu) and Aquilaria subintegra Ding Hou, (gaharu)] is a naturally distributed in the Indomalesian region and are protected against over-exploitation. Aquilaria is highly prized for its unique scented resin, agarwood, which is often the subject of unlawful trade activities. Survival of the tree is heavily threatened by destructive harvesting and agarwood poaching, leading to its protection under the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). Ambiguous species delimitation and limited genetic information within Aquilaria are among the impediments to conservation efforts. In this study, a comparative analysis was performed on eight Aquilaria species complete chloroplast (cp) genomes, of which seven were newly sequenced using Illumina HiSeq X Ten platform followed by de novo assembly. All Aquilaria species utilised in this study were collected during fieldwork and the Aquilaria germplasm of Forest Research Institute of Malaysia (FRIM), and grown in the greenhouse of the Faculty of Forestry and Environment, Universiti Putra Malaysia. All Aquilaria species have been authenticated by experts. To conduct comparative cp genome analysis, the sequences of all eight Aquilaria cp genomes were aligned using MAFFT v7 and then imported into DnaSP v5.10.1 to determine nucleotide diversity in the total genome, LSC, SSC, and IR regions. The boundaries between the IR and SC regions were manually examined to determine the differences in length variation between Aquilaria cp genomes. Aquilaria cp genomes possess a typical quadripartite structure including gene order and genomic structure. The length of each of the cp genome is about 174 kbp and encoded between 89 and 92 proteins, 38 tRNAs, and 8 rRNAs, with 27 duplicated in the IR (inverted repeat) region. Besides, 832 repeats (forward, reverse, palindrome and complement repeats) and nine highly variable regions (HVR) were also identified. The phylogenetic analysis performed using Maximum likelihood (ML) and Bayesian inference (BI) suggests that the topology structure of Aquilaria cp genomes were well presented with strong support values based on the cp genomes data set and matches their geographic distribution pattern. Five of the nine HVR (matK-rps16, ndhF-rpl32, psbJ-petA, trnD, and trnT-trnL) were selected based on a cut off value of >0.01. These regions were further analyzed using the neighbor-joining (NJ) method to assess their ability at discriminating the eight species. Coupled with in silico primer design, two potential barcoding regions, psbJ-petA and trnT-trnL, were identified. Their strengths in species delimitation were evaluated individually and in combination, via DNA barcoding analysis. The results showed that the combined dataset, psbJ-petA+trnT-trnL, effectively resolved members of the genus Aquilaria by clustering all species into their respective clades. In addition, this study show great potential for agarwood identification that the newly proposed DNA barcode was capable at identifying the species of origin of six commercial agarwood samples that were included as unknown samples. Such achievement offers a new technical advancement, useful in the combat against illicit agarwood trades and in assisting the conservation of these valuable species in natural populations. The identification of simple sequence repeats (SSR) across complete cp genome sequences from eight Aquilaria species has never been reported before. Perhaps because developing these genetic are regarded as laborious and expensive. To overcome this issue, the current work also aims to find SSRs in Aquilaria cp genomes using silico methods. For identification of the simple sequence repeats (SSRs), MISA PERL script which had a repeat length of 12 for mononucleotides (mono-), 6 for dinucleotides (di-), 4 for trinucleotides (tri-), 3 for tetranucleotides (tetra-), pentanucleotides (penta-), and hexanucleotides (hexa-), respectively, along with frequency were utilized. From a total of 312 SSRs that were discovered, merely 50 (16%) were found localized within the coding region while the majority (84%) were within the intergenic regions, with an average of one SSR per 4.5 kb. The mean length of the SSRs were 11.63 bp. Mono- repeats were the predominant motifs (29.2%), followed by tetra- (28.8%), di- (20.5%), tri- (19.9%), and penta- (1.6%). Whereas the most recurring motifs were A/T (97.8%) for mono-, AT/AT (87.5%) for di-, AAT/ATT (48.4%) for tri-, and AAAT/ATTT (45.6%) for tetra-. GO analysis using the REVIGO software identified four molecular functions, six biological processes and three cellular components. In conclusion, findings of this SSR offer a scientific foundation for future phylogenetics, evolutionary genetics, diversity studies and breeding programs on Aquilaria species.


Download File

[img] Text
FPAS 2022 17 IR.pdf

Download (1MB)

Additional Metadata

Item Type: Thesis (Doctoral)
Subject: Aquilaria - Malaysia
Subject: Plant genetics
Call Number: FPAS 2022 17
Chairman Supervisor: Professor Rozi Mohamed, PhD
Divisions: Faculty of Forestry and Environment
Depositing User: Editor
Date Deposited: 11 Oct 2023 08:07
Last Modified: 11 Oct 2023 08:07
URI: http://psasir.upm.edu.my/id/eprint/104753
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item