UPM Institutional Repository

Proteogenomic gene structure validation in the pineapple genome


Citation

Ariffin, Norazrin and Newman, David Wells and O’cualain, Ronan and Nelson, Michael G. and Hubbard, Simon J. Proteogenomic gene structure validation in the pineapple genome. Journal of Proteome Research, 23 (5). 1583 - 1592..

Abstract

MD2 pineapple (Ananas comosus) is the second most important tropical crop that preserves crassulacean acid metabolism (CAM), which has high water-use efficiency and is fast becoming the most consumed fresh fruit worldwide. Despite the significance of environmental efficiency and popularity, until very recently, its genome sequence has not been determined and a high-quality annotated proteome has not been available. Here, we have undertaken a pilot proteogenomic study, analyzing the proteome of MD2 pineapple leaves using liquid chromatography-mass spectrometry (LC-MS/MS), which validates 1781 predicted proteins in the annotated F153 (V3) genome. In addition, a further 603 peptide identifications are found that map exclusively to an independent MD2 transcriptome-derived database but are not found in the standard F153 (V3) annotated proteome. Peptide identifications derived from these MD2 transcripts are also cross-referenced to a more recent and complete MD2 genome annotation, resulting in 402 nonoverlapping peptides, which in turn support 30 high-quality gene candidates novel to both pineapple genomes. Many of the validated F153 (V3) genes are also supported by an independent proteomics data set collected for an ornamental pineapple variety. The contigs and peptides have been mapped to the current F153 genome build and are available as bed files to display a custom gene track on the Ensembl Plants region viewer. These analyses add to the knowledge of experimentally validated pineapple genes and demonstrate the utility of transcript-derived proteomics to discover both novel genes and genetic structure in a plant genome, adding value to its annotation.


Download File

[img] Text
116163.pdf - Published Version
Available under License Creative Commons Attribution.

Download (4MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Agriculture
DOI Number: https://doi.org/10.1021/acs.jproteome.3c00675.s003
Publisher: American Chemical Society
Keywords: Proteomics; Genomics; Proteogenomics; Computational biology; Genome annotation
Depositing User: Ms. Azian Edawati Zakaria
Date Deposited: 19 Mar 2025 08:32
Last Modified: 19 Mar 2025 08:32
URI: http://psasir.upm.edu.my/id/eprint/116163
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item