UPM Institutional Repository

Few-shot font generation via denoising diffusion and component-level fine-grained style


Citation

Liu, Yu and Ding, Yang and Khalid, Fatimah and Wang, Cunrui and Wang, Lei (2025) Few-shot font generation via denoising diffusion and component-level fine-grained style. Expert Systems with Applications, 296. art. no. 128987. pp. 1-16. ISSN 0957-4174 (In Press)

Abstract

Few-shot font generation aims to create new fonts using a small number of style examples. It is increasingly gaining attention due to its significant reduction in labor costs. Existing methods rely on GAN-based image-to-image style-transfer frameworks, which are prone to training collapse and struggle to maintain consistency between character content and style. Moreover, they capture only the global style while overlooking fine-grained features of radicals, components, and strokes. To address these challenges, we propose a diffusion model-based image-to-image font generation method.We fully consider the component styles between content glyphs and reference glyphs, assigning appropriate fine-grained styles to content glyphs through a multi-character style aggregation module. Additionally, in order to better preserve the integrity of character structures during the denoising iteration process, we propose leveraging an offset-enhanced multi-head attention mechanism. This mechanism adaptively samples and embeds multi-scale glyph content features into the diffusion model. Comprehensive experiments demonstrate that our method outperforms existing font generation methods both qualitatively and quantitatively.


Download File

[img] Text
122473.pdf - Published Version
Restricted to Repository staff only

Download (9MB)
[img] Text
122473-Accepted.pdf - Accepted Version
Restricted to Repository staff only

Download (2MB)

Additional Metadata

Item Type: Article
Subject: Engineering (all)
Subject: Computer Science Applications
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.1016/j.eswa.2025.128987
Publisher: Elsevier
Keywords: Attention mechanism; Deformable convolution; Diffusion model; Few-shot font generation
Depositing User: Ms. Che Wa Zakaria
Date Deposited: 19 Jan 2026 04:43
Last Modified: 26 Jan 2026 00:52
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1016/j.eswa.2025.128987
URI: http://psasir.upm.edu.my/id/eprint/122473
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item