FineDiffusion: Fine-tuning text-to-image Diffusion Models with  Subject Personalization and Conditional Control

Muthunayaka, Geeth

dc.contributor.author	Muthunayaka, Geeth
dc.date.accessioned	2026-03-26T06:55:34Z
dc.date.available	2026-03-26T06:55:34Z
dc.date.issued	2025
dc.identifier.citation	"Muthunayaka, Geeth (2025) FineDiffusion: Fine-tuning text-to-image Diffusion Models with Subject Personalization and Conditional Control. BSc. Dissertation, Informatics Institute of Technology"	en_US
dc.identifier.issn	20200508
dc.identifier.uri	http://dlib.iit.ac.lk/xmlui/handle/123456789/3070
dc.description.abstract	In recent years, text-to-image generation has gained a lot of attention. Diffusion models have been proven to be the state-of-the-art in this domain. However, due to their high computational demands, most of the current research focuses on improving their efficiency and image quality. It has also been identified that existing text-to-image solutions have very limited usability and applicability due to their lack of control. This project aims to address the lack of control and customization in text-to-image diffusion models by developing a solution that enhances their controllability and customizability. This Project proposes a unified architecture and pipeline that combines multiple fine-tuning techniques to enables both subject personalization and conditional control. Subject personalization allows for customized image generation of specific subjects, and conditional control enables the diffusion model to utilize conditioning images during the image generation process. The diffusion model must be fine-tuned with multiple datasets to enable these techniques. The prototype implementation successfully demonstrates the core functionalities of the proposed solution. Based on the qualitative self-evaluation, the implemented architecture and pipeline demonstrates the primary fine-tuning techniques with satisfactory results. The fine tuned latent diffusion model utilised in the prototype achieved a quantitative CLIP Score of 71.15	en_US
dc.language.iso	en	en_US
dc.subject	Text to image	en_US
dc.subject	Diffusion	en_US
dc.subject	Controllability	en_US
dc.subject	Customizability	en_US
dc.title	FineDiffusion: Fine-tuning text-to-image Diffusion Models with Subject Personalization and Conditional Control	en_US
dc.type	Thesis	en_US