Digital Repository

FineDiffusion: Fine-tuning text-to-image Diffusion Models with Subject Personalization and Conditional Control

Show simple item record

dc.contributor.author Muthunayaka, Geeth
dc.date.accessioned 2026-03-26T06:55:34Z
dc.date.available 2026-03-26T06:55:34Z
dc.date.issued 2025
dc.identifier.citation "Muthunayaka, Geeth (2025) FineDiffusion: Fine-tuning text-to-image Diffusion Models with Subject Personalization and Conditional Control. BSc. Dissertation, Informatics Institute of Technology" en_US
dc.identifier.issn 20200508
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/3070
dc.description.abstract In recent years, text-to-image generation has gained a lot of attention. Diffusion models have been proven to be the state-of-the-art in this domain. However, due to their high computational demands, most of the current research focuses on improving their efficiency and image quality. It has also been identified that existing text-to-image solutions have very limited usability and applicability due to their lack of control. This project aims to address the lack of control and customization in text-to-image diffusion models by developing a solution that enhances their controllability and customizability. This Project proposes a unified architecture and pipeline that combines multiple fine-tuning techniques to enables both subject personalization and conditional control. Subject personalization allows for customized image generation of specific subjects, and conditional control enables the diffusion model to utilize conditioning images during the image generation process. The diffusion model must be fine-tuned with multiple datasets to enable these techniques. The prototype implementation successfully demonstrates the core functionalities of the proposed solution. Based on the qualitative self-evaluation, the implemented architecture and pipeline demonstrates the primary fine-tuning techniques with satisfactory results. The fine tuned latent diffusion model utilised in the prototype achieved a quantitative CLIP Score of 71.15 en_US
dc.language.iso en en_US
dc.subject Text to image en_US
dc.subject Diffusion en_US
dc.subject Controllability en_US
dc.subject Customizability en_US
dc.title FineDiffusion: Fine-tuning text-to-image Diffusion Models with Subject Personalization and Conditional Control en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account