Few-Shot Learning in Writer Identification: Achieving Cross- Lingual Generalization with Limited Data

Ranawaka, Imesh

dc.contributor.author	Ranawaka, Imesh
dc.date.accessioned	2026-03-11T03:27:47Z
dc.date.available	2026-03-11T03:27:47Z
dc.date.issued	2025
dc.identifier.citation	Ranawaka, Imesh (2025) Few-Shot Learning in Writer Identification: Achieving Cross- Lingual Generalization with Limited Data. Msc. Dissertation, Informatics Institute of Technology	en_US
dc.identifier.issn	20221787
dc.identifier.uri	http://dlib.iit.ac.lk/xmlui/handle/123456789/2909
dc.description.abstract	Writer identification, a crucial task in forensic analysis and historical document authentication. faces significant challenges with data scarcity and cross-lingual application Existing systems struggle when confronted with limited handwriting samples per writer or diverse languages. limiting their real-world applicability. Traditional methods exhibit reduced accuracy with minimal sample sizes, while practical applications inherently deal with limited data due to resource constraınts. Researchers have proposed various machine learning approaches to address these challenges, but current systems mostly focus on supervised learning requiring extensive datasets. which are frequently unavailable in real-world scenarios. The adaptation of few-shot learning techniques specifically for writer identification across multiple languages remaıns largely unexplored. In this project, the author proposes a novel few-shot learning system combining Prototypical Networks with CNN backbones, specifically adapted for writer identification tasks The system employs a three-tiered architecture comprising a Python-based ML core implementing episodic training, a.NET Core backend providing REST APIs and a React frontend for end users., Key innovations include adaptation of pre-trained GoogLeNet models for grayscale handwriting analysis through weight averaging techniques and scalable microservices architecture. Date collection utilizes publicly available datasets (IAM, CVL, CERUG, Firemaker) across English, German, Chinese and Dutch languages The final model achieved 87% accuracy with high precision, recall and F1-scores using only 5 samples per writer during training. This study advances few-shot learning applications for robust writer identification in data-constrained multilingual environments, demonstrating practica] viability through scalable architecture that bridges research and forensic applications	en_US
dc.language.iso	en	en_US
dc.subject	Writer Identification	en_US
dc.subject	Combining Prototypical Networks	en_US
dc.subject	Machine Learning	en_US
dc.title	Few-Shot Learning in Writer Identification: Achieving Cross- Lingual Generalization with Limited Data	en_US
dc.type	Thesis	en_US