Digital Repository

Image Caption: Design and Implementation of an Image Caption generating Screen Reader

Show simple item record

dc.contributor.author Liyanage, Onel
dc.date.accessioned 2025-06-27T03:20:24Z
dc.date.available 2025-06-27T03:20:24Z
dc.date.issued 2024
dc.identifier.citation Liyanage, Onel (2024) Image Caption: Design and Implementation of an Image Caption generating Screen Reader. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2019155
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/2707
dc.description.abstract " The rapid growth of visual content on the internet presents challenges and opportunities for accessibility and information retrieval. This project aims to enhance web accessibility by developing an intelligent browser extension that automates the process of generating descriptive captions for images on websites. The extension scrapes images from web pages, uploads them to a Flask server equipped with a pre-trained image captioning model, and replaces the alt text of each image with the generated captions. The image captioning model is trained on a large dataset of images and corresponding captions, leveraging deep learning techniques to understand and describe visual content accurately. By integrating this model into a browser extension, the project seeks to improve the accessibility of web content for visually impaired users, enhance search engine optimization (SEO), and provide better content descriptions for various applications. The implementation involves creating a seamless user experience where the extension runs in the background, identifying images, processing them, and updating the webpage without user intervention. The Flask server handles image processing and caption generation, ensuring scalability and efficiency. This project demonstrates the potential of combining web scraping, machine learning, and browser extensions to create tools that enhance the web's usability and accessibility. Through this innovative solution, the project addresses the need for automated image descriptions, providing a practical application of artificial intelligence in improving web accessibility and enriching user experiences. The development and deployment processes, along with the challenges faced and solutions implemented, are documented to provide a comprehensive understanding of the project's scope and impact." en_US
dc.language.iso en en_US
dc.subject Web Accessibility en_US
dc.subject Image Captioning en_US
dc.subject Image Processing en_US
dc.title Image Caption: Design and Implementation of an Image Caption generating Screen Reader en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account