Digital Repository

CA-VQA: Context Aware - Visual Question Answering

Show simple item record

dc.contributor.author Nuha, M.N
dc.date.accessioned 2022-03-11T05:48:27Z
dc.date.available 2022-03-11T05:48:27Z
dc.date.issued 2021
dc.identifier.citation Nuha, M.N (2021) CA-VQA: Context Aware - Visual Question Answering. BSc. Dissertation Informatics Institute of Technology en_US
dc.identifier.issn 2017039
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/912
dc.description.abstract " VQA has emerged as a multidisciplinary challenge that integrates both vision and language processing. It has gained a lot of interest from both computer vision and natural language processing communities. Simply put, given an image and a natural language question about the image, a VQA task requires the system to find the correct answer by combining visual features of the image with inferences drawn from the question. A successful system must be able to comprehend an image semantically, understand the textual input, and generate a response based on its visual, textual, and logical interpretation of the inputs. This is typically done by making use of deep learning based techniques that extract the visual features of the image and the textual features of the question. Many researchers are interested in VQA because of its various application. Multiple methods and approaches have been utilized to propose a number of VQA models over time. The author believes that given the complexity of an image, simply using the image and the question is insufficient for a VQA model to perform well. Considering this, the author proposes CA-VQA, an experiment that aims to validate the use of additional textual information about an image in order to produce better results. A third input is used by CA-VQA to define the image. The accuracy of the proposed method is 59.19%, which is considered acceptable" en_US
dc.language.iso en en_US
dc.subject Natural Language Processing en_US
dc.subject Visual Question Answering en_US
dc.subject Computer Vision en_US
dc.subject Deep Learning en_US
dc.title CA-VQA: Context Aware - Visual Question Answering en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account