dc.contributor.author |
De Silva, Vathila |
|
dc.contributor.author |
Sumanathilaka, T.G.D.K. |
|
dc.date.accessioned |
2025-04-21T08:15:24Z |
|
dc.date.available |
2025-04-21T08:15:24Z |
|
dc.date.issued |
2024 |
|
dc.identifier.citation |
De Silva, V. and Sumanathilaka, T.G.D.K. (2024) ‘A Survey on Image Captioning Using Object Detection and NLP’, in 2024 4th International Conference on Advanced Research in Computing (ICARC). 2024 4th International Conference on Advanced Research in Computing (ICARC), pp. 270–275. Available at: https://doi.org/10.1109/ICARC61713.2024.10499755. |
en_US |
dc.identifier.uri |
http://dlib.iit.ac.lk/xmlui/handle/123456789/2250 |
|
dc.description.abstract |
Recent years have seen the emergence of image captioning as a revolutionary technical development that seamlessly blends computer vision and natural language processing. The integration of various domains facilitates the production of captivating captions for pictures, promoting a more profound comprehension of visual material. This study offers a thorough analysis of the rapidly developing field of image captioning, examining its uses in various settings such as social media, online platforms, assistive technology, and content indexing. It emphasizes the critical importance of advanced techniques like You Only Look Once (YOLO) for accurate item detection and Natural Language Processing (NLP) for creating subtle captions. While NLP gives the resulting text a layer of contextual depth, YOLO guarantees precise object detection, which helps with caption accuracy. Using these cutting-edge methods boosts the overall efficacy of image captioning systems and represents a noteworthy trend in the literature. Examining the evolution of image captioning, the review paper highlights how important it is for multimodal comprehension. Image captioning is a transformative tool that enhances the readability of visual content and influences contemporary digital experiences in various contexts. The development of image captioning is summarized in this abstract, which also highlights the tendency to use cutting-edge techniques for more accurate and nuanced caption production. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
IEEE |
en_US |
dc.subject |
Computer vision |
en_US |
dc.subject |
Natural Language Processing |
en_US |
dc.subject |
Object Detection |
en_US |
dc.title |
A Survey on Image Captioning Using Object Detection and NLP |
en_US |
dc.type |
Article |
en_US |