A Survey on Image Captioning Using Object Detection and NLP

De Silva, Vathila; Sumanathilaka, T.G.D.K.

Home
→
Conference Papers, Journal Articles
→
2024 Conference Papers & Journal Articles
→
Conferance Papers
→
View Item

dc.contributor.author	De Silva, Vathila
dc.contributor.author	Sumanathilaka, T.G.D.K.
dc.date.accessioned	2025-04-21T08:15:24Z
dc.date.available	2025-04-21T08:15:24Z
dc.date.issued	2024
dc.identifier.citation	De Silva, V. and Sumanathilaka, T.G.D.K. (2024) ‘A Survey on Image Captioning Using Object Detection and NLP’, in 2024 4th International Conference on Advanced Research in Computing (ICARC). 2024 4th International Conference on Advanced Research in Computing (ICARC), pp. 270–275. Available at: https://doi.org/10.1109/ICARC61713.2024.10499755.	en_US
dc.identifier.uri	http://dlib.iit.ac.lk/xmlui/handle/123456789/2250
dc.description.abstract	Recent years have seen the emergence of image captioning as a revolutionary technical development that seamlessly blends computer vision and natural language processing. The integration of various domains facilitates the production of captivating captions for pictures, promoting a more profound comprehension of visual material. This study offers a thorough analysis of the rapidly developing field of image captioning, examining its uses in various settings such as social media, online platforms, assistive technology, and content indexing. It emphasizes the critical importance of advanced techniques like You Only Look Once (YOLO) for accurate item detection and Natural Language Processing (NLP) for creating subtle captions. While NLP gives the resulting text a layer of contextual depth, YOLO guarantees precise object detection, which helps with caption accuracy. Using these cutting-edge methods boosts the overall efficacy of image captioning systems and represents a noteworthy trend in the literature. Examining the evolution of image captioning, the review paper highlights how important it is for multimodal comprehension. Image captioning is a transformative tool that enhances the readability of visual content and influences contemporary digital experiences in various contexts. The development of image captioning is summarized in this abstract, which also highlights the tendency to use cutting-edge techniques for more accurate and nuanced caption production.	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.subject	Computer vision	en_US
dc.subject	Natural Language Processing	en_US
dc.subject	Object Detection	en_US
dc.title	A Survey on Image Captioning Using Object Detection and NLP	en_US
dc.type	Article	en_US