Digital Repository

Fine-tuning Pre-trained Deep Bidirectional BERT for Web Page Classification

Show simple item record

dc.contributor.author Fernando, Sumudu
dc.date.accessioned 2024-04-02T06:22:11Z
dc.date.available 2024-04-02T06:22:11Z
dc.date.issued 2023
dc.identifier.citation Fernando, Sumudu (2023) Fine-tuning Pre-trained Deep Bidirectional BERT for Web Page Classification. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2018328
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1964
dc.description.abstract "With its vast amounts and wide variety of information, the World Wide Web has become one of the richest and most widely available information sources in the current information-driven society. With the advancements in computer networking technologies and with the accessibility and advancements of the internet, the information on the internet is constantly growing at a fast rate which is beneficial for its users in many ways. However, the excessive information on the internet has become the cause of several problems in the areas of web information management, retrieval and integration, web content filtering, parental control systems, and many more. Also, the excessive amount of information can be detrimental to regular users of the internet. The research project proposes a novel approach for web page classification using Bidirectional Encoder Representations from Transformers (BERT). BERT uses deep bidirectional self-attention to generate contextual representations for text sequences. Unlike unidirectional or pseudo-bidirectional models, BERT learns the context of a word concerning its surroundings rather than the sequence of words and produces accurate results compared to directional models. As one of the few research approaches using deep learning techniques for web page classification, this novel approach could provide a valuable contribution to the research domain. The research has conducted extensive experiments to identify the optimal conditions for the research project and has achieved satisfactory results in comparison to the other research approaches in the domain of web classification with limited resources and within a limited time frame." en_US
dc.language.iso en en_US
dc.subject BERT Fine-Tuning en_US
dc.subject Web Page Classification en_US
dc.subject Web Classification en_US
dc.title Fine-tuning Pre-trained Deep Bidirectional BERT for Web Page Classification en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account