Digital Repository

A System to Detect Correlations Between Two Research Papers Using Natural Language Processing

Show simple item record

dc.contributor.author Silva, Gayani
dc.date.accessioned 2024-03-04T06:34:40Z
dc.date.available 2024-03-04T06:34:40Z
dc.date.issued 2023
dc.identifier.citation Silva, Gayani (2023) A System to Detect Correlations Between Two Research Papers Using Natural Language Processing. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2018832
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1822
dc.description.abstract People who regularly work with academic papers frequently summarize and compare the contents of research papers, but doing so is an exhausting and challenging task. Due to this problem, it may also cause depression or mental stress in students. To solve this, it is suggested to investigate a novel approach for measuring semantic similarity for scientific paper texts. This approach is developed by tuning a deep learning transformer-based model called SCIBERT for the semantic textual similarity task. The fine-tuning process is done by training the model on the SICKR-STS dataset and optimizing hyperparameters as required. The final model consists of two phases, combining cross encoder and bi encoder techniques to highlight better results than in previous work. The proposed system, SIMILARS, is evaluated on the test data of the SICKR-STS benchmark dataset to measure its performance. The performance of the model is determined by evaluating predicted similarity using the Pearson and Spearman correlation metrics. The final model has improved the Pearson correlation score from 0.65 before fine tuning to 0.91 after fine tuning. Spearman correlation score has increased from 0.61 before fine tuning to 0.84 after fine tuning. en_US
dc.language.iso en en_US
dc.publisher IIT en_US
dc.subject Natural Language Processing en_US
dc.subject Semantic Textual Similarity en_US
dc.subject Information Retrieval en_US
dc.title A System to Detect Correlations Between Two Research Papers Using Natural Language Processing en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account