Digital Repository

A Machine Learning Approach to Detect Image Based Web Phishing Attempts

Show simple item record

dc.contributor.author Abeysooriya, S.R
dc.date.accessioned 2022-03-16T06:19:06Z
dc.date.available 2022-03-16T06:19:06Z
dc.date.issued 2021
dc.identifier.citation "Abeysooriya, S.R (2021) A Machine Learning Approach to Detect Image Based Web Phishing Attempts. BSc. Dissertation Informatics Institute of Technology" en_US
dc.identifier.issn 2017459
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1003
dc.description.abstract " Phishing is one of the oldest type of cybercrimes which tricks people to obtain their sensitive information such as usernames, passwords, and credit card information. Time to time the attackers have changed their methods for phishing to avoid being detected by the security protocols. If a phishing website is hosted under a legitimate domain or if the website content contains only images, it would be able to avoid being detected by the security system algorithms since there’s only a slight chance of identifying any suspicious elements from such sites to make a prediction. In this proposed machine learning approach to detect image-based web phishing attempts, the main aim was to develop a system that would be able to detect phishing websites that uses image-based attempts to avoid being detected. This image-based phishing detection system has taken a novel approach by using combined sets of features to identify any suspicious elements from the site such as URL, domain, content, and images. A brand new dataset has been compiled with previously addressed element data in order to train a high accurate model with the use of best fitting ml algorithm. In this case random forest algorithm has been selected for the model classification after comparing with several other algorithms. Compiled dataset was fed up with enough phishing and legitimate URL data from various source with unique elements to maintain a higher accuracy. A finalized prototype application has been developed based on the proposed solution for the end users. The development was carried out in a Python environment since it goes well with data science related projects. Before finalizing the project, the implemented system was tested properly according to a certain criteria and evaluated with the guidance from domain and technical experts in order to confirm that the taken approach was a success. " en_US
dc.language.iso en en_US
dc.subject Python en_US
dc.subject Random Forest en_US
dc.subject Machine Learning en_US
dc.subject Phishing Detection en_US
dc.title A Machine Learning Approach to Detect Image Based Web Phishing Attempts en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account