Digital Repository

Molecules Identification from 1H NMR Spectrum Using Deep Learning Techniques

Show simple item record

dc.contributor.author Chamali, Madduma Hallinna Liyanage
dc.date.accessioned 2022-02-25T09:14:48Z
dc.date.available 2022-02-25T09:14:48Z
dc.date.issued 2021
dc.identifier.citation Chamali, Madduma Hallinna Liyanage (2021) Molecules Identification from 1H NMR Spectrum Using Deep Learning Techniques. MSc. Dissertation Informatics Institute of Technology en_US
dc.identifier.issn 2018284
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/773
dc.description.abstract Integrating the chemistry concepts about NMR and the deep learning technology advance concepts, in here predict the SMILES structure of molecules for the given 1H NMR spectrum. The NMR is the principal method to identify the molecular structure and identify the content and purity of a sample. 1H NMR which is also called proton NMR is more popular for identifying organic molecule structures among chemistry experts and analyzing it manually, so it is time-consuming. However, it is really challenging to implement a system to predict molecular structure for a given NMR because it is required to identify the 1H NMR image correctly and required to generate a correct sequence of SMILES with a meaningful molecule. That means this research is a combination of computer vision and natural language processing (NLP) capabilities. Manually downloaded 1H NMR public data set, and open-source SMILES data set was used to carry out this research. The resized NMR image was converted into its feature vector using the InceptionV3 model in CNN referring to the transfer learning techniques. Before training the model, SMILES was tokenized into the characters and combined with the 1H NMR image feature vector. Then SMILES was converted into a vector. The LSTM technique in deep learning was used to identify the hidden pattern of SMILES characters. The Seq2seq approach is utilized to merge this CNN and RNN model and return a single model following the encoder-decoder concepts to predict molecule structure based on the given 1H NMR spectrum. The hyper-parameter techniques were carried out to maximize the predictivity of this deep learning model. This implemented model was evaluated using ROUGH techniques and with the benchmark study. The implemented system predicts molecular structure in SMILES notation for provided 1H NMR spectrum less than 2.5 seconds and the model accuracy around 90%. en_US
dc.language.iso en en_US
dc.subject LSTM en_US
dc.subject RNN en_US
dc.subject CNN en_US
dc.subject Encoder-decoder en_US
dc.subject Seq2Seq en_US
dc.subject SMILES en_US
dc.subject 1H NMR speccturm en_US
dc.subject Deep Learning en_US
dc.subject Computational Chemistry en_US
dc.title Molecules Identification from 1H NMR Spectrum Using Deep Learning Techniques en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account