Digital Repository

LawE (Tokenization of legal text for much efficient search results)

Show simple item record

dc.contributor.author Perera, Malik Praveen
dc.date.accessioned 2020-05-19T15:27:24Z
dc.date.available 2020-05-19T15:27:24Z
dc.date.issued 2019
dc.identifier.citation Perera, Malik Praveen (2019) LawE (Tokenization of legal text for much efficient search results). BSc. Dissertation Informatics Institute of Technology. en_US
dc.identifier.other 2015124
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/381
dc.description.abstract Law and legal documents have language at its heart, similar to what we find in the languages we use in day to day life but varies with numerous technical and nontechnical words used. It is due to these reasons that such a domain with ubiquity and societal importance has not received much attention in the world of Natural Processing Language. There have been certain attempts in trying to use the tools and libraries used for languages in the domain of law and they have succeeded up to a certain degree, but the accuracy levels have not been satisfactory. The past couple of years there has been some major strides in the improvement of NLP, NLTK libraries and tools etc. specifically targeting the legal domain. LexNLP is such library that tokenizers a verity of keywords, numbers, conditions etc. specifically structured according to the language structure found in legal text. Yet the world is lacking a proper system where even a user with no prior domain knowledge can easily access structured information from the trillions of unstructured data available in this already overloaded data era. This research is about structuring a data set so as to be used in a system that can easily be searched and categorized with respective to basic English irrespective of the heavy technical terms found in legal text and documents. en_US
dc.subject Text Processing and Indexing en_US
dc.subject Domain-specific search knowledge en_US
dc.subject Information Retrieval en_US
dc.title LawE (Tokenization of legal text for much efficient search results) en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account