OPTIMIZATION OF NAÏVE BAYES USING LEVENSHTEIN DISTANCE FOR TYPOGRAPHICAL ERROR CORRECTION IN SENTIMENT ANALYSIS

Utomo, Aziz Yogo (2022) OPTIMIZATION OF NAÏVE BAYES USING LEVENSHTEIN DISTANCE FOR TYPOGRAPHICAL ERROR CORRECTION IN SENTIMENT ANALYSIS. S1 - Sarjana thesis, Universitas AMIKOM Yogyakarta.

[img] Text (COVER-ABSTRAK)
COVER.pdf

Download (2MB)
[img] Text (BAB I)
BAB I.pdf

Download (291kB)
[img] Text (BAB II)
BAB II.pdf
Restricted to Registered users only

Download (632kB)
[img] Text (BAB III)
BAB III.pdf
Restricted to Registered users only

Download (291kB)
[img] Text (BAB IV)
BAB IV.pdf
Restricted to Registered users only

Download (554kB)
[img] Text (BAB V)
BAB V.pdf

Download (44kB)
[img] Text (DAFTAR PUSTAKA)
Daftar Pustaka.pdf
Restricted to Registered users only

Download (147kB)
[img] Archive (SOURCE CODE)
Source Code-18.61.0149-Aziz Yogo Utomo.zip
Restricted to Repository staff only

Download (62kB)
[img] Text (PUBLIKASI)
Publikasi-18.61.0149-AzizYogoUtomo.pdf
Restricted to Repository staff only

Download (413kB)

Abstract

Now we live in an era with a tremendous amount of unstructured data such as text data. Naïve Bayes is an algorithm that is well-performed for dealing with text data. In processing text data, there are several problems, such as the vast amount and the data dimension. Text data derived from human fingers allows typographical errors in writing; this typographical error becomes another problem because it will make the data dimension bigger and change the semantics of the word itself. In addition, typographical errors liable the calculation of the Naïve Bayes Likelihood to be 0 and affect the performance of the model. Levenshtein Distance is one method to correct typographical errors. This research indicates that the Levenshtein Distance has succeeded in increasing the performance of the Naïve Bayes model with optimal results using distance 2 of 5,9%

Item Type: Thesis (S1 - Sarjana)
Contributor:
Pembimbing
Baita, Anna
Uncontrolled Keywords: Text Mining, Classification, Naive Bayes Classifier, Levenshtein Distance, Typographical Error Correction
Subjects: 000 - Komputer, Informasi dan Referensi Umum > 000 Ilmu komputer, ilmu pengetahuan dan sistem-sistem > 005 Pemrograman komputer, program dan data
Divisions: Fakultas Ilmu Komputer > Informatika
Depositing User: RC Universitas AMIKOM Yogyakarta
Date Deposited: 07 Jun 2022 07:13
Last Modified: 22 Aug 2023 03:10
URI: http://eprints.amikom.ac.id/id/eprint/340

Actions (login required)

View Item View Item