KLASIFIKASI DAN ANALISIS SENTIMEN PADA DATA TWITTER MENGGUNAKAN ALGORTIMA NAÏVE BAYES (STUDI KASUS: PEKAN OLAHRAGA NASIONAL XX 2021)

Krisna Jonathan Sitorus, . (2022) KLASIFIKASI DAN ANALISIS SENTIMEN PADA DATA TWITTER MENGGUNAKAN ALGORTIMA NAÏVE BAYES (STUDI KASUS: PEKAN OLAHRAGA NASIONAL XX 2021). Skripsi thesis, Universitas Pembangunan Nasional Veteran Jakarta.

[img] Text
ABSTRAK.pdf

Download (129kB)
[img] Text
AWAL.pdf

Download (350kB)
[img] Text
BAB I.pdf

Download (113kB)
[img] Text
BAB II.pdf
Restricted to Repository UPNVJ Only

Download (185kB)
[img] Text
BAB III.pdf
Restricted to Repository UPNVJ Only

Download (201kB)
[img] Text
BAB IV.pdf
Restricted to Repository UPNVJ Only

Download (615kB)
[img] Text
BAB V.pdf

Download (96kB)
[img] Text
DAFTAR PUSTAKA.pdf

Download (113kB)
[img] Text
RIWAYAT HIDUP.pdf
Restricted to Repository UPNVJ Only

Download (93kB)
[img] Text
LAMPIRAN.pdf
Restricted to Repository UPNVJ Only

Download (690kB)
[img] Text
HASIL PLAGIARISME.pdf
Restricted to Repository staff only

Download (7MB)
[img] Text
ARTIKEL KI.pdf
Restricted to Repository staff only

Download (680kB)

Abstract

The world of technology is currently developing very rapidly, people can now freely access and express themselves using social media as an example, namely Twitter. Many Twitter users express their opinions through the tweets they send on social media, especially regarding the XX 2021 National Sports Week (PON) ago. Several types of tweets that are appreciative of the event are often seen, but not infrequently there are also tweets that are complaining about the implementation of the XX PON. From this problem, a research was conducted on sentiment analysis on twitter data regarding PON XX using the Naïve Bayes method. The amount of data that is crawled is 1000 data and then the data goes through a duplicate data removal process so that it produces 218 tweets and has not been labeled. Before the process of classifying the data obtained, the data must be labeled and cleaned first before entering the text processing stage, then the data is given a weight for each word with a Term Frequency–Inverse Document Frequency (TF-IDF) which will be used in the future.as a feature. Then, because the data with positive and negative labels have significantly different amounts, the Synthetic Minority Oversampling Technique (SMOTE) method was used to balance the data. The next stage is the distribution of data by 80% 20% and classified by the Naive Bayes method. The results obtained from this study are obtained that the test data get 99% accuracy, 100% precision, 98% recall.

Item Type: Thesis (Skripsi)
Additional Information: [No.Panggil: 1810511090] [Pembimbing 1: Anita Muliawati] [Pembimbing 2: Sarika] [Penguji 1: Henki Bayu Seta] [Penguji 2: Mayanda Mega Santoni]
Uncontrolled Keywords: Sentiment Analysis, Classification, PON XX, Naïve Bayes
Subjects: Q Science > Q Science (General)
T Technology > T Technology (General)
Divisions: Fakultas Ilmu Komputer > Program Studi Informatika (S1)
Depositing User: Krisna Jonathan Sitorus
Date Deposited: 01 Aug 2022 03:33
Last Modified: 01 Aug 2022 03:33
URI: http://repository.upnvj.ac.id/id/eprint/19780

Actions (login required)

View Item View Item