Krisna Jonathan Sitorus, . (2022) KLASIFIKASI DAN ANALISIS SENTIMEN PADA DATA TWITTER MENGGUNAKAN ALGORTIMA NAÏVE BAYES (STUDI KASUS: PEKAN OLAHRAGA NASIONAL XX 2021). Skripsi thesis, Universitas Pembangunan Nasional Veteran Jakarta.
Text
ABSTRAK.pdf Download (129kB) |
|
Text
AWAL.pdf Download (350kB) |
|
Text
BAB I.pdf Download (113kB) |
|
Text
BAB II.pdf Restricted to Repository UPNVJ Only Download (185kB) |
|
Text
BAB III.pdf Restricted to Repository UPNVJ Only Download (201kB) |
|
Text
BAB IV.pdf Restricted to Repository UPNVJ Only Download (615kB) |
|
Text
BAB V.pdf Download (96kB) |
|
Text
DAFTAR PUSTAKA.pdf Download (113kB) |
|
Text
RIWAYAT HIDUP.pdf Restricted to Repository UPNVJ Only Download (93kB) |
|
Text
LAMPIRAN.pdf Restricted to Repository UPNVJ Only Download (690kB) |
|
Text
HASIL PLAGIARISME.pdf Restricted to Repository staff only Download (7MB) |
|
Text
ARTIKEL KI.pdf Restricted to Repository staff only Download (680kB) |
Abstract
The world of technology is currently developing very rapidly, people can now freely access and express themselves using social media as an example, namely Twitter. Many Twitter users express their opinions through the tweets they send on social media, especially regarding the XX 2021 National Sports Week (PON) ago. Several types of tweets that are appreciative of the event are often seen, but not infrequently there are also tweets that are complaining about the implementation of the XX PON. From this problem, a research was conducted on sentiment analysis on twitter data regarding PON XX using the Naïve Bayes method. The amount of data that is crawled is 1000 data and then the data goes through a duplicate data removal process so that it produces 218 tweets and has not been labeled. Before the process of classifying the data obtained, the data must be labeled and cleaned first before entering the text processing stage, then the data is given a weight for each word with a Term Frequency–Inverse Document Frequency (TF-IDF) which will be used in the future.as a feature. Then, because the data with positive and negative labels have significantly different amounts, the Synthetic Minority Oversampling Technique (SMOTE) method was used to balance the data. The next stage is the distribution of data by 80% 20% and classified by the Naive Bayes method. The results obtained from this study are obtained that the test data get 99% accuracy, 100% precision, 98% recall.
Item Type: | Thesis (Skripsi) |
---|---|
Additional Information: | [No.Panggil: 1810511090] [Pembimbing 1: Anita Muliawati] [Pembimbing 2: Sarika] [Penguji 1: Henki Bayu Seta] [Penguji 2: Mayanda Mega Santoni] |
Uncontrolled Keywords: | Sentiment Analysis, Classification, PON XX, Naïve Bayes |
Subjects: | Q Science > Q Science (General) T Technology > T Technology (General) |
Divisions: | Fakultas Ilmu Komputer > Program Studi Informatika (S1) |
Depositing User: | Krisna Jonathan Sitorus |
Date Deposited: | 01 Aug 2022 03:33 |
Last Modified: | 01 Aug 2022 03:33 |
URI: | http://repository.upnvj.ac.id/id/eprint/19780 |
Actions (login required)
View Item |