KLASIFIKASI DAN ANALISIS SENTIMEN PADA DATA TWITTER MENGGUNAKAN ALGORTIMA NAÏVE BAYES (STUDI KASUS: PEKAN OLAHRAGA NASIONAL XX 2021)

Krisna Jonathan Sitorus, . (2022) KLASIFIKASI DAN ANALISIS SENTIMEN PADA DATA TWITTER MENGGUNAKAN ALGORTIMA NAÏVE BAYES (STUDI KASUS: PEKAN OLAHRAGA NASIONAL XX 2021). Skripsi thesis, Universitas Pembangunan Nasional Veteran Jakarta.

	Text ABSTRAK.pdf Download (129kB)
	Text AWAL.pdf Download (350kB)
	Text BAB I.pdf Download (113kB)
	Text BAB II.pdf Restricted to Repository UPNVJ Only Download (185kB)
	Text BAB III.pdf Restricted to Repository UPNVJ Only Download (201kB)
	Text BAB IV.pdf Restricted to Repository UPNVJ Only Download (615kB)
	Text BAB V.pdf Download (96kB)
	Text DAFTAR PUSTAKA.pdf Download (113kB)
	Text RIWAYAT HIDUP.pdf Restricted to Repository UPNVJ Only Download (93kB)
	Text LAMPIRAN.pdf Restricted to Repository UPNVJ Only Download (690kB)
	Text HASIL PLAGIARISME.pdf Restricted to Repository staff only Download (7MB)
	Text ARTIKEL KI.pdf Restricted to Repository staff only Download (680kB)

Official URL: http://repository.upnvj.ac.id

Abstract

The world of technology is currently developing very rapidly, people can now freely access and express themselves using social media as an example, namely Twitter. Many Twitter users express their opinions through the tweets they send on social media, especially regarding the XX 2021 National Sports Week (PON) ago. Several types of tweets that are appreciative of the event are often seen, but not infrequently there are also tweets that are complaining about the implementation of the XX PON. From this problem, a research was conducted on sentiment analysis on twitter data regarding PON XX using the Naïve Bayes method. The amount of data that is crawled is 1000 data and then the data goes through a duplicate data removal process so that it produces 218 tweets and has not been labeled. Before the process of classifying the data obtained, the data must be labeled and cleaned first before entering the text processing stage, then the data is given a weight for each word with a Term Frequency–Inverse Document Frequency (TF-IDF) which will be used in the future.as a feature. Then, because the data with positive and negative labels have significantly different amounts, the Synthetic Minority Oversampling Technique (SMOTE) method was used to balance the data. The next stage is the distribution of data by 80% 20% and classified by the Naive Bayes method. The results obtained from this study are obtained that the test data get 99% accuracy, 100% precision, 98% recall.

Item Type:	Thesis (Skripsi)
Additional Information:	[No.Panggil: 1810511090] [Pembimbing 1: Anita Muliawati] [Pembimbing 2: Sarika] [Penguji 1: Henki Bayu Seta] [Penguji 2: Mayanda Mega Santoni]
Uncontrolled Keywords:	Sentiment Analysis, Classification, PON XX, Naïve Bayes
Subjects:	Q Science > Q Science (General) T Technology > T Technology (General)
Divisions:	Fakultas Ilmu Komputer > Program Studi Informatika (S1)
Depositing User:	Krisna Jonathan Sitorus
Date Deposited:	01 Aug 2022 03:33
Last Modified:	01 Aug 2022 03:33
URI:	http://repository.upnvj.ac.id/id/eprint/19780

Actions (login required)

View Item