Analisa Penggunaan K-Gram pada Karakter, Kata dan Kalimat untuk Mendeteksi Kesamaan Dokumen
Keywords:
Python, Karakter K-Gram, Kata K-Gram, Kalimat K-Gram, Algoritma Winnowing, Kemiripan DokumenAbstract
The use of digital technology is now a necessity; one of its components is documents. Similarity detection can use a variety of methods, including the fingerprinting method. Fingerprint has a working principle using hashing techniques and K-gram. This research is focused on the detection model using Kgram using the winnowing algorithm and python as a programming language. The k-gram parsing test uses 5 k pieces, namely k = 2 k = 3 k = 4 k = 5 k = 6. As a result, the character parsing gets a larger percentage than the manual character percentage. The percentage of word parsing has the closest percentage of the manual percentage. while in sentences, the percentage is the lowest than the manual percentage.
Downloads
References
Kemdikbud. (2016, 22 Januari 2020). KBBI online.
P. Istiana, "Membuat Sitasi dan Daftar Pustaka," in "Materi Pelatihan Kursus Pelatihan Instruktur Literasi Informasi. ," Universitas Padjajaran Bandung, Universitas Sanata Dharma, Yogyakarta 2013, vol. 27 December 2014.
I. Widiastuti, C. Rahmad, and Y. Ariyanto, "Aplikasi Pendeteksi Kemiripan pada Dokumen Menggunakan Algoritma Rabin Karp," Jurnal Informatika Polinema, vol. 1, no. 2, pp. 13-13, 2015.
S. Sunardi, A. Yudhana, and I. A. Mukaromah, "Implementasi Deteksi Plagiarisme Menggunakan Metode N-Gram Dan Jaccard Similarity Terhadap Algoritma Winnowing," 2018.
A. Prastyanti and S. N. Endah, "Sistem deteksi kemiripan kata pada dua dokumen menggunakan algoritma Rabin-Karp," Universitas Diponegoro, 2014.
R. Y. Dillak, F. Laumal, and L. J. Kadja, "Sistem Deteksi Dini Plagiarisme Tugas Akhir Mahasiswa Menggunakan Algoritma Ngrams dan Winnowing," Jurnal Ilmiah Flash, vol. 2, no. 1, pp. 12-18, 2016.
A. Kurniawati and I. Wicaksana, "Perbandingan pendekatan deteksi plagiarism dokumen dalam bahasa inggris," in Proceeding, Seminar Ilmiah Nasional Komputer dan Sistem Intelijen (KOMMIT 2008), 2008: Gunadarma University.
N. Alamsyah, "Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi," Technologia: Jurnal Ilmiah, vol. 8, no. 3, pp. 124-134, 2017.
B. Zaman, E. Hariyanti, and E. Purwanti, "Sistem Deteksi Bahasa pada Dokumen menggunakan N-Gram," Multinetics, vol. 1, no. 2, pp. 21-26, 2015.
A. Radili and S. Sanjaya, "Penerapan Metode Winnowing Fingerprint dan Naive Bayes untuk Pengelompokan Dokumen," Jurnal
CoreIT: Jurnal Hasil Penelitian Ilmu Komputer dan Teknologi Informasi, vol. 3, no. 2, pp. 69-75, 2018.
S. Niwattanakul, J. Singthongchai, E. Naenudorn, and S. Wanapu, "Using of Jaccard coefficient for keywords similarity," in Proceedings of the international multiconference of engineers and computer scientists, 2013, vol. 1, no. 6, pp. 380-384.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 Prosiding Seminar Nasional Teknoka
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.