Indriati Indriati, Achmad Ridok


Indonesia a potential market for business because of a large number of smartphone users, especially developers of mobile applications. Each application stores allow the user to provide a review of the application used. The review is not only beneficial for prospective users of the application but also beneficial for the application developer. Review of the applications that are influenced by emotion (sentiment) are grouped or classified to determine positive and negative polarization. Therefore, it is necessary to have an application that can perform sentiment analysis for the mobile app reviews using Neighbor-Weighted K-Nearest Neighbor (NWKNN) classification method with high accuracy results. NWKNN method is able to classify mobile application review documents on the balanced data with current value of k = 20 gives the best f-measure average value of 0.9 with ratio of training data and test data 80%: 20%. However, for the imbalanced data with value of k = 45 gives the best f-measure average value of 0.797 with a ratio of training data and test data 80%: 20%. Based on the results, the effect of imbalanced data to  the accuracy of the NWKNN methods by comparing NWKNN and KNN methods, it was found that the value of F-Measure NWKNN method is better than KNN method with gap of 0,27, due to the added weight on class minority overcome misclassification problem on minority class.

Full Text:



CHAOVALIT, P., ZHOU, L. 2005. Moview review mining: A comparison between supervised and unsupervised classification approaches, in Proceedings of the Hawaii International Conference on System Sciences.

FELDMAN, R., & SANGER, J. 2007. The Text Mining Handbook. New York: Cambridge University Press.

LAROSE, DANIEL T. 2005. Discovering Knowledge in Data: An Introduction to Data Mining.

LIU, BING. 2010. Sentiment Analysis and Subjectivity, Handbook of natural language processing 2: 627-666.

LIU, BING. 2012. Sentiment Analysis and Opinion Mining. Morgan & Claypool Publishers.

MUSTAFA, ATIKA, ALI AKBAR, and AHMER SULTAN. 2009, Knowledge Discovery using Teks Mining: A Programmable Implementation on Information Extraction and Categorization, international Journal of Multimedia and Ubiquitous Engineering Vol. 4, No. 2, April.

NURFALAH, ADIYASAN. 2013, Sentiment Analysis Menggunakan Pendekatan Lexicon-Based,

PRASETYO, EKO. 2012. Data Mining: Konsep dan Aplikasi menggunakan Matlab, Andi

TAN, SONGBO. 2005. Neighbor-weighted K-nearest neighbor for unbalanced text corpus, Expert Systems with Applications 28 (2005) 667–671.



  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.