Hyperparameter Optimization Analysis of MultinomialNB and Logistic Regression in Multi‑Feature Text‑Based Film Genre Classification

Shabrio Cahyo Wardoyo; Umniy Salamah

doi:10.21108/indojc.v10i1.9100

Shabrio Cahyo Wardoyo Universitas Mercu Buana
Umniy Salamah

Issue	Vol. 10 No. 1 (2025)
Release	01 August 2025
Section	Articles

description 9100

subject Abstract

This study aims to analyze and compare the performance of two text classification algorithms Multinomial Naive Bayes (MNB) and Logistic Regression (LR)—for film genre classification using multi-feature text data, both with and without hyperparameter optimization. Film genres play a crucial role in digital content recommendation systems; however, manual classification is subjective and time-consuming. The dataset, obtained from Letterboxd via Kaggle, includes film titles, descriptions, and themes. After preprocessing and text normalization (tokenization, lemmatization, and stemming), the text data were transformed into numerical features using the TF-IDF method. Two modeling scenarios were applied: the first using default parameters, and the second employing GridSearchCV to find the optimal hyperparameter settings. Model performance was evaluated using accuracy, precision, recall, and F1-score. The results indicate that the optimized LR model achieved the highest accuracy of 0.847, followed by the optimized MNB model with an accuracy of 0.837. This study concludes that hyperparameter optimization significantly improves model performance and that LR outperforms MNB in the context of multi-feature text-based genre classification.

label Categories

Theory of Computation, and Computing Methodologies

format_quoteCitationfile_copyCopy

[1]

Wardoyo, S.C. and Umniy Salamah 2025. Hyperparameter Optimization Analysis of MultinomialNB and Logistic Regression in Multi‑Feature Text‑Based Film Genre Classification. Indonesian Journal on Computing (Indo-JC). 10, 1 (Aug. 2025). DOI:https://doi.org/10.21108/indojc.v10i1.9100.

ACM ACS APA ABNT Chicago Harvard IEEE MLA Turabian Vancouver

download Endnote/Zotero/Mendeley (RIS) download BibTeX

Downloads

Download data is not yet available.

Main Menu

Hyperparameter Optimization Analysis of MultinomialNB and Logistic Regression in Multi‑Feature Text‑Based Film Genre Classification

Downloads

Indonesian Journal on Computing (Indo-JC)