A Comparative Analysis of Selected Data Mining Algorithms and Programming Languages

dc.contributor.authorDymora, Paweł
dc.contributor.authorMazurek, Mirosław
dc.contributor.authorSmyła, Łukasz
dc.date.accessioned2025-02-05T10:17:16Z
dc.date.available2025-02-05T10:17:16Z
dc.date.issued2024-12
dc.description.abstractThis paper evaluates the performance of ten selected data mining algorithms in the context of classification and regression and the effectiveness between two popular programming languages used in data science: Python and R. The algorithms included in the study were Naive Bayes Classi fier, K-Nearest Neighbors (k-NN), Support Vector Machine (SVM), Decision Tree, Random Forest, Gradient Boosting Machine (GBM), Logistic Regression, Linear Regression, Ridge Re gression, and LASSO Regression. The study aimed to evaluate how the various algorithms per form in classification and regression tasks in the context of a specific problem, in this case fraud detection. The performance of the algorithms was evaluated based on key metrics such as accura cy, execution time, the difference between the best and worst results, and in terms of mean square error (MSE). Moreover, learning tools such as R and Python enable students not only to perform multidimensional data analysis, but also to predict future trends and changes. The ability to work with data, modelling and visualisation are key competences in the context of many areas of mo dern life and to support the making of accurate business decisions.eng
dc.identifier.citationJournal of Education, Technology and Computer Science 5(35)2024, s. 69-83
dc.identifier.doi10.15584/jetacomps.2024.5.7
dc.identifier.eissn2719-7417
dc.identifier.issn2719-6550
dc.identifier.urihttps://repozytorium.ur.edu.pl/handle/item/11314
dc.language.isoeng
dc.publisherThe University of Rzeszów Publishing House
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectdata mining algorithms
dc.subjectR accuracy
dc.subjectmean square error
dc.titleA Comparative Analysis of Selected Data Mining Algorithms and Programming Languages
dc.typearticle

Pliki

Oryginalny pakiet

Aktualnie wyświetlane 1 - 1 z 1
Ładowanie...
Obrazek miniatury
Nazwa:
07_A+Comparative+Analysis.pdf
Rozmiar:
430.07 KB
Format:
Adobe Portable Document Format

Pakiet licencji

Aktualnie wyświetlane 1 - 1 z 1
Nazwa:
license.txt
Rozmiar:
1.28 KB
Format:
Item-specific license agreed upon to submission
Opis: