Show simple item record

dc.contributor.authorMrozek, Petr
dc.contributor.authorPanneerselvam, John
dc.contributor.authorBagdasar, Ovidiu
dc.date.accessioned2021-02-01T11:09:34Z
dc.date.available2021-02-01T11:09:34Z
dc.date.issued2020-12-30
dc.identifier.citationP. Mrozek, J. Panneerselvam and O. Bagdasar. (2020). 'Efficient resampling for fraud detection during anonymised credit card transactions with unbalanced datasets'. 13th International Conference on Utility and Cloud Computing (UCC), Leicester, 7-10 December. New York: IEEE, pp. 426-43.en_US
dc.identifier.isbn9780738123943
dc.identifier.doi10.1109/ucc48980.2020.00067
dc.identifier.urihttp://hdl.handle.net/10545/625574
dc.description.abstractThe rapid growth of e-commerce and online shopping have resulted in an unprecedented increase in the amount of money that is annually lost to credit card fraudsters. In an attempt to address credit card fraud, researchers are leveraging the application of various machine learning techniques for efficiently detecting and preventing fraudulent credit card transactions. One of the prevalent common issues around the analytics of credit card transactions is the highly unbalanced nature of the datasets, which is frequently associated with the binary classification problems. This paper intends to review, analyse and implement a selection of notable machine learning algorithms such as Logistic Regression, Random Forest, K-Nearest Neighbours and Stochastic Gradient Descent, with the motivation of empirically evaluating their efficiencies in handling unbalanced datasets whilst detecting credit card fraud transactions. A publicly available dataset comprising 284807 transactions of European cardholders is analysed and trained with the studied machine learning techniques to detect fraudulent transactions. Furthermore, this paper also evaluates the incorporation of two notable resampling methods, namely Random Under-sampling and Synthetic Majority Oversampling Techniques (SMOTE) in the aforementioned algorithms, in order to analyse their efficiency in handling unbalanced datasets. The proposed resampling methods significantly increased the detection ability, the most successful technique of combination of Random Forest with Random Under-sampling achieved the recall score of 100% in contrast to the recall score 77% of model without resampling technique. The key contribution of this paper is the postulation of efficient machine learning algorithms together with suitable resampling methods, suitable for credit card fraud detection with unbalanced dataset.en_US
dc.description.sponsorshipN/Aen_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urlhttps://ieeexplore.ieee.org/document/9302819en_US
dc.source2020 IEEE/ACM 13th International Conference on Utility and Cloud Computing (UCC)
dc.subjectCredit cardsen_US
dc.subjectVegetationen_US
dc.subjectMachine learning algorithmsen_US
dc.titleEfficient resampling for fraud detection during anonymised credit card transactions with unbalanced datasetsen_US
dc.typeMeetings and Proceedingsen_US
dc.contributor.departmentUniversity of Derbyen_US
dc.contributor.departmentUniversity of Leicesteren_US
dc.identifier.journal2020 IEEE/ACM 13th International Conference on Utility and Cloud Computing (UCC)en_US
dcterms.dateAccepted2020-10-06
refterms.dateFOA2021-02-01T11:09:35Z
dc.author.detail782275en_US


Files in this item

Thumbnail
Name:
(2020) UCC - Mrozek et al - Fraud ...
Size:
521.8Kb
Format:
PDF
Description:
Accepted Manuscript

This item appears in the following Collection(s)

Show simple item record