What ML/AI algorithms can I use to analyze Bank statement and detect frauds??

09 January 2020 6 10K Report

Ml algorithms which can be useful to analyze bank statement and then extract information about that frauds.

Dharmin pareshkumar Dave You are looking for an anomaly - detection model, perhaps an Autoencoder. You can get lots of references by googling it.

Sudipta Roy

Data scientists have access to a range of techniques, which can be broken down in terms of problems they solve: classification and regression. Both can be used to analyse data and provide the answer to whether a transaction was genuine or fraudulent. The typical supervised machine learning algorithms used to solve these problems are logistic regression, decision trees, random forests, and neural networks.

Logistic regression is a popular method, which determines the strength of cause and effect relationships between variables in data sets. It can be used to create an algorithm which predicts whether a transaction is ‘good’ or not.
Decision trees can be used to create a set of rules that model customers’ normal behavior and can be trained, using examples of fraud, to detect anomalies.
Random forests (boosting techniques) ensemble multiple weak classifiers into one strong classifier – they can be built using an ensemble of decision trees.
Neural networks are a powerful technique inspired by the workings of the human brain. Able to learn and adapt to patterns of normal behavior, neural networks can identify fraud in real-time.

Unsupervised techniques are based on clustering algorithms, which group similar data points together – they are used for anomaly detection. Algorithms used in the unsupervised approach are K-means clustering, Local Outlier Factor and One-Class SVM.

K-means clustering divides a dataset into clusters. The algorithm works iteratively and assigns data points to one of the predefined number of classes (k), based on the features that are in the dataset. Data points are clustered based on feature similarity.
Local Outlier Factor, is an algorithm that calculates the local density of data points and allows for identifying regions with similar density in the data set. By using the locality concept, one can distinguish points with much lower density than other neighbours. These points are outliers (fraudulent transactions)
One-Class SVM learns a function used for novelty detection. The idea of novelty detection is to detect rare events, i.e. events that happen rarely, and hence, of which you have very little samples. The problem is then that the usual way of training a classifier will not work.

Although machine learning represents a huge leap forward compared to traditional methods of fraud detection, it is not without its limitations.

Machine learning models are only as good as the data they are provided with. While financial services have access to massive data sets, there are relatively few fraudulent transactions within these, which can reduce a system’s predictive capability. There are several approaches to dealing with this problem.

Cristian Ramos-Vera

http://delab.csd.auth.gr/papers/ESWA07ksm.pdf

https://pdfs.semanticscholar.org/15a9/b99aa207ea5b6615a245ced5f105738acf3a.pdf

http://www.rkproject24.com/Abstract/cseit/JV36_BP.pdf

https://pdfs.semanticscholar.org/439e/d8def87292f7161d834b79cb491808b8eeef.pdf

https://arxiv.org/pdf/1309.3944

Article The application of data mining techniques in financial fraud...

https://pdfs.semanticscholar.org/249e/8889c58971f64fc393b6293bfb59602900db.pdf

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.14.7153&rep=rep1&type=pdf

https://ersj.eu/dmdocuments/2017-xx-4-b-36.pdf

Bryar Shareef

Please read this link:

https://github.com/yzhao062/anomaly-detection-resources

D A Gayan Nayanajith

following could be used to analyse data and provide the answer to whether a transaction was genuine or fraudulent. The typical supervised machine learning algorithms used to solve these problems are logistic regression, decision trees, random forests, and neural networks

Christopher C Kelly

Benford's Law can help in spotting non-logarithmic anomalies in financial data including bank transactions. Here is a short article explaining it:

Article Crunching the logarithms on Benfords Law to unmask unusual t...

If this article is useful to your research please give it a 'recommendation'.

Seismic Line Interpretation in Sedimentary Basin?

What are needs of Research in English language and literature? What are the methodologies for that?

Does anyone know the epitope of CD2 mAb 9.6 and 9-1 ?

Anyone willing to help use with a Phase 2 grant questionaire for cancer?

Operational Management and technology resources?

What are the contour units in the attached seismic reflection mapping?

How can I digitize and enhance a raster seimic reflection survey?

Help required in interpreting seismic mapping?

Is it possible to see which papers have cited my papers?

Has anyone upgraded the turbo pump in a Bruker MicroTOF Q2 using the new generation Agilent twist torr pumps?

Feedback defines the constitution of an organism?

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

What are examples of AI for good projects a teacher can assign to students?

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

How to design human-centered classroom in the age of A.I.?

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

Measuring the Intelligence of a Species?

What's the role of IT & AI in Telecommunication Industry?