33
Primjena strojnog učenja i rudarenja podataka u bankarstvu

Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

Primjena strojnog učenja i rudarenja podataka u bankarstvu

Page 2: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Banking operations in 17 countries – Europe CEE market leader

• Global player in asset management: 218.7 bn in managed assets

• Customers: over 25 million

• Market leader in Central and Eastern Europe leveraging on the region's structural strengths

• One of the global systemically important financial institutions identified by the Financial Stability Board

O UniCredit Group

Over 143.000 employees

7.839 branches

Internationalnetwork that spans 50 markets

Page 3: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

3/25

UniCredit Group International network

Page 4: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Member of UniCredit Group since March 2002

• 26,4 % market share - 1st Croatian bank

• Leading bank in terms of products, services quality and technology innovations

• Total assets of HRK 106 billion

• 4.200 employees, 125 branches, 867 ATMs

• 1 million customers

• Undisputed market leader

• Total loans – 27,8%

• Total deposits – 26,4%

• Investment funds – 23,9%

• Pension funds – obligatory 39,9%; voluntary 53,4%

O ZABI1

1 figures as of Dec 2015

Page 5: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

O sadržaju

5

Page 6: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

6

O autoru

2012. - danasPoslijediplomski doktorski studijEkonomski fakultet u Zagrebu

2010. – 2011.Diplomski doktorski studijEkonomski fakultet u Zagrebu

2006. – 2010.Preddiplomski studijEkonomski fakultet u Zagrebu

2016. - danasRazvoj poslovne inteligencijeZagrebačka banka d.d.

2013. – 2016.Modeliranje kreditnih rizikaZagrebačka banka d.d.

2011. – 2013.Planiranje i kontrolingZagrebačka banka d.d.

Page 7: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

7

O arhitekturi

CPU vs GPU

Page 8: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

Rudarenje podataka

Page 9: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

9

Proces

CRISP-DMSEMMA

Page 10: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• SNA se bavi analizom mreže odnosa između različitih entiteta

• Predmet istraživanja mogu biti bilo koji entiteti koji imaju interakciju:

• Tvrtke, fizičke osobe, serveri, web stranice, životinje, pojave…

• Odnosi unutar mreže i informacije o mreži se pokušavaju kvantificirati različitim mjerama koje imaju različito tumačenje zavisno o kontekstu predmeta istraživanja

10

SNA

Page 11: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

11

SNA

Page 12: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Analiza socijalnih mreža (Facebook, Twitter, itd.)

• relativno slabo razvijena u RH

• detektiranje ključnih individua koje utječuna ostale

• Mogućnost razvoja marketinških kampanja(npr. ženidba potencijalan stambeni kredit)

12

SNA – primjena u bankarstvu

Page 13: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Analiza transakcijskog sustava:

• Identificiranje entiteta koji imajuznačajne priljeve, značajne odljeve,spajaju industrije, itd.

• Analiza propagacije rizika

13

SNA – primjena u bankarstvu

Metrika Klijent Ego Klika Portfelj

Broj transakcija

Iznos transakcija

Broj klijenata

Gustoća mreže

Asortativnost

Recipročnost

Međupovezanost

Bliskost

Stupanj centralnosti

pageRank

Page 14: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

Vrste strojnog učenja i algoritmi

Machine Learning

Supervised learning

1

Unsupervised learning

2

Reinforcementlearning

3

Page 15: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

Nenadzirano učenje

Page 16: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Kontrolirana redukcija varijabli

• Popunjavanje missing vrijednosti

• Detekcija anomalija

• Segmentacija klijenata

16

Clustering

Page 17: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• input = output

• sigmoid, tanh, rectifier,…

• sparse vs bottleneck

17

ANN - Autoencoder

Page 18: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Autoencoder vs PCA

18

ANN - Autoencoder

Page 19: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

19

Fraud detection - Autoencoder

• Nebalansiran set podataka

• Jedinstvenost događaja

• Tradicionalno rule based

• Skupa potencijalna false negative klasifikacija

• Detekcija anomalija

Page 20: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

20

ANN - Autoencoder

Page 21: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

21

Fraud detection - PayPal (2014) prezentacija

Page 22: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

22

Fraud detection - PayPal (2014) prezentacija

Page 23: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

Nadzirano učenje

Page 24: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Klasifikacija puno više zastupljena u praksi

• Često se regresijski problemi diskretiziraju i pretvaraju u klasifikaciju

• U bankarstvu se najčešće koristi za predikcije (prediktivna analitika)

24

Tipovi nadziranog učenja

Page 25: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Smanjenje marketinških troškova i povećanje prihoda

• Tipični problemi primjene:

• Churn

• Xsell / Upsell / Next best offer

• Nema restrikcija na modeliranje

• Testiranje modela pomoću kontrolnih skupina

• Performanse modela ponekad može teško biti izmjeriti

25

Primjena u Retailu

Page 26: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• Smanjenje troškova za banku

• Visoko regularizirano područje

• Nedopustiv tzv. black-box model

• Modeliranje PD, LGD, EAD parametara

• Stabilnost modela i frekvencija mijenjanja modela

• Kalibracija modela

• Potreban uzorak: 5-7 godina

26

Primjena u Risku

Page 27: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

• FFNN vs RNN/LSTM

27

ANN

Page 28: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

28

Deep learning algoritmi

• Sentiment analysis

• Voice recognition (emotion detection)

Page 29: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

29

Deep learning algoritmi

• OCR

• Face recognition

Page 30: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

30

ANN– investicijsko bankarstvo

James Harris Simons

Page 31: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

31

ANN– investicijsko bankarstvo

HFT?

Page 32: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

32

ANN– investicijsko bankarstvo

Ray Dalio David Ferrucci

Page 33: Primjena strojnog učenja i rudarenja podataka u bankarstvu · •Member of UniCredit Group since March 2002 •26,4 % market share - 1st Croatian bank •Leading bank in terms of

Hvala na pažnji!