17
1

AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

1

Page 2: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

AI & MLLessons Learnt & real-world challenges

Avishkar Misra PhDPractice Director – Data Science & Analytics for Americas

Page 3: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

3

So far we have delivered2000+ current projects at any time$700M + in Revenue.

6000+ employees (2018)

Founded in Open Source -- Eye towards analytic outcome.

Full spectrum consulting, data engineering, data science & BI support.

Apache Hadoop and cloud ecosystem integration.

Founded in 2010industry thought leader focused on Business Results.

We are the SI business of Teradata.

Who Is Think Big?1st Big Data provider 100%

focused around open source.

Page 4: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

4

Data Science & Analytics

Teradata Aster

SQL

Go

location

Analytic tools Languages Data sources Analytic engines

Page 5: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

5

Five lessons

Page 6: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

6

Amazon Go: No Checkout. Pick up and walk out.

source: amazon.com/go

Page 7: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

7

Amazon Go: No Checkout. Pick up and walk out.

source: amazon.com/go

#1 – Flow-on benefits (& consequences) extend beyond the obvious.

Save time for customers

Improve customer service

Prevent empty shelf events

Coupons 2.0

Page 8: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

8

Deep Learning for Recommendations

#2 – Formulate and solve the right problem, even if it is not how its done now.

Deep Learning

Deep Learning

A & B had identical neural network structure, size and training parameters.

The difference is how the problem was formulated.

Details: Rybakov et al., The Effectiveness of a Two-Layer Neural Network for Recommendations, ICLR 2018 https://openreview.net/forum?id=B1lMMx1CW

Page 9: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

9

Deep Learning for Fraud Detection

Neural Networks (CovNets, LSTM, Autoencoders) to transaction images

A Non-Fraud

Features

Fraud

B

Tim

e

Non-Fraud

Fraud

Page 10: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

10

Deep Learning for Fraud Detection Tr

ue

Po

sitiv

e R

ate

Legacy Rule System

Deep Learning

Machine Learning

False Positive Rate

#3 – Avoid domain myopia and look for inspiration across domains.

50% Fraud Detection Rate

60% False Positive Rates

Page 11: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

11

Across the board – Getting to Production

#4 – Work backwards. Think production first. The lab is not enough.

Analytics Operations

Analytics

Ops

flexibility

exploration

discovery

modeling

blue-skies

external data

iteration

data-mining

statistics

value-driven

security

governance

compliance

curation

deployment

maintenance

integration

testing

engineering

process-driven

5 monthsAverage to

develop, test,

validate, deploy

and scale new

analytical models

2 weeks

Page 12: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

12

Across the board – Failure is inevitable

#5 – De-risk failure by speeding and scaling up experimentation

It may will NOT work the first time…

... or even the 4th time.

Mitigate the cost of failure.

Page 13: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

13

#1 – Flow-on benefits (& consequences) extend beyond the obvious.

#2 – Formulate and solve the right problem, even if it is not how its done now.

#3 – Avoid domain myopia and look for inspiration across domains.

#5 – De-risk failure by speeding and scaling up experimentation

#4 – Work backwards. Think production first. The lab is not enough.

Page 14: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

14

Risks in adopting AI & ML

Page 15: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

15

Biased Data and Poor Quality Control

One of these photos failed automatic quality checks

Page 16: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

16

What can we do…?

#1 – Train Humans

More CS grads AND key decision makers in organizations.

#2 – Establish Standards

Processes and methods to verify and validate data & models holistically.

#3 – Automated Tools

Automated tools, verification and validation to surface potential data & model biases.

Page 17: AI & ML - Analytics Frontiers 2020 | The Analytics ......industry thought leader focused on Business Results. We are the SI business of Teradata. Who Is Think Big? 1st Big Data provider

17

Lets talk…Contact:• Email: [email protected]• Blog: http://a-misra.com• LinkedIn: https://www.linkedin.com/in/avishkarmisra• Twitter: @AvishkarMisra

References: Rybakov et al., The Effectiveness of a Two-Layer Neural Network for Recommendations, ICLR 2018, https://openreview.net/forum?id=B1lMMx1CW

Image Credits:• Photo by felipe lopez on Unsplash• five fingers by H Alberto Gongora from the Noun Project• Danger by Gregor Cresnar from the Noun Project

• education by Rockicon from the Noun Project• tools by Aleksandr Vector from the Noun Project• Checklist by Ralf Schmitzer from the Noun Project• amazon.com/go