33
Text Text Wil van der Aalst Professor Information Systems TU/e

TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Text Text Wil van der Aalst Professor Information Systems TU/e

Page 2: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

t

A new profession is emerging, just like computer science in the early 1980-ties!

Industry and Society need Data Scientists!

Page 3: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

t

Page 4: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

EIT-Digital Data Science Major

5 universities involved

Page 5: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

EIT-Digital Data Science Major

3 entry universities

Page 6: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

EIT-Digital Data Science Major

5 exit universities (specializations)

Page 7: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

EIT-Digital Data Science Major

5 specializations

Distributed Systems & Data Mining for Really Big Data at KTH

Multimedia & Web Science for Big Data at UNS

Design, Implementation, and Usage of Data Science Instruments at TUB

Process Mining in High Tech Systems, Healthcare, Visual Analytics, or Big Software at TUE

Internet of Things (IoT) at UPM

Page 8: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Statistics on the first intake

• Total number of students: 46

• EU students 54%

• Age :20-29

• Female students: 17%

DSC

Page 9: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

t

DSC/e

Page 10: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

t

Data Science Center Eindhoven

http://www.tue.nl/dsce/

Page 11: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

11

DSC/e: Competences and Research Programs 28 groups and 420+ people involved

Context: Why are we using data science, does it have the intended effect, and will

people accept it?

Analysis: How to turn data into real value (models, answers/decisions, and

visualizations/insights)?

Enabling technologies: How to get the data and deal with computational/

infrastructural challenges (big data and hard questions)?

Probability and Statistics

Stochastic Networks

Data Mining

Process Mining

Visualization

Large-Scale Distributed Systems

Data-Intensive Algorithms

Data-Driven Operations Management

Data-Driven Innovation and Business

Human and Social Analytics

Privacy, Security, Ethics, and Governance

Internet of Things

[RP1] Process Analytics: Improving Service While Cutting Costs

[RP2] Customer Journey: Correlating Events to Learn and Influence Customer Behavior

[RP3] Smart Maintenance & Diagnostics: Safeguarding Availability

[RP4] Quantified Self: Improving Performance and Well-Being

[RP5] Data Value and Privacy: Economic and Legal Aspects of Data Science

[RP6] Smart Cities: Ensuring Safety and Convenience for Citizens

[RP7] Smart Grids: Data Intensive Infrastructures

Page 12: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Data Science Flagship (Philips & DSC/e)

4 Strategic topics • Data Driven Value Propositions

• Healthcare Smart Maintenance

• Optimizing Healthcare Workflows

• Continuous Personal Health

4 TU/e departments

16 PhD students

30 Data science specialists

Many more organizations

• BrandLoyalty

• Vanderlande Industries

• ASML

• SynerScope

• Magnaview

• Fluxicon

• Adversitement

• Rabobank

• ING

• SAP

• IBM

• PwC

• AMC

• …

Page 13: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining

Page 14: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Example: Process Mining as the Bridge Between Data Science and Process Science

Page 15: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

• Input: events (“things that have happened”)

• Mandatory per event:

• case identifier

• activity name

• timestamp/date

• Optional

• resource

• transaction type

• costs

• …

case

identifier

activity

name timestamp

resource row = event

Page 16: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

208 cases

5987 events

74 activities

Page 17: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

batching for activities

“opstellen eindnota” and

“archiveren”

Page 18: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Loesje van

der Aalst

desire line

Process Discovery

Page 19: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior process discovery

NO

modeling

needed!

Page 20: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior process discovery

NO

modeling

needed!

74 act.

11 act.

3 act.

Page 21: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

event data process

model

Conformance Checking

Page 22: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

desire line

very safe

system

Conformance Checking

Page 23: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior conformance checking

?

discovered or

hand-made

Page 24: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior conformance checking

fitness of

93.5%

Page 25: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

conformance checking

final inspection is

skipped 40 times

Page 26: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior conformance checking

move on model

(something should have

happened, but did not)

move on log

(something happened that

should not happen)

Page 27: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior performance analysis

average

flowtime is

1.92 months

bottleneck

NO

modeling

needed!

Page 28: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

performance analysis

waiting time of

15.74 days

NO

modeling

needed!

Page 29: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior animating reality

NO

modeling

needed!

real cases

Page 30: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

16 cases are

queueing

animating reality

Page 31: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

Process Mining: Spreadsheet for behavior

Deviations

Where?

Why? time

costs

What?

Page 32: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

32

Conclusion

•Need for Data Scientists!

•Wonderful Data Science Master Program with 3

entry points and 5 specializations

• Ask Farideh Heidari ([email protected]) for details!

•Zoomed-in on the Data Science ecosystem in

Eindhoven: Data Science Center Eindhoven (DSC/e)

•Zoomed-in on a particular Data Science topic:

Process Mining (linking processes and data)

Page 33: TextText Wil van der Aalst - EIT Digital · Statistics on the first intake •Total number of students: 46 •EU students 54% •Age :20-29 •Female students: 17% DSC

masterschool.eitdigital.eu

More information?

http://www.masterschool.eitdigital.eu/programmes/dsc/

https://www.coursera.org/course/procmin/

http://www.processmining.org/

http://www.tue.nl/dsce/

http://vdaalst.com/