16
BI from open sources with GT data mining By: Edith Ohri, Datalert Home of GT data mining [email protected]

Bi analytics with gt data mining

Embed Size (px)

Citation preview

Page 1: Bi analytics with gt data mining

BI from open sourceswith

GT data mining

By: Edith Ohri, DatalertHome of GT data mining

[email protected]

Page 2: Bi analytics with gt data mining

AboutEdith Ohri – Developer of GT data mining and founder of Datalert startup for early detection. Industrial & Management Eng. from the Technion, MSc from NY Polytech.Management member of IE group in Association of Engineers and Architects in Israel, and a Liaison to Israel Society for Quality.

GT applications include:

SMU Singapore – early detection of top students and dropouts.RAFAE”L – root cause of late deliveries in PurchasingSCD Israel – root cause of a quality issue in productionDetection of earthquakes seismology patterns of behavior - Israel

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 2

Page 3: Bi analytics with gt data mining

The challenge

How to exploit free data?

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 3

Page 4: Bi analytics with gt data mining

Open data integrity issues

1. Unsupervised records2. Interdependencies3. “Long tail”4. “Overfitting”5. BIG DATA concerns,

such as inconsistencies and dynamics...

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 4

Page 5: Bi analytics with gt data mining

The GT data mining solutionGT is about patterns detection* in unsupervised complex data, including rare patterns and newly emerging ones.

*Patterns always provide further resolution, associations and insight.

s27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 5

Page 6: Bi analytics with gt data mining

Some of GT new principles1. They shall not clean input data!

2. Always prefer unsupervised data!

3. Include exceptions;

4. Include the data environment;

5. No pre-assumption about data behavior… * Consider variables as interdependent unless proved

otherwise;

6. Conclusions have to be explicit and fully traceable.

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 6

Page 7: Bi analytics with gt data mining

Example: predicting market prices

The target: pricing of new products, based on historic price lists.

The client doubts if analysis can help at all, since there is no data on competitors prices and clients’ behavior. Currently, Marketing determines prices by trial-and-error.

*See how GT resolve that issue in slide #13

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 7

Page 8: Bi analytics with gt data mining

Cont. example – predicting prices, input data

The input contains ~20,000 lines and 22 inter -dependent variables (YET NO data about competitors or clients behavior)

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 8

Page 9: Bi analytics with gt data mining

Cont. example – predicting prices, patterns

GT identifies the 3 product families and 9 subgroups - Inserts, Tools and Solid Carbide, and in them 5 sgr defined by specific functions: 6Q CUT-Grip, 30B Turning-with-hole, 21T Milling-new, 3 sgr defined by their typical Prod.Type, Grade, Shape and Size, and 1 Exceptions sgr.

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 9

Page 10: Bi analytics with gt data mining

Cont. example – predicting prices, key factors

Key-factors in addition to the already existing definitions of Sales-group and Item group:₋ Type of product₋ Geometric shape₋ Type of Chipbreaker₋ Grade₋ Product Radius (Length though has no effect..) *GT arrives to similar key factors on two independent sets.

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 10

Non-quantitative

Page 11: Bi analytics with gt data mining

Cont. example – predicting prices, test

Projecting prices with GT formulas:

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 11

Sub-group

Price by formula

Actual price $

Description Item

Gr732 44.11 45.74 38U drilling Inserts 5505456Gr750 7.52 9.84 1E self-grip Inserts 6002918Gr736 18.10 18.88 12n~A do-grip Inserts 6002410Gr736 16.17 15.96 12n~A do-grip Inserts 6095285

Anticipated prices are very close to true prices, see full simulation in next slide

Page 12: Bi analytics with gt data mining

Cont. example – predicting prices, simulation

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 12

Page 13: Bi analytics with gt data mining

Cont. example – predicting prices, GT insights

Answer to Slide-7: pricing can be done without data on competitors and clients, by reverse-engineering their old item price lists. It means that we may know the competitors’ prices more than themselves!

PS, GT groups characteristics also help improve existing Sales-group and Item-group definitions.

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 13

Page 14: Bi analytics with gt data mining

Cont. example – predicting prices, GT Benefits

1. Shorter time to market.

2. Improved specifications of new products.

3. Marketing competitive edge.

4. Extra windfall from data...

27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 14

Page 15: Bi analytics with gt data mining

GT features Sum-upDiscovery, Root-causes

Early detection

Automation, fits all platforms

Low cost application method

Fast adaptation to changes

Scalability (by fast & affordable adaptation).27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 15

Page 16: Bi analytics with gt data mining

ThanksEdith Ohri

Home of GT data mining

*Imported pictures are from free web sources.27 Oct 2015 BI from open sources with GT data mining. All rights reserved © Edith Ohri 2015 Slide 16