22
Weka(DataMining Tool) Ajou University

Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Embed Size (px)

Citation preview

Page 1: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Weka(DataMining Tool)

Ajou University

Page 2: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Introduction to Weka

• Developed by University of Waikato(Keep being updated since 1999)

• Java based Open source application(Free to download)

• Uses for Datamining & Machine learning

Page 3: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Installation

• Download Link: http://www.cs.waikato.ac.nz/ml/weka/downloading.html

Page 4: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Installation

Page 5: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

How to Start Weka

or

Page 6: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Options of Weka

Preprocess- open / edit / save data- modify data by preprocessing to use it

Classify- Select classifier to perform classification

or regression- Training & Testing the data

Cluster- Select clustering algorithm to make

clusters for the data

Associate- Analyze the data by using associator

to make association rules

Select Attribute- Select effective attributes

Visualize- Show 2D graph or plot of the data

Page 7: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Preprocessing

Useful & Most used Preprocessing Filters

- DiscretizeDiscretize a range of numeric attributes in the dataset into nominal attribute

- NormalizeNormalize(-1 ~ 1) all numeric values in the given dataset

- NumericToNominalTurning numeric attributes into nominal ones

- ReplaceMissingValuesReplacing all missing values for nominal and numeric attributes in dataset withthe modes and means from the training data

- StandardizeStandardizes all numeric attributes in the given dataset to have zero mean and unit variance

- StringToNominalTurning string attributes into nominal ones

- SwapValuesSwaps two values of a nominal attributes

Page 8: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Preprocessing

Example> Discretize Function(using “diabetes.arff”)

13

20

Discretized

Page 9: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Classification

Useful & Most used Classifier

- AttributeSelectedClassifierDimensionality of training and test data is reduced by attribute selectionbefore being passed on to a classifier

- ClassificationViaClustering(Regression)A simple meta-classifier that uses a clusterer(regression model) for classification

- DecisionTableClass for building and using a simple decision table majority classifier

- IBk(k-NN)K-nearest neighbor classifier

- LibSVMA wrapper class for using a LibSVM(Library for Support Vector Machine)

- NaiveBayesA Naïve Bayes classifier using estimator class

Page 10: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Classification

Example> IBk(k-NN) Classifier(using “KDDCup99_sample.arff”)

Page 11: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Classification

Example> IBk(k-NN) Classifier(using “KDDCup99_sample.arff”)

Page 12: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Classification

Example> IBk(k-NN) Classifier(using “KDDCup99_sample.arff”)

Page 13: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Classification

Saving, Loading, and Using of built Model

Page 14: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Classification

Saving, Loading, and Using of built Model

Page 15: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Clustering

Useful & Most used Clusterer

- EMUsing simple EM(Expectation Maximization) algorithm

- HierarchicalClustererUsing Agglomerative clustering algorithm

- SimpleKMeansUsing K-means clustering algorithm

Page 16: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Clustering

Example> SimpleKmean(using “KDDCup99_sample.arff”)

Page 17: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Clustering

Example> SimpleKmean(using “KDDCup99_sample.arff”)

Page 18: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Visualization

Page 19: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Select attributions

Example> CfsSubsetEval/BestFirst(using “KDDCup99_sample.arff”)

Page 20: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Select attributions

Example> CfsSubsetEval/BestFirst(using “KDDCup99_sample.arff”)

Page 21: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Association

Example> Apriori(using “KDDCup99_sample.arff”)

Page 22: Weka(DataMining Tool) - ICS Labics.ajou.ac.kr/~aislab/Weka_hands_on_practice.pdf ·  · 2014-10-22Introduction to Weka • Developed by University of Waikato (Keep being updated

Association

Example> Apriori(using “KDDCup99_sample.arff”)