Upload
ram-pratap-singh
View
175
Download
6
Embed Size (px)
Citation preview
Clustering Technique & its Products
Presented By :Shikha Mishra-142Sonal Pal-149Vikram Singh-292
Clustering
It is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other than to those in other clusters.
Difference
O Freeware
O Shareware
O Commercial Software
WEKAO Waikato Environment for Knowledge
Analysis is a popular suit for machine learning software written in Java.
O Weka is a free software available under the GNU general public license.
Advantages of WekaO Free availability under the GNU general
public license.O Portability, since it is fully implemented
in the java programming language and thus runs on almost any modern computing platform.
O A comprehensive collection of data processing and modeling techniques.
O Ease of use due to its graphical user interfaces.
KEY FEATURES OF WEKA
Weka supports several standard data mining task-
Data processing.ClusteringClassificationRegression
VisualizationFeature selectionAccess to SQL databases using
JAVA database connectivity.It is not capable of multi-relational
data mining but there is separate software for converting a collection of linked database tables into a single table that is suitable for processing using Weka.
Weka productWeka product
Different algorithms for data mining and machine learning
Different algorithms for data mining and machine learning
Easily useable Easily useable
Platform-independe
nt
Platform-independe
nt
Open source and freely available
Open source and freely available
Flexible facilities for
scripting experiment
Flexible facilities for
scripting experiment
WEKA INTERFACE-O Explorer : An environment for
exploring data with WEKA .
O Experimenter : An environment for performing experiments and conducting statistical tests between learning schemes.
OKnowledge Flow : This environment supports essentially the same functions as the Explorer but with a drag-and-drop interface. One advantage is that it supports incremental learning.
OSimple CLI : Provides a simple command-line interface that allows direct execution of WEKA
commands for operating systems that do not provide their own command line interface.
ARFF FILEAttribute Relationship File Format
(ARFF) is the text format file used by weka to store data in data base.
The ARFF file contains two sections: the header and the data section. The first line of the header tells us the relation name. Then there is the list of the attributes (@attribute...).
BMW CLUSTER DATA IN WEKA
BMW CLUSTER ALGORITHM
CLUSTER ATTRIBUTES
THANK YOU