16
Multi-Relational Data Mining: An Introduction Joe Paulowskey

Multi-Relational Data Mining: An Introduction

  • Upload
    lali

  • View
    22

  • Download
    0

Embed Size (px)

DESCRIPTION

Multi-Relational Data Mining: An Introduction. Joe Paulowskey. Overview. Introduction to Data Mining Relational Data Patterns Inductive Logic Programming (ILP) Relational Association Rules Relational Decision Trees Relation Distance-Based Approaches. Relation Data. Relational Database - PowerPoint PPT Presentation

Citation preview

Page 1: Multi-Relational Data Mining: An Introduction

Multi-Relational Data Mining: An Introduction

Joe Paulowskey

Page 2: Multi-Relational Data Mining: An Introduction

Overview

Introduction to Data Mining Relational

Data Patterns

Inductive Logic Programming (ILP) Relational Association Rules Relational Decision Trees Relation Distance-Based Approaches

Page 3: Multi-Relational Data Mining: An Introduction

Relation Data

Relational DatabaseMultiple TablesDefined

Views Tables

Page 4: Multi-Relational Data Mining: An Introduction

Relational Pattern

Multiple Relations from a relational databaseMore Expressive

Opens upClassificationAssociationRegression

Page 5: Multi-Relational Data Mining: An Introduction

Relational Pattern (Cont.)

Expressed in Subsets of First Order Logic

Page 6: Multi-Relational Data Mining: An Introduction

Data Mining

Look for patterns in data What do you discover?

Associations Sequences Classifications

Goals of Data Mining Predict Identify Classify Optimize

Uses Business Data Environmental/Traffic

Engineering Web Mining Drug Design

Page 7: Multi-Relational Data Mining: An Introduction

Data Mining: Relational Databases Most Data Mining approaches deal with

single tablesNot safe to merge multiple tables into one

single table Number of patterns increases

Explicit constraints required

Page 8: Multi-Relational Data Mining: An Introduction

Inductive Logic Programming (ILP)

Logic Programs used to find patterns Clauses

Head and BodyLiteralsTypes

Definite Program

Page 9: Multi-Relational Data Mining: An Introduction

ILP (Cont)

PredicateRelations in relational databaseArguments -> Attributes

Attributes are Typed

Database Clauses are typed program clauses

Deductive Database

Page 10: Multi-Relational Data Mining: An Introduction

Relational Rule Induction ILP

Learn logical definitions of relations Classification

Rules can be found by decision treesSimple Algorithm

Dealing with noisy/incomplete data

Page 11: Multi-Relational Data Mining: An Introduction

ILP Problems to Propositional Forms Propositional

attribute-value Use Single Table Data Mining algorithms LINUS

Background Knowledge

Page 12: Multi-Relational Data Mining: An Introduction

ILP/RDM Algorithms

ShareLearning as a Search Paradigm

DifferencesRepresentation of Data, PatternsRefinement operatorsTesting Coverage

Upgrading from Propositional to Relational

Page 13: Multi-Relational Data Mining: An Introduction

Relational Association Rules

Frequent PatternsDetermining Frequency Itemsets

Association RulesObtained by frequent itemsets

Page 14: Multi-Relational Data Mining: An Introduction

Relational Decision Trees

Used for Prediction Binary Trees First Order Decision List

Page 15: Multi-Relational Data Mining: An Introduction

Relational Distance-Based Approaches Calculated distance between two objects Statistical Approaches

Page 16: Multi-Relational Data Mining: An Introduction

Conclusion