Heterogeneous Consensus Learning via Decision Propagation and Negotiation

Jing Gao† Wei Fan‡ Yizhou Sun†Jiawei Han†

†University of Illinois at Urbana-Champaign‡IBM T. J. Watson Research Center

KDD’09 Paris, France

Information Explosion

Fan SiteDescriptions

PicturesVideos

Not only at scale, but also at available sources!

descriptions reviews

Multiple Source Classification

Image Categorization Like? Dislike? Research Area

images, descriptions, notes, comments, albums, tags…….

movie genres, cast, director, plots…….

users viewing history, movie ratings…

publication and co-authorship network, published papers, …….

Model Combination helps!

Some areas share similar keywordsSIGMOD

People may publish in relevant but different areas

There may be cross-discipline co-operations

supervised

unsupervised

Supervised or unsupervised

Motivation

• Multiple sources provide complementary information– We may want to use all of them to derive better classification

solution

• Concatenation of information sources is impossible– Information sources have different formats

– May only have access to classification or clustering results due to privacy issues

• Ensemble of supervised and unsupervised models– Combine their outputs on the same set of objects – Derive a consolidated solution– Reduce errors made by individual models– More robust and stable

Consensus Learning

Related Work

• Ensemble of Classification Models– Bagging, boosting, ……– Focus on how to construct and combine weak classifiers

• Ensemble of Clustering Models– Derive a consolidated clustering solution

• Semi-supervised (transductive) learning

• Link-based classification– Use link or manifold structure to help classification

– One unlabeled source

• Multi-view learning– Construct a classifier from multiple sources

Problem Formulation

• Principles– Consensus: maximize agreement among

supervised and unsupervised models– Constraints: Label predictions should be close

to the outputs of the supervised models

• Objective function

Consensus Constraints

NP-hard!

MethodologyStep 1: Group-level predictions

Step 2: Combine multiple models using local weights

How to propagate and negotiate?

How to compute local model weights?

Group-level Predictions (1)

• Groups:– similarity: percentage of common members– initial labeling: category information from supervised models

Group-level Predictions (2)

• Principles– Conditional probability estimates smooth over the graph– Not deviate too much from the initial labeling

[0.16 0.16 0.98]

[0.93 0.07 0]

Labeled nodes Unlabeled nodes

Local Weighting Scheme (1)

• Principles– If M makes more accurate prediction on x,

M’s weight on x should be higher

• Difficulties– “unsupervised” model combination—cannot

use cross-validation

Local Weighting Scheme (2)• Method

– Consensus• To compute Mi’s weight on x, use M1,…, Mi-1, Mi+1, …,

Mr as the true model, and compute the average accuracy

• Use consistency in x’s neighbors’ label predictions between two models to approximate accuracy

– Random• Assign equal weights to all the models

consensus random

Algorithm and Time Complexity

Compute similarity and local consistency

for each pairs of groups

for each group

iterate f steps

Compute probability estimates based on the weighted average of neighbors

Compute local weights

for each example

for each model

Combine models’ predictions using local weights

O(fcs2)

linear in the number of examples!

Experiments-Data Sets• 20 Newsgroup

– newsgroup messages categorization– only text information available

• Cora– research paper area categorization– paper abstracts and citation information available

• DBLP– researchers area prediction– publication and co-authorship network, and publication content– conferences’ areas are known

• Yahoo! Movie– user viewing interest analysis (favored movie types)– movie ratings and synopses– movie genres are known

Experiments-Baseline Methods

• Single models– 20 Newsgroup:

• logistic regression, SVM, K-means, min-cut

– Cora• abstracts, citations (with or without a labeled set)

– DBLP• publication titles, links (with or without labels from conferences)

– Yahoo! Movies• Movie ratings and synopses (with or without labels from movies)

• Ensemble approaches– majority-voting classification ensemble – majority-voting clustering ensemble– clustering ensemble on all of the four models

Experiments-Evaluation Measures

• Classification Accuracy– Clustering algorithms: map each cluster to th

e best possible class label (should get the best accuracy the algorithm can achieve)

• Clustering quality– Normalized mutual information– Get a “true” model from the groudtruth labels– Compute the shared information between th

e “true” model and each algorithm

Empirical Results -Accuracy

20 Newsgroup Cora DBLP

Empirical Results-NMI

20 Newsgroup Cora DBLP

Empirical Results-

DBLP data

Empirical Results-Yahoo! Movies

Empirical Results-Scalability

Conclusions• Summary

– We propose to integrate multiple information sources for better classification

– We study the problem of consolidating outputs from multiple supervised and unsupervised models

– The proposed two-step algorithm solve the problem by propagating and negotiating among multiple models

– The algorithm runs in linear time.– Results on various data sets show the improvements

• Follow-up Work– Algorithm and theory– Applications

Thanks!

• Any questions?

http://www.ews.uiuc.edu/~jinggao3/kdd09clsu.htm

jinggao3@illinois.edu

Office: 2119B

Heterogeneous Consensus Learning via Decision Propagation and Negotiation

Documents

Output Consensus Control for Heterogeneous Multi-Agent Systems

Expert consensus document: Cholangiocarcinoma: current ... · Cholangiocarcinoma (CCA) is a heterogeneous group of malignancies that can emerge at every point of the biliary tree,

PACES/HRS Expert Consensus Statement on the Evaluation …processes, “Currently no standard diagnostic approach exists, and management is heterogeneous.”1 The Pediatric and Congenital

Negotiating Statehood in a Hybrid Political Order: The ... · Somaliland: Negotiating Statehood in a Hybrid Political Order 725 particular negotiation process is heterogeneous in

Negotiation Documentation: Pre-negotiation Plan & the

Group consensus for heterogeneous multi-agent systems with ...xuanqi-net.com/Papers/HuNeuro14.pdf · Group consensus for heterogeneous multi-agent systems with parametric uncertainties

MARYLAND PUBLIC POLICY CONFLICT RESOLUTION FELLOWS …€¦ · verse group of influential Maryland leaders to expand their negotiation, conflict resolution, and consensus-building

Towards an Open Negotiation Architecture for Heterogeneous …ii.tudelft.nl/~catholijn/publications/sites/default/files... · 2014-08-15 · Towards an Open Negotiation Architecture

Negotiation Documentation: Pre-negotiation Plan & the ... · Negotiation Documentation: Pre-negotiation Plan ... While the applicability and ... that were established in the pre-negotiation

CONFLICT RESOLUTION NEGOTIATION AND MEDIATION · Conflict, Negotiation and Mediation The Keystone Center – 9 INTEREST-BASED CONSENSUS-BUILDING PROCESS Ł Consider your own interests

Conflict Prevention Due Diligence Negotiation & Consensus Building Strategies for Foreign-Investment Projects

Leader-Follower Pose Consensus for Heterogeneous Robot Networks with Variable …folk.ntnu.no/skoge/prost/proceedings/ifac2014/media/... · 2014-07-16 · Leader-follower Pose Consensus

Deregulation with Consensus · the consensus of the groups protected by the regulation. Such groups are often fairly large and heterogeneous, and the relevant information private

Deakin Research Online - Home - DROdro.deakin.edu.au/eserv/DU:30005808/cybulski-stakeholderbargaining... · negotiation, and consensus ... also believe perception and cognition are

Ferme, M. the Violence of Numbers Consensus, Competition, And the Negotiation of Disputes in Sierra Leone. (Cahiers d'Etudes Africaines)

Lecture 1 Negotiation skills. Contents Negotiation tactics 3 The negotiation process 1 Negotiation styles 2

NEGOTIATION & CONSENSUS-BUILDING STRATEGIES FOR FOREIGN-INVESTMENT PROJECTS: CONFLICT PREVENTION DUE DILIGENCE

The Negotiation of Multimedia Content Services in Heterogeneous Environments

Heterogeneous Missions Accessibility - Earth Online · Earth observation (EO) systems are varied in design and purpose, ... consensus standards process for geospatial technology stakeholders

CHAPTER 2 SOFTWARE REQUIREMENTSsce.uhcl.edu/helm/SWEBOK_IEEE/data/swebok_chapter_02.pdf · 1 Introduction ... consensus between heterogeneous groups of stakeholders ... ♦ Functional