21
Collaborative Data Analysis and Multi-Agent Systems Robert W. Thomas CSCE 824 15 APR 2013

Collaborative Data Analysis and Multi-Agent Systems

  • Upload
    teness

  • View
    14

  • Download
    0

Embed Size (px)

DESCRIPTION

Collaborative Data Analysis and Multi-Agent Systems. Robert W. Thomas CSCE 824 15 APR 2013. Agenda. Problem Description Existing Research Overview Limitation of Existing Results Future Research Suggestions. Problem Description. Information Overload Divide and Conquer; Reconcile - PowerPoint PPT Presentation

Citation preview

Page 1: Collaborative Data Analysis and Multi-Agent Systems

Collaborative Data Analysisand Multi-Agent Systems

Robert W. ThomasCSCE 824

15 APR 2013

Page 2: Collaborative Data Analysis and Multi-Agent Systems

2

Agenda

• Problem Description• Existing Research Overview• Limitation of Existing Results• Future Research Suggestions

Page 3: Collaborative Data Analysis and Multi-Agent Systems

3

Problem Description

• Information Overload• Divide and Conquer; Reconcile• Recommender Systems and Social Media– Content Filtering– Collaborative Filtering– Collaborative Data Analysis through Agents

Page 4: Collaborative Data Analysis and Multi-Agent Systems

4

Content Filtering

• Recommendations based on items similar to what has been preferred previously

Page 5: Collaborative Data Analysis and Multi-Agent Systems

5

Collaborative Filtering (CF)

• Recommendations based on what others in a network prefer

• Different Techniques– Memory-Based– Model-Based– Hybrid

Page 6: Collaborative Data Analysis and Multi-Agent Systems

6

Memory-Based CF

• Similarity Computation• Prediction and Recommendation Computation• Top-N Recommendations

Page 7: Collaborative Data Analysis and Multi-Agent Systems

7

Similarity Computation

• Compares Users or Items• Correlation-Based (Pearson correlation)

• Vector Cosine-Based

Two users: u,vTwo items: i,j= items both u and v have rated= avg rating of co-rated items of the user= users who rated both i and j= avg rating of the item by those users

R = m x n user-item matrix are n dimensional vectors corresponding to i and j column of R

Page 8: Collaborative Data Analysis and Multi-Agent Systems

8

Prediction and Recommendation Computation

• Weighted Sum of Others’ Ratings

• Simple Weighted AveragePrediction P for active user a, on item i= avg rating of user u= weight between user a and user u= users who rated item i

Prediction P for user u on item i= all other rated items for user u = weight between items i and n= rating for user u on item n

Page 9: Collaborative Data Analysis and Multi-Agent Systems

9

Top-N Recommendations

• Item-Based• User-Based

Page 10: Collaborative Data Analysis and Multi-Agent Systems

10

Model-Based CF

• Bayesian Belief Net• Clustering• Regression-Based• Markov Decision Process (MDP) –Based• Latent Semantic

Page 11: Collaborative Data Analysis and Multi-Agent Systems

11

Bayesian Belief Net

• Bayesian logic – decision making and inferential statistics• Simple Bayesian

– Memory-Based

– Laplace Estimator to avoid a conditional probability of 0

• Tree Augmented naïve Bayes and naïve Bayes optimized by Extended Logic Regression (ELR)– Require extended training periods to produce results beyond

simple Bayesian and Pearson correlation

Page 12: Collaborative Data Analysis and Multi-Agent Systems

12

Clustering

• Cluster: collection of similar objects, dissimilar to objects in other clusters– Pearson correlation can be used

• Three Categories– Partitioning– Density-based– Hierarchal

• Often an Intermediate Step

Page 13: Collaborative Data Analysis and Multi-Agent Systems

13

Regression-Based

• Use approximation of ratings to make predictions against a regression model

• Apply to situations where rating vectors have large Euclidean distances but very high Similarity Computation scores

Page 14: Collaborative Data Analysis and Multi-Agent Systems

14

MDP-Based

• Sequential Optimization Problem• <S,A,R,Pr>– S = {states}– A = {actions}– R = {rewards} for r(s,a,s’)– Pr = {transition probabilities} for pr(s,a,s’)

• Partially Observable MDP (POMDP)

Page 15: Collaborative Data Analysis and Multi-Agent Systems

15

Latent Semantic

• Uses statistical modeling to discover additional communities or profiles

Page 16: Collaborative Data Analysis and Multi-Agent Systems

16

Network Trust

• We’re all mad here; I’m mad; you’re mad.• Opinions of different contacts are valued more

than others under certain conditions• Accounting for this can increase CF accuracy• Semantic Knowledge• Social Tie-Strength

Page 17: Collaborative Data Analysis and Multi-Agent Systems

17

Hybrid CF

• CF + Content-Based• CF + CF• CF + CF and/or Content-Based

Page 18: Collaborative Data Analysis and Multi-Agent Systems

18

Limitations of Existing Solutions

• Time / Accuracy Trade Offs• Noisy Data• Data Sparsity (New User)• Scalability• Synonymy• Gray Sheep• Shilling Attacks• Privacy

Page 19: Collaborative Data Analysis and Multi-Agent Systems

19

Future Research Suggestions

• Hybrids• Semantics• Trust• Parallel Processing– Multi-Agent Systems

Page 20: Collaborative Data Analysis and Multi-Agent Systems

20

BACKUP

Page 21: Collaborative Data Analysis and Multi-Agent Systems

21

References• Su, Xiaoyuan, and Taghi M. Khoshgoftaar. "A survey of

collaborative filtering techniques." Advances in Artificial Intelligence 2009 (2009): 4.

• Chen, Wei, and Simon Fong. "Social network collaborative filtering framework and online trust factors: a case study on Facebook." Digital Information Management (ICDIM), 2010 Fifth International Conference on. IEEE, 2010.

• O'Donovan, John, and Barry Smyth. "Trust in recommender systems." Proceedings of the 10th international conference on Intelligent user interfaces. ACM, 2005.