Jonathas Magalhães 2 , Rubens Pessoa, Cleyton Souza, Evandro Costa, Joseana Fechine The 2014 RecSys Challenge [1] consists of ordering tweets shared by users on IMDb according to the amount of interaction that they received. The interaction of a tweet is defined by the sum of the number of retweets and favorites that it received.Our objective is to present a contestant approach to the 2014 RecSys Challenge. INTRODUCTION 1 More information at http://www.grouptips.org. 2 Corresponding author, e-mail: [email protected]. RECSYS CHALLENGE 2014 FEDERAL UNIVERSITY OF CAMPINA GRANDE FEDERAL UNIVERSITY OF ALAGOAS Intelligent, Personalized and Social Technologies Group 1 A RECOMMENDER SYSTEM FOR PREDICTING USER ENGAGEMENT IN TWITTER [1] A. Said, S. Dooms, B. Loni, and D. Tikk. Recommender systems challenge 2014. In Proceedings of the eighth ACM conference on Recommender systems, RecSys ’14, New York, NY, USA, 2014. ACM. [2] S. Dooms, T. De Pessemier, and L. Martens. Movietweetings: a movie rating dataset collected from twitter. In Workshop on Crowdsourcing and Human Computation for Recommender Systems, CrowdRec at RecSys 2013, 2013. REFERENCES We use two datasets: ● The expanded MovieTweetings dataset [2] distributed by the organizers of the challenge, with the following attributes: movie id, movie rating, crawled time, tweet time, followers count, statuses count, favourites count and engagement. ● The IMDb dataset which consists of additional information about movies referenced by tweets in order to complement the MovieTweetings dataset, with the following attributes: IMDb rating, IMDb votes count, Movie year. COMPOSING AND PRE-PROCESSING THE DATASET In this work we use three different regressors: Linear Regression, Pace Regression and induction model trees algorithm M5Base that is an extension of the Quinlan’s algorithm to the regression task. Table 2: Regression models and their parameters. Besides the models presented in Table 2, we implemented three methods to combine them: Average, Median and Ranking. REGRESSION STEP Our approach is divided into three steps: ● Classification; ● Regression and; ● Ordering Results. In the classification and regression steps we use the Weka API to train the models. Figure 1: Overview of the Recommender System. OVERVIEW OF THE RECOMMENDER SYSTEM We use three classifiers, Naïve Bayes, Support Vector Machines (SVM) and the Nearest Neighbor algorithm Ibk. Table 1: Classification models and their parameters. We also implement a classifier that combine them using Voting. In other words, an instance will be classified in a given class if it has obtained the required majority of the models presented. CLASSIFICATION STEP Table 3 summarizes the factors and the levels used in each one. Considering the factors and levels used, we have an experimental design with 2 * 7 * 9 = 126 treatments without replication. We use the metric normalized Discounted Cumulative Gain (nDCG) to compare the methods. Table 3: Experimental factors and their levels. METHODOLOGY Table 4 presents the NDCG@10 results of the ten best configurations of our approach. Table 4: The nDCG@10 of the 10 best configurations. RESULTS

A Recommender System for Predicting User Engagement in Twitter

Download PDF Report

Upload
jonathas-magalhaes
View
83
Download
1

Embed Size (px)

Citation preview

Page 1: A Recommender System for Predicting User Engagement in Twitter

Jonathas Magalhães2, Rubens Pessoa, Cleyton Souza, Evandro Costa, Joseana Fechine

The 2014 RecSys Challenge [1] consists of ordering tweets shared by users on IMDb according to the amount of interaction that they received. The interaction of a tweet is defined by the sum of the number of retweets and favorites that it received.Our objective is to present a contestant approach to the 2014 RecSys Challenge.

INTRODUCTION

1 More information at http://www.grouptips.org. 2 Corresponding author, e-mail: [email protected].

RECSYS CHALLENGE 2014FEDERAL UNIVERSITY OF CAMPINA GRANDE

FEDERAL UNIVERSITY OF ALAGOASIntelligent, Personalized and Social Technologies Group1

A RECOMMENDER SYSTEM FOR PREDICTINGUSER ENGAGEMENT IN TWITTER

[1] A. Said, S. Dooms, B. Loni, and D. Tikk. Recommender systems challenge 2014. In Proceedings of

the eighth ACM conference on Recommender systems, RecSys ’14, New York, NY, USA, 2014. ACM.

[2] S. Dooms, T. De Pessemier, and L. Martens. Movietweetings: a movie rating dataset collected from

twitter. In Workshop on Crowdsourcing and Human Computation for Recommender Systems,

CrowdRec at RecSys 2013, 2013.

REFERENCES

We use two datasets:

● The expanded MovieTweetings dataset [2] distributed by the organizers of the

challenge, with the following attributes: movie id, movie rating, crawled time, tweet

time, followers count, statuses count, favourites count and engagement.

● The IMDb dataset which consists of additional information about movies

referenced by tweets in order to complement the MovieTweetings dataset, with

the following attributes: IMDb rating, IMDb votes count, Movie year.

COMPOSING AND PRE-PROCESSING THE DATASET

In this work we use three different regressors: Linear Regression, Pace Regression

and induction model trees algorithm M5Base that is an extension of the Quinlan’s

algorithm to the regression task.

Table 2: Regression models and their parameters.

Besides the models presented in Table 2, we implemented three methods to combine them: Average, Median and Ranking.

REGRESSION STEP

Our approach is divided into three steps:

● Classification;

● Regression and;

● Ordering Results.

In the classification and regression steps we use the Weka API to train the models.

Figure 1: Overview of the Recommender System.

OVERVIEW OF THE RECOMMENDER SYSTEM

We use three classifiers, Naïve Bayes, Support Vector Machines (SVM) and the

Nearest Neighbor algorithm Ibk.

Table 1: Classification models and their parameters.

We also implement a classifier that combine them using Voting. In other words, an

instance will be classified in a given class if it has obtained the required majority of

the models presented.

CLASSIFICATION STEP

Table 3 summarizes the factors and the levels used in each one. Considering the

factors and levels used, we have an experimental design with 2 * 7 * 9 = 126

treatments without replication. We use the metric normalized Discounted Cumulative

Gain (nDCG) to compare the methods.

Table 3: Experimental factors and their levels.

METHODOLOGY

Table 4 presents the NDCG@10 results of the ten best configurations of our approach.

Table 4: The nDCG@10 of the 10 best configurations.

RESULTS

Thought Bubbles: a conceptual prototype for a Twitter based recommender system for research 2.0

Documents

On Predicting Election Results using Twitter and Linked ... Predicting Election Results... · 3 The Framework for Predicting Elections In this section we describe in detail the framework

Documents

Predicting retail website anomalies using Twitter datacs229.stanford.edu/proj2012/Farren-PredictingRetail...1 Predicting retail website anomalies using Twitter data Derek Farren [email protected]

Documents

US-13-Sumner-Predicting-Susceptibility-to-Social · PDF filePredicting Susceptibility to Social Bots on Twitter ... Welcome to Predicting Susceptibility to Social ... existing social

Documents

Predicting Susceptibility to Social Bots on Twitter

Documents

Predicting Bitcoin price fluctuation with Twitter …1110776/FULLTEXT01.pdfDEGREE PROJECT IN TECHNOLOGY, FIRST CYCLE, 15 CREDITS STOCKHOLM , SWEDEN 2017 Predicting Bitcoin price fluctuation

Documents

Predicting Cancer Drug Response Using a Recommender System · Predicting Cancer Drug Response Using a Recommender System Chayaporn Supahvilai1,2, Denis Bertrand2 and Niranjan Nagarajan2*

Documents

Predicting Flu Trends using Twitter Data

Documents

Predicting the Political Alignment of Twitter Users - CNetScnets.indiana.edu/wp-content/uploads/conover_prediction_socialcom... · Predicting the Political Alignment of Twitter Users

Documents

PERSONALIZED RECOMMENDER SYSTEM ON WHOM TO FOLLOW IN TWITTER · Twitter is a directed social network. In Facebook, the friendship link is bidirectional whereas, in Twitter, the link

Documents

Predicting Missing Ratings in Recommender Systems ...joans/journals/09 IJEC Predicting missing...Predicting Missing Ratings in Recommender Systems: Adapted Factorization Approach Carme

Documents

Predicting and Interpolating State-level Polling using ...kenbenoit.net/pdfs/NDATAD2013/Beauchamp_twitterpolls_2.pdf · Predicting and Interpolating State-level Polling using Twitter

Documents

Clinical Recommender System: Predicting Medical Specialty ...Clinical Recommender System: Predicting Medical Specialty Diagnostic Choices with Neural Network Ensembles Morteza Noshad1,

Documents

Predicting performance in Recommender Systems - Poster

Technology

Predicting Potential Responders in Twitter: A Query Routing Algorithm

Technology

Scalable Optimization Algorithms for Recommender Systems · Recommender systems have now gained signiﬁcant popularity and been widely used in many e-commerce applications. Predicting

Documents

A network based model for predicting a hashtag break out in twitter

Data & Analytics

Predicting Personality from Twitter 1 Predicting Personality with Social Media 2 Jennifer Golbeck, Cristina Robles, Michon Edmondson 1, Karen Turner SocialCom

Documents

Predicting the Demographics of Twitter Users from Website …cs.tulane.edu/~aculotta/pubs/culotta15predicting.pdf · 2020. 5. 19. · Predicting the Demographics of Twitter Users

Documents

Predicting Susceptibility to Social Bots in Twitter

Social Media

Personalized News Recommender using Twitter

Documents

Predicting election trends with Twitter: Hillary Clinton ... · Predicting election trends with Twitter: Hillary Clinton versus Donald Trump Alexandre Bovet, Flaviano Morone, Hern

Documents

Predicting opinion leadership on twitter

Education

Predicting American Idol with Twitter Sentiment0 Predicting American Idol with Twitter Sentiment This document is written by Sivan Alon, Simon Perrigaud, and Meredith Neyrand, who

Documents

A Review on a Semantic Recommender System for IT · PDF filejokes, hotels, financial services,[4] life insurance, persons and twitter followers[5]. Recommender systems [1], which are

Documents

Recommender Systems Recommender Systems

Documents

Personalized recommender system on whom to follow in Twitter

Documents

Predicting Bitcoin price fluctuation with Twitter ...1110776/FULLTEXT01.pdf · Predicting Bitcoin price fluctuation with Twitter sentiment analysis ... indicate a price change in

Documents

Defining and Predicting the Localness of Volunteered ...Geographic Information and Twitter The vast majority of prior work on Twitter, including most localness work, has used Twitter

Documents

CSE 258 Web Mining and Recommender Systems · predicting and ranking likely purchases) Week 4: Recommender Systems ... • One assignment on recommender systems (after week 5), worth

Documents