Thesis ProgressPresentation
İnanç Arın
09/02/2015
Extracting meaningful information from massive amount of data.
Designing a framework for◦ Streaming data (Twitter Java API)◦ Storing and accessing the data efficiently
(Elasticsearch)◦ Analyzation (Text Analytics, Classification
Algorithms, Sentiment Analysis...)◦ Visualization of data (Kibana, Highcharts...)
Big Data
Politics (parties, elections...)
Telco (turkcell, vodafone, avea, ttnet...)
Social Incidents (Gezi Parkı, Soma)
Others (finance, universities...)
Domains
Politics
◦ Call Center DataTelco (Churn analysis)
likel
y-to
chur
n
price
-sen
sitiv
e
tarif
e ar
aştır
ıyor N/S
CE660n660ral
CE660n660ral
CE660n660ral
Süper Kamu Tarifesi
likel
y-to
chur
n
price
-sen
sitiv
e
tarif
e ar
aştır
ıyor N/S
CE660n660ral
CE660n660ral
CE660n660ral
NOTACTIVE Yeni Nesil 500 Tarife
CHURN NON-CHURN
fatura iyi
yok tarife
kapatıyorum öğrenebilir
kapattırmak tekrar
arıza kampanya
pahalı almak
yüksek uygun
◦ Twitter Data
◦ http://somatech.sabanciuniv.edu/operatorResults
◦ Customer Journey
Telco (Churn analysis)
“Soma” vs “Gezi Parkı”
Discovering patterns, graph analysis
Social Incidents
Thanks