13
Hands-on-Workshop Big (Twitter) Data Damian Trilling [email protected] @damian0604 www.damiantrilling.net Afdeling Communicatiewetenschap Universiteit van Amsterdam 30 January 2014 13.15 #bigdata Damian Trilling

Analyzing social media with Python and other tools (3/4)

Embed Size (px)

Citation preview

Page 1: Analyzing social media with Python and other tools (3/4)

Hands-on-WorkshopBig (Twitter) Data

Damian Trilling

[email protected]@damian0604

www.damiantrilling.net

Afdeling CommunicatiewetenschapUniversiteit van Amsterdam

30 January 201413.15

#bigdata Damian Trilling

Page 2: Analyzing social media with Python and other tools (3/4)

In this session (3/4):

What we’ll do

1 A bunch of exercises2 If you want to, the opportunity to develop an own script

Björn and I will help you.

#bigdata Damian Trilling

Page 3: Analyzing social media with Python and other tools (3/4)
Page 4: Analyzing social media with Python and other tools (3/4)

I’ll now show you some example scripts you can use for doing theexercises and for inspiration for an own project. You find everythingyou need at http://beehub.nl/bigdata-cw/workshop.

Or in the future at https://github.com/uvacw/py-examples

#bigdata Damian Trilling

Page 5: Analyzing social media with Python and other tools (3/4)

RE exercise 1: Automated coding

See example from this morning

#bigdata Damian Trilling

Page 6: Analyzing social media with Python and other tools (3/4)

RE exercise 2: Freqencies

netvizz ⇒ engeltjes.tab ⇒ engeltjes.py ⇒ screen output +engeltjes_count.csv

something new: The package nltk and the removal of stopwords

www.nltk.org

#bigdata Damian Trilling

Page 7: Analyzing social media with Python and other tools (3/4)

RE exercise 2: Freqencies

netvizz ⇒ engeltjes.tab ⇒ engeltjes.py ⇒ screen output +engeltjes_count.csv

something new: The package nltk and the removal of stopwords

www.nltk.org

#bigdata Damian Trilling

Page 8: Analyzing social media with Python and other tools (3/4)
Page 9: Analyzing social media with Python and other tools (3/4)

RE exercise 3: Sentiment analysis

The pattern-modulepattern.nl | en | es | de | fr | it | nl

http://www.clips.ua.ac.be/pages/pattern

#bigdata Damian Trilling

Page 10: Analyzing social media with Python and other tools (3/4)
Page 11: Analyzing social media with Python and other tools (3/4)

RE exercise 4: Your own ideas

1 Have a look at the examples on beehub or github.2 Ask google.3 Ask us for advice.

#bigdata Damian Trilling

Page 12: Analyzing social media with Python and other tools (3/4)

Before you start

Common errors

indention error Pay attention to TAB and SPACE.error in line YYY Have a close look at line YYY in your editor.index out of range Maybe you want to read column 5 from a table

with 4 columns?

Try your script on a small dataset first!

#bigdata Damian Trilling

Page 13: Analyzing social media with Python and other tools (3/4)

Vragen of opmerkingen?

Damian Trilling

[email protected]@damian0604

www.damiantrilling.net

#bigdata Damian Trilling