16
Data Mining for Libraries: What are the Possibilities? Elaine M. Lasda Bergman, MLS Twitter: @ElaineLibrarian [email protected] Subject Librarian for Social Welfare University at Albany, SUNY SUNYLA Midwinter Conference January 30, 2015

Data Mining for Libraries

Embed Size (px)

Citation preview

Page 1: Data Mining for Libraries

Data Mining for Libraries:What are the Possibilities?

Elaine M. Lasda Bergman, MLSTwitter: @ElaineLibrarian

[email protected]

Subject Librarian for Social WelfareUniversity at Albany, SUNY

SUNYLA Midwinter ConferenceJanuary 30, 2015

Page 2: Data Mining for Libraries

What is Data Mining?

http://pixabay.com/en/helmet-mine-mining-headgear-155632/

Page 3: Data Mining for Libraries

Knowledge Discovery In Databases (KDD)

Input dataData

PreprocessingData Mining Postprocessing Information

Adapted from Tan, et al. (2006), p.3

Page 4: Data Mining for Libraries

A note about data collection

• It’s the kicker: GIGO

• Cleaning

• Preprocessing

Page 5: Data Mining for Libraries

What is Weka?

http://www.cs.waikato.ac.nz/ml/weka/

Page 6: Data Mining for Libraries

Weka for Prediction

Mackenzie, Ian: https://www.flickr.com/photos/madmack/165933656/

Page 7: Data Mining for Libraries

Decision Tree From Weka

Page 8: Data Mining for Libraries

Did Student use Email/IM

reference

Did student Receive

instruction

0 sessions1-2 session

Time between grad/undergrad

1-5 years

100% yes

None

45% yes

5+ years

100% yes

3+ sessions

Student’ s residency

status

On campus full time

Off campus full time Part time

Likelihood of graduate students

using library resources based on survey questions

YesNo

Page 9: Data Mining for Libraries

Weka for Classification

http://www.geograph.org.uk/photo/971476

Page 10: Data Mining for Libraries

Animal Clusters

Page 11: Data Mining for Libraries

Weka for Association Analysis

http://analytics-arena.blogspot.com/2012/12/the-famous-beer-diaper-planogram.html

Page 12: Data Mining for Libraries

Association Rules

Page 13: Data Mining for Libraries

(Anomaly Detection)

https://www.flickr.com/photos/fonalite/2780198933/

Page 14: Data Mining for Libraries

How Can Libraries Use Data Mining?

http://dlg.galileo.usg.edu/dahlonega/dahlonega_logo.jpg

Page 15: Data Mining for Libraries

Circling Back: It All Starts With Data Collection

http://www.navigatingthetension.com/2012/02/circle-wagons.html

Page 16: Data Mining for Libraries

Questions?

Me:Elaine Lasda Bergman, Subject Librarian for Social Welfare, University at Albanyemail: [email protected]: @ElaineLibrarian

Resources used: Tan, P. et al. (2006). Introduction to Data Mining. Boston: Pearson Education, Inc.

Newton, et al. (2012). Your Statistical Consultant: Answers to Your Data Analysis Questions. Thousand Oaks: SAGE Publications.

Two good Weka Tutorials:http://www.cs.ccsu.edu/~markov/weka-tutorial.pdf

http://www.uh.edu/~smiertsc/4397cis/WEKA_Data_Mining_Tool.pdf

Data Mining for the Masses:https://rapidminer.com/wp-content/uploads/2013/10/DataMiningForTheMasses.pdf