Upload
elaine-lasda
View
119
Download
0
Embed Size (px)
Citation preview
Data Mining for Libraries:What are the Possibilities?
Elaine M. Lasda Bergman, MLSTwitter: @ElaineLibrarian
Subject Librarian for Social WelfareUniversity at Albany, SUNY
SUNYLA Midwinter ConferenceJanuary 30, 2015
What is Data Mining?
http://pixabay.com/en/helmet-mine-mining-headgear-155632/
Knowledge Discovery In Databases (KDD)
Input dataData
PreprocessingData Mining Postprocessing Information
Adapted from Tan, et al. (2006), p.3
A note about data collection
• It’s the kicker: GIGO
• Cleaning
• Preprocessing
What is Weka?
http://www.cs.waikato.ac.nz/ml/weka/
Weka for Prediction
Mackenzie, Ian: https://www.flickr.com/photos/madmack/165933656/
Decision Tree From Weka
Did Student use Email/IM
reference
Did student Receive
instruction
0 sessions1-2 session
Time between grad/undergrad
1-5 years
100% yes
None
45% yes
5+ years
100% yes
3+ sessions
Student’ s residency
status
On campus full time
Off campus full time Part time
Likelihood of graduate students
using library resources based on survey questions
YesNo
Weka for Classification
http://www.geograph.org.uk/photo/971476
Animal Clusters
Weka for Association Analysis
http://analytics-arena.blogspot.com/2012/12/the-famous-beer-diaper-planogram.html
Association Rules
(Anomaly Detection)
https://www.flickr.com/photos/fonalite/2780198933/
How Can Libraries Use Data Mining?
http://dlg.galileo.usg.edu/dahlonega/dahlonega_logo.jpg
Circling Back: It All Starts With Data Collection
http://www.navigatingthetension.com/2012/02/circle-wagons.html
Questions?
Me:Elaine Lasda Bergman, Subject Librarian for Social Welfare, University at Albanyemail: [email protected]: @ElaineLibrarian
Resources used: Tan, P. et al. (2006). Introduction to Data Mining. Boston: Pearson Education, Inc.
Newton, et al. (2012). Your Statistical Consultant: Answers to Your Data Analysis Questions. Thousand Oaks: SAGE Publications.
Two good Weka Tutorials:http://www.cs.ccsu.edu/~markov/weka-tutorial.pdf
http://www.uh.edu/~smiertsc/4397cis/WEKA_Data_Mining_Tool.pdf
Data Mining for the Masses:https://rapidminer.com/wp-content/uploads/2013/10/DataMiningForTheMasses.pdf