Upload
hans-joerg-happel
View
1.040
Download
1
Tags:
Embed Size (px)
DESCRIPTION
This demo presents Woogle4MediaWiki, an extension for MediaWiki, which improves its core search functionality and introduces Wiki-style collaboration about information needs (each search query may have its own Wiki page) and result lists. Furthermore, aggregate statistics from the search logs are shown in pop-up windows attached to “red links” to guide users contributing new information.
Citation preview
Woogle4MediaWiki From Searchers to Contributors
Hans-Jörg Happel, FZI Karlsruhe, Germany2010-07-08 @ WikiSym 2010; Gdansk, Poland
Agenda• Motivation• Features / Demo• Outlook & Summary
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 2
MOTIVATION
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 3
Wikis• Web-based repositories for sharing
consensual knowledge in a collaborative fashion
• In enterprises, Wikis often act as a layer of "glue" code relating content from other information systems and capturing all the rest– Wikis are a typical entry point for information
seeking– People are often confused why and what to
put into the Wiki
4Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
There are two kinds of people in a Wiki…
Readers• Have information needs
– But only few means for articula-ting such needs (e.g. search)
• Have collaboration needs– Search is an inherently social
activity (40,4% of users interact with others before and during search; 58,7% share information after search [EC08])
Contributors• Have limited ressources
(time/effort) and face futher barriers (motivation, privacy)
• Choices on contributions are seldom based on demand of readers (lack of guidance)
• Wiki communities stress the relevance of content („what is worth to be documented“)
5
• Knowledge sharing is a communication process• Artificial decoupling of readers and contributors in Wikis• Only few readers become contributorsWoogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
SEARCHING
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 6
• Search in MediaWiki– No query logs– „Red links“ (links to non-existing pages) are
the basic means to „demand“ content– No collaboration
7Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
Woogle4MediaWiki• Can replace MediaWiki built-in search or
act as an additional/parallel search engine (user configurable!)
• Features– Search suggestions based on fulltext– Allows to index Wikitex or rendered text– Allows to index certain files (e.g. PDF)– Show previous search attempts based on query
log– Social search (Wiki-page per query, social
ranking)
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 8
Every query can be a Wiki page…• Allows for
– Wiki-Text („Social Did-You-Mean“)
– Discussion page– Watches/Notifications
(WoogleAlerts)
• Shows basic statistics about query– „How many people
searched for that?“
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 9
Social Search• Add, remove and re-rank query results• Add comments
• Beyond that: – How to support truly collaborative information seeking (for
researchers, lawyers etc!)?
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 10
Two flavours of Woogle
• Native PHP (using Zend Lucene Search)
• Can handle Wikis up to 50.000 pages (Wikipedia size) and serves normal queries in >1 sec
• Accesses Java (Lucene) backend via Webservices
• Supports searching additional sources (e.g. Ticket systems, SVN, other Wiki engines)
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 11
CONTRIBUTING
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 12
Contributing in MediaWiki
• Are Wikis really contribution-friendly?– „Red links“ do not tell how much this
information is required– The „empty-page“ phenomenon (How to
start an article)
13Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
Demand guidance• Woogle uses queries and further metadata to qualify desired
content• Users are informed how strong the information need is, and in
which context it arises
14Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
ADDITIONAL FEATURES
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 15
Statistics
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 16
Woogle Configuration
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 17
Preferences & Instrumentation• Preferences might be defined globally or
by each individual user
• Scientific instrumentation framework– Logs certain kinds of user interactions– Customize system/override configuration
for experimental groups– Randomized group assignment– Informed consent (user acceptance dialog)
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 18
OUTLOOK & SUMMARY
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 19
Outlook• Roadmap for Woogle4MediaWiki
– Implement social search backend– Request feature– Support users creating redirects– Lightweight integration with SMW (i.e. using
semantics in result snippets)
• Application of the same principles to Semantic MediaWiki („Semantic Need“ extension)
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 20
Summary• Woogle addresses the atificial separation of
information seeking and information provision in Wikis– Offers information seekers means to express needs– Guides contributors by aggregating demands
• Find downloads, demos, slides etc. at– http://www.mediawiki.org/wiki/
Extension:Woogle4MediaWiki– http://www.teamweaver.org– http://amazonas.fzi.de/wooglenative/ (Demo)– Feedback & additional evaluation partners sought!
• Questions?
21Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
BACKUP SLIDES
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 22
(Media)Wikis can do better with Woogle• Improve search with collaborative
features• Allow people to express information
needs• Provide a first-order representation for
information needs• Guide contributions by aggregating
information needs („need-driven knowledge sharing“)
23Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
24
Collaborative description of the information need
Notifications concerningsearch activities
Discussion page
Directly create new knowledge from the search dialog
Search has a URI and can be linked from within the Wiki(one page per search term)
Search across different data sources/types (WoogleRemote)
Additional clues with meta-information about a need
http://localhost/wiki/Woogle:Woogle4MediaWiki
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
Demand guidance
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 25
Literature• [EC08] Evans, B. M. and Chi, E. H. 2008. Towards a model of understanding social search. In
Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (San Diego, CA, USA, November 08 - 12, 2008). CSCW '08. ACM, New York, NY, 485-494.
• [Hap08a] Hans-Jörg Happel: Growing the Semantic Web with Inverse Semantic Search. In Proceedings of the 1st Workshop on Incentives for the Semantic Web (INSEMTIVE '08)
• [Hap08b] Hans-Jörg Happel: Closing Information Gaps with Inverse Search. In Proceeedings of the 7th International Conference on Practical Aspects of Knowledge Management (PAKM2008)
• [Hap09a] Hans-Jörg Happel: Woogle – On Why and How to Marry Wikis with Enterprise Search. In Proceedings of the 2nd Workshop on Integrated Knowledge Management Systems (IKMS2009)
• [Hap09b] Hans-Jörg Happel: Towards Need-driven Knowledge Sharing in Distributed Teams. In Proceedings of the 9th International Conference on Knowledge Management (I-KNOW 2009)
• [Hap09c] Hans-Jörg Happel: Social search and need-driven knowledge sharing in Wikis with Woogle. In Proceedings of the 5th international Symposium on Wikis and Open Collaboration (Orlando, Florida, October 25 - 27, 2009). WikiSym '09. ACM, New York, NY, 1-10.
26Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland
Aggregate query & click logging
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 27
Implementation Trivia• Search UI
– Completely intercepts MW search process – indenpendent of MW search interfaces
• Red Links– Realized via JavaScript/jQuery (parses red links, retrieves needs from
backend batch-wise) to avoid patching MW
• Very basic implementation of access rights (via MediaWiki groups)
• Instrumentation addon– Allows to re-configure most system features based on user groups– Provides evaluation participation dialog– Logs various user behaviour data
Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 28
docdoc
twIS Backend(Public Index)twIS Backend(Public Index)
docdoc
docdoc
docdoc
User 2
docdoc
docdoc
docdoc
User 1
twIS EclipsetwIS Eclipse twIS Eclipse RCPtwIS Eclipse RCP
JSF FrontendJSF Frontend
Web BrowserWeb Browser
WoogleWoogle
twIS Backend 2twIS Backend 2
twIS Backend 3twIS Backend 3
DirectAuthDirectAuthAuth via
WikiAuth via
Wiki
Push IndexerPush Indexer
DirectAuthDirectAuth
Search, Crawl
Search, Crawl
Push IndexPush Index
CrawlCrawl
SearchSearch
Big Picture
TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web
and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query
logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)
TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web
and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query
logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)
Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a
TeamWeaver Integrated Search backend
Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a
TeamWeaver Integrated Search backend
Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.
Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).
Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.
Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).
29Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland