Woogle4MediaWiki - From Searchers to Contributors

Preview:

DESCRIPTION

This demo presents Woogle4MediaWiki, an extension for MediaWiki, which improves its core search functionality and introduces Wiki-style collaboration about information needs (each search query may have its own Wiki page) and result lists. Furthermore, aggregate statistics from the search logs are shown in pop-up windows attached to “red links” to guide users contributing new information.

Citation preview

Woogle4MediaWiki From Searchers to Contributors

Hans-Jörg Happel, FZI Karlsruhe, Germany2010-07-08 @ WikiSym 2010; Gdansk, Poland

Agenda• Motivation• Features / Demo• Outlook & Summary

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 2

MOTIVATION

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 3

Wikis• Web-based repositories for sharing

consensual knowledge in a collaborative fashion

• In enterprises, Wikis often act as a layer of "glue" code relating content from other information systems and capturing all the rest– Wikis are a typical entry point for information

seeking– People are often confused why and what to

put into the Wiki

4Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

There are two kinds of people in a Wiki…

Readers• Have information needs

– But only few means for articula-ting such needs (e.g. search)

• Have collaboration needs– Search is an inherently social

activity (40,4% of users interact with others before and during search; 58,7% share information after search [EC08])

Contributors• Have limited ressources

(time/effort) and face futher barriers (motivation, privacy)

• Choices on contributions are seldom based on demand of readers (lack of guidance)

• Wiki communities stress the relevance of content („what is worth to be documented“)

5

• Knowledge sharing is a communication process• Artificial decoupling of readers and contributors in Wikis• Only few readers become contributorsWoogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

SEARCHING

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 6

• Search in MediaWiki– No query logs– „Red links“ (links to non-existing pages) are

the basic means to „demand“ content– No collaboration

7Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Woogle4MediaWiki• Can replace MediaWiki built-in search or

act as an additional/parallel search engine (user configurable!)

• Features– Search suggestions based on fulltext– Allows to index Wikitex or rendered text– Allows to index certain files (e.g. PDF)– Show previous search attempts based on query

log– Social search (Wiki-page per query, social

ranking)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 8

Every query can be a Wiki page…• Allows for

– Wiki-Text („Social Did-You-Mean“)

– Discussion page– Watches/Notifications

(WoogleAlerts)

• Shows basic statistics about query– „How many people

searched for that?“

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 9

Social Search• Add, remove and re-rank query results• Add comments

• Beyond that: – How to support truly collaborative information seeking (for

researchers, lawyers etc!)?

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 10

Two flavours of Woogle

• Native PHP (using Zend Lucene Search)

• Can handle Wikis up to 50.000 pages (Wikipedia size) and serves normal queries in >1 sec

• Accesses Java (Lucene) backend via Webservices

• Supports searching additional sources (e.g. Ticket systems, SVN, other Wiki engines)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 11

CONTRIBUTING

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 12

Contributing in MediaWiki

• Are Wikis really contribution-friendly?– „Red links“ do not tell how much this

information is required– The „empty-page“ phenomenon (How to

start an article)

13Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Demand guidance• Woogle uses queries and further metadata to qualify desired

content• Users are informed how strong the information need is, and in

which context it arises

14Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

ADDITIONAL FEATURES

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 15

Statistics

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 16

Woogle Configuration

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 17

Preferences & Instrumentation• Preferences might be defined globally or

by each individual user

• Scientific instrumentation framework– Logs certain kinds of user interactions– Customize system/override configuration

for experimental groups– Randomized group assignment– Informed consent (user acceptance dialog)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 18

OUTLOOK & SUMMARY

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 19

Outlook• Roadmap for Woogle4MediaWiki

– Implement social search backend– Request feature– Support users creating redirects– Lightweight integration with SMW (i.e. using

semantics in result snippets)

• Application of the same principles to Semantic MediaWiki („Semantic Need“ extension)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 20

Summary• Woogle addresses the atificial separation of

information seeking and information provision in Wikis– Offers information seekers means to express needs– Guides contributors by aggregating demands

• Find downloads, demos, slides etc. at– http://www.mediawiki.org/wiki/

Extension:Woogle4MediaWiki– http://www.teamweaver.org– http://amazonas.fzi.de/wooglenative/ (Demo)– Feedback & additional evaluation partners sought!

• Questions?

21Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

BACKUP SLIDES

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 22

(Media)Wikis can do better with Woogle• Improve search with collaborative

features• Allow people to express information

needs• Provide a first-order representation for

information needs• Guide contributions by aggregating

information needs („need-driven knowledge sharing“)

23Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

24

Collaborative description of the information need

Notifications concerningsearch activities

Discussion page

Directly create new knowledge from the search dialog

Search has a URI and can be linked from within the Wiki(one page per search term)

Search across different data sources/types (WoogleRemote)

Additional clues with meta-information about a need

http://localhost/wiki/Woogle:Woogle4MediaWiki

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Demand guidance

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 25

Literature• [EC08] Evans, B. M. and Chi, E. H. 2008. Towards a model of understanding social search. In

Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (San Diego, CA, USA, November 08 - 12, 2008). CSCW '08. ACM, New York, NY, 485-494.

• [Hap08a] Hans-Jörg Happel: Growing the Semantic Web with Inverse Semantic Search. In Proceedings of the 1st Workshop on Incentives for the Semantic Web (INSEMTIVE '08)

• [Hap08b] Hans-Jörg Happel: Closing Information Gaps with Inverse Search. In Proceeedings of the 7th International Conference on Practical Aspects of Knowledge Management (PAKM2008)

• [Hap09a] Hans-Jörg Happel: Woogle – On Why and How to Marry Wikis with Enterprise Search. In Proceedings of the 2nd Workshop on Integrated Knowledge Management Systems (IKMS2009)

• [Hap09b] Hans-Jörg Happel: Towards Need-driven Knowledge Sharing in Distributed Teams. In Proceedings of the 9th International Conference on Knowledge Management (I-KNOW 2009)

• [Hap09c] Hans-Jörg Happel: Social search and need-driven knowledge sharing in Wikis with Woogle. In Proceedings of the 5th international Symposium on Wikis and Open Collaboration (Orlando, Florida, October 25 - 27, 2009). WikiSym '09. ACM, New York, NY, 1-10.

26Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Aggregate query & click logging

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 27

Implementation Trivia• Search UI

– Completely intercepts MW search process – indenpendent of MW search interfaces

• Red Links– Realized via JavaScript/jQuery (parses red links, retrieves needs from

backend batch-wise) to avoid patching MW

• Very basic implementation of access rights (via MediaWiki groups)

• Instrumentation addon– Allows to re-configure most system features based on user groups– Provides evaluation participation dialog– Logs various user behaviour data

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 28

docdoc

twIS Backend(Public Index)twIS Backend(Public Index)

docdoc

docdoc

docdoc

User 2

docdoc

docdoc

docdoc

User 1

twIS EclipsetwIS Eclipse twIS Eclipse RCPtwIS Eclipse RCP

JSF FrontendJSF Frontend

Web BrowserWeb Browser

WoogleWoogle

twIS Backend 2twIS Backend 2

twIS Backend 3twIS Backend 3

DirectAuthDirectAuthAuth via

WikiAuth via

Wiki

Push IndexerPush Indexer

DirectAuthDirectAuth

Search, Crawl

Search, Crawl

Push IndexPush Index

CrawlCrawl

SearchSearch

Big Picture

TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web

and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query

logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)

TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web

and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query

logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)

Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a

TeamWeaver Integrated Search backend

Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a

TeamWeaver Integrated Search backend

Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.

Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).

Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.

Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).

29Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Recommended