26
Heinrich Hartmann [email protected] WeST Institute Related-Work.net a scientific discussion platform WeST Koblenz 21.2.2012 Heinrich Hartmann

Related-Work.net at WeST Oberseminar

Embed Size (px)

DESCRIPTION

Presentation on Related-Work.net given at WeST Institute Oberseminar

Citation preview

Page 1: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Related-Work.net

a scientific discussion platform

WeST Koblenz

21.2.2012

Heinrich Hartmann

Page 2: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Plan

Academic knowledge discovery

Vision of Related-Work.net

System details and open problems

Demo

Page 3: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Academic knowledge discovery

Finding and filtering publications

Connect people

interested in the samepaper

Page 4: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Solution: The Academic Graph

Page 5: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Problem: No Open Access!

No Open Access:* Citation data * Full-text

Page 6: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Problem: No Open Access!

No Open Access:* Citation data * Full-text

* Social information needs to be provided by community

Page 7: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Existing services have shortcomings

Citation Data Community Open Source/Data

Google Scholar Yes No No/No

Microsoft Academic Yes No No/No

SciVerse (Elsevier) Yes No No/No

Mendeley Yes Yes No/No

ResearchGate Yes Yes No/No

CiteSeerX Yes (quality?) No Yes/Yes (broken)

dblp No No -/Yes

Bibsonomy No Yes Yes/Yes

Related-Work.net Yes Yes Yes

Page 8: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Plan

Academic knowledge discovery

Vision of Related-Work.net

System details and open problems

Demo

Page 9: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Vision for Related-Work.net

Social community for scientists

Open database of papers and citations

Free software

Strong data mining:

Recommender, Auto completion, News feed

Page 10: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Vision of Related-Work.net

Page 11: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

History

March '12 Idea Heinrich & Rene: write proposal

Sept '12 Heinrich quit Maths Writes Citation extraction for Arxiv.org Networking in Oxford (Akorn, OpenCitations)

Dec '12 Merger with OpenCitationsCorpus by David Shotton JISC Grant: Cottage Labs GWTP protoype development

Page 12: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Team: Related-Work.net / OpenCitations.net

RenéPickhardt

Mathematics / Computer Science

co-founder RW.net

metalcon.de

HeinrichHartmann

Mathematics / Computer Science

co-founder RW.net

David Shotton

Oxford Zoologist

OpenCitations.netCiTO/SPAR Ontologies

JISC Grant:Cottage Labs

Page 13: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Open Citations and Semantic Publishing

Page 14: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Cottage Labs

Richard Jones

Mark MacGillivray

Martyn Whitewell

Page 15: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Plan

Academic knowledge discovery

Vision of Related-Work.net

System details and open problems

Demo

Page 16: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Data Ingest Pipeline

Page 17: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Example Matching Problem

A.G. Bashkirov, Physica A 340 , 153 (2000)→ very little information

Robert L. Pego and Michael I. Weinstein. Eigenvalues, and instabilities of solitary waves. Philos. Trans. Roy. Soc. London Ser. A , 340(1656):47--94, 1992. → Not Arxiv

D. V. Shirkov and I. L. Solovtsov, Theor. Math. Phys. 150 , 132 (2007) arXiv:hep-ph/0611229 .→ found ID: hep-ph/0611229

G.J. Galloway, Maximum principles for null hypersurfaces and null splitting theorems , Journal APPT 1 543-567 2000 .→ year author title heuristic: math/9909158

Page 18: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Example Matching Problem

A.G. Bashkirov, Physica A 340 , 153 (2000)→ very little information

Robert L. Pego and Michael I. Weinstein. Eigenvalues, and instabilities of solitary waves. Philos. Trans. Roy. Soc. London Ser. A , 340(1656):47--94, 1992. → Not Arxiv

D. V. Shirkov and I. L. Solovtsov, Theor. Math. Phys. 150 , 132 (2007) arXiv:hep-ph/0611229 .→ found ID: hep-ph/0611229

G.J. Galloway, Maximum principles for null hypersurfaces and null splitting theorems , Journal APPT 1 543-567 2000 .→ year author title heuristic: math/9909158

Extracted16 Mio. citation strings

only 2 Mio. currently matched!

Page 19: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Author Identification Problem

Two authors w. same name

Author changes name

Approaches:

Official author IDs (ORCID / Arxiv / PMC )

Graph Mining

Email addresses (!)

Page 20: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Email addresses are in the FullText!

Low Redshift QSO Lyman alpha Absorption Line Systems Associated with Galaxies

W.P. Lin G. Boerner H.J. Mo

[email protected] [email protected] [email protected] [email protected]

Found750.000

Email addresses

Page 21: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Standard Architecture for the Front End

Page 22: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Data Mining Examples 1

Page 23: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Data Mining Example 2

Page 24: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Data Mining Example 3

Page 25: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Open Problems

Crowdsourcing citation dataImprove matching algorithms (by ML? ActiveLearning?)

Community building / policy modeling

Add further data sources (cur. ArXiv/PMC/CrossRef)

→ Automated schema detection

Page 26: Related-Work.net at WeST Oberseminar

Heinrich [email protected]

WeST Institute

Plan

Academic knowledge discovery

Vision of Related-Work.net

System details and open problems

Demo