Data Analysis in the Hebrew Bible

DATA ANALYSIS INTHE HEBREW BIBLE

CLIN 2014-01-17Dirk Roorda (DANS/TLA), Martijn Naaijer and Gino Kalkman (VU ETCBC)

RESEARCH @

just started

EXEGESIS

preaching the word of God

the devil is in the details

meanings of specific words

DISTANT READING

scan large quantities of text

find patterns

signals in the noise

study other aspects than meaning

text transmission

linguistic variation

literary form

VARIATION IN BIBLICAL HEBREW

Timespan of Hebrew Bible writing: ~1000 years

Assumption: we can divide the books in 2 groups

EBH (early biblical Hebrew)

LBH (late biblical Hebrew)

"PROOF"

Select some features that differ for EBH and LBH

Risk of circularity

We need data analysis that is

comprehensive (not eclectic)

critical (not everything is a signal)

SYNTACTIC VARIATION

syntactic features

phrase, clause, text

large units

chapters

books

drivers of change

diachrony

geography

demography

variation

THE HEBREW BIBLE AS DATA

THE HEBREW BIBLE IN LAF

LAF ISO 24612:2012

SHEBANQ (github)

2.27 GB

1.5 M nodes

1.5 M edges

40 M features

400 K words

13 M XML ids

http://www.iso.org/iso/catalogue_detail.htm?csnumber=37326

http://wivu2laf.readthedocs.org/

PROCESSING LAF

it is XML

but not document-like (not asTEI)

and not database like (not nice for XQUERY)

it is graph-like

PROCESSING LAF

eXist (>30min loading time, simple queries >60min)

indexes needed: but which ones

tried POIO (>60min loading time, needs >20GB RAM)

straightforward object oriented in Python

scripting language overhead

http://media.cidles.eu/poio/graf-python/

LAF-FABRIC

LAF-Fabric

loads in a few seconds

executes in a few seconds

on a laptop

can run

in a Terminal

as an IPython notebook

also Python

uses C-like arrays

http://laf-fabric.readthedocs.org/en/latest/

http://ipython.org/

gender notebook

http://nbviewer.ipython.org/github/dirkroorda/laf-fabric/blob/master/notebooks/gender.ipynb

COOCCURRENCES

1 Common Nouns

2 Proper Nouns

Nodes are books

Edges are cooccurrences of lexemes (1 or 2)

WEIGHTED EDGES

S(lex): number of books containing lex

C(b1, b2): intersection of lexemes of b1 and b2

L(b1, b2): union of lexemes of b1 and b2

cooccurrences notebook

Common Nouns

no weight

Common Nouns

with weight

Proper Nouns

no weight

Proper Nouns

with weight

DATA-DRIVEN THEOLOGYm.naaijer@vu.nl

g.j.kalkman@vu.nl

dirk.roorda@dans.knaw.nl

Thank You

mailto:m.naaijer@vu.nl

mailto:g.j.kalkman@vu.nl

mailto:dirk.roorda@dans.knaw.nl

Data Analysis in the Hebrew Bible

Education

Hebrew bible new testament

Berachah Bible Institute Hebrew Grammar I

Berachah Bible Institute Hebrew Grammar I Chapter 10: Hebrew Construct Chain

accents in hebrew bible

TANAKH (HEBREW BIBLE)

Berachah Bible Institute Hebrew Grammar I Chapter 7: Hebrew Adjectives

The Hebrew Bible (Booklet)

METHODOLOGY, SPEECH, SOCIETY The Hebrew Bible...The Hebrew Bible Yehoshua Gitay . Methodology, Speech, Society – The Hebrew Bible Published by SUN MeDIA Stellenbosch under the imprint

WHICH HEBREW BIBLE?

Hebrew bible gospel of john

Hebrew Bible - Gospel of John

Holy bible new testament in hebrew

“The Divine Council in the Hebrew Bible” Chapter 2 of: hebrew

THE HEBREW BIBLE IN EUROPE A PRELIMINARY … · THE HEBREW BIBLE IN EUROPE IN THE MIDDLE AGES: A PRELIMINARY TYPOLOGY* DAVID STERN** What did the Hebrew Bible—the book that Jews

Bible Interpretation: Hebrew Scriptures

Hebrew Bible as Data: Laboratory, Sharing, Lessons

The Ancient Israelites and the Hebrew Bible

1. What is the Hebrew Bible?

Berachah Bible Institute Hebrew Grammar I Chapter 12: Introduction to Hebrew Verbs

Hebrew Bible / Old Testament