21

Digital Text and Data Processing Week 7

Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Download PPT Report

Upload
mavis-allison
View
222
Download
0

Tags:

Embed Size (px)

Citation preview

Page 1: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Digital Text and

Data Processing

Week 7

Page 2: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ POS: total counts: normalise by token count

□ Unicode support

□ Synchronic and diachronic variation (dialects and historical changes)

□ Not knowing beforehand what is possible / relevant

Challenges

Page 3: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ Digital humanities methodology often demands experimentation

□ Method is mostly inductive approach (cf. deductive approach advocated by Stanley Fish)

□ When experiments are not motivated, there is a risk that the research simply exposes "a correlation between a formal feature the computer program just happened to uncover and a significance that has simply been declared, not argued for".

□ Also see Chris Anderson, The End of Theory

http://nyti.ms/wjEPm7

Page 4: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ The DH methodology is partly inductive and partly deductive

□ Computational analyses often lead to unexpected results

□ Techniques can help scholars to generate hypotheses

Page 5: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ Data acquisition

□ Clean up and enrichment (removal of stopwords, POS, lemmatisation)

□ Quantification

□ Data analysis

Phases

Page 6: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ Page images and machine-readable text (removal of typography and of paratext)

□ Low quality of OCR, see, e.g. Laura Mandell, How to Read a Literary Visualisation

□ Motivation of the choice of a specific edition

Data acquisition

Page 7: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Page 8: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ Text2Genome□ OSCAR□ NeuroElectro□ Peter Murray Rust’s

work on Chemical Compounds

TM on recent scientific articles

Page 9: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ The right to read does not imply the right to mine

□ Study commissioned by EC led by by prof. Ian Hargreaves

Licences

Page 10: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Article 7.2 of Settlement:

□Creation of a “Research Corpus”;

□Solely for “non-consumptive” reading, or research “in which computational analysis is performed on one or more Books, but not research in which a researcher reads or displays substantial portions of a Book to understand the intellectual content presented within the Book”

Google Books Settlement

Page 11: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Page 12: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Page 13: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

□ Lev Manovich, The Language of New Media

□ Textual narrative: linearity and reliance on typography

□ Database: random access, non-linear, no form

Database and Narrative

Page 14: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Page 15: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

The Semantic web

□ Envisaged by Tim Berners-Lee as “a web of data that can be processed directly and indirectly by machines”

□ RDF-Triples

□Examples:

Subject: “Book-URI” Predicate: “hasISBN” Object: “978-0-252-07829-0”

Page 16: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

dbPedia

Page 17: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Page 18: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Page 19: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Nano-Publications

Page 20: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

Semantic Publishing

Page 21: Digital Text and Data Processing Week 7. □ POS: total counts: normalise by token count □ Unicode support □ Synchronic and diachronic variation (dialects

STCN SPARQL Endpoint

Diachronic and Synchronic Analyses of Japanese Statutory ...€¦ · In this paper, we report on our ongoing research on the development of a diachronic legal terminology, which deals

Diachronic and Synchronic Analyses of Japanese Statutory ...€¦ · In this paper, we report on our ongoing research on the development of a diachronic legal terminology, which deals

Documents

Using synchronic and diachronic relations for summarizing

Using synchronic and diachronic relations for summarizing

Documents

The interplay of synchronic and diachronic discovery in ...udel.edu/~pcole/fieldmethods2010... · The interplay of synchronic and diachronic discovery in Siouan grammar-writing

The interplay of synchronic and diachronic discovery in ...udel.edu/~pcole/fieldmethods2010... · The interplay of synchronic and diachronic discovery in Siouan grammar-writing

Documents

A diachronic and synchronic descriptive study of a nursing ...nursing.tamucc.edu/faculty-staff/assets/f_cv/McDonald_Claudia.pdf · HCAD 5320 Health Economics & Policy NURS 5364 Organizational

A diachronic and synchronic descriptive study of a nursing ...nursing.tamucc.edu/faculty-staff/assets/f_cv/McDonald_Claudia.pdf · HCAD 5320 Health Economics & Policy NURS 5364 Organizational

Documents

A Diachronic-Synchronic Review of Gender in English1rua.ua.es/dspace/bitstream/10045/6369/1/RAEI_20_03.pdf · A Diachronic-Synchronic Review of Gender in English 1 ... sin ce it d

A Diachronic-Synchronic Review of Gender in English1rua.ua.es/dspace/bitstream/10045/6369/1/RAEI_20_03.pdf · A Diachronic-Synchronic Review of Gender in English 1 ... sin ce it d

Documents

Reading Jeremiah 19:1 –13: Integrating Diachronic and ... Diachronic and Synchronic Methodologies . ... site maintained by Library and Archives Canada. ... Bakhtin’s writings by

Reading Jeremiah 19:1 –13: Integrating Diachronic and ... Diachronic and Synchronic Methodologies . ... site maintained by Library and Archives Canada. ... Bakhtin’s writings by

Documents

The relationship between synchronic and diachronic

The relationship between synchronic and diachronic

Documents

Philosophical Approaches to Communication · distinction between langue and parole, value and signification, the synchronic and diachronic and the syntagmatic and paradigmatic aspects

Philosophical Approaches to Communication · distinction between langue and parole, value and signification, the synchronic and diachronic and the syntagmatic and paradigmatic aspects

Documents

A Synchronic and Diachronic Analysis of Aspects ofMiddle

A Synchronic and Diachronic Analysis of Aspects ofMiddle

Documents

LECTURE # 12 SUBSTANCE SUBSTANCE & FORM DIACHRONIC AND SYNCHRONIC APPROACHES SUBSTANCE & FORM DIACHRONY& SYNCHRONY

LECTURE # 12 SUBSTANCE SUBSTANCE & FORM DIACHRONIC AND SYNCHRONIC APPROACHES SUBSTANCE & FORM DIACHRONY& SYNCHRONY

Documents

NLTK and Lexical Information - GitHub Pages · NLTK and Lexical Information Text Statistics References NLTK book examples Concordances Lexical Dispersion Plots Diachronic vs Synchronic

NLTK and Lexical Information - GitHub Pages · NLTK and Lexical Information Text Statistics References NLTK book examples Concordances Lexical Dispersion Plots Diachronic vs Synchronic

Documents

Diachronic Collocations, Genre, and DiaCollo — REVISED DRAFTkaskade.dwds.de/~jurish/pubs/jurish2018genre.pdf · 2018. 8. 10. · work on both synchronic and diachronic collocation

Diachronic Collocations, Genre, and DiaCollo — REVISED DRAFTkaskade.dwds.de/~jurish/pubs/jurish2018genre.pdf · 2018. 8. 10. · work on both synchronic and diachronic collocation

Documents

Physical Emergence, Diachronic And Synchronic · 2009. 11. 11. · Diachronic and synchronic emergent properties are distinctions within the category of structural properties. 0

Physical Emergence, Diachronic And Synchronic · 2009. 11. 11. · Diachronic and synchronic emergent properties are distinctions within the category of structural properties. 0

Documents

FWO- · PDF fileThese scopes served as a ... semantics, pragmatics — Synchronic and diachronic linguistics ... history, cultural studies and literature

FWO- · PDF fileThese scopes served as a ... semantics, pragmatics — Synchronic and diachronic linguistics ... history, cultural studies and literature

Documents

Spacetime Emergence Synchronic and Diachronic ......may be an example of ‘synchronic’ emergence 2. The physics near the big bang - The evolution of the universe from a ‘prior’

Spacetime Emergence Synchronic and Diachronic ......may be an example of ‘synchronic’ emergence 2. The physics near the big bang - The evolution of the universe from a ‘prior’

Documents

Synchronic patterns of Tuscan phonetic variation and diachronic change: evidence from a dialectometric study Simonetta Montemagni*, Martijn Wieling +,

Synchronic patterns of Tuscan phonetic variation and diachronic change: evidence from a dialectometric study Simonetta Montemagni*, Martijn Wieling +,

Documents

Synchronic and diachronic identity for elementary particles...thanks to the fact that these particles can be distinguished by their state-independent properties (for instance rest

Synchronic and diachronic identity for elementary particles...thanks to the fact that these particles can be distinguished by their state-independent properties (for instance rest

Documents

CursoDeLadino.com.ar - Judeo Spanish and the Living Museum Claim a Synchronic View of a Diachronic Dilemma John Cardenas - Haim Vidal Sephiha

CursoDeLadino.com.ar - Judeo Spanish and the Living Museum Claim a Synchronic View of a Diachronic Dilemma John Cardenas - Haim Vidal Sephiha

Documents

A Diachronic-Synchronic Review of Gender in English1 · A diachronic-synchronic review of gender in English 47 “[...] a gr am mati cal cla ss ifi cat ion of nouns, pr onoun s, or

A Diachronic-Synchronic Review of Gender in English1 · A diachronic-synchronic review of gender in English 47 “[...] a gr am mati cal cla ss ifi cat ion of nouns, pr onoun s, or

Documents

Diachronic and Synchronic Analyses of Obsidian Procurement in

Diachronic and Synchronic Analyses of Obsidian Procurement in

Documents

Kim Ebensgaard Jensen CGS, Aalborg University A great many things and a great deal of stuff: A synchronic and diachronic corpus study of two nominal constructions

Kim Ebensgaard Jensen CGS, Aalborg University A great many things and a great deal of stuff: A synchronic and diachronic corpus study of two nominal constructions

Documents

The study of diachronic and Sumerian - ETCSLhomepage1. Sumerian as the object of traditional ‘philological’ study The importance of diachronic as well as synchronic variation in

The study of diachronic and Sumerian - ETCSLhomepage1. Sumerian as the object of traditional ‘philological’ study The importance of diachronic as well as synchronic variation in

Documents

Greenbergian universals, diachrony, and statistical analysestuvalu.santafe.edu/~desmith/PDF_pubs/LITY.2011.029.pdf2. Synchronic and diachronic approaches to word order universals Greenberg

Greenbergian universals, diachrony, and statistical analysestuvalu.santafe.edu/~desmith/PDF_pubs/LITY.2011.029.pdf2. Synchronic and diachronic approaches to word order universals Greenberg

Documents

Event-centrality and the pragmatics-semantics interface … · chism (Bontinck and Ndembe Nsasi 1978), also enables us to study the potential diachronic processes underlying the synchronic

Event-centrality and the pragmatics-semantics interface … · chism (Bontinck and Ndembe Nsasi 1978), also enables us to study the potential diachronic processes underlying the synchronic

Documents

A Synchronic and Diachronic Study of Light Verbs in English

A Synchronic and Diachronic Study of Light Verbs in English

Documents

ЛЕКСИКОЛОГІЯ АНГЛІЙСЬКОЇ МОВИelibrary.kubg.edu.ua/11162/1/A_Chesnokova_LAM_GI.pdf · Synchronic and Diachronic Lexicology, Contrastive and Comparative Lexicology,

ЛЕКСИКОЛОГІЯ АНГЛІЙСЬКОЇ МОВИelibrary.kubg.edu.ua/11162/1/A_Chesnokova_LAM_GI.pdf · Synchronic and Diachronic Lexicology, Contrastive and Comparative Lexicology,

Documents

A DIACHRONIC AND SYNCHRONIC STUDY OF THE ALTERATION OF UNIFORM EXPRESSIONS FROM … · 2015. 5. 3. · A DIACHRONIC AND SYNCHRONIC STUDY OF THE ALTERATION OF UNIFORM EXPRESSIONS FROM

A DIACHRONIC AND SYNCHRONIC STUDY OF THE ALTERATION OF UNIFORM EXPRESSIONS FROM … · 2015. 5. 3. · A DIACHRONIC AND SYNCHRONIC STUDY OF THE ALTERATION OF UNIFORM EXPRESSIONS FROM

Documents

The past and perfect tenses. A diachronic and synchronic ...tauja.ujaen.es/bitstream/10953.1/837/4/TFG_CazalillaRamos,Irene.pdf · between them. Past Simple and Present Perfect tenses

The past and perfect tenses. A diachronic and synchronic ...tauja.ujaen.es/bitstream/10953.1/837/4/TFG_CazalillaRamos,Irene.pdf · between them. Past Simple and Present Perfect tenses

Documents

Diachronic Development in the Old Testament · 2019. 9. 16. · between diachronic and synchronic approaches. Let’s consider first how they contrast with each other. Contrast You’ll

Diachronic Development in the Old Testament · 2019. 9. 16. · between diachronic and synchronic approaches. Let’s consider first how they contrast with each other. Contrast You’ll

Documents

As Below, So Before: ‘Synchronic’ and ‘Diachronic’ Conceptions of …philsci-archive.pitt.edu/16754/1/Crowther-above_and... · 2019. 12. 27. · As Below, So Before: ‘Synchronic’

As Below, So Before: ‘Synchronic’ and ‘Diachronic’ Conceptions of …philsci-archive.pitt.edu/16754/1/Crowther-above_and... · 2019. 12. 27. · As Below, So Before: ‘Synchronic’

Documents