37
Elhanan Adler [email protected] English language subject headings in the catalog of the National Library of Israel

Elhanan Adler [email protected] English language subject headings in the catalog of the National Library of Israel

Embed Size (px)

Citation preview

Elhanan [email protected]

English language subject headings in the catalog of the National

Library of Israel

Background

2

• Subject retrieval at the NLI has traditionally been based on a classified catalog using a system which is a mixture of Dewey Classification, Scholem Classification (expansion of Dewey for Judaica), UDC and descriptive text (including many synonyms)

• This system is unique to the NLI. Much effort was expended in its upkeep.

• Most Israeli libraries use alphabetic subject headings (either LCSH or Hebrew)

• The stacks of the NLI (closed) are arranged by inventory number. Reading rooms have unique shelving systems.

Examples of the current system

3

001.091.767.1(08) Knowledge (History, description, critical appraisal of intellectual activity in general) - Muslim, Islamic, Arab countries - Collections, congresses

616(01)(09)(P) Human pathology; Clinical medicine; Diseases; Illnesses - Works of historic value; Sources

296.52 Halakha and poseqim before Moshe ben Maimon (Maimonides), Geonic literature and responsa, tshuvot ha-geonim

296.562.22(42) Shabbat - Halakha - Eruv hatserot (eruv of courtyards), eruv tehumin (eruv of boundaries) - England, Great Britain

E114(020.8) Yerushalayim, Jerusalem - History and description - Audiovisual treatment including video cassettes

Example of the classified catalog

4

Not all had supplementary text

5

The major problems

6

• The system is not known or understood outside the Library

• Today, most users of the NLI catalog are remote users

• Other libraries cannot use this data in their cataloging

• The NLI cannot make use of subject data from other libraries

• High maintenance cost

The logical conclusion

7

The NLI should switch to a subject retrieval system understandable to all users, particularly remote users

A widely used system which does not require local development should be used

The system chosen should allow transfer of subject data to and from the NLI

Why LCSH?

8

Most widely used system in Israeli academic libraries

Most widely used system in academic libraries in the English speaking world

Scholars worldwide are familiar with the system

Use of LCSH will enable the NLI to receive and provide usable subject data

Why not Hebrew?

9

• Creation of a Hebrew subject thesaurus for a large research library (even based on existing thesauri) is a major, expensive professional task and would take years

• It is unlikely other Israeli academic libraries would adopt it (another unique system!)

• It would not serve non-Hebrew speaking scholars (the Library has a universal mission)

• But there is still a need for Hebrew subject access – particularly for the non-academic segment of the NLI’s local user community

Isn’t LCSH passé?

10

Calhoun report (2006): “Abandon the attempt to do comprehensive subject analysis manually with LCSH in favor of subject keywords; urge LC to dismantle LCSH”

But LC has decided to continue LCSH with some simplifications: “Following the Calhoun report, LC did an analysis and determined we would continue with pre-coordinated subject strings (and LCSH for the foreseeable future) complemented by social tagging when that becomes feasible in our firewall environment.” (communication from Barbara Tillett, Feb. 2010)

When change does come, presumably there will be LCSH based conversion tools

The NLI can lead Israeli libraries into future systems only if it starts out together with them

How do we get from here to there?

11

Matching NLI records against LC MARC database (to 2006) held by the MALMAD Consortium [after 2006 MALMAD ceased purchasing LC data and joined OCLC]

Matching NLI records against Israel ULI Union List records from LCSH-using libraries

Creating conversion tables from NLI classification to LCSH

80+% of incoming data from major card conversion project (outsourced to Backstage Library Works) contains LCSH

Not uniform but…“Quick and dirty” with ongoing cleanup procedures

12

Example of conversion table$$a296(07) = 650 $$aJudaism$$xStudy and teaching$$a296.01 = 650 $$aJudaism$$xBibliography$$a296.01(020.8) = 650 $$aJudaism$$xBibliography @@ 655 $$aAudio-visual$$a296.01(04) = 650 $$aJudaism$$xBibliography$$a296.01(05) = 650 $$aJudaism$$xBibliography$$vPeriodicals$$a296.01(06) = 650 $$aJudaism$$xBibliography$$xSocieties, etc.$$a296.01(07) = 650 $$aJudaism$$xBibliography$$xStudy and teaching$$a296.01(08) = 650 $$aJudaism$$xBibliography$$a296.01(09) = 650 $$aJudaism$$xBibliography$$xHistory$$a296.012[100] = 650 $$aJewish philosophy$$vBibliography @@ 650 $$aJewish philosophers$$vBibliography$$a296.012[400] = 650 $$aJewish linguists$$vBibliography$$a296.012[530] = 650 $$aJewish scientists$$vBibliography @@ 650 $$aPhysicists$$vBibliography$$a296.012[574] = 650 $$aJewish scientists$$vBibliography @@ 650 $$aBiologists$$vBibliography$$a296.012[800] = 650 $$aJewish authors$$vBibliography$$a296.012[933.09] = 650 $$aJewish historians$$vBibliography$$a296.014 = 650 $$aAnonyms and pseudonyms, Hebrew @@ 650 $$aAnonyms and pseudonyms, Jewish

Progress in 1.5 years

13

Jan. 2010 – 50,000 NLI records with LCSHJune 2010 – 500,000 NLI records with LCSH

(over 50% of relevant records)June 2011 – 1,127,000 NLI records with LCSH

(over 90% of relevant records)

Authority controlNLI purchased the retrospective LCSH

authority file (370,000 records) from LCSet up as a separate ALEPH authority file

controlling the NLIs subject headingsGeneral policy – make minimal changes and

only in the NLIs core collection areas (Judaica & Israel)Add xrefs from more common (in Israel) termPrefer Israeli standard termAdd specific terms (kosher and/or ‘kosher-

style’)Quantity of records being processed

precluded going through SACO, etc.14

Special considerations/challenges/problems

15

Personal and corporate subjects

16

NLI already has 600 & 610 subjects – but in all 4 scripts, and usually without subdivision

English/romanized 600s and 610s are added when found but this is not a major goal of the current project

Inconsistent romanization (LC vs. Academy of the Hebrew Language)

Future linking of personal and corporate names in various scripts (a separate project) will improve cross-alphabet searching

Uniform title subjectsFollow standard Israeli usage:

Uniform Headings in Judaica authority filee.g. Talmud Bavli and Talmud Yerushalmirather than Talmud and Talmud Yerushalmi

17

http://aleph3.libnet.ac.il/F/?func=file&file_name=find-b&local_base=uhj10

18

Theological issues - Bible

Bible used for Jewish Bible (Tanakh)Bible. O.T. changed to BibleBible. O.T. Apocrypha changed to

ApocryphaBible.N.T. changed to stand-alone New

TestamentRequires parallel series of subjects :

… in the New Testament… $xNew Testament teachingetc.

19

Other theological issuesB.C./A.D. or B.C.E./C.E.?Personal subjects: Jesus, Mary, Muhammad,

saints, etc.

20

Political issues: Israel/Palestine

LC policy: Israel = 1948/1967 bordersPre-1948 = PalestinePost-1948 area of British mandate (Israel + West

Bank) = PalestinePalestine not to be used for post-Oslo Palestinian

Authority“680 __ |i Here are entered works on the region

on the eastern coast of the Mediterranean Sea that in ancient times was called the Land of Canaan, later the kingdoms of Israel and Judah, and in modern times comprises the entire state of Israel, as well as the various disputed territories. Works on a specific jurisdiction or territory within this region are entered under its respective name”21

However

22

Existing Israeli solutions Follow LCIsrael for all periods and all borders (just as

Persia = Iran)Israel (State) vs. IsraelIsrael and Eretz Israel (NLI choice)

Problems distinguishing between periods and geographic boundariesAntiquitiesGeography, geology, etc.Overlapping periods (20th century)

23

Other geo-political issuesWest Bank or “Judea and Samaria”Jerusalem (Israel)-- Israel -- JerusalemNablus or Shechem (and other multiple

place name issues)

24

Historical subjectsIsrael – History -- War of Independence,

1948-1949not: Israel-Arab War, 1948-1949

Six Day War, 1967not: Israel-Arab War, 1967

Yom Kippur War, 1973not: Israel-Arab War, 1973

Operation Peace for Galilee, 1982-1985not: Lebanon -- History -- Israeli intervention, 1982-1985

25

Uniquely Israeli topicsImmigrant absorptionPre-army yeshivotYeshivot hesderBagrut examinations (Matriculation

examinations) Aliyah/Yeridah (much more than Emigration

and immigration), also:Aliyah, 1st (1882-1903)Aliyah, 2nd (1904-1914)Aliyah, 3rd (1919-1923)Aliyah, 4th (1924-1929)Aliyah, 5th (1929-1939)Aliyah Bet (1933-1948)

26

Extension/expansion of LCSH terms

Judaism – [Moroccan, Tunisian, etc.] rite[subject] (Jewish law)Chronological subdivisions under Hasidism,

Cabala, Jewish philosophy, etc.More groups of Hasidim

27

28

Alexander HasidimAmshinov HasidimBelz HasidimBiala HasidimBobov HasidimBoyan HasidimBratslav HasidimBuhusi HasidimChernobyl HasidimChortkov HasidimGrodzhisk HasidimGur HasidimHusiatyn HasidimIzbica HasidimKaliv HasidimKarlin HasidimKomarno HasidimKozhnitz HasidimKretshniff HasidimKuzmir HasidimLelov HasidimModzitz HasidimMogielnica Hasidim

Munkacs HasidimNadvorna HasidimPiaseczno HasidimPremishlan HasidimPrshischa HasidimRachmastrivka HasidimRadomsk HasidimRuzhin HasidimSadigura HasidimSatmar HasidimSighet HasidimSkulen HasidimSkvira HasidimSlonim HasidimSochaczew HasidimSpinka HasidimStefanesti HasidimToldot Aharon Hasidim (LCSH = Guardian-of-the-Faithful Hasidim)Tolna HasidimTosh HasidimVizhnitz HasidimZanz HasidimZidichov HasidimZvolin Hasidim

Red = LCSHBlack = NLI

Problem of subjects which require Jewish + general subjectJewish … educationJewish scientists + Physicists, etc.

29

Standard 667 (internal) notesChanged to standard Israeli formCross references added by NLIHeading created by NLIHeading changed by NLI

30

Use of Genre/Form headingsPrimarily in conversion from Dewey literature

numbers655_7 with $2localExample:

Polish dramaPolish essaysPolish fictionPolish lettersPolish poetryPolish prose literaturePolish wit and humor

Also: Audio-visual, Israeli themes in literature, Jewish themes in literature

31

Ongoing cleanupCreate initial files of valid main terms ($a)

and subfields (vxyz) from LC LCSH fileAdd terms & subfields as validatedRun test against all 630/650/651 2nd

indicator 0/4 (existing records and new records being added)

Report errors & automatically fix manyGroup of programs and data files

32

33

LCSH options 1 - Retest LCSH records 2 - Run nnl01_fix_lcsh on lcsh_retest.dat creating rep.dat 3 - Edit 082_to_lcsh.dat 4 - Edit lcsh_subfielda.dat 5 - Edit 082_to_lcsh_subdivisions.dat 6 - Run nnl01_lcsh_subfield_clash_retest 9 - Run nnl01_match_082_to_lcsh 11- Edit nnl01_fix_lcsh.dat 12- Edit vxyz_subfields.dat 15- Edit nnl01_lcsh_test.dat 16- Edit israel_eretz_israel.dat 17- Edit nnl01_lcsh_test_ok.dat 19- Run nnl01_drop_lcsh on safety creating rep.dat 22- Run nnl01_reselect_no_lcsh 23- Drop app.dat records from nnl01_no_lcsh.dat 30- Retrieve lcsh.toload from ram1 and process to app.dat 35- Make ram1:lcsh.650 from scratch file safety 40- match nnl01_no_lcsh.dat to nnl6xx_bib_fields.dat 41- match nnl01_no_lcsh.dat to nnl6xx_bib_fields2.dat 50- retest 650 0/650 4 subfield a against NNL15 (for lcsh_fix_a routine) 51- Edit lcsh_fix_a.dat 52- Run lcsh_fix_a 99- Pretest datafile ram2 before adding to 082_to_lcsh.dat

34

Correct a to b$$aBible.$$pApocrypha. = $$aApocrypha.$$aBible.$$pN.T.$$ = $$aNew Testament$$$$aBible.$$pN.T.$$p = $$aNew Testament.$$p$$aBible.$$pO.T.$$ = $$aBible$$$$aBible.$$pO.T.$$p = $$aBible.$$p$$aYehuda weShomeron$$ = $$aWest Bank$$$$xAddresses, essays, lectures = ~$$zYehuda weShomeron = $$zWest Bank

correct $$aIsrael/$$aEretz Israel based on string in subfieldE $$xAntiquities…E $$y18E $$y190…E $$yTalmudicE $$yWorld WarI $$y195I $$y196I $$y197…

35

Example – test of subfields:000385128 Unknown subfield: $$v Bibligraphy at: 000385128 650 4 L $$aJewish periodicals$$xIndexes$$vBibligraphy.000273103 Unknown subfield: $$v Bibligrpahy at: 000273103 650 0 L $$aCommunism$$xPeriodicals$$vBibligrpahy.002808570 Unknown subfield: $$v Bibliographie at: 002808570 650 0 L $$aJuifs$$xHistoire$$vBibliographie$$vCatalogs.002617234 Unknown subfield: $$v Biografias at: 002617234 650 4 L $$aEconomistas chilenos$$vBiografias.002916818 Unknown subfield: $$v Biyografi at: 002916818 650 4 L $$aIslam$$vBiyografi.002997635 Unknown subfield: $$v Biyografi at: 002997635 651 4 L $$aTurkey$$vBiyografi.002204402 Unknown subfield: $$v Bubliography at: 002204402 650 0 L $$aGeology$$zSoviet Union$$vBubliography.002858437 Unknown subfield: $$v Chrestomathies and readers at: 002858437 650 4 L $$aRussian language$$vChrestomathies and readers.002975569 Unknown subfield: $$v Chrestomathies and readers at: 002975569 650 4 L $$aGreek language$$vChrestomathies and readers.

Ongoing & future plansSubjects are being sent to Worldcat as 650_4Continued mass loading from conversion projectsConversion of subject headings from NLI special

collections (manuscripts, music, Scholem, others) to LCSH

Migrate RAMBI to LCSHBegin work on Hebrew language search to LCSH

headingsUltimately provide single search interface (Hebrew

and English) to all NLI bibliographic dataIntroduction of LC classification for open shelf

collections

36

תודה רבהThank you

37