Logging in Digital Libraries

  • Published on
    14-Jan-2016

  • View
    35

  • Download
    0

Embed Size (px)

DESCRIPTION

Logging in Digital Libraries. Last week . Introduction to quality indicators and the way in which these are formalized and made computable, according to one view. Making a digital library as good as it cam be requires understanding what it is and how it is being used. - PowerPoint PPT Presentation

Transcript

  • Logging in Digital Libraries

  • Last week .Introduction to quality indicators and the way in which these are formalized and made computable, according to one view.Making a digital library as good as it cam be requires understanding what it is and how it is being used.Information comes from logs

  • Another aspectA category of quality indicator that comes from seeing what happens when users visit the libraryAn important tool -- the logsAll web based systems have logs of interaction from the outside world to the web serverNot specifically designed for digital librariesWe will look at a proposed standard for digital library specific log analysis

  • This workDone by Marcos Andr Gonalves, Ming Luo, Rao Shen, Mir Farooq Ali, and Edward A. Fox -Virginia TechandLillian N. Cassel, Filip Jagodzinski - Villanova

  • MotivationLog analysisSource of information about:How patrons really use DL servicesHow systems behave while supporting user information seeking activitiesExamples: patternsUsed to:EvaluateEnhance servicesHelp design user interfacesBetter allocation of resourcesCommon practice in the web settingSupported by web servers, proxy caching

  • Motivation (cont.)DLs differ from the webDL collections are explicitly organized, described, managed, and preservedUsers with more specific tasks and needsDigital objects and collections more structuredDL Logging should offer much richer information and opportunitiesTradeoff : user privacyCurrent DL logs Differences in formats and recorded informationProblems:Lack of interoperabilityNo reuse of analysis toolsComparability of log analysis results

  • Related WorkProblems with existing DL logsIncompatibilityIncompletenessComplexity of analysisLack of organizationAmbiguityInflexibilityVerboseness

    --- Generally, lack of a global view of need for understanding how the DL is or is not serving its users

  • The Digital Library Standardized Log FormatComprehensiveReflective of the actual DL system behaviorEasily readablePreciseFlexible to accommodate varying systemsSuccinct enough to be implementedConcern: user privacy

  • DL Standarized Log Format DesignCapture high level user and system behaviorsHierarchical organizationEncapsulated in transactionsInteractions between the users and the system or among the system componentsLog format designed to record a number of different kinds of transactionsExamples:Login to the system Submission of search queryBrowsing a result listRecording of a user failure

  • Log format design (cont.)DesignReflective of DL functionBased on the 5S formal theoryUnifying, mathematical theory to describe formally the semantics of DL componentsGuidance for how to organize the log structure

  • Log design and 5S

    5SDefinitionUse in Log DesignStreamsRepresent static and dynamic multimedia contentStructuresLabeled directed graphs; provide organization within the DLSpacesSets, properties and operations on those sets Scenariossequences of events that modify states of a computation in order to accomplish some functional requirement.Societies Sets of communities and relationships among them

  • Log design and 5S

    5SDefinitionUse in Log DesignStreamsRepresent static and dynamic multimedia contentTemporal events, types of digital objectsStructuresLabeled directed graphs; provide organization within the DLSpacesSets, properties and operations on those setsScenariossequences of events that modify states of a computation in order to accomplish some functional requirement.Societies Sets of communities and relationships among them

  • Log design and 5S

    5SDefinitionUse in Log DesignStreamsRepresent static and dynamic multimedia contentTemporal events, types of digital objectsStructuresLabeled directed graphs; provide organization within the DLStructured documents and metadata; structured searches, collection, metadata catalog; hypertext, classification schemeSpacesSets, properties and operations on those setsScenariossequences of events that modify states of a computation in order to accomplish some functional requirement.Societies Sets of communities and relationships among them

  • Log design and 5S

    5SDefinitionUse in Log DesignStreamsRepresent static and dynamic multimedia contentTemporal events, types of digital objectsStructuresLabeled directed graphs; provide organization within the DLStructured documents and metadata; structured searches, collection, metadata catalog; hypertext, classification schemeSpacesSets, properties and operations on those setsRetrieval mode, Presentation information, Scenariossequences of events that modify states of a computation in order to accomplish some functional requirement.Societies Sets of communities and relationships among them

  • Log design and 5S

    5SDefinitionUse in Log DesignStreamsRepresent static and dynamic multimedia contentTemporal events, types of digital objectsStructuresLabeled directed graphs; provide organization within the DLStructured documents and metadata; structured searches, collection, metadata catalog; hypertext, classification schemeSpacesSets, properties and operations on those setsRetrieval mode, Presentation information, Scenariossequences of events that modify states of a computation in order to accomplish some functional requirement.Organization of the user and system actions into transactions, statements, events and actions; DL services as sets of scenarios.Societies Sets of communities and relationships among them

  • Log design and 5S

    5SDefinitionUse in Log DesignStreamsRepresent static and dynamic multimedia contentTemporal events, types of digital objectsStructuresLabeled directed graphs; provide organization within the DLStructured documents and metadata; structured searches, collection, metadata catalog; hypertext, classification schemeSpacesSets, properties and operations on those setsRetrieval mode, Presentation information, Scenariossequences of events that modify states of a computation in order to accomplish some functional requirement.Organization of the user and system actions into transactions, statements, events and actions; DL services as sets of scenarios.Societies Sets of communities and relationships among themUser information

  • DL Log Format SpecificationOrganization in structured logical way XML- XML SchemaStandard syntaxGuarantee quality, correctnessRich set of basic types help standardizationAbundance of XML parsers helps construction of analysis tools

  • Log Format - StructureTop Level HierarchyLogLog EntryTransactionSessionIdMachineInfoTimeStampStatement. . .. . .

  • DL Log Format - Structure (cont)Decomposition of statement into different typesAdmInfoStatementSessionInfoEventErrorInfoHelpInfoRegisterInfo

  • Log Format - Structure (cont.)Decomposition of event

    AdmInfoStatementSessionInfoEventErrorInfoHelpInfoRegisterInfoActionStatusInfoSearchBrowseStoreSysInfoUpdate

  • DL Log Format Structure (cont)Search attributesSearch QueryString TimeFrame PresentationInfo SearchByFormat NumberOfResultsSortByCutOffCollectionCatalog

  • DL Log Tool Implementation

    Digital LibraryUser LayerXMLLogManagerwriteLogEntry(parameters)c1XMLLogDatac2Log middlewareSystemeventstorelogData(parameters)UsereventAnalysistoolgetLogData(parameters)logDataAnalysisrequestresultDLpatronDLanalyst

  • Log Tool example: loginExample 1: Login to the system

    < TransId = "3452"> 987654usr3 Start mhabib 2002-05-31T20:10:55.000-05:00 128.173.244.56 8000

  • Log tool example: query a collectionExample 2: query all Dirline records about low back pain.. Dirline CommunityRecord SearchByAnyParts NonPersistant low back pain 2002-05-31T20:11:07.000-05:00 2002-05-31T20:11:09.000-05:00 List ByRank 217 20 ...Ref to GMT

  • Log Analyzer OverviewXML LogLog Data Parser/ Error Checker Routine module

    987654usr3 2002-05-31T20:10:55.0-05.00 low back pain 5114

    Step 1: ExtractBrowseQuery StringUser ID Search ErrorDoc ID Step 3: Populate Databases, Increment Global Variables, etc. Step 4: Create Final Statistics module module moduleStep 2: Parse XML; Send Log LineFinal Report/StatisticsDatabasesStep 1: Extract Log Data, SAX parser can be employed here

    Step 2: Parse Log Data and check for log errors (eg. server stalls and incomplete log line is output to XML Log)

    Step 3: The different modules populate various databases and/or increment the appropriate counters. Each module can adjust various databases, as for example the module, which increments the appropriate month hit counter and records that the user 987654usr3 made a request at time T.

    Step 4: Aggregate data and output final statistics; all databases are made available

  • Summarizing this class and last weekLooked at a view of DL qualityBy examining the components of the DL independent of usage (explicit computation)By looking at the view of the DL obtained by a visitor (log analysis)Each is a view that has been widely promulgated and well received, but is not an industry standard

  • Next weekJoseph Lucia, Director of Villanovas Falvey Library will talk about what is happening in this very innovative and significant digital library.Come prepared with questions and ready to comment and discuss what he presents.

  • ReferencesGonalves, M. A., Luo, M., Ali, M. F., and Fox, E. A. An XML Log Standard and Tool for Digital Library Logging Analysis In Research and Advanced Technology for Digital Libraries, 6th European Conference, ECDL 2002, Rome, Italy, September 16-18, 2002, Proceedings

    Klas, C., et al "A Logging Scheme for Comparative Digital Library Evaluation Research and Advanced Technology for Digital Libraries, 10th European Conference, ECDL 2006, Alicante,Spain, September 17-29, 2006, Proceedings