1
Intermediate and Application Services eSciDoc services eSciDoc services Natasa Bulatovic Natasa Bulatovic Natasa Bulatovic, [email protected] Max Planck Digital Library (MPDL), Amalienstrasse 33, 80799 Muenchen (DE), http://mpdl.mpg.de/ June 2009 eSciDoc Infrastructure Services • manage diverse content via generalized resources (Items, Contexts, Containers) • enable versioning, persistent identification, searching, statistics, authorization • easy to use by developers, end users and non-human service requestors Solutions • visualize and reuse services and add value (e.g. data mash-ups, specific views) • enable different workflows • work with specialized resources, such as scanned books, language resources, transcriptions, translations, publication items, images and image albums. Core Services - Highlights Open formats Content resources are internally stored as Fedora FOXML documents. Thus the structure and description of each resource is readable by any human or machine user. Contextual information Content resources are related with a content model description which defines what the data actually represents, for example, a publication, a transcription of a scanned book page, a language resource or a face description. Provenance Each resource is associated with PREMIS metadata that holds richer information on the version history and events in addition to the natively provided Fedora audit trail. Metadata Metadata schema can be chosen in accordance with user needs for each model of a content resource separately. As a minimum, resources have a Dublin Core metadata record associated with them. Metadata validation can be enforced to comply to different levels of quality and different schemata by use of the validation service and rules based on Schematron. Technical metadata extraction based on JHove/JStore framework is applied for selected set of binary content. Persistent identification Users may decide what to identify (resource or resource version) and when to assign the persistent identifier (during creation, updates, or publishing). Security Local or distributed authentication and authorization system. eSciDoc AA services allow for definition of roles, user groups and granting privileges (based on XACML). Fine grained access control to resources and service interfaces can be established for Depositor, Moderator, Metadata Editor, to Collaborator, Audience and a set of administrative roles. New roles can be added and customized in accordance with user needs and workflow requirements. eSciDoc Project eSciDoc is a joint project between the Max Planck Society (MPS) and Fachinformationszentrum (FIZ) Karlsruhe, funded by the Bundesministerium für Bildung und Forschung (BMBF). The aim of the project is the development of a re-usable service infrastructure and various discipline-specific end-user solutions for the scientific communication and publication in research organizations. This includes the guarantee of long-term open access to multiple scholarly materials (research data and publications) and the support of innovative ways of scientific and interdisciplinary collaboration. For more information, visit http://escidoc.org and http://colab.mpdl.mpg.de/mediawiki/Portal:ESciDoc Cone (Control of Named Entities) service Cone is used for management of controlled vocabularies of Persons, Journals, Subjects (DDC), Languages, Mime-Types, CC licenses. Data are offered in several formats such as: XML/RDF, HTML, JSON, simple text format. Validation service Validation service defines validation points and validation rules (Schematron) for content resources. Resources are validated during creation, metadata editing or publishing (depending on the workflow). Validation rules can be customized for particular administrative context, metadata profile and publication workflow. Data acquisition (DaaS) service Daas harvests data from external sources (e.g. ArXiv, PubMed Central, Spires, BioMed Central, eSciDoc) via UnAPI interface. Results can be then forwarded to the transformation service for processing and transformation to e.g. publication item. Transformation service The service enables transformation of a resource representation from one to another format. Supported formats may be common publication formats (e.g. BibTex, EndNote), structural metadata formats (e.g. eSciDoc, ArXiv, PubMed, TEI), citation styles (e.g. APA, AJP), micro formats (Coins), RSS/Atom feed formats. This service is used by Data acquisition service, Search&Export service, Syndication manager, but also by the Depositing service (based on SWORD interface) for creation of publication items. Search&Export service Functionality adding service to the eSciDoc-core search service that calls the Transformation service to deliver the search results in desired output format (eSciDoc XML, citation style format, search-based Atom/RSS feed ) Syndication service Generates three types of Publication item feeds in several Atom/RSS formats: recent published resources in the repository, for an organization or for published resources based on an arbitrary search request. Citation style formatting service Enables definition of a citation style (such as APA, AJP) in XML and transformation of resource metadata to a citation style in desired output format such as RTF, PDF, HTML, ODT, HTML SNIPPET. Depositing service Enables deposit of publication items via SWORD interface. Supports standard eSciDoc authorization mechanisms and PubMan workflows. Uses transformation service to transform foreign formats to publication item. See also http://colab.mpdl.mpg.de/mediawiki/ESciDoc_SOA_AtGlance

Intermediate and Application Services

  • Upload
    amber

  • View
    37

  • Download
    0

Embed Size (px)

DESCRIPTION

eSciDoc services Natasa Bulatovic. eSciDoc Infrastructure. Intermediate and Application Services. Cone (Control of Named Entities) service - PowerPoint PPT Presentation

Citation preview

Page 1: Intermediate and Application Services

Intermediate and Application Services

eSciDoc serviceseSciDoc servicesNatasa BulatovicNatasa Bulatovic

Natasa Bulatovic, [email protected]

Max Planck Digital Library (MPDL), Amalienstrasse 33, 80799 Muenchen (DE), http://mpdl.mpg.de/

June 2009

eSciDoc Infrastructure

Services

• manage diverse content via generalized

resources (Items, Contexts, Containers)• enable versioning, persistent identification,

searching, statistics, authorization• easy to use by developers, end users and

non-human service requestors

Solutions• visualize and reuse services and add value

(e.g. data mash-ups, specific views)• enable different workflows• work with specialized resources, such as

scanned books, language resources,

transcriptions, translations, publication

items, images and image albums.

Core Services - HighlightsOpen formats

Content resources are internally stored as Fedora FOXML documents. Thus the structure and description of each resource is readable by any human or machine user.

Contextual information

Content resources are related with a content model description which defines what the data actually represents, for example, a publication, a transcription of a scanned book page, a language resource or a face description.

Provenance

Each resource is associated with PREMIS metadata that holds richer information on the version history and events in addition to the natively provided Fedora audit trail.

Metadata

Metadata schema can be chosen in accordance with user needs for each model of a content resource separately. As a minimum, resources have a Dublin Core metadata record associated with them. Metadata validation can be enforced to comply to different levels of quality and different schemata by use of the validation service and rules based on Schematron. Technical metadata extraction based on JHove/JStore framework is applied for selected set of binary content.

Persistent identificationUsers may decide what to identify (resource or resource version) and when to assign the persistent identifier

(during creation, updates, or publishing).

SecurityLocal or distributed authentication and authorization system. eSciDoc AA services allow for definition of roles,

user groups and granting privileges (based on XACML). Fine grained access control to resources and service

interfaces can be established for Depositor, Moderator, Metadata Editor, to Collaborator, Audience and a set of

administrative roles. New roles can be added and customized in accordance with user needs and workflow

requirements.

eSciDoc Project eSciDoc is a joint project between the Max Planck Society (MPS) and Fachinformationszentrum (FIZ) Karlsruhe, funded by the Bundesministerium für Bildung und Forschung (BMBF).

The aim of the project is the development of a re-usable service infrastructure and various discipline-specific end-user solutions for the scientific communication and publication in research

organizations. This includes the guarantee of long-term open access to multiple scholarly materials (research data and publications) and the support of innovative ways of scientific and

interdisciplinary collaboration.

For more information, visit http://escidoc.org and http://colab.mpdl.mpg.de/mediawiki/Portal:ESciDoc

Cone (Control of Named Entities) service

Cone is used for management of controlled vocabularies of Persons, Journals, Subjects (DDC), Languages, Mime-Types, CC licenses. Data are offered in several formats such as: XML/RDF, HTML, JSON, simple text format.

Validation service

Validation service defines validation points and validation rules (Schematron) for content resources. Resources are validated during creation, metadata editing or publishing (depending on the workflow). Validation rules can be customized for particular administrative context, metadata profile and publication workflow.

Data acquisition (DaaS) service

Daas harvests data from external sources (e.g. ArXiv, PubMed Central, Spires, BioMed Central, eSciDoc) via UnAPI interface. Results can be then forwarded to the transformation service for processing and transformation to e.g. publication item.

Transformation service

The service enables transformation of a resource representation from one to another format. Supported formats may be common publication formats (e.g. BibTex, EndNote), structural metadata formats (e.g. eSciDoc, ArXiv, PubMed, TEI), citation styles (e.g. APA, AJP), micro formats (Coins), RSS/Atom feed formats. This service is used by Data acquisition service, Search&Export service, Syndication manager, but also by the Depositing service (based on SWORD interface) for creation of publication items.

Search&Export service

Functionality adding service to the eSciDoc-core search service that calls the Transformation service to deliver the search results in desired output format (eSciDoc XML, citation style format, search-based Atom/RSS feed )

Syndication service

Generates three types of Publication item feeds in several Atom/RSS formats: recent published resources in the repository, for an organization or for published resources based on an arbitrary search request.

Citation style formatting service

Enables definition of a citation style (such as APA, AJP) in XML and transformation of resource metadata to a citation style in desired output format such as RTF, PDF, HTML, ODT, HTML SNIPPET.

Depositing service

Enables deposit of publication items via SWORD interface. Supports standard eSciDoc authorization mechanisms and PubMan workflows. Uses transformation service to transform foreign formats to publication item.

See also http://colab.mpdl.mpg.de/mediawiki/ESciDoc_SOA_AtGlance