API « Jakoblog — Das Weblog von Jakob VoÃŸ

Blog
About

Ãœberlegungen zur Modellierung von Bibliotheksdienstleistungen

13. März 2013 um 12:44 3 Kommentare

P.S: Mit BeschrÃ¤nkung auf Bibliotheksdienstleistungen, die sich auf Dokumente beziehen, habe ich inzwischen mit der Spezifikation einer Document Service Ontology begonnen.

Im Rahmen der Entwicklung der Patrons Account Information API (PAIA) und der Document Availability Information API (DAIA) bin ich wiederholt auf die Frage gestoÃŸen, was fÃ¼r Arten von Dienstleistungen Bibliotheken eigentlich anbieten. Auf meine Frage bei Libraries.StackExchange antwortete Adrian Pohl, der sich in seiner Masterarbeit fÃ¼r lobid.org mit dem Thema beschÃ¤ftigt hat.

Meine Fragestellung ist natÃ¼rlich einseitig und zwar ausgerichtet auf die mÃ¶gliche Verwendung in APIs, mit denen Nutzer und Programme verschiedene Bibliotheksdienstleistungen abrufen oder anfordern kÃ¶nnen sollen. Das bekannteste Beispiel ist die Dienstleistung der Ausleihe. PAIA und DAIA wurden primÃ¤r dafÃ¼r entworfen, damit Dienste wie die Ausleihe von BÃ¼chern aus Bibliotheken offen und maschinenlesbar mÃ¶glich ist. Offen heisst, dass praktisch jeder oder zumindest alle Bibliotheksnutzer direkt per gut dokumentierter API auf die Dienste zugreifen kÃ¶nnen statt nur Ã¼ber bestimmte BenutzeroberflÃ¤chen.

Dieser Blogartikel ist erst einmal ein lautes Denken, bei dem ich Adrians Klassifikation auf Eignung fÃ¼r meine Fragestellung untersuche. Wie jede Klassifikation sind sowohl die von Adrian bereitgestellte Liste als auch meine Kommentare also nicht neutral sondern nur eine mÃ¶gliche Sichtweise.

1. Webbased services without direct personnel involvement:

OPAC and other research services

List of recent acquisitions

Online Tutorials

Place an order

Renewal of a loan

Online acquisition request service

Digitization Service

Consulting Service

…

Ein Katalog ist keine Dienstleistung die angefordert oder reserviert werden mÃ¼sste, ebenso sieht es fÃ¼r andere Informationen auf der Webseite der Bibliothek aus. Die Punkte „place an order“ und „renewal of a loan“ gehÃ¶ren zur Dienstleistung Ausleihe (bzw. Ansicht bei PrÃ¤senzexemplaren). Zu klÃ¤ren ist noch, wie sich AnschaffungsvorschlÃ¤ge einordnen lassen. Die Digitalisierung von BibliotheksbestÃ¤nden (z.B. DigiWunschbuch ist vermutlich eine eigene Dienstleistung, die zu den bestehenden DAIA-Services hinzukommen kÃ¶nnte. Wieso „Consulting Service“ hier eingeordnet ist, verstehe ich nicht.

2. Webbased services with direct personnel involvement:

Chat reference/consulting

Ist ein Chat eine Dienstleistung? Wie sieht es mit anderen Kommunikationswegen (Telefon, GesprÃ¤ch vor Ort…) aus)? Ich wÃ¼rde das alles unter Auskunft zusammenfassen.

3. In-house services with direct personnel involvement

In-house guided tours and courses

Digitization Service

Microfiche readers

Reference Desk

Loan Desk

Registration Desk

Consulting Service

3D printer

Makerspace

…

Schulungen und FÃ¼hrungen sind sicher eine eigene Art von Dienstleistung, allerdings gibt es hier verschiedene Arten. Ausleihtheke und Anmeldung sind keine eigene Dienstleistung sondern Teil anderer Services. Bei den Arbeitsmitteln wie Mirofiche-Leser, 3D-Drucker etc. kommt es darauf an, ob diese direkt nutzbar sind oder reserviert werden mÃ¼ssen und ob sie nur vor Ort nutzbar oder auch ausleihbar sind.

4. In-house services without direct personnel involvement

Reading Room

Study room

Wifi

Computers with Internet Access

Photocopier

Scanner for use by visitors

3D printer

Computer for self-loan

Cafeteria

Drinks machine

…

ArbeitsrÃ¤ume und ArbeitsplÃ¤tze, Kopierer, Scanner, Kaffeeautomat etc. gehÃ¶ren zu den oben genannten Arbeitsmitteln. Ich denke dass sich DAIA einfach erweitern lÃ¤sst, so dass Nutzer per API abrufen kÃ¶nnen, ob ein Arbeitsmittel frei ist und PAIA erweitern lÃ¤sst, so dass Nutzer Arbeitsmittel per API reservieren kÃ¶nnen.

Tags: API, DAIA, PAIA 3 Kommentare

Access to library accounts for better user experience

8. Februar 2013 um 11:10 5 Kommentare

I just stumbled upon ReadersFirst, a coalition of (public) libraries that call for a better user experience for library patrons, especially to access e-books. The libraries regret that

the products currently offered by e-content distributors, the middlemen from whom libraries buy e-books, create a fragmented, disjointed and cumbersome user experience.

One of the explicit goals of ReadersFirst is to urge providers of e-content and integrated library systems for systems that allow users to

Place holds, check-out items, view availability, manage fines and receive communications within individual library catalogs or in the venue the library believes will serve them best, without having to visit separate websites.

In a summary of the first ReadersFirst meeting at January 28, the president of Queens Library (NY) is cited with the following request:

The reader should be able to look at their library account and see what they have borrowed regardless of the vendor that supplied the ebook.

This goal matches well with my activity at GBV: as part of a project to implement a mobile library app, I designed an API to access library accounts. The Patrons Account Information API (PAIA) is current being implemented and tested by two independent developers. It will also be used to provide a better user experience in VuFind discovery interfaces.

During the research for PAIA I was surprised by the lack of existing methods to access library patron accounts. Some library systems not even provide an internal API to connect to the loan system – not to speak of a public API that could directly be used by patrons and third parties. The only example I could find was York University Libraries with a simple, XML-based, read-only API. This lack of public APIs to library patron accounts is disappointing, given that its almost ten years after the buzz around Web 2.0, service oriented architecture, and mashups. All all major providers of web applications (Google, Twitter, Facebook, StackExchange, GitHub etc.) support access to user accounts via APIs.

The Patrons Account Information API will hopefully fill this gap with defined methods to place holds and to view checked out items and fines. PAPI is agnostic to specific library systems, aligned with similar APIs as listed above, and designed with RDF in mind (without any need to bother with RDF, apart from the requirement to use URIs as identifiers). Feedback and implementations are very welcome!

Tags: API, PAIA 5 Kommentare

XML Schema vs. Library APIs (OAI-PMH/SRU/unAPI…)

24. Februar 2011 um 18:33 2 Kommentare

Much of our work at GBV library network has to do with record formats and APIs. We harvest or get metadata records in a wide range of formats (with many different interpretations and misconstructions of these formats), convert records to a wide range of formats (with many special request how to interpret this formats), and provide records through various APIs. Some of these APIs allow you to select different record formats, for instance OAI-PMH (first published 2001), SRU (2003), and unAPI (2006). These APIs are based on HTTP for transport and XML for encoding of the records. There are also older APIs and encoding formats like Z39.50 and newer APIs like pure Linked Data and SPARQL for RDF. unAPI also supports non-XML formats, but in this article I will concentrate on XML-based formats.

The basic question (that I deal with since years) is „what exactely is a format and how do you refer to it?“. All three APIs provide a method for listing of all formats that are supported by a particular server. unAPI provides a „list of object formats“. Each format has a „name“, a „type“ (which must be an official Internet media type), and an optional documentation URL („docs“), which may refer to some human-readable documentation, or to an XML Schema (XSD) file. Here are three examples:

<format name="oai_dc" type="application/xml"
  docs="http://www.openarchives.org/OAI/2.0/oai_dc.xsd" 
/>
<format name="pubmed" type="application/xml" 
  docs="http://www.nlm.nih.gov/bsd/licensee/elements_descriptions.html"
/>
<format name="mods" type="application/xml"
  docs="http://www.loc.gov/standards/mods/" 
/>
<format name="marcxml" type="application/xml" 
  docs="http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"
/>

To avoid the uncertainty whether „docs“ references a formal schema or a plain document, there should have been a „schema“ attribute (first problem). To refer to a format in an unAPI request, you use the format’s „name“. In OAI-PMH you refer to a format by its „metadataPrefix“. You can get a list of supported formats with the ListMetadataFormats request. In addition to the „metadataPrefix“ each format has the location of an XML Schema („schema“) and an XML Namespace URI („metadataNamespace“). In theory the latter is dispensable, because each XSD document declares a namespace URI in its „targetNamespace“ attribute: Given a format with a schema that defines namespace „http://example.org/“ like this

<xs:schema targetNamespace="http://example.org/">

I would expect records in this format to use this namespace, at least for the XML root element:

<record xmlns="http://example.org/">

The OAI-PMH specification does not explicitly say that the „metadataNamespace“ must match the namespace in the schema file „schema“. What does it mean if they differ? (second problem).

In SRU a format is known as „schema“. A list of supported formats is contained in an explain request. Each schema has an optional „title“, a „name“ (used to refer to schemas in the „recordSchema“ HTTP parameter when doing a search query), an „identifier“, and an optional „location“. The „identifier“ contains an additional URI, and the „location“ contains a link to an XML Schema file or to some human-readable documentation (like the „docs“ attribute in unAPI). There is a list of known schemas at the SRU page, for instance:

title and location	name	identifier
MODS Schema Version 3.0	mods	info:srw/schema/1/mods-v3.0
MODS Schema Version 3.3	mods	info:srw/schema/1/mods-v3.3
MARCXML	marcxml	info:srw/schema/1/marcxml-v1.1

Note that one name (for instance „mods“) can refer to several schemas, but one particular SRU server can only provide one particular format under this name. The additional identifier neither refers to a particular XML Schema (Third problem). The identifier may only give a hint which particular version or interpretation of a format is provided.

Does anyone really need this diverse methods to refer to formats? I found in practice you cannot rely on the claimed format anyway, unless you can automatically validate it. That’s what XML Schema can be used for. I don’t say that XML Schema is the best or only method to formally describe an XML-based format (personally I much bettter like RELAX NG), but if there is an XML Schema – shouldn’t this schema be enough to identify the format?. Is there really a need of four independent identifiers to refer to an XML-based format? In the worst case we have:

Schema Name (e.g. mods)
Schema Location (e.g. http://www.loc.gov/standards/mods/v3/mods-3-3.xsd)
Schema Identifier (e.g. info:srw/schema/1/mods-v3.3)
Schema Namespace (e.g. http://www.loc.gov/mods/v3)

This is bad design, because you cannot say which of the four is the right one and how they relate to each other. A clean solution would only have two identifiers for XML-based formats:

The local name, which is only unique for a particular API and a particular server
The global schema Location, which is a cool URI that resolves to an XML Schema file.

The Schema Namespace is included as „targetNamespace“ in the XML Schema, and the Schema Identifier is delusion anyway. Either you can identify a format by a formal schema (that can also be used to validate records) or you just cannot guarantee which format your records will be in. Sure you can give some hints by linking to documentations, examples, and guidelines. But adding more identifiers is a fakery of control. You are still allowed to provide more specific formats, variants, application profiles, and interpretations under different names. But these formats don’t get more clear or usable if you give them a „Schema Identifier“. Does anyone uses SRU’s Schema Identifiers anyway? I think for XML we can better live with XML Schemas that the XML namespaces can be extracted from. An application can identify a format by its schema location, by the XML namespace, and/or by other information contained in the schema. Additional pointers to human-readable documentation are great. But don’t confuse description with identification if you need to refer to a data format.

P.S. At Code4lib mailing list Rob Sanderson pointed to our discussion we had about the same topic in 2009, and one of my earlier postings on XML4Lib also deals with SRU and namespaces.

Tags: API, Metadata, XML 2 Kommentare

How to encode the availability of documents

23. Oktober 2009 um 12:50 2 Kommentare

Since almost a year I work on a simple encoding format and API to just get the current (!) availability status of documents in libraries. Together with Reh Uwe (hebis network) and Anne Christensen (beluga project) we created the Document Availability Information API (DAIA) which is defined as data model with encoding in XML and JSON (whichever you prefer).

This week I finished and published a reference implementation of the DAIA protocol as open source Perl-module at CPAN. The implementation includes a simple DAIA validator and converter. A public installation of this validator is also available. The next tasks include implementing server and client components for several ILS software. Every library has its own special rules and schemas – Jonathan Rochkind already wrote about the problems to implement DAIA because of ILS complexity. We cannot erase this complexity by magic (unless we refactor and clean the ILS), but at least we can try to map it to a common data model – which DAIA provides.

With the DAIA Perl package you can concentrate on writing wrappers from your library systems to DAIA and easily consume and evaluate DAIA-encoded information. Why should everyone write its own routines to grab for instance the HTML OPAC output and parse availability status? One mapping to DAIA should fit most needs, so others can build upon. DAIA can not only be helpful to connect different library systems, but also to create mashups and services like „Show me on a map, where a given book is currently hold and available“ or „Send me a tweet if a given books in my library is available again“ – If you have more cool ideas for client applications, just let me know!

In the context of ILS Discovery Interface Task Force and their official recommendation DAIA implements the GetAvailability method (section 6.3.1). There are numerous APIs for several tasks in library systems (SRU/SRW, Z39.50, OpenSearch, OAI-PMH, Atom, unAPI etc.) but there was no open, usable standard way just to query whether a copy of given publication – for instance book – is available in a library, in which department, whether you can loan it or only use it in the library, whether you can directly get it online, or how long it will probably take until it is available again (yes, I looked at alternatives like Z39.50, ISO 20775, NCIP, SLNP etc. but they were hardly defined, documented, implemented and usable freely on the Web). I hope that DAIA is easy enough so non-librarians can make use of it if libraries provide an API to their system with DAIA. Extensions to DAIA can be discussed for instance in Code4Lib Wiki but I’d prefer to start with this basic, predefined services:

presentation: an item can be used inside the institution (in their rooms, in their intranet etc.).
loan: an item can be used outside of the institution (by lending or online access).
interloan: an tem can be used mediated by another institution. That means you do not have to interact with the institution that was queried for this item. This include interlibrary loan as well as public online ressources that are not hosted or made available by the queried institution.
openaccess: an item can be used imediately without any restrictions by the institution, you don’t even have to give it back. This applies for Open Access publications and free copies.

Tags: API, DAIA, Library, Perl 2 Kommentare

Link Resolver und Widgets im OPAC

26. November 2008 um 13:05 Keine Kommentare

Jonathan Rochkind hat in seinem Blog SeeAlso und Umlaut verglichen, die beide eine Form von Link Resolver darstellen. WÃ¤hrend Ã¼ber SeeAlso ausgehend von einer ID (ISBN, ISSN, DOI, PPN, …) eine Liste von passenden EintrÃ¤gen mit Links zurÃ¼ckgeliefert wird, liefert Umlaut ausgehend von einer OpenURL auch leicht komplexere Inhalte, wie zum Beispiel ein Formular, mit dem in verschiedenen Quellen im Volltext eines Buches gesucht werden kann. Die Abfrage geschieht jedoch leider nicht Ã¼ber eine standardisierte API sondern proprietÃ¤r (d.h. nur Umlaut kann lesen was Umlaut liefert).

Nach meiner Auffassung ist Umlaut (wie Link Resolver im Allgemeinen) eine spezielle Form einer Metasuche. Ãœber eine OpenURL-Anfrage kÃ¶nnen mehrere Dienste und Quellen (SFX, CrossRef, Amazon, OCLC Worldcat, ISBNdb, LibraryThing, Google Books, HathiTrust, OpenLibrary, the Internet Archive, Worldcat Identities …) gemeinsam abgefragt werden und die Ergebnisse werden auf einer Seite zusammengefasst (siehe Beispiel). Das Zusammenfassen von Inhalten aus mehreren Quellen kann allerdings auch im OPAC bei einer Titelanzeige geschehen (siehe Beispiel).

Allgemein ist fÃ¼r die Zusammenfassung von Inhalten aus unterschiedlichen Quellen (Mashup-Prinzip) hilfreich, wenn auf einheitliche APIs und Datenformate zurÃ¼ckgegriffen wird. Das populÃ¤rste Beispiel ist RSS (bzw. ATOM). SeeAlso kann als API und Format fÃ¼r einfachere Listen dienen und fÃ¼r komplexere Inhalte kÃ¤me vielleicht die Universal Widget API (UWA) in Frage. UWA-Widgets kÃ¶nnen auch in eigene Seiten eingebunden werden. Einen OPAC als Widget gibt es ja schon – wie wÃ¤re es umgekehrt einen OPAC aus Widgets zusammenzusetzen? Ich denke, dass es noch einige Zeit dauern wird, bis sich neben ATOM/RSS weitere gute Standards fÃ¼r die Aggregierung von Inhalten auf Webseiten durchsetzen. Praktische Beispiele gibt es ja inzwischen immer mehr, gerade hat Lorcan Dempsey einige davon zusammengefasst.

Tags: API, Mashup, Seealso, Umlaut, Widget Keine Kommentare

Jangle-API fÃ¼r Bibliothekssysteme

10. Juli 2008 um 15:00 2 Kommentare

Im VuFind-Projekt wird Ã¼berlegt, Jangle als allgemeine Schnittstelle fÃ¼r Bibliothekssysteme zu verwenden, anstatt fÃ¼r jedes System die Anbindung neu anzupassen. Jangle wird von Ross Singer (Talis) vorangetrieben und steht anscheinend sowohl in Konkurrenz als auch in Kooperation mit der DLF working group on digital library APIs. Ob Jangle etwas wird und ob es sich durchsetzt, ist noch offen – zumindest ist mehr technischer Sachverstand dabei als bei anderen bibliothekarischen Standards.

Ich halte es zumindest fÃ¼r wichtig, in etwa auf dem Laufenden zu sein und bei Bedarf zu versuchen, Einfluss zu nehmen, falls die Entwicklung vÃ¶llig an den eigenen BedÃ¼rfnissen vorbeigeht. So ganz verstanden habe ich Jangle allerdings bisher noch nicht. Statt eine neue Gesamt-API zu entwickeln, sollte meiner Meinung nach besser bei den existierenden Schnittstellen aufgerÃ¤umt werden – anschlieÃŸend kÃ¶nnen diese dann ja in Jangle o.Ã„. zusammengefasst werden.

Larry stellt Jangle und Primo gegenÃ¼ber: auf der einen Seite der systematische OpenSource-Ansatz, in den man sich erstmal einarbeiten muss und der dafÃ¼r insgesamt effizienter und flexibler ist und auf der anderen Seite das teure, unflexible Gesamtprodukt, das dafÃ¼r schick aussieht. Leider geht es bei den EntscheidungstrÃ¤gern meist primÃ¤r um die Fassade, so dass sich OpenSource nur langsam (aber sicher) durchsetzt.

Tags: API, digital library, Jangle, Standards 2 Kommentare

Working group on digital library APIs and possible outcomes

13. April 2008 um 14:48 3 Kommentare

Last year the Digital Library Federation (DLF) formed the „ILS Discovery Interface Task Force„, a working group on APIs for digital libraries. See their agenda and the current draft recommendation (February, 15th) for details [via Panlibus]. I’d like to shortly comment on the essential functions they agreed on at a meeting with major library system (ILS) vendors. Peter Murray summarized the functions as „automated interfaces for offloading records from the ILS, a mechanism for determining the availability of an item, and a scheme for creating persistent links to records.“

On the one hand I welcome if vendors try to agree on (open) standards and service oriented architecture. On the other hand the working group is yet another top-down effort to discuss things that just have to be implemented based on existing Internet standards.

1. Harvesting: In the library world this is mainly done via OAI-PMH. I’d also consider RSS and Atom. To fetch single records, there is unAPI – which the DLF group does not mention. There is no need for any other harvesting API – missing features (if any) should be integrated into extensions and/or next versions of OAI-PMH and ATOM instead of inventing something new. P.S: Google Wave shows what to expect in the next years.

2. Search: There is still good old overblown Z39.50. The near future is (slightly overblown) SRU/SRW and (simple) OpenSearch. There is no need for discussion but for open implementations of SRU (I am still waiting for a full client implementation in Perl). I suppose that next generation search interfaces will be based on SPARQL or other RDF-stuff.

2. Availability: The announcement says: „This functionality will be implemented through a simple REST interface to be specified by the ILS-DI task group“. Yes, there is definitely a need (in december I wrote about such an API in German). However the main point is not the API but to define what „availability“ means. Please focus on this. P.S: DAIA is now available.

3. Linking: For „Linking in a stable manner to any item in an OPAC in a way that allows services to be invoked on it“ (announcement) there is no need to create new APIs. Add and propagate clean URIs for your items and point to your APIs via autodiscovery (HTML link element). That’s all. Really. To query and distribute general links for a given identifier, I created the SeeAlso API which is used more and more in our libraries.

Furthermore the draft contains a section on „Patron functionality“ which is going to be based on NCIP and SIP2. Both are dead ends in my point of view. You should better look at projects outside the library world and try to define schemas/ontologies for patrons and patron data (hint: patrons are also called „customer“ and „user“). Again: the API itself is not underdefined – it’s the data which we need to agree on.

Tags: API, ATOM, digital library, OAI, Seealso, SOA, Standards 3 Kommentare

BÃ¼cher, die ich schon mal ausgeliehen hatte…

15. März 2008 um 15:38 1 Kommentar

Haftgrund berichtet, dass es in Wiener BÃ¼chereien „aus DatenschutzgrÃ¼nden“ nicht mehr mÃ¶glich ist, im Bibliothekssystem auf Wunsch die Titel zu speichern, die man in der Vergangenheit schon mal ausgeliehen hatte. SchÃ¶n, dass Bibliotheken anders als die meisten Webanbieter auf Datenschutz achten und nicht bis in alle Ewigkeiten speichern, wer wann was gemacht hat. In diesem Fall ist die BegrÃ¼ndung aber wohl eher vorgeschoben, schlieÃŸlich klingt „Datenschutz“ auch kompetenter als „unsere Software kann das nicht und/oder wir wissen nicht wie man sowas technisch umsetzt“. Die MÃ¶glichkeit, alle ausgeliehenden BÃ¼cher automatisch in einer Liste abzuspeichern, ist meinem Eindruck nach in Bibliotheken ebenso nÃ¼tzlich wie selten.

Wie Edlef bemerkt, kann man ausgeliehene BÃ¼cher ja auch „in citavi, Librarything oder beliebigen anderen Literaturverwaltungssystemen fÃ¼r sich selbst [speichern]“ – was prinzipiell tatsÃ¤chlich die bessere Variante ist. Nur sollten Bibliotheken Nutzer dabei nicht in „kenn wa nich, wolln wa nicht, ham wa nich, geht nich“-Manier abspeisen, sondern aktiv fÃ¼r die VerknÃ¼pfung von Bibliothekssystemen mit Literaturverwaltungssystemen sorgen. Alles was Ã¼ber ein Feld „Ausleihen automatisch bei BibSonomy/LibraryThing/etc. eintragen“ im Benutzerkonto hinausgeht, ist nicht zumutbar und Ã¼berflÃ¼ssig.

Ein LÃ¶sungsansatz gemÃ¤ÃŸ Serviceorientierter Architektur sÃ¤he folgendermaÃŸen aus:

Das Ausleihsystem bekommt eine Webschnitttstelle, Ã¼ber die Nutzer abfragen kÃ¶nnen, welche Medien sie zur Zeit ausgeliehen haben. Wenn dabei RSS oder ATOM eingesetzt wird (natÃ¼rlich nur Ã¼ber HTTPS und mit Passwort), sollten gÃ¤ngige Feed-Aggregatoren damit klarkommen. Bisher muss der Zugang zum Ausleihsystem fÃ¼r jede Anwendung wie BÃ¼cherwecker einzeln gehackt werden, was mÃ¼hsam und fehleranfÃ¤llig ist. Die ausgeliehenen Werke sollten (per URL-Parameter steuerbar) in mÃ¶glichst vielen Datenformaten beschrieben werden. Prinzipiell reicht aber auch die Standard-Kurztitelanzeige und ein Identifier, mit dem z.B. Ã¼ber unAPI weitere Details in verschiedenen Datenformaten angefordert werden kÃ¶nnen. ZusÃ¤tzlich zu diesem Pull-Verfahren sollte die MÃ¶glichkeit gegeben werden, Ã¡ la Pingback andere Dienste zu benachrichtigen sobald ein Benutzer ein Medium ausgeliehen hat.

Den Rest (das automatische Eintragen in eine Literaturverwaltung) kann – und sollte – bei gegebenen APIs eine eigene Anwendung Ã¼bernehmen. Es wÃ¼rde mich nicht wundern, wenn die fixen Entwickler von LibraryThing das selber basteln oder sich ein Student im Rahmen einer Diplom- oder Bachelorarbeit daran setzt. Sobald saubere, einfache Schnittstellen verfÃ¼gbar sind, ist der Aufwand fÃ¼r neue „Mashups“ und Zusatzdienste minimal.

Tags: API, Bibliothek, Feed, Katalog, Mashup, SOA 1 Kommentar

Schnittstelle fÃ¼r VerfÃ¼gbarkeitsdaten von BibliotheksbestÃ¤nden

21. Januar 2008 um 11:22 4 Kommentare

Letzen Dezember habe ich Ã¼ber Serviceorientierte Architektur geschrieben und bin unter Anderem auf den Heidelberger UB-Katalog eingegangen. Dabei ging es darum, wie Daten einzelner Exemplare von BibliotheksbestÃ¤nden – speziell VerfÃ¼gbarkeitsdaten – Ã¼ber eine Schnittstelle abgefragt werden kÃ¶nnen. Bislang gibt es dafÃ¼r keinen einfachen, einheitlichen Standards sondern hÃ¶chstens verschiedende proprietÃ¤re Verfahren. Till hat mich zu Recht auf den Artikel „Beyond OPAC 2.0: Library Catalog as Versatile Discovery Platform“ hingewiesen, in dem die API-Architektur des Katalogs an der North Carolina State University vorgestellt wurde. Die Beispiele zeigen gut, was mit dem Buzzword „Serviceorientierte Architektu“ eigentlich gemeint ist, wie sowas in Bibliotheken umgesetzt werden kann und was fÃ¼r Vorteile der Einsatz von einfachen, webbasierten Schnittstellen bringt. Die als CatalogWS bezeichnete API ist – wie es sich gehÃ¶rt – offen dokumentiert. CatalogWS enthÃ¤lt einen Catalog Availability Web Service, der ausgehend von einer ISBN ermittelt, in welchen (Teil)bibliotheken ein Titel verfÃ¼gbar oder ausgeliehen ist.

Bei Bedarf kÃ¶nnte ich mal versuchen, diese API fÃ¼r die GBV-Kataloge zu implementieren. Andererseits sollte man sich vielleicht erstmal Gedanken darÃ¼ber machen, was es noch fÃ¼r Kandidaten fÃ¼r eine VerfÃ¼gbarkeitsschnittstelle gibt und welche Daten Ã¼ber so eine Schnittstelle abfragbar sein sollten: Das NCIP-Protokoll scheint mir wie Z39.50 nicht wirklich zukunftsfÃ¤hig zu sein. Janifer Gatenby macht in ihrem Vortrag „Bridging the gap between discovery and delivery“ (PPT) weitere durchdachte VorschlÃ¤ge. Auf den Mailinglisten CODE4LIB und PERL4LIB habe ich letzte Woche herumposaunt, wie wichtig eine Holding-API wÃ¤re und dass das doch alles eigentlich ganz einfach sei. Neben Iinteressanten Bemerkungen zu FRBR bin ich daraufhin auf Holding-data in Z39.50 hingewiesen worden. In den PICA-LBS-Systemen stehen die VerfÃ¼gbarkeitsdaten soweit ich es herausgefunden, habe im Feld 201@, aber nur teilweise. FÃ¼r die weitere Umsetzung wÃ¤re es wahrscheinlich sinnvoll, erstmal alle in der Praxis vorkommenden VerfÃ¼gbarkeits-Stati (ausleihbar, PrÃ¤senzbestand, Kurzausleihe, ausgeliehen, unbekannt…) zu ermitteln. FÃ¼r elektronische Publikationen sollte die Schnittstelle auÃŸerdem irgendwie mit existierenden Linkresolvern zusammenarbeiten kÃ¶nnen. Eine einfache Schnittstelle fÃ¼r VerfÃ¼gbarkeitsdaten von Bibliotheken ist also nicht ganz trivial, aber solange nicht jeder Spezialfall berÃ¼cksichtigt wird oder erstmal ein Gremium eingesetzt werden muss, dÃ¼rfte es machbar sein. Hat sonst noch jemand Interesse?

Tags: API, Bibliothek, GBV, Katalog, PICA, SOA 4 Kommentare

thingISBN is getting better – so does SeeAlso

17. Januar 2008 um 00:05 Keine Kommentare

Tim Spalding points out that thingISBN is getting better [via FRBR blog] – a good reason to update the thingISBN data of our SeeAlso Linkserver isbn2librarything. It now includes 2994299 ISBNs, that’s almost 12 times the size of the isbn2wikipedia linkserver. But what’s a linkserver? A linkserver is a server that gets an identifier (for instance an ISBN) and gives you links (for instance a link to a work at LibraryThing). You can try out some SeeAlso linkservers at a demo page or start to implement your own.

To get an impression, look in GBV union catalog, for instance this record: after a short moment, a link to German Wikipedia is included. More and more German libraries follow to include such additional links in their catalog. The Catalog of University Bremen includes both links to (German) Wikipedia and links to LibraryThing (you have to click at „weitere Informationen…“ at a selected record):

There is also an experimental ISBN2WorldCat linkserver. But linkservers that only send one or zero links are not very effective. I could combine multiple service to something like ISBN2SocialCataloging or such. Any suggestions for other services that libraries might not be frightened to link to? I’d also like to include more data in the link to LibraryThing (for instance the number of reviews) but unless German libraries wake up to ask for LibraryThing for Libraries I better not reimplement services that LibraryThing already offers.

The linkserver model could also be used to provide links to library holding and thus implement a FRBR-Manifestation to FRBR-Item mapping. As I just wrote at CODE4LIB and PER4LIB, a simple data format for holding information is needed to do so.

Tags: API, GBV, ISBN, LibraryThing, Seealso, Webservices, Wikipedia Keine Kommentare

Nächste Seite »

Jakoblog — Das Weblog von Jakob VoÃŸ