<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jakoblog &#187; digital library</title>
	<atom:link href="http://jakoblog.de/tag/digital-library/feed/" rel="self" type="application/rss+xml" />
	<link>http://jakoblog.de</link>
	<description>Das Weblog von Jakob Voß</description>
	<lastBuildDate>Tue, 24 Jan 2012 11:02:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.4</generator>
		<item>
		<title>TPDL 2011 Doctoral Consortium – part 3</title>
		<link>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-%e2%80%93-part-3/</link>
		<comments>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-%e2%80%93-part-3/#comments</comments>
		<pubDate>Sun, 25 Sep 2011 15:36:48 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[tpdl]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1134</guid>
		<description><![CDATA[See also part 1 and part 2 of conference-blogging and #TPDL2011 on twitter. My talk about general patterns in data was recieved well and I got some helpful input. I will write about it later. Steffen Hennicke, another PhD student of my supervisor Stefan Gradman, then talked about his work on modeling Archival Finding Aids, [...]]]></description>
			<content:encoded><![CDATA[<p><em>See also <a href="http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium/">part 1</a> and <a href="http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-part-2/">part 2</a> of conference-blogging </em> and <a href="http://twitter.com/search?q=%23tpdl2011">#TPDL2011 on twitter</a>.</p>
<p><a href="http://aboutdata.org/bibliography">My talk about general patterns in data</a> was recieved well and I got some helpful input. I will write about it later. <a href="http://www.ibi.hu-berlin.de/institut/mitarbA-Z/akadmitarb/hennicke">Steffen Hennicke</a>, another PhD student of my supervisor Stefan Gradman, then talked about his work on modeling Archival Finding Aids, which are possibly expressed in <a href="http://en.wikipedia.org/wiki/Encoded_Archival_Description">EAD</a>. The structure of EAD is often not suitable to answer user needs. For this reason Hennicke analyses EAD data and reference questions, to develope better structures that users can follow to find what they look for in archives. This is done in CIDOC-CRM as a high-level ontology and the main result will be an expanded EAD model in RDF. To me the problem of &#8220;semantic gaps&#8221; is interesting, and I think about using some of Hennicke data as example to explain data patterns in my work.</p>
<p>The last talk by <a href="http://www-iw.uni-regensburg.de/promowiki/Profiles/RitaStrebe">Rita Strebe</a> was about aesthetical user experience of websites. One aim of her work is to measure the significance of aesthetical perception. In particular her hypothesis to be evaluated by experiments are:</p>
<blockquote><p>
H1: On a high level, the viscerally perceived visual aesthetics of websites effects<br />
approach behaviour.<br />
H2: On a low level, the viscerally perceived visual aesthetics of websites effects<br />
avoidance behaviour.
</p></blockquote>
<p>Methods and preliminary results look valid, but the relation to digital libraries seems low and so was the expertise of Strebe&#8217;s motivation and methods among the participants. I suppose her work better fits to <a href="http://en.wikipedia.org/wiki/Human%E2%80%93computer_interaction">Human-Computer Interaction</a>.</p>
<p>After the official part of the program <a href="http://www.peachnote.com/about.html">Vladimir Viro</a> briefly presented his music search engine <a href="http://www.peachnote.com/">peachnote.com</a>, that is based on scanned muscial scores. If I was working in or with musical libraries, I would not hesitate to contact Viro! I also though about a search for free musical scores in Wikimedia framework. The Doctoral Consortium ended with a general discussion about dissertation, science, libraries, users, and everything, as it should be <img src='http://jakoblog.de/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-%e2%80%93-part-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digital libraries sleep away the web 2.0</title>
		<link>http://jakoblog.de/2008/10/01/digital-libraries-sleep-away-the-web-20/</link>
		<comments>http://jakoblog.de/2008/10/01/digital-libraries-sleep-away-the-web-20/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 21:58:31 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[EPrints]]></category>
		<category><![CDATA[repository]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=464</guid>
		<description><![CDATA[Frome time to time still publish on paper, so I have to deposit the publication in a repository to make it (and its metadata) available; mostly I use the &#8220;open archive for Library and Information Science&#8221; named E-LIS. But each time I get angry because uploading and describing a submission is so complicated &#8211; especially [...]]]></description>
			<content:encoded><![CDATA[<p>Frome time to time still publish on paper, so I have to deposit the publication in a repository to make it (and its metadata) available; <a href="http://eprints.rclis.org/perl/user?userid=903">mostly I use</a> the &#8220;open archive for Library and Information Science&#8221; named <a href="http://eprints.rclis.org/">E-LIS</a>. But each time I get angry because uploading and describing a submission is so complicated &#8211; especially compared to popular commercial repositories like flickr, slideshare youtube and such. These web applications pay a lot attention to usability &#8211; which sadly is of low priority in many digital libraries.</p>
<p><a href="http://www.eprints.org/software/"><img src="http://jakoblog.de/wp-content/uploads/2008/10/eprints.gif" alt="" title="eprints" width="154" height="57" class="alignright size-full wp-image-465" /></a>I soon realized that E-LIS uses a very old version (2.13.1) of GNU EPrints &#8211; <a href="http://www.eprints.org/software/v3/">EPrints 3</a> is available since December 2006 and there have been <a href="http://files.eprints.org/view/type/release.html">many updates</a> since then. To find out whether it is usual to run a repository with such an outdated software, I did a quick study. The <a href="http://roar.eprints.org/">Registry of Open Access Repositories</a> (ROAR) should list all relevant public repositories <a href="http://roar.eprints.org/index.php?action=browse#version">that run with EPrints</a>. With 30 lines of Perl I fetched the list (271 repositories), and queried each repository <a href="http://www.openarchives.org/OAI/openarchivesprotocol.html">via OAI</a> to find out the version number. Here the summarized result in short:</p>
<p>76 x unknown (script failed to get or parse OAI response), 8 x 2.1, 18 x 2.2, 98 x 2.3, 58 x 3.0, 13 x 3.1</p>
<p>Of 195 repositories (that I could successfully query and determine the version number of) only 13 use the newest version 3.1 (released September 8th). Moreover 124 still use version 2.3 or older. EPrints 2.3 was released before the web 2.0 hype in 2005! One true point of this web 2.0 bla is the concept of &#8220;perpetual beta&#8221;: release early but often and follow user feedback, so your application will quickly improve. But most repository operators do not seem to have a real interest in improvement and in their users!</p>
<p>Ok, I know that managing and updating a repository server is work &#8211; <em>I</em> would not be the right guy for such a job &#8211; but then don&#8217;t wail over low acceptance or wonder why libraries have an antiquated image. For real progress one should perpetually do user studies and engage in the developement of your software. Digital libraries with less resources should at least <a href="http://www.eprints.org/community/">join the Community</a> and follow updates to keep up to date.</p>
<p><b>P.S:</b> E-LIS has updated its software now (November 2008). A lot of missing features remain but those need to be implemented in EPrints first.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2008/10/01/digital-libraries-sleep-away-the-web-20/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Insight into the Digital Library Reference Model</title>
		<link>http://jakoblog.de/2008/09/18/insight-into-the-digital-library-reference-model/</link>
		<comments>http://jakoblog.de/2008/09/18/insight-into-the-digital-library-reference-model/#comments</comments>
		<pubDate>Thu, 18 Sep 2008 16:31:56 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[DELOS]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[ecdl]]></category>
		<category><![CDATA[ecdl2008]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=438</guid>
		<description><![CDATA[On the first post-ECDL-conference day I participated in the Third Workshop on Foundation of Digital Libraries (DLFoundations 2008) that was organized by the DELOS Network of Excellence on Digital Libraries. One major work of DELOS is the DELOS Digital Library Reference Model (DLRM). The DLRM is an abstract model to describe digital libraries; it can [...]]]></description>
			<content:encoded><![CDATA[<p>On the first post-<a href="http://www.ecdl2008.org/">ECDL-conference</a> day I participated in the <em><a href="http://www.delos.info/DLFoundations2008">Third Workshop on Foundation of Digital Libraries</a></em> (DLFoundations 2008) that was organized by the <em><a href="http://www.delos.info">DELOS Network of Excellence on Digital Libraries</a></em>. One major work of DELOS is the <em><a href="http://www.delos.info/ReferenceModel">DELOS Digital Library Reference Model</a></em> (DLRM). The DLRM is an abstract model to describe digital libraries; it can be compared to the <em>CIDOC Conceptual Reference Model</em> (CIDOC-CRM) and the <em>Functional Requirements for Bibliographic Records</em> (FRBR) &#8211; and it shares some of their problems: a lack of availability and (resulting) a lack of implementations.</p>
<p>The DLRM is &#8220;defined&#8221; in a 213-page PDF-file &#8211; this is just not usable! Have a look at W3C or IETF how nowadays standards are defined. As DLRM is a conceptual model, you must also provide a RDF representation or it is just inavailable for serious applications on the web. And of course the silly copyright statement should be removed in favor of a CC-license. That&#8217;s the formal part. Summarizing the content of DLRM there are 218 concepts and 52 relations &#8211; which is <em>far</em> too much to start with. But: there are some really useful ideas behind DLRM.</p>
<p><img src="http://jakoblog.de/wp-content/uploads/2008/09/dl-dls-dlms.png" title="Three levels of a digital library" width="500" height="224" class="aligncenter size-full wp-image-439" /></p>
<p>The reference model includes a division of &#8220;digital library&#8221; into three levels of conceptualization (see image): first the visible <em>digital library</em> (the collection that users work with), second the <em>digital library system</em> (the software application that is installed and configured to run a digital library), and third the <em>digital library management system</em> (the software system that supports the production and administration of digital libraries). This division can be helpful to understand and talk about digital libraries &#8211; although I am not sure whether the division between digital library systems and digital library management systems is a such a good one.</p>
<p>Beside general talks about the Digital Library Reference Model the workshop provided some experience from practise by Wolfram Hostmann (<a href="http://www.driver-support.eu/">DRIVER project</a>) and by Georg Eckes (<a href="http://www.deutsches-filminstitut.de/">Deutsches Filminstitut</a>) &#8211; never underestimate good real world examples! The most refreshing talk was given by <a href="http://www.cni.org/staff/joan_index.html">Joan Lippincott</a> (<a href="http://www.cni.org/">Coalition of Networked Information</a>). She pointed out that much more then traditional repositories can be viewed as digital libraries. Especially user-generated content can constitute a digital library. A useful model for digital libraries should also fit for collections at Flickr, YouTube, Wikis, Weblogs etc. and user can mash up resources to create new digital library services, for instance the species search engine <a href="http://ispecies.org/">iSpecies</a>. She is <em>sooo</em> right! In addition Joan mentioned initiatives to broaden the use of authority files and identity management. Another direct hit! If digital libraries only focus on interoperability with other &#8220;official&#8221; digital libraries they will not remain. Libraries are only one little player in the digital knowledge environment and their infrastructure is <em>not</em> defined only by them.</p>
<p>I enjoyed the workshop, I really like the digital library community and I am happy to be part of it. But some parts still seem to live in an ivory tower. If the digital library reference model does not quickly get adopted to real applications (<em>both</em> repositories like those build with DSpace, Fedora, EPrints etc. and open systems like YouTube, Wikipedia, Slideshare&#8230;), it is nothing but an interesting idea. The digital revolution is taking place anyway, so let&#8217;s better be part of it!</p>
<p><b>P.S:</b> The slides will soon be available at <a href="http://www.delos.info/DLFoundations2008">the Workshop&#8217;s website</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2008/09/18/insight-into-the-digital-library-reference-model/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Jangle-API für Bibliothekssysteme</title>
		<link>http://jakoblog.de/2008/07/10/jangle-api-fuer-bibliothekssysteme/</link>
		<comments>http://jakoblog.de/2008/07/10/jangle-api-fuer-bibliothekssysteme/#comments</comments>
		<pubDate>Thu, 10 Jul 2008 13:00:26 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[de]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[Jangle]]></category>
		<category><![CDATA[Standards]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2008/07/10/jangle-api-fuer-bibliothekssysteme/</guid>
		<description><![CDATA[Im VuFind-Projekt wird überlegt, Jangle als allgemeine Schnittstelle für Bibliothekssysteme zu verwenden, anstatt für jedes System die Anbindung neu anzupassen. Jangle wird von Ross Singer (Talis) vorangetrieben und steht anscheinend sowohl in Konkurrenz als auch in Kooperation mit der DLF working group on digital library APIs. Ob Jangle etwas wird und ob es sich durchsetzt, [...]]]></description>
			<content:encoded><![CDATA[<p>Im <a href="http://www.vufind.org/">VuFind-Projekt</a> wird überlegt, <a href="http://www.jangle.org/">Jangle</a> als allgemeine Schnittstelle für Bibliothekssysteme zu verwenden, anstatt für jedes System die Anbindung neu anzupassen. Jangle wird von Ross Singer (Talis) <a href="http://blogs.talis.com/panlibus/archives/2008/05/the-iron-fist-of-interoperability.php">vorangetrieben</a> und steht anscheinend sowohl in Konkurrenz als auch in Kooperation mit der <a href="http://jakoblog.de/2008/04/13/working-group-on-digital-library-apis-and-possible-outcomes/">DLF working group on digital library APIs</a>. Ob Jangle etwas wird und ob es sich durchsetzt, ist noch offen &#8211; zumindest ist mehr technischer Sachverstand dabei als bei anderen bibliothekarischen Standards.</p>
<p>Ich halte es zumindest für wichtig, in etwa auf dem Laufenden zu sein und bei Bedarf zu versuchen, Einfluss zu nehmen, falls die Entwicklung völlig an den eigenen Bedürfnissen vorbeigeht. So ganz verstanden habe ich Jangle allerdings bisher noch nicht. Statt eine neue Gesamt-API zu entwickeln, sollte meiner Meinung nach besser bei den <a href="http://jakoblog.de/2007/11/30/relevant-apis-for-digital-libraries/">existierenden Schnittstellen</a> aufgeräumt werden &#8211; anschließend können diese dann ja in Jangle o.Ä. zusammengefasst werden.</p>
<p>Larry stellt <a href="http://nodalib.wordpress.com/2008/06/05/jangle-vs-primo/">Jangle und Primo</a> gegenüber: auf der einen Seite der systematische OpenSource-Ansatz, in den man sich erstmal einarbeiten muss und der dafür insgesamt effizienter und flexibler ist und auf der anderen Seite das teure, unflexible Gesamtprodukt, das dafür schick aussieht. Leider geht es bei den Entscheidungsträgern meist primär um die Fassade, so dass sich OpenSource nur langsam (aber sicher) durchsetzt.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2008/07/10/jangle-api-fuer-bibliothekssysteme/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>First impressions from the Europeana meeting</title>
		<link>http://jakoblog.de/2008/06/23/first-impressions-from-the-europeana-meeting/</link>
		<comments>http://jakoblog.de/2008/06/23/first-impressions-from-the-europeana-meeting/#comments</comments>
		<pubDate>Mon, 23 Jun 2008 10:02:12 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[EU]]></category>
		<category><![CDATA[Europeana]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2008/06/23/first-impressions-from-the-europeana-meeting/</guid>
		<description><![CDATA[Today and tomorrow I participate in a Europeana/EDLnet conference in Den Haag. Europeana is a large EU-funded project to create a &#8220;European Digital Library&#8221;. Frankly speaking I cannot give a simple definition of Europeana because in the first instance it is just a buzzword. You can ask whether library portals have a future at all, [...]]]></description>
			<content:encoded><![CDATA[<p>Today and tomorrow I participate in a <a href="http://www.europeana.eu/conference.php">Europeana/EDLnet conference</a> in Den Haag. Europeana is a large EU-funded project to create a &#8220;European Digital Library&#8221;. Frankly speaking I cannot give a simple <a href="http://openobjects.blogspot.com/2008/04/notes-from-museums-and-europeana.html">definition of Europeana</a> because in the first instance it is just a buzzword. You can ask <a href="http://telfleur.wordpress.com/2008/06/20/outcomes-user-board/">whether library portals have a future</a> at all, and Europeana has many ingredients that may help to make it fail: a large ambitious plan, a tight schedule, and many different participants with different languages, cultures and needs. The current Europeana Outline Function Specification partly reads like a magic wishlist. Instead of following one simple, good idea from bottom-up, it looks like the attempt to follow many ideas in a bottom-up way.</p>
<p>But if you see Europeana less as a monolithical project but as a network of participants (libraries, museums, and archives) that try to agree on standards to improve interoperability, the attempt seems more promising. In his introductionary speech Jill Cousins (director of Europeana) stressed the importance of APIs and ways to export information from Europeana so other institutions can build their websites and mashups with services and content from Europeana. I hope that open content respositories like <a href="http://commons.wikimedia.org">Wikimedia Commons</a> and <a href="http://wikisource.org">Wikisource</a> can act as both as source and as target of exchange with the European Digital Library. What I also found interesting is that Google and Wikipedia are seen as the default role models or at least important examples of portals to compare with. The talks I have seen so far give me the impression that the view on possibilities and roles for libraries, museums, and archives are more realistic than I thought. One of their strength is that they hold the content and are responsible for it &#8211; in contrast to user generated collections like YoutTube, Slideshare, and (partly) Wikipedia. But in general cultural institutions are only one player among others on the web &#8211; so they also need to develop further and &#8220;let the data flow&#8221;. If the Europeana project helps in this development, it is a good project, no matter if you call it a &#8220;European Digital Library&#8221; or not.</p>
<p>The next short talks were about CIDOC-CRM and about OAI-ORE &#8211; both complex techniques that you cannot easily describe in short time (unless you are a really good Wikipedia author <img src='http://jakoblog.de/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> , but at the least my imagination of them has been improved.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2008/06/23/first-impressions-from-the-europeana-meeting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digitale Bibliotheken vor 8 Jahren</title>
		<link>http://jakoblog.de/2008/05/27/digitale-bibliotheken-vor-8-jahren/</link>
		<comments>http://jakoblog.de/2008/05/27/digitale-bibliotheken-vor-8-jahren/#comments</comments>
		<pubDate>Tue, 27 May 2008 00:52:56 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[de]]></category>
		<category><![CDATA[digital library]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2008/05/27/digitale-bibliotheken-vor-8-jahren/</guid>
		<description><![CDATA[Ich weiß nicht mehr wie genau Google mich auf diese Präsentation gebracht hat und eigentlich ist es ja uncool, auf Folien zu verweisen, die nicht auf Slideshare liegen (archiviert das eigentlich endlich mal jemand); aber irgendwie hat mich schon etwas beeindruckt, wie wenig von dem Grundlagen-Workshop zum Thema &#8220;Digitale Bibliotheken&#8221; auf der zweiten CARNet-Konferenz (Zagreb, [...]]]></description>
			<content:encoded><![CDATA[<p>Ich weiß nicht mehr wie genau Google mich auf <a href="http://www.carnet.hr/CUC/cuc2000/radovi/prezentacije/WS/WS2/index.html">diese Präsentation</a> gebracht hat und eigentlich ist es ja uncool, auf Folien zu verweisen, die nicht auf <a href="http://www.slideshare.com">Slideshare</a> liegen (archiviert das eigentlich endlich mal jemand); aber irgendwie hat mich schon etwas beeindruckt, wie wenig von dem <a href="http://www.carnet.hr/CUC/cuc2000/radovi/ws2.html">Grundlagen-Workshop zum Thema &#8220;Digitale Bibliotheken&#8221;</a> auf der <a href="http://www.carnet.hr/CUC/cuc2000/">zweiten CARNet-Konferenz</a> (Zagreb, 2000) bislang veraltet ist. Trotz aller Berechtigung von Initiativen wie &#8220;Bibliothek 2.0&#8243; gilt nämlich: die <em>grundlegenden </em> Entwicklungen im Bibliothekswesen sind längerfristig und schon seit mehr als 10 Jahren abzusehen. Die Automatisierung schreitet (auch) in Bibliotheken kontinuierlich immer weiter voran und was nicht digital wird, wird marginal. Zugriff sind nicht mehr auf einen Ort beschränkt, alles ist beliebig kopierbar und auch die Kommunikation findet immer mehr im digitalen Raum statt. Soweit, so vorhersagbar.</p>
<p>Ebenso beeindruckend wie die Kontinuität dieser Entwicklung, ist allerdings, dass bei den Beteiligten in (digitalen) Bibliotheken oft elementarste Grundlagen fehlen. Mein Standardbeispiel ist (immer wieder durch die Praxis bestätigt!) die verbreitete Unkenntnis der grundlegenden Unterschiede zwischen einer <a href="http://de.wikipedia.org/wiki/Suchmaschine#Realisierung">einfachen Suchmaschine</a> und einer <a href="http://de.wikipedia.org/wiki/Metasuchmaschine">Metasuchmaschine</a>. Um unabhängig von allen Hypes ein eigenes Gefühl für diese und andere Grundbegriffe zu bekommen (weitere ad-hoc-Beispiele: &#8220;Datenformat&#8221;, &#8220;Harvesting&#8221;,  &#8220;Feed&#8221;, &#8220;Community&#8221;&#8230;) muss man sich immer wieder damit beschäftigen. Gut, wenn es also Fortbildungen, wie den Eingangs genannten Workshop gibt. Einen kleinen Beitrag liefert die Reihe <a href="http://lernen20.wordpress.com/">Lernen 2.0</a> zum Selbststudium.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2008/05/27/digitale-bibliotheken-vor-8-jahren/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Working group on digital library APIs and possible outcomes</title>
		<link>http://jakoblog.de/2008/04/13/working-group-on-digital-library-apis-and-possible-outcomes/</link>
		<comments>http://jakoblog.de/2008/04/13/working-group-on-digital-library-apis-and-possible-outcomes/#comments</comments>
		<pubDate>Sun, 13 Apr 2008 12:48:50 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[ATOM]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[OAI]]></category>
		<category><![CDATA[Seealso]]></category>
		<category><![CDATA[SOA]]></category>
		<category><![CDATA[Standards]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2008/04/13/working-group-on-digital-library-apis-and-possible-outcomes/</guid>
		<description><![CDATA[Last year the Digital Library Federation (DLF) formed the &#8220;ILS Discovery Interface Task Force&#8220;, a working group on APIs for digital libraries. See their agenda and the current draft recommendation (February, 15th) for details [via Panlibus]. I&#8217;d like to shortly comment on the essential functions they agreed on at a meeting with major library system [...]]]></description>
			<content:encoded><![CDATA[<p>Last year the <a href="http://www.diglib.org/">Digital Library Federation</a> (DLF) formed the &#8220;<a href="http://blogs.lib.berkeley.edu/shimenawa.php/2008/04/04/ils_basic_discovery">ILS Discovery Interface Task Force</a>&#8220;, a working group on <a href="http://jakoblog.de/2007/11/30/relevant-apis-for-digital-libraries/">APIs for digital libraries</a>. See <a href="https://project.library.upenn.edu/confluence/display/ilsapi/Charge+and+Agenda">their agenda</a> and the <a href="https://project.library.upenn.edu/confluence/display/ilsapi/Draft+Recommendation">current draft recommendation</a> (February, 15th) for details [<a href="http://blogs.talis.com/panlibus/archives/2008/04/ils-vendors-support-berkeley-accord-on-apis.php">via Panlibus</a>]. I&#8217;d like to shortly comment on the essential functions they agreed on at a meeting with major library system (ILS) vendors. <a href="http://dltj.org/article/dlf-ils-statement/">Peter Murray summarized</a> the functions as &#8220;automated interfaces for offloading records from the ILS, a mechanism for determining the availability of an item, and a scheme for creating persistent links to records.&#8221;</p>
<p>On the one hand I welcome if vendors try to agree on (open) standards and service oriented architecture. On the other hand the working group is yet another top-down effort to discuss things that just have to be implemented based on existing Internet standards.</p>
<p><b>1. Harvesting</b>: In the library world this is mainly done via <a href="http://www.openarchives.org/OAI/openarchivesprotocol.html">OAI-PMH</a>. I&#8217;d also consider <a href="http://en.wikipedia.org/wiki/RSS">RSS</a> and  <a href="http://atompub.org/rfc4287.html">Atom</a>. To fetch single records, there is <a href="http://unapi.info/">unAPI</a> &#8211; which the DLF group does not mention. There is no need for <em>any</em> other harvesting API &#8211; missing features (if any) should be integrated into extensions and/or next versions of OAI-PMH and ATOM instead of inventing something new. P.S: <a href="http://www.jasonkolb.com/weblog/2009/09/why-google-wave-is-the-coolest-thing-since-sliced-bread.html">Google Wave</a> shows what to expect in the next years.</p>
<p><b>2. Search</b>: There is still good old overblown Z39.50. The near future is (slightly overblown) <a href="http://www.loc.gov/standards/sru/">SRU/SRW</a> and (simple) <a href="http://www.opensearch.org/">OpenSearch</a>. There is no need for discussion but for open implementations of SRU (I am still waiting for a full client implementation in Perl). I suppose that next generation search interfaces will be based on SPARQL or other RDF-stuff.</p>
<p><b>2. Availability</b>: The <a href="http://blogs.lib.berkeley.edu/shimenawa.php/2008/04/04/ils_basic_discovery">announcement</a> says: &#8220;This functionality will be implemented through a simple REST interface to be specified by the ILS-DI task group&#8221;. Yes, there is definitely a need (in december I <a href="http://jakoblog.de/2007/12/23/heidelberger-katalog-auf-dem-weg-zu-serviceorientierter-architektur/">wrote about such an API</a> in German). However the main point is not the API but to define what &#8220;availability&#8221; means. Please focus on this. P.S: <a href="http://purl.org/NET/DAIA">DAIA</a> is now available.</p>
<p><b>3. Linking:</b> For &#8220;Linking in a stable manner to any item in an OPAC in a way that allows services to be invoked on it&#8221; (announcement) there is no need to create new APIs. Add and propagate clean URIs for your items and point to your APIs via autodiscovery (HTML link element). That&#8217;s all. Really. To query and distribute general links for a given identifier, I created <a href="http://www.gbv.de/wikis/cls/SeeAlso_Simple_Specification">the SeeAlso API</a> which is used more and more in our libraries.</p>
<p>Furthermore the draft contains a section on &#8220;<a href="https://project.library.upenn.edu/confluence/display/ilsapi/Patron+Functionality">Patron functionality</a>&#8221; which is going to be based on <a href="http://www.niso.org/committees/committee_at.html">NCIP</a> and <a href="http://multimedia.mmm.com/mws/mediawebserver.dyn?6666660Zjcf6lVs6EVs66S0LeCOrrrrQ-">SIP2</a>. Both are dead ends in my point of view. You should better look at projects <em>outside the library world</em> and try to define schemas/ontologies for patrons and patron data (hint: patrons are also called &#8220;customer&#8221; and &#8220;user&#8221;). Again: the API itself is not underdefined &#8211; it&#8217;s the data which we need to agree on.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2008/04/13/working-group-on-digital-library-apis-and-possible-outcomes/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>OCLC Grid Services &#8211; first insights</title>
		<link>http://jakoblog.de/2007/11/28/oclc-grid-services-first-insights/</link>
		<comments>http://jakoblog.de/2007/11/28/oclc-grid-services-first-insights/#comments</comments>
		<pubDate>Wed, 28 Nov 2007 09:58:22 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[Mashup]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[PICA]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2007/11/28/oclc-grid-services-first-insights/</guid>
		<description><![CDATA[I am just sitting at a library developer meeting at OCLC&#124;PICA in Leiden to get to know more about OCLC Service Grid, WorldCat Grid, or whatever the new service-oriented product portfolio of OCLC will be called. As Roy Tennant pointed out, our meeting is &#8220;completely bloggable&#8221; so here we are &#8211; a dozen of European [...]]]></description>
			<content:encoded><![CDATA[<p>I am just sitting at a library developer meeting at <a href="http://www.oclcpica.org">OCLC|PICA</a> in Leiden to get to know more about OCLC Service Grid, WorldCat Grid, or whatever the new service-oriented product <a href="http://www.oclc.org/services/">portfolio of OCLC</a> will be called. As <a href="http://roytennant.com/">Roy Tennant</a> pointed out, our meeting is &#8220;completely bloggable&#8221; so here we are &#8211; a dozen of European kind-of system librarians.</p>
<p>The &#8220;Grid Services&#8221; that OCLC is going to provide is based on the &#8220;OCLC Services Architecture&#8221; (OSA), a framework by which network services are built &#8211; I am fundamentally sceptical on additional frameworks, but let&#8217;s have a look. </p>
<p>The basic idea about services is to provide a set of small methods for a specific purpose that can be accessed via HTTP. People can then use this services and build and share unexpected application with them &#8211; a principle that is called <em>Mashups</em>.</p>
<p>The OCLC Grid portfolio will have four basic pillars:</p>
<p><strong>network services</strong>: search services, metadata extraction, identity management, payment services, social services (voting, commenting, tagging&#8230;) etc.</p>
<p><strong>registries and data resources</strong>: bibliographic registries, knowledge bases, registries of institutions etc. (see <a href="http://www.worldcat.org/registry">WorldCat registries</a>)</p>
<p><strong>reusable components</strong>: a toolbox of programming components (clients, samples, source code libraries etc.)</p>
<p><strong>community</strong>: a developer network, involvement in open source developement etc. </p>
<p>Soon after social services were mentioned, at heavy discussion on reviews, and commenting started &#8211; I find the questions raised with user generated content are less technical but more social. Paul stressed that users are less and less interested in metadata but directly want the content of an information object (book, article, book chapter etc.). The community aspect is still somehow vague to me, we had some discussion about it too. Service oriented architecture also implies a different way of software engineering, which can partly be described by the &#8220;perpetual beta&#8221; principle. I am very exited about this change and how it will be practised at OCLC|PICA. Luckily I don&#8217;t have to think about the business model and legal part which is not trivial: everyone wants to use services for free, but services need work to get established and maintained, so how do we best distribute the costs among libraries?</p>
<p>That&#8217;s all for the introduction, we will get into more concrete services later.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2007/11/28/oclc-grid-services-first-insights/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Second day at MTSR</title>
		<link>http://jakoblog.de/2007/10/18/second-day-at-mtsr/</link>
		<comments>http://jakoblog.de/2007/10/18/second-day-at-mtsr/#comments</comments>
		<pubDate>Thu, 18 Oct 2007 16:46:04 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[DCAP]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[Dublin Core]]></category>
		<category><![CDATA[MTSR07]]></category>
		<category><![CDATA[MTSR2007]]></category>
		<category><![CDATA[OAI]]></category>
		<category><![CDATA[ontology]]></category>
		<category><![CDATA[Overlay Journal]]></category>
		<category><![CDATA[RIOJA]]></category>
		<category><![CDATA[Science 2.0]]></category>
		<category><![CDATA[SOA]]></category>
		<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2007/10/18/second-day-at-mtsr/</guid>
		<description><![CDATA[It is already a week ago (conference blogging should be published immediately) so I better summarize my final notes of the MTSR conference 2007: The second day started with the keynote speech Using semantics to enhance b2b integration by Jorge Cardoso. He reported from a survey about creation and usage of Ontologies &#8211; I hope [...]]]></description>
			<content:encoded><![CDATA[<p>It is already a week ago (conference blogging should be published immediately) so I better summarize my final notes of the <a href="http://www.mtsr.ionio.gr/">MTSR conference 2007</a>: <span id="more-190"></span> The second day started with the keynote speech <i>Using semantics to enhance b2b integration</i> by Jorge Cardoso. He reported from a survey about creation and usage of Ontologies &#8211; I hope he soon publishes the slides or a paper because there were too many details to remember. The second part of his speech was about Service Oriented Architecture (SOA) and Webservices &#8211; all highly relevant but less new to me.</p>
<p>After coffee break Haibo Jia presented <i>A new Formal Concept Analysis based learning approach to Ontology building</i>. He extracted 2201 different author keywords and 107 different classification terms from 900 documents of the <a href="http://portal.acm.org/dl.cfm">ACM</a>, added stemming and used the result to create an ontology for query expansion. I hope to follow his work with extraction and analyzing author keywords and classification terms too.</p>
<p>I could not follow the following presentation <i>Formalizing Dublin Core Application Profiles Description Set Profiles and Graph Constraints</i> by Fredrik Enoksson although I would have liked to get to now Alistair&#8217;s work. <a href="<br />
http://www.typepad.com/t/trackback/1073940/16536526">Dublin Core Application Profiles</a> will surely become more important in the future but this presentation did not help me a lot &#8211; maybe I was just inattentive because of my own presentation.</p>
<p>So I switched to another session and saw one of my highlights: Panayiota Polydoratou  presented the project <a href="http://www.ucl.ac.uk/ls/rioja/">Repository Interface for Overlaid Journal Archives</a> (RIOJA). To prepare the establishement of an &#8220;Overlay Journal&#8221; a detailed survey was held among researchers in the astrophysics and cosmology community. The overlay journal concept was already <a href="http://xxx.lanl.gov/blurb/pg96unesco.html">coined by Ginsparg in 1996</a>: a peer-reviewed journal is organized on top of a repository like <a href="http://arXiv.org">arXiv.org</a>. The survery showed that most scientists are not interested in a printed journal but in quality control and copyediting. Everything else can already be done with the repository (which the asked scientists already use a lot). </p>
<p>By the way: RIOJA is not the only project to prepare and create an <a href="http://openaccess.eprints.org/index.php?/archives/171-Research-Journals-Are-Already-Just-Quality-Controllers-and-Certifiers-So-What-Are-Overlay-Journals.html">overlay journal</a>. I found the JISC founded <a href="http://www.jisc.ac.uk/whatwedo/programmes/programme_rep_pres/repositories_sue/ojims.aspx">Overlay journal infrastructure for Meteorological Sciences</a> (OJIMS) and the <a href="http://www.lmcs-online.org">Logical Methods in Computer Science</a> (LCMS) which exists <a href="http://whigmaleerie.spaces.live.com/blog/cns!C6149B019D236BF5!341.entry">since 2005</a>. <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=503">Peter Murray pointed</a> out that the blog <a href="http://www.totallysynthetic.com/blog/">TotallySynthetic.com</a> and the digital library <a href="http://wwmm.ch.cam.ac.uk/crystaleye/">Crystaleye</a> for instance can be called overlay journals. Looks like the concept of a journal is highly evolving (Wake up librarians! When will you finally start to collect weblogs? The Times They Are A-Changin&#8217;!). Ok, back to the conference, I will later think more about Overlay Journals, OpenAccess, Science 2.0, OAI-ORE&#8230;</p>
<p>In the next session I saw Fredrik Enoksson with <i>An RDF Modification Protocol, based on the Needs of Editing Tools</i>. He explainded that usually RDF is stored in triple stores and there is a need for a remotely editing RDF. In the <a href="http://www.luisa-project.eu">luisa project</a> a remote editing protocol for RDF is developed and the existing SPARQL update language needs to be extended to fullfill the needs of editing applications.</p>
<p>Beside the low number of participants my own presentation went very well &#8211; I got two good questions that helped me to think more about an SKOS encoding of country codes. The slides are <a href="http://www.slideshare.net/nichtich/encoding-changing-country-codes-in-rdf-with-iso-3166-and-skos/">at slideshare</a> and a paper will follow.</p>
<p>Finally there were also some posters &#8211; but no real poster session to find the corresponding authors of a particular poster. A poster titled <i>Combining Collaborative Tagging and Ontologies in Image Retrieval Systems</i> attracted my attention because of the topic, but it looked very drafted and the explanation contained several gaps so I cannot say whether it contained some valuable research or not. I think the basic idea was to enrich a query with additional synonyms and query a combined index of automatically generated index terms, controlled terms assigned by the creator, and uncontrolled tags.</p>
<p>Video recordings of the presentations will be published in the next weeks and post-proceedings will be published in springer&#8217;s <a href="http://www.springer.com/dal/home/computer/lncs?SGWID=1-164-6-73659-0">Lecture Notes in Computer Science</a> (LNCS) series.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2007/10/18/second-day-at-mtsr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digital library zeno.org launched</title>
		<link>http://jakoblog.de/2007/09/30/digital-library-zenoorg-launched/</link>
		<comments>http://jakoblog.de/2007/09/30/digital-library-zenoorg-launched/#comments</comments>
		<pubDate>Sun, 30 Sep 2007 19:25:11 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[de]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[wikisource]]></category>
		<category><![CDATA[zeno.org]]></category>
		<category><![CDATA[ZVDD]]></category>

		<guid isPermaLink="false">http://jakoblog.de/2007/09/30/digital-library-zenoorg-launched/</guid>
		<description><![CDATA[Around 10 years after Directmedia Publishing started to sell electronic text editions on CD, its sister company Zenodot Verlagsgesellschaft mbH launched the digital library zeno.org. Yesterday I joined the zeno.org party and I can tell you that the guys know both: how to celebrate, and to keep up with the time and the future of [...]]]></description>
			<content:encoded><![CDATA[<p>Around 10 years after <a href="http://en.wikipedia.org/wiki/Directmedia_Publishing">Directmedia Publishing</a> started to sell electronic text editions on CD, its sister company <a href="http://www.zenodot.de/">Zenodot Verlagsgesellschaft mbH</a> launched the digital library <a href="http://www.zeno.org">zeno.org</a>. Yesterday I joined <a href="http://blog.zeno.org/?p=65">the zeno.org party</a> and I can tell you that the guys know both: how to celebrate, and to keep up with the time and the future of digital libraries!</p>
<p><a href="http://www.heise.de/newsticker/meldung/96739">According to heise</a> Zeno.org probably contains the largest free German fulltext collection on the Web with around 600 Million words. The German <a href="http://gutenberg.spiegel.de/">Projekt Gutenberg</a> (not to confuse with the English project!) is not much smaller, but its quality much is less (missing sources and page numbers) and the publisher Hille &#038; Partner, tries to put copyright restrictions on it. Zeno.org knows better and explicetely states if a work is in the public domain. However full downloads of large parts of the library are not allowed &#8211; so zeno.org is more free then Projekt Gutenberg (DE) and <a href="http://en.wikipedia.org/wiki/Google_Book_Search">Google Book Search</a> but less free then <a href="http://de.wikisource.org/">Wikisource</a> and <a href="http://www.gutenberg.org/">Project Gutenberg</a>. I wonder how it would fit into the <a href="http://www.opencontentalliance.org/">Open Content Alliance</a>.</p>
<p>Somehow between in the jungle of German digitized material and digitization projects there is the <a href="http://www.digitalisiertedrucke.de/">Central Index of Digitized Imprints</a> (ZVDD), a portal that was funded by the Deutsche Forschungsgemeinschaft (DFG) according to the principle &#8220;Too many cooks spoil the broth&#8221;: A <a href="http://www.zvdd.de/sammlungen.html">large number of texts</a> were digitized in Germany with much money and little use because no simple, consistent infrastructure was provided to distribute and archive the content. It could be so so simple with accessible metadata (OAI-PMH) and persistent identifiers&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2007/09/30/digital-library-zenoorg-launched/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

