<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jakoblog &#187; en</title>
	<atom:link href="http://jakoblog.de/category/en/feed/" rel="self" type="application/rss+xml" />
	<link>http://jakoblog.de</link>
	<description>Das Weblog von Jakob Voß</description>
	<lastBuildDate>Tue, 24 Jan 2012 11:02:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.4</generator>
		<item>
		<title>Embedded diagrams and pandoc</title>
		<link>http://jakoblog.de/2012/01/24/embedded-diagrams-and-pandoc/</link>
		<comments>http://jakoblog.de/2012/01/24/embedded-diagrams-and-pandoc/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 11:02:56 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[pandoc]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1226</guid>
		<description><![CDATA[If you don&#8217;t know John MacFarlane&#8217;s Pandoc, the &#8220;Swiss army knive of document formats&#8221;, you should definitely give it a try! Pandoc&#8217;s abstract document model and its serialization in an extended variant of Markdown markup let you focus on the structure and content of a text instead of dealing with formats and user interfaces. In [...]]]></description>
			<content:encoded><![CDATA[<p>If you don&#8217;t know John MacFarlane&#8217;s <a href="http://johnmacfarlane.net/pandoc/">Pandoc</a>, the  &#8220;Swiss army knive of document formats&#8221;, you should definitely give it a try! Pandoc&#8217;s abstract document model and its serialization in an extended variant of <a href="http://en.wikipedia.org/wiki/Markdown">Markdown markup</a> let you focus on the structure and content of a text instead of dealing with formats and user interfaces. In my opinion pandoc is the best tool for document creation invented since (La)TeX (moreover pandoc is a good argument to finally learn programming in Haskell) Images in pandoc markdown documents, however, are only referenced by their file. This requires some preprocessing if you want to create different files for different document formats, especially bitmap images and vector images. So I hacked <a href="https://github.com/nichtich/ditaa-markdown/blob/master/mddia">a little preprocessing script</a> that let&#8217;s you embed images in pandoc&#8217;s markup language. For instance you write</p>
<blockquote><p>
~~~~ {.dot .Grankdir:LR}<br />
digraph {<br />
A -> B -> C;<br />
A -> C;<br />
}<br />
~~~~
</p></blockquote>
<p>and you get</p>
<p><img src="http://jakoblog.de/wp-content/uploads/2012/01/image-2.png" alt="" title="GraphViz diagram" width="323" height="89" class="aligncenter size-full wp-image-1227" /></p>
<p>or based on <a href="http://search.cpan.org/perldoc?rdfdot">rdfdot</a> you write</p>
<blockquote><p>
~~~~ {.rdfdot}<br />
@prefix foaf: <http: //xmlns.com/foaf/0.1/> .<br />
@base <http: //example.com/> .<br />
<alice> foaf:name &#8220;Alice&#8221; ;<br />
foaf:knows [ foaf:name "Bob" ] .<br />
~~~~<br />
</alice></p></blockquote>
<p>and you get</p>
<p><img src="http://jakoblog.de/wp-content/uploads/2012/01/image-3.png" alt="" title="RDF/Turtle depicted by rdfdot" width="531" height="121" class="aligncenter size-full wp-image-1228" /></p>
<p>A detailed description is included <a href="https://github.com/nichtich/ditaa-markdown/blob/master/README.md">in the manual</a> which has been transformed automatically <a href="http://nichtich.github.com/ditaa-markdown/README.html">to HTML</a> and <a href="http://nichtich.github.com/ditaa-markdown/README.pdf">to PDF</a>. Compare both documents to see that HTML includes PNG images and PDF contains vector images!</p>
<p>Feel free to reuse and modify the script, for instance by adding more diagram types! For instance how about <a href="http://en.wikipedia.org/wiki/ASCII_tab">ASCII tabs</a> and <a href="http://en.wikipedia.org/wiki/ABC_notation">ABC notation</a> if you write about music?</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2012/01/24/embedded-diagrams-and-pandoc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linked local library data simplified</title>
		<link>http://jakoblog.de/2012/01/10/linked-local-library-data-simplified/</link>
		<comments>http://jakoblog.de/2012/01/10/linked-local-library-data-simplified/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 12:53:39 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[Library]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[lld]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1214</guid>
		<description><![CDATA[A few days ago Lukas Koster wrote an article about local library linked data. He argues that bibliographic data from libraries data as linked data is not &#8220;the most interesting type of data that libraries can provide&#8221;. Instead &#8220;library data that is really unique and interesting is administrative information about holdings and circulation&#8221;. So libraries [...]]]></description>
			<content:encoded><![CDATA[<p>A few days ago Lukas Koster wrote an article about <a href="http://commonplace.net/2012/01/local-library-data-in-the-new-global-framework/">local library linked data</a>. He argues that bibliographic data from libraries data as linked data is not &#8220;the most interesting type of data that libraries can provide&#8221;. Instead &#8220;library data that is really unique and interesting is administrative information about holdings and circulation&#8221;. So libraries &#8220;should focus on holdings and circulation data, and for the rest link to available bibliographic metadata as much as possible.&#8221; I fully agree with this statements but not with the exact method how do accomplish the publication of local library data.</p>
<p>Among other project, Koster points to <a href="http://www.librarycloud.org/">LibraryCloud</a> to aggregate and deliver library metadata, but it looks like they reinvent yet more wheels in form of their own APIS and formats <a href="http://librarylab.law.harvard.edu/librarycloud_wiki/index.php/LibraryCloud_API_Documentation">for search</a> and <a href="http://librarylab.law.harvard.edu/librarycloud_wiki/index.php/Item_data">for bibliographic description</a>. Maybe I am wrong about this project, as they just started to collect <a href="http://www.librarycloud.org/deposit">holding and circulation data</a>. </p>
<p>At the recent <a href="http://swib.org/">Semantic Web in Bibliotheken</a> conference, Magnus Pfeffer gave a presentation about &#8220;Publishing and consuming library loan information as linked open data&#8221; (<a href="http://swib.org/swib11/vortraege/swib11-magnus-pfeffer.pdf">see slides</a>) and I talked about a <a href="http://jakoblog.de/2011/12/02/can-sobr-help-publishing-library-holdings/">Simplified Ontology for Bibliographic Resources</a> (SOBR) which is mainly based on the <a href="http://jakoblog.de/2012/01/06/request-for-comments-final-specification-of-daia/">DAIA</a> data model. We are going to align both data models and I hope that the next libraries will first look at these existing solutions instead of inventing yet another data format or ontology. Koster&#8217;s proposal, however, looks like such another solution: he argues that &#8220;we need an extra explicit level to link physical Items owned by the library or online subscriptions of the library to the appropriate shared network level&#8221; and suggests to introduce a &#8220;holding&#8221; level. So there would be <em>five</em> levels of description:</p>
<ul>
<li>Work</li>
<li>Expression</li>
<li>Manifestation</li>
<li>Holding</li>
<li>Item</li>
</ul>
<p>Apart from the fact that at least one of Work, Expression, Manifestation is dispensable, I disagree with a Holding level above the Item. My current model consists of at most three levels of documents:</p>
<ul>
<li>document as abstract work (frbr:Work, schema:CreativeWork&#8230;)</li>
<li>bibliographic document (frbr:Manifestation, sobr:Edition&#8230;)</li>
<li>item as concrete single copy (frbr:Item&#8230;)</li>
</ul>
<p>The term &#8220;level&#8221; is misleading because these classes are not disjoint. I depicted their relationship in a simple Venn diagram:</p>
<p><img src="http://jakoblog.de/wp-content/uploads/2012/01/sobrmodel.png" alt="" title="SOBR data model" width="515" height="419" class="aligncenter size-full wp-image-1216" /></p>
<p>For local library data, we are interested in single items, which are copies of general documents or editions. Where do Koster&#8217;s &#8220;holding&#8221; entities fit into this model? He writes &#8220;a specific Holding in this way would indicate that a specific library has one or more copies (Items) of a specific edition of a work (Manifestation), or offers access to an online digital article by way of a subscription.&#8221; The core concepts as I read them are:</p>
<ul>
<li>&#8220;one or more copies (Items)&#8221; = frbr:Item</li>
<li>&#8220;specific edition of a work (Manifestation)&#8221; = sobr:Edition or frbr:Manifestation</li>
<li>&#8220;has one [...] or offer access to&#8221; = ???</li>
</ul>
<p>Instead of creating another entity for holdings, you can express the ability &#8220;to have one or offer access to&#8221; by DAIA Services. The class <a href="http://purl.org/ontology/daia/Service">daia:Service</a> can be used for an unspecified service and more specific subclasses such as loan, presentation, and openaccess can be used if more is known. Here is a real example with all &#8220;levels&#8221;:</p>
<pre><code>
&lt;http://dbpedia.org/resource/Tractatus_Logico-Philosophicus&gt;
    a bibo:Book ;
    daia:edition &lt;urn:nbn:de:eki:GBV128382600&gt; ;
    daia:exemplar
        &lt;http://uri.gbv.de/document/opac-de-23:epn:266449999&gt; .

&lt;urn:nbn:de:eki:GBV128382600&gt; a bibo:Book ;
    daia:exemplar
        &lt;http://uri.gbv.de/document/opac-de-23:epn:266449999&gt; .

&lt;http://uri.gbv.de/document/opac-de-23:epn:266449999&gt;
    a bibo:Book, daia:Item ;
    daia:heldBy &lt;http://uri.gbv.de/organization/isil/DE-23&gt; ;
        daia:availableFor [
            a daia:Service ;
            daia:providedBy &lt;http://uri.gbv.de/organization/isil/DE-23&gt;
        ] .
</code></pre>
<p>I have only made up the RDF property daia:edition from the SOBR proposal because FRBR relations are too strict. If you know a better relation to directly relate an abstract work to a concrete edition, please let me know.</p>
<p><center><br />
<a href="http://jakoblog.de/wp-content/uploads/2012/01/linked-local-library-example.png"><img src="http://jakoblog.de/wp-content/uploads/2012/01/linked-local-library-example.png" alt="" title="linked-local-library-example" width="500" height="50" class="aligncenter size-medium wp-image-1220" /></a><br /><small>image created with <a href="http://search.cpan.org/perldoc?rdfdot">rdfdot</a></small><br />
</center></p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2012/01/10/linked-local-library-data-simplified/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Request for comments: final specification of DAIA</title>
		<link>http://jakoblog.de/2012/01/06/request-for-comments-final-specification-of-daia/</link>
		<comments>http://jakoblog.de/2012/01/06/request-for-comments-final-specification-of-daia/#comments</comments>
		<pubDate>Fri, 06 Jan 2012 10:13:49 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[code4lib]]></category>
		<category><![CDATA[DAIA]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1204</guid>
		<description><![CDATA[When I started to create an API for availability lookup of document in libraries in 2008, I was suprised that such a basic service was so poorly defined. The best I could find was the just-published recommendation of the Digital Library Federation (DLF-ILS). Even there availability status was basically a plain text message (section 6.3.1 [...]]]></description>
			<content:encoded><![CDATA[<p>When I started to create an API for availability lookup of document in libraries in 2008, I was suprised that such a basic service was so poorly defined. The best I could find was the just-published <a href="http://old.diglib.org/architectures/ilsdi/">recommendation of the Digital Library Federation</a> (DLF-ILS). Even there availability status was basically a plain text message (section 6.3.1 and appendix 4 and 5). Other parts of the DLF-ILS GetAvailability response were more helpful, so they are all part of the <a href="http://purl.org/NET/DAIA">Document Availability Information API</a> (DAIA). Here is a simple mapping from DLF-ILS to DAIA:</p>
<ul>
<li>bibliographicIdentifer (string) → document (URI)</li>
<li>itemIdentifier (string) → item (URI)</li>
<li>dateAvailable (dateTime) → expected (xs:dateTime or  xs:date or &#8220;unknown&#8221;) or delay (xs:duration or &#8220;unknown&#8221;)</li>
<li>location (string) → storage (URI and/or string, plus optional URL)</li>
<li>call number (string) → label (string)</li>
<li>holdQueueLength (int) → queue (xs:nonNegativeInteger)</li>
<li>status (string) and circulating (boolean) → available/unavailable (with service type and additional information)</li>
</ul>
<p>So you could say that DAIA implements the abstract GetAvailability function from DLF-ILS. I like abstract, language independent specifications, but they must be precise and testable (see Meek&#8217;s forgotten paper <a href="http://dx.doi.org/10.1109/SESS.1995.525971">The seven golden rules for producing language-independent standards</a>). DAIA is more than an implementation: it provides both, an abstract standard and bindings to several data languages (XML, JSON, and RDF). The conceptual DAIA data model defines some basic concepts and relationships (document, items, organisations, locations, services, availabilities, limitations&#8230;) <em>independent</em> from whether they are expressed in XML elements,  attributes, RDF properties, classes, or any other <a href="http://aboutdata.org/">data structuring method</a>. The only reference to specific formats is the requirement that <em>all unique identifiers must be URIs</em>. Right now there is an <a href="http://purl.org/NET/DAIA/schema.xsd">XML Schema</a> if you want to express DAIA in XML and <a href="http://purl.org/ontology/daia/">an OWL ontology</a> for RDF.</p>
<p>In its fourth year of development (see my previous posts <a href="http://jakoblog.de/2009/10/23/how-to-encode-the-availability-of-documents/">from 2009</a>) DAIA seems to have enough momentum to finally get accepted in practice. We use it in GBV library union (public server at <a href="http://daia.gbv.de/">http://daia.gbv.de/</a>), there are independent implementations such as in <a href="http://doctor-doc.com">Doctor-Doc</a>, there is client-support in VuFind and I heard  rumors that DAIA capabilities will be build into EBSCO and Summon Discovery Services. Native support in Integrated Library Systems, however, is still lacking &#8211; I already have given up hope and prefer a clean DAIA wrapper over a broken DAIA-implementation anyway. If you are interested in creating your own DAIA server/wrapper or client, have a look at my reference implementation <a href="http://search.cpan.org/perldoc?DAIA">DAIA</a> and <a href="http://search.cpan.org/perldoc?Plack::App::DAIA">Plack::App::DAIA</a> at CPAN and Oliver Goldschmidt&#8217;s PHP implementation in <a href="https://github.com/gbv/daia">our common github repository</a>. A conceptual overview as tree (DAIA/JSON, DAIA/XML) and as graph (DAIA/RDF) <a href="http://www.gbv.de/wikis/cls/Document_Availability_Information_API_(DAIA)#Overview">can be found here</a>.</p>
<p>Still there are some details to be defined and I&#8217;d like to solve these issues to come to a version DAIA 1.0. These are</p>
<ul>
<li>How to deal with <strong>partial publications</strong> (you requested an article but only get the full book or you requested a series but only get a single volume).</li>
<li>How to deal with <strong>digital publications</strong> (especially its possible service types: is &#8220;download&#8221; a service distinct to &#8220;loan&#8221; or is &#8220;presentation&#8221; similar to online access restricted to the library&#8217;s intranet?).</li>
<li>Final agreement on <strong>service types</strong> (now there are <strong>presentation</strong>: item can be used <em>in</em> the institution, <strong>loan</strong>: item can be used outside of the institution  for a limited time, <strong>interloan</strong>: item can be send to another institution, <strong>openaccess</strong>: item can be access unrestricted, just get a free copy). Some <a href="http://wiki.code4lib.org/index.php/DAIA_extensions">extensions</a> have been proposed.</li>
<li>A set of common <strong>limitation types</strong> (for instance IP-based access restriction, permission-based access etc.).</li>
</ul>
<p>I&#8217;d be happy to get some more feedback on these issues, especially concrete use cases. We are already discussing on the <a href="http://sourceforge.net/mailarchive/forum.php?forum_name=daia-devel">daia-devel mailing list</a> but you can also comment in your own blog, at <a href="http://lists.w3.org/Archives/Public/public-lld/">public-lld</a>, <a href="http://dewey.library.nd.edu/mailing-lists/code4lib/">code4lib</a>, <a href="https://groups.google.com/forum/#!forum/ils-di">ils-di</a> etc.).</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2012/01/06/request-for-comments-final-specification-of-daia/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Can SOBR help publishing library holdings?</title>
		<link>http://jakoblog.de/2011/12/02/can-sobr-help-publishing-library-holdings/</link>
		<comments>http://jakoblog.de/2011/12/02/can-sobr-help-publishing-library-holdings/#comments</comments>
		<pubDate>Thu, 01 Dec 2011 23:08:32 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[DAIA]]></category>
		<category><![CDATA[FRBR]]></category>
		<category><![CDATA[lod]]></category>
		<category><![CDATA[sobr]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1176</guid>
		<description><![CDATA[I just participated in the German conference Semantic Web in Bibliotheken which took place in Hamburg this week. This year there were two slots for lightning talks, but unfortunately participants did not catch on, so we only had four of them. Lightning talks are a good chance to present something unfinished that you need input [...]]]></description>
			<content:encoded><![CDATA[<p>I just participated in the German conference <a href="http://swib.org/">Semantic Web in Bibliotheken</a> which took place in Hamburg this week. This year there were two slots for lightning talks, but unfortunately participants did not catch on, so we only had four of them. Lightning talks are a good chance to present something unfinished that you need input for, <a href="http://www.slideshare.net/nichtich/voss-111129-sobr">so I presented</a> the Simplified Ontology for Bibliographic Resources (SOBR) as &#8220;FRBR light&#8221;. You can find the current  specification of SOBR <a href="https://gist.github.com/1331983">at github</a>, which means the specification is still evolving and I&#8217;d like to get more feedback.</p>
<p>SOBR was caused by a discussion on the <a href="http://lists.w3.org/Archives/Public/public-lld/">Library Linked Data mailing list</a> about the (disputed) disjointedness of FRBR classes. SOBR has a history in the <a href="http://purl.org/NET/DAIA">Document Availability Information API</a> (DAIA), which SOBR might be merged into. The use case of both is publishing information about holdings from library catalogs as Linked Open Data. The information most requested is probably connected to holdings: library users only ask &#8220;where is it?&#8221; and &#8220;how can I get it?&#8221;. In this questions, the little word &#8220;it&#8221; refers to a specific publication. In the answers, however, &#8220;it&#8221; usually refers to some holding or copy of this publication. Sometimes the holding contains more than the publication (for instance if you ask for an article in a book) and sometimes you get multiple holdings (for instance if you ask for a a large work that is split in multiple volumes). Sometimes there are multiple holdings with different content to choose from, because there are different editions, forms, translations etc. of the requested publication.</p>
<p>A long time ago, some librarians thought about similar questions and answers and came up with the Functional Requirements for Bibliographic Records (FRBR). I tried hard to accept FRBR (I even draw this ugly diagrams that people find when they look up <a href="http://en.wikipedia.org/wiki/Functional_Requirements_for_Bibliographic_Records">FRBR in Wikipedia</a>). But FRBR does not help me to publish existing library catalogs as Linked Open Data. In <a href="http://uri.gbv.de/database/">our catalog databases</a> we have <b>records that refer to editions</b>, connected with <b>records that refer to holdings</b> (I&#8217;ll ignore the little exceptions and nasty special cases such as multiple holdings described by one look-like-a-holding-record). In addition there are some records that refer to series, works, and other types of abstract <b>documents without direct holdings</b>, which are connected to records that refer to editions.</p>
<p>Maybe we can simplify this to two entities: general documents (<a href="http://purl.org/ontology/bibo/Document">bibo:Document</a>) and items (with <a href="http://purl.org/vocab/frbr/core#Item">frbr:Item</a>) as special kind of documents. The current design of SOBR also contains a class for editions, but I am not sure whether this class is also needed. At least we need <b>three properties</b> to relate documents to items (<a href="http://purl.org/ontology/daia/exemplar">daia:exemplar</a>), to relate documents to editions (daia:edition?) and to relate documents to its parts (dcterms:hasPart). To avoid the need of blank nodes, I&#8217;d also define properties that relate documents to partial items (<a href="http://purl.org/ontology/daia/exemplar">daia:extract</a> = dcterms:hasPart + daia:exemplar) and to relate documents to partial editions (daia:editionPart?)</p>
<p>Feedback on SOBR is welcome, especially if you provide examples with existing URIs (or at least local identifiers to already existing data) instead of theoretical FRBR-like-made-up examples. The best way to find a good ontology for publishing library holdings is to actually publish data that describes library holdings! The following image is based on an example that connects a work <a href="http://www.librarything.com/work/70394">from LibraryThing</a> and <a href="http://dbpedia.org/resource/Living_My_Life">from DBPedia</a> with a partial edition <a href="http://www.worldcat.org/oclc/656754414">from Worldcat</a>, a full edition <a href="http://d-nb.info/1001703464">from German National Library</a>, and a holding <a href="http://uri.gbv.de/document/opac-de-18:epn:1220640794">from Hamburg University</a>:</p>
<pre>@prefix bibo: &lt;http://purl.org/ontology/bibo/&gt; .
@prefix daia: &lt;http://purl.org/ontology/daia/&gt; .
@prefix frbr: &lt;http://purl.org/vocab/frbr/core#&gt; .
@prefix owl: &lt;http://www.w3.org/2002/07/owl#&gt; .
@prefix dct: &lt;http://purl.org/dc/terms/&gt; .
@prefix rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt; .

&lt;http://www.librarything.com/work/70394&gt; a bibo:Document ;
  owl:sameAs &lt;http://dbpedia.org/resource/Living_My_Life&gt; ;
  daia:edition &lt;http://d-nb.info/1001703464&gt; , [
    a bibo:Collection # , daia:Document
    ; dct:hasPart &lt;http://www.worldcat.org/oclc/656754414&gt;
  ]
    ;
  # daia:exemplar &lt;http://d-nb.info/1001703464&gt; ; ?
  daia:editionPart &lt;http://www.worldcat.org/oclc/656754414&gt; .

&lt;http://d-nb.info/1001703464&gt; a frbr:Item , bibo:Document ;
   daia:exemplar &lt;http://uri.gbv.de/document/opac-de-18:epn:1220640794&gt; .
</pre>
<p><center><br />
<a href="http://jakoblog.de/wp-content/uploads/2011/12/sobr-example.png"><img src="http://jakoblog.de/wp-content/uploads/2011/12/sobr-example-1024x258.png" alt="" title="sobr-example" width="512" height="129" class="aligncenter size-large wp-image-1181" /></a><br />
<small>image created with <a href="http://search.cpan.org/perldoc?rdfdot">rdfdot</a></small><br />
</center></p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/12/02/can-sobr-help-publishing-library-holdings/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>URI namespace lookup with prefix.cc and RDF::NS</title>
		<link>http://jakoblog.de/2011/11/03/uri-namespace-lookup-with-prefixcc-and-rdf-ns/</link>
		<comments>http://jakoblog.de/2011/11/03/uri-namespace-lookup-with-prefixcc-and-rdf-ns/#comments</comments>
		<pubDate>Thu, 03 Nov 2011 15:13:22 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[CPAN]]></category>
		<category><![CDATA[namespace]]></category>
		<category><![CDATA[Perl]]></category>
		<category><![CDATA[prefix]]></category>
		<category><![CDATA[URI]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1152</guid>
		<description><![CDATA[Probably the best feature of RDF is that it forces you to use Uniform Resource Identifiers (URI) instead of private, local identifiers which only make sense in a some context. URIs are long and cumbersome to type, so popular URIs are abbreviated with namespaces prefixes. For instance foaf:Person is expanded to http://xmlns.com/foaf/0.1/Person, once you have [...]]]></description>
			<content:encoded><![CDATA[<p>Probably the best feature of RDF is that it forces you to use <b>Uniform Resource Identifiers</b> (URI) instead of private, local identifiers which only make sense in a some context. URIs are long and cumbersome to type, so popular URIs are abbreviated with namespaces prefixes. For instance <code>foaf:Person</code> is expanded to <code><a href="http://xmlns.com/foaf/0.1/Person">http://xmlns.com/foaf/0.1/Person</a></code>, once you have defined prefix <code>foaf</code> for namespace <code>http://xmlns.com/foaf/0.1/</code>. In theory URI prefixes in RDF are arbitrary (in contrast to XML where <a href="http://www.w3.org/TR/xml-infoset/#infoitem.element">prefixes can matter</a>, in contrast to popular belief). In practice people prefer to agree to one or two known prefixes for common URI namespaces (unless they want to obfuscate human readers of RDF data).</p>
<p>So URI prefixes and namespaces and are vital for handling RDF. However, you still have to define them in almost every file and application. In the end people have copy &#038; paste the same prefix definitions again and again. Fortunately <a href="http://richard.cyganiak.de/blog/?s=prefix.cc">Richard Cyganiak created</a> a registry of popular URI namespaces, called <a href="http://prefix.cc">prefix.cc</a> (<a href="https://github.com/cygri/prefix.cc">it&#8217;s open source</a>), so people at least know where to copy &#038; paste from. I had enough of copying the same URI prefixes from prefix.cc over and over again, so <a href="http://search.cpan.org/dist/RDF-NS/">I created a Perl module</a> that includes snapshots of the prefix.cc database. It includes a simple command line client, that is installed automatically:</p>
<pre>
  $ sudo cpanm RDF::NS
  $ rdfns rdf,foaf.ttl
  @prefix foaf: <http: //xmlns.com/foaf/0.1/> .
  @prefix rdf: <http: //www.w3.org/1999/02/22-rdf-syntax-ns#> .
</http:></pre>
<p>In your Perl code, you can use it like this:</p>
<pre>
  use RDF::NS
  my $NS = RDF::NS->new('20111102');
  $NS->foaf_Person; # returns "http://xmlns.com/foaf/0.1/Person"
</pre>
<p>If you miss an URI prefix, just add it at http://prefix.cc, and will be included in the next release.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/11/03/uri-namespace-lookup-with-prefixcc-and-rdf-ns/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TPDL 2011 Doctoral Consortium – part 3</title>
		<link>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-%e2%80%93-part-3/</link>
		<comments>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-%e2%80%93-part-3/#comments</comments>
		<pubDate>Sun, 25 Sep 2011 15:36:48 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[digital library]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[tpdl]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1134</guid>
		<description><![CDATA[See also part 1 and part 2 of conference-blogging and #TPDL2011 on twitter. My talk about general patterns in data was recieved well and I got some helpful input. I will write about it later. Steffen Hennicke, another PhD student of my supervisor Stefan Gradman, then talked about his work on modeling Archival Finding Aids, [...]]]></description>
			<content:encoded><![CDATA[<p><em>See also <a href="http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium/">part 1</a> and <a href="http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-part-2/">part 2</a> of conference-blogging </em> and <a href="http://twitter.com/search?q=%23tpdl2011">#TPDL2011 on twitter</a>.</p>
<p><a href="http://aboutdata.org/bibliography">My talk about general patterns in data</a> was recieved well and I got some helpful input. I will write about it later. <a href="http://www.ibi.hu-berlin.de/institut/mitarbA-Z/akadmitarb/hennicke">Steffen Hennicke</a>, another PhD student of my supervisor Stefan Gradman, then talked about his work on modeling Archival Finding Aids, which are possibly expressed in <a href="http://en.wikipedia.org/wiki/Encoded_Archival_Description">EAD</a>. The structure of EAD is often not suitable to answer user needs. For this reason Hennicke analyses EAD data and reference questions, to develope better structures that users can follow to find what they look for in archives. This is done in CIDOC-CRM as a high-level ontology and the main result will be an expanded EAD model in RDF. To me the problem of &#8220;semantic gaps&#8221; is interesting, and I think about using some of Hennicke data as example to explain data patterns in my work.</p>
<p>The last talk by <a href="http://www-iw.uni-regensburg.de/promowiki/Profiles/RitaStrebe">Rita Strebe</a> was about aesthetical user experience of websites. One aim of her work is to measure the significance of aesthetical perception. In particular her hypothesis to be evaluated by experiments are:</p>
<blockquote><p>
H1: On a high level, the viscerally perceived visual aesthetics of websites effects<br />
approach behaviour.<br />
H2: On a low level, the viscerally perceived visual aesthetics of websites effects<br />
avoidance behaviour.
</p></blockquote>
<p>Methods and preliminary results look valid, but the relation to digital libraries seems low and so was the expertise of Strebe&#8217;s motivation and methods among the participants. I suppose her work better fits to <a href="http://en.wikipedia.org/wiki/Human%E2%80%93computer_interaction">Human-Computer Interaction</a>.</p>
<p>After the official part of the program <a href="http://www.peachnote.com/about.html">Vladimir Viro</a> briefly presented his music search engine <a href="http://www.peachnote.com/">peachnote.com</a>, that is based on scanned muscial scores. If I was working in or with musical libraries, I would not hesitate to contact Viro! I also though about a search for free musical scores in Wikimedia framework. The Doctoral Consortium ended with a general discussion about dissertation, science, libraries, users, and everything, as it should be <img src='http://jakoblog.de/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-%e2%80%93-part-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TPDL 2011 Doctoral Consortium &#8211; part 2</title>
		<link>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-part-2/</link>
		<comments>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-part-2/#comments</comments>
		<pubDate>Sun, 25 Sep 2011 10:42:09 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[tpdl]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1127</guid>
		<description><![CDATA[The TPDL 2011 Doctoral Consortium, which I already blogged about in part 1, continued with 15 minutes of delay: Christopher Gibson also talked about eBooks &#8211; I wonder why his talk was not combined with Luca Colombo&#8217;s work in eBook reading experiences. Gibson&#8217;s specific topic is eBook lending services in UK public libraries. To quote [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.tpdl2011.org/">TPDL 2011</a> Doctoral Consortium, which I already <a href="http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium/">blogged about in part 1</a>, continued with 15 minutes of delay: <a href="http://www.cis.strath.ac.uk/cis/staff/index.php?uid=cg">Christopher Gibson</a> also talked about eBooks &#8211; I wonder why his talk was not combined with Luca Colombo&#8217;s work in eBook reading experiences. Gibson&#8217;s specific topic is eBook lending services in UK public libraries. To quote the research questions from <a href="http://dx.doi.org/10.1007/978-3-642-24469-8_60">his paper</a>:</p>
<blockquote><p>
Q1. How have public libraries addressed ebook service provision in the UK?<br />
Q2. What challenges and opportunities exist in incorporating ebook lending into other reader services?<br />
Q3. Is it feasible to lend ebook reading devices from public libraries?<br />
Q4. How can the effectiveness of ebook lending services be measured?<br />
Q5. How do library users view the provision of ebook lending services?<br />
Q6. How can effective ebook lending services be developed?
</p></blockquote>
<p>To me an interesting aspect of his methodology was the use of targeted FOI (freedom of information) requests to gather data about eBook lending services. I cannot image this in this Germany where &#8220;<a href="http://de.wikipedia.org/wiki/Informationsfreiheit">Informationsfreiheit</a>&#8221; is still in its infancy. One result from another survery done by Gibson: most eBooks are not included in library catalogs. I think this failure is found in German libraries too. In summary the PhD project looked very profound with some real practical values for libraries. On the other hand, the theoretical contribution, for instance the question what &#8220;lending&#8221; can mean in a digital library work, was only added in the discussion afterwards.</p>
<p>The next presenting PhD student was <a href="http://twitter.com/adamsofronije">Adam Sofronjievic</a>. I am sorry that I could not fully concentrate on his talk about a <a href="http://collaborativelibrarianship.wordpress.com/2011/09/13/chapter-new-paradigm-of-library-collaboration/">New Paradigm of Library Collaboration</a> although it seemed very interesting. My talk is next <img src='http://jakoblog.de/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TPDL 2011 Doctoral Consortium</title>
		<link>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium/</link>
		<comments>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium/#comments</comments>
		<pubDate>Sun, 25 Sep 2011 09:21:41 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[tpdl]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1120</guid>
		<description><![CDATA[Today the International Conference on Theory and Practice of Digital Libraries 2011 started with tutorials and a Doctoral Consortium that I participate with a talk. The seven talks and discussions on ongoing PhD topics were rather diverse and interesting. I tried to briefly summarize at least some of them. Luco Colombo started with his work [...]]]></description>
			<content:encoded><![CDATA[<p>Today the <a href="http://www.tpdl2011.org">International Conference on Theory and Practice of Digital Libraries 2011</a> started with tutorials and a <a href="http://www.tpdl2011.org/index.php?option=com_content&#038;view=article&#038;id=52&#038;Itemid=54">Doctoral Consortium</a> that I participate <a href="http://www.slideshare.net/nichtich/revealing-digital-documents-concealed-structures-in-data">with a talk</a>. The seven talks and discussions on ongoing PhD topics were rather diverse and interesting. I tried to briefly summarize at least some of them.</p>
<p><a href="http://www.mendeley.com/profiles/luca-colombo/">Luco Colombo</a> started with his work on developing and evaluating eBook reading experience for children. Reading &#8220;traditional&#8221; books has been extensively investigated &#8211; this is not true for eBooks. Especially children are little involved in eBook studies. Colombo explained how the eBook reading experience is different because it directly involves searching, browsing, sharing, and recommending, among other arguments. A good reading experience results in a &#8220;<a href="http://en.wikipedia.org/wiki/Flow_%28psychology%29">flow state</a>&#8221; where the reading gets positively lost in a book. Colombo&#8217;s method is a <a href="http://en.wikipedia.org/wiki/Cooperative_inquiry">cooperative inquiry</a>. It is not clear whether and by what eBooks are more engaging to children (age 9-11 in this study) than traditional books &#8211; maybe this PhD will show. The following discussion was dominated by the participating mentors Jose Borbinha, Milena Dobreva, Stefan Gradmann and Giuseppina Vullo.</p>
<p>In the second talk <a href="http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/i/Ivanova:Krassimira.html">Krassimira Ivanova</a> presented her dissertation on (<a href="http://en.wikipedia.org/wiki/Content-based_image_retrieval">content-based</a>) image retrieval utilizing color models. Image retrieval on art images is difficult because it includes very different aspects (artistic styles, depicted objects etc.). Even aspects of color (contrasts, intensity, diversity, harmony etc.) are manifold &#8211; maybe this is why <a href="http://plato.stanford.edu/entries/color/">philosophy of color</a> has a long history. Nevertheless Ivanova developed several machine learning methods for this color aspects that can be used for image retrieval. I am not sure whether the resulting APICAS system (&#8220;Art Painting Image Colour Aesthetics and Semantics&#8221;) has been evaluated with a user study. Similar to the first talk, the focus could be improved by more narrowing down and making clear the specific contribution. Finally we had some real discussion, but little time.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/09/25/tpdl-2011-doctoral-consortium/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Modeling is difficult</title>
		<link>http://jakoblog.de/2011/09/21/modeling-is-difficult/</link>
		<comments>http://jakoblog.de/2011/09/21/modeling-is-difficult/#comments</comments>
		<pubDate>Tue, 20 Sep 2011 22:33:20 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[Data Modeling]]></category>
		<category><![CDATA[rdf]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1116</guid>
		<description><![CDATA[Yesterday Pete Johnston wrote a detailed blog article about difficulties of &#8220;the right&#8221; modeling with SKOS, and FOAF in general, and about the proposed RDF property foaf:focus in particular. As Dan Brickley wrote in a recent mail &#8220;foaf:focus describes a link from a skos:Concept to &#8216;the thing itself&#8217;. Not every SKOS concept (in a thesauri [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday Pete Johnston wrote <a href="http://efoundations.typepad.com/efoundations/2011/09/things-their-conceptualisations-skos-foaffocus-modelling-choices.html">a detailed blog article</a> about difficulties of &#8220;the right&#8221; modeling with SKOS, and FOAF in general, and about the proposed RDF property <a href="http://wiki.foaf-project.org/w/term_focus">foaf:focus</a> in particular. As Dan Brickley wrote in <a href="http://lists.foaf-project.org/pipermail/foaf-dev/2011-September/010761.html">a recent mail</a> &#8220;foaf:focus describes a link from a skos:Concept to &#8216;the thing itself&#8217;. Not every SKOS concept (in a thesauri of classification scheme) will have such a direct &#8220;thing&#8221;, but many do, especially concepts for people and places.&#8221; </p>
<p>Several statements in this discussion made me laugh and smile. Don&#8217;t get me wrong &#8211; I honor Pete, Dan, and the whole Semantic Web community, but there is a regular lack of philosophy and information science. There is no such thing as &#8216;the thing itself&#8217; and all SKOS concepts are equal. Even the distinction between an RDF &#8216;resource&#8217; and an SKOS &#8216;concept&#8217; is artificial. The problem origins not from wrong modeling, which could be solved by the right RDF properties, but from different paradigms and cultures. There will always be different ways to describe the same ideas with RDF, because neither RDF nor any other technology will ever fully catch our ideas. These technologies are not about things but only about data. As <a href="http://jakoblog.de/2010/04/28/william-kent-ein-weiterer-seelenverwandter/">William Kent</a> wrote in <em>Data Reality</em> (1978): &#8220;The map is not the territory&#8221; (by the way, last year Chris Rusbridge <a href="http://digitalcuration.blogspot.com/2010/03/linked-data-and-reality.html<br />
">has quoted Kent</a> in the context of linked data). As Erik Wilde and Robert J. Glushko <a href="http://dret.net/netdret/docs/wilde-cacm2008-xml-fever.html">wrote in a great article</a> (2008):</p>
<blockquote><p>RDF has succeeded beyond the wildest expectations as a convenient format for encoding information in an open and easily computable fashion. But it is just a format, and the difficult work of analysis and modeling information has not and will never go away.</p></blockquote>
<p>Ok, they referred not to &#8220;RDF&#8221; but to &#8220;XML&#8221;, so the quotation is wrong. But the statement is right for both data structuring methods. No matter if you put your data in XML, in RDF, or carve it in stone &#8211; there will never be a final model, because <a href="http://www.well.com/~doctorow/metacrap.htm#2.7">there&#8217;s more than one way to describe something</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/09/21/modeling-is-difficult/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Query parts in Linked Data URIs considered harmful</title>
		<link>http://jakoblog.de/2011/05/29/query-parts-in-linked-data-uris-considered-harmful/</link>
		<comments>http://jakoblog.de/2011/05/29/query-parts-in-linked-data-uris-considered-harmful/#comments</comments>
		<pubDate>Sat, 28 May 2011 23:40:44 +0000</pubDate>
		<dc:creator>jakob</dc:creator>
				<category><![CDATA[en]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[URI]]></category>

		<guid isPermaLink="false">http://jakoblog.de/?p=1045</guid>
		<description><![CDATA[The generic syntax of URI (and of IRI with slightly different definition of the elements) is: &#60;scheme name> : &#60;hierarchical part> [ ? &#60;query> ] [ # &#60;fragment> ] RDF is based on &#8220;URI references&#8221; which have a different syntax, but all practical RDF data should follow the generic URI syntax. This is also implied [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://tools.ietf.org/html/rfc3986#section-1.1.1">generic syntax of URI</a> (and of IRI with slightly different definition of the elements) is:</p>
<pre>
&lt;scheme name> : &lt;hierarchical part> [ ? &lt;query> ] [ # &lt;fragment> ]
</pre>
<p>RDF is based on &#8220;URI references&#8221; which have a different syntax, but all practical RDF data <a href="http://answers.semanticweb.com/questions/8244/how-do-you-handle-non-iri-compatible-urirefs">should follow the generic URI syntax</a>. This is also implied by the <a href="http://www.w3.org/DesignIssues/LinkedData.html">Linked Data Principles</a> which encourage you to use HTTP URIs. Furthermore it is <a href="http://blog.iandavis.com/2007/11/29/its-ok-to-use-uris-with-fragments-in-rdf/">a good advice not to include fragments</a> in your URIs if you care about coexistence of the Web and the Semantic Web. Yes, there is RDF data with fragment parts and the so called <a href="http://www.w3.org/TR/cooluris/#hashuri">Hash URIs</a> are even given as one design pattern, but you can avoid a lot of trouble if you use URIs without them. By the way, fragment identifiers are also used a lot in JavaScript-based web applications that spoil the whole concept of REST <a href="http://www.w3.org/QA/2011/05/hash_uris.html">as described in this recent in-depth article on Hash URIs</a>.</p>
<p>I would even go further and say that well-designed URIs for Linked Data should also forgo the query part. A clean URI to be used as identifier in Linked Data should stick to this limited generic syntax:</p>
<pre>
http: &lt;hierarchical part>
</pre>
<p>I do not argue to disallow URIs with query part, but I wonder how much they are really used and needed for resources other than Web resources. Especially URIs for <i>non-information resources</i>, should not have a query part. Most web applications will not distinguish between these two:</p>
<pre>http://example.org?foo=1&#038;bar=2

http://example.org?bar=2&#038;foo=1
</pre>
<p>These are different URIs but equivalent URLs. Choosing URIs that are not equivalent to other URIs in common non-RDF applications is a good advice, isn&#8217;t it? If you choose a cleaner URI like <code>http://example.org/foobar</code> you could use additional query parts to refer to different representations (<i>information resources</i>) and services around the resource that is referenced by the base URI.</p>
]]></content:encoded>
			<wfw:commentRss>http://jakoblog.de/2011/05/29/query-parts-in-linked-data-uris-considered-harmful/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

