<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Labs &#187; licenses</title>
	<atom:link href="http://labs.creativecommons.org/category/licenses/feed/" rel="self" type="application/rss+xml" />
	<link>http://labs.creativecommons.org</link>
	<description>by Creative Commons</description>
	<lastBuildDate>Mon, 09 Nov 2009 17:29:34 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>License-oriented metadata validator and viewer: summertime is winding up</title>
		<link>http://labs.creativecommons.org/2008/08/16/license-oriented-metadata-validator-and-viewer-summertime-is-winding-up/</link>
		<comments>http://labs.creativecommons.org/2008/08/16/license-oriented-metadata-validator-and-viewer-summertime-is-winding-up/#comments</comments>
		<pubDate>Sat, 16 Aug 2008 20:41:50 +0000</pubDate>
		<dc:creator>Hugo Dworak</dc:creator>
				<category><![CDATA[development]]></category>
		<category><![CDATA[license]]></category>
		<category><![CDATA[licenses]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[rdfa]]></category>
		<category><![CDATA[summer of code]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[opensource]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://labs.creativecommons.org/?p=157</guid>
		<description><![CDATA[Google Summer of Code 2008 approaches its end, as less than forty-eight hours are left to submit the code that will then be evaluated by mentors, therefore it is fitting to pause for a moment and sum up the work that has been done with regard to the license-oriented metadata validator and viewer and to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://code.google.com/soc/2008/">Google Summer of Code 2008</a> approaches <a href="http://code.google.com/soc/2008/faqs.html#0.1_timeline" title="Google Summer of Code 2008 Timeline">its end</a>, as less than forty-eight hours are left to submit the code that will then be evaluated by mentors, therefore it is fitting to pause for a moment and sum up the work that has been done with regard to the license-oriented metadata validator and viewer and to confront it with <a href="http://wiki.creativecommons.org/Rewrite_Metadata_Validator/SoC_2008#Proposed_timeline">the original proposal for the project</a>.</p>
<p>A Web application capable of parsing and displaying license information embedded in both <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-well-formed">well-formed</a> and ill-formed Web pages has been developed. It supports the following means of embedding license information: <a href="http://dublincore.org/documents/dcq-html/">Dublin Core metadata</a>, <a href="http://www.w3.org/TR/rdfa-syntax/">RDFa</a>, <a href="http://www.w3.org/TR/rdf-syntax-grammar/">RDF/XML</a> linked externally or embedded (utilising <a href="http://www.ietf.org/rfc/rfc2397.txt">the <span style="font-family: monospace">data</span> <abbr title="Uniform Resource Locator">URL</abbr> scheme</a>) using the <span style="font-family: monospace">link</span> and <span style="font-family: monospace">a</span> elements, and RDF/XML embedded in a comment or as an element (the last two being deprecated). This functionality has been proven by unit testing. The source code of a Web page can be uploaded or pasted by a user, there is also a possibility to provide a <a href="http://www.ietf.org/rfc/rfc3986.txt"><abbr title="Uniform Resource Identifier">URI</abbr></a> for the Web application to analyse it. The software has been written in <a href="http://www.python.org/">Python</a> and uses <a href="http://pylonshq.com/">the Pylons Web Framework</a> and <a href="http://genshi.edgewall.org/">the Genshi toolkit</a>. Should you be willing to test this <a href="http://lynx.isc.org/">Lynx</a>-friendly application, please visit <a href="http://validator-beta.creativecommons.org/" title="The temporary address of the license validator" rel="nofollow">its Web site</a>.</p>
<p>The Web application itself uses a library called “libvalidator”, which in turn is powered by cc.license (a library developed by Creative Commons that returns information about a given license), <a href="http://www.w3.org/2007/08/pyRdfa/">pyRdfa</a> (a distiller that generates the <a href="http://www.w3.org/RDF/"><abbr title="Resource Description Framework">RDF</abbr></a> triples from an (X)HTML+RDFa file), <a href="http://code.google.com/p/html5lib/">html5lib</a> (an <abbr title="HyperText Markup Language">HTML</abbr> parser/tokenizer), and <a href="http://rdflib.net/">RDFLib</a> (a library for working with RDF). The choice of this set of tools has not been obvious and the library had undergone several redesigns, which included removing the code that employed <a href="http://cthedot.de/encutils/">encutils</a>, <a href="http://www.w3.org/TR/xml-c14n"><abbr title="Extensible Markup Language">XML</abbr> canonicalization</a>, <a href="http://utidylib.berlios.de/">µTidylib</a>, and <a href="http://www.crummy.com/software/BeautifulSoup/">the BeautifulSoup</a>. The idea of using <a href="http://librdf.org/">librdf</a>, <a href="http://rdfa.digitalbazaar.com/librdfa/">librdfa</a>, rdfadict has been abandoned. The source code of both the Web application (licensed under <a href="http://www.gnu.org/licenses/agpl-3.0.txt">the GNU Affero General Public License version 3</a> or newer) and its core library (licensed under <a href="http://www.gnu.org/licenses/lgpl-3.0.txt">the GNU Lesser General Public License version 3</a> or newer) is available through <a href="http://code.creativecommons.org/viewgit">the Git repositories of Creative Commons</a>.</p>
<p>In contrast to the contents of the original proposal, the following goals have not been met: traversal of special links, syndication feeds parsing, statistics, and cloning the layout of the Creative Commons Web site. However, these were never <a href="http://wiki.creativecommons.org/Rewrite_Metadata_Validator">mandatory requirements for the Web application</a>. It is also worth noting that the software has been written from scratch, although a now-defunct metadata validator existed. Nevertheless, the development does not end with Google Summer of Code — these and several new features (such as validation of multimedia files via liblicense and support for different language versions) are planned to be added, albeit at a slower pace.</p>
<p>After the test period, the validator will be available under <a href="http://validator.creativecommons.org/">http://validator.creativecommons.org/</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://labs.creativecommons.org/2008/08/16/license-oriented-metadata-validator-and-viewer-summertime-is-winding-up/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>License-oriented metadata validator and viewer: libvalidator</title>
		<link>http://labs.creativecommons.org/2008/07/08/license-oriented-metadata-validator-and-viewer-libvalidator/</link>
		<comments>http://labs.creativecommons.org/2008/07/08/license-oriented-metadata-validator-and-viewer-libvalidator/#comments</comments>
		<pubDate>Tue, 08 Jul 2008 12:20:56 +0000</pubDate>
		<dc:creator>Hugo Dworak</dc:creator>
				<category><![CDATA[development]]></category>
		<category><![CDATA[license]]></category>
		<category><![CDATA[licenses]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[opensource]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[summer of code]]></category>
		<category><![CDATA[rdfa]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://labs.creativecommons.org/?p=110</guid>
		<description><![CDATA[As the <a href="http://code.google.com/soc/2008/">Google Summer of Code 2008</a> midterm evaluation <a href="http://code.google.com/soc/2008/faqs.html#0.1_timeline">deadline is approaching</a>, it is a good time to report the progress when it comes to the license-oriented metadata validator and viewer.]]></description>
			<content:encoded><![CDATA[<p>As the <a href="http://code.google.com/soc/2008/">Google Summer of Code 2008</a> midterm evaluation <a href="http://code.google.com/soc/2008/faqs.html#0.1_timeline">deadline is approaching</a>, it is a good time to report the progress when it comes to the license-oriented metadata validator and viewer.</p>
<p>The source code is located in two dedicated <a href="http://git.or.cz/">git</a> repositories. The first being <a href="http://code.creativecommons.org/viewgit?r=validator.git">validator</a>, which contains the source code of the Web application based on <a href="http://pylonshq.com/">Pylons</a> and <a href="http://genshi.edgewall.org/">Genshi</a>. The second repository is <a href="http://code.creativecommons.org/viewgit?r=libvalidator.git">libvalidator</a>, which hosts the files that constitute the core library that the project will utilise. This is the component that the development focuses on right now.</p>
<p>The purpose of the aforementioned library is to parse input files, scan them for relevant license information, and output the results in a machine-readable fashion. More precisely, its workflow is the following: parse the file and associated <a href="http://www.w3.org/RDF/"><abbr title="Resource Description Framework">RDF</abbr></a> information so that a complete set of RDF data is available, filter the results with regard to license information (not only related to the document itself, but also to other objects described within it), and return the results in a manner preferable for the usage by the Web application.</p>
<p><a href="http://www.w3.org/2007/08/pyRdfa/">pyRdfa</a> seems to be the best tool for the parsing stage so far. It handles the current recommendation for embedding license metadata (namely <a href="http://www.w3.org/TR/rdfa-syntax/">RDFa</a>) as well as other non-deprecated methods: linking to an external or embedded (using <a href="http://www.ietf.org/rfc/rfc2397.txt">the &#8220;data&#8221; URL scheme</a>) RDF files and utilising the <a href="http://dublincore.org/">Dublin Core</a>. The significant lacking is handling of the invalid direct embedding of <a href="http://www.w3.org/TR/rdf-syntax-grammar/">RDF/XML</a> within the <a href="http://www.w3.org/MarkUp/"><abbr title="HyperText Markup Language">HTML</abbr>/<abbr title="Extensible Hypertext Markup Language">XHTML</abbr></a> source code (as an element or in a comment) and this is resolved by first capturing all such instances using a regular expression and then parsing the data just as external RDF/XML files.</p>
<p>Once the RDF triples are extracted, one can use <a href="http://www.w3.org/TR/rdf-sparql-query/"><acronym title="SPARQL Protocol and RDF Query Language">SPARQL</acronym></a> to narrow the results just to the triples related to the licensed objects. Both <a href="http://librdf.org/">librdf</a> and <a href="http://rdflib.net/">rdflib</a> support this language. Moreover, the RDF/XML related to the license must be parsed, so that its conditions (permissions, requirements, and restrictions) are then presented to the user.</p>
<p>The library takes advantage of standard Python tools such as <a href="http://pypi.python.org/pypi/zc.buildout/">Buildout</a> and <a href="http://www.somethingaboutorange.com/mrl/projects/nose/">nose</a>. When it is completed, the project will be all about writing a Web application that will serve as an interface to libvalidator.</p>
]]></content:encoded>
			<wfw:commentRss>http://labs.creativecommons.org/2008/07/08/license-oriented-metadata-validator-and-viewer-libvalidator/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Destination: Sanity</title>
		<link>http://labs.creativecommons.org/2008/03/27/destination-sanity/</link>
		<comments>http://labs.creativecommons.org/2008/03/27/destination-sanity/#comments</comments>
		<pubDate>Thu, 27 Mar 2008 22:45:12 +0000</pubDate>
		<dc:creator>Nathan Yergler</dc:creator>
				<category><![CDATA[internal]]></category>
		<category><![CDATA[licenses]]></category>
		<category><![CDATA[sanity]]></category>
		<category><![CDATA[tools]]></category>

		<guid isPermaLink="false">http://techblog.creativecommons.org/2008/03/27/destination-sanity/</guid>
		<description><![CDATA[Last week Kinkade asked me for a brief overview of how the license engine, web services and other bits of code all fit together to create the joy that is creativecommons.org.  &#8220;Sure,&#8221; I thought; &#8220;that&#8217;s simple!&#8221;
Er, maybe not.  Fourty-five minutes, five marker colors and multiple digressions later, I had the following diagram of [...]]]></description>
			<content:encoded><![CDATA[<p>Last week <a href="http://creativecommons.org/about/people#75">Kinkade</a> asked me for a brief overview of how the <a href="http://code.creativecommons.org/viewsvn/cc.engine">license engine</a>, <a href="http://api.creativecommons.org">web services</a> and other <a href="http://code.creativecommons.org/viewsvn/license_xsl">bits</a> of <a href="http://code.creativecommons.org/viewsvn/licenses.rdf">code</a> all fit together to create the joy that is <a href="http://creativecommons.org">creativecommons.org</a>.  &#8220;Sure,&#8221; I thought; &#8220;that&#8217;s simple!&#8221;</p>
<p>Er, maybe not.  Fourty-five minutes, five marker colors and multiple digressions later, I had the following diagram of life as it is today.</p>
<p><a href="http://www.flickr.com/photos/nathan_y/2347987536/" title="The Present by Nathan Y, on Flickr"><img src="http://farm3.static.flickr.com/2297/2347987536_5515b1f963.jpg" width="500" height="375" alt="The Present" /></a></p>
<caption>&#8220;The Present&#8221;, by <a href="http://flickr.com/photos/nathan_y">Nathan Y.</a>; <a href="http://creativecommons.org/licenses/by-sa/2.0/">CC BY-SA 2.0</a></caption>
<p>Life sucks.</p>
<p><a href="http://creativecommons.org/about/people#83">Asheesh</a> joined us and we started talking about how we can make this better.  The above, while eminently sucky, has grown up during my time at Creative Commons.  All those decisions made sense at the time, but in aggregate we&#8217;ve got lots of duplicated code, a branch of code named the <code>gradually-increasing-sanity-branch</code> which doesn&#8217;t (I take the blame for that one), and plenty of unnecessary complexity.  Half an hour later, we had mapped out <strong>The Glorious Future&reg;</strong>:</p>
<p><a href="http://www.flickr.com/photos/nathan_y/2347986388/" title="The Glorious Future by Nathan Y, on Flickr"><img src="http://farm4.static.flickr.com/3267/2347986388_dd1c466a5e.jpg" width="500" height="375" alt="The Glorious Future" /></a></p>
<caption>&#8220;The Glorious Future&#8221;, by <a href="http://flickr.com/photos/nathan_y">Nathan Y.</a>; <a href="http://creativecommons.org/licenses/by-sa/2.0/">CC BY-SA 2.0</a></caption>
<p>A little simpler, huh?  And the &#8220;future&#8221; diagram shows all the functionality of the present, <em>plus</em> three packages not displayed on the original diagram.  Our immediate goal in moving in this direction is the completion of <code>cc.license</code> (labeled as &#8220;cc.licenze&#8221; in the diagrams to distinguish it from the existing implementation) which will replace the existing XSLT processing using for issuing licenses and wraps the RDF (which is the canonical representation of the licenses anyway).  We&#8217;ll also manage to dramatically reduce the number of <code>svn:externals</code> we use, which is good since we&#8217;re moving <a href="http://git.or.cz/">away</a> from <a href="http://subversion.tigris.org">Subversion</a> for some projects.  My goal is to get this upgrade done as soon as possible so we can focus on things that are actually interesting instead of our own infrastructure.</p>
]]></content:encoded>
			<wfw:commentRss>http://labs.creativecommons.org/2008/03/27/destination-sanity/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
