<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Software Research and the Industry &#187; Wikimedia</title>
	<atom:link href="http://dirkriehle.com/category/wikimedia/feed/" rel="self" type="application/rss+xml" />
	<link>http://dirkriehle.com</link>
	<description>Dirk Riehle&#039;s blog about everything computer science, applied and more</description>
	<lastBuildDate>Sun, 05 Feb 2012 20:26:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Call for Papers: WikiSym 2012</title>
		<link>http://dirkriehle.com/2012/01/17/call-for-papers-wikisym-2012/</link>
		<comments>http://dirkriehle.com/2012/01/17/call-for-papers-wikisym-2012/#comments</comments>
		<pubDate>Tue, 17 Jan 2012 15:24:41 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Announcement]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Wikimedia]]></category>
		<category><![CDATA[Wikis]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2764</guid>
		<description><![CDATA[8th International Symposium on Wikis and Open Collaboration August 27-29, 2012 &#124; Linz, Austria The International Symposium on Wikis and Open Collaboration (WikiSym) is the premier conference on open collaboration and related technologies. In 2012, WikiSym celebrates its 8th year &#8230; <a href="http://dirkriehle.com/2012/01/17/call-for-papers-wikisym-2012/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p dir="ltr"><em><strong>8th International Symposium on Wikis and Open Collaboration</strong></em></p>
<p dir="ltr"><strong>August 27-<strong id="internal-source-marker_0.941593733150512">29</strong>, 2012</strong> | Linz, Austria</p>
<p>The International Symposium on Wikis and Open Collaboration (<a href="http://www.wikisym.org" target="_blank">WikiSym</a>) is the premier conference on open collaboration and related technologies. In 2012, WikiSym celebrates its 8th year of scholarly, technical and community innovation in <strong>Linz, Austria</strong>.  We are excited this year to be collocated with <strong><a href="http://www.aec.at/festival/en/">Ars Electronica</a></strong>, the premier digital art and science meeting that attracts over 35,000 attendees per year.</p>
<p>Submissions are invited for the following categories:</p>
<p><span id="more-2764"></span></p>
<table style="margin: 0px; border: 1px solid black; border-collapse: collapse;">
<colgroup>
<col width="132" />
<col width="401" /></colgroup>
<tbody>
<tr>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">April 7, 2012 [1]</td>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">Research Papers, Panels, Workshops and Experience Reports</td>
</tr>
<tr>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">April 27, 2012 [1]</td>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">Doctoral Symposium</td>
</tr>
<tr>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">May 30, 2012</td>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">Notification of Acceptance for Research Papers, Panels, Workshops and Experience reports</td>
</tr>
<tr>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">June 8, 2012</td>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">Posters and Demos due</td>
</tr>
<tr>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">June 22, 2012</td>
<td style="vertical-align: top; padding: 4px; border: 1px solid black;">Posters and Demos announced</td>
</tr>
</tbody>
</table>
<p><small>[1] As determined at the International Date Line. In other words, as long as it&#8217;s still April 7th or April 27 somewhere on Earth, the system will accept your submissions.</small></p>
<p>The conference program will include a peer-reviewed <em>research track</em>, <em>experience reports</em>, <em>workshops</em>, <em>posters</em>, <em>demos</em>, a <em>doctoral consortium</em>, <em>invited keynotes</em> and <em>panel speakers</em>. As always, the participant-organized Open Space track will run throughout the conference. Evening social events will follow, because wiki folks know the value of a good party for sparking conversation and collaboration. Finally, WikiSym co-occurs with Ars Electronica, and we are arranging experiences where conference attendees can enjoy this innovative and unusual event.</p>
<p>Topics appropriate for submissions include all aspects of the people, tools, contexts, and content that comprise open collaboration systems. For example:</p>
<ul>
<li>Collaboration tools and processes</li>
<li>Social and cultural aspects of collaboration</li>
<li>Collaboration beyond text: images, video, sound, etc.</li>
<li>Communities and workgroups</li>
<li>Knowledge and information production</li>
<li>New media literacies</li>
<li>Uses and impact of wikis and other open resources, tools, and practices in fields and application areas, for example:</li>
<ul>
<li>Open source software development and use</li>
<li>Education and Open Educational Resources</li>
<li>E-government, open government, and public policy</li>
<li>Law/Intellectual Property (including Creative Commons)</li>
<li>Journalism (including participatory journalism)</li>
<li>Art and Entertainment (including collaborative and audience-involved art)</li>
<li>Science (including collaboratories)</li>
<li>Publishing (including open access and open review models)</li>
<li>Business (including open and collaborative management styles)</li>
</ul>
</ul>
<p>In addition to research and development topics, WikiSym also invites innovative proposals for open, collaborative art and performance.  These proposals should be made directly to the conference chairs.</p>
<h2 dir="ltr">General submission instructions and information</h2>
<p>All accepted submissions will be published in the WikiSym proceedings and archived in the ACM Digital Library. Long and short research papers will be rigorously peer reviewed and treated as archival publications. Submissions to other tracks will also be reviewed and appear in the ACM DL, but they are considered to be non-archival and may be used as the basis for later publications. Authors of research papers should use the ACM/CHI SIG Proceedings Format, and other contribution types will use the ACM/CHI Extended Abstracts Format. Templates for both formats are available at <a href="http://www.acm.org/sigs/pubs/proceed/template.html">http://www.acm.org/sigs/pubs/proceed/template.html</a>.</p>
<p>General submission instructions will be posted and the conference submission site opened around March 1. Instructions for the various contribution types are below.</p>
<h3>Research Papers – Long (up to 10 pages) and Short (up to 4 pages)</h3>
<p>Research papers present integrative reviews or original reports of substantive new work: theoretical, empirical, and/or in the design, development and/or deployment of novel systems.</p>
<p>Research papers will be reviewed by the Program Committee to meet rigorous academic standards of publication. Papers will be reviewed for relevance, conceptual quality, innovation and clarity of presentation. They should be written in English and must not exceed 10 pages (for full papers) or 4 pages (for short papers). At least one author of accepted papers is required to attend the conference in order to present the paper.</p>
<h3>Workshops (up to 6 pages, Extended Abstracts format)</h3>
<p>Workshops provide an opportunity for researchers and practitioners to discuss and learn about topics that require in-depth, extended engagement such as new systems, research methods, standards, and formats.</p>
<p>Workshop proposals should describe what you intend to do and how your session will meet the criteria described above. It should include a concise abstract, proposed time frame (half-day or full-day), what you plan to do during the workshop, and one-paragraph biographies of all organizers. Workshop proposals will be reviewed and selected for their interest to the community. Each accepted workshop will be provided with a meeting room for either a half or full day. Organizers may also request technology and materials (projector, flip pads, etc).</p>
<h3>Panels (up to 6 pages, Extended Abstracts format)</h3>
<p>Panels provide an interactive forum for bringing together people with interesting points of view to discuss compelling issues around open collaboration. Panels involve participation from both the panelists and audience members in a lively discussion. Proposals for panels should describe the topics and goals and explain how the panel will be organized and how the Wikisym community will benefit. It should include a concise abstract and one-paragraph biographies of panelists and moderators. Panel submissions will be reviewed and selected for their interest to the community. Each panel will be given a 90-minute time slot.</p>
<h3>Experience Reports (up to 16 pages, Extended Abstracts format)</h3>
<p>Experience reports are an integral part of the conference program. These are opportunities to discuss how ideas that sound good on paper (and at conferences!) work in real life projects and deployments. Many attendees want to learn from people on the front lines what it is like to do things like start a company wiki, use open collaboration tools in a classroom, or build a political campaign around open collaboration systems. Experience reports are not research papers; their goal is to present experience and reflections on a particular case, and they are reviewed for usefulness, clarity and reflection. Strong experience reports discuss both benefits and drawbacks of the approaches used and clearly call out lessons learned. Reports may focus on a particular aspect of technology usage and practice, or describe broad project experiences.</p>
<h3>Posters (up to 4 pages, Extended Abstracts format)</h3>
<p>Poster presentations enable researchers to present late-breaking results, significant work in progress, or work that is best communicated in conversation. WikiSym&#8217;s lively poster sessions let conference attendees exchange ideas one-on-one with authors, and let authors discuss their work in detail with those attendees most deeply interested in the topic. Poster proposals may describe original research, engineering, or experience reports. Successful applicants will display their posters, up to 1x2m in size, at a special session during the Symposium.</p>
<h3>Demos (up to 4 pages, Extended Abstracts format)</h3>
<p>No format is better suited for demonstrating the utility of new collaboration technologies than showing and using them. Demonstrations give presenters an opportunity to show running systems and gather feedback. Demo submissions should provide a setup for the demo, a specific description of what you plan to demo, what you hope to get out of demoing, and how the audience will benefit. A short note of any special technical requirements should be included. Demo submissions will be reviewed based on their relevance to the community.</p>
<h3>Doctoral Symposium</h3>
<p>The WikiSym 2012 Doctoral Symposium is a forum in which Ph.D. students can meet and discuss their work with each other and a panel of experienced researchers and practitioners. The symposium will be held on Tuesday August 28 on the campus of Johannes Kepler University. More information about the symposium’s leaders, goals, submission process and criteria, and funding will be posted shortly.</p>
<h3>Open Space</h3>
<p>For short and informal opportunities to organize discussion, brain-storming, and other collaborative activities, the Open Space track will run throughout WikiSym. Open Space is an entirely participant-organized track and requires no submission or review.</p>
<h3>Note on Publications</h3>
<p>Work submitted to Wikisym is published in the ACM digital library. This means it is not open access. However, ACM has a very new service called ACM Author-izer which allows authors to post official copies of their papers on personal websites for people to access, even if those people do not have access to the ACM digital library. We see this as a step to open access and are pleased to support this service.</p>
<p><a href="http://www.acm.org/publications/acm-author-izer-service">http://www.acm.org/publications/acm-author-izer-service</a></p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2012/01/17/call-for-papers-wikisym-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Design and Implementation of the Sweble Wikitext Parser: Unlocking the Structured Data of Wikipedia</title>
		<link>http://dirkriehle.com/2011/07/29/design-and-implementation-of-the-sweble-wikitext-parser/</link>
		<comments>http://dirkriehle.com/2011/07/29/design-and-implementation-of-the-sweble-wikitext-parser/#comments</comments>
		<pubDate>Fri, 29 Jul 2011 15:47:45 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Publication]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Wikimedia]]></category>
		<category><![CDATA[Wikis]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2581</guid>
		<description><![CDATA[Abstract:&#160;The heart of each wiki, including Wikipedia, is its content. Most machine processing starts and ends with this content. At present, such processing is limited, because most wiki engines today cannot provide a complete and precise representation of the wiki’s &#8230; <a href="http://dirkriehle.com/2011/07/29/design-and-implementation-of-the-sweble-wikitext-parser/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><strong>Abstract:</strong>&nbsp;The heart of each wiki, including Wikipedia, is its content. Most machine processing starts and ends with this content. At present, such processing is limited, because most wiki engines today cannot provide a complete and precise representation of the wiki’s content. They can only generate HTML. The main reason is the lack of well-defined parsers that can handle the complexity of modern wiki markup. This applies to MediaWiki, the software running Wikipedia, and most other wiki engines. This paper shows why it has been so difficult to develop comprehensive parsers for wiki markup. It presents the design and implementation of a parser for Wikitext, the wiki markup language of MediaWiki. We use parsing expression grammars where most parsers used no grammars or grammars poorly suited to the task. Using this parser it is possible to directly and precisely query the structured data within wikis, including Wikipedia. The parser is available as open source from <a href="http://sweble.org">http://sweble.org</a>.</p>
<p><strong>Keywords:</strong>&nbsp;Wiki, Wikipedia, Wiki Parser, Wikitext Parser, Parsing Expression Grammar, PEG, Abstract Syntax Tree, AST, WYSIWYG, Sweble.</p>
<p><strong>Reference:</strong>&nbsp;Hannes Dohrn and Dirk Riehle. &#8220;Design and Implementation of the Sweble Wikitext Parser: Unlocking the Structured Data of Wikipedia.&#8221; In <em>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</em> (WikiSym 2011). ACM Press, 2011.</p>
<p>The paper is available as a <a href="/wp-content/uploads/2011/07/diwp.pdf">PDF file</a> (preprint).</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/07/29/design-and-implementation-of-the-sweble-wikitext-parser/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Technical Report on WOM: An Object Model for Wikitext</title>
		<link>http://dirkriehle.com/2011/07/29/technical-report-on-wom-an-object-model-for-wikitext/</link>
		<comments>http://dirkriehle.com/2011/07/29/technical-report-on-wom-an-object-model-for-wikitext/#comments</comments>
		<pubDate>Fri, 29 Jul 2011 15:40:08 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Publication]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Wikimedia]]></category>
		<category><![CDATA[Wikis]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2576</guid>
		<description><![CDATA[Abstract:&#160;Wikipedia is a rich encyclopedia that is not only of great use to its contributors and readers but also to researchers and providers of third party software around Wikipedia. However, Wikipedia&#8217;s content is only available as Wikitext, the markup language &#8230; <a href="http://dirkriehle.com/2011/07/29/technical-report-on-wom-an-object-model-for-wikitext/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><strong>Abstract:</strong>&nbsp;Wikipedia is a rich encyclopedia that is not only of great use to its contributors and readers but also to researchers and providers of third party software around Wikipedia. However, Wikipedia&#8217;s content is only available as Wikitext, the markup language in which articles on Wikipedia are written, and whoever needs to access the content of an article has to implement their own parser or has to use one of the available parser solutions. Unfortunately, those parsers which convert Wikitext into a high-level representation like an abstract syntax tree (AST) define their own format for storing and providing access to this data structure. Further, the semantics of Wikitext are only defined implicitly in the MediaWiki software itself. This situation makes it difficult to reason about the semantic content of an article or exchange and modify articles in a standardized and machine-accessible way. To remedy this situation we propose a markup language, called XWML, in which articles can be stored and an object model, called WOM, that defines how the contents of an article can be read and modified.</p>
<p><strong>Keywords:</strong>&nbsp;Wiki, Wikipedia, Wikitext, Wikitext Parser, Open Source, Sweble, Mediawiki, Mediawiki Parser, XWML, HTML, WOM</p>
<p><strong>Reference:</strong>&nbsp;Hannes Dohrn and Dirk Riehle. <em>WOM: An Object Model for Wikitext.</em> University of Erlangen, Technical Report CS-2011-05 (July 2011).</p>
<p>The technical report is available as a <a href="/wp-content/uploads/2011/07/wom-tr.pdf">PDF file</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/07/29/technical-report-on-wom-an-object-model-for-wikitext/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Teaching Note for Case &quot;User-Generated Content Systems at Intuit(A)&quot; E-381(A)</title>
		<link>http://dirkriehle.com/2011/07/26/teaching-note-for-intuit-case-e-381a/</link>
		<comments>http://dirkriehle.com/2011/07/26/teaching-note-for-intuit-case-e-381a/#comments</comments>
		<pubDate>Tue, 26 Jul 2011 08:12:58 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[CC-BY-SA 3.0]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Open Content]]></category>
		<category><![CDATA[Product Management]]></category>
		<category><![CDATA[Wikimedia]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2532</guid>
		<description><![CDATA[Abstract:&#160;This is a teaching note for the free case “User-Generated Content Systems at Intuit(A)”, E-381(A), from the Stanford Free Case collection available at ECCH. The original case is a product management case in which Intuit, maker of consumer and small &#8230; <a href="http://dirkriehle.com/2011/07/26/teaching-note-for-intuit-case-e-381a/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><strong>Abstract:</strong>&nbsp;This is a teaching note for the free case “User-Generated Content Systems at Intuit(A)”, E-381(A), from the <a href="http://www.ecch.com/educators/casemethod/resources/freecases/stanfordfreecases">Stanford Free Case collection available at ECCH</a>. The original case is a product management case in which Intuit, maker of consumer and small business financial software, faces the decision to “go social or not” for user help in its tax preparation software. The original case discusses the pros and cons of such a disruptive innovation. This teaching note provides pertinent questions to ask your students as well as my summary answers to these questions. I could not find an original teaching note hence I wrote this one. This is my first such note so any suggestions for improvement are welcome. The note is licensed <a href="http://creativecommons.org/licenses/by-sa/3.0/legalcode">CC-BY-SA 3.0</a>; feel free to use it in your own teaching. The note&#8217;s home is <a href="http://dirkriehle.com/">my website</a>. For attribution, please link to it.</p>
<p><span id="more-2532"></span></p>
<p><strong>Introduction:</strong>&nbsp;This case, in my interpretation, is about the failure of traditional market research methods when applied to disruptive innovation. Market research methods like focus groups, interviews and surveys, and usability labs only tell product managers about potential incremental improvements of a product. Market research is good for sustaining incremental innovation. Market research doesn&#8217;t tell a product manager whether a proposed disruptive innovation is just a crazy idea or the next big thing. For such an innovation, product managers need to turn to introspection, outsiders or experiments. Also, dealing with disruptive innovation is particularly hard for organizations like Intuit that are intensely customer focused. As the case reports, Intuit spends significant amounts of time and resources on usability labs. Ironically, coping and introducing disruptive innovation is much easier for engineering-centric organizations, but that is another case.</p>
<p>The teaching note is available as a <a href="/wp-content/uploads/2011/07/Teaching-Note-E-381A.pdf">PDF file</a> and an <a href="/wp-content/uploads/2011/07/Teaching-Note-E-381A.odt">ODT file</a>. The original free cases can be downloaded from <a href="http://www.ecch.com/educators/casemethod/resources/freecases/stanfordfreecases">ECCH</a>. For your convenience, here are PDFs for <a href="/wp-content/uploads/2011/07/Intuit-A-E381AS.pdf">E-381(A)</a> and <a href="/wp-content/uploads/2011/07/Intuit-A-E381BS.pdf">E-381(B)</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/07/26/teaching-note-for-intuit-case-e-381a/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Java IP Story</title>
		<link>http://dirkriehle.com/2011/06/30/the-java-ip-story/</link>
		<comments>http://dirkriehle.com/2011/06/30/the-java-ip-story/#comments</comments>
		<pubDate>Thu, 30 Jun 2011 18:30:36 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[CC-BY-SA 3.0]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Wikimedia]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2361</guid>
		<description><![CDATA[Every year, I teach the AMOS class, a lab course on &#8220;Agile Methods and Open Source&#8221; that combines lectures with a real software project that ideally turns into a startup (see the AMOS Project concept, in German). To explain open &#8230; <a href="http://dirkriehle.com/2011/06/30/the-java-ip-story/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Every year, I teach the AMOS class, a lab course on &#8220;Agile Methods and Open Source&#8221; that combines lectures with a real software project that ideally turns into a startup (see <a href="/2010/12/06/das-amos-projektkonzept-2011/">the AMOS Project concept</a>, in German). To explain open source, I have to introduce students to intellectual property rights, of which most have been blissfully unaware of until then. Nothing teaches concepts better than a colorful story, and so I have been using the IP strategies around Java to make this dry topic come alive. For fun, comments, and corrections, I&#8217;m providing the short version of my talk below, including commentary. (You can also download <a href="/wp-content/uploads/2011/06/The Java IP Story.pdf">a PDF version of the talk</a>, with our without notes, licensed as CC-BY 3.0. If you find this useful for teaching, please tell me.) Students at this point have a basic working understanding of intellectual property and exclusion rights. Please let me know what you think! Finally, IANAL.</p>
<p><img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot01.png" width="100%" border="1px" /></p>
<p>Java is an important technology powering the modern web and in particular enterprise applications. It has a checkered intellectual property history, and with the recent acquisition of Sun, the Java creator and owner, by Oracle, things only stand to heat up. This slide set discusses some of the more interesting issues around Java intellectual property and its strategic use in business.</p>
<ol>
<li><a href="/2011/06/30/the-java-ip-story/#S02">What is Java?</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S03">Short Java IP Story Time-Line</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S04">Three Substories</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S06">Java&#8217;s Challenge to the Windows Platform</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S07">Microsoft and Java</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S08">The OpenJDK Strategy (Open Core Model)</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S09">Certification of Compatible Implementations</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S10">Threats to Commercial Revenue</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S11">Main Tools to Curtail “Competitors”</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S12">Problems for Alternative Implementations</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S13">Problems for OpenJDK Forks</a></li>
<li><a href="/2011/06/30/the-java-ip-story/#S14">Thank you! and References</a></li>
</ol>
<p><span id="more-2361"></span></p>
<p><a name="S02"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot02.png" width="100%" border="1px" /><br />
</a></p>
<p>&#8220;Java&#8221; consists of multiple components [2].</p>
<ul>
<li>Java, the programming language, which has been standardized</li>
<li>the JRE, which contains the necessary infrastructure to run Java applications</li>
<li>the JDK, which is the JRE including its libraries and runtime, plus some tools</li>
<li>in addition, there are many third party libraries and development tools</li>
</ul>
<p>Java comes in multiple editions, which target different execution environments, most notably embedded, desktop, and enterprise systems.</p>
<p>Java, the programming language, is changing only slowly. However, the libraries are evolving at a rapid pace. To make them useful to industry, the JCP, the Java Community Process, defines specifications and develops reference implementations and compatibility test suites. The JCP was set up by Sun and brought together all relevant industry players with an interest in Java. To be allowed into the process, vendors have to sign the JSPA, the Java Specification Participation Agreement, which gives Java&#8217;s owner various rights to the specification output. The individual specifications are called the JSRs for Java Specification Requests.</p>
<p>Originally Sun, now Oracle, owns the following intellectual property rights:</p>
<ul>
<li>the Java trademark; this allows it to stop labeling other parties software as &#8220;Java&#8221; in the domain of information technology</li>
<li>various JSR test suites; this output of the JCP serves to certify Java component implementations as specification compatible</li>
<li>many patents in the implementation of the core Java programming language and runtime</li>
</ul>
<p>These property rights are being used in various scenarios.</p>
<p><a name="S03"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot03.png" width="100%" border="1px" /><br />
</a></p>
<p>To get started, lets review a few key events in Java&#8217;s history [1]. Java was released to the public in 1996 under a proprietary license. Developers could use Java, but it wasn&#8217;t open source. With fast-rising industry interest, Sun announced and formalized the JCP in 1998 to help move Java forward by involving other industry players.</p>
<p>Microsoft was one of the industry players that adopted Java on its platform early on. Already in 1997 Sun filed a lawsuit alleging that Microsoft was not playing by the licensing agreement, and in 2001 Sun won this lawsuit. Microsoft has since removed Java from its platform leaving the integration to 3rd parties.</p>
<p>A first open source implementation of the Java tools, runtime, and libraries was begun in 1998 as the GNU Classpath project; it has yet to reach completion. The project uses the GPL open source license with a modification, the classpath exception, that allows linking of applications so that they don&#8217;t fall under the GPL.</p>
<p>Another open source implementation of Java was begun under the auspices of the ASF, the Apache Software Foundation, in 2005, labeled &#8220;Project Harmony&#8221;. Sun offers the non-profit community free Java certification if it was willing to accept certain field-of-use restriction for its software. The ASF objected on principled (open source) grounds, preventing it from using the Java trademark.</p>
<p>Sun itself released most of the Java tools, libraries, and runtime in 2006, finishing it in 2007, under the GPLv2 open source license. Called the OpenJDK, this is still the only complete open source implementation of Java available today.</p>
<p><a name="S04"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot04.png" width="100%" border="1px" /><br />
</a></p>
<p>I am going to tell three stories, in which intellectual property and business strategy come together:</p>
<ul>
<li>The Java vs Windows platform war</li>
<li>Generating Revenues with the Open Core Model</li>
<li>Curtailing Competition using Patents and Field-of-use restrictions</li>
</ul>
<p><a name="S05"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot05.png" width="100%" border="1px" /><br />
</a></p>
<p>Questions of intellectual property and the legal system have a habit of boring one&#8217;s brain out; I hope this won&#8217;t happen in this case.</p>
<p><a name="S06"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot06.png" width="100%" border="1px" /><br />
</a></p>
<p>Java&#8217;s original marketing slogan (and promise) to software developers was to &#8220;write once, run anywhere&#8221; (WORA). It points to the most basic of industry strategies, the platform strategy.</p>
<p>In a platform strategy, a software vendor tries to make its software become the platform that everyone else builds on. By building applications on top of someone&#8217;s platform, these applications become dependent on the platform, and whenever a copy of the application is bought, a copy of the platform has to be bought too.</p>
<p>This is why Windows is so powerful and Bill Gates became so rich.</p>
<p>Java now challenged the Windows platform by completely hiding the Windows programming interface, the Win32 API, behind the Java class libraries. Thus, applications could be implemented using Java class libraries only without knowing whether these are run on a Windows or a Linux platform. This made applications portable across different operating systems and led to Java&#8217;s rallying cry of &#8220;write once, run anywhere&#8221;.</p>
<p>Of course, Java was simply trying to become the next platform to build on by hiding the underlying operating system, be it Windows or MacOS. Scott McNealy, the Sun CEO, probably wanted to become as rich as Bill Gates.</p>
<p><a name="S07"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot07.png" width="100%" border="1px" /><br />
</a></p>
<p>Microsoft didn&#8217;t like Sun&#8217;s Java strategy as it would reduce the lock-in of applications to the Win32 API and hence make application developers less dependent on Windows. Still, customers kept asking for Java on Windows. Eventually, Microsoft complied: Microsoft licensed Java from Sun and deployed it on the Windows platform.</p>
<p>According to a 1997 lawsuit by Sun, Microsoft did so only by violating the license agreement. Sun stated that Microsoft modified the JDK by omitting libraries that were needed to run Java well. This required application developers to go directly to the Win32 API. This strategic move was to maintain the lock-in to the underlying Windows platform that Java was trying to do away with in the first place.</p>
<p>Sun won the lawsuit in 2001, requiring Microsoft to pay damages. In the aftermath, Microsoft removed Java from its platform and introduced alternative technologies. Today, Java on Windows is provided by third parties only [3].</p>
<p><a name="S08"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot08.png" width="100%" border="1px" /><br />
</a></p>
<p>Knowing that Sun released Java as open source in 2007 and knowing that there were alternative implementations, you may be wondering how Sun ever intended to make money on Java. One answer to this is called <a href="/2010/12/06/the-single-vendor-commercial-open-source-business-model/">the open core model</a> today.</p>
<p>The release of the OpenJDK by Sun in 2007 gives developers a full Java development and runtime environment under the GPLv2 open source license. The license even comes with a modification called the Classpath Exception that allows software vendors to combine their application with the OpenJDK without having to open source their application code.</p>
<p>To not fully loose their (potential) revenue stream, Sun does two things, which taken together represent the core intellectual property strategy behind the open core model:</p>
<ul>
<li>Sun maintains the full copyright and thereby ownership of the OpenJDK code base. Any outside contributors first have to sign a joint copyright agreement [8]. While this curtails code contributions, it ensures that Sun remains the owner of the JDK.</li>
<li>Sun then uses their ownership rights to offer the OpenJDK under a commercial license to those who don&#8217;t like the GPL (dual license strategy) while at the same time providing additional non-open-sourced functionality (open core model).</li>
</ul>
<p>Thus, to get full service as well as enhanced features, Java application developers have to pay for the commercial license rather than use the open source license.</p>
<p><a name="S09"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot09.png" width="100%" border="1px" /><br />
</a></p>
<p>To win against Microsoft early on, Java needed to achieve rapid growth and industry buy-in during the 1990s.</p>
<p>Java could only grow as fast as it did, because Sun brought other major industry players on board. This was done through the JCP, the Java Community Process. In that process, companies like IBM, SAP, and anyone else who cares can influence the development of Java through library specifications and make sure it works well with their products.</p>
<p>One tangible output of the JCP is a stream of JSRs, the Java Specification Requests. Each JSR specifies a library or similar component. To each JSR specification belongs a TCK, the technology compatibility kit for the JSR. A TCK is effectively a test suite that checks a given implementation of the library for compliance with the specification. If the library passes, it is considered a compliant implementation. A Java TCK combines various JSR TCKs into a full test suite for a particular Java configuration.</p>
<p>Anyone with a Java implementation who wants a Java trademark license from Oracle first has to pass the matching Java TCK.</p>
<p><a name="S10"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot10.png" width="100%" border="1px" /><br />
</a></p>
<p>Rapid growth and the open core model meant that there could be serious business.</p>
<p>There was only one threat to those revenues: Alternative implementations. These are implementations of Java, including a JVM, tools, and class libraries. The first of these implementations, the GNU Classpath project, started as early as 1998 [17]. A later alternative implementation is the Apache Harmony project [16].</p>
<p>Because of the broad industry involvement, Sun and later Oracle couldn&#8217;t simply close off Java again after it grew to dominance as the leading enterprise programming language. Thus, Oracle today provides a defined way for industry to acquire a Java trademark license if it wants to label their Java implementation “Java”.</p>
<p>Today, Oracle makes the Java trademark license available in two main variants:</p>
<ol>
<li>For paying customers, a full trademark license is granted, if the Java implementation passes the relevant Java TCK</li>
<li>For non-paying open source projects like GNU Classpath or Apache Harmony, a restricted license is granted</li>
</ol>
<p>Option 1 works very well for Oracle, after all, it receives presumably appropriate revenues. Option 2 is not really an option for open source projects as we will see, but rather an attempt to prevent these projects from gaining traction.</p>
<p><a name="S11"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot11.png" width="100%" border="1px" /><br />
</a></p>
<p>With the OpenJDK, Oracle provides a full open source implementation of Java. However, in addition, Sun, now Oracle, made two strategic moves to curtail alternative implementations:</p>
<ul>
<li>The choice of the (1991) GPLv2 license because of its poor coverage of patents</li>
<li>The field-of-use restrictions in the trademark license for open source projects</li>
</ul>
<p><a name="S12"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot12.png" width="100%" border="1px" /><br />
</a></p>
<p>An open source Java implementation that wants to call itself “Java” has to pass the appropriate Java TCK. The use of the TCK and the Java trademark license is only granted for free, if the project (more precisely the organization behind it) accepts certain field-of-use restrictions for the project. As of today (2011), the main restriction is that the alternative implementation may not be used for embedded systems development. Enterprise applications, in contrast, are fine.</p>
<p>An example of an alternative implementation is the Apache Harmony project, which provides a comprehensive Java implementation under the Apache License 2.0.</p>
<p>The definition of Open Source Software requires that no field-of-use restrictions be imposed on the user. Oracle&#8217;s field-of-use restrictions are not acceptable to an open source project that wants to call itself that way [11] [12]. (And the same applies to free software.)</p>
<p>Game over for unrestricted free and open source Java implementations.</p>
<p><a name="S13"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot13.png" width="100%" border="1px" /><br />
</a></p>
<p>But what about forking the OpenJDK, wrestling it off the hands of Oracle?</p>
<p>In forking the OpenJDK, you face two options:</p>
<ul>
<li>If you want to put the “Java” label on your fork, you&#8217;ll have to pass the TCK and accept Oracle&#8217;s field-of-use restrictions. Since you are required to keep the software under the original GPLv2 license, this may hand you a GPL violation lawsuit, after all, you just accepted field-of-use restrictions which are incompatible with the license.</li>
<li>If you don&#8217;t care about the “Java” label, you can forgo certification. However, without the certification, you don&#8217;t get a license to the software patents buried in the OpenJDK code. Thus, any patent owner who happens not to like you might decide to sue your customers who will turn back on you and will find you without patent defense.</li>
</ul>
<p>One hope is Google&#8217;s Dalvik, once it digs itself out of the siege of patent lawsuits it is being buried under, but that will then be the Dalvik IP Story, not the Java IP story&#8230;</p>
<p>Game over, again.</p>
<p>For now.</p>
<p><a name="S14"><br />
<img src="http://dirkriehle.com/wp-content/uploads/2011/06/Screenshot14.png" width="100%" border="1px" /><br />
</a></p>
<p>[1]&nbsp;Oracle. <a href="http://www.oracle.com/technetwork/java/javase/overview/javahistory-timeline-198369.html">The Java History Timeline.</a><br />
[2]&nbsp;Wikipedia. <a href="http://en.wikipedia.org/wiki/Java_%28programming_language%29">Java (Programming Language).</a><br />
[3]&nbsp;Microsoft. <a href="http://www.microsoft.com/About/Legal/EN/US/Interoperability/Java/Default.aspx">Microsoft Java Virtual Machine Support.</a><br />
[4]&nbsp;Oracle. <a href="http://openjdk.java.net/legal/gplv2+ce.html">GNU General Public License, version 2, with the Classpath Exception.</a><br />
[5]&nbsp;Wikipedia. <a href="http://en.wikipedia.org/wiki/GPL_linking_exception">GPL Linking Exception.</a><br />
[6]&nbsp;Oracle. <a href="http://www.oracle.com/us/corporate/press/176988">Oracle and IBM Collaborate to Accelerate Java Innovation Through OpenJDK.</a><br />
[7]&nbsp;Oracle. <a href="http://www.oracle.com/technetwork/java/javase/overview/index.html">Java SE at a Glance.</a><br />
[8]&nbsp;Oracle. <a href="http://openjdk.java.net/contribute/">How to Contribute.</a><br />
[9]&nbsp;Oracle. <a href="http://openjdk.java.net/faq/">OpenJDK FAQ.</a><br />
[10]&nbsp;Doug Lea. <a href="http://gee.cs.oswego.edu/dl/html/jcp22oct10.html">Email to JCP Executive Committee Members.</a><br />
[11]&nbsp;Apache Software Foundation. <a href="https://blogs.apache.org/foundation/entry/statement_by_the_asf_board1">Our Participation in the Java Community Process.</a><br />
[12]&nbsp;Apache Software Foundation. <a href="http://www.apache.org/jcp/sunopenletter.html">Open Letter to Sun Microsystems.</a><br />
[13]&nbsp;Paul Querna. <a href="http://journal.paul.querna.org/articles/2010/10/12/java-trap-2010-edition/">Java Trap, 2010 Edition.</a><br />
[14]&nbsp;Oracle. <a href="http://www.jcp.org/en/resources/tdk">Java(tm) Compatibility Test Tools (Java CTT) Info.</a><br />
[15]&nbsp;Wikipedia. <a href="http://en.wikipedia.org/wiki/Technology_Compatibility_Kit">Technology Compatibility Kit.</a><br />
[16]&nbsp;Wikipedia. <a href="http://en.wikipedia.org/wiki/Apache_Harmony">Apache Harmony.</a><br />
[17]&nbsp;Wikipedia. <a href="http://en.wikipedia.org/wiki/GNU_Classpath">GNU Classpath.</a><br />
[18]&nbsp;Brian McCallister. <a href="http://skife.org/java/jcp/2010/12/07/the-tck-trap.html">The TCK Trap.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/06/30/the-java-ip-story/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Open Source Big Bang</title>
		<link>http://dirkriehle.com/2011/06/21/the-open-source-big-bang/</link>
		<comments>http://dirkriehle.com/2011/06/21/the-open-source-big-bang/#comments</comments>
		<pubDate>Tue, 21 Jun 2011 20:40:03 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Industry]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Wikimedia]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2271</guid>
		<description><![CDATA[Open source is not only software, but also an approach to software development. The public nature of open source projects lets us show how open source software development scales to the largest project sizes. The following figure illustrates the scalability &#8230; <a href="http://dirkriehle.com/2011/06/21/the-open-source-big-bang/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Open source is not only software, but also an approach to software development. The public nature of open source projects lets us show how open source software development scales to the largest project sizes. The following figure illustrates the scalability of open source software development. I call it <strong>the big bang of open source</strong>.</p>
<p><a href="/wp-content/uploads/2011/06/Open-Source-Big-Bang.png"><img src="/wp-content/uploads/2011/06/Open-Source-Big-Bang.png" width="100%" /></a></p>
<p><span id="more-2271"></span></p>
<p>The figure shows the growth of active well-working open source projects of all sizes over time, as captured in our database [1]. Each line represents a particular year, from 1995 to 2008. Each line shows how many projects of a particular size existed in that given year. The x-axis shows the size of projects, and the y-axis shows the number of projects of that size. So, for 1995, we can see that there were 10 projects of size 1 committer. (The scales are logarithmic [2].) Using the number of registered committers as a proxy for a project&#8217;s size is most certainly a conservative assumption. In 1995, there were also 4 projects of size 10 committers. In 1996, there were already more smaller projects and also more larger projects.</p>
<p>As you can see, the number of smallest projects (one committer) kept growing over time and reached about 3.200 in 2008 in our sample. At the same time, some of these smallest projects kept growing, migrating to the right in the figure. In 2008, there were 10 projects of size 1.000 committers! (While in 1995 there were none.) I find this continued growth of open source intriguing. Speculating from the expansion of the year lines there is a constant supply of new projects, and each project grows to the size right for it, including some very large project sizes.</p>
<p>Mathematically, of interest is the gradient over the year lines. The gradient is the formula that captures the year-over-year growth. I call the figure an illustration of the open source big bang, because the gradient captures the expansion speed of the growing open source universe. We have not yet been able to develop an appropriate mathematical model for this apparent growth. However, the figure illustrates <strong>how open source projects <em>consistently</em> scale to the largest project sizes</strong>. We may not yet know exactly why, but we are measuring that they do.</p>
<p>If you liked this blog post, you might also like reading about</p>
<ol>
<li><a href="/publications/2009/open-collaboration-within-corporations-using-software-forges/">Open source practices for internal software development</a> (a.k.a. inner source)</li>
<ul>
<li>Inner Source / Benefits / <a href="/services/inner-source/benefits/platform-development/">Platform Development Scenario</a></li>
<li>Inner Source / Benefits / <a href="/services/inner-source/benefits/selbstorganisation">Selbstorganisationszenario</a> (in German)</li>
</ul>
<li><a href="/publications/2009/the-commercial-open-source-business-model/">How to go to to market with an open source strategy</a></li>
<li><a href="/publications/2010/the-economic-case-for-open-source-foundations/">The economic case for open source foundations</a></li>
<li><a href="/presentations/current-talks/">My current presentations on open source</a></li>
</ol>
<h2>Footnotes and References </h2>
<p>[1] The data used to generate the figure was taken from an Ohloh.net database snapshot from March 2008. That snapshot contains about 30% of all active open source projects at that time, using Carlo Daffara&#8217;s <a href="http://flossmetrics.org/news/11">estimate of total population as well as activity</a>. The year lines in the figure are not the result of precise mathematical modeling, rather they are a linear regression fitted into the logarithmic data. Thus, this figure serves eyeballing purposes only. The figure itself was created by my Ph.D. student Carsten Kolassa.</p>
<p>[2] A short reminder on logarithms, in case it got rusty: 10^0 = 1, 10^0.6 = 4 (roughly), 10^1 = 10, etc. The exponents are to be found on the x and y-axes.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/06/21/the-open-source-big-bang/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>The Parser that Cracked the MediaWiki Code</title>
		<link>http://dirkriehle.com/2011/05/01/the-parser-that-cracked-the-mediawiki-code/</link>
		<comments>http://dirkriehle.com/2011/05/01/the-parser-that-cracked-the-mediawiki-code/#comments</comments>
		<pubDate>Sun, 01 May 2011 20:52:07 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Announcement]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Open Collaboration]]></category>
		<category><![CDATA[Open Content]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Wikimedia]]></category>
		<category><![CDATA[Wikis]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2230</guid>
		<description><![CDATA[I am happy to announce that we finally open sourced the Sweble Wikitext parser. You can find the announcement on the OSR Group blog or directly on the Sweble project site. This is the work of Hannes Dohrn, my first &#8230; <a href="http://dirkriehle.com/2011/05/01/the-parser-that-cracked-the-mediawiki-code/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I am happy to announce that we finally open sourced the Sweble Wikitext parser. You can find the announcement on the <a href="http://group.riehle.org">OSR Group blog</a> or directly on <a href="http://sweble.org">the Sweble project site</a>. This is the work of Hannes Dohrn, my first Ph.D. student, who I hired in 2009 to implement a Wikitext parser.</p>
<p><strong>So what about this &#8220;cracking the MediaWiki code&#8221;?</strong></p>
<p>Wikipedia aims to bring the (encyclopedic) knowledge of the world to all of us, for free. While already ten years old, the Wikipedia community is just getting started, and we have barely seen the tip of the iceberg, there is so much more to come. All that wonderful content is being written by volunteers using a (seemingly) simple language called Wikitext (the stuff you type in once you click on edit). Until today, Wikitext had been poorly defined.</p>
<p><span id="more-2230"></span></p>
<p>There was no grammar, no defined processing rules, and no defined output like a DOM tree based on a well defined document object model. This is to say, the content of Wikipedia is stored in a format that is not an open standard. The format is defined by 5000 lines of php code (the parse function of MediaWiki). That code may be open source, but it is incomprehensible to most. That&#8217;s why there are 30+ failed attempts at writing alternative parsers. That&#8217;s why a respected long-time community member asked in exasperation: <a href="http://www.gossamer-threads.com/lists/wiki/foundation/145073#145073">Can anyone really edit Wikipedia?</a></p>
<p>The common answer given is to hide the complexity of Wikitext behind a visual editor, but that is not an answer. It doesn&#8217;t work: A visual editor, like any other algorithm that wants to work with Wikipedia content, needs a well-understood specification of the language that content is written in. This is where the Sweble parser comes in. Following well-understood computer science best practices, it uses a well-defined grammar that a parser generator uses to create a parser. It uses well-understood object-oriented design patterns (the Visitor pattern, prominently) to build a processing pipeline that transforms source Wikitext into whatever the desired output format is. And most importantly, it defines an abstract syntax tree (AST), document object model (DOM) tree soon, and works off that tree. We have come a long way from 5000 lines of php code.</p>
<p><strong>So what does creating an AST and DOM tree for Wikitext buy us?</strong></p>
<p>In short, it buys us interoperability and evolvability. In <a href="http://dirkriehle.com/2008/07/19/a-grammar-for-standardized-wiki-markup/">a 2007 paper, using the then hopeful wiki markup community standard WikiCreole</a>, we explained the need for such interoperability and evolvability as defined by an open standard. Different tools can gather around that format and evolve independently. Today, everything has to go lock-step with MediaWiki. By untying the content and data from MediaWiki, we are enabling an ecosystem of tools and technology around Wikipedia (and related project) content so these projects can gain more speed and breadth.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/05/01/the-parser-that-cracked-the-mediawiki-code/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>Open Commons Region Linz is Starting</title>
		<link>http://dirkriehle.com/2011/03/28/open-commons-region-linz-is-starting/</link>
		<comments>http://dirkriehle.com/2011/03/28/open-commons-region-linz-is-starting/#comments</comments>
		<pubDate>Mon, 28 Mar 2011 09:37:40 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Announcement]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Open Collaboration]]></category>
		<category><![CDATA[Open Content]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Wikimedia]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2175</guid>
		<description><![CDATA[The region of and around Linz, Austria, has declared itself the Open Commons Region Linz. The opening festivities, including talks, free-of-charge, will take place on April 11th, 2011, in Linz (naturally). Read more about it on the blog of the &#8230; <a href="http://dirkriehle.com/2011/03/28/open-commons-region-linz-is-starting/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The region of and around <a href="http://en.wikipedia.org/wiki/Linz">Linz, Austria</a>, has declared itself the <a href="http://www.opencommons.linz.at/">Open Commons Region Linz</a>. The opening festivities, including talks, free-of-charge, will take place on April 11th, 2011, in Linz (naturally). Read more about it on <a href="http://www.blog.opencommons.at/">the blog of the Open Commons Region Linz</a>! I&#8217;m a member of the academic advisory council of the Open Commons Region Linz and applaud and support the effort. I&#8217;m also happy to say that it will me bring to Linz in person once in a while.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/03/28/open-commons-region-linz-is-starting/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Upcoming Talk, Tsinghua University: Open Source Research</title>
		<link>http://dirkriehle.com/2011/03/01/upcoming-talk-tsinghua-university-open-source-research/</link>
		<comments>http://dirkriehle.com/2011/03/01/upcoming-talk-tsinghua-university-open-source-research/#comments</comments>
		<pubDate>Tue, 01 Mar 2011 11:15:39 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Announcement]]></category>
		<category><![CDATA[Open Collaboration]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Presentation]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Wikimedia]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2138</guid>
		<description><![CDATA[报告题目 Open Source Research 报告人 Prof. Dr. Dirk Riehle, University of Erlangen, Germany 时间 2011年03月17日（周一） 10:00am-noon 地点 FIT大楼 4-302 Abstract:&#160;Open source is not just software but also represents a new approach to software development. This type of software development is &#8230; <a href="http://dirkriehle.com/2011/03/01/upcoming-talk-tsinghua-university-open-source-research/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<table style="width:95%">
<tr>
<td>报告题目</td>
<td>Open Source Research</td>
</tr>
<tr>
<td>报告人</td>
<td>Prof. Dr. Dirk Riehle, University of Erlangen, Germany</td>
</tr>
<tr>
<td>时间</td>
<td>2011年03月17日（周一）  10:00am-noon</td>
</tr>
<tr>
<td>地点</td>
<td>FIT大楼  4-302</td>
</tr>
</table>
<p><span id="more-2138"></span></p>
<p><strong>Abstract:</strong>&nbsp;Open source is not just software but also represents a new approach to software development. This type of software development is different from traditional plan-driven and agile methods and scales up to the largest project sizes. In this talk, I&#8217;ll show how open source differs from prior approaches and addresses questions of globally distributed software development. I&#8217;ll first present surprising results from quantitatively analyzing open source projects that show how open source software development actually proceeds. I&#8217;ll then use these insights to motivate new software engineering tools and show some examples. Finally I&#8217;ll discuss how software forges, a novel type of platform for collaborative software development, can complement current project management approaches to improve code reuse and knowledge sharing and to more effectively use developer resources.</p>
<p><strong>Biography:</strong>&nbsp;Prof. Dr. Dirk Riehle, M.B.A., is the Professor for Open Source Software at the Friedrich-Alexander University of Erlangen-Nürnberg. Before joining academia, Riehle led the Open Source Research Group at SAP Labs, LLC, in Palo Alto, California (Silicon Valley). Riehle founded the Wiki Symposium, a conference dedicated to wiki research and practice. He was also the lead architect of the first UML virtual machine. He is interested in open source software engineering and agile methods, complexity science and human collaboration, and software design. Prof. Riehle holds a Ph.D. in computer science from ETH Zürich and an M.B.A. from Stanford Business School. He welcomes email at dirk@riehle.org, blogs at http://dirkriehle.com, and tweets as @dirkriehle.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/03/01/upcoming-talk-tsinghua-university-open-source-research/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Curating, Preserving, and Showing Software at the Computer History Museum</title>
		<link>http://dirkriehle.com/2011/02/11/curating-preserving-and-showing-software/</link>
		<comments>http://dirkriehle.com/2011/02/11/curating-preserving-and-showing-software/#comments</comments>
		<pubDate>Fri, 11 Feb 2011 16:50:16 +0000</pubDate>
		<dc:creator>Dirk Riehle</dc:creator>
				<category><![CDATA[Education]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Wikimedia]]></category>

		<guid isPermaLink="false">http://dirkriehle.com/?p=2041</guid>
		<description><![CDATA[Last Saturday I visited the Computer History Museum&#8217;s new exhibition &#8220;R&#124;Evolution: The First 2000 Years of Computing&#8221;. The exhibition is fantastic, and they&#8217;ve come a long way from the early days of their &#8220;visible storage&#8221; exhibition. If you live in &#8230; <a href="http://dirkriehle.com/2011/02/11/curating-preserving-and-showing-software/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Last Saturday I visited the <a href="http://computerhistory.org">Computer History Museum&#8217;s</a> new exhibition &#8220;R|Evolution: The First 2000 Years of Computing&#8221;. The exhibition is fantastic, and they&#8217;ve come a long way from the early days of their <a href="/humorous-takes/fun-photos/spaghetti-code-then.html">&#8220;visible storage&#8221;</a> exhibition. If you live in or visit the Silicon Valley, I highly recommend you pay it a visit.</p>
<p>That said, every time I visit the museum, I ask about the state of curating, preserving, and showing not only hardware, but also software. Like most exhibitions, the R|Evolution exhibition focusses on physical objects and complements them with textual explanations on plates as well as videos. Software is being discussed in the Software Theatre and in some smaller videos. However, these videos are about software and programming in general, not about actual software artifacts. Software is mostly shown through physical objects, i.e. the boxes they came in as packaged software.</p>
<div align="center">
<img width="100%" src="/wp-content/uploads/2011/02/P2060079.jpg" /></p>
<p>Packaged Software Box Arc</p>
</div>
<p><span id="more-2041"></span></p>
<p>This is hardly a form of presentation that brings out the beauty of software code, how software works, and how paradigms changed over time. Also, I saw no presentation of non-packaged software and open source and open data was completely missing.</p>
<p>The significance of software is being recognized though. At the exit of the exhibition, you&#8217;ll find a video wall with short snippets and statements of notable people, and my impression was that whenever they focussed on what computers do for us, software was at the forefront.</p>
<div align="center">
<p><img src="/wp-content/uploads/2011/02/P2060086.jpg" /></p>
<p>Grady Booch at R|Evolution exhibition</p>
</div>
<p>This brings me to the question of software preservation and show-casing. Clearly, the way it is currently being done is sub-optimal. Videos and boxes don&#8217;t really convey much of what people need to understand about software.</p>
<p><!--</p>
<p>From a couple of email exchanges back in 2002 I remember that Grady Booch was trying to drive the Computer History Museum to get serious about software preservation and show-casing it. When I went to see the Superbowl game last Sunday at Richard Gabriel's place, I asked him about it. The story he told me was exasperating:</p>
<blockquote<br />
Back then the museum had formed a task force to tackle the issue of software preservation. Grady had brought me onto the committee, but all the committee ever did was talk about ontologies for capturing and cataloging software and related information. At the same time, many great artifacts remained untouched in some storage space. Upon my insistence we finally decided to take a look at the many stored boxes to assess the actual state of the software. As an example, in one box, we found tapes marked as containing the Whirlwind software. The state of the tapes was abysmal, they had been slowly rotting away, and it seemed difficult to get to the actual digital data on them. Upon further inquiry, I was told that before the museum had put the tapes into their current storage space, they had been kept in some aluminum shed basking in the California sun. I quit the committee soon thereafter, and nothing much seems to have happened since then.<br />
</blockquote</p><p>--></p>
<p>I looked at the CHM&#8217;s <a href="http://softwarehistory.org">Software History</a> site to see whether they have any progress to report but this doesn&#8217;t seem to be the case: The last report dates 2004 and the site&#8217;s footer itself says 2007. Seems this effort is stalled.</p>
<p><!--</p>
<p>It is disappointing to hear that important software and data has not yet been converted from their original fragile physical form into better preservable forms and is literally rotting away.</p>
<p>--></p>
<p>Preserving the raw data is only the obvious first step. Preservation and show-casing seem interesting topics to me. Software doesn&#8217;t mean much without context, so you&#8217;d need to collect emulators, configuration information, and usage patterns to run the software. But even running software doesn&#8217;t help much, and it certainly doesn&#8217;t speak to the average museum visitor. So you&#8217;d want visualization of program execution, configuration management history, etc. A rich topic, and I encourage everyone to ask the museum staff about the state of software preservation.</p>
<p>I hope that the next exhibition at the CHM will be able to unlock the beauty of the Linux Kernel&#8217;s code, Microsoft Word&#8217;s interaction design, and Apache&#8217;s httpd subversion history in front of my eyes.</p>
]]></content:encoded>
			<wfw:commentRss>http://dirkriehle.com/2011/02/11/curating-preserving-and-showing-software/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

