<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Aust Gate &#187; xml</title>
	<atom:link href="http://austgate.co.uk/tags/xml/feed/" rel="self" type="application/rss+xml" />
	<link>http://austgate.co.uk</link>
	<description>Open Knowledge and Literature</description>
	<lastBuildDate>Mon, 23 Jan 2012 18:10:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Marking up Open Correspondence with TEI XML</title>
		<link>http://austgate.co.uk/2011/03/marking-up-open-correspondence-with-tei-xml/</link>
		<comments>http://austgate.co.uk/2011/03/marking-up-open-correspondence-with-tei-xml/#comments</comments>
		<pubDate>Sun, 20 Mar 2011 11:03:26 +0000</pubDate>
		<dc:creator>iain_emsley</dc:creator>
				<category><![CDATA[Open Knowledge]]></category>
		<category><![CDATA[projects]]></category>
		<category><![CDATA[Text Mining]]></category>
		<category><![CDATA[open_correspondence]]></category>
		<category><![CDATA[tei]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://austgate.co.uk/?p=303</guid>
		<description><![CDATA[As part of the next version of Open Correspondence, I&#8217;ve been working on the XML and JSON mark-up. As part of the XML, I&#8217;ve been using the TEI mark-up for the letters. I once hard this described as &#8220;XML for people who don&#8217;t think XML is flexible enough&#8221;. Now I can see why. It is [...]]]></description>
			<content:encoded><![CDATA[<p>As part of the next version of <a title="Open Correspondence site" href="http://www.opencorrespondence.org" target="_blank">Open Correspondence</a>, I&#8217;ve been working on the XML and JSON mark-up.</p>
<p>As part of the XML, I&#8217;ve been using the <a title="TEI P5 XML mark-up" href="http://www.tei-c.org/release/doc/tei-p5-doc/en/html/DS.html" target="_blank">TEI mark-up</a> for the letters. I once hard this described as &#8220;XML for people who don&#8217;t think XML is flexible enough&#8221;. Now I can see why. It is a highly flexible solution to digitising texts but can be confusing, especially when switching between versions. I believe the original model that I had been working on was P4 but the current one is P5 so I had to negotiate that change and to make sure that I had the correct elements in the blocks. Even then, there can be two or three different versions of the same element in the section and I do have to wonder about that wisdom rather than simplifying the elements so that there are the extensible elements that may or may not be used. I&#8217;m intending to use the schema again and to really get my head around it rather than tinkering on the edges.</p>
<p>I&#8217;ve attempted this conversion before but think that I&#8217;ve finally got it to a point which is nearly there. What I would really like to do is to put together some sort of tool kit as a core to the Open Correspondence project. Clearly this would be a long-term project and would need more research but it might be useful to other projects.</p>
<p>As well as marking up texts, it would be useful to use the XML mark-up to convert the text into other formats such as Mobipocket or the Kindle formats to allow a user to create their own e-publication. It would also be useful to find a way of using the XML in conjunction with the <a title="psbook command pages" href="http://www.tardis.ed.ac.uk/~ajcd/psutils/psbook.html" target="_blank">psbook</a> command to create a print version of a letter or collection. This does mean that I need to convert the XML into a PostScript file (which raises a host of questions at the moment &#8211; such as converting structured format into layout format) and then print it.</p>
<p>I&#8217;ve also been playing around with the correspondent collections and the way of marking up collections in TEI. I had thought of this as working on creating printable collections and making the data re-usable for printing. Equally it might allow the data to be used in answer to Jonathan Gray&#8217;s question regarding identifying the letters written to a particular correspondent.</p>
<p>When I can get the XML working and validated, then I&#8217;ll look at the JSON output. It would draw a line under this part of the project and allow me to move on. I&#8217;m aiming for a release towards the end of March or middle of April in keeping with trying to keep into a six week schedule.</p>
<p>The next thing after that is to begin answering Jonathan&#8217;s questions in terms of a tool kit to identify weaknesses and to try and write some code to re-use and re-mix the data. I would hope that would be in the next release towards the end of May.</p>
]]></content:encoded>
			<wfw:commentRss>http://austgate.co.uk/2011/03/marking-up-open-correspondence-with-tei-xml/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Weeknotes: documentation, prototyping and cats</title>
		<link>http://austgate.co.uk/2010/07/weeknotes-documentation-prototyping/</link>
		<comments>http://austgate.co.uk/2010/07/weeknotes-documentation-prototyping/#comments</comments>
		<pubDate>Sun, 11 Jul 2010 15:31:20 +0000</pubDate>
		<dc:creator>iain_emsley</dc:creator>
				<category><![CDATA[Open Knowledge]]></category>
		<category><![CDATA[weeknotes]]></category>
		<category><![CDATA[open_correspondence]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://austgate.co.uk/?p=181</guid>
		<description><![CDATA[I&#8217;ve spent most of the week either trying to persuade colleagues that rewrites are needed to existing services. I&#8217;ve also finally managed to get the initial promise of working from home so hopefully I&#8217;ll be able to get the rewrite started on the &#8220;quiet&#8221; days away from the office. (Although the cat can drive me [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve spent most of the week either trying to persuade colleagues that rewrites are needed to existing services. I&#8217;ve also finally managed to get the initial promise of working from home so hopefully I&#8217;ll be able to get the rewrite started on the &#8220;quiet&#8221; days away from the office. (Although the cat can drive me nuts before she goes to sleep at 10am).</p>
<p>Still working on the accounts project which keeps unravelling a series of underlying problems. Most of them we know about but they appear in all sorts of odd places.</p>
<p>Assuming the world doesn&#8217;t fall on my head next time I&#8217;m in the office, I&#8217;m going to try and spend the day at home on a &#8220;Fedex&#8221; day. I&#8217;m taking the notion from an issue of Wired where they were talking about different ways of working and Atlassian mentioned &#8220;Fedex&#8221; days where you spend a day building a prototype. What I&#8217;d really like to get prototyped is the service bus / queuing system. So fingers crossed.</p>
<p>The impetus came from updating the disaster recovery documentation and writing the first department of the service status documentation (which I wrote after getting the last bit of debugging finished). I know that documentation is not everybody&#8217;s favourite thing but I find it useful in rethinking the system and making sure it fits together.</p>
<p>I&#8217;ve made time to rewrite the load function for Open Letters. I&#8217;ve got the document building the letters in XML and written a rough upload script. Next task is to rewrite the main.py script, test the XML loading and then finished tidying up the initial document.</p>
<p>I&#8217;m also looking forward to Textcamp so it&#8217;ll be great to get the load finished (as it normalises the function) and get on with doing a presentation for the camp.</p>
<p>I&#8217;m also coming to end of writing my book on children&#8217;s fantasy. Whilst not technical in an IT sense, I&#8217;m thinking of the next project on the New Weird and how to use IT to visualise influences and timelines. The one that worries me is archiving necessary web pages for the research which I need to look towards as I&#8217;m not sure whether it is technically illegal.</p>
]]></content:encoded>
			<wfw:commentRss>http://austgate.co.uk/2010/07/weeknotes-documentation-prototyping/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Weeknotes: PHP, SOAP, and Open Letters</title>
		<link>http://austgate.co.uk/2010/06/weeknotes-php-soap-and-open-letters/</link>
		<comments>http://austgate.co.uk/2010/06/weeknotes-php-soap-and-open-letters/#comments</comments>
		<pubDate>Sun, 20 Jun 2010 10:15:22 +0000</pubDate>
		<dc:creator>iain_emsley</dc:creator>
				<category><![CDATA[weeknotes]]></category>
		<category><![CDATA[open_correspondence]]></category>
		<category><![CDATA[php]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://austgate.co.uk/?p=171</guid>
		<description><![CDATA[It has been a fairly quiet week with the boss away. I&#8217;ve managed to complete a service to upload details from spreadsheets sent via email. I&#8217;ve also managed to complete a SOAP service in PHP to listen for status updates and just doing the final tests to it now. Once its up it can be [...]]]></description>
			<content:encoded><![CDATA[<p>It has been a fairly quiet week with the boss away. I&#8217;ve managed to complete a service to upload details from spreadsheets sent via email.</p>
<p>I&#8217;ve also managed to complete a SOAP service in PHP to listen for status updates and just doing the final tests to it now. Once its up it can be repurposed for other companies. One of the things that I think  will come up is how to store XML files most efficiently as MySQL 5 appears to be tied to uploading files rather than just taking POST strings. I&#8217;m thinking of using something like <a title="Oracle Berkeley DB XML" href="http://www.oracle.com/database/berkeley-db/xml/index.html" target="_blank">Oracle&#8217;s BDB XML</a> database (though the license appears to preclude our uses) or <a title="eXist sourceforge page" href="http://exist.sourceforge.net/index.html" target="_blank">eXist</a> but that is something to come back to much later.</p>
<p>I&#8217;ve been thinking about the Open Correspondence site and the best way to allow it to be extended by other people. I think that the best way forward to create an internal XML format which the load command can use and anybody can use to create their own files and databases. Its along the lines of the stuff I partially did some work on in the Open Shakespeare project.</p>
<p>Given the boss is away, time for finishing more things off next week. I&#8217;ve also created a <a title="Trac website" href="http://trac.edgewall.org" target="_blank">Trac</a> instance for internal purposes but I think it&#8217;ll help on that bane if developing live &#8211; documentation.</p>
]]></content:encoded>
			<wfw:commentRss>http://austgate.co.uk/2010/06/weeknotes-php-soap-and-open-letters/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

