Posts Tagged ‘open_literature’

Weeknotes: All quiet on the accounting front

Sunday, June 27th, 2010

It’s been a week of relative frustration with priorities suddenly being shifted and the infrastructure road map looking more and more unclear.

The soap server is largely debugged and ready for more extensive testing on the server and the back end has now been rewritten to capture more data. I cannot help feeling that it will change once more services go online to scale more efficiently but right now I don’t have the expertise to do it. I’ll get there.

On a different tack, I’m back on the accounting project that I was on several months ago and making some headway in that. Its grown since I was last involved in it but nothing that a decent set of specs and roadmaps cannot solve in terms of making it manageable.

I’ve been thinking about my next book project which is on the New Weird and genre over the last 15 years and wondering how to use dbpedia’s influencedBy and influence terms in terms of showing how writers influence each other over a century. I’m tempted to put the data into a large rdf sheet and then use javascript or PHP to transform it into JSON to see if you can use the Simile timeline software usefully or if I need to find / write something more appropriate. It does have to wait for me to finish the current book.

I forgot to link to the Open Correspondence blog post on the Open Knowledge Foundation’s blog which was posted a few days ago.

Weeknotes: Pylons, Python and printing

Sunday, May 30th, 2010

I’ve been doing some more work to the Open Correspondence website (which is now functional  thanks to Rufus Pollock’s help). In part I’ve been cleaning up the urls for the data controller (which is still coming along) and trying to tie the views in together. Being happier with Apache and PHP I spent some time looking for how to rewrite the urls until I came across Andre Kollel’s blog post about the internal workings of the middleware in the Pylons framework.  The more I do on the project, the more I learn about both Python and Pylons.

One of the next things to do is to reformat the dates into human readable format. I had thought of using Python’s datetime strftime to reformat the date from its current ISO format (YYYY-MM-DD) into day, month year. Unfortunately, the method states ” years before 1900 cannot be used.” A slight cramp in the plan. However there is an Activestate recipe by Andrew Dalke which might do the trick or at least point me in the right direction. It is one of the things to be tidied up at some point.

It is a good feeling to have the site running now. The next task is to write the tests and then  to refactor the code. It is very PHPish and needs to be made more Pythonic. I’ve got an idea for trying to create a dendrogram around the textReferred element and to discover the letters and correspondents around the books that Dickens was writing. One of the tings is to continue loading the other volumes of Dickens’s letters into the site. So version 0.2 is a little way off but the light at the end of the tunnel is not a train this time.

Workwise has been a little hectic. I must make some time to write a method to allow our admin team to resubmit applications. Like so many things it is a balance between a five minute job and the two hour ones that need to be done. The major job for the week though was getting the automated printing working.

One of the jobs that admin do is to go through each client and create the packs for them. Using HTMLtools, I’ve managed to compile the html into PDF and then convert the PDF into a PostScript file for a printer. I’ve managed to use the Line Printer Remote protocol to send the job to the printer. It is a simple enough command:

lpr -S <ip address/name of printer>  -P <name of print job> (-o <optional -o 1 sets file to binary>) <name of file>

Windows doesn’t appear to support the full protocol but enough to be useful. The -o switch appears to only define whether the file is binary or not rather than specifying the paper type and so on. Annoying but it can be got around.

Anyhow it got me thinking about other ways of using commands to explore how texts can be converted and changed into useful objects. It brings me back to the use of psbook for printing but how to make it useful for an average user who does not necessarily want to run various commands. Having had a conversation with my friend Darren Nash ,editorial director of Orbit books,  about the future of publishing; he opined that small presses would come to the fore. I think, certainly in genre that this is correct. It would be interesting to see how existing tools could be used towards these ends rather than constantly re-invent the wheel.

Now that the first version of letters is out the way, time to go over other projects. I’ve got a yen to try and create something from Milton’s Areopagitica, appropriate I think as it is a cry for free presses.

Date set for Textcamp

Wednesday, May 5th, 2010

The provisional date for Textcamp has been set for August 21st on the twitter feed.

Digitising books and mumblings on open literature

Sunday, February 14th, 2010

Robert McCrum, an associate editor of the Observer, has this remarkably sane blog post regarding the nature of digitisation and Google Books. Perhaps it is only my interpreation but it does seem to be a slight volte face on his part, as I’ve always interpreted his stance as slightly anti-digitised books.

Having read Adrian Johns’ book Piracy, he considers that

From print culture’s beginnings to the rise of the internet, there has been a succession of intellectual property wars for which the English language has just one word: piracy.

There is a temptation to see digitisation and free culture as piracy. It is certainly easier in the short term to do so but that ignores the wider issues of remix and re-use. Print culture was no stranger to piracy. McCrum uses the second book of Don Quixote where the errant knight escapes from the unauthorised translations and re-uses in a print culture which was far more limited than it is now (although one which was broadening through the use of printing).

Google Books has just stepped in and is doing what publishers and libraries should have been doing: making books available for a new medium. Granted this effort will be expensive (I would have thought) but in the long run it makes sharing and reusing knowledge easier and more useful. This aids intellectual thought and allows ideas and research to be more rigorous.

I’m not sure I share McCrum’s equation of free culture with piracy. He writes that

sometimes piracy can be an engine of social and intellectual innovation as much as it has been an enemy of authors’ rights.

We need to come to a better cultural and social understanding of piracy and free culture. I do believe in sharing and reuse but not piracy in terms of stealing. Google Books is in the vanguard of this understanding but believers in open literature should be working to extend this and ensuring that we do not swap one hegemony for another.

In his piece on the Guardian’s Comment is Free David Drummond, Google’s senior vice president for corporate development and chief legal officer, wrote:

If you love books and care about the knowledge they contain, there is a problem that needs to be solved. Somewhere in the region of 175m books exist in the world today. A tiny fraction of those are in print and for sale in bookshops or on the web. ­Another small portion are so old that they are out of copyright and anyone can use them.

This is the core problem and Google are indeed to be praised for doing this and, to a certain extent, publishers and libraries damned. But it is a question of expense and Google have the funds and technical expertise whilst libraries may not.

They are indeed trail blazing and it is easy to write jeremiads against this (I’m aware that this blog post might come across as this). I like the fact that books are being made available and appreciate that there is a cultural clash and legal issues which needs to be dealt with such as orphan works and monetisation for the concerned parties. All parties need to address these issues. I doubt that there will be a mechanism for orphan works which satisfies every one and that it will need to be adapted for each legal jurisdiction but motion towards is better than no motion at all.

Drummond writes:

The truth is that readers around the world who seek the information locked in millions of out-of-print books currently have little choice other than to travel to a small number of libraries in the hope of finding what they are looking for. And if you’re an author, you have no way to make money from your work if it’s out of print.

That paragraph brings up at least two points. There are a small number of academic research libraries like the Bodleian or Cambridge University Library and independent researcher not attached to academic institutions can be locked out of them. There is the marvellous (and this has meant that I’ve had access to several books that othersie I could not get hold of) inter-library loans service that local libraries can offer. When I went to university, one of the first things I was advised to do was to register with the local library. At the time I didn’t use it a huge amount but now… They are lifelines and also help the use case argument for libraries, i.e you use them, you keep them.

How do we get ourselves out of the quagmire that we begin to see ourselves in? At the moment, I don’t know but I’m trying to stumble towards some sort of answer.

Firstly we need to break out the idea that free/remix/reuse culture = piracy. It does not, Mr McCrum. Piracy is perhaps one subset or occasional union but it does represent one view. Of course piracy exists and it always will.

Secondly we need a wider debate as users and providers as to what we want.

Thirdly I think we need to experiment with models. Google provides one way of doing it but not, I think, the only one.