The Aust Gate

Weeknotes: Pylons, Python and printing

I’ve been doing some more work to the Open Correspondence website (which is now functional thanks to Rufus Pollock’s help). In part I’ve been cleaning up the urls for the data controller (which is still coming along) and trying to tie the views in together. Being happier with Apache and PHP I spent some time […]

May 30, 2010 – 10:22 am | By iain_emsley | Posted in Open Knowledge | Tagged open_correspondence, open_literature, printing, Python | Comments (0)

Weeknotes: Data mining, XML and bibliographies

It seems to be have been a week of frantic completion and refactoring. The first half was spent frantically converting html pages into PDFs using Verypdf’s HTMLtools server product. All in all the manual is very helpful and the test server could be set up quickly. It might have helped the other end if I’d […]

May 23, 2010 – 10:57 am | By iain_emsley | Posted in Information Retrieval, Open Knowledge, projects | Tagged open_bibliography, open_correspondence, rdf, redis | Comments (0)

Weeknotes: Redis, RDF, rdflib and openletters

I’ve been trying to play catch up this week at work. One of the projects that I’ve been working on is the temporary storage of information. For one reason or another, one of the workers has decided to occasionally throw a fit and not do its job properly (on top of a connection that appears […]

May 15, 2010 – 2:57 pm | By iain_emsley | Posted in Open Knowledge, projects | Tagged open_correspondence, redis | Comments (0)

Date set for Textcamp

The provisional date for Textcamp has been set for August 21st on the twitter feed.

May 5, 2010 – 8:45 am | By iain_emsley | Posted in Open Knowledge | Tagged open_literature, textcamp | Comments (0)

Data curation in real time

Robert Scoble’s blog has this intriguing post on real-time curation which has made me think. At the moment I’m working in curating and archiving gigabytes of information at work (and usually on ways of generating more data from the systems). Whilst this is not necessarily real time, I’d like it to be or at least […]

April 1, 2010 – 8:29 pm | By iain_emsley | Posted in Information Retrieval | Comments (0)

A change to the Letters project

During the previously blogged dinner with Ben and Rufus, we talked about the nascent work on the letters project. Both have “encouraged” me (it didn’t take too much persuasion, it must be said) to move the project to the Open Knowledge Foundation and to port it to Python with a Redis backend rather than the […]

March 28, 2010 – 11:19 am | By iain_emsley | Posted in Open Knowledge, projects, Text Mining | Tagged letters | Comments (0)

Textcamp announced

Had dinner with Rufus Pollock and Ben O’Steen on Monday in Oxford. As part of the dicussions, the notion of Textcamp was raised and Ben has created the Textcamp website with an associated blog. It is a slightly bigger concept than I had had but the approach, I think, will allow the creation of a […]

March 28, 2010 – 11:16 am | By iain_emsley | Posted in Information Retrieval, Text Mining | Tagged textcamp | Comments (0)

Exporting and querying Dickens data

As a follow up to the posting regarding the propsed ontology, I’ve started to try and create a SPARQL endpoint. At some point soon, I want to use the new version of ARC as the version I’ve got here is a little out of date. After that the next thing should be to allow the […]

March 21, 2010 – 12:15 pm | By iain_emsley | Posted in Information Retrieval, projects | Tagged charles dickens, rdf | Comments (0)

Creating the text ontology

I’ve been working quietly on ideas for an ontology to describe relationships in a letter from the correspondent to people referred in the text. It is intended to complement and extend the Dublin Core and Foaf (Friend of a Friend) namespaces. Anyhow I’ve decided to publish a first set of thoughts on it having sat […]

March 18, 2010 – 8:34 pm | By iain_emsley | Posted in Open Knowledge | Tagged ontology, rdf | Comments (3)

Growing and using data

Just seen an article on Techcrunch by Bradford Cross of Flightcaster regarding the growth of data on the Web. He appears to argue that data and its uses will drive the Web soon, writing: the data age is less about the raw size of your data, and more about the cool stuff you can do […]

March 17, 2010 – 7:57 pm | By iain_emsley | Posted in Information Retrieval | Tagged data mining | Comments (0)

The Aust Gate

Weeknotes: Pylons, Python and printing

Weeknotes: Data mining, XML and bibliographies

Weeknotes: Redis, RDF, rdflib and openletters

Date set for Textcamp

Data curation in real time

A change to the Letters project

Textcamp announced

Exporting and querying Dickens data

Creating the text ontology

Growing and using data

Elsewhere on the web

Categories

Archives

Search

Open Knowledge

RSS Feeds

Meta