Information Retrieval – The Aust Gate

Category Archives: Information Retrieval

Weeknotes: Documents and data

The main project this week (apart from hte onging one of moving and virtualising servers) is to begin work on our technical documents. I’m trying to move them onto the web and make the useful, not only in terms of reading about them but also to make them linkable. I’m trying to get them out […]

July 3, 2011 – 2:39 pm | By iain_emsley | Posted in Information Retrieval, weeknotes | Tagged documents, drupal, linked_data | Comments (0)

Research Databases in the Humanities

I went to the Research Databases in the Humanities workshop, organised by Sudamih, which was an excellent afternoon and time well spent. An Oxford heavy event, there were a number of interesting directions that came out of the afternoon. Firstly James Wilson, project manager of Sudamih at Oxford University Computing Services, outlined the Database as […]

January 23, 2011 – 12:11 pm | By iain_emsley | Posted in Information Retrieval | Tagged database, digital_humanities, rdf | Comments (2)

Searching Open Correspondence with Xapian

As part of the continuing work on Open Correspondence, I managed to install Xapian to act as a full text search engine. I’ve been looking to do this for a while and had started on working on a remote back end (as blogged here) but decided not to use it as it appears to have […]

January 9, 2011 – 3:01 pm | By iain_emsley | Posted in Information Retrieval, projects | Tagged search, xapian | Comments (0)

Finding the data signal in the noise

Marshall Kirkpatrick, on ReadWriteWeb, poses the question A web of infinite information: does that sound like a scary problem of “just too much”? in a “Mamas, Don’t Let Your Babies Grow Up to Be Data Wranglers” where he discusses an interview with Evan Williams on GigaOm. (I’m not going to discuss the interview here (but […]

December 30, 2010 – 9:40 am | By iain_emsley | Posted in Information Retrieval | Tagged data mining | Comments (0)

Hacking Arts Council data

I lost my hackday cherry yesterday and went to the Open Data hackathon to look at the South East arts council data found at the data.gov.uk site (http://data.gov.uk/dataset/grants-for-the-arts-awards-arts-council-england). Our hosts, White October, were fantastic and welcoming (and put the kettle on as soon as I came in!) and Incuna provided the much needed pizzas for […]

December 5, 2010 – 12:31 pm | By iain_emsley | Posted in Information Retrieval, Open Knowledge | Tagged arts_council, open_data, visualisation | Comments (2)

Weeknotes: Open Correspondence, Xapian and Linked Data

After last week’s server move, we discovered one or two things that needed to be changed before they could go live. The main thing was the Xapian search which I had been working on. The initial version kept the Xapian server on the local machine and used that to index and search the letters butt […]

November 7, 2010 – 10:58 am | By iain_emsley | Posted in Information Retrieval, projects, weeknotes | Tagged charles dickens, open_correspondence, xapian | Comments (0)

Tweeting changes with Node.js

As a break from Open Correspondence, I’ve been looking at node.js, the server side Javascript library. I’ve been thinking about the document stuff that I’ve been working on with Milton. One of the things that I had mooted as an idea was reading Twitter and pushing them back to the document. I’ve been playing with […]

November 3, 2010 – 9:01 pm | By iain_emsley | Posted in Information Retrieval, projects | Tagged node.js, twitter | Comments (0)

Weeknotes: Ubuntu, messaging and Open Correspondence

It has been a while since the last weeknotes. I’ve finally made the move to Linux, or at least dual booting, by installing Ubuntu so I’m currently learning a little the OS and getting a development environment set up for it. I’ve nearly finsihed the ongoing accounts project at work. The framework is up and […]

August 29, 2010 – 11:04 am | By iain_emsley | Posted in Information Retrieval, projects, weeknotes | Tagged javascript, messaging, open_correspondence, ubuntu | Comments (1)

Creating bibliographic resources from web pages

Given the increasingly digital nature of research, including not only websites but blogs, forums, wikis, the (in my view), beloved moleskin is becoming increasingly outdated. I’ve just finished writing my first book and had the joy of using moleskin notebooks to note down urls and make notes. I like moleskins a lot but pen and […]

August 15, 2010 – 6:52 pm | By iain_emsley | Posted in Information Retrieval, Open Knowledge, projects | Tagged archiving, warc | Comments (0)

Finding a space for NoSQL

ReadWriteWeb have a post on NoSQL (again?) by Audrey Watters which is a brief overview of the area. The original post points the Heroku blog, where Adam Wiggins outlines the uses of NoSQL. I’m not an expert by any means but use Redis on a daily basis with the Rediska PHP library. I remember having […]

July 20, 2010 – 7:11 pm | By iain_emsley | Posted in Information Retrieval | Tagged database, nosql, redis | Comments (0)

The Aust Gate

Category Archives: Information Retrieval

Weeknotes: Documents and data

Research Databases in the Humanities

Searching Open Correspondence with Xapian

Finding the data signal in the noise

Hacking Arts Council data

Weeknotes: Open Correspondence, Xapian and Linked Data

Tweeting changes with Node.js

Weeknotes: Ubuntu, messaging and Open Correspondence

Creating bibliographic resources from web pages

Finding a space for NoSQL

Elsewhere on the web

Categories

Archives

Search

Open Knowledge

RSS Feeds

Meta