Text Mining – The Aust Gate

Category Archives: Text Mining

Extracting Music Streams from Printouts

I have gone back to work on the Eric Sunderland archive. I also sent in a poster abstract to DMRN + 20 (Digital Music Research Network) with initial comments. It will become part of a talk to be given next year. A focus for this trip was to take some more photos of the printouts […]

November 16, 2025 – 8:04 pm | By iain_emsley | Posted in Programming, Text Mining | Tagged sound, text_mining | Comments (0)

Strava, segments, and tracking

A few years ago, Strava visualised the GPS co-ordinates in their data and displayed the locations of secret bases. A change of privacy settings later and, apparently, all was secret again. The Guardian has just run a story on using segments and GPS locations to show individuals within the bases through re-purposing the segment function. […]

June 21, 2022 – 7:58 am | By iain_emsley | Posted in algorithms, Text Mining | Tagged postdigital | Comments (0)

Jane Austen’s word choices

A Facebook friend had a link to an NY Times piece on Jane Austen’s word choices. Using Franco Moretti’s techniques, it begins showing how Digital Humanities can be useful. There are one of two of his books that I am waiting for before I can get into the pros and cons but I do have […]

July 9, 2017 – 1:15 pm | By iain_emsley | Posted in Text Mining | Tagged digital_humanities | Comments (0)

A simple experiment in Sound and Vision for Hamlet

The aim of this hack is to explore turning the structures of the First Folio texts marked up using Text Encoding Initiative XML (TEI) into notes using the Chuck , PHP and Processing languages. I wanted to explore the processes for transforming the texts for the user and explore different ways of presenting the textual […]

May 4, 2015 – 11:44 am | By iain_emsley | Posted in projects, Text Mining | Tagged data mining, php, sonification | Comments (0)

Harmonising the Heterogeneous at Cultures of Knowledge

Harmonising the Heterogeneous at the Cultures of Knowledge seminar series with Eero Hyvönen. Notes are unedited. Two forms of the Web : WWW for humans, GGG (Giant Global Graph) for data. Core data set 1048 data sets and 59 billion triples. Google’s Knowledge Graph and Microsoft’s Satori – graph engines in the search giants. Why […]

November 17, 2014 – 6:25 pm | By iain_emsley | Posted in Information Retrieval, Text Mining | Tagged linked_data | Comments (1)

Future of Editing – some reflections on Nicole Pohl on Sarah Scott

The seminar in today’s The Future of Editing series, “An Editor’s duty is indeed that of most danger’ (Piozzi): editing Sarah Robinson Scott“, by Nicole Pohl that the Bodleian Digital Library Systems and Services is holding at the Oxford e-Research Centre was a thought provoking one in terms the questions raised a series of points […]

November 12, 2014 – 9:39 pm | By iain_emsley | Posted in publishing, Text Mining | Tagged editing, open_correspondence, open_data | Comments (0)

Transcribing Bentham seminar notes

Melissa Terras talked about the Transcribing Bentham , a collaborative project to transcribe the volumes of Bentham, at University College London at the first seminar in the Cultures of Knowledge seminars. Bentham believed in education for all who could afford it in London. UCL has 60,000 volumes and BL has 30,000. 40,000 volumes were untranscribed […]

October 20, 2014 – 5:31 pm | By iain_emsley | Posted in Programming, Text Mining | Comments (0)

A quick skim into mining Twitter data

This is a variant on the text prepared for a short talk at the Open Science evening at the Oxford e-Research Centre on Wednesday 27th November. Peter Murray-Rust also spoke at the event on the AMI software and the Chemical Tagger. This is a brief talk about some work that I have been doing in […]

November 30, 2013 – 8:33 pm | By iain_emsley | Posted in Programming, Text Mining | Tagged twitter | Comments (0)

Weeknotes – Scripting and scraping

It has been a while since I last posted a week note, so I thought I would try and get back in the habit. I’ve been involved in glueing together profiling tools to run so that I can have a vaguely generic framework to profile software at the IO level and the CPU level. Shell […]

August 23, 2013 – 11:40 am | By iain_emsley | Posted in Text Mining, weeknotes | Comments (0)

Attending the Open Humanities Hack

I’ve just come back from a couple of excellent days of Humanities Hacking, organised by the King’s College, London Digital Humanities department and the Open Knowledge Foundation. To be fair, it went slightly differently than I thought it would. After an interesting start trying to find the room we were in, a few of us […]

November 22, 2012 – 8:26 pm | By iain_emsley | Posted in Open Knowledge, Programming, Text Mining | Tagged digital_humanities, javascript, visualisation | Comments (0)

The Aust Gate

Category Archives: Text Mining

Extracting Music Streams from Printouts

Strava, segments, and tracking

Jane Austen’s word choices

A simple experiment in Sound and Vision for Hamlet

Harmonising the Heterogeneous at Cultures of Knowledge

Future of Editing – some reflections on Nicole Pohl on Sarah Scott

Transcribing Bentham seminar notes

A quick skim into mining Twitter data

Weeknotes – Scripting and scraping

Attending the Open Humanities Hack

Elsewhere on the web

Categories

Archives

Search

Open Knowledge

RSS Feeds

Meta