Wednesday, April 28, 2010

Libdem manifesto key concepts


And finally the key concepts for the Libdem manifesto.

Labour key concept cloud


And here's the key concept cloud from the Labour manifesto ... there seems to be much more variety of concepts here and hence the cloud is much bigger. I had to shrink it further to get it all in one screenshot.

Conservative key concept cloud


In addition to key words, Wmatrix can produce key concepts by comparing a frequency list of semantic fields automatically tagged in the data with a reference corpus, again here the BNC written sampler. This shows statistically key concepts in the Conservative manifesto

Tuesday, April 20, 2010

TEI versions of UK election manifestos

Meanwhile, somewhere deep in France with a laptop, Lou Burnard has created TEI encoded versions of the UK election manifestos, tagged and lemmatised them with TreeTagger. Download from http://ucrel.lancs.ac.uk/wmatrix/ukmanifestos2010/TEIversion/

Thanks Lou!

Updated Libdem manifesto and cloud


Lou Burnard spotted some conversion errors in the Libdem manifesto (extra spaces after ligatures e.g. 'fi') and Martin has now fixed smart quotes to straight ones. The new text version of the Libdem manifesto is at http://ucrel.lancs.ac.uk/wmatrix/ukmanifestos2010/ and here is the updated key word cloud. You'll notice the main difference is that "Britains" is no longer key because it was actually "Britain's" and the apostrophe now being fixed means that it combines its frequency with "Britain".

Shared items from Google Reader