Beavan, D;
(2008)
Visualisation of textual data through collocate clouds.
Presented at: Finding the hidden knowledge: text mining for biology and medicine, Glasgow, UK.
Preview |
PDF
Visualisation of textual data through collocate clouds - David Beavan.pdf Available under License : See the attached licence file. Download (1MB) |
Abstract
Focussing on a particular term (node word) the entire corpus is searched, finding all its collocates (surrounding words) by retrieving the five words preceding the node word, and five words following it. These are then aggregated and presented to the user in cloud form, listing the one hundred most frequent collocates in alphabetical order, showing the frequency of them by varying the font size. In addition, collocational strength (the likelihood of two words co-occurring) is shown as brightness. Therefore collocates which are large and bright are found frequently and principally around the node word. These clouds promote browsing of the resource, as each collocate can be clicked on to form the node word of a new cloud. This allows the user to explore the language used in the corpus, and how terms interact with each other. While primarily aimed at language research, this visualisation may be useful to the text mining community.
Type: | Poster |
---|---|
Title: | Visualisation of textual data through collocate clouds |
Event: | Finding the hidden knowledge: text mining for biology and medicine |
Location: | Glasgow, UK |
Dates: | 21-22 February 2008 |
Open access status: | An open access version is available from UCL Discovery |
Language: | English |
Additional information: | © 2008 The Author |
UCL classification: | UCL > Provost and Vice Provost Offices UCL > Provost and Vice Provost Offices > UCL SLASH |
URI: | https://discovery.ucl.ac.uk/id/eprint/1404372 |
Archive Staff Only
View Item |