HAMLET II
Analysis of joint frequencies
of words in a text



Polonius: "What do you read, my lord?"
Hamlet: "Words, words, words."
W.Shakespeare : Hamlet (II,ii,194)

General Principles

The main procedure HAMLET II - Joint Frequencies searches a text file for words in a given vocabulary list, and computes matrices of raw and standardised joint frequencies with respect to a chosen unit of context, or of joint occurrences within a given number of words. The procedure should be used when there are good grounds for searching for inter-connections between a number of key words, the spelling of which in the original text is already reasonably well known. Alternatively, it is possible to attempt a completely empirical preliminary identification of 'nodal' words in a given text, according to the density of their co-occurrences within a given unit of context. These can then form the basis of a suitable vocabulary list for more advanced use.

HAMLET II includes a Cluster Analysis procedure and a Correspondence Analysis option to explore word-associations in individual texts. The multi-dimensional scaling procedure MINISSA can be applied to the matrix of joint word frequencies in a text. Configurations of word usage derived in this way from a number of texts can be compared using Procrustean Individual Differences Scaling (PINDIS). A series of matrices derived from different texts can also be compared using Individual Differences Scaling (INDSCAL).  The unique graphics of  HAMLET II summarise the results of each of these analyses, for inclusion in other documents and reports.

The options WORDLIST and COMPARE may first be selected to help to determine the broad characteristics of word usage in the text(s) of interest. KWIC offers Key-Word-In-Context listings for any given string. If you are uncertain which words to specify as a vocabulary list in HAMLET II, choose WORDLIST to determine those which occur most frequently in the text. COMPARE lists words common to pairs of texts, and is also useful in generating lists including synonyms for use with HAMLET II  in comparing a number of texts.  These are intended as simple free-standing tools for the exploration of word usage which can be applied to any text file with a minimum of fuss and can help to generate vocabulary lists for analysis of joint occurrences with the main HAMLET II - Joint Frequencies procedure.

When a vocabulary list has been defined which is considered suitable for use with a number of texts, a fast-track procedure can be used to automatically carry out all stages of their comparison using PINDIS or INDSCAL.

References and further reading