1. Language Independent Methods of Clustering Similar Contexts (with applications) Ted Pedersen University of Minnesota, Duluth [email_address] http://www.d.umn.edu/~tpederse/SCTutorial.html
54. First Order Vectors of Unigrams 1 0 1 0 1 x4 0 0 0 0 0 x3 1 1 0 1 0 x2 1 1 1 1 1 x1 child magic curse black island
55.
56. First Order Vectors of Bigrams 1 0 1 1 0 x4 0 1 1 0 0 x3 1 0 0 0 1 x2 1 0 0 1 1 x1 voodoo child serious error military might island curse black magic
57.
58.
59.
60. Word by Word Matrix 120.0 0 69.4 0 0 voodoo 0 89.2 0 21.2 0 serious 0 54.9 100.3 0 0 military 73.2 0 0 189.2 0 island 43.2 0 0 0 123.5 black child error might curse magic
61.
62. There was an island curse of black magic cast by that voodoo child. 120.0 0 69.4 0 0 voodoo 73.2 0 0 189.2 0 island 43.2 0 0 0 123.5 black child error might curse magic
63.
64.
65. There was an island curse of black magic cast by that voodoo child. 78.8 0 24.4 63.1 41.2 x1 child error might curse magic
66.
67.
68. First Order Vectors of Unigrams 1 0 1 0 1 x4 0 0 0 0 0 x3 1 1 0 1 0 x2 1 1 1 1 1 x1 child magic curse black island
The main idea is to assume a null hypothesis of single cluster and to see if the alternative hypothesis of k>1 clusters is able to refute the null hypothesis.