Pinto, David; Rosso, Paolo; Jiménez-Salazar, Héctor(Oxford University Press (OUP): Policy A - Oxford Open Option A, 2011)
Clustering narrow domain short texts is considered to be a complex task because of the intrinsic features of the corpus to be clustered: (i) the low frequencies of vocabulary terms in short texts, and (ii) the high vocabulary ...