ABSTRACT
Traditional text clustering methods require enormous computing resources, which make them inappropriate for processing large scale data collections. In this paper we present a clustering method based on the word category map approach using a two-level Growing Self-Organising Map (GSOM). A significant part of the clustering task is divided into separate subtasks that can be executed on different computers using the emergent Grid technology. Thus enabling the rapid analysis of information gathered globally. The performance of the proposed method is comparable to the traditional approaches while improves the execution time by 15 times.
- Honkela, T., Kaski, S., Lagus, K., Kohonen, T.: Newsgroup Exploration with WEBSOM Method and Browsing Interface, Tech. Rep. A32, Helsinki University of Technology, Laboratory of Computer and Information Science, Espoo, Finland (1996).Google Scholar
- Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: WEBSOM--Self-Organizing Maps of Document Collections, Neurocomputing 21 (1998) 101-117.Google Scholar
Cross Ref
- Foster, I., Kesselman, C. (eds.): The grid : blueprint for a new computing infrastructure, Amsterdam, Boston, Elsevier, 2004 Google Scholar
Digital Library
- Kohonen, T.: self-organizing maps, Springer-Verlag, Berlin, 1995 Google Scholar
Digital Library
- Alahakoon. D., Halgamuge, S.K., Srinivasan, B.: Dynamic Self-Organising Maps with Controlled Growth for Knowledge Discovery, IEEE Transactions on Neural Networks, Special Issue on Knowledge Discovery and Data Mining, vol. 11, no. 3, 2000. Google Scholar
Digital Library
- Lagus, K., Kaski, S., Kohonen, T.: Mining Massive Document Collections by the WEBSOM Method, Information Sciences, Vol 163/1-3, pp. 135-156, 2004. Google Scholar
Digital Library
- Honkela, T.: Self-Organizing Maps in Natural Language Processing, Ph.D. thesis, Helsinki University of Technology, Neural Networks Research Center, Espoo, Finland, 1997.Google Scholar
- Nürnberger, A.: Interactive Text Retrieval Supported by Growing Self-Organizing Maps, Proc. of the International Workshop on Information Retrieval, pp. 61-70, 2001.Google Scholar
- Larsen, B., Aone, C.: Fast and Effective Text Mining using Linear Time Document Clustering. Proceedings of the conference on Knowledge Discovery and Data Mining, pages 16-22, 1999. Google Scholar
Digital Library
- Depoutovitch, A., Wainstein, A.: Building Grid Enabled Data-Mining Applications, http://www.ddj.com/184406345, 2005Google Scholar
- Salton G.: Developments in Automatic Text Retrieval, Science, Vol 253, pages 974-979, 1991.Google Scholar
- Hsu, A., Halgamuge, S.K.: Enhancement of Topology Preservation and Hierarchical Dynamic Self-Rrganising Maps for Data Visualisation, International Journal of Approximate Reasoning, vol. 32/2-3 pp. 259-279, Feb 2003.Google Scholar
Cross Ref
- Hsu, A., Tang, S., Halgamuge, S.K.: An Unsupervised Hierarchical Dynamic Self-Organising Approach to Class Discovery and Marker Gene Identification in Microarray Data, Bioinformatics, Oxford University Press, November 2003Google Scholar
- Alahakoon, D.: Controlling the Spread of Dynamic Self Organising Maps, Neural Computing and Applications, 13(2), pp 168-174, Springer Verlag, 2004 Google Scholar
Digital Library
- Wickramasinghe, L.K., Alahakoon, L.D.: Dynamic Self Organizing Maps for Discovery and Sharing of Knowledge in Multi Agent Systems in Web Intelligence and Agent Systems: An International Journal, (IOS Press), Vol.3, No.1, 2005. Google Scholar
Digital Library
Index Terms
- Scalable dynamic self-organising maps for mining massive textual data
Recommendations
Controlling the spread of dynamic self-organising maps
The growing self-organising map (GSOM) has recently been proposed as an alternative neural network architecture based on the traditional self-organising map (SOM). The GSOM provides the user with the ability to control the spread of the map by defining ...
Applying dynamic self organizing maps for identifying changes in data sequences
Design and application of hybrid intelligent systemsThe Growing Self Organizing Map (GSOM), a variant of the Self Organizing Map has been mainly used to cluster and identify relationships in static data in an unsupervised manner. In this paper we discuss about the capabilities of the GSOM in identifying ...
Data mining using rule extraction from Kohonen self-organising maps
The Kohonen self-organising feature map (SOM) has several important properties that can be used within the data mining/knowledge discovery and exploratory data analysis process. A key characteristic of the SOM is its topology preserving ability to map a ...




Comments