skip to main content
10.5555/1986861.1986894guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Scalable dynamic self-organising maps for mining massive textual data

Authors Info & Claims
Published:03 October 2006Publication History

ABSTRACT

Traditional text clustering methods require enormous computing resources, which make them inappropriate for processing large scale data collections. In this paper we present a clustering method based on the word category map approach using a two-level Growing Self-Organising Map (GSOM). A significant part of the clustering task is divided into separate subtasks that can be executed on different computers using the emergent Grid technology. Thus enabling the rapid analysis of information gathered globally. The performance of the proposed method is comparable to the traditional approaches while improves the execution time by 15 times.

References

  1. Honkela, T., Kaski, S., Lagus, K., Kohonen, T.: Newsgroup Exploration with WEBSOM Method and Browsing Interface, Tech. Rep. A32, Helsinki University of Technology, Laboratory of Computer and Information Science, Espoo, Finland (1996).Google ScholarGoogle Scholar
  2. Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: WEBSOM--Self-Organizing Maps of Document Collections, Neurocomputing 21 (1998) 101-117.Google ScholarGoogle ScholarCross RefCross Ref
  3. Foster, I., Kesselman, C. (eds.): The grid : blueprint for a new computing infrastructure, Amsterdam, Boston, Elsevier, 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Kohonen, T.: self-organizing maps, Springer-Verlag, Berlin, 1995 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Alahakoon. D., Halgamuge, S.K., Srinivasan, B.: Dynamic Self-Organising Maps with Controlled Growth for Knowledge Discovery, IEEE Transactions on Neural Networks, Special Issue on Knowledge Discovery and Data Mining, vol. 11, no. 3, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Lagus, K., Kaski, S., Kohonen, T.: Mining Massive Document Collections by the WEBSOM Method, Information Sciences, Vol 163/1-3, pp. 135-156, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Honkela, T.: Self-Organizing Maps in Natural Language Processing, Ph.D. thesis, Helsinki University of Technology, Neural Networks Research Center, Espoo, Finland, 1997.Google ScholarGoogle Scholar
  8. Nürnberger, A.: Interactive Text Retrieval Supported by Growing Self-Organizing Maps, Proc. of the International Workshop on Information Retrieval, pp. 61-70, 2001.Google ScholarGoogle Scholar
  9. Larsen, B., Aone, C.: Fast and Effective Text Mining using Linear Time Document Clustering. Proceedings of the conference on Knowledge Discovery and Data Mining, pages 16-22, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Depoutovitch, A., Wainstein, A.: Building Grid Enabled Data-Mining Applications, http://www.ddj.com/184406345, 2005Google ScholarGoogle Scholar
  11. Salton G.: Developments in Automatic Text Retrieval, Science, Vol 253, pages 974-979, 1991.Google ScholarGoogle Scholar
  12. Hsu, A., Halgamuge, S.K.: Enhancement of Topology Preservation and Hierarchical Dynamic Self-Rrganising Maps for Data Visualisation, International Journal of Approximate Reasoning, vol. 32/2-3 pp. 259-279, Feb 2003.Google ScholarGoogle ScholarCross RefCross Ref
  13. Hsu, A., Tang, S., Halgamuge, S.K.: An Unsupervised Hierarchical Dynamic Self-Organising Approach to Class Discovery and Marker Gene Identification in Microarray Data, Bioinformatics, Oxford University Press, November 2003Google ScholarGoogle Scholar
  14. Alahakoon, D.: Controlling the Spread of Dynamic Self Organising Maps, Neural Computing and Applications, 13(2), pp 168-174, Springer Verlag, 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Wickramasinghe, L.K., Alahakoon, L.D.: Dynamic Self Organizing Maps for Discovery and Sharing of Knowledge in Multi Agent Systems in Web Intelligence and Agent Systems: An International Journal, (IOS Press), Vol.3, No.1, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Scalable dynamic self-organising maps for mining massive textual data
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image Guide Proceedings
        ICONIP'06: Proceedings of the 13th international conference on Neural information processing - Volume Part III
        October 2006
        1215 pages
        ISBN:3540464840
        • Editors:
        • Irwin King,
        • Laiwan Chan,
        • Jun Wang,
        • DeLiang Wang

        Publisher

        Springer-Verlag

        Berlin, Heidelberg

        Publication History

        • Published: 3 October 2006

        Qualifiers

        • Article