10.1145/1352694.1352703acmconferencesArticle/Chapter ViewAbstractPublication Pageseatis-orgConference Proceedings
research-article

Digital libraries and engines of search: new information systems in the context of the digital preservation

ABSTRACT

The first's library projects occur some years ago with digitization, but just in 1996, the first's web archive initiatives start occurring. Such, was based in the Internet growth and in its increasing use, items that revealed to be an opportunity to transform and readapt the traditional library services. In this context, search engines play a fundamental role of support to the new paradigm of knowledge, by capturing, storing and providing access to the resources, allowing the existence of a digital library in each computer with internet access. In this article we analyze the ways of developing a digital library, taking higher attention to the web harvesting technique, and presenting digital libraries capabilities and limitations. Then we fully summarize relevant projects and initiatives, to finally study the role of search engines in what concerns to, digital preservation, access and information diffusion.

References

  1. Abiteboul, S., Cobéna, G., Masanes, J. and Sedrati, G. "A First Experience in Archiving the French Web", In Proceedings of the 6th European Conference on Research and Advances Technology for Digital Libraries, Rome, Italy, September 16--18, (2002). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Alexa. www.alexa.com,Google ScholarGoogle Scholar
  3. Biblioteca do Conhecimento On-Line. www.b-on.ptGoogle ScholarGoogle Scholar
  4. Biblioteca Nacional Digital. http://bnd.bn.ptGoogle ScholarGoogle Scholar
  5. Brin, S. and Page, L. "The Anatomy of a Large-scale Hypertextual Web Search Engine", In Proceedings of the 7th International World Wide Web Conference (WWW7), Bisbane, Australia, April 14--18, (1998). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Campos, F., "Seleccionar recursos para bibliotecas digitais: princípios orientadores", (2005).Google ScholarGoogle Scholar
  7. Campos, R. and Marques, C. "O Governo Electrónico e os Sistemas de Informação Públicos em Portugal", Actas da 1.a Conferência de Sistemas e Tecnologias de Informação, pp 421--437 (Volume I), Ofir, Portugal, Junho 21--23, (2006).Google ScholarGoogle Scholar
  8. Campos, R., Dias, G. and Nunes, C. "WISE: Hierarchical Soft Clustering of Web Page Search Results based on Web Content Mining Techniques", In Proceedings of the 2006 IEEE / WIC / ACM International Conference on Web Intelligence, Hong Kong, China, Dezembro 18--22, (2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. CAMiLEON. www.si.umich.edu/CAMILEON/domesday/domesday.htmlGoogle ScholarGoogle Scholar
  10. Christensen, N. "Preserving the bits of the Danish Internet", In Proceedings of the 5th International Web Archiving Workshop, Vienna, Austria, September 22--23, (2005).Google ScholarGoogle Scholar
  11. Combine harvester. http://combine.it.lth.seGoogle ScholarGoogle Scholar
  12. European Commission, "Comimission Recommendation on the digitisation and online accessibility of cultural material and digital preservation", (2006). http://ec.europa.eu/information_society/newsroom/cf/itemlongdetail.cfm?item_id=2782Google ScholarGoogle Scholar
  13. European Commission, "i2010: Digital Libraries", (2005). http://ec.europa.eu/information_society/activities/digital_librariesGoogle ScholarGoogle Scholar
  14. European Library. www.europeanlibrary.orgGoogle ScholarGoogle Scholar
  15. European Archive. www.europarchive.orgGoogle ScholarGoogle Scholar
  16. Gomes, D., Freitas, S. and Silva, M. "Design and Selection Criteria for a National Web Archive", In Proceedings of the 10th European Conference Research and Advances Technology for Digital Libraries, Alicante, Spain, September 17--22, (2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Google Books. http://books.google.comGoogle ScholarGoogle Scholar
  18. Google Books. http://scholar.google.comGoogle ScholarGoogle Scholar
  19. Hallgrímsson, P., Bang, S. and Mannerheim, J. "Nordic Web Archive", In Proceedings of the 3th International Web Archiving Workshop, Trondheim, Norway, 21 August, (2003).Google ScholarGoogle Scholar
  20. Heritrix. http://crawler.archive.orgGoogle ScholarGoogle Scholar
  21. HTTrack. www.httrack.comGoogle ScholarGoogle Scholar
  22. IFLA., ICA. "Guidelines for Digitization Projects for Collections and Holdins in the Public Domain, particularly those held by libraries and archives", (2002).Google ScholarGoogle Scholar
  23. International Internet Preservation Consortium. www.netpreserve.orgGoogle ScholarGoogle Scholar
  24. International Web Archiving Workshop. www.iwaw.netGoogle ScholarGoogle Scholar
  25. Jodelis, R. "Harvesting and Archiving of Electronic Resources in Lithuania: towards Virtual Library", In Proceedings of the 9th Conference on Professional Information Resources, Prague, Czech Republic, May 27--29, (2003).Google ScholarGoogle Scholar
  26. Kenney, A. and Oya, R. "Moving theory into practice: digital imaging for libraries and archives", Mountain View, Calif.: Research Libraries Group, (2000).Google ScholarGoogle Scholar
  27. Koerbin, P., "Managing Web Archiving in Australia: a Case Study", In Proceedings of the 3th International Web Archiving Workshop, Norway, 21 August, (2003).Google ScholarGoogle Scholar
  28. Kosala, R. and Blockeel, H. "Web Mining Research: a Survey", In ACM SIGKDD Exploration, 2(1), 1--15, (2000). Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Lampos, C., Eirinaki, M., Jevtuchova, D. and Vazirgiannis, M. "Archiving the Greek Web", In Proceedings of the 4th International Web Archiving Workshop, Bath, UK, 16 September, (2005).Google ScholarGoogle Scholar
  30. Lyman, P. "Archiving the World Wide Web", School of Information Management and Systems University of California, Berkeley, (2002).Google ScholarGoogle Scholar
  31. Marill, J., Boyko, A. and Ashenfelder, M. "Tools and Techniques for Harvesting the World Wide Web", In Proceedings of the JCDL, (2004). Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Marill, J., Boyko, A. and Ashenfelder, M. "Web Harvesting Survey", International Internet Preservation Consortium, 20 July, (2004).Google ScholarGoogle Scholar
  33. Masanès, J. "Towards Continuous Web Archiving", D-Lib Magazine, Volume 8 Number 12, ISSN 1082 - 9873, December, (2002).Google ScholarGoogle ScholarCross RefCross Ref
  34. NetCraft. http://news.netcraft.comGoogle ScholarGoogle Scholar
  35. Northest Document Conservation Center Andover Massachusetts, "Handbook for digital projects: A management tool for preservation and access", Maxine K. Sitts, Editor, (2000).Google ScholarGoogle Scholar
  36. Ntoulas, A., Cho, J., Cho, K., Cho, H. and Cho, Y. "A study on the evolution of the web", In Proceedings of the 2005 UKC Conference, August, (2005).Google ScholarGoogle Scholar
  37. Ntoulas, A., Zerfos, P. and Cho, J. "Downloading Hidden Web Content", Technical Report, UCLA, (2004).Google ScholarGoogle Scholar
  38. Open Content Alliance. /www.opencontentalliance.orgGoogle ScholarGoogle Scholar
  39. Paradigma. www.nb.no/paradigmaGoogle ScholarGoogle Scholar
  40. Pereira, A. "O Advento Digital e a nova missão da Biblioteca Pública", (2005). Biblioteca Municipal Afonso Lopes Vieira, Câmara Municipal de Leiria. http://sapp.telepac.pt/apbad/congresso8/comm6.pdfGoogle ScholarGoogle Scholar
  41. Persson, N. Arvidson, A. and Mannerheim, J. "The Kulturarw3 Project The Royal Swedish Web Archive", In Proceedings of the 66th IFLA Conference, Jerusalem, Israel, August 13--18, (2000).Google ScholarGoogle Scholar
  42. Preserving Access to Digital Information. www.nla.gov.au/padi/index.htmlGoogle ScholarGoogle Scholar
  43. Projecto Nórdico. http://nwa.nb.noGoogle ScholarGoogle Scholar
  44. Rauber, A., Aschenbrenner, A. and Witvoet, O. "Austrian On-Line Archive Processing Analyzing Archives of the World Wide Web", In Proceedings of the 6th European Conference on Research and Advances Technology for Digital Libraries, Rome, Italy, September 16--18, (2002). Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Tumba. www.tumba.ptGoogle ScholarGoogle Scholar
  46. UK Web Archiving Consortium. www.webarchive.org.ukGoogle ScholarGoogle Scholar
  47. Webb, C. "Who will save the Olympics?", OCLC/Preservation Resources Symposium, Digital Past, Digital Future: an Introduction to Digital Preservation, OCIC, Dublin, Ohio, 15 June, (2001).Google ScholarGoogle Scholar
  48. Web aRchive Access. http://archive-access.sourceforge.net/projects/weraGoogle ScholarGoogle Scholar
  49. Web Archive discussion list. http://listes.cru.fr/sympa/info/web-archiveGoogle ScholarGoogle Scholar
  50. Web Archiving Project. http://warp.ndl.go.jpGoogle ScholarGoogle Scholar
  51. Xyleme crawler. www.xyleme.comGoogle ScholarGoogle Scholar
  52. Žabička, P. "Archiving the Czech Web: Issues and Challenges", In Proceedings of the 3th International Web Archiving Workshop, Norway, 21 August, (2003).Google ScholarGoogle Scholar

Index Terms

(auto-classified)
  1. Digital libraries and engines of search

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!