ABSTRACT
The first's library projects occur some years ago with digitization, but just in 1996, the first's web archive initiatives start occurring. Such, was based in the Internet growth and in its increasing use, items that revealed to be an opportunity to transform and readapt the traditional library services. In this context, search engines play a fundamental role of support to the new paradigm of knowledge, by capturing, storing and providing access to the resources, allowing the existence of a digital library in each computer with internet access. In this article we analyze the ways of developing a digital library, taking higher attention to the web harvesting technique, and presenting digital libraries capabilities and limitations. Then we fully summarize relevant projects and initiatives, to finally study the role of search engines in what concerns to, digital preservation, access and information diffusion.
References
- Abiteboul, S., Cobéna, G., Masanes, J. and Sedrati, G. "A First Experience in Archiving the French Web", In Proceedings of the 6th European Conference on Research and Advances Technology for Digital Libraries, Rome, Italy, September 16--18, (2002). Google Scholar
Digital Library
- Alexa. www.alexa.com,Google Scholar
- Biblioteca do Conhecimento On-Line. www.b-on.ptGoogle Scholar
- Biblioteca Nacional Digital. http://bnd.bn.ptGoogle Scholar
- Brin, S. and Page, L. "The Anatomy of a Large-scale Hypertextual Web Search Engine", In Proceedings of the 7th International World Wide Web Conference (WWW7), Bisbane, Australia, April 14--18, (1998). Google Scholar
Digital Library
- Campos, F., "Seleccionar recursos para bibliotecas digitais: princípios orientadores", (2005).Google Scholar
- Campos, R. and Marques, C. "O Governo Electrónico e os Sistemas de Informação Públicos em Portugal", Actas da 1.a Conferência de Sistemas e Tecnologias de Informação, pp 421--437 (Volume I), Ofir, Portugal, Junho 21--23, (2006).Google Scholar
- Campos, R., Dias, G. and Nunes, C. "WISE: Hierarchical Soft Clustering of Web Page Search Results based on Web Content Mining Techniques", In Proceedings of the 2006 IEEE / WIC / ACM International Conference on Web Intelligence, Hong Kong, China, Dezembro 18--22, (2006). Google Scholar
Digital Library
- CAMiLEON. www.si.umich.edu/CAMILEON/domesday/domesday.htmlGoogle Scholar
- Christensen, N. "Preserving the bits of the Danish Internet", In Proceedings of the 5th International Web Archiving Workshop, Vienna, Austria, September 22--23, (2005).Google Scholar
- Combine harvester. http://combine.it.lth.seGoogle Scholar
- European Commission, "Comimission Recommendation on the digitisation and online accessibility of cultural material and digital preservation", (2006). http://ec.europa.eu/information_society/newsroom/cf/itemlongdetail.cfm?item_id=2782Google Scholar
- European Commission, "i2010: Digital Libraries", (2005). http://ec.europa.eu/information_society/activities/digital_librariesGoogle Scholar
- European Library. www.europeanlibrary.orgGoogle Scholar
- European Archive. www.europarchive.orgGoogle Scholar
- Gomes, D., Freitas, S. and Silva, M. "Design and Selection Criteria for a National Web Archive", In Proceedings of the 10th European Conference Research and Advances Technology for Digital Libraries, Alicante, Spain, September 17--22, (2006). Google Scholar
Digital Library
- Google Books. http://books.google.comGoogle Scholar
- Google Books. http://scholar.google.comGoogle Scholar
- Hallgrímsson, P., Bang, S. and Mannerheim, J. "Nordic Web Archive", In Proceedings of the 3th International Web Archiving Workshop, Trondheim, Norway, 21 August, (2003).Google Scholar
- Heritrix. http://crawler.archive.orgGoogle Scholar
- HTTrack. www.httrack.comGoogle Scholar
- IFLA., ICA. "Guidelines for Digitization Projects for Collections and Holdins in the Public Domain, particularly those held by libraries and archives", (2002).Google Scholar
- International Internet Preservation Consortium. www.netpreserve.orgGoogle Scholar
- International Web Archiving Workshop. www.iwaw.netGoogle Scholar
- Jodelis, R. "Harvesting and Archiving of Electronic Resources in Lithuania: towards Virtual Library", In Proceedings of the 9th Conference on Professional Information Resources, Prague, Czech Republic, May 27--29, (2003).Google Scholar
- Kenney, A. and Oya, R. "Moving theory into practice: digital imaging for libraries and archives", Mountain View, Calif.: Research Libraries Group, (2000).Google Scholar
- Koerbin, P., "Managing Web Archiving in Australia: a Case Study", In Proceedings of the 3th International Web Archiving Workshop, Norway, 21 August, (2003).Google Scholar
- Kosala, R. and Blockeel, H. "Web Mining Research: a Survey", In ACM SIGKDD Exploration, 2(1), 1--15, (2000). Google Scholar
Digital Library
- Lampos, C., Eirinaki, M., Jevtuchova, D. and Vazirgiannis, M. "Archiving the Greek Web", In Proceedings of the 4th International Web Archiving Workshop, Bath, UK, 16 September, (2005).Google Scholar
- Lyman, P. "Archiving the World Wide Web", School of Information Management and Systems University of California, Berkeley, (2002).Google Scholar
- Marill, J., Boyko, A. and Ashenfelder, M. "Tools and Techniques for Harvesting the World Wide Web", In Proceedings of the JCDL, (2004). Google Scholar
Digital Library
- Marill, J., Boyko, A. and Ashenfelder, M. "Web Harvesting Survey", International Internet Preservation Consortium, 20 July, (2004).Google Scholar
- Masanès, J. "Towards Continuous Web Archiving", D-Lib Magazine, Volume 8 Number 12, ISSN 1082 - 9873, December, (2002).Google Scholar
Cross Ref
- NetCraft. http://news.netcraft.comGoogle Scholar
- Northest Document Conservation Center Andover Massachusetts, "Handbook for digital projects: A management tool for preservation and access", Maxine K. Sitts, Editor, (2000).Google Scholar
- Ntoulas, A., Cho, J., Cho, K., Cho, H. and Cho, Y. "A study on the evolution of the web", In Proceedings of the 2005 UKC Conference, August, (2005).Google Scholar
- Ntoulas, A., Zerfos, P. and Cho, J. "Downloading Hidden Web Content", Technical Report, UCLA, (2004).Google Scholar
- Open Content Alliance. /www.opencontentalliance.orgGoogle Scholar
- Paradigma. www.nb.no/paradigmaGoogle Scholar
- Pereira, A. "O Advento Digital e a nova missão da Biblioteca Pública", (2005). Biblioteca Municipal Afonso Lopes Vieira, Câmara Municipal de Leiria. http://sapp.telepac.pt/apbad/congresso8/comm6.pdfGoogle Scholar
- Persson, N. Arvidson, A. and Mannerheim, J. "The Kulturarw3 Project The Royal Swedish Web Archive", In Proceedings of the 66th IFLA Conference, Jerusalem, Israel, August 13--18, (2000).Google Scholar
- Preserving Access to Digital Information. www.nla.gov.au/padi/index.htmlGoogle Scholar
- Projecto Nórdico. http://nwa.nb.noGoogle Scholar
- Rauber, A., Aschenbrenner, A. and Witvoet, O. "Austrian On-Line Archive Processing Analyzing Archives of the World Wide Web", In Proceedings of the 6th European Conference on Research and Advances Technology for Digital Libraries, Rome, Italy, September 16--18, (2002). Google Scholar
Digital Library
- Tumba. www.tumba.ptGoogle Scholar
- UK Web Archiving Consortium. www.webarchive.org.ukGoogle Scholar
- Webb, C. "Who will save the Olympics?", OCLC/Preservation Resources Symposium, Digital Past, Digital Future: an Introduction to Digital Preservation, OCIC, Dublin, Ohio, 15 June, (2001).Google Scholar
- Web aRchive Access. http://archive-access.sourceforge.net/projects/weraGoogle Scholar
- Web Archive discussion list. http://listes.cru.fr/sympa/info/web-archiveGoogle Scholar
- Web Archiving Project. http://warp.ndl.go.jpGoogle Scholar
- Xyleme crawler. www.xyleme.comGoogle Scholar
- Žabička, P. "Archiving the Czech Web: Issues and Challenges", In Proceedings of the 3th International Web Archiving Workshop, Norway, 21 August, (2003).Google Scholar
Index Terms
(auto-classified)Digital libraries and engines of search


Ricardo Campos

Comments