Author image not provided
 Ismet Zeki Yalniz

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article3.00
Citation Count21
Publication count7
Publication years2010-2013
Available for download4
Average downloads per article270.00
Downloads (cumulative)1,080
Downloads (12 Months)64
Downloads (6 Weeks)7
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


7 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 7 of 7
Sort by:

1
August 2013 ICDAR '13: Proceedings of the 2013 12th International Conference on Document Analysis and Recognition
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 0

This paper evaluates an automated scheme for aligning and combining optical character recognition (OCR) output from three scans of a book to generate a composite version with fewer OCR errors. While there has been some previous work on aligning multiple OCR versions of the same scan, the scheme introduced in ...
Keywords: OCR error correction, sequence alignment, scanned book collections

2 published by ACM
August 2012 SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 1,   Downloads (12 Months): 13,   Downloads (Overall): 204

Full text available: PDFPDF
This paper describes an approach for identifying translations of books in large scanned book collections with OCR errors. The method is based on the idea that although individual sentences do not necessarily preserve the word order when translated, a book must preserve the linear progression of ideas for it to ...
Keywords: book collections, translation detection, sequence alignment, unique words

3 published by ACM
August 2012 SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 4,   Downloads (12 Months): 12,   Downloads (Overall): 143

Full text available: PDFPDF
Conventional retrieval systems view documents as a unit and look at different retrieval types within a document. We introduce Proteus, a frame-work for seamlessly navigating books as dynamic collections which are defined on the fly. Proteus allows us to search various retrieval types. Navigable types include pages, books, named persons, ...
Keywords: navigation, object search, proteus

4
March 2012 DAS '12: Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 4

An efficient word spotting framework is proposed to search text in scanned books. The proposed method allows one to search for words when optical character recognition (OCR) fails due to noise or for languages where there is no OCR. Given a query word image, the aim is to retrieve matching ...
Keywords: document image search, image retrieval, word spotting

5 published by ACM
October 2011 CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 5
Downloads (6 Weeks): 0,   Downloads (12 Months): 12,   Downloads (Overall): 201

Full text available: PDFPDF
A framework is presented for discovering partial duplicates in large collections of scanned books with optical character recognition (OCR) errors. Each book in the collection is represented by the sequence of words (in the order they appear in the text) which appear only once in the book. These words are ...
Keywords: unique words, partial duplicate detection, sequence matching

6
September 2011 ICDAR '11: Proceedings of the 2011 International Conference on Document Analysis and Recognition
Publisher: IEEE Computer Society
Bibliometrics:
Citation Count: 5

This paper aims to evaluate the accuracy of optical character recognition (OCR) systems on real scanned books. The ground truth e-texts are obtained from the Project Gutenberg website and aligned with their corresponding OCR output using a fast recursive text alignment scheme (RETAS). First, unique words in the vocabulary of ...
Keywords: OCR evaluation, sequence alignment, digital libraries

7 published by ACM
January 2010 Journal on Computing and Cultural Heritage (JOCCH): Volume 2 Issue 3, December 2009
Publisher: ACM
Bibliometrics:
Citation Count: 5
Downloads (6 Weeks): 2,   Downloads (12 Months): 27,   Downloads (Overall): 532

Full text available: PDFPDF
This article presents Ottoman Archives Explorer, a Content-Based Retrieval (CBR) system based on character recognition for printed and handwritten historical documents. Several methods for character segmentation and recognition stages are investigated. In particular, sliding-window and histogram segmentation methods are coupled with recognition approaches using spatial features, neural networks, and a ...
Keywords: Content-Based Retrieval (CBR), Information Retrieval (IR), Optical Character Recognition (OCR), historical document analysis



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us