Author image not provided
 Jonathan Schler

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article14.25
Citation Count228
Publication count16
Publication years2000-2016
Available for download10
Average downloads per article710.60
Downloads (cumulative)7,106
Downloads (12 Months)506
Downloads (6 Weeks)46
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


16 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 16 of 16
Sort by:

1 published by ACM
December 2016 Journal on Computing and Cultural Heritage (JOCCH): Volume 9 Issue 4, December 2016
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 2,   Downloads (12 Months): 57,   Downloads (Overall): 60

Full text available: PDFPDF
A cross-period (diachronic) thesaurus enables users to search for information using modern terminology and obtain semantically related terms from earlier historical periods. The complex task of supporting the construction of a diachronic thesaurus by a domain expert lexicographer has hardly been addressed computationally until now. In this article, we introduce ...
Keywords: Hebrew, cultural heritage, semantic similarity, Diachronic thesaurus

2 published by ACM
April 2013 Journal on Computing and Cultural Heritage (JOCCH): Volume 6 Issue 1, March 2013
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 1,   Downloads (12 Months): 19,   Downloads (Overall): 258

Full text available: PDFPDF
This article describes methods for semiautomatic thesaurus construction, for a cross generation, cross genre, and cross cultural corpus. Semiautomatic thesaurus construction is a complex task, and applying it on a cross generation corpus brings its own challenges. We used a Jewish juristic corpus containing documents and genres that were written ...
Keywords: Hebrew, Language model, cultural heritage

3
June 2012 SemEval '12: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Publisher: Association for Computational Linguistics
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 1,   Downloads (12 Months): 7,   Downloads (Overall): 31

Full text available: PDFPDF
Corpus-based thesaurus construction for Morphologically Rich Languages (MRL) is a complex task, due to the morphological variability of MRL. In this paper we explore alternative term representations, complemented by clustering of morphological variants. We introduce a generic algorithmic scheme for thesaurus construction in MRL, and demonstrate the empirical benefit of ...

4
March 2011 Language Resources and Evaluation: Volume 45 Issue 1, March 2011
Publisher: Springer-Verlag New York, Inc.
Bibliometrics:
Citation Count: 22

Most previous work on authorship attribution has focused on the case in which we need to attribute an anonymous document to one of a small set of candidate authors. In this paper, we consider authorship attribution as found in the wild: the set of known candidates is extremely large (possibly ...
Keywords: Randomized feature set, Authorship attribution, Open candidate set

5 published by ACM
February 2009 Communications of the ACM - Inspiring Women in Computing: Volume 52 Issue 2, February 2009
Publisher: ACM
Bibliometrics:
Citation Count: 32
Downloads (6 Weeks): 20,   Downloads (12 Months): 224,   Downloads (Overall): 1,699

Full text available: HtmlHtml  PDFPDF

6
January 2009 Journal of the American Society for Information Science and Technology: Volume 60 Issue 1, January 2009
Publisher: John Wiley & Sons, Inc.
Bibliometrics:
Citation Count: 51

Statistical authorship attribution has a long history, culminating in the use of modern machine learning classification methods. Nevertheless, most of this work suffers from the limitation of assuming a small closed set of candidate authors and essentially unlimited training text for each. Real-life authorship attribution problems, however, typically fall short ...
Keywords: authors, information attribution, machine learning, text mining, automatic classification

7
December 2007 The Journal of Machine Learning Research: Volume 8, 12/1/2007
Publisher: JMLR.org
Bibliometrics:
Citation Count: 25

In the authorship verification problem, we are given examples of the writing of a single author and are asked to determine if given long texts were or were not written by this author. We present a new learning-based method for adducing the "depth of difference" between two example sets and ...

8 published by ACM
August 2006 SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 13
Downloads (6 Weeks): 7,   Downloads (12 Months): 32,   Downloads (Overall): 724

Full text available: PDFPDF
In this paper, we use a blog corpus to demonstrate that we can often identify the author of an anonymous text even where there are many thousands of candidate authors. Our approach combines standard information retrieval methods with a text categorization meta-learning scheme that determines when to even venture a ...
Keywords: blog analysis, authorship attribution

9
July 2006 Knowledge and Information Systems: Volume 10 Issue 1, July 2006
Publisher: Springer-Verlag New York, Inc.
Bibliometrics:
Citation Count: 4

Typographic and visual information is an integral part of textual documents. Most information extraction (IE) systems ignore most of this visual information, processing the text as a linear sequence of words. Thus, much valuable information is lost. In this paper, we show how to make use of this visual information ...
Keywords: Text analysis, Information extraction, PDF analysis, Wrapper induction

10 published by ACM
August 2005 KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Publisher: ACM
Bibliometrics:
Citation Count: 11
Downloads (6 Weeks): 8,   Downloads (12 Months): 95,   Downloads (Overall): 954

Full text available: PDFPDF
In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy. Specifically, we first use automatic tools to ascertain frequencies of various stylistic idiosyncrasies in a text. These frequencies then serve as features for support vector machines that learn ...
Keywords: text mining, author profiling

11
July 2005 IJCAI'05: Proceedings of the 19th international joint conference on Artificial intelligence
Publisher: Morgan Kaufmann Publishers Inc.
Bibliometrics:
Citation Count: 4

Sentiment analysis is an example of polarity learning. Most research on learning to identify sentiment ignores "neutral" examples and instead performs training and testing using only examples of significant polarity. We show that it is crucial to use neutral examples in learning polarity for a variety of reasons and show ...

12
May 2005 ISI'05: Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 5

Text authored by an unidentified assailant can offer valuable clues to the assailant's identity. In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy.

13 published by ACM
November 2004 CIKM '04: Proceedings of the thirteenth ACM international conference on Information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 6
Downloads (6 Weeks): 0,   Downloads (12 Months): 10,   Downloads (Overall): 821

Full text available: PDFPDF
This paper describes a hybrid statistical and knowledge-based information extraction model, able to extract entities and relations at the sentence level. The model attempts to retain and improve the high accuracy levels of knowledge-based systems while drastically reducing the amount of manual labor by relying on statistics drawn from a ...
Keywords: HMM, rules based system, text mining, information extraction

14 published by ACM
July 2004 ICML '04: Proceedings of the twenty-first international conference on Machine learning
Publisher: ACM
Bibliometrics:
Citation Count: 46
Downloads (6 Weeks): 7,   Downloads (12 Months): 55,   Downloads (Overall): 993

Full text available: PDFPDF
In the authorship verification problem, we are given examples of the writing of a single author and are asked to determine if given long texts were or were not written by this author. We present a new learning-based method for adducing the "depth of difference" between two example sets and ...

15 published by ACM
October 2001 CIKM '01: Proceedings of the tenth international conference on Information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 0,   Downloads (12 Months): 3,   Downloads (Overall): 1,025

Full text available: PDFPDF
Text-Mining is a growing area of interest within the field of Data Mining and Knowledge Discovery. Given a collection of text documents, most approaches to Text Mining perform knowledge-discovery operations either on external tags associated with each document, or on the set of all words within each document. Both approaches ...

16 published by ACM
August 2000 KDD '00: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Publisher: ACM
Bibliometrics:
Citation Count: 7
Downloads (6 Weeks): 0,   Downloads (12 Months): 4,   Downloads (Overall): 541

Full text available: PDFPDF
Keywords: text mining, theory revision, user guided revision, information extraction



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us