Author image not provided
 Ben He

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article14.35
Citation Count373
Publication count26
Publication years2003-2009
Available for download14
Average downloads per article446.14
Downloads (cumulative)6,246
Downloads (12 Months)179
Downloads (6 Weeks)17
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


26 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 20 of 26
Result page: 1 2

Sort by:

1 published by ACM
November 2009 CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 0,   Downloads (12 Months): 7,   Downloads (Overall): 195

Full text available: PDFPDF
Enterprise intranets are often sparse in nature, with limited use of alternative lexical representations between authors, making query expansion (QE) ineffective. Hence, for some enterprise search queries, it can be advantageous to instead use the well-known collection enrichment (CE) method to gather higher quality pseudo-feedback documents from a more diverse ...
Keywords: selective application, collection enrichment, enterprise search, query expansion

2 published by ACM
November 2009 CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 20
Downloads (6 Weeks): 3,   Downloads (12 Months): 20,   Downloads (Overall): 304

Full text available: PDFPDF
Pseudo-relevance feedback finds useful expansion terms from a set of top-ranked documents. It is often crucial to identify those good feedback documents from which useful expansion terms can be added to the query. In this paper, we propose to detect good feedback documents by classifying all feedback documents using a ...
Keywords: feedback document classification, relevance feedback

3
September 2009 ICTIR '09: Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 2

Query Expansion (QE) often improves the retrieval performance of an Information Retrieval (IR) system. However, as enterprise intranets are often sparse in nature, with limited use of alternative lexical representations between authors, it can be advantageous to use Collection Enrichment (CE) to gather higher quality pseudo-feedback documents. In this paper, ...

4 published by ACM
July 2009 SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 2
Downloads (6 Weeks): 0,   Downloads (12 Months): 7,   Downloads (Overall): 340

Full text available: PdfPdf
Current blog opinion retrieval approaches cannot be applied if the topic relevance and opinion score distributions by rank are dissimilar. This problem severely limits the feasibility of these approaches. We propose to tackle this problem by fitting the distribution of opinion scores, which replaces the original topic relevance score distribution ...
Keywords: blog search, opinion finding, distribution fitting

5
April 2009 ECIR '09: Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 11

Query expansion is an effective technique in improving the retrieval performance for ad-hoc retrieval. However, query expansion can also fail, leading to a degradation of the retrieval performance. In this paper, we aim to provide a better understanding of query expansion by an empirical study on what factors can affect ...

6
April 2009 ECIR '09: Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 16

Opinion finding is a challenging retrieval task, where it has been shown that it is especially difficult to improve over a strongly performing topic-relevance baseline. In this paper, we propose a novel approach for opinion finding, which takes into account the proximity of query terms to subjective sentences in a ...

7 published by ACM
October 2008 CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 32
Downloads (6 Weeks): 4,   Downloads (12 Months): 16,   Downloads (Overall): 950

Full text available: PDFPDF
Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks. Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features. In this paper, we propose a novel and effective dictionary-based statistical approach, ...
Keywords: statistics, subjectivity, retrieval, blog, opinion, sentiment

8 published by ACM
July 2008 SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 1,   Downloads (12 Months): 13,   Downloads (Overall): 672

Full text available: PDFPDF
Various measures, such as binary preference (bpref), inferred average precision (infAP), and binary normalised discounted cumulative gain (nDCG) have been proposed as alternatives to mean average precision (MAP) for being less sensitive to the relevance judgements completeness. As the primary aim of any system building is to train the system ...
Keywords: evaluation, inferred average precision, binary preference, mean average precision, training, normalised discounted cumulative gain, optimisation, relevance judgements

9 published by ACM
July 2008 SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 6
Downloads (6 Weeks): 0,   Downloads (12 Months): 15,   Downloads (Overall): 381

Full text available: PDFPDF
In opinion-finding, the retrieval system is tasked with retrieving not just relevant documents, but which also express an opinion towards the query target entity. Most opinion-finding systems are based on a two-stage approach, where initially the system aims to retrieve relevant documents, which are then re-ranked according to the extent ...
Keywords: opinion finding, baselines

10 published by ACM
July 2008 SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 20
Downloads (6 Weeks): 2,   Downloads (12 Months): 19,   Downloads (Overall): 664

Full text available: PDFPDF
The aim of an opinion finding system is not just to retrieve relevant documents, but to also retrieve documents that express an opinion towards the query target entity. In this work, we propose a way to use and integrate an opinion-identification toolkit, OpinionFinder, into the retrieval process of an Information ...
Keywords: opinion finding, blogs

11
March 2008 ECIR'08: Proceedings of the IR research, 30th European conference on Advances in information retrieval
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 1


12 published by ACM
November 2007 CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Publisher: ACM
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 0,   Downloads (12 Months): 3,   Downloads (Overall): 258

Full text available: PDFPDF
The term frequency normalisation parameter sensitivity is an important issue in the probabilistic model for Information Retrieval. A high parameter sensitivity indicates that a slight change of the parameter value may considerably affect the retrieval performance. Therefore, a weighting model with a high parameter sensitivity is not robust enough to ...
Keywords: relevance feedback, parameter sensitivity, query term reweighing

13
September 2007 Information Processing and Management: an International Journal: Volume 43 Issue 5, September, 2007
Publisher: Pergamon Press, Inc.
Bibliometrics:
Citation Count: 25

In this paper, we aim to improve query expansion for ad-hoc retrieval, by proposing a more fine-grained term reweighting process. This fine-grained process uses statistics from the representation of documents in various fields, such as their titles, the anchor text of their incoming links, and their body content. The contribution ...
Keywords: Information retrieval, Query expansion on fields, Pseudo relevance feedback, TREC experiments, Adaptive query expansion, External expansion

14 published by ACM
July 2007 SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 24
Downloads (6 Weeks): 2,   Downloads (12 Months): 27,   Downloads (Overall): 416

Full text available: PDFPDF
Term dependency, or co-occurrence, has been studied in language modelling, for instance by Metzler & Croft who showed that retrieval performance could be significantlyenhanced using term dependency information. In this work, weshow how term dependency can be modelled within the Divergence From Randomness (DFR) framework. We evaluate our term dependency ...
Keywords: DFR, term dependency

15 published by ACM
July 2007 ACM Transactions on Information Systems (TOIS): Volume 25 Issue 3, July 2007
Publisher: ACM
Bibliometrics:
Citation Count: 9
Downloads (6 Weeks): 1,   Downloads (12 Months): 13,   Downloads (Overall): 602

Full text available: PDFPDF
The setting of the term frequency normalization hyper-parameter suffers from the query dependence and collection dependence problems, which remarkably hurt the robustness of the retrieval performance. Our study in this article investigates three term frequency normalization methods, namely normalization 2, BM25's normalization and the Dirichlet Priors normalization. We tackle the ...
Keywords: TREC experimentation, information retrieval models, collection-dependence, relevance feedback, Query-dependence, term frequency normalization

16
May 2007 RIAO '07: Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Publisher: LE CENTRE DE HAUTES ETUDES INTERNATIONALES D'INFORMATIQUE DOCUMENTAIRE
Bibliometrics:
Citation Count: 4
Downloads (6 Weeks): 0,   Downloads (12 Months): 2,   Downloads (Overall): 62

Full text available: PDFPDF
Query-independent features (also called document priors), such as the number of incoming links to a document, its Page-Rank, or the type of its associated URL, have been successfully integrated into Web Information Retrieval systems in order to enhance the retrieval effectiveness. The combination of several document priors could further enhance ...

17
April 2007 ECIR'07: Proceedings of the 29th European conference on IR research
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 1

Per-field normalisation has been shown to be effective for Web search tasks, e.g. named-page finding. However, per-field normalisation also suffers from having hyper-parameters to tune on a per-field basis. In this paper, we argue that the purpose of per-field normalisation is to adjust the linear relationship between field length and ...

18
November 2006 Information Systems: Volume 31 Issue 7, November 2006
Publisher: Elsevier Science Ltd.
Bibliometrics:
Citation Count: 54

The prediction of query performance is an interesting and important issue in Information Retrieval (IR). Current predictors involve the use of relevance scores, which are time-consuming to compute. Therefore, current predictors are not very suitable for practical applications. In this paper, we study six predictors of query performance, which can ...
Keywords: query performance prediction, experiments, information retrieval

19
September 2005 CLEF'05: Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 17

We participated in the WebCLEF 2005 monolingual task. In this task, a search system aims to retrieve relevant documents from a multilingual corpus of Web documents from Web sites of European governments. Both the documents and the queries are written in a wide range of European languages. A challenge in ...

20
September 2005 CLEF'05: Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Publisher: Springer-Verlag
Bibliometrics:
Citation Count: 1

In the CLEF 2005 Ad-Hoc Track we addressed the problem of retrieving information in morphologically rich languages, by experimenting with language-specific morphosyntactic processing and light Natural Language Processing (NLP). The diversity of the languages processed, namely Bulgarian, French, Italian, English, and Greek, allowed us to measure the effect of system-specific ...



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us