Author image not provided
 Houqiang Li

Authors:
Add personal information
  Affiliation history
Bibliometrics: publication history
Average citations per article1.25
Citation Count15
Publication count12
Publication years2013-2017
Available for download6
Average downloads per article144.00
Downloads (cumulative)864
Downloads (12 Months)246
Downloads (6 Weeks)76
SEARCH
ROLE
Arrow RightAuthor only


AUTHOR'S COLLEAGUES
See all colleagues of this author

SUBJECT AREAS
See all subject areas




BOOKMARK & SHARE


12 results found Export Results: bibtexendnoteacmrefcsv

Result 1 – 12 of 12
Sort by:

1 published by ACM
December 2017 ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM): Volume 14 Issue 1, January 2018
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 29,   Downloads (12 Months): 29,   Downloads (Overall): 29

Full text available: PDFPDF
In sign language recognition (SLR) with multimodal data, a sign word can be represented by multiply features, for which there exist an intrinsic property and a mutually complementary relationship among them. To fully explore those relationships, we propose an online early-late fusion method based on the adaptive Hidden Markov Model ...
Keywords: online algorithm, query-adaptive, Sign language recognition, multi-modal feature fusion, HMM

2 published by ACM
October 2017 MM '17: Proceedings of the 2017 ACM on Multimedia Conference
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 13,   Downloads (12 Months): 61,   Downloads (Overall): 61

Full text available: PDFPDF
We are creating multimedia contents everyday and everywhere. While automatic content generation has played a fundamental challenge to multimedia community for decades, recent advances of deep learning have made this problem feasible. For example, the Generative Adversarial Networks (GANs) is a rewarding approach to synthesize images. Nevertheless, it is not ...
Keywords: cnns, video captioning, video generation, gans

3 published by ACM
October 2017 MM '17: Proceedings of the 2017 ACM on Multimedia Conference
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 11,   Downloads (12 Months): 27,   Downloads (Overall): 27

Full text available: PDFPDF
Approximate Nearest Neighbour (ANN) search is an important research topic in multimedia and computer vision fields. In this paper, we propose a new deep supervised quantization method by Self-Organizing Map (SOM) to address this problem. Our method integrates the Convolutional Neural Networks (CNN) and Self-Organizing Map into a unified deep ...
Keywords: supervised quantization, self-organizing map, approximate nearest neighbour search

4 published by ACM
August 2017 SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
Publisher: ACM
Bibliometrics:
Citation Count: 0
Downloads (6 Weeks): 19,   Downloads (12 Months): 79,   Downloads (Overall): 79

Full text available: PDFPDF
We demonstrate a video captioning bot, named Seeing Bot, which can generate a natural language description about what it is seeing in near real time. Specifically, given a live streaming video, Seeing Bot runs two pre-learned and complementary captioning modules in parallel - one for generating image-level caption for each ...
Keywords: chitchat bot, video captioning, multi-view embedding, deep convolutional neural networks, image captioning

5
April 2017 Signal Processing: Volume 133 Issue C, April 2017
Publisher: Elsevier North-Holland, Inc.
Bibliometrics:
Citation Count: 0

Total variation and its variants have been widely used in the video/image restoration area in the past decades. Among them, the nonlocal total variation model introduces penalization on nonlocal gradients and demonstrates remarkable performance gain in many applications. However, this approach tends to suppress intensity-changes of visual contents, and hence ...
Keywords: Regularization Modeling, Nonlocal Total Variation, Video Restoration

6
April 2017 IEEE Transactions on Information Theory: Volume 63 Issue 4, April 2017
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0

Recently, a secrecy measure based on list-reconstruction has been proposed, in which a wiretapper is allowed to produce a list of $2^{mR_{L}}$ reconstruction sequences and the secrecy is measured by the minimum distortion over the entire list. In this paper, we show that this list secrecy problem ...

7
October 2016 IEEE Transactions on Information Theory: Volume 62 Issue 10, October 2016
Publisher: IEEE Press
Bibliometrics:
Citation Count: 0

Recently, Tian et al. [1] considered joint source-channel coding of transmitting a Gaussian source over $K$ -user Gaussian broadcast channel, and derived an outer bound on the admissible distortion region. In [1] , they stated “due to its nonlinear form, it appears difficult to ...

8
July 2016 IJCAI'16: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence
Publisher: AAAI Press
Bibliometrics:
Citation Count: 1

Learning video representation is not a trivial task, as video is an information-intensive media where each frame does not exist independently. Locally, a video frame is visually and semantically similar with its adjacent frames. Holistically, a video has its inherent structure--the correlations among video frames. For example, even the frames ...

9
February 2016 IEEE Transactions on Circuits and Systems for Video Technology: Volume 26 Issue 2, February 2016
Publisher: IEEE Press
Bibliometrics:
Citation Count: 1

Recently, image representation by vector of locally aggregated descriptors (VLADs) has been demonstrated to be super efficient in image representation. Due to the coarse division in the feature space, its discriminative power is limited. One intuitive way to address this issue is to construct a VLAD with a larger vocabulary, ...

10
January 2015 Advances in Multimedia: Volume 2015, January 2015
Publisher: Hindawi Limited
Bibliometrics:
Citation Count: 1
Downloads (6 Weeks): 0,   Downloads (12 Months): 5,   Downloads (Overall): 7

Full text available: PDFPDF
The support for region of interest (ROI) browsing, which allows dropping background part of video bitstreams, is a desirable feature for video applications. With the help of the slice group technique provided by H.264/SVC, rectangular ROI areas can be encoded into separate ROI slices. Additionally, by imposing certain constraints on ...

11
October 2014 Signal Processing: Volume 103 Issue C, October 2014
Publisher: Elsevier North-Holland, Inc.
Bibliometrics:
Citation Count: 0


12 published by ACM
February 2013 ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM): Volume 9 Issue 1, February 2013
Publisher: ACM
Bibliometrics:
Citation Count: 11
Downloads (6 Weeks): 4,   Downloads (12 Months): 45,   Downloads (Overall): 661

Full text available: PDFPDF
Most large-scale image retrieval systems are based on the bag-of-visual-words model. However, the traditional bag-of-visual-words model does not capture the geometric context among local features in images well, which plays an important role in image retrieval. In order to fully explore geometric context of all visual words in images, efficient ...
Keywords: geometric square coding, geometric fan coding, large scale, partial duplicate, Image retrieval, rotation-invariant



The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2018 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us