skip to main content
10.1145/3343031.3350535acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

The VIA Annotation Software for Images, Audio and Video

Published: 15 October 2019 Publication History

Abstract

In this paper, we introduce a simple and standalone manual annotation tool for images, audio and video: the VGG Image Annotator (VIA). This is a light weight, standalone and offline software package that does not require any installation or setup and runs solely in a web browser. The VIA software allows human annotators to define and describe spatial regions in images or video frames, and temporal segments in audio or video. These manual annotations can be exported to plain text data formats such as JSON and CSV and therefore are amenable to further processing by other software tools. VIA also supports collaborative annotation of a large dataset by a group of human annotators. The BSD open source license of this software allows it to be used in any academic project or commercial application.

References

[1]
Sharib Ali, Felix Zhou, Christian Daul, Barbara Braden, Adam Bailey, Stefano Realdon, James East, Georges Wagnières, Victor Loschenov, Enrico Grisan, Walter Blondel, and Jens Rittscher. 2019. Endoscopy artifact detection (EAD 2019) challenge dataset. arXiv preprint arXiv:1905.03209.
[2]
BigParticle.Cloud. 2018. How-To: Generate primary object masks. https://www.bigparticle.cloud/index.php/how-to-generate-primary-object-masks/. Accessed: Mar 2019.
[3]
Julia Brasch, Kerry M Goodman, Alex J Noble, Micah Rapp, Seetha Mannepalli, Fabiana Bahna, Venkata P Dandey, Tristan Bepler, Bonnie Berger, Tom Maniatis, Clinton S Potter, Bridget Carragher, Barry Honig, and Lawrence Shapiro. 2019. Visualization of clustered protocadherin neuronal self-recognition complexes. Nature, Vol. 569, 7755, 280.
[4]
Qiong Cao, Omkar M. Parkhi, Mark Everingham, Josef Sivic, and Andrew Zisserman. 2019. VGG Face Tracker. http://www.robots.ox.ac.uk/vgg/software/face_tracker/. Accessed: Mar 2019.
[5]
Joon Son Chung, Arsha Nagrani, and Andrew Zisserman. 2018. VoxCeleb2: Deep speaker recognition. INTERSPEECH.
[6]
Michael Ferlaino, Craig A Glastonbury, Carolina Motta-Mejia, Manu Vatish, Ingrid Granne, Stephen Kennedy, Cecilia M Lindgren, and Christoffer Nellåker. 2018. Towards deep cellular phenotyping in placental histology. arXiv preprint arXiv:1804.03270.
[7]
Sarah Griffin. 2018. Diagram and Dimension: Visualising Time in the Drawings of Opicinus De Canistris (1296-c. 1352). Ph.D. Dissertation. University of Oxford.
[8]
Matilde Malaspina. 2018. 15th-century printed Italian editions of Aesopian texts. Ph.D. Dissertation. University of Oxford.
[9]
Milind Naphade, David C Anastasiu, Anuj Sharma, Vamsi Jagrlamudi, Hyeran Jeon, Kaikai Liu, Ming-Ching Chang, Siwei Lyu, and Zeyu Gao. 2017. The NVIDIA AI City Challenge. In IEEE SmartWorld. 1--6.
[10]
William Pascoe and Kaspar Paseko. 2019. Scriptopict. https://c21ch.newcastle.edu.au/scriptopict/. Accessed: May 2019.
[11]
Matthieu Pizenberg, Axel Carlier, Emmanuel Faure, and Vincent Charvillat. 2018. Web-Based Configurable Image Annotations. In 2018 ACM Multimedia Conference on Multimedia Conference. ACM, 1368--1371.
[12]
Alexander Rakhlin and Sergey Nikolenko. 2018. Neuromation Research: Pediatric Bone Age Assessment with Convolutional Neural Networks. https://medium.com/neuromation-blog/. Accessed: Mar 2019.
[13]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Advances in Neural Information Processing Systems (NIPS).
[14]
Bryan C Russell, Antonio Torralba, Kevin P Murphy, and William T Freeman. 2008. LabelMe: a database and web-based tool for image annotation. International journal of computer vision, Vol. 77, 1--3, 157--173.
[15]
Chuanhai Zhang, Kurt Loken, Zhiyu Chen, Zhiyong Xiao, and Gary Kunkel. 2018. Mask Editor: an Image Annotation Tool for Image Segmentation Tasks. arXiv preprint arXiv:1809.06461.

Cited By

View all
  • (2025)Utilising affordable smartphones and open-source time-lapse photography for pollinator image collection and annotationJournal of Pollination Ecology10.26786/1920-7603(2025)77837(1-21)Online publication date: 10-Jan-2025
  • (2025)Breast Cancer Detection and Localization Using a Novel Multimodal ApproachIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2024.350288374(1-13)Online publication date: 2025
  • (2025)Automatic Identification of Facial Tics Using Selfie-VideoIEEE Journal of Biomedical and Health Informatics10.1109/JBHI.2024.348828529:1(409-419)Online publication date: Jan-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '19: Proceedings of the 27th ACM International Conference on Multimedia
October 2019
2794 pages
ISBN:9781450368896
DOI:10.1145/3343031
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audio/video annotation
  2. image annotation
  3. manual annotation

Qualifiers

  • Short-paper

Funding Sources

  • The Engineering and Physical Sciences Research Council (EPSRC)

Conference

MM '19
Sponsor:

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)226
  • Downloads (Last 6 weeks)20
Reflects downloads up to 11 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Utilising affordable smartphones and open-source time-lapse photography for pollinator image collection and annotationJournal of Pollination Ecology10.26786/1920-7603(2025)77837(1-21)Online publication date: 10-Jan-2025
  • (2025)Breast Cancer Detection and Localization Using a Novel Multimodal ApproachIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2024.350288374(1-13)Online publication date: 2025
  • (2025)Automatic Identification of Facial Tics Using Selfie-VideoIEEE Journal of Biomedical and Health Informatics10.1109/JBHI.2024.348828529:1(409-419)Online publication date: Jan-2025
  • (2025)A systematic survey of public computer vision datasets for precision livestock farmingComputers and Electronics in Agriculture10.1016/j.compag.2024.109718229(109718)Online publication date: Feb-2025
  • (2025)Monitoring installation of partially occluded subassemblies in modular construction factories using BIM, ray tracing, and computer visionConstruction Robotics10.1007/s41693-024-00148-49:1Online publication date: 3-Jan-2025
  • (2025)Deep Networks Based Approach for Automatic Counting Panicles on UAV Captured Paddy RGB ImageryMachine Learning and Principles and Practice of Knowledge Discovery in Databases10.1007/978-3-031-74633-8_20(301-311)Online publication date: 1-Jan-2025
  • (2024)Data Readiness and Data Exploration for Successful Power Line InspectionDeep Learning - Recent Findings and Research10.5772/intechopen.112637Online publication date: 29-May-2024
  • (2024)Clustering of lithotypes based on visual features of cores using convolutional neural networks and K-MeansKazakhstan journal for oil & gas industry10.54859/kjogi1087206:2(25-38)Online publication date: 12-Jul-2024
  • (2024)Soybean crop yield estimation using artificial intelligence techniquesActa Scientiarum. Agronomy10.4025/actasciagron.v46i1.6704046:1(e67040)Online publication date: 9-Aug-2024
  • (2024)Development of Automatic Tree Seedling Detection Method in UAV Aerial Images Using Deep Learning:深層学習を用いたUAV空撮画像からの植栽木自動確認手法の開発Journal of the Japanese Forest Society10.4005/jjfs.106.31106:2(31-36)Online publication date: 1-Feb-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media