survey

Video Interaction Tools: A Survey of Recent Work

Authors:
Klaus Schoeffmann

Klagenfurt University, Austria

Klagenfurt University, Austria
View Profile

,
Marco A. Hudelist

Klagenfurt University, Austria

Klagenfurt University, Austria
View Profile

,
Jochen Huber

Synaptics, Switzerland

Synaptics, Switzerland
View Profile

Authors Info & Claims

ACM Computing Surveys Volume 48 Issue 1Article No.: 14pp 1–34https://doi.org/10.1145/2808796

Published:29 September 2015Publication History

ACM Computing Surveys

Abstract

Digital video enables manifold ways of multimedia content interaction. Over the last decade, many proposals for improving and enhancing video content interaction were published. More recent work particularly leverages on highly capable devices such as smartphones and tablets that embrace novel interaction paradigms, for example, touch, gesture-based or physical content interaction. In this article, we survey literature at the intersection of Human-Computer Interaction and Multimedia. We integrate literature from video browsing and navigation, direct video manipulation, video content visualization, as well as interactive video summarization and interactive video retrieval. We classify the reviewed works by the underlying interaction method and discuss the achieved improvements so far. We also depict a set of open problems that the video interaction community should address in future.

References

Brett Adams, Stewart Greenhill, and Svetha Venkatesh. 2012. Towards a video browser for the digital native. In Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). 127--132. DOI:http://dx.doi.org/10.1109/ICMEW.2012.29 Google ScholarDigital Library
John Adcock, Matthew Cooper, and Jeremy Pickens. 2008. Experiments in interactive video search by addition and subtraction. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 465--474. DOI:http://dx.doi.org/10.1145/1386352.1386412 Google ScholarDigital Library
Abir Al-Hajri, Gregor Miller, Sidney Fels, and Matthew Fong. 2013. Video navigation with a personal viewing history. In Human-Computer Interaction INTERACT 2013, Paula Kotz, Gary Marsden, Gitte Lindgaard, Janet Wesson, and Marco Winckler (Eds.). Lecture Notes in Computer Science, Vol. 8119. Springer, Berlin, 352--369. DOI:http://dx.doi.org/10.1007/978-3-642-40477-1_22Google ScholarCross Ref
Robin Aly, Kevin McGuinness, Shu Chen, Noel E. O’Conner, Ken Chatfield, Omkar M. Parkhi, Relja Arandjelovic, Andrew Zisserman, Basura Fernando, Tinne Tuytelaars, Dan Oneata, Matthijs Douze, Jerome Revaud, Jochen Schwenninger, Danila Potapov, Heng Wang, Zaid Harchaoui, Jakob Verbeek, and Cordelia Schmid. 2012. AXES at TRECVid 2012: KIS, INS, and MED. In TRECVID’12.Google Scholar
Leif Azzopardi, Douglas Dowie, and Kelly Ann Marshall. 2012. YooSee: A video browsing application for young children. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’12). ACM, New York, NY, 1017--1017. DOI:http://dx.doi.org/10.1145/2348283.2348442 Google ScholarDigital Library
Hongliang Bai, Lezi Wang, Yuan Dong, and Kun Tao. 2013. Interactive video retrieval using combination of semantic index and instance search. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 554--556. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_67Google Scholar
Werner Bailer, Wolfgang Weiss, Christian Schober, and Georg Thallinger. 2012. A video browsing tool for content management in media post-production. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, AlexanderG. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 658--659. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_69 Google ScholarDigital Library
Werner Bailer, Wolfgang Weiss, Christian Schober, and Georg Thallinger. 2013. An approach for browsing video collections in media production. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 538--540. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_62Google Scholar
Werner Bailer, Wolfgang Weiss, Christian Schober, and Georg Thallinger. 2014. Browsing linked video collections for media production. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 407--410. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_47 Google ScholarDigital Library
KaiUwe Barthel, Nico Hezel, and Radek Mackowiak. 2015. Graph-based browsing for large video collections. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and MuhammadAbul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 237--242. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_21Google Scholar
Frank R. Bentley and Michael Groble. 2009. TuVista: Meeting the multimedia needs of mobile sports fans. In Proceedings of the 17th ACM International Conference on Multimedia. ACM, 471--480. Google ScholarDigital Library
Adam Blazek, Jakub Lokoc, Filip Matzner, and Tomas Skopal. 2015. Enhanced signature-based video browser. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and MuhammadAbul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 243--248. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_22Google Scholar
Christoph Brachmann and Rainer Malaka. 2009. Keyframe-less integration of semantic information in a video player interface. In Proceedings of the 7th European Conference on European Interactive Television Conference (EuroITV’09). ACM, New York, NY, 137--140. DOI:http://dx.doi.org/10.1145/1542084.1542109 Google ScholarDigital Library
Shelley Buchinger, Ewald Hotop, Helmut Hlavacs, Francesca De Simone, and Touradj Ebrahimi. 2010. Gesture and touch controlled video player interface for mobile devices. In Proceedings of the International Conference on Multimedia (MM’10). ACM, New York, NY, 699--702. DOI:http://dx.doi.org/10.1145/1873951.1874055 Google ScholarDigital Library
Andrei Bursuc, Titus Zaharia, and Françoise Prêteux. 2010. Mobile video browsing and retrieval with the OVIDIUS platform. In Proceedings of the International Conference on Multimedia (MM’10). ACM, New York, NY, 1659--1662. DOI:http://dx.doi.org/10.1145/1873951.1874315 Google ScholarDigital Library
Andrei Bursuc, Titus Zaharia, and Françoise Prêteux. 2012. OVIDIUS: A web platform for video browsing and search. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 649--651. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_66 Google ScholarDigital Library
Juan Casares, A. Chris Long, Brad A. Myers, Rishi Bhatnagar, Scott M. Stevens, Laura Dabbish, Dan Yocum, and Albert Corbett. 2002. Simplifying video editing using metadata. In Proceedings of the 4th Conference on Designing Interactive Systems: Processes, Practices, Methods, and Techniques. ACM, 157--166. Google ScholarDigital Library
Renan G. Cattelan, Cesar Teixeira, Rudinei Goularte, and Maria Da Graça C. Pimentel. 2008. Watch-and-comment as a paradigm toward ubiquitous interactive video editing. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 4, 4 (2008), 28. Google ScholarDigital Library
Lekha Chaisorn, Kong-Wah Wan, Yan-Tao Zheng, Yongwei Zhu, Tian-Shiang Kok, Hui-Li Tan, Zixiang Fu, and Susanna Bolling. 2010. TRECVID 2010 known-item search (KIS) task by I2R. In TRECVID’10.Google Scholar
Bisheng Chen, Jingdong Wang, Qinghua Huang, and Tao Mei. 2012. Personalized video recommendation through tripartite graph propagation. In Proceedings of the 20th ACM International Conference on Multimedia (MM’12). ACM, New York, NY, 1133--1136. DOI:http://dx.doi.org/10.1145/2393347.2396401 Google ScholarDigital Library
Xiu Y. Chen and Zary Segall. 2009. XV-Pod: An emotion aware, affective mobile video player. In 2009 WRI World Congress on Computer Science and Information Engineering, Vol. 3. 277--281. DOI:http://dx.doi.org/10.1109/CSIE.2009.982 Google ScholarDigital Library
Michael G. Christel and Rong Yan. 2007. Merging storyboard strategies and automatic retrieval for improving interactive video search. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval (CIVR’07). ACM, New York, NY, 486--493. DOI:http://dx.doi.org/10.1145/1282280.1282351 Google ScholarDigital Library
Claudiu Cobârzan, Marco A. Hudelist, and Manfred Del Fabro. 2014. Content-based video browsing with collaborating mobile clients. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 402--406. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_46 Google ScholarDigital Library
Claudiu Cobârzan and Klaus Schoeffmann. 2014. How do users search with basic HTML5 video players? In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8325. Springer International Publishing, 109--120. DOI:http://dx.doi.org/10.1007/978-3-319-04114-8_10 Google ScholarDigital Library
Collabracam. 2015. http://collabracam.com/.Google Scholar
Peng Cui, Zhiyu Wang, and Zhou Su. 2014. What videos are similar with you?: Learning a common attributed representation for video recommendation. In Proceedings of the ACM International Conference on Multimedia (MM’14). ACM, New York, NY, 597--606. DOI:http://dx.doi.org/10.1145/2647868.2654946 Google ScholarDigital Library
Bruna C. R. Cunha, Diogo Pedrosa, Rudinei Goularte, and Maria da Graça Campos Pimentel. 2012. Video annotation and navigation on mobile devices. In Proceedings of the 18th Brazilian Symposium on Multimedia and the Web (WebMedia’12). ACM, New York, NY, 261--264. DOI:http://dx.doi.org/10.1145/2382636.2382691 Google ScholarDigital Library
Christoph Czepa, Shelley Buchinger, Helmut Hlavacs, Ewald Hotop, and Yohann Pitrey. 2012. Towards an energy-efficient attention-aware mobile video player with sensor and face detection support. In 2012 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM). 1--6. DOI:http://dx.doi.org/10.1109/WoWMoM.2012.6263801Google ScholarCross Ref
Stamatia Dasiopoulou, Eirini Giannakidou, Georgios Litos, Polyxeni Malasioti, and Yiannis Kompatsiaris. 2011. A survey of semantic image and video annotation tools. In Knowledge-Driven Multimedia Information Extraction and Ontology Evolution, Georgios Paliouras, Constantine D. Spyropoulos, and George Tsatsaronis (Eds.). Lecture Notes in Computer Science, Vol. 6050. Springer, Berlin, 196--239. DOI:http://dx.doi.org/10.1007/978-3-642-20795-2_8 Google ScholarDigital Library
James Davidson, Benjamin Liebald, Junning Liu, Palash Nandy, Taylor Van Vleet, Ullas Gargi, Sujoy Gupta, Yu He, Mike Lambert, Blake Livingston, and Dasarathi Sampath. 2010. The YouTube video recommendation system. In Proceedings of the 4th ACM Conference on Recommender Systems (RecSys’10). ACM, New York, NY, 293--296. DOI:http://dx.doi.org/10.1145/1864708.1864770 Google ScholarDigital Library
Ork de Rooij, Cees G. M. Snoek, and Marcel Worring. 2008. Balancing thread based navigation for targeted video search. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 485--494. DOI:http://dx.doi.org/10.1145/1386352.1386414 Google ScholarDigital Library
Ork de Rooij, Cees G. M. Snoek, and Marcel Worring. 2007. Query on demand video browsing. In Proceedings of the 15th International Conference on Multimedia (MULTIMEDIA’07). ACM, New York, NY, 811--814. DOI:http://dx.doi.org/10.1145/1291233.1291417 Google ScholarDigital Library
Ork de Rooij, J. J. van Wijk, and M. Worring. 2010. Mediatable: Interactive categorization of multimedia collections. IEEE Computer Graphics and Applications 30, 5 (Sept. 2010), 42--51. DOI:http://dx.doi.org/10.1109/MCG.2010.66 Google ScholarDigital Library
Manfred Del Fabro and Laszlo Böszörmenyi. 2012. AAU video browser: Non-sequential hierarchical video browsing without content analysis. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 639--641. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_63 Google ScholarDigital Library
Manfred Del Fabro, Bernd Münzer, and Laszlo Böszörmenyi. 2013. AAU video browser with augmented navigation bars. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 544--546. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_64Google Scholar
Manfred Del Fabro, Mathias Lux, Klaus Schoeffmann, and Mario Taschwer. 2012. ITEC-UNIKLU known-item search submission 2012. In TRECVID’12.Google Scholar
Marco de Sá, David A. Shamma, and Elizabeth F. Churchill. 2014. Live mobile collaboration for video production: Design, guidelines, and requirements. Personal and Ubiquitous Computing 18, 3 (2014), 693--707. Google ScholarDigital Library
Niloofar Dezuli, Jochen Huber, Elizabeth F. Churchill, and Max Mühlhäuser. 2013. CoStream: Co-construction of shared experiences through mobile live video sharing. In Proceedings of the 27th International BCS Human Computer Interaction Conference. British Computer Society, 6. Google ScholarDigital Library
Arvid Engström, Mattias Esbjörnsson, and Oskar Juhlin. 2008. Mobile collaborative live video mixing. In Proceedings of the 10th International Conference on Human Computer Interaction with Mobile Devices and Services. ACM, 157--166. Google ScholarDigital Library
Arvid Engström, Goranka Zoric, Oskar Juhlin, and Ramin Toussi. 2012. The mobile vision mixer: A mobile network based live video broadcasting system in your mobile phone. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia. ACM, 18. Google ScholarDigital Library
Colum Foley, Jinlin Guo, David Scott, Peter Wilkins, Cathal Gurrin, Alan F. Smeaton, Paul Ferguson, Kealan McCusker, Emma Sesmero Diaz, Kevin McGuinness, Noel E. O’Connor, Xavier Giró i Nieto, and Ferran Marqués. 2010. TRECVID 2010 experiments at Dublin city university. In TRECVID’10.Google Scholar
Gerald Friedland, Luke Gottlieb, and Adam Janin. 2009. Joke-o-mat: Browsing sitcoms punchline by punchline. In Proceedings of the 17th ACM International Conference on Multimedia (MM’09). ACM, New York, NY, 1115--1116. DOI:http://dx.doi.org/10.1145/1631272.1631525 Google ScholarDigital Library
Christian Frisson, Stéphane Dupont, Alexis Moinet, Cécile Picard-Limpens, Thierry Ravet, Xavier Siebert, and Thierry Dutoit. 2013. VideoCycle: User-friendly navigation by similarity in video databases. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 550--553. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_66Google Scholar
Vineet Gandhi, Remi Ronfard, and Michael Gleicher. 2014. Multi-clip video editing from a single viewpoint. In Proceedings of the 11th European Conference on Visual Media Production. ACM, 9. Google ScholarDigital Library
Roman Ganhör. 2012. ProPane: Fast and precise video browsing on mobile phones. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia (MUM’12). ACM, New York, NY, Article 20, 8 pages. DOI:http://dx.doi.org/10.1145/2406367.2406392 Google ScholarDigital Library
P. Geetha and Vasumathi Narayanan. 2008. A survey of content-based video retrieval. Journal of Computer Science 4, 6 (2008), 474.Google ScholarCross Ref
Andreas Girgensohn, John Boreczky, Patrick Chiu, John Doherty, Jonathan Foote, Gene Golovchinsky, Shingo Uchihashi, and Lynn Wilcox. 2000. A semi-automatic approach to home video editing. In Proceedings of the 13th Annual ACM Symposium on UserInterface Software and Technology. ACM, 81--89. Google ScholarDigital Library
Andreas Girgensohn, Frank Shipman, and Lynn Wilcox. 2011. Adaptive clustering and interactive visualizations to support the selection of video clips. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval (ICMR’11). ACM, New York, NY, Article 34, 8 pages. DOI:http://dx.doi.org/10.1145/1991996.1992030 Google ScholarDigital Library
Mieke Haesen, Jan Meskens, Kris Luyten, Karin Coninx, Jan Hendrik Becker, Tinne Tuytelaars, Gert-Jan Poulisse, Phi The Pham, and Marie-Francine Moens. 2013. Finding a needle in a haystack: An interactive video archive explorer for professional video searchers. Multimedia Tools and Applications 63, 2 (2013), 331--356. DOI:http://dx.doi.org/10.1007/s11042-011-0809-y Google ScholarDigital Library
Peter E. Hart, Kurt Pierson, and Jonathan J. Hull. 2005. Refocusing multimedia research on short clips. IEEE MultiMedia 12, 3 (July 2005), 8--13. DOI:http://dx.doi.org/10.1109/MMUL.2005.55 Google ScholarDigital Library
Luis Herranz and Jose M. Martinez. 2010. A framework for scalable summarization of video. IEEE Transactions on Circuits and Systems for Video Technology 20, 9 (Sept. 2010), 1265--1270. DOI:http://dx.doi.org/10.1109/TCSVT.2010.2057020 Google ScholarDigital Library
Markus Hoeferlin, Benjamin Hoeferlin, Gunther Heidemann, and Daniel Weiskopf. 2013. Interactive schematic summaries for faceted exploration of surveillance video. IEEE Transactions on Multimedia 15, 4 (2013), 908--920. Google ScholarDigital Library
Weiming Hu, Nianhua Xie, Li Li, Xianglin Zeng, and S. Maybank. 2011. A survey on visual content-based video indexing and retrieval. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 41, 6 (Nov. 2011), 797--819. DOI:http://dx.doi.org/10.1109/TSMCC.2011.2109710 Google ScholarDigital Library
Xian-Sheng Hua, Lie Lu, and Hong-Jiang Zhang. 2003. AVE: Automated home video editing. In Proceedings of the 11th ACM International Conference on Multimedia. ACM, 490--497. Google ScholarDigital Library
Jochen Huber, Jürgen Steimle, Roman Lissermann, Simon Olberding, and Max Mühlhäuser. 2010b. Wipe’n’Watch: Spatial interaction techniques for interrelated video collections on mobile devices. In Proceedings of the 24th BCS Interaction Specialist Group Conference (BCS’10). 423--427. http://dl.acm.org/citation.cfm?id=2146303.2146367 Google ScholarDigital Library
Jochen Huber, Jürgen Steimle, and Max Mühlhäuser. 2010a. Toward more efficient user interfaces for mobile video browsing: An in-depth exploration of the design space. In Proceedings of the International Conference on Multimedia. ACM, 341--350. Google ScholarDigital Library
Marco A. Hudelist, Claudiu Cobârzan, and Klaus Schoeffmann. 2014. OpenCV performance measurements on mobile devices. In Proceedings of the 4th ACM International Conference on Multimedia Retrieval (ICMR’14). ACM, New York, NY, 4. Google ScholarDigital Library
Marco A. Hudelist, Klaus Schoeffmann, and Laszlo Boeszoermenyi. 2013a. Mobile video browsing with a 3D filmstrip. In Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval (ICMR’13). ACM, New York, NY, 299--300. DOI:http://dx.doi.org/10.1145/2461466.2461515 Google ScholarDigital Library
Marco A. Hudelist, Klaus Schoeffmann, and Laszlo Boeszoermenyi. 2013b. Mobile video browsing with the thumbbrowser. In Proceedings of the 21st ACM International Conference on Multimedia (MM’13). ACM, New York, NY, 405--406. DOI:http://dx.doi.org/10.1145/2502081.2502242 Google ScholarDigital Library
Wolfgang Hürst. 2006. Interactive audio-visual video browsing. In Proceedings of the 14th Annual ACM International Conference on Multimedia. ACM, 675--678. Google ScholarDigital Library
Wolfgang Hürst and Dimitri Darzentas. 2012. HiStory: A hierarchical storyboard interface design for video browsing on mobile devices. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia (MUM’12). ACM, New York, NY, Article 17, 4 pages. DOI:http://dx.doi.org/10.1145/2406367.2406389 Google ScholarDigital Library
Wolfgang Hürst, Georg Götz, and Philipp Jarvers. 2004. Advanced user interfaces for dynamic video browsing. In Proceedings of the 12th Annual ACM International Conference on Multimedia (MULTIMEDIA’04). ACM, New York, NY, 742--743. DOI:http://dx.doi.org/10.1145/1027527.1027694 Google ScholarDigital Library
Wolfgang Hürst, Georg Götz, and Martina Welte. 2007. Interactive video browsing on mobile devices. In Proceedings of the 15th International Conference on Multimedia (MULTIMEDIA’07). ACM, New York, NY, 247--256. DOI:http://dx.doi.org/10.1145/1291233.1291284 Google ScholarDigital Library
Wolfgang Hürst and Konrad Meier. 2008. Interfaces for timeline-based mobile video browsing. In Proceedings of the 16th ACM International Conference on Multimedia (MM’08). ACM, New York, NY, 469--478. DOI:http://dx.doi.org/10.1145/1459359.1459422 Google ScholarDigital Library
Wolfgang Hürst and Philipp Merkle. 2008. One-handed mobile video browsing. In Proceedings of the 1st International Conference on Designing Interactive User Experiences for TV and Video (UXTV’08). ACM, New York, NY, 169--178. DOI:http://dx.doi.org/10.1145/1453805.1453839 Google ScholarDigital Library
Wolfgang Hürst, Rob van de Werken, and Miklas Hoet. 2015. A storyboard-based interface for mobile video browsing. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and Muhammad Abul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 261--265. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_25Google Scholar
Cisco Visual Networking Index. 2013. Global mobile data traffic forecast update, 2012--2017.Google Scholar
Dan Jackson, James Nicholson, Gerrit Stoeckigt, Rebecca Wrobel, Anja Thieme, and Patrick Olivier. 2013. Panopticon: A parallel video overview system. In Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology (UIST’13). ACM, New York, NY, 123--130. DOI:http://dx.doi.org/10.1145/2501988.2502038 Google ScholarDigital Library
Alejandro Jaimes and Nicu Sebe. 2007. Multimodal human--computer interaction: A survey. Computer Vision and Image Understanding 108, 1--2 (2007), 116--134. DOI:http://dx.doi.org/10.1016/j.cviu.2006.10.019. Special Issue on Vision for Human-Computer Interaction. Google ScholarDigital Library
Oskar Juhlin, Goranka Zoric, Arvid Engström, and Erika Reponen. 2014a. Video interaction: A research agenda. Personal and Ubiquitous Computing 18, 3 (2014), 685--692. Google ScholarDigital Library
Oskar Juhlin, Goranka Zoric, Arvid Engström, and Erika Reponen. 2014b. Video interaction: A research agenda. Personal and Ubiquitous Computing 18, 3 (March 2014), 685--692. DOI:http://dx.doi.org/10.1007/s00779-013-0705-8 Google ScholarDigital Library
Thorsten Karrer, Malte Weiss, Eric Lee, and Jan Borchers. 2008. DRAGON: A direct manipulation interface for frame-accurate in-scene video navigation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’08). ACM, New York, NY, 247--250. DOI:http://dx.doi.org/10.1145/1357054.1357097 Google ScholarDigital Library
Thorsten Karrer, Moritz Wittenhagen, and Jan Borchers. 2009. PocketDRAGON: A direct manipulation video navigation interface for mobile devices. In Proceedings of the 11th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI’09). ACM, New York, NY, Article 47, 3 pages. DOI:http://dx.doi.org/10.1145/1613858.1613917 Google ScholarDigital Library
Duy-Dinh Le, Vu Lam, Thanh Duc Ngo, Vinh Quang Tran, Vu Hoang Nguyen, Duc Anh Duong, and Shin’ichi Satoh. 2013. NII-UIT-VBS: A video browsing tool for known item search. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 547--549. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_65Google Scholar
Duy-Dinh Le, Cai-Zhi Zhu, Sebastien Poullot, Vu Q. Lam, Vu H. Nguyen, Nhan C. Duong, Thanh D. Ngo, Duc A. Duong, and Shin’ichi Satho. 2012. National Institute of Informatics, Japan at TRECVID 2012. In TRECVID’12.Google Scholar
Roman Lissermann, Simon Olberding, Benjamin Petry, Max Mühlhäuser, and Jürgen Steimle. 2012. PaperVideo: Interacting with videos on multiple paper-like displays. In Proceedings of the 20th ACM International Conference on Multimedia (MM’12). ACM, New York, NY, 129--138. DOI:http://dx.doi.org/10.1145/2393347.2393372 Google ScholarDigital Library
Suzanne Little, Iveel Jargalsaikhan, Cem Direkoglu, Noel E. O’Conner, Alan F. Smeaton, Kathy Clawson, Hao Li, Marcos Nieto, Aitor Rodriguez, Pedro Sanchez, Karina Villarroel Panzia, Ana Martinez Llorens, Roberto Gimenez, Raul Santuos de la Camara, and Anna Mereu. 2012. SAVASA project @ TRECVID 2012: Interactive surveillance event detection. In TRECVID’12.Google Scholar
Jakub Lokoc, Adam Blazek, and Tomas Skopal. 2014. Signature-based video browser. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 415--418. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_49 Google ScholarDigital Library
David G. Lowe. 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 2 (2004), 91--110. DOI:http://dx.doi.org/10.1023/B:VISI.0000029664.99615.94 Google ScholarDigital Library
Huanbo Luan, Yan-Tao Zheng, Meng Wang, and Tat-Seng Chua. 2011. VisionGo: Towards video retrieval with joint exploration of human and computer. Information Sciences 181, 19 (2011), 4197--4213. DOI:http://dx.doi.org/10.1016/j.ins.2011.05.018 Google ScholarDigital Library
Mathias Lux and Michael Riegler. 2013. Annotation of endoscopic videos on mobile devices: A bottom-up approach. In Proceedings of the 4th ACM Multimedia Systems Conference (MMSys’13). ACM, New York, NY, 141--145. DOI:http://dx.doi.org/10.1145/2483977.2483996 Google ScholarDigital Library
Xiaoqiang Ma, Haiyang Wang, Haitao Li, Jiangchuan Liu, and Hongbo Jiang. 2014. Exploring sharing patterns for video recommendation on YouTube-like social media. Multimedia Systems 20, 6 (2014), 675--691. DOI:http://dx.doi.org/10.1007/s00530-013-0309-1 Google ScholarDigital Library
Justin Matejka, Tovi Grossman, and George Fitzmaurice. 2012. Swift: Reducing the effects of latency in online video scrubbing. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’12). ACM, New York, NY, 637--646. DOI:http://dx.doi.org/10.1145/2207676.2207766 Google ScholarDigital Library
Justin Matejka, Tovi Grossman, and George Fitzmaurice. 2013. Swifter: Improved online video scrubbing. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’13). ACM, New York, NY, 1159--1168. DOI:http://dx.doi.org/10.1145/2470654.2466149 Google ScholarDigital Library
Kevin McGuinness, Robin Aly, Shu Chen, Mathieu Frappier, Martijn Kleppe, Hyowon Lee, Roeland Ordelman, Relja Arandjelović, Mayank Juneja, C. V. Jawahar, Andrea Vedaldi, Jochen Schwenninger, Sebastian Tschöpel, Daniel Schneider, Noel E. O’Conner, Andrew Zisserman, Alan Smeaton, and Henri Beunders. 2011. AXES at TRECVID 2011. In TRECVID’11.Google Scholar
Mary Meeker and Liang Wu. 2013. Internet Trends D11 Conference.Google Scholar
Tao Mei, Bo Yang, Xian-Sheng Hua, and Shipeng Li. 2011. Contextual video recommendation by multimodal relevance and user feedback. ACM Transactions on Information Systems 29, 2, Article 10 (April 2011), 24 pages. DOI:http://dx.doi.org/10.1145/1961209.1961213 Google ScholarDigital Library
Britta Meixner, Johannes Köstler, and Harald Kosch. 2011. A mobile player for interactive non-linear video. In Proceedings of the 19th ACM International Conference on Multimedia (MM’11). ACM, New York, NY, 779--780. DOI:http://dx.doi.org/10.1145/2072298.2072453 Google ScholarDigital Library
Gregor Miller, Sidney Fels, Matthias Finke, Will Motz, Walker Eagleston, and Chris Eagleston. 2009. MiniDiver: A novel mobile media playback interface for rich video content on an iPhone. In Proceedings of the 8th International Conference on Entertainment Computing (ICEC). Lecture Notes in Computer Science, Vol. 5709. Springer, Berlin, 98--109. DOI:http://dx.doi.org/10.1007/978-3-642-04052-8_9 Google ScholarDigital Library
Arthur G. Money and Harry Agius. 2008. Video summarisation: A conceptual framework and survey of the state of the art. Journal of Visual Communication and Image Representation 19, 2 (2008), 121--143. DOI:http://dx.doi.org/10.1016/j.jvcir.2007.04.002 Google ScholarDigital Library
Anastasia Moumtzidou, Konstantinos Avgerinakis, Evlampios Apostolidis, Vera Aleksić, Fotini Markatopoulou, Christina Papagiannopoulou, Stefanos Vrochidis, Vasileios Mezaris, Reinhard Busch, and Ioannis Kompatsiaris. 2014. VERGE: An interactive search engine for browsing video collections. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 411--414. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_48 Google ScholarDigital Library
Anastasia Moumtzidou, Konstantinos Avgerinakis, Evlampios Apostolidis, Fotini Markatopoulou, Konstantinos Apostolidis, Theodoros Mironidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris, and Ioannis Patras. 2015. VERGE: A multimodal interactive video search engine. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and MuhammadAbul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 249--254. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_23Google Scholar
Anastasia Moumtzidou, Nikolaos Gkalelis, Panagiotis Sidiropoulos, Michail Dimopoulos, Spiros Nikolopoulos, Stefanos Vrochidis, Vasileios Mezaris, and Ioannis Kompatsiaris. 2012. ITI-CERTH participation to TRECVID 2012. In TRECVID’12.Google Scholar
Anastasia Moumtzidou, Panagiotis Sidiropoulos, Stefanos Vrochidis, Nikolaos Gkalelis, Spiros Nikolopoulos, Vasileios Mezaris, Ioannis Kompatsiaris, and Ioannis Patras. 2011. ITI-CERTH participation to TRECVID 2011. In TRECVID’11.Google Scholar
Bernd Münzer, Klaus Schoeffmann, and Laszlo Boszormenyi. 2013. Improving encoding efficiency of endoscopic videos by using circle detection based border overlays. In 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). 1--4. DOI:http://dx.doi.org/10.1109/ICMEW.2013.6618304Google ScholarCross Ref
Bernd Münzer, Klaus Schoeffmann, Laslzo Böszörmenyi, J. F. Smulders, and Jack J. Jakimowicz. 2014. Investigation of the impact of compression on the perceptional quality of laparoscopic videos. In Proceedings of the Computer-Based Medical Systems (CBMS). 153--158. DOI:http://dx.doi.org/10.1109/CBMS.2014.58 Google ScholarDigital Library
Luís A. R. Neng and Teresa Chambel. 2010. Get around 360° hypervideo. In Proceedings of the 14th International Academic MindTrek Conference: Envisioning Future Media Environments (MindTrek’10). ACM, New York, NY, 119--122. DOI:http://dx.doi.org/10.1145/1930488.1930512 Google ScholarDigital Library
Thanh Duc Ngo, Vu Hoang Nguyen, Vu Lam, Sang Phan, Duy-Dinh Le, Duc Anh Duong, and Shin’ichi Satoh. 2014. NII-UIT: A tool for known item search by sequential pattern filtering. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 419--422. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_50 Google ScholarDigital Library
Thanh Duc Ngo, Vinh-Tiep Nguyen, Vu Hoang Nguyen, Duy-Dinh Le, Duc Anh Duong, and Shinichi Satoh. 2015. NII-UIT browser: A multimodal video search system. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and Muhammad Abul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 278--281. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_28Google Scholar
Cuong Nguyen, Yuzhen Niu, and Feng Liu. 2012. Video summagator: An interface for video summarization and navigation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’12). ACM, New York, NY, 647--650. DOI:http://dx.doi.org/10.1145/2207676.2207767 Google ScholarDigital Library
Cuong Nguyen, Yuzhen Niu, and Feng Liu. 2013. Direct manipulation video navigation in 3D. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’13). ACM, New York, NY, 1169--1172. DOI:http://dx.doi.org/10.1145/2470654.2466150 Google ScholarDigital Library
Paul Over, George Awad, Martial Michel, Jonathan Fiscus, Greg Sanders, Wessel Kraaij, Alan F. Smeaton, and Georges Quéenot. 2013. TRECVID 2013—An overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2013. NIST.Google Scholar
Zsolt Palotai, Miklos Lang, Andras Sarkany, Zoltan Toser, Daniel Sonntag, Takumi Toyama, and Andras Lorincz. 2014. Labelmovie: Semi-supervised machine annotation tool with quality assurance and crowd-sourcing options for videos. In 12th International Workshop on Content-Based Multimedia Indexing. 1--4. DOI:http://dx.doi.org/10.1109/CBMI.2014.6849850Google ScholarCross Ref
Amy Pavel, Colorado Reed, Björn Hartmann, and Maneesh Agrawala. 2014. Video digests: A browsable, skimmable format for informational lecture videos. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (UIST’14). ACM, New York, NY, 573--582. DOI:http://dx.doi.org/10.1145/2642918.2647400 Google ScholarDigital Library
Benjamin Petry and Jochen Huber. 2015. Towards effective interaction with omnidirectional videos using immersive virtual reality headsets. In Proceedings of the 6th Augmented Human International Conference (AH’15). ACM, New York, NY, 217--218. DOI:http://dx.doi.org/10.1145/2735711.2735785 Google ScholarDigital Library
Suporn Pongnumkul, Jue Wang, Gonzalo Ramos, and Michael Cohen. 2010. Content-aware dynamic timeline for video browsing. In Proceedings of the 23rd Annual ACM Symposium on User Interface Software and Technology (UIST’10). ACM, New York, NY, 139--142. DOI:http://dx.doi.org/10.1145/1866029.1866053 Google ScholarDigital Library
Manfred J. Primus, Klaus Schoeffmann, and Laszlo Boszormenyi. 2013. Segmentation of recorded endoscopic videos by detecting significant motion changes. In 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI). 223--228. DOI:http://dx.doi.org/10.1109/CBMI.2013.6576587Google ScholarCross Ref
Luca Rossetto, Ivan Giangreco, Heiko Schuldt, Stéphane Dupont, Omar Seddati, Metin Sezgin, and Yusuf Sahillioğlu. 2015. IMOTION a content-based video retrieval engine. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and Muhammad Abul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 255--260. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_24Google Scholar
Gustavo Alberto Rovelo Ruiz, Davy Vanacken, Kris Luyten, Francisco Abad, and Emilio Camahort. 2014. Multi-viewer gesture-based interaction for omni-directional video. In Proceedings of the 32nd Annual ACM Conference on Human Factors in Computing Systems (CHI’14). ACM, New York, NY, 4077--4086. DOI:http://dx.doi.org/10.1145/2556288.2557113 Google ScholarDigital Library
Wang Ruihu, Bi Hongwei, Liu Jiachen, Wu Lingguo, and Fang Bin. 2009. Interactive intelligent media player based on head motion recognition. In 2nd International Symposium on Electronic Commerce and Security (ISECS’09), Vol. 2, 81--84. DOI:http://dx.doi.org/10.1109/ISECS.2009.34 Google ScholarDigital Library
Klaus Schoeffmann. 2014. A user-centric media retrieval competition: The video browser showdown 2012--2014. IEEE MultiMedia 21, 4 (Oct 2014), 8--13. DOI:http://dx.doi.org/10.1109/MMUL.2014.56Google ScholarCross Ref
Klaus Schoeffmann, David Ahlström, Werner Bailer, Claudiu Cobârzan, Frank Hopfgartner, Kevin McGuinness, Cathal Gurrin, Christian Frisson, Duy-Dinh Le, Manfred Del Fabro, Hongliang Bai, and Wolfgang Weiss. 2013. The video browser showdown: A live evaluation of interactive video search tools. International Journal of Multimedia Information Retrieval 3 (2013), 1--15. DOI:http://dx.doi.org/10.1007/s13735-013-0050-8Google Scholar
Klaus Schoeffmann, David Ahlström, and Laszlo Böszörmenyi. 2012. Video browsing with a 3D thumbnail ring arranged by color similarity. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 660--661. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_70 Google ScholarDigital Library
Klaus Schoeffmann, David Ahlström, and Marco A. Hudelist. 2014a. 3D Interfaces to Improve the Performance of Visual Known-Item Search. IEEE Transactions on Multimedia 16 7 (2014), 1942--1951. DOI:http://dx.doi.org/10.1109/TMM.2014.2333666Google Scholar
Klaus Schoeffmann and Werner Bailer. 2012. Video browser showdown. ACM SIGMultimedia Records 4, 2 (2012), 1--2. Google ScholarDigital Library
Klaus Schoeffmann and Laszlo Boeszoermenyi. 2011. Image and video browsing with a cylindrical 3D storyboard. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval (ICMR’11). ACM, New York, NY, Article 63, 2 pages. DOI:http://dx.doi.org/10.1145/1991996.1992059 Google ScholarDigital Library
Klaus Schoeffmann, Kevin Chromik, and Laszlo Böszörmenyi. 2014b. Video navigation on tablets with multi-touch gestures. In Proceedings of the 3rd International Workshop on Emerging Multimedia Systems and Applications (EMSA) at the IEEE International Conference on Multimedia & Expo (ICME’’14). IEEE, 6.Google ScholarCross Ref
Klaus Schoeffmann and Claudiu Cobârzan. 2013. An evaluation of interactive search with modern video players. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). 1--4. DOI:http://dx.doi.org/10.1109/ICMEW.2013.6618282Google ScholarCross Ref
Klaus Schoeffmann, Manfred Del Fabro, Tibor Szkaliczki, Laszlo Böszörmenyi, and Jörg Keckstein. 2014c. Keyframe extraction in endoscopic video. Multimedia Tools and Applications, 1--20. DOI:http://dx.doi.org/10.1007/s11042-014-2224-7Google Scholar
Klaus Schoeffmann, Frank Hopfgartner, Oge Marques, Laszlo Boeszoermenyi, and Joemon M. Jose. 2010a. Video browsing interfaces and applications: A review. SPIE Reviews 1, 1 (2010), 018004. DOI:http://dx.doi.org/10.1117/6.0000005Google Scholar
Klaus Schoeffmann, Mario Taschwer, and Laszlo Boeszoermenyi. 2010b. The video explorer: A tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the First Annual ACM SIGMM Conference on Multimedia Systems (MMSys’10). ACM, New York, NY, 247--258. DOI:http://dx.doi.org/10.1145/1730836.1730867 Google ScholarDigital Library
David Scott, Junlin Guo, Colum Foley, Frank Hopfgartner, Cathal Gurrin, and Alan F. Smeaton. 2011. TRECVid 2011 experiments at Dublin city university. In TRECVID’11.Google Scholar
David Scott, Jinlin Guo, Cathal Gurrin, Frank Hopfgartner, Kevin McGuinness, Noel E. O’Connor, Alan F. Smeaton, Yang Yang, and Zhenxing Zhang. 2013. DCU at MMM 2013 video browser showdown. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 541--543. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_63Google Scholar
David Scott, Jinlin Guo, Hongyi Wang, Yang Yang, Frank Hopfgartner, and Cathal Gurrin. 2012. Clipboard: A visual search and browsing engine for tablet and PC. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, AlexanderG. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 646--648. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_65 Google ScholarDigital Library
David Scott, Zhenxing Zhang, Rami Albatal, Kevin McGuinness, Esra Acar, Frank Hopfgartner, Cathal Gurrin, Noel E. O’Connor, and Alan F. Smeaton. 2014. Audio-visual classification video browser. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 398--401. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_45 Google ScholarDigital Library
Jianbo Shi and Carlo Tomasi. 1994. Good features to track. In Proceedings of the 1994 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’94). 593--600. DOI:http://dx.doi.org/10.1109/CVPR.1994.323794Google Scholar
Mats Sjöoberg, Markus Koskela, Milen Chechev, and Jorma Laaksonen. 2010. PicSOM experiments in TRECVID 2010. In TRECVID’10.Google Scholar
Alan F. Smeaton, Colum Foley, Daragh Byrne, and Gareth J. F. Jones. 2008. iBingo mobile collaborative search. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 547--548. DOI:http://dx.doi.org/10.1145/1386352.1386424 Google ScholarDigital Library
Alan F. Smeaton, Paul Over, and Wessel Kraaij. 2006. Evaluation campaigns and TRECVid. In Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval (MIR’06). ACM, New York, NY, 321--330. DOI:http://dx.doi.org/10.1145/1178677.1178722 Google ScholarDigital Library
Michael A. Smith and Takeo Kanade. 1998. Video skimming and characterization through the combination of image and language understanding. In Proceedings of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Database. 61--70. DOI:http://dx.doi.org/10.1109/CAIVD.1998.646034 Google ScholarDigital Library
Cees G. M. Snoek, Koen E. A. van de Sande, Ork de Rooij, Bouke Huurnink, Jasper R. R. Uijlings, Michiel van Liempt, Miguel Bugalho, Isabel Trancoso, Fei Yan, Muhammad A. Tahir, and others. 2009. The mediamill TRECVID 2009 semantic video search engine. In TRECVID Workshop.Google Scholar
Cees G. M. Snoek, Marcel Worring, Ork de Rooij, Koen E. A. van de Sande, Rong Yan, and Alexander G. Hauptmann. 2008. VideOlympics: Real-time evaluation of multimedia retrieval systems. IEEE MultiMedia 15, 1 (Jan. 2008), 86--91. DOI:http://dx.doi.org/10.1109/MMUL.2008.21 Google ScholarDigital Library
Cees G. M. Snoek, Marcel Worring, Dennis C. Koelma, and Arnold W. M. Smeulders. 2007. A learned lexicon-driven paradigm for interactive video retrieval. IEEE Transactions on Multimedia 9, 2 (Feb. 2007), 280--292. DOI:http://dx.doi.org/10.1109/TMM.2006.886275 Google ScholarDigital Library
Deqing Sun, Siegmar Roth, and Michael J. Black. 2010. Secrets of optical flow estimation and their principles. In Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2432--2439. DOI:http://dx.doi.org/10.1109/CVPR.2010.5539939Google Scholar
Mario Taschwer. 2012. A key-frame-oriented video browser. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 655--657. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_68 Google ScholarDigital Library
Ba Tu Truong and Svetha Venkatesh. 2007. Video abstraction: A systematic review and classification. ACM Transactions on Multimedia Computing Communications and Applications 3, 1, Article 3 (Feb. 2007). DOI:http://dx.doi.org/10.1145/1198302.1198305 Google ScholarDigital Library
Carles Ventura, Manel Martos, Xavier Giró-i Nieto, Verónica Vilaplana, and Ferran Marqués. 2012. Hierarchical navigation and visual search for video keyframe retrieval. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 652--654. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_67 Google ScholarDigital Library
Marie-luce Viaud, Olivier Buisson, Agnes Saulnier, and Clement Guenais. 2010. Video exploration: From multimedia content analysis to interactive visualization. In Proceedings of the International Conference on Multimedia (MM’10). ACM, New York, NY, 1311--1314. DOI:http://dx.doi.org/10.1145/1873951.1874209 Google ScholarDigital Library
Marie-Luce Viaud, Jéröme Thièvre, Hervé Goëau, Agnes Saulnier, and Olivier Buisson. 2008. Interactive components for visual exploration of multimedia archives. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 609--616. DOI:http://dx.doi.org/10.1145/1386352.1386440 Google ScholarDigital Library
Stefanos Vrochidis, Anastasia Moumtzidou, Paul King, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. 2010. VERGE: A video interactive retrieval engine. In Proceedings of the 2010 International Workshop on Content-Based Multimedia Indexing (CBMI). 1--6. DOI:http://dx.doi.org/10.1109/CBMI.2010.5529884Google ScholarCross Ref
Marcel Worring, Paul Sajda, Simone Santini, David A. Shamma, Alan F. Smeaton, and Qiang Yang. 2012. Where is the user in multimedia retrieval? IEEE MultiMedia 19, 4 (Oct. 2012), 6--10. DOI:http://dx.doi.org/10.1109/MMUL.2012.53 Google ScholarDigital Library
Yue Wu, Tao Mei, Nenghai Yu, and Shipeng Li. 2012. Accelerometer-based single-handed video browsing on mobile devices: Design and user studies. In Proceedings of the 4th International Conference on Internet Multimedia Computing and Service (ICIMCS’12). ACM, New York, NY, 157--160. DOI:http://dx.doi.org/10.1145/2382336.2382381 Google ScholarDigital Library
Qing Xu, Yu Liu, Xiu Li, Zhen Yang, Jie Wang, Mateu Sbert, and Riccardo Scopigno. 2014. Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence. Information Sciences 278 (2014), 736--756. DOI:http://dx.doi.org/10.1016/j.ins.2014.03.088Google ScholarCross Ref
Qing Xu, Pengcheng Wang, Bin Long, M. Sbert, M. Feixas, and R. Scopigno. 2010. Selection and 3D visualization of video key frames. In 2010 IEEE International Conference on Systems Man and Cybernetics (SMC). 52--59. DOI:http://dx.doi.org/10.1109/ICSMC.2010.5642204Google Scholar
Jin Yuan, Huanbo Luan, Dejun Hou, Han Zhang, Yan-Tao Zheng, Zheng-Jun Zha, and Tat-Seng Chua. 2012. Video browser showdown by NUS. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 642--645. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_64 Google ScholarDigital Library
Jin-Kai Zhang, Cui-Xia Ma, Yong-Jin Liu, Qiu-Fang Fu, and Xiao-Lan Fu. 2013a. Collaborative interaction for videos on mobile devices based on sketch gestures. Journal of Computer Science and Technology 28, 5 (2013), 810--817. DOI:http://dx.doi.org/10.1007/s11390-013-1379-4Google ScholarCross Ref
Lei Zhang, Qian-Kun Xu, Lei-Zheng Nie, and Hua Huang. 2013b. VideoGraph: A non-linear video representation for efficient exploration. The Visual Computer 30, 10 (2013), 1123--1132. DOI:http://dx.doi.org/10.1007/s00371-013-0882-5 Google ScholarDigital Library
Yuan Zhou and Takashi Yukawa. 2011. A touch-panel based user interface and utilization of user’s memories for known-item search (KIS) task in TRECVID 2011. In TRECVID’11.Google Scholar

Index Terms

Video Interaction Tools: A Survey of Recent Work
1. Computing methodologies
  1. Artificial intelligence
    1. Philosophical/theoretical foundations of artificial intelligence
      1. Cognitive science
2. Human-centered computing
  1. Human computer interaction (HCI)
    1. HCI design and evaluation methods

Recommendations

Extending interaction for smart watches: enabling bimanual around device control
CHI EA '14: CHI '14 Extended Abstracts on Human Factors in Computing Systems

The size of a smart watch limits the available interactive surface for the user. Most current smart watches use a combination of a touch screen and physical buttons. Unfortunately, a small touch screen's usability is limited when it can be easily ...
Read More
Integrating Point and Touch for Interaction with Digital Tabletop Displays

TractorBeam is a hybrid point-touch interaction technique for tabletop computer displays that seamlessly combines remote pointing and local touch. Results from studies investigating its use for target selection, docking, and puzzle tasks give some ...
Read More
Investigating the effectiveness of peephole interaction for smartwatches in a map navigation task
MobileHCI '14: Proceedings of the 16th international conference on Human-computer interaction with mobile devices & services

With the increasing availability of smartwatches the question of suited input modalities arises. While direct touch input comes at the cost of the fat-finger problem, we propose to use a dynamic peephole to explore larger content such as websites or ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Computing Surveys Volume 48, Issue 1
September 2015
592 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/2808687
Editor:
Sartaj Sahni
Department of Computer and Information Science and Engineering / University of Florida / Gainesville, FL 32611
Issue’s Table of Contents
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 September 2015
- Accepted: 1 May 2015
- Revised: 1 March 2015
- Received: 1 July 2014
Published in csur Volume 48, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Video search and retrieval
human-computer interaction
mobile devices
Qualifiers
- survey
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 69
  Total Citations
  View Citations
- 1,766
  Total Downloads
- Downloads (Last 12 months)125
- Downloads (Last 6 weeks)20
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Video Interaction Tools: A Survey of Recent Work

ACM Computing Surveys

Abstract

References

Cited By

Index Terms

Recommendations

Extending interaction for smart watches: enabling bimanual around device control

Integrating Point and Touch for Interaction with Digital Tabletop Displays

Investigating the effectiveness of peephole interaction for smartwatches in a map navigation task