Abstract
Digital video enables manifold ways of multimedia content interaction. Over the last decade, many proposals for improving and enhancing video content interaction were published. More recent work particularly leverages on highly capable devices such as smartphones and tablets that embrace novel interaction paradigms, for example, touch, gesture-based or physical content interaction. In this article, we survey literature at the intersection of Human-Computer Interaction and Multimedia. We integrate literature from video browsing and navigation, direct video manipulation, video content visualization, as well as interactive video summarization and interactive video retrieval. We classify the reviewed works by the underlying interaction method and discuss the achieved improvements so far. We also depict a set of open problems that the video interaction community should address in future.
- Brett Adams, Stewart Greenhill, and Svetha Venkatesh. 2012. Towards a video browser for the digital native. In Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). 127--132. DOI:http://dx.doi.org/10.1109/ICMEW.2012.29 Google ScholarDigital Library
- John Adcock, Matthew Cooper, and Jeremy Pickens. 2008. Experiments in interactive video search by addition and subtraction. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 465--474. DOI:http://dx.doi.org/10.1145/1386352.1386412 Google ScholarDigital Library
- Abir Al-Hajri, Gregor Miller, Sidney Fels, and Matthew Fong. 2013. Video navigation with a personal viewing history. In Human-Computer Interaction INTERACT 2013, Paula Kotz, Gary Marsden, Gitte Lindgaard, Janet Wesson, and Marco Winckler (Eds.). Lecture Notes in Computer Science, Vol. 8119. Springer, Berlin, 352--369. DOI:http://dx.doi.org/10.1007/978-3-642-40477-1_22Google ScholarCross Ref
- Robin Aly, Kevin McGuinness, Shu Chen, Noel E. O’Conner, Ken Chatfield, Omkar M. Parkhi, Relja Arandjelovic, Andrew Zisserman, Basura Fernando, Tinne Tuytelaars, Dan Oneata, Matthijs Douze, Jerome Revaud, Jochen Schwenninger, Danila Potapov, Heng Wang, Zaid Harchaoui, Jakob Verbeek, and Cordelia Schmid. 2012. AXES at TRECVid 2012: KIS, INS, and MED. In TRECVID’12.Google Scholar
- Leif Azzopardi, Douglas Dowie, and Kelly Ann Marshall. 2012. YooSee: A video browsing application for young children. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’12). ACM, New York, NY, 1017--1017. DOI:http://dx.doi.org/10.1145/2348283.2348442 Google ScholarDigital Library
- Hongliang Bai, Lezi Wang, Yuan Dong, and Kun Tao. 2013. Interactive video retrieval using combination of semantic index and instance search. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 554--556. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_67Google Scholar
- Werner Bailer, Wolfgang Weiss, Christian Schober, and Georg Thallinger. 2012. A video browsing tool for content management in media post-production. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, AlexanderG. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 658--659. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_69 Google ScholarDigital Library
- Werner Bailer, Wolfgang Weiss, Christian Schober, and Georg Thallinger. 2013. An approach for browsing video collections in media production. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 538--540. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_62Google Scholar
- Werner Bailer, Wolfgang Weiss, Christian Schober, and Georg Thallinger. 2014. Browsing linked video collections for media production. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 407--410. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_47 Google ScholarDigital Library
- KaiUwe Barthel, Nico Hezel, and Radek Mackowiak. 2015. Graph-based browsing for large video collections. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and MuhammadAbul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 237--242. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_21Google Scholar
- Frank R. Bentley and Michael Groble. 2009. TuVista: Meeting the multimedia needs of mobile sports fans. In Proceedings of the 17th ACM International Conference on Multimedia. ACM, 471--480. Google ScholarDigital Library
- Adam Blazek, Jakub Lokoc, Filip Matzner, and Tomas Skopal. 2015. Enhanced signature-based video browser. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and MuhammadAbul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 243--248. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_22Google Scholar
- Christoph Brachmann and Rainer Malaka. 2009. Keyframe-less integration of semantic information in a video player interface. In Proceedings of the 7th European Conference on European Interactive Television Conference (EuroITV’09). ACM, New York, NY, 137--140. DOI:http://dx.doi.org/10.1145/1542084.1542109 Google ScholarDigital Library
- Shelley Buchinger, Ewald Hotop, Helmut Hlavacs, Francesca De Simone, and Touradj Ebrahimi. 2010. Gesture and touch controlled video player interface for mobile devices. In Proceedings of the International Conference on Multimedia (MM’10). ACM, New York, NY, 699--702. DOI:http://dx.doi.org/10.1145/1873951.1874055 Google ScholarDigital Library
- Andrei Bursuc, Titus Zaharia, and Françoise Prêteux. 2010. Mobile video browsing and retrieval with the OVIDIUS platform. In Proceedings of the International Conference on Multimedia (MM’10). ACM, New York, NY, 1659--1662. DOI:http://dx.doi.org/10.1145/1873951.1874315 Google ScholarDigital Library
- Andrei Bursuc, Titus Zaharia, and Françoise Prêteux. 2012. OVIDIUS: A web platform for video browsing and search. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 649--651. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_66 Google ScholarDigital Library
- Juan Casares, A. Chris Long, Brad A. Myers, Rishi Bhatnagar, Scott M. Stevens, Laura Dabbish, Dan Yocum, and Albert Corbett. 2002. Simplifying video editing using metadata. In Proceedings of the 4th Conference on Designing Interactive Systems: Processes, Practices, Methods, and Techniques. ACM, 157--166. Google ScholarDigital Library
- Renan G. Cattelan, Cesar Teixeira, Rudinei Goularte, and Maria Da Graça C. Pimentel. 2008. Watch-and-comment as a paradigm toward ubiquitous interactive video editing. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 4, 4 (2008), 28. Google ScholarDigital Library
- Lekha Chaisorn, Kong-Wah Wan, Yan-Tao Zheng, Yongwei Zhu, Tian-Shiang Kok, Hui-Li Tan, Zixiang Fu, and Susanna Bolling. 2010. TRECVID 2010 known-item search (KIS) task by I2R. In TRECVID’10.Google Scholar
- Bisheng Chen, Jingdong Wang, Qinghua Huang, and Tao Mei. 2012. Personalized video recommendation through tripartite graph propagation. In Proceedings of the 20th ACM International Conference on Multimedia (MM’12). ACM, New York, NY, 1133--1136. DOI:http://dx.doi.org/10.1145/2393347.2396401 Google ScholarDigital Library
- Xiu Y. Chen and Zary Segall. 2009. XV-Pod: An emotion aware, affective mobile video player. In 2009 WRI World Congress on Computer Science and Information Engineering, Vol. 3. 277--281. DOI:http://dx.doi.org/10.1109/CSIE.2009.982 Google ScholarDigital Library
- Michael G. Christel and Rong Yan. 2007. Merging storyboard strategies and automatic retrieval for improving interactive video search. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval (CIVR’07). ACM, New York, NY, 486--493. DOI:http://dx.doi.org/10.1145/1282280.1282351 Google ScholarDigital Library
- Claudiu Cobârzan, Marco A. Hudelist, and Manfred Del Fabro. 2014. Content-based video browsing with collaborating mobile clients. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 402--406. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_46 Google ScholarDigital Library
- Claudiu Cobârzan and Klaus Schoeffmann. 2014. How do users search with basic HTML5 video players? In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8325. Springer International Publishing, 109--120. DOI:http://dx.doi.org/10.1007/978-3-319-04114-8_10 Google ScholarDigital Library
- Collabracam. 2015. http://collabracam.com/.Google Scholar
- Peng Cui, Zhiyu Wang, and Zhou Su. 2014. What videos are similar with you?: Learning a common attributed representation for video recommendation. In Proceedings of the ACM International Conference on Multimedia (MM’14). ACM, New York, NY, 597--606. DOI:http://dx.doi.org/10.1145/2647868.2654946 Google ScholarDigital Library
- Bruna C. R. Cunha, Diogo Pedrosa, Rudinei Goularte, and Maria da Graça Campos Pimentel. 2012. Video annotation and navigation on mobile devices. In Proceedings of the 18th Brazilian Symposium on Multimedia and the Web (WebMedia’12). ACM, New York, NY, 261--264. DOI:http://dx.doi.org/10.1145/2382636.2382691 Google ScholarDigital Library
- Christoph Czepa, Shelley Buchinger, Helmut Hlavacs, Ewald Hotop, and Yohann Pitrey. 2012. Towards an energy-efficient attention-aware mobile video player with sensor and face detection support. In 2012 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM). 1--6. DOI:http://dx.doi.org/10.1109/WoWMoM.2012.6263801Google ScholarCross Ref
- Stamatia Dasiopoulou, Eirini Giannakidou, Georgios Litos, Polyxeni Malasioti, and Yiannis Kompatsiaris. 2011. A survey of semantic image and video annotation tools. In Knowledge-Driven Multimedia Information Extraction and Ontology Evolution, Georgios Paliouras, Constantine D. Spyropoulos, and George Tsatsaronis (Eds.). Lecture Notes in Computer Science, Vol. 6050. Springer, Berlin, 196--239. DOI:http://dx.doi.org/10.1007/978-3-642-20795-2_8 Google ScholarDigital Library
- James Davidson, Benjamin Liebald, Junning Liu, Palash Nandy, Taylor Van Vleet, Ullas Gargi, Sujoy Gupta, Yu He, Mike Lambert, Blake Livingston, and Dasarathi Sampath. 2010. The YouTube video recommendation system. In Proceedings of the 4th ACM Conference on Recommender Systems (RecSys’10). ACM, New York, NY, 293--296. DOI:http://dx.doi.org/10.1145/1864708.1864770 Google ScholarDigital Library
- Ork de Rooij, Cees G. M. Snoek, and Marcel Worring. 2008. Balancing thread based navigation for targeted video search. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 485--494. DOI:http://dx.doi.org/10.1145/1386352.1386414 Google ScholarDigital Library
- Ork de Rooij, Cees G. M. Snoek, and Marcel Worring. 2007. Query on demand video browsing. In Proceedings of the 15th International Conference on Multimedia (MULTIMEDIA’07). ACM, New York, NY, 811--814. DOI:http://dx.doi.org/10.1145/1291233.1291417 Google ScholarDigital Library
- Ork de Rooij, J. J. van Wijk, and M. Worring. 2010. Mediatable: Interactive categorization of multimedia collections. IEEE Computer Graphics and Applications 30, 5 (Sept. 2010), 42--51. DOI:http://dx.doi.org/10.1109/MCG.2010.66 Google ScholarDigital Library
- Manfred Del Fabro and Laszlo Böszörmenyi. 2012. AAU video browser: Non-sequential hierarchical video browsing without content analysis. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 639--641. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_63 Google ScholarDigital Library
- Manfred Del Fabro, Bernd Münzer, and Laszlo Böszörmenyi. 2013. AAU video browser with augmented navigation bars. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 544--546. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_64Google Scholar
- Manfred Del Fabro, Mathias Lux, Klaus Schoeffmann, and Mario Taschwer. 2012. ITEC-UNIKLU known-item search submission 2012. In TRECVID’12.Google Scholar
- Marco de Sá, David A. Shamma, and Elizabeth F. Churchill. 2014. Live mobile collaboration for video production: Design, guidelines, and requirements. Personal and Ubiquitous Computing 18, 3 (2014), 693--707. Google ScholarDigital Library
- Niloofar Dezuli, Jochen Huber, Elizabeth F. Churchill, and Max Mühlhäuser. 2013. CoStream: Co-construction of shared experiences through mobile live video sharing. In Proceedings of the 27th International BCS Human Computer Interaction Conference. British Computer Society, 6. Google ScholarDigital Library
- Arvid Engström, Mattias Esbjörnsson, and Oskar Juhlin. 2008. Mobile collaborative live video mixing. In Proceedings of the 10th International Conference on Human Computer Interaction with Mobile Devices and Services. ACM, 157--166. Google ScholarDigital Library
- Arvid Engström, Goranka Zoric, Oskar Juhlin, and Ramin Toussi. 2012. The mobile vision mixer: A mobile network based live video broadcasting system in your mobile phone. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia. ACM, 18. Google ScholarDigital Library
- Colum Foley, Jinlin Guo, David Scott, Peter Wilkins, Cathal Gurrin, Alan F. Smeaton, Paul Ferguson, Kealan McCusker, Emma Sesmero Diaz, Kevin McGuinness, Noel E. O’Connor, Xavier Giró i Nieto, and Ferran Marqués. 2010. TRECVID 2010 experiments at Dublin city university. In TRECVID’10.Google Scholar
- Gerald Friedland, Luke Gottlieb, and Adam Janin. 2009. Joke-o-mat: Browsing sitcoms punchline by punchline. In Proceedings of the 17th ACM International Conference on Multimedia (MM’09). ACM, New York, NY, 1115--1116. DOI:http://dx.doi.org/10.1145/1631272.1631525 Google ScholarDigital Library
- Christian Frisson, Stéphane Dupont, Alexis Moinet, Cécile Picard-Limpens, Thierry Ravet, Xavier Siebert, and Thierry Dutoit. 2013. VideoCycle: User-friendly navigation by similarity in video databases. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 550--553. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_66Google Scholar
- Vineet Gandhi, Remi Ronfard, and Michael Gleicher. 2014. Multi-clip video editing from a single viewpoint. In Proceedings of the 11th European Conference on Visual Media Production. ACM, 9. Google ScholarDigital Library
- Roman Ganhör. 2012. ProPane: Fast and precise video browsing on mobile phones. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia (MUM’12). ACM, New York, NY, Article 20, 8 pages. DOI:http://dx.doi.org/10.1145/2406367.2406392 Google ScholarDigital Library
- P. Geetha and Vasumathi Narayanan. 2008. A survey of content-based video retrieval. Journal of Computer Science 4, 6 (2008), 474.Google ScholarCross Ref
- Andreas Girgensohn, John Boreczky, Patrick Chiu, John Doherty, Jonathan Foote, Gene Golovchinsky, Shingo Uchihashi, and Lynn Wilcox. 2000. A semi-automatic approach to home video editing. In Proceedings of the 13th Annual ACM Symposium on UserInterface Software and Technology. ACM, 81--89. Google ScholarDigital Library
- Andreas Girgensohn, Frank Shipman, and Lynn Wilcox. 2011. Adaptive clustering and interactive visualizations to support the selection of video clips. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval (ICMR’11). ACM, New York, NY, Article 34, 8 pages. DOI:http://dx.doi.org/10.1145/1991996.1992030 Google ScholarDigital Library
- Mieke Haesen, Jan Meskens, Kris Luyten, Karin Coninx, Jan Hendrik Becker, Tinne Tuytelaars, Gert-Jan Poulisse, Phi The Pham, and Marie-Francine Moens. 2013. Finding a needle in a haystack: An interactive video archive explorer for professional video searchers. Multimedia Tools and Applications 63, 2 (2013), 331--356. DOI:http://dx.doi.org/10.1007/s11042-011-0809-y Google ScholarDigital Library
- Peter E. Hart, Kurt Pierson, and Jonathan J. Hull. 2005. Refocusing multimedia research on short clips. IEEE MultiMedia 12, 3 (July 2005), 8--13. DOI:http://dx.doi.org/10.1109/MMUL.2005.55 Google ScholarDigital Library
- Luis Herranz and Jose M. Martinez. 2010. A framework for scalable summarization of video. IEEE Transactions on Circuits and Systems for Video Technology 20, 9 (Sept. 2010), 1265--1270. DOI:http://dx.doi.org/10.1109/TCSVT.2010.2057020 Google ScholarDigital Library
- Markus Hoeferlin, Benjamin Hoeferlin, Gunther Heidemann, and Daniel Weiskopf. 2013. Interactive schematic summaries for faceted exploration of surveillance video. IEEE Transactions on Multimedia 15, 4 (2013), 908--920. Google ScholarDigital Library
- Weiming Hu, Nianhua Xie, Li Li, Xianglin Zeng, and S. Maybank. 2011. A survey on visual content-based video indexing and retrieval. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 41, 6 (Nov. 2011), 797--819. DOI:http://dx.doi.org/10.1109/TSMCC.2011.2109710 Google ScholarDigital Library
- Xian-Sheng Hua, Lie Lu, and Hong-Jiang Zhang. 2003. AVE: Automated home video editing. In Proceedings of the 11th ACM International Conference on Multimedia. ACM, 490--497. Google ScholarDigital Library
- Jochen Huber, Jürgen Steimle, Roman Lissermann, Simon Olberding, and Max Mühlhäuser. 2010b. Wipe’n’Watch: Spatial interaction techniques for interrelated video collections on mobile devices. In Proceedings of the 24th BCS Interaction Specialist Group Conference (BCS’10). 423--427. http://dl.acm.org/citation.cfm?id=2146303.2146367 Google ScholarDigital Library
- Jochen Huber, Jürgen Steimle, and Max Mühlhäuser. 2010a. Toward more efficient user interfaces for mobile video browsing: An in-depth exploration of the design space. In Proceedings of the International Conference on Multimedia. ACM, 341--350. Google ScholarDigital Library
- Marco A. Hudelist, Claudiu Cobârzan, and Klaus Schoeffmann. 2014. OpenCV performance measurements on mobile devices. In Proceedings of the 4th ACM International Conference on Multimedia Retrieval (ICMR’14). ACM, New York, NY, 4. Google ScholarDigital Library
- Marco A. Hudelist, Klaus Schoeffmann, and Laszlo Boeszoermenyi. 2013a. Mobile video browsing with a 3D filmstrip. In Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval (ICMR’13). ACM, New York, NY, 299--300. DOI:http://dx.doi.org/10.1145/2461466.2461515 Google ScholarDigital Library
- Marco A. Hudelist, Klaus Schoeffmann, and Laszlo Boeszoermenyi. 2013b. Mobile video browsing with the thumbbrowser. In Proceedings of the 21st ACM International Conference on Multimedia (MM’13). ACM, New York, NY, 405--406. DOI:http://dx.doi.org/10.1145/2502081.2502242 Google ScholarDigital Library
- Wolfgang Hürst. 2006. Interactive audio-visual video browsing. In Proceedings of the 14th Annual ACM International Conference on Multimedia. ACM, 675--678. Google ScholarDigital Library
- Wolfgang Hürst and Dimitri Darzentas. 2012. HiStory: A hierarchical storyboard interface design for video browsing on mobile devices. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia (MUM’12). ACM, New York, NY, Article 17, 4 pages. DOI:http://dx.doi.org/10.1145/2406367.2406389 Google ScholarDigital Library
- Wolfgang Hürst, Georg Götz, and Philipp Jarvers. 2004. Advanced user interfaces for dynamic video browsing. In Proceedings of the 12th Annual ACM International Conference on Multimedia (MULTIMEDIA’04). ACM, New York, NY, 742--743. DOI:http://dx.doi.org/10.1145/1027527.1027694 Google ScholarDigital Library
- Wolfgang Hürst, Georg Götz, and Martina Welte. 2007. Interactive video browsing on mobile devices. In Proceedings of the 15th International Conference on Multimedia (MULTIMEDIA’07). ACM, New York, NY, 247--256. DOI:http://dx.doi.org/10.1145/1291233.1291284 Google ScholarDigital Library
- Wolfgang Hürst and Konrad Meier. 2008. Interfaces for timeline-based mobile video browsing. In Proceedings of the 16th ACM International Conference on Multimedia (MM’08). ACM, New York, NY, 469--478. DOI:http://dx.doi.org/10.1145/1459359.1459422 Google ScholarDigital Library
- Wolfgang Hürst and Philipp Merkle. 2008. One-handed mobile video browsing. In Proceedings of the 1st International Conference on Designing Interactive User Experiences for TV and Video (UXTV’08). ACM, New York, NY, 169--178. DOI:http://dx.doi.org/10.1145/1453805.1453839 Google ScholarDigital Library
- Wolfgang Hürst, Rob van de Werken, and Miklas Hoet. 2015. A storyboard-based interface for mobile video browsing. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and Muhammad Abul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 261--265. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_25Google Scholar
- Cisco Visual Networking Index. 2013. Global mobile data traffic forecast update, 2012--2017.Google Scholar
- Dan Jackson, James Nicholson, Gerrit Stoeckigt, Rebecca Wrobel, Anja Thieme, and Patrick Olivier. 2013. Panopticon: A parallel video overview system. In Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology (UIST’13). ACM, New York, NY, 123--130. DOI:http://dx.doi.org/10.1145/2501988.2502038 Google ScholarDigital Library
- Alejandro Jaimes and Nicu Sebe. 2007. Multimodal human--computer interaction: A survey. Computer Vision and Image Understanding 108, 1--2 (2007), 116--134. DOI:http://dx.doi.org/10.1016/j.cviu.2006.10.019. Special Issue on Vision for Human-Computer Interaction. Google ScholarDigital Library
- Oskar Juhlin, Goranka Zoric, Arvid Engström, and Erika Reponen. 2014a. Video interaction: A research agenda. Personal and Ubiquitous Computing 18, 3 (2014), 685--692. Google ScholarDigital Library
- Oskar Juhlin, Goranka Zoric, Arvid Engström, and Erika Reponen. 2014b. Video interaction: A research agenda. Personal and Ubiquitous Computing 18, 3 (March 2014), 685--692. DOI:http://dx.doi.org/10.1007/s00779-013-0705-8 Google ScholarDigital Library
- Thorsten Karrer, Malte Weiss, Eric Lee, and Jan Borchers. 2008. DRAGON: A direct manipulation interface for frame-accurate in-scene video navigation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’08). ACM, New York, NY, 247--250. DOI:http://dx.doi.org/10.1145/1357054.1357097 Google ScholarDigital Library
- Thorsten Karrer, Moritz Wittenhagen, and Jan Borchers. 2009. PocketDRAGON: A direct manipulation video navigation interface for mobile devices. In Proceedings of the 11th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI’09). ACM, New York, NY, Article 47, 3 pages. DOI:http://dx.doi.org/10.1145/1613858.1613917 Google ScholarDigital Library
- Duy-Dinh Le, Vu Lam, Thanh Duc Ngo, Vinh Quang Tran, Vu Hoang Nguyen, Duc Anh Duong, and Shin’ichi Satoh. 2013. NII-UIT-VBS: A video browsing tool for known item search. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 547--549. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_65Google Scholar
- Duy-Dinh Le, Cai-Zhi Zhu, Sebastien Poullot, Vu Q. Lam, Vu H. Nguyen, Nhan C. Duong, Thanh D. Ngo, Duc A. Duong, and Shin’ichi Satho. 2012. National Institute of Informatics, Japan at TRECVID 2012. In TRECVID’12.Google Scholar
- Roman Lissermann, Simon Olberding, Benjamin Petry, Max Mühlhäuser, and Jürgen Steimle. 2012. PaperVideo: Interacting with videos on multiple paper-like displays. In Proceedings of the 20th ACM International Conference on Multimedia (MM’12). ACM, New York, NY, 129--138. DOI:http://dx.doi.org/10.1145/2393347.2393372 Google ScholarDigital Library
- Suzanne Little, Iveel Jargalsaikhan, Cem Direkoglu, Noel E. O’Conner, Alan F. Smeaton, Kathy Clawson, Hao Li, Marcos Nieto, Aitor Rodriguez, Pedro Sanchez, Karina Villarroel Panzia, Ana Martinez Llorens, Roberto Gimenez, Raul Santuos de la Camara, and Anna Mereu. 2012. SAVASA project @ TRECVID 2012: Interactive surveillance event detection. In TRECVID’12.Google Scholar
- Jakub Lokoc, Adam Blazek, and Tomas Skopal. 2014. Signature-based video browser. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 415--418. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_49 Google ScholarDigital Library
- David G. Lowe. 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 2 (2004), 91--110. DOI:http://dx.doi.org/10.1023/B:VISI.0000029664.99615.94 Google ScholarDigital Library
- Huanbo Luan, Yan-Tao Zheng, Meng Wang, and Tat-Seng Chua. 2011. VisionGo: Towards video retrieval with joint exploration of human and computer. Information Sciences 181, 19 (2011), 4197--4213. DOI:http://dx.doi.org/10.1016/j.ins.2011.05.018 Google ScholarDigital Library
- Mathias Lux and Michael Riegler. 2013. Annotation of endoscopic videos on mobile devices: A bottom-up approach. In Proceedings of the 4th ACM Multimedia Systems Conference (MMSys’13). ACM, New York, NY, 141--145. DOI:http://dx.doi.org/10.1145/2483977.2483996 Google ScholarDigital Library
- Xiaoqiang Ma, Haiyang Wang, Haitao Li, Jiangchuan Liu, and Hongbo Jiang. 2014. Exploring sharing patterns for video recommendation on YouTube-like social media. Multimedia Systems 20, 6 (2014), 675--691. DOI:http://dx.doi.org/10.1007/s00530-013-0309-1 Google ScholarDigital Library
- Justin Matejka, Tovi Grossman, and George Fitzmaurice. 2012. Swift: Reducing the effects of latency in online video scrubbing. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’12). ACM, New York, NY, 637--646. DOI:http://dx.doi.org/10.1145/2207676.2207766 Google ScholarDigital Library
- Justin Matejka, Tovi Grossman, and George Fitzmaurice. 2013. Swifter: Improved online video scrubbing. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’13). ACM, New York, NY, 1159--1168. DOI:http://dx.doi.org/10.1145/2470654.2466149 Google ScholarDigital Library
- Kevin McGuinness, Robin Aly, Shu Chen, Mathieu Frappier, Martijn Kleppe, Hyowon Lee, Roeland Ordelman, Relja Arandjelović, Mayank Juneja, C. V. Jawahar, Andrea Vedaldi, Jochen Schwenninger, Sebastian Tschöpel, Daniel Schneider, Noel E. O’Conner, Andrew Zisserman, Alan Smeaton, and Henri Beunders. 2011. AXES at TRECVID 2011. In TRECVID’11.Google Scholar
- Mary Meeker and Liang Wu. 2013. Internet Trends D11 Conference.Google Scholar
- Tao Mei, Bo Yang, Xian-Sheng Hua, and Shipeng Li. 2011. Contextual video recommendation by multimodal relevance and user feedback. ACM Transactions on Information Systems 29, 2, Article 10 (April 2011), 24 pages. DOI:http://dx.doi.org/10.1145/1961209.1961213 Google ScholarDigital Library
- Britta Meixner, Johannes Köstler, and Harald Kosch. 2011. A mobile player for interactive non-linear video. In Proceedings of the 19th ACM International Conference on Multimedia (MM’11). ACM, New York, NY, 779--780. DOI:http://dx.doi.org/10.1145/2072298.2072453 Google ScholarDigital Library
- Gregor Miller, Sidney Fels, Matthias Finke, Will Motz, Walker Eagleston, and Chris Eagleston. 2009. MiniDiver: A novel mobile media playback interface for rich video content on an iPhone. In Proceedings of the 8th International Conference on Entertainment Computing (ICEC). Lecture Notes in Computer Science, Vol. 5709. Springer, Berlin, 98--109. DOI:http://dx.doi.org/10.1007/978-3-642-04052-8_9 Google ScholarDigital Library
- Arthur G. Money and Harry Agius. 2008. Video summarisation: A conceptual framework and survey of the state of the art. Journal of Visual Communication and Image Representation 19, 2 (2008), 121--143. DOI:http://dx.doi.org/10.1016/j.jvcir.2007.04.002 Google ScholarDigital Library
- Anastasia Moumtzidou, Konstantinos Avgerinakis, Evlampios Apostolidis, Vera Aleksić, Fotini Markatopoulou, Christina Papagiannopoulou, Stefanos Vrochidis, Vasileios Mezaris, Reinhard Busch, and Ioannis Kompatsiaris. 2014. VERGE: An interactive search engine for browsing video collections. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 411--414. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_48 Google ScholarDigital Library
- Anastasia Moumtzidou, Konstantinos Avgerinakis, Evlampios Apostolidis, Fotini Markatopoulou, Konstantinos Apostolidis, Theodoros Mironidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris, and Ioannis Patras. 2015. VERGE: A multimodal interactive video search engine. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and MuhammadAbul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 249--254. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_23Google Scholar
- Anastasia Moumtzidou, Nikolaos Gkalelis, Panagiotis Sidiropoulos, Michail Dimopoulos, Spiros Nikolopoulos, Stefanos Vrochidis, Vasileios Mezaris, and Ioannis Kompatsiaris. 2012. ITI-CERTH participation to TRECVID 2012. In TRECVID’12.Google Scholar
- Anastasia Moumtzidou, Panagiotis Sidiropoulos, Stefanos Vrochidis, Nikolaos Gkalelis, Spiros Nikolopoulos, Vasileios Mezaris, Ioannis Kompatsiaris, and Ioannis Patras. 2011. ITI-CERTH participation to TRECVID 2011. In TRECVID’11.Google Scholar
- Bernd Münzer, Klaus Schoeffmann, and Laszlo Boszormenyi. 2013. Improving encoding efficiency of endoscopic videos by using circle detection based border overlays. In 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). 1--4. DOI:http://dx.doi.org/10.1109/ICMEW.2013.6618304Google ScholarCross Ref
- Bernd Münzer, Klaus Schoeffmann, Laslzo Böszörmenyi, J. F. Smulders, and Jack J. Jakimowicz. 2014. Investigation of the impact of compression on the perceptional quality of laparoscopic videos. In Proceedings of the Computer-Based Medical Systems (CBMS). 153--158. DOI:http://dx.doi.org/10.1109/CBMS.2014.58 Google ScholarDigital Library
- Luís A. R. Neng and Teresa Chambel. 2010. Get around 360° hypervideo. In Proceedings of the 14th International Academic MindTrek Conference: Envisioning Future Media Environments (MindTrek’10). ACM, New York, NY, 119--122. DOI:http://dx.doi.org/10.1145/1930488.1930512 Google ScholarDigital Library
- Thanh Duc Ngo, Vu Hoang Nguyen, Vu Lam, Sang Phan, Duy-Dinh Le, Duc Anh Duong, and Shin’ichi Satoh. 2014. NII-UIT: A tool for known item search by sequential pattern filtering. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 419--422. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_50 Google ScholarDigital Library
- Thanh Duc Ngo, Vinh-Tiep Nguyen, Vu Hoang Nguyen, Duy-Dinh Le, Duc Anh Duong, and Shinichi Satoh. 2015. NII-UIT browser: A multimodal video search system. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and Muhammad Abul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 278--281. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_28Google Scholar
- Cuong Nguyen, Yuzhen Niu, and Feng Liu. 2012. Video summagator: An interface for video summarization and navigation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’12). ACM, New York, NY, 647--650. DOI:http://dx.doi.org/10.1145/2207676.2207767 Google ScholarDigital Library
- Cuong Nguyen, Yuzhen Niu, and Feng Liu. 2013. Direct manipulation video navigation in 3D. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’13). ACM, New York, NY, 1169--1172. DOI:http://dx.doi.org/10.1145/2470654.2466150 Google ScholarDigital Library
- Paul Over, George Awad, Martial Michel, Jonathan Fiscus, Greg Sanders, Wessel Kraaij, Alan F. Smeaton, and Georges Quéenot. 2013. TRECVID 2013—An overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2013. NIST.Google Scholar
- Zsolt Palotai, Miklos Lang, Andras Sarkany, Zoltan Toser, Daniel Sonntag, Takumi Toyama, and Andras Lorincz. 2014. Labelmovie: Semi-supervised machine annotation tool with quality assurance and crowd-sourcing options for videos. In 12th International Workshop on Content-Based Multimedia Indexing. 1--4. DOI:http://dx.doi.org/10.1109/CBMI.2014.6849850Google ScholarCross Ref
- Amy Pavel, Colorado Reed, Björn Hartmann, and Maneesh Agrawala. 2014. Video digests: A browsable, skimmable format for informational lecture videos. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (UIST’14). ACM, New York, NY, 573--582. DOI:http://dx.doi.org/10.1145/2642918.2647400 Google ScholarDigital Library
- Benjamin Petry and Jochen Huber. 2015. Towards effective interaction with omnidirectional videos using immersive virtual reality headsets. In Proceedings of the 6th Augmented Human International Conference (AH’15). ACM, New York, NY, 217--218. DOI:http://dx.doi.org/10.1145/2735711.2735785 Google ScholarDigital Library
- Suporn Pongnumkul, Jue Wang, Gonzalo Ramos, and Michael Cohen. 2010. Content-aware dynamic timeline for video browsing. In Proceedings of the 23rd Annual ACM Symposium on User Interface Software and Technology (UIST’10). ACM, New York, NY, 139--142. DOI:http://dx.doi.org/10.1145/1866029.1866053 Google ScholarDigital Library
- Manfred J. Primus, Klaus Schoeffmann, and Laszlo Boszormenyi. 2013. Segmentation of recorded endoscopic videos by detecting significant motion changes. In 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI). 223--228. DOI:http://dx.doi.org/10.1109/CBMI.2013.6576587Google ScholarCross Ref
- Luca Rossetto, Ivan Giangreco, Heiko Schuldt, Stéphane Dupont, Omar Seddati, Metin Sezgin, and Yusuf Sahillioğlu. 2015. IMOTION a content-based video retrieval engine. In MultiMedia Modeling, Xiangjian He, Suhuai Luo, Dacheng Tao, Changsheng Xu, Jie Yang, and Muhammad Abul Hasan (Eds.). Lecture Notes in Computer Science, Vol. 8936. Springer International Publishing, 255--260. DOI:http://dx.doi.org/10.1007/978-3-319-14442-9_24Google Scholar
- Gustavo Alberto Rovelo Ruiz, Davy Vanacken, Kris Luyten, Francisco Abad, and Emilio Camahort. 2014. Multi-viewer gesture-based interaction for omni-directional video. In Proceedings of the 32nd Annual ACM Conference on Human Factors in Computing Systems (CHI’14). ACM, New York, NY, 4077--4086. DOI:http://dx.doi.org/10.1145/2556288.2557113 Google ScholarDigital Library
- Wang Ruihu, Bi Hongwei, Liu Jiachen, Wu Lingguo, and Fang Bin. 2009. Interactive intelligent media player based on head motion recognition. In 2nd International Symposium on Electronic Commerce and Security (ISECS’09), Vol. 2, 81--84. DOI:http://dx.doi.org/10.1109/ISECS.2009.34 Google ScholarDigital Library
- Klaus Schoeffmann. 2014. A user-centric media retrieval competition: The video browser showdown 2012--2014. IEEE MultiMedia 21, 4 (Oct 2014), 8--13. DOI:http://dx.doi.org/10.1109/MMUL.2014.56Google ScholarCross Ref
- Klaus Schoeffmann, David Ahlström, Werner Bailer, Claudiu Cobârzan, Frank Hopfgartner, Kevin McGuinness, Cathal Gurrin, Christian Frisson, Duy-Dinh Le, Manfred Del Fabro, Hongliang Bai, and Wolfgang Weiss. 2013. The video browser showdown: A live evaluation of interactive video search tools. International Journal of Multimedia Information Retrieval 3 (2013), 1--15. DOI:http://dx.doi.org/10.1007/s13735-013-0050-8Google Scholar
- Klaus Schoeffmann, David Ahlström, and Laszlo Böszörmenyi. 2012. Video browsing with a 3D thumbnail ring arranged by color similarity. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 660--661. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_70 Google ScholarDigital Library
- Klaus Schoeffmann, David Ahlström, and Marco A. Hudelist. 2014a. 3D Interfaces to Improve the Performance of Visual Known-Item Search. IEEE Transactions on Multimedia 16 7 (2014), 1942--1951. DOI:http://dx.doi.org/10.1109/TMM.2014.2333666Google Scholar
- Klaus Schoeffmann and Werner Bailer. 2012. Video browser showdown. ACM SIGMultimedia Records 4, 2 (2012), 1--2. Google ScholarDigital Library
- Klaus Schoeffmann and Laszlo Boeszoermenyi. 2011. Image and video browsing with a cylindrical 3D storyboard. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval (ICMR’11). ACM, New York, NY, Article 63, 2 pages. DOI:http://dx.doi.org/10.1145/1991996.1992059 Google ScholarDigital Library
- Klaus Schoeffmann, Kevin Chromik, and Laszlo Böszörmenyi. 2014b. Video navigation on tablets with multi-touch gestures. In Proceedings of the 3rd International Workshop on Emerging Multimedia Systems and Applications (EMSA) at the IEEE International Conference on Multimedia & Expo (ICME’’14). IEEE, 6.Google ScholarCross Ref
- Klaus Schoeffmann and Claudiu Cobârzan. 2013. An evaluation of interactive search with modern video players. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). 1--4. DOI:http://dx.doi.org/10.1109/ICMEW.2013.6618282Google ScholarCross Ref
- Klaus Schoeffmann, Manfred Del Fabro, Tibor Szkaliczki, Laszlo Böszörmenyi, and Jörg Keckstein. 2014c. Keyframe extraction in endoscopic video. Multimedia Tools and Applications, 1--20. DOI:http://dx.doi.org/10.1007/s11042-014-2224-7Google Scholar
- Klaus Schoeffmann, Frank Hopfgartner, Oge Marques, Laszlo Boeszoermenyi, and Joemon M. Jose. 2010a. Video browsing interfaces and applications: A review. SPIE Reviews 1, 1 (2010), 018004. DOI:http://dx.doi.org/10.1117/6.0000005Google Scholar
- Klaus Schoeffmann, Mario Taschwer, and Laszlo Boeszoermenyi. 2010b. The video explorer: A tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the First Annual ACM SIGMM Conference on Multimedia Systems (MMSys’10). ACM, New York, NY, 247--258. DOI:http://dx.doi.org/10.1145/1730836.1730867 Google ScholarDigital Library
- David Scott, Junlin Guo, Colum Foley, Frank Hopfgartner, Cathal Gurrin, and Alan F. Smeaton. 2011. TRECVid 2011 experiments at Dublin city university. In TRECVID’11.Google Scholar
- David Scott, Jinlin Guo, Cathal Gurrin, Frank Hopfgartner, Kevin McGuinness, Noel E. O’Connor, Alan F. Smeaton, Yang Yang, and Zhenxing Zhang. 2013. DCU at MMM 2013 video browser showdown. In Advances in Multimedia Modeling, Shipeng Li, Abdulmotaleb Saddik, Meng Wang, Tao Mei, Nicu Sebe, Shuicheng Yan, Richang Hong, and Cathal Gurrin (Eds.). Lecture Notes in Computer Science, Vol. 7733. Springer, Berlin, 541--543. DOI:http://dx.doi.org/10.1007/978-3-642-35728-2_63Google Scholar
- David Scott, Jinlin Guo, Hongyi Wang, Yang Yang, Frank Hopfgartner, and Cathal Gurrin. 2012. Clipboard: A visual search and browsing engine for tablet and PC. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, AlexanderG. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 646--648. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_65 Google ScholarDigital Library
- David Scott, Zhenxing Zhang, Rami Albatal, Kevin McGuinness, Esra Acar, Frank Hopfgartner, Cathal Gurrin, Noel E. O’Connor, and Alan F. Smeaton. 2014. Audio-visual classification video browser. In MultiMedia Modeling, Cathal Gurrin, Frank Hopfgartner, Wolfgang Hurst, Håvard Johansen, Hyowon Lee, and Noel O’Connor (Eds.). Lecture Notes in Computer Science, Vol. 8326. Springer International Publishing, 398--401. DOI:http://dx.doi.org/10.1007/978-3-319-04117-9_45 Google ScholarDigital Library
- Jianbo Shi and Carlo Tomasi. 1994. Good features to track. In Proceedings of the 1994 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’94). 593--600. DOI:http://dx.doi.org/10.1109/CVPR.1994.323794Google Scholar
- Mats Sjöoberg, Markus Koskela, Milen Chechev, and Jorma Laaksonen. 2010. PicSOM experiments in TRECVID 2010. In TRECVID’10.Google Scholar
- Alan F. Smeaton, Colum Foley, Daragh Byrne, and Gareth J. F. Jones. 2008. iBingo mobile collaborative search. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 547--548. DOI:http://dx.doi.org/10.1145/1386352.1386424 Google ScholarDigital Library
- Alan F. Smeaton, Paul Over, and Wessel Kraaij. 2006. Evaluation campaigns and TRECVid. In Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval (MIR’06). ACM, New York, NY, 321--330. DOI:http://dx.doi.org/10.1145/1178677.1178722 Google ScholarDigital Library
- Michael A. Smith and Takeo Kanade. 1998. Video skimming and characterization through the combination of image and language understanding. In Proceedings of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Database. 61--70. DOI:http://dx.doi.org/10.1109/CAIVD.1998.646034 Google ScholarDigital Library
- Cees G. M. Snoek, Koen E. A. van de Sande, Ork de Rooij, Bouke Huurnink, Jasper R. R. Uijlings, Michiel van Liempt, Miguel Bugalho, Isabel Trancoso, Fei Yan, Muhammad A. Tahir, and others. 2009. The mediamill TRECVID 2009 semantic video search engine. In TRECVID Workshop.Google Scholar
- Cees G. M. Snoek, Marcel Worring, Ork de Rooij, Koen E. A. van de Sande, Rong Yan, and Alexander G. Hauptmann. 2008. VideOlympics: Real-time evaluation of multimedia retrieval systems. IEEE MultiMedia 15, 1 (Jan. 2008), 86--91. DOI:http://dx.doi.org/10.1109/MMUL.2008.21 Google ScholarDigital Library
- Cees G. M. Snoek, Marcel Worring, Dennis C. Koelma, and Arnold W. M. Smeulders. 2007. A learned lexicon-driven paradigm for interactive video retrieval. IEEE Transactions on Multimedia 9, 2 (Feb. 2007), 280--292. DOI:http://dx.doi.org/10.1109/TMM.2006.886275 Google ScholarDigital Library
- Deqing Sun, Siegmar Roth, and Michael J. Black. 2010. Secrets of optical flow estimation and their principles. In Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2432--2439. DOI:http://dx.doi.org/10.1109/CVPR.2010.5539939Google Scholar
- Mario Taschwer. 2012. A key-frame-oriented video browser. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 655--657. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_68 Google ScholarDigital Library
- Ba Tu Truong and Svetha Venkatesh. 2007. Video abstraction: A systematic review and classification. ACM Transactions on Multimedia Computing Communications and Applications 3, 1, Article 3 (Feb. 2007). DOI:http://dx.doi.org/10.1145/1198302.1198305 Google ScholarDigital Library
- Carles Ventura, Manel Martos, Xavier Giró-i Nieto, Verónica Vilaplana, and Ferran Marqués. 2012. Hierarchical navigation and visual search for video keyframe retrieval. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 652--654. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_67 Google ScholarDigital Library
- Marie-luce Viaud, Olivier Buisson, Agnes Saulnier, and Clement Guenais. 2010. Video exploration: From multimedia content analysis to interactive visualization. In Proceedings of the International Conference on Multimedia (MM’10). ACM, New York, NY, 1311--1314. DOI:http://dx.doi.org/10.1145/1873951.1874209 Google ScholarDigital Library
- Marie-Luce Viaud, Jéröme Thièvre, Hervé Goëau, Agnes Saulnier, and Olivier Buisson. 2008. Interactive components for visual exploration of multimedia archives. In Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval (CIVR’08). ACM, New York, NY, 609--616. DOI:http://dx.doi.org/10.1145/1386352.1386440 Google ScholarDigital Library
- Stefanos Vrochidis, Anastasia Moumtzidou, Paul King, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. 2010. VERGE: A video interactive retrieval engine. In Proceedings of the 2010 International Workshop on Content-Based Multimedia Indexing (CBMI). 1--6. DOI:http://dx.doi.org/10.1109/CBMI.2010.5529884Google ScholarCross Ref
- Marcel Worring, Paul Sajda, Simone Santini, David A. Shamma, Alan F. Smeaton, and Qiang Yang. 2012. Where is the user in multimedia retrieval? IEEE MultiMedia 19, 4 (Oct. 2012), 6--10. DOI:http://dx.doi.org/10.1109/MMUL.2012.53 Google ScholarDigital Library
- Yue Wu, Tao Mei, Nenghai Yu, and Shipeng Li. 2012. Accelerometer-based single-handed video browsing on mobile devices: Design and user studies. In Proceedings of the 4th International Conference on Internet Multimedia Computing and Service (ICIMCS’12). ACM, New York, NY, 157--160. DOI:http://dx.doi.org/10.1145/2382336.2382381 Google ScholarDigital Library
- Qing Xu, Yu Liu, Xiu Li, Zhen Yang, Jie Wang, Mateu Sbert, and Riccardo Scopigno. 2014. Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence. Information Sciences 278 (2014), 736--756. DOI:http://dx.doi.org/10.1016/j.ins.2014.03.088Google ScholarCross Ref
- Qing Xu, Pengcheng Wang, Bin Long, M. Sbert, M. Feixas, and R. Scopigno. 2010. Selection and 3D visualization of video key frames. In 2010 IEEE International Conference on Systems Man and Cybernetics (SMC). 52--59. DOI:http://dx.doi.org/10.1109/ICSMC.2010.5642204Google Scholar
- Jin Yuan, Huanbo Luan, Dejun Hou, Han Zhang, Yan-Tao Zheng, Zheng-Jun Zha, and Tat-Seng Chua. 2012. Video browser showdown by NUS. In Advances in Multimedia Modeling, Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, Yiannis Andreopoulos, and Christian Breiteneder (Eds.). Lecture Notes in Computer Science, Vol. 7131. Springer, Berlin, 642--645. DOI:http://dx.doi.org/10.1007/978-3-642-27355-1_64 Google ScholarDigital Library
- Jin-Kai Zhang, Cui-Xia Ma, Yong-Jin Liu, Qiu-Fang Fu, and Xiao-Lan Fu. 2013a. Collaborative interaction for videos on mobile devices based on sketch gestures. Journal of Computer Science and Technology 28, 5 (2013), 810--817. DOI:http://dx.doi.org/10.1007/s11390-013-1379-4Google ScholarCross Ref
- Lei Zhang, Qian-Kun Xu, Lei-Zheng Nie, and Hua Huang. 2013b. VideoGraph: A non-linear video representation for efficient exploration. The Visual Computer 30, 10 (2013), 1123--1132. DOI:http://dx.doi.org/10.1007/s00371-013-0882-5 Google ScholarDigital Library
- Yuan Zhou and Takashi Yukawa. 2011. A touch-panel based user interface and utilization of user’s memories for known-item search (KIS) task in TRECVID 2011. In TRECVID’11.Google Scholar
Index Terms
- Video Interaction Tools: A Survey of Recent Work
Recommendations
Extending interaction for smart watches: enabling bimanual around device control
CHI EA '14: CHI '14 Extended Abstracts on Human Factors in Computing SystemsThe size of a smart watch limits the available interactive surface for the user. Most current smart watches use a combination of a touch screen and physical buttons. Unfortunately, a small touch screen's usability is limited when it can be easily ...
Integrating Point and Touch for Interaction with Digital Tabletop Displays
TractorBeam is a hybrid point-touch interaction technique for tabletop computer displays that seamlessly combines remote pointing and local touch. Results from studies investigating its use for target selection, docking, and puzzle tasks give some ...
Investigating the effectiveness of peephole interaction for smartwatches in a map navigation task
MobileHCI '14: Proceedings of the 16th international conference on Human-computer interaction with mobile devices & servicesWith the increasing availability of smartwatches the question of suited input modalities arises. While direct touch input comes at the cost of the fat-finger problem, we propose to use a dynamic peephole to explore larger content such as websites or ...
Comments