research-article

Understanding How to Administer Voice Surveys through Smart Speakers

Authors:
Jing Wei

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Weiwei Jiang

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Chaofan Wang

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Difeng Yu

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Jorge Goncalves

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Tilman Dingler

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

,
Vassilis Kostakos

University of Melbourne, Melbourne, VIC, Australia

University of Melbourne, Melbourne, VIC, Australia
View Profile

Proceedings of the ACM on Human-Computer Interaction Volume 6 Issue CSCW2Article No.: 548pp 1–32https://doi.org/10.1145/3555606

Published:11 November 2022Publication History

Proceedings of the ACM on Human-Computer Interaction

Abstract

Smart speakers have become exceedingly popular and entered many people's homes due to their ability to engage users with natural conversations. Researchers have also looked into using smart speakers as an interface to collect self-reported health data through conversations. Responding to surveys prompted by smart speakers requires users to listen to questions and answer in voice without any visual stimuli. Compared to traditional web-based surveys, where users can see questions and answers visually, voice surveys may be more cognitively challenging. Therefore, to collect reliable survey data, it is important to understand what types of questions are suitable to be administered by smart speakers. We selected five common survey questionnaires and deployed them as voice surveys and web surveys in a within-subject study. Our 24 participants answered questions using voice and web questionnaires in one session. They then repeated the same study session after 1 week to provide a "retest'' response. Our results suggest that voice surveys have comparable reliability to web surveys. We find that, when using 5-point or 7-point scales, voice surveys take about twice as long as web surveys. Based on objective measurements, such as response agreement and test-retest reliability, and subjective evaluations of user experience, we recommend that researchers consider adopting the binary scale and 5-point numerical scales for voice surveys on smart speakers.

References

[n.d.]. WaveNet: A generative model for raw audio. https://deepmind.com/blog/article/wavenet-generative-modelraw-audioGoogle Scholar
Mike Allen. 2017. The SAGE encyclopedia of communication research methods. Sage Publications.Google Scholar
Duane F Alwin. 1992. Information transmission in the survey interview: Number of response categories and the reliability of attitude measurement. Sociological methodology (1992), 83--118.Google Scholar
Scott Barge and Hunter Gehlbach. 2012. Using the theory of satisficing to evaluate the quality of survey data. Research in Higher Education 53, 2 (2012), 182--200.Google ScholarCross Ref
Frank Bentley, Chris Luvogt, Max Silverman, Rushani Wirasinghe, Brooke White, and Danielle Lottridge. 2018. Understanding the long-term use of smart speaker assistants. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2, 3 (2018), 1--24.Google ScholarDigital Library
John Brooke et al. 1996. SUS-A quick and dirty usability scale. Usability evaluation in industry 189, 194 (1996), 4--7.Google Scholar
S Tamer Cavusgil and Lisa A Elvey-Kirk. 1998. Mail survey response behavior: A conceptualization of motivating factors and an empirical study. European journal of marketing (1998).Google ScholarCross Ref
Irene Celino and Gloria Re Calegari. 2020. Submitting surveys via a conversational interface: an evaluation of user acceptance and approach effectiveness. International Journal of Human-Computer Studies 139 (2020), 102410.Google ScholarCross Ref
Narae Cha, Auk Kim, Cheul Young Park, Soowon Kang, Mingyu Park, Jae-Gil Lee, Sangsu Lee, and Uichin Lee. 2020. Hello there! is now a good time to talk? Opportune moments for proactive interactions with smart speakers. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 4, 3 (2020), 1--28.Google ScholarDigital Library
Dipanjan Chakraborty, Indrani Medhi, Edward Cutrell, and William Thies. 2013. Man versus machine: evaluating IVR versus a live operator for phone surveys in India. In Proceedings of the 3rd ACM Symposium on Computing for Development. Association for Computing Machinery, New York, NY, USA, 1--9.Google ScholarDigital Library
Ti-Chung Cheng, Tiffany Wenting Li, Yi-Hung Chou, Karrie Karahalios, and Hari Sundaram. 2021. " I can show what I really like." Eliciting Preferences via Quadratic Voting. Proceedings of the ACM on Human-Computer Interaction 5, CSCW1 (2021), 1--43.Google Scholar
Jane Chung, Michael Bleich, David C Wheeler, Jodi M Winship, Brooke McDowell, David Baker, and Pamela Parsons. 2021. Attitudes and Perceptions Toward Voice-Operated Smart Speakers Among Low-Income Senior Housing Residents: Comparison of Pre-and Post-Installation Surveys. Gerontology and Geriatric Medicine 7 (2021), 23337214211005869.Google ScholarCross Ref
Richard L Clayton and Debbie LS Winter. 1992. Speech data entry: results of a test of voice recognition for survey data collection. JOURNAL OF OFFICIAL STATISTICS-STOCKHOLM- 8 (1992), 377--377.Google Scholar
John Dawes. 2008. Do data characteristics change according to the number of scale points used? An experiment using 5-point, 7-point and 10-point scales. International journal of market research 50, 1 (2008), 61--104.Google ScholarCross Ref
Don C Des Jarlais, Denise Paone, Judith Milliken, Charles F Turner, Heather Miller, James Gribble, Qiuhu Shi, Holly Hagan, and Samuel R Friedman. 1999. Audio-computer interviewing to measure risk behaviour for HIV among injecting drug users: a quasi-randomised trial. The Lancet 353, 9165 (1999), 1657--1661.Google Scholar
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).Google Scholar
Ed Diener, Derrick Wirtz, Robert Biswas-Diener, William Tov, Chu Kim-Prieto, Dong-won Choi, and Shigehiro Oishi. 2009. New measures of well-being. In Assessing well-being. Springer, 247--266.Google Scholar
Don A Dillman and Leah Melani Christian. 2005. Survey mode as a source of instability in responses across surveys. Field methods 17, 1 (2005), 30--52.Google Scholar
Don A Dillman, Glenn Phelps, Robert Tortora, Karen Swift, Julie Kohrell, Jodi Berck, and Benjamin L Messer. 2009. Response rate and measurement differences in mixed-mode surveys using mail, telephone, interactive voice response (IVR) and the Internet. Social science research 38, 1 (2009), 1--18.Google Scholar
Tilman Dingler, Dominika Kwasnicka, Jing Wei, Enying Gong, and Brian Oldenburg. 2021. The Use and Promise of Conversational Agents in Digital Health. Yearbook of Medical Informatics 30, 01 (2021), 191--199.Google ScholarCross Ref
Radhika Garg and Subhasree Sengupta. 2020. He is just like me: a study of the long-term use of smart speakers by parents and children. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 4, 1 (2020), 1--24.Google ScholarDigital Library
Khalil G Ghanem, Heidi E Hutton, Jonathan M Zenilman, Rebecca Zimba, and Emily J Erbelding. 2005. Audio computer assisted self interview and face to face interview modes in assessing response bias among STD clinic patients. Sexually transmitted infections 81, 5 (2005), 421--425.Google Scholar
Moshe M Givon and Zur Shapira. 1984. Response to rating scales: A theoretical model and its application to the number of categories problem. Journal of Marketing Research 21, 4 (1984), 410--419.Google ScholarCross Ref
Katharina Graben, Bettina K Doering, Franziska Jeromin, and Antonia Barke. 2020. Problematic mobile phone use: Validity and reliability of the Problematic Use of Mobile Phone (PUMP) Scale in a German sample. Addictive behaviors reports 12 (2020), 100297.Google Scholar
Danula Hettiachchi, Zhanna Sarsenbayeva, Fraser Allison, Niels van Berkel, Tilman Dingler, Gabriele Marini, Vassilis Kostakos, and Jorge Goncalves. 2020. "Hi! I Am the Crowd Tasker" Crowdsourcing through Digital Voice Assistants. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI '20). Association for Computing Machinery, New York, NY, USA, 1--14. https://doi.org/10.1145/3313831.3376320Google ScholarDigital Library
Allyson L Holbrook, Melanie C Green, and Jon A Krosnick. 2003. Telephone versus face-to-face interviewing of national probability samples with long questionnaires: Comparisons of respondent satisficing and social desirability response bias. Public opinion quarterly 67, 1 (2003), 79--125.Google Scholar
Azra Ismail and Neha Kumar. 2018. Engaging solidarity in data collection practices for community health. Proceedings of the ACM on Human-Computer Interaction 2, CSCW (2018), 1--24.Google ScholarDigital Library
Jiepu Jiang, Wei Jeng, and Daqing He. 2013. How do users respond to voice input errors? Lexical and phonetic query reformulation in voice search. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. Association for Computing Machinery, New York, NY, USA, 143--152.Google Scholar
Aman Khullar, Priyadarshi Hitesh, Shoaib Rahman, Deepak Kumar, Rachit Pandey, Praveen Kumar, Rajeshwari Tripathi, Prince Prince, Ankit Akash Jha, Himanshu Himanshu, et al. 2021. Costs and Benefits of Conducting Voice-based Surveys Versus Keypress-based Surveys on Interactive Voice Response Systems. In ACM SIGCAS Conference on Computing and Sustainable Societies. Association for Computing Machinery, New York, NY, USA, 288--298.Google Scholar
Soomin Kim, Joonhwan Lee, and Gahgene Gweon. 2019. Comparing data from chatbot and web surveys: Effects of platform and conversational style on survey response quality. In Proceedings of the 2019 CHI conference on human factors in computing systems. Association for Computing Machinery, New York, NY, USA, 1--12.Google ScholarDigital Library
Bret Kinsella. 2019. Loup Ventures says 75% of U.S. households will have smart speakers by 2025, Google to surpass Amazon in market share. https://voicebot.ai/2019/06/18/loup-ventures-says-75-of-u-s-households-will-have-smartspeakers-by-2025-google-to-surpass-amazon-in-market-share/Google Scholar
Rafal Kocielnik, Daniel Avrahami, Jennifer Marlow, Di Lu, and Gary Hsieh. 2018. Designing for workplace reflection: a chat and voice-based conversational agent. In Proceedings of the 2018 designing interactive systems conference. Association for Computing Machinery, New York, NY, USA, 881--894.Google ScholarDigital Library
Allison Koenecke, Andrew Nam, Emily Lake, Joe Nudell, Minnie Quartey, Zion Mengesha, Connor Toups, John R Rickford, Dan Jurafsky, and Sharad Goel. 2020. Racial disparities in automated speech recognition. Proceedings of the National Academy of Sciences 117, 14 (2020), 7684--7689.Google ScholarCross Ref
Jon A Krosnick. 1991. Response strategies for coping with the cognitive demands of attitude measures in surveys. Applied cognitive psychology 5, 3 (1991), 213--236.Google Scholar
Jon A Krosnick. 2018. Questionnaire design. In The Palgrave handbook of survey research. Springer, 439--455.Google Scholar
Jon A Krosnick and Matthew K Berent. 1993. Comparisons of party identification and policy preferences: The impact of survey question format. American Journal of Political Science (1993), 941--964.Google Scholar
Jon A Krosnick, Sowmya Narayan, and Wendy R Smith. 1996. Satisficing in surveys: Initial evidence. New directions for evaluation 1996, 70 (1996), 29--44.Google Scholar
J Richard Landis and Gary G Koch. 1977. The measurement of observer agreement for categorical data. biometrics (1977), 159--174.Google Scholar
Kwan Min Lee and Jennifer Lai. 2005. Speech versus touch: A comparative study of the use of speech and DTMF keypad for navigation. International Journal of Human-Computer Interaction 19, 3 (2005), 343--360.Google ScholarCross Ref
Adam Lerer, Molly Ward, and Saman Amarasinghe. 2010. Evaluation of IVR data collection UIs for untrained rural users. In Proceedings of the first ACM symposium on computing for development. Association for Computing Machinery, New York, NY, USA, 1--8.Google ScholarDigital Library
Rensis Likert. 1932. A technique for the measurement of attitudes. Archives of psychology (1932).Google Scholar
Yuhan Luo, Bongshin Lee, and Eun Kyoung Choe. 2020. TandemTrack: Shaping Consistent Exercise Experience by Complementing a Mobile App with a Smart Speaker. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI '20). Association for Computing Machinery, New York, NY, USA, 1--13. https://doi.org/10.1145/3313831.3376616Google ScholarDigital Library
Kelly L'Engle, Eunice Sefa, Edward Akolgo Adimazoya, Emmanuel Yartey, Rachel Lenzi, Cindy Tarpo, Nii Lante Heward-Mills, Katherine Lew, and Yvonne Ampeh. 2018. Survey research with a random digit dial national mobile phone sample in Ghana: methods and sample quality. PloS one 13, 1 (2018), e0190902.Google Scholar
Raju Maharjan, Per Bækgaard, and Jakob E Bardram. 2019. " Hear me out" smart speaker based conversational agent to monitor symptoms in mental health. In Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers. Association for Computing Machinery, New York, NY, USA, 929--933.Google Scholar
Raju Maharjan, Darius Adam Rohani, Per Bækgaard, Jakob Bardram, and Kevin Doherty. 2021. Can we talk? Design Implications for the Questionnaire-Driven Self-Report of Health and Wellbeing via Conversational Agent. In CUI 2021--3rd Conference on Conversational User Interfaces. Association for Computing Machinery, New York, NY, USA, 1--11.Google ScholarDigital Library
Gloria Mark, Shamsi Iqbal, Mary Czerwinski, and Paul Johns. 2014. Capturing the mood: facebook and face-to-face encounters in the workplace. In Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing. Association for Computing Machinery, New York, NY, USA, 1082--1094.Google ScholarDigital Library
Gloria Mark, Shamsi Iqbal, Mary Czerwinski, and Paul Johns. 2015. Focused, aroused, but so distractible: Temporal perspectives on multitasking and communications. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing. Association for Computing Machinery, New York, NY, USA, 903--916.Google ScholarDigital Library
David Markland and Vannessa Tobin. 2004. A modification to the behavioural regulation in exercise questionnaire to include an assessment of amotivation. Journal of Sport and Exercise Psychology 26, 2 (2004), 191--196.Google ScholarCross Ref
John A McCarty and Larry J Shrum. 2000. The measurement of personal values in survey research: A test of alternative rating procedures. Public Opinion Quarterly 64, 3 (2000), 271--298.Google ScholarCross Ref
Lisa J Merlo, Amanda M Stone, and Alex Bibbey. 2013. Measuring problematic mobile phone use: development and preliminary psychometric properties of the PUMP scale. Journal of addiction 2013 (2013).Google Scholar
Elizabeth T Miller, Dan J Neal, Lisa J Roberts, John S Boer, Sally O Cresskr, Jane Metrik, and G Alan Marlatt. 2009. Test-retest reliability of alcohol measures: is there a difference between internet-based assessment and traditional methods? (2009).Google Scholar
Chelsea Myers, Anushay Furqan, Jessica Nebolsky, Karina Caro, and Jichen Zhu. 2018. Patterns for how users overcome obstacles in voice user interfaces. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 1--7.Google ScholarDigital Library
Charles Egerton Osgood, George J Suci, and Percy H Tannenbaum. 1957. The measurement of meaning. Number 47. University of Illinois press.Google Scholar
Debajyoti Pal, Chonlameth Arpnikanondt, Suree Funilkul, and Vijayakumar Varadarajan. 2019. User experience with smart voice assistants: the accent perspective. In 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE, 1--6.Google ScholarCross Ref
Josh Pasek and Jon A Krosnick. 2010. Optimizing survey questionnaire design in political science. In The Oxford handbook of American elections and political behavior. Oxford University Press.Google Scholar
Neil Patel, Sheetal Agarwal, Nitendra Rajput, Amit Nanavati, Paresh Dave, and Tapan S Parikh. 2009. A comparative study of speech and dialed input voice interfaces in rural India. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 51--54.Google ScholarDigital Library
Theresa E Perlis, Don C Des Jarlais, Samuel R Friedman, Kamyar Arasteh, and Charles F Turner. 2004. Audiocomputerized self-interviewing versus face-to-face interviewing for research data collection at drug abuse treatment programs. Addiction 99, 7 (2004), 885--896.Google ScholarCross Ref
Martin Porcheron, Joel E Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice interfaces in everyday life. In proceedings of the 2018 CHI conference on human factors in computing systems. Association for Computing Machinery, New York, NY, USA, 1--12.Google ScholarDigital Library
Alisha Pradhan, Leah Findlater, and Amanda Lazar. 2019. " Phantom Friend" or" Just a Box with Information" Personification and Ontological Categorization of Smart Speaker-based Voice Assistants by Older Adults. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1--21.Google Scholar
Aung Pyae and Tapani N Joelsson. 2018. Investigating the usability and user experiences of voice user interface: a case of Google home smart speaker. In Proceedings of the 20th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct. Association for Computing Machinery, New York, NY, USA, 127--131.Google ScholarDigital Library
Ling Qiu, Bethany Kanski, Shawna Doerksen, Renate Winkels, Kathryn H Schmitz, and Saeed Abdullah. 2021. Nurse AMIE: Using Smart Speakers to Provide Supportive Care Intervention for Women with Metastatic Breast Cancer. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 1--7.Google ScholarDigital Library
Juan C Quiroz, Tristan Bongolan, and Kiran Ijaz. 2020. Alexa depression and anxiety self-tests: a preliminary analysis of user experience and trust. In Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers. Association for Computing Machinery, New York, NY, USA, 494--496.Google Scholar
Shan M Randhawa, Tallal Ahmad, Jay Chen, and Agha Ali Raza. 2021. Karamad: A Voice-based Crowdsourcing Platform for Underserved Populations. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 1--15.Google ScholarDigital Library
Melanie Revilla, Mick P Couper, Oriol J Bosch, and Marc Asensio. 2020. Testing the use of voice input in a smartphone web survey. Social Science Computer Review 38, 2 (2020), 207--224.Google ScholarDigital Library
Jungwook Rhim, Minji Kwak, Yeaeun Gong, and Gahgene Gweon. 2022. Application of humanization to survey chatbots: Change in chatbot perception, interaction experience, and survey data quality. Computers in Human Behavior 126 (2022), 107034.Google ScholarDigital Library
George Robinson and Clive Morley. 2006. Call centre management: responsibilities and performance. International Journal of Service Industry Management (2006).Google ScholarCross Ref
John P Robinson, Phillip R Shaver, and Lawrence S Wrightsman. 1999. Measures of political attitudes. Academic Press.Google Scholar
Steven J Rosenstone, John Mark Hansen, and Donald R Kinder. 1986. Measuring change in personal economic well-being. Public Opinion Quarterly 50, 2 (1986), 176--192.Google ScholarCross Ref
Mariah L. Schrum, Michael Johnson, Muyleng Ghuy, and Matthew C. Gombolay. 2020. Four Years in Review: Statistical Practices of Likert Scales in Human-Robot Interaction Studies. In Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction (Cambridge, United Kingdom) (HRI '20). Association for Computing Machinery, New York, NY, USA, 43--52. https://doi.org/10.1145/3371382.3380739Google ScholarDigital Library
Norbert Schwarz, Fritz Strack, Hans-J Hippler, and George Bishop. 1991. The impact of administration mode on response effects in survey measurement. Applied Cognitive Psychology 5, 3 (1991), 193--212.Google ScholarCross Ref
Jahanzeb Sherwani, Nosheen Ali, Sarwat Mirza, Anjum Fatma, Yousuf Memon, Mehtab Karim, Rahul Tongia, and Roni Rosenfeld. 2007. Healthline: Speech-based access to health information by low-literate users. In 2007 International Conference on Information and Communication Technologies and Development. IEEE, 1--9.Google ScholarCross Ref
Eunjung Shin, Timothy P Johnson, and Kumar Rao. 2012. Survey mode effects on data quality: Comparison of web and mail modes in a US national panel survey. Social Science Computer Review 30, 2 (2012), 212--228.Google ScholarDigital Library
Alicia D Simmons and Lawrence D Bobo. 2015. Can non-full-probability internet surveys yield useful data? A comparison with full-probability face-to-face surveys in the domain of race and social inequality attitudes. Sociological Methodology 45, 1 (2015), 357--387.Google ScholarCross Ref
Ulla Sonn, Kristina Törnquist, and Elisabeth Svensson. 1999. The ADL taxonomy-from individual categorical data to ordinal categorical data. Scandinavian Journal of Occupational Therapy 6, 1 (jan 1999), 11--20. https://doi.org/10.1080/ 110381299443807Google Scholar
Venkat Srinivasan and Amiya K Basu. 1989. The metric quality of ordered categorical data. Marketing Science 8, 3 (1989), 205--230.Google ScholarDigital Library
Ayushi Srivastava, Shivani Kapania, Anupriya Tuli, and Pushpendra Singh. 2021. Actionable UI Design Guidelines for Smartphone Applications Inclusive of Low-Literate Users. Proceedings of the ACM on Human-Computer Interaction 5, CSCW1 (2021), 1--30.Google ScholarDigital Library
Jeff Szymanski and William O'Donohue. 1995. Fear of spiders questionnaire. Journal of behavior therapy and experimental psychiatry 26, 1 (1995), 31--34.Google ScholarCross Ref
Roger Tourangeau and Kenneth A Rasinski. 1988. Cognitive processes underlying context effects in attitude measurement. Psychological bulletin 103, 3 (1988), 299.Google Scholar
Priyamvada Tripathi and Winslow Burleson. 2012. Predicting creativity in the wild: Experience sample and sociometric modeling of teams. In Proceedings of the ACM 2012 conference on computer supported cooperative work. Association for Computing Machinery, New York, NY, USA, 1203--1212.Google ScholarDigital Library
Charles F Turner, Leighton Ku, Susan M Rogers, Laura D Lindberg, Joseph H Pleck, and Freya L Sonenstein. 1998. Adolescent sexual behavior, drug use, and violence: increased reporting with computer survey technology. Science 280, 5365 (1998), 867--873.Google Scholar
Niels Van Berkel, Denzil Ferreira, and Vassilis Kostakos. 2017. The experience sampling method on mobile devices. ACM Computing Surveys (CSUR) 50, 6 (2017), 1--40.Google ScholarDigital Library
Morgan Vigil-Hayes, Ann Futterman Collier, Shelby Hagemann, Giovanni Castillo, Keller Mikkelson, Joshua Dingman, Andrew Muñoz, Jade Luther, and Alexandra McLaughlin. 2021. Integrating cultural relevance into a behavioral mHealth intervention for Native American youth. Proceedings of the ACM on human-computer interaction 5, CSCW1 (2021), 1--29.Google ScholarDigital Library
Jinping Wang, Hyun Yang, Ruosi Shao, Saeed Abdullah, and S Shyam Sundar. 2020. Alexa as coach: Leveraging smart speakers to build social agents that reduce public speaking anxiety. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 1--13.Google ScholarDigital Library
Jing Wei, Tilman Dingler, and Vassilis Kostakos. 2021. Understanding User Perceptions of Proactive Smart Speakers. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 5, 4 (2021), 1--28.Google ScholarDigital Library
Jing Wei, Benjamin Tag, Johanne R Trippas, Tilman Dingler, and Vassilis Kostakos. 2022. What Could Possibly Go Wrong When Interacting with Proactive Smart Speakers? A Case Study Using an ESM Application. In CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 1--15.Google ScholarDigital Library
Philip M Wilson, Wendy M Rodgers, and Shawn N Fraser. 2002. Examining the psychometric properties of the behavioral regulation in exercise questionnaire. Measurement in physical education and exercise science 6, 1 (2002), 1--21.Google Scholar
PhilipMWilson,WendyMRodgers, Christina C Loitz, and Giulia Scime. 2006. ?It's Who I Am.. . Really!'The importance of integrated regulation in exercise contexts 1. Journal of Applied Biobehavioral Research 11, 2 (2006), 79--104.Google Scholar
Ziang Xiao, Michelle X Zhou, Q Vera Liao, Gloria Mark, Changyan Chi, Wenxi Chen, and Huahai Yang. 2020. Tell me about yourself: Using an AI-powered chatbot to conduct conversational surveys with open-ended questions. ACM Transactions on Computer-Human Interaction (TOCHI) 27, 3 (2020), 1--37.Google ScholarDigital Library
Yukang Yan, Chun Yu, Wengrui Zheng, Ruining Tang, Xuhai Xu, and Yuanchun Shi. 2020. FrownOnError: Interrupting Responses from Smart Speakers by Facial Expressions. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, 1--14.Google ScholarDigital Library
Cong Ye, Jenna Fulton, and Roger Tourangeau. 2011. More positive or more extreme? A meta-analysis of mode differences in response choice. Public Opinion Quarterly 75, 2 (2011), 349--365.Google ScholarCross Ref

Index Terms

Understanding How to Administer Voice Surveys through Smart Speakers
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction devices
      1. Sound-based input / output
    2. Interaction paradigms
      1. Natural language interfaces
  2. Interaction design
    1. Empirical studies in interaction design

Recommendations

Understanding User Perceptions of Proactive Smart Speakers

Voice assistants, such as Amazon's Alexa and Google Home, increasingly find their way into consumer homes. Their functionality, however, is currently limited to being passive answer machines rather than proactively engaging users in conversations. ...
Read More
Intelligibility Issues Faced by Smart Speaker Enthusiasts in Understanding What Their Devices Do and Why
OzCHI '20: Proceedings of the 32nd Australian Conference on Human-Computer Interaction

Studies of smart speakers highlight issues people face with understanding why unexpected behaviour occurs and with recovering from mistakes due to uninformative responses. Yet, our understanding of such intelligibility issues in smart speakers — ...
Read More
Machine Body Language: Expressing a Smart Speaker’s Activity with Intelligible Physical Motion
DIS '21: Proceedings of the 2021 ACM Designing Interactive Systems Conference

People’s physical movement and body language implicitly convey what they think and feel, are doing or are about to do. In contrast, current smart speakers miss out on this richness of body language, primarily relying on voice commands only. We present ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
Proceedings of the ACM on Human-Computer Interaction Volume 6, Issue CSCW2
CSCW
November 2022
8205 pages
EISSN:2573-0142
DOI:10.1145/3571154
Editor:
Jeff Nichols
Google
Issue’s Table of Contents
Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 November 2022
Published in pacmhci Volume 6, Issue CSCW2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
chatbots
conversational user interfaces
smart speakers
survey methodology
voice user interfaces
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 207
  Total Downloads
- Downloads (Last 12 months)93
- Downloads (Last 6 weeks)11
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Understanding How to Administer Voice Surveys through Smart Speakers

Proceedings of the ACM on Human-Computer Interaction

Abstract

References

Cited By

Index Terms

Recommendations

Understanding User Perceptions of Proactive Smart Speakers

Intelligibility Issues Faced by Smart Speaker Enthusiasts in Understanding What Their Devices Do and Why

Machine Body Language: Expressing a Smart Speaker’s Activity with Intelligible Physical Motion

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Understanding How to Administer Voice Surveys through Smart Speakers

Proceedings of the ACM on Human-Computer Interaction

Abstract

References

Cited By

Index Terms

Recommendations

Understanding User Perceptions of Proactive Smart Speakers

Intelligibility Issues Faced by Smart Speaker Enthusiasts in Understanding What Their Devices Do and Why

Machine Body Language: Expressing a Smart Speaker’s Activity with Intelligible Physical Motion

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media