A survey of 3D object selection techniques for virtual environments

doi:10.1016/j.cag.2012.12.003

Computers & Graphics

Volume 37, Issue 3, May 2013, Pages 121-136

https://doi.org/10.1016/j.cag.2012.12.003 Get rights and content

Abstract

Computer graphics applications controlled through natural gestures are gaining increasing popularity these days due to recent developments in low-cost tracking systems and gesture recognition technologies. Although interaction techniques through natural gestures have already demonstrated their benefits in manipulation, navigation and avatar-control tasks, effective selection with pointing gestures remains an open problem. In this paper we survey the state-of-the-art in 3D object selection techniques. We review important findings in human control models, analyze major factors influencing selection performance, and classify existing techniques according to a number of criteria. Unlike other components of the application's user interface, pointing techniques need a close coupling with the rendering pipeline, introducing new elements to be drawn, and potentially modifying the object layout and the way the scene is rendered. Conversely, selection performance is affected by rendering issues such as visual feedback, depth perception, and occlusion management. We thus review existing literature paying special attention to those aspects in the boundary between computer graphics and human–computer interaction.

Graphical abstract

Highlights

► We review major 3D object selection techniques for virtual environments. ► Important findings in human control models for 3D object selection are reviewed. ► We analyze major factors influencing selection performance.

Introduction

In the last decades we have witnessed enormous improvements in spatial input devices and motion tracking systems. These advances have motivated the development of a plethora of interaction techniques relying on six DoFs (Degrees of Freedom) input devices and user gestures. Interaction through natural gestures is gaining further popularity since the recent mass commercialization of low-cost solutions for full-body tracking, which is enabling the deployment of natural interfaces outside virtual reality labs. We will use the term 3D interaction to refer to interaction tasks requiring users to make some gestures in free (unconstrained) 3D space. These gestures typically involve one or both hands, and might also involve the user's head and other parts of the body.

The design of appropriate 3D interaction techniques for virtual environments (VEs) is a challenging problem [19], [51]. On the positive side, interacting in free space with natural gestures opens a new world of possibilities for exploiting the richness and expressiveness of the interaction, allowing users to control simultaneously more DoFs and exploiting well-known real-world actions. On the negative side, 3D interaction is more physically demanding and might hinder user tasks by increasing the required dexterity. Compare for example the act of selecting an object using a mouse pointer to that of grasping a 3D object in free space. Mouse movement involves small, fast muscles whereas grasping often requires a complex arm movement involving larger and slower muscles [23], [48]. Furthermore, current immersive VEs, even the most sophisticated ones, neither fail to provide the same level of cues for understanding the environment, nor reproduce faithfully the physical constraints of the real world [74]. For this reason, although humans are used to perform 3D interaction gestures in the real world, users of IVEs often encounter difficulties in understanding 3D spatial relationships and controlling multiple DoFs simultaneously.

Object selection is one of the fundamental tasks in 3D user interfaces [19] and the initial task for most common user interactions in a VE. Manipulation tasks often depend on (and are preceded by) selection tasks. As a consequence, poorly designed selection techniques often have a significant negative impact on the overall user performance. In this survey, we review major 3D interaction techniques intended for 3D object selection tasks. We do not consider indirect selection techniques, e.g. selecting from a menu or performing semantic queries. A 3D object selection technique requires the user to gesture in 3D space, e.g. grabbing an object or pointing to something (see Fig. 1). Two main 3D selection metaphors can be identified: virtual hand [78] and virtual pointing [63], [54]. In the early days, virtual hand techniques were more popular as they map identically virtual tasks with real tasks, resulting in a more natural interaction. Lately, it has been shown that overcoming the physical constraints of the real world provides substantial benefits, e.g. letting the user select objects out of reach by enlarging the user's virtual arm [75], or using virtual pointing techniques such as raycasting [63]. In fact, raycasting selection is one of the most popular techniques for 3D object selection tasks [16]. A number of user studies in the literature have found that virtual pointing techniques often result in better selection effectiveness than competing 3D selection metaphors [19]. Unlike classical virtual hand techniques, virtual pointing techniques allow the user to select objects beyond their area of reach and require relatively less physical movement.

Selection through virtual pointing, though, is not free from difficulties. The selection of small or distant objects through virtual pointing remains to be a difficult task. Some techniques address the selection of small objects by increasing the size of the selection tool [36], [73], at the expense of requiring disambiguation mechanisms to guess the object the user aims to select [30]. Noise from tracking devices and the fact that the interaction takes place in free space with no physical support for the hands [55] further hinders the accurate selection of small targets [43]. The user also has to keep the tool orientation steady until the selection confirmation is triggered, for example, by a button press. Such a confirmation action is likely to produce a change in the tool orientation, nicknamed Heisenberg effect [20], potentially causing a wrong selection. Occlusion is another major handicap for accomplishing spatial tasks [33]. Most interaction techniques for 3D selection and manipulation require the involved objects to be visible. A common solution for selecting occluded objects is to navigate to an appropriate location so that the targets become unoccluded. However, this navigate-to-select approach is impractical for selection-intensive applications. Therefore occlusion management techniques are often essential for helping users discover and access potential targets.

A number of approaches have been proposed to improve user performance in terms of task completion times and error counts [15]. A common strategy is to apply human control models such as the optimized initial impulse model [62] and Fitts' Law [34], [35]. While the optimized initial impulse model refers to the accuracy a user can achieve given the movement required to perform an action, Fitts' Law estimates the time required to acquire a target. However, as users are bounded by human motor skills, there is a natural trade-off between speed and accuracy. In a typical scenario, high-accuracy rates will produce high task completion times and vice-versa.

In the context of the real usage of 3D interfaces, the subjective impressions of the users about an interaction technique can play a larger role than merely speed. The inability to select objects precisely may prove to be overly annoying and thus frustrate users. A performance increase might not be desirable if it is achieved at the expense of increasing the cognitive load of the task, or using techniques requiring extensive training.

The rest of this paper is organized as follows. Section 2 reviews existing human pointing models. In Section 3 we review major techniques for 3D object selection and extend previously proposed classifications [18], [76], [29] with a number of additional criteria to further elucidate the potential benefits and drawbacks of existing selection techniques. A comprehensive summary of the reviewed techniques is given in Table 1. Section 4 analyzes major factors influencing selection performance and proposes some usability guidelines. Finally, Section 5 provides some concluding remarks and future research directions.

Section snippets

Human pointing models

In order to point to (acquire) an object (the target), the user is required to perform a set of gestures (movements) to position the selection tool (e.g. his finger) over it. For each movement, the final position of the selection tool (endpoint) determines whether the acquisition is accomplished (the endpoint is inside the target) or not (the endpoint is outside the target). Once the target is acquired, the user has to trigger some selection mechanism to confirm the acquisition (e.g. pressing a

Classification of selection techniques

A number of taxonomies have been proposed to classify existing 3D selection techniques. In Bowman et al. [18] classification, interaction techniques are decomposed into subtasks and classified according to them (see Fig. 3). Following [18], a selection technique has to provide means to indicate an object (object indication), a mechanism to confirm its selection (confirmation of selection) and visual, haptic or audio feedback to guide the user during the selection task (feedback). One limitation

Factors influencing performance

A number of usability guidelines exist for 2D user interfaces, however, in general, they are not directly applicable to 3D user interfaces. 3D user interfaces are significantly more difficult to design, implement and use than their 2D counterparts. 3DUIs are based on real-world characteristics such as naive physics, body awareness, environmental awareness, social awareness and social skills [46].

There are a few works explicitly focusing on usability guidelines for 3D user interfaces, being the

Conclusions and future outlook

The act of pointing to graphical elements is one of the fundamental tasks in human–computer interaction. Although 3D interaction techniques for target selection have been used for many years, they still exhibit major limitations regarding effective, accurate selection of targets in real-world applications. Some of these limitations are concerned with visual feedback issues (occlusion, visibility mismatch, depth perception in stereoscopic displays) and the inherent features of the human motor

References (108)

C. Andujar et al.
Anisomorphic ray-casting manipulation for interacting with 2D GUIs
Comput Graph
(2007)
R. Balakrishnan
“Beating” Fitts' lawvirtual enhancements for pointing facilitation
Int J Hum–Comput Stud
(2004)
N. Elmqvist et al.
View-projection animation for 3D occlusion management
Comput Graph
(2007)
R. Kopper et al.
A human motor behavior model for distal pointing tasks
Int J Hum–Comput Stud
(2010)
J. Liang et al.
JDcada highly interactive 3D modeling system
Comput Graph
(1994)
I. Poupyrev et al.
Manipulating objects in virtual worldscategorization and empirical evaluation of interaction
J Visual Lang Comput
(1999)
G.P. van Galen et al.
Fitts' law as the outcome of a dynamic noise filtering model of motor control
Hum Movement Sci
(1995)
Andujar C, Argelaguet F. Virtual pads: decoupling motor space and visual space for flexible manipulation of 2D windows...
C. Andujar et al.
Hand-based disocclusion for the world-in-miniature metaphor
PresenceTeleop Virt Environ
(2010)
Argelaguet F. Pointing facilitation techniques for 3D object selection on virtual environments. PhD thesis, Universitat...

Argelaguet F., Andujar C. Improving 3D selection in immersive environments through expanding targets. In: SG'08:...

F. Argelaguet et al.

Efficient 3D pointing selection in cluttered virtual environments

IEEE Comput Graph Appl

(2009)

Argelaguet F, Andujar C. Visual feedback techniques for virtual pointing on stereoscopic displays. In: Proceedings of...

Argelaguet F, Andujar C, Trueba R. Overcoming eye-hand visibility mismatch in 3D pointing selection. InL VRST '08:...

Argelaguet F, Kunert A, Kulik A, Froehlich B. Improving co-located collaboration with show-through techniques. In: IEEE...

Bederson BB. Fisheye menus. In: UIST '00: proceedings of the 13th annual ACM symposium on user interface software and...

Boeck JD, Weyer TD, Raymaekers C, Coninx K. Using the non-dominant hand for selection in 3D. In: IEEE symposium on 3D...

Bolt RA. “Put-that-there”: voice and gesture at the graphics interface. In: SIGGRAPH '80: proceedings of the seventh...

Bowman DA, Badillo B, Manek D. Evaluating the need for display-specific and device-specific 3D interaction techniques....

D.A. Bowman et al.

A survey of usability evaluation in virtual environmentsclassification and comparison of methods

PresenceTeleop Virt Environ

(2002)

Bowman DA, Hodges LF. An evaluation of techniques for grabbing and manipulating remote objects in immersive virtual...

D.A. Bowman et al.

The virtual venueuser-computer interaction in information-rich virtual environments

PresenceTeleop Virt. Environ.

(1998)

Bowman DA, Johnson DB, Hodges LF. Testbed evaluation of virtual environment interaction techniques. In: VRST '99:...

D.A. Bowman et al.

3D user interfacestheory and practice

(2004)

Bowman DA, Wingrave CA, Campbell J. Using pinch gloves for both natural and abstract interaction techniques in virtual...

S. Brewster

Multimodal feedback for the acquisition of small targets

Ergonomics

(2005)

M. Burns et al.

Adaptive cutaways for comprehensible rendering of polygonal scenes

ACM Trans Graph

(2008)

S.K. Card et al.

A morphological analysis of the design space of input devices

ACM Trans Inf Syst

(1991)

J. Cashion et al.

Dense and dynamic 3D selection for game-based virtual environments

IEEE Trans Visualization Comput Graph

(2012)

Casiez G, Vogel D, Balakrishnan R, Cockburn A. The impact of control–display gain on user performance in pointing...

Cockburn A, Brock P. Human on-line response to visual and motor target expansion. In: GI '06: proceedings of graphics...

Cournia N, Smith JD, Duchowski AT. Gaze vs hand-based pointing in virtual environments. In: CHI '03: CHI '03 extended...

Dachselt R, Hübner A. A survey and taxonomy of 3D menu techniques. In: EGVE 06: proceedings of the 12th eurographics...

Dang N-T. A survey and classification of 3D pointing techniques. In: 2007 IEEE international conference on research,...

de Haan G, Koutek M, Post FH. IntenSelect: using dynamic object rating for assisting 3D object selection; 2005. p....

Elmqvist N. BalloonProbe: reducing occlusion in 3D using interactive space distortion. In: VRST '05: proceedings of the...

N. Elmqvist et al.

A taxonomy of 3D occlusion management for visualization

IEEE Trans. Visualization Comput Graph

(2008)

P. Fitts

The information capacity of the human motor system is controlled by the amplitude of movement

J. Exp. Psychol.

(1954)

P. Fitts et al.

Information capacity of discrete motor response

J. Exp. Psychol.

(1964)

Forsberg A, Herndon K, Zeleznik R. Aperture based selection for immersive virtual environments. In: UIST '96:...

S. Frees et al.

Precise and rapid interaction thought scaled manipulation in immersive virtual environments

IEEE Virtual Reality

(2005)

S. Frees et al.

PRISM interaction for enhancing control in immersive virtual environments

ACM Trans Comput–Hum Interact

(2007)

Gabbard JL. A taxonomy of usability characteristics for virtual environments. Master's thesis, Department of Computer...

Grossman T, Balakrishnan R. Pointing at trivariate targets in 3D environments. In: CHI '04: proceedings of the SIGCHI...

Grossman T, Balakrishnan R. The design and evaluation of selection techniques for 3D volumetric displays. In: UIST '06:...

Grossman T, Wigdor D. Going deeper: a taxonomy of 3D on the tabletop. In: IEEE TABLETOP '07, 2007. p....

K.P. Herndon et al.

The challenges of 3D interactiona CHI'94 workshop

SIGCHI Bull

(1994)

Hinckley K, Pausch R, Goble JC, Kassell NF. A survey of design issues in spatial input. In: UIST '94: Proceedings of...

Hinckley K, Pausch R, Goble JC, Kassell NF. Passive real-world interface props for neurosurgical visualization. In: CHI...

Jacob RJK, Girouard A, Hirshfield LM, Horn MS, Shaer O, Solovey ET, Zigelbaum J. Reality-based interaction: a framework...

Cited by (352)

Selection as Tapping: An evaluation of 3D input techniques for timing tasks in musical Virtual Reality
2024, International Journal of Human Computer Studies
While numerous studies have examined 3D interaction techniques for Virtual Reality (VR) across various tasks and scenarios, limited research has focused on music-related applications. However, the most common input techniques in consumer VR systems have been developed outside of the musical domain. Therefore they have not been tested in tasks where synchronization with auditory stimuli and timing plays a crucial role. There is a lack of empirical knowledge about performance and user experience. This paper presents a comparison of five selection input techniques for VR employing the tapping paradigm commonly utilized in the research on sensorimotor synchronization. We assess asynchrony and timing variance as well as user experience, encompassing factors such as ease of use, workload, and cybersickness of such techniques. The study involved 30 participants, both with and without musical expertise, and encompassed the examination of all techniques using one and two hands. Our analysis yielded several key findings: (1) different input techniques yielded distinct outcomes regarding timing asynchrony and variance; (2) the choice of interaction metaphor significantly influenced the user experience; (3) tracking stability emerged as a critical factor. Building upon these insights, we identified essential considerations for selecting the most suitable technique for music creation in VR and proposed design guidelines and future research directions in this domain.
Research on a spatial–temporal characterisation of blink-triggered eye control interactions
2024, Advanced Engineering Informatics
Eye-controlled human–computer interface (ECHCI) system is gradually becoming the focus of audience attention, owing to its humanistic, natural, and intuitive characteristics, which provide services for a wider range of user groups. We conducted a study on the ECHCI with “gaze locking + blink trigger” as the interaction method in the temporal dimension and the spatial dimension respectively and explored the optimisation of blink trigger actions. First, the optimal lock time of the interactive object (IO) was 650 ms, and a subjective evaluation experiment confirmed that the user’s subjective feeling was consistent with the results of the ergonomic experiment. Second, the direction and distance of the visual drift of the blink action were investigated by increasing the locking range of multiple IOs. The results showed that the main direction of the line-of-sight drift for the blinking action was the upper-right corner. Furthermore, it was experimentally demonstrated that the line-of-sight drifted by a certain distance during blinking and that the system had the highest success rate when the locking range corresponded to a viewing angle of 21.074°. This study provides theoretical support for the application of multi-movement combinations in blink ECHCI, which enhances the user experience of blink triggering.
A survey of immersive visualization: Focus on perception and interaction
2023, Visual Informatics
Immersive visualization utilizes virtual reality, mixed reality devices, and other interactive devices to create a novel visual environment that integrates multimodal perception and interaction. This technology has been maturing in recent years and has found broad applications in various fields. Based on the latest research advancements in visualization, this paper summarizes the state-of-the-art work in immersive visualization from the perspectives of multimodal perception and interaction in immersive environments, additionally discusses the current hardware foundations of immersive setups. By examining the design patterns and research approaches of previous immersive methods, the paper reveals the design factors for multimodal perception and interaction in current immersive environments. Furthermore, the challenges and development trends of immersive multimodal perception and interaction techniques are discussed, and potential areas of growth in immersive visualization design directions are explored.
The effect of hands synchronicity on users perceived arms Fatigue in Virtual reality environment
2023, International Journal of Human Computer Studies
The use of virtual reality (VR) system has become more common with the arrival of a new generation of headsets. This increase in accessibility has in turn led to VR being used for applications which require longer sessions, such as video games, artistic expression, rehabilitation, and so on. Taking muscular fatigue in account for the VR applications is therefore becoming essential to ensure a comfort of usage and also to avoid injuries. In addition, usual interactions in these diverse applications are very varied and may require the use of one or two hands, synchronized or not. However, little research has focused on fatigue produced by one and two handed interactions especially during mid-air interaction in VR environment. In this paper, we examine the effect of hands synchronicity and movement direction on the user perceived arm fatigue in VR environment during a simple (one line of targets) and a composite (two lines of targets) pointing tasks performed using both controlled and free hands synchronicity. Our findings indicate that to optimize the relationship between fatigue and efficiency, it seems necessary to select the synchronicity of hands depending on the performed task. Furthermore, it is desirable to permit the users to use the hands synchronicity they prefer. In addition, distance covered by the hand and changes in user posture could be used in real-time as indicators of fatigue to trigger changes in the interface, or notifications. Finally, our findings reveal that the directions of movement along the vertical axis and some diagonals are more tiring than those of the horizontal plane, which suggests that it would be better to favor the use of horizontal directions.
Two-step techniques for accurate selection of small elements in VR environments
2023, Graphical Models
One of the key interactions in 3D environments is target acquisition, which can be challenging when targets are small or in cluttered scenes. Here, incorrect elements may be selected, leading to frustration and wasted time. The accuracy is further hindered by the physical act of selection itself, typically involving pressing a button. This action reduces stability, increasing the likelihood of erroneous target acquisition. We focused on molecular visualization and on the challenge of selecting atoms, rendered as small spheres. We present two techniques that improve upon previous progressive selection techniques. They facilitate the acquisition of neighbors after an initial selection, providing a more comfortable experience compared to using classical ray-based selection, particularly with occluded elements. We conducted a pilot study followed by two formal user studies. The results indicated that our approaches were highly appreciated by the participants. These techniques could be suitable for other crowded environments as well.
Enhancing weight perception in virtual reality: an analysis of kinematic features
2024, Virtual Reality

View all citing articles on Scopus

View full text

Special Section on Touching the 3rd DimensionA survey of 3D object selection techniques for virtual environments

Abstract

Graphical abstract

Highlights

Introduction

Section snippets

Human pointing models

Classification of selection techniques

Factors influencing performance

Conclusions and future outlook

Comput Graph

Int J Hum–Comput Stud

Comput Graph

Int J Hum–Comput Stud

Comput Graph

J Visual Lang Comput

Hum Movement Sci

Hand-based disocclusion for the world-in-miniature metaphor

PresenceTeleop Virt Environ

Efficient 3D pointing selection in cluttered virtual environments

IEEE Comput Graph Appl

A survey of usability evaluation in virtual environmentsclassification and comparison of methods

PresenceTeleop Virt Environ

The virtual venueuser-computer interaction in information-rich virtual environments

PresenceTeleop Virt. Environ.

3D user interfacestheory and practice

Multimodal feedback for the acquisition of small targets

Ergonomics

Adaptive cutaways for comprehensible rendering of polygonal scenes

ACM Trans Graph

A morphological analysis of the design space of input devices

ACM Trans Inf Syst

Dense and dynamic 3D selection for game-based virtual environments

IEEE Trans Visualization Comput Graph

A taxonomy of 3D occlusion management for visualization

IEEE Trans. Visualization Comput Graph

The information capacity of the human motor system is controlled by the amplitude of movement

J. Exp. Psychol.

Information capacity of discrete motor response

J. Exp. Psychol.

Precise and rapid interaction thought scaled manipulation in immersive virtual environments

IEEE Virtual Reality

PRISM interaction for enhancing control in immersive virtual environments

ACM Trans Comput–Hum Interact

The challenges of 3D interactiona CHI'94 workshop

SIGCHI Bull

Special Section on Touching the 3rd Dimension
A survey of 3D object selection techniques for virtual environments