[go: up one dir, main page]

Academia.eduAcademia.edu

GColl: A Flexible Videoconferencing Environment for Group-to-Group Interaction

2009

In this paper, we present GColl, a group-to-group videoconferencing environment concept, which aims to provide a natural communication channel even for ad-hoc groups or other teams that require frequent changes in the number of participants or videoconferencing locations. GColl supports mutual gaze as well as partial gaze awareness for all participants while still retaining very modest technical requirements: a camera and an echo-canceling microphone at each site; and a notebook with two USB cameras for each user. A working prototype is available for download.

GColl: A Flexible Videoconferencing Environment for Group-to-Group Interaction Petr Slovák1,2, Pavel Troubil1,2, and Petr Holub2 1 Faculty of Informatics, Masaryk University, Botanicka 68a, 60200 Brno, Czech Republic 2 CESNET, z.s.p.o, Zikova 4, 16000 Prague, Czech Republic {slovak,pavel,hopet}@ics.muni.cz Abstract. In this paper, we present GColl, a group-to-group videoconferencing environment concept, which aims to provide a natural communication channel even for ad-hoc groups or other teams that require frequent changes in the number of participants or videoconferencing locations. GColl supports mutual gaze as well as partial gaze awareness for all participants while still retaining very modest technical requirements: a camera and an echo-canceling microphone at each site; and a notebook with two USB cameras for each user. A working prototype is available for download. Keywords: Videoconferencing, CMC, Gaze Awareness, Mutual Gaze, Partial Gaze Awareness, Mixed Presence, Presence Disparity. 1 Introduction In the last few years, videoconferencing environments and tools came once again into the focus of human-computer interaction researchers. While videoconferencing between individuals has become quite common (e.g., Skype), attention is now turned to videoconferencing environments that would support collaboration among remote groups of people. In this setting, problems known from previous research (e.g., preservation of gaze awareness and other non-verbal cues) become even more salient and creation of specialized systems is therefore necessary. Such systems might be used by distributed teams both in academic and commercial sphere. Although some of these teams might be stable enough to settle for a fixed system in a dedicated room, others require the possibility of frequent changes in the number of attendants and videoconferencing locations. Moreover, some might even be created only for very short-term tasks, thus meeting just a few times altogether. Throughout the rest of this paper, we will denote groups with these needs as ad-hoc groups. To support ad-hoc groups, our goal was to create a videoconferencing environment, which would be a compromise between the need for preserving the non-verbal cues and the requirement of low-cost and flexibility. Our resulting system, GColl, supports mutual gaze (knowledge, that someone else is looking at you) as well as partial gaze awareness (knowing general direction someone else is looking) for all participants. At the same time, it requires only a camera and an echo-canceling microphone at each remote site, and a common laptop with two USB cameras for each user. Users can join or leave GColl without any changes to the physical layout at any site; this can be done even during a videoconference meeting. T. Gross et al. (Eds.): INTERACT 2009, Part II, LNCS 5727, pp. 165–168, 2009. © IFIP International Federation for Information Processing 2009 166 P. Slovák, P. Troubil, and P. Holub Related Work. Concept of gaze awareness has been studied in great detail in the literature due to its importance for effective communication as well as other group activities (e.g., in [7, 3]). In most videoconferencing environments, gaze information is not conveyed easily due to the usual discrepancy between the camera position and the visualization of the other person’s eyes. Several multi-person videoconferencing systems were invented to mediate some or all forms of gaze awareness (e.g., GAZE2 [8], MAJIC [5], and Hydra [6]). The most recent one, Multiview [4], even achieved a full spatial faithfulness and reported very positive user evaluations. None of them are, however, directly suitable for ad-hoc groups due to either the lack of support for group-to-group interaction ([8, 5, 6]), or problems with mobility and flexibility ([4]). In all of these systems, a change of one site (e.g., an increase in the number of participants) calls for non-trivial changes of the physical layout at remote sites. Also, a change of site location is not easy due to the need of transferring a complex structure of cameras and viewing screens. Once set up, these designs provide the users with excellent communication environment; however, ad-hoc groups might be forced to spend a lot of energy on re-installment of the system. GColl mediates a slightly restricted variant of mutual gaze and partial gaze awareness than, e.g., Multiview. On the other hand, we believe the restriction to be very well counter-weighted by the highest mobility and flexibility of GColl among all similar systems known to the authors. 2 Environment Description 2.1 Basic Structure Support of mutual gaze was one of our main priorities during the design phase of GColl. To fulfill this requirement, the videoconferencing system has to supply each user with her own visualization device as otherwise those users, who are sharing a common device, are in a sense sharing the “same eyes”—either all of them experience a feeling of eye-contact with a remote participant, or none does. We could not use designs based on several projectors or monitors as this would severely limit flexibility and mobility of GColl. Instead, we opted for assumption that each user has a personal notebook or other computer at her disposal, which, as we believe, is not too restrictive for the anticipated class of possible users. A basic structure diagram of a GColl installFig. 1. A scheme of site installament at a site with three users is depicted in Figure tion 1. An image of the whole group is recorded by a single camera G. An echo-canceling microphone is used to capture audio at this site. Additionally, each user uses her notebook, two common web cameras and a headset with a microphone: the focus camera is attached to the top of the notebook screen itself while the side camera is placed on the desk next to the notebook. If available, a shared screen can be used for slides or any other material the group needs. GColl: A Flexible Videoconferencing Environment for Group-to-Group Interaction 167 Fig. 2. A scheme of the visualization layout and a photo of its prototype implementation (with two remote sites) A scheme of GColl visualization screen and a photo of our prototype implementation is shown in Figure 2. The screen is divided into three parts: the top left image provides a “whole room” view of the remote group (or splits into subframes if more than two groups are communicating); images of other participants (remote as well as co-located) are shown on the bottom of the screen and after clicking on any of these small images, the chosen stream is then shown in higher quality in the top right (focus) frame. A click on the whole group frame itself unselects the chosen stream (and restores it to the original size) once it is not needed. 2.2 Functionality The focus frame allows each participant to easily select some other user, on whom she wants to currently focus, and see him in more detail. The system informs other users whether or not they are “in focus” of each of the other participants by selecting the appropriate videostreams to be shown in the bottom (or top right) frames of their visualization screen: those users, who are not selected, receive the side camera image; the selected user receives the stream from the focus camera. For example, the user whose screen is depicted on Figure 2 knows that she has been selected by the participants A, C and F, and that she is herself focusing on the participant A. This setup allows us to mediate a sensation of mutual gaze. It is based on findings by Chen [2] that show asymmetry in human sensitivity to eye-contact—that is, people would still perceive eye-contact if the other person’s gaze is directed less than 5◦ below the camera. By having a fixed place for the focus window in GColl, it is easy to attach the focus camera appropriately close to the visualized eyes. Thus, if you are selected by another participant and that participant looks at you in his focus frame, you will perceive eye-contact as can be seen in Figure 2. 3 Conclusions and Future Work In this paper, we have presented the GColl system for group-to-group collaboration with mutual gaze support. GColl is designed to be usable even for ad-hoc communicating groups due to its flexibility and modest technical requirements. Fully functional (yet still prototype) implementation is available for download at [1]. 168 P. Slovák, P. Troubil, and P. Holub Currently, we are analyzing data gathered in a usability study comprising more than 90 people and the preliminary results seem to be quite promising. A real world case study is being run simultaneously, involving working teams from several universities and a national research organization. To bring GColl closer to face-to-face communication, we are now exploring the possibilities of using USB cameras streams for eye-tracking. If we can overcome an inaccuracy of this gaze-estimation method, it might be used for user selection instead of the current mouse/keyboard interface. We believe that this method could be more natural and therefore ease users’ interaction with GColl. Once we have this modification ready, we plan to conduct another user study to verify our hypothesis. GColl currently supports group-to-group communication, but has no inherent shared workplace or any other means of task support. We presume that such extensions of the design are possible (e.g., a shared tabletop at each site) and would enhance the collaboration experience greatly. Acknowledgments. We would like to thank Milan Kabát for his help with the implementation of the GColl prototype. This project has been supported by a research project CESNET FR 254/2007. References 1. SourceForge.net: GColl, http://www.sourceforge.net/projects/gcoll 2. Chen, M.: Leveraging the asymmetric sensitivity of eye contact for videoconference. In: CHI 2002, pp. 49–56. ACM, New York (2002) 3. Monk, A., Gale, C.: A Look Is Worth a Thousand Words: Full Gaze Awareness in VideoMediated Conversation. Discourse Processes 33, 257–278 (2002) 4. Nguyen, D.T., Canny, J.: Multiview: improving trust in group video conferencing through spatial faithfulness. In: CHI 2007, pp. 1465–1474. ACM, New York (2007) 5. Okada, K.-I., Maeda, F., Ichikawaa, Y., Matsushita, Y.: Multiparty videoconferencing at virtual social distance: MAJIC design. In: Furuta, R., Neuwirth, C. (eds.) CSCW 1994: Proceedings of the 1994 ACM conference on Computer supported cooperative work, pp. 385– 393. ACM, New York (1994) 6. Sellen, A., Buxton, B., Arnott, J.: Using spatial cues to improve videoconferencing. In: CHI 1992, pp. 651–652. ACM, New York (1992) 7. Vertegaal, R., van der Veer, G., Vons, H.: Effects of Gaze on Multiparty Mediated Communication. In: Moller, T., Ware, C. (eds.) Graphics Interface, pp. 95–102. Morgan Kaufmann, San Francisco (2000) 8. Vertegaal, R., Weevers, I., Sohn, C., Cheung, C.: Gaze-2: conveying eye contact in group video conferencing using eye-controlled camera direction. In: Cockton, G., Korhonen, P. (eds.) CHI 2003, pp. 521–528. ACM, New York (2003)