Lifelogging meets alternate and cross-realities: an investigation into broadcasting personal visual realities to remote audiences

Aiordăchioae, Adrian; Pamparău, Cristian; Vatavu, Radu-Daniel

doi:10.1007/s11042-021-11310-3

Lifelogging meets alternate and cross-realities: an investigation into broadcasting personal visual realities to remote audiences

1161: Multimedia Alternate Realities
Published: 14 August 2021

Volume 83, pages 46707–46730, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Lifelogging meets alternate and cross-realities: an investigation into broadcasting personal visual realities to remote audiences

Download PDF

Adrian Aiordăchioae¹,
Cristian Pamparău¹ &
Radu-Daniel Vatavu ORCID: orcid.org/0000-0002-7631-6445¹

472 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

We address in this work the topic of broadcasting one’s visual reality, captured by the video cameras embedded in mobile and wearable devices, to a remote audience. We discuss several designs of such life broadcasting systems, which we position at the intersection of lifelogging, Alternate Reality, and Cross-Reality technology. To this end, we introduce the “Alternate Reality Broadcast-Time” matrix for the broadcasting and consumption of alternate realities, in which designs of systems that implement sharing and consumption of personal visual realities can be positioned, characterized, and compared. These design options range from simple video streaming over the web using conventional video protocols to mediated and augmented reality, to audio narration and vibrotactile rendering of concepts automatically detected from video captured by wearable cameras. To demonstrate the usefulness of our broadcast-time matrix, we describe three prototypes implementing lifelogging, concept recognition from video, augmented and mediated vision, and vibrotactile feedback. Our contributions open the way toward new applications that blend lifelogging, consumption of multimedia alternate realities, and XR technology to empower users with richer opportunities for self-expression and new means to connect with the followers of events in their lives.

Live Will Never Be the Same!

Immersive Virtual Reality in Cultural Heritage Dissemination: A Comprehensive Application for Novice Users

Analyzing the User Experience of Virtual Reality Storytelling with Visual and Aural Stimuli

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The recent availability of personal devices with built-in video cameras, such as smartphones, high-resolution photo cameras, action cameras, and other wearables, has enabled users to capture, store, and share their everyday life in the form of pictures and videos on social web platforms. Every minute, 500 hours of video are being uploaded to YouTube and over 1 billion hours of YouTube videos are watched each day [123]. The advent of wearable video cameras that can be attached to clothes, such as Google Clips [38] or Narrative Clip 2 [86], as well as of smartglasses with built-in video cameras, such as Spectacles by Snapchat [124], has generated an amplifying effect of the phenomenon known as “lifelogging,” the new form of personal big data [39]. Consequently, the practice of video recording and sharing one’s life and visual reality and experiences has created the need for advances in video streaming technology to support such user behavior [113, 133], but also created new topics in need of scientific investigation that the community has started to address, such as video literacy, privacy, and the impact on bystanders of the use of video recording devices in public places [54, 55, 57,58,59].

Mobile and wearable devices are also being increasingly used for Virtual Reality (VR), Augmented Reality (AR), and Mixed Reality (MR) applications [70], in which the physical world intertwines the virtual [9, 10, 15] to create novel interactive experiences for users. Also, applications of Mediated Reality [67] delivered by smart eyewear devices enable their users to perceive new facets of the physical reality, such as physical phenomena taking place outside the visible spectrum [68], while applications in Alternate Reality [19] (AltR)^{Footnote 1} use multimedia technology to engage users into new experiences, such as “living” other people’s stories and interacting in different places, times, situations, and contexts. At the same time, a new type of ubiquitous mixed reality has emerged from the fusion of networked sensor and actuator infrastructure and shared online virtual worlds, Cross-Reality [94].

We believe that it is only a matter of time before lifelogging and cross-reality technology used in conjunction will provide practical means for users to express themselves in new ways, share their everyday life activities, and connect to remote audiences in creative ways for broadcasting and consuming AltR realities. In this work, we explore such aspects at the intersection of lifelogging, cross-reality, and multimedia alternate reality technology, and we focus on the specific topic of broadcasting personal visual realities to a remote audience. Figure 1 shows an illustration of the concept: visual reality corresponding to the user’s field of view captured by the video camera embedded in a smart device, such as a pair of smartglasses, and processed by lifelogging [39], life abstraction [4], augmentation [15], and/or mediation [67] is streamed to a video server, from where it is broadcast to a remote audience. The audience may consist of friends, family members, co-workers, peers, or any followers [111, 134] recognized and authorized by the broadcaster. The members of the audience may experience the broadcast as a video stream, audio narration, or even haptic feedback connected to specific events that occur in the life of the broadcaster. The palette of design possibilities for systems implementing broadcasting and consumption of personal visual realities supported by lifelogging devices, cross-reality ubiquitous sensor and actuator infrastructure, and AltR modalities is large with many opportunities for innovation. In this context, it is important to analyze, structure, and discuss design possibilities for such applications. Our practical contributions are:

1.
We introduce the “Alternate Reality Broadcast-Time” matrix (AltR-BT) in the form of a conceptual space to characterize the possibilities for sharing personal visual realities with remote audiences. In this space, we identify the broadcast (what is shared) and the time of broadcasting (when sharing occurs) as two important dimensions for broadcasting personal visual realities to third party viewers for systems positioned at the intersection of lifelogging, AltR, and cross-reality technology.
2.
We describe three prototypes that implement design options from the AltR-BT space: (1) video streaming using Wi-Fi camera glasses using established protocols, e.g., HLS and DASH; (2) video streaming of mediated and augmented vision implemented with the HoloLens Head-Mounted Display (HMD) and YouTube Live; and (3) broadcasting of life abstraction in the form of concepts automatically detected from the video captured by camera glasses, which are rendered visually, aurally, and as vibrotactile feedback to the members of the remote audience.

As technology advances in terms of cross-reality infrastructure of ubiquitous sensors and actuators, new concepts emerge in AltR realities, and lifelogging devices become prevalent, we are about to see more applications that fall at the intersection of these areas that will create new opportunities for self-expression for users and new ways to connect to remote audiences. Our AltR-BT conceptual space will help guide such developments, informing researchers and practitioners about the design possibilities for broadcasting and consumption of personal visual realities.

2 Related work

We relate to prior work in computer-generated, mediated, and cross reality, overview established protocols for video streaming over the Internet, and discuss prior work on smartglasses with embedded video cameras and video streaming applications, including the lifelogging phenomenon.

2.1 Computer-generated and computer-mediated realities

Prior work has addressed many facets and practical aspects of the ways in which physical reality can be captured, processed, and rendered in modified forms and formats to users. These computer-generated, computer-mediated, and computer-enhanced realities include Virtual Reality [51], Augmented Reality [9, 10], Mixed Reality [77, 78], Mediated Reality [67], Multimediated Reality [68], Alternate Reality [19], and Cross-Reality [94], to name a few of the most common forms of hybrid physical-virtual realities that are relevant to our scope of investigation.

We are interested in such hybrid realities from the point of view of the visual perception they create for their users, in relation to which we discuss the opportunity of broadcasting personal visual realities to remote audiences. To this end, we overview definitions of the various forms of computer realities by focusing on visual perception. For example, Pausch et al. [96] adopted a visual perspective for defining VR as “any system that allows the user to look in all directions and updates the user’s viewpoint by passively tracking head motion” (p. 13), while Mann et al. [68] provided a shorter definition referring to systems that replace the real world with a virtual one, and introduced a classification system for many types of realities, including mediated, augmediated, and multimediated. Azuma [9] introduced a practical definition of AR by enumerating the characteristics of systems that combine the real and the virtual, are interactive in real time, and operate in 3-D. Regarding MR, definitions are still divided among experts [130], although Milgram et al. [77, 78] distinction between Augmented Reality and Augmented Virtuality (AV) as instances of Mixed Reality in the Reality-Virtuality Continuum seems to be the preferred definition, i.e., “display systems [that] provide users with the opportunity to move back and forth between real world and virtual world scenes” [77] (p. 19) or, respectively, a “mix of real and virtual worlds in various proportions,” according to Mann et al. [68] (p. 1). Speicher et al. [130] concluded that MR “can be many things and its understanding is always based on one’s context” (p. 12).

Mediated Reality (X-Y.R) goes a step further with respect to VR, AR, MR by mixing, blending, and modifying reality [68], such as by enabling users to filter out undesirable aspects of the visual world. A special application of X-Y.R is “diminished reality” [67], where objects from the background are removed or replaced so that visual attention can focus on a subset of objects of interest; see Zokai et al. [149] for examples. In this regard, Mori et al. [81] described diminished reality by means of a “set of methodologies for concealing, eliminating, and seeing through objects in a perceived environment in real time to diminish the reality” (p. 1). When mediation and augmentation occur simultaneously, the result is an augmented mediated reality or, for short, Augmediated Reality [50, 68], which Scourboutakos et al. [115] characterized as “an experience where by the means of a system technology, people are able to seamlessly modify (mediate) their perception of reality, while it is also being augmented” (p. 752).

2.2 Cross-Reality and Alternate Reality

Paradiso and Landay [94] introduced Cross-Reality as the union between ubiquitous sensing and actuator technology and shared online worlds. Within Cross-Reality, “sensor networks can tunnel dense real-world information into virtual worlds” and “interaction of virtual participants can incarnate into the physical world through a plenitude of diverse displays and actuators” [94] (p. 14). Although Paradiso and Landay did not provide an acronym for Cross-Reality, Mann et al. [68] referred to it using XR-like notations, while also providing context and discussing the history of the “X” in “XR.” According to Mann et al., the “X” has been used to refer to extrapolation, interpolation, and extension of the physical world. To prevent confusion of the uses of the acronym and clarify such matters, Mann et al. [68] provided three definitions for XR realities that received distinct names: Type 1a (or XR1a), Type 1b (XR1b), and Type 2 (XR2) realities, respectively. In Type 1 worlds, “X” is a mathematical variable that denotes either extrapolation past the physical reality (a Type 1a or XR1a reality) or interpolation in Milgram’s [77, 78] sense of the reality-virtuality continuum (a Type 1b or XR1b reality). In Type 2 worlds, “X” denotes Paradiso and Landay’s [94] “cross” prefix from “cross-reality.” Another use of the XR acronym has been the umbrella term “Extended Reality” to cumulatively denote VR, AR, and MR realities. In fact, the community has been using XR to denote both Cross-Reality [26, 48, 84, 85, 118, 126] and Extended Reality [20, 69, 71, 91, 143]. To keep clear from such confusions, we use in this work the acronym XR2, following the taxonomy and notations of Mann et al. [68], to refer to the cross-reality concept of Paradiso and Landay [94] relevant for the scope of our investigation.

Another type of reality relevant for our work is Alternate Reality (AltR). According to Chambel et al. [19], an alternate reality denotes “different spaces, times or situations that can be entered thanks to multimedia contents and systems, that coexist with our current reality, and are sometimes so vivid and engaging that we feel we are living in them ... immersive experiences that may involve the user in a different or augmented world, as an alternate reality.” Thus, AltR goes beyond the focus of VR, AR, and MR by addressing new forms of media and delivering new kinds of immersive experiences to users. Consequently, of all the various forms of combining the real and virtual, we relate to XR2 and AltR the most since our scope addresses broadcasting personal visual realities that implicitly require the support of ubiquitous computing technology (e.g., networked infrastructure, communications protocols, video streaming technology, wearable devices) and multimedia techniques, including interaction techniques, to deliver to a remote audience the visual experience of the broadcaster. Figure 2 illustrates the scope of our work as a Venn diagram adapted from Mann et al. [68]. While XR2, a subset of MR [68], is a combination between virtual worlds and ubiquitous infrastructure, AltR uses multimedia technology to engage users in experiences of other realities corresponding to different places, times, situations, and contexts [19]. Thus, the intersection of lifelogging, AltR, and XR2 contours our scope of investigation.

2.3 An overview of video streaming protocols

We relate to video streaming protocols since streaming represents the minimal functionality expected from systems that implement broadcasting of personal visual realities to remote viewers. According to Cisco’s Annual Internet Report for 2018-2023 [22], video streaming is forecast to account for 82 percent of the Internet traffic by 2022, of which 22% will represent UHD IP video. Increased availability of personal video camera devices and lifelogging practices becoming more common [39] will lead to even more creation and consumption of video-based content.

Many protocols are available for streaming content, including video; see [113, 133] for overviews. Common protocols include the Real-time Transport Protocol (RTP) [49], Real-time Streaming (RTSP) [103], the User Datagram Protocol (UDP) [100], and the Hypertext Transfer Protocol (HTTP) [104]. For instance, RTP [49] is a push-based protocol for video and audio streaming over IP-based networks for communication and entertainment systems that involve streaming media, such as telephony, video teleconference applications, television services, and web-based push-to-talk features. RTSP [103] is employed in communication systems that necessitate real-time sessions between end users or between streaming servers and clients. HTTP [104] can be employed to transfer various types of content, including text, images, graphics, audio, and video.

Some communication protocols have been specifically devised for video in order to optimize particular aspects of the delivery chain. An example is HTTP Adaptive Streaming (HAS) [119] that supports most of the video traffic on the Internet due to reliable transmission, cache infrastructure reuse capability, and firewall traversal. HAS is employed in Over-The-Top Players (OTTP), such as NetFlix and YouTube. To improve the performance of adaptive streaming over HTTP, several rate adaptation algorithms have been introduced [64, 65, 79, 136, 147], and the literature has equally focused on improving specific aspects of HAS, such as adaptive bit rate selection [112], buffer-based policies [63], and QoE models [11, 12, 119]. Other protocol examples include Microsoft Smooth Streaming (MSS) [120], Apple HTTP Live Streaming (HLS) [93], and HTTP Dynamic Streaming (HDS) [2]. For instance, HLS was designed to dynamically adapt to network conditions by optimizing playback at client level. The typical length of media segments in HLS is 10 seconds, which determines its latency. To improve on this aspect, the Low-Latency extension of HLS [102] was proposed. Also, the MPEG group introduced the Dynamic Adaptive Streaming over HTTP (MPEG-DASH) to unify existing solutions (e.g., MSS, HLS, HDS, etc.) by defining the format of media segments and the Multimedia Presentation Description (MPD).

2.4 Smart eyewear devices with video streaming capabilities

A large number of commercially available smartglasses feature video streaming functionality [7, 30, 66]. Applications for smartglasses have a wide addressability, among which applications to improve the quality of life for people with visual impairments [42, 131, 135], receiving information about one’s surroundings [110], streaming the wearer’s first-person video to remote viewers [3, 72, 105], consuming media projected on see-through lenses [52, 89, 90], and lifelogging [4, 5]. For example, Stearns et al. [131] introduced a system that used the HoloLens HMD to render video captured by the smartphone: on HoloLens, users with low vision were able to see a magnified version of the visual reality. Tanuwidjaja et al. [135] developed a smartglasses-based system for color substitution to assist colorblind people to distinguish color information. Rio and McCullough [72, 105] discussed telementoring and teleproctoring applications. Other systems employed video streaming to provide access to remote events rendered via HMDs [90], view favorite parts from videos recorded in 360\(^\circ\) [89], end-to-end VR streaming [52], and super-multi view 3-D content delivered from an autostereoscopic device to a web browser [117].

2.5 Lifelogging

Lifelogging is the practice of using devices with embedded sensors, such as wearable video cameras [4, 16, 44] or activity and location trackers [75, 148], to record aspects of everyday life [40]. Due to wearable devices becoming more and more affordable and available, lifelogging has become a phenomenon [35] with many people adopting this technology as a life practice. If in 1996, JenniCam [41] was streaming student Jennifer Kaye Ringley’s daily activities at a frequency of one snapshot every three minutes, today’s high-speed action cameras can capture 1080p quality images at the impressive speeds of 240 fps [41]. Several commercial products are available for lifelogging enthusiasts, among which Narrative Clip 2 [86], MeCam [74], and SnapCam [125]. Lifelogging prototypes based on wearable video cameras [16, 28, 44] are well represented in the scientific literature, which has targeted specific applications, such as food-logging [56], vehicular lifelogging [5, 73], thing-logging for the Internet-of-Things [36], logs of computer usage [43] and sleep patterns [75], and applications that monitor and report aspects regarding the quality of life [148]. Among these applications, lifelogging has been used for people with cognitive disabilities [6, 14, 132]; e.g., Berry and Stix [14, 132] discussed two applications for people with Alzheimer’s that enabled users to relive recent visual experiences with the help of the data stored in the lifelog, and Al Mahmud et al. [6] proposed an image capturing device for people with aphasia that captured photographs and added tags automatically.

2.6 Privacy aspects about the use of video cameras in public places

With the increase in the availability of mobile and wearable devices with built-in video cameras that feature video recording and streaming functionality, ensuring the privacy of bystanders represents an important ethical aspect of using such devices in public places. Many studies from the scientific literature have examined concerns regarding public video recording with wearable video cameras [21, 27, 31, 45, 46, 57,58,59, 101, 122]. Some studies have focused on the social acceptability of smartglasses [27, 59] while others analyzed sensitive lifelog data [45] or documented the behavior of lifeloggers [46, 58]. Also, the legal and ethical implications of video streaming from public places have been addressed [31, 101], including practical solutions for bystanders in the form of privacy mediation [27, 45, 57]. For example, Koelle et al. [57] proposed a set of free-hand gesture commands to enable bystanders to signal their preferences about being recorded or not by users of wearable video cameras. By using a gesture vocabulary that video camera devices could detect and understand, passers-by could explicitly express their consent or disapproval for video recording. Some systems, such as Life-Tags [4], considered privacy aspects as part of their design requirements and replaced video recording with concepts (tags) extracted from videos.

3 The Alternate Reality broadcast-time matrix

We introduce in this section a conceptual space with two axes, broadcast and time, in the form of a discrete two-dimensional matrix, in which we identify multiple opportunities to design systems for broadcasting users’ personal visual realities to remote audiences. We draw inspiration from Johansen’s 2 \(\times\) 2 time-space groupware matrix [53] for group interactions that depicts all possible combinations of location (where communication between individuals takes place) and time (when communication takes place). For example, Johansen discussed face-to-face interactions that occur in the same place and at the same time as well as communication and coordination between the members of a group that can be implemented remotely and asynchronously, i.e.in a different place and at a different time, among the four categories permitted by the time-space matrix [53]. From Johansen’s matrix, we borrow the “time” axis (i.e., when the broadcast of the personal visual reality takes place), which we extend to cover practical aspects regarding streaming content in relation to the latency of video streaming protocols [141]. Also, since broadcasting to a remote audience implicitly specifies distinct locations, i.e., the locations of the broadcaster and of the members of the audience, we replace Johansen’s “space” axis with “broadcast” to distinguish between various types of content related to the visual reality that is shared. With these two axes, broadcast and time, our matrix specifies what part of one’s personal visual reality is shared and when sharing occurs in terms of the synchronicity between the broadcasters and their audiences. By specifying all the possible combinations between the categories of these two axes, our matrix generates a two-dimensional conceptual space, the Alternate Reality Broadcast-Time space (AltR-BT), in which life broadcasting systems can be positioned, characterized, and compared, and ideas regarding future versions of such systems can be generated by following a systematic approach.

Figure 3 shows a visual illustration of the AltR-BT conceptual space for broadcasting personal visual realities to remote audiences, in which 40 combinations of broadcast \(\times\) time have been identified. However, a larger number of possibilities exists in this space, and systems may implement more than one category from each axis, e.g., there are \({10}\atopwithdelims (){2}\) = 45 ways to select two categories from the broadcast axis that, when combined with each category from the time axis, generate 180 new design possibilities; also, there are \({10}\atopwithdelims (){3}\) = 120 ways to select three broadcast categories, leading to 480 designs by combining them with each time category, and so on. Next, we discuss each axis in detail.

3.1 The time axis

By drawing from the time axis of Johansen’s [53] matrix, we distinguish between synchronous and asynchronous communication between the broadcaster and their remote audience. On this axis, we consider different application requirements for the interactivity between the broadcaster and the audience, and identify four categories, as follows (see Fig. 3):

1.
High-level interactivity expected. This category includes applications for which two-way live interactivity between the broadcaster and the audience is critical [107]. For example, applications for video conferencing, collaborative work, and remote labs, where synchronization is key in order for the task to be accomplished effectively, fall in this category. Under the requirements of a highly interactive life broadcasting system, a practical technical implication is a very low latency for the broadcast delivered with a high-performing streaming protocol. One possible implementation at the moment of this writing is the Web Real-Time Communication protocol (WebRTC) [106], regularly employed for two-way web conferencing and telepresence applications.
2.
Medium-level interactivity expected. This category contains applications with less stringent requirements of the interactivity expected between the broadcaster and their audience. Strict synchronization is not required for the life broadcasting system to operate effectively and tasks to be accomplished. Examples of applications include user-generated content live streams, game streaming, and e-sports. Existing video streaming protocols that could be employed to support this category of applications include Low-Latency HLS [102] (latency of about two seconds at least), Low-Latency CMAF for DASH [109] (two seconds at least) and RTSP/RTP (one second at least); see [141] for details.
3.
Low-level interactivity expected. With this category, we are entering the asynchronous part of the time axis. Applications that do not necessarily expect interactions between the broadcaster and the audience, or for which interactions are allowed to not be synchronized with the broadcast, such as streaming news, sport events, or one-way streams of reality shows and live events to large audiences. Examples of currently in-use video streaming protocols suitable to support applications from this category include Apple HLS [93] and MPEG-DASH [82] with around eighteen seconds latency, according to reports from [141].
4.
No interactivity expected between the broadcaster and the audience. For these applications, the content is archived and broadcast only on demand. YouTube video streaming of content that was previously uploaded by creators represents a relevant example. In this case, the audience can query the archive to locate video recordings that match specific keywords, descriptions, or time intervals. Streaming NetFlix content also falls into this category. Searching through the lifelog [39] for recollecting, reminiscing, retrieving information, reflecting, and remembering is another example.

From the first to the last category on the time axis, the interaction between the broadcaster and their remote audience becomes less constrained, from high-level interactivity to no interactivity expected at all. To exemplify possible implementations of life broadcasting systems along the time axis, we have enumerated in this section several streaming protocols that could be employed at the moment of this writing to meet various application requirements. However, our time axis should be viewed from the higher-level perspective of specifying interactions that need to take place either synchronously or asynchronously rather from the perspective of the latency delivered by one streaming protocol or another. Moreover, these requirements are inherent to human cognition and do not change as streaming protocol technology evolves.

3.2 The broadcast axis

The time axis specifies how the time zones of the broadcasters and their audiences overlap during the broadcast with implications regarding the interactivity permitted between the broadcaster and the audience. In the following, we look at the video content that is broadcast in relation to one’s personal visual reality, for which we consider various forms to transmit it and make it available to the remote audience. We distinguish two major categories on the broadcast axis, video and non-video, and a number of ten subcategories (see Fig. 3 for a visual illustration), as follows:

1.
Original, unmodified video. The video that is broadcast is not processed in any way and, thus, the audience receives the original version of the broadcaster’s visual reality as captured by the video acquisition device. This category contains applications of all types of video cameras: fixed, wearable, or embedded in various devices. For example, wearable video cameras can be attached to the body to record and stream video from a first-person perspective. Commercial products for lifelogging, such as Narrative [86], Google Clips [38], MeCam [74], and SnapCam [125], and videos captured with photo cameras and smartphones are also part of this category [1, 18, 60]. First-person, eye-level video also falls into this category, but is distinct from the previous examples in that the video camera is worn at eye-level so that video is recorded and streamed from the eye height of the wearer [108] -example is the Life-Tags prototype of Aiordăchioae and Vatavu [4]. The importance of a correct perspective and point of view for lifelogging systems has been explicitly stated in the scientific literature, see Gurrin et al. [39] (pp. 20, 31, 39), but most commercial products for lifelogging [38, 74, 86, 125] have not yet implemented such recommendations. Other examples of applications from this category use video cameras integrated into other devices that track the user around, such as personal drones [83], or are employed by the user, such as cars [99].
2.
Mediated video. The video that is broadcast may be processed, either locally or at the video streaming server, so that the audience receives a mediated view of the personal visual reality of the broadcaster. According to Mann [67], mediation offers a wide range of options, from processing video using computer vision algorithms (e.g., contrast enhancement, color correction, etc.) to diminished reality (where objects from the background are faded out so that visual attention can focus onto a subset of objects of interest). For example, the full visual reality of the broadcaster may be accessible to a selected group of members from the audience, while the rest have access to a diminished version only, from which sensitive and personal content is removed. For example, the “Audience Silhouettes” [137] prototype broadcasts 3-D body silhouettes of TV viewers to the members of their social group, but removes the background and any personal traits (e.g., face, clothes, etc.), leaving just their body silhouettes for non-verbal communication.
3.
Augmented video. Different from mediation [67, 68], augmentation refers to adding new information, in the form of computer-generated content, on top of the video capture of the physical reality. Examples include highlighting faces in video streams, displaying the names and descriptions of identified persons, showing details of specific objects, and so on. AR applications [15] fall into this category.
4.
Virtual-world video. In this category, the audience is presented with a completely virtual version of the broadcaster’s visual reality. Milgram et al. [77] discussed transitions between different mixtures of the physical and the virtual leading to completely VR worlds, and Vatavu et al. [139] presented the virtual world/conventional TV and virtual world/virtual TV examples for AR television, where TV viewers are immersed into virtual environments, from where they watch television either in the form of a video stream of their physical TV set or a virtual TV from the VR environment.
5.
Broadcast summaries. Represent individual snapshots that are broadcast instead of actual video. The advantage of this approach is reduced network bandwidth and flexibility in terms of the events that trigger streaming. For example, Google Clips [38] automatically starts recording when it detects a familiar face or a pet, and video surveillance systems start recording/streaming when motion is detected, etc.
6.
Life abstractions. Represent summaries of the broadcaster’s visual reality over a specified period of time. The concept of life abstraction was introduced by the Life-Tags prototype of Aiordăchioae and Vatavu [4] to present users with short summaries of their lifelog in the form of word clouds of concepts automatically extracted from video.
7.
Visual news tickers. Represent a particular form of life abstraction in which the concepts automatically detected from video are broadcast to the audience and displayed in the form of a narration. DeepDiary [32] is an example of such a system that employs automatic image captioning techniques to produce textual representations of video lifelogging data. Such narrations can be rendered as text to the audience as if they were a “news ticker” reporting on the life of the broadcasters.
8.
Audio news tickers. Render the content of the previous category using text-to-speech techniques. For example, Soundscape [76] is an application that provides information about one’s surroundings, the “Sotto voce” [8] system enables sharing of audio information, and audio AR systems [13, 142] fall into this category as well.
9.
Haptic news tickers. Convert content to vibrotactile feedback delivered on the body of members from the audience, such as on the wrist (e.g., by a smartwatch), the arm (by a smart armband), and so on. Tactons [17], digital vibrons [138], Flex-n-Feel [121], and haptic AR systems [61] are examples that relate to this category.
10.
Multimodal news tickers. Represent multiple possible combinations of the previously mentioned modalities to broadcast life to the remote audience. For example, the audience may experience both visual and haptic news tickers, where the latter accentuate particular events.

From the first to the fourth category of the video part of the broadcast axis, the relationship between the video content that is broadcast and the corresponding visual reality of the broadcaster becomes weaker. From the original, unmodified version of the video captured by the acquisition device to the virtual reconstruction and representation of the broadcaster’s visual reality, the audience is presented with different versions of what the original visual experience of the broadcaster has been. In this work, we are primarily interested in broadcasting visual life, but video is not the only modality to attain this desiderata. Consequently, the non-video part of the broadcast axis specifies other modalities by the means of which the visual experience of the broadcaster can be delivered to their audience, from textual descriptions of visual life to video rendered using haptic feedback. The broadcast axis reveals thus a variety of modalities in which personal visual realities can be communicated to the members of the broadcaster’s audience. These modalities go past mere video streaming toward richer modalities for self-expression of the broadcaster and consumption opportunities for the audience. Other categories could be added to the broadcast axis, such as categories that address the gustatory or olfactory senses [88] of the audience. However, since research regarding these modalities is still scarce compared to the large body of work on rendering visual, audio, and haptic feedback, we opted for limiting, for now, the number of categories on the broadcast axis to keep it manageable from a practical point of view. Future work can incorporate such new categories to address new applications of broadcasting personal visual realities.

4 Demonstrative prototypes

We describe in this section three prototypes that demonstrate various design possibilities from our AltR-BTconceptual space for sharing personal visual realities with remote audiences. Each prototype builds on prior work and extends it by implementing various AltR-BTcategories.

4.1 Broadcasting first-person, eye-level video

Aiordăchioae [3] presented a simple, yet effective solution for sharing first-person eye-level video, captured from a pair of camera glasses, with remote viewers. Aiordăchioae’s technical solution employed HTTP, WebSockets, JavaScript, and node.js technology to transfer individual snapshots from the camera glasses to clients running in the web browsers of the remote viewers’ desktop PCs, smartphones, and tablets; see [3] for details regarding the software architecture and implementation. By using concepts and categories from the AltR-BTspace, we can characterize Aiordăchioae’s system as the design combination {original, unmodified video} \(\times\) {low-level interactivity expected}. Thus, this prototype reflects the intersection between XR2 and AltR, where the remote audience can experience the broadcaster’s visual reality, but little interactivity is required.

In the following, we show how the prototype from [3] can be extended by considering more categories from the time axis of the AltR-BT space. Specifically, we re-implemented the technical solution from [3], but replaced the request-response communications for the transfer of individual snapshots from the camera glasses with established video streaming protocols, which were discussed in Sect. 2. We employed the same model of video camera glasses [30] that feature a full-HD micro video camera with a 90\(^\circ\) field of view and Wi-Fi operation; see Fig. 4, top left image for a snapshot of the user (broadcaster) wearing the glasses. The software implementation employed the node-media-server [80] npm package for node.js that provides server-based implementations of the RTMP, HTTP-FLV, WS-FLV, HLS, and DASH video streaming protocols; see a description of these protocols in Sect. 2. The server-side application receives an incoming video stream from the camera glasses in the Advanced Streaming Format (ASF) [34], which is converted to RTMP [95] using the FFmpeg [33] module for nodej.s. This conversion enabled us to use node-media-server to publish live streams delivering http-flv, websocket-flv, HLS, and DASH video. To access these live streams from a web browser, we implemented web clients using dash.js [25], hls.js [140], and flv.js [47]; see Fig. 4, top right image for an illustration. On a smartphone client (Samsung Galaxy J6, Octa-Core 1.6 GHz Cortex-A53 CPU, 3 GB RAM, Android 9, Chrome), we received video with a latency of 6 seconds on average for RTMP, 23 seconds for HLS, 42 seconds for DASH, and 8 seconds for HTTP-FLV and WS-FLV, respectively. Figure 4, bottom illustrates the categories from the AltR-BT space implemented for this prototype.

4.2 Broadcasting third-person perspectives using a personal drone

We extended the prototype from the previous section to work with video captured from a personal drone providing a third-person perspective. To this end, we used the Parrot Mambo Fly^{Footnote 2} to which we attached a 720p HD video camera; see Fig. 5. We kept all the technical details of the software implementation (node.js, HLS, DASH, HTTP-FLV, WS-FLV) and, instead of connecting to the pair of glasses, we connected to the drone’s video camera. The result is life broadcasting of original video captured and delivered from a third-person, aerial perspective.

4.3 Broadcasting first-person, eye-level mediated and augmented video

To further explore the design categories of the broadcast axis, we revisited an application for Microsoft HoloLens from Pamparău and Vatavu [92] that implemented mediated and augmented vision; see Fig. 6, top-left. This prototype implements several image processing algorithms, such as color correction, edge highlighting, contrast and brightness adjustment, and face detection, which were informed by prior work on vision rehabilitation and augmentation [29, 62, 97, 98, 135, 144,145,146]. The processing techniques, called visual filters in [92], are applied to the video frames captured by the camera embedded in the HoloLens HMD. The result is aligned with the visual reality perceived via the see-through lenses; see Fig. 6, top right and middle images and Pamparău and Vatavu [92] for more details. From the perspective of our scope of investigation, this prototype is at the intersection of XR2 and AltR, enabling the remote audience to experience not only an original version of the broadcaster’s visual reality, but also its mediation and augmentation as well. We implemented broadcasting of the user’s mediated and augmented vision with the OBS Software [129] and a YouTube channel for live streaming. This solution enables a remote audience to access the augmented and mediated video broadcast from the web browser of any device, e.g., desktop PC, laptop, smartphone, smartwatch, etc. The bottom part of Fig. 6 illustrates the categories from the AltR-BTspace, i.e., {original video, mediated vision, augmented video} \(\times\) {medium-level interactivity expected}.

4.4 Broadcasting life abstractions

Aiordăchioae and Vatavu [4] introduced Life-Tags, a smartglasses-based system designed to automatically capture snapshots and to abstract them in the form of word clouds of visual concepts (tags). Life-Tags was the first wearable system implementing life abstraction in the context of lifelogging applications. However, Life-Tags was designed to operate mainly by saving the snapshots and associated tag clouds for the user’s benefit and less for sharing them to a remote audience. According to our broadcast-time matrix, Life-Tags implements {life abstraction} \(\times\) {no interactivity required (archived content)} and is found at the intersection between lifelogging technology and XR2.

In the following, we show how Life-Tags [4] can be extended for live streaming of tag clouds and for other output modalities from the broadcast axis. We implemented Life-Tags described in [4] as follows: snapshots automatically captured from micro video camera glasses are stored temporarily on the user’s smartphone, from where they are offloaded to a permanent storage like a desktop PC, and a third-party processing service [23] is used to generate descriptive tags, which form word clouds [24]; see Fig. 7, top right for an example. We implemented a node.js server-side application using the Socket.IO library [127] for real-time, bidirectional and event-based communications, and represented tag clouds as JSON objects. For the client side, we employed the Android Socket.IO client library [128] to implement the communication with the server. On the client side, multiple options are available to render the tags: visual display of tag clouds, a visual news ticker where tags are scrolled into view, audio rendering of the tags via cloud text-to-speech API [37] and haptic feedback. Regarding the latter, our choice for a haptic rendering device was the Myo smart armband [87]; see Fig. 7, middle image. Myo is a light-weight (93 grams) arm band expandable between 19 and 34 centimeters that provides haptic feedback in the form of short, medium, and long vibrations. We implemented associations between specific concepts, e.g., “travel” or “working,” with specific vibration patterns, e.g., medium pulse, pause of 500 ms, short pulse, pause of 500 ms, long pulse; see examples in the literature [138] and previous results regarding user perception of vibrotactile feedback at forearm level [114]. Figure 7, bottom illustrates the categories from the AltR-BTspace, i.e., {life abstraction, visual news ticker, audio news ticker, haptic news ticker} \(\times\) {no interactivity expected}.

5 Conclusion and future work

We introduced in this paper the broadcast-time matrix as a conceptual space to guide researchers and practitioners interested in implementing life broadcasting to remote audiences. Beyond video broadcasting, however, our conceptual space enumerates many design options and covers a wide palette of possibilities to broadcast and consume personal visual realities, from mediated video to life abstractions and delivery of haptic feedback connected to specific events tagged in the visual life of the broadcaster. The broadcast-time matrix represents an opportunity to explore new designs of interactive systems located at the intersection of lifelogging, alternate reality, and cross-reality technology, as illustrated in our Venn diagram from Fig. 2. The implications for lifelogging regard new ways to store, stream, and query information collected about one’s visual life for the purposes of recollecting, reminiscing, retrieving, reflecting, and remembering, i.e., the five R’s of memory access with lifelogging systems [116]. For alternate reality, being connected to the other people’s life events that take place in remote locations and even different times, our broadcast-time matrix enables a systematization of possibilities to deliver such experiences from the practical perspective of available modalities, from mediated video streams to news tickers with just the key aspects of the broadcaster’s visual life. For cross-reality, our broadcast-time matrix puts together requirements about ubiquitous networked infrastructure (e.g., synchronous communications) with characteristics of virtual experiences (e.g., mediated video and virtually reconstructed worlds) so that practitioners can better inform the technical designs of their prototypes.

Future work will look at extending the broadcast-time space with new categories, e.g., other modalities on the broadcast axis to deliver gustatory and olfactory experiences [88] to the remote audience, but also adaptations to specific application domains, such as new television environments [137, 139]. More practical applications are also envisaged, incorporating new sensors to acquire various aspects of visual life, new contexts of use, and specific user categories as well. We hope that our contributions, at the intersection of lifelogging, alternate reality, and cross-reality, will draw the attention of the scientific community interested in multimedia research toward the many opportunities enabled by putting together concepts from these areas of investigation.

Notes

In this work, we use the acronym AltR to refer to Alternate Reality, since AR is the conventional acronym used for Augmented Reality.
https://support.parrot.com/us/support/products/parrot-mambo

References

Abowd GD, Gauger M, Lachenmann A (2003) The family video archive: An annotation and browsing environment for home movies. In: Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, MIR -03. ACM, New York, NY, USA, pp 1–8. https://doi.org/10.1145/973264.973266
Adobe HTTP Dynamic Streaming (HDS) Technology Center | Adobe Developer Connection. https://www.adobe.com/devnet/hds.html. Accessed 20 July 2021
Aiordăchioae A (2019) Eyewear-based system for sharing first-person video to remote viewers. In: Proceedings of the 2019 E-Health and Bioengineering Conference (EHB), pp 1–4. https://doi.org/10.1109/EHB47216.2019.8969871
Aiordăchioae A, Vatavu RD (2019) Life-tags: a smartglasses-based system for recording and abstracting life with tag clouds. Proc ACM Hum Comput Interact 3(EICS):15:1–15:22. http://doi.acm.org/10.1145/3331157. Accessed 20 July 2021
Aiordăchioae A, Vatavu RD, Popovici DM (2019) A design space for vehicular lifelogging to support creation of digital content in connected cars. In: Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems, EICS 19. ACM, New York, NY, USA. https://doi.org/10.1145/3319499.3328234
Al Mahmud A, Braun J, Martens JB (2010) Designing to capture and share life experiences for persons with Aphasia. In: Proc. of the 12th Int. Conference on Human Computer Interaction with Mobile Devices and Services, Mobile HCI ’10. ACM, New York, NY, USA, pp 391–392. http://doi.acm.org/10.1145/1851600.1851680. Accessed 20 July 2021
Amazon.com: Wi-Fi Video Camera on Glasses Wearable Camera Sport DV DVR Dash Camera (WiFi-B, Black): Home Audio & Theater. https://www.amazon.com/Action-Wearable-Streaming-Glasses-Recorder/dp/B01M0EPUBT. Accessed 20 July 2021
Aoki PM, Grinter RE, Hurst A, Szymanski MH, Thornton JD, Woodruff A (2002) Sotto Voce: Exploring the interplay of conversation and mobile audio spaces. In: Proc. of the SIGCHI Conf. on Human Factors in Computing Systems, CHI 02. ACM, New York, NY, USA, pp 431–438. https://doi.org/10.1145/503376.503454
Azuma RT (1997) A survey of augmented reality. Presence: Teleoper Virtual Environ 6(4):355–385. https://doi.org/10.1162/pres.1997.6.4.355
Azuma RT (2016) The most important challenge facing augmented reality. Presence: Teleoper. Virtual Environ 25(3):234–238. https://doi.org/10.1162/PRES_a_00264
Barakabitze AA, Barman N, Ahmad A, Zadtootaghaj S, Sun L, Martini MG, Atzori L (2020) QoE Management of Multimedia Streaming Services in Future Networks: A Tutorial and Survey. IEEE Communications Surveys Tutorials 22(1):526–565. https://doi.org/10.1109/COMST.2019.2958784
Article Google Scholar
Barman N, Martini MG (2019) QoE modeling for HTTP adaptive video streaming a survey and open challenges. IEEE Access 7:30831–30859. https://doi.org/10.1109/ACCESS.2019.2901778
Bederson BB (1995) Audio Augmented Reality: a prototype automated tour guide. In: Conference Companion on Human Factors in Computing Systems, CHI 95. ACM, New York, NY, USA, pp 210-211. https://doi.org/10.1145/223355.223526
Berry E, Kapur N, Williams L, Hodges S, Watson P, Smyth G, Srinivasan J, Smith R, Wilson B, Wood K (2007) The use of a wearable camera, sensecam, as a pictorial diary to improve autobiographical memory in a patient with limbic encephalitis: A preliminary report. Neuropsychol Rehabil 17(4–5):582–601. https://doi.org/10.1080/09602010601029780
Billinghurst M, Clark A, Lee G (2015) A survey of Augmented Reality. Found Trends Hum Comput Interact 8(2–3):73–272. http://dx.doi.org/10.1561/1100000049. Accessed 20 July 2021
Blum M, Pentland A, Troster G (2006) InSense: Interest-based life logging. IEEE MultiMedia 13(4):40–48. https://doi.org/10.1109/MMUL.2006.87
Brewster S, Brown LM (2004) Tactons: Structured tactile messages for non-visual information display. In: Proceedings of the Fifth Conference on Australasian User Interface - vol. 28, AUIC 04. Australian Computer Society, Inc., AUS, pp 15–23
Buschek D, Spitzer M, Alt F (2015) Video-recording your life: User perception and experiences. In: Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, CHI EA 15. ACM, New York, NY, USA, pp 2223–2228. https://doi.org/10.1145/2702613.2732743
Chambel T, Kaiser R, Niamut OA, Ooi WT, Redi JA (2016) AltMM ’16: Proceedings of the 1st International Workshop on Multimedia Alternate Realities. ACM, New York, NY, USA. https://dl.acm.org/doi/proceedings/10.1145/2983298. Accessed 20 July 2021
Choi J, Jeong S, Ko J (2019) Xremul: An emulation environment for XR application development. In: Proceedings of the 20th International Workshop on Mobile Computing Systems and Applications, HotMobile ’19. ACM, New York, NY, USA, pp 187. https://doi.org/10.1145/3301293.3309569
Chowdhury S, Ferdous MS, Jose JM (2016) Bystander privacy in lifelogging. In: Proceedings of the 30th International BCS Human Computer Interaction Conference: Companion Volume, HCI ’16. BCS Learning & Development Ltd., Swindon, UK, pp 15:1–15:3. https://doi.org/10.14236/ewic/HCI2016.62
Cisco: Cisco Annual Internet Report (2018-2023) White Paper. https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.html. [Accessed March 2020]
Clarifai (2019) About Clarifai. Transforming how we see the world. https://clarifai.com/about. Accessed 20 July 2021
Cullen B (2015) WordCloud. https://www.nuget.org/packages/WordCloud. Accessed 20 July 2021
dash.js (2020). https://github.com/Dash-Industry-Forum/dash.js. Accessed 20 July 2021
Davies CJ, Miller A, Allison C (2014) A view from the hill: Where cross reality meets virtual worlds. In: Proceedings of the 20th ACM Symposium on Virtual Reality Software and Technology, VRST ’14. ACM, New York, NY, USA, p 213. https://doi.org/10.1145/2671015.2671138
Denning T, Dehlawi Z, Kohno T (2014) In Situ with bystanders of Augmented Reality glasses: Perspectives on recording and privacy-mediating technologies. In: Proc. of the SIGCHI Conf. on Human Factors in Computing Systems, CHI 14. ACM, New York, NY, USA, pp 2377–2386. https://doi.org/10.1145/2556288.2557352
de Jager D, Wood AL, Merrett GV, Al-Hashimi BM, OHara K, Shadbolt NR, Hall, W (2011) A low-power, distributed, pervasive healthcare system for supporting memory. In: Proc. of the 1st ACM MobiHoc Workshop on Pervasive Wireless Healthcare, MobileHealth 11. ACM. https://doi.org/10.1145/2007036.2007043
Everingham M, Thomas B, Troscianko T (1998) Head-mounted mobility aid for low vision using scene classification techniques. Int J Virtual Real 3(4):1–10. http://dx.doi.org/10.20870/IJVR.1998.3.4.2629. Accessed 20 July 2021
Eyeglasses Hidden WiFi camera. https://spyshoproundrock.com/product/cameras/eyeglasses-hidden-wifi-camera/. Accessed 20 July 2021
Faklaris C, Cafaro F, Hook SA, Blevins A, OHaver M, Singhal N (2016) Legal and ethical implications of mobile live-streaming video apps. In: Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct, MobileHCI 16. ACM, New York, NY, USA, pp 722–729. https://doi.org/10.1145/2957265.2961845
Fan C, Crandall DJ (2016) DeepDiary: Automatically captioning lifeloggingiImage Streams. In: ECCV 2016 Workshops. Springer. https://doi.org/10.1007/978-3-319-46604-0_33
FFmpeg (2019). https://www.ffmpeg.org/. Accessed 20 July 2021
Fleischman E (1998) Advanced Streaming Format (ASF) Specification. https://tools.ietf.org/html/draft-fleischman-asf-01. Accessed 20 July 2021
Gartner (2018) Gartner says Worldwide Wearable Device sales to grow 26 percent in 2019. https://www.gartner.com/en/newsroom/press-releases/2018-11-29-gartner-says-worldwide-wearable-device-sales-to-grow-. Accessed 20 July 2021
Gemmell J (2014) Life-logging, Thing-logging and the Internet of Things. In: Proceedings of the 2014 Workshop on Physical Analytics, WPA ’14. ACM, New York, NY, USA, pp 17–17. http://doi.acm.org/10.1145/2611264.2611276. Accessed 20 July 2021
Google. Cloud Text-to-Speech. Speech Synthesis. Google Cloud. https://cloud.google.com/text-to-speech. Accessed 20 July 2021
GoogleClips (2019) Google Clips Specifications. https://support.google.com/googleclips/answer/7545447?hl=en. Accessed 20 July 2021
Gurrin C, Smeaton AF, Doherty AR (2014) LifeLogging: Personal big data. Found Trends Inf Retr 8(1):1–125. https://doi.org/10.1561/1500000033
Gurrin C, Smeaton AF, Doherty AR (2014) LifeLogging: Personal big data. Found Trends Inf Retr 8(1):1–125. http://dx.doi.org/10.1561/1500000033. Accessed 20 July 2021
Hart H (2010) 1996: JenniCam starts lifecasting. https://www.wired.com/2010/04/0414jennicam-launches/. Accessed 20 July 2021
Hu C, Zhai G, Li (2015) An augmented-reality night vision enhancement application for see-through glasses. In: Proc. of the 2015 IEEE Int. Conf. on Multimedia Expo Workshops, pp 1–6. https://doi.org/10.1109/ICMEW.2015.7169860
Hinbarji Z, Albatal R, O’Connor N, Gurrin C (2016) LoggerMan, a Comprehensive Logging and Visualization Tool to Capture Computer Usage, vol. 9517. Springer, Cham. https://doi.org/10.1007/978-3-319-27674-8_31
Hodges S, Williams L, Berry E, Izadi S, Srinivasan J, Butler A, Smyth G, Kapur N, Wood K (2006) SenseCam: A retrospective memory aid. In: Proceedings of the 8th International Conference on Ubiquitous Computing, UbiComp’06. Springer-Verlag, Berlin, Heidelberg, pp 177–193. http://dx.doi.org/10.1007/11853565_11. Accessed 20 July 2021
Hoyle R, Templeman R, Anthony D, Crandall D, Kapadia A (2015) Sensitive Lifelogs: A privacy analysis of photos from wearable cameras. In: Proc. of the 33rd Annual ACM Conference on Human Factors in Computing Systems, CHI ’15. ACM, New York, NY, USA, pp 1645–1648. http://doi.acm.org/10.1145/2702123.2702183. Accessed 20 July 2021
Hoyle R, Templeman R, Armes S, Anthony D, Crandall D, Kapadia A (2014) Privacy behaviors of lifeloggers using wearable cameras. In: Proc. of the 2014 ACM Int. Joint Conference on Pervasive and Ubiquitous Computing, UbiComp ’14. ACM, New York, NY, USA, pp 571–582. http://doi.acm.org/10.1145/2632048.2632079. Accessed 20 July 2021
HTML5 FLV Player (2018). https://github.com/Bilibili/flv.js/. Accessed 20 July 2021
Huh S, Muralidharan S, Ko H, Yoo B (2019) XR collaboration architecture based on decentralized web. In: The 24th International Conference on 3D Web Technology, Web3D ’19. ACM, pp 1–9. https://doi.org/10.1145/3329714.3338137
Jacobson V, Frederick R, Casner S, Schulzrinne H (2003) RTP: A transport protocol for real-time applications. https://tools.ietf.org/html/rfc3550. Accessed 20 July 2021
Janzen R, Yasrebi SN, Bose AJ, Subramanian A, Mann S (2014) Walking Through Sight: Seeing the Ability to See, in a 3-D Augmediated Reality Environment. In: Proc. of the 2014 IEEE Games Media Entertainment. pp 1–2. http://dx.doi.org/10.1109/GEM.2014.7048124. Accessed 20 July 2021
Jenkins A (2019) The fall and rise of VR: The struggle to make virtual reality get real. https://fortune.com/longform/virtual-reality-struggle-hope-vr/. Accessed 20 July 2021
Jiang N, Swaminathan V, Wei S (2017) Power evaluation of 360 VR video streaming on head mounted display devices. In: Proceedings of the 27th Workshop on Network and Operating Systems Support for Digital Audio and Video, NOSSDAV17. ACM, New York, NY, USA, pp 55–60. https://doi.org/10.1145/3083165.3083173
Johansen R (1988) GroupWare: Computer Support for Business Teams. The Free Press, USA
Google Scholar
Juhlin O, Reponen E, Bentley F, Kirk D (2011) Video interaction - making broadcasting a successful social media. In: CHI 11 Extended Abstracts on Human Factors in Computing Systems, CHI EA 11. ACM, New York, NY, USA, pp 2437–2440. https://doi.org/10.1145/1979742.1979578
Kekulluoglu D, Kokciyan N, Yolum P (2018) Preserving privacy as social responsibility in online social networks. ACM Trans Internet Technol 18(4). https://doi.org/10.1145/3158373
Kitamura K, Yamasaki T, Aizawa K (2008) Food log by analyzing food images. In: Proc. of the 16th ACM International Conference on Multimedia, MM ’08. ACM, New York, NY, USA, pp 999–1000. http://doi.acm.org/10.1145/1459359.1459548. Accessed 20 July 2021
Koelle M, Ananthanarayan S, Czupalla S, Heuten W, Boll S (2018) Your smart glasses’ camera bothers me! Exploring opt-in and opt-out gestures for privacy mediation. In: Proc. of the 10th Nordic Conf. on Human-Computer Interaction, NordiCHI 18. ACM, pp 473–481. https://doi.org/10.1145/3240167.3240174
Koelle M, Heuten W, Boll S (2017) Are you hiding it?: Usage habits of lifelogging camera wearers. In: Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services, MobileHCI ’17. ACM, New York, NY, USA, pp 80:1–80:8. http://doi.acm.org/10.1145/3098279.3122123. Accessed 20 July 2021
Koelle M, Kranz M, Möller A (2015) Don’t look at me that way!: Understanding user attitudes towards data glasses usage. In: Proc. of the 17th Int. Conf. on Human-Computer Interaction with Mobile Devices and Services, MobileHCI ’15. ACM, pp 362–372. http://doi.acm.org/10.1145/2785830.2785842. Accessed 20 July 2021
Kruitbosch G, Nack F (2008) Broadcast yourself on Youtube: Really? In: Proceedings of the 3rd ACM International Workshop on Human-Centered Computing, HCC 08. ACM, New York, NY, USA, pp 7–10. https://doi.org/10.1145/1462027.1462029
Kurita Y, Ikeda A, Tamaki T, Ogasawara T, Nagata K (2009) Haptic Augmented Reality interface using the real force response of an object. In: Proceedings of the 16th ACM Symposium on Virtual Reality Software and Technology, VRST 09. ACM, New York, NY, USA, pp 83–86. https://doi.org/10.1145/1643928.1643948
Langlotz T, Sutton J, Zollmann S, Itoh Y, Regenbrecht H (2018) ChromaGlasses: Computational glasses for compensating colour blindness. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI 18. ACM, New York, NY, USA. https://doi.org/10.1145/3173574.3173964
Liotou E, Sfikopoulos A, Kaltzias P, Tsolkas V (2017) An evaluation of buffer- and rate-based HTTP adaptive streaming strategies. In: Proc. of the 2017 IEEE 22nd International Workshop on Computer Aided Modeling and Design of Communication Links and Networks, pp 1–6. https://doi.org/10.1109/CAMAD.2017.8031637
Liu C, Bouazizi I, Gabbouj M (2011) Rate adaptation for adaptive HTTP streaming. In: Proc. of the 2nd Annual ACM Conference on Multimedia Systems, MMSys 11. ACM, pp 169–174. https://doi.org/10.1145/1943552.1943575
Liu C, Bouazizi I, Hannuksela MM, Gabbouj M (2012) Rate adaptation for dynamic adaptive streaming over HTTP in content distribution network. Signal Process Image Commun 27(4):288–31. http://www.sciencedirect.com/science/article/pii/S0923596511001135. Accessed 20 July 2021
Live stream WiFi Glasses. https://www.24kupi.com/live-stream-wifi-glasses.html. Accessed 20 July 2021
Mann S (1999) Mediated reality. Linux J 1999(59es). https://www.linuxjournal.com/article/3265. Accessed 20 July 2021
Mann S, Furness T, Yuan Y, Iorio J, Wang Z (2018) All reality: Virtual, augmented, mixed (X), mediated (X, Y), and multimediated reality. CoRR abs/1804.08386. http://arxiv.org/abs/1804.08386. Accessed 20 July 2021
Marr B (2019) What is extended reality technology? A simple explanation for anyone. https://www.forbes.com/sites/bernardmarr/2019/08/12/what-is-extended-reality-technology-a-simple-explanation-for-anyone/#6ffa4ba37249. Accessed 20 July 2021
Marr B (2020) The 5 biggest virtual and augmented reality trends in 2020 everyone should know about. https://www.forbes.com/sites/bernardmarr/2020/01/24/the-5-biggest-virtual-and-augmented-reality-trends-in-2020-everyone-should-know-about/#69f94c3824a8. Accessed 20 July 2021
Matthias P, Billinghurst M, Siang See Z (2019) This land AR: an Australian Music and Sound XR installation. In: Proc. of the 17th Int. Conf. on Virtual-Reality Continuum and Its Applications in Industry, VRCAI ’19. ACM, New York, NY, USA. https://doi.org/10.1145/3359997.3365740
McCullough M, Kulber L, Sammons P, Santos P, Kulber D (2018) Google Glass for remote surgical tele-proctoring in low- and middle-income countries: A feasibility study from Mozambique. Plast Reconstr Surg Glob Open 1. https://doi.org/10.1097/GOX.0000000000001999
McVeigh-Schultz J, Stein J, Boyle J, Duff E, Watson J, Syam, A., Tasse, A., Wiscombe, S., Fisher S (2012) Vehicular lifelogging: New contexts and methodologies for human-car interaction. In: CHI 12 Extended Abstracts on Human Factors in Computing Systems, CHI EA 12. ACM, New York, NY, USA, pp 221–230. https://doi.org/10.1145/2212776.2212800
MeCam (2019) High definition video camera | Best life logging device | Mini video camera - MeCam. https://mecam.me/products/mecam-hd. Accessed 20 July 2021
Metsis V, Kosmopoulos D, Athitsos V, Makedon F (2014) Non-invasive analysis of sleep patterns via multimodal sensor input. Pers Ubiquit Comput 18(1):19–26. http://dx.doi.org/10.1007/s00779-012-0623-1. Accessed 20 July 2021
Microsoft. Soundscape. https://www.microsoft.com/en-us/research/product/soundscape/. Accessed 20 July 2021
Milgram P, Colquhoun HJ (1999) A taxonomy of real and virtual world display integration. Springer-Verlag. https://www.researchgate.net/publication/2440732_A_Taxonomy_of_Real_and_Virtual_World_Display_Integration. Accessed 20 July 2021
Milgram P, Kishino F (1994) A taxonomy of mixed reality visual displays. IEICE Trans Inf Syst E77-D(12):1321–1329. https://search.ieice.org/bin/summary.php?id=e77-d_12_1321. Accessed 20 July 2021
Miller K, Quacchio E, Gennari G, Wolisz A (2012) Adaptation algorithm for adaptive streaming over HTTP. In: Proc. of the 19th International Packet Video Workshop, pp 173–178. https://doi.org/10.1109/PV.2012.6229732
Mingliang C (2020) illuspas/Node-Media-Server. https://github.com/illuspas/Node-Media-Server. Accessed 20 July 2021
Mori S, Ikeda S, Saito H (2017) A survey of diminished reality: Techniques for visually concealing, eliminating, and seeing through real objects. IPSJ Transactions on Computer Vision and Applications 9:1–14. https://doi.org/10.1186/s41074-017-0028-1
MPEG | The Moving Picture Experts Group website. https://mpeg.chiariglione.org/. Accessed 20 July 2021
Muzaffar R, Yanmaz E, Raffelsberger C, Bettstetter C, Cavallaro A (2020) Live multicast video streaming from drones: an experimental study. Auton Robot 44. https://doi.org/10.1007/s10514-019-09851-6
Nakagawa R, Sonobe K (2019) Encounters: A multiparticipant audiovisual art experience with XR. In: SIGGRAPH Asia 2019 XR, SA ’19. ACM New York, NY, USA, pp 6–8. https://doi.org/10.1145/3355355.3361886
Nakagawa R, Sonobe K (2020) Encounters 2.0: A multiparticipant audiovisual art experience with XR. In: ACM SIGGRAPH 2020 Immersive Pavilion, SIGGRAPH ’20. ACM, New York, NY, USA. https://doi.org/10.1145/3388536.3407882
Narrative (2019) The world’s most wearable HD video camera | Narrative Clip 2. http://getnarrative.com/. Accessed 20 July 2021
North: Getting started with your Myo Armband. https://support.getmyo.com/hc/en-us/articles/203398347-Getting-started-with-your-Myo-armband. Accessed 20 July 2021
Obrist M, Velasco C, Vi CT, Ranasinghe N, Israr A, Cheok AD, Spence C, Gopalakrishnakone P (2016) Touch, taste, & smell user interfaces: The future of multisensory HCI. In: Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems, CHI EA 16. ACM, New York, NY, USA, pp 3285–3292. https://doi.org/10.1145/2851581.2856462
Ochi D, Kunita Y, Fujii K, Kojima A, Iwaki S, Hirose J (2014) HMD Viewing Spherical Video Streaming System. In: Proceedings of the 22nd ACM International Conference on Multimedia, MM 14. ACM, New York, NY, USA, pp 763–764. https://doi.org/10.1145/2647868.2654870
Ochi D, Niwa K, Kameda A, Kunita Y, Kojima A (2015) Dive into remote events: Omnidirectional video streaming with acoustic immersion. In: Proceedings of the 23rd ACM International Conference on Multimedia, MM 15. ACM, New York, NY, USA, pp 737–738. https://doi.org/10.1145/2733373.2807963
Pai YS, Hajika R, Gupta K, Sasikumar P, Billinghurst M (2020) NeuralDrum: Perceiving brain synchronicity in XR drumming. In: SIGGRAPH Asia 2020 Technical Communications, SA ’20. ACM. https://doi.org/10.1145/3410700.3425434
Pamparău C, Vatavu RD (2021) Flexisee: Flexible configuration, customization, and control of mediated and augmented vision for users of smart eyewear devices. Multimed Tools Appl. https://doi.org/10.1007/s11042-020-10164-5
Pantos R (2017) RFC 8216 - HTTP Live Streaming. https://datatracker.ietf.org/doc/rfc8216. Accessed 20 July 2021
Paradiso JA, Landay JA (2009) Guest editors’ introduction: Cross-reality environments. IEEE Pervasive Comput. 8(3):14–15. https://doi.org/10.1109/MPRV.2009.47
Parmar HMT (2012) Adobes real time messaging protocol. https://wwwimages2.adobe.com/content/dam/acom/en/devnet/rtmp/pdf/rtmp_specification_1.0.pdf. Accessed 20 July 2021
Pausch R, Proffitt D, Williams G (1997) Quantifying immersion in virtual reality. In: Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 97. ACM Press/Addison-Wesley Publishing Co., USA, pp 13–18. https://doi.org/10.1145/258734.258744
Peli E (1999) Simple 1-D image enhancement for head-mounted low vision aid. Vis Impair Res 1(1):3–10. https://doi.org/10.1076/vimr.1.1.3.4449
Peli E, Luo G, Bowers A, Rensing N (2007) Applications of augmented-vision head-mounted systems in vision rehabilitation. J Soc Inf Disp 15(12):1037–1045. https://onlinelibrary.wiley.com/doi/abs/10.1889/1.2825088. Accessed 20 July 2021
Philo A, Preethi C, Prasad K (2019) Analysis of vehicle activities and live streaming using IOT. In: Proceedings of the 2019 International Conference on Communication and Signal Processing (ICCSP), pp. 754–757. https://doi.org/10.1109/ICCSP.2019.8698001
Postel J (1980) User datagram protocol. https://tools.ietf.org/html/rfc768. Accessed 20 July 2021
Procyk J, Neustaedter C, Pang C, Tang A, Judge TK (2014) Exploring video streaming in public settings: Shared geocaching over distance using mobile video chat. In: Proc. of the SIGCHI Conference on Human Factors in Computing Systems, CHI 14. ACM, pp 2163–2172. https://doi.org/10.1145/2556288.2557198
Protocol Extension for Low-Latency HLS (Preliminary Specification) | Apple Developer Documentation (2019). https://developer.apple.com/documentation/http_live_streaming/protocol_extension_for_low-latency_hls_preliminary_specification. Accessed 20 July 2021
Rao A, Lanphier R, Stiemerling M, Schulzrinne H, Westerlund M (2016) Real-Time Streaming Protocol Version 2.0. https://tools.ietf.org/html/rfc7826. Accessed 20 July 2021
Reschke JF, Fielding RT (2014) Hypertext Transfer Protocol (HTTP/1.1): Message Syntax and Routing. https://tools.ietf.org/html/rfc7230. Accessed 20 July 2021
Rio MD, Meloni V, Frexia F, Cabras F, Tumbarello R, Montis S, Marini A, Zanetti G (2018) Augmented Reality for supporting real time telementoring: an exploratory study applied to ultrasonography. In: Proceedings of the 2nd International Conference on Medical and Health Informatics, ICMHI 18. ACM, New York, NY, USA, pp 218–222. https://doi.org/10.1145/3239438.3239444
Roach AB (2016) WebRTC Video Processing and Codec Requirements. https://tools.ietf.org/html/rfc7742
Rodríguez-Gil L, Orduña P, García-Zubía J, de Ipiña DL (2017) Interactive live-streaming technologies and approaches for web-based applications. Multimed Tools Appl 77:6471–6502
Article Google Scholar
Rothe S, Kegeles B, Hussmann H (2019) Camera heights in cinematic virtual reality: How viewers perceive mismatches between camera and eye height. In: Proc. of the 2019 ACM Int. Conference on Interactive Experiences for TV and Online Video, TVX 19. ACM, pp 25–34. https://doi.org/10.1145/3317697.3323362. Accessed 20 July 2021
Ruether T (2019) Low-latency CMAF for Live Streaming at scale. https://www.wowza.com/blog/low-latency-cmaf-chunked-transfer-encoding. Accessed 20 July 2021
Ruminski J, Smiatacz M, Bujnowski A, Andrushevich A, Biallas M, Kistler R (2015) Interactions with recognized patients using smart glasses. In: Proc. of the 8th International Conference on Human System Interaction (HSI), pp 187–194
Saito K, Masuda N (2013) Two types of Twitter users with equally many followers. In: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 13. ACM, New York, NY, USA, pp 1425–1426. https://doi.org/10.1145/2492517.2492575
Sani Y, Mauthe A, Edwards C (2017) Adaptive bitrate selection: a survey. IEEE Commun Surv Tutorials 19(4):2985–3014. https://doi.org/10.1109/COMST.2017.2725241
Santos-Gonzalez I, Rivero-Garcia A, Molina-Gil J, Caballero-Gil P (2017) Implementation and analysis of real-time streaming protocols 17(4):846. https://doi.org/10.3390/s17040846
Schönauer C, Mossel A, Zai IA, Vatavu RD (2015) Touch, movement and vibration: User perception of vibrotactile feedback for touch and mid-air gestures. In: Human-Computer Interaction – INTERACT 2015. Springer, pp 165–172. https://doi.org/10.1007/978-3-319-22723-8_14
Scourboutakos P, Lu MH, Nerker S, Mann S (2017) Phenomenologically Augmented Reality with new wearable LED sequential wave imprinting machines. In: Proceedings of the Eleventh International Conference on Tangible, Embedded, and Embodied Interaction, TEI 17. ACM, New York, NY, USA, pp 751–755. https://doi.org/10.1145/3024969.3035534
Sellen AJ, Whittaker S (2010) Beyond total capture: A constructive critique of lifelogging. Commun ACM 53(5):70–77. https://doi.org/10.1145/1735223.1735243
Seo D, Yoo B, Ko H (2017) Webized 3D content streaming system for autostereoscopic 3D displays. In: Proc. of the 22nd Int. Conference on 3D Web Technology, Web3D 17. ACM, New York, NY, USA. https://doi.org/10.1145/3055624.3075940
Seo D, Yoo B, Ko H (2018) Webizing collaborative interaction space for cross reality with various human interface devices. In: Proc. of the 23rd Int. ACM Conf. on 3D Web Technology, Web3D ’18. ACM. https://doi.org/10.1145/3208806.3208808
Seufert M, Egger S, Slanina M, Zinner T, Hofeld T, Tran-Gia P (2015) A survey on quality of experience of HTTP adaptive streaming. IEEE Commun Surv Tutorials 17(1):469–492. https://doi.org/10.1109/COMST.2014.2360940
Silverlight 4 Launch Home | Microsoft Silverlight. https://www.microsoft.com/silverlight/smoothstreaming/. [Accessed March 2020]
Singhal S, Neustaedter C, Ooi YL, Antle AN, Matkin B (2017) Flex-N-Feel: The design and evaluation of emotive gloves for couples to support touch over distance. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, CSCW 17. ACM, New York, NY, USA, pp 98–110. https://doi.org/10.1145/2998181.2998247
Singhal S, Neustaedter C, Schiphorst T, Tang A, Patra A, Pan R (2016) You are being watched: Bystanders perspective on the use of camera devices in public spaces. In: Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems, CHI EA 16. ACM, New York, NY, USA, pp 3197–3203. https://doi.org/10.1145/2851581.2892522
Smith K (2020) 57 fascinating and incredible YouTube statistics. https://www.brandwatch.com/blog/youtube-stats/. Accessed 20 July 2021
Snap Inc. (2020) Spectacles by Snapchat. Capture Your World in (2020). https://www.spectacles.com/. Accessed 20 July 2021
SnapCam (2019) iON USA | SnapCam. https://usa.ioncamera.com/snapcam/. Accessed 20 July 2021
Sobel K, Takeuchi L, Castaneda LM, Bindman SW (2019) Immersive media design and children. IDC ’19. ACM, New York, NY, USA, pp 689–696. https://doi.org/10.1145/3311927.3325163
SOCKET.IO 2.0 IS HERE. https://socket.io. Accessed 20 July 2021
Socket.IO Client Library for Java (2015). https://mvnrepository.com/artifact/com.github.nkzawa/socket.io-client/0.3.0. Accessed 20 July 2021
Software OB. Open Broadcast Software. https://obsproject.com/. Accessed 20 July 2021
Speicher M, Hall BD, Nebeling M (2019) What is mixed reality? In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI 19. ACM, New York, NY, USA. https://doi.org/10.1145/3290605.3300767
Stearns L, Findlater L, Froehlich JE (2018) Design of an Augmented Reality magnification aid for low vision users. In: Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 18. ACM, New York, NY, USA, pp 28–39. https://doi.org/10.1145/3234695.3236361
Stix G (2011) Photographic memory: Wearable cam Could help patients stave off effects of impaired recall. Scientific American. https://www.scientificamerican.com/article/photographic-memory-wearable/. Accessed 20 July 2021
Taha M, Lloret J, Canovas A (2017) Survey of transportation of adaptive multimedia streaming service in internet. Network Protocols and Algorithms 9:85. https://doi.org/10.5296/npa.v9i1-2.12412
Tanaka A, Takemura H, Tajima K (2014) Why you follow: A classification scheme for Twitter follow links. In: Proc. of the 25th ACM Conf. on Hypertext and Social Media, HT 14, pp 324–326. https://doi.org/10.1145/2631775.2631790
Tanuwidjaja E, Huynh D, Koa K, Nguyen C, Shao C, Torbett P, Emmenegger C, Weibel N (2014) Chroma: A wearable Augmented-Reality solution for color blindness. In: Proc. of the 2014 ACM Int. Joint Conf. on Pervasive and Ubiquitous Computing, UbiComp 14, pp 799–810. https://doi.org/10.1145/2632048.2632091
Tian G, Liu Y (2012) Towards agile and smooth video adaptation in dynamic HTTP streaming. In: Proceedings of the 8th International Conference on Emerging Networking Experiments and Technologies, CoNEXT 12. ACM, New York, NY, USA, pp 109–120. https://doi.org/10.1145/2413176.2413190
Vatavu RD (2015) Audience silhouettes: Peripheral awareness of synchronous audience kinesics for social television. In: Proceedings of the ACM International Conference on Interactive Experiences for TV and Online Video, TVX 15. ACM, New York, NY, USA, pp 13–22. https://doi.org/10.1145/2745197.2745207
Vatavu RD, Mossel A, Schönauer C (2016) Digital Vibrons: Understanding users perceptions of interacting with invisible, zero-weight matter. In: Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services, MobileHCI 16. ACM, New York, NY, USA, pp 217–226. https://doi.org/10.1145/2935334.2935364
Vatavu RD, Saeghe P, Chambel T, Vinayagamoorthy V, Ursu MF (2020) Conceptualizing augmented reality television for the living room. In: ACM International Conference on Interactive Media Experiences, IMX ’20. ACM, New York, NY, USA, pp 1–12. https://doi.org/10.1145/3391614.3393660
video-dev/hls.js (2020). https://github.com/video-dev/hls.js. Accessed 20 July 2021
Wowza Media Systems. 2019 Video Streaming Latency Report. https://www.wowza.com/wp-content/uploads/Streaming-Video-Latency-Report-Interactive-2019.pdf. Accessed 20 July 2021
Yang J, Mattern F (2019) Audio Augmented Reality for human-object interactions. In: Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers, UbiComp/ISWC 19 Adjunct. ACM, New York, NY, USA, pp 408–412. https://doi.org/10.1145/3341162.3349302
Yavoruk O (2020) The study of observation in physics classes through XR technologies. In: Proc. of the 4th Int. Conf. on Digital Technology in Education, ICDTE 2020. ACM, New York, NY, USA, pp 58–62. https://doi.org/10.1145/3429630.3429637
Zhao Y, Cutrell E, Holz C, Morris MR, Ofek E, Wilson AD (2019) SeeingVR: A set of tools to make virtual reality more accessible to people with low vision. In: Proc. of the 2019 CHI Conf. on Human Factors in Computing Systems, CHI ’19. ACM. https://doi.org/10.1145/3290605.3300341
Zhao Y, Szpiro S, Azenkot S (2015) ForeSee: A customizable head-mounted vision enhancement system for people with low vision. In: Proc. of the 17th Int. ACM SIGACCESS Conference on Computers & Accessibility, ASSETS 15. ACM, New York, NY, USA, pp 239–249. http://dx.doi.org/10.1145/2700648.2809865. Accessed 20 July 2021
Zhao Y, Szpiro S, Shi L, Azenkot S (2019) Designing and evaluating a customizable head-mounted vision enhancement system for people with low vision. ACM Trans Access Comput 12(4). http://dx.doi.org/10.1145/3361866. Accessed 20 July 2021
Zhou C, Zhang X, Huo L, Guo Z (2012) A control-theoretic approach to rate adaptation for dynamic HTTP streaming. In: Proc. of the 2012 Visual Communications and Image Processing, pp 1–6. https://doi.org/10.1109/VCIP.2012.6410740
Zini F, Reinstadler M, Ricci F (2015) Life-logs aggregation for quality of life monitoring. In: Proc. of the 5th Int. Conf. on Digital Health 2015, DH ’15. ACM, pp 131–132. http://doi.acm.org/10.1145/2750511.2750531. Accessed 20 July 2021
Zokai S, Esteve J, Genc Y, Navab N (2003) Multiview paraperspective projection model for diminished reality. In: Proceedings of the 2nd IEEE/ACM International Symposium on Mixed and Augmented Reality, ISMAR 03. IEEE Computer Society, USA, p 217. https://dl.acm.org/doi/10.5555/946248.946801. Accessed 20 July 2021

Download references

Acknowledgements

Original versions of the icons employed in Figs. 1, 2, 4–6 were made by Freepik (http://www.freepik.com, Miscellaneous Elements pack) from Flaticon (http://www.flaticon.com), licensed under Creative Commons BY 3.0 (http://creativecommons.org/licenses/by/3.0). The HoloLens device used in this work was kindly provided by OSF Global Services, the Mobile Division, Suceava.

Funding

This work was supported by a grant of the Ministry of Research and Innovation, CNCS-UEFISCDI, project no. PN-III-P1-1.1-TE-2016-2173 (TE141/2018), within PNCDI III. The work was carried out in the Machine Intelligence and Information Visualization Lab (MintViz) of the MANSiD Research Center. The infrastructure was provided by the University of Suceava and was partially supported from the project “Integrated center for research, development and innovation in Advanced Materials, Nanotechnologies, and Distributed Systems for fabrication and control”, No. 671/09.04.2015, Sectoral Operational Program for Increase of the Economic Competitiveness, co-funded from the European Regional Development Fund.

Author information

Authors and Affiliations

MintViz Lab, MANSiD Research Center, University Ştefan cel Mare of Suceava, Suceava, 720229, Romania
Adrian Aiordăchioae, Cristian Pamparău & Radu-Daniel Vatavu

Authors

Adrian Aiordăchioae
View author publications
You can also search for this author in PubMed Google Scholar
Cristian Pamparău
View author publications
You can also search for this author in PubMed Google Scholar
Radu-Daniel Vatavu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Radu-Daniel Vatavu.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported by a grant of the Ministry of Research and Innovation, CNCS-UEFISCDI, project no. PN-III-P1-1.1-TE-2016-2173 (TE141/2018), within PNCDI III.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Aiordăchioae, A., Pamparău, C. & Vatavu, RD. Lifelogging meets alternate and cross-realities: an investigation into broadcasting personal visual realities to remote audiences. Multimed Tools Appl 83, 46707–46730 (2024). https://doi.org/10.1007/s11042-021-11310-3

Download citation

Received: 03 April 2020
Revised: 13 January 2021
Accepted: 20 July 2021
Published: 14 August 2021
Issue Date: May 2024
DOI: https://doi.org/10.1007/s11042-021-11310-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Lifelogging meets alternate and cross-realities: an investigation into broadcasting personal visual realities to remote audiences

Abstract