Introduction

The modern Internet contains incredible quantities of digital video available for viewing.  With the advent of inexpensive consumer-level digital video cameras many people have chosen to record their moments in the form of digital video.  The proliferation of surveillance cameras has only added to this growing mountain.  Video can be viewed as a very large three-dimensional data set[1] across the axes of two-dimensional space and time.

It is often inpractical to view this data set in its original form: the moving image.  A security agent may wish to quickly comprehend the motion in several long video sequences.  In many cases, the content of a video or animation needs to be shown in a single image for a printed medium such as a newspaper or article.  A particularily common requirement in the field of computer graphics is the need to show human figure animation in a figure to form part of an academic publication.  Finally, video consumes significant bandwidth; in many cases it is more economical to visualize the video in a single image.

The field of video summarization is a large and heavily researched one.  There has also been significant research into the visualization of very large data sets.  However, few in the the research communities have considered the problem of visualization of video.  If video is a very large data set, perhaps novel slices and views of this data could yield insights?  Indeed, planar slices of the video cube have been explored [1]; the results, however, are not effective as a visualization technique.

Instead, we take the approach of selecting data from video in a non-planar fashion.  We consider the restricted case of video displaying human motion, with the goal of creating a single image that conveys the motion to the viewer.  This is an incredibly challenging problem; information filling three dimensions must be compressed into a two-dimensional form.  It is easy to quickly overwhelm the limited real estate of a single image with the information of human motion.

Indeed, the scope of the problem quickly overwhelmed our original plan to automate the creation of one or two types of visualizations.  The range of possible visualization components and techniques is immense, and we would have little justification in choosing one.  Instead, we chose to explore the space of possible motion visualization techniques, creating exemplars by hand using both available software and mini-tools created for specific tasks.  In this way, we were able to evaluate the effectiveness of different visualization techniques, narrowing down our toolbox through informal user feedback.  Throughout the process of creating these images, we held interviews with fellow grad students.  We presented them with the video clip, followed by several visualizations, and asked them to comment on which they felt best captured the essence of the motion and which they found aesthetically pleasing.  At the same time, we encouraged them to critize the techniques and to suggest alternative ways to encode the motion in 2D.  Through this process, we were about to single out certain visualization techniques as being particularly promising, and hope that this exploration can form a springboard for future research automating the most effective of the methods studied.

Related work

While few in the scientific communities have considered the problem of visualizing human motion, artists have long been interested the problem.  We relate a few of the relevant efforts which influenced our approach.

Strobing

The most relevant historical reference to our research is the work of French photographer Etienne-Jules Marey (1830-1904).  Marey devoted his entire career to developing photographic techniques to capture movement across time.  An excellent summarization of his work can be found in the book "Picturing Time." [2].  He developed a variety of special-purpose devices to accomplish multiple-exposure photographs that captured a subject in multiple positions across time in a single photograph.  An example of his work is shown in Figure 1.


Marey photograph
Second marey photograph

Figure 1: Two multiple exposure photographs by Etienne-Jules Marey.


Marey's work inspired two other relevant photographers.  One is Harold Edgerton, who used electronic strobe flashes to achieve similar effects.  Indeed, most of our visualization techniques are inspired by this multiple-exposure and strobing approach.  However, both Marey and Edgerton were limited by analog technology; we are able to expand the range of possibilities using digital approaches.  The photographer Eadweard Muybridge was also influenced by Marey.  He was interested in using photography to study human motion, and captured several studies similar to the one showed in Figure 2.  Muybridge and Marey influenced a variety of artists, perhaps most notably in Duchamp's famous motion study "Nude Descending a Staircase." (Figure 3)  Muybridge added to Marey's approach by using multiple simultaneous perspectives.  His style of separating individual moments into frames is highly reminiscent of comic books, which brings us to our other major inspiration.


Muybrige

Figure 2: Muybridge motion study entitled "Descending stairs and turning around"



Nude descending a staircase

Figure 3: Marcel Duchamp's "Nude Descending a Staircase"

Comic books

Comic book artists have long been faced with the challenge of conveying human motion in static form.  An excellent discussion of this issue is found in Scott McCloud's "Understanding Comics." [4]  He notes that a single frame in a comic book rarely depicts a single moment in time, but rather slices across the space-time volume.  An example of such a space-time frame from this book is shown in Figure 4.


COmic 1
Figure 4: A comic strip frame that covers a range of time.


Comic book artists have developed a number of techniques to convey the notion of human motion in a single frame.  The most common are "motion lines"; diagrammatic elements that indicate motion.  Examples are shown in Figure 5.  We will attempt to make such of such motion indicators in our visualizations.


Comic 2

Figure 5: Several examples of motion lines.


We will focus on the contrasting techniques of comic strips and multiple-exposure strobing.  Strobing is highly effective in showing motion in a compressed form and can situate the relevant action poses in a unified context. Unfortunately, in slow moving sequences, overlap between images can be confusing and overload information into too small a space.  Comic strips do not suffer from this overloading, but require more space and, more importantly, can lose spatial context between the images.

NEXT: Strobing