Laura Ring, Eric Grose, Chris Forsythe
Statistics and Human Factors, MS-0829
Sandia National Laboratories
Albuquerque, NM 87185
(ltring, emgrose, jcforsy)@sandia.gov
We quantitatively compared performance with the Pad++ Web browser and Netscape in several different scenarios. We examined how quickly users could answer questions about a specific Web site designed for this test. Initially we found that subjects answered questions slightly slower with Pad++ than with Netscape.
After analyzing the results of this study, we implemented several changes to the Pad++ Web browser, and repeated one Pad++ condition. After improvements were made to the Pad++ browser, subjects using Pad++ answered questions 23% faster than those using Netscape.
People with diverse talents, interests, and experiences will be on-line via the Web. Many will be novices with little prior experience using computers. A simple click of the mouse can bring a user from their friend's home page to unknown destinations across the world. This can be disorienting because traditionally, following a cross reference meant shuffling across the library to find another volume. While time-consuming, this reinforced the transition that was taking place. The difficulties that novice users confront can be instructional to developers. While experts may not have as much difficulty, they experience similar cognitive burdens, and may just have a higher threshold before they experience similar difficulties.
While the immediacy of traversing information links offers many advantages, it can also make it difficult to maintain an intuitive sense of where one is, and how one got there. This is a classic problem of hypertext systems. Part of the problem can be attributed to windows-based interfaces. Current window systems don't readily support showing more than a few pages at a time. In addition, each page is usually in a separate window with no depiction of relationships to other windows. Popular WWW browsers, like other applications built according to windowing philosophies, also have this problem, although they do offer some methods to aid navigation by keeping track of interesting sites - usually in hierarchical sets of hotlists or bookmarks.
There are a variety of human factors issues to be considered []. Some of these have been addressed by proposed alternatives and extensions to browsers. WebMap is a browser extension that shows a graphical relationship between web pages []. Each page is represented by a small circle that can be selected to display the actual page. The links between pages are colored to indicate information about links, such as whether it is a link to a different server or whether the destination page has already been read. These graphs may be saved and used by others. WebBook is a three-dimensional Web browser from Xerox Parc that allows multiple pages to be viewed simultaneously and supports collections of pages in books that replace bookmark files [].
WWW pages often have links to pages that link back to the starting point. It is difficult to visualize these cycles, and so many Web visualizations are based on hierarchies extracted from a graph of the Web. Some interesting work focuses on alternative visualizations []. Furnas [] shows how multitrees can be used to represent a collection of hierarchies sharing parts of the underlying data. One application of multitrees is visualization of bookmarks from multiple individuals [].
Another approach to visualizing large information spaces that can be applied to web browsing and navigation involves techniques to show detail at particular nodes while maintaining context. One general approach, fisheye views [], has been extended with graphics [], three dimensions [], hyperbolic representations [], animation [], and zooming [][][]. Other techniques include exploiting a large virtual space [], using lenses or filters [][], and visualizing two dimensional layouts [][].
In the sections that follow, we describe our zooming Web browser, discuss user studies comparing it to Netscape, and show improvements we made based on the initial study which resulted in a substantial speedup in the Pad++ Web browser.
We are building a new zoomable Web browser using Pad++, a substrate for building multiscale dynamic user interfaces [][]. Pad++ provides an extensive graphical workspace where dynamic objects can be placed at any position and at any scale. Pad++ supports panning and zooming. Zooming can involve simple geometric scaling, or what we term context sensitive rendering, in which rendering of objects can vary based on factors in addition to scale, such as context of the task or complexity of the information being displayed. Pad++ is built as a component for Tcl/Tk, a scripting language and user-interface library [].
Pad++ allows Web pages to remain visible at varying scales so the viewer may examine many pages at once. In addition, Pad++ allows the user to zoom in and out, enabling explicit control of how much context is viewed at any time. To orient themselves, users can simply zoom out to view a number of web pages. To get more detailed views of a particular page, they can zoom in. We think this variable scale contextual display of Web pages can provide important support for navigation. When using our Web browser, users navigate a space filled with familiar objects, not iconified representations of those objects.
The "forward" and "back" buttons from traditional Web browsers are slightly modified to allow access to all the pages previously visited, and not just the ones that are direct ancestors of the current page. Traditionally, the "back" button goes to the page from which the current page was accessed.
We replaced the back button with "previous" and "parent" buttons. The "parent" button does the same thing as the original "back" button, but the "previous" button goes to the previously focused page - whether it was accessed through a link , or panning and zooming.
There is also a button called "show all" which zooms out just enough so all of the pages are visible on the screen. Then, a single click on any page will focus on it, and bring it to a readable size.
While the left mouse button is used for following hyperlinks by pressing on them, it is also used for panning by dragging on the surface. Pressing on the middle button zooms in until the button is released. Pressing on the right button zooms out until it is released. Pad++ zooms around the mouse cursor, so the user can move the mouse while zooming to control which point the view is zoomed around.
Our Web browser combines a basic focus-driven layout with automatic zooming and panning to support navigation. The software allows the user to select a focus page. That selection animates the page to occupy a larger section of the display. Pages farther from the focus page get increasingly smaller, resulting in a graphical fisheye view. See Figures 1 and 2 for snapshots of the Pad++ Web browser with different pages focused upon where the view is zoomed out so many pages can be seen. Figure 3 shows a zoomed in view where a single page may be read. Ancestor and descendant pages are visible on the sides, and can be clicked on to bring them into focus.
Figure 1: A snapshot of the Pad++ Web browser
Figure 2: The same Web pages with a different focus
Figure 3: Zoomed into a single page
Clicking on a link brings up a new page, adds it to the tree of pages, the and causes the tree to restructure itself. Unlike other Web browsers that immediately replace the current page with a new page, the restructuring process is animated so that users can understand how the tree is being. The animation helps maintain object constancy and the graphical depiction of links highlights relationships between pages. The new page becomes the current focus and is moved to the center of the screen at a size suited for viewing. The user may designate any existing page to be the current focus by clicking on it.
Clicking on a link to a page that has already been loaded focuses on that page. Even if it was originally accessed from another part of the tree, its position within the tree is not changed.
As in earlier fisheye displays, our basic layout function assigns a degree-of-interest value to each node in the tree based on its distance from the focus page. We define the distance to be the shortest path between two pages. This value is then used to determine the size of each node. Because long pages have such a big effect on the layout, we added scrollbars to pages longer than a given size. This way, you can still see the structure of the pages, but long pages aren't as disruptive.
The usability testing was designed to test how well the Pad++ Web browser could perform as a viewer for this type of database. More specifically, we created a visual database of Web pages. We then compared how long it took to access information from these pages using the two browsers, Pad++ and Netscape.
We designed a hypothetical database containing the evolution of a manufacturing process over a series of six versions. The entire stimuli set consists of 31 Web pages (Figure 4). The stimuli contains an Overall page which contains information on the database and links to the Design and Manufacturing Process pages and links to the Budget and Schedule pages for each of the six versions. Six Budget and Schedule pages (Figure 5) give an overall estimate of time and the budget for each version. Six Design and Manufacturing Process pages contain the process flowcharts for each version (Figure 6). The process flowcharts are WWW image maps which link to notes about the process. The final 18 pages contain these notes.
Figure 4: Overview of entire stimuli
Figure 5: A Budget and Schedule page from stimuli
Figure 6: A Design and Manufacturing Process page
Task 1 required the subjects to navigate to specified pages in the data base and enter a page number found on the top of each page. For example, the subjects were asked to enter the page number of Version 5's Budget and Schedule page. This tested speed of navigation within the browser.
Task 2 and Task 4 tested the users ability to retrieve and comprehend information about the whole database. In Task 2 the subjects were asked to count the number of versions with certain features or where certain events occurred. For example, the subjects were asked to indicate the number of versions that had a new step added. Task 4 required the user to identify versions with certain features or where certain events occurred. For example, the subjects were asked which version had a problem with fabrication.
Task 3 tested the users ability to retrieve and integrate information found within different pages of the database. The subjects were asked to compare different versions. For example, the subjects were asked to indicate which version, out of versions one, two and three had the highest material cost. This required the subject to look at the budget and schedule page for the three versions and compare each of their material costs.
Four experimental conditions were examined: Pad++ Prebuilt, Pad++ Interactive, Netscape Regular and Netscape Graphic. In the Pad++ Prebuilt condition, the entire network of pages was loaded into Pad++ prior to beginning the task. In the Pad++ Interactive condition, only the Overall page was loaded and the network of pages was built as the user answered the questions. In the Netscape Graphic Condition a WWW image map of the entire database was created by doing a screen capture in Pad++. This condition was added to see if Pad++'s navigational tools caused improvement in performance independent of that caused by the visual representation of the database. The image map used contains a snapshot of the pages depicted in Figure 4.
If a subject reported using Netscape less than 2 hours a week, they were asked to complete a brief training task. The task consisted of browsing through a set series of pages from the University of New Mexico's Web Site. This took approximately 10 minutes. During this task the subjects were instructed on the use of the "forward" and "back" buttons, and how to click on hypertext and image maps in order to follow links.
All but one subject had never used Pad++, although a few had seen brief demonstrations of the software. Three training tasks were used to familiarize the subjects with panning and zooming in Pad++ and using the Pad++ browser. Training tasks 1 and 2 were used to familiarize the subjects with a three-button mouse. In training task 1 the subjects were asked to zoom in and out. In training task 2 the subjects were asked to zoom, pan and select objects. The subjects performed both tasks until they were able to perform 10 operations without error. The third training task asked the subject to open a series of pages within the University of New Mexico's Web Site and then navigate to different open pages. During the training, the subjects were informed of different strategies to use when navigating in the Pad environment.
The experiment was carried out at the University of New Mexico. There were 32 participants, but only 30 participants yielded usable data. The two participants whose data was not used had over 50% of the questions incorrect. All of the subjects had completed at least two years of college and all use a computer at least two hours a week. The majority of subjects, 73%, used a computer more than an hour a day. 93 percent reorganized of the subjects had browsed the World Wide Web before, and 80% of subjects browsed the World Wide Web at least an hour a week. 90 percent of the subjects were students and the other 10% were professionals.
Prior to beginning the experimental tasks, the subjects were trained if needed and then asked to read through an instruction page. The instruction page explained the task and database, and had links to example pages. The subject was instructed to click on all links and look at all the examples. The subjects then carried out trials for the four experimental tasks. In one window, the browser was open with the appropriate initial pages, and in another window, the questions were displayed. A program written in Tcl displayed the questions, and recorded the time spent on each question and the subjects' response. After each question, the entire screen went blank so they could not see the browser and database pages. The subjects were instructed to click on a bar to proceed. This action started the timer and displayed the question and the browser window. Subjects clicked on another bar to enter their response and stop the timer. After the experimental task, a questionnaire was displayed which asked questions about the browser, for information on computer usage and for some demographic information.
In order to determine the quality of the questions, we looked at the percentage of subjects who answered each question correctly. If over 20 percent of the subjects missed the question, it was considered too difficult and thrown out of further analysis. Three questions out of 24 were thrown out for this reason. Two of these questions were in Task 2 and one of these questions was in Task 3
The dependent variable was the average time it took the subjects to answer each question. Only the times for questions answered correctly were used in calculating the average time. An average time to correct response was calculated first for each of the four tasks. The average times collapsing across tasks were also calculated. The number of correct responses ranged from 16 out of 21 to 21 out of 21 with the majority of subjects having one or fewer incorrect responses.
A one-way analysis of variance was performed on each task's average time to correct response. We are aware that in using an alpha of 0.10 for each test, we may be inadvertently increasing our Type I error rate, but it is necessary to do so in order to have enough statistical power given the limited number of subjects.
Figure 7 shows a graphical representation of the average time to correct response for each task and collapsed across tasks. The overall F test was only significant for Tasks 1 and 2, using an alpha level of 0.10, F3,26=2.74, p = 0.064 and F3,25=2.66, p=0.070 respectively. These results were significant because subjects in the Pad++ Interactive Condition took longer to carry out the tasks than did the subjects in the Netscape Graphic condition.
Figure 7: Average Time to Correct Response by Task and Browser Condition
after the First Experiment.
These results were largely attributable to technical problems with Pad++ that made it run considerably slower than Netscape. Based on the qualitative data collected by observation and from the subjects responses to the questionnaire, we made various improvements to the Pad++ browser described in the next section.
The speed at which Pad++ was running caused some discontent, but many comments suggested that subjects felt the interface to be intuitive and fun to use. Some of the quotes that indicate the positive aspects of Pad++ follow:
"ZOOM is fun!" | ||||||||||
"The fact that one can get the 'whole picture' in varying degrees of closeness is fantastic." | ||||||||||
"I love the zoom in and out. I also liked Show All very much." | ||||||||||
"The layout of the pages made the task very easy once one figured out that the pages were laid out in the order listed on the parent page. With the color scheme used, the zooming browser allowed answering of some questions merely by panning and examining the pages from a low resolution view." | ||||||||||
"Easy to find what I need to know. I did not have to search very much." | ||||||||||
"I liked the tree structure, but disliked the occasional delays in updating the display. Once I knew where the pages were in the tree structure, I could quickly find what I wanted." |
Another interesting observation made by the experimenter during testing was that subjects varied in their use of panning and zooming available in Pad++. Due to their familiarity with browsing in Netscape, they seemed reluctant to try some of the other features available to them in Pad++. They stuck to what they knew, despite that in training, the experimenter encouraged the subjects to use all available navigational tools. Furthermore, it appeared that there was a positive relationship between subjects use of these navigation features and task performance. In future studies it would be interesting to quantify this difference between subjects and to see if it is positively correlated with task performance.
Following the first experiment, we realized that we needed to make some changes in order to evaluate Pad++ more fairly. First of all, we made the above mentioned improvements. Secondly, since almost all of the subjects in the Netscape conditions were regular Netscape users, we felt that to evaluate Pad++ in a comparable manner, we should use subjects that were familiar with Pad++.
We ran the Pad++ Prebuilt condition again with seven Pad++ users using the same procedures as discussed previously. All seven of these subjects worked as part of the Pad++ development team. All were familiar with navigation in the Pad++ environment, and did not need to be trained.
A one-way analysis of variance was performed to compare the new subjects performance to those subjects in the Netscape conditions. The average time to correct responses for each task and averaged across all the tasks are displayed in Figure 8.
Figure 8: Average Time to Correct Response for Improved Pad++
versus the Netscape Conditions by Task.
The overall F-test approached significance for Task 1 and for the average across all tasks, F2,19=2.33, p=0.124 and F2,19=2.43, p=0.115 respectively. Furthermore, tests of the a-priori hypothesis that Pad++ would do better on average than the Netscape conditions yielded a p-value of 0.0599, F1,19=4.00 for Task 1 and a p-value of 0.088, F1,19=3.23 for the average time to correct response across all tasks.
These results demonstrate that subjects can perform this browsing task 23% faster on average using the Pad++ browser than using Netscape. Many subjects also found the Pad++ browser intuitive and fun to use. In a manufacturing setting, use of the Pad++ browser could increase workers productivity by speeding up the time it takes them to access historical information about a product's process development over time. Furthermore, since using the Pad++ browser allows the user to access such information in a fun and intuitive manner, it may lead to an increased use of historical information with consequent improvements in product and process quality.
Pad++ and the zooming Web browser will be made generally available in
the near future. To find current information, send mail to
pad-info@cs.unm.edu, or look at
We appreciate the work of our collaborators at the New York University
Media Research Lab, especially that of Jon Meyer, in helping us
develop Pad++.
[2] Benjamin B. Bederson, James D. Hollan, Ken Perlin, Jon Meyer, David
Bacon, and George Furnas. "Pad++: A Zoomable Graphical Sketchpad for
Exploring Alternate Interface Physics", Journal of Visual Languages
and Computing (7), 1996, 3-31.
[3] Stuart K. Card, George G. Robertson, and William York, "The WebBook
and the Web Forager: An Information Workspace for the World-Wide Web",
Proceedings of CHI'96, ACM Press, 111-117.
[4] Bay-Wei Chang and David Ungar. "Animation: From Cartoons to the
User Interface", Proceedings of UIST'93, ACM Press, 45-55.
[5] Peter Doemel, "WebMap - A Graphical Hypertext Navigation Tool", 2nd
International Conference on the World-Wide Web, Chicago, IL, 1994,
785-789.
[6] William C. Donelson. "Spatial Management of Information",
Proceedings of SIGGRAPH'78, ACM Press, 203-209.
[7] George W. Furnas. "Generalized Fisheye Views", Proceedings of
CHI'86, ACM Press, 16-23.
[8] George W. Furnas and Jeff Zacks. "Multitrees: Enriching and Reusing
Hierarchical Structure", Proceedings of SIGCHI'94, ACM Press, 330-336.
[9] Wendy A. Kellogg and John T. Richards. "The Human Factors of
Information on the Internet", in Advances in Human Computer
Interaction, Ed. J. Nielsen, Ablex Press, (5), 1-36.
[10] John Lamping, Ramana Rao, and Peter Pirolli. "A Focus+Context
Technique Based on Hyperbolic Geometry for Visualizing Large
Hierarchies", Proceedings of CHI'95, ACM press, 401-408.
[11] Jock D. Mackinlay, George G. Robertson, and Stu K. Card. "The
Perspective Wall: Detail and Context Smoothly Integrated", Proceedings
of CHI'91, ACM press, 173-179.
[12] Sougata Mukherjea, James D. Foley, and Scott Hudson. "Visualizing
Complex Hypermedia Networks through Multiple Hierarchical Views",
Proceedings of CHI'95, ACM press, 331-337.
[13] John K. Ousterhout. Tcl and the Tk Toolkit, Addison-Wesley, 1994.
[14] Ken Perlin and David Fox. "Pad: An Alternative Approach to the
Computer Interface", Proceedings of SIGGRAPH'93, ACM Press, 57-64.
[15] Monojit Sarkar and Marc H. Brown. "Graphical Fisheye Views",
Communications of the ACM, 37 (12), December, 1994.
[16] Maureen C. Stone, Ken Fishkin, and Eric A. Bier. "The Movable
Filter as a User Interface Tool", Proceedings of SIGCHI'94, ACM Press.
[17] David Ungar and Randy B. Smith. "Self: The Power of Simplicity",
Proceedings of OOPSLA'87, 227-241.
[18] Kent Wittenburg, Duco Das, Will Hill, and Larry Stead. "Group
Asynchronous Browsing on the World Wide Web", Proceedings of the 4th
International World Wide Web Conference, Boston, MA, International WWW
Conference.
ACKNOWLEDGEMENTS
We acknowledge generous support from DARPA's Human-Computer
Interaction Initiative (Contract #N66001-94-C-6039). This work also
supported by the United States Department of Energy under Contract
DE-AC04-96AL85000.
REFERENCES
[1] Benjamin B. Bederson and James D. Hollan. "Pad++: A Zooming
Graphical Interface for Exploring Alternate Interface Physics",
Proceedings of UIST'94 ACM Press, 17-26.