Beruflich Dokumente
Kultur Dokumente
Saverio Debernardis
Antonio E. Uva
Giuseppe Monno
Dipartimento di Meccanica,
Matematica e Management
(DMMM)
Politecnico di Bari
70126 Bari, Italy
Abstract
The application of augmented reality in industrial environments requires an effective
visualization of text on a see-through head-mounted display (HMD). The main contribution of this work is an empirical study of text styles as viewed through a monocular
optical see-through display on three real workshop backgrounds, examining four colors
and four different text styles. We ran 2,520 test trials with 14 participants using a
mixed design and evaluated completion time and error rates. We found that both presentation mode and background influence the readability of text, but there is no interaction effect between these two variables. Another interesting aspect is that the presentation mode differentially influences completion time and error rate. The present
study allows us to draw some guidelines for an effective use of AR text visualization in
industrial environments. We suggest maximum contrast when reading time is important, and the use of colors to reduce errors. We also recommend a colored billboard
with transparent text where colors have a specific meaning.
Introduction
doi:10.1162/PRES_a_00146
2013 by the Massachusetts Institute of Technology
*Correspondence to fiorentino@poliba.it.
posed graphical elements (see Figure 1). The main problem of the current technology is that only bright objects
can overlap on the background. In practice, dark colors
appear as semi-transparent and they mix in with the
background. This makes the use of see-through HMD
very challenging, especially in outdoor environments,
where the brightness of the background overcomes the
brightness of the display.
Industrial environments usually are indoor and characterized by controlled lighting, as given by the standard
ISO 8995-1 (ISO, 2002), but visibility problems commonly arise in the readability of the technical text labels
and these limit the effectiveness of this technology.
Literature on the readability of simple text in optical
see-through HMDs is scattered among different disciplines (computer graphics, humancomputer interaction, etc.) and usually it addresses general problems
without satisfying the specific requirements and constraints of industrial workspaces (e.g., standard color
coding, industrial practice, and workshop backgrounds).
It is common practice in industry to follow standard or
personalized color coding rules. For example, the 5S,
one of the most popular workplace organization methods, suggests the use of colors in workspace to enforce
sorting, straightening, systematic cleaning, standardizing, and sustaining (Hirano, 1995). A very common
practice of industrial data visualization which can be supported by AR technology is the use of shop floor paper
tags (see Figure 2). The tags carry important production
information in a cheap, simple, and effective way by text
change, as opposed to printed signs; and as color perception varies, so would readability. Industrial applications
would benefit from these guidelines.
Previous work reports general optimization of visualization, without providing color-based readability
guidelines. The main goal of the presented work is to
study the readability of textual information in indoor
industrial environments with an optical see-through
HMD. Different colors and text styles were combined
to investigate textual visualization on industrial backgrounds. For this purpose, we developed an opensource test workspace to support readability test experiments and made it available to the academic community
(Fiorentino, 2012).
The paper is organized as follows. In Section 2, we
present previous literature, followed by the description
of our approach in Section 3. In Sections 4 and 5, we
present the design of experiments, the results, and a
related discussion. Finally, we present a conclusion and
future work in Section 6.
2
Related Work
egy expressly did not preserve the registration of the digital information on the real objects.
Jankowski, Samp, Irzynska, Jozwowicz, and Decker
(2010) explored the effects of varying four text drawing
styles (plain, billboard, anti-interference, and shadow),
image polarity (positive when dark characters are on a
light-colored panel and conversely for negative), and
two backgrounds: the first one with videos recorded in
urban and outdoor environments and the second one
recorded in 3D video games. They found out that there
was little difference in reading performance for video and
3D backgrounds. Furthermore, they concluded that
negative presentation is faster and more accurate than
positive presentation. Therefore, billboard styles resulted
in the easiest to read and the most immune to background distractions.
From the presented works, we can conclude that the
knowledge on the readability of text on HMD is scattered among different disciplines, application fields, and
hardware setups, and, at the moment, it is not adequate
to provide standard and reliable guidelines for the application developers. In particular, we found no previous
work addressing the specific industrial environments.
This study draws inspiration from Gabbards experiments that address mainly outdoor environments and
textures. Our idea is to apply a similar approach to industrial context. Therefore, the motivation of this work is to
study and find effective text styles for monocular optical
see-through presentation, specifically for the industrial
context.
3
Our Approach
he or she can simulate the result by loading a background image on a desktop screen.
The user can change and test the following text style
parameters: font size, text color and transparency, text
billboard color and transparency, outline width, and outline color and transparency. In our preliminary phase, we
simulated different configurations using a library of 100
pictures downloaded from the internet using Google
Images with specific keywords (i.e., workshop, shop
floor, manufacturing, etc.; see Figure 4). In this way, we
were able to evaluate the experimental settings and to
plan the user tests.
The HMD testbed automated the execution of tests in
player mode: the test configurations are retrieved from
the test template file, shuffled randomly, and then displayed on the HMD. The application acquires and archives the following data in a simple log file: subject
username, time and date of the test, displayed text
strings, text style, users answer, and response time.
During the test, the performance, the progress bar,
the current score, and the top score are displayed on the
service desktop screen to monitor the test (see Figure
5). The score is added to motivate the user to maximize
performances during the test; since it is not visible to
the participants, it cannot influence the test results. The
software is publicly available on our website
(Fiorentino, 2012). We are interested in comparing the
results from other researchers using different display
configurations.
4
Design of Experiment
Figure 3. HMD testbed in editor mode: user can design the text style and preview it on the screen or HMD.
pants should input the result on a provided numeric keypad. The possible answers are 1, 2, 3, or 0 in the case of
unreadable letter not found. The alphabet is restricted to
the following letters: C, K, M, O, P, S, U, V, W, X, Z.
These letters have graphical similarity in uppercase and
lowercase, therefore this restriction makes the difficulty
associated with the target identification uniform. Our
software generates and visualizes the text blocks on the
HMD and records response time and user errors. A
crosshair viewfinder is displayed, and the user must point
to a specific target in the real scene (see Figure 1). This
solution avoids the chance that users may turn the HMD
to a more favorable position (i.e., to choose a specific
background point).
4.1 Measures
We focused on the following experiment independent variables (see Table 1) and the dependent variables (see Table 2) that we collected for the subsequent
statistical analysis.
Apart from measuring efficiency (completion time)
and effectiveness (error rate), a 5-point Likert scale is
used to measure user preferences with a post-experiment
questionnaire.
4.2 Backgrounds
Most of the industrial backgrounds we have
encountered, especially those related to production
with a lux meter and we registered an average illumination over the work area of about 300 lux.
4.3 Colors
Our setup required users to be concentrated for
the whole task. Studies on the length of human sustained
attention reported a maximum of around 20 min for
adults (Cornish & Dukette, 2009). To keep the experiment time within 20 min, we limited the color range to
only four options. In particular, following the specifications defined by the ISO 3864 standard (colors for safety
signs; ISO, 2011a), we decided to use the red and the
green as safety colors (i.e., colors with special properties
to which a safety meaning is attributed), and white and
black as contrast colors. Apart from their general messages (safety and prohibition), green and red are worth a
deeper investigation in AR setup because the literature
Figure 5. Player mode: during the test, the service screen (not seen by the participant) shows the list of configurations
(center), progress bar (lower left), and the user performance and top score.
14
3
4
4
5
2,520
11 males, 3 females
Testbed frame, tool workbench, engine
Text only, with outline, with billboard, with outline and billboard
Black, green, red, white (when applicable)
Five for each presentation mode and background
14 3 12 5
in ms
Correct task completion 1,
wrong task completion 0
Figure 6. The three real backgrounds used in the tests: testbed frame (left), tool workbench (center), motorbike engine (right).
Figure 7. Normalized luminance histograms (number of pixels for each luminance value) of the pictures of the background used in the test:
negative (testbed frame), positive (tool workbench), and neutral (engine).
modes, the color black can only be used as text or as outline on a differently colored billboard, because a black
billboard is equivalent to no billboard. Our experimental
indoor environment has a low illuminance (around 300
lux); therefore, the transparent stroke of black text or of
the outline is perceived as dark enough to be called
black. With this meaning we considered black as a color
in our experiment. Our purpose is to study its performance as contrast color to background colors (i.e., white,
red, and green), as indicated by ISO standards (ISO
2011a, 2011b, 2004).
1
2
3
4
5
6
7
8
9
10
11
12
Text
color
Outline
color
Billboard
color
Black
Black
Black
Green
Green
Green
Red
Red
Red
White
White
White
Black
Black
Green
Red
Green
Red
White
White
White
White
White
4.5 Participants
Fourteen unpaid participants were recruited for the
study among undergraduates in technical subjects. They
were 11 males and three females with the following age
distribution: seven from 21 to 25 and seven from 26 to
30, with an average age of 25. Six participants wore
glasses but none had color deficiency. All were right-eye
dominant. The users wore the HMD in front of the right
eye, and they received adequate instruction and performed a trial session. The participants could discontinue
the test at any time and the break time was not limited.
The subjects performed a total of 2,520 trials (14 participants 12 permutation modes 3 backgrounds 5
Figure 10. Box plot of the completion times for the three backgrounds
(the X marks the mean of samples).
Results
Number of errors
100
Number of targets
Testbed frame
ShapiroWilk test
W 0.996
p .031
Levene test
Mean rank
Friedman test
2.95
Figure 11. Box plot of error rate for the three backgrounds (the X
marks the mean of samples).
Tool workbench
W 0.997
p .097
F(2,2442) 9.895; p < .001
2.01
w2(2) 1518.9; p < .001
Engine
W 0.995
p .015
1.03
cation of the Welch ANOVA test for all of the 12 combinations, and the GamesHowell test for the 66 pair comparisons. Differences among all samples were statically
shown, thus hypothesis H2 was supported (see Figure
12).
The fastest presentation modes resulted from the black
text, no outline, and white billboard (mode 3). The best
performance of mode 3 was statistically confirmed
against modes 9, 7, 4, 11, and 8 (d 0.092, p .001).
It is important to note that mode 5 (green, no, white)
and mode 6 (green, black, white) showed no statistical
difference, F(1) 0.012, p 0.912, while mode 7 (red,
no, white) and mode 9 (red, black, white) are not
strongly different, F(1) 3.962; p .047. This confirms
that the black color is effectively not a color since it
shows no main effect if used as an outline.
An error-rate comparison (see Figure 13) among all
the 12 combinations revealed a statistically significant
difference, w2 (11) 37.811 > 19.675, allowing us to
accept hypothesis H2; but in this case, the performance
distribution is different from the completion-time analysis. The best performing modes were modes 2, 5, and
12. An interesting result is that presentation modes displaying red text (modes 7, 8, and 9) had bad scoring,
both for completion times and error rates.
5.2.1 Interaction BackgroundPresentation
Mode. We tested the interaction between the background and the presentation-mode effects with a twoways unbalanced ANOVA, which showed that there is
no interaction effect, F(22,2427) 0.778; p 0.756.
Every presentation mode displays results respecting the
background ranking (with the partial exception of modes
2 and 8) as shown in the radar plot in Figure 14.
Table 5. Completion-Time Analysis of the Presentation Modes, Sorted by the Mean Response Time
Presentation
mode
Mean response
time (ms)
ShapiroWilk test
3 (black,-,white)
12 (white,red,-)
1 (black,-,green)
6 (green,-,white)
10 (white,-,-)
5 (green,-,white)
2 (black,-,red)
8 (red,-,-)
11 (white,green,-)
4 (green,-,-)
7 (red,-,white)
9 (red,black,white)
5,441
5,909
6,108
6,109
6,112
6,139
6,239
6,728
6,864
6,966
7,276
7,914
W(205) 0.996
W(209) 0.989
W(207) 0.992
W(203) 0.992
W(197) 0.994
W(208) 0.995
W(206) 0.990
W(206) 0.990
W(204) 0.990
W(209) 0.995
W(205) 0.997
W(204) 0.996
p .841
p .119
p .306
p .385
p .599
p .764
p .189
p .189
p .184
p .763
p .927
p .813
Levene
test
Welch
ANOVA test
F(11,2451) 4.359
p < .001
F(11) 12.653
p < .001
Figure 12. Box plot of completion time for each presentation mode (the X marks the mean of samples).
Figure 13. Box plot of error rate for each presentation mode (the X marks the mean of samples).
Figure 14. Radar plot of response times (ms) for the three backgrounds in the 12 presentation modes.
Text style
Presentation modes
ShapiroWilk test
Levene test
Mean (ms)
Welch-ANOVA test
GamesHowell test
Text only
(T)
Text and
outline (TO)
4, 8, 10
W 0.996
p .101
Text and
billboard (TB)
11, 12
1, 2, 3, 5, 7
W 0.994
W 0.997
p .092
p .072
F(3,2249) 5.944; p < .001
6,622
6,368
6,237
F(3) 5.977; p .001
TB better than TOB d 0.042, p .001
TB better than T d 0.026, p .022
TO better than TOB d 0.034, p .035
Figure 15. Box plot about scattered completion-time data referring to each text color group (the X marks
the mean of samples).
Green
ShapiroWilk test
W(617) 0.998
p .653
Levene test
Means (ms)
Welch-ANOVA test
6,412
Red
W(611) 0.997
p .248
F (2,1831) 3.688 p .025
7,228
F(2) 21.344 p < .001
White
W(606) 0.996
p .103
6,281
Figure 16. Box plot of error rate for each grouped text color presentation mode (the X marks the mean of samples).
5.4.3 Billboard Color. Next, we wanted to analyze what was the best billboard among the three combinations available under the black text group. Therefore,
we kept the black text and focused our attention on conveying the color information using the billboards (green,
W(207) 0.992
p .306
6,109
Red
W(206) 0.990
p .189
F(2, 615) 0.126 p .882
6,237
F(2, 615) 7.061 p .001
White
W(205) 0.996
p .841
5,445
5.6 Discussion
Figure 17. Box plot of scattered completion-time data for black text
color and the three different billboard colors (the X marks the mean of
samples).
Figure 18. Cumulative marks given by participants at the end of their test trials (range 15).
Conclusion
Acknowledgments
The authors would like to thank Michele Mazzoccoli and
Michele Gattullo for the useful help provided in the test design
and execution, and all the students who took part in the test.
References
ASME. (2007). Scheme for the identification of piping systems. (ASME A13.1) Retrieved September 4, 2012 from
https://www.asme.org/products/codes-standards/
scheme-for-the-identification-of-piping-systems
Cornish, D., & Dukette, D. (2009). The essential 20: Twenty
components of an excellent health care team (pp. 7273).
Pittsburgh, PA: RoseDog Books.
Fiorentino, M. (2012). HMD test [software]. Retrieved from
Polytechnic of Bari, Department of Mechanics, Mathematics
and Management, Vr3Lab website: http://www.dimeg
.poliba.it/vr3lab/
Fukuzimi, S., Yamazaki, T., Kamijo, K., & Hayashi, Y. (1998).
Physiological and psychological evaluation for visual display
colour readability: A visual evoked potential study and a subjective evaluation study. Ergonomics, 41(1), 89108.
doi:10.1080/001401398187341
Gabbard, J. L., Swan, J. E., II, & Hix, D. (2006). The effects
of text drawing styles, background textures, and natural
lighting on text legibility in outdoor augmented reality.
Presence: Teleoperators and Virtual Environments, 15(1),
1632. doi:10.1162/pres.2006.15.1.16
Gabbard, J. L., Swan, J. E., II, Hix, D., Kim, S.-J., & Fitch, G.
(2007). Active text drawing styles for outdoor augmented
reality: A user-based study and design implications. Proceedings of the IEEE Virtual Reality Conference, VR 07, 3542.
doi:10.1109/VR.2007.352461
Gabbard, J. L., Zedlitz, J., Swan, J. E., II, & Winchester, W.
W. III (2010). More than meets the eye: An engineering
study to empirically examine the blending of real and virtual
color spaces. Technical Papers, Proceedings of IEEE Virtual
Reality, 10, 7986. doi:10.1109/VR.2010.5444808
Harrison, B. L., & Vicente, K. J. (1996). An experimental evaluation of transparent menu usage. Proceedings of the SIGCHI
Conference on Human Factors in Computing Systems: Com-
Copyright of Presence: Teleoperators & Virtual Environments is the property of MIT Press
and its content may not be copied or emailed to multiple sites or posted to a listserv without
the copyright holder's express written permission. However, users may print, download, or
email articles for individual use.