Beruflich Dokumente
Kultur Dokumente
CONTENTS
57-1
2. Screening............................................................................................................................................57-13
3. Training ..............................................................................................................................................57-14
4. Testing................................................................................................................................................57-14
5. Data Analysis ....................................................................................................................................57-14
D. Sensory Spectrum Analysis ......................................................................................................................57-14
E. Free-Choice Profiling................................................................................................................................57-14
F. Other Methods ..........................................................................................................................................57-15
VII. Time-Intensity Scaling......................................................................................................................................57-15
A. Measurement ............................................................................................................................................57-15
B. Analysis ....................................................................................................................................................57-16
VIII. Data Analysis ....................................................................................................................................................57-16
A. Descriptive Statistics ................................................................................................................................57-16
1. Central Tendency ..............................................................................................................................57-16
2. Dispersion ..........................................................................................................................................57-17
3. Frequency Distributions ....................................................................................................................57-17
B. Inferential Statistics ..................................................................................................................................57-17
1. Non-Parametric ..................................................................................................................................57-17
2. Parametric ..........................................................................................................................................57-18
C. Correlation and Regression ......................................................................................................................57-18
1. Correlation ........................................................................................................................................57-18
2. Multiple Correlation (R) ....................................................................................................................57-19
3. Regression..........................................................................................................................................57-19
D. Additional Multivariate Methods..............................................................................................................57-19
1. Multivariate Analysis-of-Variance (MANOVA) ................................................................................57-19
2. Discriminate Analysis ........................................................................................................................57-19
3. Principle Components Analysis ........................................................................................................57-19
4. Factor Analysis ..................................................................................................................................57-19
5. Cluster Analysis ................................................................................................................................57-19
IX. Visual Presentation ..........................................................................................................................................57-19
A. Histograms ................................................................................................................................................57-20
B. QDA Spider Charts ..................................................................................................................................57-21
C. Sensory Maps............................................................................................................................................57-21
X. Conclusion ........................................................................................................................................................57-23
References ....................................................................................................................................................................57-24
I. INTRODUCTION AND OVERVIEW the 1960s. The original group of psychologists chose to
not participate in the move to Natick. They disseminated
Measuring the sensory properties of foods and relating these their knowledge and interest in sensory evaluation further
properties to consumer acceptance is a major objective and into the food industry by joining product companies (e.g.,
accomplishment for sensory evaluation. These achieve- Coca Cola, Pillsbury, and General Mills), contract research
ments are possible as a direct result of advances in the sci- organizations (e.g., Battelle Memorial Institute) or founding
ence, development of valid and reliable methods, education research companies (e.g., Peryam and Kroll Corporation).
of research and application professionals, development of Government and industry interest in sensory evalua-
cost efficient and well run sensory programs, and institu- tion encouraged additional developments. For example, in
tions and commercial companies that support these pro- the 1950s the Arthur D. Little Company introduced the
grams. The latter is directly related to the expected and Flavor Profile Method (discussed below), a qualitative form
realized commercial benefit of sensory tests. of descriptive analysis that did not rely on an individual
This chapter provides the reader with an overview of expert, and formalized subject screening and training pro-
sensory evaluation, its uses and limitations. We highlight cedures. Private contract research companies continue to
important developments, current thinking and the current play an important role through sponsoring workshops,
status of the science. The interested reader will find more developing proprietary methods, assisting product compa-
detail in several books published on the topic, most nota-
Downloaded by [University of Newcastle] at 11:02 31 October 2015
physical properties. To this latter point, the definition where mechanical devices for measuring texture have been
focuses on the importance of perception for understanding described in the literature for more than a century. Bourne
the impact of a product’s physical properties; implicit in this (6) provided an excellent review of the subject describing
is the notion that in sensory evaluation “perception is real- mechanical devices to measure product changes, particu-
ity.” This has enormous implications on determining how larly the fruit ripening process, or to simulate the chewing
subjects are selected, and, when necessary, trained, what process to measure meat tenderness. The interested reader
sensory evaluation should measure, and the measurement is also referred to the Journal of Texture Studies, especially
method. the publications of A.S. Szczesniak and her collaborators.
The evolution of sensory evaluation as an independent While such mechanical devices have undergone substan-
discipline in the 1940s and 1950s produced unique meth- tive change in terms of sensitivity, measurement, design,
ods for determining difference. These and other sensory etc., the basic principles have remained unchanged, to wit,
method developments are a reminder that although they creating a mechanical device to measure what humans
have a common lineage, psychophysics and sensory eval- perceive.
uation have very different objectives. Sensory evaluation The main difficulty in simulating sensory behavior by
focuses on measuring the responses of people to products, an instrument is due to the complex simultaneous activity
and application of findings to action-oriented decisions ongoing in the mouth during eating and drinking. The upper
Downloaded by [University of Newcastle] at 11:02 31 October 2015
about products. Psychophysics focuses on the measure- jaw remains stationary while the lower jaw is capable of sev-
ment process and explaining the behavioral mechanisms eral different motions to hold, bite, tear, and masticate. At
behind sensory responses. Both objectives are important, the same time, the highly muscled tongue gathers, moves,
and each has its own criteria for success. and compresses stimuli, the salivary glands contribute mois-
ture and lubricant to the process, and swallowing provides
control of volume in the mouth while momentarily limit-
C. CLASSIFICATION OF SENSORY METHODS
ing intake to the mouth. To our knowledge, this multi-
There are several sensory methods, differentiated primarily dimensional activity cannot be reproduced with current
by objective, measurement procedure, subject qualification, instrumentation.
and various combinations of these factors; and new methods The history of odor detection has always included both
continue to be developed. As reported in Stone and Sidel (2), human and instrument systems. The earliest instruments
sensory tests are classified primarily by type of test in a three- began with measuring total volatiles. This was followed by
category classification system, which are: discriminative, development of chromatographic techniques and gas-liquid
descriptive, and affective tests. (We view discriminative and chromatography (GLC) which enabled separation of chem-
descriptive as analytical tests and thus could consider two icals as well as ability to separate specific chemicals at
categories; analytical and affective tests.) There are several very low concentrations; e.g., ppm (parts per million), ppb
different tests within each class, and they are described in (parts per billion) and less, to facilitate their identification.
greater detail later. Discrimination tests measure whether Subsequent developments enabled integration of the chro-
there is a perceived difference between products, descriptive matographic separation with even more sophisticated
tests measure perceived attributes and their intensities for instrumentation such as mass spectroscopy to more pre-
a product, and affective tests measure product acceptance or cisely clarify chemical structure, and so forth. Amerine et
preference. Affective tests are discussed in the current al. (3) describe limitations and early efforts to combine
handbook under the topic of Consumer Testing. GLC with sensory panel results. Volatiles from foods are
trapped and collected for sensory evaluation. Later refine-
ments included development of equipment containing mul-
D. SENSORY VS. INSTRUMENTAL TESTING
tiple odor ports that delivered the separated odorant at
The development of instruments to measure food and bev- prescribed dilutions simultaneously to sensory panel mem-
erage characteristics such as appearance, aroma, taste, and bers. Gas chromatography-mass spectrometry are used
texture has a very long history, as has research into corre- to analyze perceived odors, and Noble (7) described the
lating instrumental and sensory measures. Interest in this use of instrumental analysis for the sensory properties of
relationship in the food and beverage industry is based in foods.
part because it represents a potential means of reducing During this same time period, Schutz, Veley, and Iden
reliance on the human judgment, which some view as vari- (8) and Wilkens and Hartman (9) described their efforts to
able and subjective. Value, efficiency, and economy are devise other systems for the detection of odorants. Schutz
possible where instrumental measures can be effectively and co-workers investigated use of specific enzyme sys-
substituted for sensory measures. This is especially true in tems associated with olfactory tissue that were reactive
product quality assessment. to odorants and correlated results with the physical and
An extensive literature is available describing sen- chemical properties of those odorants. At the time, the con-
sory/instrumental relationships for texture measurement, straints of separation technology, enzyme stability, and
sensory characteristics and their relationship to the qual- tions or where they wait prior to proceeding to an assigned
ity of apple juices measured by consumers.” The degree to sensory booth. A blackboard or bulletin board is located
which these odor and taste detection instruments, or those on a wall near the receptionist desk and identifies the prod-
yet to be developed, are incorporated into routine product uct category and number of products that are to be tested
development and quality control activities will depend on in the session. A table is also provided for those situations
how well they correlate to important sensory measures, where subjects are offered a reward, or “treat” (juice, cof-
and are economical and safe to operate in those respective fee, cookies, candy, etc.), immediately following participa-
environments. tion in a test. The room should allow easy access to, yet
Correlating sensory flavor attributes and chemical visual separation from, the sensory booth area to minimize
measures has become almost routine in research and qual- distraction from early arriving subjects to subjects still
ity control, as has correlating sensory and Hunter color seated in the sensory booths. The dimensions for the recep-
measures. Instrumental and sensory correlations also have tion area depend on the number of subjects it has to accom-
been described for visual clarity and turbidity (14). modate prior to their proceeding to the sensory booths.
Before leaving this topic, we are reminded that sensory
information is uniquely different than information obtained
2. Test Kitchen Area
from a physical instrument (a point often overlooked when
creating a mechanical system to substitute for a physiolog- The test kitchen area and materials used in this area typi-
ical system). It represents the relatively simultaneous inte- cally represent a well-equipped commercial, or heavy duty
gration of information from all the senses, in conjunction home, kitchen. The area must accommodate preparation
with cognitive factors that include learning and expectation. and cleanup equipment, equipment and supply storage,
In this sense, sensory information is multidimensional product preparation, holding and serving activities, and
whereas instrumental information is almost entirely one workstations for kitchen staff. One or two large central
dimensional. Although we can instruct subjects to not pay islands for product preparation, showings, and holding for
attention to certain attributes, it is false to assume that those serving are recommended, and these must allow for unen-
attributes do not influence the subject or his response. cumbered foot traffic flow around them during product
Instruments can be useful in sensory evaluation when there preparation and serving. Airflow from this area is an impor-
is a good understanding of the relationship between those tant consideration, and food preparation aromas from here
measures, sensory perception, and consumer behavior. At must be exhausted away from the sensory booths. Serving
present such relationships are the exception, and consid- counters need to be high enough to minimize posture
erably more needs to be accomplished before instruments stress during serving, and wide enough to accommodate
represent a practical alternative to sensory measures. products, supplies, and scorecards for subjects seated in sen-
sory booths with pass through into in the serving area. All
under- and above-counter space should be utilized for stor-
II. SENSORY EVALUATION FACILITIES
age, and an adjacent storage room is recommended where
The typical sensory evaluation facility in a R&D center possible. A door from the kitchen to the reception area will
consists of separate areas for testing, panel training, and facilitate any needed communication between the recep-
administration. Satellite facilities are sometimes required tionist and kitchen staff, and provide easy access to treats
to accommodate testing that cannot be accomplished in and materials needed to clean and maintain the reception
the immediate vicinity of the research center. area. Telephones in this area should be equipped with a
visual alert, so that the auditory alert may be turned off rules for maintaining these locations apply; that is, a clean,
during testing. quiet, and distraction-free test environment.
3. Sensory Booths
III. ORGANIZING SENSORY PANELS
Individual sensory booths need to minimize visual and other
There are several steps to developing and organizing effec-
potential distractions to the subjects from one another, from
tive sensory panels. Subjects must be recruited and quali-
the kitchen, and from the reception area during the evalu-
fied, must participate in tests, be motivated, and have their
ation process. Dimensions, surface colors and materials,
performance monitored.
lighting, ventilation, pass-through structure, expectoration
system, signal system, and direct data entry systems must
be considered when designing sensory booths for food A. RECRUITING SUBJECTS
research. Clean filtered air should flow into the booth area, Subjects for sensory panels may be recruited from a gen-
and air should flow from the booths to the kitchen, and not eral population of consumers local to the sensory facility,
the reverse. The interested reader will find useful booth or from a company’s employee list. Because sensory panel
design information in several texts, and is directed to membership is performance related, the decision about
Eggert and Zook (15) and Stone and Sidel (2) for additional
Downloaded by [University of Newcastle] at 11:02 31 October 2015
category. Screening should involve at least ten product Positive motivation begins with informing subjects that
pairs, given in replicate, covering all sensory modalities the task is important, maintaining an efficient and profes-
and increasing in difficulty from easy (85 to 100% success sional test program, and providing positive reinforcement
rate) to very difficult (50% or less success rate). where possible, without disclosing information that can
influence upcoming tests. Following testing, a daily treat,
C. SCHEDULING SUBJECTS such as a beverage, baked good, confectionery, fruit, veg-
etable and dip immediately reinforce subject participation.
Subjects should be scheduled to arrive at the sensory facility These items need to be thoughtfully presented and fresh.
at the same time, receive their instructions as a group, pro- Once they become routine or are leftovers from previous
ceed to their assigned booth, and begin their evaluation. tests, they lose their reinforcing value. Where there is insuf-
Scheduling avoids conflicts, assures similar instructions and ficient space for a treats table, or where one would interfere
products are given, provides order, and establishes this as a with testing, a coupon of equal value and redeemable at the
serious professional activity. Allowing subjects to arrive at company cafeteria can be substituted. Holidays provide the
the facility and begin testing whenever they like introduces opportunity to incorporate a special theme into the daily
potential error from a variety of sources including product treat.
stability, extending the session too long, and traffic conges- Long term incentives include various forms of recogni-
tion in the test area. Most testing is done in the morning
Downloaded by [University of Newcastle] at 11:02 31 October 2015
discovery, and is complete when marketing research current product. A difference usually results in rejection
demonstrates sufficient consumer interest in the new prod- of the cost-reduced product. Where a difference is found,
uct. The amount and detail of direction consumer research descriptive tests may describe differences to determine
will provide about the sensory characteristics for a new whether the difference is for an important attribute or one
product will differ based on a company’s ability to obtain that may be readily corrected. Caution is advisable rela-
that information. Trained panel (e.g., QDA) evaluations in tive to using a series of discrimination tests where a suc-
these consumer studies will provide an accurate description cessful cost-reduced product replaces the control. For
of product differences and similarities, and when combined example, if A is not significantly different compared to B
with consumer preferences will identify sensory attributes and B is not significantly different compared to C, and so
and intensities to optimize consumer acceptance. The sen- forth, A very well may be significantly different when
sory description for the optimized product becomes the compared to Z. This transitivity can occur when a series
development target. Trained descriptive panels and quali- of non-significant differences result in a product that is
fied discrimination panels are used to determine when one significantly different from the original control. This pos-
or more products satisfactorily achieve or approximate a sibility can be minimized through the use of descriptive
sensory optimized target. analysis, comparing the descriptive profile of the proposed
new control to that of reproduced original control or to the
Downloaded by [University of Newcastle] at 11:02 31 October 2015
range for each, and these results are used to train quality continuing through tracking product performance, and
control panels at manufacturing sites. contributing to special assignments such as developing tests
and data to support or challenge advertising claims. We
2. Production Benchmarking briefly describe the role of sensory evaluation in those activ-
ities.
Once a new product has been accepted for manufacture
and distribution, production samples are evaluated by a
1. Monitor Competition
trained descriptive panel to establish the sensory profile
for the product when first released for sale. The bench- Discrimination and descriptive panel information are used
marking documentation is used for future comparisons to provide immediate intelligence about the similarity and
and references to the original product. differences between competitive products and also for
tracking changes and trends over time.
3. Manufacture Quality
2. Advertising/Claim Support
In a manufacturing facility, subjects are screened and trained
to evaluate a product at one or more stages of production Sensory claims often appear on television, radio, and in
(and where permitted according to GMP and HACCP mass media print and trade oriented publications. Most
Downloaded by [University of Newcastle] at 11:02 31 October 2015
requirements). These evaluations can include ingredient claims are comparative and preference related, others may
receipt and storage, during manufacture, and finished prod- identify specific sensory differences. In either case, the
uct. For finished product, trained sensory panels evaluate claims can be challenged and damages assessed where the
attributes included in the sensory specification. A sensory claim is not substantiated. All phases of the research,
professional coordinates the activity but is not a panel including subject recruitment, selection, training, testing
participant. Subjects evaluate attribute intensities, and the and the data analysis are subject to examination and chal-
sensory professional uses that information to determine lenge. Therefore, it is important that sensory claims are
whether to recommend accept, hold for further evaluation, supported by appropriate research that can be defended in
or reject the product. the legal system. The research must demonstrate that it is
free of bias that could influence the outcome, and that sound
testing principals are used.
4. Shelf-Life and Stability
The sensory quality of products deteriorates over time, and 3. Category Review
the rate of deterioration depends on factors related to ingre-
dients, packaging, handling, and storage. Several different A broad array of products, often between 12 to 24 and
sensory related criteria are possible for determining when representing the competitive and sensory range for a cate-
the end of shelf-life occurs, and no one criterion is univer- gory, are evaluated by a trained descriptive panel and by
sally accepted. Using a qualified descriptive or discrimi- consumers. Reviews for a rapidly changing product cate-
nation panel, any statistically significant difference for a gory may be scheduled as frequently as 6 month or 12
sensory attribute compared to a control product is a popular month intervals. The information is used to monitor the
choice for determining end of shelf-life, albeit a conserva- competitive market on a broad scale and to track changes
tive one. Where a sensory specification is available, stored and trends over time.
product is evaluated to access whether it deviates from
that specification. A loss of consumer acceptance is another 4. Product Optimization
popular criterion used to determine end of shelf-life.
This category of sensory research is conducted for the
purpose of optimizing consumer liking for a product. Two
5. Distribution Product different approaches are used for selecting products for
Products may experience abuse after leaving the manu- this research; one is a design of experiments (DOE), and
facturing facility; therefore, many companies audit product the other is a category review (CR). The DOE limits prod-
quality at the retail level. Results from qualified descrip- ucts to those the researcher modifies according to a pre-
tive and discrimination sensory panels are used to measure scribed experimental design (e.g., factorial, RSM); the CR
differences compared to data from the product benchmark includes a range of products having sensory differences
or sensory specification. representing the product category. A qualified discrimina-
tion panel may be used to eliminate redundant products
and a trained sensory panel to provide a descriptive analy-
C. MARKETING
sis of the products. The statistical analysis relates the con-
Sensory evaluation supports marketing and marketing sumer, trained panel, and analytical (if included) data sets
research activities beginning with new product development, to determine combinations of sensory and analytical
attributes that best predict optimal consumer liking. The characteristics of a control product that is a benchmark. For
descriptive panel is used for the follow-up research to eval- each trial in this fixed order scenario, two of the three prod-
uate test products until one or more satisfy the optimized ucts are the control and the third is the test product.
sensory target. Sensory analytical tests typically use small numbers of
subjects screened for their sensory skills, and the discrimi-
nation model is no exception. About 12 screened and qual-
V. DISCRIMINATION METHODS
ified subjects are optimal, and together with a recommended
Discrimination tests are classified as analytical methods, replicate trial provide sufficient data for statistical analysis.
and are used primarily to determine whether there is a per- Replication is recommended for analytical sensory tests,
ceived difference between products. The tests may be because each subject contributes a large percent of the data
directional, where the attribute of difference is named, or and it is important to know which subjects have difficulty
non-directional, where the attribute of difference is not replicating their response. An inability of a subject or a panel
named. Discrimination tests are relatively uncomplicated to replicate responses can provide important information
to administer (most use a simultaneous sample presenta- about problems with the subjects, procedure, and products.
tion), analyze, interpret, and report. They are used in sensory
studies primarily to screen products as a prelude to other 3. Data Analysis
Downloaded by [University of Newcastle] at 11:02 31 October 2015
A. GENERAL REQUIREMENTS
1. A-Not-A Sequential Presentation
Following are some requirements for discrimination tests.
We are reminded that these tests are sensory analytical Subjects are provided with two 3-digit coded products,
tests, and as such use small numbers of screened and qual- one followed by the other, and indicate whether the sec-
ified subjects. By small numbers, we mean 20 to 25; how- ond product is the same, or different from the first. The
ever, there is no rule that precludes more or fewer subjects. first product is removed prior to serving the second to
eliminate a direct comparison of the two. This method is
used primarily to minimize observation of unimportant
1. Subject Selection visual differences between products, and to control expo-
Subjects for sensory analytical tests are screened for their sure time between products having significant carryover
sensory skill, and for discrimination tests this means for effects. The probability of a correct judgement is 0.50;
their ability to detect differences between products. Using data analysis is straightforward and based on the binomial
consumers or other subjects that have not been screened expansion, for which there are several published tables.
for sensory acuity is not common practice or is it recom- The A not-A method can be expanded to where two or
mended. Unskilled subjects increase the risk of a Type II more coded products are given to the subject, one at a
error (i.e., missing a real difference); large numbers of sub- time, without reintroduction of the first product. Although
jects, typical for consumer tests, increases the risk of a Type this further reduces carryover effects, selection of this
I error (i.e., finding a false difference). A previous section serving option must be balanced against unwanted differ-
provides a detailed discussion about subject screening. ences in subject’s memory skills.
then the product on the right. This design should include They are instructed to try the product on the left first, and
an equal number of pairs in which the products are in fact then the product on the right. The order of tasting is bal-
the same, to reduce the impact of false hits from subjects anced across subjects and replications.
that always respond that there is a difference. The primary criticisms of this method, and of all direc-
tional discrimination tests, is the time and effort required
3. Duo-Trio to train subjects to evaluate the specific attribute, and know-
ing whether subjects actually selected a product based on
Three products are presented simultaneously to the subject,
the specified criteria. Commercial products are often quite
where one is coded “R,” for reference, and the other two
complex, where an ingredient or process difference can
products have unique 3-digit codes. The subject is instructed
result in several sensory differences. Limiting the subject’s
to select which coded product is most similar to the refer-
response to a single attribute may result in a correct choice
ence. Similar to the A-not-A method, it does not require
based on incorrect criteria. For example, different sweet-
selection based on a named attribute or training with that
ener types and amounts for a beverage or baked product
attribute, has a p ⫽ 0.50, and one can use published tables
may alter visual and textural characteristics. Products scored
for the binomial expansion. As we will see, this method
different for sweetness actually may differ on some other
requires less re-tasting and consequent potential for carry-
attribute. Several investigators (1,18,19,20) have convinc-
over effects than does the Triangle method. A sequential
Downloaded by [University of Newcastle] at 11:02 31 October 2015
consumer response behavior and for identifying those for selected chemical compounds, and a personal inter-
product attributes that are most important to consumer view to evaluate interest, attitude and availability. Training
preferences. Before discussing specific methods, it is nec- takes 6 months or more and includes instructional infor-
essary to define what is meant here by descriptive analy- mation about the senses, introduction to the components
sis. Descriptive analysis is a sensory methodology that of the flavor profile, and direct experience evaluating
provides quantitative descriptions of products based on selected products. Training also includes threshold deter-
the perceptions from a group of qualified subjects. Based mination for the basic tastes, skills having little or no con-
on the method, it provides information about a specific nection with evaluation of products containing mixtures
sensory modality or provides a total sensory description, of supra-threshold stimuli. Selected references are intro-
taking into account all the sensations that are perceived, duced to demonstrate specific sensations, and practice
visual, auditory, gustatory, olfactory, and kinesthetic — profiles obtained for products evaluated by an experi-
when the product is evaluated. The evaluation is usually enced panel. During training all subjects serve as panel
(but not always) done in a controlled environment, and leader until one is selected. The panel leader is key to this
can include product handling, and, in this sense, it is a total method; they coordinate the testing and report results. This
experience. The evaluation also can be done in a home individual assumes a leadership role, directing the conver-
environment when normal preparation, use, and consump- sation and providing a consensus conclusion based on the
Downloaded by [University of Newcastle] at 11:02 31 October 2015
tion are required over a period of time. results. The role of panel leader as defined in the method
The origins of descriptive information can be traced to can have significant consequences without some inde-
early brewmasters, perfumers, flavorists, and other product pendent controls. Subjects could be led to a conclusion
specialists. These technical specialists described products, without being aware this had occurred.
made recommendations about the purchase of specific raw The method had considerable appeal because results
materials, and also evaluated the effect of process variables could be obtained rapidly. Subjects meet, as a group, for
on product quality (as they determined product quality). about an hour to evaluate a product, reach a consensus about
Companies used that information to determine that a par- its sensory properties, and provide the requestor with a
ticular product met their criteria for manufacture and sale to result. The developers of the method emphasized that there
the consumer. These early activities were the basis for the would be confidence based on the collective professional
foundation of sensory evaluation as a science, although at judgment of the panel and this would obviate the need for
the time it was not considered within that context. Formal statistics. The method is unique in that no direct judgement
descriptive analysis received its major impetus from “flavor was made concerning consumer acceptance of the product,
profile,” an approach that demonstrated it was possible to although most investigators assumed consumer acceptance
select and train individuals to describe the sensory proper- based on a result. An early recognized shortcoming of the
ties of a product in some agreed sequence, leading to action- method was its focus on the single sensory modality of fla-
able results, without requiring years as an apprentice to an vor. Subsequent methods would extend screening, training,
expert. The method attracted considerable interest and was a and evaluation to one or more additional modalities.
milestone in the development of the science of sensory
evaluation. It also was the source of much controversy, and
B. TEXTURE PROFILE
since then other descriptive methods have been developed.
Chronologically the next descriptive method of impor-
tance was the Texture Profile® method developed at the
A. FLAVOR PROFILE
General Foods Research Center (24, 25, 26). The method
The Flavor Profile® method described by Cairncross and was based on Flavor Profile, and extended descriptive analy-
Sjöstrom (21), Sjöstrom and Cairncross (22), and Caul sis into a second modality, texture. Brandt and co-workers
(23) was developed at the Arthur D. Little Co. and is the (24) defined a texture profile as “the sensory analysis of
only existing formal qualitative descriptive procedure. the texture complex of a food in terms of its mechanical,
The method utilizes a panel of four to six screened and geometrical, fat and moisture characteristics, the degree of
selected subjects who examine, discuss, and evaluate prod- each present and the order in which they appear from first
uct in an open session. Product attributes are scored for bite through complete mastication.” Several of the proce-
intensity in the order of their perception. The intensity dures introduced by Texture Profile raise behavioral issues
scale was: 0 ⫽ none,) (⫽ threshold,) (⫺1, 1 ⫽ slight, related to panel training.
1–2, 2 ⫽ moderate, 2–3, and 3 ⫽ strong. Once agreement If the panel is to be used also for flavor assessment, sub-
is reached on the description of the product, the panel ject screening includes tasks identical to those described for
leader summarizes and reports the results without appli- Flavor Profile. Texture screening tests were devised for hard-
cation of any statistical analysis. ness and viscosity ranking, and for geometric matching. As
Six subjects are selected for training based on technical many as 40 to 50 subjects may be screened, with 10 to 15
background and a series of screening tests, including basic selected for training. Full or partial denture wearers were
taste recognition and intensity ranking, odor identification not considered for panel membership.
Texture training included a two week orientation phase, and eliminates the opportunity to assess the importance of
a three month practice phase, and a three month expansion individual differences for understanding and generalizing
phase. A standard set of references was used to demon- results to consumers.
strate taste, aroma and mouthfeel sensations. The concept Interest in eliminating subject variability is, on the sur-
of the components of flavor profile was extended to include face, a reasonable (although not likely or practical) idea
what were described as mechanical, geometrical, and a cat- but it must not give way to procedures that sacrifice the
egory of “other” (moisture and fat) characteristics, inten- validity of the measurement system. The human is a living
sity, and the order of appearance for the texture properties. and changing organism, influenced by physiological and
An effort was made to describe the physical properties of psychological conditions that better support a concept of
texture and to provide a sensory definition for each des- expected variability. Products also are themselves variable.
cribed physical property. For these reasons, a panel is used rather than a single sub-
Initially, the texture terms and definitions were com- ject, and replication has become a mainstay in most cur-
piled, sorted, and categorized from both scientific/technical rent descriptive methods.
and popular terms, the latter from laboratory or field situ-
ations in which common usage terms were used to describe C. QUANTITATIVE DESCRIPTIVE ANALYSIS (QDA)
textural sensations. This led to the development of a clas-
Development of the Flavor and Texture Profile methods
Downloaded by [University of Newcastle] at 11:02 31 October 2015
stimuli are used, increasing in difficulty from the first pair and differences, and has become a standard in the food
(easy to discriminate) to the last pair (difficult to discrim- industry. A proprietary statistical package was made avail-
inate), covering all sensory modalities, and including any able for analyzing QDA data, and included in Tragon’s
differences known to be relevant to consumers. Correct training program for panel leaders. The statistical package
selection of the different product in more than about 65% uses one-way and two-way Analysis of Variance models
of the trials, and consistency in correct selection across in prescribed ways to assess subject performance. Current
replications are used as a guide for subject selection. innovations to the package include algorithms for assess-
ing crossover and magnitude interaction, scale use, and
3. Training multivariate analysis and mapping techniques. QDA data
is well suited to the recent advances in statistical analysis
QDA training focuses on language development and the and computer technology.
evaluation process, and provides subjects experience with
the range of products they will evaluate. Language devel-
opment is a consensus building process. The panel leader’s D. SENSORY SPECTRUM ANALYSIS
function is to keep the panel focused on developing objec- This method positions itself as a hybrid of the three methods
tive terms (in contrast to attitudinal terms) that can be described previously; however, it is primarily an extension
measured and obtain agreement amongst subjects on the
Downloaded by [University of Newcastle] at 11:02 31 October 2015
important, differences. The method does not demonstrate Time-intensity studies have been reported for the taste,
any substantially unique advantages and may be better aroma, and texture of foods, and for a variety of products
suited to classification and comparison to other consumer including bread, chewing gum, beer, wine, ice cream, chili
procedures than to trained panel procedures. pepper, gelatin, and chocolate. Commercial interest in syn-
thetic sweeteners in the 1970s prompted significant pub-
F. OTHER METHODS lished and unpublished research into the temporal effects
of these sweeteners compared to sucrose.
Descriptive information is recognized as important, if not
crucial, as a means for providing a comprehensive product
assessment. Descriptive panels are used in several venues A. MEASUREMENT
including, and not limited to, product development, qual- Interest in time and intensity relationships in sensory eval-
ity control, advertising claim support, and food science uation is evident by the almost universal practice of meas-
education. Different needs have given rise to descriptive uring aftertaste or aftereffect in descriptive analysis and
panel variations other than the methods described above. consumer research. Subjects in time-intensity studies typi-
Many descriptive methods are generic and retain in varying cally receive instruction and practice with the test procedure,
degree elements of the core methods; a few other methods scale, and any special apparatus used for data collection,
have established, or attempted to establish, themselves as
Downloaded by [University of Newcastle] at 11:02 31 October 2015
perceived intensity for the attribute has returned to base- that make the use of statistics especially challenging. First,
line. Data from early generation strip chart recorders was there are many different statistical procedures from which to
handled much like that from similar tests using paper bal- choose, and new tests are regularly introduced and described
lots. The subject’s intensity score at several points in time in the literature. Second, statisticians as well as sensory sci-
were hand measured and converted to numerical value for entists differ in their view about how responses should be
data analysis. This was a labor-intensive process, depend- treated, which type of analysis is more or less appropriate
ing on the number of time points examined. and useful, the extent of any data transformation, and so
Desktop computers linked to a joystick (35), or simi- forth. Third, software packages and the power of the PC
lar device, for measuring perceived intensity introduced make all methods for analysis easily obtainable in terms
significant improvements in the time required to collect, of time and cost, but that does not make them appropriate.
enter, and analyze data, and simplified the subject’s task This section provides an introduction to the use of sta-
compared to strip chart recorders. These advances and tistics as a guideline for making sensory and business deci-
developments in statistical analysis and graphing software sions about products of interest. The focus is on statistical
that could be used with time-intensity data encouraged applications rather than statistical theory and the proofs,
greater use of the method for sensory research. or various algorithms that would take far more than the
space allocated here for this topic. The interested reader
Downloaded by [University of Newcastle] at 11:02 31 October 2015
a. Range
Downloaded by [University of Newcastle] at 11:02 31 October 2015
beneficial to modify the theory and assumptions, rather than test may not agree with practical experience using that test.
forgo the application. Where this is the case, paying attention to the risk one can
still proceed cautiously. Frequently used multiple range
a. t-test tests include; Fisher’s LSD (least significance difference),
The t-test is used to determine whether the means for two Dunnett’s test, Duncan’s test, some tests by Tukey, the
products are significantly different. Different formulae are S-N-K (Student-Newman-Keuls) test, the Scheffé test, and
available to accommodate various conditions such as: inde- the Bonferroni test. Whichever test is selected, the results
pendent and dependent observations, large and small N, must be examined to determine whether they agree with the
proportions, and equal and unequal number of observations. knowledge and expectation the researcher has about the
The t-test is not the best choice for sensory tests where the products and the sensory test.
same subjects evaluate both products, the analysis-
of-variance can be substituted and it extracts more from the
error variance. The t-test also is inappropriate where more
C. CORRELATION AND REGRESSION
than two products are to be evaluated and statistically com- Sensory scientists are often interested in the relationships
pared to one another. This constitutes multiple t-tests of the between different sets of data. The interest may be bet-
same data, a common error. ween different groups of subjects, products, attributes, or
combinations thereof. Because correlation does not imply
b. Analysis of variance (AOV) causation, summary statistics and graphs depicting asso-
The analysis of variance is the primary statistical test used ciation are useful for data reduction, substitution, predic-
for scaled sensory data. It is a versatile statistical procedure tion, and generally improving understanding about
that readily accommodates multiple products, treatment variables. As with inference testing, different statistics are
conditions and groups of subjects. It is used to determine used for different levels of measurement (e.g., nominal,
whether an effect is statistically significant, and when a rank order, or continuous).
difference noted is followed by a multiple range test to These measures are used to examine and better under-
determine which means are different. Several AOV models stand sensory and instrumental relationships, and in opti-
are extensively used in sensory evaluation; these include the mization research where sensory attributes are used as
one-way for independent means, the two-way for depend- predictor variables for consumer acceptance.
ent means, treatment x levels for different treatment con-
ditions (e.g., x-levels of different variables), and split-plot
1. Correlation
models for different subject groups evaluating the same
and different treatments. To understand a simple two-variable correlation it is best
The primary challenges for selecting the best AOV to first graph results as a scatter plot with the data from
model are to select one that is appropriate for the specific one variable on the X-axis and data from the second variable
test conditions, to accurately account for all the sources of on the Y-axis. For example, descriptive scores for sweet-
variance in the test, and to select the appropriate error term ness and bitterness for a set of products can be graphed
for testing the significance of an effect. Statistical signifi- and a correlation for the relationship between the two attrib-
cance in the AOV is based on a calculated F ratio consisting utes calculated. The Pearson product moment correlation
of an effect variance in the numerator and an error variance coefficient is used for continuous scaled data and is depicted
in the denominator. Probability tables for the significance by r. Calculated r-values range from ⫺1 to 0 to ⫹1. The
closer to 0, the less is the association between the two vari- 2. Discriminate Analysis
ables. The minus sign depicts an opposite relationship,
Discriminate analysis is a methodology for finding linear
whereas a plus sign depicts a directionally similar rela-
combinations of the independent variables that can act as
tionship; unity represents a perfect correlation. Pearson’s is
scoring functions to estimate to which of several classifi-
useful only for linear data, curvilinear data can produce low
cation categories an observation belongs. Sensory scientists
values approaching 0, misleading the casual observer to
use this and related procedures (e.g., canonical correlation)
conclude the variables are unrelated. Correlation can be
in optimization research to identify lifestyle, attitude, and
determined for non-continuous scaled data as well; an
classification information that best identify membership
example is Spearman’s rank order correlation coefficient
in different consumer preference groups.
(ρ Greek lowercase rho).
3. Principle Components Analysis
2. Multiple Correlation (R)
This data reduction technique transforms the original set
Multiple Correlation is used to assess the degree of asso-
of variables into as few linear combinations as possible to
ciation between a dependent variable and a set of predictor
explain as much of the total variation in the original data.
variables. In product optimization research several sensory
The linear combinations, identified as the factors or compo-
attributes (independent variables) may be included as
Downloaded by [University of Newcastle] at 11:02 31 October 2015
Puckery Mf Biting Mf
A. HISTOGRAMS
Downloaded by [University of Newcastle] at 11:02 31 October 2015
4
Frequency
0
1 5 10 1 5 10 1 5 10 1 5 10
Sweetness intensity
(N = 12; Scale = 1 to 10)
FIGURE 57.1 Illustration of distribution histograms for examining serving order effect. The graph represents 24 subjects that
evaluated two products in two serving orders. Twelve subjects evaluated Product A first, and 12 subjects evaluated Product B first.
differences for Product A when served in the second order descriptive panel data. The maps often group well liked
would alert the sensory researcher that it may be risky, or products together with sensory attributes and consumer
even inappropriate, to conduct statistical tests for this information (e.g., lifestyle and attitude) identified as
result if homogeneity of variances is an issue. highly related to those product preferences. In the case of
descriptive panel data, it is often helpful to visually dis-
B. QDA SPIDER CHARTS play products by attribute results. Figure 57.3 is such a
graph, and demonstrates how products may differ, or
Figure 57.2 represents a popular way for graphing
group, based on several attributes across different modal-
descriptive panel results (2,27). Figure 57.2 displays results
ities. Here we see that Wines K and D are scored high for
only for mouthfeel from a study that included additional
several different attributes; whereas, Wine G falls between
modalities. Examining Figure 57.2, we readily see product
the two. We would conclude that Wine K is a strong and
differences for individual attributes, relationships among
high impact product in several areas.
the attributes, and the overall tendency for a product to score
Figure 57.4 is an example of a density map based on a
low (Wine D), high (Wine K), or in the middle (Wine G)
cluster analysis of consumer acceptance scores for a broad
compared to other products evaluated.
range of products of the type typically included in a cate-
gory review optimization. The cluster map reveals three
C. SENSORY MAPS
Downloaded by [University of Newcastle] at 11:02 31 October 2015
0.6
0.4
Metallic Ar
Mouthdry Af t
Sour Aft
Coating Aft
Discriminant function 2 (21.2%)
–0.2 Tears
Clear Ap
–0.4
G
FIGURE 57.3 Attribute factor loading for trained panel evaluation of three wines; D, G, and K.
Group 2
Group 3
Group 1
Downloaded by [University of Newcastle] at 11:02 31 October 2015
Preference group 1
Preference group 2
Group centroids
Preference group 3
FIGURE 57.4 Cluster Analysis density map for aggregate acceptance scores. Three unique preference groups were identified.
2
Wine D Wine D Wine K
Wine K
Wine D
Wine D Wine D Wine K
Wine K Wine K
Discriminat axis 2 (21.2%)
Wine D
Wine K
Wine K
Wine D Wine G
0 Wine K
Wine G Wine D Wine K
Wine G
Wine D Wine G
Wine G
Wine D
Wine G Wine K
Wine G G
Wine
–2
Wine G
Wine G
–4
–2 0 2 4
Discriminate axis 1 (78.8%):
FIGURE 57.5 Discriminate plot for three wines across sensory attributes scored by each of 10 subjects. Arrows show two subjects that
differ compared to the other subjects for their evaluation of Products G and K. Filled arrow is Subject #167; open arrow is Subject #542.
Sensory Map
Grape flavor
Fruity aftertaste
Consumer liking
Current control
Market leader
Prototype A
Perfumy aroma
Competitor
Smoky flavor
Downloaded by [University of Newcastle] at 11:02 31 October 2015
Tears
Bitter flavor
Prototype B
Biting mouthfeel
Factor 1
FIGURE 57.6 Sensory map integrating consumer liking and sensory panel attributes for selected products.
that teach and employ them understand that sensory eval- of Foods. 2nd ed. (JR Piggott, ed.). London: Elsevier
uation is about perception; and how best to measure, ana- Applied Science, 1988.
lyze and communicate sensory information. 21. WE Cairncross, LB Sjöström. Flavor profile — a
new approach to flavor problems. Food Technol 4:
308–311, 1950.
22. LB Sjöström, SE Cairncross. The descriptive analysis of
REFERENCES flavor. In: DR Peryam, FJ Pilgrim, MS Peterson. eds.
Food Acceptance Testing Methodology. Washington, DC:
1. HT Lawless, H Heymann. Sensory Evaluation of Food: Natl Acad Sci — Natl Res Counc 1954 pp 25–30.
Principles and Practices Gaithersburg, Maryland: Aspen 23. JF Caul. The profile method of flavor analysis. Adv Food
Publishers, Inc., 1998. Res 7:1–40, 1957.
2. H Stone, JL Sidel. Sensory Evaluation Practices, 3rd ed. 24. MA Brandt, E Skinner, J Coleman. Texture profile
San Diego: Academic Press, 2004. method. J Food Sci 28:404–410, 1963.
3. MA Amerine, RM Pangborn, EB Roessler. Principles of 25. AS Szczesniak. Classification of textural characteris-
Sensory Evaluation of Food. New York: Academic tics. J Food Sci 28:385–389, 1963.
Press, 1965. 26. AS Szczesniak, MA Brandt, HH Friedman. Development
4. DR Peryam, FJ Pilgrim, MS Peterson, (eds.). Food of standard rating scales for mechanical parameters of
Acceptance Testing Methodology. Washington D.C.: texture and correlation between the objective and the sen-
Downloaded by [University of Newcastle] at 11:02 31 October 2015
Natl. Acad. Sci — Natl. Res. Counc, (1954). sory methods of texture evaluation. J Food Sci 28:
5. Anonymous. Minutes of Division Business Meeting. 397–403, 1963.
Institute of Food Technologists — Sensory Evaluation 27. H Stone, J Sidel, S Oliver, A Woolsey, RC Singleton.
Division, IFT, Chicago, Illinois, 1975. Sensory evaluation by quantitative descriptive analysis.
6. MC Bourne. Food Texture and Viscosity: Concept and Food Technol 28:24–34, 1974.
Measurement. New York: Academic Press, 1982. 28. AA Williams, SP Langron. The use of free-choice pro-
7. A Noble. Instrumental analysis of the sensory properties filing for the evaluation of commercial ports. J Sci Food
of food. Food Technol 29:56–60, 1975. and Agric 35:558–568, 1984.
8. HG Schutz, VG Vely, RB Iden. Relation of odor to 29. N Larson-Powers, RM Pangborn. Descriptive analysis
physicochemical properties of compounds (Interim. of the sensory properties of beverages and gelatins con-
Reprot). QM Food and Container Institute for the Armed taining sucrose or synthetic sweeteners. J Food Sci 43:
Forces. (Contract No. DA-19-129-QM-1500), 1961. 47–51, 1978.
9. WF Wilkens, JD Hartmann. An electronic analog for the 30. JE Hanson, DA Kendall, NF Smith. The missing link:
olfactory process. J of Food Sci 29:372–378, 1964. correlation of consumer and professional sensory
10. PN Bartlett, JM Elliott, JWGardner. Electronic noses descriptions. Beverage World 102:108–116, 1983.
and their application in the food industry. Food Technol 31. CR Stampanoni. The quantitative flavor profiling tech-
51: 44–48, 1997. nique. Perfumer and Flavorist 18:19–24, 1993.
11. RN Bleibaum, H Stone, T Tan, S Labreche, E Saint- 32. H Hahn. Die adaptation des geschmackssinnes.
Martin, S Isz. Comparison of sensory and consumer Z Sinnesphys 65:105–145, 1934.
results with electronic nose and tongue sensors for apple 33. S van Buuren. Analyzing time-intensity responses in sen-
juices. Food Quality and Preference 13:409–422, 2002. sory evaluation. Food Technol 46:101–104, 1992.
12. NS Lewis. The Caltech Electronic Nose Project. 34. N Larson-Powers, RM Pangborn. Paired comparison
Engineering & Science, No 3, 1996. and time-intensity measurements of the sensory proper-
13. G. Taubes. The Electronic Nose. Discover 40–50, 1996. ties of beverages and gelatins containing sucrose or syn-
14. L Malcolmson, L Jeffrey, DD Sharma, PKW Ng. thetic sweeteners. J Food Sci 43:41–46, 1978.
Relationship between sensory clarity and turbidity 35. JX Guinard, RM Pangborn, CF Shoemaker. Computer-
values of apple juice. Canadian Institute of Science and ized procedure for time-intensity sensory measurements.
Technology Journal 22:129–132, 1989. J Food Sci 50:543–546, 1985.
15. J Eggert, K Zook, eds. Physical Requirement Guidelines 36. MC Gacula Jr, J Singh. Statistical Methods in Food and
for Sensory Evaluation Laboratories. ASTM STP 913. Consumer Research. Orlando: Academic Press, 1987.
Am Soc Test Mater, Philadelphia, Pennsylvania, 1986. 37. MC Gacula Jr. Design and Analysis of Sensory Optimiza-
16. EB Roessler, RM Pangborn, JL Sidel, H Stone. tion. Connecticut: Food and Nutrition Press, 1993.
Expanded statistical tables for estimating significance in 38. J Cohen, P Cohen. Applied Multiple Regression/Correla-
paired-preference, paired-difference, duo-trio and trian- tion Analysis for the Behavioral Sciences. 2nd ed. New
gle tests. J Food Sci 43:940–943, 1978. Jersey: Lawrence Erlbaum Associates, 1983.
17. M O’Mahony. Sensory Evaluation of Food: Statistical 39. WR Dillon, M Goldstein. Multivariate Analysis: Methods
Methods and Procedures. New York: Dekker, 1986. and Applications. New York: John Wiley & Sons, 1984.
18. DM Ennis. The power of sensory discrimination meth- 40. WS Cleveland. Visualizing Data, Summit, New Jersey:
ods. J. Sensory Studies 8:353–370, 1993. Hobart Press, 1994.
19. DM Ennis, K Mullen. Theoretical aspects of sensory 41. K Greenhoff, HJH MacFie. Preference mapping in prac-
discrimination. Chem. Senses 11:513–522, 1986. tice. In: Measurement of Food Preferences. (HJH
20. JER Fritjers. Sensory difference testing and the meas- MacFie and DMH Thompson, eds.). Blackie, Academic
urement of sensory discriminability. In Sensory Analysis and Professional. 137–166, 1994.