Sie sind auf Seite 1von 14

Empirical Accounting Research Design for Ph. D.

Students
Author(s): William R. Kinney, Jr.
Source: The Accounting Review, Vol. 61, No. 2 (Apr., 1986), pp. 338-350
Published by: American Accounting Association
Stable URL: http://www.jstor.org/stable/247264 .
Accessed: 09/05/2014 16:35

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp

.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.

American Accounting Association is collaborating with JSTOR to digitize, preserve and extend access to The
Accounting Review.

http://www.jstor.org

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
THE ACCOUNTING REVIEW
Vol. LXI, No. 2
April 1986

EDUCATION RESEARCH
Frank H. Selto, Editor

Empirical Accounting Research Design


For Ph.D. Students
WilliamR. Kinney, Jr.

ABSTRACT: This paper discusses an approach to introducing empirical accounting research


design to Ph.D. students. The approach includes a framework for evaluating accounting ex-
periments as well as studies based on passive observation of subjects or data. Alternative
methods of isolating the effect of the "independent" variable of interest from effects of prior-
to-the-study-period variables and contemporaneous variables are discussed along with the
advantages and limitations of each method. Also discussed is the relationship between type I
and type 11error risks, sample size, and research design. The importance of research design,
including theory development and means for mitigating the effects of extraneous variables, is
emphasized as perhaps the only practical way to achieve research objectives in empirical re-
search in accounting.

A FREQUENTLY encountered problem The approach is generic-it is not tied to


in accounting Ph.D. programs is an area of accounting and doesn't depend
that first-year students do not have on prior knowledge of a particular para-
background in empirical research in ac- digm.'
counting. Few B.B.A., M.B.A. or M.Acc. ' Illustrations and extensions from applied areas of
programs include courses in empirical re- accounting are also helpful. Good sources for financial
search and many students have not seri- accounting are Ball and Foster [1982], Lev and Ohlson
ously considered its nature. Yet, such an [1982], and Abdel-khalikand Ajinkya [1979].Good sources
for behavioral work are Ashton [1982] and Libby [1981].
introduction is necessary if Ph.D students
are to efficiently relate other courses to I would like to acknowledge the helpful comments and
substantive problems in accounting and suggestions of Vic Bernard, Dan Collins, Grant Clowery,
Bob Libby, JerrySalamon and two anonymous reviewers.
be able to take full advantage of account- An earlier version of this paper was presented at the
ing workshops. American Accounting Association's Doctoral Consor-
The purpose of this paper is to show tium in Toronto, Ontario, in August 1984.
how a basic framework for evaluating
empirical research in accounting can be William R. Kinney, Jr. is Price Water-
obtained in a short introduction. This can house Auditing Professor at the Univer-
be done at the start of the first term course sity of Michigan.
and provides a context for further work in Manuscript received September 1984.
philosophy of science and statisticaldesign Revisions received April 1985 and August 1985.
as well as substantive areas of accounting. Accepted September 1985.
338

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
Kinney 339

The generic approach focuses attention for things make a difference?"' Clearly,
on the essence of scientific inquiry in ac- the accounting for items affecting tax pay-
counting. Many of the problems faced by ments makes a difference in our individ-
accounting experimenters who can man- ual and collective lives. But does the ac-
ipulate some (but not all) of the levels of counting for, say, depreciation in internal
variables to be studied are similar to those or external reports affect decisions within
faced by "passive observers" of the levels a business firm or affect stock prices? If
of all variables as set by Nature.2 Thus, it does, then the size of the effect and why
the generic approach may help to avoid it occurs are important follow-up ques-
premature specialization [Boulding, tions. Additionally, the accounting re-
1956, p. 199]. searcher must separate the underlying
Section I presents a definition of empir- economic event (or state) from the account-
ical accounting research and theory, hy- ing report of the event. Thus, while a fi-
pothesis, and fact. It also defines "depen- nance researcher may be concerned only
dent," "independent," and prior and with firm characteristics, the accounting
contemporaneous influence variables. researcher must also be concerned with
Section II discusses alternative means for the costs and benefits of alternative ac-
separating the effects of prior and con- counting reports of those characteristics.4
temporaneous influence variables from In essence, empirical research involves
the independent variable(s), and in Section theory, hypothesis, andfact. "Facts" are
III the interrelationships among signifi- states or events that are observable in the
cance, power, and research design are real world. A "theory" offers a tentative
explored. Section IV gives a summary and explanation of the relationship between
conclusions. or among groups of facts in general. "Hy-
potheses" are predictions (or assertions)
I. A FRAMEWORK
FOREMPIRICAL
about the "facts" that will occur in a par-
RESEARCH IN ACCOUNTING
ticular instance assuming that the theory
Research is a purposive activity and its is valid. Finally, observing "facts" con-
purpose is to allow us to understand, pre- sistent with the prediction or assertion
dict, or control some aspect of the envi-
ronment. Research will be defined here 2 The problems are not identical. For example, while

as the development and testing of new experimenters have the advantage of being able to specify
the values of some variables, they also face the risk of
theories of 'how the world works' or refu- choosing values that are too close together (or too far
tation of widely held existing theories. apart) to allow precise estimates of treatment effects or to
For accounting research, the theories con- allow generalization of conclusions to the real world.
3 Within this definition, relevant questions for
cern how the world works with respect to
auditing include, "Does how precisely we audit and
accounting practices. Watts and Zimmer- report the state of a firm make a difference?" and,
man [1984, p.1], state: "The objective of "How can audits of a given precision be conducted ef-
accounting theory is to explain and predict ficiently?" The first auditing question is related to the
accounting question through the concept of materiality.
accounting practice." This positive, how- Parallel questions involving the design of accounting
the-world-is approach is in contrast with systems also could be developed.
the more traditional normative view that 4Accounting professors may conduct research in
finance, economics, behavioral science, or statistics. If
accounting "theory" is concerned with the accounting question is not addressed, however, the
what accounting practices ought to be. accounting professor may face the disadvantage of being
Empirical accounting research (broadly undertrained relative to other researchers. Also, he or she
ignores a comparative advantage in the knowledge of
considered) addressesthe question: "Does accounting institutions and the sometimes subtle role of
how we as a firm or as a society account information.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
340 The Accounting Review, April 1986

made in the hypothesis lends credibility to if firm "performance" is the theoretical


the theory. principal and earnings is chosen as the
Ordinarily, researchbegins with a real- surrogate measure of firm performance,
world problem or question. One thinks then straight-line depreciation is a com-
about or studies the problem, reads about ponent of the surrogate.
seemingly similar problems in other areas As a surrogate, straight-line deprecia-
or disciplines such as economics, psy- tion contains two sources of potential
chology, organizational behavior, or po- error that may require consideration by
litical science. By immersing himself or the researcher. One is the surrogation
herself in the problem, the researcher error due to the fact that straight-line de-
may, either by genius or by adapting a preciation does not "correctly" reflect
solution from another area, develop a the relevant performance of the firm for
general theory to explain relationships the purpose at hand. The other is applica-
among facts [Simon, 1976, Chapter 7 and tion error due to mistakes or imprecision
Boulding, 1956]. From this statement of in applying straight-line depreciation.
the general relationship among facts, hy- On the other hand, in evaluating possi-
potheses about what should be observed ble determinants of managerial behavior,
in a particular situation can be derived. audited earnings using straight-linedepre-
An experiment or passive observation ciation may be specified in a contract and
study then can be designed to support or may serve as a principal. For example, if a
deny the hypotheses. manager is to receive a bonus or profit
For example, suppose it is observed share of one percent of audited earnings,
that a stock price increase usually follows then audited earnings is the principal.
the announcementthat a firm has changed Surrogation error, and any application
from straight-line to accelerated depre- error not detected and corrected by the
ciation. Why should a mere bookkeep- auditor, is ignored for contract purposes.
ing change seem to lead to an increase The same number used as a measure of
in firms' values? An explanation might firm performance will likely contain both
be that market participants believe that surrogation and application error.5
eventsleadingto such an accountingchange To add credibility to a theory, one must
also typically lead to better prospects for not only be able to show hypothesis test
the firm in the future. With development resultsthat are consistent with the theory's
and elaboration of such a theory, the predictions, but also have a basis to rule
researchermight develop a passive obser- out alternativeexplanationsof the observed
vation study of past changes or an exper- facts. This requiresconsideration of a rea-
iment to test hypotheses based on the pro- sonably comprehensive list of alternative
posed explanation. explanations. Again, knowledge of related
Theories are usually stated in terms disciplines is useful in generating alterna-
of theoretical variables or "principals" tive explanations for accounting-related
while empirical measurement requires ob- "facts." Some possible explanations can,
servable variables. The difference between of course, easily be ruled out as being of
the principal and real-world observable 5 Accounting systems designers and financial
variables presents difficulties for account- accounting standards setters can control the first type of
ing researchers since accounting mea- measurement error, while auditors and auditing stan-
dards setters can control the second. Problems relating to
surements may be either surrogates for the interaction of accounting and auditing standards
some underlying principal of interest or setters, users, and auditors is, of course, a matter for
may be the principal itself. For example, accounting research.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
Kinney 341

likely negligible effect, but others will re- DISENTANGLINGTHE EFFECTS FROM VS
quire attention. AND ZS FROMTHE EFFECT OF X
To be more specific, let Y denote the
For simplicity, assume that X is mea-
"dependent" variable to be understood,
suredat only two levels. Eitherthe observed
explained, or predicted. Variables caus-
Yis from the "control" group that receives
ing Y(or at least related to Y) can be clas-
no "treatment" or from the treatment
sified into three broad groups as follows:
group that receives the treatment. Alter-
X=the "independent" variable that natively, the two groups could simply be
the proposed theory states should different on some relevant dimension
effect Y, (e.g., to test theories about accelerated
Vs = prior-influence (prior-to-study depreciation, the control group might be
period) factors that may affect Y, defined as those firms that use straight-
and line and the treatment group as those that
Zs= contemporaneous factors (other use accelerated).
than X) that may affect Y.6 Also for simplicity, assume that there is
a singleprior-influencefactor Vthat effects
That is, Y=f(X, V,, V2, . ..,ZIZ2, ... Yand Vhas the same effect on Ywhether
A common Yfor addressingan account- the subject is from the control or treat-
ing question is the change in a firm's stock ment group. Furthermore, there are no
price. Others are a manager's act or deci- contemporaneous Zs that affect Y and
sion. A common X is a change or differ- the model determining Y is:
ence in accounting method, whether by Yij=Bo+BX,+B2Vj+ ej,, (1)
management's choice or by a regulatory
where Yij is the value of the dependent
directive. Prevalent Vs are the firm's prior
variable for the "j"th subject in the "i"th
period state variablessuch as profitability,
leverage, liquidity, and size. For tests of treatment, Bo is the intercept for the con-
trol group, Xi is an indicator variable
theories about decision-making behavior
(zero for the control group and one for
of particular human subjects, relevant
Vs often include the subject's personality treatment), Bo+ B1 is the intercept for the
treatment group (that is, B1is the effect of
traits, mathematical ability, education,
treatment), B2is the regression coefficient
training, age, firm association, and exper-
relating Vto Y, and eij is a random error
ience.
term. The eij term will include the effects
The most common Z factor in account-
of other Vs and Zs that are here assumed
ing research studies involving stock prices
to be negligible and randomly distributed
is the market return (Rm). Another com-
between the two groups, and eijis assumed
mon group of Zs for external reporting
to have expectation zero and be uncorre-
and managerial performance studies is
lated with either X or V.
the unexpected portion of contemporan-
For the simple model of equation 1, a
eous accounting measures for other firms
plot of the expected values of Y given V
or other divisions. Finally, since the ac-
for both groups will be parallel lines with
counting researcher is concerned with the
possibly different intercepts. The differ-
effect of accounting reports, Zs may be
ence in intercepts is the effect of the treat-
underlying characteristics of the "true
ment (B1). Figure 1 shows the components
state" of the firm at the time of the study
as measured by contemporaneous nonac- 6 Some Zs may be expectations,at the time of the

counting reports about the firm. study,of still futurevaluesof X, Y, V, and otherZs.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
342 The Accounting Review, April 1986

FIGURE 1
Y VALUES GROUPSUNDERANOVA, ANCOVA, ANDMATCHING
FORNONEQUIVALENT

I ~~~~~~~~~~~~Y=00+
X+02V

X
ANOVA ;
/- | andMatching
~~~NCOVA V

~~~~~~~~~~I I

lI _~~~~~~ I I
I I
V. V Vs

Matches

of equation 1 along with ellipsesthat ap- may assign to the treatmentgroup, say,
proximatethe locus of membersof the those with high values of V and to the
two groups. controlgroupthose withlow V. In analy-
An experimentermay ignore V and zingresults,the effect of the high Vvalues
may randomlyassignsubjectsto groups. (i.e., B2VJ) will be mixed with the treat-
On average,the groupswill be equivalent ment effect. An experimentermay rule
on V. For small samples,however,there out the possible effect of V by random
is nontrivialrisk that such a procedure assignmentof subjects measuredon V

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
Kinney 343

between groups. The sample subjects are Ignoring V


measured on V, matched into pairs ac- As discussed above, ignoring a poten-
cording to their V values, and one from tial V is generally inadvisable due to the
each pair is randomly assigned to treat- unknown effect of V. A negligible effect
ment. Thus, even for small samples the is the hopedfor result for any unmatched,
two groups will be approximately equiva- unmeasured, or unknown Vsor Zs. How-
lent on V. ever, most real-worldevents have multiple
The passive observerhas no opportunity causes and a negligible overall effect is
to randomlyassignsamplesubjectsto treat- unlikely. Furthermore, larger samples
ment. Even experimenters may have dif- will not help in researchdesignsthat ignore
ficulty developing a satisfactory random- systematic effects of V.
ized design due to having too many While expost matching and covariance
potentially important Vsand Zs that must analysis can't account for all possible Vs
be simultaneously matched. Thus, in gen- and Zs, they can reduce the risk that some
eral, researchersface the problem of treat- potentially important Vs and Zs disguise
ment groups that are not equivalent with the true effect of X. Figure 1 shows the
respect to V. relevant sampling distributions for the
For nonequivalent groups, there are three approaches applied to the example.
three basic ways in which the researcher As shown in the relativelyflat distributions
can mitigate the possible effects of the V on the left margin, ANOVA is based on
factor in the model of equation 1. These the marginal distribution of Y with no
are: consideration of V.
1. ignoring V(i.e., assuming or hoping Ex Post Matching
that Vis randomized with respect to
In many situations, the researcher se-
X),
lects a sample after the phenomenon of
2. matching on Vex post (i.e., match- interest has taken place. Often, the re-
ing after X has been chosen by the searcher selects a sample of subjects from
subject or assigned by Nature), and the treatment group and then selects a
3. using covariance analysis to statis- subject from the control group with V
tically estimate and remove the ef- equal or similar to V for each treatment
fect of V. subject. This ex post matching on V is
probably the most commonly used design
The first approach ignores V, and results
for passive observational studies in ac-
can be analyzed with a single-factor anal-
counting.
ysis of variance (ANOVA). The second
For ex post matching, the model as-
approach physically equates the treatment
sumed to determine Y is:
groups with respect to V, and results can
be analyzed using a randomized block de- m
sign. The third approach "statistically" Y, = Bo +BIX,+ E BjMj +eij, (2)
j2
equates the groups, and results can be
analyzed using analysis of covariance where Bo is the overall mean of Yplus the
(ANCOVA). effect of (arbitrarilydesignated) match 1,
Each of these approaches is discussed in Bj (for j> 1) is the differential effect of
turn, along with some of the advantages match compared to match 1, Mj is an in-
and limitations of each for experimenters dicator variable (equal to one if the sam-
and passive observers. ple subject is a member of match j and

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
344 The Accounting Review, April 1986

zero otherwise), and m is the number of ance (ANCOVA) statistically estimates


matches. ' the effect of V on Y and removes it.
For passive observational studies, it is ANCOVA can be viewed as the result
impossible to randomly assign subjects to of projecting observed Y values along
treatments since by definition the subjects the regression line to a common point
have already either "self-selected" into on the V axis, such as V, to yield the
treatmentgroups, or have been so selected conditional distributions as shown in Fig-
by Nature. It is possible that there will be ure 1.1
few or even no matches. For example, all Figure 1 shows that part of the differ-
the firms using straight-line depreciation ence between the marginal distributions
may be small firms and all firms using ac- of Yoand Y1(as estimated using ANOVA)
celerated depreciation may be large. A is due to a larger Vfor the treatmentgroup.
firm's choice of accounting method (or Matching (equation 2) accounts for the
decision to change methods) may merely difference by subtracting B O+1 BjMj
reflect its V value. Figure 1 illustrates
such a possibility in that only the bottom from Y for each subject, and ANCOVA
half of group 1 can be matched with a accounts for the difference by subtracting
member of group 0 due to the difference B0+ B2 Vijfrom Y for each subject. Thus,
in V for the groups. both matching and ANCOVA are seen to
Even experimenters using ex ante mitigate the differential effect of V. Fig-
matching with random assignment of ure 1 also shows that control group sub-
subjects often face the lack-of-matches jects with relatively high Vfor the control
problem for at least some Vsand Zs. Sup- group are matched to subjects with rela-
pose, for example, that a researcherbelieves tively low Vfor the treatment group. For
that an auditor's response in a profession- matched designs, all other potential sample
al task experiment may be related to the subjects must be omitted due to lack of
auditor's professional training (X) after matches.
accounting for his or her mathematical Matching and ANCOVA yield more
abilities ( V). It may be difficult to match efficient (more precise) estimates of the
subjects from different firms based on treatment effect than does ANOVA. In
mathematical abilities. This is because general, however, it is unclear which of
firms may hire and thus train (or students the two will be more precise. This is due to
may choose firms and be trained) on the the fact that while the difference in Voand
basis of mathematical ability.8 V, reduces the precision of ANCOVA,
As will be discussedbelow, the efficiency the reduction in sample size due to lack of
of matching may be less than that for co- matches reduces precision for matching.
varianceanalysis. However, expost match- ' The matches may be by individual subject ("pre-
ing is likely to be superior to covariance cision" or "caliper" matching) as discussed above, or by
analysis when the functional form of the frequency distribution (e.g., equal mean and variance
with respect to V for both groups). A test of equality on V
YlIV relationshipis nonlinearor unknown. is often used as a justification for ignoring V in the
Given that the treatment effect is not cor- statistical analysis.
I An alternative design is to limit all subjects in an
related with V, matching can be used for
experiment to a fixed level of V. This equalizes the effect
any functional form of Y and V (known of V but greatly reduces the generalizability of results
or not) and analyzed using a blocked de- over the range of reasonable V values that might occur.
I The sampling distributions for matching and
sign.
ANCOVA are shown as the same in Figure 1 since the
Covariance Analysis expectations of estimates of the treatment effects are the
same for both. As discussed, however, their standard
A researcher using analysis of covari- errors will differ.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
Kinney 345

It is often difficult to predict which will be the proposed theory implies a large effect
the greater problem. 10 on Yand/or the researcheris clever in de-
Equation 1 and Figure I present a very signing a plan to disentangle the effects of
simple situation even for a single V. For the Vsand Zs. For example, usingANOVA
example, the YI Vrelationship may differ (no matching) in a single test with target
depending on whether X is at level zero or ac=.05, 03=.1, and D=6/a=.5, the re-
one. Furthermore, the occurrence of a quired sample size is 70 for each of the two
given level of Vat time t-1 may have a di- groups. If the researcher has a theory
rect effect on Y at time t but may also yielding a larger 6 that would increase D
affect the level of X at time t which, in to .75, then the sample would be 32 each,
turn, affects Y at t. Thus, there may be and if D is 1.0 then the sample size is 18
two paths by which V affects Y. A com- per group. Alternatively, for D=.5 and
plete approach would include a model of holding 6 constant, if matching is used
the "selection" process by which V affects and the Y., Y, correlation is .25 then
X as well as the direct effects of X and V the required sample is approximately 57
on Y."1 pairs."3 If the Yo, Y1 correlation is .5
(implying a reduction in aof about 18 per-
III. ALPHA, BETA, SAMPLE SIZE
cent), then the required sample is 36 pairs.
AND RESEARCH DESIGN
The four factors and their implications
Planning researchto disentangle X, Vs, for accounting research will be discussed
and Zs involves four related factors. throughtwo subtopics.These are: 1) power,
These are alpha (ce),beta (@),sample size, and 2) prejudice against the null hypoth-
and what will be called the "research de- esis.
sign" factor (denoted D). In a given situa-
tion, setting any three of them sets the Power
fourth. The statistical factors of a (the Consider a researcherwho has a theory
probability of a type I error or incorrectly that the treatment effect (B1) is positive,
rejectinga true null hypothesis of no treat- and who therefore is interested in testing
ment effect), f3(the probability of a type the (null) hypothesis that the true effect
II error or not rejecting the null hypothe- of treatment is less than or equal to zero
sis when, in fact, there is a treatment against the alternative that the true effect
effect), and sample size are well known.
The research design factor is the ratio of '0 ANCOVA will usually be less biased, however (see
two subfactors. Its numerator is the hy- Cook and Campbell [1979, pp. 177-182], and Cochran
[1983, pp. 127-128]).
pothesized magnitude of the (X) treat- " See Cochran's comments on R. A. Fisher's advice to
ment effect (denoted 6), and its de- "Make your theories elaborate. " According to Cochran,
nominator is the standard deviation of Fisher meant that when "constructing a causal hypoth-
esis one should envisage as many different consequences
the residuals in the equation used to esti- of its truth as possible, and plan observational studies to
mate B1 (denoted a). Thus, D = 6/a. The discover whether each of these consequences is found to
numerator depends on the researcher's hold" [Rosenbaum, 1984, p. 43]. This advice is consis-
tent with Boulding's exhortation to develop and test
theory, and the denominator depends on theories that are at the level of the real world.
how the researcher disentangles the Vs 12 The required sample size is:

and Zs and the inherent variability in the n =2[(t., ) 2


2X-2 + 6, 2,-2 (1 /D)]
phenomenon under examination.
See Ostle [1963, p. 553] for a table.
The required sample size is a decreasing '" The required sample size is:
function of a, 3,6 and an increasing func-
tion of a. Therefore, for a given a and 3, n= + to55]) (f aD)]b'
(1963,p
the required sample size will be small if See Ostle [1963, p. 55 1] for a table.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
346 The Accounting Review, April 1986

is greaterthan zero. Assume that a sample tive hypothesis is greater than .5, and
of treatment subjects has been matched power is less than .5. Even if Ho is false
or "paired" on V with control subjects. (i.e., B = 6). and thus the researcher's
Also, based on an assessment of the ap- theory of a positive treatment effect is
propriatesignificance level for the issue at correct, the researcherhas a less than even
hand, the researcherhas set a at .05, and chance of rejecting it!
the researcher has a research design in Suppose that the researcher in panel b
mind. observes a test statistic that is almost sig-
What the researcher may not consider nificant, and the sample estimate of the
at the planning stage is the magnitude of treatment effect is equal to 5. He or she
the hypothesized effect (i.e., a particular then decides to take a follow-up sample.
6 for the alternative hypothesis) and the The follow-up sample is also likely to in-
allowable 3 for that 6. 14 A 6 may not be dicate nonrejection due to its small size
considered since most theories suggest [Tversky and Kahneman, 1971, p. 107].
only the direction of an effect and not its The real culprit, of course, is the low
magnitude, and f3 is not considered be- power of the test. If the low power is anti-
cause no particular 6 is specified. The re- cipated at the planning stage, an attempt
searcher may proceed to testing with little can be made to mitigate its negative effects
consideration of whether the planned test or else abort the project. In general, power
has an adequate chance to reject the null can be increased by 1) increasing the sam-
hypothesis even if it is false. ple size, or 2) increasing the design factor
To illustrate, consider the sampling dis- D by developing better theories (yielding
tributions in Figure 2. For both panels of larger 6) or by making better use of agiven
Figure 2, the left-hand distribution is for sample size and theory by careful atten-
the mean of the paired differences if the tion to the Vs and Zs (yielding smaller a).
null hypothesis (B1=0) is true, and the As a practical matter, improved design
right-hand distribution applies if the par- is often the only alternative in accounting
ticular alternative (B1= 6) is true. Also for research since the size of samples in ac-
both panels, k is the point that yields counting frequently is effectively fixed.
a =.05 or five percent of the area to the For experiments, the pool of available au-
right of the point under the left-hand dis- ditors, accountants, financial statement
tribution (i.e., Ho). In panel a, the re- users, and even students is effectively lim-
search design and sample size yield a ited to fairly small numbers. Subjects' time
sampling distribution with a large area is not free, and the supply is not inexhaust-
(1 - 3)to the right of k under the alterna- ible. For passive observation, the number
tive hypothesis. Thus, there is high proba- of firms for which particular accounting
bility of rejecting Ho when the alternative and other required economic data are
hypothesis is true. In other words, the available may be relatively small. Thus,
power of the test (1 - f) is high. accounting researchers need to be aware
In Figure 2, panel b, 6 is the same as in of a variety of analytical techniques appli-
panel a, but the sampling distributions
4 See Tversky and Kahneman [19711. This is in
are much flatter due either to small sam- contrast to classical or normal distribution theory-based
ple sizes or a large standard deviation due audit sampling where, in addition to setting a to control
to remaining effects of Vsand Zs. Rather the risk of incorrect rejection, the auditor sets i to con-
than the relatively high power test of trol the risk of incorrect acceptance and sets 6 based on
"intolerable" error (materiality). The auditor then
panel a, the researcher faces a low power selects an estimator and calculates the minimum sample
test. At a(=.05, 13for the simple alterna- size subject to the target a, j, and 6.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
Kinney 347

FIGuRE2
SAMPLINGDIsRIuBUTIONSOF d * FORHIGH AND Low POWER TESTS

a. HighPower I I

OI _ _ _ _ _

Rt
b. LowPower

l~~~~~~~~~~~ l.0
l l~~~~~~~~
RejectHo~ Reec
I
l~~~~~~~~~~~~~Rc Ho
lY~o)M l
jlal

cable to a variety of research problems. 15 degreeof owner versusmanagercontrol


Furthermore, for a given research par- as an X that affectedaccountingchoices.
adigm, the problem of low power is likely Laterstudieshave usedthe samevariable
to become more difficult over time. Other as a Vor Z. Absentdevelopmentsthat re-
things equal, as knowledge of the effects structurethe way a particularproblem
of accounting expands, the likely size of is addressed, future researcherswill be
the effect of each new or more refined faced with discoveringnew Xs that have
theory (B1)will tend to have less addition- less potentialexplanatorypower.
al explanatory power. As knowledge ex-
's In debate on the preferability of parametric vs. non-
pands, the best potential Xs are investi- parametric statistics in research, the ability of parametric
gated and become Vsor Zs. For example, methods to accommodate more Vs and Zs through co-
early studies tested hypotheses about the variance analysis is an often overlooked advantage.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
348 The Accounting Review, April 1986

In fact, it may be unreasonableto expect results against the null are "signifi-
that a particulartheory based on account- cant." Especially interesting or in-
ing methods will yield a true differential novative results may be submitted
effect that is very large relative to the vari- on higher than .05 significance (or
ance of Y. How things are accounted for alternatively, the probability at
simply can't be expected to explain a large which the results are significant are
portion of stock price variability or man- reported), but rarely does an editor
agerial or investor behavior. Under some see results with significance levels
conditions, the sample size required to above .15. This prejudice need not
yield reasonable power exceeds the size of exist if not rejecting the null gives
the known population! reasonable credibility to the null."7
A researcher can get some protection 2. Ancillary hypotheses will be ele-
by making a tentative calculation of power vated in the exposition of results.
before investing in expensive data collec- Secondary hypotheses that are sig-
tion or in experimentation. For example, nificant ex post will receive more
passive observers of accounting changes attention than other secondary hy-
and stock returns may be able to make potheses and perhaps the primary
reasonable estimates of the standard devi- hypotheses. Suggestions will be
ation of return residuals and might make made that these results warrant
power estimates for various levels of 6. 16 further study, when in fact one
If the estimated power is inadequate even would expect about one in ten non-
for the maximum 6 that might reasonably sense relationships to be significant
exist, then the research can be redesigned at the .10 level.
or aborted. Experimenters are perhaps 3. Alternative operationalization of
more familiar with prospective power cal- variables or their functional form
culations and frequently use a pilot sam- will be conducted only if "prelimi-
ple to assist with the sample size and re- nary" results are insignificant. The
search design development. extent to which this search activity is
justified is open to debate since
Prejudice Against the Null Hypothesis most theories don't imply a single
A theory usually specifies the direction measurement or functional form.
of the treatment effect and a researcher 4. The search for errors will be asym-
generally sets out to reject hypotheses metric. Outliers that impede re-
based on the assumptionthat the treatment jection of the null hypothesis will
effect is zero or in the opposite direction tend to receive more diligent atten-
from what the theory predicts. The focus tion than those that favor rejecting
on rejecting the null hypothesis is the the null. If significant results are
source of a number of "biases" against
the null that may lead to dysfunctional 16 The choice of 6 is somewhat arbitrary, but in plan-

consequences. Greenwald [1975] lists ning it is useful to consider reasonable or plausible values
for the true treatment effect of X. Alternatively, one
eight such consequences; four that seem might choose the smallest effect that informed persons
most important for accounting research- would agree is empirically "important" and therefore
ers are discussed in order to be better able worth knowing about if it exists, or the largest amount
that one could reasonably expect.
to avoid them. " In classical statistics, not rejecting the null is not
equivalent to accepting the null. However, non-rejection
1. A paper will not be submitted for by a reasonably powerful test or series of tests does
publication consideration unless the increase one's subjective degree of belief in the null.

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
Kinney 349

obtained on the first analysis of a are considered? Are there better ways to
problem, the neophyte researcher account for the effects of Vs and Zs?
may not consider a search for out- What are other Vsand Zs that might have
liers or for other violations of sta- important effects?
tistical assumptions underlying the The same approach can be applied by
analysis. Nonrejection may lead one the student to his or her own dissertation
to consider such explanations and proposal. While the basic development of
to search for programming errors a research proposal is the responsibility of
and data coding errors. the student, there is much to warrantearly
faculty discussion of planned dissertation
V. SUMMARY AND CONCLUSIONS
research. That is, the faculty can evaluate
In this paper we have stressed consider- a proposal by considering the reasonable-
ation of planning for Vsand Zs to be able ness of the theory and the adequacy of
to isolate the effect of X as a potential ex- control of potential Vs and Zs. The
planation of differences in Y. This consid- faculty should be asked: Is the magnitude
eration may allow increased power in tests of the hypothesized effect plausible? Are
and may allow more to be learned from a all importantcompetingexplanationslisted
given sample. Planning for Vs and Zs can and adequately dealt with in the plan?
reduce the risk of not rejecting the null Will the proposed tests likely uncover evi-
hypothesis when it is false. Such planning dence of a difference equal to 6 if it exists?
may also allow a basis to argue that non- Will nonrejection lend credibility to the
rejection of the null hypothesis may sup- null?
port acceptance of the null. That is, if the Faculty approval of planned disserta-
treatment has an important effect, then it tion research reduces the student's risk by
should be revealedby the test. Thus, some- 1) ruling out potential topics that have
thing may be learned whether results are little chance of successful completion, 2)
statistically significant or not. This should gathering the right data on the first at-
increase the objectivity of the researcher, tempt, 3) eliminatingoutcome dependence
since the work is valuable whatever the (thus reducing moral hazard for the stu-
empirical results. dent), and 4) reducing the temptation of
There are at least two ways in which the the student (and faculty) to pursue num-
approach discussed in this paper can be erous tangents that may come to light as
useful to Ph.D. students. One is in evalu- the research progresses.
ating the researchdesign of others, and the
other is in planning the student's disserta- 1 In planning research or evaluating the research of
tion. 18 Students must evaluate the work others, a useful practice is to give early attention to the
of others whether in published articles, purpose of the research through preparation of a three-
short-paragraph abstract, synopsis, or working model of
working papers, or accounting research the research. The first paragraph answers the question
workshops. A student applying the ap- "What is the problem?" The second asks, "Why is it an
proach to the work of others might try to important problem?" and the third, "How will it be
solved?" Alternatively, the questions might be: "What
answer the following questions: What is are you (or the researcher) trying to find out?",
the Y and what is the X? What Vs and Zs "Why?", and "How will it be done?"

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions
350 The Accounting Review, April 1986

REFERENCES
Abdel-khalik, A. R. and B. B. Ajinkya, Empirical Research in Accounting: A Methodological Viewpoint,
Accounting Education Series No. 4 (American Accounting Association, 1979).
Ashton, R. H., Human Information Processing in Accounting, Accounting Research Study, No. 17
(American Accounting Association, 1982).
Ball, R. and G. Foster, "Corporate Financial Reporting: A Methodological Review of Financial
Research," Studies in Current Research Methodologies in Accounting: A Critical Evaluation," Journal
of Accounting Research (Supplement 1982), pp. 161-234.
Boulding, K. E., "General Systems Theory-The Skeleton of Science," Management Science (April 1956),
pp. 197-208.
Cochran, W. G., Planning and Analysis of Observational Studies, edited by L. E. Moses and F. Mosteller
(John Wiley & Sons, Inc. 1983).
Cook, T. D. and D. T. Campbell, Quasi-Experimentation Design & Analysis Issues for Field Settings
(Houghton-Mifflin Company, 1979) especially chapters 3 and 4.
Greenwald, A. G., "Consequences of Prejudice Against the Null Hypothesis," Psychological Bulletin
(January 1975), pp. 1-20.
Lev, B. and J. A. Ohlson, "Market-Based Empirical Research in Accounting: A Review, Interpretation,
and Extension," "Studies in Current Research Methodologies in Accounting: A Critical Evaluation,"
Journal of Accounting Research (Supplement 1982), pp. 249-322.
Libby, R., Accounting and Human Inlormation Processing: Theory and Applications (Prentice-Hall,
Inc., 1981).
Ostle, B., Statistics in Research, 2nd Edition (The Iowa State University Press, 1963).
Rosenbaum, P. R., "From Association to Causation in Observational Studies: The Role of Tests of
Strongly Ignorable Treatment Assignment," Journal of the American Statistical Association (March
1984), pp. 41-48.
Simon, J. L., Basic Research Methods in Social Science: The Art of Empirical Investigation, 2nd Edition
(Random House, Inc., 1978) especially chapters 3, 7, and 11.
Tversky, A. and D. Kahneman, "Belief in the Law of Small Numbers," Psychological Bulletin (August
1971), pp. 105-1 10.
Watts, R. L. and J. L. Zimmerman, Positive Accounting Theory (Prentice-Hall, 1986).

This content downloaded from 169.229.32.138 on Fri, 9 May 2014 16:35:44 PM


All use subject to JSTOR Terms and Conditions

Das könnte Ihnen auch gefallen