Beruflich Dokumente
Kultur Dokumente
STRUCTURAL EQUATION
MODELING (SEM)
CONCEPTS, APPLICATIONS
AND MISCONCEPTIONS
No part of this digital document may be reproduced, stored in a retrieval system or transmitted in any form or
by any means. The publisher has taken reasonable care in the preparation of this digital document, but makes no
expressed or implied warranty of any kind and assumes no responsibility for any errors or omissions. No
liability is assumed for incidental or consequential damages in connection with or arising out of information
contained herein. This digital document is sold with the clear understanding that the publisher is not engaged in
rendering legal, medical or any other professional services.
MATHEMATICS RESEARCH
DEVELOPMENTS
STRUCTURAL EQUATION
MODELING (SEM)
CONCEPTS, APPLICATIONS
AND MISCONCEPTIONS
LARRY RIVERA
EDITOR
New York
Copyright © 2015 by Nova Science Publishers, Inc.
All rights reserved. No part of this book may be reproduced, stored in a retrieval system or transmitted
in any form or by any means: electronic, electrostatic, magnetic, tape, mechanical photocopying,
recording or otherwise without the written permission of the Publisher.
We have partnered with Copyright Clearance Center to make it easy for you to obtain permissions to
reuse content from this publication. Simply navigate to this publication‟s page on Nova‟s website and
locate the “Get Permission” button below the title description. This button is linked directly to the
title‟s permission page on copyright.com. Alternatively, you can visit copyright.com and search by
title, ISBN, or ISSN.
For further questions about using the service on copyright.com, please contact:
Copyright Clearance Center
Phone: +1-(978) 750-8400 Fax: +1-(978) 750-4470 E-mail: info@copyright.com.
Independent verification should be sought for any data, advice or recommendations contained in this book. In
addition, no responsibility is assumed by the publisher for any injury and/or damage to persons or property arising
from any methods, products, instructions, ideas or otherwise contained in this publication.
This publication is designed to provide accurate and authoritative information with regard to the subject matter
covered herein. It is sold with the clear understanding that the Publisher is not engaged in rendering legal or any other
professional services. If legal or any other expert assistance is required, the services of a competent person should be
sought. FROM A DECLARATION OF PARTICIPANTS JOINTLY ADOPTED BY A COMMITTEE OF THE
AMERICAN BAR ASSOCIATION AND A COMMITTEE OF PUBLISHERS.
Additional color graphics may be available in the e-book version of this book.
Preface vii
Chapter 1 The Determinants of Capital Structure Choice for
Chinese Listed Companies Based on Structural
Equation Modeling Approach 1
Xin-Dan Li, Xiang-Nan Feng,
Bin Lu and Xin-Yuan Song
Chapter 2 An Examination of Predictors and Outcomes Related
to School Climate Using Latent Class Analysis 35
Christine DiStefano, Elizabeth Leighton,
Mihaela Ene and Diane M. Monrad
Chapter 3 Assessing Mediation in Simple and Complex Moels 69
Thomas Ledermann and Siegfried Macho
Index 103
PREFACE
Chapter 1
ABSTRACT
This chapter proposes a Bayesian approach based on structural equation
modeling (SEM) to empirically test the determinants of capital structure
choice for the Chinese listed companies. The chapter investigates major
unobservable theoretical attributes identified by capital structure theories
and constructs proxies for these attributes considering specific
institutional settings in China. The findings suggest that some firm-
specific factors relevant to explaining capital structure in developed
economies are also related to the Chinese economy. Unique determinants
of capital structure choice for Chinese listed companies are also
Corresponding author: Xin-Yuan Song is Associate Professor, Department of Statistics, the
Chinese University of Hong Kong, Hong Kong, China, xysong@sta.cuhk.edu.hk.
2 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
1. INTRODUCTION
As one of the most important areas in corporate finance, capital structure
analysis has attracted significant attention in the literature. Modigliani and
Miller (1958) proposed an M-M theory stating that the capital structure does
not affect firms‟ value in the perfect capital market, which is a very restrictive
assumption. Since then, many efforts have been made to relax the assumptions
of the M-M theory. The development of agency theory (Jensen and Mecking,
1976), coupled with thorough research of bankruptcy costs, suggested that
corporations act as if there is a unique, optimal capital structure. The main
competing theories in explaining firms‟ capital structure choice are the static
trade-off hypothesis (Kraus and Litzenberger, 1973) and the pecking order
hypothesis (Myers and Majluf, 1984). Static trade-off models assume the
optimal capital structure does exist, while the pecking order hypothesis states
that there is no well-defined target debt ratio based on the information
asymmetry. Diverse signaling models have also been proposed to address the
asymmetric information problems (Ross, 1977).
Over the past several decades, numerous research have been conducted to
investigate the determinants of capital structure choice. Bradley et al. (1984)
adopted cross-sectional, firm-specific data to test for the existence of an
optimal capital structure by considering some influential factors. Titman and
Wessels (1988) analyzed the impact of unobservable attributes on the choice
of corporate leverages based on a factor-analytic technique. Homaifar et al.
(1994) applied a general autoregressive distributed lag model to the US data to
estimate the long-run steady state determinants of firms‟ capital structure.
Rajan and Zingales (1995) compared the capital structure decisions across G-7
countries. Wald (1999) examined the factors correlated with the capital
structure in France, Germany, Japan, the United Kingdom, and the United
States. Ozkan (2001) investigated the determinants of capital structure choice
for 390 companies in UK and the role of the adjustment process. Drobetz and
Wanzenriedb (2006) studied 90 Swiss firms‟ data from 1991 to 2001 to
analyze the impact of firm-specific characteristics and macroeconomic factors
The Determinants of Capital Structure Choice for Chinese … 3
on firms‟ speed of adjustment to the target debt ratio. Feidakis and Rovolis
(2007) examined whether there are any robust determinants that affect the
capital structure of the large listed construction firms in the European Union.
Chang et al. (2009) applied a Multiple Indicators and Multiple Causes
(MIMIC) model, with refined indicators, to a pooled sample for the period
1988 to 2003 and found more convincing results than those obtained by
Titman and Wessels (1988). Kayo and Kimura (2011) proposed the use of
hierarchical linear model and random effect model for analyzing the influence
of time-, firm-, industry-, and country-level determinants of capital structure.
Öztekin (2014) analyzed a large sample of firms drawn from 37 countries to
examine the international determinants of capital structure.
Although the major research of the determinants of capital structure
choice have been focused on the developed economies, the works are
becoming noticeable in developing countries and transitional economies in
recent years. Booth et al. (2001) analyzed capital structure choices of firms in
ten developing countries. Deesomsak et al. (2004) investigated the
determinants of capital structure of firms operating in four Asia Pacific
countries. Agarwal and Mohtadi (2004) studied the effect of financial market
development on the financing choice of firms in developing countries using a
dynamic panel approach with aggregate firm level data. Fattouh et al. (2005)
examined the capital structure of listed firms in South Korea from 1992 to
2001. Nivorozhkin (2005) presented evidence on the actual and target capital
structures of firms in five EU accession countries of Central and Eastern
Europe and the former Soviet Union. De Haas and Peeters (2006) examined
the capital structure dynamics of firms in Central and Eastern Europe (CEE).
Nguyen and Ramachandran (2006) identified the determinants influencing the
capital structure of small and medium size enterprises (SMEs) in Vietnam.
Manos et al. (2007) investigated the effect of group affiliation on firm‟s capital
structure decision with data from the Indian economy. Delcoure (2007) also
considered the capital structure determinants in emerging CEE economies to
test whether the traditional theories are still useful. Kim and Berger (2008)
studied the determinants of the capital structure of large companies
headquartered in the United States and the Republic of Korea. Fan et al.
(2012) investigated the influence of the institutional environment on capital
structure and debt maturity choices for firms located in 39 developed and
developing countries. Öztekin and Flannery (2012) showed that certain
associations between institutional arrangements and leverage adjustment
speeds are consistent with dynamic trade-off theory of capital structure choice
with a dynamic panel data set spans 37 developed and developing counties.
4 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
China is now the largest developing economy in the world which rapidly
expanding capital market attracts many international investors. As a transition
economy, however, China has distinct institutional features. Therefore, it is
interesting to test whether the capital structure determinants of Chinese firms
differ from those in the western countries. Consequently, studying the effect of
Chinese unique institutional features on its listed companies‟ financing is of
great importance.
This study mainly extends the available empirical works in two ways.
Firstly, we investigate more attributes that have been tested for the firms in
developed countries but not yet for those in China. Secondly, the attributes
identified as the determinants of firms‟ capital structure choice are often not
directly observable; instead they are reflected by multiple indicators.
Therefore, the structural equation modeling (SEM) technique is adopted to
analyze the interrelationships among the latent attributes. Compared to an
ordinary regression analysis, the SEM approach has the following appealing
features. First, through grouping multiple indicators into a few latent
attributes, the SEM reduces the model dimension significantly. Second, based
on the condensed information, the SEM provides clearer and simpler model
interpretation. Finally, by assigning data-driven weights (factor loadings) to
highly correlated predictors via factor analysis, the SEM avoids the
multicollinearity problem encountered in the ordinary regression analysis (see
Section 5.3).
The rest of the chapter is organized as follows. Section 2 presents a brief
discussion of the specific institutional settings in China. Section 3 discusses
the attributes that are identified as the determinants of firms‟ capital structure
choice. Section 4 describes the data set and the methodology. Section 5
presents the empirical results. Section 6 concludes the major findings.
B. Assets Structure
Assets of a firm include tangible and intangible assets, both of which are
important factors relevant to the capital structure. However, the effects of
tangible and intangible assets on the capital structure are diverse. A firm with
more tangible assets is expected to possess a larger collateral value, and is
capable to access more debt. This relation is not obvious for intangible assets.
8 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
Previous empirical studies argued that the ratio of fixed to total assets is
an essential determinant of leverage (Titman and Wessels, 1988; Rajan and
Zingales, 1995; Fama and French, 2002). The trade-off theory argues that the
revaluation of assets is less when a company goes bankrupt, which reveals a
positive relationship between tangible assets and leverage. The pecking order
theory (Myers and Majluf, 1984) demonstrates that firms with assets to be
collateral may tend to issue more debt to take advantage of the information
asymmetry. Indicators of assets structure include the ratio of intangible assets
to total assets (INTG) and the ratio of inventory plus fixed assets to total assets
(TANG).
Although some evidence (Homaifar, et al., 1994; Kim and Berger, 2008)
showed that non-debt tax shields are weakly related to firms‟ capital structure
choice, many works support the importance of the non-debt tax shields.
DeAngelo and Masulis (1980) presented that non-debt tax shields are
substitutes for the tax benefits of debt financing. Following studies (e.g., Wald,
1999; Ozkan, 2001; Korajczyk and Levy, 2003; Sogorb-Mira, 2005) argued
that a firm possessing larger non-debt tax shields is expected to use less debt.
Delcoure (2007) further found strong relations between the total, long-term,
and short-term debt ratio and non-debt tax shields. Indicators of non-debt tax
shields include the depreciation over total assets (DEPR) and a direct estimate
of non-debt tax shields over total assets (NDTS).
D. Growth Opportunity
opportunity include the percentage change in total assets (TAPC) and the
percentage change in prime operating revenue (PORPC).
E. Size
Based on the trade-off model, large firms are expected to have a high debt
capacity and be able to reduce transaction costs associated with long-term debt
issuance. Rajan and Zingales (1995) further argued that larger firms tend to
disclose more information than smaller ones, which makes it easier for larger
firms to have access to loans. However, pecking-order hypothesis suggests that
the complexity of the large firms increases the cost of information asymmetry,
and thus makes their debt financing more difficult. Indicators of size include
the natural logarithm of total assets (LTA) and the natural logarithm of prime
operating revenue (LPOR).
F. Profitability
From the pecking order theory (Myers and Majluf, 1984), internal
financing is favored by firms compared with debt financing. It can be expected
that profitable firms tend to have more retained earnings, and thus will use less
debt financing. Alternatively, tax effects predicted by the trade-off model
suggest that profitable firms should borrow more, given that they have greater
needs to shield income from corporate tax. On the other hand, the agency cost
theory describes that an increase in the debt ratio of profitable companies
signals the quality of financial management. Therefore, managers will attempt
to reduce the agency cost of the equity by increasing the company‟s debt ratio.
We use return on equity (ROE), return on assets (ROA), earnings per share
(EPS), sales gross profit rate (NGR), and net profit margin on sales (NSR) as
the indicators of profitability.
G. Liquidity
The pecking order theory pointed out that managers can manipulate liquid
assets in favor of shareholders against the interest of debt holders, which
increases the agency costs of debt. However, if firms generate substantial free
cash flows, shareholders would be motivated to cooperate with banks or
10 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
H. Ownership Structure
I. Uniqueness
Titman (1984) suggested that a firm with unique products might require
its customers, suppliers and workers to undertake investments that lose value if
the firm goes into liquidation. Under this setting, lower leverage commits the
firm to a liquidation policy that takes into account the effects on its customers,
suppliers and workers. Further, customers, suppliers and workers may not be
willing to deal with a highly levered firm. For these reasons, uniqueness is
expected to be negatively associated with debt ratios. Considering the
limitation of the data, we select operating expenses ratio (OER) as the
indicator of uniqueness.
J. Operational Risk
K. Signal
Firm-
specific Accounting Indicator Measurement
Factors
Capital Total Debt Ratio (TDR) Ratio of Total Debt to Total
Structure Assets
( ) Long-term Debt Ratio Ratio of Long-term Debt to Total
(LDR) Assets
Short-term Debt Ratio Ratio of Short-term Debt to Total
(SDR) Assets
Assets Tangible Assets (TANG) (Fixed Assets + Inventory)/ Total
Structure assets
( 1 ) Intangible Assets (INTG) Intangible Assets/Total Assets
Non-debt Depreciation (DEPR) Depreciation/Total Assets
Tax Shields Non-debt Tax Shields (Prime Operating Revenue -
( 2 ) (NDTS) Interest Payments - Income
Tax)/Total Assets
Growth ( 3 ) Percentage Change in (Ending Balance of Total Asset –
Total Asset (TAPC) Beginning Balance of Total
Asset)/ Beginning Balance of
Total Asset
Percentage Change in (Ending Balance of Prime
Prime Operating Revenue Operating Revenue – Beginning
(PORPC) Balance of Prime Operating
Revenue)/Beginning Balance of
Prime Operating Revenue
Size ( 4 ) LTA Natural logarithm of Total Assets
LPOR Natural logarithm of Prime
Operating Revenue
Profitability ROE Net Profit/(Total Assets - Total
( 5 ) Debt)
ROA Net Profit/Total Assets
Net Profit Margin on Net Profit/Prime Operating
Sales (NSR) Revenue
Sales Gross Profit Sale Revenue/(Sale Revenue –
Rate(NGR) Cost of Sales)
EPS Net Profit/Issued and Outstanding
Shares
The Determinants of Capital Structure Choice for Chinese … 13
Firm-
specific Accounting Indicator Measurement
Factors
Liquidity Liquidity Ratio (LR) Current Assets / Current
( 6 ) Liabilities
Quick Ratio(QR) (Current Assets - Inventory)/
Current Liabilities
Ownership STATEP State Shares/ Issued and
Structure Outstanding Shares
( 7 ) CORPP Legal Persons Shares/ Issued and
Outstanding Shares
CIRA Circulation Shares/ Issued and
Outstanding Shares
Uniqueness Operating Expenses Ratio Operating Expenses/Prime
( x1 ) (OER) Operating Revenue
Operation Prime Operating Revenue Standard Deviation of Prime
Risk ( x2 ) Ratio (SDPOR) Operating Revenue / Average of
Prime Operating Revenue
Signal ( x3 ) Dividend Payout Ratio Dividends Per Share/Earnings Per
(DPR) Share
Tax Shields Income Tax Rate (ITR) Income Tax/Total Profit
( x4 )
L. Tax Shields
Modigliani and Miller (1963) indicated that the optimal leverage ratio of
firms is determined by the trade-off between the tax shield benefit of debt and
the higher bankruptcy costs implied by the higher degree of corporate
indebtedness. Some following studies fail to find plausible or significant tax
effects on financing behaviors, which is partially explained by the theory that
the debt-equity ratio is the cumulative result of the separate decisions during a
period (Mackie-Mason, 1990). The indicator of tax shields is the income tax
rate (ITR). The above-mentioned indicators and variables are listed in Table 1.
14 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
In this study, the data were collected from the annual reports of Chinese
public-listed companies on Shanghai and Shenzhen stock exchanges over the
period 1998-2006, provided by the Wind Database. The year 2006 was chosen
because a new accounting system for business enterprises became operational
on January 1, 2007. To avoid some confounding effects, all listed companies
were required to date back to the period before 2006. In addition, we imposed
several criteria to obtain our data set: (1) all firms were set up before 1998 and
not in the financial industry. (2) We excluded the listed firms that were
suspended or withdrawn from the stock market. (3) We disregarded firms that
did not have a complete record on the variables required in our analysis. In
total, we procured 852 firms for the analysis, of which 489 were in
manufacturing and the rest in non-manufacturing that reflected the current
situation of industry of China. The distribution of industrial sectors is listed in
Table 2.
4.2. Methodology
y , (1)
X , (2)
The Proportion
No. of
Industry Classification of the
Firms
Total Sample
Communication and Cultural 6 0.70
Mining and Quarrying 8 0.94
Construction 11 1.29
Transportation and Warehousing 16 1.88
Agriculture/Forestry/Husbandry/Fishing 16 1.88
Social Services 25 2.93
Production of Electric,Coal,Gas and Water
29 3.40
Supplying
Information Technology 54 6.34
Real Estate 59 6.92
Comprehensive 64 7.51
Wholesale and Retail 75 8.80
Manufacturing 489 57.39
Total 852 100
1 2 3 4 5 6 7
TANG 1 0 0 0 0 0 0
INTG 2,1 0 0 0 0 0 0
DEPR 0 1 0 0 0 0 0
NDTS 0 4, 2 0 0 0 0 0
PORPC 0 0 1 0 0 0 0
TAPC 0 0 6 , 3 0 0 0 0
LTA 0 0 0 1 0 0 0
LPOR 0 0 0 8, 4 0 0 0
ROA 0 0 0 0 1 0 0
ROE 0 0 0 0 10,5 0 0
NSR 0 0 0 0 11,5 0 0
NGR 0 0 0 0 12,5 0 0
EPS 0 0 0 0 13,5 0 0
18 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
Table 3. (Continued)
1 2 3 4 5 6 7
LR 0 0 0 0 0 1 0
QR 0 0 0 0 15, 6 0
STATEP 0 0 0 0 0 0 1
CORPP 0 0 0 0 0 0 17, 7
CIRA 0 0 0 0 0 0 18,7
5. EMPIRICAL ANALYSIS
The descriptive statistics of the debt ratios and indicators are reported in
Table 4. The debt ratios and indicators were averaged from 1998 through 2006
to reduce the noise. As it takes time for firms to move toward the target level,
we adopted the average of debt levels to reduce the effect of this adjusting
process.
1
China Securities Regulatory Commission.
20 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
This makes the Chinese managers of state-owned firms more risk-averse than
their western counterparts, which may partially explain the low debt level.
To verify this finding, we performed a formal test described as follows.
Given that the data in developed countries are difficult to obtain and the
management incentive system in Chinese private firms is relatively similar to
that in the developed world, we divided our samples into two groups: state-
owned firms and private firms. A two-sample t-test was performed to
determine whether the average debt levels of the two types of firms differ. We
find (i) no significant difference between the long-term debt ratios (LDR) in
the two types of firms; and (ii) the total and short-term debt ratios in private
firms are significantly (at 1% level) higher than those in state-owned firms.
This result may be explained by the industry difference between state-owned
and private firms as well as provide evidence for the statement that the
managers of state-owned firms are more risk-averse.
Variable
TANG INTG DEPR NDTS TAPC PORPC LTA LPOR ROE
1 1.654 1 2.434 1 -0.110 1 1.155 1
- (4.504) - (7.356) - (-1.142) - (39.406) -
*** *** ***
ROA NSR NGR EPS LR QR STATEP CORPP CIRA
3.883 1.073 1.031 3.749 1 1.003 1 -0.962 -0.148
(9.062) (5.726) (5.585) (9.040) - (75.131) - (-22.542) (-2.792)
*** *** *** *** *** *** ***
Residual Variance
TANG INTG DEPR NDTS TAPC PORPC LTA LPOR ROE
0.976 0.882 0.977 0.603 0.817 0.989 0.286 0.075 0.937
(20.423) (16.032) (20.247) (11.884) (19.599) (20.769) (17.177) (9.653) (20.723)
*** *** *** *** *** *** *** *** ***
ROA NSR NGR EPS LR QR STATEP CORPP CIRA
0.143 0.926 0.930 0.201 0.066 0.065 0.165 0.238 0.972
(10.007) (20.594) (20.662) (12.656) (12.548) (12.440) (5.626) (7.869) (20.309)
*** *** *** *** *** *** *** *** ***
Note: t statistics are listed in parenthesis. *** stand for statistically significant at the 1%.
Table 7. Estimates of Structural Coefficients
Debt Attributes
Measures 1 2 3 4 5 6 7 x1 x2 x3 x4
TDR 0.540 2.684 3.251 -1.003 -4.419 -0.937 0.348 0.019 0.147 0.027 -0.011
(0.701) (4.397) (9.096) (-4.197) (-6.826) (-6.771) (2.499) (0.675) (5.416) (0.912) (-0.411)
*** *** *** *** *** ** ***
LDR 3.491 -1.002 3.410 0.154 -3.397 -0.528 0.465 -0.039 0.107 0.081 -0.007
(4.173) (- (7.283) (0.495) (-5.072) (-2.602) (2.850) (-1.166) (3.156) (2.287) (-0.201)
*** 1.161) *** *** ** *** *** **
SDR -1.023 3.804 2.368 -1.315 -3.818 -0.903 0.205 0.043 0.130 -0.006 -0.010
(-1.370) (6.593) (6.576) (-6.516) (-6.646) (-6.624) (2.638) (1.627) (5.076) (- (-0.424)
*** *** *** *** *** *** *** 0.204)
Note: t statistics are in parenthesis. ***, **, * stand for statistically significant at the 1%, 5%, 10%.
24 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
Types of Attributes
Leverages 1 3 4 7
Company 2 5 6 x1 x2 x3 x4
State- TDR 0.024 1.800 1.509 -0.484 -2.113 -0.595 0.073 0.062 0.118 -0.033 0.074
owned (0.046) (5.458) (5.224) (-3.663) (-6.395) (-4.876) (0.937) (3.023) (5.599) (-1.686) (1.180)
*** *** *** *** *** *** *** *
LDR 0.771 -1.359 0.987 0.333 -0.783 -0.280 0.068 -0.051 0.107 0.035 0.095
(1.700) (-3.769) (4.366) (2.219) (-2.220) (-2.229) (0.820) (-2.003) (2.759) (1.483) (1.260)
* *** *** ** ** ** ** *** **
SDR -0.344 2.883 0.831 -0.760 -2.233 -0.601 0.057 0.102 0.112 -0.057 0.045
(-0.469) (6.846) (3.538) (-5.279) (-5.985) (-3.950) (0.670) (4.357) (4.724) (-2.522) (0.627)
*** *** *** *** *** *** *** **
Private TDR 0.046 1.766 5.226 -0.599 -6.567 -0.589 0.164 0.047 0.086 0.134 -0.024
(1.334) (1.693) (9.291) (-1.563) (-6.120) (-2.516) (0.339) (0.806) (1.378) (1.626) (-
* *** *** ** * 0.652)
LDR -2.911 5.703 4.713 -0.619 -2.773 -0.058 -0.834 0.044 0.113 0.138 -0.023
(-1.734) (5.558) (4.216) (-1.030) (-2.114) (-0.166) (-1.361) (0.592) (1.362) (1.311) (-
* *** *** ** 0.458)
SDR 2.122 -0.581 4.177 -0.440 -6.774 -0.700 0.607 0.036 0.052 0.099 -0.019
(1.597) (-0.593) (7.342) (-1.368) (-7.500) (-3.696) (1.504) (0.724) (0.958) (1.364) (-
*** *** *** 0.596)
Note: t statistics are in parenthesis. ***, **, * stand for statistically significant at the 1%, 5%, 10%.
28 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
The effects of non-debt tax shields ( 2 ) on long-term debt ratio of the two
groups have reverse directions. The negative impact on state-owned firms is in
accordance with the findings in many studies (e.g., Wald, 1999; Ozkan, 2001;
Korajczyk and Levy, 2003; Sogorb-Mira, 2005). The positive impact on
private firms is also reasonable and confirmed (e.g., Bradley et al., 1984;
Delcoure, 2007). We also noticed that the relationship between non-debt tax
shield and short-term debt ratio is positively significant in SF but insignificant
in PF.
Firm size ( 4 ) has no significant impact on debt ratios for private firms,
which can be attributed to their small size. The relationships between firm size
and total and short-term debt ratios for state-owned firms are the same as those
in Section 5.2. Notably, firm size has a positive impact on long-term debt ratio
for the state-owned firms, which supports the asymmetric information theory.
While in the previous analysis, this positive impact is insignificant.
Government policies that support large state-owned companies to use debt also
contribute to the positive relationship.
The effects of uniqueness(x1) on debt ratios are significant in the SF group
but insignificant in the PF group because most of the companies in
monopolistic industries are state-owned. Furthermore, uniqueness in state-
owned firms is positively related to total and short-term debt ratios, and
negatively related to long-term debt ratio. This result confirms Titman and
Wessels (1988), which states that monopolistic companies can easily obtain
short-term loans.
The effects of operational risk (x2) on debt ratios are positively significant
in the SF group but insignificant in the PF group. This confirms our
explanation in Section 5.2, where we stated that the size of state-owned
companies is usually very large. If these companies go bankrupt, huge social
costs will be generated. Thus, the government usually implements effective
measures to help these companies survive bankruptcy. Consequently, state-
owned banks are more willing to provide loans to state-owned companies even
though the operational risk of these companies may be high.
Dividend payout ratio (x3) has negative effects on total and short-term
debt ratios in state-owned firms. This means that if the state-owned firms are
in good financial standing, they will have enough retained surplus to sustain
their operations. For the private firms, however, dividend payout ratio is
positively related to total debt ratio, indicating that dividends are used as a
signal of good financial condition to investors. With positive signals, private
firms can easily access debt financing.
The Determinants of Capital Structure Choice for Chinese Listed … 29
Finally, similar to the preceding result, tax shields (x4) have no impact on
debt ratios in both groups. In conclusion, with the effects of unique Chinese
institutional settings and industry difference between Chinese private and
state-owned firms, some similarities and differences in the influential patterns
of the determinants on capital structure choice are identified.
CONCLUSION
China has special characteristics including the imperfect capital market
and banking system, poorly specified property rights and laws, and
institutional uncertainty. In this chapter, we employed a structural equation
model with Bayesian approach to analyze the determinants of capital structure
choice for Chinese listed companies. We find a remarkable difference between
the capital structure of firms in China and developed countries, which lies in
the Chinese firms‟ low overall debt levels with a small portion of the long-
term debt. Three possible reasons have been identified. First, constraints of
debt financing in China, especially for the long-term loans, are rather
restrictive. Second, Chinese listed companies use equity more frequently than
debt in raising long-term investment capital. Third, compare with their
counterparts in developed countries, Chinese firm managers are more risk-
averse, and tend to take less debt.
The results of this empirical study show that some insights from modern
finance theories are applicable to China, especially the pecking order theory.
However, there are also some unique patterns of the determinants of capital
structure choice for Chinese listed firms. First, the most influential attributes
of the capital structure choice are growth opportunities, profitability, and
liquidity. Profitability and liquidity of the firms have negative impacts on the
three debt ratios. This supports the pecking order theory. Growth has positive
impacts on debt ratios, which verifies the agency theory. Second, asset
structure has no significant effect on total debt ratio, but has significantly
positive effect on long-term debt ratio, which agrees with the pecking order
theory. Third, leverages, as measured by short-term debt and total debt ratios,
increase with non-debt tax shield. Fourth, firm size has negative impacts on
short-term debt and total debt ratios, which agrees with the pecking order
theory. Fifth, the ownership structure significantly affects the firms‟ capital
structure choice. This unique pattern is consistent with the special
characteristics of China. Sixth, leverages increase with operational risk, which
is inconsistent with the pecking order theory. This unique feature is related to
30 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
ACKNOWLEDGMENT
The research was supported by the NSFC 11471277, and 4053087 from
the Direct Grant of the Chinese University of Hong Kong.
REFERENCES
Agarwal, S. & Mohtadi, H. (2004). Financial Markets and the Financing
Choice of Firms: Evidence from Developing Countries. Global Finance
Journal, 15(1), 57-70.
Bayarri, M. J. & Berger, J. O. (2000). P Values for Composite Null Models.
Journal of the American Statistical Association, 95, 1127-1142.
Belsley, D. A., Kuh, E. & Welsch, R. E. (1980). Regression Diagnostics. New
York: Wiley.
The Determinants of Capital Structure Choice for Chinese Listed … 31
Berkman, H., Cole, R. & Fu, J. (2002). From State to State: Improving
Corporate Governance When the Government is a Large Block Holder.
University of Auckland, New Zealand, Working Paper.
Bevan, A. A. & Danbolt, J. (2002). Capital Structure and Its Determinants in
the United Kingdom–a Decompositional Analysis. Applied Financial
Economics, 12(3), 159-170.
Booth, L., Aivazian, V., Demirguc-Kunt, A. & Maksimovic, V. (2001).
Capital Structures in Developing countries. Journal of Finance, 56(1), 87-
130.
Bradley, M., Jarrell, G. A. & Kim, E. H. (1984). On the Existence of an
Optimal Capital Structure: Theory and Evidence. Journal of Finance,
39(3), 857-878.
Chang, C. F., Lee, A. C. & Lee, C. F. (2009). Determinants of Capital
Structure Choice: a Structural Equation Modeling Approach. The
Quarterly Review of Economics and Finance, 49(2), 197-213.
Chen, J. J. (2004). Determinants of Capital Structure of Chinese Listed
Companies. Journal of Business Research, 57(12), 1341-1351.
DeAngelo, H. & Masulis, R. (1980). Optimal Capital Structure under
Corporate and Personal Taxation. Journal of Financial Economics, 8(1),
3-29.
Deesomsak, R., Paudyal, K. & Pescetto, G. (2004). The Determinants of
Capital Structure: Evidence from the Asia Pacific Region. Journal of
Multinational Financial Management, 14(4, 5), 387-405.
Delcoure, N. (2007). The Determinants of Capital Structure in Transitional
Economies. International Review of Economics and Finance, 16(3), 400-
415.
De Haas, R. & Peeters, M. (2006). The Dynamic Adjustment towards Target
Capital Structures of Firms in Transition Economies. Economics of
Transition, 14(1), 133-169.
Drobetz, W. & Wanzenried, G. (2006). What Determines the Speed of
Adjustment to the Target Capital Structure?. Applied Financial
Economics, 16(13), 941-958.
Fama, E. & French, K. (2002). Testing Trade-off and Pecking Order
Predictions about Dividends and Debt. Review of Financial Studies, 15(1),
1-33.
Fan, J. P., Titman, S. & Twite, G. (2012). An International Comparison of
Capital Structure and Debt Maturity Choices. Journal of Financial and
Quantitative Analysis, 47(1), 23-56.
32 Xin-Dan Li, Xiang-Nan Feng, Bin Lu et al.
Chapter 2
AN EXAMINATION OF PREDICTORS
AND OUTCOMES RELATED TO SCHOOL
CLIMATE USING LATENT CLASS ANALYSIS
ABSTRACT
A favorable school climate provides the structure within which students,
teachers, administrators, and parents function cooperatively and
constructively. Measures of school climate, however, have received only
passing interest from policy makers as critical elements in accountability
reporting. This study used a state-wide dataset of climate ratings from
610 elementary schools and considered multidimensional information
from both teachers and students to produce latent classes of school
climate. Two variables, school size and a school’s poverty index, were
used as covariates when creating latent classes. In addition, two measures
of school performance were examined as distal outcomes. The study
identified four classes, where classes were distinguished based upon
school climate scores. Differences in outcome variables and covariates
were observed across the classes. The information may be used by school
Correspondence: 138 Wardlaw Hall, Columbia, SC 29208, distefan@mailbox.sc.edu, 803-777-
4362
36 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
INTRODUCTION
Declining aptitude test results during the 1970s and 1980s, combined with
unflattering international comparisons, led legislators across the United States
to enact far reaching educational accountability measures for public schools.
Perhaps the most controversial, No Child Left Behind (NCLB) Act of 2001,
set a goal for all children to demonstrate achievement at least equal to their
grade level by the year 2014. While the merits of NCLB and other high stakes
accountability systems have been heatedly argued, there is no doubt that in
every state there has been an intense focus on academic performance of both
the students and the school.
Along with a focus on academic performance, the “school report card” or
“school profile” has become ubiquitous in the accountability movement.
Report cards, required under the provisions of NCLB, most typically include
mandated information on student achievement, teacher qualifications,
attendance, and other variables that provide descriptive information about the
school and its programs.
All the states have some form of a school-level reporting system
accessible over the world-wide web, providing a dizzying array of information
and data. Student attendance (and dropout data for secondary schools), student
behavior indices (e.g., incidents of tardiness, cutting class, and disruptive or
criminal acts), and teacher qualifications are customarily reported. Measures
of school climate, however, have received only passing interest from policy
makers as critical elements in accountability reporting.
Too often, the importance of school climate as a critical contextual factor
in which teachers teach and students learn has tended to fade into the
background, a casualty of other "priorities." And yet, among the preconditions
for school success, few rival the motivation to teach and the motivation to
learn: Do students wish to attend school and engage in learning activities? Do
teachers want to return to the same school next year? Do parents desire to
become involved with their child's school programs? The answers to these
questions very often hinge on the levels of support, challenge, collaboration,
and partnership provided to them by the school: in short, its climate.
An Examination of Predictors and Outcomes … 37
relationships (Brown & Medway, 2007). Teacher and staff perceptions were
pivotal in measuring school climate in early research, however, there has been
an increasing interest in examining students‟ perceptions of school climate
(e.g., Koth, Bradshaw, & Leaf, 2008; Way, Reddy, & Rhodes, 2007).
Previous research suggested the existence of a relationship between school
climate and school poverty level, with higher levels of poverty being
associated with a less positive school climate (Bernstein, 1992; “School
Climate, Discipline, and Safety”, 2013). In addition, many researchers have
noted the negative impact of poverty on educational outcomes such as
academic achievement (Sirin, 2005; Malecki & Demaray, 2006; Monrad et al.,
2008; Hopson & Lee, 2011), behavior problems (Hopson & Lee, 2011),
dropout rates (Cataldi, Laird, & Kewal-Ramani, 2009), and graduation rates
(Monrad et al., 2008). More specifically, these studies suggested that higher
levels of poverty are associated with lower grades and overall GPA, lower
scores on standardized tests, lower graduation rates, as well as higher rates of
problem behavior and dropout rates. Furthermore, results showed a strong
negative relationship between school poverty and school absolute value which
is the basis for determining school absolute rating (Monrad et al., 2008).
Larger Scale Studies of School Climate. Most studies involving school
climate administer a survey to participants at a smaller level, such as one
school or one district, and identify factors of climate that are important to
consider. See Sackney (1998) for a comprehensive review of factors identified
with previous school climate studies. School climate, as a factor to increase
student achievement, has been receiving increased attention in the school
improvement literature. The Consortium on Chicago School Research (CCSR)
used information from principals, teachers, and students across over 200
schools to identify “five essential supports for school improvement” (Sebring
et al., 2006). The CCSR found the important factors to be: leadership,
professional capacity (e.g., knowledge, skills, and disposition of faculty),
parent-community ties, climate, and instruction. To gain a greater
understanding of the impact of the five supports on school achievement, the
CCSR examined the relationship between the five factors and student
achievement as measured by a standardized test. The findings showed that
schools strong in most (e.g., 3 to 5) essential areas were up to 10 times likely
to make gains in both reading and mathematics standardized test scores on the
Iowa Test of Basic Skills (grades 3-8). Sebring et al. (2006) also found that
improvements in the essential supports also led to improved achievement.
An earlier study of elementary schools in Chicago also highlighted the
importance of positive school climate characterized by mutual trust and
40 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
respect. According to Bryk and Schneider (2002), schools with a high degree
of “relational trust” between administrators, teachers, and parents are far more
likely to make the kinds of changes needed to improve student achievement
than schools where relationships are poor. Bryk and Schneider compared 100
schools that made the greatest improvement on achievement tests (reading and
mathematics) between 1991 and 1996 with 100 schools that made little or no
improvement. They discovered that schools with high levels of trust at the
beginning of reform efforts had a 1 in 2 chance of making significant
improvements in reading and math achievement, while schools with low levels
of trust had a 1 in 7 chance of making achievement gains. Among the schools
with initially low levels of trust, only those schools where trust was
strengthened over the course of reform efforts showed achievement gains. No
school that continued to have low levels of relational trust improved student
achievement levels to any appreciable degree.
There have been few studies that have investigated school climate on a
state or national level. Using exploratory factor analysis (EFA) techniques
with the California School Climate and Safety Survey and 7,524 students in
grades 6 to 12, Furlong et al., (2005) identified two broad categories of
climate: school climate (support from teachers, enforcement of school rules)
and school safety (perceptions of safety and gang activity). However, EFA
techniques were conducted within each dimension separately rather than
across the survey.
Also, Greenberg (2004) used a national dataset for 4th, 8th, and 12th graders
to determine how NAEP mathematics scores were affected by school climate.
Using EFA, three components of school climate were identified: student
behavior, parental involvement, and school morale. Further, regression
analyses showed that NAEP mathematics scores were increased as climate
scores increased, even when school characteristics (e.g., poverty, urbanicity,
type of school, school size) were controlled.. While this study provided an
investigation of the relationship between student achievement and school
climate, only mathematics was studied.
The studies by Greenberg (2004), Furlong et al. (2005), and Sebring et al.
(2006) recognize the dimensional nature of school climate and its relationship
to achievement. Compared with other barriers which cannot be controlled by
schools, such as high child poverty, previous work has supported the notion
that school climate is not a fixed school condition and that climate can be
changed (Greenberg, 2004), potentially affecting accountability ratings.
Considering the benefits, both socially and academically, of a positive school
climate, it would be of interest for states or districts to group schools based
An Examination of Predictors and Outcomes … 41
upon the level of school climate and to intervene for those schools suffering
from negative climate.
South Carolina is currently one of only a few states in the country that
includes climate data from surveys of students, teachers, and/or parents on
their school report cards. South Carolina‟s report card was developed in
response to requirements of the state‟s Education Accountability Act of 1998
(SC Code of Laws, Section 2, Chapter 18, Title 59). The specific variables and
data elements were selected by the General Assembly‟s Education Oversight
Committee working in collaboration with the State Department of Education
and the State Board of Education. The inclusion of school climate data from
“evaluations of the school by parents, teachers, and students” in the school,
district, and state report cards is a specific requirement of the state‟s
accountability legislation (SC Code of Law, 59-18-900 (D)). School climate
data in South Carolina is collected annually from questionnaires administered
to parents, teachers, and students.
Using a state-wide database of both teacher and student responses, cluster
analysis was used to identify groups of schools related to climate (DiStefano et
al., 2007). These groups were replicated over a two-year period. The authors
identified four categories of schools, where schools differed in the degree to
which they had positive school climate. In addition, schools were differentially
related to report card outcomes, where schools with the most positive average
climate ratings also showed the most positive report card factors, such as
higher standardized test scores, lower teacher turnover, higher student
attendance, and higher AYP scores. Also, schools within each cluster had
varying levels of poverty, showing that low-poverty schools do not necessarily
have poor school climate.
Other cluster analysis studies supported some of these findings and also,
showed that the relationship between school climate cluster membership and
student achievement is consistent across core content areas such as English,
mathematics, history, and science (Bergren, 2014; Smith, 2005). In addition to
supporting the relationship between school climate group membership and
school performance, researchers also used cluster analysis to examine the
moderating effects of school climate on school interventions. Specifically,
findings showed differential effects of a violence prevention intervention by
school climate type, with the intervention having a more positive effect on
student behavior in schools with conducive climate than in schools with
average or distressed climate (Dymnicki & the Multisite Violence Prevention
Project, 2013). Whereas these findings were able to provide evidence of
climate groups and relations among school climate and report card outcomes
42 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
class cluster analysis has been called other names in the literature, such as
finite mixture modeling (Pastor & Gagne, 2013; McLachlan & Peel, 2000),
model-based clustering (e.g., Banfield & Raftery, 1993), and mixture
likelihood approach to clustering (Everitt, 1993). Latent class clustering
encompasses a broad family of methods that use the same general model,
including latent class analysis, latent profile analysis, mixed-mode clustering,
and latent transition analysis. Although the latent class methods have been
available for many years (e.g., Gibson, 1959; Lazarsfeld & Henry, 1968), the
techniques are enjoying increased popularity through improved computer
capabilities and available software (Vermunt & Magidson, 2002). While there
are similarities between latent class cluster analysis and cluster analysis, there
are also distinctions between the two methods.
As with cluster analysis, latent class cluster (LCC) analysis has a similar
overarching goal: to classify cases into groups where members within a group
are similar to each other and different from individuals in other groups
(Vermunt & Magidson, 2002). Similarly, cases are thought to belong to one of
K groups underlying the dataset where the number of groups is unknown a
priori. The goal is to uncover the total number of classes (termed K)
underlying the dataset, where each class (noted as k) may be thought of as a
sub-population which is discrete and mutually exclusive (Clogg, 1995;
Heinen, 1996).
Given that k different classes underlie a population, individuals within a
certain class have the same probability distribution with respect to the
categorical latent variable. However, while each case is grouped into only one
class, k, the LCC model recognizes that there may be uncertainty in the
classification. Therefore, each case is given a probability value of belonging to
each of the K groups. Values for the weights range between 0 and 1 per class
and sum to 1 across the set of classes.
The use of latent class clustering models has additional advantages over
traditional clustering methods. One advantage is that LCC models use fit
indices which help researchers select a model and report how well the model
performs to fit the data. Another advantage is that LCC allows researchers to
select parameters of specific interest to be included in the model estimation or
to restrict parameters that are not of interest to estimate. The former is referred
to as freeing parameters, the latter as fixing parameters. Model parameters
may be fixed or freed in line with assumptions, model characteristics, or
relationships among variables. Third, covariates and outcomes may be
included in the same model. This allows LCC to incorporate important
covariates which may affect the creation of the latent classes and to examine
44 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
of as a global fit index, providing information about the overall fit of the
model to the data. Relative fit indices are useful to use when comparing
alternative models to determine which model illustrates a better fit to the data.
Indices in this class may be used to compare models which differ in the
number of groups requested and/or the model specifications. When comparing
models, there is a need to balance information from fit indices and also the
principle of parsimony. Parsimony suggests selecting the solution with the
minimum number of classes possible while achieving an acceptable model fit.
Additional fit indices may be used to compare competing models to select a
model.
The Akaike Information Criteria (AIC) and the Bayesian Information
Criteria (BIC) offer comparative evidence to evaluate different solutions
(Muthén, 2001; Vermunt & Magidson, 2002). The AIC and BIC are
"parsimony criteria" used to compare different model solutions (i.e., different
numbers of groups underlying the data) in order to determine which model fits
the data best. For these indices, the more parameters that are estimated, the
higher the value of AIC/BIC. In addition for adjusting for the number of
parameters, BIC adjusts for the sample size (N), yielding larger values as
sample size increases, all other factors held constant. With AIC and BIC,
comparatively lower values indicate better fitting models (Pastor et al., 2013).
Other LCC-based fit indices measure the degree of uncertainness in the
classifications. These methods require that individuals are grouped to
determine how well the model works to classify cases. Posterior probabilities
denote the probability of class membership and are computed using both the
model characteristics across the set of K classes and a case‟s pattern of
observed scores (see Vermunt & Magidson, 2002 for more details). Under
mixture modeling, cases may associate with more than one group through the
mixing weight, and can have fractional group membership across all groups.
To judge model uncertainty, cases are typically assigned to the one group with
which it has the highest posterior probability of association. This type of group
classification is termed modal assignment (e.g., Pastor & Gagné, 2013). At the
individual level, the higher probability value, the greater confidence one may
have concerning a case's class assignment. A better fitting solution will have
higher classification rates (i.e., fewer cases which are difficult to classify) for
each class, interpreted as a greater certainty of the classification. Perfect
classification would be illustrated by probability values of 1 for each class.
Entropy is a measure of uncertainness or randomness in the classification
procedure, and provides a summary of the information presented in a
classification table with one index (Pastor et al., 2007). Within LCC analyses,
46 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
entropy values indicate how well the model predicts class memberships
(Akaike, 1977). Values range from 0 to 1, where entropy values closer to 1
illustrate better prediction (Vermunt & Magdison, 2002). Entropy is computed
as the maximum of the probability density distribution underlying the latent
class cluster model (Akaike, 1977) and is calculated using the posterior
probability information, as well as the number of classes K modeled and
sample size N.
As suggested by latent class researchers (Collins & Lanza, 2010; Muthén,
2001; Muthén & Muthén, 2000, 2007; Vermunt & Magdison, 2002), each
class solution can be evaluated using multiple criteria. Finally, an optimal
solution can be “named” through interpreting patterns of high and low
parameter estimates within each class to ensure that each latent class has
substantive meaning (Muthén & Muthén, 2000) and matches to theory. It is
noted that while interpretability relies on the judgment of the researcher rather
than fit indices, it is still a very important component. This is similar to factor
analysis, where a researcher evaluates the sensibility of the solutions when
deciding between different numbers of factors (Crocker & Algina, 1986).
When both covariates and outcomes are available, a three-step procedure
is recommended (Asparouhov & Muthén, 2014; Vermunt, 2010). Generally,
identifying the model is the first, and most important, step in the mixture
modeling process. This involves identifying the optimal number of classes
using the procedures described above. First, the models are estimated where
no covariate or outcomes are included. These are called unconditional models,
as they are not ”conditional” or dependent upon additional auxiliary variables.
This information is used to estimate different class models. Based on this
information, the preferred model may be chosen. Second, using modal class
assignment, cases are assigned to latent classes. Modal assignment information
provides classification information to include with the auxiliary variables (i.e.,
covariates and outcomes). Third, fixed values are obtained from the latent
classification procedures in step 1 (i.e., log odds of the classification
probabilities; Asparouhov & Muthén, 2014). Fixing measurement
relationships between the latent class variable and the most likely class
variable helps to account for the imprecision in the classification and produces
correct estimates and standard errors for the relationships of class membership
with other variables (Asparouhov & Muthén, 2014). Upon completion of these
three steps, auxiliary information may be included without influencing the
measurement of the latent classes. After performing these three steps,
covariates and distal outcomes may be incorporated. Covariates may help to
predict class membership and may be added into the model using the set
An Examination of Predictors and Outcomes … 47
number of classes, k, found in the first step. Finally, the distal outcome
variables related to the latent class variable and/or indicators may be included
in the last step. Outcome variables (e.g., standardized test scores) may be
assessed to determine if latent groups provide differences with regard to
performance and accountability measures.
METHODS
Schools and Participants
Instrumentation
PRELIMINARY ANALYSES
Statistical analyses of the imputed teacher and student data sets began
with confirmatory factor analysis (CFA). This multivariate statistical
procedure aims to determine how well the survey items measure the climate
constructs. This procedure is appropriate to use when researchers hold prior
knowledge of the underlying latent structure of an instrument (Benson, 1998;
Byrne, 1998; Hoyle & Panter, 1993). CFA was appropriate as it was preceded
by exploratory factor analysis (EFA), which yielded the same factor structure
across two consecutive years (2006 and 2007) and CFA for teachers, students,
and parents (Monrad et al., 2008) with additional independent samples (survey
data collected in 2008, 2009, and 2010).
For each data set, the factor structure derived from exploratory procedures
was used to specify the measurement model in CFA. The confirmatory factor
analyses conducted in this study included only the teacher items present in the
optimal exploratory factor solutions (53 teacher items and 43 student items).
Subsequent item analysis showed that CFA results of the current study
replicated closely the prior EFA solutions. CFAs of the teacher, student, and
parent data sets were conducted using the CALIS procedure provided in the
SAS 9.2 statistical software package. Parameters and model fit indices were
estimated using the Maximum Likelihood procedure. This estimator is
frequently used in CFA studies with (distributionally) normally distributed
An Examination of Predictors and Outcomes … 49
is primarily associated with parent involvement with the school and student
learning. The Safety factor for students is comparable to that outlined for
teachers: the perception of security both at school and coming to and going
from school.
CFA Factor Scores. Each CFA run was followed by the computation of
factor scores using a least squares regression approach (Thurstone, 1935).
Regression factor scores predict the location of each survey participant on the
distribution of each of the climate factors, and may be used for subsequent
statistical analysis. They are standardized scale scores developed from the
factor structure and based upon the weights assigned to individual items.
Values generally range from a low of -3 to a high of 3, representing three
standard deviations from the mean, where values near zero represent an
average performance. With respect to climate, positive factor scores depict
above average ratings whereas negative scores describe a climate rating that is
below average. To identify climate characteristics within each school, as well
as to compare these characteristics across schools, factor scores were
aggregated at the school level for the latent class clustering analysis. Resulting
analyses, thus, included students‟ and teachers‟ perceptions of their school‟s
climate across multiple domains. This procedure provided a school average
estimate for each climate dimension, and it allowed researchers to determine
where each school is located on every teacher and student factor. These
10factor scores across student and teacher solutions were used to create latent
classes of school climate. Thus, we recognize latent profile analysis was used
as the variables used to create the classes were continuous in nature.
receiving Medicaid and/or reduced meal plans. Its values range from 0 to 100,
where higher values indicate higher levels of poverty.
Outcomes. School climate data was compared to the school report card
information. Specifically, the school Absolute index and the school‟s
composite score or “grade” used for federal accountability purposes were
included as outcomes. For elementary schools, the Absolute index provides a
measure of a school‟s performance on standardized tests and attendance (e.g.,
http://www.eoc.sc.gov/Information%20for%20Educators/Accountability%20
Manuals/2012/Ratings%20for%20School%20Districts%2011-12.pdf).
Beginning in 2012, South Carolina applied for, and received, a waiver
for several of the requirements for the Elementary and Secondary Education
Act (ESEA). This “ESEA waiver” allowed for more flexibility in reporting for
federal accountability purposes. As such, schools now receive a composite
score calculated based on student performance on state standardized tests. This
composite score takes into account both students who meet the pre-defined
proficiency goal, as well as students who do not meet the proficiency goal but
show improvement in test scores from the previous school year (SC
Department of Education, 2013). Schools are then assigned a letter grade
based on this composite score where scores 90-100 = “A”, 80-89.9 = “B”, 70-
79.9= “C”, 60-69.9 = “D”, and below 60 = “F” (SC Department of Education,
2013).
Statistical Methods
To conduct analyses, the software package Mplus (v. 7.3) was used
(Muthén & Muthén, 1998-2014). Within the current software version, Mplus
includes automated procedures for the three-step approach when mixture
models consist of a single latent class variable and auxiliary information
consisting of either covariates or outcomes. In this study, however, the model
included both covariates and distal outcomes, so it was necessary to perform
the three-step method manually. In addition, cases which were missing
covariates, outcomes, or school-level climate scores for more than four scales
were eliminated from the analyses.
First, a series of unconditional models, that is, models which did not
include covariates or outcomes, were fit. Here, it is typical practice to start
with a one-class model and then successively increase the number of classes
by one to find the optimal number of classes. Fit statistics were collected for
each model and compared to the previous model (i.e., the model with one less
52 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
RESULTS
After examination of the preliminary dataset, 40 cases were deleted due to
missing data. Overall, the analyzed sample included 52,436 student responses
and 17,843 teacher responses from 610 elementary schools. These schools had
complete profiles of cases among the set of 10 factor scores as well as
information on covariates and outcomes. Across schools, the average Absolute
index was 3.20, average school size was 513 students, the average ESEA
waiver composite score was 83.13, and the average poverty index was 75.56.
As the first step of the procedures, the latent profile analyses ignored
covariate and outcome information and concentrated on finding the optimal
class solution. To begin, two latent classes were extracted and successive
classes were added to the solution until no additional solutions could be
extracted due to nonconvergence problems (Nylund et al., 2007; Vermunt &
Magidson, 2002). Based on this process, two to a maximum of five class
solutions were extracted. Fit indices were examined to help identify the
An Examination of Predictors and Outcomes … 53
optimal number of classes. Table 1 presents fit information for the two through
five class solutions examined.
set covariance terms to zero. Model 4B allowed variance terms to vary across
classes, and covariance terms to be estimated, but constrained to be equal
across the set of classes. The last model tested, Model 4C, allowed all variance
and covariance terms to vary freely within class. For all models, vectors of
mean scores (i.e., centroid values) were uniquely estimated within each class.
Fit indices, as well as the interpretability of the class solution, were examined
for each model.
As shown at the bottom of Table 1, Model 4B produced the highest
entropy level and the lowest AIC value. Model 4C yielded lower information
fit criteria, but showed low entropy and was hard to interpret. The difficulty in
interpreting solutions with correlated variables has been recognized previously
(Muthén, 2002). Based on the information, the baseline model, Model 4B was
used for the three-step procedure as it illustrated acceptable fit, interpretability,
and match to prior research with state-wide datasets using both cluster analysis
(DiStefano et al., 2007) and latent profile analysis (Mindrila et al., 2010).
Model 4B was used for the remaining steps in the procedure. Probability
information was used to assign schools to classes. Subsequently, class
probabilities were constrained while the influence of covariates and distal
outcomes were included. Estimated parameter values for the classes are
provided in Table 2 and a representation of the class centroids is provided in
Graph 1.
Class one (n=129, 21% of elementary school sample) was named “Poor
Climate” based on the mean profile of the student and teacher factor scores.
There were relatively few classification problems, with an average prior
probability of .974. The Poor Climate group had the lowest scores of the set of
classes, with negative teacher and student scores. Here, teacher scores were
lower than student scores, showing greater levels of dissatisfaction with the
school environment. Average teacher scores also reported higher levels of
variability than average student scores.
Class two (n=186, 30% of elementary schools) was termed “Average
Climate”. Again, class average prior probability values were high, .949. All of
the mean values for this class were positive, but close to zero. This group also
had low variability, with small variances reported for both teacher and student
parameters.
Table 2. Average Latent Profiles of School Climate Variables, by Class (N = 610 Elementary Schools)
School grade
A B C D F
n % within n % within n % within n % within n % within
Latent class class class class class class
Poor Climate 23 17.8% 25 19.4% 24 18.6% 16 12.4% 41 31.8%
Average Climate 71 38.2% 60 32.3% 19 10.2% 19 10.2% 17 9.1%
Average Teacher/ 81 52.3% 48 31.0% 13 8.4% 2 1.3% 11 7.1%
Positive Student
Positive Climate 98 70.0% 33 23.6% 6 4.3% 3 2.1% 0 0.0%
Note. The ESEA composite score or index can be converted into a letter grade as follows: scores 90-100 = “A”, scores 80-89.9 = “B”,
scores of 70-79.9= “C”, scores 60-69.9 = “D”, and scores below 60 = “F”.
58 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
For example, schools belonging to the Poor Climate class (class 1) were
associated with the highest poverty index, smallest school size, and lowest
Absolute index and ESEA composite score. Conversely, schools belonging to
the Positive Climate group (class 4) have the lowest poverty index, largest
school size, and highest Absolute index and ESEA composite score.
Considering the influence of the covariates on the latent classes, analyses
were conducted, where classes are compared to a referent class; here, the
Positive Climate (class 4) was used as the comparison. Compared to the most
Positive Climate class (class 4), there was a significant effect of poverty on
Poor Climate (class 1) and Average Climate (class 2) groups, where these
classes were more likely to have higher poverty. School size was also
significant; however, coefficients were close to zero and odds ratios
(comparing each class to the referent group) were 1, showing no great impact
of school size on school climate. The Average Teacher/High Student Climate
class did not report significantly different effects on poverty or school size as
compared to the Positive Climate class.
Outcomes. Table 3 provides information about the relationship between
class membership and performance indicators. Results indicated that schools
with the poorest climate, as defined by negative, below average factor scores,
did worst on achievement outcomes; schools in successively more favorable
climate groups showed progressively higher achievement outcomes. One
feature of the ESEA waiver composite score is that it can be converted into a
letter grade for each school.
Table 4 highlights the school “grade” distribution between the identified
latent classes. As climate profiles become more positive, the proportion of
schools receiving a school grade of an “A” increases, while the proportion of
schools receiving an “F” decreases. For example, approximately 32% of
schools belonging to the Poor Climate class received a school grade of an “F”,
while 70% of schools belonging to the Positive Climate class received an “A.”
DISCUSSION
The current study used a state-wide sample of climate ratings from 610
elementary schools. Survey results from teachers and students were aggregated
to the school level to identify the number of underlying latent classes and
estimate the impact of a categorical school climate variable on school absolute
ratings and composite scores based on student achievement ESEA, while
taking into account the effect of school size and poverty on the classification
An Examination of Predictors and Outcomes … 59
SCHOLARLY SIGNIFICANCE
Wang, Haertel, and Walberg (1997) conducted a meta-analysis using a
database consisting of 11,000 statistical findings and determined that
instruction and climate affect learning as much as student characteristics. Their
work supports “the idea that climate is a real factor in the lives of learners and
that it is measurable, malleable and material to those that work and learn in
schools” (Freiberg, 1999, p. 17). There is a compelling body of literature
providing support for the importance of school climate. Compared to other
barriers which are not within the locus of control of schools, such as high child
poverty and low state funding, negative school climate factors can be
improved. Although there is a growing literature dealing with the assessment
of school climate, efforts to systematically improve it have been limited.
Changing school climate “requires explicit, targeted, and aligned change
efforts at the leverage points” (McGuigan, 2008, p. 112). Results from this
study may be used to foster such efforts by providing greater insight about
how climate may impact selected accountability outcomes.
While some obstacles, such as poverty, are not easily surmountable,
school climate can be improved with limited expenses. Therefore, training on
changing the school climate should be provided at the school level for teachers
and administrators. These efforts should be stronger in high poverty areas, as
poverty has a negative impact on both school climate and school performance.
Therefore, with an increased focus on accountability and academic
improvement nationwide, the current research provides support for an
increased attention to school climate as a critical dimension for school leaders
to focus school improvement efforts. By evaluating the practices at the school-
level to determine which are promoting positive school climate, schools may
also see improvement in achievement outcomes.
The current work provides a framework for evaluating school climate data
as well as providing direction for the potential application of school climate
data for use in school improvement. For example, an extension of the current
work includes utilizing the climate data to develop multi-year school climate
profiles that could provide low-performing schools with a practical tool to use
in identifying critical areas for school improvement. Assessment and
An Examination of Predictors and Outcomes … 61
REFERENCES
Akaike, H. (1977). On the entropy maximization principle. In P. R. Krishnaiah
(Ed.), Applications of statistics (pp. 27-41). Amsterdam: North-Holland.
Aldenderfer, M. S., & Blashfield, R. K. (1984). Cluster analysis. Beverly
Hills, CA: Sage.
Allen, S.M., Thompson, R.H., Hoadley, M., Engelking, J., & Drapeaux, J.
(1997, March). Improving school climate: Creating a circle of
communication between educators and families. Paper presented at the
meeting of the American Educational Research Association, Chicago, IL.
Anderson, C.S. (1982). The search for school climate: A review of the
research. Review of Educational Research, 52(3), 368-420.
Ashforth, S.J. (1985). Climate formations: Issues and extensions. Academy of
Management Review, 25, 837-847.
Asparouhov, T., & Muthén, B. (2014). Auxiliary variables in mixture
modeling: Three-Step approaches using Mplus. Structural Equation
Modeling: A Multidisciplinary Journal, 21(3), 329-341.
Bacher, J. (2000). A probabilistic clustering model for variables of mixed type.
Quality & Quantity, 34, 223-235.
Banfield, J.D., & Raftery, A.E. (1993). Model-based Gaussian and non-
Gaussian clustering. Biometrics, 49, 803-821.
Bensmail, H., Celeux, G., Raftery, A. E., & Robert, C. P. (1997). Inference in
model based clustering. Statistics and Computing, 7, 1-10.
Benson, J. (1998). Developing a strong program of construct validation: A test
anxiety example. Educational Measurement: Issues and Practice, 17, 10-
22.
Bergren, D. A. (2014). The impact of school climate on student achievement in
the middle schools of the commonwealth of Virginia: A quantitative
analysis of existing data (Doctoral dissertation). Retrieved from ProQuest.
(Publication Number 3615518)
Bernstein, L. (1992). Where is reform taking place? An analysis of policy
changes and school climate. Educational Evaluation and Policy Analysis,
14(3), 297-302.
Blashfield, R. K., & Aldenderfer, M. S. (1988). The methods and problems of
cluster analysis. In J. R. Nesselroade & R. B. Cattell (Eds.), International
handbook of multivariate experimental psychology (pp. 311-359). New
York: Plenum.
Brand, S., Felner, R., Shim, M., Seitsinger, A., & Dumas, T. (2003). Assessing
school climate middle school improvement and reform: Development and
An Examination of Predictors and Outcomes … 63
Hopson, L.M. & Lee, E. (2011). Mitigate the effect of family poverty on
academic and behavioral outcomes: The role of school climate in middle
and high school. Children and Youth Services Review, 33, 2221-2229.
Hoy, W.K. & Miskel, C.G. (1982). Educational administration: Theory,
research, and practice. (2nd ed., pp. 185-219). New York: Random
House.
Hoy, W.K. (1990). Organizational climate and culture: A conceptual analysis
of the school workplace. Journal of Educational and Psychological
Consultation, 1(2), 149-168.
Hoy, W.K., Tarter, C.J., & Kottkamp, R.B. (1991). Open schools, healthy
schools: Measuring organizational climate. Newbury Park, CA: Sage.
Hoyle, R. H., & Panter, A. T. (1993). Writing about structural equation
models. In R. H. Hoyle (Ed.), Structural equation modeling: Concepts,
issues, and applications (pp. 158–176). Newbury Park, CA: Sage.
Koth, C.W., Bradshaw, C.P., & Leaf, P.J. (2008). A multilevel study of the
predictors of student perceptions of school climate: The effect of
classroom-level factors. Journal of Educational Psychology, 100(1), 96-
104.
Lazarsfeld, P. F., & Henry, N. W. (1968). Latent structure analysis. Boston:
Houghton Mifflin.
Lee, V.E. & Burkham, D.T. (1996, February). Gender differences in middle-
grade science achievement: Subject, domain, ability level, and course
emphasis. Science Edition, 80(6), 613–650.
Lezotte, L.W. (1990). A guide to the school improvement process based on
effective schools research. Okemos, MI: Effective Schools Products.
Loukas, A. & Murphy, J.L. (2007). Middle school perceptions of school
climate: Examining protective functions on subsequent adjustment
problems. Journal of School Psychology, 45, 293-309.
Ma, X. & MacMillan, R.B. (1999). Influences of workplace conditions on
teachers‟ job satisfaction. The Journal of Educational Research, 93(1), 39-
47.
Macneil, A.J., Prater, D.L., & Busch, S. (2009). The effects of school culture
and climate on student achievement. International Journal of Leadership
in Education, 12(1), 73-84.
Malecki, C.K. & Demaray, M.K. (2006). Social support as a buffer in the
relationship between socioeconomic status and academic performance.
School Psychology Quarterly, 21(4), 375-395.
McGuigan, L. (2008). Systems thinking and culture change in urban school
districts. In Hoy, W.K., & DiPaola, M. (Eds.), Improving schools: Studies
66 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
Pastor, D. A., & Gagné, P. (2013). Mean and covariance structure mixture
models. In G. R. Hancock & R. O. Mueller (Eds.), Structural equation
modeling: A second course (2ndEd.). Greenwich, CT: Information Age.
Perkins, B.K. (2006). Where we learn: The CUBE survey of urban school
climate. Retrieved June 4, 2009 from http://www.nsba.org/
SecondaryMenu/CUBE/Publications/CUBEResearchReports/WhereWeLe
arnReport/WhereWeLearnFullReport.aspx
Roney, K., Coleman, H., & Schlichtin, K.A. (2007). Linking the
developmental health of middle grades schools to student achievement.
NASSP Bulletin, 91(4), 289-321.
Rumberger, R.W. (1995). Dropping out of middle school: A multilevel
analysis of students and schools. American Educational Research Journal,
32(3), 583-625.
Sackney, L. (1988). Enhancing school learning climate: Theory, research and
practice. In D. O. E. Administration (Ed.) (Vol. 180, pp. 146): SSTA
Research Centre.
School climate, discipline, and safety: Gauging educator attitudes. (2013).
Retrieved July 16, 2013, from http://www.edweek.org/ew/qc/2013/school-
climate-gauging-attitudes.html
Sebring, P.B., Allensworth, E., Bryk, A.S., Easton, J.Q., & Luppescu S.
(2006). The essential supports for school improvement. Retrieved March
9, 2007, from http://ccsr.uchicago.edu/content/publications.php?
pub_id=86.
Sirin, S.R. (2005). Socioeconomic status and academic achievement: A meta-
analytic review of research. Review of Educational Research, 75(3), 417-
453.
Smith, J. J. (2005). The relationship between school division climate and
student achievement of school divisions in the commonwealth of Virginia
(Unpublished doctoral dissertation). Virginia Polytechnic Institute and
State University, Blacksburg, VA.
South Carolina Department of Education. (2013). ESEA federal accountability
system: Calculating the grades for districts and schools: Technical
manual. Retrieved from http://ed.sc.gov/data/esea/2012/
documents/ESEA_Technical_Documentation-Draft.pdf
Stewart, E.B. (2007, June). School structural characteristics, student effort,
peer associations, and parental involvement: The influence of school- and
individual-level factors on academic achievement. Education and Urban
Society 2008, 40, 179-204.
68 Christine DiStefano, Elizabeth Leighton, Mihaela Ene et al.
Chapter 3
ABSTRACT
This chapter addresses the testing of specific effects and contrasts in three
types of mediation models: models with up to four simultaneous
(parallel) mediators, models with two sequential mediators, and single-
mediator models with two initial variables. We use the delta method and
provide equations to calculate standard errors for simple and total indirect
effects, total effects, and specific contrasts in each type of model. We also
demonstrate how bootstrap interval estimates of specific effects and
contrasts can be obtained using phantom models and how indirect effects
involving different initial variables can be compared in a scale-free
fashion. Testing contrasts, we show how common requirements for
complete mediation can be made stronger. Limitations of both, statistics
using standard errors based on normal theory and bootstrapping to test
mediation, along with new methods are discussed. The methods are
illustrated using publicly available datasets. Supplementary material
*
Corresponding author: Thomas Ledermann, thomas.ledermann@unibas.ch.
70 Thomas Ledermann and Siegfried Macho
available online includes Amos, OpenMx, and Mplus files to estimate the
models and an Excel spreadsheet to calculate the effects.
X c′ Y
Model B 1 r3
M3
a3
1 r2
M2 b3
a2
1 r1 b2
a1 M1 b1 1 r4
X c′ Y
1 r1 1 r2
Model C b12
M1 M2
a1 b b2 r3
a2 1
b1
X c′ Y
Model D
X1 c′1
a1 1 r1 1 r2
b
M Y
a2 c′2
X2
Figure 1. Model A: Simple mediation model. Model B: Mediation model with three
simultaneous mediators. Model C: Mediation model with two sequential mediators.
Model D: Mediation model with two initial variables.
model, there are two simple indirect effects, a1b and a2b, sharing the direct
effect b, two direct effects c′, c′1 and c′2, and two total effects, a1b + c′1 and a2b
+ c′2. We next address the assessment of mediation and the testing of specific
effects and contrasts in these types of mediation models.
ASSESSING MEDIATION
For a good understanding of the mediation process all direct effects and
indirect effects in a model should be estimated and tested (e.g., Ledermann &
Macho, 2009).
There are two reasons to test the direct effects separately (Judd & Kenny,
2010). First, for mediation to occur all direct effects that constitute an indirect
effect have to be substantial because it makes no sense to speak of mediation if
not all direct effects that make up an indirect effect are substantial. Second,
mediation can be inconsistent (Maassen & Bakker, 2001; MacKinnon, Krull,
& Lockwood, 2000). Inconsistent mediation, also called suppression, occurs
when an indirect effect and the respective direct effect c′ have opposite signs.
To assess whether consistent mediation (i.e., the indirect effect and the
respective direct effect c′ are of the same sign) or inconsistent mediation has
occurred one need to know the sign of the direct effects.
Contrasts
3
In mediation models with a direct effect c′, a quantity commonly reported is the relative
proportion of the indirect effect in the total effect. For the simple mediation model this is
ab/(ab + c′). However, it has been found that the accuracy of this ratio is poor unless the
sample size is at least 500 (MacKinnon, Warsi, & Dwyer, 1995). Additionally, this ratio can
74 Thomas Ledermann and Siegfried Macho
may wish to know whether two indirect effects differ in magnitude. For
example, a researcher might find that the effect of sleep problems on school
achievement is stronger through tiredness than through negative mood.
In mediation models, very often the effects being compared have the same
initial and outcome variable and so the same metric (e.g., contrasting an
indirect effect with the respective direct effect c′ or two indirect effects in a
model with multiple sequential mediators). In this case, the effects are
quantified by the same units of measurement (e.g., Cheung, 2007; MacKinnon,
2000; Preacher & Hayes, 2008) and, so, the comparison of the effects is
independent whether the variables are standardized or unstandardized.
Sometimes the effects being compared do not have the same initial or
outcome variable. In such situations, one can use either unstandardized or
standardized estimates that are likely to produce different results. Consider a
simple model where an outcome Y is regressed on X1 and X2. If we want to
know whether an increase in X1 by one unit has the same effect on Y than an
increase by one unit in X2 we use unstandardized estimates. This practice
presupposes that the unit of measurement of both predictors is meaningful in
itself and in comparison to each other. 4 For instance, in a study on the
influence of time spent with family and close friends on people‟s satisfaction,
we might be interested in whether satisfaction is more influenced by hours
spent with family vs. hours spent with close friends and so use the
unstandardized predictors with hours as unit of measurement.
When the predictors have different scales the unit of measurement for
each predictor has to be chosen in such a way that the comparison makes most
sense on a practical level. For example, a researcher may want to know
whether weight loss is more influenced by the reduction of calories consumed
or sport work and find that reducing the consumption of food energy by one
kilocalorie a week has a bigger effect on weight loss than increasing sport
work by one hour. Measuring food energy in calories, it is most likely that
increasing sport work by one hour has a bigger influence on weight loss than
reducing food energy by one calorie.
If in models with multiple initial (or outcome) variables the units of
measurement of the variables have no definitive meaning, a comparison of
effects makes most sense if done in an independent fashion of the units of
measurement (e.g., Raykov, Brennan, Reinhardt, & Horowitz, 2008). This can
be large when the total effect is very small and can be greater than one when inconsistent
mediation occurs.
4
In models with one outcome, the outcome‟s unit of measurement has no influence on the
comparison of the effects.
Assessing Mediation … 75
5
Lau and Cheung (2012) and Raykov et al. (2008) provide details to estimate and test
standardized effects using a SEM software program, such as OpenMx, LISREL (Jöreskog &
Sörbom, 2006), or RAMONA (Browne & Mels, 2005), that allow researchers to specify the
models on the level of the matrices. For users of Mplus or lavaan (Rosseel, 2012), Cheung
(2009) showed how standardized effects can be estimated in simple mediation models.
76 Thomas Ledermann and Siegfried Macho
TESTING METHODS
Indirect effects are most often tested by either the z-statistic or Efron‟s
(1979) bootstrap method. We describe both methods that can also be used to
test direct effects, total effects, and contrasts among effects.
X1
l1
s2 = 1
Xl1
c′1
a1 1 r1 1 r2
b
M Y
a2 c′2
Xl2
l2 s2 = 1
X2
The Z Statistic
Bootstrapping
The bootstrap method has been widely advocated for testing indirect
effects (e.g., Bollen & Stine, 1990; Preacher & Hayes, 2008; Shrout & Bolger,
2002) as well as contrasts among effects (Williams & MacKinnon, 2008). In
several popular SEM software packages, including Amos and EQS (Bentler,
2000-2008), the built-in bootstrap procedure for testing effects is limited to
direct effects, total indirect effects, and total effects. As we have seen, in
complex mediation models, simple indirect effects are often part of a total
indirect.
To test specific effects and contrasts using a SEM software program with
limited capabilities to do so the phantom model approach can be used (Macho
& Ledermann, 2011). This method provides a flexible means that stands out
due to its ease of use in obtaining both point and bootstrap interval estimates
for specific effects and contrasts in structural equation models and multiple
78 Thomas Ledermann and Siegfried Macho
group analysis. It can be used with all software packages that report point
estimates of total effects, have bootstrap functions, and allow for latent
variables and parameter constraints. Phantom models are set up along and
estimated simultaneously with the main model. For each effect that cannot be
classified as direct, total indirect, and total effect, a phantom model is to build
whose structure represents the effect being estimated. Further details are given
in Appendix B. For more applications see Ledermann, Macho, and Kenny
(2011) and Perera (2013).
The decision of which method to use should be based on the data and
variables being analyzed. The z-statistic for testing contrasts and indirect
effects works well in large samples (i.e., in the high hundreds) and if the data
are normally distributed. Using maximum likelihood (ML) estimation the z-
statistic can be applied if data are available in raw form or in form of the
variances-covariance matrix and if they are incomplete. Though bootstrapping
dominates these days the testing of mediation effects, it is not without
limitations. The ordinary nonparametric bootstrap method requires access to
raw data whose empirical distribution function is assumed to be a good
representation of the population distribution function. Although the
nonparametric bootstrap method makes no assumption about the distribution
in the population it is likely to fail to provide consistent estimates when the
original sample has an extreme distribution and outliers and when there are
many missing values. Dichotomous variables in combination with a small
sample are also likely to lead to estimation problems or inconsistent results.
Depending on the distribution of the data, the missing values, and the types of
variables, bootstrapping can require fairly large sample sizes with extreme
distributions, many missing values, and dichotomous variables all require
larger samples (see Chernick, 2008). Indeed, in small samples of 20 to 80
cases, bootstrap confidence intervals can be inconsistent (Efron & Tibshirani,
1993). Koopman, Howe, Hollenbeck, and Sin (2015) evaluated the bootstrap
method for small samples and note that even for moderate effect sizes samples
of 100 cases were required. The use of bootstrapping can further be
cumbersome when standardized effects should be analyzed because the
standardization would need to be done separately for each bootstrap sample. In
addition, some programs, including Amos, require complete data to perform
bootstrap analysis but parametric bootstrapping may be used in lieu of the
Assessing Mediation … 79
ILLUSTRATIONS
We illustrate the assessment of mediation for a model with three
simultaneous mediators, a model with two sequential mediators, and a single-
mediator model with two initial variables using publicly available datasets. For
illustrative purpose, we used both the z-statistic and the bootstrap method to
test specific effects and contrasts. The bootstrap estimates presented here are
based on 5,000 bootstrap samples. To determine whether an effect is
statistically significant, we followed Cheung‟s recommendation (2007) and
reported the bias-corrected (BC) bootstrap confidence intervals for the
unstandardized effects. Cases with missing data were excluded prior to the
analysis. We analyzed the data using the Amos software program.
80 Thomas Ledermann and Siegfried Macho
The data for the model with three simultaneous mediators (Figure 1B)
were taken from the study Quality of American Life conducted by Campbell
and Converse (1978). The purpose of this study was to investigate the
perceived quality of life of Americans 18 years of age and older. Here, we
used interview data from 1350 persons who provided complete data on the
selected variables. We used satisfaction with oneself as a person (X) to predict
satisfaction with the life (Y) through the simultaneous mediators satisfaction
with the job (M1), satisfaction with marriage (M2), and degree to which
respondents enjoy their life (M3). The satisfaction measures could range from
1 (completely dissatisfied) to 7 (completely satisfied) and the enjoyment of life
measure could range from 1 (rarely) to 4 (all the time). Due to the large sample
size we report 99 percent confidence intervals.
The effect estimates, the standard errors (estimated by Amos and
Equations A2, A4, A11, A15, and A17 for the variances), and the normal and
BC bootstrap confidence intervals of the effects are presented in Table 1
(phantom models were set up to obtain point and interval estimates for a1b1,
a2b2, a3b3, a1b1 – c′, a2b2 – c′, a3b3 – c′, a1b1 + a2b2 + a3b3 – c′, a1b1 – a2b2, a1b1
– a3b3, and a2b2 – a3b3). We found that all six direct effects that make up an
indirect effect were positive and statistically significant. The direct effect c′
was positive and significant too, which suggests that self-satisfaction had an
effect on life satisfaction over and above the effects of the three mediators. For
the indirect effects, all three simple indirect effects and the total indirect effect
were statistically significant indicating that the effect from satisfaction with
oneself on satisfaction with the life was simultaneously mediated through job
satisfaction, marriage satisfaction, and enjoyment of life. The total effect was
also positive and significant. The finding that all indirect effects as well as the
direct effect c′ had the same sign and were statistically significant indicates
that partial mediation occurred. Contrasting the three simple indirect effects
and the total indirect effect with the direct effect c′, we found that c′ was
significantly stronger than any indirect effect. Among the three indirect
effects, those through enjoyment of life and marital satisfaction were
significantly stronger than the one through job satisfaction.
Table 1. Testing mediation of satisfaction with oneself on satisfaction with the life through the simultaneous
mediators satisfaction with job, satisfaction with marriage, and enjoyment of life
z-statistic Bootstrapping
Effect Estimate
SE z p 99% CI BC 99% CI
Direct effects
self sat job sat (a1) 0.362 0.034 10.795 <.001 [0.275, 0.448] [0.264, 0.461]
self sat mar sat (a2) 0.318 0.028 11.342 <.001 [0.246, 0.390] [0.233, 0.408]
self sat enjoy (a3) 0.201 0.013 15.182 <.001 [0.167, 0.235] [0.163, 0.237]
job sat life sat (b1) 0.110 0.017 6.288 <.001 [0.065, 0.155] [0.056, 0.169]
mar sat life sat (b2) 0.206 0.021 9.698 <.001 [0.151, 0.261] [0.126, 0.287]
enjoy life sat (b3) 0.395 0.045 8.736 <.001 [0.279, 0.511] [0.268, 0.521]
self sat life sat (c′) 0.375 0.024 15.763 <.001 [0.314, 0.436] [0.286, 0.458]
Indirect effects and total effect
a1b1 0.040 0.007 5.433 <.001 [0.021, 0.058] [0.020, 0.067]
a2b2 0.065 0.009 7.371 <.001 [0.043, 0.088] [0.038, 0.099]
a3b3 0.079 0.010 7.572 <.001 [0.052, 0.106] [0.052, 0.112]
a1b1 + a2b2 + a3b3 0.184 0.015 12.501 <.001 [0.146, 0.222] [0.137, 0.236]
a1b1 + a2b2 + a3b3 + c′ 0.559 0.023 23.922 <.001 [0.499, 0.619] [0.482, 0.625]
Contrasts
a1b1 – c′ -0.335 0.026 12.885 <.001 [-0.402, -0.268] [-0.428, -0.239]
a2b2 – c′ -0.309 0.026 11.689 <.001 [-0.377, -0.241] [-0.406, -0.204]
a3b3 – c′ -0.295 0.028 10.464 <.001 [-0.368, -0.223] [-0.392, -0.195]
Table 1. (Continued)
z-statistic Bootstrapping
Effect Estimate
SE z p 99% CI BC 99% CI
a1b1 + a2b2 + a3b3 – c′ -0.190 0.032 5.963 <.001 [-0.273, -0.108] [-0.310, -0.071]
a1b1 – a2b2 -0.026 0.012 2.224 .026 [-0.056, 0.004] [-0.065, 0.013]
a1b1 – a3b3 -0.040 0.013 3.024 .002 [-0.073, -0.006] [-0.078, -0.003]
a2b2 – a3b3 -0.014 0.014 0.970 .332 [-0.050 0.023] [-0.057, 0.030]
Note: SE = standard error; BC CI = bias-corrected confidence interval. The formula used to compute normal 99% CI is estimate
2.58 SE.
Table 2. Testing mediation of satisfaction with payment on satisfaction with the life through the sequential
mediators satisfaction with job and satisfaction with family life
z-statistic Bootstrapping
Effect Estimate
SE z P 95% CI BC 95% CI
Direct effects
pay sat job sat (a1) 0.244 0.028 8.658 <.001 [0.189, 0.299] [0.181, 0.308]
pay sat fam sat (a2) 0.057 0.030 1.908 .056 [-0.002, 0.116] [-0.004, 0.118]
job sat fam sat (b12) 0.194 0.040 4.858 <.001 [0.116, 0.272] [0.109, 0.282]
job sat life sat (b1) 0.403 0.050 8.062 <.001 [0.305, 0.501] [0.305, 0.511]
fam sat life sat (b2) 0.437 0.049 8.989 <.001 [0.342, 0.533] [0.342, 0.532]
pay sat life sat (c′) 0.104 0.037 2.808 .005 [0.031, 0.177] [0.026, 0.183]
Indirect effects and total effect
a1 b1 0.098 0.017 5.900 <.001 [0.066, 0.131] [0.064, 0.139]
a2 b2 0.025 0.013 1.867 .062 [-0.001, 0.051] [-0.001, 0.055]
a1b12b2 0.021 0.005 3.832 <.001 [0.010, 0.031] [0.011, 0.035]
a1b12b2 + a1b1 + a2b2 0.144 0.022 6.478 <.001 [0.101, 0.188] [0.099, 0.193]
a1b12b2 + a1b1 + a2b2 + c′ 0.248 0.040 6.271 <.001 [0.171, 0.326] [0.156, 0.338]
Contrasts
a1b1 – c′ -0.006 0.044 0.131 .896 [-0.092, .080] [-0.096, .085]
a2b2 – c′ -0.079 0.040 1.994 .046 [-0.157, -.001] [-0.161, .005]
a1b12b2 – c′ -0.083 0.038 2.216 .027 [-0.157, -.010] [-0.164, .004]
a1b12b2 + a1b1 + a2b2 – c′ 0.040 0.047 0.859 .390 [-0.051, .131] [-0.051, .135]
Note: SE = standard error; BC CI = bias-corrected confidence interval. The formula used to compute normal 95% CI is estimate
1.96 SE.
84 Thomas Ledermann and Siegfried Macho
In sum, the effect of satisfaction with oneself on satisfaction with the life
seemed partially transmitted through the simultaneous mediators job
satisfaction, marriage satisfaction, and enjoyment of life. In addition, evidence
indicated that the direct effect of self-satisfaction on life satisfaction was
stronger than the indirect effects through the three mediators and that the
mediators enjoyment of life and martial satisfaction seemed to be more
important than the mediator job satisfaction for the association between
satisfaction with oneself and satisfaction with the life.
The data for the model with two sequential mediators (Figure 1C) are part
of the Quality of Employment Survey conducted by Quinn and Graham
(1977). The aim of this survey was to investigate the working conditions in
American labor force by workers aged 16 or older. There were 640 people
who provided complete data for the variables used in this example. We used
the variable satisfaction with payment (X) to predict satisfaction with the life
(Y) through the mediators satisfaction with the job (M1) and satisfaction with
family life (M2) that act in turn. The answers for payment, job, and family
satisfaction could range from 1 to 4, the answers for life satisfaction from 1 to
3, with higher scores indicating greater satisfaction. Table 2 presents the effect
estimates, the standard errors (estimated by Amos and Equations A2, A6, A7,
A13, A15, A19 and A20 for the variances), and the normal and BC bootstrap
confidence limits (phantom models were set up to obtain point and interval
estimates for a1b1, a2b2, a1b12b2, a1b1 – c′, a2b2 – c′, a1b12b2 – c′, and a1b12b2 +
a1b1 + a2b2 – c′). All direct effects constituting the indirect effects were
positive and statistically significant with the exception of the effect from
satisfaction with payment on family life satisfaction (a2), which was not
significant. These direct effects make up two positive simple indirect effects
(a1b1 and a2b2) and a positive three-path indirect effect (a1b12b2). Because a2
was trivial the focus here is on a1b1 and a1b12b2. These two indirect effects
were both statistically significant, which indicates that the effect from
satisfaction with payment on life satisfaction was mediated by both the effect
through the sequential mediators job and family satisfaction (a1b12b2) and the
effect through job satisfaction (a1b1). Also, the total indirect effect and the
total effect were significant. The direct effect c′ was positive and statistically
significant, which means that the mediators accounted partially for the effect
of satisfaction with payment on life satisfaction. For the comparisons of the
Assessing Mediation 85
indirect effects with the direct effect c′, we found that the total indirect effect
(a1b12b2 + a1b1 + a2b2) and the simple indirect effect through job satisfaction
(a1b1) were equally strong as c′. However, c′ was significantly stronger than
the simple indirect effect through family satisfaction (a2b2) and the three-path
indirect effect a1b12b2. In sum, these results revealed that the effect of
satisfaction with payment on satisfaction with the life was partially mediated
by job satisfaction and satisfaction with family life. The simple mediating
effect through job satisfaction and the total mediating effect were as important
as the direct effect from satisfaction with payment on satisfaction with the life.
For the model with two initial variables (Figure 1D), we used cross-
sectional data collected in 2000 that are part of the study Marital Instability
Over the Life Course conducted by Booth, Johnson, Amato, and Rogers
(2010). For the variables used in this example, 711 people provided complete
data. As X1 and X2 we used respondent‟s income and partner‟s income,
respectively, to predict the degree of happiness (Y) through self-esteem (M).
Income can take the values: 0, 2.5, 7.5, 12.5, 17.5, 22.5, 27.5, 35.0, 45.0, 55.0,
65.0, 75.0, 85.0, 95.0, and 105.0 (unit of measurement is $1,000). Self-esteem
was measured by six items (1= strongly agree, 4 = strongly disagree;
Cronbach‟s alpha = .806) that were combined to a composite score with higher
scores indicating higher self-esteem. The answers for degree of happiness
could range from 1 to 3 with higher scores reflecting greater happiness. Here,
we used the unstandardized effects because the initial variables, respondent‟s
and partner‟s income, were measured by the same scale and the unit of
measurement of this scale was meaningful. This is a common practice in
dyadic research where dyad members typically provide information on the
same variables (Kenny & Ledermann, 2010). The effect estimates, the
standard errors (estimated by Amos and Equations A2, A8, A14, A15, A22,
and A23 for the variances), and the normal and BC bootstrap confidence
intervals of the effects are given in Table 3 (phantom models were set up to
obtain point and interval estimates for a1b + a2b, a1b + c′1 + a2b + c′2, a1b – c′1,
a2b – c′2, a1b – a2b, and a1b + c′1 – (a2b + c′2). We found that a1, a2, and b were
positive and statistically significant. These direct effects constitute two
positive simple indirect effects (a1b and a2b) that were statistically significant.
Table 3. Testing mediation of respondent’s and partner’s income on degree of happiness through the
mediator self esteem
z-statistic Bootstrapping
Effect Estimate
SE z p 95% CI BC 95% CI
Direct effects
R‟s income self-esteem (a1) 0.0182 0.0040 4.5834 <.0001 [0.0104, 0.0260] [0.0110, 0.0260]
P‟s income self-esteem (a2) 0.0148 0.0040 3.6765 .0002 [0.0069, 0.0227] [0.0071, 0.0228]
self-esteem happiness (b) 0.0909 0.0081 11.2685 <.0001 [0.0751, 0.1067] [0.0741, 0.1076]
R‟s income happiness (c'1) 0.0016 0.0009 1.8154 .0695 [-0.0001, 0.0033] [-0.0002, 0.0033]
P‟s income happiness (c'2) 0.0016 0.0009 1.7874 .0739 [-0.0002, 0.0033] [-0.0002, 0.0034]
Indirect and total effects
a1b 0.0017 0.0004 4.2456 .0001 [0.0009, 0.0024] [0.0010, 0.0025]
a2b 0.0013 0.0004 3.4952 .0005 [0.0006, 0.0021] [0.0007, 0.0022]
a1b + c'1 0.0032 0.0009 3.4818 .0005 [0.0014, 0.0050] [0.0014, 0.0050]
a2b + c'2 0.0029 0.0009 3.0938 .0020 [0.0011, 0.0048] [0.0010, 0.0048]
a1b + a2b 0.0030 0.0007 4.5547 <.0001 [0.0017, 0.0043] [0.0019, 0.0045]
a1b + c'1 + a2b + c'2 0.0061 0.0018 3.3604 .0008 [0.0026, 0.0097] [0.0029, 0.0092]
Contrasts
a1b – c'1 0.0001 0.0010 0.0845 .9327 [-0.0018, 0.0020] [-0.0018, 0.0021]
a2b – c'2 -0.0002 0.0010 0.2228 .8237 [-0.0021, 0.0017] [-0.0022, 0.0018]
a1b – a2b 0.0003 0.0004 0.7575 .4487 [-0.0005, 0.0011] [-0.0005, 0.0011]
a1b + c'1 – (a2b + c'2) 0.0003 0.0010 0.3060 .7596 [-0.0017, 0.0024] [-0.0016, 0.0023]
Note: SE = standard error; BC CI = bias-corrected confidence interval. The formula used to compute normal 95% CI is estimate 1.96
SE.
Assessing Mediation 87
The two total effects, their sum, and the sum of the two indirect effects
were all positive and statistically significant. The two direct effects c′ were not
significant. The fact that both indirect effects were not significantly stronger
than the respective direct effects c′ does not support complete mediation. The
finding that the two indirect effects and the two total effects were equal in size
indicated that a person‟s degree of happiness seemed to be affected to the
same extent by one‟s own income and that of the partner. That is, an increase
of the partner‟s income by one unit would have had the same effect on a
person‟s degree of happiness as an increase of his or her own income.
In sum, results suggested that the effect of both one‟s own income and the
partner‟s income on his or her happiness was mediated by his or her self-
esteem. Moreover, partner‟s income affected the degree of happiness to the
same extent as one‟s own income.
DISCUSSION
The assessment of specific indirect effects and contrasts of effects, as
discussed in this chapter, offers much promise for a good understanding of the
mechanism through which one or multiple initial variables affect an outcome.
Specifically, the information whether a specific indirect effect is stronger than,
weaker than, or equal in strength to its respective direct effect c′ or whether in
a model with multiple mediators one indirect effect is stronger or weaker than
another indirect effect can (a) help to foster a better understanding of the
significance of the mediators in a model and (b) provide insights that allow a
researcher to draw firm conclusions about where it is appropriate to intervene.
Moreover, comparing the indirect effect with the respective direct effect c′ and
requiring for complete mediation that the indirect effect is bigger in size than
the respective direct effect c′ adds to the requirements for complete mediation
that proponents of the distinction between partial and complete mediation may
find useful.
Mediation analysis has been criticized because mediation implies a causal
mechanism that often remains uncertain (Bullock, Green, & Ha, 2010) due to
the existence of statistically equivalent models (e.g., Lee & Hershberger, 1990;
MacCallum, Wegener, Uchiono, & Fabrigar, 1993). This problem of causal
inference is alleviated, if theoretical considerations preclude alternative
models or if experimental designs provide strong evidence for the underlying
mediational process (e.g., Spencer, Zanna, & Fong, 2005). A method widely
discussed to reduce the number of statistically equivalent models and to detect
88 Thomas Ledermann and Siegfried Macho
, (A1)
6
The partial derivative of a function (e.g., ab – c′) with respect to one of its parameters (e.g., a) is
the ordinary derivative of the function with respect to that parameter with all other
parameters considered as constants.
Assessing Mediation 89
simplicity, the hats above the parameter estimates are dropped in the
remainder of this section. We note that in mediation models, the a parameters
are independent from the b and c′ parameters. Consequently, the covariances
between the a parameters and the b and c′ parameters are zero.
Indirect Effects
Using the first order delta method, Sobel (1982) derived a formula to
calculate the asymptotic variance for simple indirect effects. For the effect ab
this is
where a and b are estimated parameters, sa2 and sb2 are estimated variances of
a and b, respectively. This equation can be used to test any simple indirect
effect in the models of Figure 1.
Var(a1b1 a 2 b 2 ) a12 sb21 a 22 sb22 b12 s a21 b22 s a22 2a1 a 2 sb1b 2 2b1b2 s a1a 2 , (A3)
Var(a1b1 a 2 b2 a3 b3 ) a12 sb21 a 22 sb22 a32 sb23 b12 s a21 b22 s a22 b32 s a23
2a1 a 2 sb1b 2 2a1 a3 sb1b3 2a 2 a3 sb 2b 3
2b1b2 s a1a 2 2b1b3 s a1a 3 2b2 b3 s a 2 a 3 , (A4)
and
Var(a1b1 a2b2 a3b3 a4b4 ) a12 sb21 a22 sb22 a32 sb23 a42 sb24 b12 sa21 b22 sa22
b32 sa23 b42 sa24 2a1a2 sb1b 2 2a1a3sb1b3 2a1a4 sb1b 4 2a2a3sb 2b3 , (A5)
2a2a4 sb 2b 4 2a3a4 sb3b 4 2b1b2 sa1a 2 2b1b3sa1a 3 2b1b4 sa1a 4
c 2b2b3sa 2 a 3 2b2b4 sa 2 a 4 2b3b4 sa 3a 4
where sa1a2, sa1a3, … sb3b4 are the covariance between the estimated parameters.
90 Thomas Ledermann and Siegfried Macho
Var(a1b12b2 ) a12 b122 sb22 a12 b22 sb212 b122 b22 s a21
(A6)
and
Var(a1b12b2 a1b1 a2 b2 ) a12 sb21 a22 sb22 b12 sa21 b22 sa22 a12b122 sb22
a12b22 sb212 b122 b22 sa21 2a12b12 sb1b 2 2a1b22 sa 2b12 2a1a2b12 sb22 .(A7)
2b1b12b2 sa21 2a1a2 sb1b 2
Mediation models with two initial variables (Figure 1D). The asymptotic
variance for the sum of the indirect effects a1b and a2b is
Var(a1b a2b) a1 a2 sb2 b 2 sa21 sa22 2b 2 sa1a 2 .
2
(A8)
Total Effects
Simple mediation model and models with multiple mediators (Figure 1A-
1C). The asymptotic variances for the total effects ab + c′, a1b1 + a2b2 + c′,
a1b1 + a2b2 + a3b3 + c′, a1b1 + a2b2 + a3b3 + a4b4 + c′, and a1b12b2 + a1b1 + a2b2
+ c′ are
Var(a1b1 a2b2 c' ) a12 sb21 a22 sb22 b12 sa21 b22 sa22
2a1a2 sb1b 2 2b1b2 sa1a 2 2a1sb1c ' 2a2 sb 2c ' sc2' , (A10)
Var(a1b1 a2b2 a3b3 c' ) a12 sb21 a22 sb22 a32 sb23 b12 sa21 b22 sa22
b32 sa23 2a1a2 sb1b 2 2a1a3 sb1b3 2a2 a3 sb 2b3 2b1b2 sa1a 2 , (A11)
2b1b3 sa1a 3 2b2b3 sa 2 a 3 2a1 sb1c ' 2a2 sb 2c ' 2a3 sb 3c ' sc2'
Assessing Mediation 91
Var(a1b1 a2 b2 a3b3 a4b4 c' ) a12 sb21 a22 sb22 a32 sb23 a42 sb24
b12 sa21 b22 sa22 b32 sa23 b42 sa24 2a1a2 sb1b 2 2a1a3 sb1b3
, (A12)
2a1a4 sb1b 4 2a2 a3 sb 2b3 2a2 a4 sb 2b 4 2a3 a4 sb 3b 4
2b1b2 sa1a 2 2b1b3 sa1a 3 2b1b4 sa1a 4 2b2b3 sa 2 a 3 2b2b4 sa 2 a 4
2b3b4 sa 3a 4 2a1sb1c ' 2a2 sb 2c ' 2a3 sb3c ' 2a4 sb 4c ' sc2'
and
Var(a1b12b2 a1b1 a2 b2 c' ) a12 sb21 a22 sb22 b12 sa21 b22 sa22
a12 b122 sb22 a12 b22 sb212 b122 b22 s a21 2a12 b12 sb1b 2 2a1b22 s a 2b12 . (A13)
2a1a2 b12 sb22 2b1b12b2 sa21 2a1a2 sb1b 2 2a1 sb1c '
2a2 sb 2 c ' 2a1b12 sb 2 c ' sc2'
Mediation model with two initial variables (Figure 1D). The asymptotic
variance for the sum of the two total effects (i.e., a1b + c′1 + a2b + c′2) is
Var (a1b c'1 a2b c'2 ) a12 sb2 a22 sb2 b 2 sa21 b 2 sa22 2b 2 sa1a 2 2a1a2 sb2 . (A14)
2a1sbc'1 2a2 sbc'1 2a1sbc'2 2a2 sbc'2 2sc '1c '2 sc2'1 sc2'2
Contrasts
Simple mediation model (Figure 1A). The asymptotic variances for the
contrasts ab – c′ (see MacKinnon, 2000, p. 151) is
Var(a1b1 a2 b2 c' ) a12 sb21 a22 sb22 b12 s a21 b22 sa22
, (A16)
2a1a2 sb1b 2 2b1b2 sa1a 2 2a1 sb1c ' 2a2 sb 2 c ' sc2'
92 Thomas Ledermann and Siegfried Macho
Var(a1b1 a2b2 a3b3 c' ) a12 sb21 a22 sb22 a32 sb23 b12 sa21 b22 sa22 b32 sa23
2a1a2 sb1b 2 2a1a3 sb1b3 2a2 a3 sb 2b3 2b1b2 sa1a 2 , (A17)
2b1b3 sa1a 3 2b2b3 sa 2 a 3
2a1sb1c ' 2a2 sb 2c ' 2a3 sb 3c ' sc2'
and
Var (a1b12b2 c' ) a12b22 sb212 a12b122 sb22 b122 b22 sa21 2a1b12 sb 2c ' sc2' , (A19)
and
Vara1b12b2 a1b2 a 2 b2 c' a12 s b21 a 22 s b22 b12 s a21 b22 s a22
a12 b122 s b22 a12 b22 s b212 b122 b22 s a21 . (A20)
2a b s
2
1 12 b1b 2 2a b s 2
1 2 a 2b12 2a1 a b s 2
2 12 b 2 2b b b s 2
1 12 2 a1
Vara1b1 a2b2 a12 sb21 a22 sb22 b12 sa21 b22 sa22 2a1a2 sb1b 2 2b1b2 sa1a 2 (A21)
Assessing Mediation 93
and
Var(a1b a2 b) a1 a 2 sb2 b 2 s a21 s a22 2b 2 sa1a 2 .
2
(A22)
Contrasting two total effects in models with two initial variables (Figure
1D). The asymptotic variance for the contrast a1b + c′1 – (a2b + c′2) is
Model A
a1 b1
Pin P1 Pout
Model B
a1 P1 b12 b2
Pin P2 Pout
Figure B1.Phantom models for assessing indirect effects. Model A: Simple indirect
effect a1b1. Model B: Three-path indirect effect a1b12b2.
94 Thomas Ledermann and Siegfried Macho
specific effect, the bootstrap confidence limit of the phantom model‟s total
effect is estimated. We note that in contrast to the equations for the standard
errors, phantom models do not depend whether the residual covariances in the
main model are constrained or freely estimated.
Phantom models for assessing specific contrasts are shown in Figure B2
and B3. The models of Figure B2 are designed to assess the difference
between a specific indirect effect and the direct effect c′. Model B2A contrasts
Model A
P1
a1 b1
Pin c′ -1 Pout
P2
Model B
P3
a3 b3
P2
a2 b2
a1 P1 b1
Pin Pout
c′ -1
P4
Model C
P1 b12
P2
a1 b2
c′ -1
Pin P3 Pout
Model D
P1 b12
P2
a1 P3 b2
a1 b1
a2 P4 b2
Pin c′ -1 Pout
P5
Figure B2. Phantom models for assessing contrasts involving the direct effect c′.
Model A: Contrast ab – c′. Model B: Contrast a1b1 + a2b2 + a3b3 – c′. Model C:
Contrast a1b12b2 – c′. Model D: Contrast a1b12b2 + a1b12b2 + a2b2 – c′.
Assessing Mediation 95
the simple indirect effect a1b1 with c′. Model B2B tests a1b1 + a2b2 + a3b3 – c′,
Model B2C a1b12b2 – c′, and Model B2D a1b1b2 + a1b1 + a2b2 – c′. For
mediation models containing fewer or more simultaneous mediators the
phantom model B2B can readily be adapted: Excluding a phantom variable,
say P3, with the connecting paths a3 and b3 yields a phantom model with two
simultaneous mediators; adding a new phantom variable, say P4, connected by
the paths a4 and b4 results in a phantom model with four simultaneous
mediators. The models of Figure B3 enable the comparison of two simple
indirect effects and two total effects: Model B3A tests a1b1 – a2b2, Model B3B
b(a1 – a2), and Model B3C a1b + c′1 – (a2b + c′2).
Model A
a1 b1
Pin P1 Pout
a2 -1
b2
P2 P3
Model B
a1 b
Pin P1 Pout
a2 -1
P2 b
P3
Model C
a1 P1 b
Pin c′1 Pout
a2 c′2 -1
b
P2 P3
Figure B3.Phantom model for contrasting simple indirect effects and total effects.
Model A: Contrast a1b1 – a2b2. Model B: Contrast a1b – a2b. Model C: Contrast a1b +
c′1 – (a2b + c′2).
96 Thomas Ledermann and Siegfried Macho
Estimating a model with two initial variables, a researcher may also wish
to assess the sum of the two indirect effects and the sum of the two total
effects. This can be achieved by changing in Model B3B and B3C the path P3
Pout from -1 to 1. The total effects are then b(a1 + a2) and a1b + c′1 + a2b +
c′2, respectively.
AUTHOR NOTE
The research used three data sets: Quality of American Life 1978; the
Quality of Employment Survey 1977; and Marital Instability Over the Life
Course: A six-Wave Panel Study, 1980, 1983, 1988, 1992-1994, 1997, 2000.
The data collected by Angus Campbell and Philip Converse; Robert Quinn and
Staines Graham; and Alan Booth, David Johnson, Paul Amato, and Stacy
Rogers were made available through Ann Arbor, MI: Inter-university
Consortium for Political and Social Research.
Supplementary material is available at http://thomasledermann.com
/mscm/.
We thank David A. Kenny for helpful comments on an earlier version of
this chapter.
REFERENCES
Arbuckle, J. L. (1995–2012). IBM SPSS Amos 20 user's guide. Chicago, IL:
SPSS Inc.
Baron, R. M. & Kenny, D. A. (1986). The moderator-mediator variable
distinction in social psychological research: Conceptual, strategic, and
statistical considerations. Journal of Personality and Social Psychology,
51, 1173-1182.
Bentler, P. M. (2000-2008). EQS 6 structural equations program manual.
Encino, CA: Multivariate Software.
Bishop, Y. M. M., Feinberg, S. E. & Holland, P. W. (1975). Discrete
multivariate analysis: Theory and practice. Cambridge, MA: MIT Press.
Boker, S., Neale, M., Maes, H., Wilde, M., Spiegel, M., Brick, T., . . . Fox, J.
(2011). OpenMx: An open source extended structural equation modeling
framework. Psychometrika, 76, 306-317.
Assessing Mediation 97
Kruschke, J. K., Aguinis, H. & Joo, H. (2012). The time has come: Bayesian
methods for data analysis in the organizational sciences. Organizational
Research Methods, 15, 722-752.
Lau, R. S. & Cheung, G. W. (2012). Estimating and comparing specific
mediation effects in complex latent variable models, Organizational
Research Methods, 15, 3-16.
Ledermann, T. & Macho, S. (2009). Mediation in dyadic data at the level of
the dyads: A Structural Equation Modeling approach. Journal of Family
Psychology, 23, 661-670.
Ledermann, T., Macho, S. & Kenny, D. A. (2011). Assessing mediation in
dyadic data using the actor-partner interdependence model. Structural
Equation Modeling, 18, 595-612.
Lee, D. & Hershberger, S. (1990). A simple rule for generating equivalent
models in covariance structure modeling. Multivariate Behavioral
Research, 25, 313-334.
Maassen, G. D. & Bakker, A. B. (2001). Suppressor variables in path models:
Definitions and interpretations. Sociological Methods and Research, 30,
241-270.
MacCallum, R. C., Wegener, D. T., Uchiono, B. N. & Fabrigar, L. R. (1993).
The problem of equivalent models in applications of covariance structure
analysis. Psychological Bulletin, 114, 185-199.
Macho, S. & Ledermann, T. (2011). Estimating, testing, and comparing
specific effects in structural equation models: The phantom model
approach. Psychological Methods, 16, 34-43.
MacKinnon, D. P. (2000). Contrasts in multiple mediator models. In J. Rose,
L. Chassin, C. C. Presson & S. J. Sherman (Eds.), Multivariate
applications in substance use research: New methods for new questions
(141-160). Mahwah, NJ: Lawrence Erlbaum.
MacKinnon, D. P. (2008). Introduction to statistical mediation analysis. New
York: Erlbaum.
MacKinnon, D. P., Krull, J. L. & Lockwood, C. M. (2000). Equivalence of the
mediation, confounding and suppression effect. Prevention Science, 1,
173-181.
MacKinnon, D. P., Lockwood, C. M. & Williams, J. (2004). Confidence limits
for the indirect effect: Distribution of the product and resampling
methods. Multivariate Behavioral Research, 39, 99-128.
MacKinnon, D. P. & Pirlott, A. G. (2015). Statistical approaches for enhancing
causal interpretation of the M to Y relation in mediation analysis.
Personality and Social Psychology Review, 19, 30-43.
100 Thomas Ledermann and Siegfried Macho
attitudes, 59, 67
A
CEE, 3 criticism, 42
Chicago, 39, 62, 64, 68, 96, 98 crowds, 4
child development, 64 CT, 64, 67
child poverty, 40, 60 culture, 37, 65, 68
children, 36, 71, 97 curriculum, 61
China, vii, 1, 4, 5, 6, 7, 10, 14, 19, 21, 24, customers, 11
25, 26, 29, 32, 34
Chinese firms, 4, 5, 19, 24, 29
Chinese government, 5, 24 D
City, 63
data analysis, 99
clarity, 68
data set, 3, 4, 14, 25, 42, 47, 48, 70, 96
classes, vii, 35, 42, 43, 44, 45, 46, 49, 50,
database, 41, 47, 60
51, 52, 53, 54, 58, 66
debts, 30
classification, 42, 43, 45, 46, 50, 52, 53, 54,
delinquency, 64
56, 58, 59
Delta, 88, 100
classroom, 38, 47, 49, 65
Department of Education, 41, 50, 51, 63, 67
classroom management, 47
depreciation, 8
climate, vii, 35, 36, 37, 38, 39, 40, 41, 42,
depression, 38
47, 48, 50, 51, 52, 53, 56, 58, 59, 60, 61,
derivatives, 88
62, 63, 64, 65, 66, 67, 68
developed countries, 4, 19, 20, 24, 29
cluster analysis, 41, 42, 43, 54, 62, 64, 66,
developing countries, 3, 19
68
deviation, 11, 75
cluster model, 46
disclosure, 6
clustering, 42, 43, 44, 50, 62
disorder, 63, 64
collaboration, 36, 41
disposition, 39
collateral, 7, 8, 10, 26
dissatisfaction, 54
commercial, 10
distress, 11, 24
commercial bank, 10
distribution, 14, 15, 25, 43, 44, 46, 50, 58,
communication, 62
78, 101
community(s), 37, 39, 60, 61
distribution function, 78
complexity, 9
complications, 4
computation, 50 E
computer, 43
computing, 16, 77 earnings, 9, 11, 21, 24
conflict, 97 Eastern Europe, 3
construction, 3 ecology, 37
consumption, 11, 74 education, 68
controversial, 7, 10, 36 educational research, 100
cooperation, 38 educational system, 61
corporate finance, 2 educators, 38, 62
correlation, 100 elementary school, vii, 35, 39, 42, 47, 51,
cost, 8, 9, 10, 11 52, 53, 54, 58, 59
creditors, 21 emerging markets, 5
criminal acts, 36 empirical studies, 8, 14
Index 105
energy, 74
enforcement, 40, 49
G
entropy, 46, 52, 53, 54, 62
Germany, 2
environment(s), 3, 4, 37, 38, 47, 49, 54, 59,
governments, 6
61, 64
GPA, 39
EPR, 8
grades, 39, 40, 47, 67
EPS, 9, 12, 17, 18, 22, 26
group membership, 41, 45
equity, 5, 7, 9, 10, 11, 13, 19, 20, 21, 29
grouping, 4, 42
equity market, 5
growth, 5, 6, 8, 21, 26, 29, 57, 66
estimation problems, 78
ethnicity, 37
Europe, 3 H
European Union (EU), 3, 32, 33
evidence, 3, 8, 20, 41, 45, 84, 87 happiness, 85, 86, 87
examinations, viii, 36 health, 63, 67, 68, 72, 98
experimental design, 87 health care, 98
external financing, 11 health education, 68
high school, 49, 63, 65, 98
history, 41
F homogeneity, 44
Hong Kong, 1, 30
factor analysis, 4, 40, 46, 48
House, 65
families, 62
human, 37
family life, 83, 84, 85
hypothesis, 2, 8, 9, 21, 53, 59
family system, 97
hypothesis test, 53
fear, 10
financial, 3, 5, 7, 9, 10, 11, 14, 16, 20, 24,
28 I
financial condition, 28
financial crisis, 24 identification, 16, 100
financial distress, 11, 24 improvements, 38, 39, 40
financial institutions, 5 income, 6, 9, 13, 59, 85, 86, 87
financial sector, 5, 10 income tax, 6, 13
Finland, 68 independence, 44
firm size, 25, 28, 29, 30 independent variable, 71, 98
flexibility, 51 indirect effect, viii, 69, 70, 71, 73, 74, 75,
food, 74 76, 77, 78, 79, 80, 84, 85, 87, 89, 90, 91,
force, 84 92, 93, 94, 95, 96, 97, 99, 100, 101
formula, 82, 83, 86, 89 individuals, 43, 45
France, 2 industrial sectors, 14
free trade, 6 industry(s), 3, 5, 14, 20, 25, 26, 28, 29, 30
freedom, 38 inefficiency, 4
funding, 60 insider trading, 6
funds, 11 institutions, 5, 10
interdependence, 99
interest rates, 10
106 Index
prevention, 41, 64
N prior knowledge, 48
private firms, 20, 25, 28, 30
NAEP, 40
private sector, 4, 5
National Center for Education Statistics, 63
probability, 11, 43, 44, 45, 46, 54, 56, 59
negative affectivity, 98
probability distribution, 43
negative effects, 21, 28
problem behavior, 39
negative mood, 74
profit, 9
negative relation, 11, 39
profit margin, 9
net profit margin, 9
profitability, 6, 9, 21, 26, 29
New Zealand, 31
programming, 63
No Child Left Behind, 36
project, 64
normal distribution, 15
property rights, 29
normative behavior, 37
psychological stress, 72
psychology, 62, 98, 101
O psychosocial climate, 68
public schools, 36
obstacles, 60
operations, 28
opportunities, 8, 29
Q
ownership, 5, 6, 10, 24, 25, 26, 29
qualifications, 36
ownership structure, 5, 6, 10, 25, 26, 29
quality of life, 80
quick ratio, 10
P
R
parallel, viii, 69, 70
parameter estimates, 20, 25, 46, 75, 88, 89
reading, 39, 40
parental involvement, 40, 67
reality, 30
parents, vii, 35, 36, 37, 38, 40, 41, 47, 48,
recommendations, 101
49, 59, 61
reform, 5, 10, 40, 61, 62, 98
participants, 39
regression, 4, 14, 15, 20, 25, 40, 50, 79, 101
path model, 99
regression analysis, 4, 14, 25
pathways, 73, 88
regression equation, 14
performance indicator, 58
regression model, 25
personality, 37
regulations, 5
Philadelphia, 63, 66
reputation, 21
physical environment, 49
requirements, viii, 10, 41, 51, 69, 87
physical health, 72
researchers, 5, 14, 37, 39, 41, 42, 43, 44, 46,
pluralism, 63
48, 50, 61, 70, 75, 97
policy, vii, 6, 11, 24, 35, 36, 61, 62
residuals, 71
policy makers, vii, 35, 36, 61
resources, 4
population, 43, 78
response, 16, 41
positive relationship, 8, 28
restrictions, 44
poverty, vii, 35, 39, 40, 41, 42, 50, 52, 58,
retained earnings, 9
59, 60, 61, 65, 66
108 Index
tax system, 24
teachers, vii, 35, 36, 37, 38, 39, 40, 41, 42,
V
47, 48, 49, 50, 56, 58, 59, 60, 61, 65
validation, 62, 63
techniques, 16, 40, 43, 79
variables, vii, viii, 13, 14, 15, 16, 25, 35, 36,
technology, 6
37, 38, 41, 42, 43, 44, 46, 50, 53, 54, 56,
test anxiety, 62
57, 59, 60, 62, 69, 70, 71, 72, 74, 75, 76,
test scores, 38, 39, 41, 47, 51, 57
77, 78, 79, 80, 84, 85, 87, 88, 90, 91, 93,
testing, vii, viii, 69, 70, 73, 75, 77, 78, 88,
96, 97, 98, 99, 100
98, 99, 100, 101
variance-covariance matrix, 44, 88
textbooks, 49
vector, 15, 44, 88
third dimension, 49
Vietnam, 3, 33
thoughts, 47
violence, 41, 64
trade, 2, 3, 6, 8, 9, 11, 13, 21, 24
volatility, 11, 24
trade-off, 2, 3, 8, 9, 11, 13, 21, 24
voting, 10
training, 60
trajectory, 66
transaction costs, 9 W
transformation, 5
turnover, 41, 59, 61 waiver, 51, 52, 58
Washington, 63, 98, 101
web, 36, 66
U weight loss, 74
workers, 11, 84, 98
uniform, 7
working conditions, 84
unique features, 5
workplace, 65
United, 2, 3, 31, 32, 36, 63
United Kingdom (UK), 2, 31, 33, 68
United States, 2, 3, 32, 36, 63 Y
urban, 40, 65, 67
urbanicity, 40 yield, 7
USA, 63