Methods and Applications of Longitudinal Data Analysis

Ebook1,015 pages9 hours

Methods and Applications of Longitudinal Data Analysis

Name: Methods and Applications of Longitudinal Data Analysis
Author: Xian Liu
ISBN: 9780128014820

By Xian Liu

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Methods and Applications of Longitudinal Data Analysis describes methods for the analysis of longitudinal data in the medical, biological and behavioral sciences. It introduces basic concepts and functions including a variety of regression models, and their practical applications across many areas of research. Statistical procedures featured within the text include:

descriptive methods for delineating trends over time
linear mixed regression models with both fixed and random effects
covariance pattern models on correlated errors
generalized estimating equations
nonlinear regression models for categorical repeated measurements
techniques for analyzing longitudinal data with non-ignorable missing observations

Emphasis is given to applications of these methods, using substantial empirical illustrations, designed to help users of statistics better analyze and understand longitudinal data.

Methods and Applications of Longitudinal Data Analysis equips both graduate students and professionals to confidently apply longitudinal data analysis to their particular discipline. It also provides a valuable reference source for applied statisticians, demographers and other quantitative methodologists.

From novice to professional: this book starts with the introduction of basic models and ends with the description of some of the most advanced models in longitudinal data analysis
Enables students to select the correct statistical methods to apply to their longitudinal data and avoid the pitfalls associated with incorrect selection
Identifies the limitations of classical repeated measures models and describes newly developed techniques, along with real-world examples.

Skip carousel

Mathematics

LanguageEnglish

PublisherAcademic Press

Release dateSep 1, 2015

ISBN9780128014820

Author

Xian Liu

Related authors

Skip carousel

Related to Methods and Applications of Longitudinal Data Analysis

Related ebooks

Skip carousel

Biostatistics and Computer-based Analysis of Health Data using Stata
Ebook
Biostatistics and Computer-based Analysis of Health Data using Stata
byChristophe Lalanne
Rating: 0 out of 5 stars
0 ratings
Regression Models for Categorical, Count, and Related Variables: An Applied Approach
Ebook
Regression Models for Categorical, Count, and Related Variables: An Applied Approach
byDr. John P. Hoffmann
Rating: 0 out of 5 stars
0 ratings
Theory and Methods of Statistics
Ebook
Theory and Methods of Statistics
byP.K. Bhattacharya
Rating: 0 out of 5 stars
0 ratings
Nonparametric Regression Methods for Longitudinal Data Analysis: Mixed-Effects Modeling Approaches
Ebook
Nonparametric Regression Methods for Longitudinal Data Analysis: Mixed-Effects Modeling Approaches
byHulin Wu
Rating: 0 out of 5 stars
0 ratings
ANOVA and ANCOVA: A GLM Approach
Ebook
ANOVA and ANCOVA: A GLM Approach
byAndrew Rutherford
Rating: 0 out of 5 stars
0 ratings
Statistics at Square Two: Understanding Modern Statistical Applications in Medicine
Ebook
Statistics at Square Two: Understanding Modern Statistical Applications in Medicine
byMichael J. Campbell
Rating: 0 out of 5 stars
0 ratings
Practical Biostatistics: A Friendly Step-by-Step Approach for Evidence-based Medicine
Ebook
Practical Biostatistics: A Friendly Step-by-Step Approach for Evidence-based Medicine
byMendel Suchmacher
Rating: 5 out of 5 stars
5/5
Practical Biostatistics: A Step-by-Step Approach for Evidence-Based Medicine
Ebook
Practical Biostatistics: A Step-by-Step Approach for Evidence-Based Medicine
byMendel Suchmacher
Rating: 0 out of 5 stars
0 ratings
SPSS Data Analysis for Univariate, Bivariate, and Multivariate Statistics
Ebook
SPSS Data Analysis for Univariate, Bivariate, and Multivariate Statistics
byDaniel J. Denis
Rating: 0 out of 5 stars
0 ratings
Exploratory and Multivariate Data Analysis
Ebook
Exploratory and Multivariate Data Analysis
byMichel Jambu
Rating: 0 out of 5 stars
0 ratings
Beginning Statistics with Data Analysis
Ebook
Beginning Statistics with Data Analysis
byFrederick Mosteller
Rating: 4 out of 5 stars
4/5
R in Action: Data analysis and graphics with R
Ebook
R in Action: Data analysis and graphics with R
byRobert I. Kabacoff
Rating: 4 out of 5 stars
4/5
Current Topics in Survey Sampling: Proceedings of the International Symposium on Survey Sampling Held in Ottawa, Canada, May 7-9, 1980
Ebook
Current Topics in Survey Sampling: Proceedings of the International Symposium on Survey Sampling Held in Ottawa, Canada, May 7-9, 1980
byD. Krewski
Rating: 0 out of 5 stars
0 ratings
Bayesian Biostatistics
Ebook
Bayesian Biostatistics
byEmmanuel Lesaffre
Rating: 0 out of 5 stars
0 ratings
Biostatistics Explored Through R Software: An Overview
Ebook
Biostatistics Explored Through R Software: An Overview
byVinaitheerthan Renganathan
Rating: 4 out of 5 stars
4/5
R Data Science Essentials
Ebook
R Data Science Essentials
byKoushik Raja B.
Rating: 2 out of 5 stars
2/5
Real World Health Care Data Analysis: Causal Methods and Implementation Using SAS
Ebook
Real World Health Care Data Analysis: Causal Methods and Implementation Using SAS
byDouglas Faries
Rating: 0 out of 5 stars
0 ratings
Biostatistics and Computer-based Analysis of Health Data Using SAS
Ebook
Biostatistics and Computer-based Analysis of Health Data Using SAS
byChristophe Lalanne
Rating: 0 out of 5 stars
0 ratings
Applied Data Mining for Forecasting Using SAS
Ebook
Applied Data Mining for Forecasting Using SAS
byTim Rey
Rating: 0 out of 5 stars
0 ratings
Mastering Scientific Computing with R
Ebook
Mastering Scientific Computing with R
byPaul Gerrard
Rating: 3 out of 5 stars
3/5
Introduction To Non Parametric Methods Through R Software
Ebook
Introduction To Non Parametric Methods Through R Software
byEditor IJSMI
Rating: 0 out of 5 stars
0 ratings
Learning Probabilistic Graphical Models in R
Ebook
Learning Probabilistic Graphical Models in R
byDavid Bellot
Rating: 0 out of 5 stars
0 ratings
Carpenter's Guide to Innovative SAS Techniques
Ebook
Carpenter's Guide to Innovative SAS Techniques
byArt Carpenter
Rating: 0 out of 5 stars
0 ratings
Learning Bayesian Models with R
Ebook
Learning Bayesian Models with R
byM.Koduvely Dr. Hari
Rating: 5 out of 5 stars
5/5
Predictive Modeling with SAS Enterprise Miner: Practical Solutions for Business Applications, Third Edition
Ebook
Predictive Modeling with SAS Enterprise Miner: Practical Solutions for Business Applications, Third Edition
byKattamuri S. Sarma
Rating: 0 out of 5 stars
0 ratings
A Step-by-Step Approach to Using SAS for Factor Analysis and Structural Equation Modeling, Second Edition
Ebook
A Step-by-Step Approach to Using SAS for Factor Analysis and Structural Equation Modeling, Second Edition
byNorm O'Rourke, Ph.D., R.Psych.
Rating: 0 out of 5 stars
0 ratings
Biostatistics Using JMP: A Practical Guide
Ebook
Biostatistics Using JMP: A Practical Guide
byTrevor Bihl
Rating: 0 out of 5 stars
0 ratings
End-to-End Data Science with SAS: A Hands-On Programming Guide
Ebook
End-to-End Data Science with SAS: A Hands-On Programming Guide
byJames Gearheart
Rating: 0 out of 5 stars
0 ratings
Methods of Multivariate Analysis
Ebook
Methods of Multivariate Analysis
byAlvin C. Rencher
Rating: 0 out of 5 stars
0 ratings
Categorical Data Analysis Using SAS, Third Edition
Ebook
Categorical Data Analysis Using SAS, Third Edition
byMaura E. Stokes
Rating: 0 out of 5 stars
0 ratings

Mathematics For You

Skip carousel

Quantum Physics for Beginners
Ebook
Quantum Physics for Beginners
byMax Thomson
Rating: 4 out of 5 stars
4/5
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
Ebook
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
Algebra - The Very Basics
Ebook
Algebra - The Very Basics
byMetin Bektas
Rating: 5 out of 5 stars
5/5
Geometry For Dummies
Ebook
Geometry For Dummies
byMark Ryan
Rating: 5 out of 5 stars
5/5
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
Ebook
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
byDavid Borman
Rating: 4 out of 5 stars
4/5
Practice Makes Perfect Algebra II Review and Workbook, Second Edition
Ebook
Practice Makes Perfect Algebra II Review and Workbook, Second Edition
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
Ebook
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
byS. Deviant
Rating: 4 out of 5 stars
4/5
Is God a Mathematician?
Ebook
Is God a Mathematician?
byMario Livio
Rating: 4 out of 5 stars
4/5
Basic Math & Pre-Algebra For Dummies
Ebook
Basic Math & Pre-Algebra For Dummies
byMark Zegarelli
Rating: 4 out of 5 stars
4/5
Mental Math Secrets - How To Be a Human Calculator
Ebook
Mental Math Secrets - How To Be a Human Calculator
byRandy Silverman
Rating: 5 out of 5 stars
5/5
Game Theory: A Simple Introduction
Ebook
Game Theory: A Simple Introduction
byK.H. Erickson
Rating: 4 out of 5 stars
4/5
Algebra I Workbook For Dummies
Ebook
Algebra I Workbook For Dummies
byMary Jane Sterling
Rating: 3 out of 5 stars
3/5
The Little Book of Mathematical Principles, Theories & Things
Ebook
The Little Book of Mathematical Principles, Theories & Things
byRobert Solomon
Rating: 3 out of 5 stars
3/5
Calculus Made Easy
Ebook
Calculus Made Easy
bySilvanus P. Thompson
Rating: 4 out of 5 stars
4/5
Painless Geometry
Ebook
Painless Geometry
byLynette Long
Rating: 4 out of 5 stars
4/5
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
Ebook
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
byJane Cassie
Rating: 5 out of 5 stars
5/5
Precalculus: A Self-Teaching Guide
Ebook
Precalculus: A Self-Teaching Guide
bySteve Slavin
Rating: 4 out of 5 stars
4/5
My Best Mathematical and Logic Puzzles
Ebook
My Best Mathematical and Logic Puzzles
byMartin Gardner
Rating: 5 out of 5 stars
5/5
Relativity: The special and the general theory
Ebook
Relativity: The special and the general theory
byAlbert Einstein
Rating: 5 out of 5 stars
5/5
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
Ebook
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
byEditors of Portable Press
Rating: 4 out of 5 stars
4/5
The Math Book: From Pythagoras to the 57th Dimension, 250 Milestones in the History of Mathematics
Ebook
The Math Book: From Pythagoras to the 57th Dimension, 250 Milestones in the History of Mathematics
byClifford A. Pickover
Rating: 3 out of 5 stars
3/5
The Golden Ratio: The Divine Beauty of Mathematics
Ebook
The Golden Ratio: The Divine Beauty of Mathematics
byGary B. Meisner
Rating: 5 out of 5 stars
5/5
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
Ebook
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
Ebook
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
byAlbert Rutherford
Rating: 3 out of 5 stars
3/5
Flatland
Ebook
Flatland
byEdwin A. Abbott
Rating: 4 out of 5 stars
4/5
Introducing Game Theory: A Graphic Guide
Ebook
Introducing Game Theory: A Graphic Guide
byIvan Pastine
Rating: 4 out of 5 stars
4/5
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
Ebook
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
byChristopher Monahan
Rating: 5 out of 5 stars
5/5
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
Ebook
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
byKit Yates
Rating: 4 out of 5 stars
4/5
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
Ebook
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
byJ Scott
Rating: 0 out of 5 stars
0 ratings
The Thirteen Books of the Elements, Vol. 1
Ebook
The Thirteen Books of the Elements, Vol. 1
byEuclid
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Susan Athey on Machine Learning, Big Data, and Causation: Susan Athey talks with host Russ Roberts about how machine learning can be used along with traditional econometrics to measure the impact of the minimum wage or new drug effectiveness. The last part of the conversation looks at the experimental techniques being used by firms like Google and Amazon.
Podcast episode
Susan Athey on Machine Learning, Big Data, and Causation: Susan Athey talks with host Russ Roberts about how machine learning can be used along with traditional econometrics to measure the impact of the minimum wage or new drug effectiveness. The last part of the conversation looks at the experimental techniques being used by firms like Google and Amazon.
byEconTalk
0 ratings
0% found this document useful
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
Podcast episode
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
byNew Books in Sociology
0 ratings
0% found this document useful
Therapy for Stage IV NSCLC Without Driver Alterations: ASCO Living Guideline Update 2023.3 Part 1: Dr. Jyoti Patel and Dr. Natasha Leighl discuss the latest full update to the stage IV NSCLC without driver alterations living guideline. This guideline addresses first-, second-, and subsequent-line therapy for patients according to their histology...
Podcast episode
Therapy for Stage IV NSCLC Without Driver Alterations: ASCO Living Guideline Update 2023.3 Part 1: Dr. Jyoti Patel and Dr. Natasha Leighl discuss the latest full update to the stage IV NSCLC without driver alterations living guideline. This guideline addresses first-, second-, and subsequent-line therapy for patients according to their histology...
byASCO Guidelines
0 ratings
0% found this document useful
Bridging the Research-to-Practice Gap Part 1: It’s Not Your Fault
Podcast episode
Bridging the Research-to-Practice Gap Part 1: It’s Not Your Fault
bySLP Nerdcast
0 ratings
0% found this document useful
Therapy for Diffuse Astrocytic and Oligodendroglial Tumors in Adults: ASCO-SNO Guideline: An interview with Dr. Jaishri Blakeley from Johns Hopkins University School of Medicine in Baltimore, MD, and Dr. Nimish Mohile from the University of Rochester Medical Center in Rochester, NY, co-chairs on “Therapy for Diffuse Astrocytic and...
Podcast episode
Therapy for Diffuse Astrocytic and Oligodendroglial Tumors in Adults: ASCO-SNO Guideline: An interview with Dr. Jaishri Blakeley from Johns Hopkins University School of Medicine in Baltimore, MD, and Dr. Nimish Mohile from the University of Rochester Medical Center in Rochester, NY, co-chairs on “Therapy for Diffuse Astrocytic and...
byASCO Guidelines
0 ratings
0% found this document useful
PsychEd Episode 11: Treatment of Schizophrenia Part III CTOs and ACTT with Dr. Arash Nakhost: An educational psychiatry podcast for medical students and residents
Podcast episode
PsychEd Episode 11: Treatment of Schizophrenia Part III CTOs and ACTT with Dr. Arash Nakhost: An educational psychiatry podcast for medical students and residents
byPsychEd: educational psychiatry podcast
0 ratings
0% found this document useful
Ep 03: Glyphosate, Adjuvants & Viruses with Dr. Stephanie Seneff
Podcast episode
Ep 03: Glyphosate, Adjuvants & Viruses with Dr. Stephanie Seneff
byThe Way Forward with Alec Zeck
0 ratings
0% found this document useful
Georg Striedter, "Model Systems in Biology: History, Philosophy, and Practical Concerns" (MIT Press, 2022)
Podcast episode
Georg Striedter, "Model Systems in Biology: History, Philosophy, and Practical Concerns" (MIT Press, 2022)
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
Neurology and Dermatology: Importance of Observation and the Art of Medicine.
Podcast episode
Neurology and Dermatology: Importance of Observation and the Art of Medicine.
byThe Neurophilia Podcast
0 ratings
0% found this document useful
May 2021: Quality of Depression Care for Patients With Comorbid Substance Use Disorder: Executive Editor Michael Roy speaks with Lara N. Coughlin, Ph.D., and Lewei Allison Lin, M.D., M.S., about their article on the . Dr. Lara Coughlin is an assistant professor in the Department of Psychiatry and an adjunct assistant professor of...
Podcast episode
May 2021: Quality of Depression Care for Patients With Comorbid Substance Use Disorder: Executive Editor Michael Roy speaks with Lara N. Coughlin, Ph.D., and Lewei Allison Lin, M.D., M.S., about their article on the . Dr. Lara Coughlin is an assistant professor in the Department of Psychiatry and an adjunct assistant professor of...
byAmerican Journal of Psychiatry Audio
0 ratings
0% found this document useful
Causal Inference and Confounding Factors in Public Health and Clinical Medicine--Jessica Young, PhD--Assistant Professor, Department of Population Medicine at Harvard Medical School & Harvard Pilgrim Health Care Institute: Jessica Young, PhD is a biostatistician in the Department of Population Medicine at Harvard Medical School who joins the show to discuss the ins and outs of her interesting and important work. Tune in to learn the following: How confounding factors...
Podcast episode
Causal Inference and Confounding Factors in Public Health and Clinical Medicine--Jessica Young, PhD--Assistant Professor, Department of Population Medicine at Harvard Medical School & Harvard Pilgrim Health Care Institute: Jessica Young, PhD is a biostatistician in the Department of Population Medicine at Harvard Medical School who joins the show to discuss the ins and outs of her interesting and important work. Tune in to learn the following: How confounding factors...
byFinding Genius Podcast
0 ratings
0% found this document useful
Ep 412 – Factor Five: “I Have a Client Who . . .” Pathology Conversations with Ruth Werner: Blood clots, legitimate concern about blood clots, and well-honed knowledge about blood clots are all part of our profession’s requirements for safe practice, right? And over the years I’ve been doing I Have a Client Who . . ., I’ve received...
Podcast episode
Ep 412 – Factor Five: “I Have a Client Who . . .” Pathology Conversations with Ruth Werner: Blood clots, legitimate concern about blood clots, and well-honed knowledge about blood clots are all part of our profession’s requirements for safe practice, right? And over the years I’ve been doing I Have a Client Who . . ., I’ve received...
byThe ABMP Podcast | Speaking With the Massage & Bodywork Profession
0 ratings
0% found this document useful
Georg Striedter, "Model Systems in Biology: History, Philosophy, and Practical Concerns" (MIT Press, 2022)
Podcast episode
Georg Striedter, "Model Systems in Biology: History, Philosophy, and Practical Concerns" (MIT Press, 2022)
byNew Books in Science
0 ratings
0% found this document useful
#47 Yaneer Bar-Yam on Complex Systems and the War on Values: During this thought provoking episode, Prof. discusses the nature of complex systems and complexity science. Our discussion covers the cacophony of signals within the information environment and how complexity science provides tools for understanding...
Podcast episode
#47 Yaneer Bar-Yam on Complex Systems and the War on Values: During this thought provoking episode, Prof. discusses the nature of complex systems and complexity science. Our discussion covers the cacophony of signals within the information environment and how complexity science provides tools for understanding...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Why there are no three-headed monsters, resolving some problems with brain tumours, divorce prediction and how to save marriages - James D Murray: Professor James D Murray, Professor Emeritus of Mathematical Biology, University of Oxford and Senior Scholar, Applied and Computational Mathematics, Princeton University, gives the annual Hooke Lecture.
Podcast episode
Why there are no three-headed monsters, resolving some problems with brain tumours, divorce prediction and how to save marriages - James D Murray: Professor James D Murray, Professor Emeritus of Mathematical Biology, University of Oxford and Senior Scholar, Applied and Computational Mathematics, Princeton University, gives the annual Hooke Lecture.
byThe Secrets of Mathematics
0 ratings
0% found this document useful
Remembering Ralph B. D'Agostino, Sr. | Season 5 Episode 1: We are re-releasing an episode from 2021 in remembrance of . Ellie Murray and Lucy D’Agostino McGowan chat with Ralph D’Agostino Sr. and Ralph D’Agostino Jr. about their careers in statistics, looking back at how things have developed and...
Podcast episode
Remembering Ralph B. D'Agostino, Sr. | Season 5 Episode 1: We are re-releasing an episode from 2021 in remembrance of . Ellie Murray and Lucy D’Agostino McGowan chat with Ralph D’Agostino Sr. and Ralph D’Agostino Jr. about their careers in statistics, looking back at how things have developed and...
byCasual Inference
0 ratings
0% found this document useful
It’s Personalized: The Future of Data-Driven Health: Tune in this week as special guest, Dr. Leroy Hood, an accomplished scientist best known for his integral work on the Human Genome Project, discusses data-driven analysis of chronic diseases and how our genomes may be able to provide individualized h...
Podcast episode
It’s Personalized: The Future of Data-Driven Health: Tune in this week as special guest, Dr. Leroy Hood, an accomplished scientist best known for his integral work on the Human Genome Project, discusses data-driven analysis of chronic diseases and how our genomes may be able to provide individualized h...
byThe Thorne Podcast
0 ratings
0% found this document useful
Georg Striedter, "Model Systems in Biology: History, Philosophy, and Practical Concerns" (MIT Press, 2022)
Podcast episode
Georg Striedter, "Model Systems in Biology: History, Philosophy, and Practical Concerns" (MIT Press, 2022)
byNew Books in the History of Science
0 ratings
0% found this document useful
[From the Archives] Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William Marelich on the Applied Quantitative Perspective
Podcast episode
[From the Archives] Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William Marelich on the Applied Quantitative Perspective
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
24. Angel Chater, Professor of Health Psychology & Behaviour Change & Falko Sniehotta, Professor of Behavioural Medicine & Health Psychology
Podcast episode
24. Angel Chater, Professor of Health Psychology & Behaviour Change & Falko Sniehotta, Professor of Behavioural Medicine & Health Psychology
byReal World Behavioural Science
0 ratings
0% found this document useful
Talking to patients with long-term conditions about benefits and harms of treatment
Podcast episode
Talking to patients with long-term conditions about benefits and harms of treatment
byBJGP Interviews
0 ratings
0% found this document useful
New Research from the SIOG 2020 Annual Meeting Online, with William Dale, MD, PhD: ASCO: You’re listening to a podcast from Cancer.Net. This cancer information website is produced by the American Society of Clinical Oncology, known as ASCO, the world’s leading professional organization for doctors who care for people with...
Podcast episode
New Research from the SIOG 2020 Annual Meeting Online, with William Dale, MD, PhD: ASCO: You’re listening to a podcast from Cancer.Net. This cancer information website is produced by the American Society of Clinical Oncology, known as ASCO, the world’s leading professional organization for doctors who care for people with...
byCancer.Net Podcast
0 ratings
0% found this document useful
One Thing Leads to Another: Causal Chains Link Health, Development, and Conservation: The linkages between environmental health and human well-being are complex and dynamic, and researchers have developed numerous models for describing them. The models include attempts to bridge traditional academic boundaries, uniting fields of study und
Podcast episode
One Thing Leads to Another: Causal Chains Link Health, Development, and Conservation: The linkages between environmental health and human well-being are complex and dynamic, and researchers have developed numerous models for describing them. The models include attempts to bridge traditional academic boundaries, uniting fields of study und
byBioScience Talks
0 ratings
0% found this document useful
Mitochondria and Aging: Researchers have identified several molecular pathways at a cellular level, including within the mitochondria, which appear to influence both aging and age-related chronic disease. These cellular changes associated with aging are cumulatively referre...
Podcast episode
Mitochondria and Aging: Researchers have identified several molecular pathways at a cellular level, including within the mitochondria, which appear to influence both aging and age-related chronic disease. These cellular changes associated with aging are cumulatively referre...
byGSA Momentum Discussions
0 ratings
0% found this document useful
Practical Assessment and Management of Vulnerabilities in Older Patients Receiving Systemic Cancer Therapy Guideline Update: Dr. Surpiya Mohile, Dr. William Dale, and Dr. Heidi Klepin discuss the updated guideline on the practical assessment and management of age-associated vulnerabilities in older patients undergoing systemic cancer therapy. They highlight recent evidence...
Podcast episode
Practical Assessment and Management of Vulnerabilities in Older Patients Receiving Systemic Cancer Therapy Guideline Update: Dr. Surpiya Mohile, Dr. William Dale, and Dr. Heidi Klepin discuss the updated guideline on the practical assessment and management of age-associated vulnerabilities in older patients undergoing systemic cancer therapy. They highlight recent evidence...
byASCO Guidelines
0 ratings
0% found this document useful
Cellular Aging and the Care of Older Patients: Researchers have identified several molecular pathways at a cellular level, including within the mitochondria, which appear to influence both aging and age-related chronic disease. These cellular changes associated with aging are cumulatively referre...
Podcast episode
Cellular Aging and the Care of Older Patients: Researchers have identified several molecular pathways at a cellular level, including within the mitochondria, which appear to influence both aging and age-related chronic disease. These cellular changes associated with aging are cumulatively referre...
byGSA Momentum Discussions
0 ratings
0% found this document useful
Episode 46: University of Utah MLS- Summer Immersion Program
Podcast episode
Episode 46: University of Utah MLS- Summer Immersion Program
byLet's Talk Micro
0 ratings
0% found this document useful
Ep. 80 Making Ethics Matter with Dr. Eric Keller: In this episode, Dr. Eric Keller joins Dr. Christopher Beck to discuss medical ethics within IR. He speaks about using a bottom-up approach of applied ethics, and we examine why a combination of casuistry and virtue ethics may be helpful rather than principlism.
Podcast episode
Ep. 80 Making Ethics Matter with Dr. Eric Keller: In this episode, Dr. Eric Keller joins Dr. Christopher Beck to discuss medical ethics within IR. He speaks about using a bottom-up approach of applied ethics, and we examine why a combination of casuistry and virtue ethics may be helpful rather than principlism.
byBackTable Vascular & Interventional
0 ratings
0% found this document useful
Podcast Rewind: It's Personalized – The Future of Data-Driven Health: Note: This episode originally aired in January 2023. Tune in this week as special guest, Dr. Leroy Hood, an accomplished scientist best known for his integral work on the Human Genome Project, discusses data-driven analysis of chronic diseases and ...
Podcast episode
Podcast Rewind: It's Personalized – The Future of Data-Driven Health: Note: This episode originally aired in January 2023. Tune in this week as special guest, Dr. Leroy Hood, an accomplished scientist best known for his integral work on the Human Genome Project, discusses data-driven analysis of chronic diseases and ...
byThe Thorne Podcast
0 ratings
0% found this document useful
Ed Calabrese: The History of the Linear Non-Threshold (LNT) Model of Radiation | Tom Nelson Pod #85
Podcast episode
Ed Calabrese: The History of the Linear Non-Threshold (LNT) Model of Radiation | Tom Nelson Pod #85
byTom Nelson
0 ratings
0% found this document useful

Skip carousel

Want A Job In Data Science? You Might Have To Take A Standardized Test When Applying
Chicago Tribune
Article
Want A Job In Data Science? You Might Have To Take A Standardized Test When Applying
Jul 10, 2018
3 min read
01 Giving Data Collectors—and Donors—a Real-Time Rush
Fast Company
Article
01 Giving Data Collectors—and Donors—a Real-Time Rush
Mar 20, 2017
7 min read
Malnutrition Still A Killer Among Children
Weekend Argus Saturday
Article
Malnutrition Still A Killer Among Children
Feb 20, 2021
SEVERE Acute Malnutrition (SAM) remains a serious fundamental cause of child deaths in South Africa, accounting for one quarter of children in-hospital deaths. This was one of the findings in the report of Food and Nutrition Security, South African C
2 min read
Guidelines for Reading & Interpreting Sports Science Research
UltraRunning Magazine
Article
Guidelines for Reading & Interpreting Sports Science Research
Oct 29, 2021
5 min read
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
STAT
Article
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
Sep 5, 2018
Giving study participants their individual results can drive greater public participation in research, increased support for science, and better health.
6 min read
Opinion: We Need To Take Steps Toward Building A Consensus Definition Of Biological Aging
STAT
Article
Opinion: We Need To Take Steps Toward Building A Consensus Definition Of Biological Aging
Feb 19, 2020
A confident answer to the question "What is biological aging?" in humans will help us ensure that complexity does not hide any magical mysteries.
4 min read
With So Many COVID-19 Models, Which Is Best?
Futurity
Article
With So Many COVID-19 Models, Which Is Best?
May 8, 2020
3 min read
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
STAT
Article
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
Dec 12, 2019
Don't take results from machine learning algorithms at face value. Ask what information isn't available. What subgroups haven't been prioritized? Who is on the research team?
4 min read
Fame Is Fortune in Sino-Science: In China, Famous Science Pays Like Nowhere Else.: In China, famous science pays like nowhere else.
Nautilus
Article
Fame Is Fortune in Sino-Science: In China, Famous Science Pays Like Nowhere Else.: In China, famous science pays like nowhere else.
Sep 19, 2013
You may have heard that Chinese researchers are not very well compensated, compared to their Western counterparts. What you might not know is that they can increase their income by a factor of 10 with a single publication. The better the journal they
9 min read
Lots Of Sitting May Boost Older Adults’ Dementia Risk
Futurity
Article
Lots Of Sitting May Boost Older Adults’ Dementia Risk
Sep 18, 2023
Adults age 60 and older who spend more time engaging in sedentary behaviors, such as sitting while watching TV or driving, may be at increased risk for developing dementia, according to a new study. The study shows the risk of dementia significantly
3 min read
Think Like a Researcher
UltraRunning Magazine
Article
Think Like a Researcher
Nov 26, 2021
When I was asked to start writing this column in 2015, I had just started as an assistant professor at The College of Idaho after moving from the Bay Area. I had just injured myself and was struggling to regain the peak running form I achieved a few
6 min read
Living ‘Nerve Circuit’ Could Test Opioid Alternatives
Futurity
Article
Living ‘Nerve Circuit’ Could Test Opioid Alternatives
Sep 22, 2021
1 min read
3 Questions Flag Older Cancer Patients With ‘Financial Toxicity’
Futurity
Article
3 Questions Flag Older Cancer Patients With ‘Financial Toxicity’
Sep 27, 2018
Nearly one in five older patients with advanced cancer have financial problems that may cause them to delay treatment so they can pay for food and housing costs, which in turn leads to stress and poor quality of life, according to a new study. Resear
2 min read
Advice To Parachuting Docs: Think Before You Jump Into Poor Countries
NPR
Article
Advice To Parachuting Docs: Think Before You Jump Into Poor Countries
Apr 4, 2018
5 min read
The World’s Best Hospitals 2024
Newsweek
Article
The World’s Best Hospitals 2024
Mar 8, 2024
10 min read
The World’s Best Hospitals 2024
Newsweek International
Article
The World’s Best Hospitals 2024
Mar 8, 2024
10 min read
How the Pandemic Has Tested Behavioral Science
Nautilus
Article
How the Pandemic Has Tested Behavioral Science
Jul 6, 2020
5 min read
Why Data Matters For Tracking Biodiversity Changes
Futurity
Article
Why Data Matters For Tracking Biodiversity Changes
Oct 3, 2018
New research highlights the importance of trait variability within species in measuring biodiversity changes and how ecologists can incorporate that data into their assessments. Around the world, ecologists are studying how species are responding to
2 min read
THE WORLD’S BEST Specialized Hospitals 2023
Newsweek
Article
THE WORLD’S BEST Specialized Hospitals 2023
Sep 16, 2022
72 min read
How Will A.I. Change Medicine?
Futurity
Article
How Will A.I. Change Medicine?
Dec 16, 2018
Artificial intelligence systems for health care have the potential to transform the diagnosis and treatment of diseases, which could help ensure that patients get the right treatment at the right time, but opportunities and challenges are ahead. In a
1 min read
New Method Can Tell How Old Your Cells Really Are
Futurity
Article
New Method Can Tell How Old Your Cells Really Are
Jul 12, 2017
A new system can consider a wide array of cellular and molecular factors in one comprehensive study to determine the functional age of cells. The system could eventually help clinicians evaluate and recommend ways to delay some health effects of agin
3 min read
The National Academies Illustrates the More Nuanced Value of Transparency in Science
Union of Concerned Scientists
Article
The National Academies Illustrates the More Nuanced Value of Transparency in Science
May 13, 2019
4 min read
AI Chatbots Are Supposed To Improve Health Care. But Research Says Some Are Perpetuating Racism
The Independent
Article
AI Chatbots Are Supposed To Improve Health Care. But Research Says Some Are Perpetuating Racism
Oct 20, 2023
5 min read
Opinion: Approaches To Complex Care Innovation Are ‘Naïve And Insufficient.’ We Need Systems And Design Thinking
STAT
Article
Opinion: Approaches To Complex Care Innovation Are ‘Naïve And Insufficient.’ We Need Systems And Design Thinking
Feb 20, 2020
The AIDS epidemic couldn't have been addressed without tenacity and innovation. We need the same constancy of purpose and spirit of relentless experimentation, abetted by systems and design thinking, to…
5 min read
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
STAT
Article
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
Jan 29, 2019
Instead of waiting for fleshed-out protocols for using real-world evidence, pharmaceutical companies should dive in now using a commonsense approach that relies on rigor and reason.
4 min read
Team Aims To Make Activity Tracker Data More Consistent
Futurity
Article
Team Aims To Make Activity Tracker Data More Consistent
Feb 17, 2022
2 min read
Why Is Biomedical Research So Conservative?: Funding, incentives, and skepticism of theory make some scientists play it safe.
Nautilus
Article
Why Is Biomedical Research So Conservative?: Funding, incentives, and skepticism of theory make some scientists play it safe.
Jun 16, 2016
How do scientists decide what research to do? One would like to think that they take a suitably scientific approach to this question by thinking about important problems that need to be solved, and asking which of these problems could be solved given
8 min read
Opinion: Sharing Clinical Trial Data: Lessons From The YODA Project
STAT
Article
Opinion: Sharing Clinical Trial Data: Lessons From The YODA Project
Nov 18, 2019
The culture of clinical research is changing, and there are now expectations that researchers will share data — even when it isn't required.
5 min read
Model Clarifies Individual Epilepsy Seizure ‘Clock’
Futurity
Article
Model Clarifies Individual Epilepsy Seizure ‘Clock’
Nov 14, 2022
3 min read
THE WORLD’S BEST Smart Hospitals 2023
Newsweek
Article
THE WORLD’S BEST Smart Hospitals 2023
Sep 16, 2022
7 min read

Related categories

Skip carousel

Reviews for Methods and Applications of Longitudinal Data Analysis

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Methods and Applications of Longitudinal Data Analysis - Xian Liu

Methods and Applications of Longitudinal Data Analysis

Xian Liu

Uniformed Services University of the Health Sciences

DoD Deployment Health Clinical Center, Defense Centers of Excellence, Walter Reed National Military Medical Center

Cover

Title page

Copyright

Biography

Preface

Chapter 1: Introduction

Abstracts

1.1. What is longitudinal data analysis?

1.2. History of longitudinal analysis and its progress

1.3. Longitudinal data structures

1.4. Missing data patterns and mechanisms

1.5. Sources of correlation in longitudinal processes

1.6. Time scale and the number of time points

1.7. Basic expressions of longitudinal modeling

1.8. Organization of the book and data used for illustrations

Chapter 2: Traditional methods of longitudinal data analysis

Abstract

2.1. Descriptive approaches

2.2. Repeated measures ANOVA

2.3. Repeated measures MANOVA

2.4. Summary

Chapter 3: Linear mixed-effects models

Abstract

3.1. Introduction of linear mixed models: three cases

3.2. Formalization of linear mixed models

3.3. Inference and estimation of fixed effects in linear mixed models

3.4. Trend analysis

3.5. Empirical illustrations: application of two linear mixed models

3.6. Summary

Chapter 4: Restricted maximum likelihood and inference of random effects in linear mixed models

Abstract

4.1. Overview of Bayesian inference

4.2. Restricted maximum likelihood estimator

4.3. Computational procedures

4.4. Approximation of random effects in linear mixed models

4.5. Hypothesis testing on variance component G

4.6. Empirical illustrations: linear mixed models with REML

4.7. Summary

Chapter 5: Patterns of residual covariance structure

Abstract

5.1. Residual covariance pattern models with equal spacing

5.2. Residual covariance pattern models with unequal time intervals

5.3. Comparison of covariance structures

5.4. Scaling of time as a classification factor

5.5. Least squares means, local contrasts, and local tests

5.6. Empirical illustrations: estimation of two linear regression models

5.7. Summary

Chapter 6: Residual and influence diagnostics

Abstract

6.1. Residual Diagnostics

6.2. Influence Diagnostics

6.3. Empirical Illustrations on Influence Diagnostics

6.4. Summary

Chapter 7: Special topics on linear mixed models

Abstract

7.1. Adjustment of baseline response in longitudinal data analysis

7.2. Misspecification of the assumed distribution of random effects

7.3. Pattern-mixture modeling

7.4. Summary

Chapter 8: Generalized linear mixed models on nonlinear longitudinal data

Abstract

8.1. A brief overview of generalized linear models

8.2. Generalized linear mixed models and statistical inferences

8.3. Methods of estimating parameters in generalized linear mixed models

8.4. Nonlinear predictions and retransformation of random components

8.5. Some popular specific generalized linear mixed models

8.6. Summary

Chapter 9: Generalized estimating equations (GEEs) models

Abstract

9.1. Basic specifications and inferences of GEEs

9.2. Other GEE approaches

9.3. Relationship between marginal and random-effects models

9.4. Empirical illustration: effect of marital status on disability severity in older Americans

9.5. Summary

Chapter 10: Mixed-effects regression model for binary longitudinal data

Abstract

10.1. Overview of Conventional Logistic and Probit Regression Models

10.2. Specification of Random Intercept Logistic Regression Model

10.3. Specification of Random Coefficient Logistic Regression Model

10.4. Inference of Mixed-Effects Logit Model

10.5. Approximation of Variance for Predicted Response Probability

10.6. Interpretability of Regression Coefficients and Odds Ratios

10.7. Computation of Conditional Effect and Conditional Odds Ratio for a Covariate

10.8. Empirical Illustration: Effect of Marital Status on Probability of Disability Among Older Americans

10.9. Summary

Chapter 11: Mixed-effects multinomial logit model for nominal outcomes

Abstract

11.1. Overview of multinomial logistic regression model

11.2. Mixed-effects multinomial logit models and nonlinear predictions

11.3. Estimation of fixed and random effects

11.4. Approximation of variance–covariance matrix on probabilities

11.5. Conditional effects of covariates on probability scale

11.6. Empirical illustration: marital status and longitudinal trajectories of disability and mortality among older Americans

11.7. Summary

Chapter 12: Longitudinal transition models for categorical response data

Abstract

12.1. Overview of two-time multinomial transition modeling

12.2. Longitudinal transition models with only fixed effects

12.3. Mixed-effects multinomial logit transition models

12.4. Empirical illustration: predicted transition probabilities in functional status and marital status among older Americans

12.5. Summary

Chapter 13: Latent growth, latent growth mixture, and group-based models

Abstract

13.1. Overview of structural equation modeling

13.2. Latent growth model

13.3. Latent growth mixture model

13.4. Group-based model

13.5. Empirical illustration: effect of marital status on ADL count among older Americans revisited

13.6. Summary

Chapter 14: Methods for handling missing data

Abstract

14.1. Mathematical definitions of MCAR, MAR, and MNAR

14.2. Methods handling missing at random

14.3. Methods handling missing not at random

14.4. Summary

Appendix A: Orthogonal polynomials

Appendix B: The delta method

Appendix C: Quasi-likelihood functions and properties

Appendix D: Model specification and SAS program for random coefficient multinomial logit model on health state among older Americans

References

Subject Index

Copyright

Academic Press is an imprint of Elsevier

125, London Wall, EC2Y 5AS, UK

525 B Street, Suite 1800, San Diego, CA 92101-4495, USA

225 Wyman Street, Waltham, MA 02451, USA

The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, UK

No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions.

This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein).

Notices

Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary.

Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility.

To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein.

British Library Cataloguing-in-Publication Data

A catalogue record for this book is available from the British Library

Library of Congress Cataloging-in-Publication Data

A catalog record for this book is available from the Library of Congress

ISBN: 978-0-12-801342-7

For information on all Academic Press publications visit our website at http://store.elsevier.com/

Biography

Xian Liu obtained a PhD degree in Sociology with specialization in Demography from the Population Studies Center, the Institute for Social Research at University of Michigan in 1991. Currently, he is Professor of Research at the Department of Psychiatry and Senior Scientist at the Center for the Study of Traumatic Stress, F. Edward Hebert School of Medicine at Uniformed Services University of the Health Sciences. He also serves as Research Scientist/Senior Statistician in the Deployment Health Clinical Center, Defense Centers of Excellence at Walter Reed National Military Medical Center. His areas of expertise include longitudinal analysis in health research, survival analysis, aging and health, and development of advanced statistical models in behavioral and medical studies. His articles have appeared in many leading scientific journals of various fields, with massive citations both in the United States and internationally. Some of Dr Liu’s papers on longitudinal modeling and survival analysis have been used as teaching materials at some prominent research institutions. His book, Survival Analysis: Models and Applications, was published jointly by Wiley and the Chinese Higher Education Press in 2012. As an internationally recognized scientist, Dr Liu has received several awards for his work in his career and has served as a Principal Investigator on a number of research projects from external sources. He has given numerous presentations, seminars, and special talks on the related issues to experts and professionals specialized in longitudinal data analysis and survival analysis.

Preface

In recent decades, longitudinal data analysis has become a topic of tremendous interest to statisticians, demographers, policymakers, and insurance companies. Scientifically, it encompasses a wide array of disciplines that include biomedical, gerontological, and behavioral and social sciences. The aim of this book is to provide a comprehensive account of the most relevant methods and techniques applied in longitudinal data analysis, without concentrating on any single field of application. Therefore, this book is expected to appeal to a wide variety of disciplines. Given my multidisciplinary background in training and research, the book covers models and methods applied in biomedicine, biostatistics, demography, psychology, sociology, and epidemiology, with practical examples associated with each of those disciplines. Linear mixed models have seen widespread applications in longitudinal data analysis, and accordingly five chapters are devoted to this statistical perspective. In light of the growing popularity of nonlinear models in empirical research, particularly in behavioral and social sciences, I also describe a variety of nonlinear mixed models in considerable detail. Regression modeling, linear and nonlinear predictions, and computing programming associated with different phases of longitudinal data analysis are extensively covered.

A large number of statistical models and methods are presented, starting with the most basic specifications and ending with some of the most advanced techniques applied in longitudinal data analysis. With a considerable volume of empirical illustrations, I attempt to make the transition from the introductory to the advanced levels as coherent and smooth as possible. For almost every major method or model, I include step-by-step instructions to demonstrate for the readers how to perform the technique on their own. Most methods described are supplemented with empirical practices, computing programs and output, and detailed interpretations of analytic results.

My hope is that scientists, professors, and other professionals of various disciplines can benefit from using this book either as a useful reference book, as a textbook in graduate courses, or any other use that requires and promotes appropriate longitudinal data modeling. This hope stems from my observation that many researchers are using incorrect models and methods to perform longitudinal analysis, particularly in handling nonlinear longitudinal data.

Given the focus on the application and practice in this book, the audience includes professionals, academics, and graduate students who have some experience in longitudinal data analyses. Numerous illustrations on various topics permit professionals to learn new models and methods or to improve their professional skills for performing longitudinal data analysis. As it covers a wide scope of methods and techniques to measure change, from the introductory to the advanced, this book is useful as a reference book for planners, researchers, and professors who are working in settings involving change. Scientists interested in studying the pattern of change over time should find it a useful guidebook for informing the appropriate models and methods for analyzing longitudinal data in their projects.

Graduate students of various disciplines constitute another important component of the audience. Social science students can benefit from the application of the concepts and the methods of longitudinal data analysis to solve problems in sociology, population studies, economics, psychology, geography, and political science. This book offers a useful framework and practical examples of applied social science, especially at a time when more questions about change are raised. The accessibility of many large-scale longitudinal datasets in the public domain will facilitate the interested student in practicing the models and methods learned from this book. Graduates students of biology, medicine, and public health, who are interested in doing research for their future careers, can learn a rich variety of techniques from this book to perform mathematical simulation, clinical trials, and competing risks analyses on health, healthcare, mortality, and disease. Longitudinal data analysis and some other related courses have been recognized as essential components of graduate students’ training in demography, psychology, epidemiology, and some of the biomedical departments. For example, in medical schools this book can bestow considerable appeal among medical students who want to know how to analyze data of a randomized controlled clinical trial adequately for understanding the effectiveness of a new medical treatment or of a new medicine on disease.

If the reader attempts to understand the entire body of the methods and techniques covered by this book, the prerequisites should include calculus, matrix algebra, and generalized linear modeling. For those not particularly familiar with this required knowledge, they might want to skip detailed mathematical and statistical descriptions and place their focus upon empirical illustrations and computer programming skills. By such a concentration, they can still command the skills to apply various models and methods effectively, thereby adding new dimensions to their professional, research, or teaching activities. Therefore, this book can be read selectively by the reader who is not extremely versed in high-level mathematics and statistics.

The reviewers for this book include Kenneth Land, Duke University; David Swanson, University of California, at Riverside; and several anonymous reviewers. Additionally, a number of colleagues and friends have enriched, supported, and refined the intellectual development of this book, including Bradley Belsher, James Edward McCarroll, Jichuan Wang, and Chu Zhang. Sincere thanks are given to Jill Tao of the SAS Institute for her help in SAS programing.

I owe special thanks to Charles C. Engel, whose consistent support and help has made the completion of this book possible. The staff of the Deployment Health Clinical Center, Defense Centers of Excellence, Walter Reed National Military Medical Center, provided tremendous dedication, competence, and excellence in the course of the preparation of this book. The assistance of Phoebe McCutchan in editing some of the graphs was vital.

Finally, I would like to thank my wife, Ming Dong, for her support and encouragement throughout the entire period of preparing, writing, and editing this book.

Chapter 1

Introduction

Abstracts

Serving as introduction to the book, Chapter 1 is focused on the description of the definition, historical background, data features and structures, and some other general specifications applied in longitudinal data analysis. The purpose of the chapter is to lead the reader into the realm of longitudinal data analysis by addressing its significance, underlying hypotheses, basic expressions of longitudinal modeling, and existing issues. The presence of missing data and intra-individual correlation are the two primary features in longitudinal data, and therefore, their impacts on longitudinal data analysis are presented and discussed. The chapter also summarizes the organization of the book with a chapter-by-chapter description. Given the emphasis on applications and practices for this book, two longitudinal datasets are used for empirical illustrations throughout the text, with one from a randomized controlled clinical trial and one from a large-scale longitudinal survey. In Chapter 1, these two datasets are described in details.

Keywords

Intra-individual correlation

longitudinal data

missing data patterns

multivariate and univariate data structures

pattern of change over time

repeated measurements

Chapter outline

1.1 What is Longitudinal Data Analysis? 1

1.2 History of Longitudinal Analysis and its Progress 3

1.3 Longitudinal Data Structures 4

1.3.1 Multivariate Data Structure 5

1.3.2 Univariate Data Structure 6

1.3.3 Balanced and Unbalanced Longitudinal Data 7

1.4 Missing Data Patterns and Mechanisms 9

1.5 Sources of Correlation in Longitudinal Processes 10

1.6 Time Scale and the Number of Time Points 12

1.7 Basic Expressions of Longitudinal Modeling 13

1.8 Organization of the Book and Data Used for Illustrations 16

1.8.1 Randomized Controlled Clinical Trial on the Effectiveness of Acupuncture Treatment on PTSD 17

1.8.2 Asset and Health Dynamics among the Oldest Old (AHEAD) 18

1.1. What is longitudinal data analysis?

We live in a dynamic world full of change. A person grows, ages, and dies. During that process, we may contract disease, develop functional disability, and lose mental ability. Accompanying this biological life course, social change also occurs. We attend school, develop a career and retire. In the meantime, many of us experience family disruption, become involved in social activities, cultivate personal habits and hobbies, and make adjustments to our daily activities according to our physical and mental conditions. Indeed, change characterizes almost all aspects of our social lives, ranging from the aforementioned social facets to unemployment, drug use recidivism, occupational careers, and other social events. In these biological and social processes, the gradual changes and developments over a life course reflect a pattern of change over time. More formally, such changes and developments may be referred to as an individual’s trajectory.

In a wider scope, trajectories are also seen in the pattern of change referring to such phenomena as the decaying quality over time of a commercial product or the collapse of a political system in a country. In the field of business management, change in consumer purchasing behavior is generally linked both with individual characteristics and with competing products. In population studies, demographers are concerned with such longitudinal processes as internal and international migration, and intervals between successive births. In cases such as these events and in others, the pattern of change over time can be influenced and determined by various factors, such as genetic predisposition, illness, violence, environment, medical and social advancements, or the like. Therefore, each trajectory can differ significantly among individuals and other observational units, or by the variables that govern the timing and rate of change in a period of time.

Data available at a single point of time does not suffice to analyze change and its pattern over time. Cross-sectional data, traditionally so popular and so widely used in a wide variety of applied sciences, only designates a snapshot of a course and thus does not possess the capacity to reflect change, growth, or development. Aware of the limitations in cross-sectional studies, many researchers have advanced the analytic perspective by examining data with repeated measurements. By measuring the same variable of interest repeatedly at a number of times, the change is displayed, its pattern over time revealed and constructive findings are derived with regard to the significance of change. Data with repeated measurements are referred to as longitudinal data. In many longitudinal data designs, subjects are assigned to the levels of a treatment or of other risk factors over a number of time points that are separated by specified intervals.

Analyzing longitudinal data poses considerable challenges to statisticians and other quantitative methodologists due to several unique features inherent in such data. First, the most troublesome feature of longitudinal analysis is the presence of missing data in repeated measurements. In a longitudinal survey, the loss of observations on the variables of interest frequently occurs. For example, in a clinical trial on the effectiveness of a new medical treatment for disease, patients may be lost to a follow-up investigation due to migration or health problems. In a longitudinal observational survey, some baseline respondents may lose interest in participating at subsequent times. These missing cases may possess unique characteristics and attributes, resulting in the fact that data collected at later time points may bear little resemblance to the sample initially gathered. Second, repeated measurements for the same observational unit are usually related because average responses usually vary randomly between individuals or other observational units, with some being fundamentally high and some being fundamentally low. Consequently, longitudinal data are clustered within observational units. In the meantime, an individual’s repeated measurements may be a response to a time-varying, systematic process, resulting in serial correlation. Third, longitudinal data are generally ordered by time either in equal space or by unequal intervals, with each scenario calling for a specific analytic approach. Sometimes, even with an equal-spacing design, some respondents may enter a follow-up investigation after a specified survey date, which, in turn, imposes unequal intervals for different individuals.

Over the years, scientists have developed a variety of statistical models and methods to analyze longitudinal data. Most of these advanced techniques are built upon biomedical and psychological settings, and therefore, these methodologically advanced techniques are relatively unfamiliar to researchers of other disciplines. To date, many researchers still use incorrect statistical methods to analyze longitudinal data without paying sufficient attention to the unique features of longitudinal data. For these researchers, the advanced models and methods developed specifically for longitudinal data analysis can be readily borrowed for use after careful verification, evaluation, and modification. In health and aging research, for example, the pattern of change in health status is generally the main focus. In analyzing such longitudinal courses, failure to use correct, appropriate methods can result in tremendous bias in parameter estimates and outcome predictions. In these areas, the application of advanced models and methods is essential.

1.2. History of longitudinal analysis and its progress

There were some vague, sporadic discussions about the theory of random effects and growth as early as the nineteenth century (Gompertz, 1820; Ware and Liang, 1996). The year 1918 witnessed the advent of the earliest repeated measures analysis when Fisher (1918) published the celebrated article on the analysis of variance (ANOVA). In this historical masterpiece, Fisher introduced variance-component models and the concept of intraclass correlation. Some later works extended Fisher’s approach to the domain of mixed modeling with the developments of such concepts as the split-plot design and the multilevel ANOVA (Yates, 1935; Jackson, 1939). For a long period of time, these variance decomposition methods were the major statistical tool to analyze repeated measurements. Though simplistic in many ways, the advancement of these early works provided a solid foundation for the advancement of the modern mixed modeling techniques. Around the same period, there were also some early mathematical formulations of trajectories to analyze the pattern of change over time in biological and social research (Baker, 1954; Rao, 1958; Wishart, 1938; see the summary in Bollen and Curran, 2006, Chapter 1). Until the early 1980s, however, longitudinal data analysis was largely restricted within the formulation of the classical repeated measures analysis traditionally applied in biomedical settings.

Given the substantial limitations and constraints in the traditional approaches in repeated measures analysis, many methodologists expressed grave concerns regarding how to measure and analyze the pattern of change over time correctly (see the summary in Singer and Willett, 2003, Chapter 1). Over the past 30 years, longitudinal data analysis has grown tremendously as a consequence of the rapid developments in mixed-effects modeling, multilevel analysis, and individual growth perspectives. Accompanying these developments in statistical models and methods are the equally important advancements in computer science, particularly the powerful statistical software packages. The convenience of using computer software packages to create and utilize complex statistical models has made it possible for many scientists to analyze longitudinal data by applying complex, efficient statistical methods and techniques, once considered impossible to accomplish (Singer and Willett, 2003).

As applications of various statistical techniques on longitudinal data have grown, methodological innovation has accelerated at an unprecedented pace over the past three decades. The advent of the modern mixed-effects modeling and the various approaches for the analysis of longitudinal data triggered the advancement of a large number of statistical models and methods, characterized by the complex procedures of multivariate regression. The major contribution of mixed-effects models, with their capability of containing both fixed and the random effects, is the provision of a flexible statistical approach to model the autoregressive process involved in the trajectory of individuals, both for average change across time and change for each observational unit. Given such a powerful perspective, both the measurable covariates and the unobservable characteristics can be incorporated in the model simultaneously, thereby deriving more reliable analytic results for the description of a longitudinal process. Being robust to missing data in general circumstances, mixed-effects models also have the added advantage of permitting irregularly spaced measurements across time.

More recently, a variety of Bayes-type approximation methods have been advanced to estimate parameters in the analysis of longitudinal data characterized of nonlinear functions such as proportions and counts. Given their flexibility in modeling nonnormal outcome data, these approximation techniques have enabled researchers to estimate random effects with complex structures and to correctly perform nonlinear predictions. To date, the various approximation methods have been applied by statisticians and some other quantitative methodologists to develop more statistically refined longitudinal models, thus expanding the capacity of mixed-effects modeling in longitudinal data analysis to a new dimension. In a different track, some other methodologists have advanced growth curve modeling by introducing latent factors or/and latent classes within the framework of structural equation modeling (SEM).

1.3. Longitudinal data structures

Methodologically, longitudinal data can be regarded as a special case of the classical repeated measures data of individuals that are collected and applied in experimental studies. Strictly speaking, there are some conceptual differences between the two data types: repeated measures and longitudinal analysis. The classical repeated measures data represent a wider concept of data type as they sometimes involve a large number of time points and permit changing experimental or observational conditions (West et al., 2007). In contrast, longitudinal data are more specific. They are generally composed of observations for the same subject ordered by a limited number of time points with equally or unequally spaced intervals. Therefore, longitudinal data can be defined as the data with repeated measurements at a limited number of time points with predetermined designs on time scale, time interval, and other related conditions. In statistics and econometrics, longitudinal data is often referred to as panel data.

In this section, longitudinal data structures are delineated. I first review the multivariate data format used traditionally in the classical repeated-measures analysis and applied presently in latent growth modeling. Second, the univariate data structure is introduced, in which an individual or an observational unit (such as a commercial product) has multiple rows of data to record repeated measurements at a limited number of time points. Lastly, balanced and unbalanced longitudinal data are defined and described.

1.3.1. Multivariate data structure

The classical repeated measures data are predominantly used in the ANOVA in experimental studies. Traditionally, the data structure for repeated measures ANOVA follows a multivariate format. In this data structure, each subject only has a single row of data, with repeated measurements being recorded horizontally. That is, a column is assigned to the measurement at each time point in the data matrix. To illustrate the multivariate data structure, I provide an example by using the repeated measures data of the Randomized Controlled Clinical Trial on the Effectiveness of Acupuncture Treatments on PTSD, which will be described extensively in Section 1.7 (PTSD is the abbreviation of posttraumatic stress disorder). The PTSD Checklist (PCL) score is the response variable to gauge severity of PTSD symptoms, a 17-item summary scale measured at four time points. The value range of the PCL score is from 17 to 85. In the multivariate data format, the repeated measurements for each subject are specified as four outcome variables lined in the same row, with time points indicated as suffixes attached to the variable name. Additionally, two covariates are included in the dataset: Age and Female (male = 0, female = 1). To identify the subject for further analysis, each individual’s ID number is also incorporated. Below is the data matrix for the first five subjects in the multivariate data format.

In Table 1.1, each subject has one row of data with four outcome variables, PCL1–PCL4, the ID number, and the two covariates, Age and Female. Among the five subjects, one person is aged below 30 years, one above 50, and the rest ranging between 38 and 44 years of age. There are four men and one woman. As all observations for the outcome variable are lined horizontally in the same row, the multivariate data structure of repeated measurements contains additional columns, therefore also referred to as the wide table format. Clearly, the cross-sectional data format is a special case of the multivariate structure with the outcome variable being observed only at one time. The most distinctive advantage of using the multivariate data structure is that each subject’s empirical growth record can be visually examined (Singer and Willett, 2003). In Table 1.1, for example, it is easy to summarize each subject’s trajectory by comparing values of the repeated measurements horizontally. Further examination on the pattern of change over time in the response variable can be performed visually. Perhaps due to this convenience, various latent growth models, which constitute an integral part of the literature on longitudinal data analysis, are designed with such a wide table perspective.

Table 1.1

Multivariate Data of Repeated Measurements

There are, however, distinctive disadvantages for the multivariate data structure in performing longitudinal data analysis. First, time is the primary covariate in analyzing the pattern of change over time in the response variable. In the wide table format, the time factor is indirectly reflected by the suffix attached to each time point, and therefore, time is not explicitly specified as an independent factor, thereby bringing inconvenience in the analysis of the time effect. Sometimes, intervals between two successive waves are unequally spaced by design or vary across subjects, and the multivariate data structure obviously cannot reflect such variations in spacing. Second, in longitudinal data analysis values of some covariates may vary over time, and failure to address such time-varying nature in the predictor variables can result in bias in analytic results and erroneous predictions of longitudinal processes. There are some complex, cumbersome ways to specify time-varying covariates within the multivariate data framework; these approaches, however, are not user-friendly and are inconvenient to apply.

1.3.2. Univariate data structure

Given the aforementioned disadvantages in the multivariate data format, much of the modern longitudinal modeling is performed on the basis of data of univariate structure, in which each subject has multiple rows and time is explicitly specified as a primary predictor on the developmental trajectory of individuals. That is, assuming n time points in a longitudinal data analysis, each subject is assigned n rows of data in the dataset (empirically, some of the rows may be empty due to loss of observation at follow-ups). Table 1.2 displays the same data presented in Table 1.1 but using the univariate data structure.

Table 1.2

Univariate Longitudinal Data

In Table 1.2, each subject has four rows of data, with each record corresponding to a specific time point. Therefore, in the univariate data format, each subject is assigned a block of records, rather than a single line. Correspondingly, the construction of a univariate longitudinal dataset is said to be based on a block design. The repeated measurements of PCL score are now set vertically, with the same name PCL, and suffixes are removed. A new covariate, Time, is added to the data matrix to indicate a specific time point, and a combination of values for the PCL and the time variables designate repeated measurements at four time points. Given the time points, the ID number for each subject is repeated four times in the data matrix. Likewise, as the predictors measured at baseline, the same values of Age and Female for a specific subject are also recorded four times. As subject-specific observations are set vertically, fewer columns but more rows are specified than the multivariate data structure. Therefore, the univariate longitudinal data structure is also referred to as the long table format.

Compared to the multivariate data format, the univariate longitudinal data matrix contains the identical information and differs only in structure and the addition of a time factor. Organizing longitudinal data into the univariate structure, however, boasts several distinctive advantages as contrasting to the limitations attached to the multivariate format. First, as time is specified as a direct and explicitly defined predictor variable, the researcher can handle unequally spaced intervals between time points and across individuals effectively by setting specific values of time. Second, the covariate’s values for the same individual are recorded vertically in multiple rows, thereby facilitating the specification of time-varying covariates in a straightforward, convenient fashion. Given these strengths, the univariate structure has becomes the most popular data format in longitudinal data analysis.

1.3.3. Balanced and unbalanced longitudinal data

When conducting longitudinal data analysis, the researcher needs to determine whether the data are balanced or unbalanced. In the classical ANOVA model, balanced repeated-measures data indicate an equal number of observations for all possible combinations of factor levels (such as multiple treatment factors or marital status groups), whereas unbalanced specify an unequal number of observations. In longitudinal data analysis, researchers usually use a looser definition to distinguish between a balanced and an unbalanced data design by considering the number of time points, timing, and spacing of intervals. When a set of time points are so designed as to be common to all individuals, the study design is said to be balanced over time. If data are collected exactly in accordance with such a balanced design, with all subjects having the same set of repeated measurements and there are no missing observations, the resulting longitudinal data are referred to as balanced. Balanced design is regularly applied in clinical experimental studies, in which subjects are randomized into each of the treatments for evaluation with relatively narrow intervals between successive time points. If there is no delayed entry and the follow-up information is complete, such repeated measures data are considered balanced. This definition of balanced data holds for repeated measurements that are both equally and unequally spaced as long as all individuals are followed at the same intervals.

When the number of time points is designed to be uncommon to subjects, the longitudinal data design is said to be unbalanced. Longitudinal data resulting from an unbalanced design are generally unbalanced, with subjects having different sets of time points for repeated measurements. In aging and health research, longitudinal data are sometimes collected with an unbalanced design. In older persons, for example, longitudinal attrition is usually expected to be high due to mortality and other health-related reasons, and therefore, individuals vary as some exit due to death, illness, out-of-scope residence, and the like. As a result, the sample size tends to dwindle rapidly at later waves, thereby being too small to conduct an efficient analysis, particularly when a random sample of individuals is followed for a long period of time. In these situations, researchers can randomly select a new sample at certain follow-up waves to replenish the dwindling sample size. Consequently, subjects in longitudinal data are expected to have different measurement time points. Given different sets of time points, such longitudinal data are unbalanced. Unbalanced longitudinal design can also arise when the timing of measurements is designed to be associated with certain benchmark events such as body fat prior to and right after menarche (Fitzmaurice et al., 2004).

In longitudinal data analysis, most designs follow the balanced perspective, whereas the vast majority of longitudinal data turn out to be unbalanced. Due to the unavoidable presence of missing data, a balanced design routinely results in unbalanced longitudinal data. Those dropping out from observation have fewer time points than the completers, and consequently, the presence of missing data automatically ushers in an unbalanced data structure. In clinical experimental studies with a balanced design, some subjects may enter the study considerably later than a specified starting date, thereby being listed as a delayed entry with timing of measurements different from other respondents. Longitudinal data then switch to an unbalanced structure. For many observational studies, all respondents at baseline are designed to be reinvestigated at fixed intervals. Such a balanced design, however, can rarely lead to balanced longitudinal data given the regular occurrence of attrition in repeated measurements. Given incomplete information on change, large-scale missing data have caused serious concerns in longitudinal data analysis, particularly for generating the pattern of change over time for individuals. Fortunately, statisticians and other quantitative methodologists have developed a variety of statistically efficient and robust methods to address the impact an unbalanced data structure has, as will be described and discussed in the succeeding chapters.

1.4. Missing data patterns and mechanisms

As indicated in Section 1.1, the presence of missing data in repeated measurements is one of the primary problems to contend with in longitudinal data analysis. In longitudinal processes, missing data regularly occur given dropouts or unanswered items. As designed, longitudinal data are collected in a particular observational period of time in which the outcome and other relevant variables are recorded sequentially at a set of previously designed intervals. Therefore, the researcher can only observe responses for those who are available at each follow-up time point. There are different types of missing data. Some missing data represent a random sample of all cases, and more often, missing observations are not random but the information inherent in them can be addressed by certain observed variables such as age, gender, and health status. In these situations, missing data do not pose serious threats to the quality of a longitudinal data analysis. In some circumstances, however, missing data are related to missing values of the outcome variable, and ignoring such systematic missing data can be detrimental to the estimation and prediction of the pattern of change over time in an outcome variable of interest. Given these missing data types, it is essential for the researcher to understand various missing data patterns and mechanisms before proceeding with a formal longitudinal data analysis.

Missing data can be grouped according to the missing data pattern, which describes which values are observed and which values are missing in the data matrix. In general terms, missing data patterns can be roughly classified into a variety of groups, such as univariate, multivariate, monotone, nonmonotone, and file matching (Little and Rubin, 2002). A univariate missing pattern indicates the situation where missing data occur only in a single variable. As an extension of the univariate case, the multivariate missing pattern refers to missing data in a set of variables, either for the entire unit or for particular items in a questionnaire. If a variable is missing for a particular subject not only at a specific time point but also at all subsequent time occasions, the missing data pattern for this individual is said to be a monotone missing pattern. In contrast, if a case is missing at a given time point and then returns at a later follow-up investigation, then the missing data pattern for this subject is referred to as the nonmonotone missing data. In longitudinal data analysis, the nonmonotone missing pattern can cause more problems than does the monotone pattern, and thus deserves close attention. Sometimes, variables are not observed together, and such missing data are referred to as a file-matching pattern. There are more missing data patterns such as the latent-factor patterns with variables that are never observed. For each missing data patterns, there are corresponding statistical techniques to handle its impact on the quality of a longitudinal data analysis.

Missing data mechanisms concern the relationship between missing data and the values of variables in the data matrix. Given this focus, missing data mechanisms can be categorized into three classes: missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). If missing data are unrelated to both the missing responses and the set of observed responses, the observed values are representative of the entire sample without missing values. This missing data mechanism is referred to as MCAR. If missing data depend on the set of observed responses but unrelated to the missing values, the missing data are said to be MAR. In longitudinal data analysis, the majority of missing data are categorized as MAR. Correspondingly, most longitudinal models are based on the MAR hypothesis in handling missing values.

In some special situations, missing data are related to specific missing values, referred to as MNAR. In longitudinal data analysis, ignoring the impact of this missing data mechanism can result in serious bias in analytic results and erroneous predictions. Over the years, scientists have developed a variety of empirically sound models and methods to account for MNAR in longitudinal data analysis. Compared to the methods for MAR, however, the statistical models handling MNAR are considered less mature, and research in this regard is ongoing.

With the importance of missing data analysis on longitudinal data, in this book an entire chapter (Chapter 14) is devoted to this area. In that chapter, a section is provided on the mathematical definitions and conditions associated with MCAR, MAR, and MNAR, respectively. Various statistical models handling different missing data mechanisms are described extensively in Chapter 14.

1.5. Sources of correlation in longitudinal processes

Another primary feature in longitudinal data is the presence of correlation in repeated measurements for the same subject, referred to as intraindividual correlation. Such correlation highlights the violation of the conditional independence hypothesis regularly applied in multivariate regression modeling. Such a lack of independence in repeated-measures data has driven statisticians and other quantitative methodologists to find ways for correctly performing longitudinal data analysis. There are two perspectives to address intraindividual correlation, each linked to a specific source of variability in longitudinal data. Statistically, variability in longitudinal processes can be summarized into three components: between-subjects variability, within-subjects variations, and the remaining random errors. In the literature of longitudinal data analysis, the first two components of variability, between-subjects and within-subject, address the systematic part inherent in variations in longitudinal processes, and therefore, intraindividual correlation can be modeled by means of including the pattern of variability from those two components. The third component is the random term for uncertainty as regularly specified in general linear and generalized linear regression models, and therefore, it can be estimated as regression residuals.

Between-subjects variability reflects individual heterogeneity in unspecified and unrecognized characteristics on the trajectory of the response variable. While much of the overall variability can be statistically addressed by specifying observable individual and contextual covariates, some between-subjects heterogeneity may derive from unobservable biological and environmental factors, such as genetic predisposition and physiological parameters. A sample of individuals may have different reactions to the same dose of a new medication on a specific disease; a good behavioral prototype fostered in childhood can help decelerate functional disablement process as age progresses; the social environment is likely to affect drug use recidivism among adolescents released from a rehabilitative center; and so on. As well documented in the literature of longitudinal data analysis, ignoring such unobserved heterogeneity can result in considerable bias in parameter estimates, especially the estimator of standard errors (Diggle et al., 2002; Fitzmaurice et al., 2004; Verbeke and Molenberghs, 2000). In empirical research, a popular approach to handle between-subjects variability is to specify the individual-specific random effects with the assumption of a known distribution. By accounting for individual differences in the analysis, intraindividual correlation in longitudinal data is implicitly manipulated thereby resulting in conditional independence in repeated measurements of the response for the same observational unit. As will be described in the succeeding chapters, many statistical models and methods applied in longitudinal data analysis are designed to handle intraindividual correlation by the specification of such random effects.

The second component of variability in longitudinal processes is within-subject variations. Given various biological, genetic, and environmental predispositions, the values of repeated measurements for the same individual tend to be more similar than those obtained from several randomly selected individuals. Therefore, intraindividual changes over time in the response variable tend to be confined within a person’s physical and environmental mechanisms, resulting in positive correlation in longitudinal data. Another distinctive characteristic in such subject-specific dependence is that intraindividual correlation has frequently been observed to decay over time. For example, the repeated measurements of blood sugar for the same person are very likely to be more similar than those from a number of randomly selected persons. A person’s blood sugar measurements also tend to be more similar between two successive time occasions than between two nonadjacent time points. In the literature of repeated-measures analyses, this type of change in correlation is referred to as serial correlation. The presence of recognizable patterns of correlation has enabled researchers to account for intraindividual correlation by specifying within-subject covariance structures on repeated measurements.

It must be emphasized that between-subjects and within-subject variations are closely interrelated. Clustering among repeated measurements of the response for the same observational unit can be understood as reflecting differences in the pattern of change over time among individuals. Likewise, individual differences reflect homogeneity of the repeated measurements within an observational unit. Therefore, in longitudinal data analysis, the researcher often needs to consider only one source of systematic variability in statistical modeling, within-subject or between-subjects, to make the longitudinal data conditionally independent. Given the close interaction between the two sources of variability, very often longitudinal data does not support modeling of all possible components simultaneously, particularly when the outcome variable of interest is qualitative rather than quantitative. Only in some special situations do both between-subjects and within-subject variability need to be specified in the same statistical model, and some cases of this sort will be described in the succeeding chapters.

If the specification of the subject-specific random effects or/and a specific covariance structure primarily account for intraindividual correlation in longitudinal data, the remaining within-subject variability is random and noninformative. Correspondingly, this residual component can be specified as random errors in the same fashion as traditionally assumed in general linear models and generalized linear models. For qualitative response variables, however, specification of within-subject components of variability cannot be easily specified, and some complex statistical procedures are needed for the estimation of random errors.

1.6. Time scale and the number of time points

In designing a longitudinal data analysis, the first question in the researcher’s mind is perhaps the scale of time. Time can be measured in a variety of metric units – week, month, year, age, session, and the like. The scale that is selected should depend on the nature of the study and the targeted length of an observation period. In clinical experimental studies, the outcome measurement can be linked with events with both rapid changes in status and with a relatively slow pace. In cancer research, for example, the tempo of status change within a fixed time period varies significantly over different types of tumor. A study of surgical treatment of lung cancer may examine the improvement of the survival rate for 6 months. In this research, week is an appropriate time scale. In studies of more gradual processes such as prostate cancer, the survival rate should be observed for a substantially longer period because patients with this type of tumor typically live much longer than those with lung cancer. Therefore, month is a better option for the second case. In the behavioral and social sciences, the occurrence of a change in status is usually a long, gradual process. Examples of such gradual life course events are recovery from disability among older persons, changes in marital status, and discontinuation of drinking alcohol among heavy drinkers. To follow those processes, month or year may be an appropriate choice of time scale. In health services research, different service types are associated with various cycles of turnover. A patient admitted to a short-stay hospital, for example, stays there only for a few days, whereas the average length of stay in a nursing home may be years (Liu et al., 1997). Accordingly, the time scale in the health services research needs to be specified by the nature of the service type.

The number of time points is another primary concern in designing a longitudinal study. While there are no clear-cut rules about the optimum number of time points, the minimum number should not be two, or even three. From a two-point longitudinal dataset, the only pattern of change over time that can be generalized is linear, no matter how different the values of the response differ between the two time points. Important information about the trajectory of individuals can be missed. Longitudinal data with a design of three time points provide the capacity to describe a possible exponential pattern of change over time, but not a more complex function. Four time points can yield a cubic trajectory of individuals. Therefore, for thoroughly reflecting the possible pattern of change in the response variable, the minimum number of time points is four. Analytically, there does not seem to be a definite maximum number of time points that can be determined for longitudinal data analysis. The desired number of time points should be evaluated on a case-by-case base. A high number of time points can probably improve the precision of estimation on the pattern of change over time if the sample size is sufficiently large. In the meantime, specifying a large number of time points can significantly increase the financial burden to conduct a longitudinal analysis and complicate statistical modeling. I personally recommend that for clinical experimental studies, four to six time points, consisting of the baseline and three to five follow-up waves, should be good enough to display the effectiveness of a new medication or treatment on disease. With respect to observational studies with a large sample size, 6 to 10 time points are ideal for the description of a time trend in the response variable and its group differences.

1.7. Basic expressions of longitudinal modeling

As introduced in Section 1.3, in a typical longitudinal dataset, each subject has multiple rows of data, with each record corresponding to a specific time point. Suppose that a random sample is composed of N subjects with n predesigned time points. For subject i (i = 1, 2, …, N), the observed number of time points is usually denoted ni that does not necessarily equal n due to missing observations. The response measurement for subject i at time point j (j = 1, 2, …, ni) is written as Yij. The repeated measurements of the response variable Y for subject i can be expressed in terms of an ni × 1 column vector, denoted by Yi and given by

Given N subjects, there are N such vectors in a longitudinal dataset. Repeated measurements of the response variable are generally specified as a function of the time factor and some other theoretically relevant covariates. In the analysis of cross-sectional data, various regression models are generally performed by assuming conditional independence of observations in the presence of specified model parameters. If the response variable Y has a continuous scale, it seems that a linear regression model on the response variable Y can be written as

(1.1)

where Xij is a 1 × M vector of covariates for subject i at time point j, β is a vector of the regression coefficients of M covariates with values fixed for all subjects, and ɛij is the random error term conventionally assumed to be normally distributed. In the context of longitudinal data, however, this classical specification does not derive efficient, robust, and consistent parameter estimates, particularly since the error is related to the random errors at other time points, even in the presence of specified observed covariates. Therefore, some additional parameters addressing intraindividual correlation need to be specified to secure the conditional independence hypothesis in random errors.

The importance of addressing intraindividual correlation has triggered the development of many advanced methods in longitudinal data analysis. As summarized by some leading statisticians in this area (Diggle, 1988; Diggle et al., 2002), modeling a longitudinal process should at least consider three different sources of variability. First, average responses usually vary randomly between subjects, with some subjects being fundamentally high and some being low. Second, a subject’s observed measurement profile may be a response to time-varying processes. Third, as the individual measurements involve subsampling within subjects, the measurement process adds a component of variation to the data. Such decomposition of stochastic variations in repeated measurements facilitates the formulation of correlation between pairs of measurements on the same subject. Based on the linear regression model specified in Equation (1.1), the multivariate linear model, including all the aforementioned three sources of variability, can be written as

(1.2)

where bi reflects independent stationary processes given a special form of serial correlation, Tij is the value of time for subject i at time point j, and ɛij represents the subsampling uncertainty within subject. For analytic convenience, each of the three terms, bi, and ɛij, are usually assumed to follow a normal distribution with a well-defined variance or a covariance function. With the specification of bi, the random component ɛij is conditionally independent of random errors at other time points. Thus, the estimate of β tends represents the mean response for subject i at time point j.

The three sources of variability in longitudinal data are usually intertwined, as indicated earlier. Therefore, in much of the empirical research, only one source of systematic variability needs to be considered. The most popular approach in longitudinal data analysis is perhaps the mixed-effects modeling in which the between-subjects random effects are specified to address intraindividual correlation. Let ni be the number of repeated measurements for subject i in a sample of N individuals, and Yi be the ni-dimensional vector of repeated measurements for the subject. If only a single term of the random effects is considered, a typical linear mixed model is given by

(1.3)

where Xi is a known ni × M matrix of covariates with the first column taking constant 1, β is an M × 1 vector of unknown population parameters, and bi is the unknown subject’s effect. In this specification, the primary predictor time is usually specified as a single continuous variable or a set of polynomials that are included in X. As the specification of a single random effect does not confound the fixed regression coefficients, bi is also referred to as the random effect for the intercept. Given the inclusion of bi, linear mixed models are thought to yield more efficient and robust regression coefficients than do general linear models in which residuals are potentially dependent. Sometimes, multiple terms of the random effects (e.g., the random effects for the intercept and for the time factor) need to be specified, and then the random effects for subject i are expressed in terms of bi in mixed-effects modeling given the specification of a design matrix. These extended mixed-effects models will be described and discussed in many of the succeeding chapters.

Given the specification of bi or bi, the elements in ei are assumed to be conditionally independent. Specifically, the term ei is an ni × 1 column vector of random errors for subject i, given by

Another popular perspective in longitudinal modeling is to specify the covariance structure of within-subject random errors in linear regression models while leaving the between-subject random effects unspecified. The use of such a design in modeling a longitudinal process becomes necessary when the application of the random-effects approach does not yield reliable analytic results or when the within-subject variability is sizable in comparison with the between-subjects variability. In this approach, the vector ei is assumed to follow a multivariate normal distribution with mean 0 and an ni × ni covariance matrix. To pattern the error covariance structure

Enjoying the preview?

Page 1 of 1

Methods and Applications of Longitudinal Data Analysis

About this ebook

Xian Liu

Related authors

Related to Methods and Applications of Longitudinal Data Analysis

Related ebooks

Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for Methods and Applications of Longitudinal Data Analysis

What did you think?

Book preview

Methods and Applications of Longitudinal Data Analysis - Xian Liu

Table of Contents

Copyright

Notices

Biography

Preface

Chapter 1

Abstracts

Keywords

Chapter outline

1.1. What is longitudinal data analysis?

1.2. History of longitudinal analysis and its progress

1.3. Longitudinal data structures

1.3.1. Multivariate data structure

1.3.2. Univariate data structure

1.3.3. Balanced and unbalanced longitudinal data

1.4. Missing data patterns and mechanisms

1.5. Sources of correlation in longitudinal processes

1.6. Time scale and the number of time points

1.7. Basic expressions of longitudinal modeling

(1.2)