Method Validation Amit PH PDF

Amit Manjure
Clinical & Scientific Excellence
Secondary
Method Validation
Secondary Method Validation
We run controls, sample, we collect data, we create graphs, charts, plots and
crunch numbers.
It is expensive and take time to it right.
Then we stuff it all in a folder and hand it to an inspector. It's called method
validation.
But what is it? How do we do it? Why do we do it?

Why Secondary MV?
§ Method validation studies verify the lab can achieve product performance
claims.
§ Provide useful information about what to expect with a new method & how
results may be different compared to the method being replaced.
§ Help Labs Detect Analytical Error

• Imprecision – Random Error
• Inaccuracy – Systematic Error
• Total Error = Combination
§ Assist in meeting regulatory requirements

Page 3
What studies are necessary?
CAP / CLIA Checklist recommends that laboratory must verify or establish

method performance specifications for each method.
Typical Studies needed for Quantitative Assay :

• Precision
• Analytical Measuring Range (AMR) Verification
• Method Comparison
• Verification of Reference Range
• Sensitivity – Functional Sensitivity
• Instrument Carry (for systems not using disposable tips)
Page 4
When is it Done
Analytical methods need to be validated
• before their introduction into routine use
& revalidated
• whenever the conditions change for which the method has been
validated (e.g., an instrument with different characteristics)
• whenever the method is changed and the change is outside the

original scope of the method
Page 5
Process of establishing a Routine Test
Select Diagnostics Test

Establishing
Routine Testing
Select - Method of
Analysis and Perform
Validate Method
Performance
Maintain Method
Implement Method
Prevent Problem
Check with Report

Acquire Sample Perform test
Statistical QC Results
A Routine Testing Process

Page 6
Agenda :
Precision
AMR
Method
Verification/
Comparison
Range
Overview & Reference

Cautions Interval
Page 7
Statistical tools
Common tools:
Mean, SD, CV, Confidence intervals, Chi Squared, Regression analysis
All calculations made with statistical tools are estimates, not absolute truth.
Estimates will vary between studies.
Points to keep in mind:
§ All statistical tools make assumptions about the data

§ All calculated estimates have limitations
§ Need to keep the assumptions & limitations in mind when
evaluating results of statistical tests
Page 8
Samples Used
Methods are designed to test fresh patient samples

Any other type of sample may perform differently
• Frozen / stored patient samples

• Accuracy validation samples
• Calibration Verification samples
• QC samples
• EQA samples
Page 9
Why is matrix an issue ?
Matrix: everything in the sample except what we are measuring
Sample matrix can be altered by handling / processing
• Matrix proteins in particular
• Proteins may denature
What can alter sample matrix?

• Refrigeration
• Freezing
• Pooling
• Diluting
• Spiking
• Manufacturing processes
Page 10
Reference Guidelines
Clinical & Laboratory Standards Institute (CLSI):
Accuracy / Range:
• EP15-A2 User Verification of Performance for Precision and Trueness (2005)
Precision:
• EP5-A2 Evaluation of Precision Performance of Quantitative Measurement
Methods (2004)
• EP15-A2 User Verification of Performance for Precision and Trueness (2005)
Method Comparison:
• EP9-A3 Measurement Procedure Comparison and Bias Estimation Using
Patient Samples (2013)
Reference Interval:
• C28-A3c Defining, Establishing, and Verifying Reference Intervals in the
Clinical Laboratory (2008)
Page 11
Agenda
Precision
AMR
Method
Verification
Comparison
/ Range

Cautions Interval
Page 12
AMR Verification / Assay Range
Analytical Range – aka: Calibration range, reportable range, analytical

measurement range, linear range
Study
Ideal: compare to true reference method using fresh patient samples
Next best: use special samples validated for the method
Study:
• Test accuracy samples in replicates, typically triplicate
• Compare mean of replicates to acceptable limits
Potential issues:
• Need to verify that assigned acceptable values are current to method and reagent lot number
• Comparing results to method in current use is NOT an accuracy study unless current method is a
true reference method
Page 13
TSH : Method Validation : Instrument name (S/N. 546722)
AMR/Linearity/Dilution Recovery using Linearity Panels Lot# 076859
Date: 12-Jan-16
Instrument: ADVIA CENTAUR
Reagent: Lot # 045 : Exp: 20 Mar 2016
Linearity Material: AUDIT
Calibrators Lot#: 076859 Exp.: 31 Jan 2014
Units of Measure: uIU/ml
Acceptability
Achieved Values LIMIT
Calibrators (Target) 1st Rep 2nd Rep 3rd Rep 4th Rep Mean bias % bias + / - 2 CV%
S0 0.40 0.35 0.32 0.40 0.38 0.36 -0.037 -9.1% 13.3% PASS
S1 2.50 2.30 2.40 2.35 2.33 2.345 -0.155 -6.2% 13.3% PASS
S2 5.50 5.60 5.67 5.45 5.89 5.652 0.152 2.8% 13.3% PASS
S3 10.00 11.00 10.40 10.54 10.50 10.61 0.61 6.1% 13.3% PASS
S4 25.50 24.50 24.70 25.50 23.50 24.55 -0.95 -3.7% 13.3% PASS
S5 45.50 43.78 44.01 43.68 43.89 43.84 -1.66 -3.6% 13.3% PASS
S6 75.00 72.08 71.09 73.00 73.23 72.35 -2.65 -3.5% 13.3% PASS
S7 105.00 99.00 99.32 100.00 98.50 99.21 -5.79 -5.5% 13.3% PASS
Maximum allowa ble CV (1/3 of Total Error Allowed of 20 %) = 6.7%
T SH Linearity
100 y = 0.9475x + 0.4558

R² = 0.9998
80
Achieved
60
40
20
0
-10 10 30 50 70 90 110
Target
0
25
Manufacturer's claimed linear range: 0.01 to 150 uIU/ml
Page 14
Conclusion: Linearity on Instrument name S/N. 546722 demonstrated from 0.0 to 150 uIU/ml.
Precision
AMR
Method
Verification
Comparison
/ Range

Cautions Interval
Page 15
Precision
Measure of the reproducibility of the results

Ability to reproduce a result to a given level of confidence
(typically 95% confidence)
Characterized using statistical tools

®®
• Primarily: SD & CV ®®
®
• All SD and CV values calculated from data are estimates ®
All estimates of precision can be expected to

cluster around the “true” precision
Page 16
Precision
Two Types of Estimates .
§ With in Run or Short term or Intra-assay Precision

§ Between Run or Long Term or Inter Assay Precision
Which Matrix to Use ?

• Calibrators
• Controls.
• Patient Sample Pools
Page 17
Precision - Recommendation
Atleast, Two or Three different control material that represent low and high
Medical decision points are used.
With in Run or Short term or Intra-assay Precision
• 20 replicates for each control material in one run.

• Calculate mean and SD and CV %
• Validate, determine if the short term precision is acceptable.
Between Run or Long Term or Inter assay Precision.
• 20/10/5 replicates of each control material one run for one day over 5
days
• Calculate mean, SD and CV %
• Validate, determine if the long term precision is acceptable.
Page 18
THCG ADVIA CENTUAR - Method Validation
Instrument Serial Number:
Date Performed:18 JAN 2008
Control Name: IMMUNOASSAY PLUS Manufactured by:
C LOT NO: 40191 Exp Date: 31 JULY 2009
Intra-Assay (Within Run) Precision Inter-Assay (Between Run) Precision
Conventional Units ( mIU/mL ) Conventional Units ( mIU/mL )
Level 1 Level 2 Level 3 Date Replicate Level 1 Level 2 Level 3
1 1
2 2
08/01/2012 Run
3 1 13:34PM
3
4 4
5 5
6 6
7 7
09/01/2012 Run
8 8
2 14:30 PM
9 9
10 10
11 11
12 12
10/01/2012 Run
13 3 15:30PM
13
14 14
15 15
16 16
17 17
11/01/2012 Run
18 4 16:50PM
18
19 19
20 20
21
n 0 0 0 22
12/01/2012 Run
Mean 5 18:45PM.
23
sd 24
CV(%) 25
n 0 0 0
Mean
sd
Page 19 CV(%)
What does the SD represent?
s=
å i
( x - x) 2
SD ?
(n -1)
Essentially the average size of the differences

between the mean and the individual values
Page 20
Precision: Assumptions & Limitations
n = 20 n = 10 n=6
93 93
96 Number of replicates affects calculated SD
100 100 100
93
89
94
89
94
§ Fewer replicates tend to make SD
87 87 appear larger
89
86 86 86 § Fewer replicates means less
88
86 86 confidence in how accurately the
91 91 calculated SD represents assay
92 92
91
performance
102 102 102
90
90 90
85 85
91 91
92
Mean 91.3 91.6 93.0
SD 4.39 5.52 7.04
%CV 4.8% 6.0% 7.6%
Page 21
How good are our estimates ?
87 Data Points
• Overall mean: 224
• Overall SD: 9.91
• Overall CV: 4.42%
How well do smaller data sets
estimate mean and SD ?
Page 22
Estimating Mean and SD
For a Mean:
Mean SD
Overall 224
8 – 10 replicates can give a useful estimate
Overall 9.90
Data Mean Δ
Data SD Δ
Groups of 10 results Groups of 10 results For the SD:
1-10 221.7 -1.1% 1-10 9.88 -0.2% t An SD calculated from a single study of 20
11-20 230.1 2.6% 11-20 11.76 18.8% replicates is an estimate of the “true” SD
21-30 224.3 0.0% 21-30 10.41 5.2% t This estimate can be from 30% smaller to
31-40 223.7 -0.2% 31-40 9.86 -0.4%
25% larger than the “true” SD and still be
equivalent to the “true” SD
41-50 224.7 0.2% 41-50 8.17 -17.5%
t To reliably estimate the SD to within ±10%
51-60 226.7 1.1% 51-60 7.72 -22.0%
of the “true” value requires 100+ values
61-70 226.1 0.8% 61-70 9.15 -7.6%
t To conclude that a problem exists anytime
71-80 220.2 -1.8% 71-80 11.42 15.4% an SD calculated from a single study
Groups of 20 results exceeds the reference SD (IFU) is
statistically incorrect and unrealistic
1-20 11.42 15.4%
21-40 9.87 -0.3%
41-60 7.80 -21.2%

61-80 10.52 6.3%
Page 23
Validating Precision
Is measured precision acceptable ?

If measured SD & CV are less than or equal to the reference precision
performance, method performance is validated
If measured precision exceeds reference precision …
u Performance may not meet requirements
u However, remember limitations on SD calculated from limited data
From CLSI EP5-A2 (Evaluation of Precision Performance of Quantitative

Measurement Methods):
“The user’s (precision) estimate can be larger than the SD claimed by the
manufacturer and still be acceptable. Since the user experiment is based on a limited
number of observations, there are expected sampling errors of the calculated (precision)
around the true values. … The chi-squared test is used to determine if your estimates are
significantly larger than those provided by the manufacturer.”
Page 24
Chi Squared
Chi squared (c2): tests whether a measured SD is statistically

comparable to a reference SD
Used to compare estimated precision to published performance
ü Calculated X2 is compared to the critical

s 2 * (n-1)
X2 = value of c2 for 95% confidence and degrees
σ2 of freedom (n-1)
ü If X2 is less than the critical value, precision
estimate is not different from reference
n = number of replicates
value
S = calculated SD
σ = reference SD
Page 25
Using Chi Squared
Replicate Sample1 Sample2 Sample3
1 93.00 93.00 100.00
2 96.00 100.00 94.00
3 100.00 89.00 86.00
Tests whether precision data collected on-
4 93.00 87.00 91.00
5 89.00 86.00 102.00
site is comparable to expected
6 94.00 86.00 85.00 performance
7 87.00 92.00
8 89.00 102.00 Provides objective test to support
9 86.00 90.00 comparability, even if study SD exceeds
10 88.00 91.00 reference SD
11 86.00
12 91.00
13 92.00
14 91.00
15 102.00
16 90.00
17 90.00
18 85.00
19 91.00
20 92.00
Mean 91.25 91.60 93.00
SD 4.39 5.52 7.04
CV 4.8% 6.0% 7.6%
N 20 10 6
IFU Mean 95 95 95
IFU claimed CV 5.2% 5.2% 5.2%
Chi-square 15.8 12.5 12.2
Crit. Chi-square 31.4 18.3 12.6
CV acceptable Yes Yes Yes
Page 26
Chi squared - Assumptions & Limitations
The calculation is based on SD, therefore it is essential that the SD’s

of the validation study and reference data be comparable
Since the SD may vary with concentration, we typically use a

reference SD at a concentration close to the mean of the precision
study data
2 * (n-1)
χ
2 = s
σ2
Page 27
Precision
AMR
Method
Verification
Comparison
/ Range

Cautions Interval
Page 28
What
Method comparison: testing the same group of patient

patientsamples
samplesby two
different methods to characterize the
relationship between the methods
Goal: to create a model of the relationship between the methods that
can be used to estimate average method to method difference
120
100
80
60
40
20
0
0 20 40 60 80 100 120
Page 29
Why
Commonly used in clinical laboratories to:

• Characterize the difference between a new method and the current one
• Understand how results will change with the new method
• Provide information to alert customers (clinicians) to a change
Validate reference interval for a new method
Page 30
Typical Scenario
§ Lab tests samples by current method

§ Identifies samples that will be used for method comparison
(typically 10 – 40 samples)
§ Writes initial results on tubes and stores the samples
§ Tests stored samples using new method
§ Uses regression analysis to evaluate results.
Page 31
Now there is an issue
§ Methods do not seem to agree

• Difference between results is more than 10% for some samples
• Regression slope is not 1.0, or within 0.9–1.1, or some other limit
• Correlation coefficient (r) is 0.998, but the results don’t agree
§ Regression slope is not the same as the method comparison published in the package
insert
§ Difference between methods not the same as between respective peer groups on EQA
reports or QC peer group reports
§ Most common conclusion:
There is something wrong with the new method or instrument
Most common actual root causes:

• Study done using less than optimal samples
• Data does not conform to assumptions of statistical tool(s) used
• Interpretation of regression statistics
Page 32
Considerations when selecting samples
Use samples compatible with both methods

• Avoid samples containing interferents known to affect either method
• lipemia, hemolysis, drugs, etc.
• anticoagulants
• Samples must meet requirements for BOTH methods
• Avoid using QC samples, proficiency testing samples, spiked samples, diluted samples,
linearity samples, etc.
Storage
• Avoid using stored specimens if possible
• Do not store samples in original collection tubes
• Must be stored per manufacturer recommendations (both methods)
• If stored samples are used, must be run by both methods after storage
DO NOT use the original results obtained with fresh samples by one method to
compare with results obtained later by another method using the same samples
after being stored, especially if stored frozen
Page 33
Considerations when testing samples
Range of concentration
• Cover as much of the shared analytical range as possible
• Focus on the range of clinical interest
• Sample concentrations should ideally be uniformly distributed over the range
covered
• Single samples at extreme limits of the range are generally not useful – can
strongly bias statistical analysis 80000
Good
Poor distribution
• Be very cautious about using proficiency testing, QC or other manufactured
70000
60000
samples to extend the range of concentration due to potential for matrix
50000
issues 40000
• Samples requiring dilution by either method should not be included

30000
20000
• If dilutions are important, keep the data for diluted samples separate
10000
0
0 10000 20000 30000 40000 50000 60000 70000 80000 90000
Page 34
Testing Protocol
§ Samples should be tested by both methods within a few hours or, at worst,
within the same day
§ If using stored samples, do not thaw or prepare samples until the day they will
be tested
§ Ideally, study should be done over several days, testing a different group of
fresh samples each day
§ Ideally, both methods should be recalibrated during course of study
§ If using pooled, diluted, spiked, or manufactured samples (EQA, QC, etc.), make
sure these samples are identified so they can be tracked throughout the study
Page 35
Reviewing the data
Step 1: Plot the data

• Scatter plot
• Difference plot
• Plots provide most of the necessary information
14
12
30
25
10
20
Difference (µg/L)
15
Method 2
8
10
5
6
0
0 100 200 300 400 500 600 700 800 900
-5
4
-10
2 -15
-20
0 Concentration (µg/L)
0 2 4 6 8 10 12 14
Method 1
Page 36
Scatter Plots
14
12
Always
the same X Y
10
scale Mean of Methodmethod
Comparative 2 7.72Test method
Prior method New method

Y 2
8 Mean of Method 1 8.19

Method
Reference method Candidate method

6 N 33
Current method Replacement method
Control method Evaluation method
4
0
0 2 4 6 8 10 12 14
X 1
Method
Page 37
Difference Plots
§ Plot can show…

• Graphically shows the difference
• Numerical difference
between the two methods
• % difference
• Useful for estimating average
§ Choice depends on method variance
difference and noting trends
• If the SD remains constant over the range – use
numerical value
• Useful when data is not optimal for
• If the CV remains constant over the range – use %
regression
Difference
analysis
• Correct choice improves ability to estimate
average difference
Page 38
Scatter Plots: Constant SD vs Constant
CV
Constant SD Constant CV
17 Scatter Plot 900 Scatter Plot
800
16
700
15
600
Test Method (µg/L)

Test Method (µg/L)
14
Identity
500
13
400 Identity
12
300
11 SD = +/- X 200 CV = +/- X%

10
At all concentrations 100
At all concentrations
0
9
0 100 200 300 400 500 600 700 800 900
9 10 11 12 13 14 15 16 17
Comparative Method (µg/L) Comparative Method (µg/L)
Typically chemistry methods Typically immunoassay methods
Page 39
Bland Altman Plot
60
Difference (New Method - Comparative Method)

§ Adaption of difference plot
40
§ Y data can be numeric or %
§ X data is average of both methods 20 +2 SD
§ Adds limits lines: 0
• Usually +/- 2 SD around the average

difference -20
• Can be any predetermined acceptable -40

limit
-2 SD
-60
50 150 250 350 450 550
Mean of Both
SD = standard deviation of the differences
Page 40
Reviewing the Data
Step 1: Plot the data

• Difference plot
• Scatter plot
• Plots provide most of the necessary information
Step 2: Regression analysis
• If appropriate – not always the best tool
• Powerful tool for estimating method to method bias
Page 41
Linear Regression
Linear regression analysis is a statistical tool
Used properly, provides an accurate, robust model of the relationship between

methods that can be very useful
Used carelessly, it can provide highly misleading misinformation
When using regression analysis, always keep two rules in mind …
Rule #1: Rule #2:

Always plot the data! Never use By eye, draw the best straight line you can
regression statistics without looking through the data evenly dividing the points on
at a graph both sides of the line.
If the calculated regression line is significantly
different from your line, do NOT trust the
calculated regression line
Page 42
Regression Analysis
Yields two parameter equation describing the relationship between

the methods: y = a*x + b
• Parameter 1: a = slope
• Proportional bias - Percentage difference
• If slope is 0.9 ð “y” results will be 90% of “x” results
• If both methods agree completely = 1.000
• Parameter 2: b = y intercept
• Constant bias - difference is same amount regardless of
concentration
• If both methods agree completely = 0
• Intercept estimated from equation: b = y – (a*x)
• using mean values for “X” and “Y” data
• Parameters are estimates

• Estimate is only as good as the data used
• More data gives a better estimate
• Different data sets can give somewhat different estimates
Page 43 • 95% confidence interval used to characterize estimate
Testing for adequate range
Correlation coefficient (r)

• Statistical estimate of how well the results of the two methods vary together
• If all data points fall exactly on the line: r = 1.000
• If data is randomly scattered with no relationship: r = 0.000
• Best used to confirm that range of values used was adequate to support use
of least squares linear regression
• To use least squares regression, ideally r ≥ 0.975
• Indicates that the range of data values used is adequate to compensate
for the actual scatter in the “X” data
• If r < 0.975 – use alternate approaches to estimate difference
• Use alternate regression models
• Estimate average difference from difference plot
Page 44
Misconceptions about correlation
coefficient
Common misconception about r….

is that it indicates agreement
Troponin I
Correlation coefficient tells us what 30
percentage of the differences between 25

methods is accounted for by the regression
line 20
Method 2
Good correlation means all the points fall on or 15
very near the line

10
It does not mean results agree numerically or

that the slope = 1 5
0
0 5 10 15 20 25 30
r ≠ Bias
Method 1
Method 2 = 0.37 (Method 1) + 0.11

r = 0.993
Page 45
Dealing with low r value
“Low” r value (less than 0.975)

• Generally indicates there is not enough data and /or it does not do not span enough range
for least squares to provide a good model of the relationship
• It also indicates scatter in the data which may point to assay specific matrix or interference
issues
• REMEMBER: the scatter is NOT always due to the new method –
current method may have poorer precision or interference / matrix issues
How to address:
• Obtain additional samples that span broader concentration range
• Often not practical – especially for some methods
• Instead of regression analysis, use difference plot and estimate average difference
• Use an alternate regression model (Weighted, Deming, Passing-Bablock)
Page 46
Case Studies
Dealing with data
Theory is great, but my data never looks like that !
Page 47
Case #1: Few data points
Method 1 Method 2 Bias % Bias

2.6 2.4 0.2 9.2%
2.8 3.0 -0.2 -8.2%
3.0 3.6 -0.6 -19.7%
3.6 4.1 -0.5 -13.6%
4.2 4.7 -0.5 -11.0%
4.9 5.2 -0.3 -6.1%
5.0 5.4 -0.4 -7.4%
6.1 6.7 -0.6 -9.3%
6.2 6.2 0.0 0.2%
7.2 9.0 -1.8 -25.4%
7.8 8.0 -0.2 -2.8%
7.9 7.8 0.1 1.8%
9.6 10.4 -0.8 -8.3%
10.9 11.3 -0.4 -3.7%
11.8 13.1 -1.3 -11.0%
11.9 12.2 -0.3 -2.5%
12.0 11.7 0.3 2.5%
12.4 11.7 0.7 5.6%
12.4 12.9 -0.5 -4.0%
13.8 14.2 -0.4 -2.9%
Average Bias: -0.4 -5.8%
Page 48
Difference plot
For regression analysis to be valid, need adequate data

Total T4
40 samples is recommended
4
20 – 40 may work, but may not 3.5 Average difference = 0.4
be reliable
Difference (M ethod 2 - M ethod 1)

3
Less than 20 … not worth doing 2.5
regression 2
1.5
How to address: 1
0.5
Test additional samples or …..
0
Estimate average difference -0.5
-1
Use difference plot
2 4 6 8 10 12 14 16
Mean of All
Page 49
Case #2: Troponin I
Number of Samples 30
14
Range of Observations 0.016 to 12.1
Correlation Coefficient (r) 0.999
12
Linear Slope 0.77
Linear Intercept 0.02

10
Most samples are “normal”

New Method
8
Only a very few are high 6
No samples are near upper assay limit

4
Limited number of high samples may
skew regression line
2
0
0 2 4 6 8 10 12 14
Comparative Method
Page 50
Troponin: Difference plot
Difference plot shows essentially no difference

between methods except for highest two 1
samples 0.5
D iffe re nc e (N e w M e thod - C om pa ra tiv e M e thod)

Regression slope, influenced by two high data 0
points, doesn’t show real relationship
-0.5
No difference noted at medical decision point

-1
Can get more samples if practical, but this shows -1.5

how methods relate at decision point
-2
-2.5
-3
-3.5
0.01 0.1 1 10 100
Comparative Method
Page 51
Troponin I: Excluding extreme samples
0.11
Correlation Coefficient (r) 0.962 0.1
Linear Slope 1.00
0.09
0.08
Allows focus on clinical decision 0.07
New Method
range 0.06
Shows method correlation at 0.05
decision point 0.04
0.03
0.02
0.01
0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 0.11
Comparative Method
Page 52
Case# 3
90,000
HCG
80,000
Range of Observations 1.82 to 68,651
Correlation Coefficient (r) 1.000 70,000
Linear Slope 0.850

60,000
Linear Intercept 168
50,000
New
40,000
30,000
20,000
10,000
0
0 10,000 20,000 30,000 40,000 50,000 60,000 70,000 80,000 90,000
Comparison
Page 53
Dealing with diluted samples
With dilutions removed agreement HCG

900
much better
More accurate model of relationship 800
Do NOT include dilutions 700
All guidelines state dilutions should 600

not be used
500
• CLSI EP9
New
• Articles/books on method comparison 400
If dilutions are important – compare 300

separately Number of Samples 12
200 Range of Observations 1.82 to 848
Correlation Coefficient (r) 1.000
Linear Slope 0.990
100
Linear Intercept -2.19
0
0 100 200 300 400 500 600 700 800 900
Comparison
Page 54
Case #4
25 Absolute Neutrophil Count

Correlation Coefficient (r) 0.991 20
Linear Slope 0.893

15
New Method
10
0
0 5 10 15 20 25
Comparison Method
Page 55
Case #4 - suggestions
Several extreme results influencing line

Strategy: 25 Absolute Neutrophil Count
1. Collect more samples to fill in range

or..
20
2. Retest samples with discrepant

results to r/o testing error
15
3. Use difference plot
New Method
10
0
0 5 10 15 20 25
Comparison Method
Page 56
Case #4: Difference plot
Difference plot shows minimal bias

between methods Absolute Neutrophil Count
1.5
Overall method agreement is good
Difference (New Method - Com parison Method)

Only a couple of samples show larger 0.5
difference
• Is that difference significant ? -0.5
• Could it be an error ?
-1.5
Can collect more samples in higher
range, but if that is not practical this
-2.5
shows overall excellent agreement
-3.5
0 5 10 15 20 25
Comparison Method
Page 57
Dealing with Outliers, etc.
What’s an outlier ?
A result that does not represent overall relationship / performance due
to some sample specific characteristic
How do you know it’s an outlier ?
• EP9-A3:
• Samples with method to method differences more than 4x the
average difference can be considered outliers.
• Up to 2.5% of data points, identified as outliers, can be deleted
without re-assessing the study (1 point in 40)
• Statisticians recommend against removing outliers if no specific cause
can be found – suggest using statistics robust to outliers
Page 58
Outliers
Whenever possible find out reason for outlier

• Problem sample – interferences, fibrin, etc.
• Error on run – short sample
• Remember – “error” or interference may not be the new method
Exclude data with identified errors

• These results do NOT represent typical performance of the methods
• Including this data will make estimate of relationship inaccurate
Handle unexplained “outliers” as for extreme values

• Outliers near upper or lower limits of data may affect regression statistics
• Outliers near middle of the data range have less effect
• If using regression, use robust regression tool
Page 59
Case # 7
1800
LD
1600
1400
1200
New Method
1000
800
Nothing 600
Indicates real method 400
to method difference 200
0
0 200 400 600 800 1000 1200 1400 1600 1800
Comparison Method
Page 60
True Method to Method Difference
Not very common, but it happens 1800

LD
Validate by looking at other comparisons for 1600
same two methods 1400
Look for data from similar studies with same 1200

method – confirm this is what is expected
New Method
1000
Alternatively, use Truth Table/ Concordance

report to evaluate clinical equivalence 800
600
Will need to update reference interval and
notify clinicians of expected change to results 400
200
0
0 200 400 600 800 1000 1200 1400 1600 1800
Comparison Method
Page 61
Truth Table and Concordance
Used with:
• qualitative assays (infectious disease)
• large method to method differences to
show clinical equivalence
Page 62
Concordance and Quantitative Methods
§ Regression analysis has slope very

different from 1.0
§ Reference intervals are not similar
§ Concordance helps focus on clinical
interpretation rather than just looking at
numbers
Comparison Method
Reference Intervals: New Method

Comparison Method: Less than 30 U/ml
New Method: Less than 60 U/ml
Page 63
Precision
AMR
Method
Verification
Comparison
& Range

Cautions Interval
Page 64
Transferring Reference Intervals
Key: reference interval is being verified not established

Guidance:
C28-A3: Defining, Establishing, and Verifying Reference Intervals in the Clinical Laboratory
• Establishing a reference interval requires at least 120 samples from reference individuals
• Verifying or transferring an established reference interval can be done with 20 samples
CLSI C28-A3: Defining, Establishing, and Verifying Reference Intervals in the Clinical Laboratory (2008), p28
Page 65
Verifying a Reference Interval (CLSI C28-
A3)
1. Regression analysis
• If regression results are robust (enough samples, good distribution,
appropriate regression model, etc.) can use regression equation
• New Reference limit = slope x old reference limit + intercept
• If calculated new limits match proposed reference interval – verified
2. Using small study
• Select 20 individuals that match criteria used for proposed reference interval
and analyze with method
• If no more than 2 results exceed limits of proposed reference interval –
verified
3. Subjective judgment
• On careful review of all data and description of how proposed reference
interval was established, lab director may accept new reference interval
Page 66
Summary
• Method Validation studies are very useful, good lab practice, and
generally required by regulation
• Remember, all studies provide only estimates of precision, method

agreement, etc. – estimates will vary from study to study
• Keep the assumptions and limitations of all statistical tools used in mind
when reviewing studies
• There is no simple single answer for all data sets; each requires thoughtful
selection of the correct tool for the situation
Page 67

Method Validation Amit PH PDF

Hochgeladen von

Dokumentinformationen

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Method Validation Amit PH PDF

Hochgeladen von

Copyright:

Verfügbare Formate

Amit Manjure

Clinical & Scientific Excellence

It is expensive and take time to it right.

But what is it? How do we do it? Why do we do it?

§ Help Labs Detect Analytical Error

§ Assist in meeting regulatory requirements

CAP / CLIA Checklist recommends that laboratory must verify or establish

Typical Studies needed for Quantitative Assay :

Analytical methods need to be validated

• before their introduction into routine use

• whenever the method is changed and the change is outside the

Select Diagnostics Test

Check with Report

A Routine Testing Process

Overview & Reference

Points to keep in mind:

§ All statistical tools make assumptions about the data

Methods are designed to test fresh patient samples

• Frozen / stored patient samples

What can alter sample matrix?

Overview & Reference

Analytical Range – aka: Calibration range, reportable range, analytical

Maximum allowa ble CV (1/3 of Total Error Allowed of 20 %) = 6.7%

100 y = 0.9475x + 0.4558

Overview & Reference

Measure of the reproducibility of the results

Characterized using statistical tools

All estimates of precision can be expected to

Two Types of Estimates .

§ With in Run or Short term or Intra-assay Precision

Which Matrix to Use ?

With in Run or Short term or Intra-assay Precision

• 20 replicates for each control material in one run.

Between Run or Long Term or Inter assay Precision.

Essentially the average size of the differences

21-40 9.87 -0.3%

41-60 7.80 -21.2%

Is measured precision acceptable ?

u However, remember limitations on SD calculated from limited data

From CLSI EP5-A2 (Evaluation of Precision Performance of Quantitative

Chi squared (c2): tests whether a measured SD is statistically

Used to compare estimated precision to published performance

ü Calculated X2 is compared to the critical

The calculation is based on SD, therefore it is essential that the SD’s

Since the SD may vary with concentration, we typically use a

Overview & Reference

Method comparison: testing the same group of patient

Commonly used in clinical laboratories to:

§ Lab tests samples by current method

§ Methods do not seem to agree

Most common actual root causes:

Use samples compatible with both methods

• Samples requiring dilution by either method should not be included

Step 1: Plot the data

Prior method New method

8 Mean of Method 1 8.19

Reference method Candidate method

§ Plot can show…

17 Scatter Plot 900 Scatter Plot

Test Method (µg/L)

11 SD = +/- X 200 CV = +/- X%

Comparative Method (µg/L) Comparative Method (µg/L)

Typically chemistry methods Typically immunoassay methods