Beruflich Dokumente
Kultur Dokumente
Daniel J. Weidman, Ph.D. Advanced Electron Beams, Wilmington, MA IEEE Boston-area Reliability Society Meeting Wednesday, January 14, 2009
Daniel@Weidman.com
The pdf of this version of this presentation may be distributed freely. No proprietary information is included. If distributing any portion, please include my name as the author. Thank you, Dan Weidman Copyright 2009, D. J. Weidman
1
D. J. Weidman January 2009
3
D. J. Weidman January 2009
Practical Reliability Engineering for Semiconductor Equipment Goal and scope of this talk
Review basic reliability engineering concepts and show how they can be used successfully Applicable to equipment in the semiconductor industry, and other industries
4
D. J. Weidman January 2009
5
D. J. Weidman January 2009
6
D. J. Weidman January 2009
PVD Machine
Physical Vapor Deposition of thin metal film Wafers carried on trays to minimize handling & time to change size Up to five metals in a small footprint
7
D. J. Weidman January 2009
Practical Reliability Engineering for Semiconductor Equipment Goal Reliability program plan
Immediate issues Reactive reliability engineering
Overall process
Data gathering to record each issue Data tallying
8
D. J. Weidman January 2009
Reliability program
Failure Reporting, Analysis, and Corrective Action System
Field Service Engineer receives customer report or observes fault
customer
parts or information or both
reports fault
internet access
FSR
on new machine
FAR
10
D. J. Weidman January 2009
Practical Reliability Engineering for Semiconductor Equipment Goal Reliability program plan
Immediate issues Reactive reliability engineering
Overall process Reliability engineering metrics with examples
Fault vs. failure Pareto plot Uptime and Availability MTBF, MTBA, MTBI MTTR, MTR Additional metrics specific to industry Additional metrics
11
D. J. Weidman January 2009
faults
failures
action
Fault: anything that has gone wrong Failure: an equipment problem All failures are faults Examples: If a transport system stops due to
particles that are normal to the process, then it is a failure (and a fault). a left wrench inside, then its a fault but not a failure.
12
D. J. Weidman January 2009
Lo c Lo c Lo c Lo c Lo c Lo c Lo c Lo c at io n 8 7 at io n at io n 6 4 at io n at io n 5 3 at io n 2 at io n at io n 1
Location
D. J. Weidman January 2009
Pareto plots
Function
13
Machine cross-section
Front end: Location 1 Location 2 Location 3 Location 4
Location 8
Location 5
Location 6
Chase: Location 7
D. J. Weidman January 2009
Lo c Lo c Lo c Lo c Lo c Lo c Lo c Lo c at io n 8 6 at io n at io n 5 7 at io n at io n 3 1 at io n 4 at io n at io n 2
Location Function
15
1C
2I
2C
4I
4C
1I
1C
2I
2C
4I
4C
1I
Most of these faults are not failures: upgrades of subsystem on older machines or rebooting No predominant issue
17
D. J. Weidman January 2009
1C
2I
2C
4I
4C
1I
18
D. J. Weidman January 2009
Lo c Lo c Lo c Lo c Lo c Lo c Lo c Lo c at io n 8 7 at io n at io n 6 4 at io n at io n 5 3 at io n 2 at io n at io n 1
Location
D. J. Weidman January 2009
Function
19
Fu
April
April
nc ti Fu o n nc C t Fu ion nc I ti Fu o n nc D ti Fu o n nc H t Fu ion nc E tio Fu n G nc ti Fu o n B nc tio n Fu nc F tio n A Lo ca tio Lo n 4 ca tio n Lo 3 ca tio Lo n 7 ca ti Lo on 1 ca tio n Lo 5 ca tio Lo n 6 ca tio n Lo 2 ca tio n 8 Fu nc tio n C nc tio Fu Fu
May
May
June
June
n nc I tio n Fu nc D ti Fu o n H nc tio Fu n E nc tio n Fu F nc tio n Fu nc A tio Fu n G nc tio n B Fu nc tio Fu n D nc tio Fu n C nc t Fu ion nc I ti Fu o n nc E ti Fu o n H nc ti Fu o n G nc ti Fu o n B nc ti Fu o n nc F tio n A
20
Practical Reliability Engineering for Semiconductor Equipment Goal Reliability program plan
Immediate issues Reactive reliability engineering
Overall process Reliability engineering metrics with examples
Fault vs. failure Pareto plot Uptime and Availability MTBF, MTBA, MTBI MTTR, MTR Additional metrics specific to industry Additional metrics
21
D. J. Weidman January 2009
downtime
PM / Scheduled UM / Repairs
availability
All time: either uptime or downtime Uptime: either operating or idle time Uptime (hours) availability (%) Downtime: either PM, or Unscheduled Maintenance (Repairs) MTTR (mean time to repair) applies to PM and to UM
22
D. J. Weidman January 2009
Above plot is from SEMI E10 We assume that Total Time is Operations Time
23
D. J. Weidman January 2009
quarter
Machine Availability
Specification: availability > 85% Typical performance: 90%
quarter
Machine Availability
Specification: availability > 85% Typical performance: 90%
Measured from Field Service Reports Machine PM time approximately 7%. Customers report
At beta Customer, one machine: 94.3% avail. better than 6% PM At another customer: 6% PM reported
D. J. Weidman January 2009
25
Practical Reliability Engineering for Semiconductor Equipment Goal Reliability program plan
Immediate issues Reactive reliability engineering
Overall process Reliability engineering metrics with examples
Fault vs. failure Pareto plot Uptime and Availability MTBF, MTBA, MTBI MTTR, MTR Additional metrics specific to industry Additional metrics
26
D. J. Weidman January 2009
28
D. J. Weidman January 2009
month
29
quarter
250 hours is specified Field Service Reports indicate we exceed this Quarterly less noisy than monthly
30
D. J. Weidman January 2009
week
Per machine, averaged over two machines 1: Adjusted one of the subsystems 31 2: Installed an upgraded version of the subsystem in one machine
D. J. Weidman January 2009
MTTR (Mean Time To Repair) and MTR MTTR is Mean Time to Repair (SEMI E10 definition): the average elapsed time (not person hours) to correct a failure and return the equipment to a condition where it can perform its intended function, including equipment test time and process test time (but not maintenance delay). MTR is Mean Time to Restore: includes maintenance delays.
32
D. J. Weidman January 2009
Practical Reliability Engineering for Semiconductor Equipment Goal Reliability program plan
Immediate issues Reactive reliability engineering
Overall process Reliability engineering metrics with examples
Fault vs. failure: all failures are faults Pareto plot: location and function, sample size of several Uptime and Availability: time is up or down MTBF, MTBA, MTBI: I = F + A MTTR, MTR: working time vs. clock time Additional metrics specific to industry Additional metrics
33
D. J. Weidman January 2009
Practical Reliability Engineering for Semiconductor Equipment Goal Reliability program plan
Immediate issues Reactive reliability engineering
Overall process Reliability engineering metrics with examples
Fault vs. failure: all failures are faults Pareto plot: location and function, sample size of several Uptime and Availability: time is up or down MTBF, MTBA, MTBI: I = F + A MTTR, MTR: working time vs. clock time Additional metrics specific to industry Additional metrics
34
D. J. Weidman January 2009
Broken wafers
Goals
Ideally zero In practice, need fewer than 1 in 10k (or 1 in 100k)
>7,000k wafers/year on all machines within 1 in 300k Total reported on newer-style machines: zero
35
D. J. Weidman January 2009
12 10 8 6 4 2 0
Nimbus reporting
month Another machine arrived at customer 2 more machines both arrived at customer First machine with two new major features shipped.
36
D. J. Weidman January 2009
unplanned maintenance
Unplanned maintenance (UM) based on FSRs only Actual UM is higher Data scattered: 1 std dev values themselves All machines have reported in time shown (6 quarters) 37
D. J. Weidman January 2009
C om po ne nt C 1 om po ne nt C om 2 po ne nt C om 3 po ne nt C 4 om po ne nt C om 5 po ne nt C 6 om po ne nt 7
Failures by component
38
39
D. J. Weidman January 2009
Item 3
Item 4
Item 5
Item 6
Item 7
Item 8
Item 10
Item 11
Item 12
Item 2
Item 9
Item 1
Item 16
Item 17
Item 18
Item 19
Item 20
Item 21
Item 22
Item 13
Item 14
5 quarters
41
D. J. Weidman January 2009
Item 15
Item 23
All failures
9 8 7 6 5 4 3 2 1 0
10
11
12
13
14
15 16 17
18
19
20
21
22
23
cause Two quarters 123 reported failures, which fell into 65 categories One series and another series Categories are named by cause not by symptom Other faults were not included. If PM was required, then the fault was not counted as a failure. Failures occurring twice or more are plotted, which are in 23 categories. Failures occurring three times or more were analyzed13 categories. Next slides
42
D. J. Weidman January 2009
43
D. J. Weidman January 2009
Thank you
44
D. J. Weidman January 2009
Thank you
End of presentation
45
D. J. Weidman January 2009