Beruflich Dokumente
Kultur Dokumente
Michael Silver, MD, FACP FCCP, FCCM; Taiki Sakata; , Hua-Ching Su, MS; Charles Herman; Steven B. Dolins, PhD; Michael J. OShea
ABSTRACT Healthcare provider organizations are faced with a rising number of nancial pressures. Both administrators and physicians need help analyzing large numbers of clinical and nancial data when making decisions. To assist them, Rush-PresbyterianSt. Lukes Medical Center and Hitachi America, Ltd. (HAL), Inc., have partnered to build an enterprise data warehouse and perform a series of case study analyses. This article focuses on one analysis, which was performed by a team of physicians and computer science researchers, using a commercially available on-line analytical processing (OLAP) tool in conjunction with proprietary data mining techniques developed by HAL researchers. The initial objective of the analysis was to discover how to use data mining techniques to make business decisions that can inuence cost, revenue, and operational efciency while maintaining a high level of care. Another objective was to understand how to apply these techniques appropriately and to nd a repeatable method for analyzing data and nding business insights. The process used to identify opportunities and effect changes is described. KEYWORDS Data mining On-line analytical processing tool (OLAP) Data warehouse Business process improvement
Note: The authors would like to thank Pat Skarulis, chief information ofcer at RushPresbyterianSt. Lukes Medical Center, Yoichi Shintani, vice president at Hitachi America, Ltd., and Bob Kero, chief of business development at Hitachi America, Ltd., for providing guidance for this research. Thanks to Shinji Fujiwara and Arti Denterlein, our colleagues at Hitachi America, Ltd., for setting up the case study environment.
JOURNAL OF HEALTHCARE INFORMATION MANAGEMENT, vol. 15, no. 2, Summer 2001 Healthcare Information Management Systems Society and Jossey-Bass, A Publishing Unit of John Wiley & Sons, Inc.
155
156
Healthcare provider organizations are faced with a rising number of nancial pressures: payer reimbursements that are not covering costs, uninsured patients who are provided care at low or no reimbursement, increased labor costs, decreased admissions, and so on. Both administrators and physicians need help analyzing clinical and nancial data when making decisions.1,2 To assist administrators and physicians, Rush-PresbyterianSt. Lukes Medical Center and Hitachi America, Ltd. (HAL), Inc., have partnered to build an enterprise data warehouse and to perform a series of case study analyses; this article focuses on one analysis.
157
subset of data points with the highest average value, given a set of input attributes and a minimum size of the region specied by the user. Data records contain input variables and an output variable (variables are record attributes or derived attributes, and the output variable must be a measure). A records location in a dimensional space is based on the value of its attributes, for example, attending physician, payer, and LOS in a hospital database. PRIM nds regions where the output variable has a high average value compared to the average value for the entire set of records. PRIM could also be used to nd regions with minimum average value by maximizing the negative values of the output variable. WIS. WIS is an association rule tool that nds relationships between various attributes in a database; some of the attributes can be derived measures. The relationships are dened in terms of if-then rules that show the frequency of records appearing in the database that satises the rule. For example, ninety out of one hundred patients in the database with DRG 999 have a length of stay greater than or equal to ten days. Data mining and data warehousing are becoming more prevalent in the healthcare industry because of the large quantities of data stored in various systems at medical institutions and the number of business decisions made based on the data.8,9,10
158
numerous, complicated, and interdependent factors. For example, when do you apply OLAP, an association rule tool? This was an important decision that led to our better understanding of how to apply these techniques appropriately and to nd repeatable methods for analyzing data and nding business insights. The data mining tools and the OLAP tool were evaluated, and we attempted to use the strength of each tool. Both PRIM and WIS are capable of analyzing large numbers of records in a database. PRIM is an algorithm for solving global optimization problems. PRIM can process many dimensions simultaneously when nding the best region, that is, a subset of data points with a high average value for an output variable. A SQL statement or rule can represent this region. WIS is useful for nding patterns or associations between attributes. It does not nd optimized regions. For WIS the patterns are represented by rules; each rule describes a region that consists of data points satisfying the rules conditions. The results of neither PRIM nor WIS are easy for users to evaluate. A user cannot easily look at a SQL statement describing a PRIM region and intuitively understand the differences between the high average region and the outer region. A user can look at a WIS rule or pattern and understand the attributes and values. However, some difference between the rules region and the outer region may be missed, which could offer an explanation of the meaning of the pattern. OLAP tools cannot discover high average regions or nd new patterns in data. OLAP does allow users to drill down into detail, once a data area of focus is identied, and then lets the user visualize the result on various dimensions effectively. This means the tools can be used to complement one another. For example, PRIM nds an optimized region (a subset of data points), then OLAP can graphically display aggregated values for various dimensions for the region and points outside the region, that is, the outer region. For WIS, the algorithm nds an association rule (a subset of data points) by looking at all combinations of attributes; OLAP can then display data graphically for both the region and outer region. In WISs case the region is not an optimized region but a region made of records satisfying the criteria in the rule. We used PRIM and WIS to nd regions. In essence we ran PRIM and WIS on the entire inpatient record set: inpatients in a department and inpatients with a specic DRG. We experimented with parameter settings so that the algorithms would run efciently and nd the most accurate results. For example, for PRIM we needed to select an alpha value, which controls how fast the algorithm nds a region, and beta value, which constrains the size of the region. Based on the data mining results, we used OLAP to compare data points in a region to data points outside a region. We could run an OLAP report on all dimensions, for example, on all input variables used in PRIM. Even new measures could be dened for the OLAP analysis, for example, day of week of admission. Based on the OLAP results, further OLAP reports can be run to drill down on interesting dimensions.
159
Figure 1. DRG Analysis Using a Region Found by PRIM: Medicare and Medicaid Data for One Year
Visit, Loss, Average Loss, Average Age, Average LOS (Measures Level) Visit Loss Average Loss
Average Age
Visit 64 362
160
Numerous OLAP reports were run on the attributes, for example, nancial class, marital status, and age. The report on nancial class broke down the losses by the following categories: Medicare-Exempt Rhab/Psych/SNF, MedicareNonexempt, and Medicaid. Medicare-Exempt Rhab/Psych/SNF had large and comparable losses in the region and outside the region, but the average loss was signicantly larger in the regionalmost seven times larger. The report on age showed that inpatients between the ages of forty-six and sixty-four had signicantly larger losses than the rest of the patients in the region. This is shown in Figure 2. They also had signicantly larger losses than the patients outside the region. Based on these results, it was decided to better understand why patients between the ages of forty-six and sixty-four had poor nancial performance. A follow-up OLAP analysis was performed that investigated admission source. The analysis revealed that for inpatient visits in the region with MedicareNonexempt, patients admitted via routine admission had the highest average loss; for inpatient visits in the region with Medicare-Exempt Rhab/Psych/SNF , patients admitted via routine admission had the most visits. We further investigated Medicare-Exempt Rhab/Psych/SNF inpatients with a routine admission source. For this subset of inpatient visits inside the region and outside the region, that is, patients between the ages of forty-six and sixty-four and who entered via routine admission and whose payer is Medicare-Exempt Rhab/Psych/SNF , we ran OLAP reports on admission day of week, icd-9 procedure, icd-9 diagnosis, and surgeon department. We discovered that on Tuesdays, the average loss is signicantly greater for patients in the region. Patients in the region have two times the average loss of patients in the region admitted on days of the week other than Tuesday. The difference between patients in the region and patients outside the region is even more dramatic. Although the absolute number of patients is small, and differences may not be statistically signicant, we believe that this approach will be useful for high-volume cases. Examination of Tuesdays admitting physicians revealed that several of the physicians were in the same medical specialty. This specialty cared for patients that typically required a high level of service intensity over a long period of time. The identication of a subset of patients with disproportionately high costs has prompted the institution to reevaluate its admission criteria to this unit.
161
Figure 2. DRG Analysis Using a Region Found by PRIM: Medicare and Medicaid Data for One Year
Average Loss 100 90 80 70 60 50 40 30 20 10 0 1945 4664 Age Description and All Age Dimensions Box 0/Medicare-Exempt Rhab/Psych/SNF Box 0/Medicare-Nonexempt Outer 0/Medicaid Outer 0/Medicare-Exempt Rhab/Psych/SNF Outer 0/Medicare-Nonexempt 65120
Values Displayed Average Loss 1945 4664 65120 Medicare-Exempt Rhab/Psych/SNF $18,543.05 $26,644.60 $17,785.52 Medicare-Nonexempt $63,989.42 $14,162.33 $4,013.35 $4,323.30 Medicaid $5,606.27 $3,481.69 Medicare-Exempt Rhab/Psych/SNF $3,877.73 Medicare-Nonexempt $929.36
Box 0 Outer 0
In order to effect change we need to identify opportunities, that is, either large nancial losses that can be prevented, large nancial successes that can be identied, or moderate nancial success that should be promoted. Some action(s) must be taken. Once these opportunities are identied and action(s) taken, then we need to be able to measure change over timeto rerun these tools in the same manner repeatedly over time so that we can measure and
162
determine whether the action(s) taken have the desired effect. This is one reason that developing a repeatable methodology is important. Several case studies run at Rush-PresbyterianSt. Lukes have resulted in senior management reviewing identied issues and opportunities. In several other studies, the ndings provided additional support for actions being considered by the institution. For the case study described in this article, actions are being considered but have not yet been implemented. One observation that should have been anticipated but was not is the effect on the ow of the information with the introduction of this system. Historically, information has owed unidirectionally from Physician to Medical Record Department to Billing or Physician to Patient to Cost Center Manager to Billing. With the introduction of this tool, we have already seen physicians providing valuable feedback to the medical records department regarding how specic patient services are coded. This has created additional opportunities and challenges for the organization. A second reason for developing a repeatable methodology is to be able to semi-automate the analysesthat is, to run thorough, critical evaluations of the entire institution, departments, or DRGs on demand. We believe we are developing a process that will allow managers who are not skilled in data mining techniques to view their business units data in a format that allows them to use their domain expertise to ask additional questions or develop defensible arguments for change. This approach should be valid at the managers level, the Department of Medicine level, or the level of the charge nurse in a patient care area. A third reason for developing the repeatable methodology is so that we can easily apply these tools to other institutions and possibly other industries. However, the method still depends on human expertise to understand the OLAP reports, make inferences, run more reports, and take action(s); it is not fully automated.
Conclusion
A cross-functional team was formed that included clinical, nancial, and technical expertise. Business problems were formulated at different levels of abstraction in order to identify nancial opportunities. We had a set of tools that we developed and bought, and we tried executing these tools in various combinations with one another. We eventually came up with a repeatable methodology. We discovered that we can apply PRIM and WIS to nd optimized regions and patterns, respectively, by performing complex algorithmic steps. And then after running those tools, we could take a region and points outside the region and compare them, using OLAP, which allows us to compare the region and outer region attribute by attribute. In other words, standard OLAP reports can be run in which each dimension or attribute (payer, admission source, patient age, physician
163
perspectives, marital status, and so on) can be described. If any of the reports describing an attribute is interesting, then a set of follow-up, detailed OLAP reports can be run. We showed how to apply the methodology to an analysis of one DRG. We explained each step and the results of running each step; we illustrated how this method helped turn data into knowledge. Results were presented in a graphically appealing way that helped users determine how to use the information to make decisions. References
1. Schneider, P. How Do You Measure Success? Healthcare Informatics, 1998, 15(3), 4556. 2. Rosenstein, A. Inpatient Clinical Decision-Support Systems: Determining the ROI. Healthcare Financial Management, Feb. 1999, pp. 5155. 3. Peterson, T., Pinkelman, J., and Pfeiff, B. Microsoft OLAP Unleashed. Indianapolis, Ind.: SAMS/McMillian USA, 1999. 4. Brachman, R., Khabaza, T., Kloesgen, W., Piatetsky-Shapiro, G., and Simoudis, E. Mining Business Databases. Communications of the ACM, 1996, 39(11), 4248. 5. Brin, S., Motwani, R., Ullman, J., and Tsur, S. Dynamic Itemset Counting and Implication Rules for Market Basket Data. Paper presented at the SIGMOD Conference, Tucson, Ariz., 1997. 6. Srivastava, A., and Singh, V. Deriving Interpretable Rules for Financial Outliers in Rush Hospital Data. Unpublished internal technical report, Hitachi America, Ltd. 7. Friedman, J., and Fisher, N. Bump Hunting in High-Dimensional Data. Statistics and Computing, 1999, 9(2), 123143. 8. Scheese, R. Data Warehousing as a Healthcare Business Solution. Healthcare Financial Management, Feb. 1998, pp. 5659. 9. Borok, L. Data Mining: Sophisticated Forms of Managed Care Modeling Through Articial Intelligence. Journal of Health Care Finance, 1997, 23(3), 2036. 10. Herr, W. The Benets of Data Integration: HFMA Study Findings. Healthcare Financial Management, Sept. 1996, pp. 5256.
164
Steven B. Dolins received his BS in physics and MS in computer science from Tulane University, and his PhD in computer science from the University of Texas, Arlington. He has worked on semiconductor, military, and consumer packaged goods applications for fteen years and is currently chief researcher at the Information Technology Laboratory at Hitachi America, Ltd. Michael J. OShea is an IS project manager for Rush-PresbyterianSt. Lukes Medical Center. He has project management responsibility for the development and implementation of a data warehousing project, is responsible for the patient accounting system, and acts as liaison between the nancial management group and the IS department.