Patricia Cerrito

Patricia Cerrito (PhD) has made considerable strides in the development of data mining techniques to investigate large, complex medical data. In particular, she has developed a method to automate the reduction of the number of levels in a nominal data field to a manageable number that can then be used in other data mining techniques. Another innovation of the PI is to combine text analysis with association rules to examine nominal data. The PI has over 30 years of experience in working with SAS software, and over 10 years of experience in data mining healthcare databases. In just the last two years, she has supervised 7 PhD students who completed dissertation research in investigating health outcomes. Dr. Cerrito has a particular research interest in the use of a patient severity index to define provider quality rankings for reimbursements.

Publications

Text Mining Techniques for Healthcare Provider Quality Determination: Methods for Rank Comparisons
Patricia Cerrito. © 2010. 410 pages.
The quest for quality in healthcare has led to attempts to develop models to determine which providers have the highest quality in healthcare, with the best outcomes for...
Introduction to Ranking Models
Patricia Cerrito. © 2010. 34 pages.
Risk adjustment models only consider patient condition and not patient compliance with treatment.(Rosen, Reid, Broemeling, & Rakovski, 2003) This paper suggests that health...
Data Visualization and Data Summary
Patricia Cerrito. © 2010. 38 pages.
Patients in a clinic or geographic area are very heterogeneous. Therefore, the distribution of patient factors will not have a normal distribution. Generally, the distribution...
Statistical Methods
Patricia Cerrito. © 2010. 37 pages.
Ultimately, a patient severity index is used to compare patient outcomes across healthcare providers. If the outcome is mortality, logistic regression is used. If the outcome is...
Predictive Modeling Versus Regression
Patricia Cerrito. © 2010. 43 pages.
Predictive modeling includes regression, both logistic and linear, depending upon the type of outcome variable. It can also include the generalized linear model. However, there...
The Charleston Comorbidity Index
Patricia Cerrito. © 2010. 49 pages.
In this chapter, we consider the Charlson Comorbidity Index (CCI). This index is published, and the weights used to define risk adjustment in the logistic regression model are...
The All Patient Refined Diagnosis Related Group
Patricia Cerrito. © 2010. 37 pages.
In this chapter, we will discuss the APRDRG, or all patient refined diagnosis related group. It is another type of coding system that, unlike the Charlson Index, is proprietary...
Risk Adjustment Based Upon Resource Utilization
Patricia Cerrito. © 2010. 48 pages.
Resource utilization is based upon the assumption that patients with more severe problems will utilize more resources, and the most severe patients will require the most...
Text Mining and Patient Severity Clusters
Patricia Cerrito. © 2010. 54 pages.
Text mining diagnosis codes takes advantage of the linkage across patient conditions instead of trying to force the assumption of independence. Combinations of diagnoses are used...
Working from Claims Data
Patricia Cerrito. © 2010. 16 pages.
Claims data are more difficult to work with to extract the necessary information about patient conditions in relationship to costs. There can be multiple claims for the same...
Use of Risk Adjustment Models for Provider Reimbursements
Patricia Cerrito. © 2010. 29 pages.
In this chapter, we will focus on the use of patient severity indices to determine the reimbursement to healthcare providers. In order to do this, we must first examine the...
How to Check Measures for Adequacy
Patricia Cerrito. © 2010. 7 pages.
Perhaps the biggest problem when checking measures for adequacy, in addition to overlooking the fact that model assumptions are invalid, is the need to examine the model for...
Cases on Health Outcomes and Clinical Data Mining: Studies and Frameworks
Patricia Cerrito. © 2010. 464 pages.
With the healthcare industry becoming increasingly more competitive, there exists a need for medical institutions to improve both the efficiency and the quality of their...
Clinical Data Mining for Physician Decision Making and Investigating Health Outcomes: Methods for Prediction and Analysis
Patricia Cerrito, John Cerrito. © 2010. 370 pages.
The investigation of healthcare databases can be used to examine physician decisions and develop evidence-based treatment guidelines that optimize patient outcomes. Clinical Data...
Introduction to Data Mining Methodology to Investigate Health Outcomes
Patricia Cerrito. © 2010. 14 pages.
In this case, we provide some basic information concerning the statistical methods used throughout this text. These methods include visualization by kernel density estimation...
Analyzing Problems of Childhood and Adolescence
Patricia Cerrito, Aparna Sreepada. © 2010. 30 pages.
The study presents the analysis of the results of a health survey that focuses on the health risk behaviors and attitudes in adolescents that result in teenage obesity....
Preprocessing the Data
Patricia Cerrito, John Cerrito. © 2010. 10 pages.
In this book, we provide tools that are needed to investigate administrative and clinical databases that are routinely collected in the support of patient treatment. Often, these...
Errors and Missing Values in the Dataset
Patricia Cerrito, John Cerrito. © 2010. 8 pages.
Many of the datasets provided by the federal government have been well cleaned. However, like many other datasets collected for health outcomes research, these datasets contain...
Introduction to the Use of MEPS (Medical Expenditure Panel Survey)
Patricia Cerrito, John Cerrito. © 2010. 38 pages.
We will begin with data from the Medical Expenditure Panel Survey and use it throughout the text. This dataset has been provided since 1996 and contains yearly information...
Preprocessing Medpar Data
Patricia Cerrito, John Cerrito. © 2010. 12 pages.
Medicare data provide information that hospitals submit for billing purposes, Medpar, or Medicare Provider Analysis and Review. It is publicly available (for a fee) at...
Extracting Data from the National Inpatient Sample
Patricia Cerrito, John Cerrito. © 2010. 25 pages.
In the other type of health care database that we discuss in this chapter, there are multiple columns for each patient observation. It is more difficult to find both the most...
Creating a One-to-One Relationship in the Data from a Many-to-Many
Patricia Cerrito, John Cerrito. © 2010. 22 pages.
One of the biggest problems in dealing with healthcare data is that there can be multiple patient events for the same individual. Without finding some way to combine these many...
Merging Different Datasets to Allow for a Complete Analysis (Inpatient, Outpatient, Physician Visits, Medications)
Patricia Cerrito, John Cerrito. © 2010. 38 pages.
If we want to gain a complete picture of patient treatment, we need to examine all patient encounters with the medical profession. Then, we need to divide them into episodes of...
Introduction to Analysis Using Time Components
Patricia Cerrito, John Cerrito. © 2010. 39 pages.
The introduction of a time component requires the use of statistical methods that can utilize dependent data. The assumption of independence that is required for regression...
More Survival Data Mining of Multiple Time of Endpoints
Patricia Cerrito, John Cerrito. © 2010. 22 pages.
Survival analysis is almost always reserved for an endpoint of mortality or recurrence. (Mantel, 1966) However, it can be used for many different types of endpoints as the...
Using the Data to Define Patient Compliance
Patricia Cerrito, John Cerrito. © 2010. 19 pages.
Patient compliance with treatment is essential. However, it is difficult to examine the issue of compliance from claims and administrative databases that include no direct input...
Compression of Diagnosis and Procedure Codes
Patricia Cerrito, John Cerrito. © 2010. 15 pages.
Each of the datasets has many different diagnosis and procedure codes to represent a patient’s condition. There are thousands of potential codes, and millions of potential...
Comparisons of Patient Severity Indices
Patricia Cerrito, John Cerrito. © 2010. 38 pages.
In this section, we will briefly discuss two methods of ranking patient severity. The first method we consider is the AHRQ comorbidities, which is a collection of 30 patient...
Decision Trees and Their Development: Use of Data to Determine the Quality of Care
Patricia Cerrito, John Cerrito. © 2010. 18 pages.
Decision trees are developed to support physicians who must make treatment decisions. Risk estimates are used to find the optimal treatment pathway for a group of patients....
Example of Diabetes Using CMS Data
Patricia Cerrito, John Cerrito. © 2010. 13 pages.
We want to examine the treatment of patients with diabetes, and the reasons these patients are in the hospital. In order to do this, we must consider a cohort of patients who...
Example of Breathing Illnesses, Asthma and COPD Using MEPS Data
Patricia Cerrito, John Cerrito. © 2010. 11 pages.
Asthma and COPD (chronic obstructive pulmonary disease) require maintenance and emergency medication. We want to examine the use of the required medications, as well as to...
Example of Wound Care Using Medpar Data
Patricia Cerrito, John Cerrito. © 2010. 17 pages.
Medpar Data is used for billing Medicare; it typically is “packed” into different sheets of data. The first step required to use the data is to “unpack” it into its component...
Discussion
Patricia Cerrito, John Cerrito. © 2010. 6 pages.
Now that the data are more readily available for outcomes research and the techniques to analyze that data are available, we need to use the tools to investigate the total...
Text Mining to Define a Validated Model of Hospital Rankings
Patricia Bintzler Cerrito. © 2008. 29 pages.
The purpose of this chapter is to demonstrate how text mining can be used to reduce the number of levels in a categorical variable to then use the variable in a predictive model....