The Cleveland CAD Dataset
This dataset contains clinical data of real-life patients who may or may not be suffering from cardiovascular disease.
https://csvbase.com/mdfarragher/Heart-Disease
303
Details
The Cleveland CAD dataset from the University of California UCI contains real-life diagnostic information of 303 anonymized patients and was compiled by Robert Detrano, M.D., Ph.D of the Cleveland Clinic Foundation back in 1988.
The dataset contains 13 features which include the results of the aforementioned non-invasive diagnostic tests along with other relevant patient information. The label represents the result of the invasive coronary angiogram and indicates the presence or absence of cardiovascular disease (CAD) in the patient. A label value of 0 indicates absence of CAD and label values 1-4 indicate the presence of CAD.
Data Schema
The dataset can be downloaded in CSV, Parquet, XLSX or JSONL format and has the following schema:
| Column name | Column type | Missing data? |
|---|---|---|
| Row ID | Integer | Not allowed |
| Age | Integer | Allowed |
| Sex | Integer | Allowed |
| Cp | Integer | Allowed |
| TrestBps | Integer | Allowed |
| Chol | Integer | Allowed |
| Fbs | Integer | Allowed |
| RestEcg | Integer | Allowed |
| Thalac | Integer | Allowed |
| Exang | Integer | Allowed |
| OldPeak | Float | Allowed |
| Slope | Integer | Allowed |
| Ca | Text | Allowed |
| Thal | Text | Allowed |
| Diag | Integer | Allowed |