Dataset

The Cleveland CAD Dataset

This dataset contains clinical data of real-life patients who may or may not be suffering from cardiovascular disease.


Number of rows:
303

Details

The Cleveland CAD dataset from the University of California UCI contains real-life diagnostic information of 303 anonymized patients and was compiled by Robert Detrano, M.D., Ph.D of the Cleveland Clinic Foundation back in 1988.

The dataset contains 13 features which include the results of the aforementioned non-invasive diagnostic tests along with other relevant patient information. The label represents the result of the invasive coronary angiogram and indicates the presence or absence of cardiovascular disease (CAD) in the patient. A label value of 0 indicates absence of CAD and label values 1-4 indicate the presence of CAD.

Data Schema

The dataset can be downloaded in CSV, Parquet, XLSX or JSONL format and has the following schema:

Column nameColumn typeMissing data?
Row IDIntegerNot allowed
AgeIntegerAllowed
SexIntegerAllowed
CpIntegerAllowed
TrestBpsIntegerAllowed
CholIntegerAllowed
FbsIntegerAllowed
RestEcgIntegerAllowed
ThalacIntegerAllowed
ExangIntegerAllowed
OldPeakFloatAllowed
SlopeIntegerAllowed
CaTextAllowed
ThalTextAllowed
DiagIntegerAllowed

Labs Exploring The Cleveland CAD Dataset