Dataset

The New York TLC Dataset

This well-known dataset contains trip details of all taxi trips conducted in New York City in December 2018.


Number of rows:
9,998

Details

The New York City Taxi and Limousine Commission (TLC), created in 1971, is the agency responsible for licensing and regulating New York City’s yellow taxi cabs, for-hire vehicles, commuter vans, and paratransit vehicles. Over 200,000 TLC licensed drivers complete approximately 1,000,000 trips each day. To operate for hire, drivers must first undergo a background check, have a safe driving record, and complete 24 hours of driver training.

In partnership with the New York City Department of Information Technology and Telecommunications, TLC has published millions of trip records from both yellow and green taxis. The taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

Data Schema

The dataset can be downloaded in CSV, Parquet, XLSX or JSONL format and has the following schema:

Column nameColumn typeMissing data?
Row IDIntegerNot allowed
VendorIDIntegerAllowed
tpep_pickup_datetimeTextAllowed
tpep_dropoff_datetimeTextAllowed
passenger_countIntegerAllowed
trip_distanceFloatAllowed
RatecodeIDIntegerAllowed
store_and_fwd_flagBooleanAllowed
PULocationIDIntegerAllowed
DOLocationIDIntegerAllowed
payment_typeIntegerAllowed
fare_amountFloatAllowed
extraFloatAllowed
mta_taxFloatAllowed
tip_amountFloatAllowed
tolls_amountFloatAllowed
improvement_surchargeFloatAllowed
total_amountFloatAllowed

Labs Exploring The New York TLC Dataset