📊 Day 17 - Data Download

Ride-Sharing Trip Analytics

Download all 5 tables with complete messy data (50+ trips, 15 riders, 10 drivers)

Window Functions RANK & PARTITION BY Complex JOINs TIMESTAMPDIFF CTEs

Table 1: trips (50 rows)

trip_idrider_iddriver_idpickup_timedropoff_timedistance_kmfare_amountstatus
T001 R101 D201 2024-01-15 08:30:002024-01-15 09:15:0012.5250.00completed
t002 r102 D20215/01/2024 10:00NULLNULLNULLCANCELLED
T003R103d2032024-01-15 14:30:002024-01-15 15:10:008.3175.50 Completed
T004r104 D204 16/01/2024 07:4516/01/2024 08:2015.2NULLcompleted
t005R105D2012024-01-16 18:00:002024-01-16 18:45:0010.8320.00COMPLETED
T006r106D2052024-01-16 09:00:002024-01-16 09:40:007.2145.00completed
t007R107 d206 17/01/2024 11:3017/01/2024 12:1511.5230.00Completed
T008R108D2072024-01-17 13:00:00NULLNULLNULLCANCELLED
t009 r109 D20818/01/2024 08:1518/01/2024 08:509.8196.00completed
T010R110 D209 2024-01-18 17:30:002024-01-18 18:20:0013.4NULLCOMPLETED
t011R111D21019/01/2024 07:0019/01/2024 07:356.5130.00completed
T012 r101 D2012024-01-19 08:30:002024-01-19 09:20:0014.2400.00Completed
t013R112 d202 20/01/2024 10:0020/01/2024 10:4510.1202.00COMPLETED
T014R113D2032024-01-20 14:00:002024-01-20 14:35:007.8156.00completed
t015R114D20421/01/2024 08:00NULLNULLNULLCANCELLED
T016 r115 D205 2024-01-21 12:30:002024-01-21 13:10:009.5190.00Completed
t017R101D20622/01/2024 07:4522/01/2024 08:2511.3226.00completed
T018R102d2072024-01-22 18:15:002024-01-22 19:00:0012.7380.00COMPLETED
t019 R103 D20823/01/2024 09:0023/01/2024 09:408.9178.00completed
T020r104D2092024-01-23 15:30:002024-01-23 16:15:0010.6NULLCompleted
t021R105 D210 24/01/2024 08:3024/01/2024 09:1011.8236.00COMPLETED
T022R106D2012024-01-24 13:00:00NULLNULLNULLCANCELLED
t023 r107 d20225/01/2024 08:4525/01/2024 09:3013.5450.00completed
T024R108D2032024-01-25 11:00:002024-01-25 11:45:009.2184.00Completed
t025R109 D204 26/01/2024 07:3026/01/2024 08:057.5150.00COMPLETED
T026R110D2052024-01-26 17:45:002024-01-26 18:30:0012.1NULLcompleted
t027 r111 D20627/01/2024 09:1527/01/2024 10:0010.4208.00Completed
T028R112d2072024-01-27 14:30:002024-01-27 15:10:008.7174.00COMPLETED
t029R113D20828/01/2024 18:0028/01/2024 18:5014.3350.00completed
T030 R114 D209 2024-01-28 08:00:002024-01-28 08:40:009.6192.00Completed
t031r115D21029/01/2024 12:00NULLNULLNULLCANCELLED
T032R101D2012024-01-29 07:30:002024-01-29 08:15:0011.9238.00completed
t033R102 d202 30/01/2024 10:3030/01/2024 11:108.4168.00COMPLETED
T034 R103 D2032024-01-30 15:00:002024-01-30 15:45:0010.7NULLCompleted
t035r104D20431/01/2024 08:4531/01/2024 09:2512.3246.00completed
T036R105 D205 2024-01-31 13:30:002024-01-31 14:15:009.8196.00COMPLETED
t037R106D20601/02/2024 07:0001/02/2024 07:408.1162.00Completed
T038 r107 d2072024-02-01 11:00:002024-02-01 11:50:0013.8276.00completed
t039R108D20802/02/2024 17:3002/02/2024 18:2015.1NULLCOMPLETED
T040R109 D209 2024-02-02 08:15:002024-02-02 09:00:0010.9218.00completed
t041R110D21003/02/2024 14:0003/02/2024 14:407.6152.00Completed
T042 r111 D2012024-02-03 09:30:002024-02-03 10:15:0011.4228.00COMPLETED
t043R112 d202 04/02/2024 12:00NULLNULLNULLCANCELLED
T044R113D2032024-02-04 07:45:002024-02-04 08:30:009.3186.00completed
t045R114D20405/02/2024 16:0005/02/2024 16:5014.6NULLCOMPLETED
T046 R115 D205 2024-02-05 08:00:002024-02-05 08:45:0010.2204.00Completed
t047r101D20606/02/2024 11:3006/02/2024 12:108.8176.00completed
T048R102d2072024-02-06 15:30:002024-02-06 16:15:0011.7234.00COMPLETED
t049R103D20807/02/2024 09:0007/02/2024 09:5013.2264.00Completed
T050 r104 D209 2024-02-07 13:00:002024-02-07 13:45:009.9198.00completed

Table 2: drivers (10 rows)

driver_iddriver_namejoin_datevehicle_typeratingtotal_trips
D201 amit sharma 2023-05-10sedan4.8523
d202PRIYA KUMAR15/06/2023 SUV NULL412
D203rajesh patel2023-07-22SEDAN4.5NULL
D204 NEHA SINGH 10-08-2023suv4.9678
d205VIKRAM RAO2023-09-15Sedan4.7345
D206kavita desai2023-10-20 sedan 4.6NULL
d207 RAHUL MEHTA 05-11-2023SUVNULL289
D208ANJALI VERMA2023-12-01sedan4.8398
d209suresh kumar15/12/2023 Sedan 4.4NULL
D210 POOJA SHAH 2024-01-05SUV4.9156

Table 3: riders (15 rows)

rider_idrider_namesignup_datecitytotal_trips
R101john doe2024-01-01 mumbai 15
r102 SARAH SMITH 05/01/2024DelhiNULL
R103MIKE WILSON2024-01-08bangalore22
R104anna garcia10-01-2024 MUMBAI 8
r105Bob Martin2024-01-12Delhi12
R106 EMMA BROWN 15/01/2024bangaloreNULL
r107david johnson2024-01-18mumbai18
R108LISA ANDERSON20-01-2024 Delhi 10
r109 james taylor 2024-01-22BANGALORENULL
R110MARIA MARTINEZ25/01/2024Mumbai14
r111william davis2024-01-28delhi9
R112 sophia rodriguez 30-01-2024 Bangalore NULL
r113ROBERT WILSON2024-02-01MUMBAI16
R114jennifer lee03/02/2024delhi7
r115 MICHAEL THOMAS 2024-02-05bangaloreNULL

Table 4: surge_pricing (20 rows)

surge_idtrip_idsurge_multipliersurge_startsurge_end
S001T0052.52024-01-16 17:30:002024-01-16 19:30:00
S002 t012 1.815/01/2024 08:0015/01/2024 10:00
S003T018NULL2024-01-17 18:00:002024-01-17 20:00:00
S004T0232.02024-01-18 08:30:002024-01-18 10:00:00
S005t0291.519/01/2024 17:0019/01/2024 19:00
S006 T012 2.22024-01-19 08:00:002024-01-19 10:00:00
S007t0181.922/01/2024 17:3022/01/2024 19:30
S008T023NULL2024-01-25 08:15:002024-01-25 10:15:00
S009T0291.728/01/2024 17:4528/01/2024 19:45
S010 t005 2.82024-01-16 18:00:002024-01-16 20:00:00
S011T0102.118/01/2024 17:0018/01/2024 19:00
S012t026NULL2024-01-26 17:30:002024-01-26 19:30:00
S013T0181.622/01/2024 18:0022/01/2024 20:00
S014 T029 2.32024-01-28 17:30:002024-01-28 19:30:00
S015t0391.902/02/2024 17:0002/02/2024 19:00
S016T012NULL2024-01-19 08:30:002024-01-19 10:30:00
S017T0232.425/01/2024 08:0025/01/2024 10:00
S018 t005 2.016/01/2024 17:3016/01/2024 19:30
S019T0261.82024-01-26 17:15:002024-01-26 19:15:00
S020t039NULL02/02/2024 18:0002/02/2024 20:00

Table 5: cancellations (15 rows)

cancel_idtrip_idcancelled_bycancel_reasoncancel_time
C001T002 rider driver too far2024-01-15 10:05:00
C002t008DRIVER NULL 16/01/2024 12:30
C003T015riderCHANGE OF PLANS2024-01-17 08:15:00
C004t022 Driver traffic jam18-01-2024 14:45
C005T031RIDERlong wait time2024-01-20 11:30:00
C006 t002 rider DRIVER TOO FAR 15/01/2024 10:10
C007T008DRIVERNULL2024-01-17 13:05:00
C008t015 Rider changed my mind21-01-2024 08:10
C009T022driverEMERGENCY2024-01-24 13:10:00
C010 T031 RIDER found another ride 29/01/2024 12:05
C011t043driverNULL2024-02-04 12:08:00
C012T002 RIDER too expensive15-01-2024 10:03
C013t008DriverVEHICLE ISSUE2024-01-17 13:02:00
C014T043rider long wait 04/02/2024 12:10
C015t031DRIVERbad weather2024-01-29 12:03:00

📌 How to Import into MySQL/PostgreSQL

MySQL:

1. Create database: CREATE DATABASE rideshare;

2. Import CSV: LOAD DATA INFILE 'path/trips.csv' INTO TABLE trips FIELDS TERMINATED BY ',' ENCLOSED BY '"' LINES TERMINATED BY '\n' IGNORE 1 ROWS;


PostgreSQL:

1. Create database: CREATE DATABASE rideshare;

2. Import CSV: \COPY trips FROM 'path/trips.csv' WITH (FORMAT csv, HEADER true);


Note: Create tables first with appropriate column types before importing!