Master SQL One Problem at a Time
โจ Daily SQL Problems
โจ Real-World Scenarios
โจ Complete Solutions
โจ Interview Ready
Get the messy e-commerce data to practice this challenge!
โฌ๏ธ Click Here to Download CSV Files3 tables: raw_transactions, product_catalog, customer_profiles
Table 1: raw_transactions - Sample data (10 rows total)
| trans_id | customer_email | product_sku | amount | trans_date | status |
|---|---|---|---|---|---|
| TX001 | JOHN@EMAIL.COM | SKU-123-A | $125.50 | 2024-01-15 14:30:00 | completed |
| TX002 | sarah.j@invalid | sku-456-b | 45 | 15/01/2024 | PENDING |
| TX003 | mike@shop.com | SKU-789-C | $0 | 2024-01-20 | Cancelled |
| TX004 | emma@store.io | sku-123-a | 125.50 | 2024/01/25 10:15 | COMPLETED |
...and 6 more messy rows! Download the full dataset above.
Table 2: product_catalog - Clean reference data
| sku | product_name | category | price |
|---|---|---|---|
| SKU-123-A | Wireless Mouse | Electronics | 125.50 |
| SKU-456-B | USB Cable | Accessories | 45.00 |
| SKU-789-C | Laptop Stand | Furniture | 299.99 |
Table 3: customer_profiles - Also messy data!
| full_name | join_date | country | |
|---|---|---|---|
| john@email.com | john smith | 2023-05-10 | USA |
| SARAH.J@INVALID | Sarah Jones | 05/10/2023 | uk |
| mike@shop.com | MIKE WILSON | 2023-06-15 | Canada |
Build a complete data cleaning pipeline using SQL to transform messy e-commerce data into clean, analytics-ready format.
๐ก SQL Skills Required: TRIM, UPPER, LOWER, REPLACE, CASE, CAST, STR_TO_DATE, REGEXP_LIKE, JOINs, CTEs, COALESCE, NULLIF
This is exactly what Data Engineers do daily - cleaning messy data from various sources!
Congratulations!
You've learned complete data cleaning using all SQL concepts! This is exactly what Data Engineers work with in real-world scenarios - messy data from multiple sources that needs standardization, validation, and quality scoring.
Join me on the 100 Days SQL journey!
๐ผ Connect on LinkedIn โญ Star on GitHub
LinkedIn: linkedin.com/in/rahulkanche
GitHub: github.com/rahulkanche/100-days-sql-data
โ Practice on messy real-world data
โ Build complete cleaning pipelines
โ Master data quality validation
โ Prepare for Data Engineering interviews
Follow the 100 Days SQL Challenge
Master SQL one problem at a time! ๐
Rahul Kanche โ exploring data, decoding clarity!