Behind the Visualizations: The Data That Powers This Story

This project combines multiple datasets to explore how food delivery has grown, what people order, and how they feel about it.

Restaurant and Ratings Data

Source: Uber Eats USA Restaurants (Kaggle)

This dataset contains restaurant-level information including ratings, categories, pricing, and locations. It was used to analyze how restaurants differ across cities and cuisines.

Key Fields

name, category, price_range
score, ratings
city, location (lat/lng)

Used For

Geography visualization
Cuisine comparisons
Restaurant distribution insights

DoorDash Orders Dataset

Source: DoorDash Dataset (Kaggle)

This dataset captures operational data such as order timing, delivery duration, and platform activity. It helps reveal when and how people use food delivery.

Key Fields

created_at, delivery time
total_items, subtotal
dashers & order volume

Used For

Time-of-day ordering trends
Weekly usage patterns
Demand vs supply insights

Google Trends and Sentiment Data

Sources: Google Trends

Combines search interest data with sentiment analysis scores to understand both popularity and public opinion of food delivery platforms.

Key Fields

search interest over time
sentiment (positive/neutral/negative)
VADER & AFINN scores

Used For

Trend growth visualization
Platform sentiment comparison
Public perception analysis

Canadian DoorDash Dataset

Source: Food Delivery in Canada (Kaggle)

Focused on Canadian cities, this dataset was used to explore regional differences in cuisine preferences and ordering behavior.

Key Fields

city, restaurant category
ratings and pricing

Used For

Canada-focused map visualization
Cuisine popularity by city

Cleaned Combined Dataset

Source: Food Delivery Apps (Mendeley)

A cleaned and merged dataset combining multiple sources. This dataset standardizes fields like city, category, and ratings for consistent analysis across visualizations.

Key Fields

restaurant, city, category
star ratings, price range

Used For

Star rating visualization
Cross-dataset comparisons
Final unified analysis