Capstone Project 2026
Predicting Game Revenue on Steam: What Drives Financial Success Among Top-Selling Titles?
What we're doing: Using a dataset of the top 1,500 revenue-generating Steam games, we are analyzing which game attributes are most associated with higher net revenue, framed as a BI initiative for game developers, publishers, and platform operators.
Dataset: Top 1,500 Games on Steam by Revenue (Kaggle, September 2024), includes price, review score, publisher class, average playtime, release date, and revenue.
Target Variable: Net Revenue (log-transformed)
Dimensions (one per member):
AAA vs. AA vs. Indie (Publisher Class)
High vs. Low Price Tier ($0 / Under $20 / $20–$40 / $40+)
High vs. Low Review Score (above/below 70%)
Factors: Price, Review Score, Average Playtime, Age of Game
Methods: Multiple Linear Regression + Random Forest, validated with K-Fold Cross Validation (k=5)
BI Deliverables: Revenue dashboard, price tier comparison panel, feature importance report, insight brief
Key Limitation: Dataset is limited to already high-performing titles, findings reflect patterns among top games and may not generalize to the broader Steam library.