Capstone Project 2026

Predicting Game Revenue on Steam: What Drives Financial Success Among Top-Selling Titles?

 

What we're doing: Using a dataset of the top 1,500 revenue-generating Steam games, we are analyzing which game attributes are most associated with higher net revenue, framed as a BI initiative for game developers, publishers, and platform operators.

Dataset: Top 1,500 Games on Steam by Revenue (Kaggle, September 2024), includes price, review score, publisher class, average playtime, release date, and revenue.

Target Variable: Net Revenue (log-transformed)

Dimensions (one per member):

  • AAA vs. AA vs. Indie (Publisher Class)

  • High vs. Low Price Tier ($0 / Under $20 / $20–$40 / $40+)

  • High vs. Low Review Score (above/below 70%)

Factors: Price, Review Score, Average Playtime, Age of Game

Methods: Multiple Linear Regression + Random Forest, validated with K-Fold Cross Validation (k=5)

BI Deliverables: Revenue dashboard, price tier comparison panel, feature importance report, insight brief

Key Limitation: Dataset is limited to already high-performing titles, findings reflect patterns among top games and may not generalize to the broader Steam library.