TaylorBall

A reproducible MLB game and season simulation project using historical data. Built as a senior project exploring how statistical modeling can predict player performance and team outcomes.

What This Project Does

TaylorBall implements three progressive stages of baseball simulation:

Stage 1: Simple Probabilistic Model — Basic win probability using league-average outcomes
Stage 2: Team-Level Adjustments — Incorporates team offensive and pitching metrics
Stage 3: Player-Level Simulation — Models individual batter-pitcher matchups using historical stats

Each stage builds on the previous one, with validation comparing simulated seasons against actual historical records.

Quick Start

Option 1: pip

pip install -r requirements.txt
jupyter notebook TaylorBall.ipynb

Option 2: conda

conda env create -f environment.yml
conda activate taylorball
jupyter notebook TaylorBall.ipynb

Data Sources

This project uses publicly available baseball data:

Lahman Database — Historical statistics from 1871-present
Retrosheet — Play-by-play game logs

Key Findings

(Add 2-3 bullet points summarizing your most interesting results, e.g.:)

The Stage 3 model predicted season win totals within X games for Y% of teams
Run differential proved to be the strongest single predictor of...
Monte Carlo simulations showed that playoff outcomes have higher variance than...

Project Context


Type	Senior Capstone Project
Focus	Baseball Analytics / Sports Data Science
Tools	Python, Jupyter, pandas, NumPy, matplotlib

This project demonstrates end-to-end analytical work: scoping a research question, acquiring and cleaning real-world data, building progressively complex models, and validating results against ground truth.

Limitations & Future Work

Current model uses season-level stats; pitch-by-pitch data could improve accuracy
Simulation assumes independent at-bats (doesn't model hot/cold streaks)
Could extend to project prospect performance or evaluate trades

License

MIT License — see LICENSE for details.

Author: Josh Taylor
Contact: joshknowsbaseball

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE		LICENSE
README.md		README.md
_TaylorBall_.ipynb		_TaylorBall_.ipynb
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TaylorBall

What This Project Does

Quick Start

Option 1: pip

Option 2: conda

Data Sources

Key Findings

Project Context

Limitations & Future Work

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

joshknowsbaseball/TaylorBall

Folders and files

Latest commit

History

Repository files navigation

TaylorBall

What This Project Does

Quick Start

Option 1: pip

Option 2: conda

Data Sources

Key Findings

Project Context

Limitations & Future Work

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages