Skip to content

RizeComputerScience/DSMII-Bike-Share-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Bike Share Time Series Analysis

A comprehensive time series analysis of the Capital Bikeshare system in Washington D.C., demonstrating fundamental concepts in time series forecasting including data preparation, decomposition, and forecasting methods.

Dataset

This analysis uses the Bike Sharing in Washington D.C. Dataset from Kaggle, which contains daily bike rental counts for 2011-2012 along with weather and seasonal information.

What You'll Learn

This notebook demonstrates:

  1. Time Series Data Preparation

    • Loading and filtering temporal data
    • Handling missing dates
    • Creating proper time series indexes
  2. Trend Analysis with Moving Averages

    • 7-day (weekly) moving averages
    • 30-day (monthly) moving averages
    • Visual trend identification
  3. Seasonal Pattern Detection

    • Using pandas datetime accessors
    • Day-of-week analysis
    • Weekday vs. weekend patterns
  4. Time Series Decomposition

    • Separating trend, seasonal, and residual components
    • Additive vs. multiplicative models
    • Understanding seasonal patterns
  5. Simple Forecasting Methods

    • Naive baseline (simple average)
    • Seasonal naive (repeating weekly patterns)
    • Forecast accuracy evaluation (RMSE)
    • Understanding forecast limitations

Setup

Prerequisites

  • Python 3.7 or higher
  • Jupyter Notebook or VS Code with Jupyter extension

Installation

  1. Clone or download this repository

  2. Install required packages:

pip install pandas numpy matplotlib scikit-learn statsmodels

Or install all at once:

pip install -r requirements.txt
  1. Verify the data files are present:
    • data/day.csv - Daily bike rental data
    • data/hour.csv - Hourly bike rental data (optional)

Running the Analysis

  1. Open bike_share_analysis.ipynb in Jupyter Notebook or VS Code

  2. Run cells sequentially from top to bottom (the notebook is designed to be run in order)

  3. Each section builds on previous sections, so make sure to run all cells in Part 1 before moving to Part 2, etc.

Project Structure

Unit5-Practice/
├── README.md                     # This file
├── bike_share_analysis.ipynb     # Main analysis notebook
├── requirements.txt              # Python dependencies
└── data/
    └── day.csv                   # Daily aggregated bike rental data

Assignment 3: Building Time Series Forecasts

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors