Skip to content
#

model-runtime

Here is 1 public repository matching this topic...

GPT-OSS B20 Local Execution. Lightweight local environment for running it with Python 3.12 and CUDA acceleration. - Run GPT-OSS B20 entirely offline - Optimize text generation with GPU - Enable fast, secure inference on consumer hardware.

  • Updated Aug 13, 2025
  • Python

Improve this page

Add a description, image, and links to the model-runtime topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-runtime topic, visit your repo's landing page and select "manage topics."

Learn more