Claude/add webllm client inference 0185y8rp3h yv pg9 wn ec nks wc#1330
Open
iplanwebsites wants to merge 4 commits intovercel:mainfrom
Open
Conversation
Add support for running AI inference entirely in the browser using WebLLM, without requiring an API key. Users can now select "WebLLM (Local)" from the model dropdown to run inference locally using WebGPU. Key changes: - Add @built-in-ai/web-llm package for Vercel AI SDK integration - Add webllm model option to models.ts with isLocal flag - Create useWebLLMChat hook for client-side chat handling - Create WebLLMChat component for local inference UI - Create ChatWrapper to switch between API and local modes - Add /api/chat/webllm-save endpoint for persisting messages - Add WebLLMStatus component showing download/loading progress - Update entitlements to include webllm model for all users
Auto-formatting applied by biome linter for consistent code style.
Replace single WebLLM model with quality-based options: - webllm-draft: Fastest, uses Qwen3-0.6B - webllm-standard: Balanced, uses Llama-3.2-3B - webllm-high: Better quality, uses Qwen3-4B - webllm-best: Best quality, uses Llama-3.1-8B Users can now easily toggle between server-side (API gateway) and client-side (WebLLM) inference by selecting different models from the dropdown. WebLLM models run entirely in the browser without requiring an API key.
Contributor
|
@claude is attempting to deploy a commit to the Vercel Team on Vercel. A member of the Team first needs to authorize it. |
|
Review the following changes in direct dependencies. Learn more about Socket for GitHub.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.