AI running 100% in your browser. No servers. No API keys. Your data stays on your device.
WebLLM runs a full language model in your browser using WebGPU. The model is downloaded once and cached locally.
All processing happens on your device. No data is sent to any server. Your conversations remain completely private.
Requires a modern browser with WebGPU support (Chrome 113+, Edge 113+). GPU with 4GB+ VRAM recommended.