SELECT MODEL TO BEGIN

Neural Chat

AI running 100% in your browser. No servers. No API keys. Your data stays on your device.

MODEL:

PERSONA:

Loading model...

Select a model and click "Start" to begin

WebLLM runs a full language model in your browser using WebGPU. The model is downloaded once and cached locally.

All processing happens on your device. No data is sent to any server. Your conversations remain completely private.

Requires a modern browser with WebGPU support (Chrome 113+, Edge 113+). GPU with 4GB+ VRAM recommended.