LlamaBoss Beta v0.1.2

LlamaBoss

A native Windows chat app for running local models. Multimodal, streaming, agentic — and entirely offline.

What’s in the box

Native C++

Bundled with llama.cpp runtimes — the same engine that powers local AI everywhere. No Docker, no Python, no external dependencies.

Multimodal

Drop in images and text files. Chat about screenshots or paste right from the clipboard.

Agentic tools

Filesystem, shell, and workspace access — with confirmation gates so nothing runs by surprise.

100% local

Runs entirely on your machine. No cloud, no telemetry, no account.

Download

Grab the latest build.

Windows installer, MIT licensed, no account needed. Open the installer, click through, and start chatting with your local models.

Download LlamaBoss
Windows 10/11 · x64 · Bundled with llama.cpp runtimes

Beta This native Windows build ships with bundled llama.cpp runtimes — no Ollama required. Download, install, and start chatting with local models right away.