🍋 Lemonade: Fast Open Source Local LLM Server

Source: Hacker News (413 points) | ★★★★☆ | 2026-04-02
open-source local-ai llm gpu npu

What is Lemonade?

A refreshingly fast local LLM server that runs on GPUs and NPUs. Open source, private, and ready in minutes on any PC.

Key Features

Unified API

One local service for every modality - chat, vision, image generation, transcription, speech generation with standard APIs.

Why It Matters

Local AI should be free, open, fast, and private. Lemonade brings enterprise-grade local AI capabilities to any desktop without cloud dependencies.

Try It

With 128GB unified RAM, you can load models like gpt-oss-120b or Qwen-Coder-Next for advanced tool use.

← Back to Home