Foundry Local

Build once, run locally. The SDK for shipping AI-powered applications with hardware-optimized on-device inference.

Step 1: Install Foundry Local

brew install microsoft/foundrylocal/foundrylocal

Step 2: Run a model

foundry model run qwen2.5-0.5b

Purpose built for shipping AI applications

Everything you need to embed AI into your products, with the performance and reliability your users expect.

Built as an SDK for shipping AI-powered applications, not just running models locally

// Initialize & Load

const manager = FoundryLocalManager.create(config)

const model = manager.getCatalog().getModel('gpt-oss-20b')

await model.load()|

We work directly with hardware vendors for maximum performance

NPU

Neural Engine

GPU

Graphics Card

CPU

Processor

Works fully offline with no cloud dependencies

Python, JavaScript, C#, and Rust

Drop-in API replacement for easy integration

base_url="api.openai.com"

base_url="localhost"

Everything stays on-device