Foundry Local Android

Overview

Foundry Local for Android, developed in partnership with Microsoft, creates a secure, dedicated, and flexible LLM server application that runs directly on Android devices. This architecture enables multiple applications to utilize a single, locally hosted model, which drastically improves privacy, guarantees consistent performance, and minimizes redundant resource usage. The system centralizes the model runtime into a single LLM server that any client application can connect to, avoiding the common pattern where each app ships its own runtime and model. Combined with DeliteAI, it provides context engineering, structured session memory, tool-calling capabilities, and multi-step agentic workflows—all running entirely on-device.

Download on Google Play

Key Highlights

Centralized LLM server for multiple applications
Eliminates redundant memory usage
Consistent privacy boundary with on-device processing
DeliteAI integration for agentic workflows

Demo Video

Features