Skip to content

evilsocket/cake

Repository files navigation

cake

Documentation License

Join the project community on our server!

Cake is a Rust framework for distributed inference of large language models and image generation models based on Candle. The goal is to run big (70B+) models by repurposing consumer hardware into a heterogeneous cluster of iOS, Android, macOS, Linux and Windows devices, effectively leveraging planned obsolescence as a tool to make AI more accessible and democratic.

This is experimental code that's being actively developed and changed very quickly.

Key Features

Platform Support

OS Architectures Acceleration Status
GNU/Linux arm, arm64, x86_64 -
GNU/Linux arm, arm64, x86_64 CUDA
GNU/Linux arm, arm64, x86_64 BLAS
Windows x86_64 BLAS ⚠️
Windows x86_64 CUDA
macOS x86_64 -
macOS aarch64 -
macOS aarch64 Metal
Android arm, arm64, x86_64 -
Android arm, arm64, x86_64 CUDA ⚠️
iOS / iPadOS aarch64 -
iOS / iPadOS aarch64 Metal ✅ (A13+ / M-series)

Models

Model Type Feature Flag Status
LLaMA 3.x Text llama (default)
Qwen2 / Qwen2.5 Text qwen2 (default)
Qwen3.5 Text qwen3_5 (default)
Stable Diffusion (1.5, 2.1, XL, XL Turbo) Image -

Quick Start

cargo build --release --features cuda  # or: --features metal
cake download Qwen/Qwen2.5-Coder-1.5B-Instruct
cake master --model Qwen/Qwen2.5-Coder-1.5B-Instruct --prompt "Hello!"

To start the API server and web UI:

cake master --model Qwen/Qwen2.5-Coder-1.5B-Instruct --api 0.0.0.0:8080

For the full usage guide and API reference, check the project documentation.

Contributors

Cake project contributors

Star History

Star History Chart

License

Released under the GPL 3 license. To see the licenses of the project dependencies, install cargo license with cargo install cargo-license and then run cargo license.

About

Distributed inference for mobile, desktop and server.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

  •  

Packages

 
 
 

Contributors