Nixie

Nixie is an efficient service for transparent GPU multiplexing without worrying about insufficient VRAM/DRAM capacity on Linux.

Our highlighted features include:

Optimizing for modern large AI models.
Transparent GPU multiplexing, supporting popular applications like llama.cpp, SGLang, ComfyUI and more out of the box.
Low task switching latency
Configurable maximum memory size depending on user needs.

Getting Started

Installation

Prerequisites:

Rust (>=1.90 stable)

Build the project with:

git clone https://github.com/XOR-op/nixie
cd nixie
cargo build --release

Launch Applications With Nixie

First, we need to start Nixie daemon:

nixie daemon

To configure the capacity of memory used, run with

nixie daemon --shmem <pinned-memory-size> --hostmem <paged-memory-size>
# For example, to use 16GB of pinned memory and 32GB of paged memory:
nixie daemon --shmem 16g --hostmem 32g

Then, we can launch applications with Nixie:

nixie run <app-name> <app-args>

To specify which GPU to use, assuming we use GPU 0:

nixie run -d 0 <app-name> <app-args>

CLI Reference

See CLI Reference for more details on the available commands and options.

Name		Name	Last commit message	Last commit date
Latest commit History 274 Commits
.github/workflows		.github/workflows
docs		docs
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nixie

Getting Started

Installation

Launch Applications With Nixie

CLI Reference

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nixie

Getting Started

Installation

Launch Applications With Nixie

CLI Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages