Skip to content

XOR-op/nixie

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

274 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Nixie

GitHub Actions License

Nixie is an efficient service for transparent GPU multiplexing without worrying about insufficient VRAM/DRAM capacity on Linux.

Our highlighted features include:

  • Optimizing for modern large AI models.
  • Transparent GPU multiplexing, supporting popular applications like llama.cpp, SGLang, ComfyUI and more out of the box.
  • Low task switching latency
  • Configurable maximum memory size depending on user needs.

Getting Started

Installation

Prerequisites:

  • Rust (>=1.90 stable)

Build the project with:

git clone https://github.com/XOR-op/nixie
cd nixie
cargo build --release

Launch Applications With Nixie

First, we need to start Nixie daemon:

nixie daemon

To configure the capacity of memory used, run with

nixie daemon --shmem <pinned-memory-size> --hostmem <paged-memory-size>
# For example, to use 16GB of pinned memory and 32GB of paged memory:
nixie daemon --shmem 16g --hostmem 32g

Then, we can launch applications with Nixie:

nixie run <app-name> <app-args>

To specify which GPU to use, assuming we use GPU 0:

nixie run -d 0 <app-name> <app-args>

CLI Reference

See CLI Reference for more details on the available commands and options.

About

An efficient service for transparent GPU multiplexing with VRAM oversubscription

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors

Languages