Skip to content

Bump benchmark to flare 0.2.15 + use next_token_async for GPU decode

d3c6643
Select commit
Loading
Failed to load commit list.
Merged

Benchmark: async GPU decode via next_token_async (flare 0.2.15) #316

Bump benchmark to flare 0.2.15 + use next_token_async for GPU decode
d3c6643
Select commit
Loading
Failed to load commit list.