Skip to content

New Arch, New obs.#18

Merged
l1onh3art88 merged 2 commits into
l1onh3art88:selfplayfrom
bullardji:patch-10
May 7, 2026
Merged

New Arch, New obs.#18
l1onh3art88 merged 2 commits into
l1onh3art88:selfplayfrom
bullardji:patch-10

Conversation

@bullardji

@bullardji bullardji commented May 3, 2026

Copy link
Copy Markdown

New embedding architecture
Added #include to pufferlib.cu to fix --profile

New obs (1082 → 167)
Flat byte list 0-63
each square stores a number 0..12 corrsponding to pieces

Selectable pieces and from to approach:
how many = N
list[0..N] = square IDs you can pick

how many = N
list[0..N] = legal destination square IDs

Selected square - which square is selected, 0..63 (64 if no square is selected)

EDIT: Recent edit - applied tricks found in Jonah's PR (PufferAI#522) here as Joseph mentioned would likely merged his improvements. ~200-300k SPS gain.

@bullardji bullardji marked this pull request as draft May 4, 2026 04:08
@bullardji bullardji marked this pull request as ready for review May 4, 2026 04:26
@l1onh3art88 l1onh3art88 merged commit 665dcae into l1onh3art88:selfplay May 7, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants