ArrayBuffer Lazy-assignment for GPU Context by SamSJackson · Pull Request #5076 · firedrakeproject/firedrake

SamSJackson · 2026-05-04T10:38:32Z

Description

This PR introduces a context manager to allow dynamic assignment of PyOP3 arrays on a given device.
Devices are defined within the new device.py module, where internal array management is also contained.

Key modifications:

Introduce device.py module to represent offloading devices as offloading context manager
Change buffer.py to lazily-evaluate data with respect to the current context (i.e. host or offloading device)
- _lazy_data is now a dictionary object, mapping between Device objects and respective arrays.
- _data property lazily-evaluates to appropriate data for the given context. If the data is not up-to-date, as per state property, the data is copied.
All data is copied lazily. Entering and exiting the context window will not automatically transfer data between devices.
Buffers are maintained between context windows - exiting a context window will not release the memory on the device.
All device-specific array management is kept within device.py so buffer.py can remain device/gpu-agnostic - apart from some type-hinting, i.e. cp.ndarray.

Notable issues:

CuPy has no support for writeable flag. It will throw away the flag when converting from NumPy objects (and it will not return if converted back)
Defaultdict is used for our state dictionary. Previously discussed that neither Connor or I like this but I cannot think of another approach.
- Main issue: If an array is assigned, it does not have any knowledge of other devices beyond that in its current context. As such, there is also no state counter assigned to it. Hence, it is difficult to allow the user to check the buffer on other devices, even if they are initialised. Example can be seen in ./pyop3_gpu_demo.py from asserts before entering context manager.
- Potential approaches:
  - Dictionary wrapper so we can create a more strict defaultdict (bit extreme and needless maintenance but should work).
  - Do not allow users to check state of array on device for which it has not entered context.
- Open to any advice or solutions for this.

…Patch*ExteriorFacets

…model)

- new state object (int -> dict) leads the reassignment being a weak reference.

connorjward

I've been strict but in general this is fantastic, thank you

- defaultdict gives -1 as default value if device object does not exist

- constant property was lost between cupy/numpy conversions - fixed by passing kwarg that is disregarded for cupy but used for numpy

- using @Property for last_updated_device, known by state, does not need to be variable. - duplicate method only copies most up-to-date copy and non-copy duplicate only copies for current device - initialisation adds None as optional for data input

- v3.24.5 was failing to compile petsc4py due to PETSc no support for PCPatchSetComputeFunctionExteriorFacets

connorjward · 2026-05-04T15:31:19Z

For any trivial changes please go ahead and resolve them. Saves me cross checking things.

SamSJackson added 18 commits April 27, 2026 12:39

introduce device.py and set up branch

8ceb830

const parameter for host device

ba6de9b

include gpu demo and update petsc version as 3.24.5 misaligned for PC…

6a9fd21

…Patch*ExteriorFacets

noting areas for change

fb3d0b7

introducing context variable and lazy cupy

e39f215

lazy evaluation of arrays

2a54e84

passes basic script functionality

98d6010

tofix: revised approach ensuring explicit choice of GPU device

b044c7a

implicit transfer and defaultdict implementation (pub/sub eager copy …

1de6320

…model)

explicit check if GPU available on init

6a26334

cudagpu and fix incoming re: remove eager copying/register & dev syncing

8d1f967

move conversion logic to device.py

9a729e9

managing buffer duplicate

21bac65

cleanup unnecessary todos/notes

67fe76a

removing notes and cleaning

1568315

fix: added copy to avoid weak reference

2517994

- new state object (int -> dict) leads the reassignment being a weak reference.

test: data_wo access works in context

9e7c6ad

add flatten from prev logic

cb5f28e

SamSJackson requested a review from connorjward May 4, 2026 10:38

connorjward requested changes May 4, 2026

View reviewed changes

SamSJackson added 6 commits May 4, 2026 14:04

context function as global function and def state

36d2b07

- defaultdict gives -1 as default value if device object does not exist

fix: maintaining constant array property

e5a0107

- constant property was lost between cupy/numpy conversions - fixed by passing kwarg that is disregarded for cupy but used for numpy

pr review: removing unused variables

39d0acd

remove dispatch to allow no-import cupy

e48d698

pr: fix property, duplicate, init

7d582d6

- using @Property for last_updated_device, known by state, does not need to be variable. - duplicate method only copies most up-to-date copy and non-copy duplicate only copies for current device - initialisation adds None as optional for data input

fix: change petsc config version to v3.25.0

0528e76

- v3.24.5 was failing to compile petsc4py due to PETSc no support for PCPatchSetComputeFunctionExteriorFacets

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ArrayBuffer Lazy-assignment for GPU Context#5076

ArrayBuffer Lazy-assignment for GPU Context#5076
SamSJackson wants to merge 24 commits intoconnorjward/pyop3from
SamSJackson/pyop3-outer

SamSJackson commented May 4, 2026

Uh oh!

connorjward left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

connorjward commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SamSJackson commented May 4, 2026

Description

Uh oh!

connorjward left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

connorjward commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants