Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
39 commits
Select commit Hold shift + click to select a range
a241a42
initial work on code_mode
eavanvalkenburg Mar 18, 2026
492401d
updated samples
eavanvalkenburg Mar 23, 2026
9f8eada
updates to codeact
eavanvalkenburg Mar 23, 2026
988d4a9
udpated codeact
eavanvalkenburg Mar 24, 2026
ec16c12
Draft CodeAct ADR and sample updates
eavanvalkenburg Apr 7, 2026
3d168ff
initial implementation and adr and feature
eavanvalkenburg Apr 9, 2026
8dddecc
Python: Limit Hyperlight wasm backend to Python <3.14
eavanvalkenburg Apr 9, 2026
8d70aff
Python: Fix CI for Hyperlight CodeAct PR
eavanvalkenburg Apr 9, 2026
4f77938
Python: Run Hyperlight integration when available
eavanvalkenburg Apr 9, 2026
d959537
Python: Address Hyperlight review feedback
eavanvalkenburg Apr 9, 2026
779ff14
Python: Simplify Hyperlight file mount inputs
eavanvalkenburg Apr 9, 2026
003b5f2
Python: Accept Path host paths in Hyperlight mounts
eavanvalkenburg Apr 9, 2026
e296d61
Python: Fix Hyperlight mount typing for CI
eavanvalkenburg Apr 9, 2026
66e5028
temp run integration test
eavanvalkenburg Apr 10, 2026
f880d2c
Python: Strengthen Hyperlight real sandbox tests
eavanvalkenburg Apr 10, 2026
f5cb9ea
added additional tests
eavanvalkenburg Apr 10, 2026
1763ebf
Python: Simplify Hyperlight CodeAct API
eavanvalkenburg Apr 10, 2026
07d0b9e
set tests as non-integration
eavanvalkenburg Apr 13, 2026
83fe46d
Retry Hyperlight allowed-domain registration
eavanvalkenburg Apr 14, 2026
bbb8416
Gate Hyperlight integration tests by runtime support
eavanvalkenburg Apr 14, 2026
c055acd
Fix Hyperlight skip test on Python 3.14
eavanvalkenburg Apr 14, 2026
85d8e7f
Delay Hyperlight runtime probe until test execution
eavanvalkenburg Apr 14, 2026
2380fc8
Relax Hyperlight Windows integration stdout assertion
eavanvalkenburg Apr 14, 2026
bf56d12
Scan Hyperlight output directory for artifacts
eavanvalkenburg Apr 14, 2026
7cf4091
Retry Hyperlight output artifact collection
eavanvalkenburg Apr 14, 2026
771d763
Harden Hyperlight integration output assertions
eavanvalkenburg Apr 14, 2026
9f99e03
Retry Hyperlight read-back check in integration test
eavanvalkenburg Apr 14, 2026
21615b6
Simplify Hyperlight integration write assertion
eavanvalkenburg Apr 14, 2026
2d9544b
Avoid pathlib in Hyperlight integration sandbox
eavanvalkenburg Apr 14, 2026
582bf97
Use socket network check in Hyperlight sandbox
eavanvalkenburg Apr 14, 2026
be61f6f
Replace blocked Azure AI Search blog link
eavanvalkenburg Apr 14, 2026
54f1b3a
Clarify Hyperlight guest stdlib limits
eavanvalkenburg Apr 14, 2026
d25bf1f
Use _socket in Hyperlight integration sandbox
eavanvalkenburg Apr 14, 2026
7cf71c2
Handle Hyperlight mounted file paths
eavanvalkenburg Apr 14, 2026
dafdd21
Broaden Hyperlight sandbox path fallbacks
eavanvalkenburg Apr 14, 2026
55f7403
Search Hyperlight guest mounts recursively
eavanvalkenburg Apr 14, 2026
32244b1
Split Hyperlight mount coverage
eavanvalkenburg Apr 14, 2026
92eaad2
Split Hyperlight live network tests
eavanvalkenburg Apr 14, 2026
f3dbd8c
Fix Hyperlight file-write test on Windows
eavanvalkenburg Apr 14, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions .github/workflows/python-integration-tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -131,7 +131,7 @@ jobs:
--timeout=120 --session-timeout=900 --timeout_method thread
--retries 2 --retry-delay 5

# Misc integration tests (Anthropic, Ollama, MCP)
# Misc integration tests (Anthropic, Hyperlight, Ollama, MCP)
python-tests-misc-integration:
name: Python Integration Tests - Misc
runs-on: ubuntu-latest
Expand Down Expand Up @@ -162,10 +162,11 @@ jobs:
fallback_url: ${{ env.LOCAL_MCP_URL }}
- name: Prefer local MCP URL when available
run: echo "LOCAL_MCP_URL=${{ steps.local-mcp.outputs.effective_url }}" >> "$GITHUB_ENV"
- name: Test with pytest (Anthropic, Ollama, MCP integration)
- name: Test with pytest (Anthropic, Hyperlight, Ollama, MCP integration)
run: >
uv run pytest --import-mode=importlib
packages/anthropic/tests
packages/hyperlight/tests
packages/ollama/tests
packages/core/tests/core/test_mcp.py
-m integration
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/python-merge-tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -65,6 +65,7 @@ jobs:
- 'python/samples/**/providers/azure/**'
misc:
- 'python/packages/anthropic/**'
- 'python/packages/hyperlight/**'
- 'python/packages/ollama/**'
- 'python/packages/core/agent_framework/_mcp.py'
- 'python/packages/core/tests/core/test_mcp.py'
Expand Down Expand Up @@ -278,10 +279,11 @@ jobs:
fallback_url: ${{ env.LOCAL_MCP_URL }}
- name: Prefer local MCP URL when available
run: echo "LOCAL_MCP_URL=${{ steps.local-mcp.outputs.effective_url }}" >> "$GITHUB_ENV"
- name: Test with pytest (Anthropic, Ollama, MCP integration)
- name: Test with pytest (Anthropic, Hyperlight, Ollama, MCP integration)
run: >
uv run pytest --import-mode=importlib
packages/anthropic/tests
packages/hyperlight/tests
packages/ollama/tests
packages/core/tests/core/test_mcp.py
-m integration
Expand Down
227 changes: 227 additions & 0 deletions docs/decisions/0024-codeact-integration.md

Large diffs are not rendered by default.

6 changes: 6 additions & 0 deletions docs/features/code_act/dotnet-implementation.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
# CodeAct .NET implementation

This document will describe the .NET realization of the CodeAct design in
[`docs/decisions/0024-codeact-integration.md`](../../decisions/0024-codeact-integration.md).

Coming soon.
359 changes: 359 additions & 0 deletions docs/features/code_act/python-implementation.md

Large diffs are not rendered by default.

2 changes: 2 additions & 0 deletions python/.cspell.json
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,7 @@
"azuredocs",
"azurefunctions",
"boto",
"codeact",
"contentvector",
"contoso",
"datamodel",
Expand All @@ -45,6 +46,7 @@
"hnsw",
"httpx",
"huggingface",
"hyperlight",
"Instrumentor",
"logit",
"logprobs",
Expand Down
1 change: 1 addition & 0 deletions python/PACKAGE_STATUS.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,7 @@ Status is grouped into these buckets:
| `agent-framework-foundry-local` | `python/packages/foundry_local` | `beta` |
| `agent-framework-gemini` | `python/packages/gemini` | `alpha` |
| `agent-framework-github-copilot` | `python/packages/github_copilot` | `beta` |
| `agent-framework-hyperlight` | `python/packages/hyperlight` | `alpha` |
| `agent-framework-lab` | `python/packages/lab` | `beta` |
| `agent-framework-mem0` | `python/packages/mem0` | `beta` |
| `agent-framework-ollama` | `python/packages/ollama` | `beta` |
Expand Down
7 changes: 5 additions & 2 deletions python/packages/core/agent_framework/_agents.py
Original file line number Diff line number Diff line change
Expand Up @@ -1191,11 +1191,14 @@ async def _prepare_run_context(
options=opts,
service_stores_history=bool(store_),
)
provider_options = dict(opts)
if tools_ is not None:
provider_options["tools"] = tools_

session_context, chat_options = await self._prepare_session_and_messages(
session=active_session,
input_messages=input_messages,
options=opts,
options=provider_options,
)
default_additional_args = chat_options.pop("additional_function_arguments", None)
if isinstance(default_additional_args, Mapping):
Expand All @@ -1209,7 +1212,7 @@ async def _prepare_run_context(
mcp_duplicate_message = "Tool names must be unique. Consider setting `tool_name_prefix` on the MCPTool."

# Normalize tools
normalized_tools = normalize_tools(tools_)
normalized_tools = normalize_tools(session_context.options.get("tools", tools_))

# Resolve final tool list (configured tools + runtime provided tools + local MCP server tools)
final_tools = list(base_tools)
Expand Down
6 changes: 4 additions & 2 deletions python/packages/core/agent_framework/_sessions.py
Original file line number Diff line number Diff line change
Expand Up @@ -149,7 +149,8 @@ class SessionContext:
middleware: Dict mapping source_id -> chat/function middleware added by that provider.
Maintains insertion order (provider execution order).
response: After invocation, contains the full AgentResponse, should not be changed.
options: Options passed to agent.run() - read-only, for reflection only.
options: Options passed to agent.run(). Providers can inspect these and may
update ``options["tools"]`` to influence per-run tool resolution.
metadata: Shared metadata dictionary for cross-provider communication.
"""

Expand All @@ -176,7 +177,8 @@ def __init__(
instructions: Pre-populated instructions.
tools: Pre-populated tools.
middleware: Pre-populated chat/function middleware by source.
options: Options from agent.run() - read-only for providers.
options: Options from agent.run(). Providers may inspect these and can
update ``options["tools"]`` to influence per-run tool resolution.
metadata: Shared metadata for cross-provider communication.
"""
self.session_id = session_id
Expand Down
9 changes: 5 additions & 4 deletions python/packages/core/agent_framework/_tools.py
Original file line number Diff line number Diff line change
Expand Up @@ -89,6 +89,7 @@
DEFAULT_MAX_ITERATIONS: Final[int] = 40
DEFAULT_MAX_CONSECUTIVE_ERRORS_PER_REQUEST: Final[int] = 3
SHELL_TOOL_KIND_VALUE: Final[str] = "shell"
ApprovalMode: TypeAlias = Literal["always_require", "never_require"]
ChatClientT = TypeVar("ChatClientT", bound="SupportsChatGetResponse[Any]")
ResponseModelBoundT = TypeVar("ResponseModelBoundT", bound=BaseModel)

Expand Down Expand Up @@ -270,7 +271,7 @@ def __init__(
*,
name: str,
description: str = "",
approval_mode: Literal["always_require", "never_require"] | None = None,
approval_mode: ApprovalMode | None = None,
kind: str | None = None,
max_invocations: int | None = None,
max_invocation_exceptions: int | None = None,
Expand Down Expand Up @@ -1030,7 +1031,7 @@ def tool(
name: str | None = None,
description: str | None = None,
schema: type[BaseModel] | Mapping[str, Any] | None = None,
approval_mode: Literal["always_require", "never_require"] | None = None,
approval_mode: ApprovalMode | None = None,
kind: str | None = None,
max_invocations: int | None = None,
max_invocation_exceptions: int | None = None,
Expand All @@ -1046,7 +1047,7 @@ def tool(
name: str | None = None,
description: str | None = None,
schema: type[BaseModel] | Mapping[str, Any] | None = None,
approval_mode: Literal["always_require", "never_require"] | None = None,
approval_mode: ApprovalMode | None = None,
kind: str | None = None,
max_invocations: int | None = None,
max_invocation_exceptions: int | None = None,
Expand All @@ -1061,7 +1062,7 @@ def tool(
name: str | None = None,
description: str | None = None,
schema: type[BaseModel] | Mapping[str, Any] | None = None,
approval_mode: Literal["always_require", "never_require"] | None = None,
approval_mode: ApprovalMode | None = None,
kind: str | None = None,
max_invocations: int | None = None,
max_invocation_exceptions: int | None = None,
Expand Down
61 changes: 61 additions & 0 deletions python/packages/core/tests/core/test_agents.py
Original file line number Diff line number Diff line change
Expand Up @@ -841,6 +841,67 @@ async def test_per_service_call_persistence_rejects_existing_conversation_id_whe
await agent.run("Hello", session=session, options={"store": False, "conversation_id": "existing_conversation"})


async def test_context_provider_can_inspect_runtime_tools_from_run(
chat_client_base: SupportsChatGetResponse,
) -> None:
seen_tools: list[Any] = []

class RuntimeToolsProvider(ContextProvider):
def __init__(self) -> None:
super().__init__(source_id="runtime-tools")

async def before_run(self, *, agent: Any, session: Any, context: Any, state: Any) -> None:
del agent, session, state
tools = context.options.get("tools", [])
seen_tools.extend(list(tools) if isinstance(tools, list) else [tools])

runtime_tool = FunctionTool(func=lambda: "runtime", name="runtime_tool", description="Runtime tool")
agent = Agent(client=chat_client_base, context_providers=[RuntimeToolsProvider()])

await agent._prepare_run_context( # type: ignore[reportPrivateUsage]
messages="Hello",
session=agent.create_session(),
tools=[runtime_tool],
options=None,
compaction_strategy=None,
tokenizer=None,
function_invocation_kwargs=None,
client_kwargs=None,
)

assert seen_tools == [runtime_tool]


async def test_context_provider_can_remove_runtime_tools_from_run(
chat_client_base: SupportsChatGetResponse,
) -> None:
class RuntimeToolsProvider(ContextProvider):
def __init__(self) -> None:
super().__init__(source_id="runtime-tools")

async def before_run(self, *, agent: Any, session: Any, context: Any, state: Any) -> None:
del agent, session, state
context.options["tools"] = []

base_tool = FunctionTool(func=lambda: "base", name="base_tool", description="Base tool")
runtime_tool = FunctionTool(func=lambda: "runtime", name="runtime_tool", description="Runtime tool")
agent = Agent(client=chat_client_base, tools=[base_tool], context_providers=[RuntimeToolsProvider()])

ctx = await agent._prepare_run_context( # type: ignore[reportPrivateUsage]
messages="Hello",
session=agent.create_session(),
tools=[runtime_tool],
options=None,
compaction_strategy=None,
tokenizer=None,
function_invocation_kwargs=None,
client_kwargs=None,
)

tool_names = [_get_tool_name(tool_obj) for tool_obj in ctx["chat_options"]["tools"]]
assert tool_names == ["base_tool"]


async def test_chat_client_agent_run_with_session(chat_client_base: SupportsChatGetResponse) -> None:
mock_response = ChatResponse(
messages=[Message(role="assistant", contents=[Content.from_text("test response")])],
Expand Down
21 changes: 21 additions & 0 deletions python/packages/hyperlight/LICENSE
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
MIT License

Copyright (c) Microsoft Corporation.

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE
36 changes: 36 additions & 0 deletions python/packages/hyperlight/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
# agent-framework-hyperlight

Alpha Hyperlight-backed CodeAct integrations for Microsoft Agent Framework.

## Installation

```bash
pip install agent-framework-hyperlight --pre
```

This package depends on `hyperlight-sandbox`, the packaged Python guest, and the
Wasm backend package on supported platforms. If the backend is not published for
your current platform yet, `execute_code` will fail at runtime when it tries to
create the sandbox.

## Public API

- `AllowedDomain`
- `AllowedDomainInput`
- `HyperlightCodeActProvider`
- `HyperlightExecuteCodeTool`
- `FileMount`
- `FileMountInput`

## Notes

- This package is intentionally separate from `agent-framework-core` so CodeAct
usage and installation remain optional.
- Alpha-package samples live under `packages/hyperlight/samples/`.
- `file_mounts` accepts a single string shorthand, an explicit `(host_path,
mount_path)` pair, or a `FileMount` named tuple. The host-side path in the
explicit forms may be a `str` or `Path`. Use the explicit two-value form when
the host path differs from the sandbox path.
- `allowed_domains` accepts a single string target such as `"github.com"` to
allow all backend-supported methods, an explicit `(target, method_or_methods)`
tuple such as `("github.com", "GET")`, or an `AllowedDomain` named tuple.
24 changes: 24 additions & 0 deletions python/packages/hyperlight/agent_framework_hyperlight/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
# Copyright (c) Microsoft. All rights reserved.

from __future__ import annotations

import importlib.metadata

from ._execute_code_tool import HyperlightExecuteCodeTool
from ._provider import HyperlightCodeActProvider
from ._types import AllowedDomain, AllowedDomainInput, FileMount, FileMountInput

try:
__version__ = importlib.metadata.version(__name__)
except importlib.metadata.PackageNotFoundError:
__version__ = "0.0.0"

__all__ = [
"AllowedDomain",
"AllowedDomainInput",
"FileMount",
"FileMountInput",
"HyperlightCodeActProvider",
"HyperlightExecuteCodeTool",
"__version__",
]
Loading
Loading