[Draft] Prototype for Docker executor #225

aymeric-roucher · 2025-01-16T15:48:22Z

aymeric-roucher · 2025-01-16T15:51:13Z

@stackviolator if you want to talk in more detail and have Slack we can open a colab there!

stackviolator · 2025-01-16T16:03:31Z

Hey! I'll check these out. A slack collab would actually be great. In the process of disclosing a security issue to you guys, a docker executor would help remediate the issue :^)

stackviolator · 2025-01-16T21:56:45Z

Ok I have some progress on my fork (main...stackviolator:smolagents:main) that is working fairly consistently for builtin tools.

Major design differences:

Made the containers ephemeral. A new container is spun and destroyed for each code step. This was made in case an attacker escapes the interpreter and persists within the container, any data on the container could be accessed. E.g. RAG agent sorting through internal docs.
No use of a socket. Since the containers are ephemeral, there isn't a need to make a server that executes commands, can just catch stdout and stderr when the container finished its script.
Create a new instance of a LocalPythonInterpreter in the container prior to execution. The container will install smolagents and has access to all the builtins. I played with pickling and shipping the entire object but since tools are either functions or objects, it would be a big pain to ship them between the main process and the container.

I haven't tested yet with custom tools with the @tool decorator, doubt my solution will be able to handle it. But this should be a pretty good start.

aymeric-roucher · 2025-01-16T22:08:34Z

Hello @stackviolator ! Two constraints should indeed absolutely be handled

consistent state between steps (else our agents won't really be multi-step if they can't reuse in later steps what they've built in earlier ones)
being able to pass custom tools!
The pre-existing solution built by @ErikKaum already handles the consistent state, I think what it needs is mostly the custom tool part to be more secure and easily runnable!

Also check the E2B executor, it can pass custom tools (via exporting the script using Tool.save()) and has state synchronisation between local and remote machine via pickling, I think this can be applied to remote tools to.

stackviolator · 2025-01-16T22:26:02Z

In my solution, the additional_args dict which is passed into call is sent to the container and used during execution. These are then passed into the new instance of the LocalPythonInterpreter. Correct me if I'm wrong but I believe this is the state transfer that you're talking about. At least that's what it looks like is happening with the (un)pickling in e2b.

I'm also a bit hesitant to keep a single container running throughout the lifetime of an agent. If an attacker can persist on the container during execution, various artifacts (API keys, previous code, data in the prompts) will likely pile up and defeat the purpose of having the sandbox in the first place. The idea behind the ephemeral containers was to emulate something like AWS Lambda.

As for the custom tools -- yes need to support those. I'll look into how you guys are doing them for e2b and Erik's implementation :)

Prototype for Docker executor

739bb97

aymeric-roucher mentioned this pull request Jan 16, 2025

Return textboxes on Gradio file upload errors #214

Merged

albertvillanova marked this pull request as draft January 24, 2025 14:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] Prototype for Docker executor #225

[Draft] Prototype for Docker executor #225

aymeric-roucher commented Jan 16, 2025

aymeric-roucher commented Jan 16, 2025

stackviolator commented Jan 16, 2025

stackviolator commented Jan 16, 2025

aymeric-roucher commented Jan 16, 2025 •

edited

Loading

stackviolator commented Jan 16, 2025

[Draft] Prototype for Docker executor #225

Are you sure you want to change the base?

[Draft] Prototype for Docker executor #225

Conversation

aymeric-roucher commented Jan 16, 2025

aymeric-roucher commented Jan 16, 2025

stackviolator commented Jan 16, 2025

stackviolator commented Jan 16, 2025

aymeric-roucher commented Jan 16, 2025 • edited Loading

stackviolator commented Jan 16, 2025

aymeric-roucher commented Jan 16, 2025 •

edited

Loading