How to best clean up artifacts? #54

milancurcic · 2022-09-22T17:55:30Z

When the client makes a request to the playground server, the server copies two files to the Docker container: the program source (main.f90) and input data (input.txt or similar).

Deleting these files is easy because they're tractable by the Python function that handles the request.

However, how to best handle the artifacts that can be created by calling execute_command_line or writing data to a file via open() and write() statements? These could be written anywhere in the user-writable part of the container (/home/fortran).

Worse, a creative user could overwrite existing files in the container that are necessary for fpm on the container to work.

A proposed solution that came up on GSoC calls for this project goes along the lines of:

Create a uniquely named directory (e.g. using uuid.uuid4() and place all needed artifacts (e.g. fpm, gfortran, shared libs) or their symlinks in that directory.
Run the program in the container in that unique directory under chroot and return the result. This will prevent the programs from creating files outside of the directory.
Delete the directory when done (this part can be delegated to a separate thread so that we can return the response to the user immediately).

What do you think?

The text was updated successfully, but these errors were encountered:

arjenmarkus · 2022-09-23T06:12:13Z

This looks like a practical solution to the problem, One question, though: would a new directory be created with each run request? If so, you would not be able to first create a program to write a file with a particular structure and then a program that attempts to read that file. I guess some kind of session management , if that is not already in place, is required. Regards, Arjen Op do 22 sep. 2022 om 19:55 schreef Milan Curcic ***@***.***>:

…

When the client makes a request to the playground server, the server copies two files to the Docker container: the program source (main.f90) and input data (input.txt or similar). Deleting these files is easy because they're tractable by the Python function that handles the request. However, how to best handle the artifacts that can be created by calling execute_command_line or writing data to a file via open() and write() statements? These could be written anywhere in the user-writable part of the container (/home/fortran). Worse, a creative user could overwrite existing files in the container that are necessary for fpm on the container to work. A proposed solution that came up on GSoC calls for this project goes along the lines of: 1. Create a uniquely named directory (e.g. using uuid.uuid4() and place all needed artifacts (e.g. fpm, gfortran, shared libs) or their symlinks in that directory. 2. Run the program in the container in that unique directory under chroot and return the result. This will prevent the programs from creating files outside of the directory. 3. Delete the directory when done (this part can be delegated to a separate thread so that we can return the response to the user immediately). What do you think? — Reply to this email directly, view it on GitHub <#54>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN6YRYY6HUXBQCLVKLU7C3V7SMRZANCNFSM6AAAAAAQTJHZVQ> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

certik · 2022-09-23T11:23:28Z

How are multiple requests handled? They all access the same container?

awvwgk · 2022-09-23T12:22:03Z

I think there are multiple instances, but once you create files you will be able to find them again after a few tries in one of the instances.

milancurcic · 2022-09-23T12:22:12Z

Each server worker (think Python process, there are 3 currently) spins up a Docker container. For each incoming request, a server worker opens a new thread in which the request is handled. The threads share memory, so multiple requests that are incoming through the same worker use the same Docker container.

In the initial implementation, @ashirrwad had each request spin up a new container to handle the request. However, this adds a few seconds of overhead to each request, so we pivoted to having one running container (per worker) that is being reused. Of course, this approach requires hardening because the container is stateful between multiple requests.

milancurcic · 2022-09-23T12:24:23Z

I wonder if we end up needing chroot whether we need Docker at all.

LKedward · 2022-09-23T12:34:41Z

I'm surprised that it's not one container instance per request. When I use Docker [on Linux] the containers start almost instantly, (<1sec). Also, if there are three persistent containers, then does that mean that no more than three concurrent requests can be sent in a short period of time? In addition to avoiding this persistence/cleanup problem, another advantage of doing one container per request is that you can limit CPU and memory usage for that container to ensure that it doesn't impact overall system performance (potential for a denial-of-service attack).

certik · 2022-09-23T12:43:51Z

I am surprised too, but I also suspect docker can be slow, especially on some low performance (free) VM in AWS.

To scale this well is difficult, I think that's a full time job for a long time, to ensure security, performance, maintainability, it's probably also quite expensive, with multiple workers etc. But we don't need all that to get started.

Yes, it seems using directories is probably fine for now to get started.

milancurcic · 2022-09-23T12:49:22Z

Good question and I don't know. I haven't made a hard measurement and the preliminary conclusion about it being slow was based on local development environment. It would be certainly easy to try again, it's a matter of instantiating the container within request handling function vs. the top-level (module) scope as it is now.

As I understand it, Gunicorn (our server) can serve many more requests than it has workers because they offload the work to threads. But, with the current approach we may have an issue of there being conflicts on the Docker container if there are too many users at once. Ashirwad doesn't think this happens because of how WSGI works, he may be correct, but I don't understand it well enough to tell for sure.

Brad suggested on an early GSoC call to try plain chroot instead of the Docker container. At the time Ashirwad preferred the Docker approach as he was more familiar with it, so we went ahead with it.

milancurcic added question Further information is requested security Security-related issues labels Sep 22, 2022

milancurcic mentioned this issue Oct 4, 2022

Playground"OCI runtime exec failed: exec failed: unable to start container process: read init-p: connection reset by peer: unknown" returning as Output ( As of oct 21 16:00 UTC) #58

Closed

milancurcic mentioned this issue Mar 3, 2023

The playground does not work #65

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to best clean up artifacts? #54

How to best clean up artifacts? #54

milancurcic commented Sep 22, 2022

arjenmarkus commented Sep 23, 2022 via email

certik commented Sep 23, 2022

awvwgk commented Sep 23, 2022

milancurcic commented Sep 23, 2022

milancurcic commented Sep 23, 2022

LKedward commented Sep 23, 2022

certik commented Sep 23, 2022 •

edited

Loading

milancurcic commented Sep 23, 2022 •

edited

Loading

How to best clean up artifacts? #54

How to best clean up artifacts? #54

Comments

milancurcic commented Sep 22, 2022

arjenmarkus commented Sep 23, 2022 via email

certik commented Sep 23, 2022

awvwgk commented Sep 23, 2022

milancurcic commented Sep 23, 2022

milancurcic commented Sep 23, 2022

LKedward commented Sep 23, 2022

certik commented Sep 23, 2022 • edited Loading

milancurcic commented Sep 23, 2022 • edited Loading

certik commented Sep 23, 2022 •

edited

Loading

milancurcic commented Sep 23, 2022 •

edited

Loading