Executing Code on the Client
Jeff Taylor-Chang
Frontend Dev for Actually Colab#
The ProblemServer-side execution of code is great because it requires minimal setup from the user and potentially offers the ability to use more powerful machines and GPUs the client may not have. The issue with server-side execution is that we then take on the cost of spawning and hosting many containers in the cloud which becomes expensive quickly. This is one of the reasons that the tools on the market that offer real-time collaboration are so expensive since they have all opted to run code server-side. They are trading convenience for cost, and for plenty of companies or individuals with large budgets that's often perfectly acceptable and sometimes even preferred. However, since we are building a service aimed at keeping costs as low as possible to make the service accessible to people who otherwise couldn't afford these tools, we had to come up with a different approach.
#
Client-side ExecutionWe decided to take a page from the classic Jupyter Notebook that started it all. The Jupyter Notebook, when run locally, interacts with a IPython Kernel instance on your local machine. This means that when you are running code in the notebook editor, you are executing code using dependencies installed on your computer as well as with access to files on your local disk. Since we wouldn't host the Kernel on our servers, this dramatically reduced what our server costs would be and allowed us to build a serverless (Lambda) architecture without spawning containers. This allows us to deliver a much more cost-efficient service and those cost-savings are directly passed to our users. That being said, it has some obvious disadvantages when applied to a collaborative cloud editor, namely there is no guarantee that everyone has the same dependencies unlike in a controlled server-side environment. That's a problem we will tackle in the future, but the first step was configuring our cloud editor to run code directly on the client.
#
Initial Approach with ElectronOur first instinct was to build our editor as an Electron app which would provide us with access to the client's machine and allow us to execute commands and spawn processes. We created a hidden renderer process that spawned and managed a Kernel instance on the client's machine and communicated with the editor via IPC messaging. This worked great, and we had a working Proof-of-Concept in a matter of hours. One drawback we quickly realized however was that we had limited the accessibility to our application.
- It would require that we build MacOS, Windows, and Linux builds of the application regularly and distribute them to users, making it difficult to update the editor rapidly.
- Not everyone needs to run outputs when working on a team and so we had introduced a hard native dependency on something that otherwise could be easily accessible in the browser.
- Integrating OAuth became considerably more complex because it required spawning a browser window that then redirected to the desktop application
#
Separating the Kernel from the EditorOnce we had decided that it would be better to build a web editor, we had to figure out how to handle the connection between the kernel and the editor. What we did is separate our codebase and convert the Electron application into just a simple UI that allowed the user to monitor the kernel process without looking at a terminal. We also used the Jupyter Kernel Gateway to convert the ZeroMQ messages that the Kernel normally uses into REST and websocket endpoints that our browser-based editor could use more easily. Finally we made the browser-based version of the editor automatically check to see if it can connect to the kernel and allowed users to configure their kernel gateway URI in the editor. This had some interesting implications:
- The native Electron application became entirely optional. Users who wish to have more control over the process can run the Kernel Gateway in the terminal using commands documented in our launcher's README
- The user could, in theory, connect to a Kernel that isn't on their own machine since the gateway URI is configurable. This would allow the user to spawn the Kernel Gateway on a VM on AWS for instance and then connect remotely for more power while keeping the editor UI in their local browser