PREPRINT
8A0423BD-1AA5-40F8-A002-9E458C0C54B7

Rosetta: a container-centric science platform for resource-intensive, interactive data analysis

Stefano Alberto Russo, Sara Bertocco, Claudio Gheller, Giuliano Taffoni

Submitted on 5 September 2022

Abstract

Rosetta is a science platform for resource-intensive, interactive data analysis which runs user tasks as software containers. It is built on top of a novel architecture based on framing user tasks as microservices - independent and self-contained units - which allows to fully support custom and user-defined software packages, libraries and environments. These include complete remote desktop and GUI applications, besides common analysis environments as the Jupyter Notebooks. Rosetta relies on Open Container Initiative containers, which allow for safe, effective and reproducible code execution; can use a number of container engines and runtimes; and seamlessly supports several workload management systems, thus enabling containerized workloads on a wide range of computing resources. Although developed in the astronomy and astrophysics space, Rosetta can virtually support any science and technology domain where resource-intensive, interactive data analysis is required.

Preprint

Subjects: Astrophysics - Instrumentation and Methods for Astrophysics; Computer Science - Distributed, Parallel, and Cluster Computing

URL: https://arxiv.org/abs/2209.02003