The goal of the project is the creation of a distributed computing environment based on Python and the Ray framework, designed to unify the computational resources of the three workstations of the Lab. This setup ensures efficient management of parallel tasks and dynamic scalability according to workload demands.
The Laboratory is equipped with three high-performance workstations, each featuring an NVIDIA RTX 3080 GPU and an Intel i9 processor, designed to deliver exceptional computing power for simulation, graphic processing, and advanced scientific computation.
These systems form an ideal technological foundation for the creation of a high-performance distributed environment, capable of efficiently managing complex and parallel workloads.
The study and design of the prototype are based on a state-of-the-art open-source solution: Ray, a framework that enables the efficient management of distributed and parallel computation.
The real strength of Ray lies in its ability to create a scalable and flexible computing environment, adaptable to different infrastructures—from a single laptop to complex on-premise systems—and compatible with both Windows and Linux.
This versatility makes it suitable for both small-scale contexts and more complex infrastructures, ensuring high performance and operational reliability.
The first step of the prototype involves the use of Ubuntu on Hyper-V, a configuration that allows the efficiency of Linux to be leveraged within Windows machines, combining the power of the open-source operating system with the compatibility of the Microsoft environment.
Subsequently, Ray is installed within a Kubernetes environment, a platform that enables dynamic scalability and automatic orchestration of resources across multiple nodes.
Thanks to Kubernetes, the prototype can optimally manage distributed workloads, dynamically allocating resources and ensuring stable and high-performance operation.
This configuration represents a strategic step toward building an efficient and modular distributed infrastructure, capable of supporting advanced research and experimental development activities.
After the initial design and configuration phase, activities focused on the complete installation of the system and the verification of its operational functionality within the laboratory environment.
The main objective of this phase was to confirm the feasibility of implementing and coherently managing the infrastructure described in the project design.
The Ray system was successfully configured in a Kubernetes environment, resulting in a fully operational distributed cluster capable of managing coordinated process execution.
The platform proved consistent with the planned architecture, allowing effective communication between nodes and proper allocation of computing resources.
The installation and configuration procedures confirmed the compatibility and reliability of the Ray model within the laboratory context, ensuring stable integration with the existing infrastructure.
The system is currently operational at a prototype level and serves as a functional foundation for further experimentation and development, paving the way for future improvements to the cluster and the introduction of more advanced monitoring and management tools and remote access.