Skip to content

This repository is part of a teaching innovation project for the Computer Engineering Degree of the University of Zaragoza.

Notifications You must be signed in to change notification settings

universidad-zaragoza/learning-experience-ray-tracing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Interactions between System Abstraction Levels: Ray Tracing

This project consists of four lab assignments from different courses of the Degree in Computer Engineering from Universidad de Zaragoza, Spain. The labs conform a multidisciplinary project whose main objective is to provide the students of the degree of a global vision of a computer system.

This work allows students to consolidate their knowledge of atomicity, consistency, parallelism, and concurrency present in a computer system. The system to be implemented consists of a ray tracer that uses concurrent queues, which employ library functions based on mutexes or system calls like futex implemented directly in assembly language. This way, the project covers the levels of abstraction of Application, Library, Operating System, and Instruction Set Architecture, involving the courses of Computer Graphics, Distributed and Concurrent Systems Programming, Operating Systems, and Multiprocessors, respectively.

In each 2-hour lab session, a different level of abstraction is covered, adding pointers to the rest of the levels. To strengthen the vision of a whole computer system, the Raspberry Pi platform is used in all the labs. Because the distribution of courses in different academic years of the degree, the development cannot be carried out from the Application level down to the Instruction Set Architecture. Instead, students begin to develop the system in the order established by the degree. In the first lab (Library lab), the concurrent queues are implemented using library functions. In the second lab (OS lab), functions are replaced by operating system calls. The third lab (ISA lab) replaces the system calls with assembly instructions. Finally, in the last lab (Application lab) a ray tracer makes use of the concurrent queues.

The project is structured in 4 folders, one for each lab session of the involved course in the learning experince. A working lab implementation can be found in each folder.

The material is offered by the Undergraduate Student Emanuel A. Georgescu and by Professors Alejandro Valero, Rubén Gran-Tejero, Darío Suárez-Gracia, Joaquín Ezpeleta, Pedro Álvarez, Adolfo Muñoz, Luis M. Ramos, and Pablo Ibáñez. All professors are members of the Department of Computer Science and Systems Engineering (DIIS) from Universidad de Zaragoza.

OS lab

This folder contains the implementation of the three OS-level mutex alternatives referred to as Spin, Basic, and Drepper. At this level, both Basic and Drepper solutions exploit futex syscalls. The folder contains a 'Makefile' and the source code 'main.c'. In order to compile the 'main.c', it should be run:

$prompt> make 

The 'main' executable should be run in accordance with the following usage:

$prompt> ./main [spin|basic|drepper] #threads(1<=th<=240) #MAX_SUM max_rep(>=0)
  • [spin|basic|drepper]: selects a mutex implementation to be run
  • #threads(1<=th<=240): defines the number of threads cooperating at execution
  • #MAX_SUM: number of times that the kernel code should run
  • max_rep(>=0): number of times that the entire experiment is repeated

Library lab

This folder holds the code for a Concurrent Bounded Queue, which uses a Semaphore class. Three different implementations are provided for the Semaphore class and they can be found at folders: 'sem-library-mutex', 'sem-futex-mutex-busy-wait', and 'sem-futex-mutex-w-suspension'. The former uses mutex from the standard library to protect critical sections. Additionally, it uses conditional variables to suspend the execution in case of contention. The second and third versions use futex system calls to protect critical sections on a busy-wait or with-suspension approach, respectively. Additionally, these two last alternatives rely on a mutex implemntation based on spin-lock, basic, and drepper, as described in the 'OS_lab'. The folder contains a 'Makefile' and the source code 'main.c'. In order to compile the 'main.c', it should be run:

$prompt> make 

After the execution of the 'make' command there are provided three different executables meant to benchmark the implementation of a Concurrent Bounded Queue with the three types of underlying mutex implementations: 'main_lib', 'main_futex_busy_wait', and 'main_futex_suspension'.

$prompt> ./main QUEUE_SIZE N_READERS MUTEX_TYPE(s,b,d) max_rep
  • QUEUE_SIZE: size of the concurrent bounded queue. Measured in number of items that can be read or dequeued
  • N_READERS: number of concurrent reader threads for the queue
  • MUTEX_TYPE: when applicable, it refers to the approach that implements the mutex itself (see 'OS_lab')
  • max_rep(>=0): number of times that the entire experiment is repeated

ISA lab

This folder holds the code for a Concurrent Bounded Queue, which uses a Semaphore class based on the 'sem-futex-mutex-busy-wait' implementation of the previous 'Library lab'. The mutex implementation to guarantee mutual exclusion in the critical sections of this Semaphore class is coded in ARMv8 assembly. There are provided two alternatives for the assembly mutex:

  1. A spin-lock approach implemented with the ISA instructions 'ldaxr' and 'stlxr' under synchronization load-link store-conditional paradigm.

  2. An extension of the previous approach that relies on ISA instructions 'wfe' and 'sevl' to hint a low-power state of cores under a contention scenario.

The folder contains a 'Makefile' and the source code 'main.c'. In order to compile the 'main.c', it should be run:

$prompt> make 

After the execution of the 'make' command there are provided two different executables meant to benchmark the implementation of a Concurrent Bounded Queue with the two types of underlying mutex implementations: 'main_isa_mtx' and 'main_isa_mtx_sleep'. The former corresponds to a basic mutex implementation and the latter to an extension to hint a low-power state for cores under a contention scenario. Please note that for the assembly code to be compiled it is required to run make under an ARMv8 ISA.

$prompt> ./main QUEUE_SIZE N_READERS max_rep
  • QUEUE_SIZE: size of the concurrent bounded queue. Measured in number of items that can be read or dequeued
  • N_READERS: number of concurrent reader threads for the queue
  • max_rep(>=0): number of times that the entire experiment is repeated

Application lab

This folder contains the complete implementation of a ray tracer application (smallpt by Kevin Beason) that relies on the implementation of the previous labs in order to distribute the rendering process among threads. Work distribution is centralized in a Concurrent Bounded Queue. A main thread splits the render work into pieces that are enqueued in the queue. On the other hand, a set of worker threads dequeue the pieces of work and proceed with the parallel rendering of the scene.

In order to compile the application, it should be run the 'Makefile' that can be found at path './Application_lab/smallpt/Makefile'. As a result, there are produced several versions of parallel a ray tracer which differ in the underlying implementation of the mutex and semaphore objects that enforce the correction at the critical sections of the parallel algorithm.

  • 'smallpt_queue_mutex_def': the default implementation that relies on std::mutex of the C++ standard library.
  • 'smallpt_queue_mutex_d': an implementation based on futex system call based on Ulrich Drepper's algorithm.
  • 'smallpt_queue_mutex_b': an implementation based on futex system call that suspends threads in case of a contention scenario.
  • 'smallpt_queue_mutex_s': an implementation that relies in a spin-lock approach.
  • 'smallpt_queue_mutex_asm': in this implementation, synchronization primitives have been coded in assembly (ARMv8).
  • 'smallpt_queue_mutex_asm_sleep': an improvement of the previous implementation that hints the core to step to a low-power state in case of contention.

The following is an example command line to run the ray tracer using mutexes based on Ulrich Drepper's algorithm:

$prompt> ./smallpt_queue_mutex_d -scene forest -threads 10 -samples 10 -width 300 -height 300 -divisions 20 -output fout

About

This repository is part of a teaching innovation project for the Computer Engineering Degree of the University of Zaragoza.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •