CS 2200 Systems and Networking

CS2200 Introduction to Systems and Networks
Project 4 Implementing Processor Scheduler using pthreads


Overview

This project serves a dual purpose:
  1. You will build a simulator of a simple operating system to learn more about operating systems in general and process schedulers in specific.
  2. You will do so using threads to learn about multithreading and synchronization.
Before you begin, take note of the following: In this project, you will implement a multiprocessor operating system simulator using pthreads. The framework for the multithreaded OS simulator is nearly complete, but missing one critical component: the CPU scheduler! Your task is to implement the CPU scheduler, using three different scheduling algorithms.

We have provided you with source files that provide the framework for your simulator. You will only need to modify answers.txt and student.c. However, just because you are only modifying two files doesn't mean that you should ignore the other ones - there is helpful information in the other files.
We have provided you these files:
Scheduling Algorithms

For your simulator, you will implement the following three CPU scheduling algorithms: Process States

In our OS simulation, there are five possible states for a process, which are listed in the process_state_t enum in os-sim.h:

There is a field named state in the PCB, which must be updated with the current state of the process. The simulator will use this field to collect statistics.
Process States
Figure 1: Process States


The Ready Queue

On most systems, there are a large number of processes, but only one or two CPUs on which to execute them. When there are more processes ready to execute than CPUs, processes must wait in the READY state until a CPU becomes available. To keep track of the processes waiting to execute, we keep a ready queue of the processes in the READY state.

Since the ready queue is accessed by multiple processors, which may add and remove processes from the ready queue, the ready queue must be protected by some form of synchronization--for this project, it will be a mutex lock.

Scheduling Processes

schedule() is the core function of the CPU scheduler. It is invoked whenever a CPU becomes available for running a process. schedule() must search the ready queue, select a runnable process, and call the context_switch() function to switch the process onto the CPU.

There is a special process, the idle process, which is scheduled whenever there are no processes in the READY state.

CPU Scheduler Invocation

There are four events which will cause the simulator to invoke schedule():

  1. yield() - A process completes its CPU operations and yields the processor to perform an I/O request.
  2. wake_up() - A process that previously yielded completes its I/O request, and is ready to perform CPU operations. wake_up() is also called when a process in the NEW state becomes runnable.
  3. preempt() - When using a Round-Robin or Static Priority scheduling algorithm, a CPU-bound process may be preempted before it completes its CPU operations.
  4. terminate() - A process exits or is killed.
The CPU scheduler also contains one other important function: idle(). idle() contains the code that gets by the idle process. In the real world, the idle process puts the processor in a low-power mode and waits. For our OS simulation, you will use a pthread condition variable to block the thread until a process enters the ready queue.

The Simulator

We will use pthreads to simulate an operating system on a multiprocessor computer. We will use one thread per CPU and one thread as a "supervisor" for our simulation. The CPU threads will simulate the currently-running processes on each CPU, and the supervisor thread will print output and dispatch events to the CPU threads.

Since the code you write will be called from multiple threads, the CPU scheduler you write must be thread-safe! This means that all data structures you use, including your ready queue, must be protected using mutexes.

The number of CPUs is specified as a command-line parameter to the simulator. For this project, you will be performing experiments with 1, 2, and 4 CPU simulations.

Also, for demonstration purposes, the simulator executes much slower than a real system would. In the real world, a CPU burst might range from one to a few hundred milliseconds, whereas in this simulator, they range from 0.2 to 2.0 seconds.

Simulator Function Calls
Figure 2: Simulator Function Calls

Sample Output

Compile and run the simulator with ./os-sim 2. After a few seconds, hit Control-C to exit. You will see the output below:
      Time  Ru Re Wa      CPU 0    CPU 1        < I/O Queue <

      ===== == == ==      ======== ========     =============

      0.0   0  0  0       (IDLE)   (IDLE)        < < 

      0.1   0  0  0       (IDLE)   (IDLE)        < < 

      0.2   0  0  0       (IDLE)   (IDLE)        < <

      0.3   0  0  0       (IDLE)   (IDLE)        < <

      0.4   0  0  0       (IDLE)   (IDLE)        < <

      0.5   0  0  0       (IDLE)   (IDLE)        < <

      0.6   0  0  0       (IDLE)   (IDLE)        < <

      0.7   0  0  0       (IDLE)   (IDLE)        < <

      0.8   0  0  0       (IDLE)   (IDLE)        < <

      0.9   0  0  0       (IDLE)   (IDLE)        < <

      1.0   0  0  0       (IDLE)   (IDLE)        < <

      ......
The simulator generates a Gantt Chart, showing the current state of the OS at every 100ms interval. The leftmost column shows the current time, in seconds. The next three columns show the number of Running, Ready, and Waiting processes, respectively. The next two columns show the process currently running on each CPU. The rightmost column shows the processes which are currently in the I/O queue, with the head of the queue on the left and the tail of the queue on the right.

As you can see, nothing is executing. This is because we have no CPU scheduler to select processes to execute! Once you complete Problem 1 and implement a basic FCFS scheduler, you will see the processes executing on the CPUs.

Test Processes

For this simulation, we will use a series of eight test processes, five CPU-bound and three I/O-bound. For simplicity, we have labelled each starting with a "C" or "I" to indicate CPU-bound or I/O-bound.

PIDProcess NameCPU / I/O-boundPriorityStart Time
0IapacheI/O-bound80.0 s
1IbashI/O-bound71.0 s
2ImozillaI/O-bound72.0 s
3CcpuCPU-bound53.0 s
4CgccCPU-bound14.0 s
5CspiceCPU-bound25.0 s
6CmysqlCPU-bound36.0 s
7CsimCPU-bound47.0 s

For this project, priorities range from 0 to 10, with 10 being the highest priority. Note that the I/O-bound processes have been given higher priorities than the CPU-bound processes.

Problem 1: FCFS Scheduler

A. [50 points] - Implement the CPU scheduler using the FCFS scheduling algorithm. You may do this however you like, however, we suggest the following:

Once you successfully complete this portion of the project, test your code with ./os-sim 1, and you should see output similar to the following:
 

      Time  Ru Re Wa      CPU 0         I/O Queue 

      ===== == == ==      ========     =============

      0.0   0  0  0       (IDLE)       < <

      0.1   1  0  0       Iapache      < <

      0.2   1  0  0       Iapache      < <

      0.3   1  0  0       Iapache      < <

      0.4   0  0  1       (IDLE)       < Iapache <

      0.5   0  0  1       (IDLE)       < Iapache <

      0.6   1  0  0       Iapache      < <

      0.7   1  0  0       Iapache      < <

      0.8   1  0  0       Iapache      < <

      0.9   1  0  0       Iapache      < <

      1.0   0  0  1       (IDLE)       < Iapache <

      1.1   1  0  1       Ibash        < Iapache <

      1.2   1  0  1       Ibash        < Iapache <

      1.3   1  0  1       Ibash        < Iapache <

      1.4   1  0  1       Ibash        < Iapache <

      1.5   1  0  1       Iapache      < Ibash <

      1.6   1  0  1       Iapache      < Ibash <

      1.7   0  0  2       (IDLE)       < Ibash Iapache <

      1.8   0  0  2       (IDLE)       < Ibash Iapache <

      1.9   0  0  2       (IDLE)       < Ibash Iapache <

      2.0   1  0  1       Ibash        < Iapache <

      ....

      66.9  1  1  0       Ibash        < <

      67.0  1  1  0       Ibash        < <

      67.1  1  1  0       Ibash        < <

      67.2  1  0  0       Imozilla     < <

      67.3  1  0  0       Imozilla     < <

      67.4  1  0  0       Imozilla     < <

      67.5  1  0  0       Imozilla     < <

      

      

      # of Context Switches: 97

      Total execution time: 67.6 s

      Total time spent in READY state: 389.9 s


B. [10 points] Run your OS simulation with 1, 2, and 4 CPUs. Compare the total execution time of each. Is there a linear relationship between the number of CPUs and total execution time? Why or why not?

Problem 2: Round-Robin Scheduler

A. [10 points] Add Round-Robin scheduling functionality to your code. You should modify main() to add a command line option, -r, which selects the Round-Robin scheduling algorithm, and accepts a parameter, the length of the timeslice. For this project, timeslices are measured in tenths of seconds. E.g.:

./os-sim <# CPUs> -r 5

should run a Round-Robin scheduler with timeslices of 500 ms. While:

./os-sim <# of CPUs>

should continue to run a FCFS scheduler.

To specify a timeslice when scheduling a process, use the timeslice parameter of context_switch(). The simulator will automatically preempt the process and call your preempt() handler if the process executes on the CPU for the length of the timeslice without terminating or yielding for I/O.

B. [10 points] Run your Round-Robin scheduler with timeslices of 800ms, 600ms, 400ms, and 200ms. Use only one CPU for your tests. Compare the statistics at the end of the simulation. Show that the total waiting time decreases with shorter timeslices. However, in a real OS, the shortest timeslice possible is usually not the best choice. Why not?

Problem 3: Static Priority Scheduling

A. [10 points] Add Static Priority scheduling to your code. Modify main() to accept the -p parameter to select the Static Priority algorithm. The -r and default FCFS scheduler should continue to work.

The scheduler should use the priority specified in the static_priority field of the PCB. This priority is a value from 0 to 10, with 0 being the lowest priority and 10 being the highest priority.

For Static Priority scheduling, you will need to make use of the current[] array and force_preempt() function. The current[] array should be used to keep track of the process currently executing on each CPU. Since this array is accessed by multiple CPU threads, it must be protected by a mutex. current_mutex has been provided for you.

The force_preempt() function preempts a running process before its timeslice expires. Your wake_up() handler should make use of this function to preempt a lower priority process when a higher priority process needs a CPU.

B. [10 points] The Shortest-Job First (SJF) scheduling algorithm is proven to have the optimal average waiting time. However, it is only a theoretical algorithm; it cannot be implemented in a typical CPU scheduler, because the scheduler does not have advance knowledge of the length of each CPU burst.

Run each of your three scheduling algorithms (using one CPU), and compare the total waiting times. Which algorithm is the closest approximation of SJF? Why?

Assignment Submission



Be sure to turn in the following files, plus any more needed to make your project run. Failure to include all files will result in lost points. Turning in a project that does not compile will result in a zero. Remember, turn in everything that is required by your makefile. If your makefile doesn't work, we can't grade your project.
Keep your answers detailed enough to cover the question, including support from simulator results if appropriate. Don't write a book; but if you're not sure about an answer, err on the side of giving us too much information.
End of CS 2200 Project 4