Scheduling

Operating Systems: Internals and Design Principles
Scheduling
Based on William Stallings, Chapters 9 and 10
I take a two hour nap, from one oclock to four.

Yogi Berra
Processor Scheduling
Aim is to assign processes to be executed by the processor in a way that meets system objectives, such as response time, throughput, and processor efficiency Broken down into three separate functions:
long term scheduling
medium term scheduling
short term scheduling
Long-term scheduling Medium-term scheduling
The decision to add to the pool of processes to be executed The decision to add to the number of processes that are partially or fully in main memory The decision as to which available process will be executed by the processor The decision as to which process' pending I/O request shall be handled by an available I/O device
Short-term scheduling I/O scheduling
Scheduling and Process State Transitions
Figure 9.2 Nesting of Scheduling Functions

(Referencing figure 3.9b)
Q u e u i n g D i a g r a m
Long-Term Scheduler
Determines which programs are admitted to the system for processing Controls the degree of multiprogramming the more processes that are created, the smaller the percentage of time that each process can be executed may limit to provide satisfactory service to the current set of processes
Creates processes from the queue when it can, but must decide:
when the operating system can take on one or more additional processes
which jobs to accept and turn into processes
first come, first served
priority, expected execution time, I/O requirements
Medium-Term Scheduling

Part of the swapping function Swapping-in decisions are based on the need to manage the degree of multiprogramming
considers the memory requirements of the swapped-out processes
Short-Term Scheduling

Interacting with the dispatcher Executes most frequently Makes the fine-grained decision of which process to execute next Invoked when an event occurs that may lead to the blocking of the current process or that may provide an opportunity to preempt a currently running process in favour of another
Examples: Clock interrupts I/O interrupts Operating system calls Signals (e.g., semaphores)
Short Term Scheduling Criteria
Main objective is to allocate processor time to optimise certain aspects of system behaviour A set of criteria is needed to evaluate the scheduling policy
User-oriented criteria
relate to the behaviour of the system as perceived by the individual user or process (such as response time in an interactive system) important on virtually all systems
System-oriented criteria
focus in on effective and efficient utilisation of the processor (rate at which processes are completed) generally of minor importance on single-user systems
Short-Term Scheduling Criteria: Performance

examples:
response time throughput
Criteria can be classified into:
example:
predictability
Performance-related
Non-performance related
quantitative
easily measured
qualitative
hard to measure
Table 9.2a User-Oriented Scheduling Criteria
User Oriented, Performance Related Turnaround time This is the interval of time between the submission of a process and its completion. Includes actual execution time plus time spent waiting for resources, including the processor. This is an appropriate measure for a batch job. Response time For an interactive process, this is the time from the submission of a request until the response begins to be received. Often a process can begin producing some output to the user while continuing to process the request. Thus, this is a better measure than turnaround time from the user's point of view. The scheduling discipline should attempt to achieve low response time and to maximize the number of interactive users receiving acceptable response time. Deadlines When process completion deadlines can be specified, the scheduling discipline should subordinate other goals to that of maximising the percentage of deadlines met. User Oriented, Other Predictability A given job should run in about the same amount of time and at about the same cost regardless of the load on the system. A wide variation in response time or turnaround time is distracting to users. It may signal a wide swing in system workloads or the need for system tuning to cure instabilities.
Table 9.2b SystemOriented Scheduling Criteria
System Oriented, Performance Related Throughput The scheduling policy should attempt to maximise the number of processes completed per unit of time. This is a measure of how much work is being performed. This clearly depends on the average length of a process but is also influenced by the scheduling policy, which may affect utilization. Processor utilisation This is the percentage of time that the processor is busy. For an expensive shared system, this is a significant criterion. In single-user systems and in some other systems, such as real-time systems, this criterion is less important than some of the others. System Oriented, Other Fairness In the absence of guidance from the user or other systemsupplied guidance, processes should be treated the same, and no process should suffer starvation. Enforcing priorities When processes are assigned priorities, the scheduling policy should favour higher-priority processes. Balancing resources The scheduling policy should keep the resources of the system busy. Processes that will under-utilise stressed resources should be favoured. This criterion also involves medium-term and long-term scheduling.
Priority Queuing
Determines which process, among ready processes, is selected next for execution May be based on priority, resource requirements, or the execution characteristics of the process If based on execution characteristics then important quantities are:
w = time spent in system so far, waiting e = time spent in execution so far s = total service time required by the process, including e; generally, this
quantity must be estimated or supplied by the user
Specifies the
instants in time at which the selection function is exercised
Two categories:
Non-preemptive Preemptive
Non-preemptive

Preemptive
currently running process may be interrupted and moved to ready state by the OS preemption may occur when new process arrives, on an interrupt, or periodically
once a process is in the running state, it will continue until it terminates or blocks itself for I/O
Table 9.4 Process Scheduling Example
Simplest scheduling policy Also known as first-in-first-out (FIFO) or a strict queuing scheme When the current process ceases to execute, the process longest in the Ready queue is selected
Performs much better for long processes than short ones Tends to favour processor-bound processes over I/O-bound processes
Uses preemption based on a clock Also known as time slicing because each process is given a slice of time before being preempted Principal design issue is the length of the time quantum, or slice, to be used
Particularly effective in a generalpurpose time-sharing system or transaction processing system One drawback is its relative treatment of processor-bound versus I/O-bound processes
Figure 9.6a
Effect of Size of Preemption Time Quantum
Figure 9.6b Effect of Size of Preemption Time Quantum
Virtual Round Robin (VRR)
Feedback Scheduling
Feedback Performance
Non-preemptive policy in which the process with the shortest expected processing time is selected next A short process will jump to the head of the queue Possibility of starvation for longer processes
One difficulty is the need to know, or at least estimate, the required processing time of each process If the programmers estimate is substantially under the actual running time, the system may abort the job
Attempt to predict future run time based on past Average run time: problem with varying execution times! Exponential averageing puts more emphasis on the recent past
Tn actual run time of nth execution Sn nth estimation of execution time exponential smoothing coefficient
Exponential Smoothing Coefficients
Use Of Exponential Averageing
Use Of Exponential Averageing
Preemptive version of SPN Scheduler always chooses the process that has the shortest expected remaining processing time Risk of starvation of longer processes
Should give superior turnaround time performance to SPN because a short job is given immediate preference to a running longer job
Chooses next process with the greatest response ratio Attractive because it accounts for the age of the process
While shorter jobs are favoured, ageing without service increases the ratio so that a longer process will eventually get past competing shorter jobs
Table 9.5 Comparison of Scheduling Policies
Scheduling Policies Summary
Performance Comparison
Any scheduling discipline that chooses the next item to be served independent of service time obeys the relationship:
Table 9.6 Formulas for SingleServer Queues with Two Priority Categories
Overall Normalised Response Time
Normalised Response Time for Processes
Shorter
Normalised Response Time for Longer Processes
Results
Simulation
Fair-Share Scheduling
Scheduling Each
decisions based on the process sets
user is assigned a share of the processor
Objective
is to monitor usage to give fewer resources to users who have had more than their fair share and more to those who have had less than their fair share
Fair-Share Scheduler
Traditional UNIX Scheduling
Used in both SVR3 and 4.3 BSD UNIX
these systems are primarily targeted at the time-sharing interactive environment
Designed to provide good response time for interactive users while ensuring that low-priority background jobs do not starve Employs multilevel feedback using round robin within each of the priority queues Makes use of one-second preemption Priority is based on process type and execution history
Scheduling Formula
Bands
Used to optimise access to block devices and to allow the operating system to respond quickly to system calls In decreasing order of priority, the bands are:
Swapper
Block I/O device control File manipulation Character I/O device control User processes
Example of Traditional UNIX Process Scheduling
The operating system must make three types of scheduling decisions with respect to the execution of processes: Long-term determines when new processes are admitted to the system Medium-term part of the swapping function and determines when a program is brought into main memory so that it may be executed Short-term determines which ready process will be executed next by the processor From a users point of view, response time is generally the most important characteristic of a system From a system point of view, throughput or processor utilisation is important Algorithms:
FCFS, Round Robin, SPN, SRT, HRRN, Feedback
Multiprocessor and Real-Time Scheduling

based on William Stallings, Chapter 10

Bear in mind, Sir Henry, one of the phrases in that queer old legend which Dr. Mortimer has read to us, and avoid the moor in those hours of darkness when the powers of evil are exalted.
THE HOUND OF THE BASKERVILLES, Arthur Conan Doyle
Loosely coupled or distributed multiprocessor, or cluster

consists of a collection of relatively autonomous systems, each processor having its own main memory and I/O channels
Functionally specialised processors

there is a master, general-purpose processor; specialised processors are controlled by the master processor and provide services to it
Tightly coupled multiprocessor

consists of a set of processors that share a common main memory and are under the integrated control of an operating system
Synchronisation Granularity and Processes

Grain Size Fine Medium Coarse Very Coarse Description Parallelism inherent in a single instruction stream. Parallel processing or multitasking within a single application Multiprocessing of concurrent processes in a multiprogramming environment Distributed processing across network nodes to form a single computing environment Multiple unrelated processes Synchronization Interval (Instructions) <20 20-200 200-2000 2000-1M
Independent
not applicable
No explicit synchronisation among processes
each represents a separate, independent application or job
each user is performing a particular application
Typical use is in a time-sharing system
multiprocessor provides the same service as a multiprogrammed uniprocessor because more than one processor is available, average response time to the users will be less
Synchronisation among processes, but at a very gross level Good for concurrent processes running on a multiprogrammed uniprocessor
can be supported on a multiprocessor with little or no change to user software
Single application can be effectively implemented as a collection of threads within a single process
programmer must explicitly specify the potential parallelism of an application there needs to be a high degree of coordination and interaction among the threads of an application, leading to a medium-grain level of synchronisation
Because the various threads of an application interact so frequently, scheduling decisions concerning one thread may affect the performance of the entire application
Represents a much more complex use of parallelism than is found in the use of threads Is a specialised and fragmented area with many different approaches
Scheduling on a multiprocessor involves three interrelated issues:
The approach taken will depend on the degree of granularity of applications and the number of processors available
actual dispatching of a process
use of multiprogramming on individual processors
assignment of processes to processors
Assuming all processors are equal, it is simplest to treat processors as a pooled resource and assign processes to processors on demand If a process is permanently assigned to one processor from activation until its completion, then a dedicated short-term queue is maintained for each processor
static or dynamic needs to be determined
advantage is that there may be less overhead in the scheduling function
allows group or gang scheduling
A disadvantage of static assignment is that one processor can be idle, with an empty queue, while another processor has a backlog

to prevent this situation, a common queue can be used another option is dynamic load balancing
Both
dynamic and static methods require some way of assigning a process to a processor

Approaches:
Master/Slave Peer
Key kernel functions always run on a particular processor Master is responsible for scheduling Slave sends service request to the master Is simple and requires little enhancement to a uniprocessor multiprogramming operating system Conflict resolution is simplified because one processor has control of all memory and I/O resources
Disadvantages: failure of master brings down whole system master can become a performance bottleneck
Kernel can execute on any processor Each processor does self-scheduling from the pool of available processes
Complicates the operating system operating system must ensure that two processors do not choose the same process and that the processes are not somehow lost from the queue
Usually processes are not dedicated to processors A single queue is used for all processors
if some sort of priority scheme is used, there are multiple queues based on priority
System is viewed as being a multi-server queuing architecture
for One and Two Processors
Comparison of Scheduling Performance
Thread execution is separated from the rest of the definition of a process An application can be a set of threads that cooperate and execute concurrently in the same address space On a uniprocessor, threads can be used as a program structuring aid and to overlap I/O with processing In a multiprocessor system threads can be used to exploit true parallelism in an application Dramatic gains in performance are possible in multi-processor systems Small differences in thread management and scheduling can have an impact on applications that require significant interaction among threads
processes are not assigned to a particular processor
a set of related thread scheduled to run on a set of processors at the same time, on a one-to-one basis
Load Sharing
Four approaches for multiprocessor thread scheduling and processor assignment are:
Gang Scheduling
provides implicit scheduling defined by the assignment of threads to processors
the number of threads in a process can be altered during the course of execution
Dedicated Processor Assignment
Dynamic Scheduling
Simplest approach and carries over most directly from a uniprocessor environment
Advantages: load is distributed evenly across the processors no centralised scheduler required the global queue can be organised and accessed using any of the schemes discussed in Chapter 9
Versions of load sharing:

first-come-first-served smallest number of threads first preemptive smallest number of threads first
Central queue occupies a region of memory that must be accessed in a manner that enforces mutual exclusion
can lead to bottlenecks
Preemptive threads are unlikely to resume execution on the same processor
caching can become less efficient
If all threads are treated as a common pool of threads, it is unlikely that all of the threads of a program will gain access to processors at the same time
the process switches involved may seriously compromise performance
Simultaneous scheduling of the threads that make up a single process
Benefits: synchronisation blocking may be reduced, less process switching may be necessary, and performance will increase scheduling overhead may be reduced
Useful for medium-grained to fine-grained parallel applications whose performance severely degrades when any part of the application is not running while other parts are ready to run Also beneficial for any parallel application
Figure 10.2 Example of Scheduling Groups With Four and One Threads
When an application is scheduled, each of its threads is assigned to a processor that remains dedicated to that thread until the application runs to completion If a thread of an application is blocked waiting for I/O or for synchronisation with another thread, then that threads processor remains idle
there is no multiprogramming of processors in a highly parallel system, with tens or hundreds of processors, processor utilisation is no longer so important as a metric for effectiveness or performance the total avoidance of process switching during the lifetime of a program should result in a substantial speedup of that program
Defense of this strategy:
Figure 10.3
Application Speedup as a Function of Number of Threads
For some applications it is possible to provide language and system tools that permit the number of threads in the process to be altered dynamically
this would allow the operating system to adjust the load to improve utilisation
Both the operating system and the application are involved in making scheduling decisions The scheduling responsibility of the operating system is primarily limited to processor allocation This approach is superior to gang scheduling or dedicated processor assignment for applications that can take advantage of it
The operating system, and in particular the scheduler, is perhaps the most important component
control of laboratory experiments process control in industrial plants robotics air traffic control telecommunications military command and control systems
Examples:
Correctness of the system depends not only on the logical result of the computation but also on the time at which the results are produced Tasks or processes attempt to control or react to events that take place in the outside world These events occur in real time and tasks must be able to keep up with them
Hard real-time task

Soft real-time task
one that must meet its deadline otherwise it will cause unacceptable damage or a fatal error to the system
Has an associated deadline that is desirable but not mandatory It still makes sense to schedule and complete the task even if it has passed its deadline
Periodic
tasks
requirement may be stated as:

once per period T exactly T units apart
Aperiodic

tasks
has a deadline by which it must finish or start may have a constraint on both start and finish time
Real-time operating systems have requirements in five general areas:

Determinism Responsiveness User control Reliability Fail-soft operation
Concerned with how long an operating system delays before acknowledging an interrupt Operations are performed at fixed, predetermined times or within predetermined time intervals
when multiple processes are competing for resources and processor time, no system will be fully deterministic
The extent to which an operating system can deterministically satisfy requests depends on:
the speed with which it can respond to interrupts
whether the system has sufficient capacity to handle all requests within the required time
Together with determinism make up the response time to external events
critical for real-time systems that must meet timing requirements imposed by individuals, devices, and data flows external to the system
Concerned with how long it takes a system to service the event
Responsiveness includes: amount of time required to initially handle the interrupt and begin execution of the interrupt service routine (ISR) amount of time required to perform the ISR effect of interrupt nesting
Generally much broader in a real-time operating system than in ordinary operating systems It is essential to allow the user fine-grained control over task priority User should be able to distinguish between hard and soft tasks and to specify relative priorities within each class May allow user to specify such characteristics as:
paging or process swapping what processes must always be resident in main memory what disk transfer algorithms are to be used what rights the processes in various priority bands have
More important for real-time systems than non-real time systems Real-time systems respond to and control events in real time so loss or degradation of performance may have catastrophic consequences such as:

financial loss major equipment damage loss of life
A characteristic that refers to the ability of a system to fail in such a way as to preserve as much capability and data as possible Important aspect is stability a real-time system is stable if the system will meet the deadlines of its most critical, highest-priority tasks even if some less critical task deadlines are not always met
Real-Time
Scheduling of
Process
whether a system performs schedulability analysis Scheduling approaches depend on:
if it does, whether it is done statically or dynamically
whether the result of the analysis itself produces a scheduler plan according to which tasks are dispatched at run time
Static table-driven approaches

performs a static analysis of feasible schedules of dispatching result is a schedule that determines, at run time, when a task must begin execution
Static priority-driven preemptive approaches

a static analysis is performed but no schedule is drawn up analysis is used to assign priorities to tasks so that a traditional priority-driven preemptive scheduler can be used
Dynamic planning-based approaches

feasibility is determined at run time rather than offline prior to the start of execution one result of the analysis is a schedule or plan that is used to decide when to dispatch this task
Dynamic best effort approaches

no feasibility analysis is performed system tries to meet all deadlines and aborts any started process whose deadline is missed
Real-time operating systems are designed with the objective of starting real-time tasks as rapidly as possible and emphasize rapid interrupt handling and task dispatching Real-time applications are generally not concerned with sheer speed but rather with completing (or starting) tasks at the most valuable times Priorities provide a crude tool and do not capture the requirement of completion (or initiation) at the most valuable time
Ready time
time task becomes ready for execution
Resource resources required by the requirements task while it is executing
Starting deadline Completion deadline Processing time
time task must begin

Priority measures relative importance of the task
time task must be completed time required to execute the task to completion
Subtask scheduler a task may be decomposed into a mandatory subtask and an optional subtask
Table 10.2 Execution Profile of Two Periodic Tasks
Figure 10.5 Scheduling of Periodic Real-Time Tasks With Completion Deadlines (Based on Table 10.2)
Figure 10.6 Scheduling of Aperiodic Real-Time Tasks With Starting Deadlines
Table 10.3 Execution Profile of Five Aperiodic Tasks
Rate Monotonic Scheduling
Figure 10.7
Periodic Task Timing Diagram
Figure 10.8
Value of the RMS Upper Bound Table 10.4
Can occur in any priority-based preemptive scheduling scheme Particularly relevant in the context of real-time scheduling Best-known instance involved the Mars Pathfinder mission Occurs when circumstances within the system force a higher priority task to wait for a lower priority task
Unbounded Priority Inversion the duration of a priority inversion depends not only on the time required to handle a shared resource, but also on the unpredictable actions of other unrelated tasks
Unbounded Priority Inversion
Priority Inheritance
The three classes are:

SCHED_FIFO: First-in-first-out real-time threads SCHED_RR: Round-robin real-time threads SCHED_OTHER: Other, non-real-time threads
Within each class multiple priorities may be used
Linux Real-Time Scheduling
The Linux 2.4 scheduler for the SCHED_OTHER class did not scale well with increasing number of processors and processes
Time to select the appropriate process and assign it to a processor is constant regardless of the load on the system or number of processors
Linux 2.6 uses a new priority scheduler known as the O(1) scheduler
Kernel maintains two scheduling data structures for each processor in the system
Linux Scheduling Data Structures
Figure 10.11
A complete overhaul of the scheduling algorithm used in earlier UNIX systems
The new algorithm is designed to give: highest preference to real-time processes next-highest preference to kernel-mode processes lowest preference to other user-mode processes
Major modifications:
addition of a preemptable static priority scheduler and the introduction of a set of 160 priority levels divided into three priority classes insertion of preemption points
SVR Priority Classes
Figure 10.12
Real time (159 100)

guaranteed to be selected to run before any kernel or time-sharing process
Kernel (99 60)

guaranteed to be selected to run before any time-sharing process, but must defer to real-time processes
Time-shared (59-0)
lowest-priority processes, intended for user applications other than real-time applications
can preempt kernel and user processes
SVR4 Dispatch Queues
Figure 10.13
UNIX FreeBSD Scheduler
FreeBSD scheduler was designed to provide effective scheduling for a SMP or multicore system Design goals:
address the need for processor affinity in SMP and multicore systems
processor affinity a scheduler that only migrates a thread when necessary to avoid having an idle processor
provide better support for multithreading on multicore systems improve the performance of the scheduling algorithm so that it is no longer a function of the number of threads in the system
Windows Thread Dispatching Priorities
Figure 10.14
A thread is considered to be interactive if the ratio of its voluntary sleep time versus its runtime is below a certain threshold Interactivity threshold is defined in the scheduler code and is not configurable Threads whose sleep time exceeds their run time score in the lower half of the range of interactivity scores Threads whose run time exceeds their sleep time score in the upper half of the range of interactivity scores
Processor affinity is when a Ready thread is scheduled onto the last processor that it ran on
significant because of local caches dedicated to a single processor

an idle processor steals a thread from an nonidle processor primarily useful when there is a light or sporadic load or in situations where processes are starting and exiting very frequently a periodic scheduler task evaluates the current load situation and evens it out ensures fairness among the runnable threads
Pull Mechanism FreeBSD scheduler supports two mechanisms for thread migration to balance load: Push Mechanism
Priorities in Windows are organized into two bands or classes:

real time priority class all threads have a fixed priority that never changes all of the active threads at a a given priority level are in a round-robin queue variable priority class a threads priority begins an initial priority value and then may be temporarily boosted during the threads lifetime
Each band consists of 16 priority levels Threads requiring immediate attention are in the real-time class
include functions such as communications and real-time tasks
Windows Priority Relationship
Figure 10.15
With a tightly coupled multiprocessor, multiple processors have access to the same main memory Performance studies suggest that the differences among various scheduling algorithms are less significant in a multiprocessor system A real-time process is one that is executed in connection with some process or function or set of events external to the computer system and that must meet one or more deadlines to interact effectively and correctly with the external environment A real-time operating system is one that is capable of manageing real-time processes Key factor is the meeting of deadlines Algorithms that rely heavily on preemption and on reacting to relative deadlines are appropriate in this context

Scheduling

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Scheduling

Uploaded by

Copyright:

Available Formats

Operating Systems: Internals and Design Principles

Operating Systems: Internals and Design Principles

I take a two hour nap, from one oclock to four.

long term scheduling

medium term scheduling

short term scheduling

Long-term scheduling Medium-term scheduling

Short-term scheduling I/O scheduling

Scheduling and Process State Transitions

Figure 9.2 Nesting of Scheduling Functions

which jobs to accept and turn into processes

first come, first served

priority, expected execution time, I/O requirements

considers the memory requirements of the swapped-out processes

Short Term Scheduling Criteria

Short-Term Scheduling Criteria: Performance

Table 9.2a User-Oriented Scheduling Criteria

Table 9.2b SystemOriented Scheduling Criteria

Table 9.4 Process Scheduling Example

Figure 9.6b Effect of Size of Preemption Time Quantum

Virtual Round Robin (VRR)

Exponential Smoothing Coefficients

Use Of Exponential Averageing

Use Of Exponential Averageing

Table 9.5 Comparison of Scheduling Policies

Scheduling Policies Summary

Overall Normalised Response Time

Normalised Response Time for Processes

Normalised Response Time for Longer Processes

decisions based on the process sets

user is assigned a share of the processor

Traditional UNIX Scheduling

Used in both SVR3 and 4.3 BSD UNIX

these systems are primarily targeted at the time-sharing interactive environment

Example of Traditional UNIX Process Scheduling

FCFS, Round Robin, SPN, SRT, HRRN, Feedback

Operating Systems: Internals and Design Principles

Multiprocessor and Real-Time Scheduling

Operating Systems: Internals and Design Principles

Loosely coupled or distributed multiprocessor, or cluster

Functionally specialised processors

Tightly coupled multiprocessor

Synchronisation Granularity and Processes

No explicit synchronisation among processes

each represents a separate, independent application or job

each user is performing a particular application

Typical use is in a time-sharing system

can be supported on a multiprocessor with little or no change to user software

Scheduling on a multiprocessor involves three interrelated issues:

actual dispatching of a process

use of multiprogramming on individual processors

assignment of processes to processors

static or dynamic needs to be determined

advantage is that there may be less overhead in the scheduling function

allows group or gang scheduling

System is viewed as being a multi-server queuing architecture

for One and Two Processors

Comparison of Scheduling Performance

processes are not assigned to a particular processor

provides implicit scheduling defined by the assignment of threads to processors

Dedicated Processor Assignment

Versions of load sharing:

can lead to bottlenecks