Related Content: CS604 - VU Lectures, Handouts, PPT Slides, Assignments, Quizzes, Papers & Books of Operating Systems
The processes in the system execute concurrently and they must be created and deleted dynamically thus the operating system must provide the mechanism for the creation and deletion of processes.
A process may create several new processes via a create-process system call during the course of its execution. The creating process is called a parent process while the new processes are called the children of that process. Each of these new processes may in turn create other processes, forming a tree of processes. Figure 6.1 shows partially the process tree in a UNIX/Linux system.
In general, a process will need certain resources (such as CPU time, memory files,
I/O devices) to accomplish its task. When a process creates a sub process, also known as
a child, that sub process may be able to obtain its resources directly from the operating
system or may be constrained to a subset of the resources of the parent process. The
parent may have to partition its resources among several of its children. Restricting a process to a subset of the parent’s resources prevents a process from overloading the
system by creating too many sub processes.
When a process is created it obtains in addition to various physical and logical
resources, initialization data that may be passed along from the parent process to the child
process. When a process creates a new process, two possibilities exist in terms of
execution:
There are also two possibilities in terms of the address space of the new process:
In order to consider these different implementations let us consider the UNIX
operating system. In UNIX its process identifier identifies a process, which is a unique
integer. A new process is created by the fork system call. The new process consists of a
copy of the address space of the parent. This mechanism allows the parent process to
communicate easily with the child process. Both processes continue execution at the
instruction after the fork call, with one difference, the return code for the fork system
call is zero for the child process, while the process identifier of the child is returned to the
parent process.
Typically the execlp system call is used after a fork system call by one of the
two processes to replace the process’ memory space with a new program. The execlp
system call loads a binary file in memory –destroying the memory image of the program
containing the execlp system call.—and starts its execution. In this manner, the two
processes are able to communicate and then go their separate ways. The parent can then
create more children, or if it has nothing else to do while the child runs, it can issue a
wait system call to move itself off the ready queue until the termination of the child.
The parent waits for the child process to terminate, and then it resumes from the call to
wait where it completes using the exit system call.
A process terminates when it finishes executing its final statement and asks the operating
system to delete it by calling the exit system call. At that point, the process may return
data to its parent process (via the wait system call). All the resources of the process
including physical and virtual memory, open the files and I/O buffers – are de allocated
by the operating system.
Termination occurs under additional circumstances. A process can cause the
termination of another via an appropriate system call (such as abort). Usually only the
parent of the process that is to be terminated can invoke this system call. Therefore
parents need to know the identities of its children, and thus when one process creates
another process, the identity of the newly created process is passed to the parent.
A parent may terminate the execution of one of its children for a variety of reasons,
such as:
Considering an example from UNIX, we can terminate a process by using the exit system call, its parent process may wait for the termination of a child process by using the wait system call. The wait system call returns the process identifier of a terminated child, so that the parent can tell which of its possibly many children has terminated. If the parent terminates however all its children have assigned as their new parent, the init process. Thus the children still have a parent to collect their status and execution statistics.
When the fork system call is executed, a new process is created. The original process is
called the parent process whereas the process is called the child process. The new process
consists of a copy of the address space of the parent. This mechanism allows the parent
process to communicate easily with the child process. On success, both processes
continue execution at the instruction after the fork call, with one difference, the return
code for the fork system call is zero for the child process, while the process identifier
of the child is returned to the parent process. On failure, a -1 will be returned in the
parent's context, no child process will be created, and an error number will be set
appropriately.
The synopsis of the fork system call is as follows:
#include <sys/types.h>
#include <unistd.h>
pid_t fork(void);
Figure 6.2 shows sample code, showing the use of the fork() system call and
Figure 6.3 shows the semantics of the fork system call. As shown in Figure 6.3, fork() creates an exact memory image of the parent process and returns 0 to the child process
and the process ID of the child process to the parent process.
After the fork() system call the parent and the child share the following:
The following things are different in the parent and the child:
The fork() system may fail due to a number of reasons. One reason maybe that the
maximum number of processes allowed to execute under one user has exceeded, another
could be that the maximum number of processes allowed on the system has exceeded.
Yet another reason could be that there is not enough swap space.