Shell (Part 1). Process r A process is an instance of an application running r If there are two instances of an application running then there are two

Shell (Part 1)

Process

A process is an instance of an application running

If there are two instances of an application running then there are two processes

Example: Let us say there are two users on gaul Both run grep on a file This results in two processes

Process

A process is more than the program code (sometimes known as text section)

Process information includes: Process stack: Includes temporary data

(function parameters, return addresses and local variables)

Data Section: Includes global variables Heap: Memory that is dynamically allocated

during process run-time

Process in Memory

Process State

As a process executes it changes state. Possible states:

New: The process is created Running: Instructions are being executed Waiting: The process is waiting for some

event to occur e.g., I/O completion Terminated: Process has finished execution

Process Control Block Each process is represented in the

operating system by a process control block Information related to program execution Context information including:

• Process Identifier (PID)• Process state• Program counter and other program related

information• CPU registers• CPU-Scheduling information• Memory-Management information• I/O status information

Operating System Services

An operating system provides an environment for the execution of programs

Services are provided to programs and users of programs

Operating System Services OS services (User perspective)

User interface: Command-Line (CLI), Graphics User Interface (GUI)

Program execution: OS needs to load a program into memory, runs that program, end execution, either normally or abnormally (indicating error)

I/O operations: A running program may require I/O, which may involve a file or an I/O device

File-system manipulation: Need to read and write files and directories, create and delete them, search

Operating System Services

OS provides operations for ensuring efficient operation. Examples: Resource allocation Accounting Protection and Security

The Shell The command interpreter is a program that

accepts commands from the user and interprets the commands (type of user interface)

The shell refers to an executing command interpreter Accepts commands such as “ls” or “ps” The commands make use of system calls.

A system call allows a command to request a service from the operating system.

A shell is a process A process e.g., a program being executed

• We will overuse the term “shell” to refer to the program as well as the process

The Shell

There are different shells that you can use in a Unix-based system including: bourne shell C shell bash shell tcsh shell and many more

The Shell

When a user logs in, a shell is started up.

The shell has the terminal as standard input and standard output

The shell starts out by typing the prompt, a character such as a dollar sign or percentage sign e.g.,

hanan% User enters a command e.g., date

The Shell

The user can specify that standard output be redirected to a file e.g.,

date > file Standard input can be redirected e.g., sort < file1 > file2 The output of one program can be used

as the input to another program: cat file1 file2 file3 | sort >/dev/lp &

High-Level View of Shell Code

while (1) { get a line from the user Execute command found

in line}

Details Not Highlighted Making sure that the

line from the user is correct

How is shell termination handled?

The execution of a command is done by a separate process (child process) from the shell process

For simple commands, the shell process waits for the child process to terminate so that it can print the prompt

If a child process is put in the background (using &) then the shell process can continue without waiting for the child process to terminate

The Concept of Fork The Unix system call for process creation is

called fork(). The fork system call creates a child process

that is a duplicate of the parent. Child inherits state from parent process

• Same program instructions, variables have the same values, same position in the code

Parent and child have separate copies of that state Child has the same open file descriptors from the

parent.• Parent and child file descriptors point to a common

entry in the system open file descriptor table. More on this later

fork() as a diagram

Parent

pid = fork()

Returns a new PID: e.g. pid == 56

Data

SharedProgram Data

Copied

Child

pid == 0

Process Creation Using Fork

int main (){ pid_t pid; int status = 0; pid = fork(); if (pid < 0) perror(“fork()”); if (pid > 0) { /* parent */ printf(“I am parent\n”); wait(0); } else { /* child */ printf(“I am child\n”); } return 0;}

pid is zero which indicates a child process

The fork system call returns twice: it returns a zero to the child and the child process ID (pid) to the parent.

The perror function produces a message on the standard error output describing the last error encountered during a call to a system or library function (man page)

The wait function is used to terminate the parent process when the child terminates

Fork System Call If fork () succeeds it returns the child PID to

the parent and returns 0 to the child If fork() fails, it returns -1 to the parent (no

child is created) and sets errno A program almost always uses this difference

to do different things in the parent and child processes.

Failure occurs when the limit of processes that can be created is reached.

pid_t data type represents process identifiers Other calls:

pid_t getpid() – returns the PID of calling process Pid_t getppid() – returns the PID of parent process

fork() Example 1 #include <stdio.h>#include <sys/types.h>#include <unistd.h>

int main(){

pid_t pid; int i;

sum = 0; pid = fork(); if( pid > 0 )

{ /* parent */

for( i=0; i < 10; i++ ) sum = sum + i; printf(“parent: sum is %\

n”,sum); wait(0);

}

else { /* child */ for( i=0; i < 10; i++ ) sum = sum - i; printf(“child: sum is %d\n”,sum); } return 0;

}

fork() Example 1

What is the value of sum in the parent and child processes after pid = fork()? 0

What is the value of sum in the parent and child processes at the print statements? parent: sum is 45 child: sum is -45

fork() Example 1 Remember that sum was 0 before the

fork took place When the fork took place the process

was duplicated which means that a copy is made of each variable; sum was duplicated

Since sum was 0 just before the fork then sum is 0 right after the fork in both the child and parent processes

fork() Example 2 #include <stdio.h>#include <sys/types.h>#include <unistd.h>

int main(){ pid_t pid; int i; pid = fork(); if( pid > 0 ) {

/* parent */ for( i=0; i < 1000; i++ ) printf(“\t\t\tPARENT

%d\n”, i);

wait(0); }

else { /* child */ for( i=0; i < 1000; i++ ) {

printf( “CHILD %d\n”, i ); }

} return 0;}

What is the possible output?

fork () Example 2:Possible Output

PARENT 0 PARENT 1

PARENT 2

PARENT 3

PARENT 4

PARENT 5

PARENT 6

PARENT 7

PARENT 8

PARENT 9

CHILD 0

CHILD 1

CHILD 2

CHILD 3

CHILD 4

CHILD 5

CHILD 6

CHILD 7

CHILD 8

CHILD 9

fork () Example 2:Possible Output

PARENT 0 PARENT 1 PARENT 2 PARENT 3 PARENT 4 PARENT 5 PARENT 6CHILD 0CHILD 1CHILD 2 PARENT 7 PARENT 8 PARENT 9

CHILD 3 CHILD 4CHILD 5CHILD 6CHILD 7CHILD 8CHILD 9

Lots of possible outputs!!

Execution

Processes get a share of the CPU before giving it up to give another process a turn

The switching between the parent and child depends on many factors: machine load, system process scheduling

Output interleaving is nondeterministic Cannot determine output by looking at code

How many Processes are Created by this Program?

#include <stdio.h>#include <unistd.h>int main(){ fork(); fork(); fork();}

Process Structure?

What does the process structure look like?

Is this a tree or a chain of processes

pid_t childpid = 0;

for (i=1;i<n;i++) if (childpid = fork()) == 0) break;

Process Structure?

What does the process structure look like?

Is this a tree or a chain of processes?

pid_t childpid = 0;

for (i=1;i<n;i++) if ((childpid = fork()) <= 0) break;

Process File Descriptor Table

Every process has a process file descriptor table

Each entry represents something that can be read from or written to e.g., file Screen pipe (later)

System File Descriptor Table

The OS maintains a system file descriptor table in the OS's memory space. Every open file in the system has an entry in

this table. One file can be associated with many entries

in this table (if opened by many processes). These entries contain among other things:

• the file descriptor's permissions• # links• the file offset which is moved by read and write.

An entry is removed when 0 links point to it.

OS File Structures

01234

Parent File Descriptor table

stdinstdoutstderrin_file

System file table

File info e.g., read

offset

Assume that there was something like this in a program

FILE *in_file; in_file = fopen("list.txt", "r");

Fork and Files

In a fork what gets copied is the file descriptor table;

Thus the parent and child point to the same entry in the system file table

Fork and Files

Parent File Descriptor table

01234


System file table

File info e.g., read

offset

01234


Child File Descriptor table

Fork and Files Open a file before a fork

The child process gets a copy of its parent's file descriptor table.

The child and parent share a file offset pointer because the open came before the fork.

Open a file after a fork Assume that parent and child each open a file

after the fork They get their own entries in the System File

Descriptor table• This implies that the file position information is

different

QuestionSuppose that foobar.txt consists of the 6 ASCII characters foobar. Then what is the output of the following program?

int main(){ FILE *fd1, *fd2; char c;

fd1 = fopen("foobar.txt", “r”); fd2 = fopen("foobar.txt", “r”); fscanf(fd1, “%c”, &c); fscanf(fd2, “%c”, &c); printf("c = %c\n", c);

}

Answer

The descriptors fd1 and fd2 each have their own open file table entry, so each descriptor has its own file position for foobar.txt. Thus, the read from fd2 reads the first byte of foobar.txt, and the output is

c = fand not

c = o

Fork and Files

Be careful It is much better to open files after

forks. Even so you need to be careful if there

is writing This often requires that a process has

mutual exclusive access during the write process (more later in the course)

Wait Parents waits for a child (system call)

Blocks until a child terminates Returns pid of the child process Returns -1 if no child process exists (already

exited) status

#include <sys/types.h> #include <sys/wait.h> pid_t wait(int *status)

Parent waits for a specific child to terminate pid_t waitpid(pid_t pid, int *status, int options)

Process Creation Using Fork

int main (){ pid_t pid; int status = 0; pid = fork(); if (pid < 0) perror(“fork()”); if (pid > 0) { /* parent */ printf(“I am parent\n”); pid = wait(&status); } else { /* child */ printf(“I am child\n”); exit(status); }}

Parent uses wait to sleep until the child exits; wait returns child pid and status.

Wait variants allow wait on a specific child, or notification of stops and other signals.

The fork syscall returns twice: it returns a zero to the child and the child process ID (pid) to the parent.

More about Process Operations

In Unix-based systems, a hierarchy of processes is formed

In Unix, we can obtain a listing of processes by using the ps command

ps –el will list complete information for all processes

Exec The term exec refers to a family of functions

where each of the functions replace a process’s program (the one calling one of the exec functions) with a new loaded program

A call to a function from exec loads a binary file into memory (destroying the memory image of the program calling it)

The new program starts executing from the beginning (where main begins)

On success, exec never return; on failure, exec returns -1.

The different versions are different primarily in the way parameters are passed

Exec The exec family consists of these functions:

execvp, execlp, execv, execve, execl, execle Functions with p in their name (execvp, execlp)

search for the program in the path indicated by the PATH environment variable; functions without p must be given full path.

Functions with v in their name (execv, execvp, execve) differ from functions with l (execl, execlp, execle) in the way arguments are passed

Functions with e accept array of environment variables

Versions of exec Versions of exec offered by C library:

int execl( const char *path, const char *arg, ... );int execlp( const char *file, const char *arg, ... );int execle( const char *path, const char *arg , ..., char *const envp[] );int execv( const char *path, char *const argv[] );int execvp( const char *file, char *const argv[] );int execve( const char *filename, char *const argv [], char *const envp[] );

Exec Example

Program A:

int i = 5;printf(“%d\n”,i);execl(“B”, “B”, NULL);printf(“%d\n”,i);

Program B:

main() {

printf(“hello\n”); }

What is the output of program A?5

hello

Why is it not this? 5 hello 5

The execl command replaces the instructions in the process with instructions for program B. It starts at the first instruction (starts at main)

Exec Example

Program A:int i = 5;prog_argv[0] = "B";prog_argv[1] = NULL;printf(“%d\n”,i);execv(prog_argv[0], prog_argv);printf(“%d\n”,i);

Program B: main() {

printf(“hello\n”); }

Same functionality as the program on the previous slide

Used execv instead of execl

execv uses an array to pass arguments

execl uses a list to pass arguments

Exec Example

int main(int argc, char *argv[]){

int i = 5; int status; status = execlp(“ls”, “ls”, “-l”,

“a.c”, NULL);

if (status !=0) { perror(“you goofed\n"); printf(“errno is %d\n”,errno); } printf("%d\n",i);

}

In this example, note that the command is ls –l a.c

Each argument is in the list

Question: What would cause the

perror function to be executed?

Exec Example int main(int argc, char *argv[]){

char *prog1_argv[4]; int i = 5; prog1_argv[0] = "ls"; prog1_argv[1] = "-l"; prog1_argv[2] = "a.c"; prog1_argv[3] = NULL;

execvp(prog1_argv[0], prog1_argv);

… printf("%d\n",i);

}

Same example as that on the previous side but execvp is used which requires an array

Fork and Exec

Child process may choose to execute some other program than the parent by using one of the exec calls.

Exec overlays a new program on the existing process.

Child will not return to the old program unless exec fails. This is an important point to remember.

File descriptors are preserved

fork() and execv()execv(new_program, argv[ ])

NewCopy ofParent

Initial process

fork

Originalprocess

Continues

Returns a new PID

new_Program(replacement)

execv(new_program)

fork() returns pid=0 and runs as a cloned parent until execv is called

Example#include <sys/types.h>#include <stdio.h>#include <unistd.h>int main(){ pid_t pid; pid = fork();

if (pid < 0)

perror("fork()");

if (pid > 0) { wait(NULL); printf("Child Complete"); } else{ if (pid == 0) execlp("ls","ls", “-l, “a.c”, NULL); }}

Exec and Shell

As we can see from the example on the previous slide a process, which can be a shell, can fork a child and the child can execute a command using exec

Documents

Shell (Part 1). Process r A process is an instance of an application running r If there are two instances of an application running then there are two