Class 4
CS 202
10 February 2021

On the board
------------

1. Last time
2. The shell, part II
3. File descriptors
4. The shell, part III
5. Processes: the OS's view
6. Threads
7. Intro to concurrency

---------------------------------------------------------------------------

1. Last time

    - Stack frames
    - System calls
    - Process/OS control transfers
    - Git/lab setup
    - Process birth
    - Began shell
        
2. Shell, part II

    Redirection

    What does this do?

    $ ./first3 abcd efgh > foo

    What about this?

    $ ps xc | grep ...

    or say we wanted to extract all of your GitHub ids...how would
    you do that without pipelines?

    download html from https://github.com/nyu-cs202
    then
    $ cat blob | grep -o labs-21sp-[a-zA-Z0-9\-]* | sort -f | uniq > students.txt

    How are these things implemented!? Remember, the programmer of
    first3 or cat or grep is long gone, and their output is winding
    up somewhere that the original program never specified.

3. File descriptors

    --every process can usually expect to begin life with three file
    descriptors already open:
	0: represents the input to the process (e.g., tied to
	    terminal)
	1: represents the output
	2: represents the error output

	these are sometimes known as stdin, stdout, stderr

    --NOTE: Unix hides for processes the difference between a device and
    a file. this is a very powerful hiding (or abstraction), as we will
    see soon

4. Shell, part III

    - Back to 
         $ ./first3 abcd efgh > /tmp/foo

    How is that implemented?
   
    Answer: after fork() but before exec(), shell does:

            close(1)
            open("/tmp/foo", O_TRUNC | O_CREAT | O_WRONLY, 0666)

        which automatically assigns fd 1 to point to /tmp/foo

        [draw picture of fds, fd 0 /dev/tty, fd 1 now /tmp/foo]

        --now, when first3 runs, it still has in its code: write(1,...),
        but "1" now means something else.

	What about 

	    $ sh < script > tmp1

	    where script contains 
		echo abc
		echo def

	    [draw picture]

    - Pipelines

        [Leaving this mostly as an exercise.]

        [See handout. The key mechanisms are:

            - the pipe() system call. this takes as input a two-element
            file descriptor array. the first file descriptor is the
            "read end"; the second is the "write end". after a process
            writes to the "write end", the data written is available by
            reading from the "read end".

            - the actions that the shell takes when the command has the
            vertical bar (sometimes known as the pipe character). the
            character is |. (the same character as bitwise-OR in C).

                - at a high level, when the shell sees |, it uses the
                system call pipe() to "connect" a write end in one
                process with a read end in another process. it does this
                by forking, and then manipulating file descriptors (see
                handle_pipeline() on handout02.pdf).
                
        ]

        Example:

            - look at the code for 'our_head' and 'our_yes': an example of file
            descriptors in action (simplified implementation of Unix
            utilities 'head' and 'yes').

            - now, how can we arrange for 'yes' to deliver its output to
            'head', and for 'head' to take its input from 'yes'?

            - answer: with pipelines:
    
                $ ./our_yes hello | ./our_head 15

                That prints:
                    hello
                    hello
                    <repeated 13 more times>

        
    - The power of the fork/exec separation

	[an innovation from the original Unix. possibly lucky design
	choice at the time. but turns out to work really well.  allows
	the child to manipulate environment and file descriptors
	*before* exec, so that the *new* program may in fact encounter a
	different environment]

       --To generalize redirections and pipelines, there are lots of
       things the parent shell might want to manipulate in the child
       process: file descriptors, environment, resource limits.

       --yet fork() requires no arguments!

       --Contrast with CreateProcess on Windows:

	    BOOL CreateProcess(
		name,
		commandline,
		security_attr,
		thr_security_attr,
		inheritance?,
		other flags,
		new_env,
		curr_dir_name,
		.....)

	    [http://msdn.microsoft.com/en-us/library/ms682425(v=VS.85).aspx]

	    there's also CreateProcessAsUser, CreateProcessWithLogonW,
	    CreateProcessWithTokenW, ...

	* The issue is that any conceivable manipulation of the
	environment of the new process has to be passed through 
	arguments, instead of via arbitrary code.

    in other words:

	    because whoever calls CreateProcess() (or its variant) needs
	    to perfectly configure the process before it starts running.

	    with fork(), whoever calls fork() **is still running** so
	    can arrange to do whatever it wants, without having to work
	    through a rigid interface like the above. allows arbitrary
	    "setup" of the process before exec().


    - Discussion: what makes a good abstraction?

        --simple but powerful

        --examples we've seen:

            --stdin (0), stdout (1), stderr (2) [nice by themselves, but
            when combined with the mechanisms below, things get even better]

            --file descriptors

            --fork/exec() separation

            --very few mechanisms lead to a lot of possible
            functionality


Aside:

     - Fork bomb at the bash command prompt:

            $ :(){ : | : & }; :


5. Implementation of processes

    Briefly cover the OS's view:

		 PCB
	    -----------------
	    |   process id  |
	    |   state       |   (ready, runnable, blocked, etc.)
	    |   user id     |
	    |   IP (ins ptr)|
	    |   open file   |
	    | VM structures |
	    |   registers   |
	    |   .....       |  (signal mask, terminal, priority, ...) 
	    ----------------

	    called "proc" in Unix, "task_struct" in Linux, and
	    "process_t" in lab4.


    [draw an array of these.]

    point out that during scheduling, a mechanism that we have not seen,
    a core switches between processes. will discuss the mechanism for
    this later.
    

    Note: these PCBs will have an analog when considering threads,
    below.


6. Threads

    Interface to threads:

        tid thread_create (void (*fn) (void *), void *); 
            Create a new thread, run fn with arg

        void thread_exit (); 

        void thread_join (tid thr); 
        
            Wait for thread with tid 'thr' to exit
        
      plus a lot of synchronization primitives, which we'll see 
      in the coming classes

    Assume for now that threads are: 
        --an abstraction created by OS
        --preemptively scheduled

    [draw abstract picture of threads: own registers, share memory]

    (later, we will explore alternatives)

7. Intro to concurrency

    There are many sources of concurrency.

    --what is concurrency?
        
        --stuff happening at the same time

    --sources of concurrency

        --computers have multiple CPUs and common memory, so
        instructions in multiple threads can happen at the same time!
 
        --on a single CPU, processes/threads can have their instructions
        interleaved (helpful to regard the instructions in multiple
        threads as "happening at the same time")
   
        --interrupts (CPU was doing one thing; now it's doing another)

    --why is concurrency hard?

        *** Hard to reason about all possible interleavings


    --handout:

        1a:  x = 1 or x = 2.
        1b:  x = 13 or x = 25.
        1c:  x = 1 or x = 2 or x = 3 

            say x is at mem location 0x5000

            f is "x = x+1;", which might compile to:

            movq 0x5000, %rbx    # load from address 0x5000 into register
            addq $1, %rbx        # add 1 to the register's value
            movq %rbx, 0x5000    # store back


            g is "x = x+2;", which might compile to:

            movq 0x5000, %rbx    # load from address 0x5000 into register
            addq $2, %rbx        # add 2 to the register's value
            movq %rbx, 0x5000    # store back


        2: incorrect list structure

        3: incorrect count in buffer

        all of these are called race conditions; not all of them are
        errors, though

        --worst part of errors from race conditions is that a program
        may work fine most of the time but only occasionally show
        problems. why?  (because the instructions of the various threads
        or processes or whatevever get interleaved in a
        non-deterministic order.)

        --and it's worse than that because inserting debugging code may
        change the timing so that the bug doesn't show up


    --hardware makes the problem even harder

        [look at panel 4; what is the correct answer?]

        [answer: "it depends on the hardware"]

        --sequential consistency not always in effect

        --sequential consistency means:

            --maintain program order on individual processors

            --ensuring that writes happen to each memory location
            (viewed separately) in the order that they are issued

    --assume sequential consistency until we explicitly relax it