================ Start Lecture #21
================
- The controller abstracts away some of the low level features of
the device.
- For disks, the controller does error checking, buffering and
handles interleaving of sectors. (Sectors are interleaved if the
controller or CPU cannot handle the data rate and would otherwise have
to wait a full revolution. I do not believe this is a concern with
modern machines where the electronics have increased in speed faster
than the devices.)
- For analog monitors (CRTs) the controller does
a great deal. Analog video is far from a bunch of ones and zeros.
- Controllers are also called adaptors.
- Typically the interface the OS sees consists of some device
registers located on the controller (address to access, read vs write,
length, data value, etc).
- Memory-mapped I/O vs. I/O
space. In memory mapped I/O the device registers are mapped
into normal memory space. So you need to know that
load 8888<--4
tells the disk read the sector whose address you previously loaded
into 8800. In the I/O space you used different instructions (not
ordinary loads and stores) to do essentially the same thing.
Homework: 2
5.1.3: Direct Memory Access (DMA)
- The disk controller gets the desired data from the disk to its
buffer
- For programmed I/O (PIO) the cpu then does loads and
stores (or I/O instructions) to copy the data from the buffer to the
desired memory location.
- With a DMA controller, the controller writes the memory without
intervention of the CPU.
- Clearly DMA saves CPU work. But this might not be important if
the CPU is limited by the memory or by system buses.
- Very important is that there is less data movement so the buses
are used less and the entire operation takes less time.
- Since PIO is pure software it is easier to change, which is an
advantage.
- DMA does need a number of bus transfers from the CPU to the
controller to specify the DMA. So DMA is most effective for large
transfers where the setup is amortized.
- Why have the buffer? Why not just go from the disk straight to
the memory.
Answer: Speed matching. The disk supplies data at a fixed rate, which might
exceed the rate the memory can accept it. In particular the memory
might be busy servicing a request from the processor or from another
DMA controller.
Homework: 5
5.2: Principles of I/O Software
As with any large software system, good design and layering is
important.
5.2.1: Goals of the I/O Software
Device independence
We want to have most of the OS, unaware of the characteristics of
the specific devices attached to the system. Indeed we also want the
OS to be largely unaware of the CPU type itself.
Due to this device independence programs are
written to read and write generic devices and then at run time
specific devices are assigned. Writing to a disk has differences from
writing to a terminal, but Unix cp and DOS copy do not see these
differences. Indeed, most of the OS, including the file system code,
is unaware of whether the device is a floppy or hard disk.
Uniform naming
Recall that we discussed the value
of the name space implemented by file systems. There is no dependence
between the name of the file and the device on which it is stored. So
a file called IAmStoredOnAHardDisk might well be stored on a floppy disk.
Error handling
There are several aspects to error handling including: detection,
correction (if possible) and reporting.
- Detection should be done as close to where the error occurred as
possible before more damage is done (fault containment). This is not
trivial.
- Correction is sometimes easy, for example ECC memory does this
automatically (but the OS wants to know about the error so that it can
schedule replacement of the faulty chips before unrecoverable double
errors occur).
Other easy cases include successful retries for failed ethernet
transmissions. In this example, while logging is appropriate, it is
quite reasonable for no action to be taken.
- Error reporting tends to be awful. The trouble is that the error
occurs at a low level but by the time it is reported the
context is lost. Unix/Linux in particular is horrible in this area.
Creating the illusion of synchronous I/O
- I/O must be asynchronous for performance. That is the OS
cannot simply wait for an I/O to complete. Instead, it
proceeds with other activities and responds to the notification when
the I/O has finished.
- Users (mostly) want no part of this. The code sequence
Read X
Y <-- X+1
Print Y
should print a value one greater than that read. But if the
assignment is performed before the read completes, the wrong value is
assigned.
- Performance junkies sometimes do want the asynchrony so that they
can have another portion of their program executed while the I/O is
underway.
Sharable vs dedicated devices
For devices like printers and tape drives, only one user at a time
is permitted. These are called serially reusable
devices and are studied next chapter.
Devices like disks and Ethernet ports can be shared by processes
running concurrently.
Layering
Layers of abstraction as usual prove to be effective. Most systems
are believed to use the following layers (but for many systems, the OS
code is not available for inspection).
- User level I/O routines
- Device independent I/O software
- Device drivers
- Interrupt handlers
We give a bottom up explanation.
5.2.2: Interrupt Handlers
We discussed an interrupt handler before when studying page faults.
Then it was called ``assembly language code''.
In the present case, we have a process blocked on I/O and the I/O
event has just completed. So the goal is to make the process ready.
Possible methods are.
- Releasing a semaphore on which the process is waiting
- Sending a message to the process.
- Inserting the process table entry onto the ready list.
5.2.3: Device Drivers
The portion of the OS that ``knows'' the characteristics of the
controller.
The driver has two ``parts'' corresponding to its two access
points. Recall the following figure from the beginning of the course.
- Access by the main line OS with an I/O request.
- Accessed by the interrupt handler when the I/O completes (this
completion is signaled by an interrupt).
Tanenbaum describes the actions of the driver assuming it is
implemented as a process (which he recommends). I give both that view
point and the self-service paradigm in which the driver is invoked by
the OS acting in behalf of a user process (more precisely the process
shifts into kernel mode).
Driver as a process (Tanenbaum)
- The user issues an I/O request. The main line OS prepares a
generic (e.g. read, not read using Buslogic BT-958 SCSI controller) request for
the driver and the driver is awakened (perhaps a message is sent to
the driver to do both jobs).
- The driver wakes up
- If the driver was idle (i.e., the controller is idle), the
driver writes device registers on the controller ending with a
command for the controller to begin the actual I/O.
- If the controller is busy (doing work the driver gave it), the
driver simply queues the current request (the driver dequeues this
below).
- The driver blocks waiting for an interrupt or for more
requests.
- An interrupt arrives (i.e. an I/O has been completed)
- The driver informs the main line perhaps passing data and
surely passing status (error, OK).
- Find next work item or block
- If the queue of requests is non-empty dequeue one and
proceed as if just received a request from the main line.
- If queue is empty, the driver blocks waiting for an
interrupt or a request from the main line.
Driver in a self-service paradigm
- The user issues an I/O request. The main line OS prepares a
generic request for the driver and calls the driver.
- The driver is called
- If the driver was idle (i.e., the controller is idle), the
driver writes device registers on the controller ending with a
command for the controller to begin the actual I/O.
- If the controller is busy (doing work the driver gave it), the
driver simply queues the current request (the driver dequeues this
below).
- The driver returns to the main line
- An interrupt arrives (i.e. an I/O has been completed)
- The driver informs the main line perhaps passing data and
surely passing status (error, OK).
- Find next work item or return
- If the queue of requests is non-empty dequeue one and
proceed as if just received a request from the main line.
- If queue is empty, the driver returns to the main line.