From this page you can download the data that we will be using in the class.
The first couple of lines in the data file
contain general purpose information. The actual genome sequence starts
just below the line labeled ORIGIN. The sequence data is organized in a
number of lines, each line containing 60 characters. Every line starts with a
number which is the index position of the first character in that line.
(Baker's yeast) DNA
S.cerevisiae is a eukaryotic genome with 16 chromosomes. Below you will
find data files for each of the chromosomes. The format is similar to that of
the E.Coli data described above.