Comp210 Lab 7: Timing Sorts

Comp 210 Lab 7: Timing Sorting Algorithms

Table of Contents

Advising Missionaries and Cannibals

A quick discussion of hw6. What are the possible states after one boat trip? After two?

One notable difference between this problem and previous problems is that when you recur, your argument will often be getting larger rather than smaller. And while we tell you that your program (if written correctly) will terminate, this is not obviously clear. (As opposed to all our previous recursion which just decomposed the structure of the input: if written according to the template, those functions were guaranteed to terminate.)

Insertion Sort vs. Mergesort

In class, we've seen two different ways of sorting numbers, insertion sort (``isort'') and mergesort (``msort''). A few moments in lecture talked about how many steps each of these algorithms takes to sort a list of n numbers. Today's lab will do some timing experiments, repeat the theoretical analysis, and see if the two jive. In addition, we also offer you code for another sorting method, called Quicksort. We will use it for timing purposes, but withhold discussing it until a few weeks more, in lecture.

So we have three fundamentally different approaches to sorting a list. It seems unsurprising that some might behave differently than others. Can we observe this difference? Can we provide explanations?

If your programs only sort small lists a few times, it doesn't matter; any sort that is easy to write and have correct is fine. However, for longer lists, the difference really is huge. In Real Life (tm), sorting is an operation often done on very long lists (repeatedly). Only after we've decided that sorting is critical to having our programs running reasonably do we worry about which sorting method is fastest.

Making Test Lists

In order to compare these different sorting algorithms, let's create some large lists of numbers to try them on.

Write a function nums-down, where (nums-down 7) returns an descending list of 7 numbers, say (7 6 5 4 3 2 1). (Hint: reverse is a built-in Scheme function.)
Now write a function nums-up which similarly creates an ascending list of any desired length.
Now write a function nums-rand, which similarly creates a list of random numbers in the range from 0 to 32767. To create a single random numbers in the range from 0 to 9, you could use (random 10). For larger numbers, try:
```
(define max-rand 32768)
(random max-rand)
```
(The function random is built-in; the value 32768 happens to be the biggest useful number to pass to that function. But this could conceivably change in the future; using the place-holder max-rand in your code makes it both easier to read, and easier to maintain.)
Note that random is certainly a function in the Scheme sense, but not in the mathematical sense (since the output is not determined by the input).

You should be able to whip out these test-data generating functions pretty easily.

Timing

There is a built-in function time-apply, which can be used to time how long it takes the computer to evaluate something.

Although we'd like to be able to (time-apply (msort big-list)), where big-list is some previously-defined list. However, this wouldn't quite achieve the effect you want. (Why not?) Instead, you must give time-apply a function of zero arguments. That is, something like (time-apply msort-big-list), where msort-big-list takes no arguments, and predictably sorts big-list. (Remember, big-list must be previously defined.) How can we write this function? Try it! (You may want to have msort-big-list do the work of sorting, but then always return just #t, since we don't really want to see the sorted list printed out--it will be long.) Here's our solution.

time-apply reports how much two numbers, both are times in milliseconds. The second number is the actual time elapsed. The first number is the cpu time--that is, how much time Dr. Scheme itself used up, as opposed to time spent on your Netscape program, updating any mouse movements you made, handling any mail requests coming in, etc.

Note that the time might vary for repeated evaluations of the same expression. This is because there might be different times spent waiting to load parts of Dr. Scheme from disk or cache memory, figuring out what memory is no longer being used and can be recycled, etc. You should time the same function several times to get a better idea of how much time is typically required.

Okay, now we're set! Your section leader will divide you up into two different groups (Team iSort, Team mSort); your group is in charge of running one of the sorting algorithms on lists of size 500, 1000, and 2000, for each of the types of data (ascending, descending, random). For instance, your task might be to write a function my-task, that takes no arguments and calls mSort on a list of 1000 ascending numbers, and finding (time-apply my-task).

Advanced challenge: How would you make a function which takes a list as an argument (we'll call it nums), and returns:
A function which takes no arguments, and calls mSort on the list nums.
Hint: use local. Try it!

On the board, your section leader will tally all the results. Which algorithm appears to be quickest for random data? Is this also the best sort to use if your data happens to be mostly-sorted? (Consider a bank, which has a list of your checks already sorted by transaction date, and then wants to sort them by check-number.)

(To be fair, quicksort does somewhat better if you have data stored in a different manner (vectors), which is often true in many scientific applications. We'll talk about vectors later in lecture.)

Analyzing the Results

We can theoretically reason about how many steps insertion sort requires, as well as how many steps Mergesort requires, to sort a list of n elements.

Big-Oh (optional)

Can we get a more systematic idea of how fast these different sorting algorithms run?

Formalizing "order-of"

So far, we've tried to get a gut feeling for the behavior of algorthims. However, there is "science" in computer science; it is important to be able to formalize these notions, without any hand-wavy arguments. This is done with the ``order-of'' concept, which has a formal description.

We say a "function f (n) is order of n²" to mean ``f (n) < n²'' along with some fine print: ``up to some constant factor, and perhaps ignoring small values of n''.

"Function f (n) is order of n²" is abbreviated by saying "f (n) is in big-Oh of n²", which is written "f (n) is in O (n²)".

Of course, there is nothing special about n²; this generalizes to any function.

The Nitty-gritty

Enough hand-waving. What do we mean "up to some constant factor", and "small values of n"? Don't worry, there is a technical definition:

Definition: A function f (n) is in O (g (n)) if:
there exist numbers c, bigEnough such that
for all n >= bigEnough,
it is true that f (n) <= c g (n).

(A technical note on abusage of this definition).

Finally, we can say that merge-sort runs in time O (n log (n)), and insertion sort in time O (n²). This refers to how long these algorithms take in the worst case. (Though they are also just upper bounds, we usually try to make the strongest statement we can. Thus saying merge-sort runs in time O (n¹⁷) is true, but that's not a very strong statement.)

Back to Comp 210 Home