| cos423: Theory of Algorithms | handout #18 | 
| Michael Goldwasser | |
| Princeton University | Tuesday, March 31, 1998 | 
| Homework #6: | Randomized Algorithms and Hashing | 
| Due Date: | Tuesday, April 7, 1998 (5:00pm) | 
 be a class of hash functions in which each h
maps the universe U of keys to
 be a class of hash functions in which each h
maps the universe U of keys to  .  We say that
.  We say that
 is k-universal if, for every fixed sequence of
k distinct keys
 is k-universal if, for every fixed sequence of
k distinct keys  and for any
h chosen at random from
 and for any
h chosen at random from  , the sequence
, the sequence  is equally likely to be any of the
mk sequences of length k with elements drawn from
 is equally likely to be any of the
mk sequences of length k with elements drawn from  .
. is 2-universal, then it is universal.
 is 2-universal, then it is universal.
 defined in Section 12.3.3 of CLR
is not 2-universal.
 defined in Section 12.3.3 of CLR
is not 2-universal.
 in Section
12.3.3 of CLR so that each function also contains a constant term b
also chosen uniformly at random from
 in Section
12.3.3 of CLR so that each function also contains a constant term b
also chosen uniformly at random from  ,that is, if we replace h(x) with
,that is, if we replace h(x) with

 is 2-universal.
 is 2-universal.
 .  We can use this idea
to get yet another version of balanced trees.  The difficulty,
however, is that in a dynamic setting you cannot really assume that
the items come in a random order, and you cannot afford to wait until
all items arrive before building your data structure.  In this
problem, we will design a new method, which has expected
.  We can use this idea
to get yet another version of balanced trees.  The difficulty,
however, is that in a dynamic setting you cannot really assume that
the items come in a random order, and you cannot afford to wait until
all items arrive before building your data structure.  In this
problem, we will design a new method, which has expected  time for any operation in a sequence of Insert's, 
Delete's, and Search's.
time for any operation in a sequence of Insert's, 
Delete's, and Search's.
Assume that every object x has two associated values, key[x] and priority[x]. We define a new data structure called a treap, which is a binary tree with objects stored at internal nodes, satisfying the following two properties. (we assume that keys are distinct, and that priorities are distinct)
We would like to maintain a treap dynamically through a sequence of calls to Insert, Delete, and Search, as defined below. In the following three parts, we will ask you how to implement each of these operations while maintaining the properties of a valid treap. For each operation, you should briefly justify the correctness of your implementation as well as the bound on its running time. In describing your implementations, you may feel free to refer to descriptions of basic operations on binary trees as described in Chapter 13 of the text.
Up until now, we have assumed that an object given to Insert arrives with both a key and a priority. If we want to maintain a set of objects for which only the key is defined, we are free to choose any values we want for priorities, and the above routines will be correct. For the remainder of the problem, we assume that when an object x with key[x] is inserted, we set the priority of that object uniformly at random to a real number in the range [0,1]. (The probability that two different objects choose the identical priority is equivalently zero.)
 , where n is the number of objects in the data
structure before the operation begins.
, where n is the number of objects in the data
structure before the operation begins.
Suppose that we have a hash table with n slots, with collisions
resolved by chaining, and suppose that n keys are inserted into the
table.  Each key is equally likely to be hashed to each slot.  Let M
be the maximum number of keys in any slot after all the keys have been
inserted.  Your mission is to prove an  upper
bound on E[M], the expected value of M.
 upper
bound on E[M], the expected value of M.

 .
. .  Conclude that Pk < 1/n2 for
.  Conclude that Pk < 1/n2 for
 .
.![\begin{displaymath}
E[M] \leq Pr \left\{ M \gt \frac{c \lg n}{\lg \lg n}\right\}...
 ...ac{c \lg n}{\lg \lg n}\right\} \cdot \frac{c \lg
n}{\lg \lg n}.\end{displaymath}](img14.gif)
![$E[M] = O(\lg n / \lg \lg n)$](img15.gif) .
.Copyright © 1993, 1994, 1995, 1996, 1997, Nikos Drakos, Computer Based Learning Unit, University of Leeds.