1.4 Research Areas related to Routing 11
embeddings, many results are known. An overview on results in this area has
been given by Monien and Sudborough [MS90].
The model of dynamic embeddings was first considered by Meyer auf der
Heide [Me83]. There are several results indicating the strength of these simu-
lations, see [KLM+89, Sch90, KKR93b]. E.g., the V ~ • v/n-mesh can be sim-
ulated on an n-node butterfly network with constant slowdown [KLM+89],
whereas a static embedding has dilation (and therefore slowdown) 12(log n),
as shown in [BCH+88].
Let us call a network
n-universal with slowdown s,
if it can simulate T
steps of any network of size n with slowdown at most s.T for any T _> 1. Galil
and Paul [GP83] show that every network H of size m that can sort n num-
bers in time
sort(n, m)
is n-universal with slowdown
O(sort(n, m)).
Hence
the sorting algorithm presented by Cypher and Plaxton [CP93] implies that
the shuffle-exchange network and the hypercube of size n are n-universal with
slowdown O(log n. (log log n)2). In [Me86], Meyer auf der Heide presents an
nl+~-node n-universal network with constant slowdown. Kaklamanis
et al.
[KKR93b] present, among other results, a network of size n that can simu-
late every planar network of size n with slowdown O(log log n). For restricted
classes of bounded degree networks (i.e., networks where the size of the t-
neighborhood of each node is bounded by a polynomial in t), Meyer auf der
Heide and Wanka [MW89] showed that constant slowdown simulations only
need
0 (n.poly
(log n))-size universal networks. Lower bounds for dynamic em-
beddings are shown in [Me83, Me86, KLM+89, KR94, MSW95]. In [MSW95],
Meyer auf der Heide
et al.
show that, if H is an arbitrary constant-degree
network of size m that can simulate all constant-degree networks of size n
with slowdown s, then m. s = 12(n logm).
1.4.5 Shared
Memory Simulations
Parallel machines that communicate via a shared memory, so-called
parallel
random access machines
(PRAMs), represent an idealization of a parallel
computation model. The user does not have to worry about synchronization,
locality of data, communication capacity, delay effects or memory contention.
On the other hand, PRAMs are very unrealistic from a technological point
of view; large machines with shared memory can only be built at the cost of
very slow shared memory access. Therefore there is a need for finding efficient
simulations of a PRAM on more realistic models such as bounded degree
networks. The main problem is the distribution of the shared memory cells
among the nodes of the network to allow fast accesses. A standard method is
to use universal hashing (see, e.g., [MV84, KU86, UW87, KLM93, GMR941).
For a survey on high performance hash functions see, e.g,, [Di91].
A PRAM consists of processors P1,-.., P~ and a shared memory with
cells U -- (1,..., p}, each capable of storing one integer. The processors work
synchronously and have random access to the shared memory cells. A PRAM
in which no two processors are allowed to access the same shared memory cell