138 J. Marzo et al.
5.1 Basic Concepts on Network Survivability
Network survivability reflects the ability of a network to maintain service continu-
ity during and after failures. Although this ability is of relevance for any transport
network, it is essential for operators of optical networks, where a single fiber cut,
for example, can lead to severe service disruption and to loss of revenue, affecting
thousands of end users whose traffic is transported by high-capacity DWDM links.
Naturally, failures are not limited to fiber optic cables; other components such as
multiplexers, optical cross-connects (OXC), and repeaters can also fail. Moreover,
the causes of failure are equally diverse, from aging of the physical components to
natural dissasters to errors caused by human intervention.
Recovery is the name given to the sequence of events and actions taken after the
detection of a failure in order to keep the service in operation –whether in degraded
mode or not– and return the network to the preferred state upon the completion of
the repair procedures [20]. In this chapter, we consider that the unit of service of
the optical network, and therefore the subject of recovery, is a connection, i.e., the
virtual communication channel created between two designated nodes, with a given
capacity and duration. In line with the concepts of GMPLS, we assume that any
such connection is served by paths inside the network.
The body of knowledge on network recovery for optical networks is extensive.
However, as this chapter is focused on end-to-end recovery using local failure in-
formation, the reader interested in the general topic of network recovery is referred
to [13], which surveys the major techniques for next generation networks, and to [6],
which offers several options for classifying them, as well as an evaluation of their
features from the perspective of quality of service and differentiation. Nevertheless,
we will devote the following subsections to presenting essential concepts and termi-
nology on survivability, necessary for understanding the problems addressed in the
rest of the chapter as well as the applicability of the proposed solutions.
5.1.1 Protection and Restoration
The existing recovery techniques differ in objectives (i.e., offer very fast recovery
time, maximize network utilization, accomodate conflicting QoS requirements, or
a combination of several objectives) as well as approaches to specific issues such
as the provisioning method employed, the protection scope, the signaling require-
ments, and the layers, topologies and transmission technologies to which they are
applicable. Despite their differences, however, recovery generally implies that the
traffic affected by a failure is switched to a backup path. The moment in time in
which this backup path is established gives rise to two general approaches, one is
called restoration and the other is called protection. Under restoration, backup paths
are discovered on demand, and spare capacity is dynamically allocated thereafter,
whereas under protection both steps are completed at service setup time, whether
any failure arises or not later on during the service lifetime.