Dual norm. Given the norm is L_p, where p >= 1, its
dual norm is L_q, where (1/p) + (1/q) = 1.
(If p=1, q=infinity; and, if p=infinity, q=1.) This is based on
Hölder's inequality.
Dual price. A dual variable associated with
a primal constraint.
Duality. The study and use of dual mathematical
programs.
Duality gap. Numerically, it is the difference between primal and
dual objective values. The phrase by itself means the
presence (or existence) of a gap -- i.e., that the value is not zero.
Duality theorems. These establish the fundamental
duality relations (weak or strong). The most significant ones are
given here.
Dynamic program (DP). Based on the
Principle of Optimality, this was originally concerned with
optimal decisions over time. For continuous time, it addresses
problems in
variational calculus. For discrete time, each period is sometimes
called a stage, and the DP is called a multi-stage decision
process. Here is the
Fundamental Recurrence Equation for an additive process:
F(t, s) = Opt{r(t, s, x) + aF(t', s'): x in X(t, s) and s'=T(t, s, x)},
where F(t, s) = optimal total return upon reaching time t in
state s, and
x = decision variable(s);
s, s' = state variables;
t, t' = time;
X(t, s) = decision space (usually depends only on state);
r(t, s, x) = immediate return;
T(t, s, x) = state transform.
In words, the optimal return upon reaching time t in state s equals the
optimal return from an immediate decision plus the optimal return for
the remainder of the process upon the transition to state s'. This
new state is a function of the current time, state, and decision.
For discrete time, t'=t+1 for the forward equation, and
t'=t-1 for the backward equation.
The multiplier (a) is generally a positive scalar, such as a
discount factor to yield the present
value of money. (For some problems a=1, in which case the infinite
horizon model might be ill posed.)
More generally, the process need not be additive, so that '+' could
be another composition operation.
A decision rule is a function, x*(t, s), that specifies an
optimal value of x upon reaching time t in state s. For a completely
deterministic process, backtracking is used to obtain the usual
form of an optimal solution from a known initial or final state.
The process could be stochastic, in which case r and F are expected
values, and the state transform is random with probability
distribution, P[T(t,s,x) = s' | s,x]. Thus, F(t',s') is replaced by
Sum_s'{F(t',s')P[T(t,s,x)=s'|s,x]}.
DP is a technique that applies to static problems too. For example,
consider the following recurrence equation for the
knapsack problem:
F(n, s) = Max{c(n)x + F(n-1, s-a(n)x): x in {0, 1, ..., |_s/a(n)_|}}.
In words, this says that the max total return with n objects having
total knapsack space s equals the max choice (x) of how much of the
n-th object we put into the knapsack, with immediate return c(n),
plus the max total return from the remaining n-1 objects with the
remaining space (s - a(n)x). The value of x (the stage decision
variable) is any non-negative integer up to the space allowed:
|_s/a(n)_| is the max number of objects of this type that
could fit in the knapsack. There could also be explicit bounds on
x, such as the 0-1 knapsack problem, in which case the policy set,
X(n,s) simply has only those values.