[docs] add tutorial on piecewise linear

jump-dev · Nov 6, 2023 · 2eac9b9 · 2eac9b9
1 parent ec3c779
commit 2eac9b9
Show file tree

Hide file tree

Showing 2 changed files with 184 additions and 0 deletions.
diff --git a/docs/make.jl b/docs/make.jl
@@ -291,6 +291,7 @@ const _PAGES = [
             "tutorials/linear/multi.md",
             "tutorials/linear/multi_commodity_network.md",
             "tutorials/linear/tips_and_tricks.md",
+            "tutorials/linear/piecewise_linear.md",
             "tutorials/linear/facility_location.md",
             "tutorials/linear/finance.md",
             "tutorials/linear/geographic_clustering.md",

diff --git a/docs/src/tutorials/linear/piecewise_linear.jl b/docs/src/tutorials/linear/piecewise_linear.jl
@@ -0,0 +1,183 @@
+# Copyright 2017, Iain Dunning, Joey Huchette, Miles Lubin, and contributors    #src
+# This Source Code Form is subject to the terms of the Mozilla Public License   #src
+# v.2.0. If a copy of the MPL was not distributed with this file, You can       #src
+# obtain one at https://mozilla.org/MPL/2.0/.                                   #src
+
+# # Piecewise linear functions
+
+# The purpose of this tutorial is to explain how to represent piecewise linear
+# functions in a JuMP model.
+
+# This tutorial uses the following packages:
+
+using JuMP
+import Plots
+
+# ## Minimizing a convex function (outer approximation)
+
+# If the function you are approximating is convex, and you want to minimize
+# "down" onto it, then you can use an outer approximation.
+
+# For example, $f(x) = x^2$ is a convex function:
+
+f(x) = x^2
+∇f(x) = 2 * x
+plot = Plots.plot(f, -2:0.01:2; ylims = (-0.5, 4), label = false)
+
+# Because $f$ is convex, we know that for any $x_k$, we have:
+# $$f(x) \ge f(x_k) + \nabla f(x_k) \cdot (x - x_k)$$
+
+for x_k in -2:0.5:2
+    g = x -> f(x_k) + ∇f(x_k) * (x - x_k)
+    Plots.plot!(plot, g, X; color = :gray, label = false)
+end
+plot
+
+# We can use these _tangent planes_ as constraints in our model to create an
+# outer approximation of the function. As we add more planes, the error between
+# the true function and the piecewise linear outer approximation decreases.
+
+# Here is the model in JuMP:
+
+model = Model()
+@variable(model, -2 <= x <= 2)
+@variable(model, y)
+@constraint(model, [x_k in -2:0.5:2], y >= f(x_k) + ∇f(x_k) * (x - x_k))
+@objective(model, Min, y)
+
+# !!! note
+#     This formulation does not work if we want to maximize `y`.
+
+# ## Maximizing a concave function (outer approximation)
+
+# The outer approximation also works if we want to maximize "up" into a concave
+# function.
+
+f(x) = log(x)
+∇f(x) = 1 / x
+X = 0.1:0.1:2
+plot = Plots.plot(f, X; ylims = (-3, 1), label = false)
+for x_k in X
+    g = x -> f(x_k) + ∇f(x_k) * (x - x_k)
+    Plots.plot!(plot, g, X; color = :gray, label = false)
+end
+plot
+
+# Here is the model in JuMP:
+
+model = Model()
+@variable(model, 0.1 <= x <= 2)
+@variable(model, y)
+@constraint(model, [x_k in X], y <= f(x_k) + ∇f(x_k) * (x - x_k))
+@objective(model, Max, y)
+
+# !!! note
+#     This formulation does not work if we want to minimize `y`.
+
+# ## Minimizing a convex function (inner approximation)
+
+# Instead of creating an outer approximation, we can also create an inner
+# approximation. For example, given $f(x) = x^2$, we may want to approximate the
+# true function by the red piecewise linear function:
+
+f(x) = x^2
+x̂ = -2:0.8:2
+plot = Plots.plot(f, -2:0.01:2; ylims = (-0.5, 4), label = false, linewidth = 3)
+Plots.plot!(plot, f, x̂; label = false, color = :red, linewidth = 3)
+plot
+
+# To do so, we represent the decision variables $(x, y)$ by the convex
+# combination of a set of discrete points $\{x_k, y_k\}_{k=1}^K$:
+# ```math
+# \begin{aligned}
+# x = \sum\limits_{k=1}^K \lambda_k x_k \\
+# y = \sum\limits_{k=1}^K \lambda_k y_k \\
+# \sum\limits_{k=1}^K \lambda_k = 1 \\
+# \lambda_k \ge 0, k=1,\ldots,k \\
+# \end{aligned}
+# ```
+
+# The feasible region of the convex combination actually allows any $(x, y)$
+# point inside this shaded region:
+
+I = [1, 2, 3, 4, 5, 6, 1]
+Plots.plot!(x̂[I], f.(x̂[I]); fill = (0, 0, "#f004"), width = 0, label = false)
+plot
+
+# Thus, this formulation does not work if we want to maximize $y$.
+
+# Here is the model in JuMP:
+
+x̂ = -2:0.8:2
+ŷ = f.(x̂)
+n = length(x̂)
+model = Model()
+@variable(model, -2 <= x <= 2)
+@variable(model, y)
+@variable(model, 0 <= λ[1:n] <= 1)
+@constraint(model, x == sum(λ[i] * x̂[i] for i in 1:n))
+@constraint(model, y == sum(λ[i] * ŷ[i] for i in 1:n))
+@constraint(model, sum(λ) == 1)
+@objective(model, Min, y)
+
+# ## Maximizing a convex function (inner approximation)
+
+# The inner approximation also works if we want to maximize "up" into a concave
+# function.
+
+f(x) = log(x)
+x̂ = 0.1:0.5:1.6
+plot = Plots.plot(f, 0.1:0.01:1.6; label = false, linewidth = 3)
+Plots.plot!(x̂, f.(x̂), linewidth = 3, color = :red, label = false)
+I = [1, 2, 3, 4, 1]
+Plots.plot!(x̂[I], f.(x̂[I]); fill = (0, 0, "#f004"), width = 0, label = false)
+plot
+
+# Here is the model in JuMP:
+
+ŷ = f.(x̂)
+n = length(x̂)
+model = Model()
+@variable(model, 0.1 <= x <= 1.6)
+@variable(model, y)
+@variable(model, 0 <= λ[1:n] <= 1)
+@constraint(model, sum(λ) == 1)
+@constraint(model, x == sum(λ[i] * x̂[i] for i in 1:n))
+@constraint(model, y == sum(λ[i] * ŷ[i] for i in 1:n))
+@objective(model, Max, y)
+
+# ## Non-convex functions
+
+# If the model is non-convex (or non-concave), then we cannot use an outer
+# approximation, and the inner approximation allows a solution far from the true
+# function. For example, for $f(x) = sin(x)$, we have:
+
+f(x) = sin(x)
+plot = Plots.plot(f, 0:0.01:2π; label = false)
+x̂ = range(; start = 0, stop = 2π, length = 7)
+Plots.plot!(x̂, f.(x̂), linewidth = 3, color = :red, label = false)
+I = [1, 5, 6, 7, 3, 2, 1]
+Plots.plot!(x̂[I], f.(x̂[I]); fill = (0, 0, "#f004"), width = 0, label = false)
+plot
+
+# We can force the inner approximation to stay on the red line by adding the
+# constraint `λ in SOS2()`. The [`SOS2`](@ref) set is a Special Ordered Set of
+# Type 2, and it ensures that at most two elements of `λ` can be non-zero, and
+# if they are, that they must be adjacent. This prevents the model from taking
+# a convex combination of points 1 and 5 to end up on the lower boundary of the
+# shaded red area.
+
+# Here is the model in JuMP:
+
+ŷ = f.(x̂)
+n = length(x̂)
+model = Model();
+@variable(model, 0 <= x <= 2π)
+@variable(model, y)
+@variable(model, 0 <= λ[1:n] <= 1)
+@constraints(model, begin
+    x == sum(λ[i] * x̂[i] for i in 1:n)
+    y == sum(λ[i] * ŷ[i] for i in 1:n)
+    sum(λ) == 1
+    λ in SOS2()
+end)