Basic feasible solution – Wikipedia

Posted on October 24, 2022 by lordneo

before-content-x4

In the theory of linear programming, a basic feasible solution (BFS) is a solution with a minimal set of non-zero variables. Geometrically, each BFS corresponds to a corner of the polyhedron of feasible solutions. If there exists an optimal solution, then there exists an optimal BFS. Hence, to find an optimal solution, it is sufficient to consider the BFS-s. This fact is used by the simplex algorithm, which essentially travels from some BFS to another until an optimal one is found.^[1]

after-content-x4

Table of Contents

Definitions[edit]

Preliminaries: equational form with linearly-independent rows[edit]

For the definitions below, we first present the linear program in the so-called equational form:

maximize

{textstyle mathbf {c^{T}} mathbf {x} }

subject to

{displaystyle Amathbf {x} =mathbf {b} }

where:

after-content-x4

Any linear program can be converted into an equational form by adding slack variables.

As a preliminary clean-up step, we verify that:

The system ${displaystyle Amathbf {x} =mathbf {b} }$
All m rows of the matrix ${displaystyle A}$

Feasible solution[edit]

A feasible solution of the LP is any vector

{displaystyle mathbf {x} geq 0}

${displaystyle mathbf {x} geq 0}$ such that

{displaystyle Amathbf {x} =mathbf {b} }

$Amathbf {x} =mathbf {b}$ . We assume that there is at least one feasible solution. If m = n, then there is only one feasible solution. Typically m < n, so the system

{displaystyle Amathbf {x} =mathbf {b} }

$Amathbf {x} =mathbf {b}$ has many solutions; each such solution is called a feasible solution of the LP.

Basis[edit]

A basis of the LP is a nonsingular submatrix of A, with all m rows and only m<n columns.

Sometimes, the term basis is used not for the submatrix itself, but for the set of indices of its columns. Let B be a subset of m indices from {1,…,n}. Denote by

{displaystyle A_{B}}

$A_B$ the square m-by-m matrix made of the m columns of

{displaystyle A}

$A$ indexed by B. If

{displaystyle A_{B}}

$A_B$ is nonsingular, the columns indexed by B are a basis of the column space of

{displaystyle A}

$A$ . In this case, we call B a basis of the LP.

Since the rank of

{displaystyle A}

$A$ is m, it has at least one basis; since

{displaystyle A}

$A$ has n columns, it has at most

{displaystyle {binom {n}{m}}}

${displaystyle {binom {n}{m}}}$ bases.

Basic feasible solution[edit]

Given a basis B, we say that a feasible solution

{displaystyle mathbf {x} }

$mathbf {x}$ is a basic feasible solution with basis B if all its non-zero variables are indexed by B, that is, for all

{displaystyle jnot in B:~~x_{j}=0}

${displaystyle jnot in B:~~x_{j}=0}$ .

Properties[edit]

1. A BFS is determined only by the constraints of the LP (the matrix

{displaystyle A}

$A$ and the vector

{displaystyle mathbf {b} }

$mathbf {b}$ ); it does not depend on the optimization objective.

2. By definition, a BFS has at most m non-zero variables and at least n–m zero variables. A BFS can have less than m non-zero variables; in that case, it can have many different bases, all of which contain the indices of its non-zero variables.

3. A feasible solution

{displaystyle mathbf {x} }

$mathbf {x}$ is basic if-and-only-if the columns of the matrix

{displaystyle A_{K}}

${displaystyle A_{K}}$ are linearly independent, where K is the set of indices of the non-zero elements of

{displaystyle mathbf {x} }

$mathbf {x}$ .^[1]^: 45

4. Each basis determines a unique BFS: for each basis B of m indices, there is at most one BFS

{displaystyle mathbf {x_{B}} }

${displaystyle mathbf {x_{B}} }$ with basis B. This is because

{displaystyle mathbf {x_{B}} }

${displaystyle mathbf {x_{B}} }$ must satisfy the constraint

{displaystyle A_{B}mathbf {x_{B}} =b}

${displaystyle A_{B}mathbf {x_{B}} =b}$ , and by definition of basis the matrix

{displaystyle A_{B}}

$A_B$ is non-singular, so the constraint has a unique solution:

${displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}$
${displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}$

The opposite is not true: each BFS can come from many different bases. If the unique solution of

{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

${displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}$ satisfies the non-negativity constraints

{displaystyle mathbf {x_{B}} geq 0}

${displaystyle mathbf {x_{B}} geq 0}$ , then B is called a feasible basis.

5. If a linear program has an optimal solution (i.e., it has a feasible solution, and the set of feasible solutions is bounded), then it has an optimal BFS. This is a consequence of the Bauer maximum principle: the objective of a linear program is convex; the set of feasible solutions is convex (it is an intersection of hyperspaces); therefore the objective attains its maximum in an extreme point of the set of feasible solutions.

Since the number of BFS-s is finite and bounded by

{displaystyle {binom {n}{m}}}

${displaystyle {binom {n}{m}}}$ , an optimal solution to any LP can be found in finite time by just evaluating the objective function in all

{displaystyle {binom {n}{m}}}

${displaystyle {binom {n}{m}}}$ BFS-s. This is not the most efficient way to solve an LP; the simplex algorithm examines the BFS-s in a much more efficient way.

Examples[edit]

Consider a linear program with the following constraints:

{displaystyle {begin{aligned}x_{1}+5x_{2}+3x_{3}+4x_{4}+6x_{5}&=14\x_{2}+3x_{3}+5x_{4}+6x_{5}&=7\forall iin {1,ldots ,5}:x_{i}&geq 0end{aligned}}}

${displaystyle {begin{aligned}x_{1}+5x_{2}+3x_{3}+4x_{4}+6x_{5}&=14\x_{2}+3x_{3}+5x_{4}+6x_{5}&=7\forall iin {1,ldots ,5}:x_{i}&geq 0end{aligned}}}$

The matrix A is:

{displaystyle A={begin{pmatrix}1&5&3&4&6\0&1&3&5&6end{pmatrix}}~~~~~mathbf {b} =(14~~7)}

${displaystyle A={begin{pmatrix}1&5&3&4&6\0&1&3&5&6end{pmatrix}}~~~~~mathbf {b} =(14~~7)}$

Here, m=2 and there are 10 subsets of 2 indices, however, not all of them are bases: the set {3,5} is not a basis since columns 3 and 5 are linearly dependent.

The set B={2,4} is a basis, since the matrix

{displaystyle A_{B}={begin{pmatrix}5&4\1&5end{pmatrix}}}

${displaystyle A_{B}={begin{pmatrix}5&4\1&5end{pmatrix}}}$ is non-singular.

The unique BFS corresponding to this basis is

{displaystyle x_{B}=(0~~2~~0~~1~~0)}

${displaystyle x_{B}=(0~~2~~0~~1~~0)}$ .

Geometric interpretation[edit]

The set of all feasible solutions is an intersection of hyperspaces. Therefore, it is a convex polyhedron. If it is bounded, then it is a convex polytope. Each BFS corresponds to a vertex of this polytope.^[1]^: 53–56

Basic feasible solutions for the dual problem[edit]

As mentioned above, every basis B defines a unique basic feasible solution

{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

${displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}$ . In a similar way, each basis defines a solution to the dual linear program:

minimize

{textstyle mathbf {b^{T}} mathbf {y} }

subject to

{displaystyle A^{T}mathbf {y} geq mathbf {c} }

The solution is

{displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}

${displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}$ .

Finding an optimal BFS[edit]

There are several methods for finding a BFS that is also optimal.

Using the simplex algorithm[edit]

In practice, the easiest way to find an optimal BFS is to use the simplex algorithm. It keeps, at each point of its execution, a “current basis” B (a subset of m out of n variables), a “current BFS”, and a “current tableau”. The tableau is a representation of the linear program where the basic variables are expressed in terms of the non-basic ones:^[1]^: 65

{displaystyle {begin{aligned}x_{B}&=p+Qx_{N}\z&=z_{0}+r^{T}x_{N}end{aligned}}}

where

{displaystyle x_{B}}

$x_{B}$ is the vector of m basic variables,

{displaystyle x_{N}}

$x_N$ is the vector of n non-basic variables, and

{displaystyle z}

$z$ is the maximization objective. Since non-basic variables equal 0, the current BFS is

{displaystyle p}

$p$ , and the current maximization objective is

{displaystyle z_{0}}

$z_{0}$ .

If all coefficients in

{displaystyle r}

$r$ are negative, then

{displaystyle z_{0}}

$z_{0}$ is an optimal solution, since all variables (including all non-basic variables) must be at least 0, so the second line implies

{displaystyle zleq z_{0}}

${displaystyle zleq z_{0}}$ .

If some coefficients in

{displaystyle r}

$r$ are positive, then it may be possible to increase the maximization target. For example, if

{displaystyle x_{5}}

$x_5$ is non-basic and its coefficient in

{displaystyle r}

$r$ is positive, then increasing it above 0 may make

{displaystyle z}

$z$ larger. If it is possible to do so without violating other constraints, then the increased variable becomes basic (it “enters the basis”), while some basic variable is decreased to 0 to keep the equality constraints and thus becomes non-basic (it “exits the basis”).

If this process is done carefully, then it is possible to guarantee that

{displaystyle z}

$z$ increases until it reaches an optimal BFS.

Converting any optimal solution to an optimal BFS[edit]

In the worst case, the simplex algorithm may require exponentially many steps to complete. There are algorithms for solving an LP in weakly-polynomial time, such as the ellipsoid method; however, they usually return optimal solutions that are not basic.

However, Given any optimal solution to the LP, it is easy to find an optimal feasible solution that is also basic.^[2]^{: see also “external links” below.}

Finding a basis that is both primal-optimal and dual-optimal[edit]

A basis B of the LP is called dual-optimal if the solution

{displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}

${displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}$ is an optimal solution to the dual linear program, that is, it minimizes

{textstyle mathbf {b^{T}} mathbf {y} }

${textstyle mathbf {b^{T}} mathbf {y} }$ . In general, a primal-optimal basis is not necessarily dual-optimal, and a dual-optimal basis is not necessarily primal-optimal (in fact, the solution of a primal-optimal basis may even be unfeasible for the dual, and vice versa).

If both

{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

${displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}$ is an optimal BFS of the primal LP, and

{displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}

${displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}$ is an optimal BFS of the dual LP, then the basis B is called PD-optimal. Every LP with an optimal solution has a PD-optimal basis, and it is found by the Simplex algorithm. However, its run-time is exponential in the worst case. Nimrod Megiddo proved the following theorems:^[2]

There exists a strongly polynomial time algorithm that inputs an optimal solution to the primal LP and an optimal solution to the dual LP, and returns an optimal basis.
If there exists a strongly polynomial time algorithm that inputs an optimal solution to only the primal LP (or only the dual LP) and returns an optimal basis, then there exists a strongly-polynomial time algorithm for solving any linear program (the latter is a famous open problem).

Megiddo’s algorithms can be executed using a tableau, just like the simplex algorithm.

External links[edit]

References[edit]

after-content-x4

Basic feasible solution – Wikipedia

Definitions[edit]

Preliminaries: equational form with linearly-independent rows[edit]

Feasible solution[edit]

Basis[edit]

Basic feasible solution[edit]

Properties[edit]

Examples[edit]

Geometric interpretation[edit]

Basic feasible solutions for the dual problem[edit]

Finding an optimal BFS[edit]

Using the simplex algorithm[edit]

Converting any optimal solution to an optimal BFS[edit]

Finding a basis that is both primal-optimal and dual-optimal[edit]

External links[edit]

References[edit]

Recent Posts

Recent Comments

Archives

Categories

Meta