Basic feasible solution – Wikipedia

before-content-x4

In the theory of linear programming, a basic feasible solution (BFS) is a solution with a minimal set of non-zero variables. Geometrically, each BFS corresponds to a corner of the polyhedron of feasible solutions. If there exists an optimal solution, then there exists an optimal BFS. Hence, to find an optimal solution, it is sufficient to consider the BFS-s. This fact is used by the simplex algorithm, which essentially travels from some BFS to another until an optimal one is found.[1]

after-content-x4

Definitions[edit]

Preliminaries: equational form with linearly-independent rows[edit]

For the definitions below, we first present the linear program in the so-called equational form:

maximize
subject to

where:

after-content-x4

Any linear program can be converted into an equational form by adding slack variables.

As a preliminary clean-up step, we verify that:

  • The system
  • All m rows of the matrix

Feasible solution[edit]

A feasible solution of the LP is any vector

x0{displaystyle mathbf {x} geq 0}

such that

Ax=b{displaystyle Amathbf {x} =mathbf {b} }

. We assume that there is at least one feasible solution. If m = n, then there is only one feasible solution. Typically m < n, so the system

Ax=b{displaystyle Amathbf {x} =mathbf {b} }

has many solutions; each such solution is called a feasible solution of the LP.

Basis[edit]

A basis of the LP is a nonsingular submatrix of A, with all m rows and only m<n columns.

Sometimes, the term basis is used not for the submatrix itself, but for the set of indices of its columns. Let B be a subset of m indices from {1,…,n}. Denote by

AB{displaystyle A_{B}}

the square m-by-m matrix made of the m columns of

A{displaystyle A}

indexed by B. If

AB{displaystyle A_{B}}

is nonsingular, the columns indexed by B are a basis of the column space of

A{displaystyle A}

. In this case, we call B a basis of the LP.

Since the rank of

A{displaystyle A}

is m, it has at least one basis; since

A{displaystyle A}

has n columns, it has at most

(nm){displaystyle {binom {n}{m}}}

bases.

Basic feasible solution[edit]

Given a basis B, we say that a feasible solution

x{displaystyle mathbf {x} }

is a basic feasible solution with basis B if all its non-zero variables are indexed by B, that is, for all

jB:  xj=0{displaystyle jnot in B:~~x_{j}=0}

.

Properties[edit]

1. A BFS is determined only by the constraints of the LP (the matrix

A{displaystyle A}

and the vector

b{displaystyle mathbf {b} }

); it does not depend on the optimization objective.

2. By definition, a BFS has at most m non-zero variables and at least nm zero variables. A BFS can have less than m non-zero variables; in that case, it can have many different bases, all of which contain the indices of its non-zero variables.

3. A feasible solution

x{displaystyle mathbf {x} }

is basic if-and-only-if the columns of the matrix

AK{displaystyle A_{K}}

are linearly independent, where K is the set of indices of the non-zero elements of

x{displaystyle mathbf {x} }

.[1]: 45 

4. Each basis determines a unique BFS: for each basis B of m indices, there is at most one BFS

xB{displaystyle mathbf {x_{B}} }

with basis B. This is because

xB{displaystyle mathbf {x_{B}} }

must satisfy the constraint

ABxB=b{displaystyle A_{B}mathbf {x_{B}} =b}

, and by definition of basis the matrix

AB{displaystyle A_{B}}

is non-singular, so the constraint has a unique solution:

xB=AB1b{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

The opposite is not true: each BFS can come from many different bases. If the unique solution of

xB=AB1b{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

satisfies the non-negativity constraints

xB0{displaystyle mathbf {x_{B}} geq 0}

, then B is called a feasible basis.

5. If a linear program has an optimal solution (i.e., it has a feasible solution, and the set of feasible solutions is bounded), then it has an optimal BFS. This is a consequence of the Bauer maximum principle: the objective of a linear program is convex; the set of feasible solutions is convex (it is an intersection of hyperspaces); therefore the objective attains its maximum in an extreme point of the set of feasible solutions.

Since the number of BFS-s is finite and bounded by

(nm){displaystyle {binom {n}{m}}}

, an optimal solution to any LP can be found in finite time by just evaluating the objective function in all

(nm){displaystyle {binom {n}{m}}}

BFS-s. This is not the most efficient way to solve an LP; the simplex algorithm examines the BFS-s in a much more efficient way.

Examples[edit]

Consider a linear program with the following constraints:

x1+5x2+3x3+4x4+6x5=14x2+3x3+5x4+6x5=7i{1,,5}:xi0{displaystyle {begin{aligned}x_{1}+5x_{2}+3x_{3}+4x_{4}+6x_{5}&=14\x_{2}+3x_{3}+5x_{4}+6x_{5}&=7\forall iin {1,ldots ,5}:x_{i}&geq 0end{aligned}}}

The matrix A is:

A=(1534601356)     b=(14  7){displaystyle A={begin{pmatrix}1&5&3&4&6\0&1&3&5&6end{pmatrix}}~~~~~mathbf {b} =(14~~7)}

Here, m=2 and there are 10 subsets of 2 indices, however, not all of them are bases: the set {3,5} is not a basis since columns 3 and 5 are linearly dependent.

The set B={2,4} is a basis, since the matrix

AB=(5415){displaystyle A_{B}={begin{pmatrix}5&4\1&5end{pmatrix}}}

is non-singular.

The unique BFS corresponding to this basis is

xB=(0  2  0  1  0){displaystyle x_{B}=(0~~2~~0~~1~~0)}

.

Geometric interpretation[edit]

Elongated pentagonal orthocupolarotunda.png

The set of all feasible solutions is an intersection of hyperspaces. Therefore, it is a convex polyhedron. If it is bounded, then it is a convex polytope. Each BFS corresponds to a vertex of this polytope.[1]: 53–56 

Basic feasible solutions for the dual problem[edit]

As mentioned above, every basis B defines a unique basic feasible solution

xB=AB1b{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

. In a similar way, each basis defines a solution to the dual linear program:

minimize
subject to

The solution is

yB=ABT1c{displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}

.

Finding an optimal BFS[edit]

There are several methods for finding a BFS that is also optimal.

Using the simplex algorithm[edit]

In practice, the easiest way to find an optimal BFS is to use the simplex algorithm. It keeps, at each point of its execution, a “current basis” B (a subset of m out of n variables), a “current BFS”, and a “current tableau”. The tableau is a representation of the linear program where the basic variables are expressed in terms of the non-basic ones:[1]: 65 

where

xB{displaystyle x_{B}}

is the vector of m basic variables,

xN{displaystyle x_{N}}

is the vector of n non-basic variables, and

z{displaystyle z}

is the maximization objective. Since non-basic variables equal 0, the current BFS is

p{displaystyle p}

, and the current maximization objective is

z0{displaystyle z_{0}}

.

If all coefficients in

r{displaystyle r}

are negative, then

z0{displaystyle z_{0}}

is an optimal solution, since all variables (including all non-basic variables) must be at least 0, so the second line implies

zz0{displaystyle zleq z_{0}}

.

If some coefficients in

r{displaystyle r}

are positive, then it may be possible to increase the maximization target. For example, if

x5{displaystyle x_{5}}

is non-basic and its coefficient in

r{displaystyle r}

is positive, then increasing it above 0 may make

z{displaystyle z}

larger. If it is possible to do so without violating other constraints, then the increased variable becomes basic (it “enters the basis”), while some basic variable is decreased to 0 to keep the equality constraints and thus becomes non-basic (it “exits the basis”).

If this process is done carefully, then it is possible to guarantee that

z{displaystyle z}

increases until it reaches an optimal BFS.

Converting any optimal solution to an optimal BFS[edit]

In the worst case, the simplex algorithm may require exponentially many steps to complete. There are algorithms for solving an LP in weakly-polynomial time, such as the ellipsoid method; however, they usually return optimal solutions that are not basic.

However, Given any optimal solution to the LP, it is easy to find an optimal feasible solution that is also basic.[2]: see also “external links” below. 

Finding a basis that is both primal-optimal and dual-optimal[edit]

A basis B of the LP is called dual-optimal if the solution

yB=ABT1c{displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}

is an optimal solution to the dual linear program, that is, it minimizes

bTy{textstyle mathbf {b^{T}} mathbf {y} }

. In general, a primal-optimal basis is not necessarily dual-optimal, and a dual-optimal basis is not necessarily primal-optimal (in fact, the solution of a primal-optimal basis may even be unfeasible for the dual, and vice versa).

If both

xB=AB1b{displaystyle mathbf {x_{B}} ={A_{B}}^{-1}cdot b}

is an optimal BFS of the primal LP, and

yB=ABT1c{displaystyle mathbf {y_{B}} ={A_{B}^{T}}^{-1}cdot c}

is an optimal BFS of the dual LP, then the basis B is called PD-optimal. Every LP with an optimal solution has a PD-optimal basis, and it is found by the Simplex algorithm. However, its run-time is exponential in the worst case. Nimrod Megiddo proved the following theorems:[2]

  • There exists a strongly polynomial time algorithm that inputs an optimal solution to the primal LP and an optimal solution to the dual LP, and returns an optimal basis.
  • If there exists a strongly polynomial time algorithm that inputs an optimal solution to only the primal LP (or only the dual LP) and returns an optimal basis, then there exists a strongly-polynomial time algorithm for solving any linear program (the latter is a famous open problem).

Megiddo’s algorithms can be executed using a tableau, just like the simplex algorithm.

External links[edit]

References[edit]


after-content-x4