Quadratic unconstrained binary optimization – Wikipedia

Posted on August 28, 2021 by lordneo

Quadratic unconstrained binary optimization (QUBO), also known as unconstrained binary quadratic programming (UBQP), is a combinatorial optimization problem with a wide range of applications from finance and economics to machine learning.^[1] QUBO is an NP hard problem, and for many classical problems from theoretical computer science, like maximum cut, graph coloring and the partition problem, embeddings into QUBO have been formulated.^[2]^[3]
Embeddings for machine learning models include support-vector machines, clustering and probabilistic graphical models.^[4]
Moreover, due to its close connection to Ising models, QUBO constitutes a central problem class for adiabatic quantum computation, where it is solved through a physical process called quantum annealing.^[5]

Table of Contents

Definition[edit]

The set of binary vectors of a fixed length

n>0{displaystyle n>0}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/27a6a5d982d54202a14f111cb8a49210501b2c96" aria-hidden="true" alt="n></img>0″/></span> is denoted by <span class=" width="2435.1" height="936.9">$

Bn{displaystyle mathbb {B} ^{n}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/774bb74188cbd4485eb30cf91a444a9949dad247" aria-hidden="true" alt="{displaystyle mathbb {B} ^{n}}" width="1192.1" height="1008.6">$ , where

B={0,1}{displaystyle mathbb {B} =lbrace 0,1rbrace }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/6f763047d63751f6460c0b8cea6984018d65a30b" aria-hidden="true" alt="{displaystyle mathbb {B} =lbrace 0,1rbrace }" width="4448.7" height="1223.9">$ is the set of binary values (or bits).
We are given a real-valued upper triangular matrix

Q∈Rn×n{displaystyle Qin mathbb {R} ^{ntimes n}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/2f872946a21775ec38fa1cb914276a0899745863" aria-hidden="true" alt="{displaystyle Qin mathbb {R} ^{ntimes n}}" width="4236.8" height="1152.1">$ , whose entries

Qij{displaystyle Q_{ij}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/64db595665a615bde5bad2868d822ead86de1ddc" aria-hidden="true" alt="Q_{ij}" width="1427.5" height="1223.9">$ define a weight for each pair of indices

i,j∈{1,…,n}{displaystyle i,jin lbrace 1,dots ,nrbrace }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/8e6e0e06f19c1ad5ed3885aeb2f9efcc123731a5" aria-hidden="true" alt="{displaystyle i,jin lbrace 1,dots ,nrbrace }" width="6757.7" height="1223.9">$ within the binary vector.
We can define a function

fQ:Bn→R{displaystyle f_{Q}:mathbb {B} ^{n}rightarrow mathbb {R} }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/c8cf63f9f129f34fa5b388efb1a81dd2f101bfb7" aria-hidden="true" alt="{displaystyle f_{Q}:mathbb {B} ^{n}rightarrow mathbb {R} }" width="5454.9" height="1295.7">$ that assigns a value to each binary vector through

fQ(x)=x⊤Qx=∑i=1n∑j=inQijxixj{displaystyle f_{Q}(x)=x^{top }Qx=sum _{i=1}^{n}sum _{j=i}^{n}Q_{ij}x_{i}x_{j}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/b0da6d08861e6b4a02f43aa0da30b15200a277fb" aria-hidden="true" alt="{displaystyle f_{Q}(x)=x^{top }Qx=sum _{i=1}^{n}sum _{j=i}^{n}Q_{ij}x_{i}x_{j}}" width="14287.6" height="3089.6">

Intuitively, the weight

Qij{displaystyle Q_{ij}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/64db595665a615bde5bad2868d822ead86de1ddc" aria-hidden="true" alt="Q_{ij}" width="1427.5" height="1223.9">$ is added if both

xi{displaystyle x_{i}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e87000dd6142b81d041896a30fe58f0c3acb2158" aria-hidden="true" alt="x_{i}" width="916.8" height="865.1">$ and

xj{displaystyle x_{j}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/5db47cb3d2f9496205a17a6856c91c1d3d363ccd" aria-hidden="true" alt="x_{j}" width="964.2" height="1008.6">$ have value 1.
When

i=j{displaystyle i=j}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/706e0928b2bf0f24076b0c90bb20616ff2068343" aria-hidden="true" alt="i=j" width="2092.1" height="1080.4">$ , the values

Qii{displaystyle Q_{ii}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/0dfb631b8f92ea53ed1ae6c423a447a9c2bbb465" aria-hidden="true" alt="{displaystyle Q_{ii}}" width="1380.1" height="1080.4">$ are added if

xi=1{displaystyle x_{i}=1}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/a230692f2eb10d5fc1ead43dd83d75f3b592a09b" aria-hidden="true" alt="x_{i}=1" width="2751.4" height="1080.4">$ , as

xixi=xi{displaystyle x_{i}x_{i}=x_{i}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/dd8b7ef72fb4fb554ddc2a3374c81c9c5c077f1b" aria-hidden="true" alt="{displaystyle x_{i}x_{i}=x_{i}}" width="4084.5" height="865.1">$ for all

xi∈B{displaystyle x_{i}in mathbb {B} }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/7ad9af1521ae74f3173cd6d0e429cf8b85e2bbea" aria-hidden="true" alt="{displaystyle x_{i}in mathbb {B} }" width="2807.4" height="1080.4">$ .

The QUBO problem consists of finding a binary vector

x∗{displaystyle x^{*}}

fQ{displaystyle f_{Q}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/3db4a36ee2575c705d3564216754c3b74ca50333" aria-hidden="true" alt="f_{Q}" width="1150.2" height="1223.9">$ , namely

x∗=arg⁡minx∈Bn fQ(x){displaystyle x^{*}={underset {xin mathbb {B} ^{n}}{arg min }}~f_{Q}(x)} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/2671f2558816e5e9a0e6bd4df8904207611e30bc" aria-hidden="true" alt="{displaystyle x^{*}={underset {xin mathbb {B} ^{n}}{arg min }}~f_{Q}(x)}" width="8340.8" height="2013.3">

In general,

x∗{displaystyle x^{*}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb" aria-hidden="true" alt="x^{*}" width="1026.4" height="1008.6">$ is not unique, meaning there may be a set of minimizing vectors with equal value w.r.t.

fQ{displaystyle f_{Q}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/3db4a36ee2575c705d3564216754c3b74ca50333" aria-hidden="true" alt="f_{Q}" width="1150.2" height="1223.9">$ .
The complexity of QUBO arises from the number of candidate binary vectors to be evaluated, as

|Bn|=2n{displaystyle |mathbb {B} ^{n}|=2^{n}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/5f0546b0088d4aabe14988681f2d0271712b5c9c" aria-hidden="true" alt="{displaystyle |mathbb {B} ^{n}|=2^{n}}" width="4108.3" height="1223.9">$ grows exponentially in

n{displaystyle n}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/a601995d55609f2d9f5e233e36fbe9ea26011b3b" aria-hidden="true" alt="n" width="600.5" height="721.6">$ .

Sometimes, QUBO is defined as the problem of maximizing

fQ{displaystyle f_{Q}}

f−Q=−fQ{displaystyle f_{-Q}=-f_{Q}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/0d0b1ad7c02c4e6b7590ef34e91fb2a9947fc1c1" aria-hidden="true" alt="{displaystyle f_{-Q}=-f_{Q}}" width="4963.4" height="1223.9">$ .

Properties[edit]

Multiplying the coefficients $Qij{displaystyle Q_{ij}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/64db595665a615bde5bad2868d822ead86de1ddc" aria-hidden="true" alt="Q_{ij}" width="1427.5" height="1223.9">$ with a positive factor $α>0{displaystyle alpha >0} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/edd4f784b6e8bb68fa774213ceacbab2d97825dc" aria-hidden="true" alt="alpha ></img>0″/></span> scales the output of <span class=" width="2475.1" height="936.9"> f{displaystyle f} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/132e57acb643253e7810ee9702d9581f159a1c61" aria-hidden="true" alt="f" width="550.5" height="1080.4">$ accordingly, leaving the optimum $x∗{displaystyle x^{*}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb" aria-hidden="true" alt="x^{*}" width="1026.4" height="1008.6">$ unchanged:

fαQ(x)=∑i≤j(αQij)xixj=α∑i≤jQijxixj=αfQ(x){displaystyle f_{alpha Q}(x)=sum _{ileq j}(alpha Q_{ij})x_{i}x_{j}=alpha sum _{ileq j}Q_{ij}x_{i}x_{j}=alpha f_{Q}(x)} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/009500de1590efa041f08d1c7a88e7bfdae0555d" aria-hidden="true" alt="{displaystyle f_{alpha Q}(x)=sum _{ileq j}(alpha Q_{ij})x_{i}x_{j}=alpha sum _{ileq j}Q_{ij}x_{i}x_{j}=alpha f_{Q}(x)}" width="21998.2" height="2515.6">

Flipping the sign of all coefficients flips the sign of $f{displaystyle f} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/132e57acb643253e7810ee9702d9581f159a1c61" aria-hidden="true" alt="f" width="550.5" height="1080.4">$ ‘s output, making $x∗{displaystyle x^{*}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb" aria-hidden="true" alt="x^{*}" width="1026.4" height="1008.6">$ the binary vector that maximizes $f−Q{displaystyle f_{-Q}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/5dded565118f80cff5d5f59af69daf9c1c81fbf1" aria-hidden="true" alt="{displaystyle f_{-Q}}" width="1700.7" height="1223.9">$ :

f−Q(x)=∑i≤j(−Qij)xixj=−∑i≤jQijxixj=−fQ(x){displaystyle f_{-Q}(x)=sum _{ileq j}(-Q_{ij})x_{i}x_{j}=-sum _{ileq j}Q_{ij}x_{i}x_{j}=-f_{Q}(x)} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e04df4ff0b23861f9d12d69cb9090057a1089165" aria-hidden="true" alt="{displaystyle f_{-Q}(x)=sum _{ileq j}(-Q_{ij})x_{i}x_{j}=-sum _{ileq j}Q_{ij}x_{i}x_{j}=-f_{Q}(x)}" width="22509.8" height="2515.6">

If all coefficients are positive, the optimum is trivially $x∗=(0,…,0){displaystyle x^{*}=(0,dots ,0)} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/1761e736ac0a4a8c8f1b212fb59f35c3bbd2aee1" aria-hidden="true" alt="{displaystyle x^{*}=(0,dots ,0)}" width="6370" height="1223.9">$ . Similarly, if all coefficients are negative, the optimum is $x∗=(1,…,1){displaystyle x^{*}=(1,dots ,1)} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/cd3258aefe1049667603d92fdec8433cad61b6be" aria-hidden="true" alt="{displaystyle x^{*}=(1,dots ,1)}" width="6370" height="1223.9">$ .
If $∀i≠j: Qij=0{displaystyle forall ineq j:~Q_{ij}=0} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/2a09a5d3b04ceb93055c2bf9cc84456d562e9659" aria-hidden="true" alt="{displaystyle forall ineq j:~Q_{ij}=0}" width="6994.7" height="1223.9">$ , i.e., the bits can be optimized independently, then the corresponding QUBO problem is solvable in $O(n){displaystyle {mathcal {O}}(n)} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/3c7bbe0124ae81792773344bc8709fc2f9c9910d" aria-hidden="true" alt="{mathcal {O}}(n)" width="2176" height="1223.9">$ , the optimal variable assignments $xi∗{displaystyle x_{i}^{*}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/7d4aad224eaab545bb11af7a02363e59addaad6d" aria-hidden="true" alt="x_{i}^{*}" width="1026.4" height="1223.9">$ simply being 1 if $Qii<0{displaystyle Q_{ii}<0} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/895d36b95d72e928532b379f8ab3d22357c6086c" aria-hidden="true" alt="{displaystyle Q_{ii}<0}" width="3214.7" height="1080.4">$ and 0 otherwise.

Applications[edit]

QUBO is a structurally simple, yet computationally hard optimization problem.
It can be used to encode a wide range of optimization problems from various scientific areas.^[6]

Cluster Analysis[edit]

A bad cluster assignment.

A good cluster assignment.

Visual representation of a clustering problem with 20 points: Circles of the same color belong to the same cluster. Each circle can be understood as a binary variable in the corresponding QUBO problem.

As an illustrative example of how QUBO can be used to encode an optimization problem, we consider the problem of cluster analysis.
Here, we are given a set of 20 points in 2D space, described by a matrix

D∈R20×2{displaystyle Din mathbb {R} ^{20times 2}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/8efed8fc59e466e35e8a89c10d60707db4085b38" aria-hidden="true" alt="{displaystyle Din mathbb {R} ^{20times 2}}" width="4486.3" height="1152.1">$ , where each row contains two cartesian coordinates.
We want to assign each point to one of two classes or clusters, such that points in the same cluster are similar to each other.
For two clusters, we can assign a binary variable

xi∈B{displaystyle x_{i}in mathbb {B} }

i{displaystyle i}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/add78d8608ad86e54951b8c8bd6c8d8416533d20" aria-hidden="true" alt="i" width="345.5" height="936.9">$ -th row in

D{displaystyle D}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/f34a0c600395e5d4345287e21fb26efd386990e6" aria-hidden="true" alt="D" width="828.5" height="936.9">$ , indicating whether it belongs to the first (

xi=0{displaystyle x_{i}=0}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/491e215fe273840aab63acab32e73cb14f2fc8e8" aria-hidden="true" alt="x_{i}=0" width="2751.4" height="1080.4">$ ) or second cluster (

xi=1{displaystyle x_{i}=1}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/a230692f2eb10d5fc1ead43dd83d75f3b592a09b" aria-hidden="true" alt="x_{i}=1" width="2751.4" height="1080.4">$ ).
Consequently, we have 20 binary variables, which form a binary vector

x∈B20{displaystyle xin mathbb {B} ^{20}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/94434f31f2d8b27024882eca7f76b8a3bb7b29f1" aria-hidden="true" alt="{displaystyle xin mathbb {B} ^{20}}" width="3270.9" height="1152.1">$ that corresponds to a cluster assignment of all points (see figure).

One way to derive a clustering is to consider the pairwise distances between points.
Given a cluster assignment

x{displaystyle x}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/87f9e315fd7e2ba406057a97300593c4802b53e4" aria-hidden="true" alt="x" width="572.5" height="721.6">$ , the values

xixj{displaystyle x_{i}x_{j}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/1477ed2caf21cb603eae625fc8705b831f417d91" aria-hidden="true" alt="x_{i}x_{j}" width="1881" height="1008.6">$ or

(1−xi)(1−xj){displaystyle (1-x_{i})(1-x_{j})}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/1e21f4ce19244ce0997d790ffce82438839cf80c" aria-hidden="true" alt="{displaystyle (1-x_{i})(1-x_{j})}" width="6885.9" height="1295.7">$ evaluate to 1 if points

i{displaystyle i}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/add78d8608ad86e54951b8c8bd6c8d8416533d20" aria-hidden="true" alt="i" width="345.5" height="936.9">$ and

j{displaystyle j}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/2f461e54f5c093e92a55547b9764291390f0b5d0" aria-hidden="true" alt="j" width="424" height="1080.4">$ are in the same cluster.
Similarly,

xi(1−xj){displaystyle x_{i}(1-x_{j})}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/4869f0ba84b5787377d3bd65ea5c7712c6f7268e" aria-hidden="true" alt="{displaystyle x_{i}(1-x_{j})}" width="4383.4" height="1295.7">$ or

(1−xi)xj{displaystyle (1-x_{i})x_{j}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/65e3ebe778d660d96e1469c18207d80f09d609a3" aria-hidden="true" alt="{displaystyle (1-x_{i})x_{j}}" width="4383.4" height="1295.7">$ indicate that they are in different clusters.
Let

dij≥0{displaystyle d_{ij}geq 0}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/9a066337db6c86d474ea232bd2a0df8bbbc7aed9" aria-hidden="true" alt="{displaystyle d_{ij}geq 0}" width="2991" height="1223.9">$ denote the Euclidean distance between points

i{displaystyle i}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/add78d8608ad86e54951b8c8bd6c8d8416533d20" aria-hidden="true" alt="i" width="345.5" height="936.9">$ and

j{displaystyle j}

i{displaystyle i}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/add78d8608ad86e54951b8c8bd6c8d8416533d20" aria-hidden="true" alt="i" width="345.5" height="936.9">$ and

j{displaystyle j}

dij{displaystyle d_{ij}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/f8b0116346390c7dcd72d9ed714177a9b6c81e57" aria-hidden="true" alt="d_{ij}" width="1156.5" height="1223.9">$ , and subtract it when they are in different clusters.
This way, an optimal solution tends to place points which are far apart into different clusters, and points that are close into the same cluster.
The cost function thus comes down to

f(x)=∑i<jdij(xixj+(1−xi)(1−xj))−dij(xi(1−xj)+(1−xi)xj)=∑i<j[4dijxixj−2dijxi−2dijxj+dij]{displaystyle {begin{aligned}f(x)&=sum _{i <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/f46ef423537d3084fa6228aa3b293c924af3b5ac" aria-hidden="true" alt="{displaystyle {begin{aligned}f(x)&amp;=sum _{i&lt;j}d_{ij}(x_{i}x_{j}+(1-x_{i})(1-x_{j}))-d_{ij}(x_{i}(1-x_{j})+(1-x_{i})x_{j})\&amp;=sum _{i&lt;j}left[4d_{ij}x_{i}x_{j}-2d_{ij}x_{i}-2d_{ij}x_{j}+d_{ij}right]end{aligned}}}" width="30244.3" height="5098.9">

From the second line, the QUBO parameters can be easily found by re-arranging to be:

Qij={4dijif i≠j−(∑k=1i−1dki+∑ℓ=i+1ndiℓ)if i=j{displaystyle {begin{aligned}Q_{ij}&={begin{cases}4d_{ij}&{text{if }}ineq j\-left(sum limits _{k=1}^{i-1}d_{ki}+sum limits _{ell =i+1}^{n}d_{iell }right)&{text{if }}i=jend{cases}}end{aligned}}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/c33e874637b87ea188340cb9a596b6c29897e946" aria-hidden="true" alt="{displaystyle {begin{aligned}Q_{ij}&amp;={begin{cases}4d_{ij}&{text{if }}ineq j\-left(sum limits _{k=1}^{i-1}d_{ki}+sum limits _{ell =i+1}^{n}d_{iell }right)&{text{if }}i=jend{cases}}end{aligned}}}" width="17860.8" height="4094.3">

Using these parameters, the optimal QUBO solution will correspond to an optimal cluster w.r.t. above cost function.

Connection to Ising models[edit]

QUBO is very closely related and computationally equivalent to the Ising model, whose Hamiltonian function is defined as

H(σ)=−∑⟨i j⟩Jijσiσj−μ∑jhjσj{displaystyle H(sigma )=-sum _{langle i~jrangle }J_{ij}sigma _{i}sigma _{j}-mu sum _{j}h_{j}sigma _{j}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/8c92b9ff7af38ba46817d8dc7a1639b1cd6b7f16" aria-hidden="true" alt="{displaystyle H(sigma )=-sum _{langle i~jrangle }J_{ij}sigma _{i}sigma _{j}-mu sum _{j}h_{j}sigma _{j}}" width="14736.5" height="2587.3">

with real-valued parameters

hj,Jij,μ{displaystyle h_{j},J_{ij},mu }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/859f83d81a657d1883b6f6b5cb55cc295b6438aa" aria-hidden="true" alt="{displaystyle h_{j},J_{ij},mu }" width="3653.5" height="1223.9">$ for all

i,j{displaystyle i,j}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/f4cbf8bbc622154cda8208d6e339495fe16a1f9a" aria-hidden="true" alt="i,j" width="1203.2" height="1080.4">$ .
The spin variables

σj{displaystyle sigma _{j}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/a234b1f289568934f127c2dd68ba77b6ef3f3569" aria-hidden="true" alt="sigma _{j}" width="963.2" height="1008.6">$ are binary with values from

{−1,+1}{displaystyle lbrace -1,+1rbrace }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/250a5705f5d2f0647cde1904a19881873b882799" aria-hidden="true" alt="{displaystyle lbrace -1,+1rbrace }" width="4004.2" height="1223.9">$ instead of

B{displaystyle mathbb {B} }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/12f0c20e9335038ffebc3536dc301978226675a3" aria-hidden="true" alt="mathbb {B} " width="667.5" height="936.9">$ .
Moreover, in the Ising model the variables are typically arranged in a lattice where only neighboring pairs of variables

⟨i j⟩{displaystyle langle i~jrangle }

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/384799d8f59da496c71cfe2dc452af032d4e9673" aria-hidden="true" alt="{displaystyle langle i~jrangle }" width="1787" height="1223.9">$ can have non-zero coefficients.
Applying the identity

σ↦2x−1{displaystyle sigma mapsto 2x-1}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/7f226d3075abf0da40991970c951b09566bb8d01" aria-hidden="true" alt="{displaystyle sigma mapsto 2x-1}" width="4925" height="1008.6">$ yields an equivalent QUBO problem:^[2]

f(x)=∑⟨i j⟩−Jij(2xi−1)(2xj−1)+∑jμhj(2xj−1)=∑⟨i j⟩−4Jijxixj+2Jijxi+2Jijxj−Jij+∑j2μhjxj−μhjusing xj=xjxj=∑i=1n∑j=1iqijxixj+C{displaystyle {begin{aligned}f(x)&=sum _{langle i~jrangle }-J_{ij}(2x_{i}-1)(2x_{j}-1)+sum _{j}mu h_{j}(2x_{j}-1)\&=sum _{langle i~jrangle }-4J_{ij}x_{i}x_{j}+2J_{ij}x_{i}+2J_{ij}x_{j}-J_{ij}+sum _{j}2mu h_{j}x_{j}-mu h_{j}&&{text{using }}x_{j}=x_{j}x_{j}\&=sum _{i=1}^{n}sum _{j=1}^{i}q_{ij}x_{i}x_{j}+Cend{aligned}}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/960fdd124205c1053e564ae9b33d5b257f825874" aria-hidden="true" alt="{displaystyle {begin{aligned}f(x)&amp;=sum _{langle i~jrangle }-J_{ij}(2x_{i}-1)(2x_{j}-1)+sum _{j}mu h_{j}(2x_{j}-1)\&amp;=sum _{langle i~jrangle }-4J_{ij}x_{i}x_{j}+2J_{ij}x_{i}+2J_{ij}x_{j}-J_{ij}+sum _{j}2mu h_{j}x_{j}-mu h_{j}&amp;&{text{using }}x_{j}=x_{j}x_{j}\&amp;=sum _{i=1}^{n}sum _{j=1}^{i}q_{ij}x_{i}x_{j}+Cend{aligned}}}" width="37075.7" height="8686.8">

where

Qij={−4Jijif i≠j∑⟨k i⟩2Jki+∑⟨i ℓ⟩2Jiℓ+2μhiif i=jC=−∑⟨i j⟩Jij−∑jμhj{displaystyle {begin{aligned}Q_{ij}&={begin{cases}-4J_{ij}&{text{if }}ineq j\sum _{langle k~irangle }2J_{ki}+sum _{langle i~ell rangle }2J_{iell }+2mu h_{i}&{text{if }}i=jend{cases}}\C&=-sum _{langle i~jrangle }J_{ij}-sum _{j}mu h_{j}end{aligned}}} <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/18c8869298723601fcbbc7bcabffd2ea97bee372" aria-hidden="true" alt="{displaystyle {begin{aligned}Q_{ij}&amp;={begin{cases}-4J_{ij}&{text{if }}ineq j\sum _{langle k~irangle }2J_{ki}+sum _{langle i~ell rangle }2J_{iell }+2mu h_{i}&{text{if }}i=jend{cases}}\C&amp;=-sum _{langle i~jrangle }J_{ij}-sum _{j}mu h_{j}end{aligned}}}" width="21422.1" height="5385.9">

As the constant

C{displaystyle C}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/4fc55753007cd3c18576f7933f6f089196732029" aria-hidden="true" alt="C" width="760.5" height="936.9">$ does not change the position of the optimum

x∗{displaystyle x^{*}}

$<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e5be23ee5d433f8b576e63bcb47518128ee0b6bb" aria-hidden="true" alt="x^{*}" width="1026.4" height="1008.6">$ , it can be neglected during optimization and is only important for recovering the original Hamiltonian function value.

References[edit]

^ Kochenberger, Gary; Hao, Jin-Kao (2014). “The unconstrained binary quadratic programming problem: a survey” (PDF). Journal of Combinatorial Optimization. 28: 58–81. doi:10.1007/s10878-014-9734-0. S2CID 16808394.
^ ^a ^b Glover, Fred; Kochenberger, Gary (2019). “A Tutorial on Formulating and Using QUBO Models”. arXiv:1811.11538 [cs.DS].
^ Lucas, Andrew (2014). “Ising formulations of many NP problems”. Frontiers in Physics. 2: 5. arXiv:1302.5843. Bibcode:2014FrP…..2….5L. doi:10.3389/fphy.2014.00005.
^ Mücke, Sascha; Piatkowski, Nico; Morik, Katharina (2019). “Learning Bit by Bit: Extracting the Essence of Machine Learning” (PDF). LWDA. S2CID 202760166. Archived from the original (PDF) on 2020-02-27.
^ Tom Simonite (8 May 2013). “D-Wave’s Quantum Computer Goes to the Races, Wins”. MIT Technology Review. Retrieved 12 May 2013.
^ Ratke, Daniel (2021-06-10). “List of QUBO formulations”. Retrieved 2022-12-16.

External links[edit]

QUBO Benchmark (Benchmark of software packages for the exact solution of QUBOs; part of the well-known Mittelmann benchmark collection)

Quadratic unconstrained binary optimization – Wikipedia

Definition[edit]

Properties[edit]

Applications[edit]

Cluster Analysis[edit]

Connection to Ising models[edit]

References[edit]

External links[edit]

Recent Posts

Recent Comments

Archives

Categories

Meta