Shrinkage Fields (image restoration) – Wikipedia

Shrinkage fields is a random field-based machine learning technique that aims to perform high quality image restoration (denoising and deblurring) using low computational overhead.

The restored image

{displaystyle x}

$x$ is predicted from a corrupted observation

{displaystyle y}

$y$ after training on a set of sample images

{displaystyle S}

$S$ .

A shrinkage (mapping) function

{displaystyle {f}_{{pi }_{i}}left(vright)={sum }_{j=1}^{M}{pi }_{i,j}exp left(-{frac {gamma }{2}}{left(v-{mu }_{j}right)}^{2}right)}

${displaystyle {f}_{{pi }_{i}}left(vright)={sum }_{j=1}^{M}{pi }_{i,j}exp left(-{frac {gamma }{2}}{left(v-{mu }_{j}right)}^{2}right)}$ is directly modeled as a linear combination of radial basis function kernels, where

{displaystyle gamma }

$gamma$ is the shared precision parameter,

{displaystyle mu }

$mu$ denotes the (equidistant) kernel positions, and M is the number of Gaussian kernels.

Because the shrinkage function is directly modeled, the optimization procedure is reduced to a single quadratic minimization per iteration, denoted as the prediction of a shrinkage field

{displaystyle {g}_{mathrm {Theta } }left({text{x}}right)={mathcal {F}}^{-1}leftlbrack {frac {{mathcal {F}}left(lambda {K}^{T}y+{sum }_{i=1}^{N}{F}_{i}^{T}{f}_{{pi }_{i}}left({F}_{i}xright)right)}{lambda {check {K}}^{text{*}}circ {check {K}}+{sum }_{i=1}^{N}{check {F}}_{i}^{text{*}}circ {check {F}}_{i}}}rightrbrack ={mathrm {Omega } }^{-1}eta }

${displaystyle {g}_{mathrm {Theta } }left({text{x}}right)={mathcal {F}}^{-1}leftlbrack {frac {{mathcal {F}}left(lambda {K}^{T}y+{sum }_{i=1}^{N}{F}_{i}^{T}{f}_{{pi }_{i}}left({F}_{i}xright)right)}{lambda {check {K}}^{text{*}}circ {check {K}}+{sum }_{i=1}^{N}{check {F}}_{i}^{text{*}}circ {check {F}}_{i}}}rightrbrack ={mathrm {Omega } }^{-1}eta }$ where

{displaystyle {mathcal {F}}}

${mathcal {F}}$ denotes the discrete Fourier transform and

{displaystyle F_{x}}

$F_x$ is the 2D convolution

{displaystyle {text{f}}otimes {text{x}}}

${displaystyle {text{f}}otimes {text{x}}}$ with point spread function filter,

{displaystyle {breve {F}}}

${displaystyle {breve {F}}}$ is an optical transfer function defined as the discrete Fourier transform of

{displaystyle {text{f}}}

${displaystyle {text{f}}}$ , and

{displaystyle {breve {F}}^{text{*}}}

${displaystyle {breve {F}}^{text{*}}}$ is the complex conjugate of

{displaystyle {breve {F}}}

${displaystyle {breve {F}}}$ .

{displaystyle {hat {x}}_{t}}

${displaystyle {hat {x}}_{t}}$ is learned as

{displaystyle {hat {x}}_{t}={g}_{{mathrm {Theta } }_{t}}left({hat {x}}_{t-1}right)}

${displaystyle {hat {x}}_{t}={g}_{{mathrm {Theta } }_{t}}left({hat {x}}_{t-1}right)}$ for each iteration

{displaystyle t}

$t$ with the initial case

{displaystyle {hat {x}}_{0}=y}

${displaystyle {hat {x}}_{0}=y}$ , this forms a cascade of Gaussian conditional random fields (or cascade of shrinkage fields (CSF)). Loss-minimization is used to learn the model parameters

{displaystyle {mathrm {Theta } }_{t}={leftlbrace {lambda }_{t},{pi }_{mathit {ti}},{f}_{mathit {ti}}rightrbrace }_{i=1}^{N}}

${displaystyle {mathrm {Theta } }_{t}={leftlbrace {lambda }_{t},{pi }_{mathit {ti}},{f}_{mathit {ti}}rightrbrace }_{i=1}^{N}}$ .

The learning objective function is defined as

{displaystyle Jleft({mathrm {Theta } }_{t}right)={sum }_{s=1}^{S}lleft({hat {x}}_{t}^{left(sright)};{x}_{gt}^{left(sright)}right)}

${displaystyle Jleft({mathrm {Theta } }_{t}right)={sum }_{s=1}^{S}lleft({hat {x}}_{t}^{left(sright)};{x}_{gt}^{left(sright)}right)}$ , where

{displaystyle l}

$l$ is a differentiable loss function which is greedily minimized using training data

{displaystyle {leftlbrace {x}_{gt}^{left(sright)},{y}^{left(sright)},{k}^{left(sright)}rightrbrace }_{s=1}^{S}}

${displaystyle {leftlbrace {x}_{gt}^{left(sright)},{y}^{left(sright)},{k}^{left(sright)}rightrbrace }_{s=1}^{S}}$ and

{displaystyle {hat {x}}_{t}^{left(sright)}}

${displaystyle {hat {x}}_{t}^{left(sright)}}$ .

Table of Contents

Performance[edit]

Preliminary tests by the author suggest that RTF₅^[1] obtains slightly better denoising performance than

{displaystyle {text{CSF}}_{7times 7}^{leftlbrace mathrm {3,4,5} rightrbrace }}

${displaystyle {text{CSF}}_{7times 7}^{leftlbrace mathrm {3,4,5} rightrbrace }}$ , followed by

{displaystyle {text{CSF}}_{5times 5}^{5}}

${displaystyle {text{CSF}}_{5times 5}^{5}}$ ,

{displaystyle {text{CSF}}_{7times 7}^{2}}

${displaystyle {text{CSF}}_{7times 7}^{2}}$ ,

{displaystyle {text{CSF}}_{5times 5}^{leftlbrace mathrm {3,4} rightrbrace }}

${displaystyle {text{CSF}}_{5times 5}^{leftlbrace mathrm {3,4} rightrbrace }}$ , and BM3D.

BM3D denoising speed falls between that of

{displaystyle {text{CSF}}_{5times 5}^{4}}

${displaystyle {text{CSF}}_{5times 5}^{4}}$ and

{displaystyle {text{CSF}}_{7times 7}^{4}}

${displaystyle {text{CSF}}_{7times 7}^{4}}$ , RTF being an order of magnitude slower.

Advantages[edit]

Results are comparable to those obtained by BM3D (reference in state of the art denoising since its inception in 2007)
Minimal runtime compared to other high-performance methods (potentially applicable within embedded devices)
Parallelizable (e.g.: possible GPU implementation)
Predictability: ${displaystyle O(Dlog D)}$
Fast training even with CPU

Implementations[edit]

References[edit]

^
Jancsary, Jeremy; Nowozin, Sebastian; Sharp, Toby; Rother, Carsten (10 April 2012). Regression Tree Fields – An Efficient, Non-parametric Approach to Image Labeling Problems. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI, USA: IEEE Computer Society. doi:10.1109/CVPR.2012.6247950.

[1] 
Jancsary, Jeremy; Nowozin, Sebastian; Sharp, Toby; Rother, Carsten (10 April 2012). Regression Tree Fields – An Efficient, Non-parametric Approach to Image Labeling Problems. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI, USA: IEEE Computer Society. doi:10.1109/CVPR.2012.6247950.

Shrinkage Fields (image restoration) – Wikipedia

Performance[edit]

Advantages[edit]

Implementations[edit]

See also[edit]

References[edit]

Recent Posts

Recent Comments

Archives

Categories

Meta