Score-based Generative Model through SDE

1. Notations

The meaning of :

For scalar function , we know is the gradient.

For vector function , is the divergence, defined as the sum of the partial derivatives of each component:

For matrix function , its divergence is a vector, defined as:

or written in a more general form:

where is the th row of .

2. Score-based Diffusion through SDE (Unconditional)

SDE of forward process(noise):

SDE of reverse process(denoise):

probability flow ODE:

3. Training (Unconditional)

  • Sliced Score Matching:
  • Denoising Score Matching:

4. Score-based Diffusion through SDE (Conditional)

We can train a neural network to learn

We can also use some prior knowledge to directly determine

5. Special Case

If s.t. , then the reverse process can be written as:

6. Denoising Score Matching

Assume forward diffusion process can be written as , then

minimize

  • Equivalence of Epsilon Model and Score Model
  • Unconditional Score Matching

  • Conditional (Loss Guidance) Score Matching

7. VPSDE(Continuous DDPM) (Unconditional)

Because , we have:

SDE of forward process(noise):

SDE of reverse process(denoise):

参数对比

modelbetan_step
DDPM0.0001-0.021000
VPSDE0.1-201000

8. VESDE

离散情况下: