CVAE Objective and Loss Terms

Back to CVAE methodology

This note centralizes the loss-level notes that were already natural atomic units in the original methodology thread.

Core Objective

The base training picture is the conditional ELBO:

LELBO=Ezqξ(zx,y)[logpψ(yx,z)]βKL ⁣(qξ(zx,y)pϕ(zx)).\mathcal{L}_{\mathrm{ELBO}} = \mathbb{E}_{z \sim q_\xi(z \mid x, y)}[\log p_\psi(y \mid x, z)] - \beta\, \mathrm{KL}\!\left(q_\xi(z \mid x, y)\,\|\,p_\phi(z \mid x)\right).

The project’s methodology notes mainly study why this objective is under-specified in the entangled-initialized regime, and how to alter it so that zz carries strategy semantics rather than collapsing.

Atomic Loss Notes

How To Read This Cluster

Recommended order

Start with problem setup and model components. Then read token-weighted excess reconstruction and baseline-normalized reconstruction. Read inter-latent divergence and InfoVAE connections as the main exclusivity / information-flow add-ons.

Built with LogoFlowershow