CVAE Post-Training Methodology for Latent Strategy Disentanglement
CVAE Post-Training Methodology for Latent Strategy Disentanglement
This is the landing page for the core CVAE methodology thread. It keeps the methodology readable as one coherent cluster while surfacing the most reusable pieces as linked concept notes.
Core Structure
Related Loss Notes
- Baseline-normalized reconstruction
- Discrete latent VAEs
- Token-weighted excess reconstruction
- Inter-latent divergence
- InfoVAE connections
- Sequence-level divergence
- Synthetic sequences experiment
Summary
The central idea is to factor a frozen entangled conditional generator into:
- a strategy router ,
- a strategy-conditioned generator ,
- and a training-time inference model .
The methodology covers both discrete and continuous latent variants, multiple adaptation regimes, and a family of ELBO-adjacent interventions aimed at making the latent variable carry strategy information rather than collapsing.