Hi, thanks for your great work!
I saw causal models were on the roadmap.
For rCM, the DMD component can easily be replaced with Causal Forcing or Self Forcing to keep the rollout on manifold. However I was wondering what the planned approach was for adjusting the sCM loss component so the student can be conditioned on its own rollout? Will using a bidirectional teacher and a causal student conditioned on its own generated sequence be stable for sCM?
Is sCM too sensitive to conditioning mismatch, requiring one of these?
- a different method (sCT?)
- a teacher initialized on self-generated sequences (e.g. resampling forcing)
- perhaps for the sCM path the student and teacher don't need a self-generated sequence (teacher forced ground truth for both models)?
- something else?
Appreciate any insights.
Hi, thanks for your great work!
I saw causal models were on the roadmap.
For rCM, the DMD component can easily be replaced with Causal Forcing or Self Forcing to keep the rollout on manifold. However I was wondering what the planned approach was for adjusting the sCM loss component so the student can be conditioned on its own rollout? Will using a bidirectional teacher and a causal student conditioned on its own generated sequence be stable for sCM?
Is sCM too sensitive to conditioning mismatch, requiring one of these?
Appreciate any insights.