Chang ZENG
Chang ZENG
Home
News
Publications
Skills
Activities
Experience
Posts
Light
Dark
Automatic
Importance Sampling
Adaptive Granularity Importance Sampling for Policy Optimization
A research note on adaptive granularity importance sampling for policy optimization, focusing on speech token dependencies and segment-level weighting strategies such as MASPO-Fixed, MASPO-LogProb, and MASPO-TokenVal.
Chang ZENG 曾畅 曾 暢 (ソウ チョウ)
Mar 11, 2026
9 min read
Research Notes
Adaptive Granularity Importance Sampling for Policy Optimization(中文)
一篇关于 policy optimization 中自适应粒度重要性采样的研究笔记,重点讨论语音 token 依赖结构以及 MASPO-Fixed、MASPO-LogProb、MASPO-TokenVal 等分段策略。
Chang ZENG 曾畅 曾 暢 (ソウ チョウ)
Mar 11, 2026
3 min read
Research Notes
Cite
×