李书亮目前在百度工作,从事大模型推理,强化学习研究 latest posts Mar 04, 2026 策略梯度算法汇总 Feb 03, 2026 lora 1: 初识lora selected publications You can even add a little note about which of these is the best way to reach you.