Archive

Total 10 articles

2024

2024-08-23 inner structure of transformer models 2024-08-09 240809 - 北京的雨 2024-07-29 模式识别与机器学习笔记 2024-07-13 快被人类淘汰的离线RLHF方法 2024-07-07 Mirror Descent (Bubeck 1-9) 2024-06-27 柱透镜光栅成像 2024-06-15 nonsense 2024-06-14 240613 - 科学家 2024-06-05 240605 - 一门很喜欢的课 2024-06-02 Inverse Reinforcement Learning