Lu Ma

roadma.jpg

About Me

I am a Master student at Peking University, advised by Professor Bin Cui. My research interests include large language models, AI agents, data scaling.

Before this, I obtained my bachelor’s degree in Computer Science from Renmin University of China. Previously, I conducted my internship at Tencent AI Lab, Meituan LongCat.

I am currently seeking summer internship opportunities. Please feel free to reach out if you are interested in collaboration.

selected projects

  1. Learning What Reinforcement Learning Can’t: Interleaved Online Fine-Tuning for Hardest Questions
    Lu Ma, Hao Liang, Meiyi Qiang, and 9 more authors
    ICLR
  2. Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
    Yanhao Li†, Lu Ma†, Jiaran Zhang, and 3 more authors
    ACL
  3. DataFlex
    2025
  4. DataFlow
    2025

† means equal contribution