Portfolio

Here are my research/engineering works. Most code and updates live on GitHub.

GitHub: @TheRoadQaQ

ReLIFT

Training method that interleaves RL with online fine-tuning.

DataFlow (OpenDCAI)

LLM-based operators and pipelines for data preparation.

SGL (PKU-DAIR)

Scalable graph learning toolkit for large-scale graph datasets.

DataFlex (OpenDCAI)

A data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

Lu Ma

Portfolio

ReLIFT

DataFlow (OpenDCAI)

SGL (PKU-DAIR)

DataFlex (OpenDCAI)