Portfolio

Here are my research/engineering works. Most code and updates live on GitHub.

GitHub: @TheRoadQaQ


ReLIFT

Training method that interleaves RL with online fine-tuning.

SGL (PKU-DAIR)

Scalable graph learning toolkit for large-scale graph datasets.

DataFlex (OpenDCAI)

A data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.