CV
Summary
My research interests primarily focus on LLM Agents automation including both expanding the range of agent applications via agent itself and using agents to discover new Nobel Prize level laws/rules. I believe combining search with powerful LLM is the way to AGI. I have published several papers at prominent NLP conferences, including ICLR, ICML, EMNLP, CIKM, and NAACL.
Education
- M.S. in Dept. of Artificial Intelligence2023.9-2026.6Beijing University of Posts and Telecommunications (BUPT)
- B.S. in Dept. of Electronic Engineering2019.9-2023.6Beijing University of Posts and Telecommunications (BUPT)GPA: 93.18/100 (3.88/4.0)
Visiting Experience
- C3I Group2022.10 - 2024.8Tsinghua University
- GAIR2025.2 - NowSII & SJTU
Industry Experience
- Meituan, NLP Center2024.1 - 2025.3Algorithm InternResearch Intern on Agent and data synthesis.
Publications
- InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research2025ICLR 2026
- daVinci-Env: Open SWE Environment Synthesis at Scale2026working in progress
- daVinci-Dev: Agent-native Mid-training for Software Engineering2026ICML 2026 spotlight
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently2026working in progress
- AgentRefine: Enhancing Agent Generalization through Refinement Tuning2025ICLR 2025
- DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments2025EMNLP 2025 Main
- MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making2024EMNLP 2024 Main
- How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data2024EMNLP 2024 Main
- CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery2024ICLR 2025
- PreAct: Prediction Enhances Agent's Planning Ability2024Coling 2025
- Agencybench: Benchmarking the frontiers of autonomous agents in 1m-token real-world contexts2026ACL 2026
- On Large Language Models' Hallucination with Regard to Known Facts2024NAACL 2024 Main
- DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations2024NAACL 2024 Findings
- A multi-task semantic decomposition framework with task-specific pre-training for few-shot ner2023CIKM 2023
- A prototypical semantic decoupling method via joint contrastive learning for few-shot named entity recognition2023ICASSP 2023
- Revisit out-of-vocabulary problem for slot filling: A unified contrastive framework with multi-level data augmentationss2023ICASSP 2023
- Semi-supervised knowledge-grounded pre-training for task-oriented dialog systems2022SereTOD2022, EMNLP 2022 Workshop
Skills
Programming
- C++
- Python
Deep Learning Frameworks
- LLaMA-Factory
- Deepspeed
- ray
- verl
Languages
Chinese - Native
English
Honors & Awards
- Excellent First-class Scholarship for Master Students, BUPT2023, 2024
- the 1st Award on SereTOD Challenge 2022 track 22022
- National Scholarship(Top 1%)2021
- First Prize in The Chinese Mathematics Competitions2020, 2021, 2022
Service and leadership
Reviewer for
- ICLR 2025, ICLR 2026
- ICML 2026
- COLM 2026
- ACL ARR 2024 June, August, October, December
- ACL ARR 2025 February, May, October
- ACL ARR 2026 January, March