CV

Summary

My research interests primarily focus on LLM Agents automation including both expanding the range of agent applications via agent itself and using agents to discover new Nobel Prize level laws/rules. I believe combining search with powerful LLM is the way to AGI. I have published several papers at prominent NLP conferences, including ICLR, ICML, EMNLP, CIKM, and NAACL.

Education

  • M.S. in Dept. of Artificial Intelligence
    2023.9-2026.6
    Beijing University of Posts and Telecommunications (BUPT)
  • B.S. in Dept. of Electronic Engineering
    2019.9-2023.6
    Beijing University of Posts and Telecommunications (BUPT)
    GPA: 93.18/100 (3.88/4.0)

Visiting Experience

  • C3I Group
    2022.10 - 2024.8
    Tsinghua University
  • GAIR
    2025.2 - Now
    SII & SJTU

Industry Experience

  • Meituan, NLP Center
    2024.1 - 2025.3
    Algorithm Intern
    Research Intern on Agent and data synthesis.

Publications

  • InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
    2025
  • daVinci-Env: Open SWE Environment Synthesis at Scale
    2026
    working in progress
  • daVinci-Dev: Agent-native Mid-training for Software Engineering
    2026
    ICML 2026 spotlight
  • daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
    2026
    working in progress
  • AgentRefine: Enhancing Agent Generalization through Refinement Tuning
    2025
  • DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
    2025
    EMNLP 2025 Main
  • MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making
    2024
    EMNLP 2024 Main
  • How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
    2024
    EMNLP 2024 Main
  • CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
    2024
  • PreAct: Prediction Enhances Agent's Planning Ability
    2024
    Coling 2025
  • Agencybench: Benchmarking the frontiers of autonomous agents in 1m-token real-world contexts
    2026
  • On Large Language Models' Hallucination with Regard to Known Facts
    2024
    NAACL 2024 Main
  • DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
    2024
    NAACL 2024 Findings
  • A multi-task semantic decomposition framework with task-specific pre-training for few-shot ner
    2023
  • A prototypical semantic decoupling method via joint contrastive learning for few-shot named entity recognition
    2023
    ICASSP 2023
  • Revisit out-of-vocabulary problem for slot filling: A unified contrastive framework with multi-level data augmentationss
    2023
    ICASSP 2023
  • Semi-supervised knowledge-grounded pre-training for task-oriented dialog systems
    2022
    SereTOD2022, EMNLP 2022 Workshop

Skills

Programming

  • C++
  • Python

Deep Learning Frameworks

  • LLaMA-Factory
  • Deepspeed
  • ray
  • verl

Languages

Chinese - Native
English

Honors & Awards

  • Excellent First-class Scholarship for Master Students, BUPT
    2023, 2024
  • the 1st Award on SereTOD Challenge 2022 track 2
    2022
  • National Scholarship(Top 1%)
    2021
  • First Prize in The Chinese Mathematics Competitions
    2020, 2021, 2022

Service and leadership

Reviewer for

  • ICLR 2025, ICLR 2026
  • ICML 2026
  • COLM 2026
  • ACL ARR 2024 June, August, October, December
  • ACL ARR 2025 February, May, October
  • ACL ARR 2026 January, March