CV

Dayuan Fu (傅大源)

fdy@bupt.edu.cn

https://fu-dayuan.github.io

Summary

My research interests primarily focus on LLM Agents automation including both expanding the range of agent applications via agent itself and using agents to discover new Nobel Prize level laws/rules. I believe combining search with powerful LLM is the way to AGI. I have published several papers at prominent NLP conferences, including ICLR, ICML, EMNLP, CIKM, and NAACL.

Education

M.S. in Dept. of Artificial Intelligence
2023.9-2026.6
Beijing University of Posts and Telecommunications (BUPT)
B.S. in Dept. of Electronic Engineering
2019.9-2023.6
Beijing University of Posts and Telecommunications (BUPT)
GPA: 93.18/100 (3.88/4.0)

Visiting Experience

C3I Group
2022.10 - 2024.8
Tsinghua University
GAIR
2025.2 - Now
SII & SJTU

Industry Experience

Meituan, NLP Center
2024.1 - 2025.3
Algorithm Intern
Research Intern on Agent and data synthesis.

Publications

InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
2025
ICLR 2026
View Publication
daVinci-Env: Open SWE Environment Synthesis at Scale
2026
working in progress
View Publication
daVinci-Dev: Agent-native Mid-training for Software Engineering
2026
ICML 2026 spotlight
View Publication
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
2026
working in progress
View Publication
AgentRefine: Enhancing Agent Generalization through Refinement Tuning
2025
ICLR 2025
View Publication
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
2025
EMNLP 2025 Main
View Publication
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making
2024
EMNLP 2024 Main
View Publication
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
2024
EMNLP 2024 Main
View Publication
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
2024
ICLR 2025
View Publication
PreAct: Prediction Enhances Agent's Planning Ability
2024
Coling 2025
View Publication
Agencybench: Benchmarking the frontiers of autonomous agents in 1m-token real-world contexts
2026
ACL 2026
View Publication
On Large Language Models' Hallucination with Regard to Known Facts
2024
NAACL 2024 Main
View Publication
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
2024
NAACL 2024 Findings
View Publication
A multi-task semantic decomposition framework with task-specific pre-training for few-shot ner
2023
CIKM 2023
View Publication
A prototypical semantic decoupling method via joint contrastive learning for few-shot named entity recognition
2023
ICASSP 2023
View Publication
Revisit out-of-vocabulary problem for slot filling: A unified contrastive framework with multi-level data augmentationss
2023
ICASSP 2023
View Publication
Semi-supervised knowledge-grounded pre-training for task-oriented dialog systems
2022
SereTOD2022, EMNLP 2022 Workshop
View Publication

Skills

Programming

C++
Python

Deep Learning Frameworks

LLaMA-Factory
Deepspeed
ray
verl

Languages

Chinese - Native

English

Honors & Awards

Excellent First-class Scholarship for Master Students, BUPT
2023, 2024
the 1st Award on SereTOD Challenge 2022 track 2
2022
National Scholarship(Top 1%)
2021
First Prize in The Chinese Mathematics Competitions
2020, 2021, 2022

Service and leadership

Reviewer for

ICLR 2025, ICLR 2026
ICML 2026
COLM 2026
ACL ARR 2024 June, August, October, December
ACL ARR 2025 February, May, October
ACL ARR 2026 January, March

View Markdown CV