CV
Summary
I am Dayuan Fu, a graduate student at PRIS-NLP Group at Beijing University of Posts and Telecommunications (BUPT), supervised by Prof. Weiran Xu. I'm visiting GAIR now, and I visited TsinghuaC3I group from 2022.10 to 2024.8. My research interests primarily focus on LLM Agent's reasoning, planning, and decision-making ability and memory strategy, which can make LLM more universal (i.e. AGI). I have published several papers at prominent NLP conferences, including ICLR, EMNLP, CIKM, and NAACL.
Education
- M.S. in Dept. of Artificial Intelligence2023.9-2026.6Beijing University of Posts and Telecommunications (BUPT)
- B.S. in Dept. of Electronic Engineering2019.9-2023.6Beijing University of Posts and Telecommunications (BUPT)GPA: 93.18/100 (3.88/4.0)
Visiting Experience
- C3I Group2022.10 - 2024.8Tsinghua University
- GAIR2024.9 - NowSII & SJTU
Industry Experience
- Meituan, NLP Center2024.1 - 2025.3Algorithm InternResearch Intern on Agent and data synthesis.
Publications
- DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments2025
- AgentRefine: Enhancing Agent Generalization through Refinement Tuning2025ICLR 2025
- MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making2024EMNLP 2024 Main
- How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data2024EMNLP 2024 Main
- CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery2024ICLR 2025
- PreAct: Prediction Enhances Agent's Planning Ability2024Coling 2025
- On Large Language Models' Hallucination with Regard to Known Facts2024NAACL 2024 Main
- DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations2024NAACL 2024 Findings
- A multi-task semantic decomposition framework with task-specific pre-training for few-shot ner2023CIKM 2023
- A prototypical semantic decoupling method via joint contrastive learning for few-shot named entity recognition2023ICASSP 2023
- Revisit out-of-vocabulary problem for slot filling: A unified contrastive framework with multi-level data augmentationss2023ICASSP 2023
- Semi-supervised knowledge-grounded pre-training for task-oriented dialog systems2022SereTOD2022, EMNLP 2022 Workshop
Skills
Programming
- C++
- Python
Deep Learning Frameworks
- LLaMA-Factory
- Deepspeed
- ray
- verl
Languages
Chinese - Native
English
Honors & Awards
- Excellent First-class Scholarship for Master Students, BUPT2023, 2024
- the 1st Award on SereTOD Challenge 2022 track 22022
- National Scholarship(Top 1%)2021
- First Prize in The Chinese Mathematics Competitions2020, 2021, 2022