CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Published in ICLR 2025, 2024

本研究提出了CS-Bench,这是一个全面的计算机科学基准测试集,用于评估大语言模型在计算机科学领域的掌握程度。项目已开源并提供了完整的数据集:https://csbench.github.io/

Recommended citation: Song, X., Diao, M., Dong, G., Wang, Z., Fu, Y., Qiao, R., Wang, Z., Fu, D., Wu, H., Liang, B., Zeng, W., Wang, Y., GongQue, Z., Yu, J., Tan, Q., & Xu, W. (2025). CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery. In International Conference on Learning Representations (ICLR 2025).
Download Paper