Chao Peng
Chao Peng
Home
News
Featured Publications
Service
Teaching
Awards
Work Experience
CV
简历
Light
Dark
Automatic
Benchmark
RepoMasterEval: Evaluating Code Completion via Real-World Repositories
With the growing reliance on automated code completion tools in software development, the need for robust evaluation benchmarks has …
Qinyun Wu
,
Chao Peng
,
Pengfei Gao
,
Ruida Hu
,
Haoyu Gan
,
Bo Jiang
,
Jinhe Tang
,
Zhiwen Deng
,
Zhanming Guan
,
Cuiyun Gao
,
Xia Liu
,
Ping Yang
PDF
Cite
DOI
SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code
The code generation capabilities of large language models(LLMs) have emerged as a critical dimension in evaluating their overall …
Xinghang Li
,
Jingzhe Ding
,
Chao Peng
,
Bing Zhao
,
Xiang Gao
,
Hongwan Gao
,
Xinchen Gu
PDF
Cite
DOI
Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study
Recent advancements in large language models (LLMs) have significantly enhanced their coding capabilities. However, existing benchmarks …
Bowen Li
,
Wenhan Wu
,
Ziwei Tang
,
Lin Shi
,
John Yang
,
Jinyang Li
,
Shunyu Yao
,
Chen Qian
,
Binyuan Hui
,
Qicheng Zhang
,
Zhiyin Yu
,
He Du
,
Ping Yang
,
Dahua Lin
,
Chao Peng
,
Kai Chen
PDF
Cite
Project
Cite
×