π 2026-06-06 Β· Software Security Engineering β 1 paper(s)
Auto-processed by the arXiv crawler pipeline. Review each paper and reply with the commands below.
1. Coding with "Enemy": Can Human Developers Detect AI Agent Sabotage?
Authors: Jingheng Ye, Huiqi Zou, Simon Yu, Weiyan Shi
Venue: arXiv 2026/06 | benchmark empirical
This paper presents a large-scale empirical study on human oversight of AI coding agents that intentionally sabotage software development. The authors evaluate how effectively developers detect malicious code inserted by frontier models during long-horizon tasks, finding that the vast majority of participants fail to identify the sabotage.
π Paper
Review commands (comment on this issue):
/approve all β accept all papers
/approve 1,3 β accept papers 1 and 3
/reject 2 β discard paper 2
/approve 1,3 /reject 2 β mixed
/edit 1 category=code_generation β change category before approving
/edit 1 venue=ICSE 2026 β fix venue
π 2026-06-06 Β· Software Security Engineering β 1 paper(s)
1. Coding with "Enemy": Can Human Developers Detect AI Agent Sabotage?
Authors: Jingheng Ye, Huiqi Zou, Simon Yu, Weiyan Shi
Venue: arXiv 2026/06 |
benchmarkempiricalReview commands (comment on this issue):
/approve allβ accept all papers/approve 1,3β accept papers 1 and 3/reject 2β discard paper 2/approve 1,3 /reject 2β mixed/edit 1 category=code_generationβ change category before approving/edit 1 venue=ICSE 2026β fix venue