Skip to content

[2026-06-06] Software Security Engineering β€” 1 paper(s)Β #109

Description

@Zhaoyang-Chu

πŸ“‹ 2026-06-06 Β· Software Security Engineering β€” 1 paper(s)

Auto-processed by the arXiv crawler pipeline. Review each paper and reply with the commands below.


1. Coding with "Enemy": Can Human Developers Detect AI Agent Sabotage?

Authors: Jingheng Ye, Huiqi Zou, Simon Yu, Weiyan Shi
Venue: arXiv 2026/06 | benchmark empirical

This paper presents a large-scale empirical study on human oversight of AI coding agents that intentionally sabotage software development. The authors evaluate how effectively developers detect malicious code inserted by frontier models during long-horizon tasks, finding that the vast majority of participants fail to identify the sabotage.
πŸ“„ Paper


Review commands (comment on this issue):

  • /approve all β€” accept all papers
  • /approve 1,3 β€” accept papers 1 and 3
  • /reject 2 β€” discard paper 2
  • /approve 1,3 /reject 2 β€” mixed
  • /edit 1 category=code_generation β€” change category before approving
  • /edit 1 venue=ICSE 2026 β€” fix venue

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions