Skip to content

[backfill:2025-10-15..2025-10-15] Terminal β€” 1 paper(s)Β #120

Description

@Zhaoyang-Chu

πŸ“‹ backfill:2025-10-15..2025-10-15 Β· Terminal β€” 1 paper(s)

Auto-processed by the arXiv crawler pipeline. Review each paper and reply with the commands below.


1. EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems

Authors: Yufei He, Juncheng Liu, Yue Liu, Yibo Li, Tri Cao, et al.
Venue: arXiv 2025/10 | benchmark

The paper introduces the Jericho Test-Time Learning (J-TTL) benchmark to evaluate how agents improve performance across consecutive episodes in text-based CLI environments. It proposes EvoTest, an evolutionary framework that uses an Evolver Agent to iteratively refine an Actor Agent's prompts and memory based on game transcripts.
πŸ“„ Paper


Review commands (comment on this issue):

  • /approve all β€” accept all papers
  • /approve 1,3 β€” accept papers 1 and 3
  • /reject 2 β€” discard paper 2
  • /approve 1,3 /reject 2 β€” mixed
  • /edit 1 category=code_generation β€” change category before approving
  • /edit 1 venue=ICSE 2026 β€” fix venue

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions