[backfill:2025-10-15..2025-10-15] Terminal — 1 paper(s)

## 📋 backfill:2025-10-15..2025-10-15 · Terminal — 1 paper(s)

> Auto-processed by the arXiv crawler pipeline. Review each paper and reply with the commands below.

---

### 1. EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems
**Authors:** Yufei He, Juncheng Liu, Yue Liu, Yibo Li, Tri Cao, et al.  
**Venue:** arXiv 2025/10 | `benchmark`  
> The paper introduces the Jericho Test-Time Learning (J-TTL) benchmark to evaluate how agents improve performance across consecutive episodes in text-based CLI environments. It proposes EvoTest, an evolutionary framework that uses an Evolver Agent to iteratively refine an Actor Agent's prompts and memory based on game transcripts.
[📄 Paper](https://arxiv.org/abs/2510.13220)

---

**Review commands** (comment on this issue):
- `/approve all` — accept all papers
- `/approve 1,3` — accept papers 1 and 3
- `/reject 2` — discard paper 2
- `/approve 1,3 /reject 2` — mixed
- `/edit 1 category=code_generation` — change category before approving
- `/edit 1 venue=ICSE 2026` — fix venue


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[backfill:2025-10-15..2025-10-15] Terminal — 1 paper(s) #120

📋 backfill:2025-10-15..2025-10-15 · Terminal — 1 paper(s)

1. EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

[backfill:2025-10-15..2025-10-15] Terminal — 1 paper(s) #120

Description

📋 backfill:2025-10-15..2025-10-15 · Terminal — 1 paper(s)

1. EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions