add error analysis guide by annabellscha · Pull Request #2892 · langfuse/langfuse-docs

annabellscha · 2026-05-05T14:49:38Z

Disclaimer: Experimental PR review

Greptile Summary

Adds a new cookbook guide on error analysis for LLM applications, covering open coding, failure taxonomy clustering, labeling, and deciding when to build evaluators versus fix prompts. The guide also registers the new page in meta.json.

A 360-line step-by-step guide is added, using a "Dad Tech Support" chatbot as a worked example through the full five-step error analysis process.
The guide integrates with the existing Langfuse annotation queue and scoring workflow and references the Claude Code Langfuse skill for an interactive walkthrough.

Confidence Score: 4/5

Safe to merge; the only flag is a draft note in the example data that should be cleaned up before the guide goes live.

The guide is well-structured and the process it describes is technically sound. The one thing worth fixing before publishing is the parenthetical in Step 4.2 that reveals the example bar chart is based on only 19 of 100 traces — readers may lose confidence in the example data if that note ships as-is.

content/guides/cookbook/error-analysis-llm-applications.mdx — specifically the Step 4.2 failure rates table and its incomplete-data caveat.

Important Files Changed

Filename	Overview
content/guides/cookbook/error-analysis-llm-applications.mdx	New 360-line guide walking through a full error analysis workflow; one draft-style note about incomplete example data remains in the published Step 4.2 results section.
content/guides/cookbook/meta.json	New guide entry added at the top of the pages list; change is straightforward and correct.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Choose what to annotate\nTrace vs. GENERATION observation] --> B[Select ~100 representative traces\nby latency, cost, tags, multi-turn]
    B --> C[Create annotation queue\nwith open_coding + pass_fail_assessment]
    C --> D[Open code first 30-50 traces\nFree-text observations, no pre-defined categories]
    D --> E{New failure types\nstill appearing?}
    E -- Yes --> D
    E -- No --> F[Cluster into 5-10 named failure categories\nSplit by root cause, merge by same root cause]
    F --> G[Create boolean score configs per category\nNew queue with all 10 score configs]
    G --> H[Label all 100 traces]
    H --> I[Compute failure rates\nLangfuse Dashboard - Scores widget]
    I --> J{For each category:\nCan we just fix it?}
    J -- Yes --> K[Prompt / tool / code fix]
    J -- No --> L{Worth building\nan evaluator?}
    L -- Yes --> M[LLM-as-judge or code-based check]
    L -- No --> N[Monitor / defer]
    K & M & N --> O[Re-run after next\nprompt rewrite, model switch, or incident]

Prompt To Fix All With AI

Fix the following 1 code review issue. Work through them one at a time, proposing concise fixes.

---

### Issue 1 of 1
content/guides/cookbook/error-analysis-llm-applications.mdx:275
**Incomplete-data caveat left in published guide**

The parenthetical "(from 19 labeled traces, rates will shift once all 100 are done)" reads as a draft note that was never removed. Readers following the guide will see a bar chart with rates derived from 19 of 100 traces and an explicit admission that the numbers are preliminary — undermining confidence in the example data. Either finish labeling all 100 traces before publishing, or replace this note with the final, stable numbers.

_{Reviews (1): Last reviewed commit: "add error analysis guide" | Re-trigger Greptile}

vercel · 2026-05-05T14:49:43Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
langfuse-docs	Ready	Preview, Comment	May 11, 2026 0:44am

github-actions · 2026-05-05T14:49:49Z

@claude review

review-notebook-app · 2026-05-05T18:31:49Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

claude

All prior feedback addressed (notebook source added, frontmatter complete, skill link fixed, illustrative-data caveat reworded).

Extended reasoning...

Overview

This documentation-only PR adds a new error-analysis cookbook guide. It includes a hand-written MDX, a corresponding source notebook in cookbook/, a _routes.json registration, a meta.json entry, and a small frameborder -> frameBorder JSX casing fix in a related blog post. No code paths, build scripts, or runtime behavior are changed.

Security risks

None. This is content-only — no auth, network, or data-handling code involved. External links go to langfuse.com pages (verified to exist) and to public references like hamel.dev.

Level of scrutiny

Light scrutiny appropriate for a docs-only change. The earlier higher-stakes risk (regen script silently deleting a hand-edited cookbook MDX with no notebook source) is now eliminated because a backing notebook was added and registered in cookbook/_routes.json.

Other factors

All four issues I raised on earlier revisions have been addressed in the latest commit: backing notebook present, frontmatter has category: Evaluation and sidebarTitle, the Claude Code skill link now points at a path that actually resolves (content/docs/api-and-data-platform/features/agent-skill.mdx), and Step 4.2's draft-style parenthetical has been reworded to frame the partial-sample numbers as illustrative. The Greptile bot's confidence is 4/5 and the only outstanding flag from external review is the same illustrative-data caveat that has now been reworded. Bug hunting system found no bugs.

Restores cookbook/error-analysis-llm-applications.ipynb and its _routes.json entry. Reverts lib/cookbook_route_mapping.ts to main — the showNotebookBanner field is no longer used. The standard "this is a Jupyter notebook" banner will show for this page. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

add error analysis guide

31add73

dosubot Bot added the size:L This PR changes 100-499 lines, ignoring generated files. label May 5, 2026

dosubot Bot added the documentation Improvements or additions to documentation label May 5, 2026

vercel Bot deployed to Preview May 5, 2026 14:51 View deployment

Merge branch 'main' into update-error-analysis-blogpost

e3ebd15

vercel Bot deployed to Preview May 5, 2026 14:59 View deployment

claude Bot reviewed May 5, 2026

View reviewed changes

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx Outdated

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx

claude Bot reviewed May 5, 2026

View reviewed changes

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx Outdated

annabellscha added 3 commits May 5, 2026 20:16

add link to agent skill

91adfe8

mini fix

126bf6f

edits and file reatructuring

f730cfb

vercel Bot deployed to Preview May 5, 2026 18:35 View deployment

claude Bot reviewed May 5, 2026

View reviewed changes

annabellscha commented May 5, 2026

View reviewed changes

add images

46cb9eb

vercel Bot deployed to Preview May 5, 2026 19:23 View deployment

claude Bot reviewed May 5, 2026

View reviewed changes

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx

annabellscha added 2 commits May 5, 2026 21:34

edit prompt

2c63ef2

update in mdx

e4cc3da

vercel Bot deployed to Preview May 5, 2026 19:41 View deployment

annabellscha added 2 commits May 5, 2026 21:43

edit prompt

f9b2de6

swap image

c94d81a

vercel Bot deployed to Preview May 5, 2026 19:50 View deployment

Merge branch 'main' into update-error-analysis-blogpost

7540277

vercel Bot deployed to Preview May 8, 2026 08:09 View deployment

claude Bot reviewed May 8, 2026

View reviewed changes

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx Outdated

Comment thread content/guides/cookbook/error-analysis-llm-applications.mdx Outdated

add images and small edits

f7dafd8

vercel Bot had a problem deploying to Preview May 8, 2026 14:58 Failure

Merge branch 'main' into update-error-analysis-blogpost

59ed486

vercel Bot had a problem deploying to Preview May 10, 2026 07:13 Failure

claude Bot reviewed May 10, 2026

View reviewed changes

Comment thread cookbook/_routes.json

Comment thread cookbook/_routes.json

Comment thread lib/cookbook_route_mapping.ts Outdated

annabellscha and others added 2 commits May 11, 2026 10:52

Merge branch 'main' into update-error-analysis-blogpost

fdefacc

vercel Bot deployed to Preview May 11, 2026 08:57 View deployment