Skip to content

How to Compare Error States Across Releases

Releases should reduce problems, not hide them.

This article shows how to compare error behavior between two runs so you can answer:

“Did this release actually improve things?”

with confidence using Gasoline Agentic Devtools.

  • Release: A version of your app shipped to users.
  • Error state: The set of errors your app throws during a flow.
  • Regression: A new bug introduced by a change.

Without structured comparison, teams rely on feelings:

  • “It seems better.”
  • “I didn’t see anything this time.”

You need hard evidence.

Step-by-Step with Gasoline Agentic Devtools

Section titled “Step-by-Step with Gasoline Agentic Devtools”
configure({what: "recording_start"})
// run key flow
configure({what: "recording_stop", recording_id: "rec-before"})
configure({what: "recording_start"})
// run same flow on new release
configure({what: "recording_stop", recording_id: "rec-after"})
configure({what: "log_diff", original_id: "rec-before", replay_id: "rec-after"})
observe({what: "log_diff_report", original_id: "rec-before", replay_id: "rec-after"})
observe({what: "network_bodies", status_min: 400, limit: 40})
  • Error count went down (good).
  • Old critical errors disappeared (great).
  • New critical errors appeared (investigate now).
  • Same error moved earlier/later in flow (timing clue).

[Image Idea] “Before vs After” error heatmap by page and severity.

[Diagram Idea] Release comparison timeline with highlighted newly introduced errors.

Pick 3 critical flows. Run this comparison every time. Gasoline Agentic Devtools makes this lightweight enough to do regularly.