Stacktrace Reports

System documentation for ResInsight

Stacktrace Reports

Weekly ResInsight crash telemetry, deduplicated by call-stack signature and cross-linked to upstream issues on OPM/ResInsight.

Incoming CSVs — every raw CSV received, with row counts and a link to its weekly report.
Weekly reports — list of per-week analyses, newest first.
Analyzer — the Python script that groups raw stack traces into unique signatures.
Analyzer usage — full command-line reference for the script.
Top-frame grouper — reads a generated report and groups stacks by their first non-handler frame, so each unique signature only needs one OPM issue search.
Apply known issues — copies top-frame → OPM issue links from one or more prior reports into the current report, so stacks already linked in earlier weeks don’t need a fresh GitHub search.
Reorder closed — moves stacks linked to CLOSED issues into a ## Closed issues section at the bottom of the report. Idempotent.

Workflow

A new CSV lands every week. Processing it is seven steps:

Save the CSV. Copy the weekly export into stacktrace-reports/csv/ as YYYY-MM-DD-query_data.csv. Only the timestamp and rawstack columns are expected.

Run the analyzer in Markdown mode.

python stacktrace-reports/analyze_crashes.py \
    stacktrace-reports/csv/YYYY-MM-DD-query_data.csv \
    --format md \
    --output stacktrace-reports/reports/YYYY-MM-DD.md

Carry over known links from prior reports.
```
python stacktrace-reports/apply_known_issues.py \
    stacktrace-reports/reports/YYYY-MM-DD.md \
    --prior stacktrace-reports/reports/PRIOR1.md \
    --prior stacktrace-reports/reports/PRIOR2.md
```
For every stack whose top non-handler frame matches one already linked in a prior report, this fills in the same **OPM issue:** link. Existing links are left alone. The script’s output names how many stacks were filled (updated) and how many still need manual search (no_match).
Link the remaining stacks to upstream issues. For each unique top frame still showing **OPM issue:** none found, search OPM/ResInsight issues using that frame. To group stacks that share a signature, run
```
python stacktrace-reports/extract_top_frames.py stacktrace-reports/reports/YYYY-MM-DD.md
```
so each unique signature only needs one search. Example search:
```
gh search issues --repo OPM/ResInsight "RimFileSummaryCase createSummaryReaderInterfaceThreadSafe" --limit 5
```
Fetch the issue state at the same time:
```
gh issue view 13883 --repo OPM/ResInsight --json number,state
```
Replace **OPM issue:** none found with **OPM issue:** [#NNNN](https://github.com/OPM/ResInsight/issues/NNNN) — OPEN (or CLOSED) when a match is found; leave none found otherwise so gaps stay visible. The state is captured at link-time — rerun gh issue view for stale reports if the current state matters.
Move closed stacks to the bottom.
```
python stacktrace-reports/reorder_closed.py stacktrace-reports/reports/YYYY-MM-DD.md
```
Stacks linked to a CLOSED issue are gathered under a ## Closed issues section at the end, so open and unmatched stacks stay at the top. Stack numbers are preserved.
Update incoming-csvs.md. Append a row using the total-rows and unique-stacks numbers printed by analyze_crashes.py (first two lines of its text output). Row template:
```
| YYYY-MM-DD | [YYYY-MM-DD-query_data.csv](./csv/YYYY-MM-DD-query_data.csv) | <total> | <unique> | [YYYY-MM-DD](./reports/YYYY-MM-DD.md) |
```
Update index.md. Insert a new row at the top of the weekly-reports table with the same totals.

Commit the CSV, the generated MD, and the two updated index pages together on a feature branch.

Jekyll integration

The folder lives at the repo root (not as a Jekyll collection) so the folder name on disk matches the URL segment. Each Markdown file carries the standard layout: default front-matter used by pages in _docs/. CSV files are served as static assets by Jekyll.

Data scope

CSVs are committed to the repo so reports remain reproducible. Before committing, confirm the CSV contains only non-identifying columns (currently timestamp, rawstack). If future exports add user-identifying fields, strip them before committing.

Stacktrace Reports

Contents

Workflow

Jekyll integration

Data scope