PärPod Temp
PärPod Temp
PärPod Temp
01 — Framing: what problem the next tool actually solves
Episode 116m · May 28, 2026
Pär's next investigation tool isn't a knowledge graph or publishing system—it's a substrate for mining newspaper-grade stories from raw data, where the measure of success is whether it helps him ask better questions faster.

01 — Framing: what problem the next tool actually solves

Date: 2026-05-26 Status: ratified 2026-05-26 23:22 by Pär ("yes to all"), with the explicit caveat that gaps may surface as we close in on an actual spec — re-anchor if so. Codex review applied: C1 implementation-leak fixed, C6 added (media coverage), M1 fact-provenance promoted to hard constraint, /cowork demoted from hard constraint. Inputs: omega-review-1 SYNTHESIS, omega-review-2 SYNTHESIS-forks (rev 2), Pars_view.md Q1–Q15 + captured section Purpose: lock in the problem statement before any "what to build / what to borrow" conversation. Everything downstream — fork choice, repo shape, what survives from gruvor, when to start, what the MVP looks like — must be measurable against the criteria stated here.

Pär's review-4 instruction: front-load discussion. Claude over-pads early phases and useful items end up in never-reached later phases. This document exists to ratify the goal before any phase plan can be written.


1. One-paragraph statement

The next tool is Pär's investigative substrate — a single working environment that supports a series of long-form investigations (mining today, other topics later), each producing newspaper-grade articles. It earns its keep by aiding the dig in the research phase: helping Pär look in more directions, keep track of what he already has, and surface connections across investigations that he would not have found otherwise. It is not a publishing system, not a knowledge graph for its own sake, and not gruvor with the rough edges sanded. It is a sibling to arebladet2 — prepped for integration into that ecosystem but able to grow into the integration over time, not retrofitted later.


2. What it is

3. What it is not


4. The six load-bearing capabilities

These are the criteria every downstream architecture decision is measured against. Each is anchored in a specific gruvor finding or Pars_view.md statement, not abstracted from first principles.

C1 — Timeline of all things, with spatial filter by area (not point)

Pär (Q2): "having a timeline of all things … is load-bearing to this kind of writing. Next article will be about Aura and Häggån and it will be even more important as it will also include the full history of exploration in the Viken cluster, so we need some kind of location (and that being an area not an exact point) for me to walk through."

Anchored in: gruvor's /timeline + /kb-timeline validated as "vital" (omega-1 §4); the timeline capability named as the single most consequential feature to preserve (omega-2 §6 Q1). Spatial-area filter is the new requirement Aura/Häggån introduces — gruvor only had point-based filtering. How the timeline is implemented (typed temporal edges in a KG, date-faceted events in Datashare, or a simpler model) is the central Fork B vs F decision in Section 02 and is not pre-decided here.

C2 — Per-doc-type review UI for ingest

Pär (Q11): "We could do some kind of walkthrough skill, and present the data in a UI for me to review before implementation … processing — both with AI and programmatically — and a review UI is probably the best way to get through a lot of data."

Pär (Q12): "Document side by side to what the database will ingest. Claude processes one document type and creates script for that document type going forward (when possible), review certain number of programmatic process of a document to clear it to run on its own."

Anchored in: the 30-hand-coded-loaders debt (omega-2 §2.3 / A2 finding); the walkthrough method being overstated as canonical (omega-1 §4, Pars_view Q11). This is the structural fix to ingest sprawl.

C3 — Scoped curated doc browse with 1-sentence explainers

Pär (Q4): "Find a doc to check something specific … a map of what documents I have regarding something (with a 1-sentence explainer in the UI) would have helped … 'all Bolagsverket docs for this company' would be more useful."

Pär (Q7): "for the massive files like the prospectus, it would have been good to reference the original but split it up per section for easier access."

Anchored in: the crunch-time docs gap; the manual_dropins messiness; viewer never close to good enough. This is not a search engine, it's a knowledge garden — scoped by entity ("all docs for company X"), each with a one-line explainer, with massive PDFs split per section while backlinked to the original.

C4 — Cross-investigation discovery via a connected database

Pär (Q9): "Dream is a single tool with different article areas I work on, with a connected database — working on a completely different story could find a random connection to a mining thing, building my own database of useful things that survive for a long time, and suddenly there is something I would not have found otherwise."

Anchored in: this is the durable payoff. Each investigation produces entities, documents, events that survive past article ship. The substrate's value compounds over investigations the way gruvor's value did not (because gruvor was single-investigation by construction). Modular toward arebladet2 integration so chatarkiv + arebladetmail eventually feed the same connection-finding layer.

C5 — Media coverage and press articles as first-class entities

Pär (Q8): "The entire Viken area is covered in prospecting … the storyline is much longer, and involves a lot more people, and the media articles will be more important along the way."

Pars_view.md captured: "Media coverage / press articles as first-class entity — Aura/Häggån needs it; X92 didn't."

Anchored in: gruvor had a /media scaffold route with minimal traffic — not because media is unimportant but because the X92 story launched from a single company anomaly. Aura/Häggån is different: historical media coverage of the Viken cluster is source material for the investigation, not a nice-to-have. The substrate must store, link, and retrieve press articles as entities in the same space as regulatory docs and company filings.

C6 — Ingest is permissive, not topic-gated

Pär (Q7): "when I download documents, something might not be useful to this particular article, but I am thinking this might be something later, and should be able to ingest it logically without Claude saying this is not related."

Anchored in: gruvor's filtered-to-8-kommuner posture made future-broadening expensive. The substrate's ingest must accept "I don't know yet what this is for" as a valid intent, with a place for it to land that is not "the current investigation's folder."


5. Hard constraints

These are non-negotiable boundary conditions that constrain the architecture:


6. Non-goals and what is explicitly deferred

State these so downstream sessions don't drift into them:


7. The framing question Pär put on the table

Omega-2 §6 Q5: "Is gruvor an investigation tool, or a building hobby that produces investigations as a side effect?"

Pär's answer in Q1: "It is an investigative tool. The building hobby part is fuel to keep the actual work project going. No numbers."

That answer constrains the framing meaningfully: the substrate's success criterion is whether it produces investigations, not whether it is fun to build, even though the building has to be sustainable enough that Pär keeps wanting to work on it. Two concrete consequences:

  1. When the choice is "fun to build" vs "useful for the next investigation," "useful" wins. This is a guard against M3 (capacity-filling from a task list rather than need-filling from the next article).
  2. When the build process is the only available work (mining minutia is boring, no investigation is in flight), the right move may be to study an existing tool (OpenAleph, Datashare, Aleph Pro, somebody else's pipeline) rather than write new code. "Worth looking into what we can adapt from others rather than build from scratch" (Pars_view footnote in omega-2 §6 Q5).

8. Acceptance signal for this framing

This framing is good enough to start designing against if, reading it back, Pär can answer yes to all of the following:

If any answer is no, this document gets revised before 02-build-options.md is written. Front-loading discussion is the explicit purpose; downstream sessions read this as the contract.


9. Next planning sections (not started, listed so the sequence is visible)

Each later section depends on this framing being accepted; if Section 1 changes, downstream sections re-anchor.