PärPod Temp
PärPod Temp
PärPod Temp
OSS Investigation-Substrate Scan
Episode 115m · May 28, 2026
Swedish mining investigation platform needs entity deduplication engine: FollowTheMoney + Nomenklatura (both MIT) form the core schema and resolver, letting journalists cross-link suspects, companies, and assets across multiple investigations without AGPL entanglement.

OSS Investigation-Substrate Scan

Scanned 2026-05-26 for a Pär-grade journalist investigation substrate (successor to gruvor). Filter: permissive license preferred (MIT/BSD/Apache); AGPL OK as run-alongside; non-commercial / Do-No-Harm flagged as blockers for code lift. Already known and excluded: OpenAleph, Datashare, gitscrape.

Ranking is by lift-value (how directly we'd reuse code/schema in a new repo), not by general fame.

Tier 1 — Lift code directly

1. FollowTheMoney (alephdata/opensanctions fork)

2. Nomenklatura

3. Yente

4. vis-timeline

5. forensic-architecture/timemap

6. memorious

7. Label Studio

Tier 2 — Borrow patterns / vendor a subsystem

8. Aleph (alephdata/aleph)

9. Paperless-ngx

10. OpenCTI

11. INCEpTION

12. Hoover (liquidinvestigations/hoover-search)

13. doccano

14. CJWorkbench

Excluded after closer look

Suggested composition

Build a Python+SQLite (or Postgres) + Svelte substrate. Lift:

  1. followthemoney schema (extend with mining-domain types: Mineral, Förekomst, Provborrhål).
  2. nomenklatura Resolver for cross-investigation entity matching (C4).
  3. vis-timeline + Leaflet for C1, with bbox spatial filter wired directly.
  4. Label Studio config DSL (lift idea, not code) for per-doc-type review C2 — render in Svelte against an FtM-typed payload.
  5. memorious patterns (or a 200-line equivalent) for ingesting Årebladet + competitor RSS as Article FtM entities (C5).
  6. Optionally run yente alongside for the search/reconcile API once corpus passes ~10k entities.

Everything load-bearing is MIT or Apache-2.0. No AGPL in the lift path. Forensic-Architecture's timemap is the design reference; the rest is plumbing.

Sources