PärPod Temp
PärPod Temp
PärPod Temp
OSS Scan: Entity Database / Cross-Investigation Identity Tools
Episode 214m · May 28, 2026
Nomenklatura's "Judgement" store lets investigators flag when two entities are the same across separate cases—and those decisions survive every re-run, building a canonical identity map that connects mining permits to shell company networks.

OSS Scan: Entity Database / Cross-Investigation Identity Tools

Date: 2026-05-26 Capability target (C4): Durable entity identity across multiple investigations — so a non-mining story can discover a mining link via shared canonical entities. Constraints: Internal tool, never public-deployed. MIT/BSD/Apache preferred for code lift; AGPL acceptable as run-alongside service.


Tier 1 — FollowTheMoney ecosystem (the obvious spine)

1. FollowTheMoney (FtM)

2. Nomenklatura

3. OpenAleph

4. Zavod


Tier 2 — Entity resolution libraries (the matching brain)

5. Splink

6. dedupe

7. recordlinkage (J. de Bruin)


Tier 3 — Graph / knowledge stores for the entity layer

8. Kùzu

9. TerminusDB

10. FalkorDB


Tier 4 — Investigative platforms (whole-app, run-alongside)

11. Vertex Synapse

12. OpenCTI

13. Liquid Investigations


Tier 5 — Second-brain with entity-shape (filtered)

14. Anytype

15. Nordic Registry MCP Server (Sweden-specific)


Honest synthesis

The shortest path to C4 is FtM schema + Nomenklatura resolver + Kùzu (or Postgres) as graph store, with Splink for the heavy matching when you have >100k entities. Everything else on this list is either (a) UX inspiration, (b) a competing whole-app you'd run alongside not lift from, or (c) a source-of-data wrapper. The FtM ecosystem is genuinely the answer; the rest is decoration or alternatives if FtM's corruption-bias schema chafes.

One real risk: the FtM schema assumes anti-corruption framing. If Pär's mining investigations spawn lots of physical-world entities (claims, drill sites, parcels), you'll subclass Asset and may eventually fork. Plan for that fork at design time, not in year two.

Sources: