2025 · dataset

Natuna loitering dataset

An open, queryable record of loitering events off the Natuna shelf, indexed by H3 cell and time.

Built with
  • DuckDB
  • H3
  • Parquet

Problem

Loitering — a vessel lingering without an evident purpose — is a useful signal and a hard one to share. Everyone re-derives it from raw tracks, so no two analyses agree on what counts or where it happened.

Approach

The dataset fixes loitering events to an H3 spatial index and a time bucket, so a claim about “the shelf, last spring” resolves to specific cells and intervals anyone can query. It ships as Parquet, readable directly from DuckDB without a server.

  • Detect loitering from cleaned tracks with a documented threshold.
  • Index each event by H3 cell and time bucket.
  • Publish as Parquet with a schema and a query cookbook.

Outcome

A record others can query off the Natuna shelf and cite precisely — the same cell, the same window, the same answer.