Shovel · a software studio · est. 2024

We build autonomous
data products.
They run themselves.

A fleet of autonomous software agents working relentlessly to turn scattered public records, filings, and noise into structured data products.

24/7 · team-built · AI-powered data infrastructure
01 · Portfolio

Autonomous products,
shipping live.

All products →

Mining Incidents

live

Searchable research tool for US mining safety records.

~500k records · 1983–present · refreshed weekly
What it does
What it does

Every reportable mining incident MSHA has on file — fatal and non-fatal — searchable by mine, operator, year, severity, or a keyword in the investigator's narrative. Built for safety inspectors training crews, researchers chasing patterns, journalists chasing stories, and families trying to find a record. Updated weekly from MSHA's open datasets.

Why we built it

I took an MSHA training course taught by Kim Redding and walked out unsettled. Every incident was on the record, but the public couldn't really find it — the agency's own search makes the data hard to see. This is a small counterweight: the same public record, opened up. Credit for the spark belongs to Kim.

How it works
  1. Pull the latest MSHA Accidents/Injuries/Illnesses snapshot from msha.gov each Monday.
  2. Normalize the schema and join against the Mine and Operator reference tables so every row carries operator name, mine name, and state.
  3. Run the investigator's-narrative field through an LLM classifier for incident category and severity.
  4. Index the result into Postgres with full-text search on the narrative field.
  5. Publish to miningincidents.org; preserve the original MSHA report ID end-to-end so every record traces back to the source row.

Safety Incidents

live

Searchable research tool for US workplace safety records.

Every OSHA severe-injury report · 2015–present · refreshed daily
What it does
What it does

Every severe workplace injury OSHA has on file since 2015 — amputations, hospitalizations, and loss-of-eye events — searchable by employer, state, NAICS, or a keyword in the inspection narrative. Built for defense counsel, insurers, safety consultants, and journalists. Updated daily from OSHA's open Severe Injury Report feed.

Why we built it

OSHA's severe-injury reports are public, but the agency's own search makes them effectively invisible — the same gap Mining Incidents addresses for MSHA, applied to a much broader population.

How it works
  1. Pull the latest OSHA Severe Injury Report feed daily via the DOL Open Data API v4.
  2. Normalize the schema and resolve each row's primary NAICS code into a human-readable industry.
  3. Run the final-narrative field through an LLM classifier for body-part and event type.
  4. Index the result into Postgres with full-text search on the narrative field.
  5. Publish to safetyincidents.org; preserve the original OSHA incident ID end-to-end so every record traces back to the source row.

The Vault Report

live

Precious metals intelligence.

COMEX + LME ticks · social sentiment · refreshed daily
What it does
What it does

COMEX and LME pricing, on-chain inflows, and social sentiment — synthesized daily into briefings, charts, and a fully automated video pipeline.

Why we built it

Gold coverage is either Bloomberg terminals (expensive, institutional) or YouTube stackers (vibes). There's a gap between those two for someone who just wants the feed: prices, flows, what the internet is saying, every morning. That's the Vault Report.

How it works
  1. Poll COMEX (CME Group) and LME tick feeds at 1-second cadence for gold, silver, platinum, and palladium.
  2. Stream Twitter and Reddit posts about precious metals and score each post for polarity and conviction.
  3. Join price action against social sentiment and on-chain warehouse flows into a single daily briefing each morning UTC.
  4. Render the briefing into a chart pack and feed the chart pack into the automated video pipeline.
  5. Publish to thevaultreport.com and push the briefing to subscribers.
02 · Datasets

The ground truth
underneath every product.

Every product sits on top of a live dataset that Shovel collects, cleans, and keeps fresh. Below: what's growing in the pit.

MSHA Accidents & Fatalities

Refreshed weekly

Every injury and fatality reported to the Mine Safety and Health Administration — normalized, searchable, kept fresh.

Coverage · 1983 — present

OSHA Severe Injury Reports

Refreshed daily

Every severe workplace injury reported to OSHA since the 2015 reporting rule — amputations, hospitalizations, and loss-of-eye events — normalized and searchable.

Coverage · 2015 — present

Precious Metals Tick Feed

1-second polling

COMEX and LME feeds for gold, silver, platinum, and palladium — second-by-second.

Coverage · Live

Metals Social Sentiment

Streaming

Twitter and Reddit posts about precious metals, scored for polarity and conviction.

Coverage · 2023 — present
03 · Studio

The
factory
owner.

Shovel runs three live US public-records databases: miningincidents.org (~500,000 MSHA mining-incident records, 1983 to present, refreshed weekly); safetyincidents.org (every OSHA severe-injury report since the 2015 reporting rule, refreshed daily); and thevaultreport.com (daily COMEX and LME tick data plus social sentiment). We design the playbooks; a fleet of agents executes them — fetch the source feed, normalize the schema, classify the narrative, index, publish.

Not an agency. Not a consultancy. A small team running pipelines that fetch from MSHA, OSHA, CME Group, LME, Twitter, and Reddit every day, then put the cleaned result behind a public search box. Used by defense counsel, safety inspectors, journalists, researchers, and families.

About the studio →

Want to talk shop? Book a half-hour consult — pay what you want →

04 · FAQ

Questions
and answers.

Where can I search MSHA mining incident reports?

miningincidents.org — every reportable MSHA mining incident from 1983 to present (~500,000 records), searchable by mine, operator, year, severity, and a keyword in the investigator's narrative. Refreshed weekly from MSHA's open datasets.

Is there a public database of OSHA severe injury reports?

safetyincidents.org — every OSHA severe-injury report since the 2015 reporting rule (amputations, hospitalizations, and loss-of-eye events), searchable by employer, state, NAICS, and inspection narrative. Refreshed daily from OSHA's open Severe Injury Report feed.

Where can I read a daily precious-metals report?

thevaultreport.com — COMEX and LME tick data for gold, silver, platinum, and palladium, plus social sentiment scored for polarity and conviction. Refreshed daily into briefings, charts, and a video pipeline.

Can I license the underlying datasets?

Yes — the 2026 DB Snapshot is $5,000 for a one-time perpetual commercial LLM-training license covering the full portfolio (MSHA + OSHA + precious metals), delivered as CSV and Parquet. Details at byshovel.com/licensing.