skip to content

the hardhat protocol

a unified, machine-readable data standard for career scoring, apprenticeship matching, and workforce data interoperability — powered by an agentic pipeline that ingests labor market data, training programs, employer signals, and workforce sentiment, normalizes it against a proprietary career ontology, and publishes it as open infrastructure.

workforce data is broken

before GTFS, transit information was fragmented across thousands of incompatible systems. workforce data is in the same state today.

career intelligence is scattered across BLS wage tables, O*NET skill taxonomies, DOL apprenticeship databases, state agency portals, union local websites, employer job postings, training program catalogs, and PDF flyers. automation risk isn't tracked. application windows are invisible. program outcomes are unreported. no single system connects labor market signals to training pathways to employer demand.

the result: the u.s. needs 2 million more skilled workers by 2028. workers make career decisions with incomplete data. employers can't find skilled labor despite $1.8 trillion in infrastructure spending. workforce boards allocate billions without real-time supply-demand signals. and only 7% of job seekers even consider the trades. that's not a pipeline failure — it's a data infrastructure failure. the intelligence layer between workers, programs, employers, and government doesn't exist.

five layers of workforce data

the hardhat protocol structures workforce data into five interconnected layers — from career exploration to program outcomes. each layer is machine-readable, standardized, and open.

trades
350+ occupations
scored daily
programs
apprenticeships
& training
eligibility
requirements
& prerequisites
windows
application
periods
outcomes
completion
& placement

today, a worker researching a career has to visit 5–10 different websites, decode eligibility rules from PDFs, and guess when applications open. the protocol collapses that into a single, navigable data layer — the same way GTFS collapsed transit schedules into a single feed that powers Google Maps, Apple Maps, and thousands of apps.

current coverage

what GTFS did for transit

Google created GTFS in 2005. today, 10,000+ transit agencies publish data that powers apps used by billions. we're building the same thing for workforce.
GTFS (transit)
  • fragmented schedules across agencies
  • no standard format
  • Google created an open standard
  • 10,000+ agencies now publish data
  • powers Maps, Citymapper, Transit
hardhat protocol (workforce)
  • fragmented programs across agencies
  • no standard format
  • hardhat building an open standard
  • 27,000+ programs to be mapped
  • powers career navigation at scale

one protocol, one feedback loop

every decision one stakeholder makes creates a signal for the others. workers searching for trades tell programs where demand is. employer hiring patterns tell government where to fund training. the protocol is the connective layer that closes the loop.

workers
career navigation
explore trades, compare programs, check eligibility, set alerts, and apply — all in one place.
counselors
real-time intelligence
replace outdated binders with live program data, demand signals, and pathway recommendations.
programs
recruitment pipeline
reach workers already researching the trade. pre-qualified candidates instead of word-of-mouth.
employers
compliance + talent
find apprentices for jobsites. track utilization ratios. stay compliant with IRA and state mandates.
researchers
workforce analytics
structured, machine-readable data on apprenticeship programs, outcomes, and labor market signals.
government
fund & steer
workforce boards see supply, demand, and gaps — directing funding where it matters. real-time pipeline visibility from worker interest to completion.

free, open, machine-readable

the protocol is open-source under Apache 2.0. no API keys. no paywalls. no vendor lock-in.

workforce data should be a public good — the same way transit data became a public good after GTFS. the hardhat protocol is designed to be embedded, extended, and built upon by anyone: workforce boards, community colleges, nonprofits, state agencies, and private developers. every layer feeds a feedback loop between workers, employers, programs, and government — each stakeholder's actions creating signals for the others.

the worker-facing tools on hardhat.careers are the first application built on this protocol. they won't be the last.

explore the tools

see the protocol in action — career scores, apprenticeship data, and program navigation across 50 states

the survival index apprenticeship programs about hardhat