Skip to content
SDF file structure diagram

Smart Document Format

One file. Two layers. Zero re-keying. SDF combines PDF and structured JSON so your documents work for both humans and machines.

What is SDF?

SDF (Smart Document Format) is an open standard that solves the PDF data-extraction problem. Every .sdf file is a ZIP archive containing a human-readable PDF and a machine-readable JSON — no OCR, no manual re-entry.

The Problem

Documents travel as PDF. The receiving system must OCR the text and manually re-key the data — slow, error-prone, expensive.

The Solution

SDF bundles the structured JSON alongside the PDF. The receiving system reads data.json directly — zero extraction cost, zero errors.

Backward Compatible

Any PDF viewer opens the visual layer. SDF-unaware systems see a normal PDF. SDF-aware systems get the data too.

Start Building

Products

SDF — Smart Document Format

The open document format and its ecosystem: SDK, CLI, schema registry, server, integrations, and tooling. This is what you’re reading about now.

More products coming

Etapsky is actively building additional developer tools. More products will be documented here as they reach public availability.

Ecosystem

PackageVersionRegistryDescription
@etapsky/sdf-kit0.2.2npmCore TypeScript SDK
@etapsky/sdf-cli0.3.2npm · Homebrew · BinaryCommand-line tool
@etapsky/sdf-schema-registry0.1.1npmSchema versioning and migration
@etapsky/sdf-server-core0.1.6npmSelf-hosted server framework
@etapsky/cloud-sdk0.1.0npmapi.etapsky.com client
etapsky-sdf0.1.1PyPIPython SDK