codex-pdf
Structured PDF extraction API that turns complex files into consistent JSON.
Open source · API-first · PDF + plate / tooling preflight · detection-only
lintPDF inspects your files against 500+ checks — fonts, color spaces, images, transparency, overprint, page geometry, ink coverage, packaging dielines, embellishments, and barcodes — plus a 91-check PDF/X-4 conformance suite (ISO 15930-7). It also preflights plate & tooling files (TIFF / LEN separations), compares a plate set against the approved 1-up, re-validates after prep with a spec-drift diff, and scores production-readiness. It finds problems but never modifies your file: your originals stay byte-for-byte identical.
AGPL-3.0 · 500+ checks · PDF/X-1a · PDF/X-3 · PDF/X-4 · PDF/A · GWG 2022
How it works
lintPDF answers one question, headlessly: is this PDF print-ready, and if not, exactly what's wrong and where? It reports the findings — it never touches your file.
POST a PDF (or EPS, PostScript, TIFF, JPEG, PNG — converted internally) and pick a ruleset. One request to submit, one to fetch the report.
lintPDF runs 500+ checks across fonts, color, images, transparency, geometry, and packaging — plus the PDF/X-4 conformance suite. Detection-only: your file is never modified.
Findings come back as JSON, XML, or a white-labeled PDF report — each with page and location data so an operator sees exactly what's wrong and where.
Render findings in the viewer, mint share links, and fire webhooks when a report is ready. Import upstream PitStop / callas / Acrobat reports too.
Built for web-to-print platforms, packaging houses, and publishing workflows that demand precision without lock-in.
lintPDF reads the file and reports what's wrong; it never writes. Your originals stay byte-for-byte identical — no silent fixes, no re-distillation, no quietly-changed metadata. Zero risk of file damage.
Fonts, color spaces, images, transparency, overprint, page geometry, ink coverage (TAC), packaging dielines, and barcode grading — plus a dedicated 91-check PDF/X-4 conformance suite (ISO 15930-7). Every detail that matters for print.
Pre-built profiles for GWG Sheetfed, GWG Digital, PDF/X-1a, PDF/X-3, PDF/X-4, PDF/A, and packaging workflows. A conditional rule engine lets you build dynamic rulesets with the exact pass/fail logic you need.
Effective DPI is measured at each image's actual rendered size on the page — not the declared resolution. Catches images that look fine at 300 DPI but print blurry because they've been scaled up.
POST a file, GET the report as JSON, XML, or a white-labeled PDF. Already running PitStop, callas pdfToolbox, or Acrobat Preflight? Import their XML/JSON reports. Webhooks fire the instant a report is ready — no polling.
Preflight raster plate / tooling files (TIFF + LEN separations) over the same engine — resolution, minimum dot, registration, dimension agreement, and combined ink coverage. Compare a plate set against the approved 1-up, with or without AI assistance.
After prep, re-validate the file and diff it against the original to catch spec drift, generate deterministic notes-to-prepress annotations, and roll everything into a single production-readiness score — so you know when a job is truly ready to print.
AGPL-3.0 open source you can run on Docker or Railway, or use managed Print With Synergy hosting — the same preflight engine, managed and metered. Built on codex-powered extraction, so every check reads the same document facts.
Run preflight as a managed hosted add-on, or self-host the open source for free.
adds to the Codex à la carte base · or self-host free (AGPL-3.0)
Start with lintPDFOpen source · managed hosting
A toolkit of focused, standalone PDF utilities — extraction, preflight, viewing, assembly, imposition planning, and an asset store. Each one plugs into the prepress workflow you already run. Use the open source yourself, or let us host any single tool for you on host.withsynergy.io.
Structured PDF extraction API that turns complex files into consistent JSON.
Detection-only PDF preflight engine — 500+ checks plus the PDF/X-4 conformance suite.
Embeddable PDF viewer with separations, TAC, layers, and annotation overlays.
GWG 2022 conformance assay — benchmark a preflight engine against the spec.
Content-addressed digital-asset plane — versioned blobs, a presigned data plane, and on-prem agent recall.
The print-data integration hub — canonical jobs, orders, and customers kept in sync across your MIS, ERP, and prepress tools.