About PDF to Markdown
PDF to Markdown serializes a PDF into a single self-contained .md file — tables become GFM pipe tables, lists keep their markers and nesting, hyperlinks survive as `[text](url)`, images embed as base64 data URIs so the Markdown renders anywhere without needing a sidecar assets folder. Great for publishing a PDF report to a GitHub README, piping into a static-site generator, or round-tripping through a Markdown-aware note app.
Every word of extraction happens in your browser via the same semantic pipeline that powers our PDF to Word converter. Nothing uploads. Output is deterministic — the same PDF always produces the same Markdown, character for character.
Bold and italic runs inside paragraphs are preserved (`**bold**`, `*italic*`, `***both***`). Captions and pull quotes get rendered as blockquotes since GFM lacks a dedicated caption style. The first-line of the document uses `#` as an H1, and the PDF's title + author metadata (if present) prepend the output.
How it works
- Drop your PDFDrag a PDF onto the zone or browse for it. Up to 100 MB. Local.
- Conversion runs in your browserThe engine parses structure, merges soft-hyphens, detects tables, stitches cross-page continuations — same machinery used by the Word converter. Emits GFM Markdown at the end.
- Download the .mdOne self-contained file with embedded images. Paste into any Markdown editor, renderer, or docs platform.