{
    "content": [
        {
            "type": "text",
            "text": "# pdftohtml(1) (man)\n\n**Summary:** pdftohtml - program to convert PDF files into HTML, XML and PNG images\n\n**Synopsis:** pdftohtml [options] <PDF-file> [<HTML-file> <XML-file>]\n\n## Flags\n\n| Flag | Long | Arg | Description |\n|------|------|-----|-------------|\n| -h | — | — | Show summary of options. |\n| -f | — | <int> | first page to print |\n| -l | — | <int> | last page to print |\n| -q | — | — |  |\n| -v | — | — |  |\n| -p | — | — |  |\n| -c | — | — |  |\n| -s | — | — |  |\n| — | — | — | use data URLs instead of external images in HTML. No available in all platforms |\n| -i | — | — |  |\n| — | — | — | generate no frames. Not supported in complex output mode. |\n| — | — | — | use standard output |\n| — | — | — | zoom the PDF document (default 1.5) (1 means 72 DPI) |\n| — | — | — |  |\n| — | — | — | do not round coordinates (with XML output only) |\n| — | — | — | output text encoding name |\n| — | — | — | owner password (for encrypted files) |\n| — | — | — | user password (for encrypted files) |\n| — | — | — | force hidden text extraction |\n| — | — | — | not specified, -fmt png will be assumed |\n| — | — | — | do not merge paragraphs |\n| — | — | — |  |\n| — | — | — | adjust the word break threshold percent. Default is 10. Word break occurs when dis‐ tance between two adjacent character |\n| — | — | — | outputs the font name without any substitutions. |\n\n## See Also\n\n- pdfdetach(1)\n- pdffonts(1)\n- pdfimages(1)\n- pdfinfo(1)\n- pdftocairo(1)\n- pdftoppm(1)\n- pdftops(1)\n- pdftotext(1)\n- pdfseparate(1)\n- pdfsig(1)\n- pdfunite(1)\n- PDFTOHTML(1)\n\n## Section Outline\n\n- **NAME** (2 lines)\n- **SYNOPSIS** (2 lines)\n- **DESCRIPTION** (6 lines)\n- **OPTIONS** (2 lines) — 24 subsections\n  - -h, -help (2 lines)\n  - -f <int> (2 lines)\n  - -l <int> (2 lines)\n  - -q (1 lines)\n  - -v (1 lines)\n  - -p (1 lines)\n  - -c (1 lines)\n  - -s (1 lines)\n  - -dataurls (2 lines)\n  - -i (1 lines)\n  - -noframes (2 lines)\n  - -stdout (2 lines)\n  - -zoom <fp> (2 lines)\n  - -xml (1 lines)\n  - -noroundcoord (2 lines)\n  - -enc <string> (2 lines)\n  - -opw <string> (2 lines)\n  - -upw <string> (2 lines)\n  - -hidden (2 lines)\n  - -fmt (2 lines)\n  - -nomerge (2 lines)\n  - -nodrm (1 lines)\n  - -wbt <fp> (4 lines)\n  - -fontfullname (3 lines)\n- **AUTHOR** (6 lines)\n- **SEE ALSO** (6 lines)\n\n## Full Content\n\n### NAME\n\npdftohtml - program to convert PDF files into HTML, XML and PNG images\n\n### SYNOPSIS\n\npdftohtml [options] <PDF-file> [<HTML-file> <XML-file>]\n\n### DESCRIPTION\n\nThis  manual  page documents briefly the pdftohtml command.  This manual page was written for\nthe Debian GNU/Linux distribution because the original program does not have a manual page.\n\npdftohtml is a program that converts PDF documents into HTML. It generates its output in  the\ncurrent working directory.  If PDF-file is ´-', it reads the PDF file from stdin.\n\n### OPTIONS\n\nA summary of options are included below.\n\n#### -h, -help\n\nShow summary of options.\n\n#### -f <int>\n\nfirst page to print\n\n#### -l <int>\n\nlast page to print\n\n#### -q\n\n#### -v\n\n#### -p\n\n#### -c\n\n#### -s\n\n#### -dataurls\n\nuse data URLs instead of external images in HTML. No available in all platforms\n\n#### -i\n\n#### -noframes\n\ngenerate no frames. Not supported in complex output mode.\n\n#### -stdout\n\nuse standard output\n\n#### -zoom <fp>\n\nzoom the PDF document (default 1.5) (1 means 72 DPI)\n\n#### -xml\n\n#### -noroundcoord\n\ndo not round coordinates (with XML output only)\n\n#### -enc <string>\n\noutput text encoding name\n\n#### -opw <string>\n\nowner password (for encrypted files)\n\n#### -upw <string>\n\nuser password (for encrypted files)\n\n#### -hidden\n\nforce hidden text extraction\n\n#### -fmt\n\nnot specified, -fmt png will be assumed\n\n#### -nomerge\n\ndo not merge paragraphs\n\n#### -nodrm\n\n#### -wbt <fp>\n\nadjust the word break threshold percent. Default is 10.  Word break occurs  when  dis‐\ntance  between  two  adjacent  characters  is  greater  than this percent of character\nheight.\n\n#### -fontfullname\n\noutputs the font name without any substitutions.\n\n### AUTHOR\n\nPdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It is based and  benefits  a\nlot from Derek Noonburg's xpdf package.\n\nThis  manual  page  was  written  by  Søren  Boll Overgaard <boll@debian.org>, for the Debian\nGNU/Linux system (but may be used by others).\n\n### SEE ALSO\n\npdfdetach(1), pdffonts(1), pdfimages(1), pdfinfo(1), pdftocairo(1), pdftoppm(1),  pdftops(1),\npdftotext(1) pdfseparate(1), pdfsig(1), pdfunite(1)\n\n\n\nPDFTOHTML(1)\n\n"
        }
    ],
    "structuredContent": {
        "command": "pdftohtml",
        "section": "1",
        "mode": "man",
        "summary": "pdftohtml - program to convert PDF files into HTML, XML and PNG images",
        "synopsis": "pdftohtml [options] <PDF-file> [<HTML-file> <XML-file>]",
        "flags": [
            {
                "flag": "-h",
                "long": null,
                "arg": null,
                "description": "Show summary of options."
            },
            {
                "flag": "-f",
                "long": null,
                "arg": "<int>",
                "description": "first page to print"
            },
            {
                "flag": "-l",
                "long": null,
                "arg": "<int>",
                "description": "last page to print"
            },
            {
                "flag": "-q",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "-v",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "-p",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "-c",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "-s",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "use data URLs instead of external images in HTML. No available in all platforms"
            },
            {
                "flag": "-i",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "generate no frames. Not supported in complex output mode."
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "use standard output"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "zoom the PDF document (default 1.5) (1 means 72 DPI)"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "do not round coordinates (with XML output only)"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "output text encoding name"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "owner password (for encrypted files)"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "user password (for encrypted files)"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "force hidden text extraction"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "not specified, -fmt png will be assumed"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "do not merge paragraphs"
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": ""
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "adjust the word break threshold percent. Default is 10. Word break occurs when dis‐ tance between two adjacent characters is greater than this percent of character height."
            },
            {
                "flag": "",
                "long": null,
                "arg": null,
                "description": "outputs the font name without any substitutions."
            }
        ],
        "examples": [],
        "see_also": [
            {
                "name": "pdfdetach",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdfdetach/1/json"
            },
            {
                "name": "pdffonts",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdffonts/1/json"
            },
            {
                "name": "pdfimages",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdfimages/1/json"
            },
            {
                "name": "pdfinfo",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdfinfo/1/json"
            },
            {
                "name": "pdftocairo",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdftocairo/1/json"
            },
            {
                "name": "pdftoppm",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdftoppm/1/json"
            },
            {
                "name": "pdftops",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdftops/1/json"
            },
            {
                "name": "pdftotext",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdftotext/1/json"
            },
            {
                "name": "pdfseparate",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdfseparate/1/json"
            },
            {
                "name": "pdfsig",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdfsig/1/json"
            },
            {
                "name": "pdfunite",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/pdfunite/1/json"
            },
            {
                "name": "PDFTOHTML",
                "section": "1",
                "url": "https://www.chedong.com/phpMan.php/man/PDFTOHTML/1/json"
            }
        ],
        "section_outline": [
            {
                "name": "NAME",
                "lines": 2,
                "subsections": []
            },
            {
                "name": "SYNOPSIS",
                "lines": 2,
                "subsections": []
            },
            {
                "name": "DESCRIPTION",
                "lines": 6,
                "subsections": []
            },
            {
                "name": "OPTIONS",
                "lines": 2,
                "subsections": [
                    {
                        "name": "-h, -help",
                        "lines": 2,
                        "flag": "-h"
                    },
                    {
                        "name": "-f <int>",
                        "lines": 2,
                        "flag": "-f",
                        "arg": "<int>"
                    },
                    {
                        "name": "-l <int>",
                        "lines": 2,
                        "flag": "-l",
                        "arg": "<int>"
                    },
                    {
                        "name": "-q",
                        "lines": 1,
                        "flag": "-q"
                    },
                    {
                        "name": "-v",
                        "lines": 1,
                        "flag": "-v"
                    },
                    {
                        "name": "-p",
                        "lines": 1,
                        "flag": "-p"
                    },
                    {
                        "name": "-c",
                        "lines": 1,
                        "flag": "-c"
                    },
                    {
                        "name": "-s",
                        "lines": 1,
                        "flag": "-s"
                    },
                    {
                        "name": "-dataurls",
                        "lines": 2
                    },
                    {
                        "name": "-i",
                        "lines": 1,
                        "flag": "-i"
                    },
                    {
                        "name": "-noframes",
                        "lines": 2
                    },
                    {
                        "name": "-stdout",
                        "lines": 2
                    },
                    {
                        "name": "-zoom <fp>",
                        "lines": 2
                    },
                    {
                        "name": "-xml",
                        "lines": 1
                    },
                    {
                        "name": "-noroundcoord",
                        "lines": 2
                    },
                    {
                        "name": "-enc <string>",
                        "lines": 2
                    },
                    {
                        "name": "-opw <string>",
                        "lines": 2
                    },
                    {
                        "name": "-upw <string>",
                        "lines": 2
                    },
                    {
                        "name": "-hidden",
                        "lines": 2
                    },
                    {
                        "name": "-fmt",
                        "lines": 2
                    },
                    {
                        "name": "-nomerge",
                        "lines": 2
                    },
                    {
                        "name": "-nodrm",
                        "lines": 1
                    },
                    {
                        "name": "-wbt <fp>",
                        "lines": 4
                    },
                    {
                        "name": "-fontfullname",
                        "lines": 3
                    }
                ]
            },
            {
                "name": "AUTHOR",
                "lines": 6,
                "subsections": []
            },
            {
                "name": "SEE ALSO",
                "lines": 6,
                "subsections": []
            }
        ]
    }
}