1. 程式人生 > >html2text:將 HTML 轉換為 Markdown 格式文字

html2text:將 HTML 轉換為 Markdown 格式文字

安裝:

pip install html2text
Option Description
–version Show program’s version number and exit
-h, –help Show this help message and exit
–ignore-links Don’t include any formatting for links
–escape-all Escape all special characters. Output is less readable, but avoids corner case formatting issues.
–reference-links Use reference links instead of links to create markdown
–mark-code Mark preformatted and code blocks with [code]…[/code]
>>> import html2text
>>>
>>> print(html2text.html2text("<p><strong>Zed's</strong> dead baby, <em>
Zed's</em> dead.</p>")) **Zed's** dead baby, _Zed's_ dead.
>>> import html2text
>>>
>>> h = html2text.HTML2Text()
>>> # Ignore converting links from HTML
>>> h.ignore_links = True
>>> print h.handle("<p>Hello, <a href='http://earth.google.com/'>world</a>!"
) Hello, world! >>> print(h.handle("<p>Hello, <a href='http://earth.google.com/'>world</a>!")) Hello, world! >>> # Don't Ignore links anymore, I like links >>> h.ignore_links = False >>> print(h.handle("<p>Hello, <a href='http://earth.google.com/'>world</a>!")) Hello, [world](http://earth.google.com/)!