Improved HTML Capturing with Org

If, like me, you use Org mode for your note taking and record keeping, you’ve probably found yourself occasionally needing to capture data from a Web page. You can already to that but many browsers don’t handle the conversion from HTML to text very well.

Alphapapa has an interesting solution in the org-protocol-capture-html package. The README.org page shows the result of capturing one of John Kitchin’s pages. It looks pretty nice and even turned a table into its Org equivalent.

I’m not sure how universal the solution is. Part of the requirements is a bookmarklet for the browser. Alphapapa gives examples for Firefox and Chrome but nothing for I.E. or Safari. It probably wouldn’t be very hard to get it working for those browsers too but I haven’t looked into the problem.

Sadly, there isn’t an ELPA package so you have to install it yourself. Of course, that isn’t hard but it does make it harder to keep things up to date. This package seems like a useful addition to Emacs if you often have a need to capture Web pages to Org mode.

This entry was posted in General and tagged , . Bookmark the permalink.