Custom rendering of beautifulsoup objects in IPython notebook and qtconsole
IPython-BeautifulSoup is an IPython extension for displaying BeautifulSoup HTML/XML objects as prettified and syntax highlighted HTML blocks in IPython notebook and qtconsole.
Syntax highlighting is accomplished with Pygments.
pip install "ipython-beautifulsoup[bs4]"
Installing IPython Notebook
See the IPython installation docs for details.
To install IPython notebook or qtconsole as well, append notebook and/or qtconsole to the extras specifier after “bs4” separated by a “,”, like this:
pip install "ipython-beautifulsoup[bs4,notebook,qtconsole]"
On Ubuntu LTS, if you want to install IPython notebook, you’ll need to do this before:
sudo apt-get install python-dev g++
(use python3-dev if you’re using Python 3)
For the qtconsole do this (if you do this in a virtualenv) (WARNING: it’s slow):
sudo apt-get install make cmake qt4-qmake libqt4-dev pip install pyside
In IPython notebook or qtconsole, run:
This will push a series of callables into your current context, as well as a monkey-patched BeautifulSoup and requests.
You can now use BeautifulSoup like you would if it was imported from the corresponding module.
There is great chances that you’ll want to configure the output by using configure_ipython_beautifulsoup, for example like this (just after the %load_ext):
configure_ipython_beautifulsoup(show_html=True, show_css=True, show_js=False)
To see configure_ipython_beautifulsoup documentation just do (in any interface of IPython):
This also loads a shortcut function called p (for Parse) defined as follows:
def p(url): if requests is not None: return BeautifulSoup(requests.get(url).contents) return BeautifulSoup(urlopen(url).read())
A note on security
By nature of including external HTML, JS, and CSS, this extension is inherently unsafe if you choose to render the html by setting show_html to True when calling ``configure_ipython_beautifulsoup``.
By default, <script>, <link>, and <style> tags are removed but this isn’t a 100% guarantee that this is secure if you choose to render the html, use at your own risk.
The safest option is to set all options of configure_ipython_beautifulsoup to False (the default).
In chronological order:
Don’t hesitate to add yourself.
- python 3 compatibility
- classifiers in the setup.py
- and a series of small fixes
- also: wheel!
- mainly the work of Matt Morrison, see https://github.com/Psycojoker/ipython-beautifulsoup/pull/5
- security improvement (configure_ipython_beautifulsoup)
- mostly the work of westurner, see https://github.com/Psycojoker/ipython-beautifulsoup/pull/3
- bug fixes
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|File Name & Checksum SHA256 Checksum Help||Version||File Type||Upload Date|
|ipython_beautifulsoup-0.3-py2.7.egg (4.6 kB) Copy SHA256 Checksum SHA256||2.7||Egg||Aug 26, 2014|
|ipython_beautifulsoup-0.3-py2.py3-none-any.whl (7.4 kB) Copy SHA256 Checksum SHA256||2.7||Wheel||Aug 26, 2014|
|ipython-beautifulsoup-0.3.tar.gz (4.9 kB) Copy SHA256 Checksum SHA256||–||Source||Aug 26, 2014|