Jupyter/Ipython notebooks

After writing it down a couple of weeks ago for Hacker News, here is the recap and some updates:

I am a computational biologist with a heavy emphasis on the data analysis. I did try Jupyter a couple of years ago and here are my concerns with it, compared to my usual flow (Pycharm + pure python + pickle to store results of heavy processing).

Extracting functions is harder
Your git commits become completely borked
Opening some data-heavy notebooks is neigh impossible once they have been shut down
Import of other modules you have in local is pretty non-trivial.
Refactoring is pretty hard
Sphinx for autodoc extraction is pretty much out of the picture
Non-deterministic re-runs – depending on the cell
execution order you can get very different results. That’s an issue
when you are coming back to your code a couple of months later and
try to figure what you did to get there.
Connecting to the ipython notebook, even from the environments like Pycharm is highly non-trivial, just as the mapping to the OS
filesystem
Hard to impossible to inspect the contents of the ipython notebook when it’s hosted on Github due to the encoding snafus

There are likely work-arounds for most of these problems, but the issue is that with my standard workflow they are non-issues to start with.

In my experience, Jupyter is pretty good if you rely only on existing libraries that you are piecing together, but once you need to do more involved development work, you are screwed.

One thought on “Jupyter/Ipython notebooks”

Felix says:

September 18, 2019 at 16:13

Hey, I’ve had the same problems with Jupyter. I wrote a library that, imo, dies quite a good job of substituting it: https://pypi.org/project/datasheet/

Increasing information density

Evolving ideas

One thought on “Jupyter/Ipython notebooks”

Leave a Reply Cancel reply