TL;DR
Notebooks are nice for exploratory knowledge evaluation (EDA), Proof of Ideas (PoCs). However they don’t scale nicely for long-term upkeep. Subsequently, for manufacturing techniques I really helpful IDE-based modular Python repositories.
Do you like pocket book vs Built-in Growth Atmosphere (IDE)-based Python repositories for knowledge science tasks?
I used to be requested this query a number of occasions in final 8 years.
Background:
I began programming on the college utilizing IDEs (dev cpp, Eclipse and so on.), then labored with Android Studio, Visible Studio and extra.
After I began engaged on knowledge science associated subjects throughout my grasp’s, I then began additionally utilizing Jupyter Notebooks. To be trustworthy, it felt like one thing is lacking (I assume debugging) and by no means actually preferred working with notebooks. And I at all times tried to modify to an IDE akin to Spyder (because of nice integration with anaconda at the moment).
However by the point, I began realizing it’s really not that unhealthy and may really be higher a device for a lot of use circumstances than IDEs particularly for knowledge science associated subjects.
Conclusion:
So, for knowledge science tasks, ought to we simply use notebooks in all places or have a modular structured code repositories with IDE assist?
I’d say, it relies on what you’re doing, your position, and what you’re attempting to realize.
In case you are doing exploratory knowledge evaluation (EDA), engaged on a proof of idea (PoC), must do fast iterations, mainly something that isn’t going to run in manufacturing, then you need to use notebooks. They’re intuitive, straightforward to iterate with, and permit for quick experimentation.
Nonetheless, in case your ETL or ML pipeline is supposed to run in manufacturing, you then want a structured, modular repository with correct IDE assist.
Notebooks merely don’t scale nicely for
- Lengthy-term upkeep
- Debugging
- Testing
- Model management
In my subsequent article, I’ll discuss how this may be achieved for the preferred for cloud platforms. Keep tuned!