Introduction

I attended the online conference EuroPython 2020 and recorded notes while watching the talks. Today, I reviewed the notes with the aim of consolidating the main lessons, grouping together similar talks, and recording key lessons.

Workflows

This seems to be a big theme, and sounds like the next big challenge for the data scientist profession.

GitLab Tools, William Arias

Parallel stream processing, Alejandro Saucedo

Example workflow for Translation, Shreya Khurana

Example from teikametrics, Chase Stevens

DVC, Hongjoo Lee

Data pipelines, Robson Junior

Example from DeepAR talk, Nicola Kuhaupt

NLPeasy, Philipp Thomann

Kedro, Tam-Sanh Nguyen

spaCy, Alexander Hendorf

diffprivlib, Naoise Holohan

Google’s ML APIs and AutoML, Laurent Picard

SimPy, Eran Friedman

Data Visualisation Landscape, Bence Arato

Binder, Sarah Gibson

IPython, Miki Tebeka

Analytical Functions in SQL, Brendan Tierney

Tricks and tools for efficiency/speed gains

Several talks were about this, so I thought it was worth grouping them together

concurrent.futures, Chin Hwee Ong

daal4py and SDC by Intel, Fedotova and Schlimbach

Tips and tricks for efficiency gains in Pandas, Ian Ozsvald

30 Rules for Deep Learning Performance, Siddha Ganju

Miscellaneous

Tips for Docker, Tania Allard.

Data Cleaning Checklist, Hui Zhang Chua

History of Jupyter Notebooks, William Horton

Neural Style Transfer and GANs, Anmol Krishan Sachdeva

Probabilistic forecasting with DeepAR, Nicolas Kuhaupt