Collection of useful data science, machine learning and data engineering topics with code examples linked to posts in my blog engineeringfordatascience.com
Articles about Python and data science best practices
| Title | Article | Repository |
|---|---|---|
| Which Python string formatting method should you be using in your data science project? | Blog | Link |
| Reproducible ML: Maybe you shouldn't be using Sklearn's train_test_split | Blog | Link |
| Setting up logging for Python projects | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| Plotting subplots in a loop | Blog | Link |
| Making impactful charts using plt.suptitle | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| (Part 1) Voilà! Interactive Python Dashboards Straight from your Jupyter Notebook | Blog | Link |
| (Part 2) Voilà! Optimising Python Dashboard Performance | Blog | Link |
| (Part 3) Voilà! Deploy your Jupyter Notebook Based Python Dashboard on Heroku | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| SQL-like Window Functions in Pandas | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| Unit testing in PySpark | Blog | Link |
| How to save the output of PySpark DataFrame 'show' to a variable | Blog | Link |
| How to save a PySpark dataframe to a single output file | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| How to use fixtures as arguments in parametrize | Blog | Link |
| Tips for using PyTest | Blog | Link |
| How to mock sending SMTP emails using PyTest | Blog | Link |
| How to use allure-pytest and allure-pytest-bdd plugins in the same project | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| Event Driven Data Validation with Google Cloud Functions and Great Expectations | Blog | Link |
| Google Search Console API with Python | Blog | Link |
| Title | Article | Repository |
|---|---|---|
| Export Spotify Playlist to CSV File with Python | Blog | Link |