Codebeez
Sharing knowledge

Blog

Practical insights, opinions and stories from the Beez about Python, cloud, data and everything in between.

Tooling 10 Oct 2024

uv as a poetry/pyenv/pipx replacement

How the recently expanded uv consolidates poetry, pyenv and pipx into a single fast Python toolchain.

by Pepijn
Kubernetes 19 Sept 2024

K8S Security

Why Docker and Kubernetes are not secure by default, and how to harden UID mapping, image layers, RBAC, network policies and secrets for FastAPI.

by Leonid Borodaev
Performance 7 Mar 2024

The memory footprint of your Python application

A tour of Python's memory model: how objects are allocated, stored and cleaned up, with practical tweaks to cut memory usage.

by Pepijn
Data 13 Feb 2024

Vector databasing with DuckDB on top of PGVector

An architecture integrating DuckDB with Postgres and PGVector for vector databasing, bridging the gap between OLTP and OLAP.

by Luuk van der Velden
DevOps 16 Nov 2023

Streamlining Azure Logic Apps Deployment with Managed Identity in Azure DevOps: An IaC Approach

How to deploy Azure Logic Apps securely with Managed Identities, Bicep and Azure DevOps YAML pipelines as code.

by Souhail Terrahi
API 27 Sept 2023

So you want to expose a database? An exploration of automated API generation

Hackathon write-up comparing automatic and AI-driven ways to generate a CRUD API from an existing PostgreSQL database.

by Stefan van der Weide
Conference 25 Jul 2023

EuroPython 2023 Trip Report

A first-hand report from EuroPython 2023 in Prague: keynotes, performance tricks, AI talks and key takeaways.

by Pepijn Bakker
Python 10 May 2023

Type hinting in modern Python: The Protocol class

How Python's Protocol class brings static type checking to duck-typed code without forcing explicit inheritance.

by Pepijn Bakker
Development 9 Mar 2023

A reproducible, terminal-first Python development box

How to build a reproducible, keyboard-first Python development box with Ansible, a terminal-based workflow and the right tooling.

by Pepijn Bakker
MLOps 11 Jan 2023

Secure MLOps with Databricks MLFlow

How to run multiple logical environments in one Databricks workspace and securely separate production models with an environment-aware MLFlow client.

by Luuk van der Velden
Conference 25 Nov 2022

Build Stuff 2022: Summaries

Notes and summaries from the most valuable talks at the Build Stuff 2022 conference in Vilnius.

by Vito Minheere
Conference 3 Nov 2022

Blipz on the radar 2022: summaries

Summaries of the Blipz 2022 talks: data mesh, sustainable IT, Polars, CDK, human-in-the-loop ML, OSINT and zero trust.

by Luuk van der Velden
Conference 13 Jul 2022

EuroPython 2022: Summaries of selected talks

Codebeez summaries of selected EuroPython 2022 talks, covering the JWST pipeline, error messages, PyArrow, asyncio debugging and clean architecture.

by Luuk van der Velden
Data 20 Apr 2022

The magical fusion between batch and streaming insights

How the Lambda architecture fuses batch and streaming insights into fresh data products, with our Azure implementation as illustration.

by Luuk van der Velden
CI/CD 9 Jun 2021

Elegant CICD with Databricks notebooks

How to unit and integration test Databricks notebooks on transient clusters and ship them as reproducible artifacts via Azure DevOps.

by Rik Jongerius & Luuk van der Velden
ML 1 Mar 2021

Single-node and distributed Deep Learning on Databricks

How single-node and multi-node Databricks clusters enable parallel and distributed Deep Learning beyond Spark, using Data Factory, Horovod and Petastorm.

by Luuk van der Velden & Rik Jongerius
AI 23 Nov 2020

The Netherlands, a worldwide AI knowledge hub

How the Netherlands could become a worldwide AI knowledge hub by leading in the ethical, societal application of high tech.

by Luuk van der Velden
MLOps 20 Aug 2020

MLOps: a tale of two Azure pipelines

How to combine Azure DevOps and AzureML pipelines into an end-to-end MLOps solution for continuous training and deployment.

by Luuk van der Velden & Rik Jongerius
Azure 24 Jun 2020

AzureML PyTorch GPU enabled compute target: Unifying remote and local environments

Configure an AzureML GPU compute target for PyTorch and reuse the same Docker image to unify local development and remote training.

by Luuk van der Velden
ML 2 Jun 2020

Microsoft and Python Machine Learning, a modern love story, Part 1

A development workflow for launching ML models in the cloud with AzureML while coding in VSCode devcontainers on WSL2.

by Luuk van der Velden