• Prototype, influence, build skills and deliver value January 17, 2025
  • I’ve learned that being a well-rounded data engineer goes beyond technical skills. It’s about how you approach problems, communicate with teams, and deliver outcomes. Here are four ideas that have helped me work better and make a greater impact, with some examples

    Read more »
  • Modern Data Engineer in 2025 January 3, 2025
  • 2024 was a busy year. A personal highlight is that I married my wife in the summer in Nice and had an amazing honeymoon in Santorini. I immersed myself in learning. I read and took notes on over 800 articles that spoke to me. They were on tech, science, data engineering, productivity, stoicism and many other topics. I simplified my workflows by cutting out complexity.

    Read more »
  • Configuration Data Formats in Data Engineering August 20, 2024
  • When I started out, I didn’t give much thought to how I stored my configurations. A JSON file here, a YAML file there, whatever came to my mind first. As projects grew more complex, the way I structured my configuration files began to matter a lot more.

    Read more »
  • Logging for Data Engineers August 9, 2024
  • After years of building data pipelines, from ones that lived in janky notebooks to ones that automate complex workflows in production environments, logging wasn’t something I paid a lot of attention to.

    Read more »
  • Unit Testing in Data Engineering April 21, 2024
  • I believe we can all agree that writing tests can feel as mundane as doing the laundry as most of us would probably rather be coding new features or learning something new. We would also agree that testing is undeniably a critical component of data engineering, setting individuals apart by the skill it demands.

    Read more »
  • Event-Driven Data Pipelines in AWS - Part 2 April 6, 2024
  • A data engineer doesn’t just code all day, they also design, plan, tinker, analyse, fix bugs, attend meetings and manage trade off of best principles all day. Being familiar with Terraform is a great skill to have to automate provisioning and management of infrastructure. As well as a cross-functional skill, it aligns data engineers with DevOps principles, fostering continuous integration, delivery, and deployment.

    Read more »
  • Event-Driven Data Pipelines in AWS - Part 1 March 25, 2024
  • In my early days as an analyst, I was always eager to automate tasks, often finding myself repeating lines of code without scalable solutions in sight. As I delved deeper into the field of data engineering, I started tinkering with cloud, where I found myself hooked on the possibilities.

    Read more »