PinnedPublished inThe StartupThe Modern Data Platform Design: a tool agnostic approachThe big data echosystem is still too big! There I said it. I still remember seeing this post years ago and as a young and upcoming data…Aug 7, 2022Aug 7, 2022
PinnedPublished inThe StartupPySpark on macOS: installation and useSpark is a very popular framework for data processing. It has slowly taken over the use of Hadoop for data analytics. In memory processing…Oct 21, 20194Oct 21, 20194
Published inThe StartupCI/CD Pipelines for Python Packages With Circle CI and CodeClimateIn software engineering, continuous deployment and integration is a growing trend consisting of frequently updating and releasing code via…Jan 24, 2021Jan 24, 2021
Docker: a PrimerDocker is an application allowing to manage Linux containers on top of an existing OS. It provides a virtualisation layer (the Docker…Mar 5, 2019Mar 5, 2019
How to setup an AWS EC2 instanceThe recent developement in data storage and processing have been motivated by the increasing amount and complexity of data available to…May 28, 2018May 28, 2018
Classification with OpenCV3 C++ (1/2)OpenCV is an open source C++ library focusing on computer vision launched in 1999 by Intel research. It is written in C++ but bindings in…May 7, 2018May 7, 2018
Setting up Xcode for C++ projectsWhen I first started using Xcode for my C++ projects I was a bit overwhelmed by the settings interface; sure I had used Xcode heavily for…Aug 14, 2017Aug 14, 2017
Une introduction à ArmadilloArmadillo est une librarie template C++ avec une interface très complete pour le calcul matriciel et l’algèbre linéaire. Elle permet de…Jul 9, 20171Jul 9, 20171