Category: Data Science

Data Science, PhD, R

Manage your Data Science Project in R

Reading time: 9 minutes DVC is the git equivalent for data and data science pipelines. Along with git and some R technologies such as packrat and RMarkdown you can manage your whole data science project in a very productive way! This post is a tutorial on using all these tools to manage your Data Science project!

Causality, Data Science, PhD, R

Spurious Independence: is it real?

Reading time: 14 minutes First things first: Spurious Dependence Depending on your background, you have already heard of spurious dependence in a way or another. It goes by the names of spurious association, spurious dependence, the famous quote “correlation does not imply causation” and also other versions based on the same idea that you can not say that necessarily …