Reading time: 9 minutes DVC is the git equivalent for data and data science pipelines. Along with git and some R technologies such as packrat and RMarkdown you can manage your whole data science project in a very productive way! This post is a tutorial on using all these tools to manage your Data Science project!
Category: Data Science
Spurious Independence: is it real?
Reading time: 14 minutes First things first: Spurious Dependence Depending on your background, you have already heard of spurious dependence in a way or another. It goes by the names of spurious association, spurious dependence, the famous quote “correlation does not imply causation” and also other versions based on the same idea that you can not say that necessarily …
How can I evaluate my model? Part I.
Reading time: 8 minutes Do you want to know more about terms such as recall, specificity, sensitivity, precision, f-score, accuracy, ROC, AUC? Come with me! 😉
Best links of the week #20
Reading time: < 1 minute More datasets for your analysis, more tips on tidyverse and R and the hitchhiker’s guide to feature extraction 🙂
Web Scraping, visualização de dados com R e os decretos do Bolsonaro
Reading time: 13 minutes Como o atual presidente do Brasil se compara em termos de número de decretos com seus predecessores?