Best links of the week #28

Reading time: 2 minutes

Best links of the week from 15th July to 21st July

Source: here.


  1. What are probabilistic graphical models, and why are they useful? at Quora.
  2. Google sister-company Verily is teaming with big pharma on clinical trials at CNBC.
  3. Colors of noise at John D. Cook Consulting.
  4. An Idiot’s Guide to Support Vector Machines (SVMs) by R. Berwick.
  5. Introduction to Support Vector Machines by Shivani Agarwal.
  6. The Intuition Behind Inverse Probability Weighting by Judea Pearl.
  7. Resampling/simulation methods: Monte Carlo, bootstrapping, jackknifing, cross-validation, randomization tests, and permutation tests at Cross Validated.
  8. Many articles about the data.table package for R.
  9. Cumulative Variable Importance for Random Forest (RF) Models at Rich Pauloo’s GitHub.
  10. Flipping a journal to open access will boost its citation performance – but to what degree varies by publisher, field, and rank at LSE Impact Blog.
  11. 35 Innovators Under 35: Making the software that lets powerful AI programs run more smoothly at MIT Technology Review.
  12. Top 40 R Programming Blogs and Websites To Follow in 2019 at Feedspot.
  13. Gene enrichment statistics with Hypergeometric and Fisher’s exact tests (part of Analyse statistique des données génomiques (ASG) course) at Denis Puthier’s course website.
  14. Open-source Version Control System for Machine Learning Projects (DVC).


  1. How to Set Up Data Science [in corporate environments]? by Philipp Diesinger at Data Science Central.
  2. SVMs in One Picture by Stephanie Glen at Data Science Central.
  3. Evaluation by journal name poisons science at Stephen Floor Medium account.
  4. Unexpected effects on scientific publishing after CRISPR-Cas9 editing in vivo at Stephen Floor Medium account.
  5. Should scientific publishing move to Github and friends? at Ed Hagen’s Blog.
  6. Text Parsing and Text Analysis of a Periodic Report (with R) at Tony ElHabr’s Blog.
  7. Making a Cheat Sheet with Rmarkdown at Tony ElHabr’s Blog.
  8. Why do we use arrow as an assignment operator? [in R] at Colin FAY’s Blog.
  9. Explain R environments like I’m five at Colin FAY’s Blog.
  10. A Crazy Thing Called {purrr} (Part 1, 2, 3, 4, 5 and 6) at Colin FAY’s Blog.
  11. Collecting tweets with R and {rtweet} at Colin FAY’s Blog.
  12. How to get Twitter data with rtweet in R at Storybench.
  13. Intro to rtweet: Collecting Twitter Data at rtweet.


  1. Data versioning in machine learning projects by Dmitry Petrov at PyData’s YouTube channel.

Scientific Articles

  1. Accelerating scientific publication in biology at PNAS.
  2. Potential Outcome and Directed Acyclic Graph Approaches to Causality: Relevance for Empirical Practice in Economics at

Positions available

  1. Junior Data Analyst at Criteo.
  2. Software Engineer (AI) at Facebook.
  3. Permanent research position on algorithms and software for graph and tensor computing at Huawei Technologies.
  4. Data Engineer at Keen Eye.
  5. Consultant Data Scientist at Datatorii.
  6. Data Scientist at Business & Decision.
  7. Research Scientist position on Human Augmented Sensing at Bell Labs-Research.
  8. Research Scientist position on Augmented Dynamic Networks at Bell Labs-Research.
  9. Ph.D. opportunity in Data Science at Scuola Normale Superiore.
  10. Ph.D. Studentship Artificial Intelligence for Future Society at the University of Southampton.
  11. Senior Data Scientist at Kreditech.
  12. Data Engineer at Kreditech.
  13. Research Fellow in Theoretical Neuroscience and Machine Learning at UCL.
  14. Postdoc Daytime and Bigmedilytics at the Eindhoven University of Technology.
  15. Research Associate in Next-Generation Artificial Intelligence at the University of Sheffield.
  16. Ph.D. Scholarship Positions in Software Engineering at the Chalmers University of Technology.