Best links of the week from 15th July to 21st July
Links
- What are probabilistic graphical models, and why are they useful? at Quora.
- Google sister-company Verily is teaming with big pharma on clinical trials at CNBC.
- Colors of noise at John D. Cook Consulting.
- An Idiot’s Guide to Support Vector Machines (SVMs) by R. Berwick.
- Introduction to Support Vector Machines by Shivani Agarwal.
- The Intuition Behind Inverse Probability Weighting by Judea Pearl.
- Resampling/simulation methods: Monte Carlo, bootstrapping, jackknifing, cross-validation, randomization tests, and permutation tests at Cross Validated.
- Many articles about the data.table package for R.
- Cumulative Variable Importance for Random Forest (RF) Models at Rich Pauloo’s GitHub.
- Flipping a journal to open access will boost its citation performance – but to what degree varies by publisher, field, and rank at LSE Impact Blog.
- 35 Innovators Under 35: Making the software that lets powerful AI programs run more smoothly at MIT Technology Review.
- Top 40 R Programming Blogs and Websites To Follow in 2019 at Feedspot.
- Gene enrichment statistics with Hypergeometric and Fisher’s exact tests (part of Analyse statistique des données génomiques (ASG) course) at Denis Puthier’s course website.
- Open-source Version Control System for Machine Learning Projects (DVC).
Blog/posts
- How to Set Up Data Science [in corporate environments]? by Philipp Diesinger at Data Science Central.
- SVMs in One Picture by Stephanie Glen at Data Science Central.
- Evaluation by journal name poisons science at Stephen Floor Medium account.
- Unexpected effects on scientific publishing after CRISPR-Cas9 editing in vivo at Stephen Floor Medium account.
- Should scientific publishing move to Github and friends? at Ed Hagen’s Blog.
- Text Parsing and Text Analysis of a Periodic Report (with R) at Tony ElHabr’s Blog.
- Making a Cheat Sheet with Rmarkdown at Tony ElHabr’s Blog.
- Why do we use arrow as an assignment operator? [in R] at Colin FAY’s Blog.
- Explain R environments like I’m five at Colin FAY’s Blog.
- A Crazy Thing Called {purrr} (Part 1, 2, 3, 4, 5 and 6) at Colin FAY’s Blog.
- Collecting tweets with R and {rtweet} at Colin FAY’s Blog.
- How to get Twitter data with rtweet in R at Storybench.
- Intro to rtweet: Collecting Twitter Data at rtweet.
Videos
- Data versioning in machine learning projects by Dmitry Petrov at PyData’s YouTube channel.
Scientific Articles
- Accelerating scientific publication in biology at PNAS.
- Potential Outcome and Directed Acyclic Graph Approaches to Causality: Relevance for Empirical Practice in Economics at arXiv.org.
Positions available
- Junior Data Analyst at Criteo.
- Software Engineer (AI) at Facebook.
- Permanent research position on algorithms and software for graph and tensor computing at Huawei Technologies.
- Data Engineer at Keen Eye.
- Consultant Data Scientist at Datatorii.
- Data Scientist at Business & Decision.
- Research Scientist position on Human Augmented Sensing at Bell Labs-Research.
- Research Scientist position on Augmented Dynamic Networks at Bell Labs-Research.
- Ph.D. opportunity in Data Science at Scuola Normale Superiore.
- Ph.D. Studentship Artificial Intelligence for Future Society at the University of Southampton.
- Senior Data Scientist at Kreditech.
- Data Engineer at Kreditech.
- Research Fellow in Theoretical Neuroscience and Machine Learning at UCL.
- Postdoc Daytime and Bigmedilytics at the Eindhoven University of Technology.
- Research Associate in Next-Generation Artificial Intelligence at the University of Sheffield.
- Ph.D. Scholarship Positions in Software Engineering at the Chalmers University of Technology.