Tag: statistics

Best links of the week #21

Reading time: 2 minutes

Best links of the week from 27th May to 2nd June

Source: Geek Hero Cromic.

Links

  1. Samsung AI Can Turn a Single Portrait Into a Realistic Talking Head at PetaPixel.
  2. Let’s Encrypt (Free Certification Authority) at MLAIT.
  3. Public data from the French government.
  4. Paris opens a data center to control its digital infrastructure.
  5. genderBR is an R package that predicts gender from Brazilian first names using data from the Instituto Brasileiro de Geografia e Estatistica’s 2010 Census.
  6. Git Cherry Pick at Atlassian Git Tutorials.
  7. Refs and the Reflog at Atlassian Git Tutorials.
  8. Advanced Git log at Atlassian Git Tutorials.
  9. Merging vs. Rebasing at Atlassian Git Tutorials.
  10. Intro to Cherry Picking with Git at PreviousNext.
  11. O que faz o cientista de dados ser o profissional mais procurado pelos RHs? at StartSe.
  12. 8 habilidades indispensáveis para cientistas de dados at CIO.

Best links of the week #20

Reading time: < 1 minute

Best links of the week from 20th May to 26th May

Links

  1. UN (United Nations) data.
  2. A curated list of 200+ blogs related to Data Science at CybrHome.
  3. 25 Excellent Machine Learning Open Datasets.
  4. Group Chats Are Making the Internet Fun Again at Intelligencer.
  5. Do anything with dplyr.
  6. Starting out with R at Credibly Curious.

Best links of the week #18

Reading time: 3 minutes

Best links of the week from 6th May to 12th May

Source: xkcd.

Links

  1. NextJournal, Seamless Data Science for Teams.
  2. An executive’s guide to AI.
  3. What should I use to serve R applications over the internet? at Brian Caffo’s YouTube channel. He talks about PlumbeR (PlumbeR book here).
  4. Will AI eat statistics? at Brian Caffo’s YouTube channel.
  5. A radical new neural network design could overcome big challenges in AI at MIT Technology Review.
  6. Urgent need for a government-led big data system, say industry experts at The Edge Markets.
  7. Top 10 Cities Across The Globe With The Highest Pay Packages For Data Scientists at Analytics India.
  8. What Nobody Tells You About Machine Learning at Forbes.
  9. How the data mining of failure could teach us the secrets of success at MIT Technology Review.
  10. How to hide from the AI surveillance state with a color printout at MIT Technology Review.
  11. Boosting (machine learning) at Wikipedia.
  12. Weak Learning, Boosting, and the AdaBoost algorithm at Jeremy Kun’s Blog.
  13. Weak vs. Strong Learning and the Adaboost Algorithm at Jenn Wortman Vaughan’s Website.
  14. What is a weak learner? at StackOverflow.
  15. AI está pronta para transformar radicalmente o desenvolvimento de software at CIO.
  16. O orçamento das universidades e institutos federais desde 2000 at NexoJornal.
  17. O governo contra as universidades, em dados e análises at NexoJornal.
  18. Existe alguma microevolução documentada nos humanos nos últimos duzentos anos? at Quora.

Best links of the week #16

Reading time: 2 minutes

Best links of the week from 22nd April to 28th April

You can check this comic here

Links

  1. Do more with R: drag-and-drop ggplot at InfoWorld.
  2. Apart from esquisse, the package mentioned in the link above, there is another one that allows you to drag-and-drop and plot your data: ggplotAssist.
  3. DreamRs is a French R consulting firm. In their website, they have made publicly available some shiny apps on real data, such as RATP traffic and a GitHub dashboard.
  4. VCs just invested $8 million into this startup that gave away its software for free because they noticed how much people loved it!
  5. Cheat Sheets for several softwares and concepts related to Data Science at Asif Bhat GitHub.
  6. Data Science must read articles, tutorials and useful links at Asif Bhat GitHub.
  7. Math required for Data Science at Asif Bhat GitHub.
  8. Quick overview of Statistics for Biologists (it’s useful for pretty much everybody, you don’t say no to an offer of knowledge :-).
  9. How can I show the intermediate steps of a long routine in R? at StackOverflow.
  10. ‘Friendly’ reviewers rate grant applications more highly at Nature.
  11. Calm down, everyone. Keeping dead pig cells alive is not ‘brain resuscitation’ at Los Angeles Times.
  12. Uber is sharing publicly some data!
  13. Need help on choosing the right visualization method? From data-to-viz can help you!
  14. IBM releases Diversity in Faces, a dataset with over 1 million annotated images to help fight bias at Turing Tribe.
  15. Até 2030, AI contribuirá em mais de US$ 15,7 trilhões para economia global at Computer World.
  16. A extraordinária cientista que estudou o cérebro de Einstein e revolucionou a neurociência moderna at Época Negócios.
  17. TerraBrasilis, a open access public geographical data for environmental monitoring.

Best links of the week #15

Reading time: 2 minutes

Best links of the week from 15th April to 21st April

Links

  1. When it comes to clustering, depending on the algorithm used, one may have a hard time determining the appropriate k (number of clusters). Some algorithms do not require it, but for the ones that do, such as k-means, you should have a look at the elbow method to evaluate the appropriate k or at the silhouette of objects regarding the clusters.
  2. Dunder Data is a professional training company dedicated to teaching data science and machine learning. There is paid and free online material.
  3. Software Carpentry, teaching basic lab skills for research computing.
  4. ROpenSci, transforming science through open data and software.
  5. mlmaisleve, conceitos rápidos e leves sobre Machine Learning ?.
  6. kite, Code Faster in Python with Line-of-Code Completions.

Best links of the week #14

Reading time: 2 minutes

Best links of the week from 8th April to 14th April

Source: Business Broadway.

Links

  1. Many more images like the one above at Business Broadway.
  2. Websites with challenges and exercises at Gabriel Fonseca’s GitHub page.
  3. Support innovation in healthcare with Hacking Health! There are several chapters around the world, including several in Brazil and in France :-).
  4. What are some examples of “Correlation does not equal causation?” at Quora.
  5. Does no correlation imply no causality? at Cross Validated.
  6. PEARL VS RUBIN (GELMAN) at Dokyun Lee’s website.
  7. Virgilio, your new Mentor for Data Science E-Learning at Giacomo Ciarlini.
  8. A quick reference for data visualization.
  9. Dev Tube.
  10. Por que preciso de “Análise de Componentes Principais” ou PCA na mineração de dados? at Quora.
  11. Harvard lança 15 cursos gratuitos de Inteligência Artificial at Estagio Online.
  12. Os testes de Harvard selecionam seus genes at Deviante.
  13. A realidade biopsicossocial da violência at Deviante.

Best links of the week #13

Reading time: 2 minutes

Best links of the week from 1st April to 7th April


Links

  1. Feature Engineering presentation by HJ van Veen (Nubank Brasil).
  2. Winning Data Science Competitions presentation by Owen Zhang (Data Robot).
  3. Tips and tricks to win kaggle data science competitions by raddar.
  4. 2019 Best Data Science Bootcamps.
  5. Free open public domain football data (football.db) for Brazil here and here.
  6. A weekly email of useful links for people interested in building data platforms.
  7. Top GAN Research Papers Every Machine Learning Enthusiast Must Peruse at Analytics India Magazine.

Best links of the week #9

Reading time: < 1 minute

Best links of the week from 4th March to 10th March.

Links

  1. What are some of your favorite, but less well-known, packages for R? [1] [2] at Statistics and Data Science sub Reddits.
  2. Why is it wrong to stop an A/B test before optimal sample size is reached? at Cross Validated (Stack Exchange).
  3. How do I calculate statistical power? at Effect Size FAQs.
  4. Personal website generator.
  5. From hard drive to over-heard drive: Boffins convert spinning rust into eavesdropping mic at The Register.
  6. List of Machine Learning / Deep Learning conferences in 2019 at Tryo Labs.
  7. We Use Less Information to Make Decisions Than We Think at Harvard Business Review.
  8. Apple CEO Tim Cook explains why you don’t need a college degree to be successful at Business Insider.
  9. Jordan Peterson’s 10-step process for stronger writing at Big Think.
  10. R package primer at Karl Broman‘s website.
  11. Researchers Can Now Cheaply Turn Atmospheric CO2 Back Into Coal at IFLScience.
  12. Plano de estudos em machine learning com conteúdos em português at Italo José’s GitHub.
  13. O Brasil em dados libertos.
  14. Reconhecimento facial ajuda a prender criminoso no Carnaval de Salvador at Canal Tech.
  15. Conhecer o próprio genoma envolve surpresas e decepções at Folha de São Paulo.
  16. Qual a lógica do detector de mentiras? at Revista Questão de Ciência.
  17. Pesquisas que parecem medicina, mas não são at Revista Questão de Ciência.
  18. A distribuição de pessoas com doutorado pelo Brasil at Nexo Jornal.
  19. Programadores tornarão o caminho mais fácil para invasores dizem pesquisadores at Mundo Hacker.

Best links of the week #8

Reading time: 2 minutes

Best links of the week from 25th February to 3rd March.

Links

  1. I don’t like notebooks at Jupyter Conference 2018 by Joel Grus.
  2. Twitter thread on Regression to the Mean Bias in a published paper at Andrew Althouse Twitter feed.
  3. The hipster effect: Why anti-conformists always end up looking the same at MIT Technology Review.
  4. An archive of datasets distributed with R.
  5. Beautiful, customizable, publication-ready model summaries in R (R Package) at Vincent-Arel Bundock GitHub account.
  6. Advanced R, a book by Hadley Wickham.
  7. R for Data Science, a book by Hadley Wickham.
  8. R packages, a book by Hadley Wickham.
  9. Data Visualization: A practical introduction, a book by Kieran Healy.
  10. Diversos cursos gratuitos na Data Science Academy (Com certificado) at Pelando.
  11. Estudar no Exterior: o caminho das pedras com Anna Giselle Ribeiro at Deviante.
  12. Dados de pesquisas eleitorais no Brasil at Poder360.
  13. Novo portal do IBGE compara estatísticas econômicas e sociais de 193 nações at Agência de Notícias IBGE.
  14. Estatísticas do Comércio Exterior (data visualization and raw data) at Ministério da Economia, Indústria, Comércio Exterior e Serviços.

Best links of the week #5

Reading time: < 1 minute

Best links of the week from 4th February to 10th February.

Links

  1. Como controlar o braço de outra pessoa com o poder da sua mente? at UOL.
  2. vidente is an R package I am currently writing to parse and analyze data from the Surveillance, Epidemiology and End Results (SEER) Program, which covers over 1/3 of the US population on cancer incidence and survival.
  3. Ciência de Dados com R is a book on Data Science using R at Instituto Brasileiro de Pesquisa e Análise de Dados.
  4. Data Science & Machine Learning Course at Ivanovitch Silva’s GitHub repository.
  5. A receita dos candidatos a deputado federal em 2018 at Nexo Jornal.
  6. AI 100: The Artificial Intelligence Startups Redefining Industries at CB Insights.
  7. The open-source and crowd sourced conference website.
  8. Ranking of IT conferences.