Links & Resources

Materials Science/Research

Beware of plausible predictions of fantasy materials

An interesting Nature article by Alex Zunger discussing the importance of stability and synthesis of digitally discovered materials.

Science Communication

The misuse of colour in science communication

Excellent article highlighting ways for the scientific community to identify and prevent the misuse of colour in science.

My Manuscript Was “Rejected without Review” from Chemistry of Materials: A Lesson in Burying the Lede

Useful article discussing how to become effective at scientific communication

Computer Science/Programming

Big-O Notation

Excellent introduction on Big-O notation - a way to characterize an algorithm’s efficiency in terms of execution time

What Does It Take To Be An Expert At Python?

Very informative PyData talk from James Powell.

Python Programming Tutorials

Nice Collection of YouTube videos on Python programming for beginners.

Data Science/Visualization

Seeing Theory

A visual introduction to probability and statistics

Clarity and Aesthetics in Data Visualization: Guidelines

Excellent article discussing the clarity and aesthetics in data visualization

Effective Pandas

Very useful talk on Pandas by Matt Harrison

Dimension reduction

Visualizing Data Using t-SNE

Wonderful talk by Laurens van der Maaten on how t-SNE works

BioTuring Webinar | Using t SNE for Data Analysis: A Quick Introduction | Laurens van der Maaten

Another informative talk by Laurens van der Maaten on using t-SNE for data analysis

Understanding UMAP

A nice article focused on the theory of UMAP, how to use it effectively, and how its performance compares with t-SNE.

Machine Learning

The First Rule of Machine Learning: Start without Machine Learning

A very useful article discussing the use of ML while starting with a new project

MLU-EXPLAIN: Visual explanations of core machine learning concepts

Machine Learning University (MLU) is an education initiative from Amazon designed to teach machine learning theory and practical application. As part of that goal, MLU-Explain exists to teach important machine learning concepts through visual essays in a fun, informative, and accessible manner.

Random Forests Algorithm explained with a real-life example and some Python code

A very informative article on Random Forest

GAMMA FACET

FACET is an open source library for human-explainable AI. It combines sophisticated model inspection and model-based simulation to enable better explanations of your supervised machine learning models

CausalNex

CausalNex is a Python library that uses Bayesian Networks to combine machine learning and domain expertise for causal reasoning. You can use CausalNex to uncover structural relationships in your data, learn complex distributions, and observe the effect of potential interventions

Why do tree-based models still outperform deep learning on tabular data?

Yes, it is still the case …

How to Use t-SNE Effectively

An interactive playground for t-SNE dimensionality reduction to help readers develop an intuition for technique and where it is best applied.

Comparing quantiles at scale in online A/B-testing

Calculate bootstrap confidence intervals for difference-in-quantiles in A/B tests with hundreds of millions of observations

Career and Job Search

Hiring Data Scientists With Intention

Nice article on reducing bias in hiring process for a Data Scientist position

Five Steps to Writing a Great Resume

Great tips on resume and cover letter