Tales of Science & Data
Meta
About me
GitHub
Twitter
Search…
⌃K
Links
Tales of Science and Data
Meta & resources
The meta on all this
Beautiful web of data science
Probability, statistics and data analysis
Probability, its interpretation, and statistics
Foundational concepts on distribution and measures
Hypothesis testing
Methods, theorems & laws
Notable brain teasers, paradoxes and how to be careful with data
The Monty Hall problem
Some statistical paradoxes
The Anscombe's quartet
Using aggregated metrics well
Machine Learning: concepts & procedures
Overview of the field
Learning algorithms
Feature building and modelling techniques
Dimensionality reduction and matrix factorisation
Machine Learning: fundamental algorithms
Learning paradigms
Supervised learning
Unsupervised learning
Machine Learning: model assessment
Generic problems models can have
Performance metrics and validation techniques
Diagnostics
Artificial neural networks
Overview of neural networks
Types of neurons and networks
Natural language processing
General concepts & tasks in NLP
Manipulating text and extracting information
Topic Modelling
Word Embeddings
Computer vision
Intro: quantifying images & some glossary
Processing an image
What's in an image
The computer science appendix
What's this
Notes on foundations
Essential algorithms
The mathematics appendix
Matrix algebra notes
Mathematical functions
Some geometry
Cross-field concepts
(Some) mathematical measures
Toolbox
The Python data stack
Databases and distributed frameworks
Notebook tools
Powered By
GitBook
Notable brain teasers, paradoxes and how to be careful with data
Data can be fun, and Statistics is not a trivial activity. In fact, it can be rather counterintuitive at times.
Code
Code for this section is in the pages and provided as notebooks
here
.
Content
The Monty Hall problem
Some statistical paradoxes
The Anscombe's quartet
Using aggregated metrics well
Previous
The Monte Carlo method
Next
The Monty Hall problem
Last modified
2yr ago