Week 03 — Classification — Logistic Regression, k-NN, Naive Bayes

Fisher's 1936 iris dataset: the first formal classification algorithm. The descendants now classify spam, fraud, tumors, signals.

ML  ·  schedule  ·  Week 03 of 12 ·  ← 02 ·  04 →

Week 03 — Classification — Logistic Regression, k-NN, Naive Bayes

Fisher's 1936 iris dataset: the first formal classification algorithm. The descendants now classify spam, fraud, tumors, signals.

Lecture

Logistic regression as MLE under Bernoulli noise · linear and quadratic discriminant analysis · $k$-NN and the curse of dimensionality · Naive Bayes and the Sahami 1998 spam filter.

Read before the lecture

  • Hastie, Tibshirani, Friedman, chapter 4

Code lab

Lab 1 — Classification on a clinical dataset

Build a binary classifier predicting 30-day hospital readmission. Compare logistic regression, $k$-NN, Naive Bayes, and a calibration-aware variant.

Notebook: lab01-clinical-classification.ipynb  ·  Dataset: MIMIC-IV demo (1,000 patients, redistributed).


Reference text for this week: chapter 03 of the bilingual notes — EN PDF · FR PDF.