Chapter 5: Evaluating Hypotheses

General Approach For Deriving Confidence Intervals

  1. Identify the underlying population parameter p to be estimated, for example, errorD(h).
  2. Define the estimator Y (e.g. errorS(h)). It is desirable to choose a minimum variance, unbiased estimator.
  3. Determine the probability distribution DY that governs the estimator Y, including its mean and variance.
  4. Determine the N% confidence interval by finding thresholds L and U such that N% of the mass in the probability distribution DY falls between L and U.

Central Limit Theorem

Consider a set of independent, identically distributed random variables Y1 ... Yn governed by an arbitrary probability distribution with mean μ and finite variance σ2. Define the sample mean Yn = (1/n) * Σ Yi. As n approaches infinity, the distribution governing (Yn - μ) / (σ / sqrt(n)) approaches a normal distribution with zero mean and standard deviation 1.

Difference in Error of Two Hypotheses

Hypothesis Testing

Comparing Learning Algorithms

Practice Exercises

Valid XHTML 1.0!