跳转至

Feature Engineering & Regularization

Feature Engineering

Regularization

Overfitting

  • Definition: The problem of overfitting is when the model captures the noise in the training data instead of the underlying structure.

Regularization

  • Given objective function J(θ)J(\theta).
  • Goal is to find: θ^=arg minθJ(θ)+λr(θ)\hat{\theta} = \argmin_{\theta} J(\theta) + \lambda r(\theta).
  • Key idea: Define regularizer r(θ)r(\theta) s.t. we tradeoff between fitting the data and keeping the model simple.
  • Choose form of r(θ)r(\theta) based on the model complexity.
    • Example: qq-norm
θq=(m=1Mθmq)1q \|\boldsymbol{\theta}\|_q = \left( \sum_{m=1}^M |\theta_m|^q\right)^{\frac{1}{q}}

评论