Interpretable Machine Learning

  1. Manipulate each layer/neuron, and observe the change of network parameters/activations.

  2. Saliency map

  3. Adversarial attack

  4. Correlation

  5. Information gain/loss