Interpretability
How do neural networks 'think'?
Last updated
Was this helpful?
How do neural networks 'think'?
Last updated
Was this helpful?
Datasets have incorrect labels! How do you get 99% accuracy when 3% of test data is incorrect?
Artificial intelligence
Thinking like humans
Acting like humans
Thinking rationally
Acting rationally
Small changes in pixels.. or noise.. or specific designed patters can be detected with a high accuracy of being something a human wouldn't think they were.
Network is trained with backpropagation by computing a gradient.
Tells us how to change the weights so as to minimise loss
Can instead compute a gradient with respect to the input instead of weights.
Gradient descent to then modify the input to get the network to activate
Start with a random image, make adjustments until the class is outputted
One adversarial example showing the network doesn't do the same thing as humans is applying texture from one image to the shape of another image.
A human would be confused, wdo we classify a cat made out of clocks as?? But the network instead will activate certain neurons (that may be different per example)
Gradient - true sensitivity
Integrated gradient: interpolate over input (e.g. from black image to fully exposed)
Compute gradients of output with respect to weights for each image and sum it up
Adversarial image generation to train from noise to image that maximises neuron
Use regularises to 'guide' the maximisation process
Use priors (images) to change starting point of maximisation process
Reverse mapping through 'deconvolution'
Unpooling by tracking location of the winning unit in the pooling layer
Rectifications - only track positive activity
Deconvolve - get an inverse of convolution through a weight matrix transpose (assume weight vectors on neurons are orthogonal)
Reverse mapping through 'backpropagation'
Track 'responsibility' or 'relavance' of each input for activation
Relevance rules - e.g. only track positive activity
Deep Taylor Decomposition (DTD) - generalisation of LRP revelance rule to Taylor expansion around 'root point' determines relavance rule