Transforming Neural Network Visual Representations to Predict Human Judgments of Similarity

10/13/2020
by   Maria Attarian, et al.
4

Deep-learning vision models have shown intriguing similarities and differences with respect to human vision. We investigate how to bring machine visual representations into better alignment with human representations. Human representations are often inferred from behavioral evidence such as the selection of an image most similar to a query image. We find that with appropriate linear transformations of deep embeddings, we can improve prediction of human binary choice on a data set of bird images from 72 baseline to 89 (4096) dimensional representations; however, reducing the rank of these representations results in a loss of explanatory power. We hypothesized that the dilation transformation of representations explored in past research is too restrictive, and indeed we found that model explanatory power can be significantly improved with a more expressive linear transform. Most surprising and exciting, we found that, consistent with classic psychological literature, human similarity judgments are asymmetric: the similarity of X to Y is not necessarily equal to the similarity of Y to X, and allowing models to express this asymmetry improves explanatory power.

READ FULL TEXT
research
05/29/2020

Extracting low-dimensional psychological representations from convolutional neural networks

Deep neural networks are increasingly being used in cognitive modeling a...
research
11/02/2022

Human alignment of neural network representations

Today's computer vision models achieve human or near-human level perform...
research
05/14/2021

Visual analogy: Deep learning versus compositional models

Is analogical reasoning a task that must be learned to solve from scratc...
research
08/06/2016

Adapting Deep Network Features to Capture Psychological Representations

Deep neural networks have become increasingly successful at solving clas...
research
03/26/2019

High-Level Perceptual Similarity is Enabled by Learning Diverse Tasks

Predicting human perceptual similarity is a challenging subject of ongoi...
research
08/16/2020

Visual stream connectivity predicts assessments of image quality

Some biological mechanisms of early vision are comparatively well unders...
research
09/12/2016

Examining Representational Similarity in ConvNets and the Primate Visual Cortex

We compare several ConvNets with different depth and regularization tech...

Please sign up or login with your details

Forgot password? Click here to reset