Zero-Shot Learning by Mutual Information Estimation and Maximization.

Chenwei Tang, XueYang, Jiancheng Lv, ZhenanHe

January 2020

PDF

Image credit: Unsplash

Abstract

The key of zero-shot learning is to use the visual-semantic embedding to transfer the knowledge from seen classes to unseen classes. In this paper, we propose to build the visual-semantic embedding by maximizing the mutual information between visual features and corresponding attributes. Then, the mutual information between visual and semantic features can be utilized to guide the knowledge transfer from seen domain to unseen domain. Since we are primarily interested in maximizing mutual information, we introduce the noise-contrastive estimation to calculate lower-bound value of mutual information. Through the noise-contrastive estimation, we reformulate zero-shot learning as a binary classification problem, i.e., classifying the matching visual-semantic pairs (positive samples) and mismatching visual-semantic pairs (negative/noise samples). Experiments conducted on five datasets demonstrate that the proposed mutual information estimators outperforms current state-of-the-art methods both in conventional and generalized zero-shot learning settings.

Type

Journal article

Zero-Shot Learning by Mutual Information Estimation and Maximization.

Abstract

Chenwei Tang

Postdoc

Jiancheng Lv

Dean and professor of Computer Science of Sichuan University

Related