Supposed Maximum Mutual Information for Improving Generalization and Interpretation of Multi-Layered Neural Networks

Ryotaro Kamimura

Open Access

Supposed Maximum Mutual Information for Improving Generalization and Interpretation of Multi-Layered Neural Networks

Ryotaro Kamimura

| Dec 31, 2018

Journal of Artificial Intelligence and Soft Computing Research

Volume 9 (2019): Issue 2 (April 2019)

About this article

Cite

Page range: 123 - 147

Received: Feb 06, 2018

Accepted: Aug 13, 2018

DOI: https://doi.org/10.2478/jaiscr-2018-0029

Keywords
mutual information, disentanglement, generalization, interpretation

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

The present paper¹ aims to propose a new type of information-theoretic method to maximize mutual information between inputs and outputs. The importance of mutual information in neural networks is well known, but the actual implementation of mutual information maximization has been quite difficult to undertake. In addition, mutual information has not extensively been used in neural networks, meaning that its applicability is very limited. To overcome the shortcoming of mutual information maximization, we present it here in a very simplified manner by supposing that mutual information is already maximized before learning, or at least at the beginning of learning. The method was applied to three data sets (crab data set, wholesale data set, and human resources data set) and examined in terms of generalization performance and connection weights. The results showed that by disentangling connection weights, maximizing mutual information made it possible to explicitly interpret the relations between inputs and outputs.

eISSN:: 2083-2567
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Computer Sciences, Artificial Intelligence, Databases and Data Mining

Journal RSS Feed

Supposed Maximum Mutual Information for Improving Generalization and Interpretation of Multi-Layered Neural Networks

Published Online: Dec 31, 2018

Page range: 123 - 147

Received: Feb 06, 2018

Accepted: Aug 13, 2018

DOI: https://doi.org/10.2478/jaiscr-2018-0029

Keywords
mutual information, disentanglement, generalization, interpretation

© 2018 Ryotaro Kamimura, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Supposed Maximum Mutual Information for Improving Generalization and Interpretation of Multi-Layered Neural Networks

Published Online: Dec 31, 2018

Page range: 123 - 147

Received: Feb 06, 2018

Accepted: Aug 13, 2018

DOI: https://doi.org/10.2478/jaiscr-2018-0029

Keywordsmutual information, disentanglement, generalization, interpretation

© 2018 Ryotaro Kamimura, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Keywords
mutual information, disentanglement, generalization, interpretation