Skip to main content

A network that learns Strassen multiplication

Cornell Affiliated Author(s)

Author

V. Elser

Abstract

We study neural networks whose only non-linear components are multipliers, to test a new training rule in a context where the precise representation of data is paramount. These networks are challenged to discover the rules of matrix multiplication, given many examples. By limiting the number of multipliers, the network is forced to discover the Strassen multiplication rules. This is the mathematical equivalent of finding low rank decompositions of the n x n matrix multiplication tensor, Mn. We train these networks with the conservative learning rule, which makes minimal changes to the weights so as to give the correct output for each input at the time the input-output pair is received. Conservative learning needs a few thousand examples to find the rank 7 decomposition of M2, and 105 for the rank 23 decomposition of M3 (the lowest known). High precision is critical, especially for M3, to discriminate between true decompositions and "border approximations". ©2016 Veit Elser.

Date Published

Journal

Journal of Machine Learning Research

Volume

17

Number of Pages

1-13,

URL

https://www.scopus.com/inward/record.uri?eid=2-s2.0-84989180803&partnerID=40&md5=0c574a9e987d48db017a4e39de0674a0

Group (Lab)

Veit Elser Group

Download citation