Signatures of double descent in deep quantum models
ORAL
Abstract
Deep neural networks show amazing generalization properties despite being highly over-parameterized. They show a transition past an interpolation point where despite fitting every training data point perfectly, the generalization error reduces again as the number of parameters is increased. This violates the long-standing bias-variance trade-off phenomenon. We present thorough empirical evidence of this “double descent” phenomenon in deep quantum neural networks. We also present various tools to study this transition such as changes in bias and variance, Quantum Neural Tangent Kernel, and Fisher information. We also try to understand where quantum deep learning is feasible and propose efficient anzatses. This is the first step towards using deep quantum models to avoid barren plateaus and achieve fast convergences.
*We acknowledge support from the Natural Sciences and Engineering Research Council (NSERC), the Shared Hierarchical Academic Research Computing Network (SHARCNET), Compute Canada, and the Canadian Institute for Advanced Research (CIFAR) AI chair program. Resources used in preparing this research were provided, in part, by the Province of Ontario, the Government of Canada through CIFAR, and companies sponsoring the Vector Institute www.vectorinstitute.ai/#partners.
–
Presenters
-
Aroosa Ijaz
- Univeristy of Waterloo