Finding descriptors for material properties from billions of candidates via compressed sensing: accurate prediction of crystal structures and band gaps from only chemical composition

ORAL

Abstract

Identifying the key physical parameters (termed descriptor) determining the target material properties is a critical step toward material discovery and rational design. Thus far, systematic methods for the descriptor identification are not well established. In particular, it has been suggested that good descriptors should both yield an accurate prediction and be physically interpretable [L. M. Ghiringhelli,\emph{et al.}, PRL \textbf{114}, 105503 (2015)]. In this talk, we present a systematic scheme for descriptor identification based on sure independent screening [J. Fan and J. Lv, J. R. Statist. Soc. B {\bf 70}, 849 (2008)] and compressed sensing [E. Candès and M. B. Wakin, IEEE Signal Proc. Mag. {\bf 25}, 21 (2008)]. The scheme starts with automatic building of the ``feature spaces'', i.e. all offered candidate descriptors, and the feature space may contain billions of options. The employed combination of sure independent screening and compressed sensing provides an efficient scheme for identifying the best low-dimensional descriptor. The approach is demonstrated for the important problems of crystal-structure and band-gap prediction.

Authors

  • Runhai Ouyang

    • Fritz Haber Institute of the Max Planck Society
  • Emre Ahmetcik

    • Fritz Haber Institute of the Max Planck Society
  • Luca M. Ghiringhelli

    • Fritz Haber Institute of the Max Planck Society
  • Matthias Scheffler

    • Fritz Haber Institute of the Max Planck Society