Developing Databases for Polymer Informatics

Roselyne Tchoua; Zhi Hong; Debra Audus; Shrayesh Patel; Logan Ward; Kyle Chard; Juan De Pablo; Ian Foster

Developing Databases for Polymer Informatics

ORAL

Abstract

One significant barrier to the adoption of polymer informatics is a lack of large FAIR (Findable, Accessible, Interoperable, Reusable) databases. In an effort to overcome this barrier, we developed pipelines to harness the vast quantities of valuable experimental polymer data trapped in the literature. In our first effort, we developed the largest Flory-Huggins chi parameter database using crowdsourcing and found that the burden to review papers could be lessened by training a classifier to identify promising articles. To further reduce human input, we turned to natural language processing software coupled with specially designed software modules to extract grass transition temperatures with minimal human input; ultimately, we extracted over 250 glass transition temperatures. All of the resulting data is freely available at the Polymer Property Predictor and Database website (http://pppdb.uchicago.edu). During this process, we found that identification of the polymer names within the literature was a key problem as polymers are referred to by common names, sample names, labels, etc. and subsequently explored named entity recognition to tackle this problem. To further extend our databases, we are working on allowing them to accept user submitted data.

March 3, 2020, 12:51 PM – March 3, 2020, 1:03 PM

Presenters

Debra Audus
- National Institute of Standards and Technology
- National Institute of Standards and Technology, Gaithersburg, MD

Authors

Roselyne Tchoua
- DePaul University
Zhi Hong
- University of Chicago
Debra Audus
- National Institute of Standards and Technology
- National Institute of Standards and Technology, Gaithersburg, MD
Shrayesh Patel
- University of Chicago
Logan Ward
- University of Chicago
Kyle Chard
- University of Chicago
Juan De Pablo
- University of Chicago
- Pritzker School of Molecular Engineering, University of Chicago
- Institute for Molecular Engineering, University of Chicago. Argonne National Laboratory
- Pritzker School of Molecular Engineerin, The University of Chicago
- Molecular Engineering, University of Chicago
Ian Foster
- University of Chicago