TY - JOUR
T1 - Integrating multiple materials science projects in a single neural network
AU - Hatakeyama-Sato, Kan
AU - Oyaizu, Kenichi
N1 - Publisher Copyright:
© 2020, The Author(s).
PY - 2020/12
Y1 - 2020/12
N2 - In data-intensive science, machine learning plays a critical role in processing big data. However, the potential of machine learning has been limited in the field of materials science because of the difficulty in treating complex real-world information as a digital language. Here, we propose to use graph-shaped databases with a common format to describe almost any materials science experimental data digitally, including chemical structures, processes, properties, and natural languages. The graphs can express real world’s data with little information loss. In our approach, a single neural network treats the versatile materials science data collected from over ten projects, whereas traditional approaches require individual models to be prepared to process each individual database and property. The multitask learning of miscellaneous factors increases the prediction accuracy of parameters synergistically by acquiring broad knowledge in the field. The integration is beneficial for developing general prediction models and for solving inverse problems in materials science.
AB - In data-intensive science, machine learning plays a critical role in processing big data. However, the potential of machine learning has been limited in the field of materials science because of the difficulty in treating complex real-world information as a digital language. Here, we propose to use graph-shaped databases with a common format to describe almost any materials science experimental data digitally, including chemical structures, processes, properties, and natural languages. The graphs can express real world’s data with little information loss. In our approach, a single neural network treats the versatile materials science data collected from over ten projects, whereas traditional approaches require individual models to be prepared to process each individual database and property. The multitask learning of miscellaneous factors increases the prediction accuracy of parameters synergistically by acquiring broad knowledge in the field. The integration is beneficial for developing general prediction models and for solving inverse problems in materials science.
UR - http://www.scopus.com/inward/record.url?scp=85095557955&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095557955&partnerID=8YFLogxK
U2 - 10.1038/s43246-020-00052-8
DO - 10.1038/s43246-020-00052-8
M3 - Article
AN - SCOPUS:85095557955
SN - 2662-4443
VL - 1
JO - Communications Materials
JF - Communications Materials
IS - 1
M1 - 49
ER -