English

Electrocatalyst discovery through text mining and multi-objective optimization

Materials Science 2025-03-03 v1

Abstract

The discovery and optimization of high-performance materials is the basis for advancing energy conversion technologies. To understand composition-property relationships, all available data sources should be leveraged: experimental results, predictions from simulations, and latent knowledge from scientific texts. Among these three, text-based data sources are still not used to their full potential. We present an approach combining text mining, Word2Vec representations of materials and properties, and Pareto front analysis for the prediction of high-performance candidate materials for electrocatalysis in regions where other data sources are scarce or non-existent. Candidate compositions are evaluated on the basis of their similarity to the terms `conductivity' and `dielectric', which enables reaction-specific candidate composition predictions for oxygen reduction (ORR), hydrogen evolution (HER), and oxygen evolution (OER) reactions. This, combined with Pareto optimization, allows us to significantly reduce the pool of candidate compositions to high-performing compositions. Our predictions, which are purely based on text data, match the measured electrochemical activity very well.

Keywords

Cite

@article{arxiv.2502.20860,
  title  = {Electrocatalyst discovery through text mining and multi-objective optimization},
  author = {Lei Zhang and Markus Stricker},
  journal= {arXiv preprint arXiv:2502.20860},
  year   = {2025}
}

Comments

17 pages, 13 figures

R2 v1 2026-06-28T22:01:31.151Z