Text Mining for Understanding Material Synthesis


  • Rama K. Vasudevan, Center for Nanophase Materials Sciences (CNMS), Oak Ridge, TN
September 20, 2017 - 11:00am to 11:30am


The discovery of new materials is required to tackle a large number of challenges facing the globe, from energy production to farming to transportation. However, the discovery rate of new advanced materials remains stubbornly low. Much information that can increase this discovery rate exists in the extant literature in the form of thousands of papers but is scattered, uncollated, and unindexed. This talk will present a method to employ text mining through the form of regular expressions in combination with crowdsourcing, to yield a database of growth condition-functional property information for select oxides grown via pulsed laser deposition. Open-source tools facilitate the text mining, automatically annotating the relevant information, while user-led efforts in the form of crowdsourcing sift through annotated data, as well as figures, to compile database entries in an efficient manner. We have produced a database with hundreds of entries that show growth windows, trends, and outliers and that can serve as a template for analyzing the distribution of growth conditions and provide starting points for related compounds. Moreover, the database provides a community-wide resource that is both dynamic and searchable and that can be mined in the same method as first-principles repositories. Such tools will comprise an integral part of the materials design schema in the coming decade.

Center for Nanophase Materials Sciences Seminar Series

Center for Nanophase Materials Sciences


  • Central Laboratory and Office Building (SNS)
  • Building: 8600
  • Room: C-156

