14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon

Digital Discovery, 2023, 2,1233-1250
DOI: 10.1039/D3DD00113J, Perspective
Open Access Open Access
Creative Commons Licence  This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.
Kevin Maik Jablonka, Qianxiang Ai, Alexander Al-Feghali, Shruti Badhwar, Joshua D. Bocarsly, Andres M. Bran, Stefan Bringuier, L. Catherine Brinson, Kamal Choudhary, Defne Circi, Sam Cox, Wibe A. de Jong, Matthew L. Evans, Nicolas Gastellu, Jerome Genzling, María Victoria Gil, Ankur K. Gupta, Zhi Hong, Alishba Imran, Sabine Kruschwitz, Anne Labarre, Jakub Lála, Tao Liu, Steven Ma, Sauradeep Majumdar, Garrett W. Merz, Nicolas Moitessier, Elias Moubarak, Beatriz Mouriño, Brenden Pelkie, Michael Pieler, Mayk Caldas Ramos, Bojana Ranković, Samuel G. Rodriques, Jacob N. Sanders, Philippe Schwaller, Marcus Schwarting, Jiale Shi, Berend Smit, Ben E. Smith, Joren Van Herck, Christoph Völker, Logan Ward, Sean Warren, Benjamin Weiser, Sylvester Zhang, Xiaoqi Zhang, Ghezal Ahmad Zia, Aristana Scourtas, K. J. Schmidt, Ian Foster, Andrew D. White, Ben Blaiszik
We report the findings of a hackathon focused on exploring the diverse applications of large language models in molecular and materials science.
The content of this RSS Feed (c) The Royal Society of Chemistry

Evaluating the roughness of structure–property relationships using pretrained molecular representations

Digital Discovery, 2023, 2,1452-1460
DOI: 10.1039/D3DD00088E, Paper
Open Access Open Access
Creative Commons Licence  This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.
David E. Graff, Edward O. Pyzer-Knapp, Kirk E. Jordan, Eugene I. Shakhnovich, Connor W. Coley
Pretrained molecular representations are often thought to provide smooth, navigable latent spaces; analysis by ROGI-XD suggests they are no smoother than fixed descriptor/fingerprint representations.
The content of this RSS Feed (c) The Royal Society of Chemistry

Digital design and 3D printing of reactionware for on demand synthesis of high value probes

Digital Discovery, 2023, 2,1326-1333
DOI: 10.1039/D3DD00108C, Paper
Open Access Open Access
Przemyslaw Frei, Philip J. Kitson, Alexander X. Jones, Leroy Cronin
A new modular approach to 3D printed reactionware design is presented, and its effectiveness demonstrated in the synthesis of a number of structurally related, diazirine based, photoaffinity probes.
The content of this RSS Feed (c) The Royal Society of Chemistry

Site-Net: using global self-attention and real-space supercells to capture long-range interactions in crystal structures

Digital Discovery, 2023, 2,1297-1310
DOI: 10.1039/D3DD00005B, Paper
Open Access Open Access
Michael Moran, Michael W. Gaultois, Vladimir V. Gusev, Matthew J. Rosseinsky
Site-Net is a transformer architecture that models the periodic crystal structures of inorganic materials as a labelled point set of atoms and relies entirely on global self-attention and geometric information to guide learning.
The content of this RSS Feed (c) The Royal Society of Chemistry

Design of antimicrobial peptides containing non-proteinogenic amino acids using multi-objective Bayesian optimisation

Digital Discovery, 2023, 2,1347-1353
DOI: 10.1039/D3DD00090G, Paper
Open Access Open Access
Yuki Murakami, Shoichi Ishida, Yosuke Demizu, Kei Terayama
MODAN is a multi-objective Bayesian framework for automated design of antimicrobial peptides containing various non-proteinogenic amino acids and side-chain stapling.
The content of this RSS Feed (c) The Royal Society of Chemistry

Go with the flow: deep learning methods for autonomous viscosity estimations

Digital Discovery, 2023, 2,1540-1547
DOI: 10.1039/D3DD00109A, Paper
Open Access Open Access
Michael Walker, Gabriella Pizzuto, Hatem Fakhruldeen, Andrew I. Cooper
An autonomous viscosity estimation using a dexterous dual-armed collaborative robot and a three dimensional convolutional neural network model that strongly outperforms human abilities.
The content of this RSS Feed (c) The Royal Society of Chemistry

Automated routing of droplets for DNA storage on a digital microfluidics platform

Digital Discovery, 2023, 2,1436-1451
DOI: 10.1039/D3DD00083D, Paper
Open Access Open Access
Ajay Manicka, Andrew Stephan, Sriram Chari, Gemma Mendonsa, Peyton Okubo, John Stolzberg-Schray, Anil Reddy, Marc Riedel
Automated routing of droplets for DNA storage on an industrial-scale digital microfluidics platform.
The content of this RSS Feed (c) The Royal Society of Chemistry

Active learning for efficient navigation of multi-component gas adsorption landscapes in a MOF

Digital Discovery, 2023, 2,1506-1521
DOI: 10.1039/D3DD00106G, Paper
Open Access Open Access
Creative Commons Licence  This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.
Krishnendu Mukherjee, Etinosa Osaro, Yamil J. Colón
We present the development of an active learning framework to model multicomponent gas adsorption in metal–organic frameworks.
The content of this RSS Feed (c) The Royal Society of Chemistry

A rigorous uncertainty-aware quantification framework is essential for reproducible and replicable machine learning workflows

Digital Discovery, 2023, 2,1251-1258
DOI: 10.1039/D3DD00094J, Perspective
Open Access Open Access
Line Pouchard, Kristofer G. Reyes, Francis J. Alexander, Byung-Jun Yoon
The capability to replicate the predictions by machine learning (ML) or artificial intelligence (AI) models and the results in scientific workflows that incorporate such ML/AI predictions is driven by a variety of factors.
The content of this RSS Feed (c) The Royal Society of Chemistry

Towards the automated extraction of structural information from X-ray absorption spectra

Digital Discovery, 2023, 2,1461-1470
DOI: 10.1039/D3DD00101F, Paper
Open Access Open Access
Creative Commons Licence  This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.
Tudur David, Nik Khadijah Nik Aznan, Kathryn Garside, Thomas Penfold
A machine learning model capable of extracting structural information from XANES spectra is introduced. This approach, analogous to a Fourier transform of EXAFS spectra, can predict first coordination shell bond-lengths with a median error of 0.1 Å.
The content of this RSS Feed (c) The Royal Society of Chemistry