Polymer informatics beyond homopolymers

Shukla, Shivank S.; Kuenneth, Christopher; Ramprasad, Rampi

doi:10.1557/s43577-023-00561-0

Polymer informatics beyond homopolymers

Impact Article
Published: 31 July 2023

Volume 49, pages 17–24, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

MRS Bulletin Aims and scope Submit manuscript

Polymer informatics beyond homopolymers

Download PDF

Shivank S. Shukla¹,
Christopher Kuenneth^1,2 &
Rampi Ramprasad ORCID: orcid.org/0000-0003-4630-1565¹

553 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

Polymers are diverse and versatile materials that have met a wide range of material application demands. They come in several flavors and architectures (e.g., homopolymers, copolymers, polymer blends, and polymers with additives). Searching this enormous space for suitable materials with a specific set of property/performance targets is thus nontrivial, painstaking, and expensive. Such a search process can be made effective by the creation of rapid and accurate property predictors. In this article, we present a machine learning framework to predict the thermal properties of homopolymers, copolymers, and polymer blends. A universal fingerprinting scheme capable of handling this entire polymer chemical class has been developed and a multitask deep learning algorithm is trained simultaneously on a large data set of glass-transition, melting, and degradation temperatures. The trained models demonstrate precision and scalability to other properties when relevant data becomes accessible.

Impact statement

The chemical and structural variations that can be achieved with the polymers is staggering. Such extraordinary and diverse possibilities translate to attractive combinations of physical properties impacting several application spaces, making the polymeric class of materials ubiquitous in our modern society. This chemistry–structure–property diversity is accompanied by a major challenge. Searching the chemo-structural space to identify suitable application-relevant candidates with the right set of target properties is nontrivial, requiring advanced rapid property prediction and search schemes. In this article, we present a data-driven machine learning framework to instantaneously predict the thermal properties (an important property class) of a dizzyingly large class of polymer archetypes, namely, homopolymers, copolymers, and polymer blends. A state-of-the-art machine learning algorithm has been developed and trained on a large data set of glass-transition, melting, and degradation temperatures, to make instantaneous predictions of these properties for any new-to-the-world polymer that falls in this large important polymer chemical class. This prediction scheme paves the way for discovering polymers with unprecedented thermal stability by allowing searches of enormous chemical spaces at scale.

Graphical abstract

Machine-learning-assisted discovery of polymers with high thermal conductivity using a molecular design algorithm

Article Open access 21 June 2019

Polymer Genome: A Polymer Informatics Platform to Accelerate Polymer Discovery

Polymer graph neural networks for multitask property learning

Article Open access 30 May 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Polymeric materials come in a variety of flavors and architectures, such as homopolymers, copolymers, polymer blends, and polymers with additives such as dopants, plasticizers, and organic/inorganic fillers.^1,2,3,4 The extraordinary chemical and structural diversity offered by such materials lead to wide-ranging and attractive combinations of physical properties impacting several application spaces, ranging from structural, electrical, packaging, chemical separation, health care, energy, and sustainable technologies.^{5,6,7,8,9,10,11,12,13,14,15,16,17}

In an effort to simultaneously optimize multiple (correlated or uncorrelated) properties, the community has explored and developed polymer varieties beyond neat homopolymers, namely, copolymers, polymer blends, and polymers with additives. Finding optimal candidates possessing a predefined set of property attributes has largely been guided by experience, intuition, and trial-and-error approaches. An exhaustive search of the relevant chemical spaces is nontrivial given the vast expanse of the spaces. Over the last decade or so, polymer informatics approaches have attempted to aid this search process by offering data-driven machine learning (ML) models to rapidly predict the properties of new polymer formulations and to recommend candidate materials that may meet multi-property target requirements.^{18,19,20,21,22,23,24,25,26,27} Nevertheless, these polymer informatics efforts have thus far largely focused on neat homopolymers,²⁸ although notable exceptions exist within recent attempts to address copolymer chemistries,^{29,30,31,32,33,34} polymers with dopants, and polymer composites.³⁵

In this contribution, we lay the groundwork to handle neat homopolymers, copolymers, and polymer blends within one unified multitask neural network polymer informatics framework. For definiteness, we focus on thermal properties, namely, the glass-transition temperature (\(T_{\textrm{g}}\)), the melting temperature (\(T_{\textrm{m}}\)), and the degradation temperature (\(T_{\textrm{d}}\)). Our method combines a novel fingerprinting scheme for polymer blends with a unified fingerprinting approach for all three types of polymers. We fine-tuned a ML architecture to accurately predict thermal properties, including the glass-transition temperature (\(T_{\textrm{g}}\)), the melting temperature (\(T_{\textrm{m}}\)), and the degradation temperature (\(T_{\textrm{d}}\)) of all three types of polymers, including miscible and immiscible polymer blends. Furthermore, we developed a classification model to predict the miscibility of polymer blends. Past efforts have mainly focused on homopolymers and copolymers, but our framework can handle all three types of polymers, and it can predict these properties for any new homopolymer, copolymer, or polymer blend.

As portrayed in Figure 1a, homopolymers are a subset of copolymers, and copolymers are a subset of polymer blends. Homopolymers are defined by one monomer repeat unit and copolymers by multiple monomer units. Polymer blends are a physical mixture of two or more homopolymer(s) and/or copolymer(s). In this work, we assume that our copolymers are random (i.e., the multiple repeat units are distributed randomly along the polymer backbone); we make this assumption because we do not have data that specify the particular archetype of the copolymer. However, if the copolymer type is known, an ML architecture trained by Tao et al. can be used to predict thermal properties of block and gradient copolymers.^36,37 Their framework considers the sequence of repeating units and relative composition of the monomers, which can be crucial for predicting thermal properties, whereas our approach only considers the relative composition of monomers. Polymer blends could be miscible or immiscible; our framework first predicts which category the polymer blend belongs to, and then, subsequently, predicts the appropriate number of critical temperatures (miscible polymer blends are characterized by one \(T_{\textrm{g}}\), one \(T_{\textrm{m}}\), and one \(T_{\textrm{d}}\), whereas immiscible two-phase polymer blends could display two \(T_{\textrm{g}}\), two \(T_{\textrm{m}}\), and one \(T_{\textrm{d}}\)).

Our machine learning procedure starts by first converting the chemical structure of homopolymers, copolymers, and polymer blends into numerical vectors called fingerprints; the chemical structure itself is specified using SMILES strings³⁸ of the repeat units, the composition of the copolymers (if relevant), and the weight fraction of the polymer blend components (if relevant). These aspects are captured in Figure 1b. The fingerprints of the chemical structures, along with \(T_{\textrm{g}}\), \(T_{\textrm{m}}\), and \(T_{\textrm{d}}\) data are fed into our multitask neural network architecture,²⁹ with the ultimate output being a trained model that can predict whether a polymer blend (if the queried case is a polymer blend) is miscible or not, followed by the thermal properties of the queried case. This new polymer informatics capability is able to predict \(T_{\textrm{g}}\), \(T_{\textrm{m}}\), and \(T_{\textrm{d}}\) with a RMSE of 15 K, 17 K, and 23 K, respectively. Needless to say, this framework can be extended to handle any other property class, as long as the requisite data for these properties are available.

Table I The number of homopolymers, copolymers, and polymer blend data points for the glass-transition (\(T_{\textrm{g}}\)), melting (\(T_{\textrm{m}}\)), and degradation (\(T_{\textrm{d}}\)) temperatures.

Full size table

Results and discussion

Data

The data set used in this study for homopolymer, copolymer, and polymer blends \(T_{\textrm{g}}\), \(T_{\textrm{m}}\), or \(T_{\textrm{d}}\) were collected from multiple sources cited elsewhere,^{18,29,39,40,41} including from the poLyInfo database.⁴¹ (The copyrights of this database are owned by the National Institute for Materials Science [NIMS].) In order to maintain consistency, we only use \(T_{\textrm{g}}\) and \(T_{\textrm{m}}\) data points measured using differential scanning calorimetry (DSC), and \(T_{\textrm{d}}\) values measured via thermogravimetric analysis (TGA).²⁹ Each copolymer data point has two comonomers, and each polymer blend data point consists of two constituents (two homopolymers, a homopolymer–copolymer mixture, or two copolymers). We infer the polymer blend miscibility for each data point from the presence of one or two \(T_{\textrm{g}}\) values in the data set. This is also known as technological miscibility.⁴² If the polymer blend data point has only one \(T_{\textrm{g}}\) value, we consider it miscible; otherwise, it is classified as immiscible. A three-component selector vector is used to indicate the property (\(T_{\textrm{g}}\), \(T_{\textrm{m}}\), or \(T_{\textrm{d}}\)) and miscibility (miscible or immiscible) of the data point. The property (for homopolymers, copolymers, and polymer blends) is encoded in the selector vector at the position of the nonzero component as shown in Figure 2a. For polymer blends, the miscibility information is encoded at the appropriate component taking on values of 1 or 2 as shown in Figure 2a.

Fingerprinting

The repeat units of the monomers of homopolymers, copolymers, and polymer blends in the data set are represented using simplified molecular input line system (SMILES) strings.³⁸ We use stars [*] to denote the end points of the repeat unit. SMILES strings cannot directly be ingested by conventional ML models and require conversion to numerical vectors. This conversion is performed using a previously pioneered handcrafted fingerprinting scheme²⁰ (see the “Methods” section) that has shown great performance for predicting properties of polymers in many previous works.^{18,19,20,23,28} For copolymers,²⁹ we compute fingerprints as the composition-weighted sum of the homopolymer fingerprint vectors (\({\textbf{C}_{j}} = \sum _{i}^{N} {x_{ji} \textbf{H}_{i}}\)), as shown in Figure 1b. \(\textbf{H}_{i}\), \(x_{ji}\), and N denote the fingerprint vector of a homopolymer (i), relative compositions of homopolymers in a copolymer (j), and the total number of comonomer components (in this work, \(N = 1,2\)), respectively. For polymer blends, we use the composition-weighted harmonic mean of fingerprint vectors of the constituents in the polymer blend. To compute the polymer blend fingerprints, we use \(1/\textbf{B}+1 = \sum _j^M w_j/ \textbf{C}^{'}_{j} =\sum _{i,j}^{N,M} w_j/x_{ji} \textbf{H}_{i}+1\). Here, \(\textbf{B}\), \(w_{j}\), \(\textbf{C}^{'}_{j}\), and M represent the polymer blend fingerprint, the relative composition of the \(j^{\text {th}}\) constituent, fingerprint vector of the \(j^{\text {th}}\) polymer blend constituent, and the number of constituents in the polymer blend, respectively. This equation resembles the mathematical form of the Fox equation.⁴³ A scalar factor of one was added to the fingerprint vector components (and later subtracted) to avoid singularities caused by fingerprint components with the value of zero. The complete fingerprinting pipeline for homopolymers, copolymers, and polymer blends is shown in Figure 1b.

Performance

ML model development involved fivefold cross-validation (CV) and a meta learner, as described in the “Methods” section. The averaged performance scores of the five CV and the meta learner classification models are illustrated in Figure 3a. By comparing the precision (P), recall (R), accuracy (A), and F1 score, we find that P is generally higher than R. The reason for this is the imbalance of miscible and immiscible polymer blends (ratio \(\approx 5/1\)) in our data set (see Table I). The performance metrics of the classification meta learner improve from the fivefold cross-validation classification model for \(T_{\textrm{g}}\) as illustrated in Figure 3a. Figure 3b and Supplementary information Table S1 show the root-mean-square error (RMSE) values of the fivefold CV models of the property-predictive regression models. The low RMSE values for \(T_{\textrm{g}}\), \(T_{\textrm{m}}\), and \(T_{\textrm{d}}\) of homopolymers, copolymers, and polymer blends provide confidence in the novel fingerprinting scheme for polymer blends and the usage of multitask models for this problem. All RMSE values for thermal properties are also of the same order of magnitude as experimental measurement errors. These RMSE values are also slightly better than the performance of homopolymers and copolymers as reported in our past publications.^21,28,29 For all types of polymers, RMSEs of fivefold CV are the lowest for \(T_{\textrm{g}}\), followed by \(T_{\textrm{m}}\), and then \(T_{\textrm{d}}\).

The parity plots of meta learner predictions for all the thermal properties on different polymer data sets are shown in Figure 4. These meta learner predictions are based on the 80% data set used to train the cross-validation models. The low overall RMSE values (including all the types of polymers) of 15 K, 17 K, and 23 K, and high R\(^2\) of values 0.98, 0.97, and 0.96 for \(T_{\textrm{g}}\), \(T_{\textrm{m}}\), and \(T_{\textrm{d}}\), respectively, signify high performance across the three distinct classes of polymers. Incorporating a hyperparameter-tuned meta learner on top of the cross-validation model further improves the performance of the multitask model as shown in Table S1. We use Monte Carlo dropout to estimate the uncertainty in our predictions and report them within a 95% confidence interval.⁴⁴ Our use of Monte Carlo dropout allows us to quantify the uncertainties in our predictions.

Sample prediction

Model predictions and experimentally measured values across the whole composition range for two selected miscible (Figure 5a–b) and two immiscible polymer blends (Figure 5c–d) are illustrated in Figure 5. The smoothness of the predictions across the composition range indicates that the ML model learned a smooth mapping between the fingerprint space and polymer properties. The experimental data points are in close agreement with the predictions, except for a few points in Figure 5b that fall outside the shaded bands, which indicate the uncertainty of the predictions.

Model validation

As a direct and true test of the generalizability of our trained meta learner model, we performed the following. The entire data set was divided into two parts. We used 90% of the data to train the fivefold cross-validation and the meta learner model, while the remaining 10% was reserved for testing. This 10% subset of data was never seen by our model and was used to test both the classification and prediction models. The performance of the meta learner classification model on the testing data set is summarized in Table IIa, demonstrating promising performance for both thermal properties. Similarly, Table IIb presents a summary of the meta learner prediction model’s performance on the 10% testing data set. Additionally, the parity plots for all predictions can be found in Figure S2. Notably, our observations indicate that the prediction performance for homopolymers and copolymers surpassed that of polymer blends on the testing data set, primarily due to the larger availability of data points, consistent with the findings from our fivefold cross-validation model. Furthermore, we noticed variations in performance based on thermal properties, with the model demonstrating enhanced accuracy for \(T_{\textrm{g}}\) and \(T_{\textrm{m}}\) compared to \(T_{\textrm{d}}\). Overall, the results suggest reasonable agreement between the predicted values and the measured values for all polymer classes and different thermal properties, although additional data could further improve the model’s performance. To expand the data set for polymer blends, simulated data for \(T_{\textrm{g}}\) can be employed. Previous studies have employed molecular dynamics simulations to generate \(T_{\textrm{g}}\) data for miscible blends.^45,46 Furthermore, features encompassing processing parameters, morphological details, and experimental conditions can be utilized for all thermal properties to train more accurate models.

Table II The performance of the trained multitask (a) classification and (b) prediction meta learner model for the thermal properties on the testing data set.

Full size table

Conclusion

In this study, we developed an ML framework capable of predicting properties of homopolymers, copolymers, and polymer blends simultaneously. Our framework demonstrates notable performance in terms of prediction accuracy and expands the chemical domain for this property class, showing improvements over previous works in terms of prediction accuracy and coverage.²⁹ Powered by multitask predictors and a large data set of 23,018 thermal data points, this framework enables the prediction of polymer properties that fall in a broad technologically relevant class. For the success of the model building, we designed a polymer blend fingerprinting scheme that extends and is based on homopolymer and copolymer fingerprints that we have used in the past.

There are several ways in which this work can be utilized to further expand the capabilities of polymer informatics in terms of accuracy and scope. Incorporating the impact of processing parameters and morphology in the case of polymer blends can lead to improvements in prediction accuracy and versatility. The conceptual ideas used in this work to set up the ML pipeline for thermal properties prediction for different polymer flavors can be extended to other properties (e.g., electronic and mechanical) when suitable data are available. Our approach can be adapted to different polymer classes, such as polymers with additives or polymer composites, by encoding features corresponding to the additive and fillers in the fingerprinting step.

Methods

Polymer genome fingerprints

Polymer SMILES strings (e.g., [*]CC[*] for polyethylene) are converted to numerical fingerprint vectors using a handcrafted fingerprinting scheme. For homopolymers/monomers, previous works have shown that effective fingerprinting involves using three hierarchical levels of descriptors that span different length scales.^47,48 Starting from the atomic level, we determine the presence of predetermined atomic fragments or motifs, which are sequences of contiguous atoms.²¹ Moving up the hierarchy, we employ larger length-scale descriptors that utilize quantitative structure–property relationships (QSPRs), such as van der Waals surface area, topological polar surface area (TPSA), fraction of atoms in rings, and fraction of rotatable bonds.²¹ Finally, we use chain-level descriptors that characterize the physical structure of the polymer, including the shortest topological distance between rings, fraction of atoms in side chains, and length of the longest side chain.²¹ These three levels of descriptors enable us to capture diverse and critical features of the polymers that are important for their properties. The benefits of these descriptors are that they have sufficient chemico-structural information to describe a wide range of physical and chemical attributes that control various polymer properties, they can distinguish between two different monomers, and they are invariant to different specifications of the polymer SMILES strings of the same polymer.

Multitask models and meta learner

The polymer fingerprints along with the selector vector and thermal property values for homopolymers, copolymers, and polymer blends are used to train our predictive ML models. Before training, the thermal property values are curated using min-max normalization to scale them to [0,1]. For the classification models, polymer fingerprints, selector vectors, and the number of thermal property values for \(T_{\textrm{g}}\) and \(T_{\textrm{m}}\) of polymer blends are used for training. The ML architecture is shown in Figure 2a, where the multitask model is a concatenation-based conditioned multitask deep neural network. This multitask model is trained on 80% of the data set through fivefold cross-validation (CV) and the remaining 20% is utilized to train the meta learner. We utilized TensorFlow⁴⁹ to implement all of our models. Adam optimization along with stochastic weight averaging is used for updating the weights of the network. The learning rate is initially set at \(10^{-3}\) and is changed in the later phases of training through the learning rate scheduler along with early stopping to prevent overfitting. Hyperparameters of our ML model that include the number of layers, number of neurons in each layer, initial learning rate, dropout rates, and the layer where the selector vector is concatenated were tuned using the HyperBand algorithm implemented in the KerasTuner.⁵⁰ All values of the hyperparameters are present in Table S2. The ML architecture and the hyperparameter tuning steps are the same for the prediction and classification models.

The predicted thermal property values from the fivefold CV models are used as inputs to the meta learner as shown in Figure 2c. The predictive meta learner is an ensemble model that predicts the final thermal property value from the five thermal property values of the five CV models.²⁹ The meta learner for classifying miscibility is an ensemble model that predicts the final probabilities for miscibility of polymer blends. Both meta learners are trained on the remaining 20% of the data set that the cross-validation models have never seen. Similar to the multitask models, the hyperparameter optimization of the meta learner is achieved through the HyperBand algorithm implemented in the KerasTuner.⁵⁰

Data availability

The data sets used in this study are from PoLyInfo database https://polymer.nims.go.jp/en/. (The copyrights of this database are owned by the National Institute for Materials Science [NIMS].)

References

C. Hagiopol, Copolymerization: Toward a Systematic Approach (Springer, New York, 1999)
Book Google Scholar
D.R. Paul, Polymer Blends, Volume 1 (Elsevier, Amsterdam, 2012)
Google Scholar
V. Ambrogi, C. Carfagna, P. Cerruti, V. Marturano, “Additives in Polymers,” in Modification of Polymer Properties, ed. by C.F. Jasso-Gastinel, J.M. Kenny (Elsevier, Amsterdam, 2017), chap. 4, pp. 87–108. https://doi.org/10.1016/B978-0-323-44353-1.00004-X
Chapter Google Scholar
D.R. Paul, J.W. Barlow, J. Macromol. Sci. Part C 18, 109 (1980). https://doi.org/10.1080/00222358008080917
Article Google Scholar
A.N. Singh, R.D. Thakre, J.C. More, P.K. Sharma, Y.K. Agrawal, Polymer 54, 1077 (2015)
CAS Google Scholar
P.N. Bartlett, P.R. Birkin, Synth. Met. 61, 15 (1993)
Article CAS Google Scholar
S.P. Nunes, Macromolecules 49, 2905 (2016)
Article CAS Google Scholar
Y. Sun, G. Shi, J. Polym. Sci. B Polym. Phys. 51, 231 (2013)
Article CAS Google Scholar
L.O. Ekebafe, D.E. Ogbeifun, F.E. Okieimen, Biokemistri 23(2), 81 (2011)
Google Scholar
F. Puoci, F. Iemma, U.G. Spizzirri, G. Cirillo, M. Curcio, N. Picci, Am. J. Agric. Biol. Sci. 3, 299 (2008)
Article Google Scholar
M. Rasek, J. Drozd, Pol. J. Appl. Chem. 53(3), 267 (2009)
CAS Google Scholar
L.M. Robeson, Polym. Eng. Sci. 24, 587 (1984)
Article CAS Google Scholar
O. Okay, Prog. Polym. Sci. 25, 711 (2000)
Article CAS Google Scholar
E. Elhefian, J. Chem. Soc. Pak. 36, 11 (2014)
CAS Google Scholar
L. Yu, K. Dean, L. Li, Prog. Polym. Sci. 31, 576 (2006)
Article CAS Google Scholar
P. Pötschke, D.R. Paul, J. Macromol. Sci. Part C 43, 87 (2003)
Article Google Scholar
K. Hamad, M. Kaseem, Y.G. Ko, F. Deri, Polym. Sci. Ser. A 56, 812 (2014)
Article CAS Google Scholar
C. Kim, A. Chandrasekaran, T.D. Huan, D. Das, R. Ramprasad, J. Phys. Chem. C 122, 17575 (2018)
Article CAS Google Scholar
A. Chandrasekaran, C. Kim, R. Ramprasad, “Polymer Genome: A Polymer Informatics Platform to Accelerate Polymer Discovery,” in Machine Learning Meets Quantum Physics, ed. by K.T. Schütt, S. Chmiela, O.A. von Lilienfeld, A. Tkatchenko, K. Tsuda, K.-R. Müller, Lecture Notes in Physics Book Series, vol. 968 (Springer, Cham, 2020), pp. 397–412. https://doi.org/10.1007/978-3-030-40245-7_18
Chapter Google Scholar
H. Doan Tran, C. Kim, L. Chen, A. Chandrasekaran, R. Batra, S. Venkatram, D. Kamal, J.P. Lightstone, R. Gurnani, P. Shetty, M. Ramprasad, J. Laws, M. Shelton, R. Ramprasad, J. Appl. Phys. 128, 171104 (2020)
Article CAS Google Scholar
C. Kuenneth, A.C. Rajan, H. Tran, L. Chen, C. Kim, R. Ramprasad, Patterns 2, 100238 (2021)
Article CAS Google Scholar
G. Pilania, C. Wang, X. Jiang, S. Rajasekaran, R. Ramprasad, Sci. Rep. 3, 2810 (2013)
Article Google Scholar
R. Ramprasad, R. Batra, G. Pilania, A. Mannodi-Kanakkithodi, C. Kim, NPJ Comput. Mater. 3, 54 (2017)
Google Scholar
S. Wu, Y. Kondo, M.-A. Kakimoto, B. Yang, H. Yamada, I. Kuwajima, G. Lambard, K. Hongo, Y. Xu, J. Shiomi, C. Schick, J. Morikawa, R. Yoshida, NPJ Comput. Mater. 5, 66 (2019)
Google Scholar
C. Yan, X. Feng, C. Wick, A. Peters, G. Li, Polymer 214, 123351 (2021)
Article CAS Google Scholar
J.W. Barnett, C.R. Bilchak, Y. Wang, B.C. Benicewicz, L.A. Murdock, T. Bereau, S.K. Kumar, Sci. Adv. 6, eaaz4301 (2020)
Article CAS Google Scholar
M.-X. Zhu, H.-G. Song, Q.-C. Yu, J.-M. Chen, H.-Y. Zhang, Int. J. Heat Mass Transf. 162, 120381 (2020)
Article CAS Google Scholar
L. Chen, G. Pilania, R. Batra, T.D. Huan, C. Kim, C. Kuenneth, R. Ramprasad, Mater. Sci. Eng. 144, 100595 (2021)
Article Google Scholar
C. Kuenneth, W. Schertzer, R. Ramprasad, Macromolecules 54, 5957 (2021)
Article CAS Google Scholar
Y. Zhang, X. Xu, J. Mol. Graph. Model. 103, 107796 (2021)
Article CAS Google Scholar
G. Pilania, C.N. Iverson, T. Lookman, B.L. Marrone, J. Chem. Inf. Model. 59, 5013 (2019)
Article CAS Google Scholar
Z. Jiang, J. Hu, B.L. Marrone, G. Pilania, X.B. Yu, Materials (Basel) 13(24), 5701 (2020). https://doi.org/10.3390/ma13245701
Article CAS Google Scholar
S. Goswami, R. Ghosh, A. Neog, B. Das, Mater. Today 46, 5838 (2021)
CAS Google Scholar
Y. Zhang, X. Xu, Mol. Cryst. Liq. Cryst. 730, 9 (2021)
Article CAS Google Scholar
M.-X. Zhu, Q.-C. Yu, H.-G. Song, T.-X. Chen, J.-M. Chen, ACS Appl. Energy Mater. 4, 1449 (2021). https://doi.org/10.1021/acsaem.0c02647
Article CAS Google Scholar
L. Tao, J. Byrnes, V. Varshney, Y. Li, iScience 25, 104585 (2022)
Article CAS Google Scholar
L. Tao, T. Arbaugh, J. Byrnes, V. Varshney, Y. Li, STAR Protoc. 3, 101875 (2022)
Article CAS Google Scholar
D. Weininger, J. Chem. Inf. Comput. Sci. 28, 31 (1988)
Article CAS Google Scholar
C. Kim, A. Chandrasekaran, A. Jha, R. Ramprasad, MRS Commun. 9, 860 (2019)
Article CAS Google Scholar
A. Jha, A. Chandrasekaran, C. Kim, R. Ramprasad, Model. Simul. Mater. Sci. Eng. 27, 24002 (2019)
Article CAS Google Scholar
PoLyInfo. https://polymer.nims.go.jp/en/
E. Manias, L.A. Utracki, “Thermodynamics of Polymer Blends,” in Polymer Blends Handbook, ed. by L.A. Utracki, C.A. Wilkie (Springer, Dordrecht, 2014), pp. 171–289. https://doi.org/10.1007/978-94-007-6064-6_4
Chapter Google Scholar
W. Brostow, R. Chiu, I.M. Kalogeras, A. Vassilikou-Dova, Mater. Lett. 62, 3152 (2008)
Article CAS Google Scholar
Y. Gal, Z. Ghahramani, “Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning,” 33rd International Conference on Machine Learning, ICML 2016 (New York, June 19–24, 2016), pp. 1651–1660
A.D. Glova, S.G. Falkovich, D.I. Dmitrienko, A.V. Lyulin, S.V. Larin, V.M. Nazarychev, M. Karttunen, S.V. Lyulin, Macromolecules 51, 552 (2018)
Article CAS Google Scholar
J. Li, S. Jin, G. Lan, S. Chen, L. Li, J. Mol. Graph. Model. 84, 182 (2018)
Article CAS Google Scholar
H. Doan Tran, C. Kim, L. Chen, A. Chandrasekaran, R. Batra, S. Venkatram, D. Kamal, J.P. Lightstone, R. Gurnani, P. Shetty, M. Ramprasad, J. Laws, M. Shelton, R. Ramprasad, J. Appl. Phys. 128(17), 171104 (2020). https://doi.org/10.1063/5.0023759
Article CAS Google Scholar
A. Mannodi-Kanakkithodi, G. Pilania, T.D. Huan, T. Lookman, R. Ramprasad, Sci. Rep. 6, 20952 (2016)
Article Google Scholar
A. Martin, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G.S. Corrado, A. Davis, J. Dean, M. Devin, S. Ghemawat, I. Goodfellow, A. Harp, G. Irving, M. Isard, Y. Jia, R. Jozefow, TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems (2015). https://www.tensorflow.org/
T. O’Malley, E. Bursztein, J. Long, F. Chollet, H. Jin, L. Invernizzi, KerasTuner (2019). https://github.com/keras-team/keras-tuner

Download references

Acknowledgments

This work is financially supported by the Office of Naval Research through a Multidisciplinary University Research Initiative (MURI) Grant (N00014-17-1-2656). C.K. thanks the Alexander von Humboldt Foundation for financial support.

Author information

Authors and Affiliations

School of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, USA
Shivank S. Shukla, Christopher Kuenneth & Rampi Ramprasad
Faculty of Engineering Science, Universität Bayreuth, Bayreuth, Germany
Christopher Kuenneth

Authors

Shivank S. Shukla
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Kuenneth
View author publications
You can also search for this author in PubMed Google Scholar
Rampi Ramprasad
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S.S curated the data set and trained the ML models. C.K. provided assistance for training and evaluating the ML models. R.R. conceptualized the work and provided his guidance. All authors discussed the results and commented on the manuscript.

Corresponding author

Correspondence to Rampi Ramprasad.

Ethics declarations

Code availability

The code is available at https://github.com/Ramprasad-Group/Polymer_informatics_beyond_homopolymers.

Competing interests

The authors declare no competing financial interest.

Additional information

Publisher′s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Below is the link to the electronic supplementary material.

Electronic supplementary material 1 (PDF 671 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Shukla, S.S., Kuenneth, C. & Ramprasad, R. Polymer informatics beyond homopolymers. MRS Bulletin 49, 17–24 (2024). https://doi.org/10.1557/s43577-023-00561-0

Download citation

Accepted: 18 June 2023
Published: 31 July 2023
Issue Date: January 2024
DOI: https://doi.org/10.1557/s43577-023-00561-0

Use our pre-submission checklist

Avoid common mistakes on your manuscript.