Open this publication in new window or tab >>Show others...
2022 (English)In: Journal of Chromatography A, ISSN 0021-9673, E-ISSN 1873-3778, Vol. 1666, article id 462867Article in journal (Refereed) Published
Abstract [en]
Structural elucidation of compounds detected with liquid chromatography coupled to high resolution mass spectrometry is a challenging and time-consuming step in the workflow of non-targeted analysis and often requires manual validation of the results. Retention time, alongside exact mass, isotope pattern, fragmentation spectra, and collision cross-section, is valuable information for ruling out unlikely structures and increasing the confidence in others. Different approaches to predict retention times have been used previously for reversed phase chromatography and hydrophilic interaction liquid chromatography (HILIC), but application is limited to a small set of mobile phases and gradient profiles. Here, we expand the toolbox available for retention time predictions by developing a random forest regression model for predicting retention times for four column types and twenty mobile phase systems. MultiConditionRT was built using a dataset containing 78 compounds analyzed with C18 reversed phase, mixed mode, HILIC, and biphenyl columns. In addition, different eluent compositions were used: both methanol and acetonitrile were combined with different aqueous phases with pH from 2.1 to 10.0 (formic acid, acetic acid, trifluoroacetic acid, formate, acetate, bicarbonate, and ammonia). The root mean square error (RMSE) of the test set predictions was 1.55 min for C18 reversed phase, 1.79 min for mixed-mode, 1.93 min for HILIC, and 1.56 min for biphenyl column. Additionally, MultiConditionRT can be applied to different gradient profiles with a general additive model-based calibration approach. The approach of MultiConditionRT was validated externally and internally with 356 and 151 compounds respectively, yielding an RMSE of 2.68 and 2.32 min. 324 and 84 of these compounds were not in the dataset used in the model development.
Keywords
High resolution mass spectrometry, Random forest regression, Gradient elution, Quantitative structure-retention relationship model
National Category
Chemical Sciences
Identifiers
urn:nbn:se:su:diva-202764 (URN)10.1016/j.chroma.2022.462867 (DOI)000756457600003 ()35139450 (PubMedID)
2022-03-112022-03-112024-08-21Bibliographically approved