Selecting initial values for iterative fitting of chromatographic peaks with exponentially modified gaussian function
Abstract
Various mathematical functions are used to describe shapes of chromatographic peaks. Some of these functions, such as exponentially modified gaussian, polynomially modified gaussian or parabolic variance gaussian are based on the normal distribution, and some are not. These functions have from 4 to 9 parameters that need to be iteratively optimized to fit the model function to the experimental data. Many of these functions are numerically unstable, therefore choosing an optimal initial guess of their parameters becomes crucial for successful fitting. The most commonly employed approaches are based on empirical equations relating the basic peak shape parameters (asymmetry value, width at 10% of peak height) and the parameters of the model function. Additionally, the algorithms for calculating the basic peak shape parameters are not thoroughly described in the literature.
Exponentially modified gaussian (EMG) was used as a model function in this work as it is a de facto standard in chromatography. Implementations of EMG in Python programming language libraries were listed. The numerical instability of SciPy implementation was investigated for symmetrical peaks and its probable causes were discussed. It was shown that Kalambet’s approach to calculating EMG (based on using several equations depending on the shape of the chromatographic peak) did not show such instability.
Approaches to calculate base peak parameters were discussed. Algorithms to find the apex coordinates, left and right halfwidths and width at selected peak height (10% to 50% of the peak maximum) were described.
It is widely known that the relation between the basic peak shape parameters and the parameters of the EMG function is not linear. The empirical equations that approximate these relations were suggested by Foley and Dorsey in the 1980s. We suggested using interpolation by splines instead. This approach significantly improved accuracy in estimating the model function parameters and allowed broadening the range of usable peak shape values. Splines can be calculated once and knots together with spline coefficients can be saved for future use.
In most of the articles and manuals width and halfwidths of the peak are calculated at 10% of the height. Alternative heights (10% to 50%) to calculate basic peaks parameters were tested. It was concluded that parameters of the EMG function can be calculated without significant difference in accuracy at different heights (from 10 to 30%) for noise-free peaks. For noisy data (S/N=100) 30-35% of the peak height can be considered as an alternative.
Downloads
References
Di Marco V.B., Bombi G.G. Math-ematical functions for the representation of chromatographic peaks. Journal of Chro-matography A. 2001; 931(1-2): 1-30.
Grushka Eli. Characterization of exponentially modified Gaussian peaks in chromatography. Anal. Chem. American Chemical Society, 1972; 44(11): 1733-1738.
Torres-Lapasió J.R., Baeza-Baeza J.J., García-Alvarez-Coque M.C. A Model for the Description, Simulation, and De-convolution of Skewed Chromatographic Peaks. Anal. Chem. 1997; 69(18): 3822-3831.
Baeza-Baeza J.J., Garcı́a-Alvarez-Coque M.C. Prediction of peak shape as a function of retention in reversed-phase liq-uid chromatography. Journal of Chroma-tography A. 2004; 1022(1-2): 17-24.
Caballero R.D., Garcı́a-Alvarez-Coque M.C., Baeza-Baeza J.J. Parabolic-Lorentzian modified Gaussian model for describing and deconvolving chromato-graphic peaks. Journal of Chromatography A. 2002; 954(1-2): 59-76.
Baeza-Baeza J.J., Ortiz-Bolsico C., García-Álvarez-Coque M.C. New ap-proaches based on modified Gaussian mod-els for the prediction of chromatographic peaks. Analytica Chimica Acta. 2013; 758: 36-44.
Li J. Development and Evaluation of Flexible Empirical Peak Functions for Processing Chromatographic Peaks. Anal. Chem. 1997; 69(21): 4452-4462.
Li J. Comparison of the capability of peak functions in describing real chro-matographic peaks. Journal of Chromatog-raphy A. 2002; 952(1-2): 63-70.
Pap T.L., Pápai Zs. Application of a new mathematical function for describing chromatographic peaks. Journal of Chro-matography A. 2001; 930(1): 53-60.
Pápai Z., L. Pap T. Determination of chromatographic peak parameters by non-linear curve fitting using statistical mo-ments. Analyst. Royal Society of Chemistry, 2002; 127(4): 494-498.
Foley J.P., Dorsey J.G. Equations for calculation of chromatographic figures of merit for ideal and skewed peaks. Anal. Chem. 1983; 55(4): 730-737.
Baeza-Baeza J.J., García-Alvarez-Coque M.C. Characterization of chromato-graphic peaks using the linearly modified Gaussian model. Comparison with the bi-Gaussian and the Foley and Dorsey ap-proaches. Journal of Chromatography A. 2017; 1515: 129-137.
Evaluating System Suitability - CE, GC, LC and A/D ChemStation - Revisions: A.03.0x-->A.08.0x.
Kalambet Y, Kozmin Y., Mikhai-lova K., Nagaev I., Tikhonov P. Recon-struction of chromatographic peaks using the exponentially modified Gaussian func-tion. J. Chemometrics. 2011; 25(7): 352-356.
Harris C.R., Harris C.R., Millman K.J., Van Der Walt S.J., Gommers R., Vir-tanen P., Cournapeau D., Wieser E., Taylor J., Berg S., Smith N.J., Kern R. Array pro-gramming with NumPy. Nature. Nature Publishing Group, 2020.; 585(7825): 357-362.
Virtanen P. et al. SciPy 1.0: funda-mental algorithms for scientific computing in Python. Nat Methods. Nature Publishing Group, 2020; 17(3): 261-272.
The pandas development team. pandas-dev/pandas: Pandas. Zenodo, 2024.
Hunter J.D. Matplotlib: A 2D Graphics Environment. Computing in Sci-ence & Engineering. 2007; 9(3): 90-95.
Waskom M.L. seaborn: statistical data visualization. Journal of Open Source Software. 2021; 6(60): 3021.
Granger B.E., Pérez F. Jupyter: Thinking and Storytelling With Code and Data. Computing in Science & Engineer-ing. 2021; 23(2): 7-14.
mkhrisanfov/peak-shape-calculation [Electronic resource]. URL: https://github.com/mkhrisanfov/peak-shape-calculation (accessed: 14.05.2024).
Zenkevich I.G., Makarov A.A., Pav-lovskii A.A. New approaches to the calcu-lation and interpretation of asymmetry fac-tors of chromatographic peaks. J Anal Chem. 2017; 72(7): 710-718.
Mallard W.G., Reed J. AMDIS – USER GUIDE. National Institute of Stand-ards and Technologies, 2019.
TensorFlow Developers. Tensor-Flow. Zenodo, 2024.
Samokhin A.S., Kalambet Yu.A. Opredelenie parametrov funkcii razvertki kvadrupol''nogo mass-spektrometra: 2. Analitika i kontrol'. 2018; 22(2): 168-176. (In Russ.)
Barber W.E., Carr P.W. Graphical method for obtaining retention time and number of theoretical plates from tailed chromatographic peaks. Anal. Chem. 1981; 53(12): 1939-1942.




