IR and terahertz spectroscopy and machine learning for medical and ecological applications
Yu. Kistenev*, V. Prishepa, V. Skiba1, V. Nikolaev, G. Rasponin, D. Makashev, A. Borisov
LMIML Laboratory, Tomsk State University, Tomsk, Russia
* yuk@iao.ru
IR and THz spectra are associated with "molecular fingerprints" because these spectral ranges contain useful information about vibration, rotation, intramolecular and intermolecular modes of large number of molecules [1]. This information allows conducting a sample qualitative and quantitative analysis. Terahertz time domain spectroscopy (THz-TDS) is based on measurement of electric field of transmitted through the sample femtosecond terahertz pulses 0,3]. THz-TDS has a spectral resolution of several GHz that is enough for solid or dried liquid samples analysis. THz high-resolution absorption spectroscopy has spectral resolution of several kHz that enough for gas mixture analysis. Symmetric molecules have no strong absorption lines in THz range, polar possess absorption in both spectral ranges. Therefore, spectral analysis of a sample by a THz spectrometer is complementary to IR spectrometry. But combination of THz and IR spectra means that data are presented in high dimension feature space. In this case, data are highly correlated. It makes inefficient data analysis using conventional methods like Multivariate Curve Resolution [4], Univariate Calibration 0. Machine learning methods are becoming the best tool to solve the discussed tasks.
Several applications of THz and IR spectroscopy combined with machine learning methods will be presented.
The work was conducted with the financial support of the Ministry of Science and Higher Education of Russia (Agreement No. 075-15-2024-557 dated 04/25/2024).
[1] O. Cherkasova, M. Konnikova, Yu. Kistenev, V. Vaks, J.-L. Coutaz, A. Shkurinov, Terahertz spectroscopy of biological molecules in solid, liquid, and gaseous states, In Molecular and Laser Spectroscopy; Gupta, V.P. Elsevier, Ch. 13, pp. 433-478 (2022) (Advances and Applications: V.3).
[2] M. Walther, et al, Collective vibrational modes in biological molecules investigated by terahertz time-domain spectroscopy, Biopolymers: Original Research on Biomolecules, V. 67 (4-5), 310-313 (2002).
[3] O. Smolyanskaya, N. Chernomyrdin, A. Konovko, K. Zaytsev, I. Ozheredov, O. Cherkasova, M. Nazarov, J.-P. Guillet, S. Kozlov, Y. Kistenev, et al, Terahertz biophotonics as a tool for studies of dielectric and spectral properties of biological tissues and liquids, Prog. Quantum Electron., 62, 1-77 (2018).
[4] A. de Juan and R. Tauler, Multivariate Curve Resolution: 50 years addressing the mixture analysis problem - A review, Analytica Chimica Acta, V. 1145, 59-78 (2021).
[5] P. Koscielniak and M. Wieczorek, Univariate analytical calibration methods and procedures. A review, Analytica Chimica Acta, V. 944, 14-28 (2016).