MULTICLASS CLASSIFICATION IN THE PROBLEM OF DIFFERENTIAL DIAGNOSIS OF VENOUS DISEASES BASED ON MICROWAVE RADIOMETRY DATA

Levshinskii Vladislav Viktorovich

udc 004.891.3

Vladislav V. Levshinskii

Multiclass Classification in the Problem of Differential Diagnosis of Venous Diseases Based on Microwave Radiometry Data

Abstract. This article is devoted to applying mathematical models in the differential diagnosis of venous diseases based on microwave radiometry data. A modified approach for transforming feature space in thermometric data is described. After constructing features, a multiclass classification problem is solved in several ways: by reducing to binary classification problems using "one versus rest" and "one versus one" methods and building a multivariate logistic regression model. The best classification model achieved an average balanced accuracy score of 0.574. A key feature of the approach is that classification result can be explained and justified in terms understandable to a diagnostician. This article presents the most significant patterns in thermometric data and the accuracy with which they can identify different classes of diseases.

Key words and phrases: microwave radiometry, mathematical modeling, feature construction, multiclass classification.

2010 Mathematics Subject Classification: 97M60; 68T30, 68T35

Introduction

Nowadays, it is incredibly relevant to develop intelligent systems based on applying various methods of artificial intelligence [1]. Such systems can help to interpret and analyze examination data and support decision making in medical diagnosis. Systems of the greatest interest are advisory systems that apply artificial intelligence methods and algorithms and contain mechanisms to explain the proposed solutions. The development of advisory systems requires using mathematical modeling, machine learning, and data analysis methods.

The reported study was funded by RFBR, project number 19-31-90153.

DO lY&Jj1

Microwave radiometry is a promising diagnostic method based on the examination of intrinsic electromagnetic radiation of human tissues in the microwave and infrared wavelength ranges. Its vital feature is absolute harmlessness to the patient. The method is successfully applied in various fields of medicine [2-4], in particular, in the early diagnosis and dynamic control of varicose diseases of the lower extremities [5] classified as "diseases of civilization," since the number of people suffering from them is estimated in billions.

Examination technique consists of consecutive measurement of internal and surface (skin) temperatures, registration of temperatures in numerical data, and subsequent examination data analysis. A specialist search for anomalies in thermometric data is a highly complex intellectual task requiring long training and many years of experience. Future development of intelligent systems will both improve the quality of diagnosis in general and also solve the problem of lack of narrow specialists, which will make possible the mass application of this method. Interpretation and formalization of expert knowledge and knowledge extraction from the data are critical stages in developing models for solving such problems.

During the last decade, the first studies have appeared on applying mathematical modeling, machine learning, and data analysis to diagnose varicose diseases of the lower extremities based on microwave radiometry data. The first models were based on Bayesian classifier [6]. Feature space consisted of temperature values, and the criteria used in making the diagnosis were incomprehensible to a diagnostician. All this created significant difficulties in justifying and explaining a diagnostic decision.

Statistical models have become prerequisites for the creation of effective models and algorithms that allow interpretation and justification of result [7,8]. Those models were applied to solve the binary classification problem Healthy /Sick.

In a related field, in the diagnosis of breast cancer based on microwave radiometry data, as a result of data mining, a significant number of patterns describing anomalies in the behavior of temperature fields have been revealed [9]. They are the basis for a model that allows to justify and explain a result not only in the diagnosis of breast cancer but also in the diagnosis of venous diseases [10,11].

The purpose of this study is to apply the model for dynamically describing a patient condition in the problem of differential diagnosis of venous diseases based on microwave radiometry data.

Of course, the problem of differential diagnosis is not considered for the first time. Previously, D.A. Vedenyapin and A.G. Losev applied neural

MW Left MW Right IR Left IR Right G Left G Right

28.5 30.0 31.5 0.0 1.5 3.0

Figure 1. Temperature fields of a patient whose left leg is affected by venous disease. MW is internal, IR is surface temperatures and G is internal temperature gradients. The data is interpolated using cubic splines.

networks [12] to solve it. There is a comparative review of that approach in the conclusion.

1. Data and methods

1.1. Microwave radiometry

Microwave radiometry is a biophysical non-invasive examination method consisting of the consecutive measurement of internal and surface temperatures at specific points and registration of temperatures in numerical data. A specialist analyzes examination data in thermograms or maps of temperature fields to detect temperature anomalies and conclude the state of health or the need for further examinations. A method is based on the fact that temperature anomalies precede structural changes.

As an example, Figure 1 shows maps of the internal and surface temperature fields of a patient whose left leg is affected by venous disease.

During the examination of the lower extremities, a specialist measures the internal and surface temperatures at 12 symmetrical points located along the back surface of both lower legs, according to Figure 2. Several measurements are being taken for a patient in different positions: lying on the stomach and standing up.

Figure 2. Sampling points on each leg (1-12). 1.2. Dataset

Dataset containing measurement data of the lower extremities of 146 patients (292 lower legs) is being analyzed. Each lower leg is labeled depending on the presence of a particular disease:

0 (Healthy) is measurement data of the legs without diseases, 36 lower legs (12.3%);

1 (Norm 2) is healthy lower legs of patients with venous disease on the other lower leg, 67 lower legs (22.9%);

2 (CVI) is lower legs with chronic venous insufficiency, 100 (34.2%);

3 (PTS) is post-thrombotic syndrome, 69 lower legs (23.6%);

4 (ADVT) is acute deep vein thrombosis, 20 (6.8%). Formally, the dataset can be represented as a matrix

(1) X = № t2 . t2 . . til . ti ,v = yi V2 ,Y = {1, 2 ,...,C} ,

if j. m l2 . m . Ln ym

where m is the number of objects in the dataset, n is the number of features, x1 = (t\,... ,tln) is the feature vector of object i, Y is the set of class labels and y G Y is a class label.

1.3. Feature construction

The process of feature constructing and building a model for dynamically describing condition of each lower leg consists of several steps.

Feature vector contains 48 values of internal and surface temperatures measured at certain points of the lower legs in the lying and standing positions. Measurement points are shown in Figure 2. At the first step, temperature data is being split into the following groups:

1. Internal temperatures, standing position

yi,mw,st _ (yi,mw,st yi,mw,st)

2. Internal temperatures, lying position

y i,mw,ly _ (y i,mw,ly yi,mw,ly )

3. Surface temperatures, standing position

yi,ir,st _ (yi,ir,st yi,ir,st)

4. Surface temperatures, lying position

yi,ir,ly _ (yi,ir,ly yi,ir,ly )

Here, superscript mw or ir indicates the range of temperatures (internal or surface), and superscript st or ly (standing or lying) indicates the patient position during the measurement. Subscript is a point number.

There is an additional special group called internal gradients. That group contains differences between internal and surface temperatures at the corresponding points. For example, gradients of internal temperatures measured in the lying position are represented as yi,g,ly _ (yhsJy yhg,ly)

Further, for every group of points and separately for pairs of groups, several valuable characteristics are calculated. These characteristics are presented in the form of hypotheses about the behavior of temperature fields and the corresponding generalized mathematical descriptions [9,10]:

1. Hypothesis about an insignificant temperature difference, according to which healthy lower legs are characterized by low values of the following functionals: 1.1. Temperature oscillation

(2) Fi(T) _ maxt - mint

ter ter

where T is a set of temperatures.

1.2. Temperature deviation

(3)

F2(T ) = STdev (T ) = \

£ (t - t )2

teT

|T |- 1 :

where T is the average value of temperatures in T, |T| is the number of temperatures in T. 1.3. Deviation of temperature values relative to the average

(4)

F3 (T) = max T - t teT

(5)

1.4. Deviation of internal gradients. The maximum and minimum values, (2), (3), (4), and the following Lp norms are used as measures of the spread of internal gradients:

f(t) = ||tll! , f(t) = ||t||2, Fe(T) = IT||TO ,

where

iit ip = (eitip)p,

yT

teT

max

teT

More specific:

1.4.1. Maximum difference between the internal temperatures of the lower leg and the average temperature, standing position

fi(xi ) = F3(Ti'mw'st) = max teT i,mw,

Ti,mw,st _t

1.4.2. The spread of internal temperatures of the lower leg measured in the lying position

/2^ ) = F2(T4

)

£ (t - ti

teT i,mw,iy

\iy )2

It i

1

1.4.3. Oscillation of the surface temperatures of the lower leg measured in the standing position

/3(x4) = Fi(T i'ir'st)

max t — min t

teTi,ir,st teti,ir,st

2. Hypothesis about the symmetry of temperature fields, according to which healthy lower legs are characterized by slight deviations of temperatures at the corresponding points (subregions), as well as slight differences in the values of the corresponding characteristics.

The following characteristics are used as symmetry measures:

(6)

F(TC,T„) = ||TC - Tp\

F(Tc, Tp) = ||TCH - ||Tp||,

where ||z|| is a functional, Tc — Tp is an element-wise difference, Tc is current, and Tp is paired group of temperatures. These characteristics require an additional step of data preprocessing, as well as the presence of a pair for every lower leg in the dataset. For example, during the preprocessing of lower legs data, if the left lower leg is being viewed at the moment, then current temperature group is internal or surface temperatures of the left lower leg, and paired group is internal or surface temperatures of the right lower leg.

For paired temperature groups, the calculated characteristics are basically defined under the previous hypothesis, e.g:

2.1. Maximum absolute value of temperature difference of the corresponding points

Fr(Tc,Tp) = Fe(Tc — Tp)

2.2. Difference between the minimum and maximum temperatures of the lower legs

Fg(Tc,Tp) = max t — min t

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

teTc teTp

2.3. Difference of standard deviations of lower leg temperatures (7) Fg(Tc,Tp) = F2(Tc) — F2 (Tp)

2.4. Difference of average values, etc.

Fio(Tc, Tp) = Tc — Tp

3. Hypothesis about the stability of temperature fields, according to which healthy lower legs are characterized by slight differences in temperatures measured in different positions.

Features of this group characterize the degree of similarity between temperature fields in different positions and are practically similar to features defined within the symmetry hypothesis. For

example:

3.1. Difference of average values of surface temperatures of the lower leg, measured in standing and lying positions

f (x*) = F10 (T i'ir'st T i'irjy ) = T iir'st — T i'ir'ly

3.2. Maximum absolute value of the difference between the internal temperature gradients of the lower leg, measured in standing and

lying positions

f5(xi) = F7(T i,a,st,Ti,g,iy) =

_t ig,1

4. Hypotheses related to the physiological structure of the lower legs [7,8]. The values of lateral-medial and axial gradients for different groups of temperatures are considered, as well as their differences for the corresponding groups of the right and left lower legs:

4.1. lateral-medial gradient

F11 (T) = LMG(T) = TXt - T~t,

where Text is a subgroup of temperatures of the external part of the lower leg (points 1, 4, 7, 10) and Tint is a subgroup of temperatures of the internal part of the lower leg (points 3, 6, 9, 12).

4.2. axial gradient

(8) F12 (T) = AG(T) = TOp - TbTt,

where Ttop is a subgroup of temperatures of the top part of the lower leg (points 1, 2, 3) and Tbot is a subgroup of temperatures of the bottom part of the lower leg (points 10, 11, 12).

Going back to the example in Figure 1, the following is observed:

1. Similarity of the internal and surface temperature fields of the right lower leg when measured standing or lying. Features of the form (6) can be applied for detection and description;

2. Similarity of the internal temperature fields of the right lower leg when measured standing and lying. Similar for surface temperatures. The same features of the form (6);

3. Asymmetry of the temperature fields of the right and left lower leg. Features of the form (6) are applied, including all other features, e.g.

(7).

4. Differences in the internal and surface temperature fields of the left lower leg when measured standing or lying. Similar to item 2;

5. Bell-shaped contours are observed in the left lower leg. Such data can be detected by using, for example, deviation measures, features of the form (5), and the axial gradient (8), etc.

1.4. Thermometric features

For every object in the dataset, the values of functions f are calculated and 128 new features are constructed. Further, by binarizing [13] the

obtained values, a set of thermometric features is constructed

(9) S = (¿1,

where s is the number of features.

A thermometric feature is a triplet ^ = (f, I, W), where I is an interval and W is a weight (informativeness of f on I), or a quantitative measure that determines how well a feature separates objects of one class from other classes. Thermometric feature is considered fulfilled (observed for the object xl) if f (x1) € I.

Statistical informativeness [13] was applied for calculating weights. In the case of several classes, it is defined as

nPl S~<PK

C D ...CD

(10) I(¿,X) = - ln Pl Cp Pk ,

Cm

where Ck is a binomial coefficient, Pi is the number of class i objects in sample X, pi is the number of class i objects, for which the feature ^ is observed, p = p1 + • • • + pK. This measure is fair enough and works well for small unbalanced datasets.

A key feature of thermometric features is interpretability, which makes it possible to form a conclusion about the state of an object based on the values of thermometric features. Vector (¿^ ¿2,..., ¿s) dynamically describes the condition of the object in the sample. Element of a vector with index j equals 1 if feature j is observed for the object xi, and 0 otherwise.

After all transformations, the matrix (1) takes the form of a binary matrix

>i(x1) ¿2 (x1) . . ¿s(x1)'

(11) X' = ¿i(x2) ¿2 (x2 ) . . ¿s (x2 )

_Mxm) ¿ 2 ( x m) . . ¿s(xm)_

and further the classification algorithms are constructed. Moreover, every feature from (11) can be described in a language understandable to a diagnostician.

As binarization result, a large number of thermometric features can be obtained, while many features do not provide new information in combination with each other, so here arises the problem of feature selection. To solve this problem, logistic regression with L1-regularization [14] is applied. The process of transforming the feature space is illustrated in Figure 3.

Classification algorithm is defined as a(x*) =

where

{i

1, if hW(x*) > 0.5, 0, otherwise

hw(x4) = g(Wo + ^ Wj j(x4)) j=i

is the sum of weights of thermometric features, Wj is a weight of feature jj, and

g(z) = i ,1 -z 1 + e z

is a sigmoid.

Together with thermometric features, logistic regression is a weighted feature voting algorithm. To justify and explain the classification result, it is sufficient to combine the descriptions of the object's features.

The following approaches for solving the multiclass classification problem are considered:

1. Logistic regression (LR), one versus rest (OvR). For every class in the dataset, a model that determines whether an object belongs to the selected class is built. The most confident model determines the result. In total, C classifiers are trained, C is the number of classes.

2. LR, one versus one (OvO). For every pair of classes in the dataset, a separate classification model is built. A majority vote determines the result. In total, classifiers are trained.

3. Multinomial logistic regression (MLR), which is a generalization of logistic regression for the case of several classes.

In addition to multiclass classification, a hierarchical approach is also considered. The first model is applied for solving binary classification problem: for separating Healthy class from others, which can be done quite effectively. And the second model is applied to clarify the class of disease.

Stratified nested cross-validation [15] is used to evaluate the efficiency of classification and to compare models with each other. The dataset is split into 9 blocks at the outer level and 8 blocks at the inner level. Balanced precision [16] is used as a performance metric. It is defined as the

Table 1. Classification performance

Metric

Accf,

LR, OvR LR, OvO MLR

W/J H w/b H w/b H

Avg 0.557 0.548 0.574 0.537 0.56 0.541 Std Dev 0.102 0.074 0.065 0.05 0.073 0.078

Avg 0.844 — 0.781 — 0.838

0.096

Recalh 8 —

Std Dev 0.136 - 0.162

Reca lb.

Reca lb

Avg 0.609 0.641 0.583 0.684 0.597 0.608

Std Dev 0.138 0.176 0.13 0.092 0.12 0.151

Avg 0.614 0.579 0.617 0.549 0.595 0.641

Std Dev 0.197 0.142 0.09 0.11 0.163 0.167

R ., Avg 0.45 0.576 0.535 0.561 0.519 0.519

a Std Dev 0.18 0.136 0.104 0.189 0.143 0.177

R „ Avg 0.271 0.396 0.354 0.354 0.25 0.396

KecaiLt Std Dev a333 a24g a227 0^227 0^204 0^24g

average value of recall for every class:

n

Recall;

Acc6 = ^

c '

¿=1

correct;

Recall;, = -,

tota I .;

where correct;, is the number of class i objects, which are classified as i, total.; is the total number of class i objects, C is the number of classes.

2. Results and discussion

Results are presented in Table 1. There the mark "w/o H" belongs to algorithms that are trained on the dataset without class Healthy, Avg is the average score, Std Dev is standard deviation. The highest balanced accuracy scores are achieved using LR, OvO in the dataset containing all classes, and using LR, OvR in the dataset without class 0 (Healthy).

LR, OvR is the best model for identifying class Healthy. It has an average accuracy of 0.844.

Variance of LR, OvO scores is less than that of other models. This model, on average, performs better than other models in identifying class 2, which characterizes venous diseases. However, other models have better performance in identifying the rest of classes.

Models built for the dataset without class 0 have higher classification accuracy for various diseases than the same algorithms for the full dataset. In that case, LR, OvR model is leading and more efficiently identifies

Table 2. Thermometric features

Feature W Ro Rl R2 R3 Ri

rpi,ir,st ■ T^st ,-0.288) 48.32 0.0 0.7 0.18 0.16 0.2

|| Twt - T^st (1.967,7.72) 44.91 0.0 0.75 0.64 0.62 0.85

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

_ Tip-,w,ly e (4.25, oo) 44.65 0.0 0.69 0.57 0.55 0.95

,Tvr,st e (_0.288,0.279) 44.44 1.0 0.25 0.48 0.39 0.15

|| rpi,rnw,ly _TgmwJy e (1.45, 3.85) 42.81 0.94 0.22 0.34 0.32 0.05

|| Twiy ^€(4.45,00) 42.61 0.17 0.85 0.64 0.9 0.85

||TWt H^ e (i.25,oo) 41.62 0.0 0.73 0.64 0.54 0.7

rpi,ir}y _ TWy e (0.221, oo) 40.39 0.11 0.04 0.38 0.54 0.7

y -'Lni'l J// _ Ti,mw,ly e(^o _0.096) 39.98 0.14 0.75 0.31 0.16 0.25

y ■■/.nill .^i _Tvnw,st ||iG [0,5.55) 39.93 1.0 0.49 0.67 0.62 0.05

class 3. In comparison with MLR, this model distinguishes class 1 better, class 2 worse, and class 3 with the same accuracy. In comparison with LR, OvO, this model identifies class 1 worse and better identifies all other classes. Class 4 is characterized by a significant variance of average accuracy estimates.

Table 2 shows examples of the most informative thermometric features, which are the basis for classification models. There W is informativeness, Ri is a proportion of class i objects that have a feature. All these features describe the symmetry of temperature fields of the lower legs.

Three features with the highest informativeness are not observed in Healthy class. They allow effective detection of the lower legs with diseases. Such features are the difference in means and the deviation of surface temperatures of the legs measured in standing position, as well as the deviation of internal temperatures measured in the lying position.

Almost all healthy lower legs are characterized by a small difference between average values of skin temperatures measured in the standing position and a small deviation of internal temperatures measured in both standing and lying positions.

In class 1, there is practically no high difference in average values of internal or surface temperatures measured both standing and lying. At the

same time, the difference between average values of surface temperatures in standing position is usually higher for them than for class 0.

Class 4 does not exhibit low deviance of internal temperature gradients measured in the lying position. A low deviation of internal temperatures, measured both standing and lying, is practically not observed.

These and other features are used in weighted voting classifiers. And the given features signal that different classes of diseases are characterized by high deviation of internal and surface temperatures of the lower legs.

Conclusion

The most effective universal algorithm for solving the task is LR, OvO. It has an average balanced accuracy of 0.574. However, when applying a hierarchy of classifiers and reducing the problem to a binary classification Healthy/ Sick with subsequent clarification of the disease, the best result can be achieved with LR, OvR. It has an average estimate of clarification of the disease class of 0.548.

Earlier, Vedenyapin and Losev [12] applied three two-layer neural networks in sequential order to solve the differential diagnosis problem. Every network separated one of the classes from all the others, and the rest were classified as Healthy. That approach has an accuracy of 0.59. A detailed comparison of results is not possible because evaluation methods and datasets are a bit different. Nevertheless, the presented approach has the following advantages over neural networks:

1. A possibility to justify and explain the classification result. Every thermometric feature can be interpreted;

2. Anamnesis data (indicators of edema, pain, skin changes) is not used. It is possible that adding anamnesis data to features space can significantly improve the performance of classification. However, this is of interest for further research.

Results show the applicability of the model for dynamically describing the patient's condition in differential diagnosis of venous diseases. The key feature of constructed algorithms is the possibility to justify and explain the diagnostic decision.

References

[1] Roadmap for the development of "Pass-thmugh" digital technology "Neu-rotechnologies and Artificial Intelligence" , Ministry of Digital Development,

MULTICLASS CLASSIFICATION IN THE PROBLEM OF DIFFERENTIAL DIAGNOSIS OF VENOUS DISEASES BASED ON MICROWAVE RADIOMETRY DATA Текст научной статьи по специальности «Компьютерные и информационные науки»

Аннотация научной статьи по компьютерным и информационным наукам, автор научной работы — Levshinskii Vladislav Viktorovich

Похожие темы научных работ по компьютерным и информационным наукам , автор научной работы — Levshinskii Vladislav Viktorovich

Текст научной работы на тему «MULTICLASS CLASSIFICATION IN THE PROBLEM OF DIFFERENTIAL DIAGNOSIS OF VENOUS DISEASES BASED ON MICROWAVE RADIOMETRY DATA»