Analysis of chi-squared divergence changes by filtering of stego images formed according to uniward embedding methods

Progonov D.O.

Visnyk N'l'UU KP1 Seriia Radiolekhnika tiadioaparatobuduummia, "2019, Iss. 76, pp. 72—76

УДК 004.[056.54:932.2]

Analysis of Chi-Squared Divergence Changes by Filtering of Stego Images Formed According to UNIWARD Embedding Methods

Progonov D. O.

National Technical University of Ukraine "Igor Sikorsky Kyiv Politechnic Institute" E-mail: progonov&gmaiL com

Counteraction to sensitive information leakage is topical task today. Special interest is taken on early detection of hidden (st.eganograpliic) information transferring by data transmission in communication systems. Message (stego data) embedding is provided by alteration of cover files, such as digital images, according to used st.eganograpliic algorithm. Reliable detection of formed stego images requires usage of targeted st.egdetect.or that needs a priori information about specific distortions (signatures) of cover due to data hiding. It makes detection systems vulnerable to zero-day attack usage by malefactors the previously unknown embedding algorithms. Therefore it is required development of universal (blind) st.egdet.ect.ors that, are capable to reliable revealing of stego images even in case of limited or absence information about, used embedding method. Creation of blind st.egdetect.or requires determination of cover image parameters that, are sensitive to any alteration caused by message hiding. As such parameters it. is proposed to use information-theoretic estimations (clii-square divergence) of pixels brightness distribution distortion due to stego data embedding. For amplification of these distortions it. is used image pre-processing with median and Wiener filters. The case of adaptive messages hiding in cover images according to UNIWARD methods is considered. It. is revealed that, usage of clii-square divergence allows reliably detection of small alteration of cover image even in case of low cover payload (less than 10 %). Different, character of clii-square divergence changes for filtered images by information hiding in spatial and JPEG domains allows determine type of used embedding domain.

Key words: st.eganalysis: adaptive embedding methods: UNIWARD algorithm: clii-squared divergence DOI: 10.20535/RADAP.2019.76.72-76

Introduction

Protection of confidential information is topical task today. Special interest is taken to counteraction of information leakage by data transmission in communication systems, snch as social networks [1]. Usage of steganographic methods allows hiding information into transmitted (cover) files, snch as digital images. It complicates revealing of unauthorized confidential information transmission with nsage of Data Leak Prevention systems (DLP-systems).

Ensuring of high detection accuracy (more than 90%) of modified (stego) images requires a priori information about specific alterations (signatures) of cover image caused by message hiding [1]. But in most real cases this information are limited or even absent, especially for advanced embedding algorithms. Therefore, it is needed development of universal (blind) stegdetectors that allows reliably detecting of stego images even in case of limitation the information about features of used embedding method.

1 Related works

One of the most widespread approaches to stego images detection is nsage of covers image models [1]. These models are taking into account cover image parameters, snch as [1]: parameters of pixels brightness distributions, correlation of adjacency pixels brightness, etc. Achieving of high detection accuracy requires merging several simple models into one rich model, for instance J—SRM [2], SRM [3] models. Despite of high effectiveness of mentioned cover image rich models, theirs practical applications are limited. It is caused by necessity of time-taking and computation intensive tuning of the enormous number of model's parameters (from 12,870 parameters for PSRM model to 35,263 parameters for J—SRM model) for achieving high detection accuracy.

The development of adaptive embedding methods, snch as UNIWARD [4], leads to considerable reduction of models-based stegdetectors performance. It is caused by minimization of cover parameters alterations by information hiding. For increasing detection accuracy

Analysis of chi-squarod divergence changes by filtering of stego images formed according to UNI WARD embedding methods 73

it was proposed detection methods that are based on applying of Artificial Neural Networks (ANN) [5]. The ANN is used for extraction features from analyzed images and further construction of cover image model. Nevertheless, duration of ANN-based stegdetectors tuning remains relatively long. Therefore it is needed development of fast and accurate universal (blind) stegdetectors that are effective even in case of absence a priori information about embedding method features.

For solving of mentioned task it was proposed to analyze alterations of cover pixels brightness distortions due to information hiding with usage of information-theoretic indices, such as chi-square divergence [6]. It allows decreasing duration of steg-detector tuning by preserving of detection accuracy. Practical application of proposed solution requires usage of initial (undistorted) cover images that are hard to obtain in real cases. For this limitation to be overcome we propose to use the calibrated cover image, obtained by suppression (filtering) of specific alterations caused by message hiding.

2 Task and challenges

Our purpose is analysis of chi-square divergence changes by stego image pre-processing with usage of median and Wiener filters in case of adaptive message hiding according to UNIWARD embedding method.

3 Adaptive methods for data embedding in digital images

For minimization of cover image distortions caused by information embedding there has been proposed adaptive embedding methods (AEM). Feature of AEM is representation of data embedding process as optimization task [7]:

F(C,S)= -ïï(vm) • D(yM) ^ min

with limitations

V M) = const,

\d(Vm) < De,

(1)

(2)

independent changes of cover image parameters. In this case, functions ^(vm) and D(um) can be represented with usage of Gibbs distribution [7]:

n(VM) = n

i=1

Vi^VM

D(VM) = Y.P( y^

(4)

where d - stego bits number; p(yi) - function for

th element of set yM- Variation of Gibbs distribution parameter A ( ) allows either improving robustness of formed stego image to steganalysis (A ^ or

increasing number of embedded stego bits (A ^ 0).

One of the most robust to steganalysis embedding methods is UNIWARD groups of algorithms. These methods are based on minimization of heuristically

defined distortion function p(^) [ ]:

£ ± £ , (i)

fi t=1t=i * + iwit\c )|

where F(C, S) - function for estimation of cover image C alterations % stego image S formation; y m ~ set of cover pixels brightness changes that is needed for message M hiding; Y - set of all the possible changes of cover pixels brightness; y m) _ estimation of probability a stego image formation by usage of y m D( y m) estimation of cover image parameters alterations duo to î/m applying; De - fixed level of cover parameters distortion by stego image formation.

The standard way to solving (1) is usage of assumption that embedding a single stego bit leads to

where C,S - grayscale cover and stego images with size M x N pixels; - wavelet coefficient on the

uv-ih. spatial position in the A;-th sub-band of first level of two-dimensional discrete wavelet transformation; a(a > 0) - stabilizing constant.

Usage of distortion function (5) by solving of optimization task (1) with limitations (2) allows creating the uniform approach to embedding methods construction. This approach can be adapted for message embedding in spatial (S-UNIWARD algorithm) as well as JPEG (J-UNIWARD algorithm) domains.

It is shown [6] that despite of high ''adaptability" of UNIWARD embedding methods to cover image, theirs usage leads to significant alteration of cover image pixel brightness distribution. These alterations can be detected by changes of information-theoretic indices, such as chi-squarod divergence, by taking of initial (undistorted) cover and stego images. In most practical cases such cover image is unavailable and can be only approximated (calibrated). Therefore, it is needed analysis of information-theoretic indices changes for initial and calibrated (filtered) analyzed images.

4 Analysis of divergences between pixel brightness distributions of filtered stego images

The well-known information-theoretic indices for estimation differences between distributions of cover Pc and stego Ps images pixels brightness are [ , ]:

74

Piogoriov D. O.

Kullback-Leibler divergences (Dkl)i Hellinger (Dh) and Bhattacharaya (Db) distance:

Dkl(Pc,Ps) = £ Pc(q) ■ log2(p^) ,

qeQ ^ s(q) '

D* (pG ,ps ) = ^

DB (Pc ,Ps )

- ln(l -D2H(Pc,PS)),

(6)

Je Pë(qj - ^p^))2, (7)

V

(8)

■^•S-UNIWARD embedding method O J-UNIWARD embedding method

Cover image payload, %

Fig. 1. Dependency of chi-squared divergence on cover image payload for undistorted covers and stego images formed according to S-UNIWARD (solid line) and J-UNIWARD (dashed line) adaptive embedding methods.

where Q = {0,1,..., 2fc — 1} - range of pixels brightness q',k — number of bits that is used for pixel's brightness encoding.

It is shown [6] that Kullback-Leibler divergence (6), Hellinger (7) and Bhattacharaya (8) distances are insensitive to weak alterations of cover pixels brightness distribution, caused by usage of AEM. For revealing these alterations it was proposed to use specific information-theoretic estimation, such as chi-squared divergence Dx2 [ ].

Chi-squared divergence Dx2 between distributions of cover Pc and stego Ps images pixels brightness can be estimated according to formula [8]:

It should be noted weak dependency of chi-squared divergence D^1 on cover image payload in case of usage UNIWARD embedding methods (Fig. 1). Therefore, detection accuracy for standard stegdetector [9], tuned with estimated chi-squared divergence D^ is relatively low - probability of misclassification is Pmc ~ 0.43.

For strengthening the changes due to message hiding it is represent an interest to pre-process analyzed image with usage of median and Wiener filters.

5 Experiments

DX2 (Pc ,Ps )

£

qeQ

(Pc(q) -Ps(q)Y Ps (q)

(9)

It should be noted that distance (9) is not symmetric - Dx2(Pc,Ps) = Dx2(Ps,Pc). Therefore, it is useful to take into account by digital images steganalysis estimation for cover (DC^ and stego (DS2) images as well as relative (D7X2l) chi-squared distances:

=DX2 (PS ,PC ),

Ds,2

Dx2 (Pc ,Ps ),

Df = /dSX2 .

(10)

(ii)

(12)

According to the results of [6], the relative chi-squared divergence Drxe2 (12) is more sensitive to weak alterations of cover image in comparison with DC2 ( ) and D'^ (H).

Typical values of divergence by analysis of undistorted cover and stego image, formed according to S-UNIWARD and J-UNWARD embedding methods analysis are represented on Fig. 1 [6].

Investigation was carried out with usage of pseudo randomly chosen 10,000 images from standard dataset MIRFLickr-25k [10]. Images were scaled to size 640x480 pixels with usage of Lanczos kernel and transformed to grayscale mode (8 bits color depth). Prepared images were saved in lossless JPEG format.

Stego images were formed according to S-UNIWARD and J-UNIWARD embedding methods [4]. Cover image payload was changed from 5% to 65% with step 5%..

Processing of images with median and Wiener filters were conducted in several steps. Firstly, analyzed image was divided into parts with usage of sliding windows W with size w x w(w G N0^d) pixels, where N0dd is set of odd numbers. Size of sliding window was chosen w = 7 according to recommendation [ ]. Then, for suppressing the edge effects (w+1)/2 points outside the image boundaries were symmetrically filled.

Image processing was started from left-upper corner and iteratively continuing by sliding window shift on 1 pixel from left to right. If sliding window achieved the last pixel in x row, window was moved to (x + 1) row.

For median filter, the initial (un-noised) value of central pixel brightness W™ was estimated as median of brightness distribution for pixels located within window.

For Wiener filter, it was used standard assumption that noise is additive white Gaussian (AWGN). Estimation initial brightness of sliding window's central

Analysis of chi-squared divergence changes by filtering of stego images formed according to UNIWARD embedding methods 75

pixels was calculatcd according to [12] :

w: = » +

(w(

w + 1)/2,(w+1)/2

where y = E [W] - expectation of pixels brightness; a2 = (E [W2] — y2) - variation of pixels brightness; E [•] - expectation operator; v2 = E [a2] - estimation of AWGN variance.

Estimation of chi-squared divergence D7^1 for filtered cover and stego images was carried out according to (12). Dependencies of divergence D7^ on cover image payload for processed initial (undistorted) cover as well as stego images for S-UNIWARD and JUNIWARD embedding methods arc represented at Fig. 2.

s \

\

_____

*- —1—^

- ^ k- „ -->

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

- 4 ■ —&

0 5 10 15 20 25 30 35 40 45 50 55 60 65

Cover image payload, %

(a)

: „

r ' f / / - -i r k.

/ / [ /

i i

0 5 10 15 20 25 30 35 40 45 50 55 60 Cover image payload, %

(b)

rel 2

Fig. 2. Dependencies of chi-squared divergence Dx on cover image payload for S-UNIWARD (a) and JUNIWARD (b) embedding methods. Results for median and Wiener filters arc marked as solid and dashed lines respectively.

It should be noted that different character of D7^1 changes by cover image payload variation for S-UNIWARD and J-UNIWARD embedding methods (Fig. 2). These differences can be used as distinctive features for revealing information about message embedding domain.

Also, there is revealed jump of D7^1 for stego images, even in case of low (less than 10%) cover image payload (Fig. 2). It allows using thresholding of D7^1 values for stego images detection. Application of tuned stcgdctcctor gives opportunity to significantly decrease probability of misclassification (Pmc ~ 0.17) in comparison with case of undistorted image analysis (Pmc - 0.43).

Conclusion

Based on the results the performed analysis, it, is shown that pre-processing (filtering) of analyzed images allows discerning differences between pixels brightness distributions of cover and stego images, formed according to advanced S-UNIWARD and J-UNIWARD embedding methods. Different character of chi-square divergence changes for stego images, formed according to S-UNIWARD and J-UNIWARD methods, can be used as distinctive features for revealing information about information embedding domain.

Taking into account these differences between chi-square divergence for cover and stego images allows up to 4 times reducing probability of misclassification (from for case of undistorted image analysis to ) even in case of low cover image payload (less than 10%).

References

[lj Fridrich J. (2009) Steganography in Digital Madia. DOI: 10.1017/cbo9781139192903

[2] Kodovsky J. and Fridrich J. (2012) Steganalysis of JPEG images using rich models. Madia Watermarking, Security, and Forensics 2012. DOI: 10.1117/12.907495

[3j Fridrich J. and Kodovsky J. (2012) Rich Models for Steganalysis of Digital Images. IEEE Transactions on Information Forcnsics and Security, Vol. 7, Iss. 3, pp. 868-882. DOI: 10.1109/tifs.2012.2190402

|4] Holub V., Fridrich J. and Denemark T. (2014) Universal distortion function for steganography in an arbitrary domain. EURASIP Journal on Information Security, Vol. 2014, Iss. 1. DOI: 10.1186/1687-417x-2014-1

[5] Davidson J., Bergman C. and Bartlett E. (2005) An artificial neural network for wavelet steganalysis. Mathematical Methods in Pattern and Image Analysis. DOI: 10.1117/12.615280

[6] Progonov D. (2018) Information-Theoretic Estimations of Cover Distortion by Adaptive Message Embedding. Information Theories and Applications, Vol. 25, No 1, pp. 47-62.

|7] Filler T. and Fridrich J. (2010) Gibbs Construction in Steganography. IEEE Transactions on Information Forensics and Security, Vol. 5, Iss. 4, pp. 705-720. DOI: 10.1109/tifs.2010.20 77629

|8] Bishop C. (2006) Pattern Recognition and Machine Learning, Springer-Verlag, 738 p.

[9] Kodovsky J., Fridrich J. and Holub V. (2012) Ensemble Classifiers for Steganalysis of Digital Media. IEEE Transactions on Information Forensics and Security, Vol. 7, Iss. 2, pp. 432-444. DOI: 10.1109/tifs.2011.2175919

[10J Huiskes M.J. and Lew M.S. (2008) The MIR flickr retrieval evaluation. Proceeding of the 1st ACM international conference on Multimedia information retrieval - MIR '08. DOI: 10.1145/1460096.1460104

[11] Avcibas I., Memon N. and Sankur B. (2003) Steganalysis using image quality metrics. IEEE Transactions on Image Processing, Vol. 12, Iss. 2, pp. 221-229. DOI: 10.1109/tip.2002.807363

[12] Gonzalez R.C and Woods R. E. (2007) Digital Image Processing, Prentice Hall, 976 p.

a2 -z/2

76

Progonov D. О.

Анал1з змш xi-квадрат вщсташ м!ж розподшами яскравост! шксел!в при фшьтраци стеганограм, сформованих згщно методу UNIWARD

Прогонов Д. О.

Протид1я несанкцюновашй передач! конф!денцшних даних е актуальною та важливою задачею сьогодш. Особлива увага придшяеться ранньому виявленню прихо-вано! (стеганограф1чпо1) передач! шформаци при обм!ш пов1домленнями в шформацшпо-комушкац1йпих системах. Приховаппя пов1домлепь (стегодапих) проводиться шляхом внесения змш до файл!в-контейнер!в, зокрема цифрових зображень. Забезпечення впсоко! ¡мов1рност! впявлення сформованих стеганограм потребуе застосу-вання спец1ал1зованпх стегодетектор!в, заснованпх на використанш апрюрнпх даних щодо використаного сте-ганограф!чного алгоритму. Це призводить до зниження ефектпвноста спстемп впявлення у випадку атаки нульо-вого дня (zero-day attack) - використання зловмисника-мп иопередньо нев!домих метод!в прихованням пов!дом-лень. Внаондок цього актуальною задачею е розробка ушверсальних (сл!пих) стегодетектор!в, здатних з ви-сокою точшстю впявлятп стеганограмп в умовах обме-женоста або нав!ть в1дсутност! апрюрних даних щодо використаного стеганограф!чного алгоритму. Виршен-ня дано! задач! потребуе впявлення та анал!зу слабких змш параметр!в зображення-контейнеру, обумовлених вбудовуваипям стегоданих. Для шдсилення даних змш в робот! запропоновано ироводити попередню обробку (ф!льтрагцю) доонджуваних зображень з використан-ням мед!анного та вшеровського фшьтр!в. Розглянуто випадок формування стеганограм з використанням но-в1тн1х адаптпвних метод!в UNIWARD. Показано, що по-иередня ф!льтрац1я стеганограм дозволяв виявити слаб-к! в!дмшноста в розподш значень яскравоста шксел!в зображень-контейнер!в та стеганограм, навиь у випадку малого заповнення контейнер!в стегоданими (менше 10%). Виявлено, що характер змш xi-квадрат в!дсташ м!ж розподшами значень яскравоста шксел!в зображень-контейнер!в та стеганограм суттево залежить в!д облает!

вбудовування стегоданих до контейнеру. Врахування даних змш при проведенш стегоанал!зу цифрових зображень дае можлив!сть визначати область приховання пов!домлень та, в1дпов1дно, обирати ефективш методи деструкци стеганограм.

Ключовг слова: стегоанал!з; адаптивш стегапографь чш методи; метод 1".\1\\".\ИI): хьквадрат в!дстань

Анализ изменений хи-квадрат расстояния между распределениями яркости пикселей при фильтрации стегано-грамм, сформированных согласно методу иМШ'АШЭ

Прогонов Д. А.

В работе исследованы изменения хи-квадрат расстояния между распределениями значений яркости пикселей изображений-контейнеров и стеганограмм при проведении фильтрации. Рассмотрен случай формирования стеганограмм с использованием адаптивных методов 1".\1\\".\НI). Показано, что использование медианного и винеровского фильтров дает возможность выявлять слабые различия в распределениях яркости пикселей изображений-контейнеров и стеганограмм, даже в случае малого заполнения контейнеров стего-данными (менее 10%). Выявлено, что характер изменений хи-квадрат расстояния между распределениями значений яркости пикселей исходных и обработанных стеганограмм существенно зависит от области встраивания стегоданных в изображение-контейнер. Анализ данных изменений при проведении стегоанализа цифровых изображений дает возможность определять область встраивания стегоданных в изображение-контейнер, что представляет интерес для выбора эффективных методов деструкции сформированных стеганограмм.

Ключевые слова: стегоанализ; адаптивные стегано-графические методы; метод 1".\1\\".\1Ш: хи-квадрат расстояние

Analysis of chi-squared divergence changes by filtering of stego images formed according to uniward embedding methods Текст научной статьи по специальности «Компьютерные и информационные науки»

Аннотация научной статьи по компьютерным и информационным наукам, автор научной работы — Progonov D.O.

Похожие темы научных работ по компьютерным и информационным наукам , автор научной работы — Progonov D.O.

Анализ изменений хи-квадрат расстояния между распределениями яркости пикселей при фильтрации стеганограмм, сформированных согласно методу UNIWARD

Текст научной работы на тему «Analysis of chi-squared divergence changes by filtering of stego images formed according to uniward embedding methods»