Научная статья на тему 'BAYESIAN NON-INFERIORITY TEST BETWEEN TWO BINOMIAL PROPORTIONS'

BAYESIAN NON-INFERIORITY TEST BETWEEN TWO BINOMIAL PROPORTIONS Текст научной статьи по специальности «Математика»

CC BY
25
8
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук
Ключевые слова
Non‐inferiority test / Test of proportion / Bayesian Inference / Conjugate prior / Power

Аннотация научной статьи по математике, автор научной работы — W.B. Yahya, C.P. Ezenweke, O.R. Olaniran, I.A. Adeniyi, K. Jimoh

The paper aimed to propose a new Bayesian test method for establishing a non-inferiority measure between an active treatment (drug) and a new (cheaper) treatment using two independent binomial samples. A Bayesian test statistic was developed for testing non-inferiority between two independent binomial proportions. Conjugate Beta prior was assumed for the binomial proportions to elicit posterior from the same Beta family of distributions. The efficiency of this test method was established via power analysis and its ability to yield the nominal Type I error rate (alpha) in a detailed Monte-Carlo study. Results from this study showed that the proposed test method yielded higher powers and good estimates of the Type I error rate at the chosen sample sizes and varying non-inferiority margins (effect sizes). Thus, the new Bayesian test method is very efficient at detecting the significance of the non-inferiority margin between two independent binomial proportions when such is not negligible at all sample sizes. Further results showed that the size of the two population proportions being tested influences the power and the estimated nominal Type I error rate with an increase in power and a good estimate of Type I error rate achieved when both population proportions being tested are less than 0.5. It is therefore concluded that the new Bayesian test method can be employed whenever it is desirable to establish the existence of non-inferiority or otherwise between a pair of (clinical) treatments (drugs). All the simulations and analyses were performed with the R statistical package.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «BAYESIAN NON-INFERIORITY TEST BETWEEN TWO BINOMIAL PROPORTIONS»

BAYESIAN NON-INFERIORITY TEST ... Volume 19, September 2024

BAYESIAN NON-INFERIORITY TEST BETWEEN TWO BINOMIAL PROPORTIONS

W. B. Yahya1, C. P. Ezenweke2, O. R. Olaniran3, I. A. Adeniyi4, K. Jimoh5, R. B. Afolayan6, M. K. Garba7, I. Ahmed8

!'3'6'7Department of Statistics, University of Ilorin, Ilorin, Nigeria 1wbyahya@unilorin.edu.ng; 3rid4stat@yahoo.com; 6afolayan.rb@unilorin.edu.ng ; 7garba.mk@unilorin.edu.ng

2'4Department of Mathematics, Federal University Lokoja, Nigeria 2chinenye.ezenweke@fulokoja.edu.ng; 4isaac.adeniyi@fulokoja.edu.ng

5Department of Physical Sciences, Al-Hikmah University, Ilorin, Nigeria 5jimkaminsha@alhikmah. edu.ng

8Department of Statistics, Nasarawa State University, Keffi, Nigeria 8ibrahimloko@nsuk.edu.ng

Abstract

The paper aimed to propose a new Bayesian test method for establishing a non-inferiority measure between an active treatment (drug) and a new (cheaper) treatment using two independent binomial samples. A Bayesian test statistic was developed for testing non-inferiority between two independent binomial proportions. Conjugate Beta prior was assumed for the binomial proportions to elicit posterior from the same Beta family of distributions. The efficiency of this test method was established via power analysis and its ability to yield the nominal Type I error rate (alpha) in a detailed Monte-Carlo study. Results from this study showed that the proposed test method yielded higher powers and good estimates of the Type I error rate at the chosen sample sizes and varying non-inferiority margins (effect sizes). Thus, the new Bayesian test method is very efficient at detecting the significance of the non-inferiority margin between two independent binomial proportions when such is not negligible at all sample sizes. Further results showed that the size of the two population proportions being tested influences the power and the estimated nominal Type I error rate with an increase in power and a good estimate of Type I error rate achieved when both population proportions being tested are less than 0.5. It is therefore concluded that the new Bayesian test method can be employed whenever it is desirable to establish the existence of non-inferiority or otherwise between a pair of (clinical) treatments (drugs). All the simulations and analyses were performed with the R statistical package.

Keywords Non-inferiority test, Test of proportion, Bayesian Inference, Conjugate prior, Power

1. Introduction

In pharmacological and drug discovery studies, the need for the development of new drugs to compete with some of the existing ones for a particular ill-health condition is often desirable for many reasons. Clinicians and end-users may prefer a new drug (or treatment) if significant advantages can be derived from its use such as having fewer side effects, being less toxic, being relatively cheaper, being relatively more convenient in its formulation and administration (e.g. tablets instead of infusion or infusion/drug instead of surgery) and so on (Yahya et al. [1]). Therefore, it is always a welcome development that improved, better, and (possibly) cheaper drugs or treatments are discovered and introduced for the treatment of various forms of ailments.

In clinical trials, a new drug (or treatment) for a particular medical condition can only be accepted as being good if its performance is at least as good as an existing one (the active control). In other words, a new drug may be preferred if its performance is equivalent to or better than (not inferior to) the active control (Chen and Peace [2], Ng [3], Norleans [4]). This is the scenario that brought about the concept of non-inferiority between two drugs or treatments in clinical trials.

There are different methods for determining if a new drug (or treatment), often referred to as the test product, is as good as an existing one, most of which involve the use of statistical tools. Some of the procedures for determining if the performance of a new drug is not inferior to an existing one include superiority trials and non-inferiority trials (Lesaffre [5], Yahya and Jolayemi [6]).

A superiority trial is used to prove that a new drug is better than an existing one, while a non-inferiority trial is used to show that the new drug is not (much) worse than the existing one (Kawasaki et al. [7]). Many clinical trials that compare a test product with an active comparator are designed as non-inferiority trials (Ng [3], EMA [8]). This is attributed to the fact that a placebo (controlled trial) is considered unethical or impractical (Temple [9]) among other reasons. In this study, our focus is much on non-inferiority trials based on nominal outcomes from two independent proportions, and a new test method to handle such a situation is proposed.

The term 'non-inferiority trial' is commonly used to refer to a randomized clinical trial in which a new treatment is compared with a standard active treatment rather than a placebo or untreated control group (D'Agostino et al. [10], Pocock [11]). The non-inferiority trial is a trial to show how an experimental treatment is statistically and clinically not inferior to the active control, see D'Agostino et al. [10]. In clinical trials, non-inferiority tests are frequently used to demonstrate that the response for study drugs is not much worse than the response for reference drugs, see Kawasaki et al. [7], Kawasaki and Miyaoka [12].

1.1 Non-Inferiority Tests

A non-inferiority test is used to indicate whether the responses from new drugs are clinically not much worse than the response to active control. It is often conducted in clinical trials. Many clinical trials that compare a test product with an active comparator or control are designed as non-inferiority trials. A non-inferiority trial is sometimes stated as being to demonstrate that the test product is not inferior to the active control. However, only a superiority trial can demonstrate this. A non-inferiority trial aims to demonstrate that the test product is not worse than the active control by more than a pre-specified threshold, often called the non-inferiority margin, A, see Ng [3], EMA [8], D'Agostino et al. [10].

Non-inferiority studies are typically confirmatory trials that employ a randomized parallel-group design with an active control group. Some trials also include a placebo control group. The placebo is used to validate the study and to demonstrate the superiority of the test treatment to the

placebo (Garrett [13]). Non-inferiority methods are frequently used in more serious, acute, and sometimes life-threatening situations such as oncology and infectious diseases, see Garrett [13].

Without loss of generality, a non-inferiority test is used to indicate whether the responses from a new drug are clinically not much worse than the responses from the active control (an existing drug) as it is often conducted in clinical trials. Thus, a prior judgment is made that, for the new treatment to be of merit, it only needs to be as good as the active control regarding appropriate outcome measure(s) of response, see Pocock [11].

Non-inferiority tests are examples of directional (one-sided) tests. There are many forms of this test depending on the form of the response. The non-inferiority tests for two-sample designs in which the outcome is a continuous normal random variable were carried out using the two-sample t-test procedure and the analysis of variance (ANOVA) fc-sample F-test procedure when there are more than two samples (Yahya and Jolayemi [6]). When the response or outcome is binary, the non-inferiority test procedure, which is the main focus of this study, involves determining if the difference between two binomial proportions is beyond a specified non-inferiority margin A.

1.2 Non-Inferiority Test Between Two Independent Proportions

The responses from non-inferiority studies can be binary, for example, cure or no cure, alive or dead, cancerous or non-cancerous, and so on. When this is the case, the test product group and active control group are referred to as two independent binominal samples. The non-inferiority tests for this case are carried out using the tests for the differences between two proportions. For instance, suppose that the current drug (treatment) for a given disease works 80% of the time. Suppose also that this treatment is either too expensive or occasionally exhibits serious side effects or that its administration is considered quite tedious (e.g. surgery) and a new promising treatment or drug has been developed to the point where it can be tested. One of the questions that must be answered is whether the new treatment is as good as the current treatment. In other words, do at least 80% of treated subjects respond positively to the new treatment?

Due to the many benefits (being cheaper, having relatively fewer side effects, and being very easy to administer) of the new treatment, clinicians would be willing to adopt the new treatment even if it is slightly less effective than the current treatment by some small tolerable margin. In order words, it is of interest to determine how much less effective the new treatment can be relative to the existing one (the active control) for which the new treatment can still be confidently adopted. This is called the non-inferiority margin of the new drug compared to the existing one (Ng [3], Yahya and Jolayemi [12]). The non-inferior margin represents how much worse the new treatment can be compared with the standard treatment, yet still, be considered 'similar' or 'not worse' than the standard treatment, Leung et al. [14]. In essence, there is a performance level of the new treatment as compared to the existing active control treatment below or above which is no longer considered ignorable.

In the above example, suppose it was found that the performance rate of the new drug is 75% among the population of users and this was considered acceptable by the clinicians and other users due to its numerous attendant benefits over the existing treatment. This simply shows that the non-inferiority margin of the new drug is 5%, i.e. A = 5% and the new treatment is thus, considered noninferior to the existing treatment. Thus, the drug developers need to design an experiment to test the hypothesis that the response rate of the new drug (treatment) is at least 0.75. In other words, the test hypothesis problem here is to establish whether the non-inferiority margin of 5% is exceeded by the new drug or not.

Different test methods have been proposed for establishing non-inferiority between a pair of two independent population proportions if such exists. Most of these methods were developed through the frequentist techniques while a few had appeared within the Bayesian framework. In

W. B, Yahya, C. P. Ezenweke, O. R. Olaniran, I. A. Adeniyi, K. Jimoh, R. RT&A No 3 (79) B. Afolayan, M. K. Garba, I. Ahmed ' 4

BAYESIAN NON-INFERIORITY TEST ..._Volume I9, September 2024

this work, a new Bayesian test method for testing the difference between two population

proportions within the context of the non-inferiority formulation is proposed. In the development

of this new test method, the emphasis is more on the effect sizes rather than the p-values of the

statistical tests as opined by Leung et al. [14].

2. Methods

Let Xj be an independent Bernoulli random variable that indicates whether a clinician or patient prefers treatment/drug j with the associated probability of success nj, while 7 = 1 for the existing active treatment and j = 2 for the new treatment. Therefore, in a total of n,- end-users of treatment/drug j, the random variable that represents the number of end-users that

preferred treatment/drug j is distributed Binomial with parameters n,- and nj. That is,

n-

~ Birt(p.jji y). The sample estimate of Uj is obtained as fij = 1=1 11, j = 12 . For simplicity,

the terms 'treatment' and 'drug' shall be used interchangeably in the subsequent discussions to represent the existing or the new clinical treatment adopted or preferred by the end-users.

To establish the efficiency of the new drug relative to the existing drug within the context of non-inferiority, the hypothesis of non-inferiority between the two drugs in terms of their relative preferences by end-users is constructed around the two independent population proportions ^(for existing drug) and n2 (for new drug), and the non-inferiority margin A0 as follows:

Hp ! > n2 + A against Hjz 1 < n2 + A (1)

The inequality statement n1> n2 + A in the null hypothesis H0 simply indicates that the proportion (n2) of users that preferred the new drug is only up to the proportion (%) of those that preferred the existing drug by a margin of the non-inferiority parameter value A0.

Therefore, if the null hypothesis H0 is not rejected by the test, it simply shows that the new drug is not inferior to the existing drug an indication that the non-inferiority margin Aq is negligible. However, a rejection of the null hypothesis H0 in favour of the alternative hypothesis Ha is a strong indication that the new drug is superior (more preferred by users) to the existing drug. Hence, the non-inferiority margin A0, which has contributed significantly to the rejection of H0 is not negligible.

Without loss of generality, the hypothesis set in (1) can be re-expressed as follows (with the null hypothesis H0 indicating no difference between the treatment and the active control groups):

1 - n2 = A0 against Han 1 - n2 < A (2)

Some of the existing methods for testing the non-inferiority hypothesis set in (2) include the z-test with a pooled variance given by

01) (3)

where, fir is computed as fir = 1-1 '-

L r L n1+n 2

Another test statistic for hypothesis in (2) is the Wald statistic, Ward and Ahlquist [15] given by:

^ftu-w-) (4)

It has been indicated in some studies that the performance of the Wald statistic suffers when the sample size is small, Di-Caterina and Kosmidis [16]. However, Munzel and Hsuschke [17] have shown the framework of the non-inferiority test for ordered categorical data. When the number of categories is two, it can be regarded as a problem of the difference between two proportions. Hence, the Wald test statistic is derived by extending the method proposed by Munzel and Hsuschke [17] to the non-inferiority test for deriving the difference between two proportions.

In this paper, an attempt is made to extend the Wald-test procedures from a Bayesian perspective.

2.1. The Posterior Distribution of Bayesian Binomial Proportions

The posterior distribution p(n\X)is defined as:

= ¿fr/*)xPW (5)

Given that f(x) = n X(1 — n)1~x}c = 01 ; then, the likelihood distribution function L(ti/X) is defined by

L(jr/X)= nr=17r"(l- nY~xl -> L(n/X) = n 2*i(l - n)n~Zxi (6)

Therefore, the Maximum Likelihood Estimator (MLE) of n is given by

MLE (tt) = it = ^ (7)

n

Assuming a conjugate beta prior for the likelihood in order to ascertain the same class of the posterior, the prior distribution, p{n) is defined as;

a0-l(1_ -.bo-l

Pfr>= rcU.) (8)

Therefore, the posterior distribution p(n\X) is determined using (5) as

p№) =

v v na°~1(l - if)60"1

1 7Ta0_lfl — TT~)b0~i

Jo *Z*CL - X r(ap)x T(b p) r(a o + №

TT2xi(l - 7r)n"2xi X TT20"1^ - tt)60"1

p(n\X) = —-

J0 7r£*i(l - X na0~1(l - nyo-idn

Recall from the Beta function that,

r1na0+Zxi-l(1_ ^bo+n-Zxi-ld = rCao+Zxj-ljxrft p+n-Z, Xj-1) J0 v ' T(aa + ba+n-2) V '

W. B, Yahya, C. P. Ezenweke, O. R. Olaniran, I. A. Adeniyi, K. Jimoh, R. RT&A N0 3 (79) B. Afolayan, M. K. Garba, I. Ahmed ' BAYESIAN NON-INFERIORITY TEST ..._Volume I9, September 2024

Therefore, the prior distribution in (10) becomes;

v(n\X} = wao+I.xi-l(1 — ^bo+n-Z.Xi-1 x rCao+Zxj-ljxrft 0+n-£x;-l) (11)

FV 1 ' V ' r(a0 + 6o+n-2) V '

Thus,

p(n\X}~Beta (a0 + £ b 0 + « < ^ < 1 (12)

If we let a = a0 + £ b = b0 + n —£ in (11) and (12), the posterior mean, and

posterior variance, Var(n\x) can be obtained as;

E(n\x) = ^ (13) Var(n\x)= -^--(14)

2.2 Prior Elicitation

2.2.1 Choosing a Conjugate Prior by Matching Location and Scale Parameters

Given that the Beta(ab) family of distributions is the conjugate family for Binomial(njz) distribution, the posterior will be a member of the same family, with the parameters updated by simple rules. Then, we can find the posterior without integration. Note that, the beta distribution can have many shapes. Therefore, the prior chosen should correspond to one's belief. We suggest choosing a Beta(a0b 0) that matches one's prior belief about the mean (location) and standard deviation (scale).

Let n0 be the prior mean for the proportion and let ofi be the prior variance for the proportion. But, we know that the mean of Beta(ab) distribution is If this is set to equal to what the prior belief about the mean of the proportion is, we have;

= (16)

Also, the variance of the Betafab) distribution is-7--■ This shall be set to equal to the

prior belief about the variance for the proportion to have:

°o2 = ^ "I*" ^ (17)

(a0 + 60)2(ao + 6o + l)

Note that n0 = "" from (16), this implies that 1 — ^ = bo . Substituting these in (17) to

a0 + b0 a0+b0

have;

a02 = ^^ (18)

Solving equations (16) and (18) for a0 and b0 gives the Beta(a0b 0) prior parameters. Proof: Given that n~ Beta(ab) then,

E(n) = —; Var{n) =-^--(19)

v ' a+b v ' (a+6)2(a+6+l) v '

BAYESIAN NON-INFERIORITY TEST ... Volume 19, September 2024

Solving for a0 in terms of b0 and n0 from (16) gives

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

bgng

l-Ho

Substituting (20) in (18) and solving for b0 gives

„ n0(l — n^)

u0 ~ I-

l—^o ^

2_ _— "o)_

b0U0 + bg — bgUQ +1—1%

1— no 2_ — "o)

b0 + 1 — 75

1— no

o£= n 0(1 — n0)x ——---

»0 + 1 —

2__ — n0)2

bo+l—Tf,

o-o(bo+l —fh)=n o(l — n0)2

borf + 0d — a§n0 =n0(l— n0)2 b0o% + — n0)= n0(l — n0)2 b0o% = n0(l — no)2 — — n0) n0(l — n0)2 — — n0)

bo =-5-

Cn

Substituting b0 in (21) into (20) to have

(1 — nb)[n-0(l — n0)—

an = ■

1 —n^ — nb)[n-0(l — n0)— ff02]

(1 — no)ff02

(20)

"0 — „2 (22)

Therefore, the posterior mean, ft and posterior variance, o^ are determined as follows. We recall from (13) and (14) that

a

E(n\x) =-= ft

a+b

and

ab

(a + b)2(a+ b+ 1)

respectively.

a b 1

= --x--x- „„

' a+b a+b a+fo+1 *

Also, recall that a = a0 + £ xv b = b0 + n then, by proper substitution, we have that

_ ag+Zxi

a0 + ¿b + n a-o+^Xi b0+ n -2,Xi

a0 + + h^ + n -'Zxi a0 + X X4

0 (23)

a0+ bo + n-'Zxi a0+ bo + n-'Zxi a0 + £+ bo + n xt + 1

„2 ao+^Xi bo+n-^Xi 1

-;-x-;-x-;-; (24)

a0+ib+n a0+ib+n a0+ib+n+l

By substituting (21) and (22) in (23) for a0 and b0 the Bayesian posterior mean of binomial proportion n is determined as follows;

np[no(l- Up)- g02] | Vv a2 +LXi "0_

- ^o)- ffp] | (1 - flo)K(l - Up)- ff2] | n °0 °o

_- ^o)- g02] + _

n-0[n-0(l - n-0)- CT02] + nb(l - TT0)2 - q2(l - n0) + ff02n

„ _ - ^0)- gp2] + q^Z^i

[n-0(l - Ho) - Oo][nQ + (1 - TJo)] + ogn noK(l - no)- CT02] + Oq

7T —

^0(1 - ^o) + - 1)

(25)

Also, the Bayesian posterior variance tr^ of binomial proportion n is derived by substituting (21) and (22) in (24) as

£(1 - ^0

[n-0(l - no) - <$] + (fin + TT(1 - TT>T02

n-0(l - n;,) + no

(26)

where ft = —as given in (7).

2.3 Proposed Bayesian Non-inferiority Test of Two Independent Binomial population proportions

Following the above Bayesian estimates of mean and variance of a binomial proportion, the proposed Bayesian non-inferiority test statistic for testing the hypothesis set

as earlier stated in (2) is;

where

Hp 1 - TT2 = A0 against Hjr 1 - n2 < A,

7Ti - Tin - A,

Z B= 1 2 -~N(Ql) 07)

+

^0l[^0l(l- ^Ol)- ffoi] + 7T ! = -----5--(128)

^01(1- ^oi) + qii("i- i) _ ^02 [^02(1 - ^02)- O02] +

71 2 " ^02(1- ^02)+ 4(«2- i) C J

BAYESIAN NON-INFERIORITY TEST ... Volume 19, September 2024

- 2 = — TT1)X fful

, ftn (1 — 7T2)X ff02

<7- = -—-—-—--(31)

"2 ^02(1 — ^02) + ($2n2

and ftj = z-£hixijJ = % .

2.4 Decision Rule

The decision rule for the test function is to reject H0 if ZB < — Zi_a where Z1_a is the quantile of the standard normal distribution at Type I error rate a.

If the null hypothesis H0 is not rejected by the test function (27), it simply shows that the new drug is not inferior to the existing drug an indication that the non-inferiority margin Aq is negligible. However, a rejection of the null hypothesis H0 in favour of the alternative hypothesis Ha by the test is a strong indication that the new drug is superior (more preferred by users) to the existing drug. Hence, the non-inferiority margin A0, which has contributed significantly to the rejection of H0 is not negligible.

2.5 The Power and Type I Error Rate (a) of the Proposed Bayesian Non-inferiority Test

In computing the power and nominal Type I error rate of the proposed Bayesian Non-inferiority test method, we consider the hypothesis of the non-inferiority test stated in (2), its test statistic as well as its decision rule. The hypothesis set to test is

Hp ! — n2 = A0 against Hjr 1 — n2 < Aq with the proposed test statistic for the above test according to (27) is of the form

z° = StrNm (32)

where A0= fi1 — ft2 and SE{A0) is the standard error of A0. The decision rule for the test function as earlier stated is to reject H0 if ZB < — Zi_a where Z1_a is the quantile of the standard normal

distribution at Type I error rate a. Note that, 5£(A0) = ^Jaj^+a^.

2.5.1 Power Computation

The power of the statistical test of size a for testing the null hypothesis H0 against the alternative set Ha is the probability that the test rejects H0 when Ha is true. That is;

Power= P[TestfunctioriRejectsH 0\Hais true]

Therefore, given the above decision rule for testing the hypothesis (2) above at some specified Type I error rate a, the power of the test is defined by

Power = P[Zcal < — ZL_a\HJ.strue ]

W. B, Yahya, C. P. Ezenweke, O. R. Olaniran, I. A. Adeniyi, K. Jimoh, R. RT&A No 3 (79) B. Afolayan, M. K. Garba, I. Ahmed ' 4

BAYESIAN NON-INFERIORITY TEST ..._Volume I9, September 2024

^ Power = P < —ZL_a\ni—n2< A,] (33)

Suppose A1 is the true difference between the two proportions n1 and n2 for the alternative hypothesis Ha to be true, hence, the test hypothesis (2) can be re-expressed as

Htfi 1 — n2 = A0 against Hjr 1 — n2 = A1, with A1< A-The expression for power of the test in (33) then becomes;

Power = P[gg<—^ |AJ (34)

Thus, we have;

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Power = ^< — i-a]

Power = P < — + -M

- Power = p[z<—

, Power = ®[—(35)

where $ is the cumulative distribution function of the normal density and SE(A 0) is the standard error of A0= 7T1 — ft2. From (30) and (31), 0) is computed by

SE(A0) =

n1(l — n1)xa21 tt2(1 — TT2)X ct022

^01(1— ^oi) + q?ini ^02(1— ^02) + q>2n2 Therefore, the power of the proposed Bayesian non-inferiority test is given by (35). 2.5.2 Nominal Type I Error Rate Computation

The Type I error rate of a statistical test a is the probability that the test function rejects the null hypothesis H0 when H0 is actually true. Let the actual size alpha level of the test be denoted by a. This can be computed by;

S = (36) where A0 is the null difference. Thus, we have that;

A0 — A)

a = P

< —A-a

SE(A0) = < —

LSE(A0) ^ a SE(Ao)J

a = p[z<—z_a+1^

•■■ a=M—(37)

Thus, the power function and the estimator of the significance level a for the proposed Bayesian test statistic considered in this paper are presented in Table 1.

BAYESIAN NON-INFERIORITY TEST ... Volume 19, September 2024

Table 1: The Power function (ZB-Power) and estimator of the Nominal Type I error rate a (ZB-Alpha (a))of the proposed Bayesian test for two independent Binomial Proportions.

Description Estimator

Zb-Alpha (a) A-a+Ao : A n1{ 1 — n{) x ct021 | tt2(1 — 7t2) x ct022

— + ffoinl ^02(1 — ^02) + ff022n2

Zb-Power —Z1_a+A1 + A 7^(1 — tti)xct021 | tt2(1 —tt2)xct022

— ^01) + ff02inl ' ^02(1 — ^02) + ff022n2

3. Simulation Study

3.1 Simulation Scheme

The data utilized for this work were simulated using the R statistical package (R Core Team [18]). Two independent binomial random variables X1 and X2 were generated at different sample sizes n = 2O;10,OO using varying probabilities of success n1 and n2 with varying effect size

A0 whererr 1 = n 2 + Ai- The Power (%) and the nominal size a (%) values of the proposed test statistic were computed using 10,000 iterations of the basic experiment. The implementation of the Bayesian methodology is in two phases:

1. Obtaining the Bayesian estimate of fi1 and n2 by updating the data using conjugate beta-prior.

2. Estimating the corresponding Power (%) and the nominal size a (%) values of the proposed Bayesian test statistic.

Recall that the non-inferiority hypothesis test is given as H0\ ni — n2 = A0 vs. Ha: n1 — n2 <

The section that follows presents the results for different values of n1 and n2 at various sample sizes.

4. Results

The results of the proposed non-inferiority test between two binomial proportions are presented in phases based on the different parameters combinations as provided under the simulation scheme.

Table 2 presents the estimated power of the proposed Bayesian test first at the effect size A0= 0.05 but under two different sizes of proportion pairs (n1 = 0.2,n 2 = 0.25 ) and (ni = 0.6, n2 = 0.65) with the associated Bayesian test statistics Zb1 and Zb2 respectively at varying sample sizes. Table 2 equally presents the results of the estimated powers of the proposed test at the effect size A0= 0.1 and under two different sizes of proportion pairs (n1 = 0.2,n 2 = 0.3 ) and (ni = 0.6, n2 = 0.7 ) with the associated Bayesian test statistics Zb3 and Zb4 respectively also at varying sample sizes. Expectedly, it can be observed that the power of the proposed Bayesian test method increases as the sample size increases.

BAYESIAN NON-INFERIORITY TEST ... Volume 19, September 2024

Table 2: The Power (in %) of the proposed Binomial Non-inferiority Test (ZB) at various sample sizes (n), with the following test parameters; % = 0.2, n 2 = 0.25, effect size A0= 0.05 for test ZB1; nt = 0.6, n 2 = 0.65 effect size A0= 0.05 for test ZB2; nt = 0.2, n 2 = 0.3, effect size A0= 0.1 for test ZB3; nt = 0.6, n 2 =0.7 effect size A0= 0.1 for

test ZB4.

Power of Bayesian Non-Inferiority Tests

Sample Size (n) Effect Size A0= 0.05 Effect Size A0= 0.1

(ni,n2) = (0.2, 0.25) (ni,n2) = (0.6, 0.65) {ni,n2) = (0.2, 0.3) {ni,n2) = (0.6, 0.7)

20 21.91 16.35 70.42 65.69

40 34.92 28.16 85.01 79.77

60 44.29 37.23 91.78 89.25

80 50.86 43.48 96.16 93.90

100 56.80 49.50 97.96 96.51

120 64.02 55.29 99.03 98.17

140 68.96 59.67 99.62 98.92

160 73.81 65.24 99.77 99.28

180 78.11 69.20 99.88 99.67

200 81.25 71.57 99.96 99.86

It can also be observed from the results in Table 2 that, the power of the test increases remarkably as the effect size A0 increases. All these results are presented in the line graphs in Fig 1 (left) in which the powers of the test were plotted against the various sample sizes under the two effect sizes chosen.

Power at various Sample Sizes Alpha at various Sample Sizes

//

O

CN I, = 0.2 i2 = 0.25

I, = 0.2 i2 = 0.3

I, = 0.6 i2 = 0.65

O - -■- I = 0.6 i2 = 0.7

i 50 1 100 1 ,50 1 200

Sample Size

E №

m ^

/

50

i, = 0.2 i2 = 0.25 i, = 0.2 i2= 0.3 i, = 0.6 i2 = 0.65 i, = 0.6 i2 = 0.7 Nominal Alpha

100

Sample Size

,50

200

Figure 1: The graphs of the estimated Powers (left) and nominal Type I error rate a (right) of the proposed Bayesian Non-inferiority test for two Independent Population Proportions at different sample sizes and varying effect sizes.

Regarding the ability of the proposed test at returning the 5% nominal Type I error rate a set for it, it can be observed from Table 3 that the test returns nominal alpha values that are quite close to the 5% alpha level set for it at all the effect sizes most especially at higher sample sizes. However, the test under-estimated this nominal 5% a level of the test as all the estimated nominal values are below the 5% line as shown in Fig. 1 (right).

BAYESIAN NON-INFERIORITY TEST ... Volume 19, September 2024

Table 3: The estimated nominal Type I error rate a (in %) of the Proposed Binomial Non-inferiority Test (ZB) at various sample sizes (n), with the following test parameters; n1 = 0.2, n 2 = 0.25, effect size A0 = 0.05 for test Zbi; n1 = 0.6,n 2 = 0.65 effect size A0 = 0.05 for test Zbi; % = 0.2,n 2 = 0.3, effect size A0= 0.1 for test Zb3; % = 0.6,

n2 = 0.7 effect size A0= 0.1 for test Zb4.

Estimated Nominal Type I Error Rate a (in %) of Bayesian Non-Inferiority Tests

Sample Size (n) Effect Size A0= 0.05 Effect Size A0= 0.1

Zbi (n1,n2) = (0.2, 0.25) (n1,n2) = (0.6, 0.65) (ni,n2) = (0.2, 0.3) (ni,n2) = (0.6, 0.7)

20 1.91 0.71 2.17 0.83

40 3.33 1.98 3.63 1.54

60 3.28 2.77 3.47 2.65

80 4.17 3.63 4.23 3.46

100 4.00 3.90 4.06 3.66

120 3.95 3.99 4.53 3.53

140 4.54 4.11 4.75 3.78

160 4.58 4.24 4.82 3.79

180 4.47 3.94 4.68 3.68

200 4.36 3.79 4.73 3.69

5. Discussion of Results

In this paper, the Bayesian test statistic for testing the difference of two binomial proportions under the non-inferiority condition was presented. The estimators of the power and significance level for the proposed Bayesian test were summarized in Table 1. Table 2 presents the empirical percentage power of the proposed test at varying sample sizes under the four scenarios considered while Table 3 presents the nominal Type I error rate of the test at various sample sizes under different effect sizes.

The Bayesian test statistic of non-inferiority Zbi examines a test condition with a low effect size (A0= 0.05) and relatively small sizes of the proportion pair (n1 = 0.2,n 2 = 0.25 ). Under this scenario, the proposed Bayesian test method requires an average sample size of about 150 observations to achieve an approximate 70% power as can be observed in Table 2 and Fig 1 (left).

Unlike the test statistic Zbi, the Bayesian test statistic Zb2 examines a test condition with a low effect size (A0= 0.05) but relatively large sizes of the proportion pair (ni = 0.6,n 2 = 0.65 ) in the hypothesis set to be tested. Here, the Bayesian test Zb2 requires more sample units in the neighborhood of 180 samples before it could achieve about 70% power. It can therefore be concluded that the size of the binomial proportion pair (n1,n 2) influences the size of the power of the test. Thus, the smaller the sizes of the two population proportions n1 and n2 being tested, the higher the power of the test.

The Bayesian test statistic Zb3 unlike Zb1, examines a test condition with a relatively large effect size (A0= 0.1) and smaller sizes of the proportion pair (n1 = 0.2,n 2 = 0.3). Here, the proposed Bayesian test method achieved a reasonable power of about 70% even at a very small sample size as low as 20 samples as can be observed from the results in Table 1 and clearly shown by the power plot in Fig 1 (left).

Finally, the Bayesian test statistic Zb4 in Table 2 tested a pair of relatively large binomial proportions (ni = 0.6,n 2 = 0.7) with the same effect size (A0= 0.1) as used in Zb3 test. Given these parameters settings, the Bayesian test Zb4 requires more samples to attain the same feat of about

70% power achieved by Zb3 at a relatively smaller sample size simply due to an increase in the sizes of the two proportions n1 and n2. In all cases considered, the power of the proposed Bayesian test increases as the sample size increases.

To further examine the goodness of the proposed test method, its ability to retain its size a of 5% set for it was evaluated. Table 3 presents the empirical percentage significance levels a returned by the test at varying sample sizes under the four parameters combinations considered. It could be observed from Table 3 that the empirical percentage significance levels a provided by the test are closer to the 5% nominal level set for it, especially at sample sizes 100 and above. Finally, all these results in Table 3 showed that the ability of the proposed Bayesian test method to commit the Type I error is lower than the 5% nominal level set for it. At all the sample sizes considered, the test under-estimates the 5% nominal level set for it.

In this work, an efficient Bayesian test method for testing non-inferiority between two independent binomial proportions is proposed. The goodness of the proposed test method was assessed based on the power and the empirical Type I error rates provided by the test across the various sample sizes considered. Results in Table 2 clearly showed that the proposed test is quite efficient at detecting the significance of the non-inferiority parameter value when such is not negligible in all sample sizes.

Without loss of generality, the various results from power analysis and analysis of nominal Type I error rates reported by the test are a clear indication that the proposed Bayesian test method is quite efficient and good for testing and establishing non-inferiority between two binomial population proportions. Besides the high power reported by the proposed test, the empirical levels of significance a estimated by the test method that was closer to the 5% nominal level set for the test in all the cases considered also confirmed the goodness of the proposed Bayesian test method. It is therefore recommended that the proposed Bayesian test method be employed whenever it is desirable to establish the existence of non-inferiority or otherwise between a pair of treatments in which the preference of users is of the essence.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

References

[1] Yahya, W. B., Olaniran, O. R., Garba, M. K., Banjoko, A. W., Dauda. K. A., and Olorede, K. O. (2016). A Test Procedure for Ordered Hypothesis of Population Proportions against a Control. Turkiye Klinikleri Journal of Biostatistics, 8(1):1-12. doi:10.5336/biostatic.2016-50196.

[2] Chen, D. G. and Peace, K. E. (2013). Clinical Trial Data Analysis using R. Chapman and Hall/CRC Press, UK. https://doi.org/10.1177/0962280211425588

[3] Ng T-H. (2014). Noninferiority Testing in Clinical Trials: Issues and Challenges. 1st Edition, Chapman and Hall/CRC, UK.

[4] Norleans, M. X. (2019). Statistical Methods for Clinical Trials. 1st Edition. Chapman and Hall/CRC Press, UK.

[5] Lesaffre, E. (2008). Superiority, Equivalence and Non-Inferiority Trials. Bulletin of the NYU Hospital for Joint Diseases, 66(2):150-154.

[6] Yahya, W. B. and Jolayemi, E. T. (2008). A note on the Test of Hypothesis with Ordered Alternatives. Scientific Annals of "Alexandru loan Cuza ", 54(1), 197-208.

[7] Kawasaki, Y., Zhang, F. and Miyaoka, E. (2010). Comparisons of Test Statistics for Non-inferiority Test for the Difference between Two Independent Binominal Proportions. American Journal of Biostatistics, 1(1): 23-31.

6. Conclusion

[8] European Medicines Agency (EMA) (2005). Pre-authorization Evaluation of Medicines for Human Use. Canary Wharf, London.Doc. Ref. EMEA/CPMP/EWP/2158/99.

[9] Temple, R. J. (2022). FDA Guidance on Non-Inferiority Trials General Issues, 2022. URL: www.fda.gov (Accessed on 30th March 2022).

[10] D'Agostino, R. B., Massaro, J. M. and Sullivan, L. M. (2003). Non-Inferiority trials. Design concepts and issues - the encounters of academic consultants in statistics. Statistics in Medicine, 22(2): 169 - 186. DQI:10.1002/sim.1425

[11] Pocock, J. S. (2003). The pros and cons of non-inferiority trials. Blackwell Publishing Fundamental & Clinical Pharmacology, 17:483 - 490.

[12] Kawasaki, Y. and Miyaoka, E. (2013). A Bayesian non-inferiority test for two independent binomial proportions. Pharmaceutical Statistics, 12(4): 201-206. https://doi.org/10.1002/pst.1571

[13] Garrett, A. (2003). Therapeutic equivalence; Fallacies and falsification. Statistics in Medicine, 22: 741 - 762.

[14] Leung, J. T., Barnes, S. L., Lo, S. T. and Leung, D. Y. (2020). Non-inferiority trials in cardiology: what clinicians need to know. Heart, 106: 99-104. doi:10.1136/heartjnl-2019-315772

[15] Ward, M. D. and Ahlquist, J. S. (2018). Maximum Likelihood for Social Science: Strategies for Analysis. Cambridge University Press. P. 36.

[16] Di-Caterina, C., Kosmidis, I. (2018). Location-adjusted Wald statistics for scalar parameters. Statistics (Methodology), 1-21. https://arxiv.org/pdf/1710.11217.pdf

[17] Munzel, U.and Hsuschke, D. (2003). A nonparametric test for proving non-inferiority in clinical trials with ordered categorical data. Pharma. Stat., 2:31 - 37. DOI: 10.1002/pst.17.

[18] R Core Team. (2023). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https:// www.R-project.org/.

i Надоели баннеры? Вы всегда можете отключить рекламу.