Научная статья на тему 'В дискретном случае усреднение не может быть состоятельным'

В дискретном случае усреднение не может быть состоятельным Текст научной статьи по специальности «Математика»

CC BY
35
9
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
УСРЕДНЕНИЕ / ОБРАБОТКА ОЦЕНОК / СОСТОЯТЕЛЬНОСТЬ / ДИСКРЕТНЫЙ СЛУЧАЙ / AVERAGING / PROCESSING ESTIMATES / CONSISTENCY / DISCRETE CASE

Аннотация научной статьи по математике, автор научной работы — Кошелева О., Крейнович В.

Когда мы имеем две оценки одной и той же величины, желательно объединить их в одну более точную оценку. В обычном случае непрерывных величин естественной идеей является вычисление среднего арифметического двух оценок. Если мы имеем четыре оценки, то мы можем разделить их на две пары, усреднить каждую пару, а затем усреднить полученные средние значения. Среднее арифметическое {\em состоятельно} в том смысле, что результат не зависит от того, как мы разделим исходные четыре оценки на две пары. Для дискретных величин (например, величин, описываемых целыми числами) среднее арифметическое двух целых чисел не всегда является целым числом. В этом случае нам нужно выбрать одно из двух целых чисел, ближайших к среднему. В этой статье мы показываем, что независимо от того, как мы выбираем (даже если мы допустим вероятностный выбор), полученное усреднение не может быть всегда состоятельным.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

In the Discrete Case, Averaging Cannot Be Consistent

When we have two estimates of the same quantity, it is desirable to combine them into a single more accurate estimate. In the usual case of continuous quantities, a natural idea is to take the arithmetic average of the two estimates. If we have four estimates, then we can divide them into two pairs, average each pair, and then average the resulting averages. Arithmetic average is {\em consistent} in the sense that the result does not depend on how we divide the original four estimates into two pairs. For discrete quantities e.g., quantities described by integers the arithmetic average of two integers is not always an integer. In this case, we need to select one of the two integers closest to the average. In this paper, we show that no matter how we select even if we allow probabilistic selection the resulting averaging cannot be always consistent.

Текст научной работы на тему «В дискретном случае усреднение не может быть состоятельным»

Mathematical Structures and Modeling 2018. N. 4(48). PP. 46-52

UDC 519.2 DOI: 10.25513/2222-8772.2018.4.46-52

IN THE DISCRETE CASE, AVERAGING CANNOT

BE CONSISTENT

Olga Kosheleva

Ph.D. (Phys.-Math.), Associate Professor, e-mail: olgak@utep.edu

Vladik Kreinovich Ph.D. (Phys.-Math.), Professor, e-mail: vladik@utep.edu

University of Texas at El Paso, El Paso, Texas 79968, USA

Abstract. When we have two estimates of the same quantity, it is desirable to combine them into a single more accurate estimate. In the usual case of continuous quantities, a natural idea is to take the arithmetic average of the two estimates. If we have four estimates, then we can divide them into two pairs, average each pair, and then average the resulting averages. Arithmetic average is consistent in the sense that the result does not depend on how we divide the original four estimates into two pairs. For discrete quantities — e.g., quantities described by integers — the arithmetic average of two integers is not always an integer. In this case, we need to select one of the two integers closest to the average. In this paper, we show that no matter how we select — even if we allow probabilistic selection — the resulting averaging cannot be always consistent.

Keywords: averaging, processing estimates, consistency, discrete case.

1. Formulation of the Problem

Need for averaging. In many practical situations, we have two (or more) estimates x1 and x2 of the same quantity x. In such situations, it is desirable to combine the two estimates and come up with a single — hopefully more accurate - estimate x1 * x2 of this quantity.

What operation * should be use? In geometric terms, the pair (x1,x2) can be naturally represented by a point in a 2-D plane. If the estimates were exact, we would have the exact same number x in both components of this pair, i.e., we would have the pair (x,x). It is therefore reasonable to look for the value x for which the corresponding pair (x,x) is the closest to the pair (x1,x2).

The distance between the 2-D points (x,x) and (x1,x2) is equal to

y/(x - Xi)2 + (x - x2)2. Minimizing this distance is equivalent to minimizing its square

(x — xi)2 + (x — x2)2.

Differentiating this expression with respect to x and equating the derivative to 0, we conclude that x = Xl + X2. Such averaging is indeed one of the main ways to combine two estimates; see, e.g., [1,5].

Averaging is consistent. If we have four estimates X\, X2, Xs, and x4, then a natural idea is:

• to divide them into two pairs; for example, we can divide into pairs (x1,x2) and (xs, x4);

• average values from each pair, coming up with combined estimates x1 * x2 and xs * x4, and

then average the resulting averages, combing up with the value

(xi * X2) * (xs * xA).

It is reasonable to require that the averaging operation is consistent in the sense that the result of this operation should not change if, on the first stage, we use a different division into two pairs, i.e., if

(xi * X2) * (X2 * X4) = (xi * Xs) * (X2 * X4).

What if the corresponding quantity is discrete? Some physical quantities — like electric change — are discrete, in the sense that they can take only values ..., — 2e, —e, 0, e, 2e,... proportional to some fixed value e. To make our discussion simpler, let us select this value e as a measurement unit. In this case, possible values of the quantity x are integers.

If xi and x2 have the same parity, i.e., if they are either both odd or both event,

then the arithmetic average x = Xi + X2 is also an integer. However, if one of

the estimates is even, and another is odd — e.g., if xi = 0 and xi = 1 — then the arithmetic average is no longer an integer. In this case, as one can easily see, we have two different integers x for which the square (x — xi)2 + (x — x2)2 of the distance is the smallest: the floor [x\ and the ceiling \x] of the corresponding fraction x. For example, for xi = 0 and x2 = 1, we have x = 0.5, so [x\ = 0 and \x] = 1.

Formulation of the problem. We would like to select, for very pair of integers (xi,x2), one of the two possible averages. A natural question is: can we select it in such a way that the resulting operation is consistent?

What we prove. In this paper, we prove that in the discrete case, averaging cannot be consistent.

2. Definitions and the Main Result

Definition 1. We say that an operation * : Z x Z ^ Z that maps pairs of integers into an integer is a discrete-case averaging if for every pair (xi,x2), the

result x = xi * x2 minimizes the sum (x — xi)2 + (x — x2)2:

(xi * x2 — xi)2 + (xi * x2 — x2)2 = min((x — xi)2 + (x — x2)2).

Definition 2. We say that a discrete-case averaging * is consistent if for every four integers xit x2, x3, and x4, we have

(Xi * X2) * (X2 * X4) = (Xi * X3) * (X2 * X4).

Proposition 1. No discrete-case averaging is consistent.

Proof. Let us assume that * is a consistent discrete-case averaging, and let us get a contradiction out of this assumption.

1°. By definition of a discrete-case averaging, the value 1 * 2 should be equal either to 1 or to 2. Let us show that in both cases, consistency is violated for some values x1, x2, x3, and x4.

2°. Let us first consider the case when 1 * 2=1. Let us prove that in this case, 2 * 3 = 2.

Indeed, by definition of a discrete-case averaging, we have 2 * 3 = 2 or 2 * 3 = 3. However, if 2 * 3 = 3, then, due to consistency, we have

(1 * 2) * (1 * 3) = (1 * 1) * (2 * 3).

We consider the case when 1 * 2 = 1; by definition, 1 * 3 = 2, thus the left-hand side of the above formula has the form (1 * 2) * (1 * 3) = 1 * 2, and we already know that 1 * 2 = 1.

On the other hand, if 2 * 3 = 3, then the right-hand side has the form

(1 * 1) * (2 * 3) = 1 * 3 = 2.

So, if 2 * 3 = 3, then the left-hand side and the right-hand side are different — and hence, the averaging * is not consistent. Since we assumed that * is consistent, this means that 2 * 3 cannot be equal to 3 — and thus, it must be equal to 2. Then, due to consistency, we should also have

(1 * 2) * (2 * 3) = (1 * 3) * (2 * 2).

Here, 1 * 2 = 1 and 2 * 3 = 2, so the left-hand side of this equality takes the form (1 * 2) * (2 * 3) = 1 * 2=1.

On the other hand, here 1 * 3 = 2 and 2 * 2 = 2, hence the right-hand side takes the form (1 * 3) * (2 * 2) = 2 * 2 = 2. So, the left-hand side and right-hand side are different — and thus, the averaging is not consistent.

3°. To complete the proof, let us consider the remaining case when 1 * 2 = 2. Let us prove that in this case, 0 * 1 = 1.

Indeed, by definition of a discrete-case averaging, we have 0* 1 = 0 or 0* 1 = 1. However, if 0 * 1 = 0, then, due to consistency, we have

(0 * 2) * (1 * 2) = (0 * 1) * (2 * 2).

We consider the case when 1 * 2 = 2; by definition, 0 * 2=1, thus the left-hand side of the above formula has the form (0 * 2) * (1 * 2) = 1 * 2, and we already know that 1 * 2 = 2.

On the other hand, if 0 * 1 = 0, then the right-hand side has the form

(0 * 1) * (2 * 2) = 0 * 2=1.

So, if 0 * 1 = 0, then the left-hand side and the right-hand side are different — and hence, the averaging * is not consistent. Since we assumed that * is consistent, this means that 0 * 1 cannot be equal to 0 — and thus, it must be equal to 1. Then, due to consistency, we should also have

(1 * 2) * (0 * 1) = (1 * 1) * (0 * 2).

Here, 1 * 2 = 2 and 0 * 1 = 1, so the left-hand side of this equality takes the form (1 * 2) * (0 * 1) = 2 * 1 = 2.

On the other hand, here 1 * 1 = 1 and 0 * 2 = 1, hence the right-hand side takes the form (1 * 1) * (0 * 2) = 1 * 1 = 1. So, the left-hand side and right-hand side are different — and thus, the averaging is not consistent.

The proposition is proven.

3. What If We Allow Probabilistic Averaging

Idea. The above result is about a deterministic averaging, when to every pair (xi,x2), we assign a single value xi * x2. For example, for xi = 0 and x2 = 1, we have two possible values x, for which the sum (x — xi)2 + (x — x2)2 is the smallest — namely, the values 0 and 1, and we pick one of these values.

But if 0 and 1 are equally good, why not select each of them with some probability, e.g., with probability 1/2 each? In this case, we get a probabilistic averaging, for which, for each xi and x2, the value xi * x2 is a random variable.

Natural question. Will the resulting probabilistic averaging be consistent — in the sense that for every xi, x2, x3, and x4, the random variables (xi * x2) * (x2 * x4) and (xi * x3) * (x2 * x4) have the same distribution?

What we prove. We prove that the answer is still "no" — but at least the above two random variables have the same mean.

Definition 3. By a probabilistic averaging, we mean an operation * that assigns, to every pair of integers (xi,x2), the following random variable:

• when the sum xi + x2 is even, the random variable xi * x2 is equal to

x =f Xi + X<2 with probability 1;

• when the sum xi + x2 is odd, the random variable is equal either to or to \x\, with some probability.

Definition 4. We say that a probabilistic averaging is consistent if for every Xi, X2, X3, and x4, the random variables

(xi * x2) * (x3 * x4) and (xi * x3) * (x2 * x4)

have the same distribution, where different * operations are assumed to be independent.

Definition 5. We say that a probabilistic averaging is weakly consistent if for every X]_, x2, x3, and x4, the random variables (xi * x2) * (x3 * x4) and (xi * x3) * (x2 * x4) have the same mean.

Proposition 2. No probabilistic averaging is consistent.

Proposition 3. There exists a probabilistic averaging which is weakly consistent.

Proof of Proposition 2. Let us assume that * is a consistent probabilistic averaging, and let us get a contradiction out of this assumption.

1°. Let us first prove that for all pairs (n,n + 1), the probability p of selecting n as n * (n + 1) is equal to either 0, or 0.5, or 1.

Indeed, by definition of consistency, we should have

(n * n) * ((n +1) * (n + 1)) = (n * (n + 1)) * (n * (n + 1)).

The left-hand side is equal to n * (n + 1) and is, thus, equal to n with probability p.

In the right-hand side, each of the two terms n * (n +1) is equal to n with probability p and to n +1 with the remaining probability 1 — p. Since different *-operations are assumed independent, we therefore have four possible cases:

• the first case is when both terms n * (n +1) are equal to n; the probability of this case is p • p = p2;

• the second case is when the first term is equal to n and the second term is equal to n +1; the probability of this case is equal to p • (1 — p);

• the third case is when the first term is equal to n + 1 and the second term is equal to n; the probability of this case is equal to (1 — p) • p;

• finally, the fourth case is when both terms n * (n + 1) are equal to n + 1; the probability of this case is (1 — p) • (1 — p) = (1 — p)2.

In the first case, the value (n * (n + 1)) * (n * (n + 1)) is always equal to n, and, as we recall, this case occurs with probability p2. In the second and third cases, the value n appears with probability p; thus, the overall probability of getting n in these cases is 2p • (1 — p) • p = 2p2 • (1 — p). In the fourth case, we always get n + 1. So, the overall probability of getting n is

p2 + 2p2 • (1 — p)= p2 + 2p2 — 2p3 = 3p2 — 2p3.

Since the operation * is consistent, the probability of getting n on both sides should be equal, so we must get p = 3p2 — 2p3. The first possibility to get this

equality is to have p = 0. If p = 0, then we can divide both sides by p and get 1 = 3p — 2 • 2p2, i.e., a quadratic equation 2p2 — 3p + 1 = 0, whose solutions are p = 0.5 and p = 1.

2°. From Part 1 of this proof, it follows that the probability p12 of getting 1 as a result of 1 * 2 is either 0, or 0.5, or 1. Let us first consider the case when p12 > 0. In this case, let us consider another consistency requirement:

(1 * 2) * (1 * 3) = (1 * 1) * (2 * 3).

In the left-hand side, 1 * 2 is equal to 1 with probability p12 > 0, and to 2 with the remaining probability 1 — p12. Here, 1 * 3 = 2, so (1 * 2) * (1 * 3) is equal to 1 * 2 with probability p12 and to 2 * 2 with probability 1 — p12. In the first case, we get 1 in p12 of the cases, so the overall probability that the left-hand side is 1 is equal to p^2.

In the right-hand side, 1 * 1 = 1, and 2 * 3 is equal to 2 with some probability p23 and to 3 with the remaining probability 1 — p23. Thus, the right-hand side is equal to 1 * 2 with probability p23 and to 1 * 3 = 2 with probability 1 — p23. In the first case, we get 1 in p12 of the cases, so the overall probability that the right-hand side is 1 is equal to p12 • p23.

Due to consistency, the probability that the left-hand side is 1 and that the right-hand side is 1 should be the same, so we get p12 = p12 • p23. Since p12 > 0, we can conclude that p12 = p23.

Now, let us consider yet another particular case of consistency:

(1 * 2) * (2 * 3) = (1 * 3) * (2 * 2).

The right-hand side is always equal to 2 * 2 = 2, while in the left-hand side, we have 1 * 2=1 with probability p12, 2 * 3 = 2 with probability p23 = p12 and thus, (1 * 2) * (2 * 3) = 1 * 2 with probability p212. Out of these cases, we get (1 * 2) * (2 * 3) = 1 with probability p12 • p12 > 0.

So, in the right-hand side, we never get 1, but in the left-hand side, we get 1 with positive probability — which contradicts to the consistency assumption.

3°. Thus, the case p12 > 0 is impossible, and so, we always have 1 * 2 = 2.

In this case, consistency implies that (1 * 2) * (0 * 2) = (0 * 1) * (2 * 2). Here, 1 * 2 = 2 and 0 * 2 = 1, and thus, the left-hand side is equal to 2 * 1 = 2.

The value 0 * 1 is equal to 0 with some probability p01 and to 1 with the remaining probability 1 — p01. Since 2 * 2 = 2, the right-hand side is equal to 0 * 2=1 with probability p01 and to 1 * 2 = 2 with probability 1 — p01. The left-hand side is always equal to 2, hence the right-hand side cannot be equal to 1, and so P01 = 0.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Thus, we always have 0 * 1 = 1. In this case, we can use another particular case of consistency: (1 * 2) * (0 * 1) = (1 * 1) * (0 * 2). Here, since 1 * 2 = 2 and 0 * 1 = 1, the left-hand side is equal to 2 * 1 = 2, while the right-hand side is equal to 1 * 1 = 1 — a contradiction.

Thus, the proposition is proven.

Proof of Proposition 3. One can easily check that, as the desired probabilistic averaging, we can take the averaging in which, for each pair with non-integer x, we return both the floor and the ceiling of x with equal probability 1/2. In this case, the mean is simply the usual arithmetic average, and we know that the arithmetic average is consistent.

Acknowledgments

This work was supported in part by the US National Science Foundation grant HRD-1242122 (Cyber-ShARE Center of Excellence).

References

1. Rabinovich S.G. Measurement Errors and Uncertainty: Theory and Practice. Springer Verlag, New York, 2005.

2. Sheskin D.J. Handbook of Parametric and Nonparametric Statistical Procedures. Chapman and Hall / CRC, Boca Raton, Florida, 2011.

В ДИСКРЕТНОМ СЛУЧАЕ УСРЕДНЕНИЕ НЕ МОЖЕТ БЫТЬ

СОСТОЯТЕЛЬНЫМ

О. Кошелева

к.ф.-м.н., доцент, e-mail: olgak@utep.edu В. Крейнович

к.ф.-м.н., профессор, e-mail: vladik@utep.edu

Техасский университет в Эль Пасо, США

Аннотация. Когда мы имеем две оценки одной и той же величины, желательно объединить их в одну более точную оценку. В обычном случае непрерывных величин естественной идеей является вычисление среднего арифметического двух оценок. Если мы имеем четыре оценки, то мы можем разделить их на две пары, усреднить каждую пару, а затем усреднить полученные средние значения. Среднее арифметическое состоятельно в том смысле, что результат не зависит от того, как мы разделим исходные четыре оценки на две пары. Для дискретных величин (например, величин, описываемых целыми числами) среднее арифметическое двух целых чисел не всегда является целым числом. В этом случае нам нужно выбрать одно из двух целых чисел, ближайших к среднему. В этой статье мы показываем, что независимо от того, как мы выбираем (даже если мы допустим вероятностный выбор), полученное усреднение не может быть всегда состоятельным.

Ключевые слова: усреднение, обработка оценок, состоятельность, дискретный случай.

Дата поступления в редакцию: 13.10.2018

i Надоели баннеры? Вы всегда можете отключить рекламу.