Научная статья на тему 'A NATURAL CAUSALITY-MOTIVATED DESCRIPTIONOF LEARNING'

A NATURAL CAUSALITY-MOTIVATED DESCRIPTIONOF LEARNING Текст научной статьи по специальности «Философия, этика, религиоведение»

CC BY
14
8
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
TEACHING / LEARNING / RELATIVITY THEORY / CAUSALITY / METRIC / KINEMATICMETRIC

Аннотация научной статьи по философии, этике, религиоведению, автор научной работы — Kosheleva O., Kreinovich V.

Teaching is not easy. One of the main reasons why it is not easyis that the existing descriptions of the teaching process are not very precise -and thus, we cannot use the usual optimization techniques, techniques whichrequire a precise model of the corresponding phenomenon. It is thereforedesirable to come up with a precise description of the learning process. Tocome up with such a description, we notice that on the set of all possible statesof learning, there is a natural order 𝑠6𝑠′meaning that we can bring thestudent from the state 𝑠to the state 𝑠′. This relation is similar to the causalityrelation of relativity theory, where 𝑎6𝑏means that we can move from point𝑎to point 𝑏. In this paper, we use this analogy with relativity theory to comeup with the basics of such an order-based description of learning. We hopethat future studies of these basics will help to improve the teaching process.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «A NATURAL CAUSALITY-MOTIVATED DESCRIPTIONOF LEARNING»

Mathematical Structures and Modeling 2022. N. 1 (61). PP. 59-67

UDC 378:514.82 DOI 10.24147/2222-8772.2022.1 .59-67

A NATURAL CAUSALITY-MOTIVATED DESCRIPTION

OF LEARNING

O. Kosheleva

Ph.D. (Phys.-Math.), Associate Professor, e-mail: olgak@utep.edu

V. Kreinovich Ph.D. (Phys.-Math.), Professor, e-mail: vladik@utep.edu

University of Texas at El Paso, El Paso, USA

Abstract. Teaching is not easy. One of the main reasons why it is not easy is that the existing descriptions of the teaching process are not very precise -and thus, we cannot use the usual optimization techniques, techniques which require a precise model of the corresponding phenomenon. It is therefore desirable to come up with a precise description of the learning process. To come up with such a description, we notice that on the set of all possible states of learning, there is a natural order s ^ s' meaning that we can bring the student from the state s to the state s'. This relation is similar to the causality relation of relativity theory, where a ^ b means that we can move from point a to point b. In this paper, we use this analogy with relativity theory to come up with the basics of such an order-based description of learning. We hope that future studies of these basics will help to improve the teaching process.

Keywords: teaching, learning, relativity theory, causality, metric, kinematic metric.

1. Formulation of the Problem

How can we train instructors faster? Effective teaching is not easy. For most instructors, it takes several years to master teaching - and even after these years, no matter how experienced the instructor is, there is always room for significant improvement.

How can we speed up this process? How can we make sure that instructors learn the teaching skills as soon as possible - and not, as now, spend several years learning these skills?

This problem is not easy. One of the reasons why training instructors takes so long is that teaching is not a precise science.

Naive thinking is that since we want to achieve optimal teaching, why not use optimization techniques - that have been so successful in many other application areas? Unfortunately, this is not that easy: optimization techniques require that the problem is formulated in precise terms, and the teaching problem is far from such a description.

We need to describe teaching in precise terms. To be able to utilize the effectiveness of the existing optimization tools, it is therefore desirable to be able to come up with a formal description of the teaching process.

Our approach to this description. One of our areas of interest is foundations of relativity, where an ordering relation - namely, the causality relation - turned out to play a fundamental role; see, e.g., [1-3]. In view of this fact, a natural idea is to look for a description of teaching in terms of an ordering relation.

2. Towards a Natural Description of the Learning Process

What are states of student knowledge. Before we start analyzing what is the natural relation on the set S of all possible states of knowledge, we first need to find out what is a natural way to describe these states.

To fully characterize the student's knowledge of the class material, we need to describe this student's degree of knowledge in each topic. Usually, the knowledge of each topic is described by a grade, and grades are somewhat subjective. To avoid this subjectivity, we can use some objective (or at least inter-subjective) way to gauging this knowledge: e.g., by the number of hours that an average student would take to get to this level of knowledge.

Thus, at any given moment of time, the student's knowledge can be characterized by the values si,..., sn describing this student's knowledge of all n topics, or, in other words, by an n-dimensional point s = (s1,...,sn). In this case, S is simply a subset in the n-dimensional affine space.

To be more precise, since the number of hours is always non-negative, S is a quadrant of the n-dimensional affine space in which all the coordinates s¿ are non-negative.

What is a natural ordering relation between states of knowledge. Now that we have an idea of what is the set S of states of knowledge, we can start analyzing what is a natural relation between these states. To come up with such a description, let us use an analogy with causality. Causality relation a ^ b between two points (events) in space-time means that, in principle, we can go from the point a to the point b, i.e., an observer can first observe a and then observe b.

Similarly, we can define an order s ^ s' between two different states of knowledge as the possibility to go from the state s to the state s', i.e., the possibility that a student was first in the state s and at some future moment of time, the knowledge of this student is characterized by the state s'.

We assume that the skills the students learn are not forgotten during for the duration of the course - or, to be more precise, that, in the first approximation, we can ignore the effects of possible forgetting. Under this assumption, the student's level of knowledge in each topic cannot decrease, so we cannot have s¿ > s-. On the other hand, if s¿ ^ s- for all i, then we can move from the state s to the state s' by teaching the student additional material in each topic. Thus, a natura ordering relation on the set S of all states of the student knowledge is the coordinate-wise

order:

(s = (si, ...,sn) ^ s' = (si,..., <)) ^ (Si ^ s[ for all i). (1)

How much effort do we need to move the students from one state to another. The ultimate goal of teaching is to bring the student from the original state (0,..., 0) (in which the student does not yet have any knowledge of any of the class topics) to the desired state I = (l1,... ,£n), where l is the student's desired level of knowledge on the z-th topic.

Our goal is to bring the student there the fastest way. To find out which way is the fastest, we need to know, for every two states s ^ s', how much student effort (e.g., measured by hours) we need to get from the state s to the state s'. Let us denote this amount by d(s,s'). To be more precise, d(s,s') denote the smallest possible effort needed to get from the state s to the state s'.

Comment. In the idealized case when all topics are independent, the only way to go from state s to state s' is to teach the student additional material for each topic. Because of our selection of the way we measure the student's knowledge in each topic - by the number of hours needed to go from 0 to Si - the additional time needed for the student to improve his/her knowledge from level Si to the level sis to spend time s- — Si. In this case, the overall time needed to go from s to s' is equal to the sum of these times:

n

d(s,s' ) = Y,« — Si). (2)

i=1

In reality, topics are interdependent, so the knowledge of one topic helps to study another topic. For example, knowing basic physics helps students to better understand calculus - for example, the derivative can be naturally understood when a student realizes the velocity is the derivative of the coordinate. Similarly, knowing the derivative can help the student better understand the ideas of velocity and acceleration. Because of such mutual help, the overall time needed to go from s to s' can be smaller that the sum of the corresponding times:

n

d(s,s>) ^ — Si). (3)

i=1

What are natural properties of the function d(s,s'). Any transition requires some efforts: the only time when the effort is 0 is when s = s1. So, we have the following property:

d(s, s) = 0 and (d(s, s') = 0 ^ s = s'). (4)

If we can go from state s to s' by using the amount d(s,s') of resources, and then we can go from s' to s" by using the amount d(s', s") of resources, then one of the possible ways to go from the state s to the state s" is to go through s'. For this

two-stage transition, we spend the amount d(s,s') + d(s',s"). Thus, the smallest possible amount d(s,s") of resources needed to go from s to s" cannot exceed this sum. Thus, we have the usual triangle inequality:

d(s,s") ^ d(s,s') + d(s',s"). (5)

How is our function d(s,s') related to metric and to its space-time analogue - kinematic metric (as measured by proper time)? Properties (4) and (5)

resemble the usual properties of metric. The main difference is that in our case, the value d(s,s') is only defined when s ^ s'. From this viewpoint, this notion resembles kinematic metric r(a, b) - the proper time that an inertial particle would measure when it goes from event a to event b: this values is also only defined when a ^ b. However, kinematic metric is known to be the largest value of the proper time - not the smallest as in our case - and thus, instead of the triangle inequality, it satisfies the opposite ("anti-triangle") inequality

r(a, a") ^ r(a, a') + r(a', a").

From this viewpoint, the proposed model is intermediate between the regular metrics and the kinematic metrics.

Towards a formal general definition. Let us combine the above-described natural properties of this "learning" metric (we will call it l-metric, l for "learning") into the following definition:

Definition 1. Let (S, be a partially ordered set. By an l-metric we mean a function d(s,s') that is defined for all pairs s,s' e S for which s ^ s' and that satisfies the properties (4) and (5).

Challenge. Since l-metrics provide a natural description of learning, we believe that to enhance the learning process, it will be beneficial to study the properties of such l-metrics.

One such property is described in the next section.

3. Shift-Invariant Scale-Invariant l-Metrics on the Finite-Dimensional Affine Space with Component-Wise Order

Why shift-invariant and scale-invariant. In the study of casuality, a good approximation to real-life causality is provided by the Special Relativity Theory, in which:

• the causality relation is invariant with respect to shift a ^ b a + c ^ b + c and with respect to scalings: a ^ b A ■ a ^ A ■ b for all A > 0, and

• the kinematic metric is both shift- and scale-invariant: r(a,b) = r(a + c,b + c) and r(A ■ a,\ ■ b) = A ■ r(a, b) for all A > 0.

In our case, component-wise order is also clearly shift- and scale-invariant. It is therefor reasonable to consider shift- and scale-invariant l-metrics.

Definition 2. Let ( S, be an n-dimensional affine space with coordinate-wise order (1). We say that an l-metric is:

• shift-invariant if d(s, s') = d(s + s", s' + s") for all s, s', and s" , and

• scale-invariant if d(\ • s,X • s') = X • d(s, s') for all s, s', and X> 0.

Proposition 1. Let ( S, be an 'n-dimensional affine space with coordinate-wise order (1). The following two conditions are equivalent to each other for any function d(s, s') defined for all pairs s,s' e S for which s ^ s':

• d(s, s') is a shift- and scale-invariant l-metric;

( , ) has the form

d(s,s') = (s 1 — *i) • F ()

\s'i — Si s'i — SiJ

for some positive-valued convex function F(r2,..., rn).

Proof. Shift-invariance clearly implies that d(s,s') = d(0,s — s'), and scale-invariance, with X = i — i (when i > i) implies that

d(s, s') = d(0, s' — s) = (si — si) • d(0, (1, T'2,..., rn)),

where we denoted

def Sj Si

Ti = --.

i — i

So, if we denote

F(r2, ..., rn) d=f d(0, (1, r2,..., rn)),

we almost get the desired result - the only thing remaining to prove is that the triangle inequality for the original function d(s,s') is equivalent to convexity of the function F(r2,..., rn).

Indeed, due to shift-invariance, the triangle inequality gets the following equivalent form

d(0, s" — s) ^ d(0, s' — s) + d(0, s" — s'). So, if we denote a d=f s' — s and b d=f s" — s', we get the following equivalent form:

d(0,a + b) ^d(0,a) + d(0, b). (6)

Due to scale-invariance, we have

d(0, a) = ai • d(0, (1, A2,..., An)) = ai • F (A2, ...,An),

where we denoted

Similarly, we have where we denoted

and

where we denoted

. def ai Ai — . ai

d(0,b) — b-1 • F (Bi,...,Bn),

TJ def bi

Bi — T, bi

d(0,a + b) — (ai + bi) • F(Ci,...,Cn),

a

def ai + bi

ai + bi

Thus, the triangle inequality (6) takes the form

(ai + bi) ■ F (C2, ...,Cn) ^ ai ■ F (A2, ...,An) + bi ■ F (B2,..., Bn). Dividing both sides of this inequality by ai + bi, we get an equivalent inequality

F(C2,...,Cn) ^

a1

ai + bi

F (A2,...,An) +

bi

ai + bi

F (B2,...,Bn),

i.e., if we denote

a

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

def ai

ai + bi

the form

F (C2, ...,Cn) ^ a • F (A2, ...,An) + (l - a) • F (B2,..., Bn).

(7)

By definition of Ai and Bi, we have ai = ai ■ Ai and bi = bi ■ Bi. Substituting these expressions for ai and bi into the formula that defined Ci, we get

ai + bi _ ai • Ai + bi • Bi _ ai ai + bi ai + bi ai + bi

Ai +

bi

ai + bi

Bi — a • A, + (l - a) • B%.

Thus, the inequality (7) takes the equivalent form

F (a • A2 + (1 - a) • B2,...,a • An + (1 - a) • Bn) ^

a • F (A2, ...,An) + (1 - a) • F (B2, ...,Bn),

which is exactly the definition of convexity.

The equivalence between triangle inequality for the function d(s,s') and the convexity of the corresponding function F(r2,... ,rn) is thus proven, and so is the proposition.

Comment. The case when all topics are independent and the l-metric is described by the formula (2) corresponds to the convex function F(r2,... ,rn) = r2 +... + rn.

Acknowledgments

This work was supported in part by the National Science Foundation grants 1623190 (A Model of Change for Preparing a New Generation for Professional Practice in Computer Science), and HRD-1834620 and HRD-2034030 (CAHSI Includes), and by the AT&T Fellowship in Information Technology.

It was also supported by the program of the development of the Scientific-Educational Mathematical Center of Volga Federal District No. 075-02-2020-1478, and by a grant from the Hungarian National Research, Development and Innovation Office (NRDI).

References

1. Alexandrov A.D. On Lorentz transformations. Uspekhi Math. Nauk, 1950, vol. 5, no. 1, p. 187 (in Russian).

2. Alexandrov A.D. and Ovchinnikova V.V. Remarks on the foundations of Special Relativity. Leningrad University Vestnik, 1953, no. 11, pp. 94-110 (in Russian).

3. Zeeman E.C. Causality implies the Lorentz group. Journal of Mathematical Physics, 1964, vol. 5, no. 4, pp. 490-493.

МОТИВИРОВАННОЕ ЕСТЕСТВЕННОЙ ПРИЧИННОСТЬЮ ОПИСАНИЕ

ОБУЧЕНИЯ

О. Кошелева

к.ф.-м.н., доцент, e-mail: olgak@utep.edu В. Крейнович

к.ф.-м.н., профессор, e-mail: vladik@utep.edu

Техасский университет в Эль-Пасо, Эль-Пасо, США

Аннотация. Преподавать нелегко. Одна из основных причин, почему это нелегко, заключается в том, что существующие описания процесса обучения не очень точны, и поэтому мы не можем использовать обычные методы оптимизации, методы, которые требуют точной модели соответствующего явления. Поэтому желательно придумать точное описание процесса обучения. Чтобы придумать такое описание, заметим, что на множестве всех возможных состояний обучения существует естественный порядок s ^ s', означающий, что мы можем перевести ученика из состояния s в состояние s'. Это отношение похоже на отношение причинности теории относительности, где а ^ Ъ означает, что мы можем двигаться из точки а в точку Ь. В этой статье мы используем эту аналогию с теорией относительности, чтобы придумать основы такого упорядоченного описания обучения. Мы надеемся, что дальнейшее изучение этих основ поможет улучшить процесс обучения.

Ключевые слова: преподавание, обучение, теория относительности, причинность, метрика, кинематическая метрика.

Дата поступления в редакцию: 14.02.2022

i Надоели баннеры? Вы всегда можете отключить рекламу.