Как обучить искусственный интеллект медицине или размышления о новой роли стандартизации

Воробьев Павел Андреевич; Воробьев Андрей Павлович

DOI: 10.26347/1607-2502201807-08019-027

КАК ОБУЧИТЬ ИСКУССТВЕННЫЙ ИНТЕЛЛЕКТ МЕДИЦИНЕ ИЛИ РАЗМЫШЛЕНИЯ О НОВОЙ РОЛИ СТАНДАРТИЗАЦИИ

Рассматриваются проблемы глубокого обучения системам искусственного интеллекта на примере системы МеБЮазе. Анализируются проблемы, связанные с обучением с учителем экспертных систем, диагностические игры в выборе знаний врача, вопросы эвристического и дедуктивного подхода в диагностике. Делается заключение, что важный элемент системы стандартизации в здравоохранения — клинические рекомендации — могут стать основой для глубинного обучения интеллектуальных системы в медицине.

Ключевые слова: искусственный интеллект, глубокое обучение, экспертные системы, диагностические игры, обучение с учителем, эвристика, дедукция

Авторы заявляют об отсутствии возможных конфликтов интересов.

П.А. Воробьев1, А.П. Воробьев2'3

1 Московское городское научное общество терапевтов, Москва, Россия

2 ФГАОУ ВО Первый Московский государственный медицинский университет имени И.М. Сеченова Минздрава России (Сеченовский Университет), Москва, Россия

3 МТП Ньюдиамед, Москва, Россия

HOW TO TEACH MEDICINE AN ARTIFICIAL INTELLIGENCE, OR CONSIDERATIONS ABOUT A NEW ROLE OF STANDARDIZATION

Reviewed here is the challenge of the deep learning of artificial intelligence systems, with the MeDiCase system used as an example. The problems related to teaching expert systems with associative learning, diagnostic games with the choice of a doctor's knowledge, issues of the heuristic and deductive approach in diagnostics are analyzed. The conclusion is made that clinical guidelines as an important element of standardizing the healthcare system can play a fundamental role in deep learning of intelligence systems in medicine.

Keywords: artificial intelligence, deep learning, expert systems, diagnostic games, associative learning, heuristics, deduction

Authors declare lack of the possible conflicts of interests.

EA. Vorobiev1, A.P Vorobiev2'3

1 Moscow City Scientific Society of Primary Care Physicians, Moscow, Russia

2 Federal State Autonomous Educational Institution

of Higher Education «I.M. Sechenov First Moscow State Medical University» of the Ministry of Health of the Russian Federation (Sechenov University), Moscow, Russia.

3 Medical and Technology Company «Newdiamed», Moscow, Russia

Сегодня наблюдается огромный энтузиазм, связанный с обещаниями революционных побед искусственного интеллекта (ИИ) во многих областях науки и в быту, в том числе и в медицине. Большое число энтузиастов готовы вознести тему искусственного интеллекта на первое место на пьедестале развития медицины на ближайшее годы. Это новый виток романтического энтузиазма: подобное наблюдалось в 70—80-е годы, вновь — через 10—20 лет. Но в прежние годы ничего не произошло, считается, что в первую очередь из-за недостаточного развития технической базы. Но, скорее всего, большую роль (или — злую шутку) сыграла сама сущность ме-

дицины: до сих пор бытует мнение экспертов-медиков, что законы медицины отличаются от законов физики, химии и математики, что их нельзя описать графиками и формулами, математической моделью и просчитать на мощном процессоре.

В этих утверждениях есть, безусловно, доля истины, однако в медицине все построено на вероятностях и допущениях: нет практически ничего точного, чтобы не подвергалось сомнению и ревизии. Собственно, это — относительность — и является полем для развития ИИ: считается, что огромные статистические массивы и машинное обучение, которые являются основой для

развития ИИ, появление нейронных сетей приведет к выдающимся успехам за счет выявления невидимых человеческому взгляду закономерностей. Уже первые проекты, свидетельствуют, что нейронные сети могут работать как лучшие клиницисты. Но — лишь при решении определенных диагностических задач. При решении других задач, казалось бы, аналогичных или даже еще более простых, ИИ не работает.

В связи с этим логично попытаться ответить на важнейший вопрос: является ли современный уровень интереса к ИИ очередным периодом шумихи в циклах возбуждения, возникавших вокруг этой темы ранее? Или действительно появилось что-то новое, что изменяет ситуацию? Очевидно, что сегодня техническая мощность для реализации информационных технологий возросла многократно. Многое заставляет более внимательно воспринимать новые обещания применения ИИ: потенциальное влияние ИИ на здоровье может быть связано с социальными факторами и вызовами, которые могут на этот раз «сделать судьбу» ИИ в медицине. Синергич-ное влияние трех сил заставило общество определять новые ориентированные на здоровье подходы, которые могут быть достигнуты через ИИ:

1. Разочарование в устаревшей системе организации здравоохранения и высокие ожидания у потребителей медицинской помощи характерно для всего мира, а не только для нашей страны. Это разочарование различных фокус-групп общества связано с высокими затратами и невысоким качеством медицинской помощи. Как результат: коренное разрушение доверия открывает умы людей новым парадигмам, инструментам, услугам. Мы уже наблюдаем взрыв новой технологии контроля персонального здоровья через платформы смарт-устройств и интернет-взаимодействия. Появляется всеобъемлющее наблюдение, учет огромного числа факторов, которое определяет среду, в которой сейчас разрабатываются приложения ИИ [1].

2. Вездесущность сетевых устройств в нашем обществе, что связано со смартфонами и планшетами: инструменты на основе ИИ появляются в приложениях, ориентированных на здоровье, которые могут использоваться на карманных сетевых устройствах, мощность которых превосходит мощности имевшихся еще в недавнем прошлом «персональных компьютеров» [2].

3. Удобства услуг «на дому», предоставляемых через сетевых агрегаторов [3]: от банального «магазина на диване» произошло почти мгновенное переключение на самые разные системы закупок и продажи всего и вся, работу с мобильным банком, систему «Госуслуги» и т.д.

Таким образом, общество готово воспринять ИИ в медицине. Но не готовы врачи. Недоверие врачей к ИИ в медицине очевидно и строится на их, на самом деле, примитивном мнении об уникальности процессов постановки диагноза на основании... «интуиции». Врач чаще всего не задумается, ставя диагноз, — а как он это делает. Вместе с тем точность определения болезни опытным клиницистом, точность прогноза последующего течения заболевания нередко впечатляет пациентов. Известны врачи, ставящие диагноз за то время, пока пациент проходит расстояние от двери до рабочего стола терапевта.

Врача озарило? — Озарение носит название «эвристического» метода принятия решений или решения задач, от того самого восклицания «Эврика» с которым один, всем известный древний грек выскочил из ванны. Эвристика — это решение задач в условиях высокой степени неопределенности, к которым и относится медицина. Эвристические подходы противопоставляются формальным методам решения, опирающимся на алгоритмы и модели. Эвристические методы понимаются как любые методы, направленные на сокращение перебора, или как индуктивные методы решения задач. Индукция здесь обозначает, что обнаруженный, иногда незначительный факт, как катушка индуктивности запускает действие огромной, несоразмерной с первичной величиной индуктора с существенными результатами.

К сожалению, интуиция — дело далеко не точное, и она должна быть проверена современными доказательствами, основанными на методах статистики. Первичная диагностическая гипотеза, пришедшая в голову врачу, сразу обрастает огромным количеством лабораторных и инструментальных исследований, дополнительных консультаций. Это является следующим этапом диагностического поиска по эвристической или индуктивной методологии: после «озарения» идет подбор аргументов, которые подтверждают озарившую врача гипотезу. Делается это неосознанно, но может иметь слишком боль-

шие последствия. Нередко врач ищет аргументы в пользу своей мысли («за»), упорствует и отбрасывает все аргументы «против» как несущественные, мешающее правильному решению. Не говоря уже о «несущественных» признаках, не укладывающихся в обсуждаемую диагностическую гипотезу или не релевантные ей.

Использование эвристического подхода в диагностике сокращает время решения задачи по сравнению с методом полного ненаправленного перебора возможных альтернатив, но получаемые решения не являются, как правило, наилучшими. Применение эвристических (индуктивных) методов не всегда обеспечивает достижение поставленной цели. Именно здесь кроется суть спора между «физиками (а теперь — кибернетиками) и лириками» от медицины — наука, медицина или искусство. Если искусство, то тогда эвристические подходы оправданы, поскольку требует наименьших затрат времени, являясь рабочим инструментом клинического мышления, соединяющего в единый процесс мысль и действия. Если наука, — то озарению места немного.

Эвристические методы наиболее перспективны в условиях часто повторяющихся ситуаций, когда врач воспроизводит модели уже виденного им ранее. Если речь идет о частых заболеваниях, с которыми врач сталкивается ежедневно по несколько раз, — модель точна. Например, респираторные инфекции или артериальная гипертония. Или, вариант, модель некой болезни синтезирована врачом по результатам изучения литературы, при обучении. Поскольку ситуация частая, то и диагностическая гипотеза обычно будет правильной. Но небольшой процент «диагностических ошибок» есть всегда. Врач прогнозирует именно ту ситуацию, которая наиболее вероятна. Например, в зимне-весенний период молодой человек, пришедший на прием с очень высокой вероятностью болен респираторной вирусной инфекцией. Его даже смотреть не надо. Но очень редко, может быть 1 на 5—10 тыс. пациентов, к врачу с такими же жалобами придет больной острым лейкозом. И врач, используя эвристические подходы в диагностике, обязательно пропустит диагноз, что, скорее всего, закончится смертью больного. Но ведь в остальных тысячах и тысячах случаев он будет прав!

Эвристические подходы используются и в принятии коллективных решений. Они получили название «сократические беседы», исторически восходящие к Сократу. Эта методология состоит в задавании наводящих и уточняющих вопросов, приведении примеров и контрпримеров. При этом исходно равноценны все самые неожиданные предлагаемые решения, и они должны обсуждаться. Коллективный метод решения трудных проблем известен как «мозговой штурм» и, на самом деле, используется при проведении консилиума в составе нескольких специалистов или на профессорских обходах.

Очевидно, что эвристический подход к диагностике не должен рассматриваться в рамках обучения ИИ.

Дедукция — антипод эвристике — метод решения задач, основанный на выведении (дедукций) заключений из предпосылок, истинностное значение (вес) которых неизвестно, на тщательном переборе всех имеющихся фактов с последующим их сведением на основе правил в диагностическую гипотезу. Нужно тщательное изучение деталей, особенно тех, которые не вписываются в формируемую модель. Врач, понимающий необходимость дедуктивной проверки диагностической гипотезы, становится блестящим диагностом. Понимание врачами дедуктивного подхода в основном базируется не на научных данных о правилах принятия решений, а на интуитивных вещах, часто имеющих отношение к характерологическим особенностям врача, склонности его к тщательности и аккуратности.

Структура дедуктивного рассуждения обусловлена познавательной деятельностью человека с использованием эвристически-индукционных подходов, истолкования эмпирического материала, его классификации и систематизации, создания моделей и правил на основе причинно-следственных связей. Иначе говоря: эвристические, индуктивные методы отталкиваются от симптома и развивают древо решений от «главной жалобы», тогда как дедуктивные способы решения начинаются с результата (предполагаемого или имеющегося), перебирают максимально возможное число признаков с использованием формул решения, формируя набор версий, отбрасывая не жизнеспособные. Такими образом, дедукция и индукция неразрывно связаны между собой.

Модель организации дедуктивной диагностической системы включает:

1. Создание исходного свода терминов и утверждений. Нужно определить понятие «формулы». Формулой называют правила, по кото -рым дедуктивная система будет преобразовывать объекты. Для этого создается «алфавит», описание, идентифицирующее объекты.

2. Вторым этапом создаются правила построения формул: алгоритм построения из символов алфавита формул.

3. Совокупность утверждений (заключений, диагностических гипотез), получаемых из идентифицированных объектов путем применения формул.

При построении логической цепочки дедуктивной диагностической системы можно использовать иерархию гипотез, степень абстрактности и общности которых увеличивается по мере удаления от эмпирического базиса — окончательного диагноза. При этом на вершине такой пирамиды будут располагаться гипотезы, имеющие наиболее общий характер и обладающие наибольшей логической силой (например, боль в области сердца у мужчин в возрасте старше 50 лет вероятно связаны с ишемией миокарда). Из них, как из предпосылок, будут выводиться гипотезы более низкого — 2- и 3-го уровней (связь болевого синдрома с нагрузкой и влияние нитроглицерина). В основании пирамиды будут находиться гипотезы, которые можно сопоставить с эмпирическими данными или, в нашем случае, — с верифицированными диагнозами (стенокардия — боль в прекардиальной области, связанная с нагрузкой и быстро купирующаяся приемом нитроглицерина). Логическая диагностическая последовательность при этом будет носить название «древо решения»: от общего (вершина пирамиды) к частному (ее основание).

Сегодня все чаще и в диагностической теме употребляются термины медицины, основанной на доказательствах (МОД). В каждом доказательстве существует три уровня: тезис, аргументы (обоснования), демонстрация. Тезис — это суждение, истинность и приятие которого устанавливается в доказательстве, аргументы — суждения, из которых выводится тезис (алфавит), демонстрация — логическая форма связи (формула) первых двух элементов, обусловливающая необходимость выведения одного из другого, тезиса

из аргумента. Существуют правила в поисках доказательств.

Доказательство определением: четкое определение ключевых категорий, которые не вызывают сомнений относительно их адекватности реальным явлениям и практическому опыту. Например: быстрый эффект нитроглицерина характерен для стенокардии.

Доказательство от обратного: выдвигаются обоснования, демонстрирующие абсурдность противоположного доказываемому тезису. Сюда можно отнести и доказательство по принципу приведения к абсурду.

Доказательство, построенное на анализе свойств исследуемого объекта. Этот подход чаще других наблюдается в клинической практике: перебирая те или иные анализы, мы создаем диагностическую гипотезу. Доказательство на основе классификации факторов, позволяют установить свойства объекта.

Аксиоматическое доказательство основано на бесспорных, понятных и принятых положениях, от которых и строится доказательство. Здесь в медицине возникает огромная проблема «мнений». Этому противостоит фактологическое доказательство, в котором главную роль играют факты. Однако факты подвержены интерпретации, что снижает значение и этого подхода.

Доказательство на основе рабочей гипотезы или концепции (гипотетическое, концептуальное доказательство). К таким доказательствам скорее всего можно отнести эвристические подходы.

В экспериментальном доказательстве главная опора — эксперимент, который в современной медицине поставлен во главу угла того, что мы называем МОД. Возникает вопрос: если МОД — экспериментальное доказательство, то возможно ли доказать что-либо в медицине, используя другие методы доказательства? Да. Подавляющее большинство тезисов в медицине не имеет экспериментальных аргументов. Они просто никогда не подвергались экспериментальным проверкам, а сложились исторически. И, нередко, при объективных проверках оказываются ложными.

Вот лишь один такой пример. ЭКГ для диагностики инфаркта миокарда считается золотым стандартом, хотя ее статистические оценки — чувствительность и специфичность — крайне низки — от 30 до 40% [4]. Так, в исследовании

E. Trägardh показано, что чувствительность для выявления острого инфаркта миокарда с подъемом сегмента ST при лабораторной верификации диагноза у 479 пациентов составила 28% для ЭКГ с 12 отведениями, 33% для ЭКГ с 16 отведениями и 37% для ЭКГ с 24 отведениями. Специфичность составила 97, 93 и 95% соответственно. При таких показателях никакая новая диагностическая технология не получает сейчас право на жизнь [5]. О.И. Ларичев в эксперименте установил, что диагностика по ЭКГ улучшается в случае, если врач использует данные машинной обработки результатов тестирования лишь как подсказку и самостоятельно выносит решения относительно наблюдаемых изменений кривой: «машинный диагноз» был хуже врачебного [6].

Врачи учатся не на результатах статистики и не по результатам статистики собирают образ болезни. Не на основании статистики выработаны те или иные методы диагностики и лечения. Хотя на современном уровне они и оцениваются статистическими методами, но не всегда за ними остается последнее слово. Обучение врача идет не на примерах, а на обобщениях. Мы создаем образ болезни, образ больного человека. Простым перебором данных нельзя создать обучающий материал. Почему же возникают наивные предположения, что такая возможность существует: просто надо набрать побольше данных (Big Data), а их осмысление придет само собой.

Процесс обучения ИИ можно и, наверное, нужно, уподобить обучению растущего человека: сначала младенец фиксирует взор и начинает различать людей, вещи, затем он идентифицирует вещи и людей, начинает понимать речь, осваивает ползание и ходьбу и, наконец, начинает говорить. С этого момента начинается его предметное образование, которое одновременно двигается по двум направлениям — абстрактному (все, что вижу и слышу) и целенаправленному (грамматика, арифметика, физика и т.д.). С позиции ИИ кажется не нужным промежуточное целенаправленное обучение отвлеченным вопросам (биология, физика, химия) и уж тем более — не нужно абстрактное мышление. Это, на наш взгляд — глубочайшая ошибка.

В своем отчете JASON [7] отмечает, что ИИ должен в процессе обучения в качестве установленного стандарта медицинской помощи использовать надежную практику рецензируемых науч-

ных исследований и разработок, что может обеспечить защиту ИИ от ложных или не одобренных экспертами алгоритмов. Иными словами, в привычной нам фразеологии: необходимо использование для обучения ИИ результатов, полученных в ходе исследований — МОД. Ни о каких самостоятельных построениях и поисках закономерностей речь в данном случае не идет. Обучение ИИ, как и врача — из рук в руки с применением достоверных фактов.

Если внедряется новая технология диагностики с помощью ИИ, то она должна пройти исследования на чувствительность и специфичность, как это установлено для иных диагностических процедур и соответствующим образом валиди-рована по результатам тестов. С позиции обучения — это переобучение ИИ или... повышение квалификации. Последнее, как известно, процесс непрерывный, длящийся всю профессиональную жизнь.

На протяжении последних десятилетий в ИИ развивалось такое направление, как экспертные системы. Мы не случайно упоминаем этот подход, так как он критически важен в обсуждаемой нами проблеме обучения ИИ. Строго говоря, экспертные системы — разнородное понятие, не имеющие даже общего определения. Так, Вики-педия называет экспертной системой компьютерную систему, способную частично заменить специалиста-эксперта в разрешении проблемной ситуации [8]. Другое, но похожее определение находим на портале [9]: экспертные системы — это направление исследований в области искусственного интеллекта по созданию вычислительных систем, умеющих принимать решения, схожие с решениями экспертов в заданной предметной области.

В создании экспертных систем большую роль играют знания опытных специалистов. Экспертное знание — это теоретическое понимание проблемы и практические навыки ее решения, эффективность которых подтверждена практической деятельностью эксперта. Другими словами, экспертные системы создавались отдельными специалистами на основании собственных знаний и умений.

Наивные представления математиков-разработчиков экспертных систем состояли в том, что правильно выбранный эксперт и удачная формализация его знаний позволяют наделить экспертную систему уникальными и ценными знаниями.

Врач, считали они, хорошо диагностирует болезни и эффективно назначает лечение не потому, что он обладает некими врожденными способностями, а потому что имеет качественное медицинское образование и большой опыт в лечении своих пациентов. Ценность экспертной системы как законченного продукта определяется качеством созданной базы знаний. При разработке экспертных систем не предполагалось найти консенсус между различными мнениями и подходами. Более того, создателей обычно не интересовало, насколько адекватно эксперт понимает и воспроизводит свои «знания и умения». Эти ошибки первичного этапа — обучения экспертных систем — привели к тому, что за почти полвека своего существования ни одна из них так и не оказала какого-либо значимого влияния на медицинскую практику.

Вспомним систему MYCIN. В практическом здравоохранении она тоже не нашла применения, но до сих пор упоминается, хотя создана была в 1973—1974 гг. силами специалистов Стэнд-фордского университета [10]. Она включала базу знаний из примерно 600 правил, нацеленных на диагностику менингита, сепсиса и эмпирический подбор антибиотика. Программа задавала врачу большое число простых вопросов с ответами «да» или «нет». В результате, выдавала список возможных возбудителей с отметкой вероятности диагнозов заболеваний и их обоснование. Система предлагала адекватную терапию в 69% случаев — чаще, чем эксперты. Оказалось, что не все внешние эксперты были согласны со сделанным выбором, неясно было, что делать с этической точки зрения, если предложения программы были неверными. Немаловажно было и то, что на ответ на все вопросы системы врач должен был тратить не менее 30 мин. Врач работает существенно быстрее, при все-таки схожей эффективности в отношении эмпирического выбора антибактериальной терапии. Кроме того, в этот момент стало появляться огромное число новых антибиотиков, их нужно было вновь и вновь вносить в систему, устанавливать степень вероятности эффективности. Но система обучения системы MYCIN не была отработана. Эксперты заключают, что главной трудностью, с ко -торой столкнулись во время разработки MYCIN и последующих экспертных систем, было «извлечение» знаний из опыта людей-экспертов для формирования базы правил [11].

Обучение экспертных систем знаниям является большой проблемой и связано, чаще всего, со сложностью формализации эвристических знаний экспертов. Вопросами извлечения знаний у экспертов-врачей занималась с начала 70-х гг. ХХ века большая группа кибернетиков и врачей под общим руководством И.М. Гельфанда [12]. Ими была разработана и апробирована методология так называемых диагностических игр, в процессе которых формализовались критерии диагностики и прогноза заболевания, создавался адекватный язык, формулы или решающие правила. Эта работа, несмотря на ее актуальность и достаточно большое число публикаций, не оказала значимого влияния на развитие ИИ, хотя именно этот подход был и остается абсолютно революционным в выявлении профессиональных знаний врача, считающего свою работу искусством и интуицией.

Экспертные системы не могут предоставить осмысленные объяснения своих рассуждений, как это делает эксперт: экспертные системы описывают последовательность шагов в процессе поиска решения лишь в контексте, заложенным экспертом.

Экспертные системы не способны к самообучению: чтобы поддерживать экспертные системы в актуальном состоянии необходимо постоянное вмешательство в базу знаний инженеров знаний — учителей. Если экспертные системы, лишаются поддержки разработчиков, учителей, они быстро теряют свою актуальность.

В последние десятилетия основным инструментом доведения результатов клинических исследований до практиков является система клинических рекомендаций (КР). В России КР являются элементом системы стандартизации в здравоохранении [13]. В идеале в КР должны попадать медицинские технологии, прошедшие всестороннюю оценку с точки зрения выявления доказательств эффективности. Оценку медицинских технологий грубо можно разделить на 2 этапа: а) поиск доказательств и их ранжирование по степени качества исследований и б) обсуждение приемлемости для практического здравоохранения с точки зрения экспертов. Таким образом, КР являются квинтэссенцией понимания экспертами результатов клинических исследований.

Следовательно, основные источники знаний для обучения ИИ можно определить следующим

образом: а) формализованные знания врачей, выявленные с помощью диагностических игр и б) клинические рекомендации или иные стандартизованные ДОКУМЕНТЫ, созданные на основе осмысления результатов научных экспериментов и консенсуса специалистов.

В 2010 г. область ИИ была потрясена неожиданными успехами много лет разрабатываемой технологии многослойных нейронных сетей (NNs). Качественно новые решения произошли в результате синергичного влияния двух эволюционных изменений: а) быстрые аппаратные графические процессоры позволяют тренировать намного больше и особенно глубже (многослой-ность сетей) и б) большие маркированные наборы данных (изображения, веб-запросы, социальные сети и т.д.) могут быть использованы в качестве учебных «пособий». Эта комбинация породила парадигму глубокого обучения (Deep Learning, DL) в глубоких нейронных сетях (DNN), особенно с архитектурой, называемой «свертор-ные нейронные сети» (CNN)». Глубокое обучение использует многослойную систему фильтров для извлечения признаков, характеристик и создания формул. Каждый последующий слой получает на входе выходные данные предыдущего слоя. Напоминает это обучение студента-медика: общая патология с морфологией и патофизиологией, пропедевтика, факультетская и госпитальная терапия, интернатура, ординатура, непрерывное медицинское образование. При этом анализируемые признаки организованы в иерархическую систему, признаки верхнего уровня являются производными от признаков нижнего уровня. На каждом этапе предыдущие знания уточняются, юстируются, шлифуются в направлении от абстрактных общих представлений до конкретных заболеваний во всей их красе.

Система глубокого обучения может сочетать алгоритмы обучения с учителем и без учителя, при этом «анализом образца» называется обучение без учителя (самообучение), а «классификацией» — обучение с учителем. Надо напомнить, что классификации бывают иерархические, когда классифицируемые признаки находятся в четкой связи друг с другом (например, заболевания сердца, нарушения ритма и пароксизмальная фибрилляция предсердий), фасетные, когда определяющие классификационные признаки могут быть не связаны между собой (например, в

МКБ-11 пересмотра [14], которая в ближайшие годы вступит в действие наряду с разделами по системам органов есть раздел, связанный с диагнозами в традиционной медицине) и смешанные (на самом деле МКБ-11 — это типичный пример смешанной классификации, где встречают иерархические и фасеточные разделы).

Обучение с учителем подразумевает обучение на примерах, состоящих из пары «признак — результат». Подразумевается, что связь между этими параметрами априори не известна, но такие пары представляют из себя некоторую выборку, на которой, собственно, и проводится обучение. Задачей обучения с учителем является не столько поиск зависимости (причинно-следственные связи, этиология, патогенез, саногенез и т.д.), сколько отработка вероятности результата при наличии признака. Для повышения точности прогноза может быть использован эксперимент с обратной связью, когда точность прогноза постепенно увеличивается на фоне увеличения выборки. При этом аккумулятором получаемой информации может выступать как машина, так и человек — учитель.

В задачу нашей работы не входит подробное описание математического аппарата процессов глубокого обучения. Тем не менее, данный подход очень созвучен тому, что использовалось в медицине задолго до появления нейронных сетей. Приведем несколько примеров на основе создаваемой системы искусственного интеллекта МеБЮа8е. Система состоит из нескольких «слоев», каждый из которых подвергался в процессе создания «обучению»:

1. Опрос и маршрутизация пациента.

2. Синдромальная диагностика.

3. Вероятностное прогнозирование и дифференциальный диагноз.

4. Мониторинг заболевание, вторичная профилактика, терапия, реабилитация, оценка эффективности терапии (реабилитации).

5. Автоматизированный подбор лечебных и реабилитационных мероприятий.

На первом этапе (слое) решались следующие задачи: идентификация участников системы, защита персональных данных, интеграция со сторонними информационными системами, автоматизированное проведение опроса пациента, автоматизированное принятие решений о необходимости экстренного обращения за медицинс-

кой помощью, валидация вопросника, выбор математической модели системы (описание базовой платформы).

Следующий этап нацелен на формирование диагностических гипотез и их обоснований, вали-дацию диагностических гипотез. Этот этап обучения, например, потребовал предварительного анализа нормативно-правовой базы относительно периодических осмотров и диспансеризации в России, работы с широким пулом экспертов.

Третий слой должен характеризовать этап оценки и представление данных врачу о вероятности диагностической гипотезы в привязке к ее обоснованию, выделение групп диагнозов, требующих дифференциальной диагностики в ходе опроса конкретного пациента, сопоставление вероятности диагнозов в группе с формированием плана дополнительного обследования, формализации результатов дополнительных обследований и использование их в дифференциальной диагностике, оптимизации выбора технологий терапии, профилактики и реабилитации врачом (система поддержки принятия решений - СППР).

Очередной слой — это система телемедицинской оценки динамики здоровья пациента при определенном заболевании, формирование критериев и технологий оценки эффективности лечебных мероприятий и реабилитации для каждого заболевания, корректировка терапии врачом с использованием инструментов автоматизированной обработки данных (СППР) — (включающей клинические рекомендации, алгоритмы, справочники).

Новый уровень — это постепенный переход к автоматизированному ответственному самолечению.

Таким образом, мы уверены, что стандартизация источников, подходов и методов глубокого обучения интеллектуальных систем в здравоохранении окажет большое влияние на дальнейшее развитие телемедицинских технологий в ближайшее время.

ЛИТЕРАТУРА

1. Davis K., Stremikis K., Schoen C. and D. Squires, Mirror, Mirror on the Wall, 2014 Update: How the U.S. Health Care System Compares Internationally, The Commonwealth Fund, June 2014.

2. Record shares of Americans now own smartphones, have home broadband, A. SMITH, Pew Research Center 2017. Доступно по: http://www.pewresearch.org/fact-tank/2017/ 01/12/evolution-of-technology/. Ссылка активна на 01.07.2018.

3. Mobile Fact Sheet, Pew Research Center 2018. Доступно по: http://www.pewinternet.org/fact-sheet/mobile/. Ссылка активна на 01.07.2018.

4. База знаний по молекулярной и общей биологии человека (HUMBIO), ЭКГ при патологии: инфаркт миокарда. Доступно по: http://humbio.ru/humbio/har3/ 003eb22f.htm. Ссылка активна на 01.07.2018.

5. Detection of acute myocardial infarction using the 12-lead ECG plus inverted leads versus the 16-lead ECG (with additional posterior and right-sided chest electrodes). T^rdh E1, Claesson M, Wagner GS, Zhou S, Pahlm O. DOI: 10.1111/j. 1475-097X.2007.00761 .x PMID: 1794465.

6. Ларичев О.И., Мошкович Е.М. Качественные методы принятия решений. Вербальный анализ решений. М.: Наука. Изд. Фирма «Физ. мат. лит», 1996, 207 с.

7. JASON 2017, Perspectives on Research in Artificial Intelligence and Artificial General Intelligence Relevant to DoD. JSR-16-Task-003.

8. Экспертная оценка. Википедия. Доступно по: https:// ru.wikipedia.org/wiki/Экспертная_система. Ссылка активна на 01.07.2018.

9. Экспертные системы. Доступно по: http://www.aipor-tal.ru/articles/expert-systems/expert-systems.html. Ссылка активна на 01.07.2018.

10. Expert Systems in Business and Finance: Issues and Applications (John Wiley Series in Information Systems) John Wiley 390 pages 1993.

11. John McCarthy. Some expert system need common sense. Доступно по: http://www-formal.stanford.edu/jmc/some-need/someneed.html. Ссылка активна на 01.07.2018.

12. Гельфанд И.М., Розенфельд Б.И., Шифрин М.А. Очерки о совместной работе математиков и врачей. М., Наука, 1989. 272 с.

13. ГОСТ Р 56034—2014 Клинические рекомендации (протоколы лечения). Общие положения. Доступно по: http://base.garant.ru/71003526/. Ссылка активна на 01.07.2018.

14. Международная классификация болезней (МКБ-11). Доступно по: http://icd11.ru. Ссылка активна на 01.07.2018.

Поступила 22.06.2018 Принята к опубликованию 25.06.2018

REFERENCES

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

1. Davis K., Stremikis K., Schoen C. and D. Squires, Mirror, Mirror on the Wall, 2014 Update: How the U.S. Health Care System Compares Internationally, The Commonwealth Fund, June 2014.

2. Record shares of Americans now own smartphones, have home broadband, A. SMITH, Pew Research Center 2017. Available at: http://www.pewresearch.org/fact-tank/2017/ 01/12/evolution-of-technology/. Accessed on 01.07.2018.

3. Mobile Fact Sheet, Pew Research Center 2018. Available at: http://www.pewinternet.org/fact-sheet/mobile/. Accessed on 01.07.2018.

4. Baza znanij po molekulyarnoj i obshchej biologii chelove-ka (HUMBIO), EHKG pri patologii: infarkt miokarda. [Knowledge base on molecular and General human biology (HUMBIO), ECG in pathology: myocardial infarction]. Available at: http://humbio.ru/humbio/har3/003eb22f.htm. Accessed on 01.07.2018. (In Russian).

5. Detection of acute myocardial infarction using the 12-lead ECG plus inverted leads versus the 16-lead ECG (with additional posterior and right-sided chest electrodes). Trggerdh E1, Claesson M, Wagner GS, Zhou S, Pahlm O. DOI: 10.1111/j. 1475-097X.2007.00761 .x PMID: 1794465.

6. Larichev O.I., Moshkovich E.M. Kachestvennye metody prinyatiya reshenij. Verbal'nyj analiz reshenij [Quality methods in decision-making. Verbal Analysis of Solutions]. M.: Nauka. Izd. Firma «Fiz. mat. lit» [M.: Science. Physical and Mathematical Literature Publishers]. 1996, 207 page. (In Russian).

7. JASON 2017, Perspectives on Research in Artificial Intelligence and Artificial General Intelligence Relevant to DoD. JSR-16-Task-003.

8. Ehkspertnaya ocenka. Vikipediya [Expert Assessment. Wikipedia]. Available at: https://en.wikipedia.org/wiki/ Expert_system. Accessed on 01.07.2018 (In Russian).

9. Ehkspertnye sistemy [Expert systems]. Available at: http:// www.aiportal.ru/articles/expert-systems/expert-sys-tems.html. Accessed on 01.07.2018. (In Russian).

10. Expert Systems in Business and Finance: Issues and Applications (John Wiley Series in Information Systems) John Wiley 390 pages 1993.

11. John McCarthy. Some expert system need common sense. Available at: http://www-formal.stanford.edu/jmc/some-need/someneed.html. Accessed on 01.07.2018.

12. Gelfand I.M., Rosenfeld B.I., Shifrin M.A. Ocherki o sovmestnoj rabote matematikov i vrachej [Essays on Mathematicians' and Physicians' Joint Work]. M., Nauka [Moscow, Science]. 1989, 272 pages. (In Russian).

13. GOST R 56034—2014 Klinicheskie rekomendacii (pro-tokoly lecheniya). Obshchie polozheniya [GOST R 56034— 2014 Clinical Recommendations (treatment protocols) General Provisions]. Available at: http://base.garant.ru/ 71003526/. Accessed on 01.07.2018 (In Russian).

14. Mezhdunarodnaya klassifikaciya boleznej (MKB-11) [International Disease Classification (IDC-11)]. Available at: http://icd11.ru. Accessed on 01.07.2018 (In Russian).

Received on 22.06.2018 Accepted for publication on 25.06.2018

Сведения об авторах:

Воробьев Павел Андреевич — д-р мед. наук, профессор, Председатель Правления МГНОТ. 119048, Москва, Хамовничес-кий вал. Тел.: +7 (495) 225 83 74

Воробьев Андрей Павлович — аспирант ФГАОУ ВО ПМГМУ им. И.М. Сеченова Минздрава России (Сеченовский Универ -ситет); Президент МТП Ньюдиамед. 119991, Москва, ул. Большая Пироговская, дом 2, стр. 4. E-mail: [email protected]

About the authors:

Vorobiev Pavel A. — Doctor of Medical Sciences, professor, head of Moscow City Scientific Society of Primary Care Physicians, Khamovnicheskii val, Moscow, Russia 119048. Phn: + 7 (495) 225 83 74

Vorobiev Andrey P. — postgraduate student of the Federal State Autonomous Educational Institution of Higher Education «I.M. Sechenov First Moscow State Medical University» of the Ministry of Health of the Russian Federation (Sechenov University); President of the Medical and Technology Company «Newdiamed», 2, Bolshaya Pirogovskaya str., bldg 4, Moscow, Russia 119991. E-mail: [email protected]

HOW TO TEACH MEDICINE AN ARTIFICIAL INTELLIGENCE, OR CONSIDERATIONS ABOUT A NEW ROLE OF STANDARDIZATION

Reviewed here is the challenge of the deep learning of artificial intelligence systems, with the MeDiCase system used as an example. The problems related to teaching expert systems with associative learning, diagnostic games with the choice of a doctor's knowledge, issues of the heuristic and deductive approach in diagnostics are analyzed. The conclusion is made that clinical guidelines as an important element of standardizing the healthcare system can play a fundamental role in deep learning of intelligence systems in medicine.

Keywords: artificial intelligence, deep learning, expert systems, diagnostic games, associative learning, heuristics, deduction

Authors declare lack of the possible conflicts of interests.

Today we can observe a huge enthusiasm related to the promises of revolutionary advances of Artificial Intelligence (AI) in scientific and domestic field, including its use in medicine. A vast amount of enthusiasts are ready to raise the topic of artificial intelligence to the top priority of development of medicine in the near future. This is a new turn of romantic enthusiasm: the same picture was seen in 70—80s, again in the next 10—20 years. But in previous years nothing happened — and it is believed that, first of all, it was due to a lack of development of the technical base. But most likely, it was the very essence of medicine that played a key role (or — a cruel joke): it is still widely stated by medical experts that laws of medicine are different from laws of physics, chemistry and pharmacy, and it is impossible to describe them with graphs and formulas, mathematic models or calculate them using powerful processor.

Of course, there is some truth in these statements; but everything in medicine is based on probabilities and assumptions: there is almost nothing as accurate as not to be questioned and revised. Actually, this relativity is exactly the field of the AI development: it is believed that vast statistical arrays and machine learning which are the basis of AI development, the emergence of neural networks will lead to the outstanding achievements because of the patterns invisible for humans will be uncovered. Neural networks can work like the best clinicians, as evidenced by the first launched projects. But — only for solving certain range of diagnostic challenges. For the other challenges that look similar or even more simple as those above-mentioned, the AI does not work.

DOI: 10.26347/1607-2502201807-08028-034

P.A. Vorobiev1, A.P Vorobiev2'3

1 Moscow City Scientific Society of Primary Care Physicians, Moscow, Russia

2 Federal State Autonomous Educational Institution of Higher Education «I.M. Sechenov First Moscow State Medical University» of the Ministry of Health of the Russian Federation (Sechenov University), Moscow, Russia.

3 Medical and Technology Company «Newdiamed», Moscow, Russia

In this regard, it is logical to try to answer the most important question: is the current level of interest to AI just another period of sensation in excitation cycles that occurred earlier around this issue ? Or has something new appeared really, that makes the situation different? It is evident that today we have multiple times increased technical capacity for the implementation of information technologies. A lot of things make you look more closely at new promises of AI application: the potential impact of AI on health can be related to the social factors and challenges that can, this time, «make destiny» of AI in medicine. Synergistic effect of these three forces made the society establish the new health-oriented approaches that can be achieved through AI:

1) Disappointment in an outdated health management and high expectations of health care consumers is typical for the whole world, and not just for our country. This disappointment of different society focus groups is related to high costs and not so high quality of health care. As a result: The fundamental erosion of credibility opens the minds of people to new paradigms, tools, and services. We already witness the explosion of new technologies of the personal health control via the smart device and web networking platforms. The comprehensive surveillance, the accounting for numerous factors, which determines the environment where AI applications are being developed, emerges [1].

2) The omnipresence of network devices in our society, due to the prevalence of smartphones and tablet PCs: AI-based tools appear in the health-focused applications that can be used in pocket net-

work appliances with capacity exceeding that of personal computers available in the recent future [2].

3) Convenience of at-home services provided via network aggregators [3]: the trivial home shopping transformed almost instantly to diverse systems of purchase and sale of everything, dealings with the mobile bank, the State Services system etc.

Therefore, the society is ready to AI introduction in medicine. But neither are the physicians. The physicians' mistrust in AI introduction in medicine is obvious and is rooted in their actually primitive opinion on the uniqueness of the intuition-based diagnosis. When establishing a diagnosis, a physician does not often think about how he is doing that. However, the precise disease identification by an experienced clinical physician, the precise forecast of subsequent disease course often impress the patients. Some physicians establish the diagnosis while a patient walks the distance from the door to the GP's table.

Did it dawn on the physician? — The insight is called the heuristic decision-making or task-solving method, derived from that very Eureka! exclaimed by the ancient Greek whom everybody knows when jumping out of the bath. Heuristics is the problem solving in a highly uncertain environment, including medicine. Heuristic approaches are opposed to formal solution methods that rely on algorithms and models. Heuristic methods mean any methods aimed at reducing the search or inductive problem solving methods. Induction means that a detected, sometimes insignificant fact, like an inductor, triggers an enormous action that is incommensurate with the primary value of the inductor, with substantial effects.

Unfortunately, intuition is far from exact, and it should be validated by contemporary evidence based on statistical methods. The primary diagnostic hypothesis that crossed the doctor's mind accumulates an array of laboratory and instrumental studies and additional consultations immediately. This is the next step in the diagnostic search for heuristic or inductive methodology: the insight is followed by the search for arguments that support the hypothesis that dawned on the physician. Though unconscious, this search can have far-reaching consequences. A physician often looks for arguments in favor of his thoughts, persists and rejects all «counter» arguments as irrelevant, interfering with the correct decision. Letting alone the «unessential» signs not fit-

ting into, or not relevant to, the diagnostic hypothesis under discussion.

The heuristic approach in diagnostics shortens the problem-solving time as compared with the method of complete, non-targeted search for possible options; but the obtained solutions are not normally the best ones. Heuristic (inductive) methods do not always help achieve the goal. It is here that the essence of the dispute between so-called physicists (and now, cybernetics) and lyricists of medicine lies: is medicine a science or an art? If it is art then the heuristic approaches in it are justified because it is the least time-consuming, being a working tool of clinical thinking that connects thought and action into a single process. If it is science, then there is not much room for insight.

Heuristic methods are the most promising in recurrent situations when a physician reproduces models of what he has already seen before. If it concerns recurrent diseases a physician comes across several times a day, the model can be rather precise. For instance, respiratory infections or arterial hypertension. Or else an option or a model of a certain disease is synthesized by a physician based on the studies of reference literature, during studies. As it is a frequent situation, then the diagnostic hypothesis is usually correct. But there is always a small percentage of «diagnostic mistakes». The physician always forecasts the most probable scenario. For instance, in the winter and spring season, a young man who came to a physician is very likely to suffer from a viral respiratory infection. It's not even necessary to examine him. But a patient with acute leukemia may come to a physician with the same complaints — though very

rarely, maybe one in 5,000 to 10,000 patients, And

a physician who makes use of heuristic approaches to diagnosis may establish a wrong diagnosis, which is likely to lead to the patient's death. But he will be right in the remaining thousands and thousands of cases!

Heuristic approaches are used in collective decision-making. They were called «Socratic conversations», historically dating back to Socrates. This methodology consists in posing inducing and specifying questions, giving examples and counter-examples. The most unexpected proposed solutions are originally tantamount and should be discussed. A collective method of addressing difficult problems is known as brainstorm and is actually used in

a multidisciplinary team meeting comprising several professionals or in medical rounds.

The heuristic approach to diagnosis should not, obviously, be considered as part of AI training.

Deduction, as opposed to heuristics, is a task-solving method that relies on deducing the conclusions from the prerequisites with unknown value (weight), on thorough search for all available facts, with their subsequent rule-based narrowing-down to a diagnostic hypothesis. We need careful investigation of details, especially those not fitting into the model under formation. The doctor who understands the need for deductivity check of the diagnostic hypothesis, becomes an excellent diagnostician. Understanding the deductive approach by physicians is generally based on intuitive things (often related to the characterological traits of the doctor, his tendency to thoroughness and accuracy) rather than scientific data and decision rules.

The structure of deductive reasoning is caused by the cognitive activity of a person using heuristic-inductive approaches, interpretations of empiric material, his/her classification and taxonomy, creation of models and rules based on cause-effect relationships. In other words heuristic, inductive methods originate from a symptom and develop the decision tree from «the main complaint», whereas deductive solutions originate from the result (presumed or present), sort out the maximum possible number of signs using decision formulas and form a set of versions, discarding unviable ones. Thus, deduction and induction are closely connected.

The deductive diagnostic system model includes:

1) Creation of the initial set of terms and statements. The formula notion should be defined. The formula is the rules, based on which the deductive system will transform objects. For this purpose, the alphabet, i.e. the description that identifies the objects, is created.

2) The formula-building rules are created in the second stage: the algorithm for building-up formulas from alphabetic symbols.

3) A set of statements (conclusions, diagnostic hypotheses) derived from the identified objects by applying formulas.

When constructing a logical chain of a deductive diagnostic system, one can use the hierarchy of hypotheses with increasing degree of abstraction and generality progressively increasing when moving away from the empirical basis, i.e. the final diagno-

sis. Therefore, the top of this pyramid will include the most general hypotheses with the maximum logic strength (for example, chest pain in a male over the age of 50 is probably related to myocardial ischemia). The hypotheses of lower, 2nd and 3rd levels (connection between the pain syndrome and the load and nitroglycerine effects), will be derived from them as from the prerequisites. The hypotheses that can be compared with empirical data or, in our case, with verified diagnoses (angina pectoris — pains in pericardium region, which are associated with physical exercise and are quickly eliminated by nitroglycerine administration) will lie in the pyramid's foundation. The logical diagnostic sequence in this case will be called the «decision-making tree»: from the general (the pyramid's top) to the particular (its foundation).

Nowadays, the evidenced-based medicine (EBM) terms are more frequently used in the diagnostic topic, too. Each evidence has three levels: thesis, arguments (substantiation), demonstration. The thesis is a judgment, the truth and acceptance of which is established in proof; arguments are judgments a thesis is derived from (alphabet); demonstration is a logical form of connection (formula) of the first two elements, which determines the need to derive one from the other, a thesis from an argument. There are rules in the search for proofs.

Proof by definition: a precise definition of the key categories that do not give rise to doubts as to their adequacy to real phenomena and hands-on experience e.g.: nitroglycerine produces quick effect in angina pectoris.

Proof from the contrary: the substantiations that demonstrate the absurdity of what is contrary to the thesis being proved are put forward. The proof based in the principle of leading to absurdity can be included here.

Proof based on the analysis of properties of the object being studied. This approach is seen more frequently than others in clinical practice: by comparing some particular analyses we create a diagnostic hypothesis. Proof based on factor classification enables to establish the object's properties.

An axiomatic proof is based on indisputable, understandable and accepted provisions the proof is based on. A huge problem of «opinions» arises in medicine here. This problem is opposed by the factual proof where facts are the most important thing.

However, facts depend on interpretation, which reduces the significance of this approach.

Working hypothesis or concept-based proof (hypothetic, conceptual proof). Heuristic approaches are likely to be regarded as such proofs.

The main support in the experimental proof is the experiment the contemporary medicine focuses on, which we call EBM. The question arises: if EBM is experimental proof, is it possible to prove anything in medicine by using other proving methods? Yes, it is. The overwhelming majority of theses in medicine is not supported with experimental arguments. They have taken shape historically and have never been subjected to experimental proofs. And they often prove to be false, in case of unbiased checks.

Here is just one such example. ECG is a golden standard for a myocardial infarction diagnosis, though its statistical estimates, such as sensitivity and specificity, are extremely low and range from 30% to 40% [4]. For instance, E.Tragardh showed in his studies that the sensitivity for acute myocardial infarction detection, with ST segment upsurge in case of diagnosis laboratory verification in 479 patients came to 28% for ECG with 12 derivations, and 33% for ECG with 16 derivations and 37% for ECG with 24 derivations. Specificity came to 97%, 93%, and 95%, respectively. With these indicators, no new diagnostic technology will get the right to life [5]. O.I. Larichev in his experiment established that ECG-based diagnosis improves if a physician uses the computer processing of test findings as a prompt and makes decisions on the observed changes in the curve on his/her own: the «computer diagnosis» was worse than that of a physician [6]

Physicians neither learn nor compile the disease image from the statistics findings. Some particular diagnosis and treatment methods were elaborated other than on statistics. Even though they are estimated using statistic methods at the current level, but they are not always the most influential ones. A physician is trained on generalizations rather than on examples. We create the image of a disease, of a sick person. Training documents cannot be created by simple data search. Why do the naive assumptions of such possibility arise? one should simply collect as much data as possible (Big Data) and will realize them organically.

AI training process can and, perhaps, should be compared with training of a growing-up person: at first, the baby fixes its vision and begins distinguish-

ing persons and things; they it identifies things and persons, begins to understand speech; learns to crawl and to walk and, finally, begins speaking. Since then, its substantive education starts and moves in two directions simultaneously: the abstract one (everything I can see and hear) and the targeted one (grammer, arithmetics, physics, etc.) From AI point of view, the intermediate targeted training in abstract matters (biology, physics, chemistry) seems to be unnecessary — and abstract thinking seems even less so. It is the gravest mistake, in our opinion.

Jason [7] notes in his report that AI should make use of the reliable practice of reviewable scientific research and development during training as a well-established medical care standard, which may protect AI from false algorithms or those not approved by experts. In other words, in the phraseology we are accustomed to: The findings derived during EBM research should be used for AI training. In this case, we are not speaking about any independent formulations and searches for laws. AI training like physician traning — from hands to hands, using reliable facts.

If a new AI-based diagnostic technology is introduced, it should be tested for sensitivity and specificity, as established for other diagnostic procedures, and duly validated based on the test findings. From the training standpoint, it is re-training in AI or... further training. The latter is known to be a continuous process that lasts the entire professional life.

Such area as expert systems has developed in AI during recent decades. It is for a good reason that we mention this approach because it is critical for the AI training problem we are discussing. Strictly speaking, expert systems are a heterogeneous notion not even having a common definition. For instance, Wikipedia calls a computer system that is capable of substituting an expert in addressing a problem the expert system [8] Another but similar definition is available in the portal [9]: expert systemsis the area of artificial intelligence studies to create computing systems that are capable of making similar decisions to those made by experts in this subject area.

Experienced specialists' knowledge is significant in creation of expert systems. Expert knowledge is a theoretical understanding of the problem and practical skills in its resolution, the efficiency of which is validated by an expert's practical activities. In other words, the expert systems were created by individual

professionals based on their own knowledge and skills.

Naive perceptions of the mathematicians who developed the expert systems were that a correctly chosen expert and successful formalization of his/her knowledge allow to include unique and valuable knowledge to the expert system. They believed a physician is good at establishing diagnoses and at prescribing treatment because he/she has earned quality medical education and extensive experience in treating his/her patients, and not because of some congenital abilities. The value of an expert system as of a finished product is determined by the quality of the created knowledge base. The expert system development was not intended to finding the consensus between different opinions and approaches. Moreover, the developers were rarely interested in how adequately the expert understood and reproduced his «knowledge and skills». These errors of the primary stage, teaching the expert systems, resulted in the lack of any significant influence by any of the systems on medical practice during almost half a century in existence.

Let's remember MYCIN system. It was not implemented in practical healthcare but is still mentioned, even though it was created as late as in 1973/1947 by the Stanford University's professionals [10]. It included the knowledge base of approx. 600 rules intended to diagnose meningitis, sepsis and to select an antibiotic empirically. The system asked a lot of simple questions requiring positive or negative answers to a physician. As a result, the system yielded a list of possible infectious agents, noting the probability of disease diagnoses and their substantiation. The system proposed adequate treatment in 69% cases: more frequently than experts. Not all external experts proved to agree with the chose made; what to do if the system's proposals were wrong was unclear, in ethical terms. Of equal importance was that a physician was supposed to spend not less than 30 minutes to answer all questions posed by the system. A physician works much quicker where the efficiency of empirical choice of anti-bacterial treatment is similar. Moreover, numerous new antibiotics have emerged; they had to be re-entered into the system, and the efficiency probability was to be established. But MYCIN training system needed improvements. Experts believe that «extraction» of knowledge from experts' experience for shaping the base of rules was the key difficulty they

came across in elaborating MYCIN and subsequent expert systems [11].

Knowledge training of expert systems is challenging, largely due to the difficulties with formalization of experts' heuristic knowledge. Knowledge extraction from expert physicians has been studied by a large group of cibernetics and physicians under the overall guidance of I.M. Gelfand since the 1970's [12]. They have elaborated and proved the methodology of so-called diagnostic games that formalized the disease diagnosis and forecast criteria, created an adequate language, formulas or decisive rules. This work, though relevant and entailing rather numerous publications, did not have any significant impact on AI development; even though this approach used to be and is 100% revolutionary in identification of professional knowledge of a physician whoregards his/her work as art and intuition.

Expert systems cannot provide well-substantiated explanations of their discussions, as an expert can: expert systems describe the sequence of steps in the search for a decision in the context laid down by an expert.

Expert systems are unable of self-training: updating the expert systems takes constant involvement of the knowledge engineers, tutors, into the knowledge base. If the expert systems are devoid of support by developers and tutors, they become obsolete quickly.

The clinical recommendation (CR) system has become the tool for communicating the clinical trial findings to practicians. CR system is an element of the healthcare standardization system in Russia [13]. In theory, the medical technologies that underwent the comprehensive assessment, in terms of proof efficiency determination, should get into CR system. The medical technologies can be assessed into 2 stages: (a) search for proofs and their ranking by the study quality degree, and (b) discussion of their suitability for practical healthcare from the experts' point of view. Thus, CR system is a distillation of the experts' understanding of clinical trial findings.

Consequently, the main knowledge sources for AI training may be determined as follows: (a) physicians' formalized knowledge found using diagnostic games, and (b) clinical recommendations or other standardized DOCUMENTS created by interpreting the scientific experiment findings and the specialists' consensus.

In 2010, AI sector was shaken by the unexpected progress in the multi-layer neuron networks (NNs)

that had been developed for several years already then. The entirely new solutions resulted from the synergy of two evolutionary changes: (a) quick hardware-based graphic processor enable to train much more and, in particular, deeper (multi-layer networks), and (b) large labelled data sets (images, web inquiries, social media etc.) can be used as tutorials. This combination engendered the paradigm of deep leaning (DL) in deep neuron networks (DNN), in particular, with the architecture called convolutional neuron networks (CNN). Deep Learning makes use of a multi-layer system of filters to extract features, parameters and derive formulas. Each subsequent layer receives the previous layer output at input. It resembles a medical student's training: general pathology with morphology and pathophysiology, pro-pedeutics, faculty and hospital treatment, internship, residency training, continuous medical education. The attributes being analyzed are arranged as an hierarchical system; the upper-level attributes are derived from the lower-level ones. At each stage, previous knowledge is specified, adjusted, fine tuned in the direction from abstract general ideas to some particular diseases, at their best.

The deep learning system can combine the algorithms of training with and without a tutor; with the «sample analysis» being training without a tutor (self-training), and the classification, training with a tutor. We should remind that classification can be hierarchic, when the attributes being classified are clearly connected with each other (e.g. heart diseases, rhythm disorder and paroxismal atrium fibrillation); facet-based, when the determining classification attributes are not necessarily inter-connected (e.g. IDC-11 of the version [14] that will take effect in the near future contains a section relating to the conventional medicine diagnoses, in addition to the sections devoted on organ systems), and mixed ones (in fact, IDC-11 is a typical example of a mixed classification where hierarchical and facet-based sections are come across).

A training with a tutor means training on examples consisting of the «attribute — result» pair. The association between these parameters is assumed to be unknown in the first place, but such pairs represent a certain sampling on which training is based. Training with tutor is intended not only to look for the dependence (causal relationships, genesis, pathogenic mechanisms, sanogenesis etc.) but rather to rework the outcome probability if the attribute ex-

ists. To improve the forecast precision, experiment with feedback can be used, when the forecast precision increases gradually as the sampling expands. The information can be accumulated by both a computer and a person, the tutor.

A detailed description of the mathematical tools of deep learning processes is beyond the scope of our work. Nonetheless, this approach is in harmony with the one that has been used in medicine long before the neuron networks appeared. Let's give several examples based on MeDiCase, an AI-based system being created. The system comprises several 'layers», each undergoing «training» during the creation process:

1) Patient inquiry and routing.

2) Syndrome diagnosis.

3) Probability forecasting and differential diagnosis.

4) Disease monitoring, secondary prevention, treatment, resuscitation, treatment/resuscitation efficiency assessment.

5) Automatic selection of treatment and resuscitation efforts.

At the first stage (layer), the following objectives were met: identification of the system participants; personal data protection; integration with third parties' information systems; automated inquiry of a patient; automated decision-making on the need for emergency medical care; questionnaire validation; selection of the mathematical model of the system (base platform description).

The next stage is intended to shape diagnostic hypotheses and their substantiations, and validation of diagnostic hypotheses. For instance, this training stage necessitated a preliminary analysis of the legal framework on regular examinations and annual physical exam, dealings with a broad pool of experts.

The third layer is supposed to describe the stage of assessment and presentation to a physician of the data on the diagnostic hypothesis probability as linked to its substantiation; singling out of the diagnosis groups that require differential diagnosis during the inquiry of some particular patient; comparison of the diagnosis probability in the group with drafting of the additional examination plan, documentation of additional examination findings and their use for differential diagnosis, refinement of the physician's selection of treatment, prevention and resuscitation technologies (decision-making support system, DMSS).

The next layer is the system of telemedical health assessment for patients with certain disease, formation of criteria and methods for effectiveness evaluation of treatment and rehabilitation for different diseases, therapy adjustment by physician using automated data processing tools (ADPT) — (including clinical guidelines, algorithms and reference books).

The new level is a gradual transition to automated, responsible self-treatment.

Thus, we are sure that the standardization of sources of, approaches to and methods of deep learning of intellectual systems in healthcare will have strong implications for future development of the telemedical technologies in the near future.

REFERENCES

1. Davis K., Stremikis K., Schoen C. and D. Squires, Mirror, Mirror on the Wall, 2014 Update: How the U.S. Health Care System Compares Internationally, The Commonwealth Fund, June 2014.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

2. Record shares of Americans now own smartphones, have home broadband, A. SMITH, Pew Research Center 2017. Available at: http://www.pewresearch.org/fact-tank/2017/ 01/12/evolution-of-technology/. Accessed on 01.07.2018.

3. Mobile Fact Sheet, Pew Research Center 2018. Available at: http://www.pewinternet.org/fact-sheet/mobile/. Accessed on 01.07.2018.

4. Baza znanij po molekulyarnoj i obshchej biologii chelove-ka (HUMBIO), EHKG pri patologii: infarkt miokarda. [Knowledge base on molecular and General human biology (HUMBIO), ECG in pathology: myocardial infarction].

Available at: http://humbio.ru/humbio/har3/003eb22f.htm. Accessed on 01.07.2018. (In Russian).

5. Detection of acute myocardial infarction using the 12-lead ECG plus inverted leads versus the 16-lead ECG (with additional posterior and right-sided chest electrodes). Trggerdh E1, Claesson M, Wagner GS, Zhou S, Pahlm O. DOI: 10.1111/j. 1475-097X.2007.00761 .x PMID: 1794465.

6. Larichev O.I., Moshkovich E.M. Quality methods in decision-making. Verbal Analysis of Solutions. Moscow: Science. Physical and Mathematical Literature Publishers, 1996, 207 page.

7. JASON 2017, Perspectives on Research in Artificial Intelligence and Artificial General Intelligence Relevant to DoD. JSR-16-Task-003.

8. Expert Assessment. Wikipedia. Available at: https:// en.wikipedia.org/wiki/Expert_system. Accessed on 01.07.2018.

9. Expert systems. Available at: http://www.aiportal.ru/arti-cles/expert-systems/expert-systems.html. Accessed on 01.07.2018.

10. Expert Systems in Business and Finance: Issues and Applications (John Wiley Series in Information Systems) John Wiley 390 pages 1993.

11. John McCarthy. Some expert system need common sense. Available at: http://www-formal.stanford.edu/jmc/some-need/someneed.html. Accessed on 01.07.2018.

12. Gelfand I.M., Rosenfeld B.I., Shifrin M.A. Essays on Mathematicians' and Physicians' Joint Work. Moscow, Science, 1989, 272 pages

13. GOST R 56034—2014 Clinical Recommendations (treatment protocols) General Provisions Available at: http:// base.garant.ru/71003526/. Accessed on 01.07.2018. 14.International Disease Classification (IDC-11). Available at: http://icd11.ru. Accessed on 01.07.2018.

Received on 22.06.2018 Accepted for publication on 25.06.2018

About the authors:

Vorobiev Pavel Andreevich — Doctor of Medical Sciences, professor, head of Moscow City Scientific Society of Primary Care Physicians, Khamovnicheskii val, Moscow, Russia 119048. Phn: +7 (495) 225 83 74

Vorobiev Andrey Pavlovich — postgraduate student of the Federal State Autonomous Educational Institution of Higher Education «I.M. Sechenov First Moscow State Medical University» of the Ministry of Health of the Russian Federation (Sechenov University); President of the Medical and Technology Company «Newdiamed», 2, Bolshaya Pirogovskaya str., bldg 4, Moscow, Russia 119991. E-mail: [email protected]

Аннотация научной статьи по компьютерным и информационным наукам, автор научной работы — Воробьев Павел Андреевич, Воробьев Андрей Павлович

Похожие темы научных работ по компьютерным и информационным наукам , автор научной работы — Воробьев Павел Андреевич, Воробьев Андрей Павлович

HOW TO TEACH MEDICINE AN ARTIFICIAL INTELLIGENCE, OR CONSIDERATIONS ABOUT A NEW ROLE OF STANDARDIZATION

Текст научной работы на тему «Как обучить искусственный интеллект медицине или размышления о новой роли стандартизации»