RU2412463C2

RU2412463C2 - Mobile communication terminal and menu navigation method for said terminal

Info

Publication number: RU2412463C2
Application number: RU2008126780/08A
Authority: RU
Inventors: Дзонг-Кеун ЙОУН (KR); Дзонг-Кеун ЙОУН; Дае-Сунг ДЗУНГ (KR); Дае-Сунг ДЗУНГ; Дзае-Хоон Ю (KR); Дзае-Хоон Ю; Тае-Дзун КИМ (KR); Тае-Дзун КИМ; Дзае-Мин ДЗОХ (KR); Дзае-Мин ДЗОХ; Дзае-До КВАК (KR); Дзае-До КВАК; Дзонг-Хо Шин (KR); Дзонг-Хо Шин
Original assignee: ЭлДжи ЭЛЕКТРОНИКС ИНК.
Priority date: 2008-04-08
Filing date: 2008-07-01
Publication date: 2011-02-20
Also published as: RU2008126780A

Abstract

FIELD: information technology.

SUBSTANCE: mobile communication terminal has an input apparatus configured to receive input data for activating speech recognition functions in the mobile communication terminal, and memory configured to store a plurality of areas associated with the menu and with operating modes of the mobile communication terminal. Said terminal also includes a computer which is configured to access specific areas from the plurality of areas in the memory, based on received input data for activating speech recognition functions, recognise user speech based on the language model and the acoustic model of the area which has been accessed, and determine at least one menu and one operating mode for the mobile communication terminal based on the specific area which has been accessed, and the recognised user speech.

EFFECT: improved user speech recognition.

26 cl, 22 dwg

Description

Настоящее изобретение относится к терминалу мобильной связи и к соответствующему способу, способному обеспечивать увеличение процента правильно распознанных слов при распознавании речи путем задания области для распознавания речи в виде информации, связанной с конкретными меню или услугами.The present invention relates to a mobile communication terminal and to a corresponding method capable of providing an increase in the percentage of correctly recognized words in speech recognition by defining an area for speech recognition in the form of information associated with specific menus or services.

Уровень техникиState of the art

В настоящее время терминалы мобильной связи предоставляют множество дополнительных услуг помимо базовых услуг телефонной связи. Например, в настоящее время абонент может осуществлять доступ к сети Интернет, играть в игры, смотреть видеофильмы, слушать музыку, производить фото- и видеосъемку, запись звуковых файлов и т.д. В настоящее время терминалы мобильной связи также обеспечивают прием вещательных программ, поэтому абонент может смотреть телевизионные шоу, спортивные программы, видеофильмы и т.д.Currently, mobile communication terminals provide many additional services in addition to basic telephone services. For example, at present, a subscriber can access the Internet, play games, watch videos, listen to music, take photos and videos, record audio files, etc. Currently, mobile communication terminals also receive broadcast programs, so the subscriber can watch television shows, sports programs, videos, etc.

Кроме того, поскольку произошло существенное расширение функций, содержащихся в терминалах мобильной связи, то интерфейсы пользователя также стали более сложными. Например, интерфейсы пользователя теперь содержат сенсорные экраны, предоставляющие пользователю возможность прикасаться к экрану и выбирать конкретный элемент или пункт меню. Терминалы мобильной связи также содержат очень ограниченные функции распознавания речи, которые предоставляют пользователю возможность выполнять элементарные функции. Однако, частота появления ошибок при определении смысла речевой команды пользователя является слишком высокой, и поэтому пользователи, как правило, не используют ограниченные функциональные возможности распознавания речи, имеющиеся в терминале.In addition, since there has been a significant expansion of the functions contained in the mobile communication terminals, the user interfaces have also become more complex. For example, user interfaces now include touch screens, giving the user the ability to touch the screen and select a specific item or menu item. Mobile communication terminals also contain very limited speech recognition functions that provide the user with the ability to perform basic functions. However, the frequency of occurrence of errors in determining the meaning of a user's speech command is too high, and therefore users, as a rule, do not use the limited speech recognition functionality available in the terminal.

РАСКРЫТИЕ ИЗОБРЕТЕНИЯSUMMARY OF THE INVENTION

Соответственно, одной из задач настоящего изобретения является решение вышеупомянутых и других проблем.Accordingly, one of the objectives of the present invention is to solve the above and other problems.

Другой задачей настоящего изобретения является создание терминала мобильной связи и соответствующего способа, обеспечивающего управление меню, связанными с конкретными его функциями или услугами, путем распознавания смысла речевой команды на основании контекста и содержания.Another objective of the present invention is to provide a mobile communication terminal and a corresponding method for managing menus associated with its specific functions or services by recognizing the meaning of a speech command based on context and content.

Еще одной задачей настоящего изобретения является создание терминала мобильной связи и соответствующего способа, обеспечивающего значительное увеличение процента правильно распознанных слов при распознавании речи, путем задания области для распознавания речи в виде области, связанной с конкретными меню или услугам.Another objective of the present invention is to provide a mobile communication terminal and a corresponding method that provides a significant increase in the percentage of correctly recognized words in speech recognition by defining an area for speech recognition in the form of an area associated with specific menus or services.

Еще одной задачей настоящего изобретения является создание терминала мобильной связи и соответствующего способа управления меню, связанными с конкретными функциями или услугами, путем применения одного или большего количества имеющихся в нем интерфейсов пользователя (ИП), при этом, функцию распознавания речи приводят в действие таким образом, что обнаруживают манипуляции пользователя.Another objective of the present invention is to provide a mobile communication terminal and a corresponding method for managing menus associated with specific functions or services by applying one or more user interfaces (UIs) therein, while the speech recognition function is activated in this way what user manipulation reveals.

Еще одной задачей настоящего изобретения является создание терминала мобильной связи и соответствующего способа, обеспечивающего возможность управления меню, связанными с конкретными функциями или услугами, даже неопытным пользователем посредством его или ее речевой команды, путем предоставления справочной информации о вводе речевой команды в соответствии с рабочим состоянием или с режимом работы этого терминала мобильной связи.Another objective of the present invention is the creation of a mobile communication terminal and a corresponding method that provides the ability to control menus associated with specific functions or services, even by an inexperienced user, by his or her voice command, by providing reference information about entering a voice command in accordance with the operating state or with the operating mode of this mobile communication terminal.

Для достижения этих и других преимуществ и в соответствии с целью настоящего изобретения, осуществленной и описанной здесь в общих чертах, согласно одному из объектов настоящего изобретения, в нем предложен терминал мобильной связи, содержащий устройство ввода, выполненное таким образом, что оно способно принимать введенные данные для активации функции распознавания речи в терминале мобильной связи, запоминающее устройство, выполненное таким образом, что оно способно запоминать множество областей, связанных с меню и с режимами работы терминала мобильной связи, и контроллер, сконфигурированный таким образом, что он способен осуществлять доступ к конкретной области из множества областей, содержащихся в запоминающем устройстве, на основании принятых введенных данных для активации функции распознавания речи, распознавать речь пользователя на основании модели языка и акустической модели области, к которой осуществлен доступ, и определять, по меньшей мере, одно меню и один режим работы терминала мобильной связи на основании конкретной области, к которой осуществлен доступ, и распознанной речи пользователя.To achieve these and other advantages, and in accordance with the purpose of the present invention, implemented and described here in General terms, according to one of the objects of the present invention, it proposes a mobile communication terminal containing an input device made in such a way that it is able to receive entered data for activating a speech recognition function in a mobile communication terminal, a storage device configured in such a way that it is able to memorize a plurality of areas related to menus and operating modes a mobile communication terminal, and a controller configured in such a way that it is able to access a specific area from a plurality of areas contained in the storage device, based on the received input data for activating a speech recognition function, to recognize a user's speech based on a language model and an acoustic area model accessed and determine at least one menu and one operating mode of the mobile communication terminal based on the specific area to which access is made n, and recognizes the user's speech.

Согласно другому объекту настоящего изобретения в нем предложен способ управления терминалом мобильной связи. Способ содержит следующие операции: принимают введенные данные для активации функции распознавания речи в терминале мобильной связи, осуществляют доступ к конкретной области из запомненного множества областей, содержащихся в запоминающем устройстве терминала мобильной связи, на основании принятых введенных данных для активации функции распознавания речи, распознают речь пользователя на основании модели языка и акустической модели области, к которой осуществлен доступ, и производят вывод, по меньшей мере, одного меню и режима работы терминала мобильной связи на основании конкретной области, к которой осуществлен доступ, и распознанной речи пользователя.According to another aspect of the present invention, there is provided a method for controlling a mobile communication terminal. The method comprises the following operations: receiving input data for activating a speech recognition function in a mobile communication terminal, accessing a specific area from a stored set of areas contained in a storage device of a mobile communication terminal, based on received input data for activating a speech recognition function, recognizing a user's speech based on the language model and the acoustic model of the accessed area, and at least one menu and an operating mode are output mobile erminala based on a particular field to which accessed, and recognizes the user's speech.

Дальнейший объем применимости настоящего изобретения станет очевидным из приведенного ниже подробного описания. Однако следует понимать, что подробное описание и конкретные примеры, несмотря на указание предпочтительных вариантов осуществления настоящего изобретения, приведены исключительно в иллюстративных целях, так как для специалистов в данной области техники из этого подробного описания станет очевидной возможность различных изменений и модификаций, не выходящих за пределы сущности и объема патентных притязаний настоящего изобретения.A further scope of applicability of the present invention will become apparent from the following detailed description. However, it should be understood that the detailed description and specific examples, despite indicating preferred embodiments of the present invention, are provided for illustrative purposes only, as for those skilled in the art from this detailed description, the possibility of various changes and modifications without departing from the scope will be apparent. the nature and scope of patent claims of the present invention.

КРАТКОЕ ОПИСАНИЕ ЧЕРТЕЖЕЙBRIEF DESCRIPTION OF THE DRAWINGS

Настоящее изобретение станет более понятным из приведенного ниже подробного описания и из сопроводительных чертежей, которые приведены исключительно в иллюстративных целях и, следовательно, не ограничивают настоящее изобретение, и на которых изображено следующее:The present invention will become more apparent from the following detailed description and from the accompanying drawings, which are given for illustrative purposes only and, therefore, do not limit the present invention, and which depict the following:

на Фиг. 1 изображена блок-схема терминала мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 1 is a block diagram of a mobile communication terminal according to an embodiment of the present invention;

на Фиг. 2 на виде спереди в перспективе изображен терминал мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 2 is a front perspective view illustrating a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 3 на виде сзади в перспективе изображен терминал мобильной связи, показанный на Фиг. 2;in FIG. 3 is a rear perspective view illustrating the mobile communication terminal shown in FIG. 2;

на Фиг. 4 изображен общий вид системы связи, способной работать с терминалом мобильной связи из настоящего изобретения;in FIG. 4 is a perspective view of a communication system capable of operating with a mobile communication terminal of the present invention;

на Фиг. 5 изображена схема последовательности операций, на которой проиллюстрирован способ управления меню для терминала мобильной связи посредством речевой команды согласно одному из вариантов осуществления настоящего изобретения;in FIG. 5 is a flowchart illustrating a menu control method for a mobile communication terminal by means of a voice command according to one embodiment of the present invention;

на Фиг. 6A в общем виде показан способ активации функции распознавания речи для терминала мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 6A generally shows a method for activating a speech recognition function for a mobile communication terminal according to one embodiment of the present invention;

на чертежах Фиг. 6B и Фиг. 6C в общем виде показан способ вывода справочной информации в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in the drawings of FIG. 6B and FIG. 6C is a perspective view illustrating a method for outputting help information in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 7A изображена схема последовательности операций, на которой показан способ распознавания речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 7A is a flowchart showing a method for recognizing a voice command in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 7Б в общем виде показан способ распознавания речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 7B is a perspective view illustrating a voice command recognition method in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 8 в общем виде показан способ отображения меню в соответствии с процентом правильно распознанных слов при распознавании речи в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 8 is a perspective view illustrating a menu display method in accordance with the percentage of correctly recognized words in speech recognition in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 9 в общем виде показан способ распознавания речевой команды в терминале мобильной связи согласно другому варианту осуществления настоящего изобретения;in FIG. 9 is a perspective view illustrating a voice command recognition method in a mobile communication terminal according to another embodiment of the present invention;

на Фиг. 10 в общем виде показана конфигурация баз данных, используемых в качестве опорной информации для распознавания речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 10 generally shows a configuration of databases used as reference information for recognizing a voice command in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 11 в общем виде показано состояние, в котором выполняют функцию распознавания речи в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 11 is a general view of a state in which a voice recognition function is performed in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 12 в общем виде показан способ обработки подкоманд, связанных с конкретным меню, посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 12 is a perspective view illustrating a method for processing subcommands associated with a particular menu by means of a voice command in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 13 в общем виде показан способ поиска по схеме метрополитена посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 13 shows, in a general view, a metro search method using a voice command in a mobile communication terminal according to an embodiment of the present invention;

на Фиг. 14 в общем виде показан способ воспроизведения файлов с мультимедийным содержимым посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 14 is a perspective view illustrating a method of reproducing multimedia files by means of a voice command in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 15 в общем виде показан способ передачи сообщения электронной почты посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 15 is a perspective view illustrating a method for transmitting an e-mail message using a voice command in a mobile communication terminal according to an embodiment of the present invention;

на Фиг. 16 в общем виде показан способ выполнения телефонного вызова посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 16 is a perspective view illustrating a method of making a telephone call using a voice command in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 17 в общем виде показан способ использования информации из телефонной книги посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения;in FIG. 17 shows, in a general view, a method for using information from a telephone book by means of a voice command in a mobile communication terminal according to one embodiment of the present invention;

на Фиг. 18 в общем виде показан способ изменения выводимого на экран фонового изображения посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения; иin FIG. 18 shows, in a general view, a method for changing a displayed background image by a voice command in a mobile communication terminal according to one embodiment of the present invention; and

на Фиг. 19 в общем виде показан способ воспроизведения файлов с мультимедийным содержимым посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения.in FIG. 19 is a perspective view illustrating a method for reproducing multimedia files by means of a voice command in a mobile communication terminal according to one embodiment of the present invention.

ПОДРОБНОЕ ОПИСАНИЕ ИЗОБРЕТЕНИЯDETAILED DESCRIPTION OF THE INVENTION

Теперь будет приведено подробное описание предпочтительных вариантов осуществления настоящего изобретения, примеры которых проиллюстрированы на сопроводительных чертежах.A detailed description will now be made of preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings.

На Фиг. 1 изображена блок-схема терминала 100 мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Как показано на чертеже, терминал 100 мобильной связи содержит блок 110 беспроводной связи, содержащий один или большее количество компонентов, который позволяет поддерживать беспроводную связь между терминалом 100 мобильной связи и системой или сетью беспроводной связи, в пределах которой расположен терминал мобильной связи.In FIG. 1 is a block diagram of a mobile communication terminal 100 according to one embodiment of the present invention. As shown in the drawing, the mobile communication terminal 100 comprises a wireless communication unit 110 containing one or more components that allows wireless communication between the mobile communication terminal 100 and the wireless communication system or network within which the mobile communication terminal is located.

Например, блок 110 беспроводной связи содержит модуль 111 приема широковещательных передач, который принимает сигнал широковещательной передачи, и/или информацию, связанную с широковещательной передачей, из внешнего объекта, осуществляющего управление широковещательными передачами, по широковещательному каналу. Широковещательным каналом может являться, в том числе, спутниковой канал и наземный канал.For example, the wireless communication unit 110 comprises a broadcast reception module 111 that receives a broadcast signal and / or information related to a broadcast from an external broadcast control entity via a broadcast channel. A broadcast channel may include, but is not limited to, a satellite channel and a terrestrial channel.

Кроме того, термин "объект, осуществляющий управление широковещательными передачами" обычно относится к системе, которая производит передачу широковещательного сигнала и/или информации, связанной с широковещательной передачей. Примером информации, связанной с широковещательной передачей, является, в том числе, информация, связанная с широковещательным каналом, с широковещательной программой, с поставщиком услуг вещания и т.д. Например, информация, связанная с широковещательной передачей, может содержать электронную программу передач (EPG) для системы цифрового мультимедийного вещания (DMB) и электронный справочник услуг (ESG) системы цифрового телевещания для портативных устройств (DVB-H).In addition, the term "broadcast control entity" generally refers to a system that transmits a broadcast signal and / or broadcast related information. An example of information related to broadcast transmission is, inter alia, information related to the broadcast channel, the broadcast program, the broadcast service provider, etc. For example, broadcast related information may include an electronic program guide (EPG) for a digital multimedia broadcasting system (DMB) and an electronic service guide (ESG) of a digital broadcasting system for portable devices (DVB-H).

Кроме того, сигнал широковещательной передачи может быть реализован, помимо прочего, в виде сигнала телевизионного вещания, сигнала радиовещания и сигнала широковещательной передачи данных. Кроме того, сигнал широковещательной передачи может содержать сигнал широковещательной передачи, объединенный с сигналом телевизионного вещания или с радиовещательным сигналом.In addition, the broadcast signal may be implemented, inter alia, in the form of a television broadcast signal, a broadcast signal, and a data broadcast signal. In addition, the broadcast signal may comprise a broadcast signal combined with a television broadcast signal or a broadcast signal.

Модуль 111 приема широковещательных передач также выполнен таким образом, что принимает сигналы широковещательной передачи, переданные из систем вещания различных типов. Такими системами вещания являются, в том числе, например, наземная система цифрового мультимедийного вещания (DMB-T), спутниковая система цифрового мультимедийного вещания (DMB-S), система цифрового телевещания для портативных устройств (DVB-H), система широковещательной передачи данных, известная как система передачи мультимедийной информации только по линии прямой связи (MediaFLO®) и наземная система цифрового вещания с комплексными услугами (ISDB-T). Также возможен прием многоадресных сигналов. Кроме того, данные, принятые модулем 111 приема широковещательных передач, могут быть запомнены в соответствующем устройстве, например в запоминающем устройстве 160.The broadcast reception module 111 is also configured to receive broadcast signals transmitted from various types of broadcast systems. Such broadcasting systems are, for example, a terrestrial digital multimedia broadcasting system (DMB-T), a satellite digital multimedia broadcasting system (DMB-S), a digital broadcasting system for portable devices (DVB-H), a broadcasting system, Known as Direct Link Only Multimedia Information System (MediaFLO®) and Integrated Services Terrestrial Digital Broadcasting System (ISDB-T). Multicast reception is also possible. In addition, the data received by the broadcast reception module 111 may be stored in a corresponding device, for example, a storage device 160.

Блок 110 беспроводной связи также содержит модуль 112 мобильной связи, который производит передачу радиосигналов в один или большее количество сетевых объектов (которыми являются, например, базовая станция, узел B (Node B))/прием радиосигналов из них. Этими сигналами могут являться, в том числе, звуковые сигналы, видеосигналы, мультимедийные сигналы, передаваемые сигналы управления, и данные.The wireless communication unit 110 also includes a mobile communication module 112, which transmits radio signals to one or more network entities (which, for example, are a base station, node B (Node B)) / receive radio signals from them. These signals may include, but are not limited to, audio signals, video signals, multimedia signals, transmitted control signals, and data.

Он также содержит модуль 113 беспроводного доступа к сети Интернет, который обеспечивает поддержку доступа к сети Интернет для терминала мобильной связи. Соединение модуля 113 с терминалом может быть внутренним или внешним. Блок 110 беспроводной связи также содержит модуль 114 ближней связи, который, соответственно, облегчает ближнюю связь. Технологиями, пригодными для реализации этого модуля, являются, в том числе, следующие: технология радиочастотной идентификации (RFID), технология передачи данных через инфракрасный порт согласно стандарту IrDA, утвержденному Ассоциацией по средствам передачи данных в инфракрасном диапазоне, технология ультраширокополосной радиосвязи (UWB), а также такие сетевые технологии, которые обычно именуют технологией Bluetooth и технологией ZigBee.It also includes an Internet wireless access module 113, which provides Internet access support for a mobile communication terminal. The connection of the module 113 with the terminal may be internal or external. The wireless communication unit 110 also includes a short-range communication module 114, which accordingly facilitates short-range communication. Technologies suitable for implementing this module include, but are not limited to: radio frequency identification (RFID) technology, infrared data transmission technology in accordance with the IrDA standard, approved by the Infrared Data Communication Association, ultra wideband radio communication (UWB) technology, as well as network technologies commonly referred to as Bluetooth technology and ZigBee technology.

Блок 110 беспроводной связи также содержит модуль 115 определения местоположения, и он распознает местоположение терминала 100 мобильной связи или получает сведения о его местоположении иным способом. Модуль 115 определения местоположения может быть реализован с использованием компонентов Глобальной системы определения местоположения (GPS), которые взаимодействуют с соответствующими спутниками, с компонентами сети и с их комбинациями.The wireless communication unit 110 also includes a positioning module 115, and it recognizes the location of the mobile communication terminal 100 or otherwise obtains its location information. The positioning module 115 may be implemented using Global Positioning System (GPS) components that communicate with respective satellites, network components, and combinations thereof.

Кроме того, как показано на Фиг. 1, терминал 100 мобильной связи также содержит блок 120 ввода звуковых/видео (A/V) сигналов, который обеспечивает подачу звуковых или видеосигналов в терминал 100 мобильной связи. Как показано на чертеже, блок 120 ввода звуковых/видео (A/V) сигналов содержит камеру 121 и микрофон 122. Камера 121 получает и обрабатывает кадры изображений фотографических снимков или видеофильмов.In addition, as shown in FIG. 1, the mobile communication terminal 100 also comprises an audio / video (A / V) signal input unit 120 that provides audio or video signals to the mobile communication terminal 100. As shown in the drawing, the audio / video (A / V) input unit 120 includes a camera 121 and a microphone 122. The camera 121 receives and processes image frames of photographs or videos.

Кроме того, когда портативное устройство находится в конкретном режиме, например в режиме телефонного разговора, в режиме записи и в режиме распознавания речи, то микрофон 122 принимает внешний звуковой сигнал. Затем принятый звуковой сигнал обрабатывают и преобразовывают в цифровые данные. Портативное устройство и, в частности, блок 120 ввода звуковых/видео (A/V) сигналов, также содержит алгоритмы удаления различных шумов, предназначенные для удаления шума, сгенерированного в ходе приема внешнего звукового сигнала. Кроме того, данные, сгенерированные блоком 120 ввода звуковых/видео (A/V) сигналов, могут быть сохранены в запоминающем устройстве 160, могут быть использованы блоком 150 вывода или могут быть переданы посредством одного или большего количества модулей блока 110 связи. При желании могут быть использованы два или большее количество микрофонов и/или две или большее количество камер.In addition, when the portable device is in a particular mode, for example, in a telephone conversation mode, a recording mode, and a speech recognition mode, the microphone 122 receives an external audio signal. Then, the received audio signal is processed and converted into digital data. The portable device, and in particular the audio / video (A / V) signal input unit 120, also comprises various noise removal algorithms designed to remove noise generated during the reception of an external audio signal. In addition, the data generated by the audio / video (A / V) input unit 120 may be stored in the storage device 160, may be used by the output unit 150, or may be transmitted by one or more modules of the communication unit 110. If desired, two or more microphones and / or two or more cameras can be used.

Терминал 100 мобильной связи также содержит устройство 130 ввода пользовательских данных, которое осуществляет генерацию входных данных в ответ на манипуляции пользователя с соответствующим устройством или с соответствующими устройствами ввода. Примерами таких устройств являются, в том числе, кнопочная панель, переключатель с эластичной клавишей куполообразной формы (dome switch), сенсорная панель (например, основанная на изменении электрической емкости при приложении статического давления), колесико управления (jog wheel) и джойстик-переключатель (jog switch). Конкретным примером является следующий: устройство 130 ввода пользовательских данных выполнено в виде сенсорной панели, взаимодействующей с дисплеем с сенсорным экраном, более подробное описание которого приведено ниже.The mobile communication terminal 100 also comprises a user data input device 130 that generates input data in response to user manipulation of a corresponding device or corresponding input devices. Examples of such devices are, for example, a keypad, a switch with an elastic dome-shaped key (dome switch), a touchpad (for example, based on a change in electric capacitance when static pressure is applied), a control wheel (jog wheel) and a joystick switch ( jog switch). A specific example is as follows: the user data input device 130 is configured as a touch panel cooperating with a touch screen display, which is described in more detail below.

Терминал 100 мобильной связи также содержит датчик 140, и этот датчик 140 обеспечивает измерения состояния различных аспектов работы терминала 100 мобильной связи. Например, датчик 140 может регистрировать, находится ли терминал 100 мобильной связи в открытом/закрытом состоянии, относительное расположение компонентов (например, дисплея и кнопочной панели) терминала 100 мобильной связи, изменение местоположения терминала 100 мобильной связи или компонента терминала 100 мобильной связи, наличие или отсутствие контакта пользователя с терминалом 100 мобильной связи, ориентацию или ускорение/замедление перемещения терминала 100 мобильной связи и т.д.The mobile communication terminal 100 also includes a sensor 140, and this sensor 140 provides status measurements of various aspects of the operation of the mobile communication terminal 100. For example, the sensor 140 may detect whether the mobile communication terminal 100 is in an open / closed state, the relative location of the components (eg, display and keypad) of the mobile communication terminal 100, the location of the mobile terminal 100 or a component of the mobile communication terminal 100, the presence or lack of user contact with the mobile communication terminal 100, orientation or acceleration / deceleration of movement of the mobile communication terminal 100, etc.

Например, когда терминалом 100 мобильной связи является терминал мобильной связи сдвижного типа, датчик 140 может регистрировать, является ли сдвижная часть терминала 100 мобильной связи открытой или закрытой. Другими примерами являются, в том числе, следующие: датчик 140 регистрирует наличие или отсутствие электропитания, обеспечиваемого источником 190 электропитания, наличие или отсутствие связи или иного соединения между блоком 170 интерфейса и внешним устройством, и т.д.For example, when the mobile communication terminal 100 is a movable type mobile communication terminal, the sensor 140 may detect whether the movable portion of the mobile communication terminal 100 is open or closed. Other examples are, inter alia, the following: the sensor 140 detects the presence or absence of power provided by the power source 190, the presence or absence of a connection or other connection between the interface unit 170 and an external device, etc.

Кроме того, блок 170 интерфейса часто реализован таким образом, что обеспечивает связь терминала 100 мобильной связи с внешними устройствами. Типичными внешними устройствами являются, в том числе, проводные/беспроводные головные телефоны, внешние зарядные устройства, источники электропитания, устройства хранения данных, выполненные таким образом, что обеспечивают хранение данных (например, звуковых данных, видеоинформации, изображений и т.д.), наушники и микрофоны. Кроме того, блок 170 интерфейса может быть сконфигурирован с использованием проводного/беспроводного порта передачи данных, разъема для плат (например, для соединения с платой памяти, с платой модуля идентификации абонента (SIM-картой), с платой модуля идентификации абонента (UIM), с платой сменного модуля идентификации абонента (RUIM) и т.д.), портов ввода/вывода звукового сигнала и портов ввода/вывода видеосигнала.In addition, the interface unit 170 is often implemented in such a way that the mobile terminal 100 is in communication with external devices. Typical external devices include, but are not limited to, wired / cordless head phones, external chargers, power supplies, data storage devices designed to store data (e.g., audio data, video, images, etc.), headphones and microphones. In addition, the interface unit 170 can be configured using a wired / wireless data port, a card slot (for example, to connect to a memory card, to a subscriber identity module (SIM) card, to a subscriber identity module (UIM) card, with a plug-in Subscriber Identity Module (RUIM) card, etc.), audio input / output ports, and video input / output ports.

Блок 150 вывода обычно содержит различные компоненты, которые обеспечивают вывод требуемых сигналов в терминале 100 мобильной связи. Терминал 100 мобильной связи также содержит дисплей 151, который обеспечивает визуальное отображение информации, связанной с терминалом 100 мобильной связи. Например, если терминал 100 мобильной связи работает в режиме телефонного разговора, то дисплей 151 обычно обеспечивает интерфейс пользователя или графический интерфейс пользователя, содержащий информацию, связанную с заказом, проведением и завершением телефонного разговора. В качестве другого примера, если терминал 100 мобильной связи находится в режиме видеотелефонного разговора или в режиме фотографирования, то дисплей 151 может дополнительно или в альтернативном варианте выводить на экран дисплея изображения, связанные с этими режимами.The output unit 150 typically contains various components that provide the output of the desired signals in the mobile communication terminal 100. The mobile communication terminal 100 also includes a display 151 that provides a visual display of information associated with the mobile communication terminal 100. For example, if the mobile communication terminal 100 is in a telephone conversation mode, then the display 151 typically provides a user interface or graphical user interface containing information related to ordering, conducting and ending the telephone conversation. As another example, if the mobile communication terminal 100 is in a videophone conversation mode or in a photographing mode, the display 151 may additionally or alternatively display images associated with these modes on the display screen.

Кроме того, дисплей 151 также предпочтительно содержит сенсорный экран, работающий во взаимодействии с устройством ввода, например сенсорную панель. Такая конфигурация позволяет дисплею 151 функционировать как в качестве устройства вывода, так и в качестве устройства ввода. Кроме того, дисплей 151 может быть реализован с использованием, в том числе, таких технологий отображения информации, как, например, жидкокристаллический дисплей (LCD), жидкокристаллический дисплей на тонкопленочных транзисторах (TFT-LCD), дисплей на органических светоизлучающих диодах (OLED), гибкий дисплей и трехмерный дисплей.In addition, the display 151 also preferably includes a touch screen that works in conjunction with an input device, such as a touch panel. This configuration allows the display 151 to function both as an output device and as an input device. In addition, the display 151 can be implemented using, inter alia, information display technologies such as, for example, a liquid crystal display (LCD), a thin-film transistor liquid crystal display (TFT-LCD), an organic light emitting diode (OLED) display, flexible display and three-dimensional display.

Терминал 100 мобильной связи также может содержать один или большее количество таких дисплеев. Примером варианта осуществления с двумя дисплеями является следующий: один дисплей выполнен таким образом, что он является внутренним дисплеем (который виден тогда, когда терминал находится в открытом положении), а второй дисплей выполнен таким образом, что он является внешним дисплеем (который виден в обоих положениях: в открытом положении и в закрытом положении).The mobile communication terminal 100 may also contain one or more such displays. An example of an embodiment with two displays is as follows: one display is designed to be an internal display (which is visible when the terminal is in the open position), and the second display is designed to be an external display (which is visible in both positions: in the open position and in the closed position).

Кроме того, на Фиг.1 показано устройство 150 вывода, содержащее модуль 152 вывода звука, который обеспечивает вывод требуемых звуковых сигналов в терминале 100 мобильной связи. Модуль 152 вывода звука может быть реализован с использованием одного или большего количества громкоговорителей, зуммеров, других устройств создания звука и комбинаций этих устройств. Кроме того, модуль 152 вывода звука функционирует в различных режимах, в том числе в режиме приема телефонного вызова, в режиме выполнения телефонного вызова, в режиме записи, в режиме распознавания речи и в режиме приема широковещательных передач. Во время работы модуль 152 вывода звука выводит звуковые сигналы, связанные с конкретной функцией (например, с принятым телефонным вызовом, с принятым сообщением и с ошибками).In addition, FIG. 1 shows an output device 150 comprising an audio output module 152 that provides output of desired audio signals to a mobile communication terminal 100. Sound output module 152 may be implemented using one or more speakers, buzzers, other sound generating devices, and combinations of these devices. In addition, the audio output module 152 operates in various modes, including in a telephone call reception mode, in a telephone call execution mode, in a recording mode, in a speech recognition mode, and in a broadcast reception mode. During operation, the audio output module 152 outputs audio signals associated with a particular function (for example, with a received phone call, with a received message, and with errors).

Кроме того, на чертеже показано, что блок 150 вывода дополнительно содержит сигнальное устройство 153, которое используют для сигнализации о возникновении конкретного события, связанного с терминалом 100 мобильной связи, или для распознавания факта его возникновения иным способом. Событиями, о которых сигнализируют, являются, в том числе, принятый вызов, принятое сообщение и принятые данные, введенные пользователем. Примером такого выходного эффекта является, в том числе, создание осязательных ощущений (например, вибрации) для пользователя. Например, сигнальное устройство 153 может быть выполнено таким образом, что вибрирует в ответ на прием телефонного вызова или сообщения терминалом 100 мобильной связи.In addition, the drawing shows that the output unit 150 further comprises a signaling device 153, which is used to signal the occurrence of a specific event associated with the mobile communication terminal 100, or to recognize the fact of its occurrence in another way. The events signaled are, inter alia, the received call, the received message and the received data entered by the user. An example of such an output effect is, inter alia, the creation of tactile sensations (for example, vibration) for the user. For example, the signal device 153 may be configured to vibrate in response to receiving a telephone call or message by the mobile communication terminal 100.

В качестве другого примера, сигнальное устройство 153 создает вибрацию в ответ на прием введенных пользователем данных в терминале 100 мобильной связи, обеспечивая, таким образом, механизм осязательной обратной связи. Кроме того, различные выходные эффекты, созданные компонентами устройства 150 вывода, могут быть реализованы по отдельности, или же такой выходной эффект может быть реализован с использованием любой комбинации этих компонентов.As another example, the signal device 153 vibrates in response to receiving user input at the mobile communication terminal 100, thereby providing a tactile feedback mechanism. In addition, the various output effects created by the components of the output device 150 may be implemented separately, or such an output effect may be implemented using any combination of these components.

Кроме того, для хранения различных типов данных для обеспечения требуемой обработки, требуемого управления и хранения данных в терминале 100 мобильной связи используют запоминающее устройство 160. Примерами таких данных являются, в том числе, программные команды для прикладных программ, работающих в терминале 100 мобильной связи, предыстория телефонных вызовов, данные о контактах, данные телефонной книги, сообщения, изображения, видеоинформация и т.д.In addition, a storage device 160 is used to store various types of data to provide the required processing, required control, and data storage in the mobile communication terminal 100. Examples of such data are, inter alia, program instructions for application programs operating in the mobile communication terminal 100, telephone call history, contact data, phone book data, messages, images, video information, etc.

Кроме того, запоминающее устройство 160, показанное на Фиг. 1, может быть реализовано с использованием подходящих энергозависимых и энергонезависимых запоминающих устройств или устройств хранения данных любого типа, которыми являются, в том числе, оперативное запоминающее устройство (ОЗУ), статическое оперативное запоминающее устройство (статическое ОЗУ), электрически стираемое программируемое постоянное запоминающее устройство (ЭСППЗУ), стираемое программируемое постоянное запоминающее устройство (СППЗУ), программируемое постоянное запоминающее устройство (ППЗУ), постоянное запоминающее устройство (ПЗУ), магнитное запоминающее устройство, флэш-память, магнитный или оптический диск, запоминающее устройство на картах памяти или иное подобное запоминающее устройство или устройство для хранения данных, или с использованием комбинации этих устройств.In addition, the storage device 160 shown in FIG. 1 can be implemented using suitable volatile and non-volatile memory devices or storage devices of any type, which are, including random access memory (RAM), static random access memory (static RAM), electrically erasable programmable read-only memory ( EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (EPROM), read-only memory troystvo (ROM), magnetic memory, flash memory, magnetic or optical disk, memory device on memory cards or other similar memory or data storage device, or a combination of these devices.

Терминал 100 также содержит контроллер 180, который обычно управляет функционированием всего терминала 100 мобильной связи. Например, контроллер 180 осуществляет управление и обработку, которые связаны с телефонными вызовами в режиме речевой связи, с передачей данных, с обменом мгновенными сообщениями, с вызовами в режиме видеотелефонной связи, с операциями камеры и с операциями записи. Как показано на Фиг. 1, контроллер 180 может также содержать мультимедийный модуль 181, предназначенный для обеспечения функций воспроизведения мультимедийной информации. Мультимедийный модуль 181 может быть выполнен в виде части контроллера 180 или может быть реализован в виде отдельного компонента.The terminal 100 also includes a controller 180, which typically controls the operation of the entire mobile communication terminal 100. For example, the controller 180 performs control and processing that are associated with telephone calls in voice mode, with data transfer, with instant messaging, with calls in video telephony mode, with camera operations and recording operations. As shown in FIG. 1, the controller 180 may also comprise a multimedia module 181 designed to provide multimedia information playback functions. The multimedia module 181 may be implemented as part of the controller 180, or may be implemented as a separate component.

Кроме того, источник 190 электропитания обеспечивает энергию, используемую различными компонентами портативного устройства. Предоставленное электропитание может быть внутренним электропитанием, внешним электропитанием или их комбинациями.In addition, the power source 190 provides energy used by various components of the portable device. The power provided may be internal power, external power, or combinations thereof.

На следующем чертеже Фиг.2 терминал 100 мобильной связи согласно одному из вариантов осуществления настоящего изобретения показан на виде спереди-сбоку. Как показано на Фиг. 2, терминал 100 мобильной связи содержит первый корпус 200 выполненный таким образом, что может скользить по направляющим во взаимодействии со вторым корпусом 205. Устройство 130 ввода пользовательских данных, описанное со ссылкой на Фиг.1, может содержать первое устройство ввода данных, например функциональные кнопки 210, второе устройство ввода данных, например, кнопочную панель 215, и третье устройство ввода данных, например, боковые клавиши 245.In the following drawing of FIG. 2, a mobile communication terminal 100 according to one embodiment of the present invention is shown in a front-side view. As shown in FIG. 2, the mobile communication terminal 100 comprises a first housing 200 configured so that it can slide along the rails in cooperation with the second housing 205. The user data input device 130 described with reference to FIG. 1 may include a first data input device, for example, function buttons 210, a second data input device, for example, a keypad 215, and a third data input device, for example, side keys 245.

Функциональные кнопки 210 связаны с первым корпусом 200, а кнопочная панель 215 связана со вторым корпусом 205. Кнопочная панель 215 содержит различные клавиши (например, цифр, букв и символов), предоставляющие пользователю возможность сделать телефонный вызов, подготовить текстовое или мультимедийное сообщение и управлять терминалом 100 мобильной связи каким-либо иным способом.Function buttons 210 are associated with the first housing 200, and the keypad 215 is associated with the second housing 205. The keypad 215 contains various keys (eg, numbers, letters, and symbols) allowing the user to make a phone call, prepare a text or multimedia message, and control the terminal 100 mobile communications in any other way.

Кроме того, первый корпус 200 может быть перемещен путем скольжения относительно второго корпуса 205 между открытым положением и закрытым положением. В закрытом положении первый корпус 200 расположен поверх второго корпуса 205 таким образом, что кнопочная панель 215 в значительной степени или полностью закрыта первым корпусом 200. В открытом положении возможен доступ пользователя к кнопочной панели 215, а также к дисплею 151 и к функциональным кнопкам 210,. Функциональные кнопки 210 удобны для ввода пользователем таких команд, как, например, команды "пуск", "стоп" и "прокрутка".In addition, the first housing 200 can be moved by sliding relative to the second housing 205 between the open position and the closed position. In the closed position, the first housing 200 is located on top of the second housing 205 so that the keypad 215 is substantially or completely closed by the first housing 200. In the open position, the user can access the keypad 215, as well as the display 151 and the function buttons 210, . Function buttons 210 are convenient for user input of commands such as, for example, start, stop, and scroll commands.

Кроме того, терминал 100 мобильной связи способен работать либо в режиме ожидания (в котором он, например, способен принять телефонный вызов или сообщение, принимать служебные сигналы управления сетью связи и реагировать на них), либо в режиме активного телефонного вызова. Как правило, терминал 100 мобильной связи функционирует в режиме ожидания тогда, когда он находится в закрытом положении, и в активном режиме тогда, когда он находится в открытом положении. Однако конфигурация режима может быть изменена при необходимости или по желанию.In addition, the mobile communication terminal 100 is capable of operating either in standby mode (in which it, for example, is capable of receiving a telephone call or message, receiving communication network control signals and responding to them) or in an active telephone call mode. Typically, the mobile communication terminal 100 operates in the standby mode when it is in the closed position, and in the active mode when it is in the open position. However, the configuration of the mode can be changed if necessary or as desired.

Кроме того, первый корпус 200 сформирован из первого кожуха 220 и второго кожуха 225, и второй корпус 205 сформирован из первого кожуха 230 и второго кожуха 235. Первые и вторые кожухи предпочтительно выполнены из соответствующего материала, снабженного ребрами жесткости, например, из пластмассы, полученной способом литья под давлением, или сформованы с использованием металла, например нержавеющей стали (STS) и титана (Ti).In addition, the first casing 200 is formed from the first casing 220 and the second casing 225, and the second casing 205 is formed from the first casing 230 and the second casing 235. The first and second casings are preferably made of a suitable material provided with stiffeners, for example, plastic obtained injection molding, or molded using metal, such as stainless steel (STS) and titanium (Ti).

При желании может быть предусмотрено наличие одного или большего количества промежуточных кожухов, расположенных между первым и вторым кожухами одного из корпусов: первого корпуса 200 и второго корпуса 205, или обоих этих корпусов. Кроме того, первый корпус 200 и второй корпус 205 имеют такой размер, что вмещают в себя электронные компоненты, используемые для обеспечения функционирования терминала 100 мобильной связи.If desired, one or more intermediate casings located between the first and second casings of one of the housings may be provided: the first housing 200 and the second housing 205, or both of these housings. In addition, the first housing 200 and the second housing 205 are sized to accommodate electronic components used to support the operation of the mobile communication terminal 100.

Первый корпус 200 также содержит камеру 121 и устройство 152 вывода звука, которое выполнено в виде громкоговорителя, расположенного надлежащим образом относительно дисплея 151. Камера 121 также может быть сконструирована таким образом, что по выбору она может быть установлена в заданное положение (например, повернута, повернута на шарнире и т.д.) относительно первого корпуса 200.The first housing 200 also includes a camera 121 and a sound output device 152, which is designed as a loudspeaker appropriately positioned relative to the display 151. The camera 121 can also be designed so that it can optionally be set to a predetermined position (for example, rotated, swiveled, etc.) relative to the first housing 200.

Кроме того, рядом с нижней стороной дисплея 151 расположены функциональные кнопки 210. Как изложено выше, дисплей 151 может быть реализован в виде жидкокристаллического дисплея, ЖКД (LCD), или в виде дисплея на органических светоизлучающих диодах (OLED). Дисплей 151 также может быть выполнен в виде сенсорного экрана, снабженного расположенной под ним сенсорной панелью, которая осуществляет генерацию сигналов в ответ на прикосновение пользователя (например, пальцем, ручкой-стилусом и т.д.) к сенсорному экрану.In addition, function buttons 210 are located adjacent to the lower side of the display 151. As described above, the display 151 may be implemented as a liquid crystal display (LCD), or as an organic light emitting diode (OLED) display. The display 151 can also be made in the form of a touch screen equipped with a touch panel located beneath it, which generates signals in response to the user's touch (for example, with a finger, stylus pen, etc.) to the touch screen.

Второй корпус 205 также содержит микрофон 122, расположенный рядом с кнопочной панелью 215, и боковые клавиши 245, которые являются одним из типов устройств ввода пользовательских данных, расположенные вдоль боковой стороны второго корпуса 205. В предпочтительном варианте боковые клавиши 245 выполнены в виде клавиш быстрого вызова функций ("горячих клавиш") таким образом, что боковые клавиши 245 связаны с конкретной функцией терминала 100 мобильной связи. Как показано на чертеже, рядом с боковыми клавишами 245 расположен блок 170 интерфейса, а в нижней части второго корпуса 205 расположен источник 190 электропитания в виде аккумулятора.The second body 205 also includes a microphone 122 located adjacent to the keypad 215, and side keys 245, which are one type of user input device, located along the side of the second body 205. In a preferred embodiment, the side keys 245 are in the form of shortcut keys functions (hot keys) so that the side keys 245 are associated with a specific function of the mobile communication terminal 100. As shown in the drawing, an interface unit 170 is located adjacent to the side keys 245, and an electric power source 190 in the form of a battery is located at the bottom of the second housing 205.

На Фиг.3 терминал 100 мобильной связи, показанный на Фиг. 2. изображен на виде сзади-сбоку. Как показано на Фиг. 3, второй корпус 205 содержит камеру 121 и связанные с ней лампу-вспышку 250 и зеркало 255. Лампа-вспышка 250 функционирует вместе с камерой 121 второго корпуса 205, а зеркало 255 является полезным для содействия надлежащему расположению камеры 121 в режиме автопортрета пользователем. Кроме того, камера 121 второго корпуса 205 обращена в направлении, противоположном тому направлению, в котором обращена камера 121 первого корпуса 200, показанная на Фиг. 2.3, the mobile communication terminal 100 shown in FIG. 2. depicted in rear-side view. As shown in FIG. 3, the second housing 205 comprises a camera 121 and its associated flash lamp 250 and a mirror 255. The flash lamp 250 functions in conjunction with the camera 121 of the second housing 205, and the mirror 255 is useful for assisting the user to properly position the camera 121 in self portrait mode. In addition, the camera 121 of the second body 205 faces in a direction opposite to that in which the camera 121 of the first body 200 shown in FIG. 2.

Кроме того, каждая из камер 121 первого и второго корпусов 200 и 205 может иметь одинаковые или различные функциональные возможности. Например, в одном из вариантов осуществления изобретения камера 121 первого корпуса 200 работает с относительно более низкой разрешающей способностью, чем камера 121 второго корпуса 205. Такое устройство работает хорошо, например, во время телефонной связи в режиме видеоконференции, когда пропускная способность линии обратной связи может быть ограниченной. Кроме того, относительно более высокая разрешающая способность камеры 121 второго корпуса 205 (Фиг. 3) является полезной для получения более высококачественных изображений для их использования впоследствии.In addition, each of the chambers 121 of the first and second bodies 200 and 205 may have the same or different functionalities. For example, in one embodiment, the camera 121 of the first body 200 operates at a relatively lower resolution than the camera 121 of the second body 205. Such a device works well, for example, during telephone conferencing in a video conferencing mode, when the bandwidth of the feedback line can to be limited. In addition, the relatively higher resolution of the camera 121 of the second body 205 (FIG. 3) is useful for obtaining higher quality images for later use.

Второй корпус 205 также содержит модуль 152 вывода звука, который выполнен в виде громкоговорителя, и который расположен в верхней части второго корпуса 205. Модули выхода звукового сигнала первого и второго корпусов 200 и 205 могут также взаимодействовать друг с другом для обеспечения вывода стереозвука. Кроме того, любой из этих модулей вывода звука или оба из них могут быть выполнены таким образом, что функционируют в качестве устройства громкоговорящей связи.The second housing 205 also includes a sound output module 152, which is configured as a loudspeaker, and which is located at the top of the second housing 205. The audio output modules of the first and second housings 200 and 205 can also interact with each other to provide stereo audio output. In addition, either of these sound output modules, or both of them, may be configured in such a way that they function as a speakerphone.

Терминал 100 также содержит антенну 260 для приема сигнала широковещательной передачи, расположенную на верхнем торце второго корпуса 205. Антенна 260 функционирует во взаимодействии с модулем 111 приема широковещательных передач (показанным на Фиг. 1). При желании антенна 260 может быть установлена или выполнена таким образом, что способна втягиваться во второй корпус 205. Кроме того, задняя сторона первого корпуса 200 содержит модуль 265 скольжения по направляющим, который соединен с соответствующим модулем скольжения по направляющим, расположенным на передней стороне второго корпуса 205, с возможностью его перемещения путем скольжения.The terminal 100 also includes an antenna 260 for receiving a broadcast signal located at the upper end of the second housing 205. The antenna 260 operates in conjunction with the broadcast reception module 111 (shown in FIG. 1). If desired, the antenna 260 can be mounted or configured in such a way that it can be pulled into the second housing 205. In addition, the rear side of the first housing 200 includes a rail sliding module 265 that is connected to a corresponding rail sliding module located on the front side of the second housing 205, with the possibility of its movement by sliding.

Кроме того, проиллюстрированная компоновка различных компонентов первого и второго корпусов 200 и 205 может быть видоизменена при необходимости или по желанию. В общем случае, в альтернативном варианте некоторые или все компоненты одного корпуса могут быть реализованы таким образом, что они расположены в другом корпусе. Кроме того, местоположение и относительное расположение таких компонентов может быть таким, что их места расположения отличаются от тех, которые показаны на представленных чертежах.In addition, the illustrated arrangement of the various components of the first and second housings 200 and 205 may be modified as necessary or as desired. In the General case, in an alternative embodiment, some or all of the components of one housing can be implemented in such a way that they are located in another housing. In addition, the location and relative position of such components may be such that their locations differ from those shown in the drawings.

Кроме того, терминал 100 мобильной связи, показанный на чертежах Фиг. 1-3, может быть выполнен таким образом, что работает в системе связи, осуществляющей передачу данных посредством кадров или пакетов, в том числе, не только в системах беспроводной связи, но и в системах проводной связи, и в системах спутниковой связи. В таких системах связи используют различные интерфейсы радиосвязи и/или физические уровни.In addition, the mobile communication terminal 100 shown in the drawings of FIG. 1-3, can be performed in such a way that it works in a communication system that transfers data through frames or packets, including not only in wireless communication systems, but also in wired communication systems, and in satellite communication systems. Such communication systems use various radio interfaces and / or physical layers.

Примерами таких интерфейсов радиосвязи, используемых в системах связи, являются, в том числе, например, следующие: множественный доступ с частотным разделением, МДЧР (FDMA), множественный доступ с временным разделением, МДВР (TDMA), множественный доступ с кодовым разделением, МДКР (CDMA), и интерфейс радиосвязи Универсальной системы мобильной связи (UMTS), стандарт интерфейса радиосвязи в рамках программы долгосрочного развития (LTE) Универсальной системы мобильной связи (UMTS) и интерфейс радиосвязи Глобальной системы мобильной связи (GSM). В качестве только лишь примера, не являющегося ограничивающим признаком, приведенное ниже описание относится к системе связи МДКР, но идея этого изобретения равным образом применима и к системам других типов.Examples of such radio interfaces used in communication systems are, for example, the following: frequency division multiple access, FDMA, time division multiple access, TDMA, code division multiple access, CDMA ( CDMA), and the radio communication interface of the Universal Mobile Telecommunications System (UMTS), the radio communication standard for the long-term development program (LTE) of the Universal Mobile Telecommunications System (UMTS) and the radio communication interface of the Global Mobile Telecommunications System (GSM). As a merely non-limiting example, the following description refers to a CDMA communication system, but the idea of this invention is equally applicable to other types of systems.

На следующем чертеже Фиг. 4 проиллюстрирована система беспроводной связи МДКР, содержащая множество терминалов 100 мобильной связи, множество базовых станций (БС) 270, множество контроллеров 275 базовых станций (КБС) и коммутационный центр 280 мобильной связи (КЦМС).In the following drawing, FIG. 4 illustrates a CDMA wireless communication system comprising a plurality of mobile communication terminals 100, a plurality of base stations (BS) 270, a plurality of base station controllers (BSC) 275, and a mobile switching center 280 (MSC).

КЦМС 280 сконфигурирован таким образом, что соединен с помощью интерфейса с коммутируемой телефонной сетью 290 общего пользования (КТСОП), и КЦМС 280 также таким образом, что соединен с помощью интерфейса с контроллерами 275 базовых станций (КБС). Кроме того, контроллеры 275 базовых станций (КБС) связаны с базовыми станциями 270 через линии обратной связи. Кроме того, линии обратной связи могут быть сконфигурированы в соответствии с любым из нескольких интерфейсов, которыми являются, в том числе, например, интерфейс линии E1/T1, интерфейс линии асинхронного режима передачи (ATM), интерфейс протокола сети Интернет (IP), интерфейс протокола соединения "точка-точка" (PPP), интерфейс протокола ретрянсляции кадров (Frame Relay), интерфейс высокоскоростной цифровой абонентской линии (HDSL), ассиметричной цифровой абонентской линии (ADSL) или цифровых абонентских линий по технологии xDSL. Кроме того, система может содержать более двух контроллеров 275 базовых станций (КБС).MSC 280 is configured in such a way that it is connected via an interface to the public switched telephone network 290 (PSTN), and MSC 280 is also such that it is connected via an interface to controllers 275 base stations (BSC). In addition, base station controllers (BSCs) 275 are connected to base stations 270 via feedback links. In addition, the feedback lines can be configured in accordance with any of several interfaces, which, for example, include the E1 / T1 line interface, the asynchronous transfer mode (ATM) line interface, the Internet Protocol (IP) interface, the interface point-to-point connection protocol (PPP), frame relay protocol interface (Frame Relay), high-speed digital subscriber line (HDSL) interface, asymmetric digital subscriber line (ADSL) or digital subscriber lines using xDSL technology. In addition, the system may contain more than two controllers 275 base stations (BSC).

К тому же, каждая базовая станция 270 может содержать один или большее количество секторов, при этом каждый сектор содержит всенаправленную антенну или антенну, направленную в конкретном направлении по радиусу от базовой станции 270. В альтернативном варианте каждый сектор может содержать две антенны для приема с разнесением. Кроме того, каждая базовая станция 270 может быть выполнена таким образом, что обеспечивает поддержку множества распределений предоставленных частот, причем каждое распределение предоставленных частот имеет конкретную ширину спектра (например, 1,25 МГц, 5 МГц).In addition, each base station 270 may contain one or more sectors, with each sector containing an omnidirectional antenna or an antenna directed in a specific direction along the radius from the base station 270. Alternatively, each sector may contain two antennas for diversity reception . In addition, each base station 270 may be configured to support multiple allocations of the provided frequencies, with each distribution of the provided frequencies having a specific spectrum width (e.g., 1.25 MHz, 5 MHz).

Пересечение сектора и предоставленных частот может именоваться каналом МДКР. Базовые станции 270 также могут именоваться подсистемами приемопередатчика базовой станции, ППБС (BTS). В некоторых случаях термин "базовая станция" может использоваться таким образом, что он относится к совокупности из КБС 275 и одной или большего количества базовых станций 270.The intersection of the sector and the frequencies provided may be referred to as a CDMA channel. Base stations 270 may also be referred to as base station transceiver subsystems, BTSs. In some cases, the term “base station” may be used in such a way that it refers to a combination of BSC 275 and one or more base stations 270.

Базовые станции 270 также могут именоваться "узлами сети сотовой связи". В альтернативном варианте узлами сети сотовой связи именоваться отдельные сектора конкретной базовой станции 270 могут. Кроме того, показан передатчик 295 наземной системы цифрового мультимедийного вещания (DMB), осуществляющий широковещательную передачу в терминалы 100 мобильной связи, работающие в системе.Base stations 270 may also be referred to as "cellular network nodes." Alternatively, the nodes of a cellular network can be referred to as individual sectors of a particular base station 270 may. In addition, a transmitter 295 of a terrestrial digital multimedia broadcasting (DMB) system is shown broadcasting to mobile communication terminals 100 operating in the system.

Кроме того, модуль 111 приема широковещательных передач (показанный на Фиг.1) терминала 100 мобильной связи обычно выполнен таким образом, что способен принимать сигналы широковещательной передачи, переданные передатчиком 295 системы цифрового мультимедийного вещания (DMB). Как изложено выше, аналогичные устройства могут быть реализованы для других типов широковещательной и многоадресной передачи сигналов.In addition, the broadcast reception module 111 (shown in FIG. 1) of the mobile communication terminal 100 is typically configured to receive broadcast signals transmitted by the transmitter 295 of the digital multimedia broadcasting system (DMB). As described above, similar devices can be implemented for other types of broadcast and multicast signaling.

Кроме того, на Фиг.4 проиллюстрировано несколько спутников 300 Глобальной системы определения местоположения (GPS). Эти спутники облегчают определение местоположения некоторых или всех терминалов 100 мобильной связи. На Фиг. 4 показаны два спутника, но информация о местоположении может быть получена посредством большего или меньшего количества спутников.In addition, FIG. 4 illustrates several Global Positioning System (GPS) satellites 300. These satellites facilitate the location of some or all of the mobile communication terminals 100. In FIG. 4 shows two satellites, but location information can be obtained by more or less satellites.

Кроме того, модуль 115 определения местоположения (показанный на Фиг. 1) терминала 100 мобильной связи обычно выполнен таким образом, что взаимодействует со спутниками 300 для получения желательной информации о местоположении. Однако в альтернативном варианте может быть реализована техника определения местоположения иных типов, например такая техника определения местоположения, которая может использоваться в дополнение к технологии определения местоположения посредством Глобальной системы определения местоположения (GPS) или вместо нее. В альтернативном варианте или в дополнение к этому некоторые или все спутники 300 Глобальной системы определения местоположения (GPS) могут быть устроены таким образом, что обеспечивают передачу программ цифрового мультимедийного вещания (DMB) со спутников.In addition, the location module 115 (shown in FIG. 1) of the mobile communication terminal 100 is typically configured to communicate with satellites 300 to obtain desired location information. However, in an alternative embodiment, other types of positioning techniques may be implemented, such as a positioning technique that can be used in addition to, or instead of using, a Global Positioning System (GPS). Alternatively, or in addition to this, some or all of the satellites 300 of the Global Positioning System (GPS) can be arranged to transmit digital multimedia broadcasting (DMB) programs from satellites.

Кроме того, во время обычной работы системы беспроводной связи базовые станции 270 принимают наборы сигналов, передаваемых по линиям обратной связи из различных терминалов 100 мобильной связи. Терминалы 100 мобильной связи участвуют в телефонных вызовах, в обмене сообщениями и в других видах связи.In addition, during normal operation of the wireless communication system, base stations 270 receive sets of signals transmitted on the feedback lines from various mobile communication terminals 100. Mobile communication terminals 100 participate in telephone calls, messaging, and other forms of communication.

Кроме того, каждый сигнал, переданный по линии обратной связи, который принят конкретной базовой станцией 270 обрабатывают в этой же базовой станции 270 и результирующие данные пересылают в соответствующий КБС 275. КБС 275 обеспечивает распределение ресурсов телефонного вызова и функциональные возможности управления мобильностью, в том числе, "мягкими" эстафетными передачами управления связью между базовыми станциями 270.In addition, each feedback signal received by a particular base station 270 is processed at the same base station 270 and the resulting data is sent to the corresponding BSC 275. The BSC 275 provides a telephone call resource allocation and mobility management functionality, including , soft handoffs of communication control between base stations 270.

Кроме того, контроллеры 275 базовых станций (КБС) также направляют принятые данные в КЦМС 280, который обеспечивает дополнительные услуги маршрутизации для связи с КТСОП 290. Аналогичным образом, КТСОП служит средством связи с КЦМС 280, а КЦМС 280 служит средством связи с контроллерами 275 базовых станций (КБС). Контроллеры 275 базовых станций (КБС) также управляют базовыми станциями 270 для обеспечения передачи наборов сигналов по линии прямой связи в терминалы 100 мобильной связи.In addition, the base station controllers 275 (BSC) also send the received data to the MSC 280, which provides additional routing services for communication with the PSTN 290. Similarly, the PSTN serves as a communication tool with the MSC 280, and the MSC 280 serves as a communication tool with 275 base controllers stations (BSC). Base station controllers (BSCs) 275 also control base stations 270 to provide for transmitting sets of signals over a direct link to mobile communication terminals 100.

В приведенном ниже описании дано объяснение способа управления, применимого для терминала 100 мобильной связи, сконфигурированного описанным выше способом, в различных вариантах осуществления изобретения. Однако описанные ниже варианты осуществления изобретения могут быть реализованы независимо или посредством их комбинаций. В дополнение к этому, в приведенном ниже описании предполагают, что дисплей 151 содержит сенсорный экран. Кроме того, сенсорный экран или его экран обозначен ниже номером позиции '400'.In the description below, an explanation is given of a control method applicable to a mobile communication terminal 100 configured in the manner described above in various embodiments of the invention. However, the embodiments described below may be implemented independently or by combinations thereof. In addition, the description below assumes that the display 151 includes a touch screen. In addition, the touch screen or its screen is indicated below by the position number '400'.

В одном из вариантов осуществления настоящего изобретения терминал мобильной связи задает область (или диапазон информационного поиска) баз данных, используемую в качестве опорной информации для распознавания речевой команды в виде области, связанной с конкретным меню или с конкретными услугами. Соответственно, увеличивается процент правильно распознанных слов для речевой команды и уменьшается общий объем ресурсов, используемых терминалом мобильной связи.In one embodiment of the present invention, a mobile communication terminal defines an area (or information retrieval range) of databases used as reference information for recognizing a voice command in the form of an area associated with a particular menu or with specific services. Accordingly, the percentage of correctly recognized words for the speech command increases and the total amount of resources used by the mobile communication terminal decreases.

Кроме того, область базы данных, используемая в качестве опорной информации для распознавания речевой команды, может быть задана посредством меню установки режима работы терминала мобильной связи. К тому же, после того, как активирована функция распознавания речи, заданная область может быть применена автоматически.In addition, the database area used as reference information for recognizing a speech command can be set via the operation mode setting menu of the mobile communication terminal. In addition, after the speech recognition function is activated, the predetermined area can be applied automatically.

В приведенном ниже описании предполагают, что заранее заданная область базы данных для распознавания речевой команды содержит информацию, связанную с меню, которое отображают на дисплее 151 в текущий момент времени, или информацию, связанную с подменю одного из меню.In the description below, it is assumed that the predetermined area of the database for recognizing a speech command contains information related to the menu that is currently displayed on the display 151, or information associated with a submenu of one of the menus.

На следующем чертеже Фиг. 5 показана схема последовательности операций способа управления меню для терминала мобильной связи посредством речевой команды согласно одному из вариантов осуществления настоящего изобретения. В приведенном ниже описании также ссылаются на Фиг. 1. Как показано на Фиг. 5, контроллер 180 определяет, была ли активирована функция распознавания речи (операция S101).In the following drawing, FIG. 5 is a flowchart of a menu control method for a mobile communication terminal via a voice command according to one embodiment of the present invention. The following description also refers to FIG. 1. As shown in FIG. 5, the controller 180 determines whether the speech recognition function has been activated (operation S101).

Затем функция распознавания речи может быть активирована пользователем, выбирающим аппаратные клавиши на терминале мобильной связи, или сенсорные клавиши, отображенные на модуле 151 дисплея. Пользователь также может активировать функцию распознавания речи путем манипуляций с конкретными меню, отображенными на дисплее 151. Функция распознавания речи также может быть активирована пользователем, создающим конкретные звуки или звуковые эффекты, посредством радиосигналов ближнего или дальнего действия или посредством информации, исходящей от тела пользователя, например, посредством жеста рукой или телодвижения.Then, the speech recognition function can be activated by a user selecting hardware keys on a mobile communication terminal or touch keys displayed on the display unit 151. The user can also activate the speech recognition function by manipulating the specific menus displayed on the display 151. The speech recognition function can also be activated by the user creating specific sounds or sound effects, via short-range or long-range radio signals or through information coming from the user's body, for example by hand gesture or body movement.

В более подробном изложении конкретными звуками или звуковыми эффектами могут являться, в том числе, звуки удара, уровень громкости которых превышает определенный уровень. Кроме того, конкретные звуки или звуковые эффекты могут быть зарегистрированы с использованием алгоритма распознавания уровня громкости. В дополнение к этому, в предпочтительном варианте алгоритм распознавания уровня громкости является более простым, чем алгоритм распознавания речи и, следовательно, потребляет меньший объем ресурсов терминала мобильной связи. К тому же, алгоритм (или схема) распознавания уровня громкости может быть реализована отдельно от алгоритма или схемы распознавания речи, или он может быть реализован таким образом, что задает некоторые функции алгоритма распознавания речи.More specifically, specific sounds or sound effects may include, but not limited to, shock sounds whose volume level exceeds a certain level. In addition, specific sounds or sound effects can be registered using a volume recognition algorithm. In addition to this, in a preferred embodiment, the volume recognition algorithm is simpler than the speech recognition algorithm and, therefore, consumes less resources of the mobile communication terminal. In addition, the volume level recognition algorithm (or circuit) may be implemented separately from the speech recognition algorithm or circuit, or it may be implemented in such a way that defines some functions of the speech recognition algorithm.

Кроме того, радиосигналы могут быть приняты посредством блока 110 беспроводной связи, а сведения о жестах рукой или о телодвижениях пользователя могут быть получены посредством датчика 140. Таким образом, в одном из вариантов осуществления из настоящего изобретения блок 110 беспроводной связи, устройство 130 ввода пользовательских данных и датчик 140 могут именоваться устройством ввода сигнала. Кроме того, функция распознавания речи также может быть завершена аналогичным способом.In addition, radio signals can be received by the wireless communication unit 110, and information about the hand gestures or the user's movements can be obtained by the sensor 140. Thus, in one embodiment of the present invention, the wireless communication unit 110, the user data input device 130 and sensor 140 may be referred to as a signal input device. In addition, the speech recognition function can also be completed in a similar manner.

Физическая активация функции распознавания речи пользователем является особенно целесообразной потому, что пользователь является более осведомленным о том, что он собирается использовать речевые команды для управления терминалом. То есть, поскольку пользователь сначала должен выполнить физическую манипуляцию с терминалом, то он или она интуитивно осознают, что они собираются ввести речевую команду или инструкцию в терминал и, следовательно, говорят более отчетливо или более медленно, чтобы тем самым активировать конкретную функцию. Таким образом, поскольку пользователь говорит более отчетливо или более медленно, то, например, возрастает вероятность точного распознавания речевых команд. То есть, в одном из вариантов осуществления настоящего изобретения активацию функции распознавания речи выполняют посредством физической манипуляции с клавишей на терминале, а не путем активации функции распознавания речи, говоря в терминал.The physical activation of a user’s speech recognition function is especially useful because the user is more aware that he / she is going to use voice commands to control the terminal. That is, since the user must first perform physical manipulation of the terminal, he or she intuitively realizes that they are going to enter a speech command or instruction into the terminal and, therefore, speak more distinctly or more slowly, thereby activating a specific function. Thus, since the user speaks more clearly or more slowly, then, for example, the likelihood of accurate recognition of voice commands increases. That is, in one embodiment of the present invention, the activation of the speech recognition function is performed by physically manipulating a key on the terminal, and not by activating the speech recognition function, speaking to the terminal.

Затем контроллер 180 может начать или завершить активацию функции распознавания речи на основании того, какое количество раз пользователь прикоснулся к конкретной клавише или к конкретному участку сенсорного экрана, как долго пользователь прикасается к конкретной клавише или к конкретному участку сенсорного экрана, и т.д. Пользователь также может задать то, каким образом контроллер 180 должен активировать функцию распознавания речи с использованием надлежащего пункта меню, что обеспечивается настоящим изобретением. Например, пользователь может выбрать на терминале тот пункт меню, который содержит: 1) установку активации распознавания речи на основании того количества раз, равного X, которое была выбрана клавиша речевого управления, 2) установку активации распознавания речи на основании количества времени, равного X, в течение которого была выбрана клавиша речевого управления, 3) установку активации распознавания речи тогда, когда выбраны клавиши X и Y, и т.д. Затем пользователь может ввести значения X и Y для задания того, каким образом контроллер 180 определяет факт активации функции речевой активации распознавания, с возможностью изменения этих параметров. Таким образом, согласно одному из вариантов осуществления настоящего изобретения пользователь активно задействует функцию речевого управления, имеющуюся в его собственном терминале мобильной связи, что увеличивает вероятность определения контроллером 180 правильной функции, соответствующей речевой команде пользователя и что предоставляет пользователю возможность приспособить функцию речевого управления в соответствии с его или ее потребностями.Then, the controller 180 can start or complete the activation of the speech recognition function based on how many times the user has touched a particular key or a specific area of the touch screen, how long the user has touched a specific key or a specific area of the touch screen, etc. The user can also specify how the controller 180 should activate the speech recognition function using the appropriate menu item, which is provided by the present invention. For example, the user can select on the terminal the menu item that contains: 1) the installation of speech recognition activation based on the number of times equal to X that the speech control key was selected, 2) the installation of speech recognition activation based on the amount of time equal to X, during which the speech control key was selected, 3) the installation of speech recognition activation when the X and Y keys are selected, etc. The user can then enter the X and Y values to specify how the controller 180 determines whether the speech recognition activation function has been activated, with the possibility of changing these parameters. Thus, according to one embodiment of the present invention, the user actively activates the voice control function available in his own mobile communication terminal, which increases the likelihood that the controller 180 determines the correct function corresponding to the user's speech command and that allows the user to adapt the voice control function in accordance with his or her needs.

Контроллер 180 также может сохранять активированное состояние функции распознавания речи в течение времени прикосновения к заданной клавише (к заданным клавишам) или в течение времени нажатия на нее (на них) и прекращать выполнение функции распознавания речи, когда заданная клавиша отпущена (заданные клавиши отпущены). В альтернативном варианте контроллер 180 может сохранять активацию функции распознавания речи в течение заранее заданного промежутка времени после прикосновения к заданной клавише (к заданным клавишам) или нажатия на нее (на них) и прекращать или завершать выполнение функции распознавания речи по истечении заранее заданного промежутка времени. В еще одном варианте осуществления изобретения контроллер 180 может сохранять принятые речевые команды в запоминающем устройстве 160 в течение того времени, когда функция распознавания речи поддерживается в активированном состоянии.The controller 180 can also save the activated state of the speech recognition function during the time it touches a given key (to specified keys) or for the duration of pressing it (on them) and stop the speech recognition function when the specified key is released (specified keys are released). Alternatively, the controller 180 may retain activation of the speech recognition function for a predetermined period of time after touching or pressing a predetermined key (s), and stopping or completing the speech recognition function after a predetermined period of time. In yet another embodiment of the invention, the controller 180 may store the received speech commands in the memory 160 during the time that the speech recognition function is maintained in an activated state.

Кроме того, как показано на Фиг. 5, область базы данных, используемую в качестве опорной информации для распознавания смысла речевой команды, указывают как информацию, связанную с конкретными функциями или меню в терминале (операция S102). Например, заданная область базы данных может быть указана в информации, связанной с меню, отображаемым на дисплее 151 в текущий момент времени, или в информации, связанной с подменю одного из отображенных меню. Кроме того, поскольку задана область базы данных, то увеличивается процент правильно распознанных слов для введенной речевой команды. Примерами областей являются, в том числе, следующие: область электронной почты, область принятых телефонных вызовов, область мультимедийной информации и т.д.In addition, as shown in FIG. 5, the database area used as reference information for recognizing the meaning of the speech command is indicated as information associated with specific functions or menus in the terminal (operation S102). For example, a given area of the database may be indicated in information related to the menu displayed on the display 151 at the current time, or in information related to a submenu of one of the displayed menus. In addition, since the database area is specified, the percentage of correctly recognized words for the entered speech command increases. Examples of areas are, but are not limited to, the following: email area, area of received phone calls, area of multimedia information, etc.

Информация, связанная с подменю, также может быть сконфигурирована в виде данных, находящихся в базе данных. Например, информация может быть сконфигурирована в виде ключевого слова, и множество фрагментов информации может соответствовать одной функции или меню. Кроме того, база данных может представлять собой множество баз данных, соответствующих характерным особенностям информации, и она может храниться в запоминающем устройстве 160.Information associated with the submenu can also be configured as data in the database. For example, information may be configured as a keyword, and a plurality of pieces of information may correspond to a single function or menu. In addition, the database may be a plurality of databases corresponding to the characteristic features of the information, and it may be stored in the storage device 160.

Кроме того, информация в базе (в базах) данных может быть целесообразным образом скорректирована или обновлена посредством процесса обучения. Каждая область соответствующих баз данных также может быть задана в виде области, связанной с функциями или с меню, выводимыми в текущий момент времени, для увеличения процента правильно распознанных слов речевой команды. Эта область также может быть изменена при дальнейшем прохождении по пунктам меню.In addition, the information in the database (s) of the data can be expediently adjusted or updated through the learning process. Each area of the respective databases can also be defined as an area associated with functions or with menus that are displayed at the current time, to increase the percentage of correctly recognized words of the speech command. This area can also be changed with further passage through the menu items.

После того, как была активирована функция распознавания речи (результатом операции S101 является "ДА"), и была задана область (операция S102), контроллер 180 определяет, введена ли пользователем речевая команда (операция S103). В том случае, когда контроллер 180 определяет, что пользователем была введена речевая команда (результатом операции S103 является "ДА"), контроллер 180 анализирует контекст и содержание речевой команды или инструкции, введенной через микрофон 122, на основании конкретной базы данных, выясняя, тем самым, смысл речевой команды (операция S104).After the speech recognition function has been activated (the result of operation S101 is “YES”) and the area has been set (operation S102), the controller 180 determines whether the user has entered the voice command (operation S103). In the case where the controller 180 determines that the user entered the voice command (the result of operation S103 is “YES”), the controller 180 analyzes the context and content of the voice command or instruction entered through the microphone 122 based on a specific database, finding out in fact, the meaning of the speech command (operation S104).

Кроме того, контроллер 180 может определять смысл речевой инструкции или команды на основании модели языка и акустической модели области, к которой осуществляют доступ. В более подробном изложении модель языка связана непосредственно с самими словам, а акустическая модель соответствует тому, каким образом произнесены слова (например, частотным составляющим произносимых слов или фраз). Используя модель языка и акустическую модель совместно с конкретной областью и со сведениями о режиме работы терминала 100 мобильной связи, контроллер 180 может эффективно определять смысл введенных речевых инструкций или команд.In addition, the controller 180 may determine the meaning of the speech instruction or command based on the language model and the acoustic model of the accessed area. In a more detailed presentation, the language model is directly related to the words themselves, and the acoustic model corresponds to how the words are spoken (for example, the frequency components of the spoken words or phrases). Using the language model and the acoustic model in conjunction with a specific area and with information about the operation mode of the mobile communication terminal 100, the controller 180 can effectively determine the meaning of the entered voice instructions or commands.

Кроме того, контроллер 180 может незамедлительно начать выполнение процедуры выяснения смысла введенной речевой команды, когда пользователь прекращает активацию функции распознавания речи, в том случае, когда контроллер 180 сохраняет введенную речевую команду в запоминающем устройстве 160, или может выполнять функцию речевого управления одновременно с вводом речевой команды.In addition, the controller 180 can immediately begin the process of ascertaining the meaning of the entered speech command when the user stops activating the speech recognition function, in the event that the controller 180 stores the entered speech command in the memory 160, or can perform the voice control function simultaneously with the input of the speech teams.

Затем, если речевая команда не была введена полностью (результатом операции S103 является "НЕТ"), то контроллер 180 может по-прежнему выполнять иные функции. Например, если пользователь выполняет иное действие путем прикосновения к пункту меню на экране и т.д. или нажимает клавишу на терминале (результатом операции S109 является "ДА"), то контроллер 180 выполняет соответствующую выбранную функцию (операция S110).Then, if the voice command has not been entered completely (the result of operation S103 is “NO”), then the controller 180 may still perform other functions. For example, if the user performs a different action by touching a menu item on the screen, etc. or presses a key on the terminal (the result of operation S109 is “YES”), then the controller 180 performs the corresponding selected function (operation S110).

Затем, после того, как при операции S104 контроллером 180 определен смысл введенной речевой команды, контроллер 180 выводит результирующее значение смысла команды (операция S105). То есть, результирующее значение может содержать управляющие сигналы для реализации меню, связанных с функциями или услугами, которые соответствуют определенному смыслу, для управления конкретными компонентами терминала мобильной связи и т.д. Результирующее значение также может содержать данные для визуального отображения информации, связанной с распознанной речевой командой.Then, after the meaning of the inputted voice command is determined by the controller 180 in step S104, the controller 180 outputs the resultant meaning of the command (step S105). That is, the resulting value may contain control signals for implementing menus related to functions or services that correspond to a specific meaning, for controlling specific components of a mobile communication terminal, etc. The resulting value may also contain data for visually displaying information associated with the recognized speech command.

Контроллер может также выдать запрос о том, чтобы пользователь подтвердил, что выведенное результирующее значение является точным (операция S106). Например, в том случае, когда процент правильно распознанных слов в речевой команде является низким, или когда определено, что она имеет множество смыслов, контроллер 180 может вывести множество меню, связанных с соответствующими смыслами, а затем реализовать то меню, которое выбрано пользователем (операция S107). Контроллер 180 может также выдать пользователю запрос о том, следует ли реализовать конкретное меню, которому соответствует высокий процент правильно распознанных слов, а затем выполнить или отобразить на дисплее соответствующую функцию или соответствующее меню по выбору пользователя или согласно его ответу.The controller may also request the user to confirm that the output result value is accurate (step S106). For example, in the case when the percentage of correctly recognized words in the speech command is low, or when it is determined that it has many meanings, the controller 180 can display many menus associated with the corresponding meanings, and then implement the menu that is selected by the user (operation S107). The controller 180 may also prompt the user whether to implement a particular menu that corresponds to a high percentage of correctly recognized words, and then execute or display the corresponding function or the corresponding menu of the user's choice or according to his answer.

Кроме того, контроллер 180 также может вывести речевое сообщение, прося пользователя выбрать конкретное меню или пункт меню, например, следующее речевое сообщение: "Do you want to execute a photo album menu? Reply with Yes or No" ("Желаете ли Вы реализовать меню "фотоальбом"? Ответьте "да" или "нет""). Затем контроллер 180 выполняет или не выполняет функцию, соответствующую конкретному меню или пункту меню на основании ответа пользователя. Если пользователь не ответил в течение конкретного промежутка времени (равного, например, пяти секундам), то контроллер 180 может также незамедлительно реализовать конкретное меню или пункт меню. То есть, если ответ от пользователя не получен, то контроллер 180 может автоматически выполнить функцию или реализовать меню, полагая, что отсутствие ответа является положительным ответом.In addition, the controller 180 can also output a voice message asking the user to select a specific menu or menu item, for example, the following voice message: " Do you want to execute a photo album menu? Reply with Yes or No ""photo album? Answer yes or no"). Then, the controller 180 performs or does not perform a function corresponding to a particular menu or menu item based on the user's response. If the user does not respond within a specific period of time (equal to, for example, five seconds), then the controller 180 can also immediately implement a specific menu or menu item. That is, if no response has been received from the user, then the controller 180 may automatically execute the function or implement the menu, assuming that the absence of a response is a positive response.

Кроме того, пользователь может ответить на вопрос из контроллера 180 с использованием его или ее голоса (например "да" или "нет") или посредством других устройств ввода, например аппаратных или программных клавиш, сенсорной панели и т.д. Кроме того, если при операции S106 от пользователя получен отрицательный ответ (результатом операции S106 является "НЕТ"), то есть, если смысл речевой команды точно не определен, то контроллер 180, может выполнить дополнительную операцию обработки ошибок (операция S108).In addition, the user can answer a question from the controller 180 using his or her voice (for example, “yes” or “no”) or through other input devices, such as hardware or soft keys, touch panel, etc. In addition, if a negative response is received from the user in operation S106 (the result of operation S106 is “NO”), that is, if the meaning of the speech command is not precisely determined, then the controller 180 may perform an additional error processing operation (operation S108).

То есть, операция обработки ошибок может быть выполнена путем повторного приема введенной речевой команды или может быть выполнена путем отображения множества меню, для которых процент правильно распознанных слов превышает определенный уровень или множество меню, которые могут считаться имеющими аналогичный смысл. Затем пользователь может выбрать одно из множества меню. К тому же, когда количество функций или меню, для которых процент правильно распознанных слов превышает определенный уровень, является меньшим, чем заранее заданное количество (равное, например, двум), контроллер 180, может автоматически выполнить соответствующую функцию или реализовать соответствующее меню.That is, an error processing operation can be performed by repeatedly receiving an entered speech command, or it can be performed by displaying a plurality of menus for which the percentage of correctly recognized words exceeds a certain level or a plurality of menus that can be considered to have similar meaning. Then the user can select one of the many menus. Moreover, when the number of functions or menus for which the percentage of correctly recognized words exceeds a certain level is less than a predetermined number (equal to, for example, two), the controller 180 can automatically perform the corresponding function or implement the corresponding menu.

На следующем чертеже Фиг. 6A в общем виде показан способ активации функции распознавания речи для терминала мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Как показано на чертеже, пользователь может активировать функцию распознавания речи на экране 410 дисплея путем прикосновения к виртуальной клавише 411. Пользователь может также завершить функцию распознавания речи путем отпускания виртуальной клавиши 411. В более подробном изложении пользователь может активировать функцию распознавания речи путем прикосновения к виртуальной клавише 411 и продолжительного прикосновения к виртуальной клавише 411 или к аппаратной клавише 412 до тех пор, пока речевая команда не будет завершена. То есть, пользователь может отпустить виртуальную клавишу 411 или аппаратную клавишу 412 по завершении речевой команды. Таким образом, контроллер 180 является осведомленным о том, когда следует ввести речевую команду, и о том, когда речевая команда была завершена. Как изложено выше, поскольку пользователь непосредственно вовлечен в определение этого, то точность интерпретации введенной речевой команды повышается.In the following drawing, FIG. 6A is a perspective view illustrating a method for activating a speech recognition function for a mobile communication terminal according to one embodiment of the present invention. As shown in the drawing, the user can activate the speech recognition function on the display screen 410 by touching the virtual key 411. The user can also end the speech recognition function by releasing the virtual key 411. In more detail, the user can activate the speech recognition function by touching the virtual key 411 and a continuous touch on the virtual key 411 or on the hardware key 412 until the speech command is completed. That is, the user can release the virtual key 411 or the hardware key 412 upon completion of the speech command. Thus, the controller 180 is aware of when to enter the voice command and when the voice command was completed. As stated above, since the user is directly involved in determining this, the accuracy of the interpretation of the entered speech command is improved.

Контроллер 180 также может быть выполнен таким образом, что распознает начало функции речевого управления, когда пользователь впервые прикасается к виртуальной клавише 411, и затем распознает тот факт, что речевая команда была завершена, например, когда пользователь прикасается к виртуальной клавиши 411 во второй раз. Также возможны и другие способы выбора. Кроме того, как показано на экране 410 дисплея на Фиг. 6A, вместо того, чтобы использовать виртуальную клавишу 411, включение и отключение речевого управления могут быть выполнены путем манипуляций с аппаратной клавишей 412 на терминале.The controller 180 may also be configured in such a way that it recognizes the start of the voice control function when the user first touches the virtual key 411, and then recognizes the fact that the voice command has been completed, for example, when the user touches the virtual key 411 a second time. Other choices are also possible. Furthermore, as shown on the display screen 410 of FIG. 6A, instead of using the virtual key 411, enabling and disabling voice control can be performed by manipulating the hardware key 412 on the terminal.

Кроме того, виртуальная клавиша 411, показанная на экране 410 дисплея, может представлять собой одиночную виртуальную клавишу, которую пользователь нажимает или отпускает для включения/отключения функции распознавания речи, или может представлять собой кнопку вызова меню, которая при ее выборе создает перечень меню (например: "1. Включить речевое управление и 2. Отключить речевое управление"). Виртуальная клавиша 411 также может отображаться на дисплее, например, во время режима ожидания.In addition, the virtual key 411 shown on the display screen 410 may be a single virtual key that the user presses or releases to enable / disable the speech recognition function, or may be a menu call button that, when selected, creates a menu list (e.g. : "1. Turn on voice control and 2. Turn off voice control"). A virtual key 411 may also be displayed, for example, during standby mode.

В другом примере, и как показано на экране 420 дисплея, пользователь может также включать и отключать функцию распознавания речи путем прикосновения к произвольному месту экрана. Экран 430 дисплея иллюстрирует еще один пример, в котором пользователь включает и отключает функцию распознавания речи путем создания конкретных звуков или звуковых эффектов, уровень громкости которых превышает конкретный уровень. Например, для создания такого звука удара пользователь может хлопнуть в ладоши.In another example, and as shown on the display screen 420, the user can also enable or disable the speech recognition function by touching an arbitrary location on the screen. The display screen 430 illustrates another example in which the user enables and disables the speech recognition function by creating specific sounds or sound effects whose volume level exceeds a specific level. For example, to create such a punch sound, the user can clap his hands.

Таким образом, согласно одному из вариантов осуществления настоящего изобретения функция распознавания речи может быть реализована в двух режимах. Например, функция распознавания речи может быть реализована в первом режиме для обнаружения конкретных звуковых или звуковых эффектов, уровень громкости которых превышает определенный уровень, и во втором режиме для распознавания речевой команды и для определения смысла речевой команды. Если в первом режиме уровень громкости звуков или звуковых эффектов превышает определенный уровень, то активируют второй режим, чтобы посредством него распознать речевую команду.Thus, according to one embodiment of the present invention, the speech recognition function can be implemented in two modes. For example, the speech recognition function can be implemented in the first mode for detecting specific sound or sound effects whose volume level exceeds a certain level, and in the second mode for recognizing a speech command and determining the meaning of a speech command. If in the first mode the volume level of sounds or sound effects exceeds a certain level, then the second mode is activated in order to recognize the speech command through it.

Экран 440 дисплея иллюстрирует еще один способ включения и отключения пользователем функции распознавания речи. В этом примере контроллер 180 выполнен таким образом, что интерпретирует движения тела пользователя для включения и отключения функции речевого управления. Например, и как показано на экране 440 дисплея, контроллер 180 может быть выполнен таким образом, что интерпретирует движение руки пользователя к дисплею как команду активирования функции распознавания речи и движение руки пользователя от дисплея как команду завершения функции речевого управления. Для включения и отключения функции распознавания речи также могут использоваться радиосигналы ближнего или дальнего действия.The display screen 440 illustrates yet another method for enabling and disabling a speech recognition function by a user. In this example, the controller 180 is configured to interpret the movements of the user's body to enable or disable the speech control function. For example, and as shown on the display screen 440, the controller 180 may be configured to interpret the movement of the user's hand toward the display as a command to activate the speech recognition function and the movement of the user's hand from the display as a command to complete the voice control function. To enable or disable the speech recognition function, short-range or long-range radio signals can also be used.

Таким образом, согласно одному из вариантов осуществления настоящего изобретения, поскольку функцию речевого управления включают и отключают, то функцию распознавания речи выполняют с перерывами. То есть, когда функция распознавания речи непрерывно поддерживается в активированном состоянии, то объем потребляемых ресурсов в терминале мобильной связи возрастает по сравнению с вариантом осуществления настоящего изобретения.Thus, according to one embodiment of the present invention, since the voice control function is turned on and off, the speech recognition function is performed intermittently. That is, when the speech recognition function is continuously maintained in an activated state, the amount of resources consumed in the mobile communication terminal increases compared to an embodiment of the present invention.

Кроме того, как изложено выше со ссылкой на Фиг. 5, когда функция распознавания речи активирована, то контроллер 180 задает область конкретной базы данных, используемую в качестве опорной информации для распознавания речевой команды, в виде области, связанной с перечнем меню на дисплее 151. Затем, если из перечня меню выбрано или реализовано конкретное меню, то область базы данных может быть определена в виде информации, связанной с выбранным меню или подменю конкретного меню.In addition, as described above with reference to FIG. 5, when the speech recognition function is activated, the controller 180 sets the region of the specific database used as reference information for recognizing the speech command in the form of the region associated with the menu list on the display 151. Then, if a specific menu is selected or implemented from the menu list , then the database area can be defined as information associated with the selected menu or submenu of a particular menu.

Кроме того, когда конкретное меню выбрано или реализовано посредством речевой команды или сенсорного ввода данных, то контроллер 180 может вывести справочную информацию, связанную с подменю конкретного меню, в виде речевого сообщения или в виде всплывающих окон или подсказок. Например, как показано на Фиг. 6Б, когда пользователь выбирает меню "мультимедийная информация" путем операции прикосновения или речевой команды, контроллер 180 отображает на дисплее информацию, связанную с подменю (которыми являются, например, следующие пункты меню: "широковещательная передача", "камера", "средство просмотра текста", "игра" и т.д.) меню "мультимедийная информация", в качестве справочной информации 441 в виде всплывающей подсказки. В альтернативном варианте контроллер 180 может вывести речевой сигнал 442, содержащий справочную информацию. Затем пользователь может выбрать один из пунктов справочной информации с использованием речевой команды или посредством операции прикосновения.In addition, when a particular menu is selected or implemented by means of a speech command or touch input, the controller 180 may output help information associated with a submenu of a particular menu in the form of a voice message or in the form of pop-up windows or prompts. For example, as shown in FIG. 6B, when the user selects the “multimedia information” menu by a touch operation or a speech command, the controller 180 displays on the display information associated with the submenu (which, for example, are the following menu items: “broadcast”, “camera”, “text viewer "," game ", etc.) of the" multimedia information "menu, as reference information 441 in the form of a tooltip. Alternatively, the controller 180 may output a speech signal 442 containing reference information. Then, the user can select one of the items of reference information using a speech command or through a touch operation.

На Фиг.6В проиллюстрирован вариант осуществления изобретения, в котором пользователь выбирает пункт меню с использованием движений его или ее тела (которым в этом примере являются жест рукой пользователя). В более подробном изложении, когда пользователь придвигает его или ее палец ближе к пункту меню 443, контроллер 180 выводит на экран дисплея подменю 444, связанные с меню 443. Контроллер 180 может распознавать информацию о движении тела пользователя, например, посредством датчика 140. Кроме того, выведенная на экран дисплея справочная информация может быть отображена таким образом, что имеет прозрачность или яркость, регулируемую в соответствии с расстоянием до пользователя. То есть, по мере приближения руки пользователя элементы, выведенные на экран дисплея, могут быть дополнительно выделены яркостью.FIG. 6B illustrates an embodiment of the invention in which the user selects a menu item using the movements of his or her body (which in this example is the user's hand gesture). In more detail, when a user moves his or her finger closer to menu item 443, the controller 180 displays submenus 444 associated with the menu 443. The controller 180 can recognize information about the movement of the user's body, for example, through the sensor 140. In addition The help information displayed on the screen can be displayed in such a way that it has transparency or brightness, adjustable in accordance with the distance to the user. That is, as the user's hand approaches, the elements displayed on the display screen can be additionally highlighted.

Как изложено выше, контроллер 180 может быть выполнен таким образом, что определяет факт включения и отключения функции распознавания речи на основании множества различных способов. Например, пользователь может выбрать/привести в действие виртуальные или аппаратные клавиши, прикоснуться к сенсорному экрану в произвольном месте и т.д. Контроллер 180 также может сохранять активацию функции распознавания речи в течение заранее заданного промежутка времени, а затем автоматически завершить активизацию в конце заранее заданного промежутка времени. Кроме того, контроллер 180 может сохранять активацию только лишь в течение времени нажатия на конкретную клавишу или выполнения операции прикосновения, а затем автоматически завершает активацию при отпускании этого средства ввода данных. Контроллер 180 может также завершать процесс активации, когда речевая команда больше не поступает на вход в течение определенного количества времени.As described above, the controller 180 may be configured in such a way that determines whether the speech recognition function is turned on and off based on a variety of different methods. For example, the user can select / activate virtual or hardware keys, touch the touch screen in any place, etc. The controller 180 may also maintain activation of the speech recognition function for a predetermined period of time, and then automatically complete the activation at the end of the predetermined period of time. In addition, the controller 180 can maintain activation only for the duration of pressing a particular key or performing a touch operation, and then automatically completes the activation when releasing this data input means. The controller 180 may also complete the activation process when the speech command no longer arrives at the input within a certain amount of time.

На следующем чертеже Фиг. 7A показана схема последовательности операций способа распознавания речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Со ссылкой на Фиг. 7A, когда функция распознавания речи активирована, то контроллер 180 задает область базы данных, которая может использоваться в качестве опорной информации для распознавания речевой команды, в виде области, связанной с меню, отображенному на дисплее 151, или к подменю этого меню (операция S201). Пользователь также вводит речевую команду (операция S202) либо с использованием точного наименования меню, либо с использованием естественного языка (например, разговорного английского языка).In the following drawing, FIG. 7A is a flowchart of a voice command recognition method in a mobile communication terminal according to one embodiment of the present invention. With reference to FIG. 7A, when the speech recognition function is activated, the controller 180 sets the database area, which can be used as reference information for recognizing the speech command, in the form of the area associated with the menu displayed on the display 151, or to a submenu of this menu (operation S201) . The user also enters a speech command (operation S202) either using the exact menu name or using natural language (for example, spoken English).

Затем контроллер 180 сохраняет входную речевую команду в запоминающем устройстве 160 (операция S203). Кроме того, когда речевая команда введена таким образом, что соответствует заданной области, контроллер 180 анализирует контекст и содержание речевой команды на основании заданной области с использованием алгоритма распознавания речи. Кроме того, речевая команда может быть преобразована в текстовую информацию для ее анализа (операция S204), а затем запомнена в конкретной базе данных в запоминающем устройстве 160. Однако операция преобразования речевой команды в текстовую информацию может быть опущена.Then, the controller 180 stores the input speech command in the storage device 160 (operation S203). Furthermore, when the speech command is inputted in such a way that it corresponds to a predetermined area, the controller 180 analyzes the context and contents of the speech command based on the predetermined area using a speech recognition algorithm. In addition, the speech command can be converted into text information for analysis (step S204), and then stored in a specific database in the storage device 160. However, the operation of converting the speech command into text information can be omitted.

Затем для анализа контекста и содержания речевой команды контроллер 180 обнаруживает характерное слово или ключевое слово речевой команды (операция S205). На основании обнаруженных слов или ключевых слов контроллер 180 анализирует контекст и содержание речевой команды и определяет или выясняет смысл речевой команды путем обращения к информации, хранящейся в конкретной базе данных (операция S206).Then, to analyze the context and contents of the speech command, the controller 180 detects a characteristic word or keyword of the speech command (operation S205). Based on the detected words or keywords, the controller 180 analyzes the context and content of the speech command and determines or ascertains the meaning of the speech command by accessing information stored in a specific database (operation S206).

Кроме того, как изложено выше, база данных, используемая в качестве опорной информации, содержит заданную область, и выполняют функции или реализуют меню, соответствующие смыслу речевой команды, выясненному на основании базы данных (операция S207). Кроме того, поскольку базу данных для распознавания речи задают для каждой информации, связанной с конкретным меню, то увеличивается процент правильно распознанных слов и скорость распознавания речевой команды, а объем ресурсов, используемый в терминале, уменьшается. Кроме того, процент правильно распознанных слов указывает степень соответствия с наименованием, заранее заданным для конкретного меню.In addition, as described above, the database used as reference information contains a predetermined area, and perform functions or implement menus that correspond to the meaning of the speech command found on the basis of the database (operation S207). In addition, since the speech recognition database is set for each information associated with a particular menu, the percentage of correctly recognized words and the recognition speed of the speech command are increased, and the amount of resources used in the terminal is reduced. In addition, the percentage of correctly recognized words indicates the degree of correspondence with the name predefined for a particular menu.

О проценте правильно распознанных слов для введенной речевой команды также можно судить по объему информации, связанной с конкретными функциями или меню, которые вызывают посредством речевой команды. Следовательно, процент правильно распознанных слов для введенной речевой команды увеличивается в том случае, когда информация в точности соответствует конкретной функции или меню (например, наименованию меню), содержащейся в речевой команде.The percentage of correctly recognized words for the entered speech command can also be judged by the amount of information associated with specific functions or menus that are invoked through the speech command. Consequently, the percentage of correctly recognized words for the entered speech command increases when the information exactly corresponds to the specific function or menu (for example, the name of the menu) contained in the speech command.

В более подробном изложении, на Фиг. 7Б в общем виде показан способ распознавания речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Как показано на Фиг. 7Б, пользователь вводит на естественном языке речевую команду, состоящую из шести слов: "I want to see my pictures" ("Я хочу увидеть мои изображения"). В этом примере о проценте правильно распознанных слов можно судить на основании количества значащих слов (которыми являются, например, "see" ("увидеть"), "pictures" ("изображения")), связанных с конкретным меню (например, с меню "фотоальбом"). Кроме того, контроллер 180 может определять, являются ли слова, содержащиеся в речевой команде, значащими словами, связанными с конкретной функцией или с конкретным меню, на основании информации, хранящейся в базе данных. Например, незначащими словами, содержащимися в речевой команде естественного языка, которые не связаны с конкретным меню, могут являться следующие: подлежащее "I" ("я"), предлог "to" и притяжательное местоимение "my" ("мои").In more detail, FIG. 7B is a perspective view illustrating a method for recognizing a voice command in a mobile communication terminal according to one embodiment of the present invention. As shown in FIG. 7B, the user enters in a natural language a speech command consisting of six words: " I want to see my pictures "(" I want to see my pictures "). In this example, the percentage of correctly recognized words can be judged based on the number of significant words (which are, for example, "see" ( "see"), "pictures" ( "Image")) associated with a particular menu (for example, menu " photo album "). In addition, the controller 180 may determine whether the words contained in the speech command are meaningful words associated with a particular function or a specific menu based on information stored in the database. For example, insignificant words contained in a natural language speech command that are not related to a particular menu may include the following: the subject " I "("I"), the preposition "to " and the possessive pronoun " my "("mine").

К тому же, естественным языком является язык, обычно используемый людьми, и его концепция является противоположной концепции искусственного языка. Кроме того, обработка данных естественного языка может быть осуществлена с использованием алгоритма обработки естественного языка. Естественный язык может содержать или может не содержать точное наименование, связанное с конкретным меню, что иногда вызывает трудности при совершенно точном распознавании речевой команды. Следовательно, согласно варианту осуществления настоящего изобретения, когда процент правильно распознанных слов речевой команды превышает определенный уровень (равный, например, 80%), то контроллер 180 принимает решение о том, что распознавание является точным.In addition, natural language is the language commonly used by people, and its concept is the opposite of the concept of artificial language. In addition, natural language data processing can be carried out using a natural language processing algorithm. Natural language may or may not contain the exact name associated with a particular menu, which sometimes makes it difficult to accurately recognize a speech command. Therefore, according to an embodiment of the present invention, when the percentage of correctly recognized words of the speech command exceeds a certain level (equal to, for example, 80%), then the controller 180 decides that the recognition is accurate.

Кроме того, когда контроллер 180 полагает, что множество меню имеют аналогичный смысл, то контроллер 180 выводит на экран дисплея множество меню, и пользователь может выбрать одно из выведенных на экран дисплея меню для выполнения его функций. Кроме того, меню, которому соответствует относительно более высокий процент правильно распознанных слов, может быть отображено первым по порядку или может быть отображено более отчетливо по сравнению с другими меню.In addition, when the controller 180 believes that the plurality of menus have the same meaning, the controller 180 displays a plurality of menus on the display screen, and the user can select one of the displayed menu menus to perform its functions. In addition, a menu that corresponds to a relatively higher percentage of correctly recognized words can be displayed first in order or can be displayed more distinctly than other menus.

Например, на Фиг. 8 в общем виде показан способ отображения меню в соответствии с процентом правильно распознанных слов при распознавании речи в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Как показано на Фиг. 8, пиктограмму меню, которому соответствует более высокий процент правильно распознанных слов, отображают в центральной части экрана 510 дисплея, или же она может быть отображена имеющей больший размер или более темный цвет, как показано на экране 520 дисплея. Пиктограмма меню, которому соответствует более высокий процент правильно распознанных слов, может быть также отображена первой, а далее следуют, по порядку или последовательно, меню с более низкими процентами правильно распознанных слов.For example, in FIG. 8, a general view illustrates a method for displaying a menu in accordance with the percentage of correctly recognized words in speech recognition in a mobile communication terminal according to one embodiment of the present invention. As shown in FIG. 8, a menu icon corresponding to a higher percentage of correctly recognized words is displayed in the center of the display screen 510, or it can be displayed with a larger size or darker color, as shown on the display screen 520. The menu icon, which corresponds to a higher percentage of correctly recognized words, can also be displayed first, and then follow, in order or sequentially, a menu with lower percentages of correctly recognized words.

Кроме того, контроллер 180 может отчетливо отображать множество меню путем изменения, по меньшей мере, одного из следующих параметров: размера, положения, цвета, яркости меню, или путем выделения их яркостью в порядке более высокого процента правильно распознанных слов. Прозрачность меню также может быть надлежащим образом изменена или отрегулирована.In addition, the controller 180 can clearly display a plurality of menus by changing at least one of the following parameters: size, position, color, brightness of the menu, or by highlighting them with brightness in the order of a higher percentage of correctly recognized words. The transparency of the menu can also be appropriately changed or adjusted.

Кроме того, как показано в нижней части Фиг.8, меню, имеющее более высокую частоту его выбора пользователем, может быть скорректировано или задано таким образом, чтобы оно соответствовало проценту правильно распознанных слов. То есть, контроллер 180 запоминает предысторию выборов, сделанных пользователем, (операция S301) и выполняет процесс обучения (операция S302), чтобы посредством него скорректировать конкретный процент правильно распознанных слов для того пункта меню, который пользователь выбирал более часто, чем другие пункты меню (операция S303). Таким образом, к проценту правильно распознанных слов для меню могут быть применены данные о том количестве раз, которое часто используемое меню выбиралось пользователем. Следовательно, речевая команда, введенная одним и тем же или аналогичным образом по произношению или по содержанию, может иметь различный процент правильно распознанных слов в соответствии с тем, сколько раз пользователем выбрано конкретное меню.In addition, as shown in the lower part of Fig. 8, a menu having a higher frequency of its selection by the user can be adjusted or set so that it corresponds to the percentage of correctly recognized words. That is, the controller 180 remembers the history of the choices made by the user (operation S301) and performs the learning process (operation S302) so that it can correct a specific percentage of correctly recognized words for that menu item that the user selects more often than other menu items ( operation S303). Thus, data on the number of times that a frequently used menu was selected by the user can be applied to the percentage of correctly recognized words for the menu. Consequently, a speech command entered in the same or similar way in pronunciation or content can have a different percentage of correctly recognized words in accordance with how many times the user selects a particular menu.

Кроме того, контроллер 180 может также запоминать момент времени, в который пользователь выполняет конкретные функции. Например, пользователь может проверить сообщения электронной почты или пропущенные сообщения каждый раз после того, когда он проснулся, по дням недели с понедельника по пятницу. Эта информация о времени также может быть использована для повышения процента правильно распознанных слов. Сведения о режиме работы терминала (например, режим ожидания и т.д.) также могут быть использованы для повышения процента правильно распознанных слов. Например, пользователь может проверять сообщения электронной почты или пропущенные сообщения при первом включении его терминала мобильной связи, при открывании терминала из закрытого положения и т.д.In addition, the controller 180 may also memorize the point in time at which the user performs specific functions. For example, a user can check emails or missed messages each time he wakes up, on the days of the week from Monday to Friday. This time information can also be used to increase the percentage of correctly recognized words. Information about the operating mode of the terminal (for example, standby mode, etc.) can also be used to increase the percentage of correctly recognized words. For example, a user can check e-mail messages or missed messages when they first turn on their mobile terminal, when opening the terminal from a closed position, etc.

На следующем чертеже Фиг.9 в общем виде показан способ распознавания речевой команды в терминале мобильной связи согласно другому варианту осуществления настоящего изобретения. Как показано на Фиг. 9, пользователь активирует функцию распознавания речи и вводит речевую команду "I want to see my pictures" ("я хочу увидеть мои изображения"). Затем контроллер 180 определяет область базы данных для распознавания речевой команды в виде области, связанной с подменю, отображенными на дисплее. Затем контроллер 180 интерпретирует речевую команду (операция S401) и в этом примере выводит на экран дисплея множество меню, для которых вероятность превышает конкретное значение (равное, например, 80%) (операция S402). Как показано на экране 610 дисплея на Фиг. 9, контроллер отображает четыре меню "мультимедийная информация".The following drawing, Fig. 9 in General view shows a method for recognizing a voice command in a mobile communication terminal according to another embodiment of the present invention. As shown in FIG. 9, the user activates the speech recognition function and enters the voice command “ I want to see my pictures ”. Then, the controller 180 determines the database area for recognizing the speech command in the form of the area associated with the submenus displayed on the display. Then, the controller 180 interprets the speech command (operation S401) and in this example displays a plurality of menus for which the probability exceeds a specific value (equal to, for example, 80%) (operation S402). As shown on the display screen 610 in FIG. 9, the controller displays four “multimedia information” menus.

Контроллер 180 также отчетливо отображает меню, имеющее самую высокую вероятность (которым в этом примере является, например, пункт 621 меню "фотоальбом"). Затем пользователь может выбрать любое из отображенных меню для выполнения функции, соответствующей выбранному меню. В примере, показанном на Фиг. 9, пользователь выбирает пункт 621 меню "фотоальбом", и контроллер 180 выводит на экран дисплея изображения из выбранного фотоальбома, что показано на экране 620 дисплея.The controller 180 also clearly displays the menu having the highest probability (which in this example is, for example, item 621 of the "photo album" menu). Then the user can select any of the displayed menus to perform the function corresponding to the selected menu. In the example shown in FIG. 9, the user selects “photo album” menu item 621, and the controller 180 displays images from the selected photo album on the display screen, as shown on the display screen 620.

К тому же, при операции S402, показанной в нижней части Фиг. 9, контроллер 180 может также сразу же выполнить функцию в том случае, когда определено, что только одно меню превышает заранее заданный вероятностный показатель. То есть, в том случае, когда определено, что пункт 621 меню "фотоальбом" является единственным меню, имеющим более высокий процент правильно распознанных слов или более высокую вероятность, чем заранее заданное пороговое значение, контроллер 180 сразу же выводит на экран дисплея изображения из фотоальбома так, как показано на экране 620 дисплея, без необходимости выбора пользователем пункта 621 меню "фотоальбом". Кроме того, даже несмотря на то, что меню имеет точное наименование, например, "фотоальбом", запоминающее устройство 160 может запоминать множество данных, связанных с меню, таких как, например, "фотография, изображение, альбом".Moreover, in operation S402 shown at the bottom of FIG. 9, the controller 180 can also immediately execute a function when it is determined that only one menu exceeds a predetermined probability indicator. That is, when it is determined that item 621 of the photo album menu is the only menu having a higher percentage of correctly recognized words or a higher probability than a predetermined threshold value, the controller 180 immediately displays images from the photo album on the display screen as shown on the display screen 620, without the need for the user to select item 621 of the "photo album" menu. In addition, even though the menu has an exact name, for example, “photo album”, the storage device 160 may store a lot of data related to the menu, such as, for example, “photo, image, album”.

Кроме того, как изложено выше со ссылкой на Фиг.6Б, контроллер 180 также может выводить справочную информацию для пользователя при выборе или выполнении конкретного меню посредством речевой команды или сенсорного ввода данных в соответствии с состоянием или режимом работы (например, режимом указания функции распознавания речи). Кроме того, пользователь может установить режим работы для вывода справочной информации с использованием соответствующих пунктов меню, предусмотренных в меню "установка режима работы". Соответственно, пользователь может работать с терминалом из настоящего изобретения, не нуждаясь в высоком уровне квалификации или не имея его. То есть, многие люди старшего возраста могут не иметь достаточного опыта в управлении множеством различных меню, предусмотренных в терминале. Однако в терминале из настоящего изобретения пользователь, который, как правило, не знаком со сложностями интерфейсов пользователя, предусмотренных в терминале, может легко работать с терминалом мобильной связи.In addition, as described above with reference to Fig.6B, the controller 180 can also display help information for the user when selecting or executing a specific menu through a speech command or touch data input in accordance with the state or mode of operation (for example, the mode of indicating the speech recognition function ) In addition, the user can set the operation mode for displaying reference information using the corresponding menu items provided in the "setting operation mode" menu. Accordingly, the user can work with the terminal of the present invention without the need for a high level of qualification or without it. That is, many older people may not have enough experience in managing the many different menus provided in the terminal. However, in the terminal of the present invention, a user who, as a rule, is not familiar with the complexities of the user interfaces provided in the terminal, can easily work with a mobile communication terminal.

Кроме того, когда контроллер 180 распознает, что речевая команда имеет множество смыслов (то есть, когда речевая команда на естественном языке не содержит точного наименования меню, например, когда меню содержится в категории "мультимедийная информация", но не имеет точного наименования из числа слов "камера", "фотоальбом" и "видео"), контроллер 180 выводит на экран дисплея множество меню, для которых процент правильно распознанных слов является большим, чем определенное значение (равное, например, 80%).In addition, when the controller 180 recognizes that the speech command has many meanings (that is, when the natural language speech command does not contain an exact menu name, for example, when the menu is contained in the "multimedia information" category but does not have an exact word name “camera”, “photo album” and “video”), the controller 180 displays a plurality of menus on the screen for which the percentage of correctly recognized words is greater than a certain value (equal, for example, 80%).

На следующем чертеже Фиг. 10 в общем виде показано множество баз данных, используемых контроллером 180 для распознавания речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. В этом варианте осуществления изобретения в этих базах данных хранят информацию, которую контроллер 180 использует для выяснения смысла речевой команды, и ими может являться любое количество баз данных в соответствии с характерными особенностями информации. Кроме того, соответствующие базы данных, сконфигурированные в соответствии с характерными особенностями информации, могут быть обновлены посредством непрерывного процесса обучения под управлением контроллера 180.In the following drawing, FIG. 10 generally shows a plurality of databases used by the controller 180 to recognize a voice command in a mobile communication terminal according to one embodiment of the present invention. In this embodiment of the invention, these databases store information that the controller 180 uses to determine the meaning of the speech command, and they can be any number of databases in accordance with the characteristic features of the information. In addition, the respective databases configured in accordance with the characteristic features of the information can be updated through a continuous learning process under the control of the controller 180.

Например, в процессе обучения предпринимают попытку сопоставления речи пользователя с соответствующим словом. Например, когда корейское слово "Saeng-il" (обозначающее "день рождения"), произнесенное пользователем, неправильно понято как слово "Saeng-hwal" (обозначающее "жизнь"), то пользователь исправляет это слово на "Saeng-il". Соответственно, то же самое произношение, которое будет впоследствии введено пользователем, должно быть распознано как "Saeng-il".For example, in the learning process, an attempt is made to compare the user's speech with the corresponding word. For example, when the Korean word "Saeng-il" (meaning "birthday") spoken by the user is not correctly understood as the word "Saeng-hwal" (meaning "life"), the user corrects this word to "Saeng-il". Accordingly, the same pronunciation that will subsequently be entered by the user must be recognized as “Saeng-il”.

Как показано на Фиг.10, соответствующие базы данных согласно характерным особенностям информации содержат первую базу 161 данных, вторую базу 162 данных, третью базу 163 данных, и четвертую базу 164 данных. В этом варианте осуществления изобретения в первой базе 161 данных хранят речевую информацию для распознавания речи, введенной через микрофон, в единицах фонем или слогов, или морфем. Во второй базе 162 данных хранят информацию (например, о грамматике, о точности произношения, о структуре предложения и т.д.) для выяснения смысла речевой команды в целом на основании распознанной речевой информации. В третьей базе 163 данных хранят информацию, связанную с меню для функций или услуг терминала мобильной связи, а в четвертой базе 164 данных хранят сообщение или речевую информацию, которая должна быть выведена из терминала мобильной связи для того, чтобы получить подтверждение пользователя о выясненном смысле речевой команды.As shown in FIG. 10, respective databases according to characteristic features of the information comprise a first database 161, a second database 162, a third database 163, and a fourth database 164. In this embodiment, the first database 161 stores speech information for recognizing speech inputted through a microphone in units of phonemes or syllables, or morphemes. In the second database 162 data is stored information (for example, about grammar, pronunciation accuracy, sentence structure, etc.) to clarify the meaning of the speech command as a whole based on the recognized speech information. The third database 163 stores information related to the menu for the functions or services of the mobile communication terminal, and the fourth database 164 stores the message or voice information that must be output from the mobile terminal in order to obtain user confirmation of the clarified sense of speech teams.

Кроме того, третья база 163 данных может быть определена как информация, связанная с конкретной категорией меню, в соответствии с областью, заранее заданной для распознавания речевой команды. К тому же, в соответствующей базе данных может храниться звуковая информация (информация о произношении) и фонемы, слоги, морфемы, слова, ключевые слова или предложения, соответствующие информации о произношении. Соответственно, контроллер 180 может определять или выяснять смысл речевой команды с использованием, по меньшей мере, одной из множества баз 161-164 данных, и выполнять меню, связанные с теми функциями или услугами, которые соответствуют выясненному смыслу речевой команды.In addition, the third database 163 can be defined as information associated with a particular menu category, in accordance with an area predefined for recognizing a speech command. In addition, sound information (pronunciation information) and phonemes, syllables, morphemes, words, keywords or sentences corresponding to pronunciation information can be stored in the appropriate database. Accordingly, the controller 180 can determine or ascertain the meaning of the speech command using at least one of the plurality of data bases 161-164, and execute menus associated with those functions or services that correspond to the clarified meaning of the speech command.

На следующем чертеже Фиг.11 в общем виде показано состояние, в котором выполняют функцию распознавания речи в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Как показано на чертеже, когда контроллер 180 выполняет функцию распознавания речи, то контроллер 180 отображает на дисплее особый указатель или пиктограмму 500, которая сообщает пользователю о том, что выполняется функция распознавания речи. Контроллер 180 также может вывести звук или сообщение для уведомления пользователя о том, что выполняется функция распознавания речи.The following drawing of FIG. 11 shows in a general view a state in which a voice recognition function is performed in a mobile communication terminal according to one embodiment of the present invention. As shown in the drawing, when the controller 180 performs the speech recognition function, the controller 180 displays a special indicator or icon 500 on the display that informs the user that the speech recognition function is being performed. The controller 180 may also output a sound or message to notify the user that a speech recognition function is being performed.

Кроме того, описанные выше варианты осуществления изобретения относятся к распознаванию речевой команды пользователя. Однако настоящее изобретение также применимо и для того случая, когда пользователь выполняет дополнительную функцию ввода данных во время распознавания речевой команды. Например, функции распознавания речи и сенсорного ввода данных, распознавания речи и ввода данных посредством клавиш, или распознавания речи и сенсорного ввода данных/ввода данных посредством клавиш могут выполняться одновременно.In addition, the above-described embodiments of the invention relate to the recognition of a user's voice command. However, the present invention is also applicable to the case when the user performs an additional function of data input during recognition of a speech command. For example, the functions of speech recognition and touch data input, speech recognition and data input via keys, or speech recognition and touch data input / data input by keys can be performed simultaneously.

Кроме того, контроллер 180 может не допускать выполнения функции распознавания речи в конкретном режиме или меню, или в конкретном режиме работы. Кроме того, звуковая информация (например, звуковое уведомление или инструктивная информация) или видеоинформация (например, индикатор 500 на Фиг. 11), указывающая, что в текущий момент применяется функция распознавания речи, может быть воспроизведена в режиме распознавания речи, в меню или в указателе режима работы. К тому же, информация о том, что в текущий момент применяется функция распознавания речи, может быть предоставлена пользователю путем вывода справочной информации.In addition, the controller 180 may not allow the speech recognition function to execute in a particular mode or menu, or in a particular mode of operation. In addition, audio information (eg, audio notification or instructional information) or video information (eg, indicator 500 in FIG. 11) indicating that a speech recognition function is currently being applied can be played back in a speech recognition mode, in a menu or in operating mode indicator. In addition, information that the speech recognition function is currently being applied can be provided to the user by displaying reference information.

На Фиг.12 в общем виде показан способ обработки подкоманд, связанных с конкретным меню, в терминале мобильной связи посредством речевой команды согласно одному из вариантов осуществления настоящего изобретения. В этом варианте осуществления изобретения предполагают, что функция распознавания речи уже активирована пользователем.12, a general view illustrates a method for processing subcommands associated with a particular menu in a mobile communication terminal by means of a voice command according to one embodiment of the present invention. In this embodiment of the invention, it is assumed that the speech recognition function is already activated by the user.

Затем, как показано в левой части Фиг. 12, пользователь прикасается к пиктограмме "будильник/календарь", и контроллер 180 выводит на экран дисплея всплывающее справочное меню, в котором перечислены имеющиеся функции (например: 1) будильник, 2) календарный план, 3) перечень запланированных дел (To Do) и 4) памятка). Затем пользователь вводит речевую команду "перечень запланированных дел", и контроллер 180 интерпретирует смысл речевой команды и отображает множество меню, которые определены как соответствующие речевой команде, что показано на экране 611 дисплея.Then, as shown on the left side of FIG. 12, the user touches the alarm / calendar icon, and the controller 180 displays a pop-up help menu listing available functions (for example: 1) an alarm clock, 2) a schedule, 3) a list of scheduled tasks (To Do) and 4) memo). Then, the user enters the “list of scheduled tasks” speech command, and the controller 180 interprets the meaning of the speech command and displays a plurality of menus that are defined as corresponding to the speech command, as shown on the display screen 611.

То есть, как показано на экране 611 дисплея, контроллер 180 отображает на дисплее четыре события, связанных с функцией "перечень запланированных дел". Затем пользователь вводит речевую команду "выбрать номер 2", и контроллер 180 выбирает вариант номер 2 (Встреча 1). Затем пользователь вводит речевую команду "я хочу удалить это". Затем контроллер 180 отображает на дисплее всплывающее меню 613, в котором пользователя просят подтвердить удаление записи путем ответа "да" или "нет". Затем пользователь вводит речевую команду "да", и контроллер 180 затем удаляет запись, что показано на экране 616 дисплея из Фиг. 12.That is, as shown on the display screen 611, the controller 180 displays on the display four events related to the Scheduled To Do List function. Then the user enters the voice command "select number 2", and the controller 180 selects option number 2 (Meeting 1). Then the user enters the speech command "I want to delete this." Then, the controller 180 displays a pop-up menu 613, in which the user is asked to confirm the deletion of the record by the answer “yes” or “no”. Then, the user enters the yes command, and the controller 180 then deletes the recording, as shown on the display screen 616 of FIG. 12.

Кроме того, если ответ от пользователя не поступил, то контроллер 180 может автоматически выполнить подкоманды, расценивая отсутствие ответа как положительный ответ. Контроллер 180 также выводит речевую команду 615, уведомляя пользователя о том, что элемент данных был удален. К тому же, вместо выбора первого меню "будильник/календарь" путем прикосновения к меню пользователь мог бы вместо этого выдать другую речевую команду. К тому же, когда пользователь первый раз выбирает пиктограмму "будильник/календарь", то контроллер 180 может выдать речевое сообщение 617 для уведомления пользователя о том, что будет выполняться соответствующая задача.In addition, if no response has been received from the user, then the controller 180 may automatically execute the subcommands, regarding the absence of a response as a positive response. The controller 180 also outputs a speech command 615, notifying the user that the data item has been deleted. In addition, instead of selecting the first menu "alarm / calendar" by touching the menu, the user could instead issue another voice command. In addition, when the user first selects the alarm / calendar icon, the controller 180 may issue a voice message 617 to notify the user that the corresponding task will be performed.

Кроме того, как изложено выше, когда выполняется конкретное меню, то контроллер 180 задает область базы данных, используемую в качестве опорной информации для распознавания речевой команды, в виде области, связанной с выполняемым меню. То есть, эта область содержит информацию, связанную с подменю конкретного меню, или информацию, связанную с подкомандами, которые могут быть выполнены из конкретного меню.In addition, as described above, when a specific menu is executed, the controller 180 sets the database area used as reference information for recognizing a speech command in the form of an area associated with the menu being executed. That is, this area contains information associated with a submenu of a particular menu, or information associated with subcommands that can be executed from a specific menu.

На следующем чертеже Фиг. 13 в общем виде показан способ поиска по схеме метрополитена посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. В этом примере снова предполагают, что функция распознавания речи уже была активирована пользователем. Кроме того, также предполагают, что контроллер 180 реализует конкретное меню, связанное с отображением на дисплее схемы метрополитена, на основании речевой команды пользователя или манипуляции с использованием других устройств ввода.In the following drawing, FIG. 13 shows, in general terms, a metro search method using a voice command in a mobile communication terminal according to an embodiment of the present invention. This example again assumes that the speech recognition function has already been activated by the user. In addition, it is also contemplated that controller 180 implements a specific menu related to displaying a subway circuit based on a user's voice command or manipulation using other input devices.

То есть, контроллер 180 выводит на экран дисплея схему метрополитена, что показано на экране 621 дисплея. Как изложено выше, когда выполняется конкретное меню, контроллер 180 может задать область базы данных, используемую в качестве опорной информации, для распознавания речевой команды в виде области, связанной с выполняемым меню (например, названия станций метро, информацию о расстоянии (времени) между каждыми станциями). Кроме того, эта область содержит информацию, связанную с подменю конкретного меню, или информацию, связанную с подкомандами, которые могут быть выполнены из конкретного меню.That is, the controller 180 displays a subway circuit on the display screen, as shown on the display screen 621. As described above, when a specific menu is executed, the controller 180 can set the area of the database used as reference information for recognizing the voice command in the form of the area associated with the menu (for example, names of metro stations, information about the distance (time) between each stations). In addition, this area contains information associated with a submenu of a particular menu, or information associated with subcommands that can be executed from a specific menu.

Затем контроллер 180 выдает речевую команду 626 с просьбой о том, чтобы пользователь ввел данные о начальной станции и о конечной станции. Затем пользователь выбирает две станции на экране 621 дисплея. То есть, контроллер 180 получает сведения о двух станциях 622 и 623 из схемы метрополитена, выведенной на экран дисплея, для которых пользователь желает знать количество времени, затрачиваемое на поездку между этими двумя станциями. Пользователь может выбрать две станции с использованием речевой команды тогда, когда терминал дает соответствующую подсказку (то есть, "произнесите название начальной и конечной станций), или путем прикосновения к обозначениям двух станций 622 и 623 на экране. Также возможны и другие способы выбора двух станций. После того, как пользователем выбраны две станции, контроллер 180 выводит через громкоговоритель речевое сообщение 624, содержащее две выбранные станции (то есть, были выбраны следующие станции: станция ISU и станция Seoul station). К тому же, вместо вывода речевого сообщения контроллер 180 может вместо этого вывести на экран дисплея всплывающие окна с запрошенной или введенной информацией.Then, the controller 180 issues a voice command 626 asking the user to enter information about the start station and the end station. Then, the user selects two stations on the display screen 621. That is, the controller 180 receives information about the two stations 622 and 623 from the subway circuitry displayed on the screen, for which the user wants to know the amount of time spent traveling between the two stations. The user can select two stations using a speech command when the terminal gives the appropriate prompt (that is, "say the name of the start and end stations), or by touching the designations of the two stations 622 and 623 on the screen. Other ways to select two stations are also possible. After the two stations have been selected by the user, the controller 180 outputs through the loudspeaker a voice message 624 containing the two selected stations (that is, the following stations were selected: ISU station and Seoul station). hundred and output a voice message, the controller 180 may instead be displayed on the display screen pop-ups requested or entered information.

Кроме того, когда выбраны две станции, то контроллер 180 также может вывести справочную информацию. Например, как показано на экране 621 дисплея на Фиг. 13, контроллер выводит на экран дисплея всплывающее окно справочной информации, в котором перечислены названия станций и цвет линии метрополитена. Затем пользователь выдает запрос на получение информации о количестве времени, которое займет поездка между двумя выбранными станциями. Пользователь может выдать запрос на получение этой информации путем ввода следующей речевой команды; "I want to know how long it will take from ISU to Seoul Station" ("Я хочу знать, сколько времени займет поездка от станции ISU до станции Seoul Station").In addition, when two stations are selected, the controller 180 can also output help information. For example, as shown on the display screen 621 of FIG. 13, the controller displays a pop-up help information window listing station names and the color of the subway line. The user then issues a request for information about the amount of time that the trip between the two selected stations will take. The user can request this information by entering the next voice command; " I want to know how long it will take from ISU to Seoul Station "(" I want to know how long it will take to travel from ISU to Seoul Station").

Затем контроллер 180 обнаруживает значащие слова (например, "how long" ("сколько времени"), "take" ("займет"), "ISU", "Seoul Station"), связанные с обработкой информации о схеме метрополитена в заданной области для анализа контекста и содержания речевой команды. На основании проанализированной информации контроллер 180 определяет, что смысл речевой команды состоит в том, что выдан запрос на получение информации о времени поездки между двумя станциями метрополитена: станцией ISU и станцией Seoul Station.Then, the controller 180 detects significant words (for example, “ how long ”, “ take ”, “ ISU ”, “ Seoul Station ”) associated with processing information about the subway in a given area for analysis of the context and content of the speech team. Based on the analyzed information, the controller 180 determines that the meaning of the voice command is that a request has been issued to receive information about the travel time between two metro stations: ISU station and Seoul Station.

Кроме того, когда контроллер 180 выясняет смысл речевых команд, контроллер 180 может сначала выдать запрос на подтверждение пользователем того, является ли предполагаемый смысл речевой команды точным. Затем контроллер 180 отображает две станции на схеме метрополитена вместе с данными о расстоянии (или времени) между двумя станциями, о количестве остановок между двумя станциями и т.д. и выводит речевое сообщение 627, информирующее пользователя о результате, которое показано на экране 625 дисплея на Фиг. 13. Кроме того, как описано выше, если пользователь не отвечает на просьбу о подтверждении в течение конкретного промежутка времени, то контроллер 180 может интерпретировать это как положительный ответ и предоставить результаты запрошенной услуги.In addition, when the controller 180 determines the meaning of the speech commands, the controller 180 may first issue a request for confirmation by the user whether the intended meaning of the speech command is accurate. Then, the controller 180 displays two stations in the metro scheme along with data on the distance (or time) between the two stations, the number of stops between the two stations, etc. and outputs a voice message 627 informing the user of the result, which is shown on the display screen 625 in FIG. 13. In addition, as described above, if the user does not respond to the request for confirmation within a specific period of time, then the controller 180 can interpret this as a positive response and provide the results of the requested service.

На следующем чертеже Фиг. 14 в общем виде показан способ воспроизведения файлов с мультимедийным содержимым посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Кроме того, в приведенном ниже описании предполагают, что сигнал управления активацией уже введен пользователем, и что контроллер 180 начинает активацию функции распознавания речи. Также предполагают, что контроллером 180 было реализовано конкретное меню, связанное с меню "воспроизведение мультимедийной информации", путем приема введенной речевой команды или данных о манипуляциях пользователя с использованием других устройств ввода.In the following drawing, FIG. 14 is a perspective view illustrating a method for reproducing multimedia files by means of a voice command in a mobile communication terminal according to one embodiment of the present invention. In addition, the description below assumes that the activation control signal has already been entered by the user, and that the controller 180 starts activating the speech recognition function. It is also contemplated that the controller 180 has implemented a specific menu associated with the “multimedia information playback” menu by receiving an inputted voice command or user manipulation data using other input devices.

То есть, как показано на экране 631 дисплея, контроллер 180 отображает на дисплее перечень песен, которые пользователь может выбрать для воспроизведения. Таким образом, в настоящем изобретении поиск желательного для пользователя файла с мультимедийным содержимым для его воспроизведения может быть осуществлен непосредственно посредством речевой команды. В более подробном изложении, после того, как реализовано меню "воспроизведение мультимедийной информации", контроллер 180 задает область базы данных, используемой в качестве опорной информации для распознавания речевой команды, в виде области, связанной с реализованными меню.That is, as shown on the display screen 631, the controller 180 displays on the display a list of songs that the user can select to play. Thus, in the present invention, the search for the desired file for the user with multimedia content to play it can be carried out directly through a speech command. In a more detailed description, after the "multimedia information playback" menu is implemented, the controller 180 sets the area of the database used as reference information for recognizing the speech command in the form of an area associated with the implemented menus.

Как изложено выше, эта область содержит информацию, связанную с подменю, содержащимися в меню "воспроизведение мультимедийной информации", информацию, связанную с подкомандами, которые могут быть выполнены из меню "воспроизведение мультимедийной информации", или информацию, связанную с файлом с мультимедийным содержимым (например, имена файлов, время воспроизведения, сведения о владельце авторских прав и т.д.).As described above, this area contains information related to the submenus contained in the “multimedia information playback” menu, information associated with subcommands that can be executed from the “multimedia information playback” menu, or information associated with the multimedia content file ( for example, file names, playing time, copyright holder information, etc.).

Кроме того, контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, может отображать на дисплее список файлов с мультимедийным содержимым. В примере, показанном на Фиг. 14, пользователь вводит свою речевую команду на естественном языке (например, "Let's play this song" ("Давайте проиграем эту песню") в том состоянии, когда из списка файлов выбран один файл, что показано на экране 631 дисплея.In addition, the controller 180, having received an inputted speech command or user manipulation data using other input devices, may display a list of multimedia files. In the example shown in FIG. 14, the user enters his speech command in natural language (for example, “ Let's play this song”) in the state when one file is selected from the file list, which is shown on the display screen 631.

После того, как введена речевая команда, контроллер 180 обнаруживает значащие слова (например, "play" ("проиграем"), "this song" ("эту песню")), связанные с подменю или с подкомандами для обработки выбранного меню в пределах заданной области. Кроме того, контроллер 180 выясняет смысл речевой команды путем анализа обнаруженных слов, всего контекста и содержания речевой команды.After the speech command is entered, the controller 180 detects significant words (eg, “ play ”, “ this song ”) associated with a submenu or subcommands to process the selected menu within a given area. In addition, the controller 180 determines the meaning of the speech command by analyzing the detected words, the entire context and content of the speech command.

После того, как выяснен смысл речевой команды, контроллер 180 получает подтверждение пользователя о том, является ли выясненный смысл речевой команды точным. Например, как показано на Фиг, 13, контроллер 180 выводит на экран дисплея всплывающее окно 633, прося пользователя сказать "да" или "нет" относительно воспроизведения выбранной песни. Контроллер также может вывести речевое сообщение 632, спрашивая пользователя о том, является ли песня номер 2 той песней, которую следует проиграть. Затем пользователь может сказать "да", и контроллер 180 выводит выбранную песню, что показано на экране 634 дисплея.Once the meaning of the speech command has been clarified, the controller 180 receives user confirmation of whether the clarified meaning of the speech command is accurate. For example, as shown in FIG. 13, the controller 180 displays a pop-up window 633, asking the user to say “yes” or “no” regarding the playback of the selected song. The controller can also output a voice message 632, asking the user whether the song number 2 is the song to be played. The user can then say yes, and the controller 180 outputs the selected song, as shown on the display screen 634.

В альтернативном варианте контроллер 180 может автоматически проиграть выбранную песню, не прося пользователя подтвердить выбор. Пользователь также может использовать соответствующие пункты меню для установки по умолчанию того варианта, когда контроллер 180 запрашивает подтверждение о выбранной задаче или не запрашивает это подтверждение. Кроме того, если ответ от пользователя не поступил, то контроллер 180 может автоматически выполнять речевую команду, смысл которой был выяснен, расценивая такую реакцию как положительный ответ.Alternatively, the controller 180 may automatically play the selected song without asking the user to confirm the selection. The user can also use the appropriate menu items to set the default when the controller 180 asks for confirmation of the selected task or does not request this confirmation. In addition, if no response was received from the user, then the controller 180 can automatically execute a speech command, the meaning of which was clarified, regarding such a reaction as a positive response.

Следовательно, в этом варианте осуществления изобретения выбирают файл, подлежащий воспроизведению, и команду воспроизведения выбранного файла вводят посредством речевой команды. Однако когда пользователю уже известно имя файла, то имя файла может быть непосредственно введено из меню более высокого уровня посредством речевой команды.Therefore, in this embodiment of the invention, the file to be reproduced is selected, and the playback command of the selected file is entered through the speech command. However, when the user already knows the file name, the file name can be directly entered from a higher level menu by means of a speech command.

На следующем чертеже Фиг. 15 в общем виде показан способ передачи сообщения электронной почты или текстового сообщения посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. И вновь, в описании этого варианта осуществления изобретения предполагают, что сигнал управления активацией уже был введен, что контроллер 180 начал активацию функции распознавания речи, и что контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, реализовал конкретное меню (например, меню "передача/прием почты/сообщений").In the following drawing, FIG. 15 is a perspective view illustrating a method for transmitting an e-mail or text message by means of a voice command in a mobile communication terminal according to one embodiment of the present invention. Again, in the description of this embodiment of the invention, it is assumed that the activation control signal has already been entered, that the controller 180 has started the activation of the speech recognition function, and that the controller 180, having received the entered speech command or user manipulation data using other input devices, has implemented a specific menu (for example, the menu "send / receive mail / messages").

В более подробном изложении, после того, как реализовано меню "передача/прием почты (или сообщений)", контроллер 180 задает область базы данных, используемую в качестве опорной информации для распознавания речевой команды, в виде области, связанной с реализованными меню. Эта область содержит информацию, связанную с подменю, содержащимися в меню "передача/прием почты/сообщений", информацию, связанную с подкомандами, которые могут быть выполнены из меню "передача/прием почты/сообщений", информацию, связанную с переданными/принятыми почтовыми письмами/сообщениями (например, о передатчике, о приемнике, о времени передачи/приема, о заголовке и т.д.).In more detail, after the "send / receive mail (or message)" menu is implemented, the controller 180 sets the database area used as reference information for recognizing the speech command in the form of an area associated with the implemented menu. This area contains information related to the submenus contained in the "send / receive mail / messages" menu, information related to subcommands that can be executed from the "send / receive mail / messages" menu, information related to sent / received mail letters / messages (for example, about the transmitter, about the receiver, about the transmission / reception time, about the header, etc.).

Контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, также отображает на дисплее перечень переданной/принятой почты (переданных/принятых сообщений). Как показано на экране 641 дисплея, пользователь вводит речевую команду "I want to reply" ("я хочу послать ответ"). Затем контроллер 180 отображает на дисплее принятые сообщения, на которые пользователь может ответить, что показано на экране 645 дисплея. В этом примере пользователь использует свой естественный язык (например, "Reply to this message" ("Ответить на это сообщение")) в том состоянии, когда одно почтовое письмо/сообщение выбрано из перечня почтовой корреспонденции/сообщений, что показано на экране 645 дисплея.The controller 180, having received an entered speech command or data about user manipulations using other input devices, also displays a list of transmitted / received mail (transmitted / received messages) on the display. As shown on the screen 641 of the display, the user enters the speech command " I want to reply "("I want to send a response"). The controller 180 then displays received messages that the user can respond to, as shown on the display screen 645. In this example, the user uses his natural language (for example, " Reply to this message ") in the state when one mail letter / message is selected from the list of mail correspondence / messages, which is shown on screen 645 of the display .

Кроме того, после того, как введена речевая команда, контроллер 180 обнаруживает значащие слова (например, "reply" ("ответить"), "this message" ("это сообщение")), связанные с обработкой ответа для выбранного почтового письма/сообщения в пределах области. Затем контроллер 180 выясняет смысл речевой команды (реализация меню "ответ на почту/сообщение") путем анализа обнаруженных слов и всего контекста и содержания речевой команды.In addition, after the speech command is entered, the controller 180 detects significant words (eg, “ reply ”, “ this message ”) associated with processing the response for the selected mail message / message within the area. Then, the controller 180 ascertains the meaning of the speech command (implementation of the "reply to mail / message" menu) by analyzing the detected words and the entire context and content of the speech command.

После того, как выяснен смысл речевой команды, контроллер 180 может получить подтверждение пользователя о том, является ли выясненный смысл речевой команды точным. Например, для получения подтверждения от пользователя может быть выведено речевое сообщение 642, или может быть выведено текстовое сообщение 643. Когда выведено сообщение для получения подтверждения от пользователя, пользователь может ответить в речевой форме или посредством иных устройств ввода. При отсутствии ответа от пользователя контроллер 180 может автоматически выполнить функции, соответствующие выясненному смыслу, расценивая эту реакцию как положительный ответ. Затем, когда реализовано меню "ответ на почту/сообщение", контроллер 180 автоматически вводит адрес/телефонный номер выбранного адресата в окне 644 записи почтового письма/сообщения.After the meaning of the speech command has been clarified, the controller 180 may receive user confirmation of whether the clarified meaning of the speech command is accurate. For example, to receive confirmation from the user, a voice message 642 may be output, or a text message 643 may be output. When a message is displayed to receive confirmation from the user, the user can respond in voice form or through other input devices. In the absence of a response from the user, the controller 180 can automatically perform the functions corresponding to the clarified meaning, regarding this reaction as a positive response. Then, when the "reply to mail / message" menu is implemented, the controller 180 automatically enters the address / telephone number of the selected recipient in the mail message / message recording window 644.

Следовательно, в этом варианте осуществления изобретения сначала выбирают почтовое письмо/сообщение, на которое нужно ответить, и команду "ответить" для выбранной почтового письма/сообщения вводят с использованием речевой команды. Однако когда пользователю известна информация об адресате, ответ на почтовое письмо/сообщение, посылаемый адресату, может быть непосредственно введен посредством речевой команды.Therefore, in this embodiment of the invention, the mail letter / message to be answered is first selected, and the reply command for the selected mail letter / message is entered using the voice command. However, when the user knows the recipient information, the response to the mail letter / message sent to the recipient can be directly entered by means of a speech command.

Кроме того, вариант осуществления изобретения, показанный на Фиг. 15 может быть видоизменен таким образом, чтобы он соответствовал передаче текстовых сообщений. В более подробном изложении, контроллер 180 содержит программное обеспечение для преобразования речи пользователя в текст, поэтому пользователь может сказать в терминал то, что он или она хочет сказать, и контроллер 180 выполняет преобразование введенной речи в текстовое сообщение. Контроллер также 180 может отображать на дисплее преобразованный текст для пользователя, поэтому пользователь может подтвердить, что это преобразование является приемлемым. Затем пользователь может выдать запрос, чтобы терминал послал текстовое сообщение желательному абоненту.In addition, the embodiment of the invention shown in FIG. 15 may be modified to suit text messaging. In more detail, the controller 180 contains software for converting the user's speech into text, so the user can tell the terminal what he or she wants to say, and the controller 180 converts the inputted speech into a text message. The controller also 180 can display the converted text for the user on the display, so the user can confirm that the conversion is acceptable. The user can then request that the terminal send a text message to the desired subscriber.

Этот видоизмененный вариант осуществления изобретения является особенно целесообразным потому, что ввод текстового сообщения вручную является очень трудоемким и утомительным процессом. Многие пользователи желают послать текстовое сообщение вместо того, чтобы звонить человеку по телефону, по нескольким различным причинами, но не хотят проходить через трудоемкий процесс выбора множества кнопок вручную для того, чтобы послать одно текстовое сообщение. Видоизмененный вариант осуществления настоящего изобретения предоставляет пользователю возможность вводить желательную текстовую информацию с использованием его голоса, а затем посылать текстовое сообщение желательному абоненту.This modified embodiment of the invention is particularly useful because manually entering a text message is a very laborious and tedious process. Many users want to send a text message instead of calling a person on the phone for several different reasons, but they do not want to go through the time-consuming process of selecting multiple buttons manually in order to send a single text message. A modified embodiment of the present invention enables the user to enter desired text information using his voice, and then send a text message to the desired subscriber.

На Фиг.16 в общем виде показан способ выполнения телефонного вызова посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Подобно вышеописанным вариантам осуществления изобретения в этом варианте осуществления изобретения также предполагают, что сигнал управления активацией уже был введен пользователем, что контроллер 180 уже активировал функцию распознавания речи, и что контроллер 180 уже реализовал конкретное меню (например, телефонную книгу или меню с перечнем последних принятых телефонных вызовов), связанное с телефонными вызовами, путем приема введенной речевой команды или данных о манипуляциях пользователя с использованием других устройств ввода.On Fig in General terms shows a method of making a phone call through a voice command in a mobile communication terminal according to one embodiment of the present invention. Similar to the above-described embodiments of the invention, it is also assumed in this embodiment that the activation control signal has already been entered by the user, that the controller 180 has already activated the speech recognition function, and that the controller 180 has already implemented a specific menu (for example, a phone book or a menu with a list of the last received phone calls) associated with phone calls by receiving an entered voice command or data about user manipulations using other input devices.

После того как было реализовано меню, связанное с телефонными вызовами, контроллер 180 задает область базы данных, используемую в качестве опорной информации для распознавания речевой команды, в виде области, связанной с телефонными вызовами. Кроме того, эта область содержит информацию, связанную с исходящими телефонными вызовами, с входящими телефонными вызовами, с пропущенными телефонными вызовами и т.д., и информацию, связанную с каждым телефоном (например, о времени исходящего вызова, о времени входящего вызова, о передатчике, о приемнике, о длительности телефонного звонка, о частоте телефонных вызовов и т.д.).After the menu related to telephone calls has been implemented, the controller 180 sets the area of the database used as reference information for recognizing a voice command in the form of an area associated with telephone calls. In addition, this area contains information related to outgoing phone calls, incoming phone calls, missed phone calls, etc., and information associated with each phone (for example, about the time of an outgoing call, about the time of an incoming call, about transmitter, about the receiver, about the duration of a phone call, about the frequency of phone calls, etc.).

Кроме того, контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, отображает на дисплее перечень телефонных звонков. То есть, пользователь использует его или ее естественный язык для ввода речевой команды (например, "I want to see the received phone calls" ("Я хочу посмотреть перечень принятых телефонных вызовов")), как показано на экране 711 дисплея.In addition, the controller 180, having received an entered speech command or data about user manipulations using other input devices, displays a list of telephone calls. That is, the user uses his or her natural language to enter a voice command (for example, " I want to see the received phone calls "), as shown on the display screen 711.

После того, как введена речевая команда, контроллер 180 обнаруживает значащие слова (например, "see" ("посмотреть"), "received" ("принятых"), "phone" ("телефонных"), "calls" ("вызовов")), связанные с телефонными вызовами, в области, и выясняет, что речевая команда имеет следующий смысл: "вывести данные о принятых телефонных вызовах", путем анализа обнаруженных слов и всего контекста и содержания речевой команды. После того, как выяснен смысл речевой команды, контроллер 180 выводит перечень принятых телефонных вызовов, показанный на экране 712 дисплея.After the speech command is entered, the controller 180 detects meaningful words (e.g., " see ", " received "("received")," phone "("telephone"),"calls"("calls" )) associated with phone calls in the area and finds out that the voice command has the following meaning: "display data on received phone calls" by analyzing the words found and the entire context and content of the speech command. After the meaning of the speech command has been clarified, the controller 180 displays a list of received phone calls shown on the display screen 712.

Кроме того, затем пользователь вводит речевую команду, "Call this person" ("позвонить этому человеку") в том состоянии, когда элемент данных выбран из выведенного перечня. В результате этого контроллер 180 выясняет, что речевая команда имеет следующий смысл: "позвонить противоположному абоненту выбранного принятого вызова". Затем контроллер 180 получает подтверждение пользователя о том, является ли выясненный смысл речевой команды точным. То есть, контроллер 180 может вывести речевое сообщение 713 или текстовое сообщение 715.In addition, the user then enters a voice command, “ Call this person ”, in the state when the data item is selected from the displayed list. As a result of this, the controller 180 finds out that the speech command has the following meaning: "call the opposite party of the selected received call." Then, the controller 180 receives user confirmation of whether the clarified meaning of the speech command is accurate. That is, the controller 180 may output a voice message 713 or a text message 715.

Пользователь также может ответить голосом или при помощи других устройств ввода. Как упомянуто выше, если ответ от пользователя не поступил, то контроллер 180 может автоматически выполнить функцию, соответствующую выясненному смыслу, расценивая такую реакцию как положительный ответ. Контроллер 180 также выводит сообщение 714, указывающее, что устанавливается телефонное соединение.The user can also respond with voice or other input devices. As mentioned above, if no response has been received from the user, then the controller 180 can automatically perform a function corresponding to the clarified meaning, regarding such a reaction as a positive response. The controller 180 also displays a message 714 indicating that a telephone connection is being established.

Таким образом, в этом варианте осуществления изобретения из перечня телефонных звонков выбирают адресата и вводят команду "позвонить выбранному адресату" посредством речевой команды. Однако когда пользователь уже знает информацию об адресате, то телефонный звонок этому человеку может быть непосредственно выполнен посредством речевой команды.Thus, in this embodiment of the invention, a destination is selected from the list of telephone calls and a “call to selected destination” command is entered using a voice command. However, when the user already knows the information about the addressee, a phone call to this person can be directly made through a speech command.

На следующем чертеже Фиг.17 в общем виде показан способ использования информации из телефонной книги посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. В описании этого способа сделаны те же самые предположения, которые были описаны выше в других вариантах осуществления изобретения. То есть, предполагают, что после ввода сигнала управления активацией контроллер 180 начинает активацию функции распознавания речи, и что контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, выбирает или реализует конкретное меню (например, меню "телефонная книга"), показанное на экране 720 дисплея.The following drawing Fig.17 in General terms shows a method of using information from the phone book through a voice command in a mobile communication terminal according to one embodiment of the present invention. In the description of this method, the same assumptions are made that were described above in other embodiments of the invention. That is, it is assumed that after the activation control signal is input, the controller 180 starts the activation of the speech recognition function, and that the controller 180, having received an entered speech command or user manipulation data using other input devices, selects or implements a specific menu (for example, the telephone menu book ") shown on the display screen 720.

После того, как реализовано меню "телефонная книга", контроллер 180 задает область базы данных, используемую в качестве опорной информации для распознавания речевой команды, в виде области, связанной с подменю, содержащимися в меню "телефонная книга", или с подкомандами, которые могут быть выполнены из меню "телефонная книга". Кроме того, эту область задают таким образом, чтобы повысить процент правильно распознанных слов, но это может не являться обязательным условием.After the phonebook menu is implemented, the controller 180 sets the database area used as reference information for recognizing the voice command in the form of an area associated with submenus contained in the phonebook menu or with subcommands that can be executed from the "phone book" menu. In addition, this area is set in such a way as to increase the percentage of correctly recognized words, but this may not be a prerequisite.

Кроме того, пользователь вводит речевую команду на его естественном языке (например, "Edit James" ("отредактировать данные о Джеймсе"), "Add James" ("добавить данные о Джеймсе"), "Find James" ("найти данные о Джеймсе"), "Call James" ("позвонить Джеймсу"), "I want to send a Message to James" ("я хочу послать сообщение Джеймсу") в режиме ожидания или в том состоянии, когда произведен выбор меню, связанного с телефонной книгой. После того, как введена речевая команда, контроллер 180 обнаруживает значащие слова (например, "Edit" ("отредактировать"), "Add" ("добавить"), "Find" ("найти"), "Call" ("позвонить"), "Send Message" ("послать сообщение"), "James" ("Джеймс")), связанные с телефонными вызовами, в пределах области, и выясняет смысл каждой из речевых команд путем анализа обнаруженных слов и всего контекста и содержания речевых команд.In addition, the user enters a speech command in his natural language (for example, " Edit James "("edit James data"), " Add James "("add James data"), " Find James "("find James data ")," Call James "," I want to send a Message to James "in standby mode or when the menu associated with the phone book is selected After the speech command is entered, the controller 180 detects meaningful words (for example, “ Edit ”, “ Add ”, “ Find ”, “ Call ” it ")," Send Message "( " send a message ")," James "(" James ")) associated with telephone calls within the region, and finds out the meaning of each of the voice commands by analyzing the detected words and the entire context and content speech commands.

После того, как выяснен смысл каждой из речевых команд, контроллер 180 выполняет функции или реализует меню, соответствующие надлежащим речевым командам, которые показаны на экранах 722-724 дисплея. Кроме того, перед их выполнением контроллер 180 может получить подтверждение пользователя о том, является ли выясненные смыслы речевых команд точными. Как изложено выше, для получения подтверждения от пользователя может быть выведено речевое сообщение или текстовое сообщение.After the meaning of each of the speech commands has been clarified, the controller 180 performs functions or implements menus corresponding to the appropriate speech commands, which are shown on the display screens 722-724. In addition, before executing them, the controller 180 may receive user confirmation of whether the clarified meanings of the speech commands are accurate. As described above, a voice message or text message can be displayed to receive confirmation from the user.

Кроме того, когда выведено сообщение для получения подтверждения от пользователя, то пользователь может ответить голосом или посредством других устройств ввода. Если ответ от пользователя не поступил, то контроллер 180 может автоматически выполнить функции, соответствующие выясненным смыслам, расценивая такую реакцию как положительный ответ.In addition, when a message is displayed to receive confirmation from the user, the user can respond by voice or other input devices. If no response has been received from the user, then the controller 180 can automatically perform the functions corresponding to the clarified meanings, regarding such a reaction as a positive response.

На следующем чертеже Фиг. 18 в общем виде показан способ изменения выводимого на экран фонового изображения посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. И вновь, в описании этого способа предполагают, что после того, как был введен сигнал управления активацией, контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, начинает активацию функции распознавания речи и реализует конкретное меню (например, меню "фотоальбом").In the following drawing, FIG. 18 is a perspective view illustrating a method for changing a displayed background image by a voice command in a mobile communication terminal according to one embodiment of the present invention. Again, in the description of this method, it is assumed that after the activation control signal has been input, the controller 180, having received the entered speech command or user manipulation data using other input devices, starts the activation of the speech recognition function and implements a specific menu (for example, menu "photo album").

Меню "фотоальбом" может быть реализовано путем ввода речевой команды или посредством многоэтапного подменю с использованием других устройств ввода. К тому же, меню "фотоальбом" может быть непосредственно реализовано посредством речевой команды на естественном языке (которой является, например, следующая команда: "I want to see my photo album" ("я хочу увидеть мой фотоальбом")), что показано на экране 731 дисплея. В соответствии с выясненным смыслом речевой команды, контроллер 180 выводит перечень фотографий путем реализации меню "фотоальбом", показанного на экране 732 дисплея. Затем контроллер 180 получает одну фотографию, выбранную из выведенного перечня фотографий, содержащихся в фотоальбоме.The menu "photo album" can be implemented by entering a speech command or through a multi-stage submenu using other input devices. In addition, the menu "photo album" can be directly implemented by means of a speech command in natural language (which, for example, is the following command: " I want to see my photo album "), as shown in display screen 731. In accordance with the clarified meaning of the speech command, the controller 180 displays a list of photos by implementing the menu "photo album" shown on the screen 732 of the display. Then, the controller 180 receives one photograph selected from the displayed list of photographs contained in the photo album.

Если в этом состоянии пользователем введена речевая команда (например, "Change my wall paper using this picture" ("измените у меня фоновое изображение с использованием этого изображения"), то контроллер 180 обнаруживает значащую информацию (например, слова "change" ("измените"), "wall paper" ("фоновое изображение")), связанную с подменю или с подкомандами реализованного меню. Затем контроллер 180 выясняет смысл речевой команды путем анализа обнаруженных слов и всего контекста и содержания речевой команды. То есть, контроллер 180 выясняет, что смыслом речевой команды является следующий: "заменить выведенное на экран фоновое изображение выбранной фотографией".If a voice command is entered by the user in this state (for example, “Change my wall paper using this picture”), then the controller 180 detects meaningful information (for example, the words “ change ” (“change ")," wall paper "(" background image ")) associated with the submenu or subcommands of the implemented menu. Then, the controller 180 determines the meaning of the speech command by analyzing the detected words and the entire context and content of the speech command. That is, the controller 180 finds out that the meaning of the speech command is Xia is: "Replace the displayed background image of the selected photo."

После того, как выяснен смысл речевой команды, контроллер 180 выводит на экран дисплея фоновое изображение, соответствующее выбранной фотографии, и получает подтверждение пользователя о том, является ли выясненный смысл речевой команды точным. При этом для получения подтверждения от пользователя может быть выведено речевое сообщение 733, или может быть выведено текстовое сообщение 734. Речевая команда, смысл которой выяснен, также может быть выполнена непосредственно без подтверждения пользователя в соответствии с высоким процентом правильно распознанных слов или с заранее заданным меню "установка режима работы".After the meaning of the speech command has been clarified, the controller 180 displays a background image corresponding to the selected photo on the display screen and receives user confirmation of whether the clarified meaning of the speech command is accurate. In this case, to receive confirmation from the user, a voice message 733 can be displayed, or a text message 734 can be displayed. A speech command, the meaning of which is clarified, can also be executed directly without user confirmation in accordance with a high percentage of correctly recognized words or with a predefined menu "setting the operating mode".

Когда выведено сообщение для получения подтверждения от пользователя, пользователь может ответить голосом или посредством других устройства ввода. Если ответ от пользователя не поступил, то контроллер 180 может автоматически выполнить функцию, соответствующую выясненному смыслу речевой команды, расценивая такую реакцию как положительный ответ.When a message is displayed to receive confirmation from the user, the user can respond with voice or other input devices. If no response has been received from the user, then the controller 180 may automatically perform a function corresponding to the clarified meaning of the speech command, regarding such a response as a positive response.

Для изменения фонового изображения меню "фотоальбом" не обязательно должно быть реализовано вначале, как показано в данном варианте осуществления изобретения. Вместо этого поиск желательной фотографии пользователя для изменения фонового изображения может быть произведен после того, как реализовано меню "фоновое изображение".To change the background image, the “photo album” menu does not have to be implemented first, as shown in this embodiment of the invention. Instead, a search for a desired user photo to change the background image can be performed after the background image menu is implemented.

На Фиг.19 в общем виде показан способ воспроизведения файлов с мультимедийным содержимым посредством речевой команды в терминале мобильной связи согласно одному из вариантов осуществления настоящего изобретения. Подобно описанным выше вариантам осуществления изобретения в этом описании предполагают, что после того, как введен сигнал управления активацией, контроллер 180, получив введенную речевую команду или данные о манипуляциях пользователя с использованием других устройств ввода, начинает активацию функции распознавания речи и реализует конкретное меню (например, меню "воспроизведение мультимедийной информации").On Fig in General terms shows a method of reproducing files with multimedia content through a speech command in a mobile communication terminal according to one of the embodiments of the present invention. Similar to the above-described embodiments of the invention, it is assumed in this description that, after the activation control signal is input, the controller 180, having received the inputted speech command or user manipulation data using other input devices, starts activating the speech recognition function and implements a specific menu (e.g. , menu "playback of multimedia information").

Для воспроизведения файла с мультимедийным содержимым пользователем реализуют специальное меню, выбирают одно из подменю специального меню для отображения на дисплее списка файлов, и из этого списка файлов выбирают один файл для его воспроизведения. Однако в настоящем изобретении поиск желательного для пользователя файла с мультимедийным содержимым для его воспроизведения может быть осуществлен непосредственно при помощи речевой команды.To play a file with multimedia contents, the user implements a special menu, selects one of the submenus of the special menu to display a list of files, and select one file from this list of files to play it. However, in the present invention, the search for a user-desired file with multimedia content for playback can be carried out directly using a speech command.

Например, если конкретная речевая команда (например, "Move to the Beatles album" ("перейти к альбому группы Битлз") введена после того, как была активирована функция распознавания речи, то контроллер 180 выясняет смысл речевой команды путем анализа всего контекста и содержания речевой команды, что показано на экране 741 дисплея. На основании проанализированной информации контроллер 180 выполняет конкретные функции или реализует конкретные меню или отображает список файлов путем перемещения в конкретную папку с файлами, что показано на экране 742 дисплея.For example, if a specific speech command (for example, “ Move to the Beatles album ”) is entered after the speech recognition function has been activated, then the controller 180 determines the meaning of the speech command by analyzing the entire context and content of the speech commands, which is shown on the display screen 741. Based on the analyzed information, the controller 180 performs specific functions or implements specific menus or displays a list of files by moving to a specific file folder, as shown on the display screen 742.

Когда речевая команда (например, "Play this song" ("проиграйте эту песню") или "Play number 3" ("проиграйте песню номер 3")) введена после того, как из списка файлов выбран один файл, то контроллер 180 выясняет смысл речевой команды путем анализа всего контекста и содержания речевой команды. Кроме того, функции или меню, соответствующие смыслу речевой команды, могут быть выполнены непосредственно в соответствии с высоким процентом правильно распознанных слов или с заранее заданным меню "установка режима работы".When a speech command (for example, "Play this song" or "Play number 3") is entered after one file is selected from the list of files, then the controller 180 finds out the meaning speech command by analyzing the entire context and content of the speech command. In addition, functions or menus that correspond to the meaning of the speech command can be performed directly in accordance with a high percentage of correctly recognized words or with a predetermined menu "setting the operating mode".

После того, как выяснен смысл речевой команды, контроллер 180 получает подтверждение пользователя о том, является ли выясненный смысл речевой команды точным. Здесь, для получения подтверждения от пользователя может быть выведено текстовое сообщение или речевое сообщение 743. Когда выведено сообщение для получения подтверждения от пользователя, пользователь может ответить на него голосом или посредством других устройств ввода. Если ответ от пользователя не поступил, то контроллер 180 может автоматически выполнить функцию выясненной речевой команды, расценивая такую реакцию как положительный ответ. Затем контроллер 18 выполняет или воспроизводит выбранную песню, что показано на экране 744 дисплея.Once the meaning of the speech command has been clarified, the controller 180 receives user confirmation of whether the clarified meaning of the speech command is accurate. Here, to receive confirmation from the user, a text message or voice message 743 can be displayed. When a message is displayed to receive confirmation from the user, the user can respond to it with voice or other input devices. If no response has been received from the user, then the controller 180 may automatically perform the function of the clarified speech command, regarding such a reaction as a positive response. Then, the controller 18 executes or plays the selected song, as shown on the display screen 744.

Таким образом, в этом варианте осуществления изобретения выбирают файл, подлежащий воспроизведению, и вводят команду воспроизведения выбранного файла посредством речевой команды. Однако когда пользователю известно имя файла, то это имя файла может быть непосредственно введено из меню более высокого уровня голосом для его воспроизведения.Thus, in this embodiment of the invention, the file to be played is selected, and a command to play the selected file by the voice command is entered. However, when the user knows the file name, this file name can be directly entered from a higher level menu by voice to play it.

Следовательно, согласно вариантам осуществления настоящего изобретения, в том состоянии, когда активирована функция распознавания речи, введенную речевую команду преобразуют в особый вид, и контекст и ее содержание сравнивают с базой данных, заданной в области, используемой в качестве опорной информации. Кроме того, результирующее значение, соответствующее выясненному смыслу речевой команды, выводят в конкретный компонент терминала мобильной связи.Therefore, according to embodiments of the present invention, in a state where the speech recognition function is activated, the inputted speech command is converted into a special form, and the context and its contents are compared with the database defined in the area used as reference information. In addition, the resulting value corresponding to the clarified meaning of the speech command is output to a specific component of the mobile communication terminal.

Терминал мобильной связи из настоящего изобретения может управлять меню, связанными с конкретными его функциями или услугами, путем выяснения смысла введенной речевой команды на основании контекста и содержания. Кроме того, терминал мобильной связи из настоящего изобретения может обеспечивать увеличение процента правильно распознанных слов при распознавании речи путем задания области для распознавания речи в виде области, связанной с конкретным меню или с конкретными услугами, в соответствии с его рабочим состоянием или режимом работы.The mobile communication terminal of the present invention can manage menus associated with its specific functions or services by ascertaining the meaning of the entered speech command based on context and content. In addition, the mobile communication terminal of the present invention can provide an increase in the percentage of correctly recognized words in speech recognition by defining an area for speech recognition in the form of an area associated with a particular menu or with specific services, in accordance with its operating state or mode of operation.

К тому же, терминал мобильной связи из настоящего изобретения может одновременно выбирать или реализовывать меню, связанные с конкретными функциями или услугами, путем применения одного или большего количества имеющихся в нем интерфейсов пользователя (ИП), даже в то время, когда функция распознавания речи активирована, для обнаружения манипуляций пользователя. Кроме того, терминал мобильной связи из настоящего изобретения может управлять меню, связанными с конкретными функциями или услугами, посредством речевой команды вне зависимости от квалификации пользователя, путем предоставления справочной информации о вводе речевой команды в соответствии с рабочим состоянием или режимом работы терминала.In addition, the mobile communication terminal of the present invention can simultaneously select or implement menus associated with specific functions or services by applying one or more of its user interfaces (UIs), even while the speech recognition function is activated, to detect user manipulation. In addition, the mobile communication terminal of the present invention can manage the menus associated with specific functions or services through a voice command, regardless of the user's qualifications, by providing reference information about entering a voice command in accordance with the operating state or mode of operation of the terminal.

Кроме того, множество областей может содержать, по меньшей мере, две области из области, связанной с электронной почтой, которые соответствуют сообщениям электронной почты, посланным из терминала мобильной связи и принятым в терминале мобильной связи, область запланированных заданий, которая соответствует запланированным событиям, назначенным в терминале мобильной связи, область контактов, соответствующая сведениям о контактах, имеющимся в терминале мобильной связи, область телефонной книги, соответствующая телефонным номерам, хранящимся в терминале мобильной связи, область карт, соответствующая информации о картах, предоставляемой терминалом мобильной связи, область фотографий, соответствующая фотографиям, хранящимся в терминале мобильной связи, область сообщений, соответствующая сообщениям, посланным из терминала мобильной связи и принятым в терминале мобильной связи, область мультимедийной информации, соответствующая мультимедийным функциям, выполняемым в терминале мобильной связи, область внешних устройств, соответствующая внешним устройствам, к которым может быть подключен терминал мобильной связи, область предыстории телефонных вызовов, соответствующая телефонным вызовам, переданным из терминала мобильной связи и принятым в терминале мобильной связи, и область установочных параметров, соответствующая функциям установки параметров, выполняемым в терминале мобильной связи.In addition, the plurality of areas may comprise at least two areas from an email related area that correspond to email messages sent from the mobile communication terminal and received at the mobile communication terminal, a scheduled task area that corresponds to scheduled events assigned in the mobile communication terminal, the contact area corresponding to the contact information available in the mobile communication terminal, the phone book area corresponding to the telephone numbers, storing the area of cards corresponding to the card information provided by the mobile communication terminal, the area of photographs corresponding to the photos stored in the mobile communication terminal, the message area corresponding to messages sent from the mobile communication terminal and received in the mobile communication terminal, region multimedia information corresponding to multimedia functions performed in a mobile communication terminal, an area of external devices corresponding to external devices to which A mobile terminal may be connected, a telephone call history area corresponding to telephone calls transmitted from the mobile communication terminal and received at the mobile communication terminal, and a setting area corresponding to parameter setting functions performed in the mobile communication terminal.

Кроме того, производителем терминала мобильной связи или пользователем терминала мобильной связи может быть установлено заранее заданное пороговое значение процента правильно распознанных слов.In addition, a predetermined threshold value for the percentage of correctly recognized words can be set by the manufacturer of the mobile communication terminal or by the user of the mobile communication terminal.

Кроме того, описанные выше различные варианты осуществления изобретения могут быть реализованы в считываемом посредством компьютера носителе информации с использованием, например, компьютерного программного обеспечения, аппаратных средств или какой-либо их комбинации. Для варианта аппаратной реализации описанные выше варианты осуществления изобретения могут быть реализованы в одной или в большем количестве специализированных интегральных микросхем (ASIC), процессоров для цифровой обработки сигналов (DSP), устройств цифровой обработки сигналов (DSPD), программируемых логических устройств (PLD), программируемых пользователем вентильных матриц (FPGA), процессоров, контроллеров, микроконтроллеров, микропроцессоров, других электронных устройств, предназначенных для выполнения описанных здесь функций, или в выборочной комбинации этих устройств.In addition, the various embodiments described above can be implemented in a computer-readable storage medium using, for example, computer software, hardware, or some combination thereof. For a hardware implementation, the above-described embodiments of the invention can be implemented in one or more specialized integrated circuits (ASICs), processors for digital signal processing (DSP), digital signal processing devices (DSPD), programmable logic devices (PLD), programmable the user of gate arrays (FPGAs), processors, controllers, microcontrollers, microprocessors, other electronic devices designed to perform the functions described here, or a juxtaposed combination of these devices.

Для варианта программной реализации описанные здесь варианты осуществления изобретения могут быть реализованы посредством отдельных программных модулей, например процедур и функций, каждая из которых выполняют одну или большее количество описанных здесь функций и операций. Программные коды могут быть реализованы посредством прикладной программы, написанной на любом подходящем языке программирования, и могут быть запомнены в запоминающем устройстве (например, в запоминающем устройстве 160) и выполнены контроллером или процессором (например, контроллером 180).For a software implementation option, embodiments of the invention described herein may be implemented by separate software modules, for example, procedures and functions, each of which performs one or more of the functions and operations described herein. Software codes may be implemented by an application program written in any suitable programming language, and may be stored in a storage device (e.g., storage device 160) and executed by a controller or processor (e.g., controller 180).

Кроме того, терминал 100 мобильной связи может быть реализован во множестве различных конфигураций. Примерами таких конфигураций являются, в том числе, следующие: конструкция складывающегося типа, конструкция сдвижного типа, конструкция прямоугольного типа, конструкция поворотного типа конструкция шарнирного типа и их комбинации.In addition, the mobile communication terminal 100 may be implemented in many different configurations. Examples of such configurations are, inter alia, the following: a folding type structure, a sliding type structure, a rectangular type structure, a rotary type structure, a hinge type structure, and combinations thereof.

Для специалистов в данной области техники очевидно, что в настоящем изобретении могут быть выполнены различные его модификации и изменения, не выходя за пределы сущности или объема патентных притязаний изобретения. Следовательно, подразумевают, что настоящее изобретение охватывает собой модификации и изменения этого изобретения при условии, что они подпадают под прилагаемую формулу изобретения и ее эквиваленты.For specialists in the art it is obvious that in the present invention can be made of various modifications and changes without going beyond the essence or scope of patent claims of the invention. Therefore, it is intended that the present invention cover the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.

Claims

1. A mobile communication terminal, comprising:
an input device configured in such a way that it is capable of receiving input data to activate a speech recognition function in a mobile communication terminal;
a storage device configured in such a way that it is capable of storing a plurality of areas related to menus and operating modes of the mobile communication terminal; and
a controller configured in such a way that it is capable of defining a specific area for recognizing voice commands from a plurality of areas stored in the memory when the speech recognition function is activated, to analyze the context and information content of the voice command in a specific area, when the voice command is entered, recognize the voice command based on the language model and the acoustic model of the specific area that is accessed, and determine at least one menu and one operating mode of the term mobile Nala, based on a specific area, the accessed and recognizes the user's speech,
however, the controller is also designed so that it is able to determine a specific area in the information related to the currently displayed menu, or information related to the submenu of a particular menu selected from these menus.

2. The mobile communications terminal of claim 1, wherein the particular menu or mode of operation is at least one of the following: a menu or a mode for working with multimedia information, a menu or contact mode, a menu or a messaging mode, a menu or a sound mode , menu or organizer mode, menu or screen mode, menu or utility mode, menu or camera mode, and menu or parameter setting mode.

3. The mobile communication terminal according to claim 1, in which the controller is also configured in such a way that determines the percentage of correctly recognized words so that a certain menu and operation mode exactly match the entered speech of the user.

4. The mobile communication terminal according to claim 3, also containing a display made in such a way that provides a visual display of information,
at the same time, the controller is also configured in such a way that it displays all the menus and operating modes of the mobile communication terminal based on the specific area that was accessed and the user's recognized speech, for which it was determined that the percentage of correctly recognized words in it exceeds in advance set threshold value.

5. The mobile communication terminal according to claim 1, in which the input device is also designed in such a way that it is able to receive a voice input command to select one of the displayed menus and operating modes, and the controller recognizes the entered voice command and displays information requesting that whether the recognized inputted speech command is accurate.

6. The mobile communication terminal according to claim 4, in which the controller is also configured in such a way that it displays all the mentioned menus and operating modes of the mobile communication terminal based on the specific area to which access is made and the user's recognized speech, where the percentage of correctly recognized words exceeds a predetermined threshold value, in order from a higher percentage of correctly recognized words to a lower percentage of correctly recognized words.

7. The mobile communication terminal according to claim 4, in which a predetermined threshold value is set by the manufacturer of the mobile communication terminal or the user of the mobile communication terminal.

8. The mobile communication terminal according to claim 4, in which the controller is also configured in such a way that it displays on the display that menu or the operating mode that corresponds to the highest percentage of correctly recognized words by adjusting at least one of the size , position, color, brightness and backlight of this menu or operating mode.

9. The mobile communication terminal according to claim 3, in which the controller is also configured in such a way that determines how many times a particular menu or a particular operating mode has been selected in the terminal, and adjusts the percentage of correctly recognized words for a particular menu or operating mode based on of this certain number of times that a particular menu or a particular operating mode was previously selected.

10. The mobile communication terminal according to claim 1, in which the input device contains at least one of the following means: 1) a touch virtual key that is touched to activate the speech recognition function, 2) a hardware key that is pressed or with which perform manipulations to activate the speech recognition function, 3) an arbitrary place on the touch screen contained in the input device that is touched to activate the speech recognition function, 4) the beat sound that is entered to activate the speech recognition function, 5) p diosignal short-range or long-range radio signal, and 6) a signal from a user with information about his gestures.

11. The mobile communication terminal according to claim 1, also containing:
the first database, configured in such a way that it stores voice information or pronunciation information used by the controller to recognize the entered speech of the user;
the second database, configured in such a way that it stores information about words, keywords or sentences used by the controller to recognize the entered speech of the user;
a third database, configured in such a way that it stores information related to the functions or menu of the mobile communication terminal; and
the fourth database, configured in such a way that it stores reference information that must be output to inform the user that the controller is trying to determine the meaning of the entered speech of the user.

12. The mobile communications terminal of claim 1, wherein the controller is also configured to output audio or video information indicating that the speech recognition function is in an activated state.

13. The mobile communications terminal of claim 1, wherein the plurality of regions comprises at least two regions from an email related area that corresponds to email messages sent from the mobile communication terminal and received at the mobile communication terminal, scheduled areas tasks corresponding to the scheduled events assigned in the mobile communication terminal, the contact area corresponding to the contact information available in the mobile communication terminal, the phone book area corresponding to the telephone numbers stored in the mobile communication terminal, the card area corresponding to the card information provided by the mobile communication terminal, the photo area corresponding to the photos stored in the mobile communication terminal, the message area corresponding to messages sent from the mobile communication terminal and received in the mobile terminal communication, the field of multimedia information corresponding to multimedia functions performed in the mobile communication terminal, the field of external devices corresponding to the external devices which can be connected to the mobile communication terminal, the history field of telephone calls the respective telephone calls transmitted from the mobile communication terminal and received in the mobile communication terminal and the area setting parameters corresponding to the parameter setting functions performed on the mobile terminal.

14. A method for controlling a mobile communication terminal, comprising the steps of:
activate the speech recognition function in the mobile communication terminal in accordance with the entered data;
determining a specific area for recognizing voice commands from a plurality of areas stored in the storage device when the speech recognition function is activated;
accessing a specific area when a voice command is entered;
recognize the entered speech command based on the language model and
an acoustic model of a specific area that is accessed by analyzing the context and information content of the speech command in a given specific area; and
at least one menu and an operating mode of the mobile communication terminal are derived based on the specific area to which the access is made and the recognized speech of the user,
however, a specific area includes information related to the currently displayed menu, or information related to a submenu of a particular menu selected from these menus.

15. The method according to 14, in which a particular menu or mode of operation is at least one of the following: a menu or a mode for working with multimedia information, a menu or a mode of contacts, a menu or a mode of messaging, a menu or a mode of sounds, a menu or organizer mode, menu or screen mode, menu or utility mode, menu or camera mode, and menu or parameter setting mode.

16. The method according to 14, also containing stages in which:
displaying help information related to a submenu of a particular menu in the form of at least one of: a voice message displayed on a window screen and a tooltip when a particular menu is selected by means of a speech command;
determining at least one menu and one operating mode of the mobile communication terminal based on the specific area to which access is made and the recognized speech of the user; and
determine the percentage of correctly recognized words so that this particular menu and this particular mode of operation exactly match the input speech.

17. The method according to clause 16, also containing a stage on which:
display on the display screen of the mobile communication terminal all the menus and operating modes of the mobile communication terminal based on the specific area to which access is made and the recognized speech of the user, for which it was determined that the percentage of correctly recognized words in it exceeds a predetermined threshold value.

18. The method according to 17, also containing stages in which:
accept a voice input command to select one of the displayed menus and operating modes;
recognize the entered speech command; and
outputting information requesting whether the recognized inputted voice command is accurate.

19. The method according to 17, also containing a stage in which:
all the mentioned menus and operating modes of the mobile communication terminal are displayed on the basis of the particular area to which the access is made and the user's recognized speech, where the percentage of correctly recognized words exceeds a predetermined threshold value, in order from a higher percentage of correctly recognized words to more low percentage of correctly recognized words.

20. The method of claim 17, wherein the predetermined threshold value is set by the manufacturer of the mobile communication terminal or by the user of the mobile communication terminal.

21. The method according to 17, also containing a stage in which: the menu or the operating mode that corresponds to the highest percentage of correctly recognized words is displayed differently by adjusting at least one of the size, position, color , brightness and backlight of this menu or operating mode.

22. The method according to clause 16, also containing stages in which:
determine how many times a particular menu or a particular operating mode in the terminal was previously selected, and correct the percentage of correctly recognized words for a particular menu or operating mode based on that specific number of times that a particular menu or a specific operating mode was previously selected.

23. The method of claim 14, wherein the receiving step comprises applying at least one of the following means: 1) a touch virtual key that is touched to activate the speech recognition function, 2) a hardware key that is pressed or made manipulations to activate the speech recognition function, 3) an arbitrary place on the touch screen contained in the input device that is touched to activate the speech recognition function, 4) the shock sound that is entered to activate the speech recognition function, 5) radio signal ala short-range or long-range radio signal, and 6) signal from the user with information about his gestures.

24. The method of claim 14, further comprising the steps of:
storing speech information or pronunciation information in the first database used to recognize the inputted speech of the user;
store in the second database information about words, keywords or sentences used to recognize the entered speech of the user;
storing in a third database information related to the functions or menu of the mobile communication terminal; and save in the fourth database reference information that is to be output to inform the user that at the current time determine the meaning of the entered speech of the user.

25. The method of claim 14, further comprising: outputting audio or video information indicating that the speech recognition function is in an activated state.

26. The method of claim 14, wherein the plurality of areas comprises at least two areas from an email related area that corresponds to email messages sent from the mobile communication terminal and received at the mobile communication terminal, the scheduled task area, which corresponds to the scheduled events assigned in the mobile communication terminal, the contact area corresponding to the contact information available in the mobile communication terminal, the phone book area corresponding to the telephone numbers stored in the mobile communication terminal, a map area corresponding to card information provided by the mobile communication terminal, a photo area corresponding to photographs stored in the mobile communication terminal, a message area corresponding to messages sent from the mobile communication terminal and received at the mobile communication terminal, an area of multimedia information corresponding to multimedia functions performed in a mobile communication terminal, an area of external devices corresponding to external devices, to which may be connected to a mobile communication terminal, a telephone call history area corresponding to telephone calls transmitted from the mobile communication terminal and received at the mobile communication terminal, and a setting area corresponding to parameter setting functions performed in the mobile communication terminal.