SI22823A - Process and device for intelligent access control - Google Patents

Process and device for intelligent access control Download PDF

Info

Publication number
SI22823A
SI22823A SI200800141A SI200800141A SI22823A SI 22823 A SI22823 A SI 22823A SI 200800141 A SI200800141 A SI 200800141A SI 200800141 A SI200800141 A SI 200800141A SI 22823 A SI22823 A SI 22823A
Authority
SI
Slovenia
Prior art keywords
module
voice
telephone
speech
command
Prior art date
Application number
SI200800141A
Other languages
Slovenian (sl)
Inventor
TomaĹľ Rotovnik
Bojan Kotnik
Zdravko KaÄŤiÄŤ
Original Assignee
Univerza v Mariboru Fakulteta za elektrotehniko, računalništvo in informatiko
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univerza v Mariboru Fakulteta za elektrotehniko, računalništvo in informatiko filed Critical Univerza v Mariboru Fakulteta za elektrotehniko, računalništvo in informatiko
Priority to SI200800141A priority Critical patent/SI22823A/en
Publication of SI22823A publication Critical patent/SI22823A/en

Links

Abstract

Subject of the invention is a process and device for intelligent physical access control for protected locations by a system for multi-sensor biometric and behaviouristic identity verification based on a device for the integration of sensor data. The process for physical access control according to the invention is performed via a multisensor data acquisition on the behaviour of users during the time directly, during and after registration at the access terminal, adaptable integration of any number of sensors in a single device, storing the acquired data in a database and a two-level intelligent processing of data with machine learning. The first level of data processing consists of several adaptable modules for learning, among them is also learning based on time differences between the individual events. The second level of data processing is meta learning, which is performed based on information from the first level of learning. Regarding the adjustable parameters which originate from the security requirements and the acquired knowledge intelligent deduction on potential deviant behaviour of the controlled persons is performed.

Description

Sistem in postopek za govorno vodeno telefonsko komunikacijoSystem and procedure for voice-guided telephone communication

Izum sodi v področje sistemov in postopkov za vzpostavljanje telefonske komunikacije s pomočjo govora, natančneje v področje pripomočkov osebam s posebnimi potrebami, ki zaradi motoričnih ovir ne morejo uporabljati normalnih telefonskih aparatov oz. na področje digitalnega procesiranja signalov in področje govornih vmesnikov v telekomunikacijah.The invention belongs to the field of systems and procedures for establishing telephone communication by means of speech, more specifically to the field of accessories for persons with special needs who cannot use normal telephone sets or because of motor impediments. to the field of digital signal processing and the field of voice interfaces in telecommunications.

Tehnični problemA technical problem

Tehnični problem, ki ga rešuje sistem za govorno vodeno telefonsko komunikacijo po izumu, je zasnova elektronskega sistema, ki omogoča uporabniku in zlasti uporabniku s posebnimi potrebami glasovno (govorno) vzpostavljanje in prekinjanje telefonske komunikacije, pri čemer je uporabniku omogočeno nemoteno govorno komuniciranje s svojo okolico, brez nehotenega sprožanja delovanja sistema. Rešitev mora omogočati uporabniku s posebnimi potrebami, da s pomočjo govora kadarkoli aktivira govorno vodeni telefon, izbira ustrezno telefonsko številko, izvede klic, deaktivira govorno vodeni telefon ter nato nadaljuje z govorno komunikacijo s svojo okolico, kot to po zaključku telefonskega pogovora počno uporabniki običajnega telefona. Problem, ki ga pričujoči izum rešuje je oblikovanje algoritma, ki bo s pomočjo digitalnega procesiranja signalov aktiviral posamezne dele sistema govorno vodene telefonske komunikacije za vzpostavljanje telefonske linije, izbiranje klicane številke, opravljanje pogovora in prekinjanje telefonske linije s klicano številko. Naloga in cilj izuma je izpopolniti postopek komuniciranja s pomočjo telefona tako, da se odpravi potreba po taktilni komunikaciji, da je celotni postopek od vzpostavitve do prekinitve telefonske povezave možno izvesti samo s pomočjo govora in s pomočjo računalniškega aktiviranja posameznih funkcij obstoječih procesorsko krmiljenih naprav kot so računalnik, mobilni telefon, igralna konzola, in podobne naprave.A technical problem solved by the voice-guided telephone communication system of the invention is the design of an electronic system that enables the user, and especially the user with special needs, to voice (voice) establish and interrupt telephone communications, allowing the user to communicate smoothly with their surroundings , without inadvertently triggering the system. The solution should allow a user with a disability to activate the voice-guided telephone at any time by means of speech, select the appropriate telephone number, make a call, deactivate the voice-guided telephone, and then continue voice communication with his or her surroundings, as usual telephone users do after the telephone conversation . A problem solved by the present invention is the design of an algorithm that will activate, by means of digital signal processing, individual parts of a voice-guided telephone communication system to establish a telephone line, dial a dialed number, make a conversation, and interrupt a telephone line with a dialed number. The object and object of the invention is to perfect the process of communication by telephone by eliminating the need for tactile communication, so that the entire process from the establishment to the disconnection of the telephone connection can be performed only by voice and by computer activation of certain functions of existing processor-controlled devices such as computer, mobile phone, game console, and similar devices.

Govorna komunikacija s pomočjo telefona je danes nepogrešljiv način komunikacije. Poleg osnovne ideje prenosa sporočila na daljavo, je pomembna tudi sama vsebina sporočila. Na splošno bi lahko rekli, da uporaba telefona izboljšuje komunikacijske zmožnosti ljudi. V tej množici ljudi so tudi ljudje s posebnimi potrebami, ki zaradi bolezenskega stanja ali drugih vzrokov ne morejo uporabljati običajnega telefona na enak način, kot drugi ljudje. Težava največkrat nastopi v tisti fazi uporabe telefona, ko mora uporabnik s pomočjo taktilne komunikacije, to je, z uporabo rok, izbrati ustrezno telefonsko številko ter izvesti telefonski klic.Voice-over-telephone communication is an indispensable method of communication today. In addition to the basic idea of transmitting a message remotely, the content of the message itself is important. Generally speaking, using a phone improves people's communication skills. This crowd also includes people with disabilities who, due to a medical condition or other causes, cannot use a regular telephone in the same way as other people. The problem most often occurs at the stage of using the phone, when the user has to use the tactile communication, that is, using his hands, to select the appropriate phone number and make a phone call.

Razpoznavalnik govora razpoznava vhodni govorni signal. V splošnem nastopi težava pri razpoznavanju govora, ko uporabnik v dani situacij ne želi posredovati sporočila razpoznavalniku ampak tretji osebi. V takšnem primeru je potrebno s pomočjo stikala, uporaba taktilne komunikacije, preprečiti neželeno proženje razpoznavalnika govora ter napačnega odziva sistema.The speech recognizer recognizes the input speech signal. Generally speaking, there is a problem with speech recognition when, in a given situation, the user does not want to forward the message to the recognizer but to a third party. In such a case, the use of a switch, the use of tactile communication, should prevent unwanted triggering of the speech recognizer and the wrong response of the system.

Predstavljeni izum je izveden tako, da omogoča uporabniku ob sami uporabi govorno vodene komunikacije telefona tudi vodenje vsakdanjega pogovora brez uporabe stikala oz. taktilne komunikacije, ki bi omogočal prekinitev zajema govornega signala.The present invention is implemented in such a way that, while using the voice-guided communication of the telephone, the user can also conduct daily conversation without using a switch or switch. tactile communication that would allow interruption of speech signal acquisition.

Stanje tehnikeThe state of the art

Znanih je kar nekaj rešitev na temo govorno upravljanih telefonov. Ena izmed prvih rešitev je predstavljena v patentu US 5.007.081. V omenjenem patentnem spisu je predstavljen govorno vodeni telefon, vgrajen v ohišje običajnega namiznega telefona s tipkovnico, prikazovalnikom in telefonsko slušalko. Telefon je predviden za priključitev na analogno javno telefonsko omrežje (PSTN, ang. »Public Switched Telephone Network«). Prikazovalnik služi za kontrolo razpoznanega govornega ukaza in omogoča uporabniku, da s pritiskom na ustrezno tipko prepreči izvajanje nepravilno razpoznanega govornega ukaza. Upravljanje omenjenega govorno vodenega telefona zahteva torej tudi taktilno komunikacijo z napravo. Postopek avtomatskega razpoznavanja govora v omenjeni rešitvi temelji na postopku dinamičnega časovnega sledenja (DTW, ang. »Dynamic Time VVarping«) in je odvisen od govorca. To pomeni, da mora vsak uporabnik naučiti sistem s svojim naborom govornih ukazov pred samo uporabo sistema.There are quite a few solutions on the subject of voice-controlled phones. One of the first solutions is disclosed in US Patent 5,007,081. The aforementioned patent file discloses a voice-guided telephone embedded in the case of a conventional desktop telephone with a keyboard, display, and handset. The phone is intended to be connected to an analogue public Switched Telephone Network (PSTN). The display is used to control a recognized voice command and allows the user to prevent an incorrectly recognized voice command by pressing the appropriate key. The operation of said voice-guided telephone also requires tactile communication with the device. The automatic speech recognition process in this solution is based on the Dynamic Time VVarping (DTW) process and depends on the speaker. This means that each user must learn the system with its set of voice commands before just using the system.

Predlagana rešitev se od zgoraj opisanega patenta US 5.007.081 razlikuje po tem, da ne zahteva nobene taktilne komunikacije, ne potrebuje nobenega predhodnega učenja govornih ukazov, ni od govorca odvisna, sistem avtomatskega razpoznavanja govora pa temelji na prikritih modelih Markova. Poglavitna razlika predlagane rešitve od patenta US 5.007.081 je v tem, da predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oz. klicane osebe. Predlagana rešitev prav tako ni omejena le na analogno telefonsko omrežje, temveč lahko deluje v poljubnem digitalnem telefonskem omrežju (ISDN, VolP).The proposed solution differs from US Patent No. 5,007,081 described above in that it requires no tactile communication, requires no prior learning of voice commands, is independent of the speaker, and the automatic speech recognition system is based on covert Markov models. The main difference between the proposed solution and the patent of US 5,007,081 is that the proposed solution enables the voice activation of the telephone dialing or telephone dialing. called persons. The proposed solution is not only limited to the analogue telephone network, but can operate on any digital telephone network (ISDN, VolP).

Naslednja znana rešitev je predstavljena v patentnem spisu US 5.452.340. Gre za izboljšavo prej omenjene rešitve, saj omogoča govorno dodajanje klicane osebe in pripadajoče številke v imenik telefona kar med postopkom vzpostavitve telefonskega klica. Tudi v tem primeru je za krmiljenje uporabljena tako govorna, vnos govornih ukazov, kot tudi taktilna komunikacija pri potrjevanju pravilnosti govornih ukazov.Another known solution is disclosed in U.S. Patent No. 5,452,340. It is an improvement of the aforementioned solution, since it allows voice dialing of the called person and the corresponding number in the phone book during the process of making a phone call. Here again, both voice, voice command input and tactile communication are used to control the validity of voice commands.

Predlagana rešitev se od zgoraj opisanega patenta US 5.452.340 razlikuje po tem, da za potrjevanje ukazov ne zahteva nobene taktilne komunikacije. Poglavitna razlika predlagane rešitve od patenta US 5.452.340 je v tem, da predlagana rešitev omogoča govorno aktiviranje pričetka izbiranja telefonske številke oziroma klicane osebe.The proposed solution differs from US Pat. No. 5,452,340 described above in that no tactile communication is required to confirm commands. The main difference between the proposed solution and the patent of US 5,452,340 is that the proposed solution enables the voice dialing of the telephone number or the called person to be activated.

V patentu US 5.483.579 je predstavljena naprava, ki omogoča govorno krmiljeno klicanje in jo je možno uporabiti v kombinaciji z več klasičnimi telefonskimi aparati na isti telefonski liniji.U.S. Pat. No. 5,483,579 introduces a device that enables voice-controlled dialing and can be used in combination with several classic telephone sets on the same telephone line.

Predlagana rešitev se od zgoraj opisanega patenta US 5.483.579 razlikuje po tem, da se ne uporablja kot nadgradnja klasičnih telefonskih aparatov, ampak deluje kot samostojna govorno vodena komunikacijska naprava. Predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe, medtem ko je v omenjenem patentu govorno izbiranje aktivirano šele z dvigom slušalke in ni popolnoma prostoročno.The proposed solution differs from the above-described US Pat. No. 5,483,579 in that it is not used as an upgrade of classic telephone sets but acts as a standalone voice-guided communication device. The proposed solution enables voice activation of the dialing start of the telephone number or the person called, while in the aforementioned patent, voice dialing is only activated by lifting the handset and is not completely hands free.

Primer govorno krmiljenega telefona brez vsakršne uporabe taktilne komunikacije je predstavljen v patentnem spisu US 6.167.251. V patentu je opisan sistem mobilnega celičnega telefona (lahko GSM ali CDMA), ki ne vsebuje nobene tipke oz. tipkovnice. Tako govorni ukazi, kot tudi telefonski pogovori se prenašajo brezžično preko bazne postaje v centralo. Strežnik v centrali omogoča razpoznavanje govornih ukazov, vzpostavljanje telefonskih povezav, govorno krmiljeno urejanje imenikov itd.An example of a voice-controlled telephone without any use of tactile communication is presented in U.S. Patent No. 6,167,251. The patent describes a mobile cellular telephone system (GSM or CDMA) that does not contain any key or key. keyboards. Both voice commands and telephone conversations are transmitted wirelessly through the base station to the exchange. The PBX server enables voice command recognition, telephone connections, voice-controlled directories editing, etc.

Predlagana rešitev se od zgoraj opisanega patenta US 6.167.251 razlikuje po tem, da ne vsebuje oddaljene centralne enote, ki bi opravljala postopek avtomatskega razpoznavanja govora, ampak je postopek razpoznavanja vgrajen neposredno v sistem govorno vodene komunikacijske naprave. Prav tako predlagana rešitev ni omejena na sisteme mobilne celične telefonije točno določenega operaterja, ki bi podpiral govorno vodene storitve. Poglavitna razlika predlagane rešitve od patenta US 6.167.251 je v tem, da predlagana rešitev omogoča govorno aktiviranje pričetka izbiranja telefonske številke oziroma klicane osebe. Rešitev v tem patentu sicer ne vsebuje klasične tipkovnice ali številčnice, vendar pa vsebuje tipko za vklop in izklop telefona, s čemer dobi center znak za začetek procesiranja govornega signala. Uporabnik izgovori telefonsko številko šele po sporočilu, ki mu ga posreduje strežnik.The proposed solution differs from US Pat. No. 6,167,251 above in that it does not contain a remote central unit that would perform the automatic speech recognition process, but that the recognition process is integrated directly into the voice-guided communication device system. Likewise, the proposed solution is not limited to mobile cellular telephony systems from a specific carrier that supports voice-guided services. The main difference between the proposed solution and the patent of US 6,167,251 is that the proposed solution enables the voice dialing of the telephone number or the called person to be activated. Although the solution in this patent does not include a classic keyboard or dial, it does have a key to turn the phone on and off, giving the center a signal to start processing the voice signal. The user speaks the telephone number only after a message is sent to the server.

Patent US 2002/0097843A1 opisuje posebno vgradno napravo, ki omogoča govorno podporo za poljubni telefonski aparat. Sistem za razpoznavanje je odvisen od govorca, ki mora sistem najprej naučiti na detekcijo krmilnih besed.US 2002 / 0097843A1 describes a special built-in device that provides voice support for any telephone set. The recognition system depends on the speaker, who must first learn the system to detect the control words.

Predlagana rešitev se od zgoraj opisanega patenta US 2002/0097843A1 razlikuje po tem, da ne potrebuje nobenega predhodnega učenja govornih ukazov in ni odvisna od govorca. Prav tako se predlagana rešitev ne uporablja kot vgradna naprava za klasične telefonske aparate, ampak deluje kot samostojna govorno vodena komunikacijska naprava. Poglavitna razlika predlagane rešitve od prej opisane je v tem, da predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe.The proposed solution differs from the above patent US 2002 / 0097843A1 in that it does not require any prior learning of spoken commands and is independent of the speaker. Also, the proposed solution is not used as a built-in device for classic telephones, but acts as a standalone voice-guided communication device. The main difference between the proposed solution and the one described above is that the proposed solution enables the voice dialing of the telephone or the called person to be dialed.

V patentnem spisu US 5.799.065 je opisana naprava z nazivom »virtualni telefonski operater« za govorno vodeno vzpostavljanje klicev brez uporabe taktilne komunikacije. Sistem za avtomatsko razpoznavanje govora s tremi prehodi temelji na prikritih modelih Markova (HMM, ang. »Hidden Markov Models«) z uporabo PLPRASTA (ang. »Perceptual Linear Predictive Coding - Relative Spectral«) značilk. Sistem je sposoben procesirati tekoči govor, vključen pa je v telefonsko centralo.U.S. Pat. No. 5,799,065 describes a device called "virtual telephone operator" for voice-guided calling without the use of tactile communication. The three-pass automatic speech recognition system is based on Hidden Markov Models (HMM) using PLPRAST (Perceptual Linear Predictive Coding - Relative Spectral) features. The system is capable of processing current speech and is integrated into the telephone exchange.

Predlagana rešitev se od zgoraj opisanega patenta US 5.799.065 razlikuje po tem, da sistem za procesiranje govornega signala oz. za avtomatsko razpoznavanje govora ni vključen v telefonsko centralo, temveč se uporablja kot samostojna govorno vodena komunikacijska naprava. Poglavitna razlika predlagane rešitve od patenta US 5.799.065 je v tem, da predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe.The proposed solution differs from the above-described U.S. Pat. No. 5,799,065 in that the voice signal processing system, or it is not included in the telephone exchange for automatic speech recognition, but is used as a standalone voice-guided communication device. The main difference between the proposed solution and the patent of US 5,799,065 is that the proposed solution enables the voice dialing of the telephone or the called person to be dialed.

Opis rešitve tehničnega problemaDescription of solution to a technical problem

Bistvo sistema za govorno vodeno telefonsko komunikacijo po izumu je v tem, da naprava vsebuje mikrofon, slušalke, govorno procesorsko napravo na kateri se izvaja razpoznavanje govora ali oddaljeni mikrofon, zvočnik, prilagojeno govorno procesorsko napravo na kateri se poleg razpoznavanja govora izvaja dodatno predprocesiranje govora in nadzorno procesorsko napravo na kateri teče aplikacija vzpostavitve telefonske povezave ter prenosa govora. Bistvo izuma je v algoritmu s katerim se vpišejo vsi za uporabnika značilni govorni podatki v procesorsko napravo za razpoznavo govora, ki krmili nadzorno procesorsko napravo katere algoritem omogoča vzpostavitev telefonske povezave ter prenosa govora. Algoritem omogoča izvedbo klica s pomočjo govora brez kakršnegakoli drugega posredovanja uporabnika. Aplikacije algoritmov govorne procesorske naprave, prilagojene govorne procesorske naprave in nadzorne procesorske naprave se lahko realizirajo na računalniku, digitalnem komunikacijskem terminalu, igralno konzolo, osebni organizator in podobno.The essence of a voice-guided telephone communication system according to the invention is that the device comprises a microphone, a headset, a speech processing device on which speech recognition is performed or a remote microphone, a speaker, a customized voice processing device on which, in addition to speech recognition, additional preprocessing of speech is performed, and a control processor device running the application of establishing a telephone connection and voice transmission. The essence of the invention is an algorithm whereby all user-specific voice data is entered into a speech recognition processor that controls a controlling processor device whose algorithm enables telephone connection and speech transmission. The algorithm enables voice-rendered calls without any other user intervention. Applications of voice processor algorithms, custom voice processor devices, and control processor devices may be implemented on a computer, digital communication terminal, game console, personal organizer, and the like.

Predmet izuma je tudi postopek za govorno vodeno telefonsko komunikacijo, ki je izveden tako, da omogoča izvedbo klica s pomočjo govora, brez kakršnegakoli drugega posredovanja uporabnika.The subject of the invention is also a method for voice-guided telephone communication, which is implemented in such a way as to enable the call to be made by voice without any other intervention of the user.

Sistem za govorno vodeno telefonsko komunikacijo po izumu bo podrobneje opisan s pomočjo slik, ki kažejo:The voice-guided telephone communication system of the invention will be described in greater detail by means of pictures showing:

Slika 1 - Blokovno shemo sistema govorno vodene telefonske komunikacije Slika 2 - Sistem in postopek za govorno vodene telefonske komunikacije Slika 3 - Sistem in postopek za govorno vodene telefonske komunikacije po izvedbenem primeru IFigure 1 - Block diagram of the voice-guided telephone communication system Figure 2 - System and procedure for voice-guided telephone communications Figure 3 - System and procedure for voice-guided telephone communications after embodiment I

Predmet izuma je postopek za govorno vodeno telefonsko komunikacijo, ki je izveden tako, da s prav tako v izumu opisanem sistemu za to komunikacijo omogoča izvedbo klica s pomočjo govora, brez kakršnegakoli drugega posredovanja uporabnika. Uporabnik z uporabo takšnega sistema s ' ponločjo govora kadarkoli aktivira sistem za govorno vodeno telefonsko komunikacijo, izbira ustrezno telefonsko številko, izvede klic, deaktivira sistem ter nato nadaljuje z govorno komunikacijo s svojo okolico, kot to po zaključku telefonskega pogovora počno uporabniki običajnega telefona.The subject of the invention is a method for voice-guided telephone communication, which is implemented in such a way that the system described for the communication described in the invention also enables a call to be made by voice without any other intervention of the user. Using such a system of 'voice conversations', the user activates the voice-guided telephone communication system at any time, dials the appropriate telephone number, makes a call, deactivates the system, and then resumes voice communication with his or her surroundings, as usual telephone users do after the telephone conversation has ended.

Predstavljeni izum je izveden tako, da omogoča uporabniku ob sami uporabi govorno vodene komunikacije telefona tudi vodenje vsakdanjega pogovora brez uporabe stikala, taktilne komunikacije, ki bi omogočal prekinitev zajema govornega signala.The present invention is designed to allow the user, while using the voice-guided communication of the telephone itself, to conduct daily conversation without the use of a switch, tactile communication that would allow interruption of speech signal acquisition.

V tem izumu opisan sistem omogoča s pomočjo govorne tehnologije tudi ljudem s posebnimi potrebami uporabo telefona na način, ki je enakovreden načinu uporabe, kot ga uporabljajo drugi ljudje. Uporabnik lahko z uporabo govorno vodenega telefona s pomočjo govora kadarkoli aktivira govorno vodeni telefon, izbira ustrezno telefonsko številko, izvede klic, deaktivira govorno vodeni telefon ter nato nadaljuje z govorno komunikacijo s svojo okolico, kot to po zaključku telefonskega pogovora počno uporabniki običajnega telefona.In the present invention, the system described also makes it possible for people with disabilities to use the telephone in a manner equivalent to that used by other people through speech technology. Using a voice-guided telephone, the user can activate the voice-guided telephone at any time, select the appropriate telephone number, make a call, deactivate the voice-guided telephone, and then resume voice communication with his or her surroundings, as users of a regular telephone do after completing a telephone conversation.

Sistem za govorno vodeno telefonsko komunikacijo je sestavljen iz govorne procesorske naprave 1.1, nadzorne procesorske naprave 1.4, slušalk 1.3 ter mikrofona 1.2. Na govorni procesorski napravi 1.1 se izvaja algoritem za razpoznavo govora, ki vhodni govorni signal pretvarja v tekstovno obliko. Rezultat razpoznave v tekstovni obliki se prenese v nadzorno procesorsko napravo 1.4. Nadzorna procesorska naprava 1.4 obdela ukaz prejet od govorne procesorske naprave 1.1 in izvede ustrezno akcijo. Akcija je lahko omogočitev telefonske povezave, izbira telefonske številke ali imena pridruženega izbrani telefonski številki, vzpostavitev telefonske povezave, sprejem dohodnega klica, zavrnitev telefonskega klica ali prekinitev telefonske povezave. Slušalke 1.3 in mikrofon 1.2 so lahko žično ali brezžično povezane z govorno procesorsko napravo 1.1 za razpoznavo govora. Ob vzpostavitvi telefonske povezave uporabnik komunicira z drugim uporabnikom preko mikrofona 1.2 ter slušalk 1.3.The voice-guided telephone communication system consists of a voice processor 1.1, a control processor 1.4, a handset 1.3 and a microphone 1.2. Speech processor 1.1 implements a speech recognition algorithm that converts the input speech signal to text format. The result of the recognition in text format is transferred to the control processor 1.4. The control processor 1.4 processes the command received from the speech processor 1.1 and performs the appropriate action. An action can be to enable a phone connection, select a phone number or name associated with the selected phone number, make a phone connection, answer an incoming call, reject a phone call, or end the phone connection. The headset 1.3 and microphone 1.2 can be wired or wirelessly connected to a speech processor 1.1 for speech recognition. When establishing a telephone connection, the user communicates with another user via microphone 1.2 and handset 1.3.

V izvedbenem primeru I sistema za govorno vodeno telefonsko komunikacijo je sistem sestavljen iz prilagojene govorne procesorske naprave 1.5, nadzorne procesorske naprave 1.4, zvočnika 1.6 ter oddaljenega mikrofona 1.7. Na prilagojeni govorni procesorski napravi 1.5 se poleg algoritma za razpoznavo govora izvaja še algoritem za dodatno predprocesiranje govornega signala, ki zmanjšuje odmeve, ki so posledica prostoročnega telefoniranja. Rezultat razpoznave v tekstovni obliki se enako kot pri prvi izvedbi prenese v nadzorno procesorsko napravo 1.4. Nadzorna procesorska naprava 1.4 obdela ukaz prejet od prilagojene govorne procesorske naprave 1.5 in izvede ustrezno akcijo za omogočitev telefonske povezave, izbira telefonske številke ali imena pridruženega izbrani telefonski številki, vzpostavitev telefonske povezave, sprejem dohodnega klica, zavrnitev telefonskega klica ali prekinitev telefonske povezave. Zvočnik 1.6 in oddaljeni mikrofon 1.7 so lahko enako kot v pri prvi izvedbi žično ali brezžično povezane s prilagojeno govorno procesorsko napravo 1.5 za predprocesiranje in razpoznavo govora. Ob vzpostavitvi telefonske povezave uporabnik komunicira z drugim uporabnikom preko oddaljenega mikrofonaIn embodiment I of the voice-guided telephone communication system, the system consists of a custom voice processor 1.5, a control processor 1.4, a speaker 1.6, and a remote microphone 1.7. On the custom voice processor 1.5, in addition to the speech recognition algorithm, an algorithm for additional preprocessing of the speech signal is implemented, which reduces the echoes that result from hands-free calling. The result of the recognition in text format is transferred to the control processor 1.4 in the same way as in the first version. The processor processor 1.4 processes the command received from the personalized voice processor 1.5 and performs the appropriate action to enable a telephone connection, select a telephone number or name associated with the selected telephone number, establish a telephone connection, answer an incoming call, reject a telephone call, or disconnect a telephone connection. Loudspeaker 1.6 and Remote Microphone 1.7 can be wired or wirelessly connected, as in the first embodiment, to a customized speech processor 1.5 for preprocessing and speech recognition. When establishing a telephone connection, the user communicates with another user via a remote microphone

1.7 ter zvočnika 1.6.1.7 and the speaker 1.6.

Govorno procesorsko napravo 1.1, prilagojeno govorno procesorsko napravo 1.5 in nadzorno procesorsko napravo 1.4 lahko predstavlja digitalni računalnik, prenosni računalnik, digitalni komunikacijski terminal (mobilna komunikacijska naprava - na primer mobilni telefon), igralna konzola, digitalni osebni organizator, pri tem pa uporaba izuma ni omejena le na omenjene naprave.Voice processor 1.1, customized voice processor 1.5 and control processor 1.4 may be represented by a digital computer, a laptop computer, a digital communication terminal (mobile communication device - for example a mobile phone), a game console, a digital personal organizer, without the use of the invention. limited to the aforementioned devices only.

Postopek izvedbe akcije omogočitve, izbire telefonske številke, vzpostavitve, sprejema in zavrnitve telefonskega klica govorno vodene telefonske komunikacije je podan na sliki 2. Uporabnik sistema po izumu sproži preko vnaprej določene ključne besede virtualno tipko START, ki je vezana preko modulov S2.1, S2.2, S2.3 na modul S2.4 in predstavlja govorno procesorsko napravo 1.1. Modul S2.1 preverja ali je bil detektiran govor. Modul S2.2 določi značilke govornega signala in modul S2.3 določi izgovorjeno besedo. Modul S2.4 določi zanesljivost razpoznane besede. Če je v modulu S2.1 na vprašanje, ali je bil detektiran govor, odgovor NO se postopek ponovno vrne na tipko START. Ko je odgovor na izhodu modula S2.1 YES poteka postopek naprej preko modulov S2.2, S2.3 in S2.4 do modula S2.5, ki preverja ali je razpoznana beseda dovolj zanesljiva. Če je odgovor NO, se postopek vrača na virtualno tipko START. Ko je odgovor YES se proces nadaljuje v modulu S2.6, ki preverja ali je razpoznan ukaz za omogočitev telefonske povezave. Če je odgovorThe process of performing an enabling, selecting a telephone number, establishing, receiving and rejecting a telephone call by voice-guided telephone communication is given in Figure 2. The system user of the invention, via a predefined keyword, triggers a virtual START key, which is bound via modules S2.1, S2 .2, S2.3 to module S2.4, and represents the voice processor device 1.1. Module S2.1 checks whether speech has been detected. Module S2.2 defines the characteristics of the speech signal and module S2.3 defines the spoken word. Module S2.4 determines the reliability of a recognized word. If there is a question in module S2.1 whether a speech has been detected, the NO will return to the START key again. When the answer is at the output of module S2.1 YES, the process proceeds through modules S2.2, S2.3 and S2.4 to module S2.5, which verifies that the recognized word is sufficiently reliable. If NO, the process returns to the virtual START key. When the answer is YES, the process continues in module S2.6, which checks whether the command to enable telephone connection is recognized. If yes

YES se proces nadaljuje na modulu S2.14. V kolikor je odgovor NO se proces nadaljuje v modulu S2.7, ki preverja ali je razpoznan ukaz za vzpostavitev telefonske povezave. Če je odgovor YES se proces nadaljuje v modulu S2.13 in modulu S2.14. Če je odgovor v modulu S2.7 NO se proces nadaljuje v modulu S2.8, ki preverja ali je razpoznan ukaz za ponoven vnos telefonske številke. Če je odgovor YES se proces nadaljuje do modula S2.14, katerega izhod je povezan s tipko START. Če je na modulu S2.8 odgovor No se proces nadaljuje v modulu S2.9, ki preverja ali je razpoznan ukaz za zavrnitev oz. prekinitev telefonske povezave. Ko je odgovor YES se proces nadaljuje v modulu S2.15, katerega izhod je vezan na tipko START. Ko je v modulu S2.9 odgovor NO se proces nadaljuje v modulu S2.10, ki preverja ali je razpoznan ukaz za sprejem telefonskega pogovora. Če je odgovor YES se proces nadaljuje preko modula S2.11 ki vzpostavi telefonski pogovor. Če je v modulu S2.10 odgovor NO se proces nadaljuje preko modula S2.12, ki omogoči prikaz izbrane telefonske številke. Izhod modula S2.12 je vezan na modul S2.14 za Izbiro ustrezne slovnice. Modul S2.13 ima ukaz - Izvedi klic. Modul S2.15 ima ukaz - Prekini telefonski pogovor. Moduli S2.5 do S2.15 izvajajo funkcije nadzorne procesorske naprave 1.4.The YES process continues on module S2.14. If NO, the process continues in module S2.7, which checks whether the command to establish a telephone connection is recognized. If YES, the process continues in module S2.13 and module S2.14. If the answer is in module S2.7 NO, the process is continued in module S2.8, which checks whether the command to re-enter the telephone number is recognized. If YES, the process proceeds to module S2.14, the output of which is connected to the START key. If the answer to No is S2.8, then the process continues in module S2.9, which checks whether the rejection or rejection command is recognized. interruption of telephone connection. When the answer is YES, the process resumes in module S2.15, whose output is bound to the START key. When NO is answered in Module S2.9, the process continues in Module S2.10, which checks if a command to receive a telephone conversation is recognized. If YES, the process is continued via the S2.11 module which establishes a telephone conversation. If NO is answered in module S2.10, the process is continued via module S2.12, which allows the selected telephone number to be displayed. The output of Module S2.12 is linked to Module S2.14 to select the appropriate grammar. Module S2.13 has the command - Make a call. Module S2.15 has the command - End telephone conversation. Modules S2.5 to S2.15 perform the functions of a control processor 1.4.

Postopek izvedbe akcije omogočitve, izbire telefonske številke, vzpostavitve, sprejema in zavrnitve telefonskega klica govorno vodene telefonske komunikacije po izvedbenem primeru I je podan na sliki 3. Uporabnik sistema po izumu sproži preko vnaprej določene ključne besede virtualno tipko START, ki je vezana preko modulovThe process of executing an enabling, selecting a telephone number, establishing, receiving, and rejecting a telephone call of a voice-guided telephone communication according to embodiment I is shown in Figure 3. The system user according to the invention triggers via a predetermined keyword a virtual START key which is bound via modules

S3.1, S3.2, S3.3, S3.4 na modul S3.5 in predstavlja prilagojeno govorno procesorsko napravo 1.5. Modul S3.1 preverja ali je bil detektiran govor. Modul S3.2 zajeti govorni signal dodatno predprocesira z namenom zmanjšanja odmevov. Modul S3.3 določi značilke govornega signala in modul S3.4 določi izgovorjeno besedo. Modul S3.5 določi zanesljivost razpoznane besede. Če je v modulu S3.1 na vprašanje, ali je bil detektiran govor, odgovor NO se postopek ponovno vrne na tipko START. Ko je odgovor na izhodu modula S3.1 YES poteka postopek naprej preko modulov S3.2, S3.3, S3.4 in S3.5 do modula S3.6, ki preverja ali je razpoznana beseda dovolj zanesljiva. Če je odgovor NO, se postopek vrača na virtualno tipko START. Ko je odgovor YES se proces nadaljuje v modulu S3.7, ki preverja ali je razpoznan ukaz za omogočitev telefonske povezave. Če je odgovor YES se proces nadaljuje na modulu S3.15. V kolikor je odgovor NO se proces nadaljuje v modulu S3.8, ki preverja ali je razpoznan ukaz za vzpostavitev telefonske povezave. Če je odgovor YES se proces nadaljuje v modulu S3.14 in modulu S3.15. Če je odgovor v modulu S3.8 NO se proces nadaljuje v modulu S3.9, ki preverja ali je razpoznan ukaz za ponoven vnos telefonske številke. Če je odgovor YES se proces nadaljuje do modulaS3.1, S3.2, S3.3, S3.4 to module S3.5 and is a customized voice processor 1.5. Module S3.1 checks whether speech has been detected. Module S3.2 additionally preprocesses the captured voice signal in order to reduce echoes. Module S3.3 defines the characteristics of the speech signal and module S3.4 defines the spoken word. Module S3.5 determines the reliability of the recognized word. If there is a question in module S3.1 whether a speech has been detected, the NO will return to the START key again. When the answer is at the output of module S3.1 YES, the process proceeds through modules S3.2, S3.3, S3.4 and S3.5 to module S3.6, which verifies that the recognized word is sufficiently reliable. If NO, the process returns to the virtual START key. When YES is answered, the process continues in module S3.7, which checks whether the command to enable telephone connection is recognized. If YES, the process continues on module S3.15. If NO, the process continues in module S3.8, which checks whether the command to establish a telephone connection is recognized. If YES, the process continues in module S3.14 and module S3.15. If the answer is in Module S3.8 NO, the process is continued in Module S3.9, which checks whether the command to re-enter the telephone number is recognized. If YES, the process proceeds to the module

53.15, katerega izhod je povezan s tipko START. Če je na modulu S3.9 odgovor No se proces nadaljuje v modulu S3.10, ki preverja ali je razpoznan ukaz za zavrnitev oz. prekinitev telefonske povezave. Ko je odgovor YES se proces nadaljuje v modulu53.15, the output of which is connected to the START key. If Module S3.9 answers No, the process continues in Module S3.10, which verifies that the reject or reject command is recognized. interruption of telephone connection. When the answer is YES, the process continues in the module

53.16, katerega izhod je vezan na tipko START. Ko je v modulu S3.10 odgovor NO se proces nadaljuje v modulu S3.11, ki preverja ali je razpoznan ukaz za sprejem telefonskega pogovora. Če je odgovor YES se proces nadaljuje preko modula S3.12 ki vzpostavi telefonski pogovor. Če je v modulu S3.11 odgovor NO se proces nadaljuje preko modula S3.13, ki omogoči prikaz izbrane telefonske številke. Izhod modula S3.13 je vezan na modul S3.15 za Izbiro ustrezne slovnice. Modul S3.14 ima ukaz - Izvedi klic. Modul S3.16 ima ukaz - Prekini telefonski pogovor. Moduli S3.6 do S3.16 izvajajo funkcije nadzorne procesorske naprave 1.4.53.16, the output of which is bound to the START key. When NO is answered in Module S3.10, the process continues in Module S3.11, which checks whether the command to receive a telephone conversation is recognized. If YES, the process is continued through module S3.12, which establishes a telephone conversation. If NO is answered in module S3.11, the process is continued via module S3.13, which allows the selected telephone number to be displayed. The output of Module S3.13 is linked to Module S3.15 to select the appropriate grammar. Module S3.14 has the command - Make a call. Module S3.16 has the command - End telephone conversation. Modules S3.6 to S3.16 perform the functions of a control processor 1.4.

Uporabnik ima aktiviran mikrofon 1.2 in slušalke 1.3 ali oddaljeni mikrofon 1.7 in zvočnik 1.6 ter vodi normalno komunikacijo s svojo okolico. V trenutku, ko želi izvesti telefonski pogovor s pomočjo v tem opisu predstavljenega izuma izgovori ključno besedo ali ustrezno zaporedje ključnih besed za omogočitev telefonske povezave. Po pravilno razpoznani ključni besedi uporabnik izgovori želeno telefonsko številko ali ime osebe, ki jo želi poklicati in kateri je v sistemu pridružena ustrezna telefonska številka. Če je razpoznana napačna telefonska številka jo uporabnik lahko ponovi, drugače pa sproži klic z izgovarjavo ključne besede za vzpostavitev povezave. Med samo aktivno telefonsko povezavo jo lahko uporabnik kadarkoli zaključi z izgovarjavo ključne besede oziroma izbranega zaporedja ključnih besed za prekinitev telefonske povezave. Z izgovarjavo ključnih besed za sprejem telefonskega pogovora ali za zavrnitev telefonske povezave ima uporabnik popoln nadzor nad telefonsko komunikacijo. Uporabnik lahko sam določi ključne besede za vzpostavitev in prekinitev telefonske povezave.The user has an activated microphone 1.2 and headset 1.3 or remote microphone 1.7 and speaker 1.6 and guides normal communication with their surroundings. As soon as he wishes to make a telephone conversation, he utters a keyword or an appropriate sequence of keywords to enable a telephone connection using the invention described herein. After a correctly recognized keyword, the user speaks the desired phone number or the name of the person he wants to call and to which the corresponding phone number is associated in the system. If the wrong phone number is identified, the user can repeat it, otherwise it will trigger a call by saying the keyword to connect. During an active telephone connection only, the user can terminate it at any time by uttering a keyword or a selected sequence of keywords to end the telephone connection. By saying keywords to receive a telephone conversation or to reject a telephone connection, the user has complete control over the telephone communication. It is up to the user to determine the keywords to establish and disconnect the telephone connection.

V tej patentni prijavi opisana rešitev se razlikuje od obstoječih rešitev v naslednjih točkah:The solution described in this patent application differs from the existing solutions in the following points:

Omogočitev govornega aktiviranja začetka izbiranja telefonske številke oziroma klicane osebe.Enable voice activation to start dialing a phone number or a person called.

Uporaba sodobnih pristopov v postopku avtomatskega procesiranja in razpoznavanja govora omogoča procesiranje tekočega govora in razpoznavo govornih ukazov tudi med samim telefonskim pogovorom, brez poseganja v zagon in ustavitev razpoznavanja. Takšen pristop pa rezultira v vodenju vsakdanjega pogovora brez uporabe stikala (taktilne komunikacije), ki bi omogočala prekinitev zajema govornega signala za pravilno delovanje sistema razpoznavanja govora.The use of modern approaches in the process of automatic processing and speech recognition enables the processing of current speech and the recognition of voice commands even during the telephone conversation itself, without prejudice to starting and stopping the recognition. Such an approach, however, results in the conduct of everyday conversation without the use of a switch (tactile communication) that would allow the interruption of speech signal acquisition for the proper functioning of the speech recognition system.

Delovanje v sklopu sodobnih telefonskih tehnologij Vol P (ang. »Voice overOperating in the modern voice technologies of Vol P (Voice over

Internet Protocol«) ali katerekoli digitalne telefonske povezave, ki prenaša govorne pakete.Internet Protocol ") or any digital telephone connection that transmits voice packets.

Za delovanje ni potrebna nobena specifična strojna oprema.No specific hardware is required to operate.

Razpoznavanje je neodvisno od govorca in ni potrebno predhodno učenje na posameznega govorca.Recognition is independent of the speaker and no prior learning is required on the individual speaker.

- Razpoznavalni sistem s klicno logiko je integriran lokalno, kar predstavlja prihranek pri obremenitvi telefonske linije (potrebna samo pri dejanski vzpostavitvi klica).- Call Logic Recognition is integrated locally, saving on phone line load (only required when actually making a call).

Claims (10)

1. Sistem za govorno vodeno telefonsko komunikacijo, ki je lahko apliciran s pomočjo digitalnega računalnika, prenosnega računalnika, digitalnega komunikacijskega terminala na primer mobilni telefon, igralno konzolo, digitalnega osebnega organizatorja ali drugih podobnih elektronskih naprav, označen s tem, da uporabnik s pomočjo govora brez uporabe taktilne komunikacije izvede telefonsko komunikacijo s sistemom, ki ga sestavljajo govorna procesorska naprava (1.1) povezana z nadzorno procesorsko napravo (1.4), mikrofon (1.2), ki je povezan na govorno procesorsko napravo (1.1) , in slušalke (1.3), ki so povezane na govorno procesorsko napravo (1.1).1. Voice-guided telephone communication system, which may be applied by means of a digital computer, a laptop computer, a digital communication terminal such as a mobile phone, a game console, a digital personal organizer or other similar electronic devices, characterized in that the user is using speech without the use of tactile communication, carry out telephone communication with a system consisting of a speech processing device (1.1) connected to a control processing device (1.4), a microphone (1.2) connected to a speech processing device (1.1), and headphones (1.3), which are connected to the speech processing device (1.1). 2. Sistem za govorno vodeno telefonsko komunikacijo po zahtevku 1, označen s tem, da sistem sestavljajo prilagojena govorna procesorska naprava (1.5) povezana z nadzorno procesorsko napravo (1.4), oddaljeni mikrofon (1.7), ki je povezan na prilagojeno govorno procesorsko napravo (1.5), in zvočnik (1.6), ki je prav tako povezan na prilagojeno govorno procesorsko napravo (1.5).Voice-guided telephone communication system according to claim 1, characterized in that the system consists of a custom voice processor (1.5) connected to a control processor (1.4), a remote microphone (1.7) connected to a custom voice processor ( 1.5), and a loudspeaker (1.6), which is also connected to a custom voice processor (1.5). 3. Sistem za govorno vodeno telefonsko komunikacijo po zahtevku 1, označen s tem, da omogoča uporabniku govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe tako, da sproži preko vnaprej določene ključne besede virtualno tipko START, ki je vezana preko modulov (S2.1, S2.2, S2.3) na modul (S2.4) in predstavlja govorno procesorsko napravo (1.1), da modul (52.1) preverja ali je bil detektiran govor, da modul (S2.2) določi značilke govornega signala in modul (S2.3) določi izgovorjeno besedo, da modul (S2.4) določi zanesljivost razpoznane besede, da v prieru da je v modulu (S2.1) na vprašanje, ali je bil detektiran govor, odgovor NO se postopek ponovno vrne na tipko START, da v primeru ko je odgovor na izhodu modula (S2.1) YES poteka postopek naprej preko modulov (S2.2, S2.3 in S2.4) do modula (S2.5), ki preverja ali je razpoznana beseda dovolj zanesljiva, da se v primeru ko je odgovor NO, postopek vrača na virtualno tipko START, da se v primeru ko je odgovor YES proces nadaljuje v modulu (S2.6), ki preverja ali je razpoznan ukaz za omogočitev telefonske povezave, Da se v primeru ko je odgovor YES proces nadaljuje na modulu (S2.14), da se v primeru ko je odgovor NO proces nadaljuje v modulu (S2.7), ki preverja ali je razpoznan ukaz za vzpostavitev telefonske povezave, da se v primeru da je odgovor YES proces nadaljuje v modulu (S2.13) in modulu (S2.14), da se v primeru ko je v modulu (S2.7) odgovor NO proces nadaljuje v modulu (S2.8), ki preverja ali je razpoznan ukaz za ponoven vnos telefonske številke, da se v primeru ko je odgovor YES proces nadaljuje do modula (S2.14), katerega izhod je povezan s tipko START, da se v primeru, ko je na modulu (S2.8) odgovor No proces nadaljuje v modulu (S2.9), ki preverja ali je razpoznan ukaz za zavrnitev oz. prekinitev telefonske povezave, da se v primeru, ko je odgovor YES proces nadaljuje v modulu (S2.15), katerega izhod je vezan na tipko START, da se v primeru, ko je v modulu (S2.9) odgovor NO proces nadaljuje v modulu (S2.10), ki preverja ali je razpoznan ukaz za sprejem telefonskega pogovora, da se v primeru, ko je odgovor YES proces nadaljuje preko modula (S2.11), ki vzpostavi telefonski pogovor; da se v primeru, ko je v modulu (S2.10) odgovor NO proces nadaljuje preko modula (S2.12), ki omogoči prikaz izbrane telefonske številke, da je izhod modula (S2.12) vezan na modul (S2.14) za Izbiro ustrezne slovnice, da modul (S2.13 ) izvede ukaz - Izvedi klic; da modul (S2.15) izvrši ukaz - Prekini telefonski pogovor, da moduli (S2.5 do S2.15) izvajajo funkcije nadzorne procesorske naprave (1.4).3. Voice-guided telephone communication system according to claim 1, characterized in that it allows the user to activate voice dialing of the telephone number or the called person by voice triggering via a predetermined keyword a virtual START key connected via modules (S2.1 , S2.2, S2.3) to the module (S2.4) and represents a speech processing device (1.1) for the module (52.1) to check whether speech has been detected, for the module (S2.2) to determine the characteristics of the speech signal and module (S2.3) determines the spoken word that module (S2.4) determines the reliability of the recognized word, that if module (S2.1) answers the question whether speech has been detected, the answer NO returns to the key again START that in the case when the answer is at the output of module (S2.1) YES proceeds through the modules (S2.2, S2.3 and S2.4) to the module (S2.5), which checks that the word is recognized enough it is certain that in the case when the answer is NO, the process returns to the virtual START key; the YES response proceeds in the module (S2.6), which checks whether the command to enable telephone connection is recognized, that if the YES response process proceeds on the module (S2.14), that if the answer is NO process proceeds in module (S2.7), which checks whether the command to establish a telephone connection is recognized, so that in the event that the answer is YES, the process continues in module (S2.13) and module (S2.14), so that when the module NO (S2.7) has the NO process resumed in the module (S2.8), which checks whether the command to re-enter the telephone number is recognized, so that if the YES process is continued to the module (S2.14), the output of which is connected to the START key, so that, if the module (S2.8) has the No response, the process continues in the module (S2.9), which checks whether the reject or execute command is recognized. interruption of the telephone connection to continue in the module (S2.15) whose output is linked to the START key in the case when the answer is YES, that in the module (S2.9) the NO process will continue in a module (S2.10) that checks whether a command to receive a telephone conversation is recognized so that in the event that the YES response is continued through the module (S2.11) that initiates a telephone conversation; that if the module NO (S2.10) has NO response, the process is continued via the module (S2.12), which allows the selected telephone number to be displayed, that the output of the module (S2.12) is tied to the module (S2.14) for Selecting the appropriate grammar for the module (S2.13) to execute the command - Make a call; for the module (S2.15) to execute the command - End the telephone conversation for the modules (S2.5 to S2.15) to perform the functions of the control processor device (1.4). 4. Sistem za govorno vodeno telefonsko komunikacijo po zahtevku 1 in 2, označen s tem, da omogoča uporabniku govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe tako, da sproži preko vnaprej določene ključne besede virtualno tipko START, ki je vezana preko modulov (S3.1, S3.2, S3.3, S3.4) na modul (S3.5) in predstavlja prilagojeno govorno procesorsko napravo (1.5), da modul (S3.1) preverja ali je bil detektiran govor, da modul (S3.2) zajeti govorni signal dodatno predprocesira z namenom zmanjšanja odmevov, da modul (S3.3) določi značilke govornega signala in modul (S3.4) določi izgovorjeno besedo, da modul (S3.5) določi zanesljivost razpoznane besede; da se, če je odgovor NO v modulu (S3.1) na vprašanje, ali je bil detektiran govor, postopek ponovno vrne na tipko START, da se, če je odgovor YES na izhodu modula (S3.1), postopek izvaja naprej preko modulov (S3.2, S3.3, S3.4 in S3.5) do modula (S3.6), ki preverja ali je razpoznana beseda dovolj zanesljiva; da se v primeru odgovora NO, postopek vrača na virtualno tipko START, da se v primeru odgovora YES proces nadaijuje v modulu (S3.7), ki preverja ali je razpoznan ukaz za omogočitev telefonske povezave, da se v primeru odgovor YES proces nadaljuje na modulu (S3.15), da se v primeru odgovora NO proces nadaljuje v modulu (S3.8), ki preverja ali je razpoznan ukaz za vzpostavitev telefonske povezave, da se v primeru odgovora YES proces nadaljuje v modulu (S3.14) in modulu (S3.15), da se v primeru odgovora NO v modulu (S3.8) proces nadaljuje v modulu (S3.9), ki preverja ali je razpoznan ukaz za ponoven vnos telefonske številke, da se v primeru odgovora YES proces nadaljuje do modula (S3.15), katerega izhod je povezan s tipko START, da se v primeru odgovora NO modula (S3.9) proces nadaljuje v modulu (S3.10), ki preverja ali je razpoznan ukaz za zavrnitev oz. prekinitev telefonske povezave, da je v primeru odgovora YES proces nadaljuje v modulu (S3.16), katerega izhod je vezan na tipko START, da se v primeru odgovora NO v modulu S3.10 proces nadaljuje v modulu (S3.11), ki preverja ali je razpoznan ukaz za sprejem telefonskega pogovora, da se v primeru odgovora YES proces nadaljuje preko modula (S3.12), ki vzpostavi telefonski pogovor, da se v primeru odgovora NO v modulu (S3.11) proces nadaljuje preko modula (S3.13), ki omogoči prikaz izbrane telefonske številke, da je izhod modula (S3.13) vezan na modul (S3.15) za Izbiro ustrezne slovnice, da ima modul (S3.14) ukaz - Izvedi klic, da ima modul (S3.16) ukaz Prekini telefonski pogovor, da moduli (S3.6 do S3.16) izvajajo funkcije nadzorne procesorske naprave (1.4).Voice-guided telephone communication system according to Claims 1 and 2, characterized in that it allows the user to activate voice dialing of the telephone number or the called person by voice triggering via a predefined keyword a virtual START key connected via modules (S3 .1, S3.2, S3.3, S3.4) per module (S3.5) and represents a custom voice processor (1.5) for the module (S3.1) to verify that speech has been detected that the module (S3 .2) the captured speech signal is further preprocessed in order to reduce echoes, so that the module (S3.3) determines the characteristics of the speech signal and the module (S3.4) determines the spoken word in order for the module (S3.5) to determine the reliability of the recognized word; that if the answer is NO in module (S3.1) to the question whether speech has been detected, then the procedure returns to the START key, so that if the answer is YES at the output of module (S3.1), the process is carried on via modules (S3.2, S3.3, S3.4 and S3.5) to a module (S3.6) that verifies that the recognized word is sufficiently reliable; that in the case of NO, the process returns to the virtual START key, so that in the case of a YES response, the process resumes in a module (S3.7) that checks if a command is provided to enable a telephone connection to continue in the case of a YES response to Module (S3.15) that in the case of NO, the process continues in Module (S3.8), which checks whether the command to establish a telephone connection is recognized, that in the case of YES, the process continues in Module (S3.14), and module (S3.15) that in the case of NO in module (S3.8), the process continues in module (S3.9), which checks whether the command to re-enter the telephone number is recognized, so that in the case of YES the process continues to the module (S3.15), the output of which is connected to the START key, so that, in the case of NO module response (S3.9), the process continues in the module (S3.10), which checks whether the rejection command or. interruption of telephone connection that in case of YES answer the process continues in module (S3.16), whose output is bound to START button, that in case of NO answer in module S3.10 the process continues in module (S3.11) which checks whether the command to receive a telephone conversation is recognized so that in the case of a YES response, the process continues via the module (S3.12), which establishes a telephone conversation, that in the case of a NO response in the module (S3.11), the process continues through the module (S3 .13) that allows the selected telephone number to be displayed that the module output (S3.13) is bound to the module (S3.15) for Selecting the appropriate grammar that the module (S3.14) has a command - Make a call that the module ( S3.16) command End the telephone conversation that the modules (S3.6 to S3.16) perform the functions of the control processor device (1.4). 5. Sistem za govorno vodeno telefonsko komunikacijo po zahtevku 1, označen s tem, da sistem vsebuje govorno procesorsko napravo (1.1), s slušalkami (1.3) in mikrofonom (1.2), ki vključuje sistem avtomatskega razpoznavanja govora.5. Voice-guided telephone communication system according to claim 1, characterized in that the system comprises a speech processing device (1.1), with a headset (1.3) and a microphone (1.2), which includes an automatic speech recognition system. 6. Sistem za govorno vodeno telefonsko komunikacijo po zahtevku 2, označen s tem, da sistem vsebuje prilagojeno govorno procesorsko napravo (1.5), z zvočnikom (1.6) in oddaljenim mikrofonom (1.7), ki vključuje sistem avtomatskega razpoznavanja govora.Voice-guided telephone communication system according to claim 2, characterized in that the system comprises a customized voice processing device (1.5), with a loudspeaker (1.6) and a remote microphone (1.7), which includes an automatic speech recognition system. 7. Sistem za govorno vodeno telefonsko komunikacijo po zahtevku 2, označen s tem, da sistem vsebuje prilagojeno govorno procesorsko napravo (1.5), z zvočnikom (1.6) in oddaljenim mikrofonom (1.7), ki vključuje sistem dodatnega predprocesiranja govora za zmanjševanje odmevov.7. Voice-guided telephone communication system according to claim 2, characterized in that the system comprises a custom voice-processing device (1.5), with a loudspeaker (1.6) and a remote microphone (1.7), which includes an additional preprocessing system for speech reduction. 8. Sistem za govorno vodeno telefonsko komunikacijo po zahtevkih 1 in 2, označen s tem, da sistem vsebuje nadzorno procesorsko napravo (1.4), ki vključuje sistem za vzpostavitev telefonske povezave preko sodobnih telefonskih tehnologij VolP ali katerekoli digitalne telefonske povezave, ki8. Voice-guided telephone communication system according to claims 1 and 2, characterized in that the system comprises a control processor device (1.4) which includes a system for establishing a telephone connection via modern VolP telephone technologies or any digital telephone connection which 10 prenaša govorne pakete.10 carries voice packets. 9. Sistem za govorno vodeno telefonsko komunikacijo po zahtevkih 1 do 8, označen s tem, da govorna procesorska naprava (1.1) in prilagojena govorna procesorska naprava (1.5) razpoznavajo ključne besede.9. Voice-guided telephone communication system according to claims 1 to 8, characterized in that the speech processing device (1.1) and the adapted voice processing device (1.5) recognize the keywords. 10. Postopek govorno vodene telefonske komunikacije po zahtevkih 1 do 9, označen s tem, da govorna procesorska naprava (1.1) in prilagojena govorna procesorska naprava (1.5) ovrednotijo pravilnost razpoznanih hipotez.10. The method of voice-guided telephone communication according to claims 1 to 9, characterized in that the voice processing device (1.1) and the adapted voice processing device (1.5) evaluate the correctness of the recognized hypotheses. 20 11. Postopek govorno vodene telefonske komunikacije po zahtevkih 1 do 10, označen s tem, da govorna procesorska naprava (1.1) in prilagojena govorna procesorska naprava (1.5) omogočajo sprotno izbiro razpoznavalnih slovnic za zagotovitev različnih točk dialoga.20. A method of voice-guided telephone communication according to claims 1 to 10, characterized in that the speech processing device (1.1) and the adapted speech processing device (1.5) allow the selection of recognition grammars to be selected on an ongoing basis to provide different points of dialogue.
SI200800141A 2008-05-30 2008-05-30 Process and device for intelligent access control SI22823A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
SI200800141A SI22823A (en) 2008-05-30 2008-05-30 Process and device for intelligent access control

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SI200800141A SI22823A (en) 2008-05-30 2008-05-30 Process and device for intelligent access control

Publications (1)

Publication Number Publication Date
SI22823A true SI22823A (en) 2009-12-31

Family

ID=41462296

Family Applications (1)

Application Number Title Priority Date Filing Date
SI200800141A SI22823A (en) 2008-05-30 2008-05-30 Process and device for intelligent access control

Country Status (1)

Country Link
SI (1) SI22823A (en)

Similar Documents

Publication Publication Date Title
EP3577646B1 (en) Handling calls on a shared speech-enabled device
US5594784A (en) Apparatus and method for transparent telephony utilizing speech-based signaling for initiating and handling calls
US6744860B1 (en) Methods and apparatus for initiating a voice-dialing operation
EP1170932B1 (en) Audible identification of caller and callee for mobile communication device
US6563911B2 (en) Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs
JPS5939154A (en) Telephone set
EP1511277A1 (en) Method for answering an incoming event with a phone device, and adapted phone device
US7471776B2 (en) System and method for communication with an interactive voice response system
KR100664105B1 (en) Voice understanding method for hand-held terminal
JP6090027B2 (en) Voice command compatible information terminal with specific sound
US20070286395A1 (en) Intelligent Multimedia Dial Tone
EP1690358A2 (en) Enhanced telecommunication system
SI22823A (en) Process and device for intelligent access control
JP2015023485A5 (en)
US20160219153A1 (en) Method for Providing Personalized Voicemails
KR20050077989A (en) Method for executing mobile phone's specific function in driving mode
AU756212B2 (en) Method for establishing telephone calls
JP5143062B2 (en) Method for determining illegal call from malicious third party and automatic telephone answering device
JPH11127239A (en) Voice response telephone set
JPS61157053A (en) Telephone set
KR100788652B1 (en) Apparatus and method for dialing auto sound
KR20050102743A (en) Short message transmission method using speech recognition of mobile phone
KR20000002265A (en) Selective call receiving phone
KR20030039039A (en) Caller recognizing apparatus and method for telephone by voice recognition
KR20040001318A (en) Remote control method using voice recognition of mobile telecommunication terminal equipment

Legal Events

Date Code Title Description
OO00 Grant of patent

Effective date: 20100121

KO00 Lapse of patent

Effective date: 20180531