SI22823A - Process and device for intelligent access control - Google Patents
Process and device for intelligent access control Download PDFInfo
- Publication number
- SI22823A SI22823A SI200800141A SI200800141A SI22823A SI 22823 A SI22823 A SI 22823A SI 200800141 A SI200800141 A SI 200800141A SI 200800141 A SI200800141 A SI 200800141A SI 22823 A SI22823 A SI 22823A
- Authority
- SI
- Slovenia
- Prior art keywords
- module
- voice
- telephone
- speech
- command
- Prior art date
Links
Abstract
Description
Sistem in postopek za govorno vodeno telefonsko komunikacijoSystem and procedure for voice-guided telephone communication
Izum sodi v področje sistemov in postopkov za vzpostavljanje telefonske komunikacije s pomočjo govora, natančneje v področje pripomočkov osebam s posebnimi potrebami, ki zaradi motoričnih ovir ne morejo uporabljati normalnih telefonskih aparatov oz. na področje digitalnega procesiranja signalov in področje govornih vmesnikov v telekomunikacijah.The invention belongs to the field of systems and procedures for establishing telephone communication by means of speech, more specifically to the field of accessories for persons with special needs who cannot use normal telephone sets or because of motor impediments. to the field of digital signal processing and the field of voice interfaces in telecommunications.
Tehnični problemA technical problem
Tehnični problem, ki ga rešuje sistem za govorno vodeno telefonsko komunikacijo po izumu, je zasnova elektronskega sistema, ki omogoča uporabniku in zlasti uporabniku s posebnimi potrebami glasovno (govorno) vzpostavljanje in prekinjanje telefonske komunikacije, pri čemer je uporabniku omogočeno nemoteno govorno komuniciranje s svojo okolico, brez nehotenega sprožanja delovanja sistema. Rešitev mora omogočati uporabniku s posebnimi potrebami, da s pomočjo govora kadarkoli aktivira govorno vodeni telefon, izbira ustrezno telefonsko številko, izvede klic, deaktivira govorno vodeni telefon ter nato nadaljuje z govorno komunikacijo s svojo okolico, kot to po zaključku telefonskega pogovora počno uporabniki običajnega telefona. Problem, ki ga pričujoči izum rešuje je oblikovanje algoritma, ki bo s pomočjo digitalnega procesiranja signalov aktiviral posamezne dele sistema govorno vodene telefonske komunikacije za vzpostavljanje telefonske linije, izbiranje klicane številke, opravljanje pogovora in prekinjanje telefonske linije s klicano številko. Naloga in cilj izuma je izpopolniti postopek komuniciranja s pomočjo telefona tako, da se odpravi potreba po taktilni komunikaciji, da je celotni postopek od vzpostavitve do prekinitve telefonske povezave možno izvesti samo s pomočjo govora in s pomočjo računalniškega aktiviranja posameznih funkcij obstoječih procesorsko krmiljenih naprav kot so računalnik, mobilni telefon, igralna konzola, in podobne naprave.A technical problem solved by the voice-guided telephone communication system of the invention is the design of an electronic system that enables the user, and especially the user with special needs, to voice (voice) establish and interrupt telephone communications, allowing the user to communicate smoothly with their surroundings , without inadvertently triggering the system. The solution should allow a user with a disability to activate the voice-guided telephone at any time by means of speech, select the appropriate telephone number, make a call, deactivate the voice-guided telephone, and then continue voice communication with his or her surroundings, as usual telephone users do after the telephone conversation . A problem solved by the present invention is the design of an algorithm that will activate, by means of digital signal processing, individual parts of a voice-guided telephone communication system to establish a telephone line, dial a dialed number, make a conversation, and interrupt a telephone line with a dialed number. The object and object of the invention is to perfect the process of communication by telephone by eliminating the need for tactile communication, so that the entire process from the establishment to the disconnection of the telephone connection can be performed only by voice and by computer activation of certain functions of existing processor-controlled devices such as computer, mobile phone, game console, and similar devices.
Govorna komunikacija s pomočjo telefona je danes nepogrešljiv način komunikacije. Poleg osnovne ideje prenosa sporočila na daljavo, je pomembna tudi sama vsebina sporočila. Na splošno bi lahko rekli, da uporaba telefona izboljšuje komunikacijske zmožnosti ljudi. V tej množici ljudi so tudi ljudje s posebnimi potrebami, ki zaradi bolezenskega stanja ali drugih vzrokov ne morejo uporabljati običajnega telefona na enak način, kot drugi ljudje. Težava največkrat nastopi v tisti fazi uporabe telefona, ko mora uporabnik s pomočjo taktilne komunikacije, to je, z uporabo rok, izbrati ustrezno telefonsko številko ter izvesti telefonski klic.Voice-over-telephone communication is an indispensable method of communication today. In addition to the basic idea of transmitting a message remotely, the content of the message itself is important. Generally speaking, using a phone improves people's communication skills. This crowd also includes people with disabilities who, due to a medical condition or other causes, cannot use a regular telephone in the same way as other people. The problem most often occurs at the stage of using the phone, when the user has to use the tactile communication, that is, using his hands, to select the appropriate phone number and make a phone call.
Razpoznavalnik govora razpoznava vhodni govorni signal. V splošnem nastopi težava pri razpoznavanju govora, ko uporabnik v dani situacij ne želi posredovati sporočila razpoznavalniku ampak tretji osebi. V takšnem primeru je potrebno s pomočjo stikala, uporaba taktilne komunikacije, preprečiti neželeno proženje razpoznavalnika govora ter napačnega odziva sistema.The speech recognizer recognizes the input speech signal. Generally speaking, there is a problem with speech recognition when, in a given situation, the user does not want to forward the message to the recognizer but to a third party. In such a case, the use of a switch, the use of tactile communication, should prevent unwanted triggering of the speech recognizer and the wrong response of the system.
Predstavljeni izum je izveden tako, da omogoča uporabniku ob sami uporabi govorno vodene komunikacije telefona tudi vodenje vsakdanjega pogovora brez uporabe stikala oz. taktilne komunikacije, ki bi omogočal prekinitev zajema govornega signala.The present invention is implemented in such a way that, while using the voice-guided communication of the telephone, the user can also conduct daily conversation without using a switch or switch. tactile communication that would allow interruption of speech signal acquisition.
Stanje tehnikeThe state of the art
Znanih je kar nekaj rešitev na temo govorno upravljanih telefonov. Ena izmed prvih rešitev je predstavljena v patentu US 5.007.081. V omenjenem patentnem spisu je predstavljen govorno vodeni telefon, vgrajen v ohišje običajnega namiznega telefona s tipkovnico, prikazovalnikom in telefonsko slušalko. Telefon je predviden za priključitev na analogno javno telefonsko omrežje (PSTN, ang. »Public Switched Telephone Network«). Prikazovalnik služi za kontrolo razpoznanega govornega ukaza in omogoča uporabniku, da s pritiskom na ustrezno tipko prepreči izvajanje nepravilno razpoznanega govornega ukaza. Upravljanje omenjenega govorno vodenega telefona zahteva torej tudi taktilno komunikacijo z napravo. Postopek avtomatskega razpoznavanja govora v omenjeni rešitvi temelji na postopku dinamičnega časovnega sledenja (DTW, ang. »Dynamic Time VVarping«) in je odvisen od govorca. To pomeni, da mora vsak uporabnik naučiti sistem s svojim naborom govornih ukazov pred samo uporabo sistema.There are quite a few solutions on the subject of voice-controlled phones. One of the first solutions is disclosed in US Patent 5,007,081. The aforementioned patent file discloses a voice-guided telephone embedded in the case of a conventional desktop telephone with a keyboard, display, and handset. The phone is intended to be connected to an analogue public Switched Telephone Network (PSTN). The display is used to control a recognized voice command and allows the user to prevent an incorrectly recognized voice command by pressing the appropriate key. The operation of said voice-guided telephone also requires tactile communication with the device. The automatic speech recognition process in this solution is based on the Dynamic Time VVarping (DTW) process and depends on the speaker. This means that each user must learn the system with its set of voice commands before just using the system.
Predlagana rešitev se od zgoraj opisanega patenta US 5.007.081 razlikuje po tem, da ne zahteva nobene taktilne komunikacije, ne potrebuje nobenega predhodnega učenja govornih ukazov, ni od govorca odvisna, sistem avtomatskega razpoznavanja govora pa temelji na prikritih modelih Markova. Poglavitna razlika predlagane rešitve od patenta US 5.007.081 je v tem, da predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oz. klicane osebe. Predlagana rešitev prav tako ni omejena le na analogno telefonsko omrežje, temveč lahko deluje v poljubnem digitalnem telefonskem omrežju (ISDN, VolP).The proposed solution differs from US Patent No. 5,007,081 described above in that it requires no tactile communication, requires no prior learning of voice commands, is independent of the speaker, and the automatic speech recognition system is based on covert Markov models. The main difference between the proposed solution and the patent of US 5,007,081 is that the proposed solution enables the voice activation of the telephone dialing or telephone dialing. called persons. The proposed solution is not only limited to the analogue telephone network, but can operate on any digital telephone network (ISDN, VolP).
Naslednja znana rešitev je predstavljena v patentnem spisu US 5.452.340. Gre za izboljšavo prej omenjene rešitve, saj omogoča govorno dodajanje klicane osebe in pripadajoče številke v imenik telefona kar med postopkom vzpostavitve telefonskega klica. Tudi v tem primeru je za krmiljenje uporabljena tako govorna, vnos govornih ukazov, kot tudi taktilna komunikacija pri potrjevanju pravilnosti govornih ukazov.Another known solution is disclosed in U.S. Patent No. 5,452,340. It is an improvement of the aforementioned solution, since it allows voice dialing of the called person and the corresponding number in the phone book during the process of making a phone call. Here again, both voice, voice command input and tactile communication are used to control the validity of voice commands.
Predlagana rešitev se od zgoraj opisanega patenta US 5.452.340 razlikuje po tem, da za potrjevanje ukazov ne zahteva nobene taktilne komunikacije. Poglavitna razlika predlagane rešitve od patenta US 5.452.340 je v tem, da predlagana rešitev omogoča govorno aktiviranje pričetka izbiranja telefonske številke oziroma klicane osebe.The proposed solution differs from US Pat. No. 5,452,340 described above in that no tactile communication is required to confirm commands. The main difference between the proposed solution and the patent of US 5,452,340 is that the proposed solution enables the voice dialing of the telephone number or the called person to be activated.
V patentu US 5.483.579 je predstavljena naprava, ki omogoča govorno krmiljeno klicanje in jo je možno uporabiti v kombinaciji z več klasičnimi telefonskimi aparati na isti telefonski liniji.U.S. Pat. No. 5,483,579 introduces a device that enables voice-controlled dialing and can be used in combination with several classic telephone sets on the same telephone line.
Predlagana rešitev se od zgoraj opisanega patenta US 5.483.579 razlikuje po tem, da se ne uporablja kot nadgradnja klasičnih telefonskih aparatov, ampak deluje kot samostojna govorno vodena komunikacijska naprava. Predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe, medtem ko je v omenjenem patentu govorno izbiranje aktivirano šele z dvigom slušalke in ni popolnoma prostoročno.The proposed solution differs from the above-described US Pat. No. 5,483,579 in that it is not used as an upgrade of classic telephone sets but acts as a standalone voice-guided communication device. The proposed solution enables voice activation of the dialing start of the telephone number or the person called, while in the aforementioned patent, voice dialing is only activated by lifting the handset and is not completely hands free.
Primer govorno krmiljenega telefona brez vsakršne uporabe taktilne komunikacije je predstavljen v patentnem spisu US 6.167.251. V patentu je opisan sistem mobilnega celičnega telefona (lahko GSM ali CDMA), ki ne vsebuje nobene tipke oz. tipkovnice. Tako govorni ukazi, kot tudi telefonski pogovori se prenašajo brezžično preko bazne postaje v centralo. Strežnik v centrali omogoča razpoznavanje govornih ukazov, vzpostavljanje telefonskih povezav, govorno krmiljeno urejanje imenikov itd.An example of a voice-controlled telephone without any use of tactile communication is presented in U.S. Patent No. 6,167,251. The patent describes a mobile cellular telephone system (GSM or CDMA) that does not contain any key or key. keyboards. Both voice commands and telephone conversations are transmitted wirelessly through the base station to the exchange. The PBX server enables voice command recognition, telephone connections, voice-controlled directories editing, etc.
Predlagana rešitev se od zgoraj opisanega patenta US 6.167.251 razlikuje po tem, da ne vsebuje oddaljene centralne enote, ki bi opravljala postopek avtomatskega razpoznavanja govora, ampak je postopek razpoznavanja vgrajen neposredno v sistem govorno vodene komunikacijske naprave. Prav tako predlagana rešitev ni omejena na sisteme mobilne celične telefonije točno določenega operaterja, ki bi podpiral govorno vodene storitve. Poglavitna razlika predlagane rešitve od patenta US 6.167.251 je v tem, da predlagana rešitev omogoča govorno aktiviranje pričetka izbiranja telefonske številke oziroma klicane osebe. Rešitev v tem patentu sicer ne vsebuje klasične tipkovnice ali številčnice, vendar pa vsebuje tipko za vklop in izklop telefona, s čemer dobi center znak za začetek procesiranja govornega signala. Uporabnik izgovori telefonsko številko šele po sporočilu, ki mu ga posreduje strežnik.The proposed solution differs from US Pat. No. 6,167,251 above in that it does not contain a remote central unit that would perform the automatic speech recognition process, but that the recognition process is integrated directly into the voice-guided communication device system. Likewise, the proposed solution is not limited to mobile cellular telephony systems from a specific carrier that supports voice-guided services. The main difference between the proposed solution and the patent of US 6,167,251 is that the proposed solution enables the voice dialing of the telephone number or the called person to be activated. Although the solution in this patent does not include a classic keyboard or dial, it does have a key to turn the phone on and off, giving the center a signal to start processing the voice signal. The user speaks the telephone number only after a message is sent to the server.
Patent US 2002/0097843A1 opisuje posebno vgradno napravo, ki omogoča govorno podporo za poljubni telefonski aparat. Sistem za razpoznavanje je odvisen od govorca, ki mora sistem najprej naučiti na detekcijo krmilnih besed.US 2002 / 0097843A1 describes a special built-in device that provides voice support for any telephone set. The recognition system depends on the speaker, who must first learn the system to detect the control words.
Predlagana rešitev se od zgoraj opisanega patenta US 2002/0097843A1 razlikuje po tem, da ne potrebuje nobenega predhodnega učenja govornih ukazov in ni odvisna od govorca. Prav tako se predlagana rešitev ne uporablja kot vgradna naprava za klasične telefonske aparate, ampak deluje kot samostojna govorno vodena komunikacijska naprava. Poglavitna razlika predlagane rešitve od prej opisane je v tem, da predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe.The proposed solution differs from the above patent US 2002 / 0097843A1 in that it does not require any prior learning of spoken commands and is independent of the speaker. Also, the proposed solution is not used as a built-in device for classic telephones, but acts as a standalone voice-guided communication device. The main difference between the proposed solution and the one described above is that the proposed solution enables the voice dialing of the telephone or the called person to be dialed.
V patentnem spisu US 5.799.065 je opisana naprava z nazivom »virtualni telefonski operater« za govorno vodeno vzpostavljanje klicev brez uporabe taktilne komunikacije. Sistem za avtomatsko razpoznavanje govora s tremi prehodi temelji na prikritih modelih Markova (HMM, ang. »Hidden Markov Models«) z uporabo PLPRASTA (ang. »Perceptual Linear Predictive Coding - Relative Spectral«) značilk. Sistem je sposoben procesirati tekoči govor, vključen pa je v telefonsko centralo.U.S. Pat. No. 5,799,065 describes a device called "virtual telephone operator" for voice-guided calling without the use of tactile communication. The three-pass automatic speech recognition system is based on Hidden Markov Models (HMM) using PLPRAST (Perceptual Linear Predictive Coding - Relative Spectral) features. The system is capable of processing current speech and is integrated into the telephone exchange.
Predlagana rešitev se od zgoraj opisanega patenta US 5.799.065 razlikuje po tem, da sistem za procesiranje govornega signala oz. za avtomatsko razpoznavanje govora ni vključen v telefonsko centralo, temveč se uporablja kot samostojna govorno vodena komunikacijska naprava. Poglavitna razlika predlagane rešitve od patenta US 5.799.065 je v tem, da predlagana rešitev omogoča govorno aktiviranje začetka izbiranja telefonske številke oziroma klicane osebe.The proposed solution differs from the above-described U.S. Pat. No. 5,799,065 in that the voice signal processing system, or it is not included in the telephone exchange for automatic speech recognition, but is used as a standalone voice-guided communication device. The main difference between the proposed solution and the patent of US 5,799,065 is that the proposed solution enables the voice dialing of the telephone or the called person to be dialed.
Opis rešitve tehničnega problemaDescription of solution to a technical problem
Bistvo sistema za govorno vodeno telefonsko komunikacijo po izumu je v tem, da naprava vsebuje mikrofon, slušalke, govorno procesorsko napravo na kateri se izvaja razpoznavanje govora ali oddaljeni mikrofon, zvočnik, prilagojeno govorno procesorsko napravo na kateri se poleg razpoznavanja govora izvaja dodatno predprocesiranje govora in nadzorno procesorsko napravo na kateri teče aplikacija vzpostavitve telefonske povezave ter prenosa govora. Bistvo izuma je v algoritmu s katerim se vpišejo vsi za uporabnika značilni govorni podatki v procesorsko napravo za razpoznavo govora, ki krmili nadzorno procesorsko napravo katere algoritem omogoča vzpostavitev telefonske povezave ter prenosa govora. Algoritem omogoča izvedbo klica s pomočjo govora brez kakršnegakoli drugega posredovanja uporabnika. Aplikacije algoritmov govorne procesorske naprave, prilagojene govorne procesorske naprave in nadzorne procesorske naprave se lahko realizirajo na računalniku, digitalnem komunikacijskem terminalu, igralno konzolo, osebni organizator in podobno.The essence of a voice-guided telephone communication system according to the invention is that the device comprises a microphone, a headset, a speech processing device on which speech recognition is performed or a remote microphone, a speaker, a customized voice processing device on which, in addition to speech recognition, additional preprocessing of speech is performed, and a control processor device running the application of establishing a telephone connection and voice transmission. The essence of the invention is an algorithm whereby all user-specific voice data is entered into a speech recognition processor that controls a controlling processor device whose algorithm enables telephone connection and speech transmission. The algorithm enables voice-rendered calls without any other user intervention. Applications of voice processor algorithms, custom voice processor devices, and control processor devices may be implemented on a computer, digital communication terminal, game console, personal organizer, and the like.
Predmet izuma je tudi postopek za govorno vodeno telefonsko komunikacijo, ki je izveden tako, da omogoča izvedbo klica s pomočjo govora, brez kakršnegakoli drugega posredovanja uporabnika.The subject of the invention is also a method for voice-guided telephone communication, which is implemented in such a way as to enable the call to be made by voice without any other intervention of the user.
Sistem za govorno vodeno telefonsko komunikacijo po izumu bo podrobneje opisan s pomočjo slik, ki kažejo:The voice-guided telephone communication system of the invention will be described in greater detail by means of pictures showing:
Slika 1 - Blokovno shemo sistema govorno vodene telefonske komunikacije Slika 2 - Sistem in postopek za govorno vodene telefonske komunikacije Slika 3 - Sistem in postopek za govorno vodene telefonske komunikacije po izvedbenem primeru IFigure 1 - Block diagram of the voice-guided telephone communication system Figure 2 - System and procedure for voice-guided telephone communications Figure 3 - System and procedure for voice-guided telephone communications after embodiment I
Predmet izuma je postopek za govorno vodeno telefonsko komunikacijo, ki je izveden tako, da s prav tako v izumu opisanem sistemu za to komunikacijo omogoča izvedbo klica s pomočjo govora, brez kakršnegakoli drugega posredovanja uporabnika. Uporabnik z uporabo takšnega sistema s ' ponločjo govora kadarkoli aktivira sistem za govorno vodeno telefonsko komunikacijo, izbira ustrezno telefonsko številko, izvede klic, deaktivira sistem ter nato nadaljuje z govorno komunikacijo s svojo okolico, kot to po zaključku telefonskega pogovora počno uporabniki običajnega telefona.The subject of the invention is a method for voice-guided telephone communication, which is implemented in such a way that the system described for the communication described in the invention also enables a call to be made by voice without any other intervention of the user. Using such a system of 'voice conversations', the user activates the voice-guided telephone communication system at any time, dials the appropriate telephone number, makes a call, deactivates the system, and then resumes voice communication with his or her surroundings, as usual telephone users do after the telephone conversation has ended.
Predstavljeni izum je izveden tako, da omogoča uporabniku ob sami uporabi govorno vodene komunikacije telefona tudi vodenje vsakdanjega pogovora brez uporabe stikala, taktilne komunikacije, ki bi omogočal prekinitev zajema govornega signala.The present invention is designed to allow the user, while using the voice-guided communication of the telephone itself, to conduct daily conversation without the use of a switch, tactile communication that would allow interruption of speech signal acquisition.
V tem izumu opisan sistem omogoča s pomočjo govorne tehnologije tudi ljudem s posebnimi potrebami uporabo telefona na način, ki je enakovreden načinu uporabe, kot ga uporabljajo drugi ljudje. Uporabnik lahko z uporabo govorno vodenega telefona s pomočjo govora kadarkoli aktivira govorno vodeni telefon, izbira ustrezno telefonsko številko, izvede klic, deaktivira govorno vodeni telefon ter nato nadaljuje z govorno komunikacijo s svojo okolico, kot to po zaključku telefonskega pogovora počno uporabniki običajnega telefona.In the present invention, the system described also makes it possible for people with disabilities to use the telephone in a manner equivalent to that used by other people through speech technology. Using a voice-guided telephone, the user can activate the voice-guided telephone at any time, select the appropriate telephone number, make a call, deactivate the voice-guided telephone, and then resume voice communication with his or her surroundings, as users of a regular telephone do after completing a telephone conversation.
Sistem za govorno vodeno telefonsko komunikacijo je sestavljen iz govorne procesorske naprave 1.1, nadzorne procesorske naprave 1.4, slušalk 1.3 ter mikrofona 1.2. Na govorni procesorski napravi 1.1 se izvaja algoritem za razpoznavo govora, ki vhodni govorni signal pretvarja v tekstovno obliko. Rezultat razpoznave v tekstovni obliki se prenese v nadzorno procesorsko napravo 1.4. Nadzorna procesorska naprava 1.4 obdela ukaz prejet od govorne procesorske naprave 1.1 in izvede ustrezno akcijo. Akcija je lahko omogočitev telefonske povezave, izbira telefonske številke ali imena pridruženega izbrani telefonski številki, vzpostavitev telefonske povezave, sprejem dohodnega klica, zavrnitev telefonskega klica ali prekinitev telefonske povezave. Slušalke 1.3 in mikrofon 1.2 so lahko žično ali brezžično povezane z govorno procesorsko napravo 1.1 za razpoznavo govora. Ob vzpostavitvi telefonske povezave uporabnik komunicira z drugim uporabnikom preko mikrofona 1.2 ter slušalk 1.3.The voice-guided telephone communication system consists of a voice processor 1.1, a control processor 1.4, a handset 1.3 and a microphone 1.2. Speech processor 1.1 implements a speech recognition algorithm that converts the input speech signal to text format. The result of the recognition in text format is transferred to the control processor 1.4. The control processor 1.4 processes the command received from the speech processor 1.1 and performs the appropriate action. An action can be to enable a phone connection, select a phone number or name associated with the selected phone number, make a phone connection, answer an incoming call, reject a phone call, or end the phone connection. The headset 1.3 and microphone 1.2 can be wired or wirelessly connected to a speech processor 1.1 for speech recognition. When establishing a telephone connection, the user communicates with another user via microphone 1.2 and handset 1.3.
V izvedbenem primeru I sistema za govorno vodeno telefonsko komunikacijo je sistem sestavljen iz prilagojene govorne procesorske naprave 1.5, nadzorne procesorske naprave 1.4, zvočnika 1.6 ter oddaljenega mikrofona 1.7. Na prilagojeni govorni procesorski napravi 1.5 se poleg algoritma za razpoznavo govora izvaja še algoritem za dodatno predprocesiranje govornega signala, ki zmanjšuje odmeve, ki so posledica prostoročnega telefoniranja. Rezultat razpoznave v tekstovni obliki se enako kot pri prvi izvedbi prenese v nadzorno procesorsko napravo 1.4. Nadzorna procesorska naprava 1.4 obdela ukaz prejet od prilagojene govorne procesorske naprave 1.5 in izvede ustrezno akcijo za omogočitev telefonske povezave, izbira telefonske številke ali imena pridruženega izbrani telefonski številki, vzpostavitev telefonske povezave, sprejem dohodnega klica, zavrnitev telefonskega klica ali prekinitev telefonske povezave. Zvočnik 1.6 in oddaljeni mikrofon 1.7 so lahko enako kot v pri prvi izvedbi žično ali brezžično povezane s prilagojeno govorno procesorsko napravo 1.5 za predprocesiranje in razpoznavo govora. Ob vzpostavitvi telefonske povezave uporabnik komunicira z drugim uporabnikom preko oddaljenega mikrofonaIn embodiment I of the voice-guided telephone communication system, the system consists of a custom voice processor 1.5, a control processor 1.4, a speaker 1.6, and a remote microphone 1.7. On the custom voice processor 1.5, in addition to the speech recognition algorithm, an algorithm for additional preprocessing of the speech signal is implemented, which reduces the echoes that result from hands-free calling. The result of the recognition in text format is transferred to the control processor 1.4 in the same way as in the first version. The processor processor 1.4 processes the command received from the personalized voice processor 1.5 and performs the appropriate action to enable a telephone connection, select a telephone number or name associated with the selected telephone number, establish a telephone connection, answer an incoming call, reject a telephone call, or disconnect a telephone connection. Loudspeaker 1.6 and Remote Microphone 1.7 can be wired or wirelessly connected, as in the first embodiment, to a customized speech processor 1.5 for preprocessing and speech recognition. When establishing a telephone connection, the user communicates with another user via a remote microphone
1.7 ter zvočnika 1.6.1.7 and the speaker 1.6.
Govorno procesorsko napravo 1.1, prilagojeno govorno procesorsko napravo 1.5 in nadzorno procesorsko napravo 1.4 lahko predstavlja digitalni računalnik, prenosni računalnik, digitalni komunikacijski terminal (mobilna komunikacijska naprava - na primer mobilni telefon), igralna konzola, digitalni osebni organizator, pri tem pa uporaba izuma ni omejena le na omenjene naprave.Voice processor 1.1, customized voice processor 1.5 and control processor 1.4 may be represented by a digital computer, a laptop computer, a digital communication terminal (mobile communication device - for example a mobile phone), a game console, a digital personal organizer, without the use of the invention. limited to the aforementioned devices only.
Postopek izvedbe akcije omogočitve, izbire telefonske številke, vzpostavitve, sprejema in zavrnitve telefonskega klica govorno vodene telefonske komunikacije je podan na sliki 2. Uporabnik sistema po izumu sproži preko vnaprej določene ključne besede virtualno tipko START, ki je vezana preko modulov S2.1, S2.2, S2.3 na modul S2.4 in predstavlja govorno procesorsko napravo 1.1. Modul S2.1 preverja ali je bil detektiran govor. Modul S2.2 določi značilke govornega signala in modul S2.3 določi izgovorjeno besedo. Modul S2.4 določi zanesljivost razpoznane besede. Če je v modulu S2.1 na vprašanje, ali je bil detektiran govor, odgovor NO se postopek ponovno vrne na tipko START. Ko je odgovor na izhodu modula S2.1 YES poteka postopek naprej preko modulov S2.2, S2.3 in S2.4 do modula S2.5, ki preverja ali je razpoznana beseda dovolj zanesljiva. Če je odgovor NO, se postopek vrača na virtualno tipko START. Ko je odgovor YES se proces nadaljuje v modulu S2.6, ki preverja ali je razpoznan ukaz za omogočitev telefonske povezave. Če je odgovorThe process of performing an enabling, selecting a telephone number, establishing, receiving and rejecting a telephone call by voice-guided telephone communication is given in Figure 2. The system user of the invention, via a predefined keyword, triggers a virtual START key, which is bound via modules S2.1, S2 .2, S2.3 to module S2.4, and represents the voice processor device 1.1. Module S2.1 checks whether speech has been detected. Module S2.2 defines the characteristics of the speech signal and module S2.3 defines the spoken word. Module S2.4 determines the reliability of a recognized word. If there is a question in module S2.1 whether a speech has been detected, the NO will return to the START key again. When the answer is at the output of module S2.1 YES, the process proceeds through modules S2.2, S2.3 and S2.4 to module S2.5, which verifies that the recognized word is sufficiently reliable. If NO, the process returns to the virtual START key. When the answer is YES, the process continues in module S2.6, which checks whether the command to enable telephone connection is recognized. If yes
YES se proces nadaljuje na modulu S2.14. V kolikor je odgovor NO se proces nadaljuje v modulu S2.7, ki preverja ali je razpoznan ukaz za vzpostavitev telefonske povezave. Če je odgovor YES se proces nadaljuje v modulu S2.13 in modulu S2.14. Če je odgovor v modulu S2.7 NO se proces nadaljuje v modulu S2.8, ki preverja ali je razpoznan ukaz za ponoven vnos telefonske številke. Če je odgovor YES se proces nadaljuje do modula S2.14, katerega izhod je povezan s tipko START. Če je na modulu S2.8 odgovor No se proces nadaljuje v modulu S2.9, ki preverja ali je razpoznan ukaz za zavrnitev oz. prekinitev telefonske povezave. Ko je odgovor YES se proces nadaljuje v modulu S2.15, katerega izhod je vezan na tipko START. Ko je v modulu S2.9 odgovor NO se proces nadaljuje v modulu S2.10, ki preverja ali je razpoznan ukaz za sprejem telefonskega pogovora. Če je odgovor YES se proces nadaljuje preko modula S2.11 ki vzpostavi telefonski pogovor. Če je v modulu S2.10 odgovor NO se proces nadaljuje preko modula S2.12, ki omogoči prikaz izbrane telefonske številke. Izhod modula S2.12 je vezan na modul S2.14 za Izbiro ustrezne slovnice. Modul S2.13 ima ukaz - Izvedi klic. Modul S2.15 ima ukaz - Prekini telefonski pogovor. Moduli S2.5 do S2.15 izvajajo funkcije nadzorne procesorske naprave 1.4.The YES process continues on module S2.14. If NO, the process continues in module S2.7, which checks whether the command to establish a telephone connection is recognized. If YES, the process continues in module S2.13 and module S2.14. If the answer is in module S2.7 NO, the process is continued in module S2.8, which checks whether the command to re-enter the telephone number is recognized. If YES, the process proceeds to module S2.14, the output of which is connected to the START key. If the answer to No is S2.8, then the process continues in module S2.9, which checks whether the rejection or rejection command is recognized. interruption of telephone connection. When the answer is YES, the process resumes in module S2.15, whose output is bound to the START key. When NO is answered in Module S2.9, the process continues in Module S2.10, which checks if a command to receive a telephone conversation is recognized. If YES, the process is continued via the S2.11 module which establishes a telephone conversation. If NO is answered in module S2.10, the process is continued via module S2.12, which allows the selected telephone number to be displayed. The output of Module S2.12 is linked to Module S2.14 to select the appropriate grammar. Module S2.13 has the command - Make a call. Module S2.15 has the command - End telephone conversation. Modules S2.5 to S2.15 perform the functions of a control processor 1.4.
Postopek izvedbe akcije omogočitve, izbire telefonske številke, vzpostavitve, sprejema in zavrnitve telefonskega klica govorno vodene telefonske komunikacije po izvedbenem primeru I je podan na sliki 3. Uporabnik sistema po izumu sproži preko vnaprej določene ključne besede virtualno tipko START, ki je vezana preko modulovThe process of executing an enabling, selecting a telephone number, establishing, receiving, and rejecting a telephone call of a voice-guided telephone communication according to embodiment I is shown in Figure 3. The system user according to the invention triggers via a predetermined keyword a virtual START key which is bound via modules
S3.1, S3.2, S3.3, S3.4 na modul S3.5 in predstavlja prilagojeno govorno procesorsko napravo 1.5. Modul S3.1 preverja ali je bil detektiran govor. Modul S3.2 zajeti govorni signal dodatno predprocesira z namenom zmanjšanja odmevov. Modul S3.3 določi značilke govornega signala in modul S3.4 določi izgovorjeno besedo. Modul S3.5 določi zanesljivost razpoznane besede. Če je v modulu S3.1 na vprašanje, ali je bil detektiran govor, odgovor NO se postopek ponovno vrne na tipko START. Ko je odgovor na izhodu modula S3.1 YES poteka postopek naprej preko modulov S3.2, S3.3, S3.4 in S3.5 do modula S3.6, ki preverja ali je razpoznana beseda dovolj zanesljiva. Če je odgovor NO, se postopek vrača na virtualno tipko START. Ko je odgovor YES se proces nadaljuje v modulu S3.7, ki preverja ali je razpoznan ukaz za omogočitev telefonske povezave. Če je odgovor YES se proces nadaljuje na modulu S3.15. V kolikor je odgovor NO se proces nadaljuje v modulu S3.8, ki preverja ali je razpoznan ukaz za vzpostavitev telefonske povezave. Če je odgovor YES se proces nadaljuje v modulu S3.14 in modulu S3.15. Če je odgovor v modulu S3.8 NO se proces nadaljuje v modulu S3.9, ki preverja ali je razpoznan ukaz za ponoven vnos telefonske številke. Če je odgovor YES se proces nadaljuje do modulaS3.1, S3.2, S3.3, S3.4 to module S3.5 and is a customized voice processor 1.5. Module S3.1 checks whether speech has been detected. Module S3.2 additionally preprocesses the captured voice signal in order to reduce echoes. Module S3.3 defines the characteristics of the speech signal and module S3.4 defines the spoken word. Module S3.5 determines the reliability of the recognized word. If there is a question in module S3.1 whether a speech has been detected, the NO will return to the START key again. When the answer is at the output of module S3.1 YES, the process proceeds through modules S3.2, S3.3, S3.4 and S3.5 to module S3.6, which verifies that the recognized word is sufficiently reliable. If NO, the process returns to the virtual START key. When YES is answered, the process continues in module S3.7, which checks whether the command to enable telephone connection is recognized. If YES, the process continues on module S3.15. If NO, the process continues in module S3.8, which checks whether the command to establish a telephone connection is recognized. If YES, the process continues in module S3.14 and module S3.15. If the answer is in Module S3.8 NO, the process is continued in Module S3.9, which checks whether the command to re-enter the telephone number is recognized. If YES, the process proceeds to the module
53.15, katerega izhod je povezan s tipko START. Če je na modulu S3.9 odgovor No se proces nadaljuje v modulu S3.10, ki preverja ali je razpoznan ukaz za zavrnitev oz. prekinitev telefonske povezave. Ko je odgovor YES se proces nadaljuje v modulu53.15, the output of which is connected to the START key. If Module S3.9 answers No, the process continues in Module S3.10, which verifies that the reject or reject command is recognized. interruption of telephone connection. When the answer is YES, the process continues in the module
53.16, katerega izhod je vezan na tipko START. Ko je v modulu S3.10 odgovor NO se proces nadaljuje v modulu S3.11, ki preverja ali je razpoznan ukaz za sprejem telefonskega pogovora. Če je odgovor YES se proces nadaljuje preko modula S3.12 ki vzpostavi telefonski pogovor. Če je v modulu S3.11 odgovor NO se proces nadaljuje preko modula S3.13, ki omogoči prikaz izbrane telefonske številke. Izhod modula S3.13 je vezan na modul S3.15 za Izbiro ustrezne slovnice. Modul S3.14 ima ukaz - Izvedi klic. Modul S3.16 ima ukaz - Prekini telefonski pogovor. Moduli S3.6 do S3.16 izvajajo funkcije nadzorne procesorske naprave 1.4.53.16, the output of which is bound to the START key. When NO is answered in Module S3.10, the process continues in Module S3.11, which checks whether the command to receive a telephone conversation is recognized. If YES, the process is continued through module S3.12, which establishes a telephone conversation. If NO is answered in module S3.11, the process is continued via module S3.13, which allows the selected telephone number to be displayed. The output of Module S3.13 is linked to Module S3.15 to select the appropriate grammar. Module S3.14 has the command - Make a call. Module S3.16 has the command - End telephone conversation. Modules S3.6 to S3.16 perform the functions of a control processor 1.4.
Uporabnik ima aktiviran mikrofon 1.2 in slušalke 1.3 ali oddaljeni mikrofon 1.7 in zvočnik 1.6 ter vodi normalno komunikacijo s svojo okolico. V trenutku, ko želi izvesti telefonski pogovor s pomočjo v tem opisu predstavljenega izuma izgovori ključno besedo ali ustrezno zaporedje ključnih besed za omogočitev telefonske povezave. Po pravilno razpoznani ključni besedi uporabnik izgovori želeno telefonsko številko ali ime osebe, ki jo želi poklicati in kateri je v sistemu pridružena ustrezna telefonska številka. Če je razpoznana napačna telefonska številka jo uporabnik lahko ponovi, drugače pa sproži klic z izgovarjavo ključne besede za vzpostavitev povezave. Med samo aktivno telefonsko povezavo jo lahko uporabnik kadarkoli zaključi z izgovarjavo ključne besede oziroma izbranega zaporedja ključnih besed za prekinitev telefonske povezave. Z izgovarjavo ključnih besed za sprejem telefonskega pogovora ali za zavrnitev telefonske povezave ima uporabnik popoln nadzor nad telefonsko komunikacijo. Uporabnik lahko sam določi ključne besede za vzpostavitev in prekinitev telefonske povezave.The user has an activated microphone 1.2 and headset 1.3 or remote microphone 1.7 and speaker 1.6 and guides normal communication with their surroundings. As soon as he wishes to make a telephone conversation, he utters a keyword or an appropriate sequence of keywords to enable a telephone connection using the invention described herein. After a correctly recognized keyword, the user speaks the desired phone number or the name of the person he wants to call and to which the corresponding phone number is associated in the system. If the wrong phone number is identified, the user can repeat it, otherwise it will trigger a call by saying the keyword to connect. During an active telephone connection only, the user can terminate it at any time by uttering a keyword or a selected sequence of keywords to end the telephone connection. By saying keywords to receive a telephone conversation or to reject a telephone connection, the user has complete control over the telephone communication. It is up to the user to determine the keywords to establish and disconnect the telephone connection.
V tej patentni prijavi opisana rešitev se razlikuje od obstoječih rešitev v naslednjih točkah:The solution described in this patent application differs from the existing solutions in the following points:
Omogočitev govornega aktiviranja začetka izbiranja telefonske številke oziroma klicane osebe.Enable voice activation to start dialing a phone number or a person called.
Uporaba sodobnih pristopov v postopku avtomatskega procesiranja in razpoznavanja govora omogoča procesiranje tekočega govora in razpoznavo govornih ukazov tudi med samim telefonskim pogovorom, brez poseganja v zagon in ustavitev razpoznavanja. Takšen pristop pa rezultira v vodenju vsakdanjega pogovora brez uporabe stikala (taktilne komunikacije), ki bi omogočala prekinitev zajema govornega signala za pravilno delovanje sistema razpoznavanja govora.The use of modern approaches in the process of automatic processing and speech recognition enables the processing of current speech and the recognition of voice commands even during the telephone conversation itself, without prejudice to starting and stopping the recognition. Such an approach, however, results in the conduct of everyday conversation without the use of a switch (tactile communication) that would allow the interruption of speech signal acquisition for the proper functioning of the speech recognition system.
Delovanje v sklopu sodobnih telefonskih tehnologij Vol P (ang. »Voice overOperating in the modern voice technologies of Vol P (Voice over
Internet Protocol«) ali katerekoli digitalne telefonske povezave, ki prenaša govorne pakete.Internet Protocol ") or any digital telephone connection that transmits voice packets.
Za delovanje ni potrebna nobena specifična strojna oprema.No specific hardware is required to operate.
Razpoznavanje je neodvisno od govorca in ni potrebno predhodno učenje na posameznega govorca.Recognition is independent of the speaker and no prior learning is required on the individual speaker.
- Razpoznavalni sistem s klicno logiko je integriran lokalno, kar predstavlja prihranek pri obremenitvi telefonske linije (potrebna samo pri dejanski vzpostavitvi klica).- Call Logic Recognition is integrated locally, saving on phone line load (only required when actually making a call).
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SI200800141A SI22823A (en) | 2008-05-30 | 2008-05-30 | Process and device for intelligent access control |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SI200800141A SI22823A (en) | 2008-05-30 | 2008-05-30 | Process and device for intelligent access control |
Publications (1)
Publication Number | Publication Date |
---|---|
SI22823A true SI22823A (en) | 2009-12-31 |
Family
ID=41462296
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SI200800141A SI22823A (en) | 2008-05-30 | 2008-05-30 | Process and device for intelligent access control |
Country Status (1)
Country | Link |
---|---|
SI (1) | SI22823A (en) |
-
2008
- 2008-05-30 SI SI200800141A patent/SI22823A/en not_active IP Right Cessation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3577646B1 (en) | Handling calls on a shared speech-enabled device | |
US5594784A (en) | Apparatus and method for transparent telephony utilizing speech-based signaling for initiating and handling calls | |
US6744860B1 (en) | Methods and apparatus for initiating a voice-dialing operation | |
EP1170932B1 (en) | Audible identification of caller and callee for mobile communication device | |
US6563911B2 (en) | Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs | |
JPS5939154A (en) | Telephone set | |
EP1511277A1 (en) | Method for answering an incoming event with a phone device, and adapted phone device | |
US7471776B2 (en) | System and method for communication with an interactive voice response system | |
KR100664105B1 (en) | Voice understanding method for hand-held terminal | |
JP6090027B2 (en) | Voice command compatible information terminal with specific sound | |
US20070286395A1 (en) | Intelligent Multimedia Dial Tone | |
EP1690358A2 (en) | Enhanced telecommunication system | |
SI22823A (en) | Process and device for intelligent access control | |
JP2015023485A5 (en) | ||
US20160219153A1 (en) | Method for Providing Personalized Voicemails | |
KR20050077989A (en) | Method for executing mobile phone's specific function in driving mode | |
AU756212B2 (en) | Method for establishing telephone calls | |
JP5143062B2 (en) | Method for determining illegal call from malicious third party and automatic telephone answering device | |
JPH11127239A (en) | Voice response telephone set | |
JPS61157053A (en) | Telephone set | |
KR100788652B1 (en) | Apparatus and method for dialing auto sound | |
KR20050102743A (en) | Short message transmission method using speech recognition of mobile phone | |
KR20000002265A (en) | Selective call receiving phone | |
KR20030039039A (en) | Caller recognizing apparatus and method for telephone by voice recognition | |
KR20040001318A (en) | Remote control method using voice recognition of mobile telecommunication terminal equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
OO00 | Grant of patent |
Effective date: 20100121 |
|
KO00 | Lapse of patent |
Effective date: 20180531 |