KR100819928B1 - Apparatus for speech recognition of wireless terminal and method of thereof - Google Patents

Apparatus for speech recognition of wireless terminal and method of thereof Download PDF

Info

Publication number
KR100819928B1
KR100819928B1 KR1020070040652A KR20070040652A KR100819928B1 KR 100819928 B1 KR100819928 B1 KR 100819928B1 KR 1020070040652 A KR1020070040652 A KR 1020070040652A KR 20070040652 A KR20070040652 A KR 20070040652A KR 100819928 B1 KR100819928 B1 KR 100819928B1
Authority
KR
South Korea
Prior art keywords
voice
unit
command
words
recognition
Prior art date
Application number
KR1020070040652A
Other languages
Korean (ko)
Inventor
김세윤
이윤수
Original Assignee
(주)부성큐
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by (주)부성큐 filed Critical (주)부성큐
Priority to KR1020070040652A priority Critical patent/KR100819928B1/en
Application granted granted Critical
Publication of KR100819928B1 publication Critical patent/KR100819928B1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers; Analogous equipment at exchanges
    • H04M1/26Devices for signalling identity of wanted subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/271Devices whereby a plurality of signals may be stored simultaneously controlled by voice recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers; Analogous equipment at exchanges
    • H04M1/72Substation extension arrangements; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selecting
    • H04M1/725Cordless telephones
    • H04M1/72519Portable communication terminals with improved user interface to control a main telephone operation mode or to indicate the communication status
    • H04M1/72522With means for supporting locally a plurality of applications to increase the functionality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Abstract

A voice recognition device of a portable terminal and a method thereof are provided to increase a voice recognition rate of the terminal connected to a wireless network, thus a user can receive stock information, weather, news, a lot of daily information, and contents services by inputting voices without inputting buttons as well as confirm received messages(mail). A voice recognition device(200) comprises as follows. A word combiner(210) extracts a voice section by detecting starting and ending points of an inputted voice, and combines phonemes and syllables detected from the voice section to form the combined phonemes and syllables into words. A word recognizer(220) recognizes the combined words to configure the words as sentences. A voice recognizer(230) recognizes the configured sentences as voice commands. A code converter(240) converts the recognized voice commands into control codes by applying a Korean alphabet standard code table stored in a memory unit(190).

Description

Speech recognition device of mobile terminal and its method {APPARATUS FOR SPEECH RECOGNITION OF WIRELESS TERMINAL AND METHOD OF THEREOF}

1 is a diagram illustrating a voice recognition apparatus of a portable terminal according to an embodiment of the present invention.

FIG. 2 is a diagram illustrating a detailed configuration of a voice recognition unit and a code conversion unit shown in FIG. 1.

3 is a flowchart illustrating a voice recognition process of a mobile terminal according to an embodiment of the present invention.

4 is a flowchart illustrating a process of recognizing a voice command in a process of recognizing a voice of a mobile terminal according to an embodiment of the present invention.

<Explanation of symbols for the main parts of the drawings>

110: key input unit 120: audio processing unit

130: control unit 140: variation demodulation unit

150: transceiver 160: video input unit

170: image processor 200: voice recognition device

210: word combination unit 220: word recognition unit

230: speech recognition unit 240: code conversion unit

The present invention relates to a voice recognition device of a mobile terminal, and more particularly, to increase the voice recognition rate of a mobile terminal connected to a wireless network, to execute various operations through input of a voice command, and to convert the recognized voice command into a control code. Then, the present invention relates to a voice recognition device and method for transmitting a service center through a wireless network so that various information services can be provided as input of a voice command without input of a key button.

The mobile terminal, which is rapidly spreading, provides not only a unique voice call service but also a data transmission service, additional services such as mail, securities, news, weather, and living information, and a multimedia communication service that provides a video call service while looking at the other party's face. It is positioning itself as a device.

The portable terminal has a large-capacity memory capable of storing MP3 files, photo files, video files, and various data files received, and has a voice recognition function for convenience of use.

The speech recognition function recognizes or understands the user's voice by analyzing the user's voice.It converts the human voice having a specific frequency into an electric signal by changing the shape of the mouth and the position of the tongue depending on the pronunciation, and then converts the voice's frequency characteristics. It is a technique to recognize the pronunciation by extracting.

Such a voice recognition function is applied to various fields such as dialing of a phone, toy control, language learning, control of home appliances, and the like, and a portable terminal provides only dialing through voice recognition of a user.

Voice dialing is a function of automatically dialing a phone number set in a recognized voice after recognizing a voice when a predetermined word is input by voice. It is used when a user is inconvenient to use his / her hand due to other tasks such as driving. .

This voice dialing provides automatic dialing by simply setting a few phone numbers to a specific word and saving them, and then speaking the word with a voice. Therefore, voice recognition dialing is possible for only a few stored phone numbers. There is a problem in that voice recognition is not provided for other unregistered phone numbers.

In addition, there is a limit in the number of phone numbers that can register voice dialing due to the memory capacity, so that the effectiveness of voice dialing is not large.

In addition, the voice recognition technology is very poor in the ambient noise and the current technology is not yet able to guarantee the 100% recognition success rate, so frequent errors occur in the work performed by the voice recognition.

In order to reduce the error rate of such a task, the user may be asked to confirm the result of the speech recognition, or a list of a plurality of alternatives may be presented to the user according to the result of the speech recognition. The method of determining the recognition word is used.

The voice recognition according to the user's confirmation or the voice recognition according to the user's selection of the proposed alternative does not provide complete voice recognition of the portable terminal itself, and there is a problem that the user's selection must be made at all times.

In addition, with the development of communication services, Internet access is provided to mobile terminals so that users can search web sites, search contents, e-mail, stock trading, games, etc. The technology has a low recognition rate so that it is difficult to provide the above various services through the Internet using voice in a mobile environment.

The present invention has been invented to solve the above problems, the object of which is to increase the voice recognition rate of the mobile terminal connected to the wireless network to check the received message (mail) and the stock information, weather, news, various Life information, content services to be provided by the input of voice without button input.

In addition, another object of the present invention is to perform the overall operation of the mobile terminal by the recognition of the voice command, to check the received message (mail) and to edit the message (mail) to be transmitted and to send the edited message (mail) To be executed as the input of a voice command.

In addition, another object of the present invention is to recognize the user's voice command is converted to a control code and then transmitted to the Internet network to request the required information service, the state that does not involve the input of the key button according to the various information services accordingly Is to be provided by.

In the portable terminal, a voice recognition apparatus for a portable terminal according to a feature of the present invention for achieving the above object,

A key input unit including a plurality of keys and function keys for inputting numbers and characters;

An audio processor converting the analog voice signal input into the microphone into a digital voice signal and converting the digital voice signal provided from the controller into an analog voice signal and outputting the analog voice signal to a speaker;

A modulation / demodulator for encoding and decoding voice signals and data packets transmitted and received through a wireless network;

A transceiver for connecting to a wireless network through an antenna, up-converting and harmonic-amplifying the encoded voice signals and data packets to be transmitted to the wireless network, and performing low-noise amplification and frequency down-conversion of the signal received from the wireless network;

An image input unit which inputs a surrounding image and converts the image into a digital signal through a built-in DSP;

An image processor including one or more image codecs among a JPEG codec, an MPEG codec, and a wavelet codec and processing an image signal applied from an image input unit in units of frames, and outputting the image signal according to characteristics of a display unit and a display standard;

It includes a display unit for displaying the image of the frame unit applied by the image processing unit and the message (mail), content, news, weather, life information data that is applied by the control unit in a text or text,

A voice recognition device that detects a start point and an end point of a voice from a user's voice input into a microphone, extracts a voice section, forms a word by combining phonemes and syllables of the voice section, and recognizes a sentence composed of a combination of words as a voice command;

A standard code table for converting an operation program and a voice recognition command of the portable terminal into a control code, a memory unit for storing a data packet generated during an operation of the portable terminal;

Controls the overall operation of the mobile terminal according to the set operation program, and accesses the corresponding information according to the recognition result of the voice command in the voice recognition mode to provide voice transmission and display, or to request and receive a service corresponding to the wireless network. The apparatus further includes a display unit and a controller for transmitting information to the voice.

In addition, the voice recognition method of the mobile terminal according to an aspect of the present invention comprises the steps of: (a) activating the voice recognition mode after initializing the system when the voice input is detected in the standby state of the mobile terminal;

(b) combining words of a microphone input into words to generate words, and analyzing word attributes and dependencies between words to determine the meaning of words;

(c) generating a sentence by combining the words whose meaning is identified in the step (b);

(d) recognizing a sentence generated by the combination of words in the step (c) as a voice command and analyzing the actual meaning of the voice command by applying a set Korean standard code table;

(e) generating a file of the voice command whose meaning is analyzed in step (d) and converting the file into a control code;

(f) accessing information corresponding to the voice command recognized by the operation of the operation program according to the control code converted in the step (e), transmitting the information through the speaker, and simultaneously displaying the information through the display unit.

DETAILED DESCRIPTION Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art may easily implement the present invention.

As those skilled in the art would realize, the described embodiments may be modified in various different ways, all without departing from the spirit or scope of the present invention.

In the drawings, parts irrelevant to the description are omitted for simplicity of explanation, and like reference numerals designate like parts throughout the specification.

In addition, when a part is said to "include" a certain component, this means that it may further include other components, except to exclude other components unless otherwise stated.

Now, a voice recognition apparatus and method for a portable terminal according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

1 is a diagram illustrating a voice recognition apparatus of a portable terminal according to an embodiment of the present invention.

As shown, the present invention, the key input unit 110, the audio processor 120, the controller 130, the modulation and demodulation unit 140, the transceiver 150, the image input unit 160, the image processor 170, the display unit 180, a memory unit 190, and a voice recognition device 200.

The key input unit 110 includes a plurality of keys for inputting numbers and letters and a function key for setting a specific function in use of the mobile terminal, and the function key includes a function key for entering the mobile terminal into a voice recognition mode. It may be further included.

The audio processor 120 includes a data codec for processing packet data and an audio codec for processing an audio signal such as voice. The audio processor 120 converts an analog voice signal input by a microphone into a digital signal through an audio codec. The controller 130 can recognize the signal, and converts the digital voice signal provided from the controller 130 into an analog voice signal and transmits the same through the speaker Spk.

In addition, when a data packet such as a message (mail) received through a wireless network is a data packet for providing information to a user, the speaker (Spk) by converting the data packet provided from the controller 130 into an analog signal through a data codec Provided with voice guidance through.

The controller 110 controls the overall operation of the portable terminal according to the set operation program, and enters the voice recognition mode by recognizing the voice command of the user or by using a function key included in the key input unit 110 to recognize the voice command. According to the received message (mail) through the display unit 180, and transmits the voice through the speaker (Spk) as necessary, requests for the services required for the wireless network, and accordingly received stock information, news, weather , And receives the living information and the like through the display unit 180 and at the same time transmits the voice through the speaker (Spk).

The modulation and demodulation unit 140 encodes a voice signal and a data packet transmitted to the wireless network to the transceiver unit 150, decodes the voice signal and data packet received through the transceiver unit 150 to the controller 130. to provide.

The transceiver 150 is connected to the wireless network through the antenna ANT, and up-converts and harmonic-amplifies the frequencies of the voice signal and data packet encoded by the modulator 140, and transmits the same through the antenna ANT. The low-noise amplification and the frequency down-conversion of the signal received from the network via the antenna ANT is provided to the demodulation unit 140.

The image input unit 160 is, for example, a CCD image pickup device or a camera. The image input unit 160 inputs an image of a subject, such as a peripheral object or a person, according to a control signal applied from the controller 130, and has a DSP (Digital) in which an analog image signal is input. Signal Processor) converts to a digital signal.

The image processor 170 processes the image signal applied from the image input unit 160 in units of frames according to the control signal of the controller 130, and converts the image signal of the frame unit into the characteristics and display standards of the display unit 180. Output to fit.

The image processing unit 170 may include any one or more image codecs of a JPEG codec, an MPEG codec, or a wavelet codec, and compress or compress the image data of frame units displayed on the display unit 180 in a set manner. Execute the function to restore the data.

The display unit 180 displays an image in a frame unit applied by the image processing unit 170, and displays data such as a message (mail), content information, news, weather, living information, etc., which is applied by the controller 130. Display in graph format.

The display unit 180 may be implemented as a touch screen to operate as an input unit instead of the key input unit 110.

The memory unit 190 stores a program for operating the mobile terminal, data for voice command recognition, a Korean standard code table for converting the recognized voice command into a control code, and a data packet generated during the operation of the mobile terminal.

The voice recognition device 200 detects a voice point from a voice of a user input through a microphone, extracts a voice section, forms a word by combining phonemes and syllables detected from the voice section, and recognizes the word. A sentence composed of a combination of recognized words is recognized as a voice command, and the recognized voice command is converted into a control code by applying a Korean standard code table stored in the memory 190.

The speech recognition apparatus 200 may include a word combiner 210 configured to form a word by combining phonemes and syllables detected in the extracted voice interval, a word recognizer 220 configured to recognize a combined word and form a sentence. The speech recognition unit 230 recognizes a sentence composed of a combination of words and recognizes it as a voice command, and a code conversion unit converting the recognized voice command into a control code by applying a Korean standard code table stored in the memory 190. 240.

The speech recognition unit 230 and the code conversion unit 240 will be described in more detail with reference to FIG. 2 as follows.

As shown, the speech recognizer 230 includes a parser 231 and a parser 232, and the code converter 240 includes a parser 241, a parser 242, and a syllable converter 243. And file generator 244.

The parser 231 included in the speech recognizer 230 analyzes the input voice to identify attributes and then analyzes the dependency relationship between words to generate sentences of the voice command.

The parser 232 included in the speech recognizer 230 analyzes the actual meaning of the command by applying the Hangul standard code table stored in the memory 190 to the sentence of the generated voice command.

The parser 241 included in the code converter 240 analyzes the actual meaning of the command in the speech recognizer 230 to identify attributes such as noun phrases of the applied voice command and analyzes the dependency relations between words.

The syntax interpreter 242 included in the code converter 240 may apply a Korean standard code table stored in the memory unit 190 to the voice command analyzed by the parser 241 to determine the actual meaning of the command.

The syllable converter 243 included in the code converter 240 syllable converts the voice command whose meaning is known.

The file generator 244 included in the code converter 240 generates and outputs a syllable-converted voice command word as a file.

The voice command recognition and the operation thereof of the voice recognition apparatus of the mobile terminal according to the embodiment of the present invention including the above functions will be described.

Since operations on a voice call or video input, transmission and reception of a message (mail), and reception of various contents and information according to key input in the portable terminal are the same as or similar to those of a typical portable terminal, detailed description thereof will be omitted.

Since the present invention recognizes a voice command and performs an operation according to it, this will be described with reference to FIGS. 3 and 4.

In the standby state in which the portable terminal maintains power on (S101), the controller 130 determines whether a voice command of a user input through the microphone Mic is detected (S102).

When the user inputs a specific voice command to the microphone (Mic), the audio processor 120 converts the analog voice signal of the user to a digital signal through the audio codec to provide to the controller 130, the controller 130 It is possible to determine whether or not a voice command is input in a state of waiting for operation.

If it is determined in S102 that an input of a specific voice command is detected, it is determined as an entry request of the voice recognition mode to initialize the system (S103), and the voice conversion mode is activated (S104).

In the above, the voice recognition mode is entered into the input of a specific voice command in the standby state, but the present invention is not limited thereto, and the function of entering the voice recognition mode through the input of a specific key provided in the key input unit 110 is also described. It is included in the scope of the invention.

When the voice conversion mode is activated in S104, the word combination unit 210 included in the speech recognition apparatus 200 detects a start point and an end point of the voice from a user's voice signal applied through the controller 130 to extract a voice section. Then, the phoneme and syllables detected in the speech section are combined to form a word, and the word recognition unit 220 recognizes the combined word and configures the sentence (S105) (S106).

Thereafter, the speech recognizer 230 analyzes and interprets the dependency of each word in a sentence composed of a combination of words (S107) to recognize a voice command (S108).

A recognition procedure of the voice command will be described with reference to FIG. 4.

As a result of dependency analysis and analysis of each word constituting the sentence, it is determined whether the word is composed of a word defined in advance so as to be recognized as a voice command (S201) (S202).

As a result of the determination, if it is composed of the words defined in the dictionary, each syllable constituting the word is examined (S203), and the Korean standard code table stored in the memory unit 190 is searched (S204) and a matching code exists. It is determined whether or not (S205).

If there is a matching code as a result of the determination in S205, the corresponding matching code is applied and recognized as a voice command (S206) (S207).

When the voice command is recognized through the above procedure, the voice command is generated as a voice command file (S109), and the voice command recognized by applying the Korean standard code table stored in the memory unit 190 through the code converter 240 is applied. Is converted into a control code and applied to the control unit 130 (S110).

Therefore, the controller 130 executes the operation command according to the voice recognition command applied as the control code from the voice recognition apparatus 200 to execute the recognized command (S111), and the execution result is transmitted through the audio processor 120. It converts into an analog voice signal and transmits it through the speaker Spk and simultaneously displays it on the display unit 180 (S112).

For example, if the recognized voice command is output of the received message (mail), the controller 130 accesses the message (mail) which is requested to be output from the data packet stored in the memory 190 and displays the display unit 180. )

In addition, if necessary, a message received through an audio codec included in the audio processor 120 is converted into an analog voice signal and then transmitted through a speaker Spk.

If the voice command recognized above is a service request of stock information, news, weather, living information, and various contents from the wireless internet network, the wireless internet network is connected through the transceiver 150 according to the control code of the command.

Thereafter, a request for a voice-recognized service is transmitted to a corresponding web server, and a data packet of a service provided accordingly is received and displayed to the user through the display unit 180.

Then, if necessary, the audio codec of the audio processor 120 converts the voice and then outputs it through the speaker Spk.

In addition, the message (mail) is edited as an input of a voice command and transmitted to the counterpart, and the message received from the counterpart is displayed on the display unit 180 or converted into voice and transmitted through the speaker Spk.

The present invention can be applied to a mode in which an environment such as a mobile communication service company, an Internet service company, a content provider, and the like can be combined with voice recognition synthesis technology.

The embodiments of the present invention described above are not implemented only through the apparatus and the method, but may be implemented through a program for realizing a function corresponding to the configuration of the embodiment of the present invention or a recording medium on which the program is recorded. Implementation may be easily implemented by those skilled in the art from the description of the above-described embodiments.

Although the embodiments of the present invention have been described in detail above, the scope of the present invention is not limited thereto, and various modifications and improvements of those skilled in the art using the basic concepts of the present invention defined in the following claims are also provided. It belongs to the scope of rights.

According to the above-described configuration, the present invention provides a message (mail), various life information, and personal schedule data through the recognition of voice commands and conversion of control codes in a wireless network environment of Wibro, WCDMA, and HSPA (HSDPA + HSUPA). By applying the speaker technology to convert the video and voice to provide services, video telephony, video multimedia services, bulletin boards, newspaper articles, product advertising, You will be provided with all the information posted on the Internet, including posts, economics, entertainment and my information.

Claims (9)

  1. A key input unit including a plurality of keys and function keys for inputting numbers and characters; An audio processor converting the analog voice signal input into the microphone into a digital voice signal and converting the digital voice signal provided from the controller into an analog voice signal and outputting the analog voice signal to a speaker; A modulation / demodulator for encoding and decoding voice signals and data packets transmitted and received through a wireless network; A transceiver for connecting to a wireless network through an antenna, up-converting and harmonic-amplifying the encoded voice signals and data packets to be transmitted to the wireless network, and performing low-noise amplification and frequency down-conversion of the signal received from the wireless network; An image input unit which inputs a surrounding image and converts the image into a digital signal through a built-in DSP; An image processor including one or more image codecs among a JPEG codec, an MPEG codec, and a wavelet codec and processing an image signal applied from an image input unit in units of frames, and outputting the image signal according to characteristics of a display unit and a display standard; It includes a display unit for displaying the image of the frame unit applied by the image processing unit and the message (mail), content, news, weather, life information data that is applied by the control unit in a text or text,
    Speech recognition is performed by detecting the start and end points of the voice from the user's voice input into the microphone, extracting the speech section, forming a word by combining the phonemes and syllables extracted from the speech section, and recognizing a sentence composed of the combination of words as a voice command. Device; A standard code table for converting an operation program and a voice recognition command of the portable terminal into a control code, a memory unit for storing a data packet generated during an operation of the portable terminal; Controls the overall operation of the mobile terminal according to the set operation program, and accesses the corresponding information according to the recognition result of the voice command in the voice recognition mode to provide voice transmission and display, or to request and receive a service corresponding to the wireless network. In the voice recognition device of a portable terminal further comprising a control unit for transmitting information to the display unit and the voice,
    The speech recognition apparatus includes: a word combination unit configured to detect a start point and an end point of an input speech, extract a speech section, and form a word by combining phonemes and syllables detected in the speech section;
    A word recognition unit recognizing the combined words and constructing sentences;
    A voice recognition unit recognizing a sentence composed of a combination of words as a voice command;
    And a code conversion unit for converting the recognized voice command into a control code by applying a Hangul standard code table stored in a memory unit.
  2. delete
  3. delete
  4. delete
  5. The method of claim 1,
    The speech recognition unit parses the input voice to identify attributes and analyze the dependency between words to generate a sentence of the speech command sentence;
    And a parser that analyzes the actual meaning of the command by applying a Hangul standard code table stored in a memory unit to a sentence of the voice command generated by the parser.
  6. The method of claim 1,
    The code conversion unit parses the syntax of the speech recognition command to analyze the actual meaning of the command in the speech recognition unit and analyzes the dependencies between words;
    A syntax interpreter configured to determine a practical meaning of the command by applying a Korean standard code table to the analyzed voice command;
    A syllable converter for syllable conversion of a voice command whose meaning is grasped by the parser;
    And a file generator for generating the syllable-converted voice command into a file.
  7. (a) initializing the system and activating a voice recognition mode when a voice input is detected in a standby state of the portable terminal;
    (b) combining words of a microphone input into words to generate words, and analyzing word attributes and dependencies between words to determine the meaning of words;
    (c) generating a sentence by combining the words whose meaning is identified in the step (b);
    (d) recognizing a sentence generated by the combination of words in the step (c) as a voice command and analyzing the actual meaning of the voice command by applying a set Korean standard code table;
    (e) generating a file of the voice command whose meaning is analyzed in step (d) and converting the file into a control code;
    (f) accessing information corresponding to the voice command recognized by the operation of the operation program according to the control code converted in step (e), transmitting the signal through a speaker, and simultaneously displaying the information on the display unit. Speech recognition method.
  8. The method of claim 7, wherein
    The information matching the voice command recognized in the step (e) is the display and listening of the received and stored message (mail), the service request of the securities, news, weather, living information, content information from the wireless network, received according to the service request A voice recognition method of a portable terminal further comprising displaying and transmitting the voice information.
  9. The method of claim 7, wherein
    In the voice recognition mode of the step (a), the voice recognition method of a mobile terminal further comprising providing a voice (edit) of the message (mail) and the output of the received message (mail).
KR1020070040652A 2007-04-26 2007-04-26 Apparatus for speech recognition of wireless terminal and method of thereof KR100819928B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020070040652A KR100819928B1 (en) 2007-04-26 2007-04-26 Apparatus for speech recognition of wireless terminal and method of thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020070040652A KR100819928B1 (en) 2007-04-26 2007-04-26 Apparatus for speech recognition of wireless terminal and method of thereof

Publications (1)

Publication Number Publication Date
KR100819928B1 true KR100819928B1 (en) 2008-04-08

Family

ID=39533957

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020070040652A KR100819928B1 (en) 2007-04-26 2007-04-26 Apparatus for speech recognition of wireless terminal and method of thereof

Country Status (1)

Country Link
KR (1) KR100819928B1 (en)

Cited By (76)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101352695B1 (en) 2012-02-24 2014-01-17 주식회사 비엔에스웍스 Method for Displaying Contents by using Sound
WO2015005927A1 (en) * 2013-07-11 2015-01-15 Intel Corporation Device wake and speaker verification using the same audio input
KR101642918B1 (en) * 2015-08-03 2016-07-27 서치콘주식회사 Method for controlling network connection using codename protocol, network connection control server performing the same, and storage medium storing the same
KR20160127911A (en) * 2015-04-28 2016-11-07 주식회사 디오티스 Method for Providing Phone Banking based on Sentence Structure Recognition by using Linkage of Different Network
US9865248B2 (en) 2008-04-05 2018-01-09 Apple Inc. Intelligent text-to-speech conversion
KR101834624B1 (en) * 2013-06-08 2018-03-05 애플 인크. Automatically adapting user interfaces for hands-free interaction
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9966060B2 (en) 2013-06-07 2018-05-08 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10049675B2 (en) 2010-02-25 2018-08-14 Apple Inc. User profiling for voice input processing
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US10083690B2 (en) 2014-05-30 2018-09-25 Apple Inc. Better resolution when referencing to concepts
US10108612B2 (en) 2008-07-31 2018-10-23 Apple Inc. Mobile device having human language translation capability with positional feedback
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10311871B2 (en) 2015-03-08 2019-06-04 Apple Inc. Competing devices responding to voice triggers
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US10332518B2 (en) 2017-05-09 2019-06-25 Apple Inc. User interface for correcting recognition errors
US10354652B2 (en) 2015-12-02 2019-07-16 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10381016B2 (en) 2008-01-03 2019-08-13 Apple Inc. Methods and apparatus for altering audio output signals
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10417344B2 (en) 2014-05-30 2019-09-17 Apple Inc. Exemplar-based natural language processing
US10417405B2 (en) 2011-03-21 2019-09-17 Apple Inc. Device access using voice authentication
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10431204B2 (en) 2014-09-11 2019-10-01 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10636424B2 (en) 2017-11-30 2020-04-28 Apple Inc. Multi-turn canned dialog
US10643611B2 (en) 2008-10-02 2020-05-05 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10657961B2 (en) 2013-06-08 2020-05-19 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10657328B2 (en) 2017-06-02 2020-05-19 Apple Inc. Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US10684703B2 (en) 2018-06-01 2020-06-16 Apple Inc. Attention aware virtual assistant dismissal
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10699717B2 (en) 2014-05-30 2020-06-30 Apple Inc. Intelligent assistant for home automation
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10706841B2 (en) 2010-01-18 2020-07-07 Apple Inc. Task flow identification based on user intent
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10733982B2 (en) 2018-01-08 2020-08-04 Apple Inc. Multi-directional dialog
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10755051B2 (en) 2017-09-29 2020-08-25 Apple Inc. Rule-based natural language processing
US10755703B2 (en) 2017-05-11 2020-08-25 Apple Inc. Offline personal assistant
US10769385B2 (en) 2013-06-09 2020-09-08 Apple Inc. System and method for inferring user intent from speech inputs
US10791176B2 (en) 2017-05-12 2020-09-29 Apple Inc. Synchronization and task delegation of a digital assistant
US10789945B2 (en) 2017-05-12 2020-09-29 Apple Inc. Low-latency intelligent automated assistant
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10795541B2 (en) 2009-06-05 2020-10-06 Apple Inc. Intelligent organization of tasks items
US10810274B2 (en) 2017-05-15 2020-10-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050122604A (en) * 2004-06-25 2005-12-29 삼성전자주식회사 Method for initiating voice recognition in wireless terminal

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050122604A (en) * 2004-06-25 2005-12-29 삼성전자주식회사 Method for initiating voice recognition in wireless terminal

Cited By (87)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US10381016B2 (en) 2008-01-03 2019-08-13 Apple Inc. Methods and apparatus for altering audio output signals
US9865248B2 (en) 2008-04-05 2018-01-09 Apple Inc. Intelligent text-to-speech conversion
US10108612B2 (en) 2008-07-31 2018-10-23 Apple Inc. Mobile device having human language translation capability with positional feedback
US10643611B2 (en) 2008-10-02 2020-05-05 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10795541B2 (en) 2009-06-05 2020-10-06 Apple Inc. Intelligent organization of tasks items
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10706841B2 (en) 2010-01-18 2020-07-07 Apple Inc. Task flow identification based on user intent
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10049675B2 (en) 2010-02-25 2018-08-14 Apple Inc. User profiling for voice input processing
US10692504B2 (en) 2010-02-25 2020-06-23 Apple Inc. User profiling for voice input processing
US10417405B2 (en) 2011-03-21 2019-09-17 Apple Inc. Device access using voice authentication
KR101352695B1 (en) 2012-02-24 2014-01-17 주식회사 비엔에스웍스 Method for Displaying Contents by using Sound
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
US9966060B2 (en) 2013-06-07 2018-05-08 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
KR101834624B1 (en) * 2013-06-08 2018-03-05 애플 인크. Automatically adapting user interfaces for hands-free interaction
US10657961B2 (en) 2013-06-08 2020-05-19 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10769385B2 (en) 2013-06-09 2020-09-08 Apple Inc. System and method for inferring user intent from speech inputs
WO2015005927A1 (en) * 2013-07-11 2015-01-15 Intel Corporation Device wake and speaker verification using the same audio input
US9445209B2 (en) 2013-07-11 2016-09-13 Intel Corporation Mechanism and apparatus for seamless voice wake and speaker verification
US9852731B2 (en) 2013-07-11 2017-12-26 Intel Corporation Mechanism and apparatus for seamless voice wake and speaker verification
US10657966B2 (en) 2014-05-30 2020-05-19 Apple Inc. Better resolution when referencing to concepts
US10083690B2 (en) 2014-05-30 2018-09-25 Apple Inc. Better resolution when referencing to concepts
US10699717B2 (en) 2014-05-30 2020-06-30 Apple Inc. Intelligent assistant for home automation
US10714095B2 (en) 2014-05-30 2020-07-14 Apple Inc. Intelligent assistant for home automation
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
US10417344B2 (en) 2014-05-30 2019-09-17 Apple Inc. Exemplar-based natural language processing
US10431204B2 (en) 2014-09-11 2019-10-01 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
US10390213B2 (en) 2014-09-30 2019-08-20 Apple Inc. Social reminders
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US10311871B2 (en) 2015-03-08 2019-06-04 Apple Inc. Competing devices responding to voice triggers
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
KR101707086B1 (en) * 2015-04-28 2017-02-15 주식회사 디오티스 Method for Providing Phone Banking based on Sentence Structure Recognition by using Linkage of Different Network
KR20160127911A (en) * 2015-04-28 2016-11-07 주식회사 디오티스 Method for Providing Phone Banking based on Sentence Structure Recognition by using Linkage of Different Network
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
KR101642918B1 (en) * 2015-08-03 2016-07-27 서치콘주식회사 Method for controlling network connection using codename protocol, network connection control server performing the same, and storage medium storing the same
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10354652B2 (en) 2015-12-02 2019-07-16 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10580409B2 (en) 2016-06-11 2020-03-03 Apple Inc. Application integration with a digital assistant
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10553215B2 (en) 2016-09-23 2020-02-04 Apple Inc. Intelligent automated assistant
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10332518B2 (en) 2017-05-09 2019-06-25 Apple Inc. User interface for correcting recognition errors
US10755703B2 (en) 2017-05-11 2020-08-25 Apple Inc. Offline personal assistant
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US10847142B2 (en) 2017-05-11 2020-11-24 Apple Inc. Maintaining privacy of personal information
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10789945B2 (en) 2017-05-12 2020-09-29 Apple Inc. Low-latency intelligent automated assistant
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10791176B2 (en) 2017-05-12 2020-09-29 Apple Inc. Synchronization and task delegation of a digital assistant
US10810274B2 (en) 2017-05-15 2020-10-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US10657328B2 (en) 2017-06-02 2020-05-19 Apple Inc. Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10755051B2 (en) 2017-09-29 2020-08-25 Apple Inc. Rule-based natural language processing
US10636424B2 (en) 2017-11-30 2020-04-28 Apple Inc. Multi-turn canned dialog
US10733982B2 (en) 2018-01-08 2020-08-04 Apple Inc. Multi-directional dialog
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10684703B2 (en) 2018-06-01 2020-06-16 Apple Inc. Attention aware virtual assistant dismissal
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
US10504518B1 (en) 2018-06-03 2019-12-10 Apple Inc. Accelerated task performance

Similar Documents

Publication Publication Date Title
US20180018544A1 (en) Translation and display of text in picture
US8812325B2 (en) Use of multiple speech recognition software instances
US8244540B2 (en) System and method for providing a textual representation of an audio message to a mobile device
JP3728177B2 (en) Audio processing system, apparatus, method, and storage medium
US8265933B2 (en) Speech recognition system for providing voice recognition services using a conversational language model
JP4135307B2 (en) Voice interpretation service method and voice interpretation server
US8328089B2 (en) Hands free contact database information entry at a communication device
EP1603291B1 (en) Information transmission system and information transmission method
US7224989B2 (en) Communication terminal having a predictive text editor application
US6701162B1 (en) Portable electronic telecommunication device having capabilities for the hearing-impaired
JP4768969B2 (en) Understanding synchronization semantic objects for advanced interactive interfaces
US7409349B2 (en) Servers for web enabled speech recognition
US8655659B2 (en) Personalized text-to-speech synthesis and personalized speech feature extraction
CA2484246C (en) Sequential multimodal input
CN201440733U (en) Mobile speech communication terminal suitable for person with language barrier
KR100735663B1 (en) Method for batch processing of command using pattern recognition of panel input in portable communication terminal
US7421390B2 (en) Method and system for voice control of software applications
JP4768970B2 (en) Understanding synchronous semantic objects implemented with voice application language tags
US6263202B1 (en) Communication system and wireless communication terminal device used therein
CN102543071B (en) Voice recognition system and method used for mobile equipment
US9183843B2 (en) Configurable speech recognition system using multiple recognizers
KR101098716B1 (en) Combing use of a stepwise markup language and an object oriented development tool
US7506022B2 (en) Web enabled recognition architecture
US6816837B1 (en) Voice macros for scanner control
US7962344B2 (en) Depicting a speech user interface via graphical elements

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20110331

Year of fee payment: 4

LAPS Lapse due to unpaid annual fee