CN1306470C - Voice identification method of portable terminal apparatus - Google Patents

Voice identification method of portable terminal apparatus Download PDF

Info

Publication number
CN1306470C
CN1306470C CNB2004100571714A CN200410057171A CN1306470C CN 1306470 C CN1306470 C CN 1306470C CN B2004100571714 A CNB2004100571714 A CN B2004100571714A CN 200410057171 A CN200410057171 A CN 200410057171A CN 1306470 C CN1306470 C CN 1306470C
Authority
CN
China
Prior art keywords
voice
speech
user
identification
processes portion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2004100571714A
Other languages
Chinese (zh)
Other versions
CN1624764A (en
Inventor
柳在滢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur LG Digital Mobile Communications Co Ltd
Original Assignee
LG Electronics China Research and Development Center Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics China Research and Development Center Co Ltd filed Critical LG Electronics China Research and Development Center Co Ltd
Publication of CN1624764A publication Critical patent/CN1624764A/en
Application granted granted Critical
Publication of CN1306470C publication Critical patent/CN1306470C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/26Devices for calling a subscriber
    • H04M1/27Devices whereby a plurality of signals may be stored simultaneously
    • H04M1/271Devices whereby a plurality of signals may be stored simultaneously controlled by voice recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

The invention relates to a voice identification method of portable terminal apparatus. The object of the invention is to perform the conforming step to prevent maloperation resulted from false identification when the portable terminal apparatus recognizes voice command. The method comprises the following steps: step1, speech processing section affirming whether voice command is present; step2, comparing voice command with stored standard speech pattern to recognizing the correlative voice in speech processing section; step3, the recognized voice being diaplayed on the liquid crystal screen by words or outputing voice towards speaker to hint; step4, user validating whether the speech recognition is right to execute the recognized voice command.

Description

The audio recognition method of mobile terminal device
Technical field
The present invention relates to a kind of technology of mobile terminal device, particularly a kind of audio recognition method that possesses the mobile terminal device of speech identifying function.
Background technology
At present, mobile terminal device, especially mobile communication terminal (hereinafter to be referred as mobile phone) because of it has the removable function and the portable small and exquisite outward appearance of carrying out wireless communication whenever and wherever possible, are accepted by increasing people gradually and are used.
, along with the live complicated variation of mode of modern, mobile phone not only is used for the transmission of telephonic communication or information, also requires it to have easier function simultaneously in operation.
But common mobile phone must could dial by button, and therefore if when needing to make a phone call in driving, the hidden danger that can bring out large-scale traffic hazard with regard to existence is so occurred possessing the mobile phone of speech identifying function on the market.
And the mobile phone that possesses speech identifying function need not user key-press, also can discern its phonetic order, and carries out corresponding function.
In the existing speech recognition technology, it is 2000-57401 (title: mobile phone phonetic is discerned the wireless hands-free device) and 2000-81084 (title: patented technology voice identification telephone set) that Republic of Korea's number of patent application is arranged.
Possess speech engine in this class technical pattern, the specific function commands that in a single day user will carry out with phonetic entry, specific action will be called or finish to mobile terminal device.
, even if the function of speech recognition machine increases, its phonetic recognization rate does not reach 100% yet, and when therefore utilizing speech engine to converse, also there is the inconvenience of bringing to the user because of maloperation in the technical limitation that still exists in the past.
That is, when utilizing speech identifying function to send phonetic order,, will take place as calling out the maloperation of other telephone numbers if the mobile phone mistake is discerned this phonetic order in the past.When utilizing the mobile terminal device dialing that is equipped with the non-user-specific speech engine in the past, in case the user can not assign instruction exactly or think that environment is more noisy because of speak with a lisp, simultaneously when if " digit dialling (digit dial) " or " name dialing (name dial) " instruction is arranged, speech engine will be discerned its instruction by mistake, might the beyond thought phone of connecting subscribers participating.
Particularly, Republic of Korea's number of patent application is the technology of 2000-57401, needs to be equipped with relevant hands-free device and microphone, so also exist the problem that cost increases.
Summary of the invention
The objective of the invention is to: in order to improve the problem that prior art exists in practical operation, a kind of audio recognition method of mobile terminal device is provided, make it when recognizing voice is instructed, carry out, thereby can prevent the maloperation that the identification of voice mistake causes its step of being confirmed.
In order to realize described purpose, the audio recognition method operation steps of mobile terminal device of the present invention is made up of following step, step 1, and speech processes portion confirms whether to have phonetic order; Step 2, in described speech processes portion, described phonetic order compares with the received pronunciation pattern that has stored wherein, and the identification related voice; Whether successfully step 3 with the voice of literal or the described identification of voice suggestion, confirms identification; Step 4 if confirm that through the user described speech recognition is correct, is carried out its recognizing voice instruction.Wherein said method comprises following concrete steps:
Behind the interface of a plurality of instructions of LCDs 120 output, when the user sent the voice signal of dialing, the related voice signal will be imported microphone 150, and was delivered to speech processes portion 130 and handles;
Become audio digital signals via the voice signal of microphone 150 input through the treatment conversion of speech processes portion 130, and compare with the received pronunciation pattern, the identification related voice confirms that simultaneously its voice are " dialing ";
The requirement of input telephone number is pointed out to user speech by loudspeaker 160;
When the user uses the phonetic entry telephone number, compare the identification related voice once more with the received pronunciation pattern;
Speech processes portion 130 through the voice delivery of identification to control part 110, and be presented on the LCDs 120, simultaneously with loudspeaker 160 outputs " needing to dial? " voice suggestion, confirm that with this whether related voice instruction discern successful execution;
When the user to microphone 150 input "Yes" voice, speech processes portion 130 confirms that it is correct, and phone numbers associated is delivered to RF handling part 140 by speech recognition, finishes dialing; Otherwise to microphone 150 input "No" voice, speech processes portion 130 confirms that current speech is identified as mistake, and removes its speech recognition as the user.
When speech recognition is failed, comprise in its implementation, invite the step of user input voice instruction once more.
Beneficial effect of the present invention is: the present invention allows the user confirm the correctness that instructs calling out (call) before, and the probability of the identification that therefore can reduce greatly to make a mistake has simultaneously and is convenient to the effect of user with the phonetic order dialing.Speech engine is installed on mobile terminal device, and before carrying out phonetic order, allows the user confirm, therefore can prevent the maloperation that mistake identification causes, bring advantage to the user.
Description of drawings
Fig. 1 is and the corresponding mobile phone block scheme of the embodiment of the invention;
Fig. 2 is LCDs (LCD) synoptic diagram of carrying out phonetic order in the corresponding embodiment of the invention;
Fig. 3 is the operational flowchart that sends the phonetic order identification step according to the embodiment of the invention.
Main identification division explanation in the accompanying drawing:
110: control part 120: LCDs (LCD)
130: speech processes portion 140: radio frequency (RF) handling part
150: microphone 160: loudspeaker
Embodiment
Below in conjunction with accompanying drawing, the present invention will be described in detail.
In embodiments of the present invention, the dial-up operation process of utilizing speech recognition is illustrated.
Accompanying drawing 1 is and the corresponding mobile phone block scheme of the embodiment of the invention.As shown in the drawing, it structurally is made up of following components: carry out wireless telecommunications with wireless network and the RF handling part 140 that is provided with; Voice and received pronunciation pattern by microphone 150 inputs compare, and discern, in case confirm its speech recognition success, just phone numbers associated are delivered to the speech processes portion 130 of described RF handling part 140; For recognizing voice, control described speech processes portion 130, and its voice identification result is presented at the control part 110 of LCDs 120.
Described speech processes portion 130 is made up of speech pattern memory function and speech pattern recognition function.
The work and the action effect that regard to the embodiment of the invention of forming as mentioned above down describe.
Accompanying drawing 2 is LCDs (LCD) synoptic diagram of carrying out phonetic order in the corresponding embodiment of the invention; Accompanying drawing 3 is the operational flowcharts that send the phonetic order identification step according to the embodiment of the invention.
That is, embodiments of the invention, be made up of following step: speech processes portion 130 confirms whether to have phonetic order; In described speech processes portion 130, described phonetic order compares with the received pronunciation pattern that has stored, and the identification related voice; Described recognizing voice, be presented at LCDs 120 or point out to loudspeaker output voice with literal; Confirm that through the user described speech recognition is correct, carry out its recognizing voice instruction.Below, it is described in detail.
At first, behind the interface of a plurality of instructions of LCDs 120 outputs, in case the user sends the voice signal of dialing once more, the related voice signal will be imported microphone 150, and is delivered to speech processes portion 130 by control part 110.
At this moment, voice signal by microphone 150 inputs, processing such as the filtering of process speech processes portion 130, sampling, after converting audio digital signals to, compare with the received pronunciation pattern, and the identification related voice, confirm that simultaneously its voice are " dialing ", will pass through loudspeaker 160 to the requirement of input telephone number to user prompt.When the user uses the phonetic entry telephone number, compare the identification related voice once more with the received pronunciation pattern.
Secondly, speech processes portion 130 arrives control part 110 to the voice delivery through identification, and be presented on the LCDs 120, show " needing dialing? " simultaneously information, and with loudspeaker 160 output " needing dialing? " voice suggestion, confirm that with this whether related voice instruction discern successful execution.
Then, the user is to microphone 150 input "Yes" voice, and speech processes portion 130 confirms that it is correct, and phone numbers associated is delivered to RF handling part 140 by speech recognition, finishes dialing.
Otherwise to microphone 150 input "No" voice, speech processes portion 130 confirms that current speech is identified as mistake, and removes its speech recognition, imports the prompting of phonetic order once more to loudspeaker 160 outputs as the user.
In addition, though a phonetic order step to relevant dialing is illustrated in described description, steps such as dialing or voice messaging transmission also can be finished with same step once more.
Simultaneously, though in described description, the step of finishing speech recognition in speech processes portion 130 is illustrated.But described step further comprises: speech processes portion 130 is for the voice signal by microphone 150 input, after carrying out filtering, sampling etc. and handling, converts audio digital signals to, and finishes its speech recognition steps by control part 110.In this case, described control part 110 need possess speech pattern memory function and speech pattern recognition function.
Above embodiment only is used to illustrate the present invention, but not is used to limit the present invention.

Claims (2)

1. the audio recognition method of a mobile terminal device is characterized in that: confirm whether to have phonetic order; Described phonetic order compares with the received pronunciation pattern that has stored, and the identification related voice; Point out the voice of described identification with literal or voice mode, confirm whether successfully identification; Through confirming that if described speech recognition is correct, carry out its recognizing voice instruction; Wherein, described method comprises following concrete steps:
Behind the interface of a plurality of instructions of LCDs (120) output, when the user sent the voice signal of dialing, the related voice signal will be imported microphone (150), and was delivered to speech processes portion (130) and handles;
Become audio digital signals via the voice signal of microphone (150) input through the treatment conversion of speech processes portion (130), and compare with the received pronunciation pattern, the identification related voice confirms that simultaneously its voice are " dialing ";
The requirement of input telephone number is pointed out to user speech by loudspeaker (160);
When the user uses the phonetic entry telephone number, compare the identification related voice once more with the received pronunciation pattern;
Speech processes portion (130) arrives control part (110) to the voice delivery through identification, and be presented on the LCDs (120), use that simultaneously loudspeaker (160) is exported " needing dialing? " voice suggestion, confirm that with this whether related voice instruction discern successful execution;
When the user to microphone (150) input "Yes" voice, speech processes portion (130) confirms that by speech recognition it is correct, and phone numbers associated is delivered to RF handling part (140), finishes dialing; Otherwise to microphone (150) input "No" voice, speech processes portion (130) confirms that current speech is identified as mistake, and removes its speech recognition as the user.
2. the audio recognition method of mobile terminal device according to claim 1 is characterized in that: when speech recognition is failed, comprise in its implementation, invite the step of user input voice instruction once more.
CNB2004100571714A 2003-12-04 2004-08-27 Voice identification method of portable terminal apparatus Expired - Fee Related CN1306470C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR1020030087628 2003-12-04
KR10-2003-0087628 2003-12-04
KR1020030087628A KR100664105B1 (en) 2003-12-04 2003-12-04 Voice understanding method for hand-held terminal

Publications (2)

Publication Number Publication Date
CN1624764A CN1624764A (en) 2005-06-08
CN1306470C true CN1306470C (en) 2007-03-21

Family

ID=34793147

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100571714A Expired - Fee Related CN1306470C (en) 2003-12-04 2004-08-27 Voice identification method of portable terminal apparatus

Country Status (2)

Country Link
KR (1) KR100664105B1 (en)
CN (1) CN1306470C (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101405941B1 (en) * 2007-10-05 2014-06-27 엘지전자 주식회사 Communication terminal and method for outputting audio signal therein
KR101889836B1 (en) * 2012-02-24 2018-08-20 삼성전자주식회사 Method and apparatus for cotrolling lock/unlock state of terminal through voice recognition
CN103458090A (en) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 Mobile terminal control method and mobile terminal control device
CN102945120B (en) * 2012-11-27 2015-09-02 南京恒知讯科技有限公司 A kind of based on the human-computer interaction auxiliary system in children's application and exchange method
CN105895093A (en) * 2015-11-02 2016-08-24 乐视致新电子科技(天津)有限公司 Voice information processing method and device
CN105632497A (en) * 2016-01-06 2016-06-01 昆山龙腾光电有限公司 Voice output method, voice output system
CN106231053A (en) * 2016-08-12 2016-12-14 柳州鹏达科技有限责任公司 Mobile phone based on speech recognition
CN107147672A (en) * 2017-06-19 2017-09-08 广州市讯飞樽鸿信息技术有限公司 A kind of verification method of speech recognition

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4959850A (en) * 1987-05-29 1990-09-25 Kabushiki Kaisha Toshiba Radio telephone apparatus
CN1258183A (en) * 1998-11-26 2000-06-28 日本电气株式会社 Mobile terminal having speech identifying function
US6185536B1 (en) * 1998-03-04 2001-02-06 Motorola, Inc. System and method for establishing a communication link using user-specific voice data parameters as a user discriminator
US6260012B1 (en) * 1998-02-27 2001-07-10 Samsung Electronics Co., Ltd Mobile phone having speaker dependent voice recognition method and apparatus
CN1346566A (en) * 1999-02-08 2002-04-24 高通股份有限公司 Voice recognition user interface for telephone handsets
WO2002077975A1 (en) * 2001-03-27 2002-10-03 Koninklijke Philips Electronics N.V. Method to select and send text messages with a mobile
JP2003233390A (en) * 2002-02-07 2003-08-22 Ricoh Co Ltd Information terminal equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4959850A (en) * 1987-05-29 1990-09-25 Kabushiki Kaisha Toshiba Radio telephone apparatus
US6260012B1 (en) * 1998-02-27 2001-07-10 Samsung Electronics Co., Ltd Mobile phone having speaker dependent voice recognition method and apparatus
US6185536B1 (en) * 1998-03-04 2001-02-06 Motorola, Inc. System and method for establishing a communication link using user-specific voice data parameters as a user discriminator
CN1258183A (en) * 1998-11-26 2000-06-28 日本电气株式会社 Mobile terminal having speech identifying function
CN1346566A (en) * 1999-02-08 2002-04-24 高通股份有限公司 Voice recognition user interface for telephone handsets
WO2002077975A1 (en) * 2001-03-27 2002-10-03 Koninklijke Philips Electronics N.V. Method to select and send text messages with a mobile
JP2003233390A (en) * 2002-02-07 2003-08-22 Ricoh Co Ltd Information terminal equipment

Also Published As

Publication number Publication date
KR20050054275A (en) 2005-06-10
KR100664105B1 (en) 2007-01-04
CN1624764A (en) 2005-06-08

Similar Documents

Publication Publication Date Title
CN103747129B (en) A kind of Bluetooth system unlocking with vocal print and wake up
US8311584B2 (en) Hands-free system and method for retrieving and processing phonebook information from a wireless phone in a vehicle
EP0746129A2 (en) Method and apparatus for controlling a telephone with voice commands
CN1306470C (en) Voice identification method of portable terminal apparatus
CN104092829A (en) Voice calling method and access gateway based on voice recognition
CN1369165A (en) Apparatus and method of controlling voice controlling operation
EP1170932B1 (en) Audible identification of caller and callee for mobile communication device
CN103929523A (en) Method and mobile terminal for intelligently processing incoming call
CN100452863C (en) Method and device for controlling visual telephone
CN1316863A (en) Method and system for operating mobile phone using voice recognition
CN1615637A (en) Portable telephone
US6256611B1 (en) Controlling a telecommunication service and a terminal
CN1489368A (en) Method and system for integrating telephones in instant communication tools
US20100173613A1 (en) Method for updating phonebook and portable terminal adapted thereto
CN1165889C (en) Method and system for voice dialling
CN110493751A (en) Method and car-mounted electronic device based on speech recognition technology making and receiving calls
CN103679898A (en) Residential security entrance guard system capable of timely starting entrance guard function
CN105007365A (en) Method and apparatus for dialing extension number
CN1893701A (en) Mobile communication terminal and method for morse signal and analysis and conversion
CN1968300A (en) Method for implementing auto dialing using blue teeth earphone
CN1809098A (en) Apparatus and method of preventing interference to incoming calls
CN1753423A (en) Method for prompting calling error of mobile communication terminal
CN1946101A (en) Method and device for realizing mobile terminal audio signal self adaption
CN1155870C (en) External data input device and its phonetic control input method
CN100342356C (en) Mobile communication terminal and its control method having on-line banking function

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: LANGCHAO LEJIN DIGITAL MOBILE COMMUNICATION CO., L

Free format text: FORMER OWNER: LG ELECTRONIC (CHINA) RESEARCH + DEVELOPMENT CENTRE CO., LTD.

Effective date: 20120309

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100102 CHAOYANG, BEIJING TO: 264006 YANTAI, SHANDONG PROVINCE

TR01 Transfer of patent right

Effective date of registration: 20120309

Address after: 264006 No. 228 Changjiang Road, Yantai Economic Development Zone, Shandong, China

Patentee after: Langchao Lejin Digital Mobile Communication Co., Ltd.

Address before: Two Beijing 100102 Chaoyang District city in Wangjing Lize Park No. 203 Petrova building block B

Patentee before: LG Electronic (China) Research and Development Center Co., Ltd.

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070321

Termination date: 20140827

EXPY Termination of patent right or utility model