CN102520792A - Voice-type interaction method for network browser - Google Patents

Voice-type interaction method for network browser Download PDF

Info

Publication number
CN102520792A
CN102520792A CN2011103887723A CN201110388772A CN102520792A CN 102520792 A CN102520792 A CN 102520792A CN 2011103887723 A CN2011103887723 A CN 2011103887723A CN 201110388772 A CN201110388772 A CN 201110388772A CN 102520792 A CN102520792 A CN 102520792A
Authority
CN
China
Prior art keywords
browser
information
control
client
control command
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011103887723A
Other languages
Chinese (zh)
Inventor
林云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JIANGSU QIYIDIAN NETWORKS CO Ltd
Original Assignee
JIANGSU QIYIDIAN NETWORKS CO Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIANGSU QIYIDIAN NETWORKS CO Ltd filed Critical JIANGSU QIYIDIAN NETWORKS CO Ltd
Priority to CN2011103887723A priority Critical patent/CN102520792A/en
Publication of CN102520792A publication Critical patent/CN102520792A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a voice-type interaction method for a network browser, which comprises the following steps that: 1) a voice identification engine is established on a server; 2) after a client opens a network browser, voice of a user is collected through a microphone, voice characteristic information in the user voice is extracted and collected, and the voice characteristic information is transmitted to the server; 3) the server receives the voice characteristic information transmitted by the client, calls the voice identification engine to convert the voice characteristic information to a browser control order and transmits the browser control order to the client; and 4) the client receives the browser control order transmitted by the server and executes the browser control order to realize the interaction with the network browser. The voice-type interaction method has advantages that: the network function of the browser can be adequately used for realizing the calling of the voice identification engine on the server and for realizing the voice-type interaction with the network browser, the user experience is good, and the simplicity and convenience in use can be realized.

Description

The speech type exchange method that is used for web browser
Technical field
The present invention relates to field of human-computer interaction, be specifically related to a kind of speech type exchange method that is used for web browser.
Background technology
The The Research of Speech Recognition of China originates in 1958, by 10 vowels of Chinese Academy of Sciences's vacuum tube circuit that acoustics utilizes identification.Just discerned until 1973 by Chinese Academy of Sciences's computer speech that acoustics begins.Because the restriction of prevailing condition, the The Research of Speech Recognition work of China is in slow development stages always.Get into after the eighties, along with Computer Applied Technology is popularized and the further developing of application and digital signal technique in China gradually, domestic many units have possessed the pacing items of research voice technology.Meanwhile, speech recognition technology becomes the focus of research heavily again after having passed through silence for many years in the world, and development rapidly.Just under this form, domestic many units put into one after another in this research work and go.In March, 1986 China's development in Hi-Tech plan (863 Program) starts, and speech recognition is classified as research topic specially as an important component part of intelligent computer systems research.Under the support of 863 Program, China has begun the research of organized speech recognition technology, and has determined every special meeting of holding a speech recognition at a distance from 2 years.From then on the speech recognition technology of China has got into a unprecedented developing stage.Especially along with the most in the last few years, national and various commercial undertakings are to the attention of speech recognition, and speech recognition technology is mature on the whole at present, and in commercial application, obtained using widely.
Web browser has become the main entrance of operating system and types of applications platform at present; Become one of application software main in the operating system gradually, the user experience that therefore how to improve web browser has become web browser and has attracted one of main means of user.And web browser is particularly useful for speech recognition technology comparatively speaking because content identified is single relatively.
Summary of the invention
The technical matters that the present invention will solve provides that the speech type that calls, realizes web browser that a kind of network function that can make full use of browser itself realizes the service end speech recognition engine is mutual, user experience good, the speech type exchange method that is used for web browser easy to use.
In order to solve the problems of the technologies described above, the technical scheme that the present invention adopts is:
A kind of speech type exchange method that is used for web browser, implementation step is following:
1) service end is set up speech recognition engine;
2) client is gathered user speech through microphone after opening web browser, extracts the phonetic feature information in the user speech that collects, and said phonetic feature information is sent to service end;
3) said service end receives the phonetic feature information that client is sent, and calling speech recognition engine is the browser control command with the phonetic feature information translation, and said browser control command is sent to client;
4) client receives the browser control command that said service end is sent, and carries out the mutual of said browser control command realization and web browser.
Further improvement as technique scheme:
The server calls speech recognition engine is that the concrete steps of browser control command comprise with the phonetic feature information translation in the said step 3): calling speech recognition engine is Word message with the phonetic feature information translation; Said Word message is divided into control model information and control command information; Three kinds of the location input of said control model packets of information purse rope, current page and label control, browser program controls, said control command information comprises the shortcut that is used for correspondence under said control model information.
The concrete steps of the said browser control command of client executing comprise in the said step 4): client reads the control model information of browser control command; If control model information is the network address input; Then, comprise the key-press event of shortcut then to operating system transmitting control commands information with the address input field of the current focus fixer network browser of operating system; If control model information is the control of current page and label,, comprise the key-press event of shortcut then to operating system transmitting control commands information then with the page or the label of the current focus fixer network browser of operating system; If control model information is browser program control,, comprise the key-press event of shortcut then to operating system transmitting control commands information then with the window of the current focus fixer network browser of operating system.
If said client reads the failure of control model information when reading the control model information of browser control command, then current Shipping Options Page or the current page with web browser navigates to preset network address.
The present invention has following advantage:
The present invention sets up speech recognition engine, client after opening web browser through service end; Gather user speech through microphone; Phonetic feature information in the user speech that extraction collects; And phonetic feature information is sent to service end, service end receive the phonetic feature information that client is sent; Calling speech recognition engine is the browser control command with the phonetic feature information translation; And the browser control command is sent to client, client receive the browser control command that service end is sent, and carry out the browser control command and realize mutual with web browser, can make full use of the calling of network function realization service end speech recognition engine of browser itself; And speech recognition engine is arranged on service end and can makes things convenient at any time and upgrade speech recognition engine and client need not any change and can improve speech recognition performance, has good, the easy to use advantage of user experience.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the main schematic flow sheet of the embodiment of the invention.
Embodiment
Below in conjunction with accompanying drawing the preferred embodiments of the present invention are set forth in detail, thereby protection scope of the present invention is made more explicit defining so that advantage of the present invention and characteristic can be easier to it will be appreciated by those skilled in the art that.
As shown in Figure 1, the implementation step of speech type exchange method that present embodiment is used for web browser is following:
1) service end is set up speech recognition engine;
2) client is gathered user speech through microphone after opening web browser, extracts the phonetic feature information in the user speech that collects, and phonetic feature information is sent to service end;
3) service end receives the phonetic feature information that client is sent, and calling speech recognition engine is the browser control command with the phonetic feature information translation, and the browser control command is sent to client;
4) client receives the browser control command that service end is sent, and carries out the mutual of realization of browser control command and web browser.
The server calls speech recognition engine is that the concrete steps of browser control command comprise with the phonetic feature information translation in the present embodiment step 3): calling speech recognition engine is Word message with the phonetic feature information translation; Word message is divided into control model information and control command information; Three kinds of the location input of control model packets of information purse rope, current page and label control, browser program controls, control command information comprises the shortcut that is used for correspondence under control model information.
The concrete steps of client executing browser control command comprise in the present embodiment step 4): client reads the control model information of browser control command; If control model information is the network address input; Then, comprise the key-press event of shortcut then to operating system transmitting control commands information with the address input field of the current focus fixer network browser of operating system; If control model information is the control of current page and label,, comprise the key-press event of shortcut then to operating system transmitting control commands information then with the page or the label of the current focus fixer network browser of operating system; If control model information is browser program control,, comprise the key-press event of shortcut then to operating system transmitting control commands information then with the window of the current focus fixer network browser of operating system.
If the present embodiment client reads the failure of control model information when reading the control model information of browser control command, then current Shipping Options Page or the current page with web browser navigates to preset network address.
 
The above only is a preferred implementation of the present invention, and protection scope of the present invention also not only is confined to the foregoing description, and all technical schemes that belongs under the thinking of the present invention all belong to protection scope of the present invention.Should be pointed out that for those skilled in the art in the some improvement and the retouching that do not break away under the principle of the invention prerequisite, these improvement and retouching also should be regarded as protection scope of the present invention.

Claims (4)

1. speech type exchange method that is used for web browser is characterized in that implementation step is following:
1) service end is set up speech recognition engine;
2) client is gathered user speech through microphone after opening web browser, extracts the phonetic feature information in the user speech that collects, and said phonetic feature information is sent to service end;
3) said service end receives the phonetic feature information that client is sent, and calling speech recognition engine is the browser control command with the phonetic feature information translation, and said browser control command is sent to client;
4) client receives the browser control command that said service end is sent, and carries out the mutual of said browser control command realization and web browser.
2. the speech type exchange method that is used for web browser according to claim 1; It is characterized in that: the server calls speech recognition engine is that the concrete steps of browser control command comprise with the phonetic feature information translation in the said step 3): calling speech recognition engine is Word message with the phonetic feature information translation; Said Word message is divided into control model information and control command information; Three kinds of the location input of said control model packets of information purse rope, current page and label control, browser program controls, said control command information comprises the shortcut that is used for correspondence under said control model information.
3. the speech type exchange method that is used for web browser according to claim 2; It is characterized in that: the concrete steps of the said browser control command of client executing comprise in the said step 4): client reads the control model information of browser control command; If control model information is the network address input; Then, comprise the key-press event of shortcut then to operating system transmitting control commands information with the address input field of the current focus fixer network browser of operating system; If control model information is the control of current page and label,, comprise the key-press event of shortcut then to operating system transmitting control commands information then with the page or the label of the current focus fixer network browser of operating system; If control model information is browser program control,, comprise the key-press event of shortcut then to operating system transmitting control commands information then with the window of the current focus fixer network browser of operating system.
4. the speech type exchange method that is used for web browser according to claim 3; It is characterized in that: if said client reads the failure of control model information when reading the control model information of browser control command, then current Shipping Options Page or the current page with web browser navigates to preset network address.
CN2011103887723A 2011-11-30 2011-11-30 Voice-type interaction method for network browser Pending CN102520792A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011103887723A CN102520792A (en) 2011-11-30 2011-11-30 Voice-type interaction method for network browser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011103887723A CN102520792A (en) 2011-11-30 2011-11-30 Voice-type interaction method for network browser

Publications (1)

Publication Number Publication Date
CN102520792A true CN102520792A (en) 2012-06-27

Family

ID=46291744

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011103887723A Pending CN102520792A (en) 2011-11-30 2011-11-30 Voice-type interaction method for network browser

Country Status (1)

Country Link
CN (1) CN102520792A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102830915A (en) * 2012-08-02 2012-12-19 聚熵信息技术(上海)有限公司 Semanteme input control system and method
CN102843598A (en) * 2012-09-18 2012-12-26 四川长虹电器股份有限公司 Browser interaction method for smart television
CN103020165A (en) * 2012-11-26 2013-04-03 北京奇虎科技有限公司 Browser capable of performing voice recognition processing and processing method
CN103442130A (en) * 2013-04-10 2013-12-11 威盛电子股份有限公司 Voice control method, mobile terminal device and voice control system
CN104123085A (en) * 2014-01-14 2014-10-29 腾讯科技(深圳)有限公司 Method and device for having access to multimedia interaction website through voice
CN107885481A (en) * 2017-10-26 2018-04-06 中国地质大学(武汉) The page sound control method and voice browser of a kind of browser of mobile terminal
CN108777808A (en) * 2018-06-04 2018-11-09 深圳Tcl数字技术有限公司 Text-to-speech method, display terminal and storage medium based on display terminal
CN109994110A (en) * 2018-12-06 2019-07-09 平安科技(深圳)有限公司 Audio recognition method, device based on artificial intelligence, computer equipment
CN113921016A (en) * 2021-10-15 2022-01-11 阿波罗智联(北京)科技有限公司 Voice processing method, device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1606772A (en) * 2002-04-10 2005-04-13 三菱电机株式会社 Method for distributed automatic speech recognition and distributed automatic speech recognition system
CN201402461Y (en) * 2008-12-30 2010-02-10 同济大学第一附属中学 Sound-control web-page browser and voice control module thereof
CN101916266A (en) * 2010-07-30 2010-12-15 优视科技有限公司 Voice control web page browsing method and device based on mobile terminal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1606772A (en) * 2002-04-10 2005-04-13 三菱电机株式会社 Method for distributed automatic speech recognition and distributed automatic speech recognition system
CN201402461Y (en) * 2008-12-30 2010-02-10 同济大学第一附属中学 Sound-control web-page browser and voice control module thereof
CN101916266A (en) * 2010-07-30 2010-12-15 优视科技有限公司 Voice control web page browsing method and device based on mobile terminal

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102830915A (en) * 2012-08-02 2012-12-19 聚熵信息技术(上海)有限公司 Semanteme input control system and method
CN102843598A (en) * 2012-09-18 2012-12-26 四川长虹电器股份有限公司 Browser interaction method for smart television
CN103020165A (en) * 2012-11-26 2013-04-03 北京奇虎科技有限公司 Browser capable of performing voice recognition processing and processing method
CN103020165B (en) * 2012-11-26 2016-06-22 北京奇虎科技有限公司 Browser and the processing method of voice recognition processing can be carried out
CN103442130A (en) * 2013-04-10 2013-12-11 威盛电子股份有限公司 Voice control method, mobile terminal device and voice control system
CN104123085B (en) * 2014-01-14 2015-08-12 腾讯科技(深圳)有限公司 By the method and apparatus of voice access multimedia interaction website
WO2015106688A1 (en) * 2014-01-14 2015-07-23 Tencent Technology (Shenzhen) Company Limited Method and apparatus for voice access to multimedia interactive website
CN104123085A (en) * 2014-01-14 2014-10-29 腾讯科技(深圳)有限公司 Method and device for having access to multimedia interaction website through voice
US10936280B2 (en) 2014-01-14 2021-03-02 Tencent Technology (Shenzhen) Company Limited Method and apparatus for accessing multimedia interactive website by determining quantity of characters in voice spectrum
CN107885481A (en) * 2017-10-26 2018-04-06 中国地质大学(武汉) The page sound control method and voice browser of a kind of browser of mobile terminal
CN108777808A (en) * 2018-06-04 2018-11-09 深圳Tcl数字技术有限公司 Text-to-speech method, display terminal and storage medium based on display terminal
CN108777808B (en) * 2018-06-04 2021-01-12 深圳Tcl数字技术有限公司 Text-to-speech method based on display terminal, display terminal and storage medium
CN109994110A (en) * 2018-12-06 2019-07-09 平安科技(深圳)有限公司 Audio recognition method, device based on artificial intelligence, computer equipment
CN113921016A (en) * 2021-10-15 2022-01-11 阿波罗智联(北京)科技有限公司 Voice processing method, device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN102520792A (en) Voice-type interaction method for network browser
CN104050966B (en) The voice interactive method of terminal device and the terminal device for using this method
CN107277272A (en) A kind of bluetooth equipment voice interactive method and system based on software APP
CN106847274B (en) Man-machine interaction method and device for intelligent robot
WO2014176894A1 (en) Voice processing method and terminal
WO2017128775A1 (en) Voice control system, voice processing method and terminal device
CN103346902B (en) The method and system of data acquisition scheduling
CN102790727A (en) Method and system for dynamically pushing personal labels of users
CN103365836A (en) Natural language utilized distributed intelligent interaction achieving method and system thereof
CN104615670A (en) Method for supporting multiple rendering engines in android browser and browser
CN101957771A (en) Method and device for installing mobile software for multiple mobile phones simultaneously
CN102811288A (en) Method and device for recording call information
CN101764898B (en) Shortcut dialling method, client and system
CN103888297A (en) Interchanger network management method and system
CN109683715A (en) A kind of VR apparatus control method, device and computer readable storage medium
CN109151564A (en) Apparatus control method and device based on microphone
CN110401939B (en) Low-power consumption bluetooth controller link layer device
CN110335599B (en) Voice control method, system, equipment and computer readable storage medium
CN103996400A (en) Speech recognition method
CN103067464B (en) Intelligent terminal method for remote controlling computer and system
CN106486111B (en) Multi-TTS engine output speech speed adjusting method and system based on intelligent robot
CN103499972B (en) A kind of many scenes of all-purpose robot control method based on containment type hierarchical structure
CN106197394A (en) Air navigation aid and device
CN102014199A (en) Information display method and terminal
CN104301500A (en) Terminal control method and device and terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120627