CN107885481A

CN107885481A - The page sound control method and voice browser of a kind of browser of mobile terminal

Info

Publication number: CN107885481A
Application number: CN201711021099.3A
Authority: CN
Inventors: 李指明
Original assignee: China University of Geosciences
Current assignee: China University of Geosciences
Priority date: 2017-10-26
Filing date: 2017-10-26
Publication date: 2018-04-06

Abstract

The invention discloses the page sound control method and voice browser of a kind of browser of mobile terminal, is related to a kind of upper and lower rolling of Voice command browser page, advance the control method of basic operation and a kind of voice browser such as loading and retrogressing loading.The word content in the voice of user's input is identified first, afterwards by the page basic operation of the language and characters content response browser identified.Research/development platform is JDK 8.0+Eclipse 4.7.0+Android SDK 4.0.3, takes the speech recognition development kit by means of offers such as the winged, Googles of news of University of Science and Technology, the identification to user speech is realized on Android mobile intelligent terminals.The present invention can not operate the special population of browser for finger defect, and both hands are busy in operating, it is also necessary to operate the operating personnel of Other Instruments, instrument, there is provided operate the facility of electronic product, improve the quality of living and operating efficiency.Present invention can apply to mobile terminal intelligent sound control field.

Description

The page sound control method and voice browser of a kind of browser of mobile terminal

Technical field

The present invention relates to speech recognition and control, category Android intelligent controls APP research and development field, and in particular to Yi Zhongyi The page sound control method and voice browser of dynamic terminal browser.

Background technology

Speech recognition is a cross discipline.In the late two decades, speech recognition technology obtains marked improvement, starts from experiment Move towards market in room.It is contemplated that in coming 10 years, speech recognition technology will enter industry, household electrical appliances, communication, automotive electronics, doctor The every field such as treatment, home services, consumption electronic product.Field involved by speech recognition technology includes：Signal transacting, pattern Identification, probability theory and information theory, sound generating mechanism and hearing mechanism, artificial intelligence etc..

Browser refers to that web page server or HTML (the Hypertext Markup of file system can be shown Language, HTML) file content, and allow a kind of software of user and these file interactions.Web browser Mainly interacted simultaneously with web page server by HTTP (Hypertexttransfer protocol, hypertext transfer protocol) agreement Webpage is obtained, these webpages are referred to by URL (Uniform/Universal Resource Locator, URL) Fixed, file format is usually HTML, and by MIME (MultipurposeInternet Mail Extensions, Multi-function mutual Networking mail expands service) indicated in http protocol.

Current voice browser, such as Baidu and Google's browser, generally provide and the word of webpage is converted into voice certainly The dynamic speech identifying function read aloud with voice conversion text data, not using the basic operation of Voice command browser, as above The conventional basic browser operation such as lower slider, prevpage and the next page.

The content of the invention

The technical problems to be solved by the invention are, language is utilized for no existing for above-mentioned current voice browser The problem of page basic operation of sound control browser, the invention provides a kind of page Voice command of browser of mobile terminal Method and voice browser solve the above problems.

To achieve the above object, concrete scheme is as follows by the present invention.

The page sound control method of a kind of browser of mobile terminal, it is characterised in that comprise the following steps：

S1, the voice of collection user's input；

S2, the speech language classification to identification user's input judge, Selection and call and the user language classification The speech database that matches identifies voice；

Word content in the voice that S3, identification user input；

S4, the language and characters content identified is judged, if being present in speech database and the word content pair The instruction answered, then browser is controlled to perform corresponding instruction, if being instructed in the absence of corresponding, this is not responding to what this was identified Word content；

S5, pass through the page basic operation of the language and characters content response browser identified.

Further, the speech database is used for stored voice message, by inputting language and characters content and voice number Identification is made to input language and characters content after being contrasted according to the voice messaging in storehouse.

Further, the page basic operation of the browser includes upper and lower rolling, advancing loads and retreat loading.

Further, the speech SDK of carrying is the speech SDK that University of Science and Technology's news fly and Google provides.

Further, the mobile terminal is the mobile terminal based on android system, and it is JDK 8.0+ to realize platform Eclipse 4.7.0+Android SDK 4.0.3。

Also a kind of voice browser, it is characterised in that comprising with lower module：

Voice acquisition module：For gathering the voice of user's input；

Sound identification module：For judging the speech language classification of identification user's input, Selection and call and institute Speech database that user language classification matches is stated to identify voice；

Instruct judge module：For judging the language and characters content identified；

Instruct respond module：Page basic operation for the language and characters content response browser by identifying；

Speech database：For stored voice message, by inputting language and characters content and the voice in speech database Identification is made to input language and characters content after information contrast.

Further, mobile terminal of the voice browser based on android system, development platform are JDK 8.0+ Eclipse 4.7.0+Android SDK 4.0.3。

Further, it is browser window above the interface of the voice browser, knows below for URL input frames and voice Other button.

Further, URL address input fields are clicked on, soft keyboard will be derived automatically from, for keying in network address.

Further, the speech recognition button is used for the speech identifying function for opening browser.

Brief description of the drawings

Fig. 1 is Voice command algorithm flow chart in the present invention；

Fig. 2 is activation software disk interface in the present invention；

Fig. 3 is to input web site interface in the present invention；

Fig. 4 completes interface for loading in the present invention；

Fig. 5 is to identify startup interface in the present invention；

Fig. 6 completes interface for " Down " instruction identification in the present invention；

Fig. 7 completes interface for " Up " instruction identification in the present invention；

Fig. 8 completes interface for " Backward " instruction identification in the present invention；

Fig. 9 completes interface for " Forward " instruction identification in the present invention；

Figure 10 respectively forms module relation diagram for a kind of voice browser in the present invention.

Embodiment

In order to which technical characteristic, purpose and the effect of the present invention is more clearly understood, now compares accompanying drawing and describe in detail The embodiment of the present invention, the given examples are served only to explain the present invention, is not intended to limit the scope of the present invention.

As shown in figure 1, a kind of page sound control method of browser of mobile terminal, is achieved by the steps of to browsing The page basic operation control of device：

S1, the voice of collection user's input；

S2, the speech language classification to identification user's input judge, Selection and call and the user language classification The speech database to match identifies voice, and speech database is used for stored voice message, by inputting language and characters content With making identification to input language and characters content after the voice messaging contrast in speech database；

Word content in the voice that S3, identification user input；

The mobile terminal of application is the mobile terminal based on android system, and it is JDK 8.0+ to realize Platform Designing Eclipse 4.7.0+Android SDK 4.0.3.The speech SDK of carrying is that the voice that University of Science and Technology's news fly and Google provides is opened Give out a contract for a project.The word content in the voice of identification user's input can be realized and the language and characters content response by identifying browses The function of the page basic operation of device.The specifically used mode of voice browser is as follows.

First, network address is inputted

1st, URL address input fields are clicked on, soft keyboard, such as Fig. 2 will be derived automatically from using behind interface into voice browser It is shown.

2nd, example network address http is keyed in://www.baidu.com, as shown in Figure 3.

3rd, soft keyboard is packed up, GO network address index buttons is clicked, click event monitor corresponding to triggering retrieval, opens webpage http://www.baidu.com, as shown in Figure 4.

2nd, voice command performs case demonstration

The present embodiment will be with language and characters content " Down ", " Up ", " Backward ", " Forward " conduct demonstrations.

1st, ASR_GO speech recognition buttons are clicked, the click event monitor of speech recognition corresponding to triggering, now backstage Available speech database is opened, as shown in Figure 5：

2nd, dictated voice " Down. " under relatively quiet environment, will be shown after the completion of identification in the form of the ejection by Toast The phonetic order that user is triggered last moment, and corresponding action is completed, as user " Down. " instructs institute as shown in Figure 6 The downslide bottom set action of triggering.

3rd, dictated voice " Up " under relatively quiet environment, use will be shown in the form of the ejection by Toast after the completion of identification The phonetic order that family last moment is triggered, and corresponding action is completed, as user " Up " instructs what is triggered as shown in Figure 7 Upper sliding top set action.

4th, dictated voice " Backward " under relatively quiet environment, by the form of the ejection by Toast after the completion of identification The phonetic order that is triggered last moment of display user, and complete corresponding action, as shown in Figure 8 as user " Backward " The triggered retrogressing loading action of instruction.

5th, dictated voice " Forward " under relatively quiet environment, will show after the completion of identification in the form of the ejection by Toast Show the phonetic order that user is triggered last moment, and complete corresponding action, as user " Forward " refers to as shown in Figure 9 The triggered advance loading action of order.

Figure 10 is referred to, each composition module relationship that above-mentioned voice browser includes is as shown in the figure：

Voice acquisition module：For gathering the voice of user's input；

The purpose of the present invention is that the special population of browser can not be operated for finger defect, and both hands are busy in operating, Also need to operation Other Instruments, (such as hospital doctor both hands think controller to the operating personnel of instrument simultaneously just in operation The image amplification and moving operation that table is shown, at this moment can be with Voice command instrument with regard to that can provide great convenience for doctor), there is provided The facility of electronic product is operated, is improved the quality of living and operating efficiency.Meanwhile the configuration of the voice browser is slim and graceful, optimizes Incompatibility problem present in general small-sized browser, while also solve situations such as error sudden strain of a muscle is moved back.Mistake of this project in test Cheng Zhong, for voice successful recognition rate up to 98%, the response time up to 70 milliseconds, can meet the primary demand of user.

The present invention is not only limited to above-mentioned embodiment, and persons skilled in the art are according to disclosed by the invention interior Hold, other a variety of embodiments can be used to implement the present invention, therefore, every design structure and think of using the present invention Road, some simple designs for changing or changing are done, both fall within the scope of protection of the invention.

Claims

1. the page sound control method of a kind of browser of mobile terminal, it is characterised in that comprise the following steps：

S1, the voice of collection user's input；

S2, to identification user input speech language classification judge, Selection and call and the user language classification phase The speech database matched somebody with somebody identifies voice；

Word content in the voice that S3, identification user input；

S4, the language and characters content identified is judged, if being present in speech database corresponding with the word content Instruction, then browser is controlled to perform corresponding instruction, if being instructed in the absence of corresponding, this is not responding to the word identified Content；

2. the page sound control method of a kind of browser of mobile terminal according to claim 1, it is characterised in that described Speech database is used for stored voice message, after inputting the voice messaging contrast in language and characters content and speech database Identification is made to input language and characters content.

3. the page sound control method of a kind of browser of mobile terminal according to claim 1, it is characterised in that described The page basic operation of browser includes upper and lower rolling, advance loading and retrogressing loading.

4. a kind of page sound control method of described browser of mobile terminal according to claim 1, its feature exist In the speech SDK that the speech SDK of carrying flies for University of Science and Technology's news and Google provides.

5. the page sound control method of a kind of browser of mobile terminal according to claim 1, it is characterised in that described Mobile terminal is the mobile terminal based on android system, and it is JDK 8.0+Eclipse 4.7.0+Android to realize platform SDK 4.0.3。

6. a kind of voice browser, it is characterised in that comprising with lower module：

Voice acquisition module：For gathering the voice of user's input；

Sound identification module：For judging the speech language classification of identification user's input, Selection and call and the use Speech database that family language category matches identifies voice；

Speech database：For stored voice message, by inputting language and characters content and the voice messaging in speech database Identification is made to input language and characters content after contrast.

7. a kind of voice browser according to claim 6, it is characterised in that the voice browser is based on Android systems The mobile terminal of system, development platform are JDK 8.0+Eclipse 4.7.0+Android SDK 4.0.3.

A kind of 8. described mobile terminal sound browser according to claim 6, it is characterised in that the voice browse It is browser window above the interface of device, is URL input frames and speech recognition button below.

9. described URL input frames according to claim 6, it is characterised in that URL address input fields are clicked on, will be automatic Soft keyboard is exported, for keying in network address.

10. described speech recognition button according to claim 6, it is characterised in that the speech recognition button is used for Open the speech identifying function of browser.