CN107885481A - The page sound control method and voice browser of a kind of browser of mobile terminal - Google Patents

The page sound control method and voice browser of a kind of browser of mobile terminal Download PDF

Info

Publication number
CN107885481A
CN107885481A CN201711021099.3A CN201711021099A CN107885481A CN 107885481 A CN107885481 A CN 107885481A CN 201711021099 A CN201711021099 A CN 201711021099A CN 107885481 A CN107885481 A CN 107885481A
Authority
CN
China
Prior art keywords
browser
voice
language
mobile terminal
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711021099.3A
Other languages
Chinese (zh)
Inventor
李指明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Geosciences
Original Assignee
China University of Geosciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Geosciences filed Critical China University of Geosciences
Priority to CN201711021099.3A priority Critical patent/CN107885481A/en
Publication of CN107885481A publication Critical patent/CN107885481A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Abstract

The invention discloses the page sound control method and voice browser of a kind of browser of mobile terminal, is related to a kind of upper and lower rolling of Voice command browser page, advance the control method of basic operation and a kind of voice browser such as loading and retrogressing loading.The word content in the voice of user's input is identified first, afterwards by the page basic operation of the language and characters content response browser identified.Research/development platform is JDK 8.0+Eclipse 4.7.0+Android SDK 4.0.3, takes the speech recognition development kit by means of offers such as the winged, Googles of news of University of Science and Technology, the identification to user speech is realized on Android mobile intelligent terminals.The present invention can not operate the special population of browser for finger defect, and both hands are busy in operating, it is also necessary to operate the operating personnel of Other Instruments, instrument, there is provided operate the facility of electronic product, improve the quality of living and operating efficiency.Present invention can apply to mobile terminal intelligent sound control field.

Description

The page sound control method and voice browser of a kind of browser of mobile terminal
Technical field
The present invention relates to speech recognition and control, category Android intelligent controls APP research and development field, and in particular to Yi Zhongyi The page sound control method and voice browser of dynamic terminal browser.
Background technology
Speech recognition is a cross discipline.In the late two decades, speech recognition technology obtains marked improvement, starts from experiment Move towards market in room.It is contemplated that in coming 10 years, speech recognition technology will enter industry, household electrical appliances, communication, automotive electronics, doctor The every field such as treatment, home services, consumption electronic product.Field involved by speech recognition technology includes:Signal transacting, pattern Identification, probability theory and information theory, sound generating mechanism and hearing mechanism, artificial intelligence etc..
Browser refers to that web page server or HTML (the Hypertext Markup of file system can be shown Language, HTML) file content, and allow a kind of software of user and these file interactions.Web browser Mainly interacted simultaneously with web page server by HTTP (Hypertexttransfer protocol, hypertext transfer protocol) agreement Webpage is obtained, these webpages are referred to by URL (Uniform/Universal Resource Locator, URL) Fixed, file format is usually HTML, and by MIME (MultipurposeInternet Mail Extensions, Multi-function mutual Networking mail expands service) indicated in http protocol.
Current voice browser, such as Baidu and Google's browser, generally provide and the word of webpage is converted into voice certainly The dynamic speech identifying function read aloud with voice conversion text data, not using the basic operation of Voice command browser, as above The conventional basic browser operation such as lower slider, prevpage and the next page.
The content of the invention
The technical problems to be solved by the invention are, language is utilized for no existing for above-mentioned current voice browser The problem of page basic operation of sound control browser, the invention provides a kind of page Voice command of browser of mobile terminal Method and voice browser solve the above problems.
To achieve the above object, concrete scheme is as follows by the present invention.
The page sound control method of a kind of browser of mobile terminal, it is characterised in that comprise the following steps:
S1, the voice of collection user's input;
S2, the speech language classification to identification user's input judge, Selection and call and the user language classification The speech database that matches identifies voice;
Word content in the voice that S3, identification user input;
S4, the language and characters content identified is judged, if being present in speech database and the word content pair The instruction answered, then browser is controlled to perform corresponding instruction, if being instructed in the absence of corresponding, this is not responding to what this was identified Word content;
S5, pass through the page basic operation of the language and characters content response browser identified.
Further, the speech database is used for stored voice message, by inputting language and characters content and voice number Identification is made to input language and characters content after being contrasted according to the voice messaging in storehouse.
Further, the page basic operation of the browser includes upper and lower rolling, advancing loads and retreat loading.
Further, the speech SDK of carrying is the speech SDK that University of Science and Technology's news fly and Google provides.
Further, the mobile terminal is the mobile terminal based on android system, and it is JDK 8.0+ to realize platform Eclipse 4.7.0+Android SDK 4.0.3。
Also a kind of voice browser, it is characterised in that comprising with lower module:
Voice acquisition module:For gathering the voice of user's input;
Sound identification module:For judging the speech language classification of identification user's input, Selection and call and institute Speech database that user language classification matches is stated to identify voice;
Instruct judge module:For judging the language and characters content identified;
Instruct respond module:Page basic operation for the language and characters content response browser by identifying;
Speech database:For stored voice message, by inputting language and characters content and the voice in speech database Identification is made to input language and characters content after information contrast.
Further, mobile terminal of the voice browser based on android system, development platform are JDK 8.0+ Eclipse 4.7.0+Android SDK 4.0.3。
Further, it is browser window above the interface of the voice browser, knows below for URL input frames and voice Other button.
Further, URL address input fields are clicked on, soft keyboard will be derived automatically from, for keying in network address.
Further, the speech recognition button is used for the speech identifying function for opening browser.
Brief description of the drawings
Fig. 1 is Voice command algorithm flow chart in the present invention;
Fig. 2 is activation software disk interface in the present invention;
Fig. 3 is to input web site interface in the present invention;
Fig. 4 completes interface for loading in the present invention;
Fig. 5 is to identify startup interface in the present invention;
Fig. 6 completes interface for " Down " instruction identification in the present invention;
Fig. 7 completes interface for " Up " instruction identification in the present invention;
Fig. 8 completes interface for " Backward " instruction identification in the present invention;
Fig. 9 completes interface for " Forward " instruction identification in the present invention;
Figure 10 respectively forms module relation diagram for a kind of voice browser in the present invention.
Embodiment
In order to which technical characteristic, purpose and the effect of the present invention is more clearly understood, now compares accompanying drawing and describe in detail The embodiment of the present invention, the given examples are served only to explain the present invention, is not intended to limit the scope of the present invention.
As shown in figure 1, a kind of page sound control method of browser of mobile terminal, is achieved by the steps of to browsing The page basic operation control of device:
S1, the voice of collection user's input;
S2, the speech language classification to identification user's input judge, Selection and call and the user language classification The speech database to match identifies voice, and speech database is used for stored voice message, by inputting language and characters content With making identification to input language and characters content after the voice messaging contrast in speech database;
Word content in the voice that S3, identification user input;
S4, the language and characters content identified is judged, if being present in speech database and the word content pair The instruction answered, then browser is controlled to perform corresponding instruction, if being instructed in the absence of corresponding, this is not responding to what this was identified Word content;
S5, pass through the page basic operation of the language and characters content response browser identified.
The mobile terminal of application is the mobile terminal based on android system, and it is JDK 8.0+ to realize Platform Designing Eclipse 4.7.0+Android SDK 4.0.3.The speech SDK of carrying is that the voice that University of Science and Technology's news fly and Google provides is opened Give out a contract for a project.The word content in the voice of identification user's input can be realized and the language and characters content response by identifying browses The function of the page basic operation of device.The specifically used mode of voice browser is as follows.
First, network address is inputted
1st, URL address input fields are clicked on, soft keyboard, such as Fig. 2 will be derived automatically from using behind interface into voice browser It is shown.
2nd, example network address http is keyed in://www.baidu.com, as shown in Figure 3.
3rd, soft keyboard is packed up, GO network address index buttons is clicked, click event monitor corresponding to triggering retrieval, opens webpage http://www.baidu.com, as shown in Figure 4.
2nd, voice command performs case demonstration
The present embodiment will be with language and characters content " Down ", " Up ", " Backward ", " Forward " conduct demonstrations.
1st, ASR_GO speech recognition buttons are clicked, the click event monitor of speech recognition corresponding to triggering, now backstage Available speech database is opened, as shown in Figure 5:
2nd, dictated voice " Down. " under relatively quiet environment, will be shown after the completion of identification in the form of the ejection by Toast The phonetic order that user is triggered last moment, and corresponding action is completed, as user " Down. " instructs institute as shown in Figure 6 The downslide bottom set action of triggering.
3rd, dictated voice " Up " under relatively quiet environment, use will be shown in the form of the ejection by Toast after the completion of identification The phonetic order that family last moment is triggered, and corresponding action is completed, as user " Up " instructs what is triggered as shown in Figure 7 Upper sliding top set action.
4th, dictated voice " Backward " under relatively quiet environment, by the form of the ejection by Toast after the completion of identification The phonetic order that is triggered last moment of display user, and complete corresponding action, as shown in Figure 8 as user " Backward " The triggered retrogressing loading action of instruction.
5th, dictated voice " Forward " under relatively quiet environment, will show after the completion of identification in the form of the ejection by Toast Show the phonetic order that user is triggered last moment, and complete corresponding action, as user " Forward " refers to as shown in Figure 9 The triggered advance loading action of order.
Figure 10 is referred to, each composition module relationship that above-mentioned voice browser includes is as shown in the figure:
Voice acquisition module:For gathering the voice of user's input;
Sound identification module:For judging the speech language classification of identification user's input, Selection and call and institute Speech database that user language classification matches is stated to identify voice;
Instruct judge module:For judging the language and characters content identified;
Instruct respond module:Page basic operation for the language and characters content response browser by identifying;
Speech database:For stored voice message, by inputting language and characters content and the voice in speech database Identification is made to input language and characters content after information contrast.
The purpose of the present invention is that the special population of browser can not be operated for finger defect, and both hands are busy in operating, Also need to operation Other Instruments, (such as hospital doctor both hands think controller to the operating personnel of instrument simultaneously just in operation The image amplification and moving operation that table is shown, at this moment can be with Voice command instrument with regard to that can provide great convenience for doctor), there is provided The facility of electronic product is operated, is improved the quality of living and operating efficiency.Meanwhile the configuration of the voice browser is slim and graceful, optimizes Incompatibility problem present in general small-sized browser, while also solve situations such as error sudden strain of a muscle is moved back.Mistake of this project in test Cheng Zhong, for voice successful recognition rate up to 98%, the response time up to 70 milliseconds, can meet the primary demand of user.
The present invention is not only limited to above-mentioned embodiment, and persons skilled in the art are according to disclosed by the invention interior Hold, other a variety of embodiments can be used to implement the present invention, therefore, every design structure and think of using the present invention Road, some simple designs for changing or changing are done, both fall within the scope of protection of the invention.

Claims (10)

1. the page sound control method of a kind of browser of mobile terminal, it is characterised in that comprise the following steps:
S1, the voice of collection user's input;
S2, to identification user input speech language classification judge, Selection and call and the user language classification phase The speech database matched somebody with somebody identifies voice;
Word content in the voice that S3, identification user input;
S4, the language and characters content identified is judged, if being present in speech database corresponding with the word content Instruction, then browser is controlled to perform corresponding instruction, if being instructed in the absence of corresponding, this is not responding to the word identified Content;
S5, pass through the page basic operation of the language and characters content response browser identified.
2. the page sound control method of a kind of browser of mobile terminal according to claim 1, it is characterised in that described Speech database is used for stored voice message, after inputting the voice messaging contrast in language and characters content and speech database Identification is made to input language and characters content.
3. the page sound control method of a kind of browser of mobile terminal according to claim 1, it is characterised in that described The page basic operation of browser includes upper and lower rolling, advance loading and retrogressing loading.
4. a kind of page sound control method of described browser of mobile terminal according to claim 1, its feature exist In the speech SDK that the speech SDK of carrying flies for University of Science and Technology's news and Google provides.
5. the page sound control method of a kind of browser of mobile terminal according to claim 1, it is characterised in that described Mobile terminal is the mobile terminal based on android system, and it is JDK 8.0+Eclipse 4.7.0+Android to realize platform SDK 4.0.3。
6. a kind of voice browser, it is characterised in that comprising with lower module:
Voice acquisition module:For gathering the voice of user's input;
Sound identification module:For judging the speech language classification of identification user's input, Selection and call and the use Speech database that family language category matches identifies voice;
Instruct judge module:For judging the language and characters content identified;
Instruct respond module:Page basic operation for the language and characters content response browser by identifying;
Speech database:For stored voice message, by inputting language and characters content and the voice messaging in speech database Identification is made to input language and characters content after contrast.
7. a kind of voice browser according to claim 6, it is characterised in that the voice browser is based on Android systems The mobile terminal of system, development platform are JDK 8.0+Eclipse 4.7.0+Android SDK 4.0.3.
A kind of 8. described mobile terminal sound browser according to claim 6, it is characterised in that the voice browse It is browser window above the interface of device, is URL input frames and speech recognition button below.
9. described URL input frames according to claim 6, it is characterised in that URL address input fields are clicked on, will be automatic Soft keyboard is exported, for keying in network address.
10. described speech recognition button according to claim 6, it is characterised in that the speech recognition button is used for Open the speech identifying function of browser.
CN201711021099.3A 2017-10-26 2017-10-26 The page sound control method and voice browser of a kind of browser of mobile terminal Pending CN107885481A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711021099.3A CN107885481A (en) 2017-10-26 2017-10-26 The page sound control method and voice browser of a kind of browser of mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711021099.3A CN107885481A (en) 2017-10-26 2017-10-26 The page sound control method and voice browser of a kind of browser of mobile terminal

Publications (1)

Publication Number Publication Date
CN107885481A true CN107885481A (en) 2018-04-06

Family

ID=61782636

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711021099.3A Pending CN107885481A (en) 2017-10-26 2017-10-26 The page sound control method and voice browser of a kind of browser of mobile terminal

Country Status (1)

Country Link
CN (1) CN107885481A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113053384A (en) * 2021-04-20 2021-06-29 五八到家有限公司 APP voice control method and system and computer equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100237385B1 (en) * 1997-08-05 2000-01-15 정선종 The Implementation Method of Speech Recognizer on the Web Browser
CN201402461Y (en) * 2008-12-30 2010-02-10 同济大学第一附属中学 Sound-control web-page browser and voice control module thereof
CN102520792A (en) * 2011-11-30 2012-06-27 江苏奇异点网络有限公司 Voice-type interaction method for network browser
CN103577444A (en) * 2012-07-30 2014-02-12 腾讯科技(深圳)有限公司 Browser control method and system
CN106228974A (en) * 2016-08-19 2016-12-14 镇江惠通电子有限公司 Control method based on speech recognition, Apparatus and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100237385B1 (en) * 1997-08-05 2000-01-15 정선종 The Implementation Method of Speech Recognizer on the Web Browser
CN201402461Y (en) * 2008-12-30 2010-02-10 同济大学第一附属中学 Sound-control web-page browser and voice control module thereof
CN102520792A (en) * 2011-11-30 2012-06-27 江苏奇异点网络有限公司 Voice-type interaction method for network browser
CN103577444A (en) * 2012-07-30 2014-02-12 腾讯科技(深圳)有限公司 Browser control method and system
CN106228974A (en) * 2016-08-19 2016-12-14 镇江惠通电子有限公司 Control method based on speech recognition, Apparatus and system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113053384A (en) * 2021-04-20 2021-06-29 五八到家有限公司 APP voice control method and system and computer equipment

Similar Documents

Publication Publication Date Title
EP3557406B1 (en) Device and method for performing functions
CN100478923C (en) System and method for concurrent multimodal communication session persistence
JP6305033B2 (en) Method and system for providing a multi-user messenger service
US20140088970A1 (en) Method and device for user interface
CN103973771B (en) Method and system for sharing part Web page
CN107992587A (en) A kind of voice interactive method of browser, device, terminal and storage medium
CN110297679A (en) For providing the equipment, method and graphic user interface of audiovisual feedback
CN101291336A (en) System and method for concurrent multimodal communication
KR20170014353A (en) Apparatus and method for screen navigation based on voice
KR20090090613A (en) System and method for multimodal conversational mode image management
CN106527945A (en) Text information extracting method and device
CN109448709A (en) A kind of terminal throws the control method and terminal of screen
CN107479400A (en) Control method, device, home appliance and the readable storage medium storing program for executing of home appliance
CN106775647A (en) A kind of control method of mobile terminal, control device and mobile terminal
CN107479783A (en) A kind of picture upload method and terminal
CN110047484A (en) A kind of speech recognition exchange method, system, equipment and storage medium
CN110389807A (en) A kind of interface interpretation method, device, electronic equipment and storage medium
CN108829686A (en) Translation information display methods, device, equipment and storage medium
KR102367132B1 (en) Device and method for performing functions
CN103970839A (en) Method for controlling webpage browsing through voice
Barlott et al. Increasing participation in the information society by people with disabilities and their families in lower-income countries using mainstream technologies
CN109817204A (en) Voice interactive method and device, electronic equipment, readable storage medium storing program for executing
EP2590393A1 (en) Service server device, service provision method, and service provision program
CN107885481A (en) The page sound control method and voice browser of a kind of browser of mobile terminal
CN116701811B (en) Webpage processing method, device, equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180406

WD01 Invention patent application deemed withdrawn after publication