CN106683675A - Control method and voice operating system - Google Patents
Control method and voice operating system Download PDFInfo
- Publication number
- CN106683675A CN106683675A CN201710063092.1A CN201710063092A CN106683675A CN 106683675 A CN106683675 A CN 106683675A CN 201710063092 A CN201710063092 A CN 201710063092A CN 106683675 A CN106683675 A CN 106683675A
- Authority
- CN
- China
- Prior art keywords
- keyword
- word
- voice
- operating system
- control method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Abstract
The invention provides a method for achieving voice operation on an operating system, which is based on semantic keywords and representative keywords, wherein the representative keywords are particularly suitable for utilizing colors. Corresponding words or figures are marked on corresponding elements. And the location and selection operation of the different elements in the window is carried out by utilizing the representative keywords which replace a mouse. A voice operating system and an application program based on the method are further disclosed.
Description
Technical field
The present invention proposes a kind of method of the full voice of simple realization on an operating system operation, and based on the method
Operating system and software.
Background technology
With the development and perfection of speech recognition technology, speech recognition gradually enters into the life of people, or even starts giving birth to
Leading position is occupied in work, however, speech recognition technology still faces many difficult problems, the foot that it incorporates people's life is hindered
Step, so far all neither one can carry out completely the operating system of all operations, the Brilliant Eyes several years ago cut a dash with voice
Although mirror can realize many operations with voice, also there are many operations to need just to be completed with reference to gesture.
The content of the invention
The present invention propose it is a kind of can full voice operation control method and based on the voice operating system of the method, application
Program.
The technical solution adopted in the present invention is:All the elements of the present invention are all based on by semantic keywords and represent word key
The voice operating method that word is constituted, by semantic keywords prompt operation is carried out, and is represented word keyword and be substantially carried out positioning action.
The invention has the beneficial effects as follows:One is that the operating system can be to be operated using full voice mode, such TV
Class inconvenience is mobile and operating distance electronic product farther out is more convenient to operate, and it is intelligent to be more beneficial for it;Two is available
The modes of operation such as touch screen, mouse, keyboard are aided with voice operating makes the operation more convenient and efficient of electronic product;Three are advantageous for
The fusion unification of various families or personal consumption electronic product operating system, is conducive to interconnecting convenient for people's lives;
Four are advantageous for individual privacy, office voice, and personal aspect in public may using the semantic Voice command of normal tape
Cause individual privacy preference etc. to reveal, be equally likely to cause other people puzzlement, but if that is just big using colour coding Voice command
Make a world of difference, be also in office in the same manner, when especially people is more, using colour coding Voice command it is more convenient, also more like
In office, so office also should be more efficiently;Five is that the method that highlights of the present invention is more recreational, and left-hand seat more holds
Easily;Six is the lifting that the reduction of required speech recognition content can undoubtedly bring recognition accuracy, reduces various system resources and takes,
So bring less cost to include the input that personal less economy is paid and all sectors of society is less.
Specific embodiment
The control method of voice operating system described in the claims in the present invention 1, it is based on semantic keywords and represents word pass
Key word is controlled, and to substitute Mus table, touch screen different elements or all elements in form are carried out(It is referred to as control in specialty, but goes back
Including some less elements in control, the such as each option in multiselect frame control)Selected operation, single hit and dblclick and right button it is quick
Menu operation, other menus etc. are operated, should explain, the full voice operating system which is realized, only include daily exhausted
Most reference, but do not include motion track, the interior more specifically certain point etc. of inseparable control element for needing to record mouse
The operation that substituted with colour coding of inconvenience, the main application of this generic operation has drawing, existing game etc., therefore in view of full voice
The limitation of operating system, the present invention should not necessarily be limited to only can voice operating electronic product operating system, and should can also lead
In being applied to the system of the voice dual-purpose such as mouse, touch screen, keyboard.Keyword is not limited only to a word in above-mentioned keyword,
Can also be multiple words, semantic keywords are all used for the behaviour of the semanteme correlation for performing the keyword as common sound control product
Make, such as voice " opening my computer " " stickup " " deletion ", and represent word keyword, such as numeral, color, Chinese era sequence
Row, familiar things etc., the element in form individually or is in combination referred to their title, and on each element this is identified with
The respective markers word or figure of the representative word keyword of element, such as:If certain element represents word keyword as " Fructus Mali pumilae ", at this
There should be an image Fructus Mali pumilae labelling on element;If certain element colour coding represents word keyword as " redness ", should have on the element
One little red filled circle marker;If the colour coding keyword of certain element is " red blue white ", one is sequentially identified on the element
Individual little red Filled Rectangle, blue Filled Rectangle and white Filled Rectangle.Because color represents word advantage a lot, such as long distance
From easily distinguishing, the sign of form interior element takes up room can be less, and have much just be able to can be represented with individual character
Color, deals with more convenient, therefore hereafter only illustrates using color as word keyword is represented.
Common color is such as:The purple white grey black palm fibre metal powder of the yellowish green ultramarine of reddish orange, in addition with some individual characters generation is can be used to
The color of table difference depth degree is such as:Red, black, lead, Zhu, green, grey, tender, profound etc., here does not continue to enumerate, and these are basic
Enough routine use, totally 22 colour coding keywords listed above, 22 totally 484 kinds of combinations are multiplied by if combination of two for 22, if three
Individual one group is that trichroism coding is then multiplied by 22 totally 10648 kinds of combinations for 484, it is seen that with single double-colored unit that can be met in being normally applied
Prime number amount, and with it is trichroism carry out big scale of construction text editing i.e. number of words be more when start word for word to carry out colour coding coding from first character to refer to
In generation, can be encoded with paging when not enough, the application of other big amount of element can also be carried out in the same manner, when such as amplifying certain region, use one
Individual special amplification window, is fabricated to coordinate system and screen is divided into into some little regular regions with colour coding, and on each region
Identify corresponding colour coding labelling, voice selecting region;For another example electrical form, equally can be entered using trichroism to each cell
Row coding.Each application can choose when using single double-colored mark, when use trichroism mark, Huo Zhegeng according to own situation
The method of polychromatic combination is identified to all elements in form, it is possible to arrange the global language for enabling or locally enabling in addition
Adopted keyword(Or represent word keyword)Carry out quick voice operating.
Generally represent word keyword only effective to current active window or current active program, naturally it is also possible to pre-
Some are stayed to represent the operation that word keyword carries out cross-window or system is global, program is global.
With regard to the voice operating system described in the claims in the present invention 3, help because current many systems are all proposed voice
Handss, diversified service is pushed by the server of distal end to user, and the voice assistant of some systems can also carry out
Locally applied content, but the support that application is carried to system is substantially, lack the extensive support to applying, trace it to its cause also
The restriction of current speech technology of identification level of development, however the voice operating system based on operational approach of the present invention if
To represent word keyword whole system operatios and whole visual elements are carried out by coding and referred to, while to the conventional operation of minority with language
Adopted keyword is substituted, and is so greatly lowered required level of technical sophistication, such as speech recognition core in the market
In piece, ability is most front can be identified to the 50 of unspecified person groups of words, and above listed only 22 color codes are just
The extensive covering to system all operations can be met, residue also has 28 groups of words to be available for system or application assigned to use,
Along with many times colour coding keyword is that comparison is had more than needed, it is also possible to some prompt operations are set to by it, so as to use
Family possesses abundant selectivity.Based on such theory, all of Voice command keyword, including the global keyword for enabling and
The keyword that local enables, and function, using method of the keyword in each application, all should be managed by systematic unity and be divided
Match somebody with somebody, and all should be recorded that and manage in same file or file system, program, registration table, data base, application program or be
System can as needed increase, delete or change keyword and function, the using method of the keyword, or referred to as all answer
Same voice-controlled operations specification should be all followed with program including system.
With regard to the voice operating system described in claim 4, reason represents the particularity of the voice system of word composition, its language
Sound control command is not easy to be remembered, therefore should have a voiced keyword navigation sidebar, and full frame at any time or non-full screen display can
The related command of selected element in voice command, including global command, local command and form.And these available orders
Be required for inquiry system file just can obtain, i.e., the unification described in claim 2 record all Voice command keywords and its
Function, the file of using method or file system, program, registration table, data base.
With regard to the language of the voice operating system described in claim 5, the system being typically identified with isolated word, word and word
Generally require interval a bit of time between sound input, this short time may make the uneasy agitation of user, therefore this
The bright interaction that increased with user, more adds a recreational, i.e., often identifying form after one represents word keyword
The mark content that respective identification Duan Yuyi of the representative word mark of interior all elements is identified is matched, and will be matched correct
The representative word mark of element, highlights one labelling of a labelling, or a whole labelling is integrally highlighted, i.e., with
The representative word mark that family phonetic entry represents word keyword and is not yet input into the element that will be possible to match when finishing is dashed forward
Go out to show.For example, certain element represents word keyword for " red green " in form, and user's phonetic entry successively, user speech is input into
After " red ", system identification goes out " red " and all first colour codings in form is designated into the first red colour coding mark of red element
Remember row into highlight, such user reduces unnecessary trouble it can be identified that system does not recognize mistake, and user is again
Phonetic entry " green ", system identification goes out " green " and is matched by red green mark of front two, to be grasped so as to match user
The object of work simultaneously highlights the object.
With regard to the Voice command application program described in claim 5, due to above-described voice operating system it is mainly special
It is that the element in form has been carried out representing word to encode and do respective markers to levy, and the method is simply easily realized, or even many applications
It is as crucial in the title that required voice-operated element is occurred in program code is substituted for voice as long as changing less
Word, in respective element subscript respective markers are known, and are added voice driven program and are installed speech recognition hardware module(Or such as mobile phone
On using offline speech recognition)Voice command thus substantially can be just completed, therefore generation is used using interior variety classes element
Literary name keyword has carried out the application program of the operation that coding and voice carry out the element should belong to scope of the invention.
Claims (6)
1. a kind of control method of voice operating system, is controlled using the semantic keyword of band, be it is characterized in that:Also there is generation
Literary name keyword, Unified coding is carried out with word keyword is represented to the different types of element in form, and in respective element
Do respective markers, with using represent word keyword carry out visual elements in form voice select operation.
2., based on the control method described in claim 1, it is characterized in that:Described represents word keyword to represent the word of color.
3. the voice operating system based on control method described in claim 1,2, is characterized in that:All application packages in system
Include system and all follow same voice-controlled operations specification, i.e., all Voice command keywords, including the global keyword for enabling
The keyword enabled with local, and function, using method of the keyword in each application, are all recorded and manage in same text
In part or file system, program, registration table, data base, application program and system can as needed increase, delete or change pass
The function of key word and the keyword, using method.
4. the voice operating system based on claim 3, is characterized in that:The system has a voiced keyword navigation side
Hurdle, the full frame at any time or available voice command of non-full screen display, including institute's chosen elements in global command, local command and form
Associative operation order.
5. the voice operating system of claim 1,2 is based on, and its system can be identified based on isolated word, and keyword is two words
And two it is more than word when, be identified according to the sequencing of word, it is characterized in that:Above-mentioned keyword is often known to represent word keyword
Do not go out one represent word keyword will in form respective identification Duan Yuyi of the representative word mark of all elements identify it is complete
Portion's mark content is matched, and will match the representative word mark of correct element, highlights one labelling of a labelling,
Or integrally highlight a whole labelling, i.e., will be all when user speech input represents word keyword and is not yet input into and finishes
The representative word mark of the element that may be matched is highlighted.
6. based on control method described in claim 1,2 can Voice command application program, it is characterized in that:Using interior variety classes
Element using representing, word keyword has carried out coding and voice carries out the operation of the element.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710063092.1A CN106683675A (en) | 2017-02-08 | 2017-02-08 | Control method and voice operating system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710063092.1A CN106683675A (en) | 2017-02-08 | 2017-02-08 | Control method and voice operating system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106683675A true CN106683675A (en) | 2017-05-17 |
Family
ID=58860267
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710063092.1A Pending CN106683675A (en) | 2017-02-08 | 2017-02-08 | Control method and voice operating system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106683675A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108172222A (en) * | 2017-12-08 | 2018-06-15 | 石化盈科信息技术有限责任公司 | A kind of workbench voice control measures and procedures for the examination and approval and system |
CN108769638A (en) * | 2018-07-25 | 2018-11-06 | 京东方科技集团股份有限公司 | A kind of control method of projection, device, projection device and storage medium |
CN110007826A (en) * | 2019-04-12 | 2019-07-12 | 深圳市语芯维电子有限公司 | The mobile method and apparatus of voice control cursor |
CN110691160A (en) * | 2018-07-04 | 2020-01-14 | 青岛海信移动通信技术股份有限公司 | Voice control method and device and mobile phone |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101557651A (en) * | 2008-04-08 | 2009-10-14 | Lg电子株式会社 | Mobile terminal and menu control method thereof |
CN101557432A (en) * | 2008-04-08 | 2009-10-14 | Lg电子株式会社 | Mobile terminal and menu control method thereof |
CN103869931A (en) * | 2012-12-10 | 2014-06-18 | 三星电子(中国)研发中心 | Method and device for controlling user interface through voice |
CN103970839A (en) * | 2014-04-24 | 2014-08-06 | 四川长虹电器股份有限公司 | Method for controlling webpage browsing through voice |
CN104461346A (en) * | 2014-10-20 | 2015-03-25 | 天闻数媒科技(北京)有限公司 | Method and device for visually impaired people to touch screen and intelligent touch screen mobile terminal |
CN104965596A (en) * | 2015-07-24 | 2015-10-07 | 上海宝宏软件有限公司 | Voice control system |
CN105513594A (en) * | 2015-11-26 | 2016-04-20 | 许传平 | Voice control system |
-
2017
- 2017-02-08 CN CN201710063092.1A patent/CN106683675A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101557651A (en) * | 2008-04-08 | 2009-10-14 | Lg电子株式会社 | Mobile terminal and menu control method thereof |
CN101557432A (en) * | 2008-04-08 | 2009-10-14 | Lg电子株式会社 | Mobile terminal and menu control method thereof |
CN103869931A (en) * | 2012-12-10 | 2014-06-18 | 三星电子(中国)研发中心 | Method and device for controlling user interface through voice |
CN103970839A (en) * | 2014-04-24 | 2014-08-06 | 四川长虹电器股份有限公司 | Method for controlling webpage browsing through voice |
CN104461346A (en) * | 2014-10-20 | 2015-03-25 | 天闻数媒科技(北京)有限公司 | Method and device for visually impaired people to touch screen and intelligent touch screen mobile terminal |
CN104965596A (en) * | 2015-07-24 | 2015-10-07 | 上海宝宏软件有限公司 | Voice control system |
CN105513594A (en) * | 2015-11-26 | 2016-04-20 | 许传平 | Voice control system |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108172222A (en) * | 2017-12-08 | 2018-06-15 | 石化盈科信息技术有限责任公司 | A kind of workbench voice control measures and procedures for the examination and approval and system |
CN110691160A (en) * | 2018-07-04 | 2020-01-14 | 青岛海信移动通信技术股份有限公司 | Voice control method and device and mobile phone |
CN108769638A (en) * | 2018-07-25 | 2018-11-06 | 京东方科技集团股份有限公司 | A kind of control method of projection, device, projection device and storage medium |
CN110007826A (en) * | 2019-04-12 | 2019-07-12 | 深圳市语芯维电子有限公司 | The mobile method and apparatus of voice control cursor |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1647023B (en) | Voice-controlled data entry | |
CN100409173C (en) | Voice-controlled user interfaces | |
US7752563B2 (en) | Enabling a user to select multiple objects in a document | |
CN106683675A (en) | Control method and voice operating system | |
US6122606A (en) | System and method for enhancing human communications | |
RU2650029C2 (en) | Method and apparatus for controlling application by handwriting image recognition | |
US7389236B2 (en) | Navigation and data entry for open interaction elements | |
CN108829435A (en) | A kind of image labeling method and general image annotation tool | |
US20020077832A1 (en) | Computer based integrated text/graphic document analysis | |
CN109614845A (en) | Manage real-time handwriting recognition | |
WO2002033578A2 (en) | Dynamically displaying current status of tasks | |
CN104424180B (en) | Text entry method and equipment | |
EP1946227A1 (en) | Method and system for entering and entrieving content from an electronic diary | |
KR20090058409A (en) | Method and system for providing and using editable personal dictionary | |
EP1445707B1 (en) | System and method for checking and resolving publication design problems | |
Ghidini et al. | Developing apps for visually impaired people: Lessons learned from practice | |
CN105100372B (en) | Minutes method and mobile terminal | |
CN113901186A (en) | Telephone recording marking method, device, equipment and storage medium | |
JPH0991380A (en) | Device and method for information, and storage medium | |
CN1629934A (en) | Building and using method of virtual speech keyboard for interactive control | |
CN111596883B (en) | Data visualization system supporting voice recognition and somatosensory operation remote control | |
CN109410939B (en) | Universal data maintenance method based on voice instruction set | |
Noh | Ralph Ellison's Computer Memory | |
CN106126093A (en) | A kind of input method based on dummy keyboard and system | |
CN117149027A (en) | Wallpaper generation method and device, electronic equipment and readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170517 |