CN106356059A - Voice control method, device and projector - Google Patents

Voice control method, device and projector Download PDF

Info

Publication number
CN106356059A
CN106356059A CN201510424654.1A CN201510424654A CN106356059A CN 106356059 A CN106356059 A CN 106356059A CN 201510424654 A CN201510424654 A CN 201510424654A CN 106356059 A CN106356059 A CN 106356059A
Authority
CN
China
Prior art keywords
phonetic order
speech recognition
instruction
state
projector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510424654.1A
Other languages
Chinese (zh)
Inventor
朱渊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201510424654.1A priority Critical patent/CN106356059A/en
Priority to PCT/CN2016/090170 priority patent/WO2017012511A1/en
Publication of CN106356059A publication Critical patent/CN106356059A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/04Programme control other than numerical control, i.e. in sequence controllers or logic controllers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

The invention provides a voice control method, a device and a projector. The method includes the steps: determining that the projector enters a voice recognition state; receiving an inputted voice instruction; executing operation corresponding to the voice instruction according to the received voice instruction. The voice recognition state is a state for executing the operation according to the voice instruction. The problem of poor user experience due to complex manual operation of the projector in related technology is solved, operation complexity of the projector is reduced, and user experience is improved.

Description

Sound control method, device and projector apparatus
Technical field
The present invention relates to the communications field, in particular to a kind of sound control method, device and projector apparatus.
Background technology
Projector, also known as scialyscope, is a kind of equipment that can be by image or VIDEO PROJECTION on curtain, can be by not The same computer of interface together, video disc (video compact disc, referred to as vcd), digital video disk (digital Video disc, referred to as dvd), game machine etc. be connected, play corresponding video signal, projector is extensively applied In family, office, school and public place of entertainment, according to the difference of applied environment, projector can be divided into several classes as follows: Home theater type, portable business type projector, educational conference type projector, main flow Engineering-type projector, professional theater type Projector, measuring projector.
These projectors have the characteristics that one common it is simply that operate these projectors when, need manual controller operate, And manual operation can cause the problem of complex operation, thus leading to poor user experience, lacking interest.
For complex operation during manual operation projector present in correlation technique, lead to the problem of poor user experience, at present Effective solution is not yet proposed.
Content of the invention
The invention provides a kind of sound control method, device and projector apparatus, at least to solve to exist in correlation technique Manual operation projector when complex operation, lead to the problem of poor user experience.
According to an aspect of the invention, it is provided a kind of sound control method, comprising: determine that projector apparatus enter language Sound identifies state, and wherein, described speech recognition state is the state according to phonetic order execution operation;The language of receives input Sound instructs;Described phonetic order according to receiving executes operation corresponding with described phonetic order.
Optionally it is determined that projector apparatus enter speech recognition state comprises determining that described projector apparatus are called out by reception The mode of awake instruction, enters described speech recognition state, and wherein, described wake-up instruction includes at least one of: makes a reservation for The touching signals of track, voice signal, push button signalling.
Alternatively, included according to the described phonetic order execution operation corresponding with described phonetic order receiving: judge whether It is previously stored with the instruction mated with described phonetic order;In the case of being to be in judged result, execution is referred to described voice Make corresponding operation.
Alternatively, before according to the described phonetic order execution operation corresponding with described phonetic order receiving, also include: Obtain the Apply Names of the file name of file prestoring and/or preassembled application;Store described file name And/or described Apply Names, wherein, described file name is used for being called and described file name according to described phonetic order Corresponding file, described Apply Names is used for calling application corresponding with described Apply Names according to described phonetic order.
Alternatively, described projector apparatus are supported to receive described phonetic order by ancillary equipment, and wherein, described periphery sets Standby inclusion at least one of: wired earphone, bluetooth earphone.
According to a further aspect in the invention, there is provided a kind of phonetic controller, comprising: determining module, throw for determining Shadow instrument equipment enters speech recognition state, and wherein, described speech recognition state is the state according to phonetic order execution operation; Receiver module, for the phonetic order of receives input;Performing module, for according to the described phonetic order execution receiving with The corresponding operation of described phonetic order.
Alternatively, described determining module comprises determining that unit, for determine described projector apparatus by receive wake-up refer to The mode of order, enters described speech recognition state, and wherein, described wake-up instruction includes at least one of: desired trajectory Touching signals, voice signal, push button signalling.
Alternatively, described performing module includes: judging unit, is used for judging whether to be previously stored with and described phonetic order The instruction of coupling;Performance element, for the judged result in described judging unit for, in the case of being, executing and institute's predicate Sound instructs corresponding operation.
Alternatively, described device also includes: acquisition module, for obtain the file name of file prestoring and/or The Apply Names of preassembled application;Memory module, for storing described file name and/or described Apply Names, Wherein, described file name is used for calling file corresponding with described file name, described application according to described phonetic order Title is used for calling application corresponding with described Apply Names according to described phonetic order.
Alternatively, described projector apparatus are supported to receive described phonetic order by ancillary equipment, and wherein, described periphery sets Standby inclusion at least one of: wired earphone, bluetooth earphone.
According to a further aspect in the invention, there is provided a kind of projector apparatus, described equipment at least includes: low-power consumption wakes up Chip, speech engine and normal stream assembly, wherein, described low-power consumption wakes up chip and is used for entering voice according to wake-up instruction Identification state, wherein, described speech recognition state is the state according to phonetic order execution operation;Described speech engine is used Phonetic order in receives input;Described normal stream assembly is used for according to the described phonetic order execution receiving and described voice Instruct corresponding operation.
By the present invention, using determining projector apparatus entrance speech recognition state, wherein, described speech recognition state is State according to phonetic order execution operation;The phonetic order of receives input;According to the described phonetic order execution receiving with The corresponding operation of described phonetic order, complex operation when solving manual operation projector present in correlation technique, lead to Poor user experience, and then reached reduction projector operation complexity, improve the effect of Consumer's Experience.
Brief description
Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the application, the present invention Schematic description and description be used for explaining the present invention, do not constitute inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart of sound control method according to embodiments of the present invention;
Fig. 2 is the structured flowchart of phonetic controller according to embodiments of the present invention;
Fig. 3 is the structured flowchart of determining module 22 in phonetic controller according to embodiments of the present invention;
Fig. 4 is the structured flowchart of performing module 26 in phonetic controller according to embodiments of the present invention;
Fig. 5 is the preferred structure block diagram of phonetic controller according to embodiments of the present invention;
Fig. 6 is the structured flowchart of voice control projection instrument system according to embodiments of the present invention;
Fig. 7 is that the low-power consumption of voice control projection instrument system according to embodiments of the present invention wakes up flow chart;
Fig. 8 is the working state figure of voice control projection instrument system according to embodiments of the present invention.
Specific embodiment
To describe the present invention in detail below with reference to accompanying drawing and in conjunction with the embodiments.It should be noted that in the feelings do not conflicted Under condition, the embodiment in the application and the feature in embodiment can be mutually combined.
It should be noted that term " first " in description and claims of this specification and above-mentioned accompanying drawing, " second " Etc. being for distinguishing similar object, without for describing specific order or precedence.
Provide a kind of sound control method in the present embodiment, Fig. 1 is sound control method according to embodiments of the present invention Flow chart, as shown in figure 1, this flow process comprises the steps:
Step s102, determines that projector apparatus enter speech recognition state, wherein, this speech recognition state is according to language The state of sound instruction execution operation;
Step s104, the phonetic order of receives input;
Step s106, executes operation corresponding with above-mentioned phonetic order according to the phonetic order receiving.
By above-mentioned steps, when operating projector apparatus, projector apparatus can be operated by phonetic order, thus can To avoid manual tedious steps, complex operation when solving manual operation projector present in correlation technique, lead Cause poor user experience, and then reached reduction projector operation complexity, improve the effect of Consumer's Experience.
In an optional embodiment, determine that projector apparatus enter speech recognition state and comprise determining that this projector sets By way of the standby wake-up instruction by reception, enter above-mentioned speech recognition state, wherein, below this wake-up instruction inclusion at least One of: the touching signals of desired trajectory, voice signal, push button signalling.
In an optional embodiment, the above-mentioned phonetic order execution operation corresponding with phonetic order according to receiving includes: Judge whether to be previously stored with the instruction mated with above-mentioned phonetic order;In the case of being to be in judged result, execute and be somebody's turn to do The corresponding operation of phonetic order.Wherein, if being not stored in the instruction of above-mentioned phonetic order coupling, one can be fed back Information, the such as prompting of " this instruction of None- identified ".
In an optional embodiment, before the above-mentioned phonetic order execution operation corresponding with phonetic order receiving, Also include: obtain the Apply Names of the file name of file prestoring and/or preassembled application;Storage this article Part title and/or Apply Names, wherein, this document title is used for calling literary composition corresponding with file name according to phonetic order Part, this Apply Names is used for calling application corresponding with Apply Names according to phonetic order.Store above-mentioned file name and answer To easily corresponding file and application be called according to phonetic order with the purpose of title, when store new file or The Apply Names of the file name of file of this new storage and the application of this new installation after being mounted with new application, can be stored.
In an optional embodiment, above-mentioned projector apparatus are supported to receive above-mentioned phonetic order by ancillary equipment, its In, this ancillary equipment includes at least one of: wired earphone, bluetooth earphone.
Through the above description of the embodiments, those skilled in the art can be understood that according to above-described embodiment Method can realize by the mode of software plus necessary general hardware platform naturally it is also possible to pass through hardware, but a lot In the case of the former is more preferably embodiment.Based on such understanding, technical scheme is substantially in other words to existing Have what technology contributed partly can embody in the form of software product, this computer software product is stored in one In storage medium (as rom/ram, magnetic disc, CD), including some instructions with so that a station terminal equipment (can To be mobile phone, computer, server, or the network equipment etc.) method described in execution each embodiment of the present invention.
Additionally provide a kind of phonetic controller in the present embodiment, this device is used for realizing above-described embodiment and is preferable to carry out Mode, had carried out repeating no more of explanation.As used below, predetermined function can be realized in term " module " Software and/or hardware combination.Although the device described by following examples preferably to be realized with software, firmly Part, or the realization of the combination of software and hardware is also may and to be contemplated.
Fig. 2 is the structured flowchart of phonetic controller according to embodiments of the present invention, as shown in Fig. 2 this device includes really Cover half block 22, receiver module 24 and performing module 26, illustrate to this device below.
Determining module 22, for determining projector apparatus entrance speech recognition state, wherein, this speech recognition state is root State according to phonetic order execution operation;Receiver module 24, connects to above-mentioned determining module 22, for receives input Phonetic order;Performing module 26, connects to above-mentioned receiver module 24, for according to the phonetic order execution receiving with State the corresponding operation of phonetic order.
Fig. 3 is the structured flowchart of determining module 22 in phonetic controller according to embodiments of the present invention, as shown in figure 3, This determining module 22 includes determining unit 32, below this determining module 22 is illustrated.
Determining unit 32, for determining projector apparatus by way of receiving and waking up instruction, entrance speech recognition state, Wherein, this wake-up instruction includes at least one of: the touching signals of desired trajectory, voice signal, push button signalling.
Fig. 4 is the structured flowchart of performing module 26 in phonetic controller according to embodiments of the present invention, as shown in figure 4, This performing module 26 includes judging unit 42 and performance element 44, below this performing module 26 is illustrated:
Judging unit 42, for judging whether to be previously stored with the instruction mated with above-mentioned phonetic order;Performance element 44, Connect to above-mentioned judging unit 42, for the judged result in above-mentioned judging unit 42 in the case of being, executing and being somebody's turn to do The corresponding operation of phonetic order.
Fig. 5 is the preferred structure block diagram of phonetic controller according to embodiments of the present invention, as shown in figure 5, this device removes Outside including all modules shown in Fig. 2, also include acquisition module 52 and memory module 54, below this device is said Bright:
Acquisition module 52, for obtaining the file name of file prestoring and/or the application name of preassembled application Claim;Memory module 54, connects to above-mentioned acquisition module 52 and above-mentioned performing module 26, for storing above-mentioned file name And/or above-mentioned Apply Names, wherein, this document title is used for calling file corresponding with file name according to phonetic order, This Apply Names is used for calling application corresponding with Apply Names according to phonetic order.
Alternatively, above-mentioned projector apparatus are supported to receive phonetic order, wherein, this ancillary equipment bag by ancillary equipment Include at least one of: wired earphone, bluetooth earphone.
According to a further aspect in the invention, a kind of projector apparatus are additionally provided, this equipment at least includes: low-power consumption wakes up Chip, speech engine and normal stream assembly, wherein, this low-power consumption wakes up chip and is used for being known according to wake-up instruction entrance voice Other state, wherein, this speech recognition state is the state according to phonetic order execution operation;This speech engine is used for receiving The phonetic order of input;This normal stream assembly is used for executing operation corresponding with this phonetic order according to the phonetic order receiving. Wherein, above-mentioned low-power consumption wakes up chip and can connect with speech engine, and this speech engine can connect with normal stream assembly, Low-power consumption wakes up and can connect it is also possible to be not connected between chip and normal stream assembly.
In embodiments of the present invention, involved technology can comprise the following aspects:
1st, speech recognition technology:
Speech recognition technology, as current hot technology, has penetrated into every field, opens from " keyboard mutuality ", " touches The interactive mode of " interactive voice " is arrived in control interaction ", is that people's liberation both hands bring possibility with improving efficiency.
Speech recognition technology is also referred to as automatic speech recognition (automatic speech recognition, referred to as asr), Its target be by the vocabulary Content Transformation in the voice of the mankind be computer-readable input, such as button, binary coding Or character string.Different from Speaker Identification (speaker recognition) and speaker verification, the latter attempts identification Or confirm to send speaker rather than the vocabulary content included in it of voice.Speech recognition technology is exactly to allow machine to pass through to know Other and understanding process is changed into the high-tech of corresponding text or order voice signal.Speech recognition technology mainly includes spy Levy extractive technique, pattern match criterion and three aspects of model training technology.
Different according to the object of identification, voice recognition tasks substantially can be divided into 3 classes, i.e. isolated word recognition (isolated word Recognition), key word identification (or claiming keyword spotting, keyword spotting) and continuous speech recognition.Wherein, The task of isolated word recognition is the previously known isolated word of identification, such as " start ", " shutdown " etc.;Continuous speech recognition Task be then to identify arbitrary continuous speech, a such as sentence or one section of word;Keyword detection pin in continuous speech stream To be continuous speech, but it and the whole word of nonrecognition, and simply detect that known some key words wherein occur, As detection " computer ", " world " this two words in one section of word.
The speech recognition of isolated word can be adopted in embodiments of the present invention, the phonetic order supported will be needed to edit in advance Become grammar file, have engine compiling to generate corresponding identification range.User only supports to pre-define in grammer when using Instruction.
2nd, low-power consumption wakes up:
Low power consumption digital signal processor (digital signal processor, referred to as dsp) voice awakening technology refers to (i.e. central processing unit (central after terminal (e.g., mobile phone) radio access point (access point, referred to as ap) dormancy Processing unit, referred to as cpu) quit work), rely on the distinctive processing unit of dsp, and by specific Triggering mode, can reach wake-up cpu so that the technology of its rearming.It is to be conceived to completely to solve Put the speech control scene of both hands, on the basis of reaching maximum economize on electricity in cell phone system resting state, exploitation is to mobile phone language The technical operation that sound wakes up.The development of this research can be opened up one kind for mobile phone operation and completely be used " voice+listen Feel reaction " the input operation premise of replacement " finger+vision touch-control ", thus reaching the man-machine of completely voice-intelligent Interactive experience.
3rd, barge:
Barge refers to carry out a specific human voices technology of identification of speech recognition under stationary background sound.There is this work( Can, could talk using after just need not waiting " ticking " sound during speech recognition system, but can be beaten with voice at any time Disconnected prompt tone, is directly entered speech recognition (this process is referred to as barge-in).
The key of barge is speech terminals detection function, and the purpose of end-point detection is under complicated applied environment Tell voice signal and non-speech audio in signal stream, and determine beginning and the end of voice signal.General signal All there is certain background sound in stream, and the model of speech recognition is all based on voice signal training, voice signal and voice It is just meaningful that model carries out pattern match.Therefore detect from signal stream voice signal be speech recognition necessary pre- from Reason process.
In detail, end-point detection has two processes:
A) feature based on voice signal, with the parameters such as energy, zero-crossing rate, business (entropy), pitch (pitch) with And their derivative parameter, to judge the speech/non-speech signal in signal stream.
B), after voice signal is detected in signal stream, judge it is whether beginning or the end point of sentence herein.In commercial language In system for electrical teaching, it is easier to make in sentence, there is pause (non-voice) due to the changeable background of signal and natural dialogue pattern, special It is not always to have silence gap before outburst initial consonant.Therefore, the judgement of this beginning/end is particularly important.
In addition the purpose of end-point detection also resides in:
A) reduce the data processing amount of evaluator: the computing load of working transmission and evaluator can be reduced in a large number, for The Real time identification of voice dialogue plays an important role.
B) refuse the signal of non-voice: the identification to non-speech audio is not only a kind of wasting of resources, and be possible to change The state of dialogue, causes the puzzlement to user.
C) in the system needing to interrupt (barge-in) function, the starting point of voice is necessary.Find in end-point detection During the starting point of voice, system will stop the broadcasting of prompt tone.Complete to interrupt function.
The technical scheme of this system is as follows:
During device sleeps, user passes through to wake up wake instruction projector, enters speech recognition state, and this wake-up instruction is supported Self-defined recording is trained.
Wherein, user also can wake up projector manually, such as presses wake-up device by home bond distance, enters speech recognition shape State.
Immediately, user can say preset any phonetic order, tells projector next step needs what does.As: beat Open projection, close projection, play * * * *, (wherein * * * is video file name, ppt document name or installation to open * * * Application name etc.).Automatically this title can be loaded into and can say grammer as long as file copies projector memorizer to, application As long as the system that is installed to also can be automatically loaded into can say grammer.
Wherein, the projector not preset instruction when user input, projector can point out user input mistake, enters again Input instruction flow process.
When video commences play out, user can be by barge technology whole process Voice command video playback, you can in video Whenever phonetic order is inputted during broadcasting.User can say video control instruction such as: heighten volume, turn down volume, temporarily Stop, continue to play, exit broadcasting etc..
When ppt starts to demonstrate, user can be play by barge technology whole process Voice command ppt, you can in ppt Whenever phonetic order is inputted during demonstration.User can say ppt control instruction such as: page up, lower one page, homepage, Endpage, exit full frame, played in full screen etc..
Support ancillary equipment Voice command projector.Ancillary equipment such as wired earphone, bluetooth earphone, after connecting projector, Ancillary equipment can control projector as voice-input device.Pass through indigo plant as user may stand in the place farther out from projector Tooth earphone voice control projection instrument.
Whole flow process projector has user interface (user interface, referred to as ui) prompting on projecting apparatus screen, Have voice simultaneously or prompt tone tells when user starts input instruction, end of input, input error etc..
Below with reference to accompanying drawing to the embodiment of the present invention scheme more illustrated in detail.
Fig. 6 is the structured flowchart of voice control projection instrument system according to embodiments of the present invention, as shown in Figure 6.This system is main Be made up of 3 parts, including low-power consumption wake up chip module (corresponding to the low-power wakeup dsp chip in Fig. 6, Wake up chip with above-mentioned low-power consumption), identification and report engine modules (corresponding to the voice engine in Fig. 6, ibid The speech engine stated) and normal stream assembly module (corresponding to the standard flow component in Fig. 6, with above-mentioned Normal stream assembly).The major function of each module is as follows:
Low-power consumption wakes up chip module, belongs to hardware device, for monitoring the wake operation of user in projector dormancy; Identification and report engine modules, are the nucleus modules that speech recognition and voice are reported, and are responsible for the audio frequency collected is known Not, and voice synthesized broadcast content;Normal stream assembly module, is used for realizing each concrete function point, such as video playback language Sound control system, opens application Voice command, and each function point exists in the form of streaming, has the life cycle of oneself.
Fig. 7 is that the low-power consumption of voice control projection instrument system according to embodiments of the present invention wakes up flow chart, as shown in fig. 7, should Flow process comprises the steps:
Step s702, user input wakes up word;
Step s704, low-power consumption wakes up chip and persistently monitors user speech input in projector dormancy;
Step s706, when the phonetic entry of user is consistent in the wake-up word of preset training, low-power consumption wakes up chip and wakes up Cpu, and report wake events to driving layer;
Step s708, subsequent ccf layer notifies application layer by way of message;
Step s710, application layer has adjusted speech recognition flow process;
Step s712, terminates.
It is to liberate user's both hands completely that this low-power consumption wakes up chip, makes Voice command flow process become close loop maneuver and is possibly realized. In view of low-power consumption wakes up chip belongs to hardware configuration, cannot configure in some projector types, so the system is in low configuration This module of cutting is supported on projector, user can by other means, such as ancillary equipment, projector button is waking up.
Fig. 8 is the working state figure of voice control projection instrument system according to embodiments of the present invention, illustrates with reference to Fig. 8:
After equipment initialization completes and is waken up, equipment enters recording state, waits user input phonetic order.User Now there are two kinds of possible operations: one is not have sounding, and identification process time-out terminates;One is to have sounding to be projected instrument typing, Hence into follow-up identification state.After entering identification state, if recognizing user to have said correct instruction, Jiu Huifen It is dealt into corresponding normal stream assembly to be processed;If unrecognizable instruction, suggest that user input mistake, again Input or exit.
Wherein recording interrupts is a kind of specific identification mode under sound in stationary background.As Voice command during video playback. Lasting open detection user speech of now recording inputs and is directed to stationary background sound de-noising.If detecting dynamic with preset Instruct consistent phonetic entry, engine can return recognition result and inform that standard package stream carries out corresponding operating.Continue inspection simultaneously Survey phonetic entry next time, recording interrupts and will not stop exiting video playback until user.
In the embodiment of the present invention, loaded down with trivial details for projector apparatus manual operation, poor user experience, lack interesting problem, Voice control projection instrument system is proposed to solve this problem.This system can be waken up by sound with the use of family by hardware and software Projector simultaneously sends sound instruction.Whole flow process enables close loop maneuver, and that is, whole link is all completed by acoustic control, is not required to Wanting manual operation, thus having liberated the both hands of user, greatly strengthen service efficiency and the interest of projector.This system Support cutting, can clipping function and hardware configuration as needed.
It should be noted that above-mentioned modules can be by software or hardware to realize, for the latter, Ke Yitong Cross in the following manner to realize, but not limited to this: above-mentioned module is respectively positioned in same processor;Or, above-mentioned module position respectively In multiple processors.
Embodiments of the invention additionally provide a kind of storage medium.Alternatively, in the present embodiment, above-mentioned storage medium can To be arranged to store for executing the program code of following steps:
S1, determines that projector apparatus enter speech recognition state, wherein, this speech recognition state is to hold according to phonetic order The state of row operation;
S2, the phonetic order of receives input;
S3, executes operation corresponding with above-mentioned phonetic order according to the phonetic order receiving.
Alternatively, in the present embodiment, above-mentioned storage medium can include but is not limited to: u disk, read only memory (read-only memory, referred to as rom), random access memory (random access memory, referred to as For ram), portable hard drive, magnetic disc or CD etc. are various can be with the medium of store program codes.
Alternatively, in the present embodiment, processor is according to the program-code execution step s1-s3 of storage in storage medium.
Alternatively, the specific example in the present embodiment may be referred to showing described in above-described embodiment and optional embodiment Example, the present embodiment will not be described here.
Obviously, those skilled in the art should be understood that each module of the above-mentioned present invention or each step can be with general Realizing, they can concentrate on single computing device computing device, or be distributed in multiple computing devices and formed Network on, alternatively, they can be realized with the executable program code of computing device, it is thus possible to by they Storage to be executed by computing device in the storage device, and in some cases, can be to hold different from order herein The shown or described step of row, or they are fabricated to respectively each integrated circuit modules, or will be many in them Individual module or step are fabricated to single integrated circuit module to realize.So, the present invention is not restricted to any specific hardware Combine with software.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for the technology of this area For personnel, the present invention can have various modifications and variations.All within the spirit and principles in the present invention, made any Modification, equivalent, improvement etc., should be included within the scope of the present invention.

Claims (11)

1. a kind of sound control method is it is characterised in that include:
Determine that projector apparatus enter speech recognition state, wherein, described speech recognition state is according to phonetic order The state of execution operation;
The phonetic order of receives input;
Described phonetic order according to receiving executes operation corresponding with described phonetic order.
2. method according to claim 1 is it is characterised in that determine that projector apparatus enter speech recognition state and include: Determine that described projector apparatus, by way of receiving and waking up instruction, enter described speech recognition state, wherein, institute State wake-up and instruct and include at least one of:
The touching signals of desired trajectory, voice signal, push button signalling.
3. method according to claim 1 is it is characterised in that execute and institute's predicate according to the described phonetic order receiving The corresponding operation of sound instruction includes:
Judge whether to be previously stored with the instruction mated with described phonetic order;
In the case of being to be in judged result, execute operation corresponding with described phonetic order.
4. method according to claim 1 is it is characterised in that execute and institute's predicate according to the described phonetic order receiving Before the corresponding operation of sound instruction, also include:
Obtain the Apply Names of the file name of file prestoring and/or preassembled application;
Store described file name and/or described Apply Names, wherein, described file name is used for according to institute's predicate Sound instruction calls file corresponding with described file name, described Apply Names is used for being called according to described phonetic order Application corresponding with described Apply Names.
5. method according to any one of claim 1 to 4 is it is characterised in that described projector apparatus are supported to pass through Ancillary equipment receives described phonetic order, and wherein, described ancillary equipment includes at least one of: wired earphone, Bluetooth earphone.
6. a kind of phonetic controller is it is characterised in that include:
Determining module, for determining projector apparatus entrance speech recognition state, wherein, described speech recognition state It is the state according to phonetic order execution operation;
Receiver module, for the phonetic order of receives input;
Performing module, for executing operation corresponding with described phonetic order according to the described phonetic order receiving.
7. device according to claim 6 is it is characterised in that described determining module comprises determining that unit, for true Fixed described projector apparatus, by way of receiving and waking up instruction, enter described speech recognition state, wherein, described Wake up to instruct and include at least one of:
The touching signals of desired trajectory, voice signal, push button signalling.
8. device according to claim 6 is it is characterised in that described performing module includes:
Judging unit, for judging whether to be previously stored with the instruction mated with described phonetic order;
Performance element, for the judged result in described judging unit for, in the case of being, execution is referred to described voice Make corresponding operation.
9. device according to claim 6 is it is characterised in that also include:
Acquisition module, for obtaining the file name of file prestoring and/or the application of preassembled application Title;
Memory module, for storing described file name and/or described Apply Names, wherein, described file name For calling file corresponding with described file name according to described phonetic order, described Apply Names is used for according to institute The application corresponding with described Apply Names of predicate sound instruction calls.
10. the device according to any one of claim 6 to 9 is it is characterised in that described projector apparatus are supported to pass through Ancillary equipment receives described phonetic order, and wherein, described ancillary equipment includes at least one of: wired earphone, Bluetooth earphone.
A kind of 11. projector apparatus are it is characterised in that at least include: low-power consumption wakes up chip, speech engine and normal stream group Part, wherein,
Described low-power consumption wakes up chip and is used for according to waking up instruction entrance speech recognition state, and wherein, described voice is known Other state is the state according to phonetic order execution operation;
Described speech engine is used for the phonetic order of receives input;
The described phonetic order that described normal stream assembly is used for according to receiving executes behaviour corresponding with described phonetic order Make.
CN201510424654.1A 2015-07-17 2015-07-17 Voice control method, device and projector Pending CN106356059A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201510424654.1A CN106356059A (en) 2015-07-17 2015-07-17 Voice control method, device and projector
PCT/CN2016/090170 WO2017012511A1 (en) 2015-07-17 2016-07-15 Voice control method and device, and projector apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510424654.1A CN106356059A (en) 2015-07-17 2015-07-17 Voice control method, device and projector

Publications (1)

Publication Number Publication Date
CN106356059A true CN106356059A (en) 2017-01-25

Family

ID=57833698

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510424654.1A Pending CN106356059A (en) 2015-07-17 2015-07-17 Voice control method, device and projector

Country Status (2)

Country Link
CN (1) CN106356059A (en)
WO (1) WO2017012511A1 (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106847285A (en) * 2017-03-31 2017-06-13 上海思依暄机器人科技股份有限公司 A kind of robot and its audio recognition method
CN107180631A (en) * 2017-05-24 2017-09-19 刘平舟 A kind of voice interactive method and device
CN107680592A (en) * 2017-09-30 2018-02-09 惠州Tcl移动通信有限公司 A kind of mobile terminal sound recognition methods and mobile terminal and storage medium
CN107920240A (en) * 2017-12-27 2018-04-17 兴天通讯技术有限公司 A kind of smart projector of achievable speech control
CN108566634A (en) * 2018-03-30 2018-09-21 深圳市冠旭电子股份有限公司 Reduce method, apparatus and Baffle Box of Bluetooth that Baffle Box of Bluetooth continuously wakes up delay
WO2018176387A1 (en) * 2017-03-31 2018-10-04 深圳市红昌机电设备有限公司 Voice control method and system for winding-type coil winder
CN108920128A (en) * 2018-07-12 2018-11-30 苏州思必驰信息科技有限公司 The operating method and system of PowerPoint
WO2019015435A1 (en) * 2017-07-19 2019-01-24 腾讯科技(深圳)有限公司 Speech recognition method and apparatus, and storage medium
CN109375460A (en) * 2018-12-27 2019-02-22 成都市极米科技有限公司 The control method and smart projector of smart projector
WO2019153999A1 (en) * 2018-02-09 2019-08-15 广景视睿科技(深圳)有限公司 Voice control-based dynamic projection method, apparatus, and system
CN110322873A (en) * 2019-07-02 2019-10-11 百度在线网络技术(北京)有限公司 Voice technical ability exits method, apparatus, equipment and storage medium
CN110505431A (en) * 2018-05-17 2019-11-26 视联动力信息技术股份有限公司 A kind of control method and device of terminal
CN110517697A (en) * 2019-08-20 2019-11-29 中信银行股份有限公司 Prompt tone intelligence cutting-off device for interactive voice response
CN110992960A (en) * 2019-12-18 2020-04-10 Oppo广东移动通信有限公司 Control method, control device, electronic equipment and storage medium
CN111467198A (en) * 2020-04-28 2020-07-31 北京光彩明天儿童眼科医院有限公司 Eyesight improving and consciousness restoring instrument
CN112530430A (en) * 2020-11-30 2021-03-19 北京百度网讯科技有限公司 Vehicle-mounted operating system control method and device, earphone, terminal and storage medium
CN113127105A (en) * 2021-03-18 2021-07-16 福建马恒达信息科技有限公司 Excel automatic voice tool calling method
CN113157350A (en) * 2021-03-18 2021-07-23 福建马恒达信息科技有限公司 Office auxiliary system and method based on voice recognition
CN113160806A (en) * 2020-01-07 2021-07-23 京东方科技集团股份有限公司 Projection system and control method thereof
CN113763944A (en) * 2020-09-29 2021-12-07 浙江思考者科技有限公司 AI video cloud interactive system based on simulation person logic knowledge base
CN114097660A (en) * 2021-11-08 2022-03-01 广州回味源蛋类食品有限公司 Duck egg screening device

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110908718A (en) * 2018-09-14 2020-03-24 上海擎感智能科技有限公司 Face recognition activated voice navigation method, system, storage medium and equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101648077A (en) * 2008-08-11 2010-02-17 巍世科技有限公司 Voice command game control device and method thereof
CN103885350A (en) * 2014-03-19 2014-06-25 四川长虹电器股份有限公司 Method and device for voice control over household appliances
CN103971683A (en) * 2013-01-24 2014-08-06 上海果壳电子有限公司 Voice control method and system and handheld device
CN104599669A (en) * 2014-12-31 2015-05-06 乐视致新电子科技(天津)有限公司 Voice control method and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0986808B1 (en) * 1997-06-06 2002-02-20 BSH Bosch und Siemens Hausgeräte GmbH Household appliance, specially an electrically operated household appliance
CN101740028A (en) * 2009-11-20 2010-06-16 四川长虹电器股份有限公司 Voice control system of household appliance
CN104216351B (en) * 2014-02-10 2017-09-29 美的集团股份有限公司 Household electrical appliance sound control method and system
CN104538030A (en) * 2014-12-11 2015-04-22 科大讯飞股份有限公司 Control system and method for controlling household appliances through voice

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101648077A (en) * 2008-08-11 2010-02-17 巍世科技有限公司 Voice command game control device and method thereof
CN103971683A (en) * 2013-01-24 2014-08-06 上海果壳电子有限公司 Voice control method and system and handheld device
CN103885350A (en) * 2014-03-19 2014-06-25 四川长虹电器股份有限公司 Method and device for voice control over household appliances
CN104599669A (en) * 2014-12-31 2015-05-06 乐视致新电子科技(天津)有限公司 Voice control method and device

Cited By (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106847285B (en) * 2017-03-31 2020-05-05 上海思依暄机器人科技股份有限公司 Robot and voice recognition method thereof
WO2018176387A1 (en) * 2017-03-31 2018-10-04 深圳市红昌机电设备有限公司 Voice control method and system for winding-type coil winder
CN106847285A (en) * 2017-03-31 2017-06-13 上海思依暄机器人科技股份有限公司 A kind of robot and its audio recognition method
CN107180631A (en) * 2017-05-24 2017-09-19 刘平舟 A kind of voice interactive method and device
US11244672B2 (en) 2017-07-19 2022-02-08 Tencent Technology (Shenzhen) Company Limited Speech recognition method and apparatus, and storage medium
WO2019015435A1 (en) * 2017-07-19 2019-01-24 腾讯科技(深圳)有限公司 Speech recognition method and apparatus, and storage medium
CN107680592B (en) * 2017-09-30 2020-09-22 惠州Tcl移动通信有限公司 Mobile terminal voice recognition method, mobile terminal and storage medium
CN107680592A (en) * 2017-09-30 2018-02-09 惠州Tcl移动通信有限公司 A kind of mobile terminal sound recognition methods and mobile terminal and storage medium
CN107920240A (en) * 2017-12-27 2018-04-17 兴天通讯技术有限公司 A kind of smart projector of achievable speech control
WO2019153999A1 (en) * 2018-02-09 2019-08-15 广景视睿科技(深圳)有限公司 Voice control-based dynamic projection method, apparatus, and system
CN108566634B (en) * 2018-03-30 2021-06-25 深圳市冠旭电子股份有限公司 Method and device for reducing continuous awakening delay of Bluetooth sound box and Bluetooth sound box
CN108566634A (en) * 2018-03-30 2018-09-21 深圳市冠旭电子股份有限公司 Reduce method, apparatus and Baffle Box of Bluetooth that Baffle Box of Bluetooth continuously wakes up delay
CN110505431A (en) * 2018-05-17 2019-11-26 视联动力信息技术股份有限公司 A kind of control method and device of terminal
CN108920128A (en) * 2018-07-12 2018-11-30 苏州思必驰信息科技有限公司 The operating method and system of PowerPoint
CN108920128B (en) * 2018-07-12 2021-10-08 思必驰科技股份有限公司 Operation method and system of presentation
CN109375460A (en) * 2018-12-27 2019-02-22 成都市极米科技有限公司 The control method and smart projector of smart projector
CN109375460B (en) * 2018-12-27 2021-03-23 成都极米科技股份有限公司 Control method of intelligent projector and intelligent projector
CN110322873B (en) * 2019-07-02 2022-03-01 百度在线网络技术(北京)有限公司 Voice skill quitting method, device, equipment and storage medium
US11580974B2 (en) 2019-07-02 2023-02-14 Baidu Online Network Technology (Beijing) Co., Ltd. Method for exiting a voice skill, apparatus, device and storage medium
CN110322873A (en) * 2019-07-02 2019-10-11 百度在线网络技术(北京)有限公司 Voice technical ability exits method, apparatus, equipment and storage medium
CN110517697A (en) * 2019-08-20 2019-11-29 中信银行股份有限公司 Prompt tone intelligence cutting-off device for interactive voice response
CN110992960A (en) * 2019-12-18 2020-04-10 Oppo广东移动通信有限公司 Control method, control device, electronic equipment and storage medium
CN113160806A (en) * 2020-01-07 2021-07-23 京东方科技集团股份有限公司 Projection system and control method thereof
CN111467198A (en) * 2020-04-28 2020-07-31 北京光彩明天儿童眼科医院有限公司 Eyesight improving and consciousness restoring instrument
CN111467198B (en) * 2020-04-28 2022-12-09 天赋光彩医疗科技(苏州)有限公司 Eyesight improving and consciousness restoring instrument
CN113763944A (en) * 2020-09-29 2021-12-07 浙江思考者科技有限公司 AI video cloud interactive system based on simulation person logic knowledge base
CN112530430A (en) * 2020-11-30 2021-03-19 北京百度网讯科技有限公司 Vehicle-mounted operating system control method and device, earphone, terminal and storage medium
CN113127105A (en) * 2021-03-18 2021-07-16 福建马恒达信息科技有限公司 Excel automatic voice tool calling method
CN113157350B (en) * 2021-03-18 2022-06-07 福建马恒达信息科技有限公司 Office auxiliary system and method based on voice recognition
CN113127105B (en) * 2021-03-18 2022-06-10 福建马恒达信息科技有限公司 Excel automatic voice tool calling method
CN113157350A (en) * 2021-03-18 2021-07-23 福建马恒达信息科技有限公司 Office auxiliary system and method based on voice recognition
CN114097660A (en) * 2021-11-08 2022-03-01 广州回味源蛋类食品有限公司 Duck egg screening device

Also Published As

Publication number Publication date
WO2017012511A1 (en) 2017-01-26

Similar Documents

Publication Publication Date Title
CN106356059A (en) Voice control method, device and projector
CN108470034B (en) A kind of smart machine service providing method and system
CN103021409B (en) A kind of vice activation camera system
EP2842125B1 (en) Embedded system for construction of small footprint speech recognition with user-definable constraints
CN108520743A (en) Sound control method, smart machine and the computer-readable medium of smart machine
CN109493849A (en) Voice awakening method, device and electronic equipment
CN110914828B (en) Speech translation method and device
JP2019117623A (en) Voice dialogue method, apparatus, device and storage medium
CN112201246B (en) Intelligent control method and device based on voice, electronic equipment and storage medium
CN107210040A (en) The operating method of phonetic function and the electronic equipment for supporting this method
CN109166575A (en) Exchange method, device, smart machine and the storage medium of smart machine
CN107018228B (en) Voice control system, voice processing method and terminal equipment
CN109246473B (en) Voice interaction method and terminal system of personalized video bullet screen based on voiceprint recognition
CN107180631A (en) A kind of voice interactive method and device
CN109637548A (en) Voice interactive method and device based on Application on Voiceprint Recognition
CN109243462A (en) A kind of voice awakening method and device
CN109545207A (en) A kind of voice awakening method and device
JP6783339B2 (en) Methods and devices for processing audio
CN105719647A (en) Background Speech Recognition Assistant Using Speaker Verification
US11862153B1 (en) System for recognizing and responding to environmental noises
CN109240107A (en) A kind of control method of electrical equipment, device, electrical equipment and medium
CN103533519A (en) Short message broadcasting method and system
CN109192208A (en) A kind of control method of electrical equipment, system, device, equipment and medium
CN109360567A (en) The customizable method and apparatus waken up
CN106601242A (en) Executing method and device of operation event and terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170125

RJ01 Rejection of invention patent application after publication