WO2019114255A1 - Procédé d'acquisition vocale, télécommande et support d'informations lisible par ordinateur - Google Patents

Procédé d'acquisition vocale, télécommande et support d'informations lisible par ordinateur Download PDF

Info

Publication number
WO2019114255A1
WO2019114255A1 PCT/CN2018/093746 CN2018093746W WO2019114255A1 WO 2019114255 A1 WO2019114255 A1 WO 2019114255A1 CN 2018093746 W CN2018093746 W CN 2018093746W WO 2019114255 A1 WO2019114255 A1 WO 2019114255A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
recording
remote controller
pressing
stop
Prior art date
Application number
PCT/CN2018/093746
Other languages
English (en)
Chinese (zh)
Inventor
林大煜
Original Assignee
深圳Tcl新技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳Tcl新技术有限公司 filed Critical 深圳Tcl新技术有限公司
Publication of WO2019114255A1 publication Critical patent/WO2019114255A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42222Additional components integrated in the remote control device, e.g. timer, speaker, sensors for detecting position, direction or movement of the remote control, microphone or battery charging device
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42221Transmission circuitry, e.g. infrared [IR] or radio frequency [RF]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the present application relates to the field of voice recognition technology, and in particular, to a voice collection method, a remote controller, and a computer readable storage medium.
  • the existing methods of voice collection are mainly divided into two types: one is to press the voice button to start the voice, and the voice is automatically stopped after no sound input; the other is to press the voice button to start the voice, and the user releases the voice button to stop the recording.
  • the voice collection modes used by each device are different.
  • the main purpose of the present application is to provide a voice collection method, a remote controller, and a computer readable storage medium, which are intended to solve the technical problem that the voice collection modes adopted by the existing devices are different and the voice collection function cannot be accurately used.
  • the present application provides a voice collection method, where the voice collection method includes the following steps:
  • the control remote controller When receiving the pressing signal of the voice key, the control remote controller starts recording;
  • the remote controller is controlled to stop recording when the stop condition is satisfied.
  • the step of determining a stop condition of the recording according to the pressing duration includes:
  • the stopping condition is determined to be a release signal of the received voice key, wherein when the release signal of the voice key is received, the remote controller is controlled to stop recording.
  • the method further includes:
  • the pressing duration is less than or equal to the preset duration, determining that the stopping condition is that no voice signal is received within the first preset time interval, wherein when the voice signal is not received within the first preset time interval, controlling the The remote control stops recording.
  • the method further includes:
  • the remote controller is controlled to stop recording when the release signal of the voice key is received.
  • the method further includes:
  • the control remote controller is executed to start recording.
  • the method further includes:
  • the control command is sent to the television.
  • the present application further provides a remote controller, where the remote controller includes: a memory, a processor, and a voice collection program stored on the memory and operable on the processor, The steps of the voice collection method of any of the above, when the voice acquisition program is executed by the processor.
  • the present application further provides a computer readable storage medium, wherein the computer readable storage medium stores a voice collection program, and the voice collection program is executed by a processor to implement any of the above The steps of the voice collection method.
  • the voice collection method, the remote controller and the computer readable storage medium provided by the embodiment of the present application control the remote controller to start recording when receiving the pressing signal of the voice button, and monitor the pressing duration of the voice button, and determine the pressing duration.
  • the received voice collection mode is long press acquisition or short press acquisition.
  • the judgment of the end of the acquisition of the two voice collection modes is different, so the stop condition of the recording can be determined according to the pressing duration, and the remote control is stopped while satisfying the recording stop condition. recording.
  • FIG. 1 is a schematic structural diagram of a terminal in a hardware operating environment involved in a solution according to an embodiment of the present application
  • FIG. 2 is a schematic flowchart of a first embodiment of a voice collection method according to the present application
  • FIG. 3 is a schematic flowchart of a second embodiment of a voice collection method according to the present application.
  • FIG. 4 is a schematic flowchart of a third embodiment of a voice collection method according to the present application.
  • the control remote controller When receiving the pressing signal of the voice key, the control remote controller starts recording;
  • the remote controller is controlled to stop recording when the stop condition is satisfied.
  • the present application provides a solution for controlling the remote controller to start recording while receiving the pressing signal of the voice button, and monitoring the duration of pressing the voice button, and determining whether the received voice collection mode is long-pressing or short-pressing by pressing the duration. Acquisition, the judgment of the end of the acquisition of the two voice collection modes is different, so the stop condition of the recording can be determined according to the pressing duration, and the remote controller is controlled to stop the recording while satisfying the recording stop condition.
  • FIG. 1 is a schematic structural diagram of a terminal in a hardware operating environment involved in an embodiment of the present application.
  • the terminal in the embodiment of the present application may be a remote controller, or may be a portable terminal device with a voice collection function such as a PC, a smart phone, a tablet computer, an e-book reader, or a portable computer.
  • a voice collection function such as a PC, a smart phone, a tablet computer, an e-book reader, or a portable computer.
  • the terminal may include a processor 1001, such as a CPU, an infrared transmitter 1003, a crystal oscillator circuit 1004, a button matrix 1005, a memory 1006, and a communication bus 1002.
  • the communication bus 1002 is used to implement connection communication between these components.
  • the infrared emitter 1003 can include a single beam of infrared emitters, a plurality of beams of infrared emitters.
  • the crystal oscillator circuit 1004 can be a common crystal oscillator, a temperature-compensated crystal oscillator, a voltage-controlled crystal oscillator, and a temperature-controlled crystal oscillator.
  • the key matrix 1005 can be a key matrix composed of any number of rows and any number of columns.
  • the memory 1006 can be a high speed RAM memory or a stable memory (non-volatile) Memory), such as disk storage.
  • terminal structure shown in FIG. 1 does not constitute a limitation to the device, and may include more or less components than those illustrated, or some components may be combined, or different component arrangements.
  • a memory 1006 as a computer storage medium may include an operating system, an infrared transmitting module, a crystal oscillator circuit module, a key matrix module, and an interrupt response program.
  • the infrared emitter 1003 is mainly used for transmitting an infrared pulse signal to a television terminal, and transmitting different control commands by transmitting infrared pulse signals of different frequencies;
  • the crystal oscillator circuit 1004 is mainly used for controlling the transmission of the infrared emitter.
  • the key matrix 1005 is mainly used for connecting with the crystal oscillator circuit, controlling the crystal vibration frequency, realizing the infrared transmitter to emit pulse signals of different frequencies;
  • the processor 1001 can be used for calling the voice collection program stored in the memory 1006. And do the following:
  • the control remote controller When receiving the pressing signal of the voice key, the control remote controller starts recording;
  • the remote controller is controlled to stop recording when the stop condition is satisfied.
  • processor 1001 can call the voice collection program stored in the memory 1005, and also performs the following operations:
  • the stopping condition is determined to be a release signal of the received voice key, wherein when the release signal of the voice key is received, the remote controller is controlled to stop recording.
  • processor 1001 can call the voice collection program stored in the memory 1005, and also performs the following operations:
  • the pressing duration is less than or equal to the preset duration, determining that the stopping condition is that no voice signal is received within the first preset time interval, wherein when the voice signal is not received within the first preset time interval, controlling the The remote control stops recording.
  • processor 1001 can call the voice collection program stored in the memory 1005, and also performs the following operations:
  • the remote controller is controlled to stop recording when the release signal of the voice key is received.
  • processor 1001 can call the voice collection program stored in the memory 1005, and also performs the following operations:
  • the control remote controller is executed to start recording.
  • processor 1001 can call the voice collection program stored in the memory 1005, and also performs the following operations:
  • the control command is sent to the television.
  • the voice collection method includes:
  • step S10 when the pressing signal of the voice key is received, the remote controller is controlled to start recording.
  • a voice control device is provided with a control button for controlling a voice collection program, often referred to as a voice button.
  • a voice button for controlling a voice collection program
  • the voice button sends a pressing signal to the controller, and when the controller receives the pressing signal, the controller controls the remote controller to start recording for voice collection.
  • Step S20 monitoring the pressing duration of the voice key, and determining the stopping condition of the recording according to the pressing duration.
  • Step S30 when the stop condition is satisfied, the remote controller is controlled to stop recording.
  • the timing is started when the pressing signal of the voice key is received, and when the release signal of the voice key is received, the timing is stopped, and the pressing time of the voice key is obtained.
  • the timing is greater than the preset duration and the release signal of the voice key is not received, it is determined that the pressing duration of the voice key is greater than the preset duration. Since the recording stop conditions in different voice collection modes are different, the voice collection mode may be determined according to the pressing duration of the voice key, and then the recording stop condition is determined according to the determined voice collection mode.
  • the remote controller is controlled to stop recording.
  • the control remote controller stops recording.
  • the stopping condition may also be determined as the pressing signal of receiving the stop key, that is, when the remote controller is provided with the stop key, the voice recording of the voice command is short.
  • the remote control is stopped when the stop signal is pressed.
  • the stop condition of the recording is determined to be the release signal of the received voice key, and it is determined whether the audio is received at the preset time interval. If the signal is not received within the preset time interval, it can be determined that the voice button of the current remote controller is in the wrong state. Therefore, when the audio signal is not received within the preset time interval, the remote controller is stopped to stop recording, and the signal is increased. Alternate stop conditions to prevent false acquisition of voice data.
  • the remote controller when receiving the pressing signal of the voice key, the remote controller is controlled to start recording, and the duration of pressing the voice key is monitored, and the length of the pressing is used to determine whether the received voice collection mode is a long press acquisition or a short press acquisition.
  • the judgment of the end of the acquisition of the two voice collection modes is different, so the stop condition of the recording can be determined according to the pressing duration, and the remote controller can be controlled to stop the recording while satisfying the recording stop condition.
  • a second embodiment of the voice collection method of the present application is provided. Based on the first embodiment, after the step S30, the method further includes:
  • step S40 it is determined whether the time interval between the pressing signal of the received voice key and the last time the recording is stopped is greater than the third time interval.
  • step S10 that is, the remote controller is started to start recording.
  • the remote controller When the time interval between the end of the last recording and the start of recording again is short, it is usually caused by the user accidentally pressing the voice button, or when the user accidentally presses the voice button of the remote controller, the remote controller does not receive the audio signal within a certain period of time. After the recording is stopped, since the voice button of the remote controller is still in the pressed state, in order to prevent the controller from sending the pressing signal to the controller again, the controller repeatedly starts the voice collection, and at this time, the pressing signal of the received voice button can be increased by the judgment.
  • the step of stopping the recording is greater than the preset time interval to control.
  • the control remote controller starts recording; and when the pressing signal of the voice key is received, the time interval from the last time the recording is stopped is less than or Equal to the third preset time interval, no recording will be performed, the processing of error collection will be reduced, and the damage of the remote controller can be reduced, and the service life of the remote controller can be prolonged.
  • the control remote controller before the control remote controller starts recording, it is determined whether the time interval between the pressing signal of the currently received voice key and the previous recording stop is greater than the third preset time interval, when the voice button is received.
  • the control remote control starts recording.
  • the voice collection program caused by the voice key being prevented from being accidentally pressed is repeatedly performed, the processing operation of the error collection is reduced, the damage of the remote controller is also reduced, and the service life of the remote controller is prolonged.
  • a third embodiment of the voice collection method of the present application is provided. Based on the first embodiment, after the step S30, the method further includes:
  • step S50 the recorded data is acquired, and a corresponding control instruction is generated according to the recorded data.
  • the sound signal collected by the recording is subjected to pre-processing such as impurity removal to generate a voice control command.
  • the control instruction may be generated by converting the voice signal obtained by the voice acquisition into a voice pulse sequence, filtering the voice pulse sequence into the interference signal, extracting the voice feature vector, and further pre-storing the extracted voice feature vector and the remote controller.
  • the voice models in the voice template library are compared to obtain control commands corresponding to the currently collected voice signals.
  • step S60 the control command is sent to the television.
  • the control command generated by the collected voice signal is sent to the television corresponding to the remote controller, so that the television performs a function operation such as video playback according to the voice signal, so that the voice collection is responded. Because the voice collection operation is simple, convenient and fast, it saves a lot of time for the transmission of subsequent control commands, which embodies the efficiency and convenience of voice control.
  • the collected voice signal is generated corresponding to the control command, and the control command is sent to the television paired with the remote controller, so that the television responds to the control command and performs a function operation corresponding to the voice signal. Because the voice collection operation is simple, convenient and fast, it saves a lot of time for the transmission of subsequent control commands, which embodies the efficiency and convenience of voice control.
  • the embodiment of the present application further provides a remote controller, where the remote controller includes: a memory, a processor, and a voice collection program stored on the memory and operable on the processor, where the voice collection program is The processor performs the steps of the voice acquisition method as described in the various embodiments above when executed.
  • the embodiment of the present application further provides a computer readable storage medium, where the computer readable storage medium stores a voice collection program, and when the voice collection program is executed by the processor, the voices described in the foregoing embodiments are implemented. The steps of the acquisition method.
  • the technical solution of the present application which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM as described above). , a disk, an optical disk, including a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in the various embodiments of the present application.
  • a terminal device which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Selective Calling Equipment (AREA)

Abstract

La présente invention concerne un procédé d'acquisition vocale, comprenant les étapes consistant : lorsqu'un signal de pression sur une touche vocale est reçu, à commander une télécommande de manière à démarrer un enregistrement sonore; à surveiller la durée pendant laquelle la pression est exercée sur la touche vocale, et à déterminer une condition d'arrêt de l'enregistrement sonore en fonction de ladite durée de pression; et lorsque la condition d'arrêt est satisfaite, à commander la télécommande de manière à arrêter l'enregistrement sonore. La présente invention concerne en outre une télécommande et un support d'informations lisible par ordinateur.
PCT/CN2018/093746 2017-12-14 2018-06-29 Procédé d'acquisition vocale, télécommande et support d'informations lisible par ordinateur WO2019114255A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711346326.XA CN108040272A (zh) 2017-12-14 2017-12-14 语音采集方法、遥控器及计算机可读存储介质
CN201711346326.X 2017-12-14

Publications (1)

Publication Number Publication Date
WO2019114255A1 true WO2019114255A1 (fr) 2019-06-20

Family

ID=62103157

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/093746 WO2019114255A1 (fr) 2017-12-14 2018-06-29 Procédé d'acquisition vocale, télécommande et support d'informations lisible par ordinateur

Country Status (2)

Country Link
CN (1) CN108040272A (fr)
WO (1) WO2019114255A1 (fr)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108040272A (zh) * 2017-12-14 2018-05-15 深圳Tcl新技术有限公司 语音采集方法、遥控器及计算机可读存储介质
CN109743606A (zh) * 2018-12-29 2019-05-10 深圳Tcl数字技术有限公司 语音输入方法、语音输入装置及存储介质
CN110444232B (zh) * 2019-07-31 2021-06-01 国金黄金股份有限公司 音箱的录音控制方法及装置、存储介质和处理器
CN114120980A (zh) * 2021-10-21 2022-03-01 北京电子工程总体研究所 一种显控台操控系统和方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001075570A (ja) * 1999-09-03 2001-03-23 Roland Corp オーディオ波形の記録装置
CN103002136A (zh) * 2012-11-20 2013-03-27 广东欧珀移动通信有限公司 一种移动终端及其录音方法、系统
CN104347072A (zh) * 2013-08-02 2015-02-11 广东美的制冷设备有限公司 遥控器控制的方法、装置和遥控器
CN104780263A (zh) * 2015-03-10 2015-07-15 广东小天才科技有限公司 一种语音断点延长判断的方法及装置
CN105635778A (zh) * 2015-12-29 2016-06-01 康佳集团股份有限公司 一种智能电视的语音交互方法及系统
CN108040272A (zh) * 2017-12-14 2018-05-15 深圳Tcl新技术有限公司 语音采集方法、遥控器及计算机可读存储介质

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105446489B (zh) * 2015-12-08 2017-09-22 广州神马移动信息科技有限公司 语音双模控制方法、装置及用户终端

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001075570A (ja) * 1999-09-03 2001-03-23 Roland Corp オーディオ波形の記録装置
CN103002136A (zh) * 2012-11-20 2013-03-27 广东欧珀移动通信有限公司 一种移动终端及其录音方法、系统
CN104347072A (zh) * 2013-08-02 2015-02-11 广东美的制冷设备有限公司 遥控器控制的方法、装置和遥控器
CN104780263A (zh) * 2015-03-10 2015-07-15 广东小天才科技有限公司 一种语音断点延长判断的方法及装置
CN105635778A (zh) * 2015-12-29 2016-06-01 康佳集团股份有限公司 一种智能电视的语音交互方法及系统
CN108040272A (zh) * 2017-12-14 2018-05-15 深圳Tcl新技术有限公司 语音采集方法、遥控器及计算机可读存储介质

Also Published As

Publication number Publication date
CN108040272A (zh) 2018-05-15

Similar Documents

Publication Publication Date Title
WO2019114255A1 (fr) Procédé d'acquisition vocale, télécommande et support d'informations lisible par ordinateur
WO2018032681A1 (fr) Procédé de jumelage bluetooth, et dispositif bluetooth
WO2013105782A1 (fr) Appareil d'affichage d'image et son procédé de commande
WO2018120681A1 (fr) Procédé, dispositif et système de synchronisation de données, serveur de traitement de données et support de stockage
WO2018076866A1 (fr) Procédé de traitement de données, dispositif, support de stockage, dispositif électronique, et serveur
WO2018094950A1 (fr) Véhicule aérien sans pilote, et procédé de communication basé sur un véhicule aérien sans pilote
WO2018068411A1 (fr) Procédé de débogage à distance de télévision intelligente et système de débogage à distance de télévision intelligente
WO2017000729A1 (fr) Procédé et dispositif de commande de lecture audio/vidéo
WO2020133764A1 (fr) Procédé et système de télécommande et stockage lisible par ordinateur
WO2017092498A1 (fr) Procédé de gestion d'informations et terminal utilisateur
WO2017036204A1 (fr) Procédé et dispositif de positionnement de foyer pour une commutation d'application
WO2017092416A1 (fr) Procédé et dispositif d'affichage d'intensité de signal wi-fi dans des routeurs sans fil, et routeur sans fil associé
WO2017012417A1 (fr) Procédé de commande entre des dispositifs interactifs à écrans multiples, dispositif interactif à écrans multiples et système
WO2019223600A1 (fr) Procédé et dispositif de transmission audio bluetooth, et support de stockage lisible par ordinateur
WO2017036218A1 (fr) Procédé et dispositif de lecture de fichier multimédia
WO2017092267A1 (fr) Procédé et dispositif de gestion de canal
WO2019227564A1 (fr) Procédé et appareil d'affichage d'informations de communication, et équipement utilisateur et support de stockage
WO2017133054A1 (fr) Procédé d'appel de service de messagerie instantanée et serveur de bus de service d'entreprise
WO2017215233A1 (fr) Procédé et système de commande d'interrupteur de terminal
WO2018076842A1 (fr) Procédé de sauvegarde de données, dispositif, système, support de stockage et dispositif électronique
WO2017084280A1 (fr) Procédé et appareil de commande de haut-parleur
WO2018161588A1 (fr) Procédé, appareil, support de stockage et dispositif électronique d'ajustement de file d'attente de récepteur de diffusion
WO2016206223A1 (fr) Procédé et système de capture d'écran
WO2015078073A1 (fr) Procédé de surveillance du blocage de logiciel et appareil médical externe l'utilisant
WO2018205534A1 (fr) Procédé de commande de télécommande somatosensorielle, terminal intelligent et support de stockage lisible par ordinateur

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18887981

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18887981

Country of ref document: EP

Kind code of ref document: A1