CN106571137A - Terminal voice dotting control device and method - Google Patents

Terminal voice dotting control device and method Download PDF

Info

Publication number
CN106571137A
CN106571137A CN201610977756.0A CN201610977756A CN106571137A CN 106571137 A CN106571137 A CN 106571137A CN 201610977756 A CN201610977756 A CN 201610977756A CN 106571137 A CN106571137 A CN 106571137A
Authority
CN
China
Prior art keywords
ready
voice data
keyword
mark
identification point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610977756.0A
Other languages
Chinese (zh)
Inventor
陈文智
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nubia Technology Co Ltd
Original Assignee
Nubia Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nubia Technology Co Ltd filed Critical Nubia Technology Co Ltd
Priority to CN201610977756.0A priority Critical patent/CN106571137A/en
Publication of CN106571137A publication Critical patent/CN106571137A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C7/00Arrangements for writing information into, or reading information out from, a digital store
    • G11C7/16Storage of analogue signals in digital stores using an arrangement comprising analogue/digital [A/D] converters, digital memories and digital/analogue [D/A] converters 

Abstract

The invention discloses a terminal voice dotting control device and method. The terminal voice dotting control method comprises the steps of acquiring at least one piece of keyword information in advance, receiving audio data, performing keyword identification on the audio data according to a time axis sequence of the audio data, determining positioning information of an identification point according to a preset positioning rule, and performing dotting marking at a position corresponding to the positioning information, thereby realizing automatic identification and dotting marking on the audio data according to the keyword information, avoiding a phenomenon of mistaken or missed dotting caused by a man-made objective factor, improving the positioning accuracy of dotting marking and the integrity of the data content; and finally storing the dotting marked audio data, thereby facilitating the staff to manage the content of the audio data, performing management according to the keyword information, and greatly improving the work efficiency of the staff and the experience effect of users.

Description

A kind of terminal speech gets control device and its method ready
Technical field
The present invention relates to data processing field, more particularly, it relates to a kind of terminal speech gets control device and its side ready Method.
Background technology
With developing rapidly for information age, the importance of information input/output function has been added in the electronic device By force, people can be recorded by mobile phone or sound pick-up outfit, facilitate record information, for example, when attending a lecture, many times be recorded Time it is long, certain section of content is wanted to find to afterwards and is just bothered very much.
At present in Recording Process, it is possible to use get function ready manually and recording is marked, can listen while recording Sound, for important content is got manually ready record, but, manual mode is got ready and is easy to mistakes and omissions occur, particularly exists In the case of notice high concentration, user can forget to get mark ready, which results in and mark content imperfect, in user afterwards Playback lookup is needed for again, carries out bringing larger workload when conference content is recorded to civilian, reduce operating efficiency, So that the experience of user is not good.
The content of the invention
The present invention provides a kind of terminal speech and gets control device and its method ready, to solve existing to get recording ready manually During mark, the mark mistakes and omissions of content easily occur causes the imperfect of information, and the problem that operating efficiency is low
To solve above-mentioned technical problem, the present invention provides a kind of terminal speech and gets control device ready, and described device includes:
Acquisition module, for obtaining at least one keyword message;
Receiver module, for receiving voice data;
Locating module, for carrying out keyword knowledge to the voice data according to the time shaft of voice data order Not, the location information of identification point is determined according to default locating rule;
Logging modle, for getting ready the voice data on the corresponding position of the location information of the identification point Mark;
Preserving module, for preserving through the voice data for getting mark ready.
Further, the locating module is used to determine the actual position information of identification point;In obtaining default locating rule Positioning advance parameters;The position letter of the positioning advance parameters before by the actual position information of the identification point Breath, is defined as the location information of identification point.
Further, the logging modle is used to determine the corresponding keyword message of location information of the identification point;Root According to default keyword message and the mapping relations for getting type ready, lookup is corresponding with the keyword message to get mark ready Type;The voice data is carried out getting mark process ready according to the type of getting ready.
Further, described device also includes catalog generation module, for according to the keyword message, and the pass The location information of the corresponding identification point of key word information generates the nodal directory of the voice data.
Further, the catalog generation module is additionally operable to determine the corresponding voice mark of location information of each identification point; Voice mark correspondence is added in the voice data nodal directory.
Further, the invention provides a kind of terminal speech gets control method ready, methods described includes:
Obtain at least one keyword message;
Receive voice data;
Keyword recognition is carried out to the voice data according to the time shaft order of the voice data, according to default fixed Position rule determines the location information of identification point;
The voice data is carried out on the corresponding position of location information of the identification point get mark ready;
Preserve through the voice data for getting mark ready.
Further, it is described to determine that the location information of identification point includes according to default locating rule:
Determine the actual position information of identification point;
Obtain the positioning advance parameters in default locating rule;
The positional information of the positioning advance parameters before by the actual position information of the identification point, is defined as knowing The location information of other point.
Further, it is described on the corresponding position of the identification dot position information voice data to be carried out to get mark ready Note includes:
Determine the corresponding keyword message of location information of the identification point;
According to default keyword message and the mapping relations for getting type ready, search corresponding with the keyword message Get type ready;
The voice data is carried out getting mark process ready according to the type of getting ready.
Further, methods described also includes:According to the keyword message, and the corresponding knowledge of the keyword message The location information of other point generates the nodal directory of the voice data.
Further, methods described also includes:Determine the corresponding voice mark of location information of each identification point;By the people Tone mark is known correspondence and is added in the voice data nodal directory.
The invention has the beneficial effects as follows:
The terminal speech that the present invention is provided gets control device and its method ready, and the method is closed by obtaining at least one in advance Key word information, receives voice data, and keyword recognition is carried out to the voice data according to the time shaft order of the voice data, presses Determine the location information of identification point according to default locating rule, and carry out getting mark ready on the corresponding position of location information, from And to realize and automatic identification is carried out to voice data by keyword message get mark ready, it is to avoid beating occurs in artificial objective factor The phenomenon of point mistakes and omissions, improves the accuracy for getting mark positioning ready, and the integrality of data content;After mark will finally be got ready Voice data preserved, facilitate arrangement of the staff to audio data content, arranged according to keyword message, Substantially increase the operating efficiency of staff, and Consumer's Experience effect.
Description of the drawings
Below in conjunction with drawings and Examples, the invention will be further described, in accompanying drawing:
Fig. 1 is the hardware architecture diagram for realizing the optional mobile terminal of each embodiment one of the invention.
Fig. 2 gets the structured flowchart of control device ready for the terminal speech that first embodiment of the invention is provided.
Fig. 3 gets another structured flowchart of control device ready for the terminal speech that second embodiment of the invention is provided.
Fig. 4 gets control method basic flow sheet ready for the terminal speech that third embodiment of the invention is provided.
Fig. 5 gets the flow chart of control method ready for the terminal speech that fourth embodiment of the invention is provided.
Fig. 6 be the present embodiments relate to a kind of schematic diagram for getting ready of keyword message.
Fig. 7 be the present embodiments relate to another kind of schematic diagram got ready of keyword message.
Fig. 8 be the present embodiments relate to keyword message and figure mapping table.
Specific embodiment
It should be appreciated that specific embodiment described herein is not intended to limit the present invention only to explain the present invention.
The mobile terminal of each embodiment of the invention is realized referring now to Description of Drawings.In follow-up description, use For represent element such as " module ", " part " or " unit " suffix only for be conducive to the present invention explanation, itself Not specific meaning.Therefore, " module " can be used mixedly with " part ".
Mobile terminal can be implemented in a variety of manners.For example, the terminal described in the present invention can include such as moving Phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP The mobile terminal of (portable media player), guider etc. and such as numeral TV, desktop computer etc. are consolidated Determine terminal.Hereinafter it is assumed that terminal is mobile terminal, however, it will be understood by those skilled in the art that, except being used in particular for movement Outside the element of purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.
Apply in example in the present invention, it can be a kind of embedded data with mobile terminal that the terminal speech gets control device ready Processing unit, by the A/V input blocks on mobile terminal voice data is obtained, and terminal speech gets control device ready according to default Keyword message crucial identification is carried out to voice data, after recognition, the corresponding position of keyword is carried out getting mark ready, so The voice data that indicated will be got ready afterwards to be preserved, can need to carry out the content in voice data in follow-up work personnel whole During reason, directly taxonomic revision can be carried out according to getting mark ready.
Fig. 1 is the hardware architecture diagram for realizing the optional mobile terminal of each embodiment one of the invention.
Mobile terminal 1 00 can include wireless communication unit 110, A/V (audio/video) input block 120, user input Unit 130, output unit 140, memory 150, controller 160 and power subsystem 170 etc..Fig. 1 is shown with various groups The mobile terminal of part, it should be understood that be not required for implementing all components for illustrating, can alternatively implement more or more Few component, will be discussed in more detail below the element of mobile terminal.
Wireless communication unit 110 generally includes one or more assemblies, and it allows mobile terminal 1 00 and wireless communication system Or the radio communication between network.For example, wireless communication unit can include mobile communication module 111, wireless Internet mould At least one of block 112, short range communication module 113.
Mobile communication module 111 sends radio signals to base station (for example, access point etc.), exterior terminal and clothes Business at least one of device and/or receive from it radio signal.Such radio signal can include voice call signal, Video calling signal or the various types of data for sending and/or receiving according to text and/or Multimedia Message.
Wireless Internet module 112 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.The module can be internally or externally It is couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by the module can include WLAN (WLAN) (Wi-Fi), Wibro (WiMAX), Wimax (worldwide interoperability for microwave accesses), HSDPA (high-speed downlink packet access) etc..
Short range communication module 113 is the module for supporting junction service.Some examples of short-range communication technology include indigo plant Tooth TM, RF identification (RFID), Infrared Data Association (IrDA), ultra broadband (UWB), purple honeybee TM etc..
A/V input blocks 120 are used to receive audio or video signal.A/V input blocks 120 can include the He of camera 121 Microphone 122, the static images that 121 pairs, camera is obtained in Video Capture pattern or image capture mode by image capture apparatus Or the view data of video is processed.Picture frame after process may be displayed on display module 151.Jing cameras 121 are processed Picture frame afterwards can be stored in memory 160 (or other storage mediums) or carry out sending out via wireless communication unit 110 Send, two or more cameras 121 can be provided according to the construction of mobile terminal.
Microphone 122 can be in telephone calling model, logging mode, speech recognition mode etc. operational mode via wheat Gram wind receives sound (voice data), and can be voice data by such acoustic processing.Audio frequency (voice) after process Data can be converted in the case of telephone calling model can be sent to mobile communication base station via mobile communication module 111 Form is exported.Microphone 122 can implement various types of noises and eliminate (or suppression) algorithm to eliminate (or suppression) in reception With the noise or interference produced during transmission audio signal.
User input unit 130 can generate key input data to control each of mobile terminal according to the order of user input Plant operation.User input unit 130 allows the various types of information of user input, and can include keyboard, metal dome, touch Plate (for example, detection is due to the sensitive component of the change of touched and caused resistance, pressure, electric capacity etc.), roller, rocking bar etc. Deng.Especially, when touch pad is superimposed upon in the form of layer on display module 141, touch-screen can be formed.
Output unit 140 can include display module 141, dio Output Modules 142, alarm modules 143 etc..
Display module 141 may be displayed on the information processed in mobile terminal 1 00.For example, when mobile terminal 1 00 is in electricity During words call mode, display module 141 can show and converse or other communicate (for example, text messaging, multimedia files Download etc.) related user interface (UI) or graphic user interface (GUI).When mobile terminal 1 00 is in video calling pattern Or during image capture mode, display module 141 can show the image of capture and/or the image of reception, illustrate video or figure UI or GUI of picture and correlation function etc..
Meanwhile, when the display module 141 and touch pad touch-screen with formation superposed on one another in the form of layer, display module 141 can serve as input unit and output device.Display module 141 can include liquid crystal display (LCD), thin film transistor (TFT) In LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc. at least It is a kind of.Some in these displays may be constructed such that transparence to allow user from outside viewing, and this is properly termed as transparent Display, typical transparent display can be, for example, TOLED (transparent organic light emitting diode) display etc..According to specific The embodiment wanted, mobile terminal 1 00 can include two or more display modules (or other display devices), for example, move Dynamic terminal can include outside display module (not shown) and internal display module (not shown).Touch-screen can be used for detection and touch Input pressure and touch input position and touch input area.
Dio Output Modules 142 can mobile terminal in call signal reception pattern, call mode, logging mode, It is that wireless communication unit 110 is received or in memory 150 when under the isotypes such as speech recognition mode, broadcast reception mode The voice data transducing audio signal of middle storage and it is output as sound.And, dio Output Modules 142 can be provided and movement The audio output (for example, call signal receives sound, message sink sound etc.) of the specific function correlation that terminal 100 is performed. Dio Output Modules 142 can include loudspeaker, buzzer etc..
Alarm modules 143 can provide output so that event is notified to mobile terminal 1 00.Typical event can be with Including calling reception, message sink, key signals input, touch input etc..In addition to audio or video is exported, alarm modules 143 can in a different manner provide output with the generation of notification event.For example, alarm modules 143 can be in the form of vibrating Output is provided, when calling, message or some other entrance communication (incoming communication) are received, alarm mould Block 143 can provide tactile output (that is, vibrating) to notify to user.By providing such tactile output, even if When the mobile phone of user is in the pocket of user, user also can recognize that the generation of various events.Alarm modules 143 The output of the generation of notification event can be provided via display module 141 or dio Output Modules 142.
Memory 150 can store software program for the process and control operation performed by controller 160 etc., Huo Zheke With the data (for example, telephone directory, message, still image, video etc.) for temporarily storing own Jing outputs or will export.And And, memory 150 can be storing the vibration of various modes with regard to exporting when touching and being applied to touch-screen and audio signal Data.
Memory 150 can include the storage medium of at least one type, and the storage medium includes flash memory, hard disk, many Media card, card-type memory (for example, SD or DX memories etc.), random access storage device (RAM), static random-access storage Device (SRAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory (PROM), magnetic storage, disk, CD etc..And, mobile terminal 1 00 can perform memory with by network connection The network storage device cooperation of 150 store function.
The overall operation of the generally control mobile terminal of controller 160.For example, controller 160 is performed and voice call, data The related control of communication, video calling etc. and process.In addition, controller 160 can be included for reproducing (or playback) many matchmakers The multi-media module of volume data, multi-media module can be constructed in controller 160, or is so structured that and controller 160 Separate.The handwriting input for performing on the touchscreen or picture can be drawn defeated by controller 160 with execution pattern identifying processing Enter to be identified as character or image.
Power subsystem 170 receives external power or internal power under the control of controller 160 and provides operation each unit Appropriate electric power needed for part and component.
Various embodiments described herein can be with using such as computer software, hardware or its any combination of calculating Machine computer-readable recording medium is implementing.For hardware is implemented, embodiment described herein can be by using application-specific IC (ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), scene can Programming gate array (FPGA), processor, controller, microcontroller, microprocessor, it is designed to perform function described herein Implementing, in some cases, such embodiment can be implemented at least one in electronic unit in controller 160. For software is implemented, the embodiment of such as process or function can with allow to perform the single of at least one function or operation Software module is implementing.Software code can be come by the software application (or program) write with any appropriate programming language Implement, software code can be stored in memory 150 and be performed by controller 160.
So far, own Jing describes mobile terminal according to its function.Below, for the sake of brevity, will description such as folded form, Slide type mobile terminal in various types of mobile terminals of board-type, oscillating-type, slide type mobile terminal etc. is used as showing Example.Therefore, the present invention can be applied to any kind of mobile terminal, and be not limited to slide type mobile terminal.
Based on above-mentioned mobile terminal hardware configuration, the terminal speech for proposing the present invention gets control method each embodiment ready.
It is described in detail below by way of specific embodiment.
First embodiment:
With reference to Fig. 2, Fig. 2 gets the structured flowchart of control device ready for the terminal speech that first embodiment of the invention is provided.This On the basis of hardware configuration of the embodiment based on the mobile terminal of above-mentioned offer, the terminal speech of proposition gets control device ready, uses Mark in the automatically dotting to voice data, realize the classified finishing of content in voice data, the terminal speech gets control dress ready Putting 2 includes:Acquisition module 21, receiver module 22, locating module 23, logging modle 24 and preserving module 24, wherein:
The acquisition module 21 be used for obtain at least one keyword message, the keyword message include text message and Voice messaging;
Specifically, the keyword message is configured according to the related content of voice data, and for example, the voice data is To recruit a session recording being the theme, then, it is that terminal speech is beaten according to the theme of meeting before session recording is carried out Point control device arranges keyword message, and the keyword message is the word related to recruitment, in Recording Process, according to setting Keyword message carries out content and gets mark ready to recording.
When the keyword message is text message, the acquisition module 21 is carried from the text message of user input Take out keyword message.When the keyword message is voice messaging, the acquisition module 21 needs to pass through voice messaging Speech identifying function is converted into text message, then again so as to extracting corresponding keyword message in text message.
The receiver module 22 is used to receive voice data, and the voice data can be recording file or the stream in recording Media.
In the present embodiment, the function that the acquisition module 21 is realized, can specifically pass through the user input list in Fig. 1 Realizing, the content that user records as needed be input into keyword in user input unit 130 and believes for unit 130 and controller 160 Breath, specifically can be arranged, it is also possible to arranged by way of voice, then controller 160 by way of input through keyboard word Participle is carried out to the keyword message of user input, corresponding identification keyword is obtained.
In the present embodiment, when voice data is received, can pass through wired or wirelessly obtain, it might even be possible to It is that terminal is obtained by the live pick-up of A/V input blocks 120, when receiver module 22 is wirelessly to obtain audio frequency number According to, can be obtained by terminal wireless communication unit 110, for example, downloaded from the Internet by internet by mobile communication module 111 The audio file related to keyword message, such as download a lecture audio file from the Internet;Further, can also be logical Cross wireless Internet module 112 and receive the audio file sended over from other terminals.
The locating module 23 is used to carry out key to the voice data according to the time shaft order of the voice data Word identification, according to default locating rule the location information of identification point is determined;
In the present embodiment, the default locating rule is the place for getting mark ready before the position of keyword is recognized Reason rule, such as, the recording file of a lecture occurs in that keyword " recruitment " on the time point of the 1st minute, then getting mark ready Clock and got ready according to default locating rule, the locating rule is to carry out on the position of first 5 seconds of " recruitment " keyword time point Get ready, the positional information that is to say the identification point of final determination is 55 seconds positions.
Further, in order to ensure finally to get the integrality for marking the data for obtaining ready, locating module 23 is getting mark ready When, need the recording according to voice data or file, reproduction time axle order to be got ready.
Logging modle 24 is used on the corresponding position of location information of the identification point that the voice data to be carried out to beat Point mark;
In the present embodiment, logging modle 24, specifically can be by mark figure of different shapes when carrying out getting mark ready Shape needs the corresponding pass set up between keyword message and figure being got ready, but before adopting and graphically getting ready It is table.
As shown in fig. 6, keyword message gets schematic diagram ready, it is assumed that the voice data is the session recording with regard to recruiting, Its keyword message includes:Campus recruiting, educational background, specialty, wages, job site etc. are learned, in Recording Process, terminal passes through Speech identifying function, conversion is identified to the spoken voice in meeting, and the identification of keyword message is carried out after conversion, works as identification During to default keyword, then logging modle 24 is marked on corresponding record length point, concrete as shown in fig. 6, working as The time point t1 moment has recognized keyword " campus recruiting ", and at the time point t2 moment keyword " learning specialty " is recognized, this When terminal control logging modle 24 mark corresponding keyword message on t1 and t2 time points respectively.
In the present embodiment, logging modle 24 can also be marked when mark is got ready by figure, as shown in fig. 7, Its mark mode is identical with Fig. 6, but logging modle 24 also needs to carry out the inquiry of figure in mark money, particular by looking into Ask mapping table to realize, its mapping table includes keyword message and corresponding graphical information, as shown in Figure 8.For example, Logging modle 24 arrives Fig. 8 after the time point t1 moment has recognized keyword " campus recruiting " according to keyword " campus recruiting " In mapping table in inquire about, the corresponding figure of the keyword is obtained for triangle, according to the triangle on t1 time points Triangle symbol on mark, the t2 moment recognizes keyword " learning specialty ", and corresponding figure is asterisk.
Preserving module 25 is used to preserve through the voice data for getting mark ready.
In the present embodiment, when the locating module 23 determines the location information of identification point according to default locating rule, Positioned particular by the following manner:
Keyword recognition is carried out to the voice data according to keyword message, and determine identification point in voice data Actual position information, obtain the positioning advance parameters in default locating rule, according to the actual position information and Positioning advance parameters determine the location information of the identification point.
Specifically, settings of the positioning advance parameters can improve the degree of accuracy and completely of the voice data after mark Property.
In this example, it is assumed that Xiao Ming needs to record the speech in meeting for leader, Session Topic is recruitment, and he needs With the speech process led under sound recordings, and need to carry out arrangement output afterwards.Its processing procedure is specially:
First, recruited according to the theme of meeting, multiple key word informations be set to sound pick-up outfit, the key word information it is defeated It can be input into keyword audio frequency the input of the input keyboard of sound pick-up outfit, or by way of recording to enter, then Keyword audio frequency is converted into into text message;
Then, start to record meeting, during recording, sound pick-up outfit is according to according to the key for pre-setting Word information carries out the identification of keyword to recording, there is the keyword when recognizing, then the corresponding keyword in recording file Carry out getting mark process ready on position, and using the positional information got ready as location information.
Finally, will carry and get the recording file of mark ready and stored, Xiao Ming after the conference, according on recording file Getting mark ready carries out the classified finishing of conference content.
In the present embodiment, by when getting mark ready voice data being arranged, in order to be able to improve operating efficiency, can be with Return the multiple staff of distribution to be processed, everyone correspondence arranges the content of a class keywords information, therefore, in order to more square Just staff divides key word information and is arranged, and the terminal speech that the present embodiment is provided gets control device ready also to be included for each Keyword message setting is corresponding to get type ready, sets up keyword message and gets the corresponding relation of type ready, in note Record module 24 is in the corresponding positional information of identification point when getting mark ready, according to the keyword and keyword of identification point with get ready What the corresponding relation of type determined the identification point gets type of sign ready, gets ready on the identification point according to type is got ready Mark process.This gets type ready including triangle, circle, five-pointed star etc. various shapes, and the shape that do not swell represents one Keyword, described to get the color that type can also be different ready, each keyword corresponds to a kind of color.
In the present embodiment, carry out getting mark ready according to the location information of the keyword message for identifying in logging modle 24 After note process, also including directory listing is generated, the directory listing is specifically believed according to the keyword message, and keyword The voice data nodal directory that the location information of corresponding identification point is generated.
Specifically, in one section of session recording file, the identification of each keyword message is not only occur once, can Can occur multiple, then after the completion of logging modle 24 gets mark ready, one is generated with keyword message according to keyword message Catalogue for index is together stored in recording file, when staff carries out recording file arrangement, is directly looked into according to catalogue Ask and arrange, substantially increase operating efficiency.
It is not that one-man speaks in meeting or lecture, many individuals may be related to and be entered in different periods The discussion of row difference topic, at this moment will cause confusion in the arrangement that content is carried out according to keyword, therefore, the present embodiment is carried For terminal speech get ready control device also include according to voice identification get ready mark sort out, specifically, first determine each knowledge The corresponding voice mark of other point location information;Voice mark correspondence is added in the voice data nodal directory.
In the present embodiment, above-mentioned terminal speech gets the function that each module is realized in control device ready, specifically can be with Realized by the hardware of the mobile terminal 1 00 provided in Fig. 1, the key that user is arranged is obtained by user input unit 130 Word information, A/V input blocks 120 are received to be needed to carry out getting the audio file of mark ready, optionally, directly by microphone scene Recording is obtained, or from extraneous acquisition;User input unit 130 and A/V input blocks 120 are respectively by the keyword for obtaining Information and audio file are sent to controller 160, and controller 160 carries out keyword knowledge according to keyword message to audio file Not, determine each keyword corresponding positional information in audio file, and carry out getting mark ready according to corresponding type of getting ready Note process, exports one with the audio file for getting mark ready, in being stored in memory 150.
The terminal speech that the present embodiment is provided gets control device ready, and by acquisition module at least one keyword letter is obtained Breath, receiver module receives voice data, and locating module enters according to the time shaft order of the voice data to the voice data Row keyword recognition, according to default locating rule the location information of identification point is determined, logging modle is determined the identification point Position information carries out getting mark ready on corresponding position to the voice data, it is achieved thereby that by keyword message to audio frequency number Mark is got ready according to automatic identification is carried out, it is to avoid artificial objective factor occurs getting the phenomenon of mistakes and omissions ready, improve that to get mark ready fixed The accuracy of position, and the integrality of data content;After the completion of getting mark ready, by preserving module by the audio frequency for getting mark ready Data are preserved, and facilitate arrangement of the staff to audio data content, are arranged according to keyword message, are carried significantly The high operating efficiency of staff, and Consumer's Experience effect.
Second embodiment:
With reference to Fig. 3, Fig. 3 gets another structural frames of control device ready for the terminal speech that second embodiment of the invention is provided Figure.The terminal speech is got control device ready and is specifically included:Acquisition module 21, receiver module 22, locating module 23, logging modle 24 With preserving module 24;
The acquisition module 21 be used for obtain at least one keyword message, the keyword message include text message and Voice messaging.
The receiver module 22 is used to receive voice data, and the voice data can be recording file or the stream in recording Media.
The locating module 23 is used to carry out key to the voice data according to the time shaft order of the voice data Word identification, according to default locating rule the location information of identification point is determined.
In the present embodiment, the default locating rule is to get mark ready before the position of keyword is recognized, than Such as, the recording file of a lecture occurs in that keyword " recruitment " on the time point of the 1st minute, then when mark is got ready according to pre- If locating rule get ready, the locating rule that is to say to be got ready on the position of first 5 seconds of " recruitment " keyword time point The positional information of the final identification point for determining is 55 seconds positions.
Logging modle 24 is used on the corresponding position of location information of the identification point that the voice data to be carried out to beat Point mark.
Preserving module 25 is used to preserve through the voice data for getting mark ready.
In the present embodiment, when the keyword message is text message, the acquisition module 21 is from user input In text message, keyword message is extracted.When the keyword message be voice messaging when, the acquisition module 21 need by Voice messaging is converted into text message by speech identifying function, then again so as to extracting corresponding pass in text message Key word information.
In order to ensure finally to get ready the integrality of the data that mark is obtained, locating module 23 needs to press when mark is got ready Recording, reproduction time axle order according to voice data or file is got ready.
In the present embodiment, when the locating module 23 determines the location information of identification point according to default locating rule, Positioned particular by the following manner:
Keyword recognition is carried out to the voice data according to keyword message, and determine identification point in voice data Actual position information, obtain the positioning advance parameters in default locating rule, according to the actual position information and Positioning advance parameters determine the location information of the identification point.
In this example, it is assumed that Xiao Ming needs to record the speech in meeting for leader, Session Topic is recruitment, and he needs With the speech process led under sound recordings, and need to carry out arrangement output afterwards.Its processing procedure is specially:
First, recruited according to the theme of meeting, multiple key word informations be set to sound pick-up outfit, the key word information it is defeated It can be input into keyword audio frequency the input of the input keyboard of sound pick-up outfit, or by way of recording to enter, then Keyword audio frequency is converted into into text message;
Then, start to record meeting, during recording, sound pick-up outfit is according to according to the key for pre-setting Word information carries out the identification of keyword to recording, there is the keyword when recognizing, then the corresponding keyword in recording file Carry out getting mark process ready on position, and using the positional information got ready as location information.
Finally, will carry and get the recording file of mark ready and stored, Xiao Ming after the conference, according on recording file Getting mark ready carries out the classified finishing of conference content.
In the present embodiment, the logging modle 24 is recognized on the corresponding position of dot position information to the audio frequency described When data carry out getting mark ready, specifically include:
Determine the corresponding keyword message of the identification point location information;According to default keyword message with get mark ready The mapping relations of type, the lookup keyword message is corresponding to get type ready;The voice data is beaten according to described Point type carries out getting mark process ready.
This gets type ready including triangle, circle, five-pointed star etc. various shapes, and each shape represents a pass Keyword, described to get the color that type can also be different ready, each keyword corresponds to a kind of color.
As shown in Figure 6,7, be it is provided in an embodiment of the present invention mark schematic diagram is got ready to voice data, as shown in fig. 6, When keyword " campus recruiting " has been recognized at the time point t1 moment, recognize keyword at the time point t2 moment and " learn specially Industry ", at this moment terminal control logging modle 24 mark corresponding keyword message on t1 and t2 time points respectively.Optionally, also Can graphically be marked, as shown in Figure 7.
In the present embodiment, for the treatment effeciency to audio data content for further improving, the terminal speech is beaten Point control device 2 is additionally provided with catalog generation module, for according to the keyword message, and keyword message correspondence The location information of identification point generate the nodal directory of the voice data.
Specifically, in one section of session recording file, the identification of each keyword message is not only occur once, can Can occur multiple, then after the completion of logging modle 24 gets mark ready, one is generated with keyword message according to keyword message Catalogue for index is together stored in recording file, when staff carries out recording file arrangement, is directly looked into according to catalogue Ask and arrange, substantially increase operating efficiency.
Further, the catalog generation module is additionally operable to determine the corresponding voice mark of location information of each identification point; Voice mark correspondence is added in the voice data nodal directory.
The terminal speech that the present embodiment is provided gets control device ready, and by acquisition module at least one keyword letter is obtained Breath, receiver module receives voice data, and locating module enters according to the time shaft order of the voice data to the voice data Row keyword recognition, according to default locating rule the location information of identification point is determined, logging modle is determined the identification point Position information carries out getting mark ready on corresponding position to the voice data, it is achieved thereby that by keyword message to audio frequency number According to automatic identification is carried out get mark ready, it is to avoid artificial objective factor occurs getting the phenomenon of mistakes and omissions ready, can more detailed record The key content position of lower recording.
Further, in order to convenient staff arranges according to mark is got ready to the content of voice data, remember Record module also gets type ready for the setting of each keyword message is corresponding, and by catalog generation module according to the key Word information, and the voice data nodal directory that the location information of the corresponding identification point of keyword letter is generated, not only solve existing When getting mark ready manually to recording, the mark mistakes and omissions of content easily occur causes the incomplete problem of information, also more detailed Thin records the key content position of recording, and to getting the classification of mark ready, substantially increases operating efficiency.
3rd embodiment:
With reference to Fig. 4, Fig. 4 gets control method basic flow sheet ready for the terminal speech that third embodiment of the invention is provided.Should Method is that the voice proposed on the hardware foundation of the mobile terminal of above-mentioned offer gets control method ready, and the method is specifically included:
S401, obtains at least one keyword message.
In this step, the keyword message includes text message and voice messaging, when the keyword message is text During this information, the acquisition module 21 extracts keyword message from the text message of user input.When keyword letter Cease for voice messaging when, the acquisition module 21 needs for voice messaging to be converted into text envelope by speech identifying function Breath, then again so as to extracting corresponding keyword message in text message.
S402, receives voice data, and the voice data can be recording file or the Streaming Media in recording.
Specifically, the step can pass through wired or wirelessly obtain, it might even be possible to be that terminal is input into by A/V The live pick-up of unit 120 is obtained, when receiver module 22 is wirelessly to obtain voice data, can be logical by terminal wireless Letter unit 110 is obtained, and for example, is downloaded from the Internet by internet by mobile communication module 111 related to keyword message Audio file, such as download a lecture audio file from the Internet;Further, can also be by wireless Internet module 112 receive the audio file sended over from other terminals.
S403, carries out keyword recognition, according to default according to the time shaft order of the voice data to the voice data Locating rule determines the location information of identification point.
In this step, the default locating rule refers to get mark ready before the position of keyword is recognized Rule is processed, such as, the recording file of a lecture occurs in that keyword " recruitment " on the time point of the 1st minute, then getting ready Get ready according to default locating rule during mark, the locating rule is enterprising for the position of first 5 seconds of " recruitment " keyword time point Row is got ready, and the positional information that is to say the identification point of final determination is 55 seconds positions.
S404, carries out getting mark ready on the corresponding position of identification point location information to the voice data.
S405, preserves through the voice data after getting mark ready.
In the present embodiment, it is described identification point is determined according to default locating rule location information when, particular by In the following manner is positioned:
Keyword recognition is carried out to the voice data according to keyword message, and determine identification point in voice data Actual position information;
Obtain the positioning advance parameters in default locating rule;
The location information of the identification point is determined according to the actual position information and positioning advance parameters.
Assume that Xiao Ming needs to record the speech in meeting for leader, Session Topic is recruitment, and he is needed with sound recordings The speech process of leader, and need to carry out arrangement output afterwards.Its processing procedure is specially:
First, recruited according to the theme of meeting, multiple key word informations are arranged to sound pick-up outfit, such as:Recruitment, wages, bar Part.The input of the key word information can be input into, or by way of recording the input keyboard of sound pick-up outfit Input keyword audio frequency, is then converted into text message by keyword audio frequency;
Then, start to record meeting, during recording, sound pick-up outfit is according to according to the key for pre-setting Word information carries out the identification of keyword to recording, there is the keyword when recognizing, then the corresponding keyword in recording file Carry out getting mark process ready on position, and using the positional information got ready as location information.
Finally, will carry and get the recording file of mark ready and stored, Xiao Ming after the conference, according on recording file Getting mark ready carries out the classified finishing of conference content.
Arranged in order to convenient staff divides key word information, the terminal speech that the present embodiment is provided is got ready Control method also includes getting type ready for the setting of each keyword message is corresponding, sets up keyword message and gets marking class ready The corresponding relation of type, specifically being determined with the corresponding relation for getting type ready according to the keyword and keyword of identification point should Identification point gets type of sign ready, processes according to getting type ready and getting mark ready on the identification point, and this gets type ready Including triangle, circle, five-pointed star etc. various shapes, the shape that do not swell represents a keyword, described to get type ready Different colors is can also be, each keyword corresponds to a kind of color.
After being carried out getting ready the step of mark is processed according to the location information of the keyword message for identifying, also wrap Include:Directory listing is generated, the directory listing specifically believes corresponding identification point according to the keyword message, and keyword The voice data nodal directory that location information is generated.
For example, user is input in multiple keyword A, B, C, D ... recording whenever identifying these keywords before recording, Will get off as nodes records, a series of recording node catalogue of A, B, C, D ... be automatically generated after the completion of recording and allows user Inquiry.
As shown in fig. 6, keyword message gets schematic diagram ready, it is assumed that the voice data is the session recording with regard to recruiting, Its keyword message includes:Campus recruiting, educational background, specialty, wages, job site etc. are learned, in Recording Process, when in the time The point t1 moment has recognized keyword " campus recruiting ", and at the time point t2 moment keyword " learning specialty " is recognized, at this moment eventually End control logging modle 24 marks respectively corresponding keyword message on t1 and t2 time points.Optionally, can be with figure Mode be marked, it is concrete as shown in Figure 7.
It is not that one-man speaks in meeting or lecture, many individuals may be related to and be entered in different periods The discussion of row difference topic, at this moment will cause confusion in the arrangement that content is carried out according to keyword, therefore, the present embodiment is carried For terminal speech get ready control device also include according to voice identification get ready mark sort out, specifically, first determine each knowledge The corresponding voice mark of other point location information;Voice mark correspondence is added in the voice data nodal directory.
The terminal speech that the present embodiment is provided gets control method ready, by obtaining at least one keyword message, receives sound Frequency evidence, carries out keyword recognition, according to default fixed according to the time shaft order of the voice data to the voice data Position rule determines the location information of identification point, and the voice data is carried out on the corresponding position of the identification point location information Get mark ready, preserve through it is described get mark ready after voice data, solve it is existing when getting mark ready manually to recording, easily The mark mistakes and omissions of content occur causes the incomplete problem of information.Further realize by keyword message to audio frequency number Mark is got ready according to automatic identification is carried out, automatic identification is carried out by keyword message and gets mark ready, not only increase and get mark ready The accuracy of positioning, also facilitates arrangement of the staff to audio data content, and the operating efficiency of staff is improve greatly.
Fourth embodiment:
With reference to Fig. 5, Fig. 5 gets another flow chart of control method ready for the terminal speech that fourth embodiment of the invention is provided. The present embodiment is got control method ready and is further described so that mobile phone carries out session recording as an example to terminal speech.
The process step that the terminal speech that the present embodiment is provided gets control method ready is specific as follows:
S501, obtains the theme of meeting, and the specific Session Topic is recruitment meeting.
S502, arranges keyword message on mobile phone, and the keyword message includes recruitment, wages, academic condition, work Jing Test etc..
S503, starts recording.
S504, according to whether having corresponding keyword message in the audio frequency that the identification of default keyword message is recorded to, has Body, according to the keyword of recruitment, wages, academic condition, working experience etc., in identification recording file voice whether have recruitment, Wages, academic condition, the printed words of working experience occur.
S505, if recognizing corresponding keyword message, determines corresponding positional information.
In this step, it is determined that during particular location of the keyword message in recording file, with specific reference to default fixed Position rule is determining.Optionally, the locating rule is to be got ready on the position of first 5 seconds of " recruitment " keyword time point, when Detect in recording file and occur in that keyword " recruitment " on the time point of the 1st minute, extraction " recruitment " is in recording file Actual position information be the time point of 60 seconds, then final positional information is 55 seconds positions.
S506, carries out getting mark ready according to the final location information recording file for determining.
S507, the voice data got ready after mark is preserved.
Before step S507, also include:It is corresponding according to the mark of getting ready of the keyword message, and keyword letter The voice data nodal directory that the location information of identification point is generated.
When there is many people's sound in the recording file, also include:Determine the corresponding people of each identification point location information Tone mark is known;Voice mark correspondence is added in the voice data nodal directory.
In sum, the terminal speech that the present invention is provided gets control device and its method ready, and the method is by acquisition in advance At least one keyword message, receives voice data, and the voice data is closed according to the time shaft order of the voice data The identification of key word, according to default locating rule the location information of identification point is determined, and is carried out on the corresponding position of location information Get mark ready, solve existing when getting mark ready manually to recording, the mark mistakes and omissions of content easily occur causes information imperfect Problem, to realize carry out automatic identification and get mark ready by keyword message to voice data, it is to avoid artificial objective factor The phenomenon of mistakes and omissions is got in appearance ready, improves the accuracy for getting mark positioning ready;The voice data got ready after mark is preserved, Arrangement of the staff to audio data content is facilitated, is arranged according to keyword message, substantially increase staff Operating efficiency, and Consumer's Experience effect.
It should be noted that herein, term " including ", "comprising" or its any other variant are intended to non-row His property is included, so that a series of process, method, article or device including key elements not only include those key elements, and And also include other key elements being not expressly set out, or also include for this process, method, article or device institute inherently Key element.In the absence of more restrictions, the key element for being limited by sentence "including a ...", it is not excluded that including being somebody's turn to do Also there is other identical element in the process of key element, method, article or device.
The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on such understanding, technical scheme is substantially done to prior art in other words Going out the part of contribution can be embodied in the form of software product, and the computer software product is stored in a storage medium In (such as ROM/RAM, magnetic disc, CD), including some instructions are used so that a station terminal equipment (can be mobile phone, computer takes Business device, air-conditioner, or network equipment etc.) perform method described in each embodiment of the invention.
Embodiments of the invention are described above in conjunction with accompanying drawing, but be the invention is not limited in above-mentioned concrete Embodiment, above-mentioned specific embodiment is only schematic, rather than restricted, one of ordinary skill in the art Under the enlightenment of the present invention, in the case of without departing from present inventive concept and scope of the claimed protection, can also make a lot Form, these are belonged within the protection of the present invention.

Claims (10)

1. a kind of terminal speech gets control device ready, it is characterised in that include:
Acquisition module, for obtaining at least one keyword message;
Receiver module, for receiving voice data;
Locating module, for carrying out keyword recognition to the voice data according to the time shaft of voice data order, presses Determine the location information of identification point according to default locating rule;
Logging modle, for carrying out to the voice data getting mark ready on the corresponding position of the location information of the identification point Note;
Preserving module, for preserving through the voice data for getting mark ready.
2. terminal speech according to claim 1 gets control device ready, it is characterised in that the locating module is used to determine The actual position information of the identification point;Obtain the positioning advance parameters in default locating rule;By the identification point The positional information of the positioning advance parameters before actual position information, is defined as the location information of the identification point.
3. terminal speech according to claim 1 and 2 gets control device ready, it is characterised in that the logging modle is used for Determine the corresponding keyword message of location information of the identification point;According to default keyword message with get type ready Mapping relations, lookup is corresponding with the keyword message to get type ready;According to described ready mark is got to the voice data Note type carries out getting mark process ready.
4. terminal speech according to claim 3 gets control device ready, it is characterised in that also including catalog generation module, For generating the audio frequency number according to the keyword message, and the location information of the corresponding identification point of the keyword message According to nodal directory.
5. terminal speech according to claim 4 gets control device ready, it is characterised in that the catalog generation module is also used In it is determined that the corresponding voice mark of the location information of each identification point;Voice mark correspondence is added into into the voice data section In point catalogue.
6. a kind of terminal speech gets control method ready, it is characterised in that include:
Obtain at least one keyword message;
Receive voice data;
Keyword recognition is carried out to the voice data according to the time shaft order of the voice data, is advised according to default positioning Then determine the location information of identification point;
The voice data is carried out on the corresponding position of location information of the identification point get mark ready;
Preserve through the voice data for getting mark ready.
7. terminal speech according to claim 6 gets control method ready, it is characterised in that described to advise according to default positioning Then determine that the location information of identification point includes:
Determine the actual position information of the identification point;
Obtain the positioning advance parameters in default locating rule;
The positional information of the positioning advance parameters before by the actual position information of the identification point, is defined as the knowledge The location information of other point.
8. the terminal speech according to claim 6 or 7 gets control method ready, it is characterised in that described in the identification point On the corresponding position of positional information the voice data is carried out getting mark ready includes:
Determine the corresponding keyword message of location information of the identification point;
According to default keyword message and the mapping relations for getting type ready, beat corresponding with the keyword message is searched Point type;
The voice data is carried out getting mark process ready according to the type of getting ready.
9. terminal speech according to claim 8 gets control method ready, it is characterised in that also include:According to the key Word information, and the nodal directory of the location information generation voice data of the corresponding identification point of the keyword message.
10. terminal speech according to claim 9 gets control method ready, it is characterised in that also include:Determine each identification point Location information corresponding voice mark;Voice mark correspondence is added in the voice data nodal directory.
CN201610977756.0A 2016-10-28 2016-10-28 Terminal voice dotting control device and method Pending CN106571137A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610977756.0A CN106571137A (en) 2016-10-28 2016-10-28 Terminal voice dotting control device and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610977756.0A CN106571137A (en) 2016-10-28 2016-10-28 Terminal voice dotting control device and method

Publications (1)

Publication Number Publication Date
CN106571137A true CN106571137A (en) 2017-04-19

Family

ID=58540175

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610977756.0A Pending CN106571137A (en) 2016-10-28 2016-10-28 Terminal voice dotting control device and method

Country Status (1)

Country Link
CN (1) CN106571137A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108093124A (en) * 2017-11-15 2018-05-29 维沃移动通信有限公司 A kind of audio localization method, device and mobile terminal
CN108831510A (en) * 2018-06-29 2018-11-16 Oppo(重庆)智能科技有限公司 Method, apparatus, terminal and the storage medium that audio-video document is got ready
CN110719518A (en) * 2018-07-12 2020-01-21 阿里巴巴集团控股有限公司 Multimedia data processing method, device and equipment
CN111178006A (en) * 2019-12-26 2020-05-19 东莞盛翔精密金属有限公司 Machining information marking method, numerically controlled machine tool, and storage medium
CN111479124A (en) * 2020-04-20 2020-07-31 北京捷通华声科技股份有限公司 Real-time playing method and device
CN111935552A (en) * 2020-07-30 2020-11-13 安徽鸿程光电有限公司 Information labeling method, device, equipment and medium
CN112822554A (en) * 2020-12-31 2021-05-18 联想(北京)有限公司 Multimedia processing method and device and electronic equipment
CN113640013A (en) * 2021-08-12 2021-11-12 安徽江淮汽车集团股份有限公司 Road test data processing method for driving assistance

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1404609A (en) * 2000-10-30 2003-03-19 皇家菲利浦电子有限公司 System and method for detecting highlights in a video program using audio properties
CN1842151A (en) * 2005-03-30 2006-10-04 株式会社东芝 Information processing apparatus and method
CN103400592A (en) * 2013-07-30 2013-11-20 北京小米科技有限责任公司 Recording method, playing method, device, terminal and system
CN104184870A (en) * 2014-07-29 2014-12-03 小米科技有限责任公司 Call log marking method and device and electronic equipment
CN104240113A (en) * 2014-09-24 2014-12-24 张慧燕 Coupon distribution device and coupon distribution system
CN104751846A (en) * 2015-03-20 2015-07-01 努比亚技术有限公司 Method and device for converting voice into text
CN104766604A (en) * 2015-04-02 2015-07-08 努比亚技术有限公司 Voice data marking method and device
CN105975569A (en) * 2016-05-03 2016-09-28 深圳市金立通信设备有限公司 Voice processing method and terminal
CN106024009A (en) * 2016-04-29 2016-10-12 北京小米移动软件有限公司 Audio processing method and device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1404609A (en) * 2000-10-30 2003-03-19 皇家菲利浦电子有限公司 System and method for detecting highlights in a video program using audio properties
CN1842151A (en) * 2005-03-30 2006-10-04 株式会社东芝 Information processing apparatus and method
CN103400592A (en) * 2013-07-30 2013-11-20 北京小米科技有限责任公司 Recording method, playing method, device, terminal and system
CN104184870A (en) * 2014-07-29 2014-12-03 小米科技有限责任公司 Call log marking method and device and electronic equipment
CN104240113A (en) * 2014-09-24 2014-12-24 张慧燕 Coupon distribution device and coupon distribution system
CN104751846A (en) * 2015-03-20 2015-07-01 努比亚技术有限公司 Method and device for converting voice into text
CN104766604A (en) * 2015-04-02 2015-07-08 努比亚技术有限公司 Voice data marking method and device
CN106024009A (en) * 2016-04-29 2016-10-12 北京小米移动软件有限公司 Audio processing method and device
CN105975569A (en) * 2016-05-03 2016-09-28 深圳市金立通信设备有限公司 Voice processing method and terminal

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108093124A (en) * 2017-11-15 2018-05-29 维沃移动通信有限公司 A kind of audio localization method, device and mobile terminal
CN108831510A (en) * 2018-06-29 2018-11-16 Oppo(重庆)智能科技有限公司 Method, apparatus, terminal and the storage medium that audio-video document is got ready
CN110719518A (en) * 2018-07-12 2020-01-21 阿里巴巴集团控股有限公司 Multimedia data processing method, device and equipment
CN111178006A (en) * 2019-12-26 2020-05-19 东莞盛翔精密金属有限公司 Machining information marking method, numerically controlled machine tool, and storage medium
CN111479124A (en) * 2020-04-20 2020-07-31 北京捷通华声科技股份有限公司 Real-time playing method and device
CN111935552A (en) * 2020-07-30 2020-11-13 安徽鸿程光电有限公司 Information labeling method, device, equipment and medium
CN112822554A (en) * 2020-12-31 2021-05-18 联想(北京)有限公司 Multimedia processing method and device and electronic equipment
CN113640013A (en) * 2021-08-12 2021-11-12 安徽江淮汽车集团股份有限公司 Road test data processing method for driving assistance

Similar Documents

Publication Publication Date Title
CN106571137A (en) Terminal voice dotting control device and method
CN104378441B (en) schedule creation method and device
CN106024009A (en) Audio processing method and device
CN109919244B (en) Method and apparatus for generating a scene recognition model
CN107992195A (en) A kind of processing method of the content of courses, device, server and storage medium
CN104021350A (en) Privacy-information hiding method and device
CN104035995B (en) Group's label generating method and device
CN110175223A (en) A kind of method and device that problem of implementation generates
CN107358227A (en) A kind of mark recognition method, mobile terminal and computer-readable recording medium
CN110083319B (en) Note display method, device, terminal and storage medium
CN106571136A (en) Voice output device and method
CN106570102A (en) Intelligent chat method, apparatus and terminal
CN104978145A (en) Recording realization method and apparatus and mobile terminal
CN110830362B (en) Content generation method and mobile terminal
CN107408238A (en) From voice data and computer operation context automatic capture information
CN107291343A (en) Recording method, device and the computer-readable recording medium of notes
CN108174236A (en) A kind of media file processing method, server and mobile terminal
CN106653011A (en) Voice control method, voice control device and terminal
US9420204B2 (en) Information processing apparatus, information processing method, and non-transitory computer readable medium
CN108174270B (en) Data processing method, data processing device, storage medium and electronic equipment
CN110895661A (en) Behavior identification method, device and equipment
CN108847066A (en) A kind of content of courses reminding method, device, server and storage medium
CN106572230A (en) Device and method for recording calls
WO2021136334A1 (en) Video generating method and apparatus, electronic device, and computer readable storage medium
CN107240076A (en) Image processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170419

RJ01 Rejection of invention patent application after publication