CN110415703A - Voice memos information processing method and device - Google Patents

Voice memos information processing method and device Download PDF

Info

Publication number
CN110415703A
CN110415703A CN201910647159.5A CN201910647159A CN110415703A CN 110415703 A CN110415703 A CN 110415703A CN 201910647159 A CN201910647159 A CN 201910647159A CN 110415703 A CN110415703 A CN 110415703A
Authority
CN
China
Prior art keywords
information
voice memos
voice
label
memos
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910647159.5A
Other languages
Chinese (zh)
Inventor
钱庄
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Mobile Software Co Ltd
Original Assignee
Beijing Xiaomi Mobile Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Mobile Software Co Ltd filed Critical Beijing Xiaomi Mobile Software Co Ltd
Priority to CN201910647159.5A priority Critical patent/CN110415703A/en
Publication of CN110415703A publication Critical patent/CN110415703A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The disclosure is directed to voice memos information processing method and devices.This method comprises: obtaining voice memos information, voice memos information includes one or more snippets audio-frequency information;Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice memos text information corresponding with voice memos information;Label corresponding with voice memos information is determined according to voice memos text information.The technical solution can automatically generate target labels corresponding with voice memos information, make user when searching voice memos information, the content of the voice memos information is conveniently understood according to the target labels, so that the voice memos information needed for finding oneself, improves user experience.So as to improve user experience.

Description

Voice memos information processing method and device
Technical field
This disclosure relates to control technology field more particularly to voice memos information processing method and device.
Background technique
With the gradually quickening lived in modern society with work rhythm, schedule often arranges people in daily life It is more nervous, forget that the situation for needing the thing done happens occasionally.It, can be standby by recording in order to avoid there is above-mentioned condition The mode for forgetting record reminds people's thing to be treated, wherein memo information is recorded in a manner of voice, it can be without typewriting Under the premise of, by saying voice memos information, electronic equipment is made to acquire the voice memos information and storing, at any time so as to user Check or play the voice memos information.
Summary of the invention
To overcome the problems in correlation technique, embodiment of the disclosure provides a kind of voice memos information processing method And device.Technical solution is as follows:
It is according to an embodiment of the present disclosure in a first aspect, providing a kind of voice memos information processing method, comprising:
Voice memos information is obtained, voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice corresponding with voice memos information Memorandum text information;
Label corresponding with voice memos information is determined according to voice memos text information.
It include one or more snippets audio-frequency information voice memos by obtaining in the technical scheme provided by this disclosed embodiment Information carries out speech recognition to the audio-frequency information in voice memos information, standby to obtain voice corresponding with voice memos information Forget text information, and label corresponding with the voice memos information is determined according to voice memos text information.In above-mentioned steps In, label corresponding with the voice memos information can be understood as it is associated with the content of the voice memos text information, because This makes user when searching voice memos information, can be conveniently understood in the voice memos information according to the label Hold, so that the voice memos information needed for finding oneself, improves user experience.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time Sound memorandum prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio Form play output.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/ Or the time creates schedule;
Wherein, created calendar content includes voice memos text information.
In one embodiment, method further include:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes to label Modification instruction information, addition instruction information delete instruction information;
Instruction information is adjusted according to label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, method further include:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, to obtain and inquire sound The corresponding tag queries text information of frequency information;
From the matched inquiry tag of label text with tag queries text information determining in existing label;
The determining voice inquirement memo information with inquiry tag corresponding record, and export determining instruction voice inquirement memorandum Prompt information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
Second aspect according to an embodiment of the present disclosure provides a kind of voice memos information processing unit, comprising:
Voice memos data obtaining module, for obtaining voice memos information, the voice memos information include one section or Multistage audio-frequency information;
Speech recognition module, for in the voice memos information audio-frequency information carry out speech recognition, with obtain with The corresponding voice memos text information of the voice memos information;
Label determining module, for corresponding with the voice memos information according to voice memos text information determination Label.
In one embodiment, described device further include:
Date prompt module, for including date and/or time in response to the voice memos text information, according to described Date and/or time export corresponding voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio Form play output.
In one embodiment, described device further include:
Schedule creation module calls schedule for including date and/or time in response to the voice memos text information Interface is created, schedule is created according to the date and/or time;
Wherein, created calendar content includes the voice memos text information.
In one embodiment, described device further include:
Label adjustment instruction data obtaining module, for obtaining the label adjustment instruction information of user's input;Wherein, described Label adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction information;
Label adjusts module, is adjusted for adjusting instruction information according to the label to corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, described device further include:
Tag queries audio-frequency information obtains module, for obtaining tag queries audio-frequency information, and to the tag queries sound Frequency information carries out speech recognition, to obtain tag queries text information corresponding with the inquiry audio-frequency information;
Inquiry tag determining module, for being determined from existing label and the tag queries text information is matched looks into Ask label;;
Voice inquirement memo information cue module is believed for the determining voice memos with the inquiry tag corresponding record Breath, and export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
The third aspect according to an embodiment of the present disclosure provides a kind of voice memos information processing unit, comprising:
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is configured to:
Voice memos information is obtained, the voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in the voice memos information, to obtain and the voice memos information pair The voice memos text information answered;
Label corresponding with the voice memos information is determined according to the voice memos text information.
Fourth aspect according to an embodiment of the present disclosure provides a kind of computer readable storage medium, is stored thereon with meter Calculation machine instruction, which is characterized in that any one of the first aspect of embodiment of the disclosure is realized when the instruction is executed by processor The step of method.
It should be understood that above general description and following detailed description be only it is exemplary and explanatory, not The disclosure can be limited.
Detailed description of the invention
The drawings herein are incorporated into the specification and forms part of this specification, and shows the implementation for meeting the disclosure Example, and together with specification for explaining the principles of this disclosure.
Fig. 1 a is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 b is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 c is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 d is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 e is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 2 a is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 b is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 c is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 d is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 e is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 3 is a kind of block diagram of device shown according to an exemplary embodiment;
Fig. 4 is a kind of block diagram of device shown according to an exemplary embodiment.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment Described in embodiment do not represent all implementations consistent with this disclosure.On the contrary, they be only with it is such as appended The example of the consistent device and method of some aspects be described in detail in claims, the disclosure.
With the gradually quickening lived in modern society with work rhythm, schedule often arranges people in daily life It is more nervous, forget that the situation for needing the thing done happens occasionally.It, can be standby by recording in order to avoid there is above-mentioned condition The mode for forgetting record reminds people's thing to be treated.With the high speed development of science and technology and constantly mentioning for people's living standard Height, electronic equipment such as smart phone, tablet computer etc. starts to be widely used in people's lives in recent years.It is set by electronics Standby, user can acquire the voice messaging oneself said, and the voice messaging is standby as voice under the premise of without typewriting Forget information, i.e., record memo information in a manner of voice, electronic equipment is made to acquire the voice memos information and store, so as to user Check or play at any time the voice memos information.
Although above scheme, which is able to use family, can conveniently record memo information, when the language of electronic equipment storage When sound memo information is excessive, user possibly can not determine the content in voice memos information, so that user be made not look for easily Voice memos information needed for oneself, compromises user experience.
It to solve the above-mentioned problems, include one section or more by obtaining in the technical scheme provided by this disclosed embodiment Section audio information speech memo information carries out speech recognition to the audio-frequency information in voice memos information, standby with voice to obtain Forget the corresponding voice memos text information of information, and corresponding with the voice memos information according to the determination of voice memos text information Label.In above-mentioned steps, label corresponding with the voice memos information be can be understood as and the voice memos text envelope The associated keyword or keyword extracted from the voice memos text information of the content of breath, therefore user is made to search voice When memo information, the key content of the voice memos information can be conveniently understood according to the label, to find oneself Required voice memos information, improves user experience.
Embodiment of the disclosure provides a kind of voice memos information processing method, and this method can be applied to electronic equipment Such as Intelligent mobile communication terminal, tablet computer, computer, game console, Medical Devices, body-building equipment, individual digital help Reason etc. includes the following steps 101 to step 104 as shown in Figure 1a:
In a step 101, voice memos information is obtained.
Wherein, voice memos information includes one or more snippets audio-frequency information;
Illustratively, voice memos information is obtained, it can be understood as controlling electronic devices acquires user speech, makees to obtain For one or more snippets audio-frequency information of voice memos information, it is understood that read from electronic equipment including one or more snippets The voice memos information of audio-frequency information, or receive the voice including one or more snippets audio-frequency information of other device or systems transmission Memo information.
In a step 102, speech recognition is carried out to the audio-frequency information in voice memos information, is believed with obtaining with voice memos Cease corresponding voice memos text information.
Illustratively, speech recognition (Automatic Speech is carried out to the audio-frequency information in voice memos information Recognition, ASR) can be for the audio-frequency information in voice memos information be encoded (feature extraction), and it will be after coding The trained deep neural network of information input or hidden Markov model to obtain recognition result, recognition result is passed through It is exported after decoding.
In step 103, label corresponding with voice memos information is determined according to voice memos text information.
Illustratively, label corresponding with voice memos information is determined according to voice memos text information, it can be understood as Voice memos text information is parsed, obtain carrying out voice memos text information semantic understanding as a result, and by the knot The corresponding one or more keywords of fruit are determined as label corresponding with voice memos information.
It, can be by the voice memos text information and the mark after determining the corresponding label of voice memos text information Associated storage is signed, voice memos text information is identified by the label.The label parsed for example can be with are as follows: work, Shopping, study etc..
Illustratively, it before determining label corresponding with voice memos information according to voice memos text information, can obtain List of labels is taken, which, which is used to indicate, has stored label and stored pair between the identified voice memos text of label It should be related to, obtain list of labels, it can be understood as pass through the human-computer interaction device on electronic equipment such as keyboard, touch screen, wheat Gram wind etc. obtains the list of labels of user's input, it is understood that read the list of labels stored in advance from electronic equipment, Or receive the list of labels of other device or systems transmission.According to the determination of voice memos text information and voice memos information pair When the label answered, it can be retrieved in voice memos text information according to the word in the list of labels, when in voice memos It, can corresponding label be true in list of labels by the word retrieved when retrieving the word in the list of labels in text information It is set to label corresponding with voice memos information.It should be noted that when not retrieving label in voice memos text information When any word in list, it can determine that voice memos information does not have any corresponding label.
It include one or more snippets audio-frequency information voice memos by obtaining in the technical scheme provided by this disclosed embodiment Information carries out speech recognition to the audio-frequency information in voice memos information, standby to obtain voice corresponding with voice memos information Forget text information, and label corresponding with voice memos information is determined according to voice memos text information.In above-mentioned steps, with The corresponding label of voice memos information can be understood as associated with the content of the voice memos text information, therefore user be made to exist When searching voice memos information, the content of the voice memos information can be conveniently understood according to the label, to find Voice memos information needed for oneself, improves user experience.
In one embodiment, as shown in Figure 1 b, the voice memos information processing method that embodiment of the disclosure provides is also Include the following steps 1041:
Include date and/or time in response to voice memos text information in step 1041, according to the date and/or when Between export corresponding voice memos prompt information.
Wherein, wherein export corresponding voice memos prompt information, comprising: output is shown in a manner of text, and/or Output is played in the form of audio.
Illustratively, voice memos text information includes date and/or time, it can be understood as voice memos text information Including being used to indicate the date of some determination and/or one or more snippets text information of time, wherein the date be can be understood as Some date determined;Time can be understood as a moment or period, such as: in the period that some is determined daily Or before and after the moment in preset time section, or certain moment or period for defaulting the date (such as same day) etc.;Date and time It can be understood as in determining period in a determining date or before and after the moment in preset time section.
The date of the date for including in determining voice memos text information in the present embodiment and/or time meaning or Person's moment perhaps can be set the date or moment after the period or the period arrives the moment or pre- before arriving If the time exports corresponding voice memos prompt information, which can be configured by user when being also possible to default Between, which is not described herein again.
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time Sound memorandum prompt information can miss item relevant to the date, to avoid user so as to improve user experience.
In one embodiment, as illustrated in figure 1 c, the voice memos information processing method that embodiment of the disclosure provides is also Include the following steps 1042:
In step 1042, includes date and/or time in response to voice memos text information, schedule creation is called to connect Mouthful, schedule is created according to date and/or time.
Wherein, creation calendar content includes voice memos text information.
Illustratively, schedule is created according to date and/or time, it can be understood as call corresponding schedule management application journey Sequence creates corresponding with the date and/or time schedule, further, can also by the part of voice memos text information or It all inserts in schedule corresponding with the date and/or time, so that user carries out pipe to it by calendar application Reason.
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/ Or the time creates schedule, can miss item relevant to the date, to avoid user so as to improve user experience.
In one embodiment, as shown in Figure 1 d, the voice memos information processing method that embodiment of the disclosure provides is also Include the following steps 105 to step 106:
In step 105, the label adjustment instruction information of user's input is obtained.
Wherein, label adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction Information.
Illustratively, the label adjustment instruction information for obtaining user's input, can be to pass through the man-machine friendship on electronic equipment Mutual device such as keyboard, touch screen, microphone etc. obtains the label adjustment instruction information of user's input, it is understood that receive The label adjustment instruction information that other device or systems are sent.
The mark label adjustment instruction information that user inputs is obtained, can also indicate voice messaging by obtaining label adjustment, And instruction voice messaging is adjusted to label and carries out speech recognition, to obtain label adjustment instruction information.
By obtaining label adjustment instruction voice messaging, and instruction voice messaging is adjusted to label and carries out speech recognition, with Label adjustment instruction information is obtained, family can be used under the premise of without typewriting, adjusts deictic word message by saying label Breath, easily modifies, increases or deletes to the corresponding label of voice memos information, so as to improve user experience
In step 106, instruction information is adjusted according to label to be adjusted corresponding label.
Wherein, label adjustment instruction information is text information or voice messaging.
Illustratively, instruction information is adjusted according to label and modifies label corresponding with voice memos information, it can be understood as Instruction information is adjusted according to label and deletes one or more labels from the corresponding label of voice memos information, or according to label tune Whole instruction information adds one or more labels into the corresponding label of voice memos information, or adjusts instruction information according to label It is other labels by one or more tag replacements in the corresponding label of voice memos information.
By obtain user input label adjustment instruction information, and according to label adjust instruction information to corresponding label into Row adjustment, can be convenient user and modifies to the corresponding label of voice memos information, keep modified label more acurrate Voice memos information is described, be conveniently used for searching voice memos information according to modified label, thus Improve user experience.
In one embodiment, as shown in fig. le, the voice memos information processing method that embodiment of the disclosure provides is also Include the following steps 107 to step 109:
In step 107, tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, with Obtain tag queries text information corresponding with inquiry audio-frequency information.
In step 108, the determining and matched inquiry tag of tag queries text information from existing label.
In step 109, the determining voice memos information with inquiry tag corresponding record, and export determining voice memos Information.
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
Voice inquirement memorandum prompt information includes the corresponding text information of the corresponding voice inquirement memo information of inquiry tag, Or voice inquirement memorandum prompt information is used to prompt to play the audio-frequency information in the corresponding voice inquirement memo information of inquiry tag.
Further, the case where corresponding to multiple voice memos information for a label, can be according to preset order (example Such as, the time sequencing of storage) it is sequentially output multiple voice memos information, prompt information can also be exported, the choosing of user is received Instruction is selected, user is played and indicates corresponding voice memos information.The prompt information of output includes but is not limited to multiple voice memos The temporal information of information storage, secondary key information etc..It should be noted that carrying out voice to each voice memos information Level-one keyword and secondary key can be extracted when identification, level-one keyword can be used as label, and secondary key can To be further identified to voice memos information.For example, the voice memos information on May 14 includes going to supermarket to buy face tomorrow Packet, the voice memos information on May 29 includes going at weekend to stroll Wangfujing, and the level-one parsed from two voice memos information is closed Key word can be shopping, and for first voice memos information, the secondary key parsed can be supermarket, for second The keyword that voice memos information parses can be Wangfujing, and level-one keyword can be used as two voice memos information Label, secondary key can be used for identifying two voice memos information respectively, to carry out area to two voice memos information Point.Based on this principle, multistage label can also be parsed, every level-one label is all more more specific than upper level label, preferably realizes To the storage management of multiple voice memos information.
Speech recognition is carried out by obtaining tag queries audio-frequency information, and to tag queries audio-frequency information, to obtain and look into The corresponding tag queries text information of audio-frequency information is ask, is determined from existing label and tag queries text information is matched looks into Label, the determining voice memos information with inquiry tag corresponding record are ask, and exports determining voice memos information, can be used Family is under the premise of without typewriting, and by saying tag queries audio-frequency information, the corresponding voice inquirement of convenient inquiry tag is standby Forget information, and obtains the audio-frequency information for being used to prompt to play in the corresponding voice inquirement memo information of inquiry tag, so as to improve User experience
Following is embodiment of the present disclosure, can be used for executing embodiments of the present disclosure.
Fig. 2 a is a kind of block diagram of the voice memos information processing unit 20 shown according to an exemplary embodiment, voice Memo information processing unit 20 can may be a part of electronic equipment, voice memos information processing unit for electronic equipment 20 being implemented in combination with as some or all of of electronic equipment by software, hardware or both.As shown in Figure 2 a, should Voice memos information processing unit 20 includes:
Voice memos data obtaining module 201, for obtaining voice memos information, voice memos information includes one section or more Section audio information.
Speech recognition module 202, for carrying out speech recognition to the audio-frequency information in voice memos information, with acquisition and language The corresponding voice memos text information of sound memo information.
Label determining module 203, for determining mark corresponding with voice memos information according to voice memos text information Label.
In one embodiment, as shown in Figure 2 b, voice memos information processing unit 20 further include:
Date prompt module 2041, for including date and/or time in response to voice memos text information, according to the date And/or the time exports corresponding voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio Form play output.
In one embodiment, as shown in Figure 2 c, voice memos information processing unit 20 further include:
Schedule creation module 2042 calls schedule for including date and/or time in response to voice memos text information Interface is created, schedule is created according to date and/or time;
Wherein, created calendar content includes voice memos text information.
In one embodiment, as shown in Figure 2 d, voice memos information processing unit 20 further include:
Label adjustment instruction data obtaining module 205, for obtaining the label adjustment instruction information of user's input;Wherein, Label adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction information.
Label adjusts module 206, is adjusted for adjusting instruction information according to label to corresponding label.
In one embodiment, as shown in Figure 2 e, voice memos information processing unit 20 further include:
Tag queries audio-frequency information obtains module 207, for obtaining tag queries audio-frequency information, and to tag queries audio Information carries out speech recognition, to obtain tag queries text information corresponding with audio-frequency information is inquired.
Inquiry tag determining module 208 is adjusted corresponding label for adjusting instruction information according to label.
Voice inquirement memo information cue module 209, for determining the voice memos information with inquiry tag corresponding record, And export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
Embodiment of the disclosure provides a kind of voice memos information processing unit, which can be with Include one or more snippets audio-frequency information voice memos information by obtaining, voice is carried out to the audio-frequency information in voice memos information Identification, to obtain corresponding with voice memos information voice memos text information, and determined according to voice memos text information and The corresponding label of the voice memos information.In above-mentioned steps, label corresponding with the voice memos information is understood that To be associated with the content of the voice memos text information, therefore make user when searching voice memos information, it can be according to this Label conveniently understands the content of the voice memos information, so that the voice memos information needed for finding oneself, improves User experience.
Fig. 3 is a kind of block diagram of voice memos information processing unit 30 shown according to an exemplary embodiment, the voice Memo information processing unit 30 can be terminal, or a part of terminal, voice memos information processing unit 30 include:
Processor 301;
Memory 302 for 301 executable instruction of storage processor;
Wherein, processor 301 is configured as:
Voice memos information is obtained, voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice corresponding with voice memos information Memorandum text information;
Label corresponding with voice memos information is determined according to voice memos text information.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time Sound memorandum prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio Form play output.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/ Or the time creates schedule;
Wherein, created calendar content includes voice memos text information.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes to label Modification instruction information, addition instruction information delete instruction information;
Instruction information is adjusted according to label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, to obtain and inquire sound The corresponding tag queries text information of frequency information;
From the matched inquiry tag of label text with tag queries text information determining in existing label;
The determining voice inquirement memo information with inquiry tag corresponding record, and export determining instruction voice inquirement memorandum Prompt information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
Fig. 4 be it is shown according to an exemplary embodiment a kind of for handling the block diagram of the device 400 of voice memos information, The device 400 is suitable for electronic equipment.For example, device 400 can be mobile phone, computer, digital broadcast terminal, message receipts Send out equipment, game console, tablet device, Medical Devices, body-building equipment, personal digital assistant etc..
Device 400 may include following one or more components: processing component 402, memory 404, power supply module 406, Multimedia component 408, audio component 410, the interface 412 of input/output (I/O), sensor module 414 and communication component 416。
The integrated operation of the usual control device 400 of processing component 402, such as with display, telephone call, data communication, phase Machine operation and record operate associated operation.Processing element 402 may include that one or more processors 420 refer to execute It enables, to perform all or part of the steps of the methods described above.In addition, processing component 402 may include one or more modules, just Interaction between processing component 402 and other assemblies.For example, processing component 402 may include multi-media module, it is more to facilitate Interaction between media component 408 and processing component 402.
Memory 404 is configured not stored various types of data to support the operation in device 400.These data are shown Example includes the instruction of any application or method for operating on device 400, contact data, and telephone book data disappears Breath, picture, video etc..Memory 404 can be by any kind of volatibility or non-volatile memory device or their group It closes and realizes, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable to compile Journey read-only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash Device, disk or CD.
Power supply module 406 provides electric power for the various assemblies of device 400.Power supply module 406 may include power management system System, one or more power supplys and other with for device 400 generate, manage, and distribute the associated component of electric power.
Multimedia component 408 includes the screen of one output interface of offer between device 400 and user.In some realities It applies in example, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen can To be implemented as touch screen, to receive input signal from the user.Touch panel include one or more touch sensors with Sense the gesture on touch, slide, and touch panel.The touch sensor can not only sense the side of touch or sliding action Boundary, but also detect duration and pressure associated with the touch or slide operation.In some embodiments, multimedia group Part 408 includes a front camera and/or rear camera.When device 400 is in operation mode, such as screening-mode or video When mode, front camera and/or rear camera can receive external multi-medium data.Each front camera and postposition Camera can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio component 410 is configured as output and/or input audio signal.For example, audio component 410 includes a Mike Wind (MIC), when device 400 is in operation mode, when such as call mode, recording mode, and voice recognition mode, microphone is matched It is set to reception external audio signal.The received audio signal can be further stored in memory 404 or via communication set Part 416 is sent.In some embodiments, audio component 410 further includes a loudspeaker, is used for output audio signal.
I/O interface 412 provides interface between processing component 402 and peripheral interface module, and above-mentioned peripheral interface module can To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock Determine button.
Sensor module 414 includes one or more sensors, and the state for providing various aspects for device 400 is commented Estimate.For example, sensor module 414 can detecte the state that opens/closes of device 400, and the relative positioning of component, for example, it is described Component is the display and keypad of device 400, and sensor module 414 can be with 400 1 components of detection device 400 or device Position change, the existence or non-existence that user contacts with device 400,400 orientation of device or acceleration/deceleration and device 400 Temperature change.Sensor module 414 may include proximity sensor, be configured to detect without any physical contact Presence of nearby objects.Sensor module 414 can also include optical sensor, such as CMOS or ccd image sensor, at As being used in application.In some embodiments, which can also include acceleration transducer, gyro sensors Device, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 416 is configured to facilitate the communication of wired or wireless way between device 400 and other equipment.Device 400 can access the wireless network based on communication standard, such as intercom private network, WiFi, 2G, 3G, 4G or 5G or their group It closes.In one exemplary embodiment, communication component 416 receives the broadcast from external broadcasting management system via broadcast channel Signal or broadcast related information.In one exemplary embodiment, the communication component 416 further includes near-field communication (NFC) mould Block, to promote short range communication.For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) skill can be based in NFC module Art, ultra wide band (UWB) technology, bluetooth (BT) technology and other technologies are realized.
In the exemplary embodiment, device 400 can be believed by one or more application specific integrated circuit (ASIC), number Number processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing the above method.
In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided It such as include the memory 404 of instruction, above-metioned instruction can be executed by the processor 420 of device 400 to complete the above method.For example, The non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, floppy disk With optical data storage devices etc..
A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is by the processing of device 400 When device executes, so that device 400 is able to carry out above-mentioned voice memos information processing method, which comprises
Voice memos information is obtained, voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice corresponding with voice memos information Memorandum text information;
Label corresponding with voice memos information is determined according to voice memos text information.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time Sound memorandum prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio Form play output.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/ Or the time creates schedule;
Wherein, created calendar content includes voice memos text information.
In one embodiment, method further include:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes to label Modification instruction information, addition instruction information delete instruction information;
Instruction information is adjusted according to label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, method further include:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, to obtain and inquire sound The corresponding tag queries text information of frequency information;
From the matched inquiry tag of label text with tag queries text information determining in existing label;
The determining voice inquirement memo information with inquiry tag corresponding record, and export determining instruction voice inquirement memorandum Prompt information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
Those skilled in the art will readily occur to its of the disclosure after considering specification and practicing disclosure disclosed herein Its embodiment.This application is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or Person's adaptive change follows the general principles of this disclosure and including the undocumented common knowledge in the art of the disclosure Or conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the disclosure are by following Claim is pointed out.
It should be understood that the present disclosure is not limited to the precise structures that have been described above and shown in the drawings, and And various modifications and changes may be made without departing from the scope thereof.The scope of the present disclosure is only limited by the accompanying claims.

Claims (12)

1. a kind of voice memos information processing method characterized by comprising
Voice memos information is obtained, the voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in the voice memos information, it is corresponding with the voice memos information to obtain Voice memos text information;
Label corresponding with the voice memos information is determined according to the voice memos text information.
2. voice memos information processing method according to claim 1, which is characterized in that the method also includes:
Include date and/or time in response to the voice memos text information, is corresponded to according to the date and/or time output Voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
3. voice memos information processing method according to claim 1, which is characterized in that the method also includes:
Include date and/or time in response to the voice memos text information, calls schedule to create interface, according to the date And/or the time creates schedule;
Wherein, created calendar content includes the voice memos text information.
4. voice memos information processing method according to claim 1, which is characterized in that the method also includes:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes the modification to label It indicates information, addition instruction information or deletes instruction information;
Instruction information is adjusted according to the label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
5. voice memos information processing method according to claim 1-4, which is characterized in that the method is also wrapped It includes:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to the tag queries audio-frequency information, is looked into obtaining with described Ask the corresponding tag queries text information of audio-frequency information;
The determining and matched inquiry tag of tag queries text information from existing label;
The determining voice memos information with the inquiry tag corresponding record, and export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output in a manner of text, and/or broadcast in the form of audio Put output.
6. a kind of voice memos information processing unit characterized by comprising
Voice memos data obtaining module, for obtaining voice memos information, the voice memos information includes one or more snippets Audio-frequency information;
Speech recognition module, for in the voice memos information audio-frequency information carry out speech recognition, with obtain with it is described The corresponding voice memos text information of voice memos information;
Label determining module, for determining mark corresponding with the voice memos information according to the voice memos text information Label.
7. voice memos information processing unit according to claim 6, which is characterized in that described device further include:
Date prompt module, for including date and/or time in response to the voice memos text information, according to the date And/or the time exports corresponding voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with the shape of audio Formula plays output.
8. voice memos information processing unit according to claim 6, which is characterized in that described device further include:
Schedule creation module calls schedule creation for including date and/or time in response to the voice memos text information Interface creates schedule according to the date and/or time;
Wherein, created calendar content includes the voice memos text information.
9. voice memos information processing unit according to claim 6, which is characterized in that described device further include:
Label adjustment instruction data obtaining module, for obtaining the label adjustment instruction information of user's input;Wherein, the label Adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction information;
Label adjusts module, is adjusted for adjusting instruction information according to the label to corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
10. according to the described in any item voice memos information processing units of claim 6-9, which is characterized in that described device is also Include:
Tag queries audio-frequency information obtains module, believes for obtaining tag queries audio-frequency information, and to the tag queries audio Breath carries out speech recognition, to obtain tag queries text information corresponding with the inquiry audio-frequency information;
Inquiry tag determining module is marked for determining from existing label with the matched inquiry of the tag queries text information Label;
Voice inquirement memo information cue module, for the determining voice memos information with the inquiry tag corresponding record, and Export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output in a manner of text, and/or broadcast in the form of audio Put output.
11. a kind of voice memos information processing unit characterized by comprising
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is configured to:
Voice memos information is obtained, the voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in the voice memos information, it is corresponding with the voice memos information to obtain Voice memos text information;
Label corresponding with the voice memos information is determined according to the voice memos text information.
12. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the instruction is by processor The step of any one of claim 1-5 the method is realized when execution.
CN201910647159.5A 2019-07-17 2019-07-17 Voice memos information processing method and device Pending CN110415703A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910647159.5A CN110415703A (en) 2019-07-17 2019-07-17 Voice memos information processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910647159.5A CN110415703A (en) 2019-07-17 2019-07-17 Voice memos information processing method and device

Publications (1)

Publication Number Publication Date
CN110415703A true CN110415703A (en) 2019-11-05

Family

ID=68361825

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910647159.5A Pending CN110415703A (en) 2019-07-17 2019-07-17 Voice memos information processing method and device

Country Status (1)

Country Link
CN (1) CN110415703A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110931010A (en) * 2019-12-17 2020-03-27 用友网络科技股份有限公司 Voice control system
CN110992984A (en) * 2019-12-02 2020-04-10 新华智云科技有限公司 Audio processing method and device and storage medium
CN112735402A (en) * 2020-12-14 2021-04-30 厦门盈趣科技股份有限公司 Memorandum device and method

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120203551A1 (en) * 2011-02-04 2012-08-09 International Business Machines Corporation Automated follow up for e-meetings
CN104038630A (en) * 2014-05-28 2014-09-10 小米科技有限责任公司 Speech processing method and device
CN106327151A (en) * 2016-08-15 2017-01-11 捷开通讯(深圳)有限公司 Note recording method and system based on voice recognition
CN107038220A (en) * 2017-03-20 2017-08-11 北京光年无限科技有限公司 Method, intelligent robot and system for generating memorandum
CN108229916A (en) * 2018-01-02 2018-06-29 京东方科技集团股份有限公司 A kind of information prompting method and device
CN108737663A (en) * 2018-06-30 2018-11-02 上海爱优威软件开发有限公司 A kind of call householder method and terminal based on memo information
CN109669710A (en) * 2018-11-30 2019-04-23 维沃移动通信有限公司 Note processing method and terminal

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120203551A1 (en) * 2011-02-04 2012-08-09 International Business Machines Corporation Automated follow up for e-meetings
CN104038630A (en) * 2014-05-28 2014-09-10 小米科技有限责任公司 Speech processing method and device
CN106327151A (en) * 2016-08-15 2017-01-11 捷开通讯(深圳)有限公司 Note recording method and system based on voice recognition
CN107038220A (en) * 2017-03-20 2017-08-11 北京光年无限科技有限公司 Method, intelligent robot and system for generating memorandum
CN108229916A (en) * 2018-01-02 2018-06-29 京东方科技集团股份有限公司 A kind of information prompting method and device
CN108737663A (en) * 2018-06-30 2018-11-02 上海爱优威软件开发有限公司 A kind of call householder method and terminal based on memo information
CN109669710A (en) * 2018-11-30 2019-04-23 维沃移动通信有限公司 Note processing method and terminal

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110992984A (en) * 2019-12-02 2020-04-10 新华智云科技有限公司 Audio processing method and device and storage medium
CN110931010A (en) * 2019-12-17 2020-03-27 用友网络科技股份有限公司 Voice control system
CN112735402A (en) * 2020-12-14 2021-04-30 厦门盈趣科技股份有限公司 Memorandum device and method
CN112735402B (en) * 2020-12-14 2024-01-19 厦门盈趣科技股份有限公司 Memorandum device and method

Similar Documents

Publication Publication Date Title
US8144939B2 (en) Automatic identifying
JP2019117623A (en) Voice dialogue method, apparatus, device and storage medium
CN104394137B (en) A kind of method and device of prompting voice call
CN105138319B (en) Event-prompting method and device
CN101971250A (en) Mobile electronic device with active speech recognition
CN105306752B (en) The method and device that generation event is reminded
CN105224601B (en) A kind of method and apparatus of extracting time information
CN105489220A (en) Method and device for recognizing speech
CN109643548A (en) System and method for content to be routed to associated output equipment
CN110415703A (en) Voice memos information processing method and device
WO2021244057A1 (en) Interaction method and apparatus, earphone, and earphone accommodation apparatus
CN105141758A (en) Terminal control method and device
JP2022501623A (en) Audio processing method, device and storage medium
CN107562952A (en) The method, apparatus and terminal that music matching plays
CN104506703B (en) Tone information, tone information player method and device
CN108710791A (en) The method and device of voice control
CN107423386A (en) Generate the method and device of electronic card
CN108806714A (en) The method and apparatus for adjusting volume
CN106534459A (en) Voice prompt method and device
CN109388699A (en) Input method, device, equipment and storage medium
CN108574777A (en) Information prompting method and device
WO2021244059A1 (en) Interaction method and device, earphone, and server
CN106657543A (en) Voice information processing method and device
CN106534495A (en) Method of information processing, device and equipment
CN109961793A (en) Handle the method and device of voice messaging

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191105

RJ01 Rejection of invention patent application after publication