CN110415703A - Voice memos information processing method and device - Google Patents
Voice memos information processing method and device Download PDFInfo
- Publication number
- CN110415703A CN110415703A CN201910647159.5A CN201910647159A CN110415703A CN 110415703 A CN110415703 A CN 110415703A CN 201910647159 A CN201910647159 A CN 201910647159A CN 110415703 A CN110415703 A CN 110415703A
- Authority
- CN
- China
- Prior art keywords
- information
- voice memos
- voice
- label
- memos
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000010365 information processing Effects 0.000 title claims abstract description 47
- 238000003672 processing method Methods 0.000 title claims abstract description 22
- 238000000034 method Methods 0.000 claims abstract description 23
- 230000004044 response Effects 0.000 claims description 18
- 238000012986 modification Methods 0.000 claims description 10
- 230000004048 modification Effects 0.000 claims description 10
- 238000010586 diagram Methods 0.000 description 15
- 238000004891 communication Methods 0.000 description 10
- 238000012545 processing Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 6
- 238000007726 management method Methods 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 230000000712 assembly Effects 0.000 description 2
- 238000000429 assembly Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- KLDZYURQCUYZBL-UHFFFAOYSA-N 2-[3-[(2-hydroxyphenyl)methylideneamino]propyliminomethyl]phenol Chemical compound OC1=CC=CC=C1C=NCCCN=CC1=CC=CC=C1O KLDZYURQCUYZBL-UHFFFAOYSA-N 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 201000001098 delayed sleep phase syndrome Diseases 0.000 description 1
- 208000033921 delayed sleep phase type circadian rhythm sleep disease Diseases 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000012092 media component Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
- H04M1/72433—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The disclosure is directed to voice memos information processing method and devices.This method comprises: obtaining voice memos information, voice memos information includes one or more snippets audio-frequency information;Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice memos text information corresponding with voice memos information;Label corresponding with voice memos information is determined according to voice memos text information.The technical solution can automatically generate target labels corresponding with voice memos information, make user when searching voice memos information, the content of the voice memos information is conveniently understood according to the target labels, so that the voice memos information needed for finding oneself, improves user experience.So as to improve user experience.
Description
Technical field
This disclosure relates to control technology field more particularly to voice memos information processing method and device.
Background technique
With the gradually quickening lived in modern society with work rhythm, schedule often arranges people in daily life
It is more nervous, forget that the situation for needing the thing done happens occasionally.It, can be standby by recording in order to avoid there is above-mentioned condition
The mode for forgetting record reminds people's thing to be treated, wherein memo information is recorded in a manner of voice, it can be without typewriting
Under the premise of, by saying voice memos information, electronic equipment is made to acquire the voice memos information and storing, at any time so as to user
Check or play the voice memos information.
Summary of the invention
To overcome the problems in correlation technique, embodiment of the disclosure provides a kind of voice memos information processing method
And device.Technical solution is as follows:
It is according to an embodiment of the present disclosure in a first aspect, providing a kind of voice memos information processing method, comprising:
Voice memos information is obtained, voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice corresponding with voice memos information
Memorandum text information;
Label corresponding with voice memos information is determined according to voice memos text information.
It include one or more snippets audio-frequency information voice memos by obtaining in the technical scheme provided by this disclosed embodiment
Information carries out speech recognition to the audio-frequency information in voice memos information, standby to obtain voice corresponding with voice memos information
Forget text information, and label corresponding with the voice memos information is determined according to voice memos text information.In above-mentioned steps
In, label corresponding with the voice memos information can be understood as it is associated with the content of the voice memos text information, because
This makes user when searching voice memos information, can be conveniently understood in the voice memos information according to the label
Hold, so that the voice memos information needed for finding oneself, improves user experience.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time
Sound memorandum prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio
Form play output.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/
Or the time creates schedule;
Wherein, created calendar content includes voice memos text information.
In one embodiment, method further include:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes to label
Modification instruction information, addition instruction information delete instruction information;
Instruction information is adjusted according to label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, method further include:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, to obtain and inquire sound
The corresponding tag queries text information of frequency information;
From the matched inquiry tag of label text with tag queries text information determining in existing label;
The determining voice inquirement memo information with inquiry tag corresponding record, and export determining instruction voice inquirement memorandum
Prompt information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
Second aspect according to an embodiment of the present disclosure provides a kind of voice memos information processing unit, comprising:
Voice memos data obtaining module, for obtaining voice memos information, the voice memos information include one section or
Multistage audio-frequency information;
Speech recognition module, for in the voice memos information audio-frequency information carry out speech recognition, with obtain with
The corresponding voice memos text information of the voice memos information;
Label determining module, for corresponding with the voice memos information according to voice memos text information determination
Label.
In one embodiment, described device further include:
Date prompt module, for including date and/or time in response to the voice memos text information, according to described
Date and/or time export corresponding voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio
Form play output.
In one embodiment, described device further include:
Schedule creation module calls schedule for including date and/or time in response to the voice memos text information
Interface is created, schedule is created according to the date and/or time;
Wherein, created calendar content includes the voice memos text information.
In one embodiment, described device further include:
Label adjustment instruction data obtaining module, for obtaining the label adjustment instruction information of user's input;Wherein, described
Label adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction information;
Label adjusts module, is adjusted for adjusting instruction information according to the label to corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, described device further include:
Tag queries audio-frequency information obtains module, for obtaining tag queries audio-frequency information, and to the tag queries sound
Frequency information carries out speech recognition, to obtain tag queries text information corresponding with the inquiry audio-frequency information;
Inquiry tag determining module, for being determined from existing label and the tag queries text information is matched looks into
Ask label;;
Voice inquirement memo information cue module is believed for the determining voice memos with the inquiry tag corresponding record
Breath, and export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
The third aspect according to an embodiment of the present disclosure provides a kind of voice memos information processing unit, comprising:
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is configured to:
Voice memos information is obtained, the voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in the voice memos information, to obtain and the voice memos information pair
The voice memos text information answered;
Label corresponding with the voice memos information is determined according to the voice memos text information.
Fourth aspect according to an embodiment of the present disclosure provides a kind of computer readable storage medium, is stored thereon with meter
Calculation machine instruction, which is characterized in that any one of the first aspect of embodiment of the disclosure is realized when the instruction is executed by processor
The step of method.
It should be understood that above general description and following detailed description be only it is exemplary and explanatory, not
The disclosure can be limited.
Detailed description of the invention
The drawings herein are incorporated into the specification and forms part of this specification, and shows the implementation for meeting the disclosure
Example, and together with specification for explaining the principles of this disclosure.
Fig. 1 a is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 b is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 c is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 d is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 1 e is the flow diagram of voice memos information processing method shown according to an exemplary embodiment;
Fig. 2 a is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 b is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 c is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 d is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 2 e is the structural schematic diagram of voice memos information processing unit shown according to an exemplary embodiment;
Fig. 3 is a kind of block diagram of device shown according to an exemplary embodiment;
Fig. 4 is a kind of block diagram of device shown according to an exemplary embodiment.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to
When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment
Described in embodiment do not represent all implementations consistent with this disclosure.On the contrary, they be only with it is such as appended
The example of the consistent device and method of some aspects be described in detail in claims, the disclosure.
With the gradually quickening lived in modern society with work rhythm, schedule often arranges people in daily life
It is more nervous, forget that the situation for needing the thing done happens occasionally.It, can be standby by recording in order to avoid there is above-mentioned condition
The mode for forgetting record reminds people's thing to be treated.With the high speed development of science and technology and constantly mentioning for people's living standard
Height, electronic equipment such as smart phone, tablet computer etc. starts to be widely used in people's lives in recent years.It is set by electronics
Standby, user can acquire the voice messaging oneself said, and the voice messaging is standby as voice under the premise of without typewriting
Forget information, i.e., record memo information in a manner of voice, electronic equipment is made to acquire the voice memos information and store, so as to user
Check or play at any time the voice memos information.
Although above scheme, which is able to use family, can conveniently record memo information, when the language of electronic equipment storage
When sound memo information is excessive, user possibly can not determine the content in voice memos information, so that user be made not look for easily
Voice memos information needed for oneself, compromises user experience.
It to solve the above-mentioned problems, include one section or more by obtaining in the technical scheme provided by this disclosed embodiment
Section audio information speech memo information carries out speech recognition to the audio-frequency information in voice memos information, standby with voice to obtain
Forget the corresponding voice memos text information of information, and corresponding with the voice memos information according to the determination of voice memos text information
Label.In above-mentioned steps, label corresponding with the voice memos information be can be understood as and the voice memos text envelope
The associated keyword or keyword extracted from the voice memos text information of the content of breath, therefore user is made to search voice
When memo information, the key content of the voice memos information can be conveniently understood according to the label, to find oneself
Required voice memos information, improves user experience.
Embodiment of the disclosure provides a kind of voice memos information processing method, and this method can be applied to electronic equipment
Such as Intelligent mobile communication terminal, tablet computer, computer, game console, Medical Devices, body-building equipment, individual digital help
Reason etc. includes the following steps 101 to step 104 as shown in Figure 1a:
In a step 101, voice memos information is obtained.
Wherein, voice memos information includes one or more snippets audio-frequency information;
Illustratively, voice memos information is obtained, it can be understood as controlling electronic devices acquires user speech, makees to obtain
For one or more snippets audio-frequency information of voice memos information, it is understood that read from electronic equipment including one or more snippets
The voice memos information of audio-frequency information, or receive the voice including one or more snippets audio-frequency information of other device or systems transmission
Memo information.
In a step 102, speech recognition is carried out to the audio-frequency information in voice memos information, is believed with obtaining with voice memos
Cease corresponding voice memos text information.
Illustratively, speech recognition (Automatic Speech is carried out to the audio-frequency information in voice memos information
Recognition, ASR) can be for the audio-frequency information in voice memos information be encoded (feature extraction), and it will be after coding
The trained deep neural network of information input or hidden Markov model to obtain recognition result, recognition result is passed through
It is exported after decoding.
In step 103, label corresponding with voice memos information is determined according to voice memos text information.
Illustratively, label corresponding with voice memos information is determined according to voice memos text information, it can be understood as
Voice memos text information is parsed, obtain carrying out voice memos text information semantic understanding as a result, and by the knot
The corresponding one or more keywords of fruit are determined as label corresponding with voice memos information.
It, can be by the voice memos text information and the mark after determining the corresponding label of voice memos text information
Associated storage is signed, voice memos text information is identified by the label.The label parsed for example can be with are as follows: work,
Shopping, study etc..
Illustratively, it before determining label corresponding with voice memos information according to voice memos text information, can obtain
List of labels is taken, which, which is used to indicate, has stored label and stored pair between the identified voice memos text of label
It should be related to, obtain list of labels, it can be understood as pass through the human-computer interaction device on electronic equipment such as keyboard, touch screen, wheat
Gram wind etc. obtains the list of labels of user's input, it is understood that read the list of labels stored in advance from electronic equipment,
Or receive the list of labels of other device or systems transmission.According to the determination of voice memos text information and voice memos information pair
When the label answered, it can be retrieved in voice memos text information according to the word in the list of labels, when in voice memos
It, can corresponding label be true in list of labels by the word retrieved when retrieving the word in the list of labels in text information
It is set to label corresponding with voice memos information.It should be noted that when not retrieving label in voice memos text information
When any word in list, it can determine that voice memos information does not have any corresponding label.
It include one or more snippets audio-frequency information voice memos by obtaining in the technical scheme provided by this disclosed embodiment
Information carries out speech recognition to the audio-frequency information in voice memos information, standby to obtain voice corresponding with voice memos information
Forget text information, and label corresponding with voice memos information is determined according to voice memos text information.In above-mentioned steps, with
The corresponding label of voice memos information can be understood as associated with the content of the voice memos text information, therefore user be made to exist
When searching voice memos information, the content of the voice memos information can be conveniently understood according to the label, to find
Voice memos information needed for oneself, improves user experience.
In one embodiment, as shown in Figure 1 b, the voice memos information processing method that embodiment of the disclosure provides is also
Include the following steps 1041:
Include date and/or time in response to voice memos text information in step 1041, according to the date and/or when
Between export corresponding voice memos prompt information.
Wherein, wherein export corresponding voice memos prompt information, comprising: output is shown in a manner of text, and/or
Output is played in the form of audio.
Illustratively, voice memos text information includes date and/or time, it can be understood as voice memos text information
Including being used to indicate the date of some determination and/or one or more snippets text information of time, wherein the date be can be understood as
Some date determined;Time can be understood as a moment or period, such as: in the period that some is determined daily
Or before and after the moment in preset time section, or certain moment or period for defaulting the date (such as same day) etc.;Date and time
It can be understood as in determining period in a determining date or before and after the moment in preset time section.
The date of the date for including in determining voice memos text information in the present embodiment and/or time meaning or
Person's moment perhaps can be set the date or moment after the period or the period arrives the moment or pre- before arriving
If the time exports corresponding voice memos prompt information, which can be configured by user when being also possible to default
Between, which is not described herein again.
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time
Sound memorandum prompt information can miss item relevant to the date, to avoid user so as to improve user experience.
In one embodiment, as illustrated in figure 1 c, the voice memos information processing method that embodiment of the disclosure provides is also
Include the following steps 1042:
In step 1042, includes date and/or time in response to voice memos text information, schedule creation is called to connect
Mouthful, schedule is created according to date and/or time.
Wherein, creation calendar content includes voice memos text information.
Illustratively, schedule is created according to date and/or time, it can be understood as call corresponding schedule management application journey
Sequence creates corresponding with the date and/or time schedule, further, can also by the part of voice memos text information or
It all inserts in schedule corresponding with the date and/or time, so that user carries out pipe to it by calendar application
Reason.
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/
Or the time creates schedule, can miss item relevant to the date, to avoid user so as to improve user experience.
In one embodiment, as shown in Figure 1 d, the voice memos information processing method that embodiment of the disclosure provides is also
Include the following steps 105 to step 106:
In step 105, the label adjustment instruction information of user's input is obtained.
Wherein, label adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction
Information.
Illustratively, the label adjustment instruction information for obtaining user's input, can be to pass through the man-machine friendship on electronic equipment
Mutual device such as keyboard, touch screen, microphone etc. obtains the label adjustment instruction information of user's input, it is understood that receive
The label adjustment instruction information that other device or systems are sent.
The mark label adjustment instruction information that user inputs is obtained, can also indicate voice messaging by obtaining label adjustment,
And instruction voice messaging is adjusted to label and carries out speech recognition, to obtain label adjustment instruction information.
By obtaining label adjustment instruction voice messaging, and instruction voice messaging is adjusted to label and carries out speech recognition, with
Label adjustment instruction information is obtained, family can be used under the premise of without typewriting, adjusts deictic word message by saying label
Breath, easily modifies, increases or deletes to the corresponding label of voice memos information, so as to improve user experience
In step 106, instruction information is adjusted according to label to be adjusted corresponding label.
Wherein, label adjustment instruction information is text information or voice messaging.
Illustratively, instruction information is adjusted according to label and modifies label corresponding with voice memos information, it can be understood as
Instruction information is adjusted according to label and deletes one or more labels from the corresponding label of voice memos information, or according to label tune
Whole instruction information adds one or more labels into the corresponding label of voice memos information, or adjusts instruction information according to label
It is other labels by one or more tag replacements in the corresponding label of voice memos information.
By obtain user input label adjustment instruction information, and according to label adjust instruction information to corresponding label into
Row adjustment, can be convenient user and modifies to the corresponding label of voice memos information, keep modified label more acurrate
Voice memos information is described, be conveniently used for searching voice memos information according to modified label, thus
Improve user experience.
In one embodiment, as shown in fig. le, the voice memos information processing method that embodiment of the disclosure provides is also
Include the following steps 107 to step 109:
In step 107, tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, with
Obtain tag queries text information corresponding with inquiry audio-frequency information.
In step 108, the determining and matched inquiry tag of tag queries text information from existing label.
In step 109, the determining voice memos information with inquiry tag corresponding record, and export determining voice memos
Information.
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
Voice inquirement memorandum prompt information includes the corresponding text information of the corresponding voice inquirement memo information of inquiry tag,
Or voice inquirement memorandum prompt information is used to prompt to play the audio-frequency information in the corresponding voice inquirement memo information of inquiry tag.
Further, the case where corresponding to multiple voice memos information for a label, can be according to preset order (example
Such as, the time sequencing of storage) it is sequentially output multiple voice memos information, prompt information can also be exported, the choosing of user is received
Instruction is selected, user is played and indicates corresponding voice memos information.The prompt information of output includes but is not limited to multiple voice memos
The temporal information of information storage, secondary key information etc..It should be noted that carrying out voice to each voice memos information
Level-one keyword and secondary key can be extracted when identification, level-one keyword can be used as label, and secondary key can
To be further identified to voice memos information.For example, the voice memos information on May 14 includes going to supermarket to buy face tomorrow
Packet, the voice memos information on May 29 includes going at weekend to stroll Wangfujing, and the level-one parsed from two voice memos information is closed
Key word can be shopping, and for first voice memos information, the secondary key parsed can be supermarket, for second
The keyword that voice memos information parses can be Wangfujing, and level-one keyword can be used as two voice memos information
Label, secondary key can be used for identifying two voice memos information respectively, to carry out area to two voice memos information
Point.Based on this principle, multistage label can also be parsed, every level-one label is all more more specific than upper level label, preferably realizes
To the storage management of multiple voice memos information.
Speech recognition is carried out by obtaining tag queries audio-frequency information, and to tag queries audio-frequency information, to obtain and look into
The corresponding tag queries text information of audio-frequency information is ask, is determined from existing label and tag queries text information is matched looks into
Label, the determining voice memos information with inquiry tag corresponding record are ask, and exports determining voice memos information, can be used
Family is under the premise of without typewriting, and by saying tag queries audio-frequency information, the corresponding voice inquirement of convenient inquiry tag is standby
Forget information, and obtains the audio-frequency information for being used to prompt to play in the corresponding voice inquirement memo information of inquiry tag, so as to improve
User experience
Following is embodiment of the present disclosure, can be used for executing embodiments of the present disclosure.
Fig. 2 a is a kind of block diagram of the voice memos information processing unit 20 shown according to an exemplary embodiment, voice
Memo information processing unit 20 can may be a part of electronic equipment, voice memos information processing unit for electronic equipment
20 being implemented in combination with as some or all of of electronic equipment by software, hardware or both.As shown in Figure 2 a, should
Voice memos information processing unit 20 includes:
Voice memos data obtaining module 201, for obtaining voice memos information, voice memos information includes one section or more
Section audio information.
Speech recognition module 202, for carrying out speech recognition to the audio-frequency information in voice memos information, with acquisition and language
The corresponding voice memos text information of sound memo information.
Label determining module 203, for determining mark corresponding with voice memos information according to voice memos text information
Label.
In one embodiment, as shown in Figure 2 b, voice memos information processing unit 20 further include:
Date prompt module 2041, for including date and/or time in response to voice memos text information, according to the date
And/or the time exports corresponding voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio
Form play output.
In one embodiment, as shown in Figure 2 c, voice memos information processing unit 20 further include:
Schedule creation module 2042 calls schedule for including date and/or time in response to voice memos text information
Interface is created, schedule is created according to date and/or time;
Wherein, created calendar content includes voice memos text information.
In one embodiment, as shown in Figure 2 d, voice memos information processing unit 20 further include:
Label adjustment instruction data obtaining module 205, for obtaining the label adjustment instruction information of user's input;Wherein,
Label adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction information.
Label adjusts module 206, is adjusted for adjusting instruction information according to label to corresponding label.
In one embodiment, as shown in Figure 2 e, voice memos information processing unit 20 further include:
Tag queries audio-frequency information obtains module 207, for obtaining tag queries audio-frequency information, and to tag queries audio
Information carries out speech recognition, to obtain tag queries text information corresponding with audio-frequency information is inquired.
Inquiry tag determining module 208 is adjusted corresponding label for adjusting instruction information according to label.
Voice inquirement memo information cue module 209, for determining the voice memos information with inquiry tag corresponding record,
And export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
Embodiment of the disclosure provides a kind of voice memos information processing unit, which can be with
Include one or more snippets audio-frequency information voice memos information by obtaining, voice is carried out to the audio-frequency information in voice memos information
Identification, to obtain corresponding with voice memos information voice memos text information, and determined according to voice memos text information and
The corresponding label of the voice memos information.In above-mentioned steps, label corresponding with the voice memos information is understood that
To be associated with the content of the voice memos text information, therefore make user when searching voice memos information, it can be according to this
Label conveniently understands the content of the voice memos information, so that the voice memos information needed for finding oneself, improves
User experience.
Fig. 3 is a kind of block diagram of voice memos information processing unit 30 shown according to an exemplary embodiment, the voice
Memo information processing unit 30 can be terminal, or a part of terminal, voice memos information processing unit 30 include:
Processor 301;
Memory 302 for 301 executable instruction of storage processor;
Wherein, processor 301 is configured as:
Voice memos information is obtained, voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice corresponding with voice memos information
Memorandum text information;
Label corresponding with voice memos information is determined according to voice memos text information.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time
Sound memorandum prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio
Form play output.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/
Or the time creates schedule;
Wherein, created calendar content includes voice memos text information.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes to label
Modification instruction information, addition instruction information delete instruction information;
Instruction information is adjusted according to label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, above-mentioned processor 301 can be additionally configured to:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, to obtain and inquire sound
The corresponding tag queries text information of frequency information;
From the matched inquiry tag of label text with tag queries text information determining in existing label;
The determining voice inquirement memo information with inquiry tag corresponding record, and export determining instruction voice inquirement memorandum
Prompt information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
Fig. 4 be it is shown according to an exemplary embodiment a kind of for handling the block diagram of the device 400 of voice memos information,
The device 400 is suitable for electronic equipment.For example, device 400 can be mobile phone, computer, digital broadcast terminal, message receipts
Send out equipment, game console, tablet device, Medical Devices, body-building equipment, personal digital assistant etc..
Device 400 may include following one or more components: processing component 402, memory 404, power supply module 406,
Multimedia component 408, audio component 410, the interface 412 of input/output (I/O), sensor module 414 and communication component
416。
The integrated operation of the usual control device 400 of processing component 402, such as with display, telephone call, data communication, phase
Machine operation and record operate associated operation.Processing element 402 may include that one or more processors 420 refer to execute
It enables, to perform all or part of the steps of the methods described above.In addition, processing component 402 may include one or more modules, just
Interaction between processing component 402 and other assemblies.For example, processing component 402 may include multi-media module, it is more to facilitate
Interaction between media component 408 and processing component 402.
Memory 404 is configured not stored various types of data to support the operation in device 400.These data are shown
Example includes the instruction of any application or method for operating on device 400, contact data, and telephone book data disappears
Breath, picture, video etc..Memory 404 can be by any kind of volatibility or non-volatile memory device or their group
It closes and realizes, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable to compile
Journey read-only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash
Device, disk or CD.
Power supply module 406 provides electric power for the various assemblies of device 400.Power supply module 406 may include power management system
System, one or more power supplys and other with for device 400 generate, manage, and distribute the associated component of electric power.
Multimedia component 408 includes the screen of one output interface of offer between device 400 and user.In some realities
It applies in example, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen can
To be implemented as touch screen, to receive input signal from the user.Touch panel include one or more touch sensors with
Sense the gesture on touch, slide, and touch panel.The touch sensor can not only sense the side of touch or sliding action
Boundary, but also detect duration and pressure associated with the touch or slide operation.In some embodiments, multimedia group
Part 408 includes a front camera and/or rear camera.When device 400 is in operation mode, such as screening-mode or video
When mode, front camera and/or rear camera can receive external multi-medium data.Each front camera and postposition
Camera can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio component 410 is configured as output and/or input audio signal.For example, audio component 410 includes a Mike
Wind (MIC), when device 400 is in operation mode, when such as call mode, recording mode, and voice recognition mode, microphone is matched
It is set to reception external audio signal.The received audio signal can be further stored in memory 404 or via communication set
Part 416 is sent.In some embodiments, audio component 410 further includes a loudspeaker, is used for output audio signal.
I/O interface 412 provides interface between processing component 402 and peripheral interface module, and above-mentioned peripheral interface module can
To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock
Determine button.
Sensor module 414 includes one or more sensors, and the state for providing various aspects for device 400 is commented
Estimate.For example, sensor module 414 can detecte the state that opens/closes of device 400, and the relative positioning of component, for example, it is described
Component is the display and keypad of device 400, and sensor module 414 can be with 400 1 components of detection device 400 or device
Position change, the existence or non-existence that user contacts with device 400,400 orientation of device or acceleration/deceleration and device 400
Temperature change.Sensor module 414 may include proximity sensor, be configured to detect without any physical contact
Presence of nearby objects.Sensor module 414 can also include optical sensor, such as CMOS or ccd image sensor, at
As being used in application.In some embodiments, which can also include acceleration transducer, gyro sensors
Device, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 416 is configured to facilitate the communication of wired or wireless way between device 400 and other equipment.Device
400 can access the wireless network based on communication standard, such as intercom private network, WiFi, 2G, 3G, 4G or 5G or their group
It closes.In one exemplary embodiment, communication component 416 receives the broadcast from external broadcasting management system via broadcast channel
Signal or broadcast related information.In one exemplary embodiment, the communication component 416 further includes near-field communication (NFC) mould
Block, to promote short range communication.For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) skill can be based in NFC module
Art, ultra wide band (UWB) technology, bluetooth (BT) technology and other technologies are realized.
In the exemplary embodiment, device 400 can be believed by one or more application specific integrated circuit (ASIC), number
Number processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array
(FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing the above method.
In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided
It such as include the memory 404 of instruction, above-metioned instruction can be executed by the processor 420 of device 400 to complete the above method.For example,
The non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, floppy disk
With optical data storage devices etc..
A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is by the processing of device 400
When device executes, so that device 400 is able to carry out above-mentioned voice memos information processing method, which comprises
Voice memos information is obtained, voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in voice memos information, to obtain voice corresponding with voice memos information
Memorandum text information;
Label corresponding with voice memos information is determined according to voice memos text information.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, corresponding language is exported according to date and/or time
Sound memorandum prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with audio
Form play output.
In one embodiment, method further include:
Include date and/or time in response to voice memos text information, schedule called to create interface, according to the date and/
Or the time creates schedule;
Wherein, created calendar content includes voice memos text information.
In one embodiment, method further include:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes to label
Modification instruction information, addition instruction information delete instruction information;
Instruction information is adjusted according to label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
In one embodiment, method further include:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to tag queries audio-frequency information, to obtain and inquire sound
The corresponding tag queries text information of frequency information;
From the matched inquiry tag of label text with tag queries text information determining in existing label;
The determining voice inquirement memo information with inquiry tag corresponding record, and export determining instruction voice inquirement memorandum
Prompt information;
Wherein, determining voice memos information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
Those skilled in the art will readily occur to its of the disclosure after considering specification and practicing disclosure disclosed herein
Its embodiment.This application is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or
Person's adaptive change follows the general principles of this disclosure and including the undocumented common knowledge in the art of the disclosure
Or conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the disclosure are by following
Claim is pointed out.
It should be understood that the present disclosure is not limited to the precise structures that have been described above and shown in the drawings, and
And various modifications and changes may be made without departing from the scope thereof.The scope of the present disclosure is only limited by the accompanying claims.
Claims (12)
1. a kind of voice memos information processing method characterized by comprising
Voice memos information is obtained, the voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in the voice memos information, it is corresponding with the voice memos information to obtain
Voice memos text information;
Label corresponding with the voice memos information is determined according to the voice memos text information.
2. voice memos information processing method according to claim 1, which is characterized in that the method also includes:
Include date and/or time in response to the voice memos text information, is corresponded to according to the date and/or time output
Voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
3. voice memos information processing method according to claim 1, which is characterized in that the method also includes:
Include date and/or time in response to the voice memos text information, calls schedule to create interface, according to the date
And/or the time creates schedule;
Wherein, created calendar content includes the voice memos text information.
4. voice memos information processing method according to claim 1, which is characterized in that the method also includes:
Obtain the label adjustment instruction information of user's input;Wherein, the label adjustment instruction information includes the modification to label
It indicates information, addition instruction information or deletes instruction information;
Instruction information is adjusted according to the label to be adjusted corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
5. voice memos information processing method according to claim 1-4, which is characterized in that the method is also wrapped
It includes:
Tag queries audio-frequency information is obtained, and speech recognition is carried out to the tag queries audio-frequency information, is looked into obtaining with described
Ask the corresponding tag queries text information of audio-frequency information;
The determining and matched inquiry tag of tag queries text information from existing label;
The determining voice memos information with the inquiry tag corresponding record, and export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output in a manner of text, and/or broadcast in the form of audio
Put output.
6. a kind of voice memos information processing unit characterized by comprising
Voice memos data obtaining module, for obtaining voice memos information, the voice memos information includes one or more snippets
Audio-frequency information;
Speech recognition module, for in the voice memos information audio-frequency information carry out speech recognition, with obtain with it is described
The corresponding voice memos text information of voice memos information;
Label determining module, for determining mark corresponding with the voice memos information according to the voice memos text information
Label.
7. voice memos information processing unit according to claim 6, which is characterized in that described device further include:
Date prompt module, for including date and/or time in response to the voice memos text information, according to the date
And/or the time exports corresponding voice memos prompt information;
Wherein, corresponding voice memos prompt information is exported, comprising: show output, and/or in a manner of text with the shape of audio
Formula plays output.
8. voice memos information processing unit according to claim 6, which is characterized in that described device further include:
Schedule creation module calls schedule creation for including date and/or time in response to the voice memos text information
Interface creates schedule according to the date and/or time;
Wherein, created calendar content includes the voice memos text information.
9. voice memos information processing unit according to claim 6, which is characterized in that described device further include:
Label adjustment instruction data obtaining module, for obtaining the label adjustment instruction information of user's input;Wherein, the label
Adjustment instruction information includes that the modification to label indicates information, addition instruction information or deletes instruction information;
Label adjusts module, is adjusted for adjusting instruction information according to the label to corresponding label;
Wherein, the label adjustment instruction information is text information or voice messaging.
10. according to the described in any item voice memos information processing units of claim 6-9, which is characterized in that described device is also
Include:
Tag queries audio-frequency information obtains module, believes for obtaining tag queries audio-frequency information, and to the tag queries audio
Breath carries out speech recognition, to obtain tag queries text information corresponding with the inquiry audio-frequency information;
Inquiry tag determining module is marked for determining from existing label with the matched inquiry of the tag queries text information
Label;
Voice inquirement memo information cue module, for the determining voice memos information with the inquiry tag corresponding record, and
Export determining voice memos information;
Wherein, determining voice memos information is exported, comprising: show output in a manner of text, and/or broadcast in the form of audio
Put output.
11. a kind of voice memos information processing unit characterized by comprising
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is configured to:
Voice memos information is obtained, the voice memos information includes one or more snippets audio-frequency information;
Speech recognition is carried out to the audio-frequency information in the voice memos information, it is corresponding with the voice memos information to obtain
Voice memos text information;
Label corresponding with the voice memos information is determined according to the voice memos text information.
12. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the instruction is by processor
The step of any one of claim 1-5 the method is realized when execution.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910647159.5A CN110415703A (en) | 2019-07-17 | 2019-07-17 | Voice memos information processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910647159.5A CN110415703A (en) | 2019-07-17 | 2019-07-17 | Voice memos information processing method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110415703A true CN110415703A (en) | 2019-11-05 |
Family
ID=68361825
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910647159.5A Pending CN110415703A (en) | 2019-07-17 | 2019-07-17 | Voice memos information processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110415703A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110931010A (en) * | 2019-12-17 | 2020-03-27 | 用友网络科技股份有限公司 | Voice control system |
CN110992984A (en) * | 2019-12-02 | 2020-04-10 | 新华智云科技有限公司 | Audio processing method and device and storage medium |
CN112735402A (en) * | 2020-12-14 | 2021-04-30 | 厦门盈趣科技股份有限公司 | Memorandum device and method |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120203551A1 (en) * | 2011-02-04 | 2012-08-09 | International Business Machines Corporation | Automated follow up for e-meetings |
CN104038630A (en) * | 2014-05-28 | 2014-09-10 | 小米科技有限责任公司 | Speech processing method and device |
CN106327151A (en) * | 2016-08-15 | 2017-01-11 | 捷开通讯(深圳)有限公司 | Note recording method and system based on voice recognition |
CN107038220A (en) * | 2017-03-20 | 2017-08-11 | 北京光年无限科技有限公司 | Method, intelligent robot and system for generating memorandum |
CN108229916A (en) * | 2018-01-02 | 2018-06-29 | 京东方科技集团股份有限公司 | A kind of information prompting method and device |
CN108737663A (en) * | 2018-06-30 | 2018-11-02 | 上海爱优威软件开发有限公司 | A kind of call householder method and terminal based on memo information |
CN109669710A (en) * | 2018-11-30 | 2019-04-23 | 维沃移动通信有限公司 | Note processing method and terminal |
-
2019
- 2019-07-17 CN CN201910647159.5A patent/CN110415703A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120203551A1 (en) * | 2011-02-04 | 2012-08-09 | International Business Machines Corporation | Automated follow up for e-meetings |
CN104038630A (en) * | 2014-05-28 | 2014-09-10 | 小米科技有限责任公司 | Speech processing method and device |
CN106327151A (en) * | 2016-08-15 | 2017-01-11 | 捷开通讯(深圳)有限公司 | Note recording method and system based on voice recognition |
CN107038220A (en) * | 2017-03-20 | 2017-08-11 | 北京光年无限科技有限公司 | Method, intelligent robot and system for generating memorandum |
CN108229916A (en) * | 2018-01-02 | 2018-06-29 | 京东方科技集团股份有限公司 | A kind of information prompting method and device |
CN108737663A (en) * | 2018-06-30 | 2018-11-02 | 上海爱优威软件开发有限公司 | A kind of call householder method and terminal based on memo information |
CN109669710A (en) * | 2018-11-30 | 2019-04-23 | 维沃移动通信有限公司 | Note processing method and terminal |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110992984A (en) * | 2019-12-02 | 2020-04-10 | 新华智云科技有限公司 | Audio processing method and device and storage medium |
CN110931010A (en) * | 2019-12-17 | 2020-03-27 | 用友网络科技股份有限公司 | Voice control system |
CN112735402A (en) * | 2020-12-14 | 2021-04-30 | 厦门盈趣科技股份有限公司 | Memorandum device and method |
CN112735402B (en) * | 2020-12-14 | 2024-01-19 | 厦门盈趣科技股份有限公司 | Memorandum device and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8144939B2 (en) | Automatic identifying | |
JP2019117623A (en) | Voice dialogue method, apparatus, device and storage medium | |
CN104394137B (en) | A kind of method and device of prompting voice call | |
CN105138319B (en) | Event-prompting method and device | |
CN101971250A (en) | Mobile electronic device with active speech recognition | |
CN105306752B (en) | The method and device that generation event is reminded | |
CN105224601B (en) | A kind of method and apparatus of extracting time information | |
CN105489220A (en) | Method and device for recognizing speech | |
CN109643548A (en) | System and method for content to be routed to associated output equipment | |
CN110415703A (en) | Voice memos information processing method and device | |
WO2021244057A1 (en) | Interaction method and apparatus, earphone, and earphone accommodation apparatus | |
CN105141758A (en) | Terminal control method and device | |
JP2022501623A (en) | Audio processing method, device and storage medium | |
CN107562952A (en) | The method, apparatus and terminal that music matching plays | |
CN104506703B (en) | Tone information, tone information player method and device | |
CN108710791A (en) | The method and device of voice control | |
CN107423386A (en) | Generate the method and device of electronic card | |
CN108806714A (en) | The method and apparatus for adjusting volume | |
CN106534459A (en) | Voice prompt method and device | |
CN109388699A (en) | Input method, device, equipment and storage medium | |
CN108574777A (en) | Information prompting method and device | |
WO2021244059A1 (en) | Interaction method and device, earphone, and server | |
CN106657543A (en) | Voice information processing method and device | |
CN106534495A (en) | Method of information processing, device and equipment | |
CN109961793A (en) | Handle the method and device of voice messaging |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191105 |
|
RJ01 | Rejection of invention patent application after publication |