CN107886963A - The method, apparatus and electronic equipment of a kind of speech processes - Google Patents

The method, apparatus and electronic equipment of a kind of speech processes Download PDF

Info

Publication number
CN107886963A
CN107886963A CN201711071260.8A CN201711071260A CN107886963A CN 107886963 A CN107886963 A CN 107886963A CN 201711071260 A CN201711071260 A CN 201711071260A CN 107886963 A CN107886963 A CN 107886963A
Authority
CN
China
Prior art keywords
voice
speech parameter
pending
parameter value
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711071260.8A
Other languages
Chinese (zh)
Other versions
CN107886963B (en
Inventor
潘虹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gree Electric Appliances Inc of Zhuhai
Original Assignee
Gree Electric Appliances Inc of Zhuhai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gree Electric Appliances Inc of Zhuhai filed Critical Gree Electric Appliances Inc of Zhuhai
Priority to CN201711071260.8A priority Critical patent/CN107886963B/en
Publication of CN107886963A publication Critical patent/CN107886963A/en
Application granted granted Critical
Publication of CN107886963B publication Critical patent/CN107886963B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/0202
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing

Abstract

The present embodiments relate to communication technical field, discloses a kind of method of speech processing, device and electronic equipment.The method of wherein described speech processes includes:Obtain pending voice;Receive the voice revision directive to the pending voice;According to the voice revision directive, default speech parameter value is obtained;If the default speech parameter value and the speech parameter of the pending voice mismatch, according to the default speech parameter value, the speech parameter of the pending voice is modified, the voice after being handled;Voice after the processing is sent to the electronic equipment of communication party.By being modified with upper type, the embodiment of the present invention the voice of user, the voice of beautification is generated, effectively meets demand of the user to beautification voice, drastically increases the auditory perception of speech play.

Description

The method, apparatus and electronic equipment of a kind of speech processes
Technical field
The present invention relates to communication technical field, and in particular to the method, apparatus and electronic equipment of a kind of speech processes.
Background technology
With the development of intelligent terminal and various social softwares, the exchange between people is more and more, wherein, word and language Sound exchange is the most universal.The mode of speech exchange has a lot, such as telephone relation, voice-enabled chat, transmission voice etc..In voice During exchange, sound represents the image of individual to a certain degree.Pleasing to the ear euphony allows people to feel pleasant happy, can find pleasure in In further being talked with speaker.Therefore, dulcet sound can be people's bonus point, the welcome degree of lifting people.
Everybody is intended to possess the gentle full sound as announcer, but under reality, except normally speaking Intonation, everyone is when just getting up, flu when sound can be hoarse, when special occasions mumbles, sound can not enough become clear etc. Deng, in this case, can cause user to speech exchange produce rejection feeling.
Inventor has found that the speech processes of correlation technique can not effectively meet during the embodiment of the present invention is realized User polishes the voice of oneself according to the hobby of oneself to correct, and is transmitted further to the demand of peer user.
The content of the invention
The embodiment of the present invention provides a kind of method, apparatus and electronic equipment of speech processes, to be carried out to the voice of user Modification, the voice of beautification is generated, effectively meet demand of the user to beautification voice, drastically increase listening for speech play Feel impression.
To achieve these goals, the embodiment of the invention discloses following technical scheme:
In a first aspect, the embodiments of the invention provide a kind of method of speech processes, including:Obtain pending voice;Connect Receive the voice revision directive to the pending voice;According to the voice revision directive, default speech parameter value is obtained;If institute State default speech parameter value and the speech parameter of the pending voice mismatches, according to the default speech parameter value, to institute The speech parameter for stating pending voice is modified, the voice after being handled;Voice after the processing is sent to communication The electronic equipment of side.
Wherein, methods described also includes:Receive sample voice;Speech parameter is extracted from the sample voice, and will be carried The speech parameter got is stored as the default speech parameter value.
Wherein, after the pending voice of acquisition, methods described also includes:The pending voice is located in advance Reason, the pretreatment include noise reduction.
Wherein, if being mismatched in the speech parameter of the default speech parameter value and the pending voice, according to The default speech parameter value, is modified to the speech parameter of the pending voice, after the voice after being handled, institute Stating method also includes:The adjust instruction to the voice after the processing is received, and according to the adjust instruction, after the processing Voice be adjusted.
Wherein, before the voice by after the processing sends the electronic equipment to communication party, methods described is also wrapped Include:The audition instruction to the voice after the processing is received, and according to the audition instruction, plays the voice after the processing.
Wherein, the speech parameter of the pending voice includes decibel, frequency and/or waveform.
Second aspect, the embodiments of the invention provide a kind of device of speech processes, including:Voice acquiring unit, is used for Obtain pending voice;Revision directive receiving unit, for receiving the voice revision directive to the pending voice;Default ginseng Number acquiring unit, for according to the voice revision directive, obtaining default speech parameter value;Amending unit, if for described pre- If speech parameter value and the speech parameter of the pending voice mismatch, according to the default speech parameter value, treated to described The speech parameter of processing voice is modified, the voice after being handled;Transmitting element, for the voice after the processing to be sent out Deliver to the electronic equipment of communication party.
Wherein, described device also includes:Sample reception unit, for receiving sample voice;Parameter extraction unit, for from The sample voice extracts speech parameter, and the speech parameter extracted is stored as into the default speech parameter value.
Wherein, after the voice acquiring unit, described device also includes:Pretreatment unit, for waiting to locate to described Reason voice is pre-processed, and the pretreatment includes noise reduction.
Wherein, after the amending unit, described device also includes:Adjustment unit, after receiving to the processing Voice adjust instruction, and according to the adjust instruction, the voice after the processing is adjusted.
Wherein, before the transmitting element, described device also includes:Audition unit, after receiving to the processing Voice audition instruction, and according to the audition instruction, play the voice after the processing.
Wherein, the speech parameter of the pending voice includes decibel, frequency and/or waveform.
The third aspect, the embodiments of the invention provide a kind of electronic equipment of speech processes, including:
At least one processor;And
The memory being connected with least one processor communication;Wherein,
The memory storage has can be by the instruction of at least one computing device, and the instruction is by described at least one Individual computing device, so that the method that at least one processor is able to carry out speech processes as described above.
Fourth aspect, the embodiment of the present invention additionally provide a kind of non-volatile computer readable storage medium storing program for executing, the calculating Machine readable storage medium storing program for executing is stored with computer executable instructions, and the computer executable instructions are used to enable electronic equipment to hold The method of row speech processes as described above.
5th aspect, the embodiment of the present invention additionally provide a kind of computer program product, the computer program product bag The computer program being stored on non-transient computer readable storage medium storing program for executing is included, the computer program includes programmed instruction, when When described program instruction is computer-executed, make the method for computer execution speech processes as described above.
The beneficial effect of the embodiment of the present invention is:In the case of being different from prior art, provided in an embodiment of the present invention one The method of kind speech processes is by obtaining pending voice;Receive the voice revision directive to the pending voice;According to institute Predicate sound revision directive, obtain default speech parameter value;If the default speech parameter value and the voice of the pending voice Parameter mismatches, and according to the default speech parameter value, the speech parameter of the pending voice is modified, handled Voice afterwards;Voice after the processing is sent to the electronic equipment of communication party.By with upper type, entering to the voice of user Row modification, the voice of beautification is generated, effectively meet demand of the user to beautification voice, drastically increase speech play Auditory perception.
Brief description of the drawings
One or more embodiments are illustrative by the picture in corresponding accompanying drawing, these exemplary theorys The bright restriction not formed to embodiment, the element for having same reference numbers label in accompanying drawing are expressed as similar element, removed Non- have a special statement, and composition does not limit the figure in accompanying drawing.
Fig. 1 is a kind of schematic flow sheet of the method for speech processes provided in an embodiment of the present invention;
Fig. 2 is a kind of schematic flow sheet of the method for speech processes that another embodiment of the present invention provides;
Fig. 3 is a kind of application example schematic flow sheet of the method for speech processes provided in an embodiment of the present invention;
Fig. 4 is a kind of operation interface schematic diagram of smart mobile phone speech processes provided in an embodiment of the present invention;
Fig. 5 is that the embodiment of the present invention provides a kind of structural representation of the device of speech processes;
Fig. 6 is the structural representation of a kind of electronic equipment provided in an embodiment of the present invention.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is Part of the embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.
In addition, as long as technical characteristic involved in each embodiment of invention described below is each other not Conflict can is formed to be mutually combined.
The embodiment of the present invention provides a kind of method, apparatus and electronic equipment of speech processes, and methods described and device can be with Meet to modify the voice of user, generate the demand of the voice of beautification, drastically increase the auditory perception of speech play.
The method of the speech processes of the embodiment of the present invention, can have user interaction means and fortune in any suitable type Performed in the user terminal of the processor of calculation ability, such as desktop computer, smart mobile phone, tablet personal computer and other users are whole In end.
The device of the speech processes of the embodiment of the present invention can be used as one of software or hardware function units, independent It is arranged in above-mentioned user terminal, it is real can also performs the present invention as the one of functional module integrated within a processor The method for applying the speech processes of example.
In an embodiment of the present invention, electronic equipment can be smart mobile phone, it is computer, intelligent watch, Intelligent bracelet, flat Plate computer, palm PC etc. have the electronic equipment of phonetic function.Above-mentioned electronic equipment supports various multipads Install, one or more multipad in such as following multipad:Instant message application program, phone should With program, video application, email application, digital video recorder application program and etc..
Specifically, below in conjunction with the accompanying drawings, the embodiment of the present invention is further elaborated.
Embodiment one
Fig. 1 is a kind of schematic flow sheet of the method for speech processes provided in an embodiment of the present invention.Referring to Fig. 1, voice The method of processing is applied to electronic equipment, wherein, the method for the speech processes includes:
110th, pending voice is obtained;
In the present embodiment, above-mentioned " pending voice " can be that the user of the electronic equipment is defeated when sending instant message The voice entered, such as the phonetic entry of wechat, voice-enabled chat etc.;Can also be the local audio file of the electronic equipment, such as Recording file, music etc.;Voice when can also be telephone communication etc..The embodiment for obtaining pending voice can be with For:The voice of instant message input is read from register;Or obtained by reading the audio-frequency information in memory or internal memory Local audio file;Or voice of telephone communication etc. is read from register.
120th, the voice revision directive to the pending voice is received;
In the present embodiment, voice revision directive includes according to associative operation being distributed by electronic equipment and is used to perform such as The instruction of " obtaining default speech parameter value " shown in next step, electronic equipment is according to corresponding to performing the voice revision directive Event.Wherein, associative operation can be soft operation or hard operation, and soft operation can be that electronic equipment touches according to the output of advance logic Signal, so that electronic equipment further distributes voice revision directive.Hard operation can be the outside correlation to electronic equipment Hardware is operated and electronic equipment is distributed voice revision directive, for example, it may be touch-screen of the user in electronic equipment The touch operation of progress, can be the operation of the button progress to electronic equipment, and etc..
When electronic equipment receives voice revision directive, the execution of next step is completed again according to logic of propositions.
130th, according to the voice revision directive, default speech parameter value is obtained;
In the present embodiment, the term of reference that speech parameter value is used for the amendment as pending voice, Ke Yiwei are preset The parameter value for one section of oneself preferable sound that user prerecords, or the parameter value of the sound set for User Defined, or Person is parameter value of system default sound, etc..Wherein, parameter value include but is not limited to the decibel of default voice, frequency and/or Waveform.
Voice revision directive triggers electronic equipment, electronic equipment is obtained default speech parameter value.In acquisition process, electricity Sub- equipment can preset speech parameter value according to corresponding to obtaining the type of the speech parameter of pending voice, if for example, waiting to locate The speech parameter of reason voice includes decibel and frequency, then obtains the decibel of default voice and frequency is used as default speech parameter value. Wherein, designer can voluntarily be joined according to business demand to component corresponding to electronic equipment and method with obtaining default voice Numerical value.
If the 140th, the default speech parameter value and the speech parameter of the pending voice mismatch, according to described default Speech parameter value, the speech parameter of the pending voice is modified, the voice after being handled;
In the present embodiment, the speech parameter of pending voice includes but is not limited to decibel, frequency and/or waveform, wherein, The volume of decibel reflection sound, frequency reflect the tone of sound, and waveform reflects the tone color of sound.According to the voice of pending voice Parameter, matched respectively with corresponding default speech parameter value, i.e., it is and default by the decibel of pending voice, frequency, waveform Decibel, frequency, the waveform of voice are matched one by one.Wherein, " matching " can be that the speech parameter for judging pending voice is It is no in the range of default speech parameter value, if in the range of, for matching;If not in the range of, to mismatch.For example, The decibel of pending voice is 20dB, and the decibel range for presetting speech parameter value is 50-60dB, then the voice ginseng of pending voice Number mismatches with default speech parameter value.
In the present embodiment, the speech parameter of pending voice being modified can be to join the voice of pending voice Number is adjusted in the range of default speech parameter value, for example, the decibel of pending voice is 20dB, presets point of speech parameter value Shellfish scope is 50-60dB, and the decibel of pending voice is adjusted into 55dB.Therefore, when default speech parameter value and pending language When the speech parameter of sound mismatches, according to default speech parameter value, the speech parameter of pending voice is modified, obtained everywhere Voice after reason, embodiment can be:The speech parameter of pending voice and default speech parameter value are respectively compared, is sentenced Whether the speech parameter of disconnected pending voice is in the range of default speech parameter value, if it is not, then by the voice of pending voice Parameter is modified to the numerical value in the range of default speech parameter value, the voice after being handled respectively.Such as:Point of pending voice Shellfish is 20dB, and frequency 100Hz, waveform is x (t), and the decibel for presetting speech parameter value is 50-60dB, frequency 500- 700Hz, waveform are y (t), the decibel of pending voice are modified into 55dB, frequency is modified to 500Hz, and calculates x (t) and y (t) dependency number Pxy, adjustment waveform x (t), until dependency number Pxy in the range of 0.6-1, then completes the language of pending voice The amendment of sound parameter, the voice after being handled.
Wherein, the speech parameter of pending voice can also include word speed, phoneme etc..For example, to pending voice Word speed is modified, and makes the speech speed after processing more appropriate, steady, so that it is more melodious.For example, pre-set default language The standard phoneme of sound parameter value, it can be mandarin phoneme, Guangdong language phoneme, English phoneme etc., according to standard phoneme, treat place The phoneme of reason voice is modified, so that the mandarin of voice, Guangdong language or English more standard.
150th, the voice after the processing is sent to the electronic equipment of communication party.
In the present embodiment, above-mentioned communication party can be the sending object of the instant message of user, the object of call, meeting Object, the object that attends a lecture etc..The electronic equipment of communication party can be smart mobile phone, computer, intelligent watch, intelligent hand Ring, tablet personal computer, palm PC or sound equipment etc..After user pleases oneself to the voice after processing, the transmission of user is received Instruction, the transmission instruction user can click or long-press send button by way of input, then by after processing voice send out The electronic equipment of communication party is delivered to, so as to bring good auditory perception to communication party.
The method for a kind of speech processes that the present embodiment provides, by obtaining pending voice;Receive to described pending The voice revision directive of voice;According to the voice revision directive, default speech parameter value is obtained;If the default speech parameter Value and the speech parameter of the pending voice mismatch, according to the default speech parameter value, to the pending voice Speech parameter is modified, the voice after being handled;Voice after the processing is sent to the electronic equipment of communication party.It is logical Cross with upper type, the voice of user is modified, generate the voice of beautification, effectively meet need of the user to beautification voice Ask, drastically increase the auditory perception of speech play.
Embodiment two
Fig. 2 is a kind of schematic flow sheet of the method for speech processes that another embodiment of the present invention provides, as shown in Fig. 2 The method of speech processes is applied to electronic equipment, wherein, the method for the speech processes includes:
210th, sample voice is received;
In the present embodiment, user can pre-set default speech parameter value by recording sample voice.Sample voice Can be user feel sound status it is good when one section of voice recording;Or can be that the hope that user records is imitated Other people one section of voice, such as the voice of star, the voice etc. of host;Or can also be one section of language of cartoon character Sound, such as the just following voice of sound, the voice etc. of RNB.
Wherein, user can record multiple sample voices, different default speech parameter values be pre-set, as different Voice packet preserves.
220th, speech parameter is extracted from the sample voice, and the speech parameter extracted is stored as the default language Sound parameter value;
In the present embodiment, can be from the embodiment of sample voice extraction speech parameter:Receiving After sample voice, the electronic equipment stores the sample voice, then carries out signal transacting to the sample voice, extracts the sample The speech parameter of voice.Wherein, speech parameter can include decibel, frequency, waveform etc..
230th, pending voice is obtained;
240th, the pending voice is pre-processed, the pretreatment includes noise reduction;
In the present embodiment, electronic equipment is after pending voice is obtained, because the user of electronic equipment is in input voice When, the environment of surrounding may be more noisy, either user input speech volume may it is too small or due to Equipment make it is defeated There is noise etc. in the voice entered, it is therefore desirable to which pending voice is pre-processed.Pretreatment can be noise reduction, and noise reduction refers to subtract Few noise, such as ambient noise etc..Noise reduction can be realized by Digital Signal Processing, such as frequency-domain transform, wavelet transformation etc.. Pretreatment can also be amplification, and when the volume of pending voice is too small, voice is carried out to pending voice by signal transacting Identification, and vocal sections are amplified.
250th, the voice revision directive to the pending voice is received;
260th, according to the voice revision directive, default speech parameter value is obtained;
If the 270th, the default speech parameter value and the speech parameter of the pending voice mismatch, according to described default Speech parameter value, the speech parameter of the pending voice is modified, the voice after being handled;
280th, the adjust instruction to the voice after the processing is received, and according to the adjust instruction, after the processing Voice be adjusted;
In the present embodiment, voice after user is to processing is still dissatisfied, it is desirable to when slightly being adjusted, unrestricted choice Input adjust instruction is adjusted to the voice after processing.Adjust instruction can be inputted setting value, including decibel adjustment by user Instruction, frequency adjust instruction etc..
281st, the audition instruction to the voice after the processing is received, and according to the audition instruction, plays the processing Voice afterwards;
In the present embodiment, when user wish audition processing after voice, by triggering audition instruction, can be to processing after Voice plays out.
Wherein, step 280 and step 281 can circulate progress, such as:Adjust instruction is received, receives audition instruction, is continued Adjust instruction is received, continues to audition instruction ...
290th, the voice after the processing is sent to the electronic equipment of communication party.
A kind of method for speech processes that the present embodiment provides, by pre-setting default speech parameter value, obtains and waits to locate Reason voice is simultaneously pre-processed, and is received the voice revision directive to pending voice and is obtained default speech parameter value;It is if default Speech parameter value and the speech parameter of pending voice are mismatched, and the speech parameter of pending voice is modified, obtained everywhere Voice after reason, audition, adjustment are carried out to the voice after processing, the voice after processing is sent to the electronic equipment of communication party. By with upper type, the voice of user is modified, adjusted, customer satisfaction system voice is generated, effectively meets user To the demand of beautification voice or conversion voice, the auditory perception of speech play is drastically increased, adds interest.
Embodiment three
Fig. 3 is a kind of application example schematic flow sheet of the method for speech processes provided in an embodiment of the present invention, the application Example is illustrated for the method for the speech processes of smart mobile phone, as shown in figure 3, this method includes:
310th, receive sample voice and extract speech parameter, the speech parameter extracted is stored as default speech parameter Value;
Also referring to Fig. 4, the smart mobile phone clicks on the button on screen by user, receives sample voice input and refers to Order, user starts to input the good voice of one section of sound status, while is connect according to the sample voice input instruction, the smart mobile phone Receive sample voice.After sample voice is received, the smart mobile phone stores the sample voice, and then the sample voice is entered Row signal transacting, extract the speech parameter (including decibel, frequency, waveform) of the sample voice, and the speech parameter that will be extracted It is stored as default speech parameter value.In the present embodiment, three default speech parameter values of the user preset of the smart mobile phone, point These three are not preset into speech parameter value and saves as different voice packets:Beautify, be overcast, cheerful and light-hearted.
320th, grasped in the phonetic entry interface of the instant message application program of the smart mobile phone, the user for receiving input voice Make, operated according to shown user and obtain pending voice, and the pending voice is pre-processed;
When user opens instant message application program, and the chat conversations frame of opening and contact person's first, it is desirable to send one Duan Yuyin gives contact person's first, and user clicks on the button " pin and speak " on screen, the voice of a length of 10 seconds when starting to input one section, The smart mobile phone obtains the voice as pending voice simultaneously, then noise reduction process is carried out to the pending voice, to reduce Ambient noise.
330th, the voice revision directive to the pending voice is received, and according to the voice revision directive, is obtained pre- If speech parameter value;
When user is dissatisfied to the sound status of the voice of input, it is desirable to when beautifying to the voice of input, at this The voice revision directive of " beautification sound " is triggered on smart mobile phone, the smart mobile phone then obtains the default speech parameter of " beautification ".
If the 340th, the default speech parameter value and the speech parameter of the pending voice mismatch, according to described default Speech parameter value, the speech parameter of the pending voice is modified, the voice after being handled;
Whether the smart mobile phone is matched by the default speech parameter of more pending voice and " beautification ", to unmatched Pending voice segments are modified, and the speech parameter of pending voice is obtained in the range of the default speech parameter of " beautification " Voice after processing.
350th, the audition instruction to the voice after the processing is received, and according to the audition instruction, plays the processing Voice afterwards;
User wishes to be known a priori by the voice after " beautification " is handled, and audition is then triggered on the smart mobile phone refers to Order, the smart mobile phone is according to the audition instruction, the voice after playback process.
360th, the adjust instruction to the voice after the processing is received, and according to the adjust instruction, after the processing Voice be adjusted;
After voice after user is to processing carries out audition, it is believed that the speech volume after processing is too small, to result still It is so dissatisfied, " volume+" adjust instruction then is triggered in the smart mobile phone, the smart mobile phone is right according to " volume+" adjust instruction Voice after processing is amplified processing, to improve volume.
370th, the voice after the processing is sent to the electronic equipment of communication party.
After user pleases oneself to the voice after processing, the transmission instruction of user's triggering is received, by the voice after processing Contact person's first is sent to, so as to bring different auditory perception on sense organ to contact person's first.
The present embodiment is by obtaining pending voice;Receive the voice revision directive to the pending voice;According to institute Predicate sound revision directive, obtain default speech parameter value;If the default speech parameter value and the voice of the pending voice Parameter mismatches, and according to the default speech parameter value, the speech parameter of the pending voice is modified, handled Voice afterwards;Voice after the processing is sent to the electronic equipment of communication party.By with upper type, entering to the voice of user Row modification, the voice of beautification is generated, effectively meet demand of the user to beautification voice, drastically increase speech play Auditory perception.
Example IV
Fig. 5 is that the embodiment of the present invention provides a kind of structural representation of the device of speech processes.As shown in figure 5, the voice Processing unit 500 is applied to electronic equipment, wherein, the voice processing apparatus 50 includes voice acquiring unit 510, revision directive connects Receive unit 520, parameter preset acquiring unit 530, amending unit 540 and transmitting element 550.Voice acquiring unit 510 is used to obtain Take pending voice;Revision directive receiving unit 520 is used to receive the voice revision directive to the pending voice;Default ginseng Number acquiring unit 530 is used for according to the voice revision directive, obtains default speech parameter value;If amending unit 540 is used for institute State default speech parameter value and the speech parameter of the pending voice mismatches, according to the default speech parameter value, to institute The speech parameter for stating pending voice is modified, the voice after being handled;Transmitting element 550 is used for after the processing Voice is sent to the electronic equipment of communication party.
Alternatively, device 500 also includes:Sample reception unit 560 is used to receive sample voice;Parameter extraction unit 561 For extracting speech parameter from the sample voice, and the speech parameter extracted is stored as the default speech parameter Value.
Alternatively, after the voice acquiring unit 510, device 500 also includes:Pretreatment unit 570 is used for institute State pending voice to be pre-processed, the pretreatment includes noise reduction.
Alternatively, after amending unit 540, device 500 also includes:Adjustment unit 570 is used to receive to the processing The adjust instruction of voice afterwards, and according to the adjust instruction, the voice after the processing is adjusted.
Alternatively, before the transmitting element 550, device 500 also includes:Audition unit 580 is used to receive to described The audition instruction of voice after processing, and according to the audition instruction, play the voice after the processing.
Alternatively, the speech parameter of the pending voice includes decibel, frequency and/or waveform.
Because device embodiment and embodiment of the method are to be based on same design, on the premise of content does not conflict mutually, dress The content for putting embodiment can be will not be described here with quoting method embodiment.
The voice processing apparatus 500 that the present embodiment provides can be modified the voice of user, generate the voice of beautification, Demand of the user to beautification voice is effectively met, drastically increases the auditory perception of speech play.
Embodiment five
Fig. 6 is the structural representation of a kind of electronic equipment provided in an embodiment of the present invention, as shown in fig. 6, the electronic equipment 600 include:
One or more processors 610 and memory 620, in Fig. 6 by taking a processor 610 as an example.
Processor 610 can be connected with memory 620 by bus or other modes, to be connected by bus in Fig. 6 Exemplified by.
Memory 620 is used as a kind of non-volatile computer readable storage medium storing program for executing, available for storage non-volatile software journey Sequence, non-volatile computer executable program and module, as corresponding to the instant message based reminding method in the embodiment of the present invention Programmed instruction/module is (for example, the voice acquiring unit 510, revision directive receiving unit 520, parameter preset shown in accompanying drawing 5 obtain Take unit 530, amending unit 540 and transmitting element 550).Processor 610 is stored in non-easy in memory 620 by operation The property lost software program, instruction and module, so as to perform the various function application of the user terminal and data processing, that is, are realized The method of the speech processes of above method embodiment.
Memory 620 can include storing program area and storage data field, wherein, storing program area can store operation system Application program required for system, at least one function;Storage data field can store uses institute according to instant message alarm set Data of establishment etc..In addition, memory 620 can include high-speed random access memory, non-volatile memories can also be included Device, for example, at least a disk memory, flush memory device or other non-volatile solid state memory parts.In some embodiments In, memory 620 is optional including that can pass through net relative to the remotely located memory of processor 610, these remote memories Network is connected to instant message alarm set.The example of above-mentioned network include but is not limited to internet, intranet, LAN, Mobile radio communication and combinations thereof.
One or more of modules are stored in the memory 620, when by one or more of processors During 610 execution, the method for the speech processes in above-mentioned any means embodiment is performed, for example, performing in Fig. 1 described above Method and step 110 realizes the function of the modules or unit described in the unit 510-550 in Fig. 4 to step 150.
The embodiment of the present invention additionally provides a kind of nonvolatile computer storage media, the computer-readable storage medium storage There are computer executable instructions, the computer executable instructions are executed by one or more processors, such as at one in Fig. 6 Device 610 is managed, may be such that said one or the method that multiple processors can perform the speech processes in above-mentioned any means embodiment, For example, the method for the speech processes in above-mentioned any means embodiment is performed, for example, performing shown in Fig. 1 described above to Fig. 3 Each step;Also the function of the modules or unit described in accompanying drawing 4 can be realized.
Device embodiment described above is only schematical, wherein the unit illustrated as separating component can To be or may not be physically separate, it can be as the part that unit is shown or may not be physics list Member, you can with positioned at a place, or can also be distributed on multiple NEs.It can be selected according to the actual needs In some or all of module realize the purpose of this embodiment scheme.
Through the above description of the embodiments, those of ordinary skill in the art can be understood that each embodiment The mode of general hardware platform can be added by software to realize, naturally it is also possible to pass through hardware.Those of ordinary skill in the art can To understand that all or part of flow realized in above-described embodiment method is can to instruct the hard of correlation by computer program Part is completed, and described program can be stored in a computer read/write memory medium, the program is upon execution, it may include as above State the flow of the embodiment of each method.Wherein, described storage medium can be magnetic disc, CD, read-only memory (Read- Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
Finally it should be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;At this Under the thinking of invention, it can also be combined between the technical characteristic in above example or different embodiments, step can be with Realized with random order, and many other changes of the different aspect of the present invention as described above be present, for simplicity, they do not have Have and provided in details;Although the present invention is described in detail with reference to the foregoing embodiments, the ordinary skill people of this area Member should be understood:It can still modify to the technical scheme described in foregoing embodiments, or to which part skill Art feature carries out equivalent substitution;And these modifications or replacement, the essence of appropriate technical solution is departed from each reality of the present invention Apply the scope of a technical scheme.

Claims (14)

1. a kind of method of speech processes, applied to electronic equipment, including:
Obtain pending voice;
Receive the voice revision directive to the pending voice;
According to the voice revision directive, default speech parameter value is obtained;
If the default speech parameter value and the speech parameter of the pending voice mismatch, according to the default speech parameter Value, is modified, the voice after being handled to the speech parameter of the pending voice;
Voice after the processing is sent to the electronic equipment of communication party.
2. according to the method for claim 1, it is characterised in that methods described also includes:
Receive sample voice;
Speech parameter is extracted from the sample voice, and the speech parameter extracted is stored as the default speech parameter Value.
3. according to the method for claim 2, it is characterised in that after the pending voice of acquisition, methods described is also Including:
The pending voice is pre-processed, the pretreatment includes noise reduction.
4. according to the method for claim 2, it is characterised in that if waiting to locate with described in the default speech parameter value The speech parameter for managing voice is mismatched, and according to the default speech parameter value, the speech parameter of the pending voice is carried out Correct, after the voice after being handled, methods described also includes:
The adjust instruction to the voice after the processing is received, and according to the adjust instruction, the voice after the processing is entered Row adjustment.
5. according to the method for claim 4, it is characterised in that sent in the voice by after the processing to communication party Electronic equipment before, methods described also includes:
The audition instruction to the voice after the processing is received, and according to the audition instruction, plays the voice after the processing.
6. according to the method described in claim 1-5, it is characterised in that the speech parameter of the pending voice include decibel, Frequency and/or waveform.
A kind of 7. device of speech processes, applied to electronic equipment, it is characterised in that including:
Voice acquiring unit, for obtaining pending voice;
Revision directive receiving unit, for receiving the voice revision directive to the pending voice;
Parameter preset acquiring unit, for according to the voice revision directive, obtaining default speech parameter value;
Amending unit, if being mismatched for the speech parameter of the default speech parameter value and the pending voice, according to institute Default speech parameter value is stated, the speech parameter of the pending voice is modified, the voice after being handled;
Transmitting element, for the voice after the processing to be sent to the electronic equipment of communication party.
8. device according to claim 7, it is characterised in that described device also includes:
Sample reception unit, for receiving sample voice;
Parameter extraction unit, for extracting speech parameter from the sample voice, and the speech parameter extracted is stored as The default speech parameter value.
9. device according to claim 8, it is characterised in that after the voice acquiring unit, described device is also wrapped Include:
Pretreatment unit, for being pre-processed to the pending voice, the pretreatment includes noise reduction.
10. device according to claim 8, it is characterised in that after the amending unit, described device also includes:
Adjustment unit, for receiving the adjust instruction to the voice after the processing, and according to the adjust instruction, to the place Voice after reason is adjusted.
11. device according to claim 10, it is characterised in that before the transmitting element, described device also includes:
Audition unit, for receiving to the audition instruction of the voice after the processing, and according to the audition instruction, described in broadcasting Voice after processing.
12. according to the device described in claim any one of 7-11, it is characterised in that the speech parameter bag of the pending voice Include decibel, frequency and/or waveform.
13. a kind of electronic equipment, it is characterised in that including:
At least one processor;And
The memory being connected with least one processor communication;Wherein,
The memory storage has can be by the instruction of at least one computing device, and the instruction is by least one place Manage device to perform, so that at least one processor is able to carry out the method described in claim any one of 1-6.
14. a kind of non-volatile computer readable storage medium storing program for executing, the computer-readable recording medium storage have computer to hold Row instruction, the computer executable instructions are used to make user terminal be able to carry out the method described in claim any one of 1-6.
CN201711071260.8A 2017-11-03 2017-11-03 A kind of method, apparatus and electronic equipment of speech processes Active CN107886963B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711071260.8A CN107886963B (en) 2017-11-03 2017-11-03 A kind of method, apparatus and electronic equipment of speech processes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711071260.8A CN107886963B (en) 2017-11-03 2017-11-03 A kind of method, apparatus and electronic equipment of speech processes

Publications (2)

Publication Number Publication Date
CN107886963A true CN107886963A (en) 2018-04-06
CN107886963B CN107886963B (en) 2019-10-11

Family

ID=61778818

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711071260.8A Active CN107886963B (en) 2017-11-03 2017-11-03 A kind of method, apparatus and electronic equipment of speech processes

Country Status (1)

Country Link
CN (1) CN107886963B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108965967A (en) * 2018-05-25 2018-12-07 苏州浪潮智能软件有限公司 TV control method and device, computer readable storage medium, terminal
CN110677521A (en) * 2019-10-24 2020-01-10 北京九狐时代智能科技有限公司 Fixed-line equipment and audio signal processing method
WO2020232578A1 (en) * 2019-05-17 2020-11-26 Xu Junli Memory, microphone, audio data processing method and apparatus, and device and system
CN112216275A (en) * 2019-07-10 2021-01-12 阿里巴巴集团控股有限公司 Voice information processing method and device and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103903627A (en) * 2012-12-27 2014-07-02 中兴通讯股份有限公司 Voice-data transmission method and device
CN104144097A (en) * 2013-05-07 2014-11-12 百度在线网络技术(北京)有限公司 Voice message transmission system, sending end, receiving end and voice message transmission method
CN104538011A (en) * 2014-10-30 2015-04-22 华为技术有限公司 Tone adjusting method and device and terminal device
CN105448300A (en) * 2015-11-12 2016-03-30 小米科技有限责任公司 Method and device for calling
CN105989832A (en) * 2015-02-10 2016-10-05 阿尔卡特朗讯 Method of generating personalized voice in computer equipment and apparatus thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103903627A (en) * 2012-12-27 2014-07-02 中兴通讯股份有限公司 Voice-data transmission method and device
CN104144097A (en) * 2013-05-07 2014-11-12 百度在线网络技术(北京)有限公司 Voice message transmission system, sending end, receiving end and voice message transmission method
CN104538011A (en) * 2014-10-30 2015-04-22 华为技术有限公司 Tone adjusting method and device and terminal device
CN105989832A (en) * 2015-02-10 2016-10-05 阿尔卡特朗讯 Method of generating personalized voice in computer equipment and apparatus thereof
CN105448300A (en) * 2015-11-12 2016-03-30 小米科技有限责任公司 Method and device for calling

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108965967A (en) * 2018-05-25 2018-12-07 苏州浪潮智能软件有限公司 TV control method and device, computer readable storage medium, terminal
WO2020232578A1 (en) * 2019-05-17 2020-11-26 Xu Junli Memory, microphone, audio data processing method and apparatus, and device and system
CN112216275A (en) * 2019-07-10 2021-01-12 阿里巴巴集团控股有限公司 Voice information processing method and device and electronic equipment
CN110677521A (en) * 2019-10-24 2020-01-10 北京九狐时代智能科技有限公司 Fixed-line equipment and audio signal processing method

Also Published As

Publication number Publication date
CN107886963B (en) 2019-10-11

Similar Documents

Publication Publication Date Title
CN107977183A (en) voice interactive method, device and equipment
CN107886963B (en) A kind of method, apparatus and electronic equipment of speech processes
CN108198569B (en) Audio processing method, device and equipment and readable storage medium
US11475897B2 (en) Method and apparatus for response using voice matching user category
CN109326289A (en) Exempt to wake up voice interactive method, device, equipment and storage medium
CN108922525B (en) Voice processing method, device, storage medium and electronic equipment
CN106992008A (en) Processing method and electronic equipment
CN107293300A (en) Audio recognition method and device, computer installation and readable storage medium storing program for executing
JP2015517709A (en) A system for adaptive distribution of context-based media
US20210125610A1 (en) Ai-driven personal assistant with adaptive response generation
CN110047484A (en) A kind of speech recognition exchange method, system, equipment and storage medium
CN106791024A (en) Voice messaging player method, device and terminal
CN109994106A (en) A kind of method of speech processing and equipment
CN107465818A (en) The method and terminal of a kind of virtual incoming call
WO2017172655A1 (en) Analysis of a facial image to extract physical and emotional characteristics of a user
CN107515765A (en) A kind of method for closing of quarter-bell, system and terminal device
CN107632813A (en) A kind of method and device for closing alarm clock function
CN108364346B (en) Method, apparatus and computer readable storage medium for constructing three-dimensional face model
CN109887509A (en) A kind of control method of ordering, electronic equipment and storage medium based on vocal print
CN115312079A (en) Information display method and device, electronic equipment and computer readable medium
CN107656923A (en) Voice translation method and device
WO2019242415A1 (en) Position prompt method, device, storage medium and electronic device
CN116016779A (en) Voice call translation assisting method, system, computer equipment and storage medium
CN109725798A (en) The switching method and relevant apparatus of Autonomous role
CN108833688A (en) Position reminding method, apparatus, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant