CN108520760A - A kind of audio signal processing method and terminal - Google Patents

A kind of audio signal processing method and terminal Download PDF

Info

Publication number
CN108520760A
CN108520760A CN201810259017.7A CN201810259017A CN108520760A CN 108520760 A CN108520760 A CN 108520760A CN 201810259017 A CN201810259017 A CN 201810259017A CN 108520760 A CN108520760 A CN 108520760A
Authority
CN
China
Prior art keywords
voice signal
content
voice
signal
sentence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810259017.7A
Other languages
Chinese (zh)
Other versions
CN108520760B (en
Inventor
符升升
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vivo Mobile Communication Co Ltd
Original Assignee
Vivo Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vivo Mobile Communication Co Ltd filed Critical Vivo Mobile Communication Co Ltd
Priority to CN201810259017.7A priority Critical patent/CN108520760B/en
Publication of CN108520760A publication Critical patent/CN108520760A/en
Application granted granted Critical
Publication of CN108520760B publication Critical patent/CN108520760B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/72Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for transmitting results of analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/52User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail for supporting social networking services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72406User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by software upgrading or downloading
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computing Systems (AREA)
  • Environmental & Geological Engineering (AREA)
  • Telephone Function (AREA)

Abstract

The embodiment of the invention discloses a kind of audio signal processing method and terminals.The method is applied to terminal, the method includes:During carrying out voice communication, the voice signal received to the terminal is detected, when detecting that the first voice signal in the voice signal is unsatisfactory for Parameter Conditions, the corresponding content of first voice signal is determined according to the second voice signal, second voice signal includes the voice signal of the preset duration received before first voice signal in the voice signal and at least one of the voice signal of preset duration that receives later, by the corresponding content of first voice signal include on display interface it is for reference, to reduce the operation for mutually interrupting and repeating voice content of voice communication both sides, it ensure that the smooth progress of voice communication, improve user experience.

Description

A kind of audio signal processing method and terminal
Technical field
The present embodiments relate to technical field of information processing more particularly to a kind of audio signal processing method and terminals.
Background technology
Social software has the function of real-time, interactive, and using real-time performance, user can carry out short message, language after good friend each other The real time interactive operation of the diversified forms such as sound, video.
When user carries out voice communication using social software with other good friends, if there is wave in the network that terminal is connected Dynamic, then voice signal can be distorted, and cause user that can not catch the voice messaging of good friend.At this moment user is to know good friend's Voice messaging, it will usually interrupt with the voice communication of good friend, allow good friend to repeat voice messaging again, however the repetition of voice messaging Operation increases the workload of good friend, increases taking for voice communication, reduces user experience.
Invention content
The present invention provides a kind of audio signal processing methods, when being distorted with solving voice signal, voice messaging The workload that operation increases good friend is repeated, the problem of taking, reduce user experience of voice communication is increased.
In a first aspect, providing a kind of audio signal processing method, it is applied to terminal, including:
During carrying out voice communication, the voice signal received to the terminal is detected;
When detecting that the first voice signal in the voice signal is unsatisfactory for Parameter Conditions, according to the second voice signal Determine that the corresponding content of first voice signal, second voice signal include in the voice signal in first language In the voice signal of preset duration and the voice signal of the preset duration received later that are received before sound signal at least It is a kind of;
Include on display interface by the corresponding content of first voice signal.
Second aspect provides a kind of terminal, including:
Signal detection module, for during carrying out voice communication, being carried out to the voice signal that the terminal receives Detection;
Content determination module detects that the first voice signal in the voice signal is unsatisfactory for Parameter Conditions for working as When, determine that the corresponding content of first voice signal, second voice signal include institute's predicate according to the second voice signal The voice signal of the preset duration received before first voice signal in sound signal and receive later it is default when At least one of long voice signal;
Content display module, for including in display interface by the corresponding content of first voice signal.
In this way, in the embodiment of the present invention, during carrying out voice communication, the voice signal received to terminal is examined It surveys, when detecting that the first voice signal in voice signal is unsatisfactory for Parameter Conditions, the first voice signal of judgement is distorted, with Afterwards according to the second voice signal received before or after the first voice signal, the corresponding content of the first voice signal is determined And by the content include on display interface it is for reference, to reduce mutually interrupting and repeating for voice communication both sides The operation of voice content ensure that the smooth progress of voice communication, improve user experience.
Above description is only the general introduction of technical solution of the present invention, in order to better understand the technical means of the present invention, And can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, below the special specific implementation mode for lifting the present invention.
Description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by institute in the description to the embodiment of the present invention Attached drawing to be used is needed to be briefly described, it should be apparent that, the accompanying drawings in the following description is only some implementations of the present invention Example, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these attached drawings Obtain other attached drawings.
Fig. 1 is the flow chart of the audio signal processing method of one embodiment of the invention;
Fig. 2 is the flow chart of the audio signal processing method of another embodiment of the present invention;
Fig. 3 is the flow chart of the audio signal processing method of an example of the present invention;
Fig. 4 is the block diagram of the terminal of one embodiment of the invention;
Fig. 5 is the hardware architecture diagram of the mobile terminal of one embodiment of the invention.
Specific implementation mode
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained without creative efforts Example, shall fall within the protection scope of the present invention.
Embodiment one
Fig. 1 is the flow chart of the audio signal processing method of one embodiment of the invention.Speech processing shown in FIG. 1 Method is applied to terminal, the method includes:
Step 101, during carrying out voice communication, to terminal receive voice signal be detected.
Terminal can there are many, such as fixed terminal, mobile terminal, fixed terminal can there are many, such as desktop computer, Mobile terminal can there are many, such as mobile phone, notebook, tablet etc..
Social software has the function of real-time, interactive, is mounted in terminal, when terminal connects network, such as connects mobile network When network or wireless network, user can use social software to carry out the real-time of the modes such as voice communication, text communication with its good friend Interaction.When user carries out voice communication using social software and its good friend, terminal that terminal and good friend that user uses use Between mutually send out voice signal, terminal that user uses receives the voice signal that the terminal that user uses is sent.
During carrying out voice communication, due to network quality, ambient noise and user pronunciation word speed etc. it is multiple because The influence of element so that voice signal is distorted, and causes user that can not catch the dialog context of good friend.
To solve the above problems, the voice signal that the present invention receives terminal is detected, whether detection voice signal is sent out Raw distortion is then handled voice signal so that user can catch good friend's when detecting that voice signal is distorted Dialog context.
Step 102, when detecting that the first voice signal in voice signal is unsatisfactory for Parameter Conditions, according to the second voice Signal determines that the corresponding content of the first voice signal, the second voice signal include being connect before the first voice signal in voice signal The voice signal of the preset duration received and at least one of the voice signal of preset duration received later.
First voice signal is a part for the voice signal that terminal receives.Second voice signal is the voice that terminal receives Another part of signal is the voice signal of the preset duration received before the first voice signal, it is default to receive later The voice of the voice signal of duration or the voice signal and the preset duration received later of the preset duration received before The combination of signal.
Content defined by above-mentioned preset duration can there are many, for example, it may be pre-set specified duration, also may be used To be revocable duration corresponding with practical call scenarios.Under normal conditions, it to improve the treatment effect of voice signal, presets Duration is preferably revocable duration corresponding with practical call scenarios.
The present invention pre-sets the Parameter Conditions of voice signal, and the voice signal that terminal-pair receives is judged, language is worked as When sound signal meets the Parameter Conditions, judgement voice signal is not distorted, when voice signal is unsatisfactory for the Parameter Conditions, is sentenced Speech signal is distorted.
After detecting that the first voice signal is unsatisfactory for Parameter Conditions, the first voice signal of judgement is distorted, the use of terminal Family can not catch the dialog context of good friend, then determine first according to the corresponding content of the second voice signal not being distorted The corresponding content of voice signal.Specifically, it can speculate the first voice signal according to the corresponding contents semantic of the second voice signal Corresponding content.
It in practice, can be directly according to the second voice signal pair after detecting that the first voice signal is unsatisfactory for Parameter Conditions The content answered determines the corresponding content of the first voice signal, can also first by the way of speech recognition to the first voice signal into Row identification, after failing speech recognition and going out the corresponding content of the first voice signal, further according to the corresponding content of the second voice signal Determine the corresponding content of the first voice signal.
Parameter Conditions can there are many, such as in frequency condition, noise accounting condition and corresponding word speed condition at least One kind can also be other Parameter Conditions in addition to the Parameter Conditions of the example above, can be according to being actually configured.Work as ginseng When said conditions include frequency condition, frequency condition can limit frequency range, frequency amplitude of variation range etc.;When Parameter Conditions packet When including noise accounting condition, noise accounting condition can limit noise accounting range etc.;When Parameter Conditions include voice signal pair When the word speed condition answered, word speed condition can limit word speed range, Speed variation amplitude range etc..
The corresponding content of first voice signal is included on display interface by step 103.
Include joining for user on the display interface of terminal by the content after determining the corresponding content of the first voice signal Examine, to reduce voice communication during both call sides unnecessary repetition, ensure that the smooth progress of voice communication, Improve the user experience of voice communication.The corresponding content of the first voice signal determined can be with written form, picture shape The forms such as formula, word and picture combining form are shown on display interface, are checked for user.
The entirety of voice communication content is understood for the convenience of the user, it can be according to the receiving time sequence of signal, by first The corresponding content of voice signal and the corresponding content of the second voice signal are simultaneously displayed on display interface and are checked for user.
There are many opening ways of Speech processing function, for example, it is logical to proceed by voice in user and good friend When words, Speech processing function is opened, the work(can also be opened after the function open instruction that terminal receives user's execution Can, user is such as received to opening the function after the selection operation of preset options or button, can also be other applicable functions Opening ways, the present invention are not limited herein.
Embodiment according to the present invention, during carrying out voice communication, the voice signal received to terminal is detected, When detecting that the first voice signal in voice signal is unsatisfactory for Parameter Conditions, the first voice signal of judgement is distorted, then According to the second voice signal received before or after the first voice signal, the corresponding content of the first voice signal is determined simultaneously By the content include on display interface it is for reference, mutually interrupt and repeat language to reduce voice communication both sides The operation of sound content ensure that the smooth progress of voice communication, improve user experience.
Embodiment two
Fig. 2 is the flow chart of the audio signal processing method of another embodiment of the present invention.At voice signal shown in Fig. 2 Reason method is applied to terminal, the method includes:
Step 201, during carrying out voice communication, to terminal receive voice signal be detected.
During carrying out voice communication, the terminal that user uses can receive the voice that the terminal that good friend uses is sent Signal, the voice signal that the terminal-pair of user receives are detected, and whether detection voice signal is normal.
Step 202, when detecting that the first voice signal in voice signal is unsatisfactory for Parameter Conditions, according to the second voice The corresponding composition of content of signal sentence to be modified, sentence to be modified exist empty at the corresponding sentence position of the first voice signal It lacks.
During second voice signal is the voice signal received before the first voice signal and the voice signal that receives later One kind or combination, the second voice signal meet Parameter Conditions, can determine corresponding content.
In the embodiment of the present invention, terminal stores voice communication content to specified storage position during voice communication It sets.There are many voice communication content storage modes, for example, storing the voice communication of good friend/both sides in entire voice call process Content can also be other alternatively, the voice communication content of good friend/both sides in history duration such as half a minute in the past is preset in storage Applicable storage mode, the meaning that wherein symbol "/" represents as or.
After detecting that the first voice signal is unsatisfactory for Parameter Conditions, the second voice letter is extracted from the designated storage location of terminal Number corresponding content, and sentence to be modified is built according to the corresponding content of the second voice signal, sentence to be modified is in the first voice There are vacancies at the corresponding sentence position of signal.
Due to the second voice signal relative to the first voice signal reception sequence there are many, vacancy is in language to be modified There are many positions in sentence, and specifically, vacancy is likely located at middle part, end, front end of sentence to be modified etc..
According to there are many modes of the corresponding composition of content of the second voice signal sentence to be modified, for example, first, according to The correspondence of the signal duration and its content-length of two voice signals determines being matched with its signal duration for the first voice signal Content-length;Secondly, according to the vacancy of the second voice signal corresponding content and corresponding content length, language to be modified is constructed Sentence.
Illustratively, after the first voice signal is unsatisfactory for Parameter Conditions in the voice signal for detecting reception, that is, judge After first voice signal is distorted, according to the number of characters (N1) for the corresponding content of the second voice signal not being distorted and its Signal duration (t1) determines the signal duration (t1/N1) corresponding to a character of the second voice signal, uses what is be distorted The signal duration (t2) of first voice signal divided by (t1/N1) obtain the character that the corresponding content of the first voice signal is included Number (N1t2/t1), determines the corresponding number of characters of vacancy (N1t2/t1), according to the sky that corresponding number of characters is (N1t2/t1) The scarce and corresponding content of the second voice signal, constructs sentence to be modified.
Step 203, from sentence database, search and the object statement of statement matching to be modified.
Sentence database is pre-set, and records a large amount of sentences in sentence database.According to the second voice signal pair After the composition of content sentence to be modified answered, from sentence database, the object statement with statement matching to be modified is searched.
Step 204, using the corresponding content of vacancy in object statement as the corresponding content of the first voice signal.
From the object statement matched in sentence database, content corresponding with the vacancy of sentence to be modified is first The corresponding content of voice signal.
The object statement matched from sentence database may include one or more.When object statement includes two When a above, can using in all object statements, the corresponding content of vacancy is as the corresponding content of the first voice signal, follow-up By multiple contents include being checked for user on the display interface of terminal in step;More than two targets can also found After sentence, first more than two object statements are ranked up, later by vacancy in the object statement of N before sequence it is corresponding in Hold and be used as the corresponding content of the first voice signal, wherein N is the positive integer more than or equal to 1, and the size of N can be according to reality It is configured.
There are many modes of the more than two object statements of sequence, for example, according to the receiving time of the first voice signal, terminal At least one of location information and the corresponding voice effect of the first voice signal, to more than two object statements into Row sequence.More than two object statements can also be ranked up according to other parameters, the embodiment of the present invention is not limited herein.
For example, the corresponding content of the second voice signal received before the first voice signal is " you ", in the first voice The corresponding content of the second voice signal received after signal is that " had a meal in the morning", according to the signal of the second voice signal Duration and its content number of characters, and the signal duration according to the first voice signal, thus it is speculated that the corresponding content of the first voice signal Including two characters, the sentence to be modified at this moment constructed is that " you have had a meal in * * mornings", each " * " represents a character.Structure After producing sentence to be modified, sentence to be modified is matched with sentence database, obtains five and statement matching to be modified Object statement, is ranked up five object statements, and the ranking results for obtaining object statement are:" you have a meal this morning ", " you have had a meal at yestermorning", " you have had a meal in morning at weekend", " you remember that had a meal in the morning" and " you It has not had a meal in the morning", choosing 3 most preceding object statements of sequence, (" you have had a meal this morning", " your early yesterday Morning has had a meal" and " you have had a meal in morning at weekend") it is used as object statement.
The corresponding content of first voice signal is included on display interface by step 205.
After using the corresponding content of vacancy in object statement as the corresponding content of the first voice signal, by the first voice signal Corresponding content is shown on display interface and is checked for user.For example, by illustrate in step 204 " you have a meal this morning ", " you have had a meal at yestermorning" and " you have had a meal in morning at weekend" it is sequentially displayed in display interface in order On.
In operation, it can include being checked for user on display interface by the corresponding content of vacancy in object statement, also may be used With by include the corresponding content of vacancy object statement include on display interface, i.e., by the corresponding content of the first voice signal and The corresponding content of second voice signal is simultaneously displayed on display interface, since object statement includes more dialog context, Therefore user is facilitated to understand the entirety of dialog context.
To make those skilled in the art that the present invention be more clearly understood, now by following example to institute of the embodiment of the present invention The audio signal processing method stated is described in detail.
Fig. 3 is the flow chart for the audio signal processing method for being an example of the present invention.With reference to shown in Fig. 3, voice signal Processing method includes:
S1, detect that the voice call function of social software is opened.
S2, opening network quality testing.
The quality of the network connected to terminal is detected.
S3, construction buffer pool, voice signal corresponding content record of the voice communication both sides in preset duration n seconds is existed In buffer pool.
S4, judge that the network of terminal connection whether there is exception, if not, S5 is thened follow the steps, if it is, executing step Rapid S6.
S5, judge whether voice communication terminates, if it is, method terminates, if it is not, then executing step S4.
S6, judge whether the current speech signal being newly added in buffer pool is distorted, if conditions are not met, then executing step Rapid S5, if it is satisfied, then executing step S7.
In this example, current speech signal is the voice signal that terminal currently newly receives, and current speech signal includes two Point, a part of voice signal meets Parameter Conditions, and another part voice signal is unsatisfactory for Parameter Conditions.
It refers to that voice signal is unsatisfactory for preset Parameter Conditions that voice signal in this example, which is distorted,.Parameter Conditions Content is referred to foregoing description of the embodiment of the present invention.
S7, speech recognition is carried out to the part of speech signal not being distorted in current speech signal, identifies and does not occur Corresponding content, that is, the text of part of speech signal of distortion.
S8, judge whether the part of speech signal being distorted in current speech signal can be with speech recognition, if can not S9-S11 is thened follow the steps, if it is then executing step S12.
The corresponding content of part of speech signal that S9, basis are not distorted, to the part of speech signal pair being distorted The content answered carries out natural language context deduction.
The mode that the phrase data library lookup of above-mentioned record may be used, it is corresponding to the part of speech signal being distorted Content is inferred, can also be other applicable deduction modes.
S10, the multiple contents that will conclude that are ranked up, and filter out the content of m before sequence, and m is just more than or equal to 1 Integer.
It is placed in designated position in S11, the m that will conclude that, generates the corresponding amendment content of current speech signal, In, designated position is the corresponding sentence position of part of speech signal being distorted in the corresponding sentence of current speech signal.
After step S11, S13 is executed.
S12, by speech recognition go out in be placed in designated position, generate the corresponding amendment content of current speech signal, In, designated position is the corresponding sentence position of part of speech signal being distorted in the corresponding sentence of current speech signal.
After step S12, S13 is executed.
S13, by the corresponding content of revised current speech signal include on display interface, it is for reference.
After step S13, S5 is executed.
This example utilizes voice record, speech recognition and natural language context inference technologies, to caused by network fluctuation The corresponding content of of short duration voice signal that distorts of appearance carry out auxiliary amendment, and by revised content displaying in display interface Upper for reference, it is corresponding that user can substantially understand the voice signal being distorted by the content checked on display interface Content ensure that the smooth progress of voice communication, carry to reduce the unnecessary repetition of both call sides in voice call process The user experience of voice-over-net call is risen.
Embodiment according to the present invention, during carrying out voice communication, the voice signal received to terminal is detected, When detecting that the first voice signal in voice signal is unsatisfactory for Parameter Conditions, the first voice signal of judgement is distorted, then According to the second voice signal received before or after the first voice signal, the corresponding content of the first voice signal is determined simultaneously By the content include on display interface it is for reference, mutually interrupt and repeat language to reduce voice communication both sides The operation of sound content ensure that the smooth progress of voice communication, improve user experience.
Embodiment three
Fig. 4 is the block diagram of the terminal of one embodiment of the invention.Terminal shown in Fig. 4 includes:
Signal detection module 301, for during carrying out voice communication, to the voice signal of terminal reception into Row detection.
Content determination module 302 detects that the first voice signal in the voice signal is unsatisfactory for parameter item for working as When part, determine that the corresponding content of first voice signal, second voice signal include described according to the second voice signal It the voice signal of the preset duration received before first voice signal in voice signal and receives later default At least one of voice signal of duration.
Content display module 303, for including in display interface by the corresponding content of first voice signal.
In the embodiment of the present invention, it is preferable that the content determination module 302 includes:
Sentence constructs submodule, is used for according to the corresponding composition of content sentence to be modified of second voice signal, described There are vacancies at the corresponding sentence position of first voice signal for sentence to be modified;
Object statement searches submodule, for from sentence database, searching the target with the statement matching to be modified Sentence;
Content obtains submodule, for using the corresponding content of vacancy described in the object statement as first voice The corresponding content of signal.
In the embodiment of the present invention, it is preferable that the sentence constructs submodule and includes:
Content-length determination sub-module is used for pair of the signal duration and its content-length according to second voice signal Should be related to, determine first voice signal with its matched content-length of signal duration;
Sentence obtains submodule, for according to the corresponding content of second voice signal and the corresponding content-length Vacancy constructs the sentence to be modified.
In the embodiment of the present invention, it is preferable that the content determination module 302 further includes:
Sentence sorting sub-module, for from sentence database, searching the mesh with the statement matching to be modified described After poster sentence, at least two object statements are ranked up, the object statement found out from the database is extremely It is two few;
The content obtains submodule, is specifically used for the corresponding content of vacancy described in the object statement of N before sorting and makees For the corresponding content of first voice signal, wherein N is the positive integer more than or equal to 1.
In the embodiment of the present invention, it is preferable that the sentence sorting sub-module is specifically used for according to first voice signal Receiving time, at least one in the location of terminal information and the corresponding voice effect of first voice signal Kind, multiple at least two object statements are ranked up.
In the embodiment of the present invention, it is preferable that the content display module 303 is specifically used for including that the vacancy corresponds to The object statement of content be shown on the display interface.
Embodiment according to the present invention, during carrying out voice communication, the voice signal received to terminal is detected, When detecting that the first voice signal in voice signal is unsatisfactory for Parameter Conditions, the first voice signal of judgement is distorted, then According to the second voice signal received before or after the first voice signal, the corresponding content of the first voice signal is determined simultaneously By the content include on display interface it is for reference, mutually interrupt and repeat language to reduce voice communication both sides The operation of sound content ensure that the smooth progress of voice communication, improve user experience.
A kind of hardware architecture diagram of Fig. 5 mobile terminals of each embodiment to realize the present invention.
The mobile terminal 400 includes but not limited to:It is radio frequency unit 401, network module 402, audio output unit 403, defeated Enter unit 404, sensor 405, display unit 406, user input unit 407, interface unit 408, memory 409, processor The components such as 410 and power supply 411.It will be understood by those skilled in the art that mobile terminal structure shown in Fig. 5 is not constituted Restriction to mobile terminal, mobile terminal may include than illustrating more or fewer components, either combine certain components or Different component arrangements.In embodiments of the present invention, mobile terminal include but not limited to mobile phone, tablet computer, laptop, Palm PC, car-mounted terminal, wearable device and pedometer etc..
Wherein, radio frequency unit 401, in the process for carrying out voice communication, receiving pair for carrying out voice communication with terminal Hold the voice signal sent.
Processor 410, for during carrying out voice communication, the voice signal received to the terminal to be examined It surveys, when detecting that the first voice signal in the voice signal is unsatisfactory for Parameter Conditions, is determined according to the second voice signal The corresponding content of first voice signal, second voice signal include believing in first voice in the voice signal The voice signal of the preset duration received before number and at least one of the voice signal of preset duration received later, Include on display interface by the corresponding content of first voice signal.
It should be understood that the embodiment of the present invention in, radio frequency unit 401 can be used for receiving and sending messages or communication process in, signal Send and receive, specifically, by from base station downlink data receive after, to processor 410 handle;In addition, by uplink Data are sent to base station.In general, radio frequency unit 401 includes but not limited to antenna, at least one amplifier, transceiver, coupling Device, low-noise amplifier, duplexer etc..In addition, radio frequency unit 401 can also by radio communication system and network and other set Standby communication.
Mobile terminal has provided wireless broadband internet to the user by network module 402 and has accessed, and such as user is helped to receive Send e-mails, browse webpage and access streaming video etc..
It is that audio output unit 403 can receive radio frequency unit 401 or network module 402 or in memory 409 The audio data of storage is converted into audio signal and exports to be sound.Moreover, audio output unit 403 can also be provided and be moved The relevant audio output of specific function that dynamic terminal 400 executes is (for example, call signal receives sound, message sink sound etc. Deng).Audio output unit 403 includes loud speaker, buzzer and receiver etc..
Input unit 404 is for receiving audio or video signal.Input unit 404 may include graphics processor (Graphics Processing Unit, GPU) 4041 and microphone 4042, graphics processor 4041 is in video acquisition mode Or the image data of the static images or video obtained by image capture apparatus (such as camera) in image capture mode carries out Reason.Treated, and picture frame may be displayed on display unit 906.Through graphics processor 4041, treated that picture frame can be deposited Storage is sent in memory 409 (or other storage mediums) or via radio frequency unit 401 or network module 402.Mike Wind 4042 can receive sound, and can be audio data by such acoustic processing.Treated audio data can be The format output of mobile communication base station can be sent to via radio frequency unit 401 by being converted in the case of telephone calling model.
Mobile terminal 400 further includes at least one sensor 405, such as optical sensor, motion sensor and other biographies Sensor.Specifically, optical sensor includes ambient light sensor and proximity sensor, wherein ambient light sensor can be according to environment The light and shade of light adjusts the brightness of display panel 4061, and proximity sensor can close when mobile terminal 400 is moved in one's ear Display panel 4061 and/or backlight.As a kind of motion sensor, accelerometer sensor can detect in all directions (general For three axis) size of acceleration, size and the direction of gravity are can detect that when static, can be used to identify mobile terminal posture (ratio Such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, tap);It passes Sensor 405 can also include fingerprint sensor, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer, wet Meter, thermometer, infrared sensor etc. are spent, details are not described herein.
Display unit 406 is for showing information input by user or being supplied to the information of user.Display unit 906 can wrap Display panel 4061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode may be used Forms such as (Organic Light-Emitting Diode, OLED) configure display panel 4061.
User input unit 407 can be used for receiving the number or character information of input, and generate the use with mobile terminal Family is arranged and the related key signals input of function control.Specifically, user input unit 905 include touch panel 4071 and Other input equipments 4072.Touch panel 4071, also referred to as touch screen collect user on it or neighbouring touch operation (for example user uses any suitable objects or attachment such as finger, stylus on touch panel 4071 or in touch panel 4071 Neighbouring operation).Touch panel 4071 may include both touch detecting apparatus and touch controller.Wherein, touch detection Device detects the touch orientation of user, and detects the signal that touch operation is brought, and transmits a signal to touch controller;Touch control Device processed receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives processor 410, receiving area It manages the order that device 410 is sent and is executed.Furthermore, it is possible to more using resistance-type, condenser type, infrared ray and surface acoustic wave etc. Type realizes touch panel 4071.In addition to touch panel 4071, user input unit 407 can also include other input equipments 4072.Specifically, other input equipments 4072 can include but is not limited to physical keyboard, function key (such as volume control button, Switch key etc.), trace ball, mouse, operating lever, details are not described herein.
Further, touch panel 4071 can be covered on display panel 4061, when touch panel 4071 is detected at it On or near touch operation after, send processor 410 to determine the type of touch event, be followed by subsequent processing device 410 according to touch The type for touching event provides corresponding visual output on display panel 4061.Although in Figure 5, touch panel 4071 and display Panel 4061 is to realize the function that outputs and inputs of mobile terminal as two independent components, but in some embodiments In, can be integrated by touch panel 4071 and display panel 4061 and realize the function that outputs and inputs of mobile terminal, it is specific this Place does not limit.
Interface unit 408 is the interface that external device (ED) is connect with mobile terminal 400.For example, external device (ED) may include having Line or wireless head-band earphone port, external power supply (or battery charger) port, wired or wireless data port, storage card end Mouth, port, the port audio input/output (I/O), video i/o port, earphone end for connecting the device with identification module Mouthful etc..Interface unit 408 can be used for receiving the input (for example, data information, electric power etc.) from external device (ED) and By one or more elements that the input received is transferred in mobile terminal 400 or can be used in 400 He of mobile terminal Transmission data between external device (ED).
Memory 409 can be used for storing software program and various data.Memory 409 can include mainly storing program area And storage data field, wherein storing program area can storage program area, application program (such as the sound needed at least one function Sound playing function, image player function etc.) etc.;Storage data field can store according to mobile phone use created data (such as Audio data, phone directory etc.) etc..In addition, memory 409 may include high-speed random access memory, can also include non-easy The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.
Processor 410 is the control centre of mobile terminal, utilizes each of various interfaces and the entire mobile terminal of connection A part by running or execute the software program and/or module that are stored in memory 409, and calls and is stored in storage Data in device 409 execute the various functions and processing data of mobile terminal, to carry out integral monitoring to mobile terminal.Place Reason device 410 may include one or more processing units;Preferably, processor 410 can integrate application processor and modulatedemodulate is mediated Manage device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is main Processing wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 410.
Mobile terminal 400 can also include the power supply 411 (such as battery) powered to all parts, it is preferred that power supply 411 Can be logically contiguous by power-supply management system and processor 410, to realize management charging by power-supply management system, put The functions such as electricity and power managed.
In addition, mobile terminal 400 includes some unshowned function modules, details are not described herein.
Preferably, the embodiment of the present invention also provides a kind of terminal, including processor 410, and memory 409 is stored in storage It is real when which is executed by processor 410 on device 409 and the computer program that can be run on the processor 410 Each process of existing above-mentioned audio signal processing method embodiment, and identical technique effect can be reached, to avoid repeating, here It repeats no more.
The embodiment of the present invention also provides a kind of computer readable storage medium, and meter is stored on computer readable storage medium Calculation machine program, the computer program realize each process of above-mentioned audio signal processing method embodiment when being executed by processor, And identical technique effect can be reached, to avoid repeating, which is not described herein again.Wherein, the computer readable storage medium, Such as read-only memory (Read-Only Memory, abbreviation ROM), random access memory (Random Access Memory, letter Claim RAM), magnetic disc or CD etc..
It should be noted that herein, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that process, method, article or device including a series of elements include not only those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including this There is also other identical elements in the process of element, method, article or device.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical scheme of the present invention substantially in other words does the prior art Going out the part of contribution can be expressed in the form of software products, which is stored in a storage medium In (such as ROM/RAM, magnetic disc, CD), including some instructions are used so that a station terminal (can be mobile phone, computer, service Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The embodiment of the present invention is described with above attached drawing, but the invention is not limited in above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form belongs within the protection of the present invention.

Claims (12)

1. a kind of audio signal processing method is applied to terminal, which is characterized in that including:
During carrying out voice communication, the voice signal received to the terminal is detected;
When detecting that the first voice signal in the voice signal is unsatisfactory for Parameter Conditions, determined according to the second voice signal The corresponding content of first voice signal, second voice signal include believing in first voice in the voice signal The voice signal of the preset duration received before number and at least one of the voice signal of preset duration received later;
Include on display interface by the corresponding content of first voice signal.
2. according to the method described in claim 1, it is characterized in that, described determine first voice according to the second voice signal The corresponding content of signal includes:
According to the corresponding composition of content sentence to be modified of second voice signal, the sentence to be modified is in first voice There are vacancies at the corresponding sentence position of signal;
From sentence database, the object statement with the statement matching to be modified is searched;
Using the corresponding content of vacancy described in the object statement as the corresponding content of first voice signal.
3. according to the method described in claim 2, it is characterized in that, described according to the corresponding content structure of second voice signal Making sentence to be modified includes:
According to the correspondence of the signal duration and its content-length of second voice signal, first voice signal is determined With its matched content-length of signal duration;
According to the vacancy of the corresponding content of second voice signal and the corresponding content-length, the language to be modified is constructed Sentence.
4. according to the method described in claim 2, it is characterized in that, the object statement that ought be found out from the database is extremely When two few, described from sentence database, after searching the object statement with the statement matching to be modified, the method Further include:
At least two object statements are ranked up;
It is described to include using the corresponding content of vacancy described in the object statement as the corresponding content of first voice signal:
Using the corresponding content of vacancy described in the object statement of N before sequence as the corresponding content of first voice signal, Wherein, N is the positive integer more than or equal to 1.
5. according to the method described in claim 4, it is characterized in that, at least two object statements of the sequence include:
According to the location of the receiving time of first voice signal, the terminal information and first voice signal At least one of corresponding voice effect is ranked up at least two object statements.
6. according to the method described in claim 2, it is characterized in that, the content by first voice signal includes aobvious Show on interface and includes:
Include on the display interface by the object statement including the corresponding content of the vacancy.
7. a kind of terminal, which is characterized in that including:
Signal detection module, for during carrying out voice communication, the voice signal received to the terminal to be detected;
Content determination module, for when detecting that the first voice signal in the voice signal is unsatisfactory for Parameter Conditions, root Determine that the corresponding content of first voice signal, second voice signal include the voice signal according to the second voice signal In the language of the voice signal and the preset duration received later of preset duration that is received before first voice signal At least one of sound signal;
Content display module, for including in display interface by the corresponding content of first voice signal.
8. terminal according to claim 7, which is characterized in that the content determination module includes:
Sentence constructs submodule, is used for according to the corresponding composition of content sentence to be modified of second voice signal, described to be repaired There are vacancies at the corresponding sentence position of first voice signal for positive sentence;
Object statement searches submodule, for from sentence database, searching the object statement with the statement matching to be modified;
Content obtains submodule, for using the corresponding content of vacancy described in the object statement as first voice signal Corresponding content.
9. terminal according to claim 8, which is characterized in that the sentence constructs submodule and includes:
Content-length determination sub-module, for the pass corresponding with its content-length of the signal duration according to second voice signal System, determine first voice signal with its matched content-length of signal duration;
Sentence obtains submodule, for the sky according to the corresponding content of second voice signal and the corresponding content-length It lacks, constructs the sentence to be modified.
10. terminal according to claim 8, which is characterized in that the content determination module further includes:
Sentence sorting sub-module, for from sentence database, searching the target language with the statement matching to be modified described After sentence, at least two object statements are ranked up, the object statement found out from the database is at least two It is a;
The content obtains submodule, is specifically used for before sorting the corresponding content of vacancy described in the object statement of N as institute State the corresponding content of the first voice signal, wherein N is the positive integer more than or equal to 1.
11. terminal according to claim 10, it is characterised in that:
The sentence sorting sub-module is specifically used for residing for the receiving time according to first voice signal, the terminal At least one of location information and the corresponding voice effect of first voice signal, to multiple at least two object statements It is ranked up.
12. terminal according to claim 8, it is characterised in that:
The content display module includes described specifically for will include the object statement of the corresponding content of the vacancy On display interface.
CN201810259017.7A 2018-03-27 2018-03-27 Voice signal processing method and terminal Active CN108520760B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810259017.7A CN108520760B (en) 2018-03-27 2018-03-27 Voice signal processing method and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810259017.7A CN108520760B (en) 2018-03-27 2018-03-27 Voice signal processing method and terminal

Publications (2)

Publication Number Publication Date
CN108520760A true CN108520760A (en) 2018-09-11
CN108520760B CN108520760B (en) 2020-07-24

Family

ID=63434318

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810259017.7A Active CN108520760B (en) 2018-03-27 2018-03-27 Voice signal processing method and terminal

Country Status (1)

Country Link
CN (1) CN108520760B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109286554A (en) * 2018-09-14 2019-01-29 腾讯科技(深圳)有限公司 Social functions unlocking method and device in social application
WO2022242405A1 (en) * 2021-05-19 2022-11-24 北京荣耀终端有限公司 Voice call method and apparatus, electronic device, and computer readable storage medium
CN115798465A (en) * 2023-02-07 2023-03-14 天创光电工程有限公司 Voice input method, system and readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060018440A1 (en) * 2004-07-26 2006-01-26 Watkins Gary A Method and system for predictive interactive voice recognition
CN101287040A (en) * 2006-11-29 2008-10-15 Sap股份公司 Action prediction based on interactive history and context between sender and recipient
CN104160392A (en) * 2012-03-07 2014-11-19 三菱电机株式会社 Device, method, and program for estimating meaning of word
CN105336326A (en) * 2011-09-28 2016-02-17 苹果公司 Speech recognition repair using contextual information
CN106537494A (en) * 2014-07-23 2017-03-22 三菱电机株式会社 Speech recognition device and speech recognition method
CN106847280A (en) * 2017-02-23 2017-06-13 海信集团有限公司 Audio-frequency information processing method, intelligent terminal and Voice command terminal
CN106856093A (en) * 2017-02-23 2017-06-16 海信集团有限公司 Audio-frequency information processing method, intelligent terminal and Voice command terminal

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060018440A1 (en) * 2004-07-26 2006-01-26 Watkins Gary A Method and system for predictive interactive voice recognition
CN101287040A (en) * 2006-11-29 2008-10-15 Sap股份公司 Action prediction based on interactive history and context between sender and recipient
CN105336326A (en) * 2011-09-28 2016-02-17 苹果公司 Speech recognition repair using contextual information
CN104160392A (en) * 2012-03-07 2014-11-19 三菱电机株式会社 Device, method, and program for estimating meaning of word
CN106537494A (en) * 2014-07-23 2017-03-22 三菱电机株式会社 Speech recognition device and speech recognition method
CN106847280A (en) * 2017-02-23 2017-06-13 海信集团有限公司 Audio-frequency information processing method, intelligent terminal and Voice command terminal
CN106856093A (en) * 2017-02-23 2017-06-16 海信集团有限公司 Audio-frequency information processing method, intelligent terminal and Voice command terminal

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109286554A (en) * 2018-09-14 2019-01-29 腾讯科技(深圳)有限公司 Social functions unlocking method and device in social application
CN109286554B (en) * 2018-09-14 2021-07-13 腾讯科技(深圳)有限公司 Social function unlocking method and device in social application
WO2022242405A1 (en) * 2021-05-19 2022-11-24 北京荣耀终端有限公司 Voice call method and apparatus, electronic device, and computer readable storage medium
CN115798465A (en) * 2023-02-07 2023-03-14 天创光电工程有限公司 Voice input method, system and readable storage medium
CN115798465B (en) * 2023-02-07 2023-04-07 天创光电工程有限公司 Voice input method, system and readable storage medium

Also Published As

Publication number Publication date
CN108520760B (en) 2020-07-24

Similar Documents

Publication Publication Date Title
CN108632658B (en) Bullet screen display method and terminal
CN108255378A (en) A kind of display control method and mobile terminal
US20220353225A1 (en) Method for searching for chat information and electronic device
CN109271121A (en) A kind of application display method and mobile terminal
CN109063583A (en) A kind of learning method and electronic equipment based on read operation
CN108337374A (en) A kind of message prompt method and mobile terminal
CN109857494B (en) Message prompting method and terminal equipment
CN106874091A (en) A kind of application call method, device and mobile terminal
CN108833661B (en) Information display method and mobile terminal
CN108874352A (en) A kind of information display method and mobile terminal
CN108334272A (en) A kind of control method and mobile terminal
CN108616448A (en) A kind of the path recommendation method and mobile terminal of Information Sharing
CN110046015A (en) Using sharing method and terminal
CN108345474A (en) Startup method, starter and the mobile terminal of application program
CN108228033A (en) A kind of message display method and mobile terminal
CN108075966A (en) A kind of message treatment method and mobile terminal
CN110069675A (en) A kind of search method and mobile terminal
CN109495638A (en) A kind of information display method and terminal
CN108520760A (en) A kind of audio signal processing method and terminal
CN110096203A (en) A kind of screenshot method and mobile terminal
CN110012151A (en) A kind of information display method and terminal device
CN109063076A (en) A kind of Picture Generation Method and mobile terminal
CN109873901A (en) A kind of screenshot method for managing resource and terminal, computer readable storage medium
CN109949809A (en) A kind of sound control method and terminal device
CN108597495A (en) A kind of method and device of processing voice data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant