CN107516534A

CN107516534A - Voice information comparison method and device and terminal equipment

Info

Publication number: CN107516534A
Application number: CN201710769644.0A
Authority: CN
Inventors: 吴小龙
Original assignee: Guangdong Genius Technology Co Ltd
Current assignee: Guangdong Genius Technology Co Ltd
Priority date: 2017-08-31
Filing date: 2017-08-31
Publication date: 2017-12-26
Anticipated expiration: 2037-08-31
Also published as: CN107516534B

Abstract

The invention is suitable for the technical field of voice information, and provides a method, a device and a terminal device for comparing voice information, wherein the method comprises the following steps: acquiring two pieces of voice information to be compared, and generating a corresponding audio waveform diagram for each piece of voice information to be compared; obtaining the similarity of the two pieces of voice information by comparing the audio oscillograms corresponding to the two pieces of voice information; if the similarity of the two voices is greater than a threshold value, outputting a comparison result that the voice contents of the two voice messages are the same; otherwise, outputting a comparison result that the contents of the two pieces of voice information are different. The invention generates the audio waveform diagram by the two pieces of voice information, and determines whether the contents of the two pieces of voice information are consistent or not by comparing the audio waveform diagrams, so that the recognition result is more accurate.

Description

A kind of comparison method of voice messaging, device and terminal device

Technical field

The invention belongs to voice messaging technical field, more particularly to a kind of comparison method of voice messaging, device and terminal Equipment.

Background technology

At present, the data type that text and voice are combined is more and more, and accordingly, voice messaging also produces many data Form.In the application of voice messaging, in order to meet the needs of different, often a kind of voice data file of form is changed For the voice data file of another form, this requires to ensure data during voice data file form is changed The uniformity of content.

But the uniformity of data content is ensure that even in the process that voice data file form is changed, in follow-up language During sound data processing utilizes, it is also possible to the intermediate voice data file of certain form is modified, this is resulted in Same speech data source file under different-format or the voice data file of different phase data content it is inconsistent, still User is being that can not judge these voice data files under using different-format or during the voice data file of different phase Data content it is whether consistent.

The content of the invention

In view of this, the embodiments of the invention provide a kind of comparison method of voice messaging, device and terminal device, with solution Certainly can not the multiple voice messagings of precise alignment it is whether consistent the problem of.

The first aspect of the embodiment of the present invention provides a kind of comparison method of voice messaging, including：

Two voice messagings to be compared are obtained, by audio volume control figure corresponding to every voice messaging generation to be compared；

Audio volume control figure corresponding to two voice messagings is compared, obtains the similarity of two voice messagings；

If the similarity of two voice messagings is more than threshold value, the voice content phase of two voice messagings is exported Same comparison result；

If the similarity of two voice messagings is less than or equal to the threshold value, two voice messagings are exported The comparison result that content differs.

The second aspect of the embodiment of the present invention provides a kind of comparison device of voice messaging, including：

Oscillogram generation module, for obtaining two voice messagings to be compared, every voice messaging to be compared is given birth to Into corresponding audio volume control figure；

Similarity obtains module, for comparing audio volume control figure corresponding to two voice messagings, obtains described two The similarity of voice messaging；

Processing module, if the similarity for two voice messagings is more than threshold value, export two voices letter The voice content identical comparison result of breath；

The processing module, it is defeated if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value Go out the comparison result that the content of two voice messagings differs.

The third aspect of the embodiment of the present invention provides a kind of terminal device, including memory, processor and is stored in In the memory and the computer program that can run on the processor, described in the computing device during computer program The step of realizing the methods described that first aspect of the embodiment of the present invention provides.

The fourth aspect of the embodiment of the present invention provides a kind of computer-readable recording medium, the computer-readable storage Media storage has computer program, and the computer program realizes the embodiment of the present invention when being executed by one or more processors On the one hand the step of methods described provided.

Existing beneficial effect is the embodiment of the present invention compared with prior art：The embodiment of the present invention is by will be to be compared Two voice messaging generates audio volume control figure, and then the audio volume control figure of two voice messagings is compared, determines two languages Whether the content of message breath is consistent.This method utilizes the oscillogram of audio, confirms two voice messagings by the comparison of oscillogram Voice content it is whether consistent so that than pair result it is more accurate.

Brief description of the drawings

Technical scheme in order to illustrate the embodiments of the present invention more clearly, below will be to embodiment or description of the prior art In the required accompanying drawing used be briefly described, it should be apparent that, drawings in the following description be only the present invention some Embodiment, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these Accompanying drawing obtains other accompanying drawings.

Fig. 1 is a kind of implementation process schematic diagram of the comparison method for voice messaging that one embodiment of the invention provides；

Fig. 2 is a kind of implementation process schematic diagram of the comparison method for voice messaging that one embodiment of the invention provides；

Fig. 3 is the schematic block diagram of the comparison device for the voice messaging that one embodiment of the invention provides；

Fig. 4 is the schematic block diagram for the terminal device that one embodiment of the invention provides.

Embodiment

In describing below, in order to illustrate rather than in order to limit, it is proposed that such as tool of particular system structure, technology etc Body details, thoroughly to understand the embodiment of the present invention.However, it will be clear to one skilled in the art that there is no these specific The present invention can also be realized in the other embodiments of details.In other situations, omit to well-known system, device, electricity Road and the detailed description of method, in case unnecessary details hinders description of the invention.

It should be appreciated that ought be in this specification and in the appended claims in use, special described by the instruction of term " comprising " Sign, entirety, step, operation, the presence of element and/or component, but be not precluded from one or more of the other feature, entirety, step, Operation, element, component and/or its presence or addition for gathering.

It is also understood that the term used in this description of the invention is merely for the sake of the mesh for describing specific embodiment And be not intended to limit the present invention.As used in description of the invention and appended claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singulative, "one" and "the" are intended to include plural form.

It will be further appreciated that the term "and/or" used in description of the invention and appended claims is Refer to any combinations of one or more of the associated item listed and be possible to combine, and including these combinations.

As used in this specification and in the appended claims, term " if " can be according to context quilt Be construed to " when ... " or " once " or " in response to determining " or " in response to detecting ".Similarly, phrase " if it is determined that " or " if detecting [described condition or event] " can be interpreted to mean according to context " once it is determined that " or " in response to true It is fixed " or " once detecting [described condition or event] " or " in response to detecting [described condition or event] ".

In order to illustrate technical solutions according to the invention, illustrated below by specific embodiment.

Fig. 1 is a kind of implementation process schematic diagram of the comparison method for voice messaging that one embodiment of the invention provides, such as This method may comprise steps of shown in figure：

Step S101, two voice messagings to be compared are obtained, by sound corresponding to every voice messaging generation to be compared Frequency oscillogram.

In embodiments of the present invention, two voice messagings to be compared are obtained first, and one of voice messaging can be made For received pronunciation information, another voice messaging, which can be used as, compares voice messaging, wherein the received pronunciation information is as base Standard, by compare voice messaging be compared with received pronunciation information, obtain comparison voice messaging whether with received pronunciation information Voice content is consistent.Because voice messaging can generally switch to text information, it is possible to be converted to two voice messagings Text information, then compare the text information in two voice messagings, it is possible to which obtaining the voice content of two voice messagings is It is no consistent, but during being converted into text information due to voice messaging, it is pair of element according to voice messaging and word It should be related to, voice messaging is mapped as text information, the pronunciation or linguistic context due to many text informations are different, are actually turning Deviation is likely to occur during change, so by the way that voice messaging to be compared is converted into the method being compared again after word simultaneously Precise alignment can not be accomplished.The embodiment of the present invention passes through first by audio volume control figure, sound corresponding to voice messaging generation to be compared The information such as the loudness, tone color, frequency of audio are contained in frequency oscillogram, can more represent voice messaging.

Specifically, audio volume control figure corresponding to every voice messaging generation to be compared is included：

The voice messaging is decompressed, and the voice messaging is randomly divided into multiple data blocks；

The amplitude of sampled point and the sampled point is obtained according to default sample mode in each data block；

Sampled point is ranked up according to the time audio volume control figure is generated according to the amplitude of each sampled point afterwards.

In embodiments of the present invention, during voice messaging is generated into audio volume control figure, can be obtained according to the time Sampled point, the data for only choosing sampled point generate audio oscillogram, can so reduce amount of calculation, we can first decompress The voice messaging, the voice messaging after decompression being randomly divided into multiple data blocks, the size of each data block is not fixed, Then sampled point is obtained according to default sample mode in each data block, because the data block divided in advance is according to random Mode divides, and is sampled in each data block according to fixed mode, so, this existing randomness has rule again The sampled point that the sample mode of rule property to obtain can more represent voice messaging sample, after obtaining the sampled point of each data block, Also sampled point is ranked up according to the time in voice messaging, so equivalent in voice messaging sample according to both wrapping Contain random, the sample mode for containing regularity again is sampled, and after obtaining sampled point, then obtains number corresponding to sampled point According to, such as amplitude, generate audio volume control figure.

Step S102, audio volume control figure corresponding to two voice messagings is compared, obtain two voice messagings Similarity.

In embodiments of the present invention, the letter such as loudness, tone color, frequency of one section of voice messaging is contained in audio volume control figure Breath, for example, in audio volume control figure, upper and lower amplitude representative loudness, the combination of frequency represent tone color, and period distances represent Frequency.We can obtain the similarity of two voice messagings by the way that the audio volume control figure of two voice messagings is compared. Such as two audio volume control figures can be carried out overlapping Comparison Method, the part generation overlapped in the audio volume control figure of two voice messagings The consistent part of table, the part that can not be overlapped represent inconsistent part, and the hundred of total oscillogram can be accounted for according to the part of coincidence Be divided to the similarity for being used for two audio volume control figures, the similarities of two audio volume control figures namely two voice messagings it is similar Degree.

Step S103, if the similarity of two voice messagings is more than threshold value, export two voice messagings Voice content identical comparison result；If the similarity of two voice messagings is less than or equal to the threshold value, institute is exported State the comparison result that the content of two voice messagings differs.

In embodiments of the present invention, due to before the similarity of two voice messagings to be compared is obtained, it is possible to pass through Format conversion, the generation process such as oscillogram are crossed, even so the information of two voice contents of identical, in the mistake of format conversion Two audio volume control figures that encoding variability between different-format causes to ultimately generate are also possible in journey can't be completely the same, or Person is different due to the method sampled during generation audio volume control figure, it is also possible to which two audio volume control figures for causing to ultimately generate will not It is completely the same.At this moment we just need to set threshold value, when the similarity of two voice messagings is more than threshold value, then show two languages The voice content of message breath is consistent, it is possible to the voice content identical sound result of two voice messagings is exported, otherwise, Export the comparison result that the content of two voice messagings differs.

The mode of the default sampling can be according to fixed step size carry out sampling, can also be according to it is existing its Its sample mode is sampled, and will not be repeated here.

Then the embodiment of the present invention is compared by the way that two voice messagings to be compared are generated into corresponding audio volume control figure respectively The similarity of two voice messagings is obtained to the audio volume control figure of two voice messagings, two voice messagings are judged by similarity Content it is whether consistent.

Fig. 2 is a kind of implementation process schematic diagram of the comparison method for voice messaging that further embodiment of this invention provides, such as This method may comprise steps of shown in figure：

Step S201, two voice messagings to be compared are obtained, two voice messagings to be compared are set respectively For received pronunciation information and compare voice messaging.

Step S202, obtains the data format of the received pronunciation information, and by the data lattice of the comparison voice messaging Formula is converted to the data format of the received pronunciation information.

In embodiments of the present invention, due to needing to generate audio volume control figure, the coded system of the audio file of different-format Difference, may can slightly have gap when generating audio volume control figure, so we are after two voice messagings to be compared are obtained, First identify whether the data format of two voice messagings to be compared is identical, if differing, by two languages to be compared Message breath is converted to identical data format, can be converted into predetermined audio format simultaneously, can also be by one of voice Information is converted into the voice document with another voice messaging identical data format, can specifically obtain the standard speech message The data format of breath, the Data Format Transform by the comparison voice messaging are the data format of the received pronunciation information.Language The data format of message breath i.e. the form of audio, can there is MP3, WAV, AU, SND, RAW, AFC etc..

Step S203, identify the mute part in every voice messaging, and the Jing Yin portion in the voice messaging that will identify that Cutting removes.

In embodiments of the present invention, due to may including the part of non-voice, such as mute part in voice messaging, such as Fruit compares mute part during comparison and obviously increases amount of calculation, and we can be first identified in every voice messaging Mute part, the mute part in the voice messaging that then will identify that are cut off from voice messaging.

Step S204, by audio volume control figure corresponding to every voice messaging generation to be compared.

The step is identical with step S102, specifically can refer to step S102 explanation, will not be repeated here.

Step S205, dot matrix image is generated according to audio volume control figure corresponding to every voice messaging, compared by dot matrix Method compares dot matrix image corresponding to two voice messagings, obtains the similarity of two voice messagings.

In embodiments of the present invention, audio volume control figure can also be generated dot matrix image, dot matrix image is also dot chart, point The least unit of the system of battle formations is pixel, and dot chart is exactly the figure that display effect is realized by the arrangement of pel array.Generate dot matrix image Afterwards, we can compare dot matrix image corresponding to the voice messaging by the method that dot matrix compares, so, equivalent to each Dot matrix image corresponding to voice messaging can be all made up of several pixels, and we realize ratio by comparing pixel one by one To two voice messagings, the ratio acquisition that the number of total pixel can be so accounted for by the number of identical pixel is similar Degree.

Step S206, if the similarity of two voice messagings is more than threshold value, export two voice messagings Voice content identical comparison result；

If the similarity of two voice messagings is less than or equal to the threshold value, two voice messagings are exported The comparison result that content differs, and change the comparison voice messaging and cause the comparison voice messaging and the received pronunciation The data content of information is consistent or according to the data format for comparing voice messaging, by the data of the received pronunciation information Form is converted to the data format for comparing voice messaging to replace the comparison voice messaging.

In embodiments of the present invention, if the similarity of two voice messagings is less than threshold value, then explanation compares voice messaging Voice content relative to received pronunciation information is inconsistent, at this moment needs two voice messagings being revised as in identical voice Hold, can will compare voice messaging and be revised as the voice messaging consistent with the data content of received pronunciation information, can also delete Fall and compare voice messaging, as comparison voice letter after received pronunciation information directly is converted into the data format for comparing voice messaging Breath, such received pronunciation information are exactly consistent with the voice content of comparison voice messaging.

The embodiment of the present invention is changed by entering row format to the voice messaging to be compared of acquisition, excision mute part, generation Oscillogram and then dot chart is generated according to oscillogram, compare dot chart and be obtained with the similarities of two voice messagings, so Than pair mode can more accurately obtain the similarities of two voices to be compared.

It should be understood that the size of the sequence number of each step is not meant to the priority of execution sequence, each process in above-described embodiment Execution sequence should determine that the implementation process without tackling the embodiment of the present invention forms any limit with its function and internal logic It is fixed.

Fig. 3 is the schematic block diagram of the comparison device for the voice messaging that one embodiment of the invention provides, for convenience of description, only The part related to the embodiment of the present invention is shown.

The comparison device of the voice messaging can be built in terminal device (such as mobile phone, computer, tablet personal computer, pen Remember this etc.) in software unit, hardware cell or the unit of soft or hard combination, can also be integrated into as independent suspension member described In terminal device.

The comparison device 3 of the voice messaging includes：

Oscillogram generation module 31, for obtaining two voice messagings to be compared, by every voice messaging to be compared Audio volume control figure corresponding to generation；

Similarity obtains module 32, for comparing audio volume control figure corresponding to two voice messagings, obtains described two The similarity of bar voice messaging；

Processing module 33, if the similarity for two voice messagings is more than threshold value, export two voices The voice content identical comparison result of information；

The processing module 33, if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value, Export the comparison result that the content of two voice messagings differs.

Optionally, the similarity obtains module 32 and included：

Dot matrix image generation unit 321, dot matrix image is generated for the audio volume control figure according to corresponding to every voice messaging；

Comparing unit 322, the method for being compared by dot matrix compare dot matrix image corresponding to two voice messagings.

Optionally, the oscillogram generation module 31 includes：

Data format acquiring unit 311, for two voice messagings to be compared to be respectively set into received pronunciation Information and comparison voice messaging, and obtain the data format of the received pronunciation information；

Format conversion unit 312, for being the standard speech message by the Data Format Transform of the comparison voice messaging The data format of breath.

Optionally, the oscillogram generation module 31 also includes：

Jing Yin excision unit 313, for identifying the mute part in every voice messaging, and the voice messaging that will identify that In mute part excision.

Optionally, the oscillogram generation module 31 also includes：

Decompression unit 314, multiple data are randomly divided into for decompressing the voice messaging, and by the voice messaging Block；

Sampling unit 315, for obtaining sampled point according to default sample mode in each data block and described adopting The amplitude of sampling point；

Oscillogram generation unit 316, for sampled point to be ranked up afterwards according to the amplitude of each sampled point according to the time Value generation audio volume control figure.

Optionally, the processing module 33 is additionally operable to：If the similarity of two voice messagings is less than or equal to institute Threshold value is stated, then changes the comparison voice messaging and causes the comparison voice messaging and the data content of the received pronunciation information Unanimously；

Or the data format according to the comparison voice messaging, by the Data Format Transform of the received pronunciation information For the comparison voice messaging data format to replace the comparison voice messaging.

It is apparent to those skilled in the art that for convenience of description and succinctly, only with above-mentioned each work( Can module, unit division progress for example, in practical application, can be as needed and by above-mentioned function distribution by different Functional module, unit are completed, will the internal structure of comparison device of the voice messaging be divided into different functional modules, list Member, to complete all or part of function described above.Each functional module, unit in embodiment can be integrated at one Reason module in or modules, unit be individually physically present, can also two or more modules be integrated in one In module, above-mentioned integrated module, unit can both be realized in the form of hardware, can also use the shape of software function module Formula is realized.In addition, each functional module, the specific name of unit are also only to facilitate mutually differentiation, is not limited to this Shen Protection domain please.Module, the specific work process of unit, may be referred to the correspondence in preceding method embodiment in said system Process, it will not be repeated here.

Fig. 4 is the schematic block diagram for the terminal device that one embodiment of the invention provides.As shown in figure 4, the terminal of the embodiment Equipment 4 includes：One or more processors 40, memory 41 and it is stored in the memory 41 and can be in the processor The computer program 42 run on 40.The processor 40 realizes above-mentioned each voice messaging when performing the computer program 42 Comparison method embodiment in step, such as the step S101 to S103 shown in Fig. 1.Or the processor 40 performs institute The function of each module in above-mentioned learning time statistic device embodiment, such as module shown in Fig. 3 are realized when stating computer program 42 31 to 33 function.

Exemplary, the computer program 42 can be divided into one or more module/units, it is one or Multiple module/units are stored in the memory 41, and are performed by the processor 40, to complete the present invention.Described one Individual or multiple module/units can be the series of computation machine programmed instruction section that can complete specific function, and the instruction segment is used for Implementation procedure of the computer program 42 in the terminal device 4 is described.For example, the computer program 42 can be divided It is cut into oscillogram generation module, similarity obtains module, processing module.

The oscillogram generation module, for obtaining two voice messagings to be compared, every voice to be compared is believed Audio volume control figure corresponding to breath generation；

The similarity obtains module, for comparing audio volume control figure corresponding to two voice messagings, described in acquisition The similarity of two voice messagings；

The processing module, if the similarity for two voice messagings is more than threshold value, export two languages The voice content identical comparison result of message breath；

Either unit refers to the description of module or unit in the comparison device of voice messaging to other modules, herein not Repeat again.

The terminal device includes but are not limited to processor 40, memory 41.It will be understood by those skilled in the art that figure 4 be only the example of terminal device 4, does not form the restriction to terminal device 4, can be included than illustrating more or less portions Part, some parts or different parts are either combined, such as the terminal device can also include input equipment, output is set Standby, network access equipment, bus etc..

The processor 40 can be CPU (Central Processing Unit, CPU), can also be Other general processors, digital signal processor (Digital Signal Processor, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) either other PLDs, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor can also be any conventional processor Deng.

The memory 41 can be the internal storage unit of the terminal device 4, such as the hard disk of terminal device 4 or interior Deposit.The memory 41 can also be the External memory equipment of the terminal device 4, such as be equipped with the terminal device 4 Plug-in type hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, dodge Deposit card (Flash Card) etc..Further, the memory 41 can also both include the storage inside list of the terminal device 4 Member also includes External memory equipment.The memory 41 is used to store needed for the computer program and the terminal device Other programs and data.The memory 41 can be also used for temporarily storing the data that has exported or will export.

In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and is not described in detail or remembers in some embodiment The part of load, it may refer to the associated description of other embodiments.

Those of ordinary skill in the art are it is to be appreciated that the list of each example described with reference to the embodiments described herein Member and algorithm steps, it can be realized with the combination of electronic hardware or computer software and electronic hardware.These functions are actually Performed with hardware or software mode, application-specific and design constraint depending on technical scheme.Professional and technical personnel Described function can be realized using distinct methods to each specific application, but this realization is it is not considered that exceed The scope of the present invention.

In embodiment provided by the present invention, it should be understood that disclosed terminal device, apparatus and method, can be with Realize by another way.For example, device described above/terminal device embodiment is only schematical, for example, institute The division of module or unit is stated, only a kind of division of logic function, there can be other dividing mode when actually realizing, such as Multiple units or component can combine or be desirably integrated into another system, or some features can be ignored, or not perform.Separately A bit, shown or discussed mutual coupling or direct-coupling or communication connection can be by some interfaces, device Or INDIRECT COUPLING or the communication connection of unit, can be electrical, mechanical or other forms.

The unit illustrated as separating component can be or may not be physically separate, show as unit The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.

In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list Member can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.

If the integrated module/unit realized in the form of SFU software functional unit and as independent production marketing or In use, it can be stored in a computer read/write memory medium.Based on such understanding, the present invention realizes above-mentioned implementation All or part of flow in example method, by computer program the hardware of correlation can also be instructed to complete, described meter Calculation machine program can be stored in a computer-readable recording medium, and the computer program can be achieved when being executed by processor The step of stating each embodiment of the method..Wherein, the computer program includes computer program code, the computer program Code can be source code form, object identification code form, executable file or some intermediate forms etc..Computer-readable Jie Matter can include：Can carry any entity or device of the computer program code, recording medium, USB flash disk, mobile hard disk, Magnetic disc, CD, computer storage, read-only storage (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium etc..It is it should be noted that described The content that computer-readable medium includes can carry out appropriate increasing according to legislation in jurisdiction and the requirement of patent practice Subtract, such as in some jurisdictions, according to legislation and patent practice, computer-readable medium do not include be electric carrier signal and Telecommunication signal.

Embodiment described above is merely illustrative of the technical solution of the present invention, rather than its limitations；Although with reference to foregoing reality Example is applied the present invention is described in detail, it will be understood by those within the art that：It still can be to foregoing each Technical scheme described in embodiment is modified, or carries out equivalent substitution to which part technical characteristic；And these are changed Or replace, the essence of appropriate technical solution is departed from the spirit and scope of various embodiments of the present invention technical scheme, all should Within protection scope of the present invention.

Claims

A kind of 1. comparison method of voice messaging, it is characterised in that including：

Two voice messagings to be compared are obtained, by audio volume control figure corresponding to every voice messaging generation to be compared；

Audio volume control figure corresponding to two voice messagings is compared, obtains the similarity of two voice messagings；

If the similarity of two voice messagings is more than threshold value, the voice content identical of two voice messagings is exported Comparison result；

If the similarity of two voice messagings is less than or equal to the threshold value, the content of two voice messagings is exported The comparison result differed.
2. the comparison method of voice messaging according to claim 1, it is characterised in that described to compare two voices letter Audio volume control figure includes corresponding to breath：

Dot matrix image is generated according to audio volume control figure corresponding to every voice messaging；

The method compared by dot matrix compares dot matrix image corresponding to two voice messagings.
3. according to the method for claim 1, it is characterised in that described after two voice messagings to be compared are obtained Method also includes：

Two voice messagings to be compared are respectively set to received pronunciation information and compare voice messaging；

Obtain the data format of the received pronunciation information；

Data Format Transform by the comparison voice messaging is the data format of the received pronunciation information.
4. according to the method for claim 1, it is characterised in that by sound corresponding to every voice messaging generation to be compared Before frequency oscillogram, methods described also includes：

The mute part in every voice messaging is identified, and the mute part excision in the voice messaging that will identify that.
5. according to the method for claim 1, it is characterised in that described by corresponding to every voice messaging generation to be compared Audio volume control figure includes：

The voice messaging is decompressed, and the voice messaging is randomly divided into multiple data blocks；

The amplitude of sampled point and the sampled point is obtained according to default sample mode in each data block；

Sampled point is ranked up according to the time audio volume control figure is generated according to the amplitude of each sampled point afterwards.
6. according to the method described in any one of claim 3 to 5, it is characterised in that also include：

If the similarity of two voice messagings is less than or equal to the threshold value, changes the comparison voice messaging and cause The comparison voice messaging is consistent with the data content of the received pronunciation information；

Or the data format according to the comparison voice messaging, the Data Format Transform by the received pronunciation information is institute The data format for comparing voice messaging is stated to replace the comparison voice messaging.
A kind of 7. comparison device of voice messaging, it is characterised in that including：

Oscillogram generation module, for obtaining two voice messagings to be compared, by every voice messaging generation pair to be compared The audio volume control figure answered；

Similarity obtains module, for comparing audio volume control figure corresponding to two voice messagings, obtains two voices The similarity of information；

Processing module, if the similarity for two voice messagings is more than threshold value, export two voice messagings Voice content identical comparison result；

The processing module, if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value, export institute State the comparison result that the content of two voice messagings differs.
8. the comparison device of voice messaging according to claim 7, it is characterised in that similarity, which obtains module, to be included：

Dot matrix image generation unit, dot matrix image is generated for the audio volume control figure according to corresponding to every voice messaging；

Comparing unit, the method for being compared by dot matrix compare dot matrix image corresponding to two voice messagings.
9. a kind of terminal device, including memory, processor and it is stored in the memory and can be on the processor The computer program of operation, it is characterised in that realize such as claim 1 to 6 described in the computing device during computer program The step of any one methods described.
10. a kind of computer-readable recording medium, the computer-readable recording medium storage has computer program, and its feature exists In when the computer program is executed by processor the step of realization such as any one of claim 1 to 6 methods described.