CN107516534A - Voice information comparison method and device and terminal equipment - Google Patents
Voice information comparison method and device and terminal equipment Download PDFInfo
- Publication number
- CN107516534A CN107516534A CN201710769644.0A CN201710769644A CN107516534A CN 107516534 A CN107516534 A CN 107516534A CN 201710769644 A CN201710769644 A CN 201710769644A CN 107516534 A CN107516534 A CN 107516534A
- Authority
- CN
- China
- Prior art keywords
- voice
- voice messaging
- messagings
- compared
- comparison
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000004590 computer program Methods 0.000 claims description 21
- 239000011159 matrix material Substances 0.000 claims description 20
- 238000003860 storage Methods 0.000 claims description 7
- 238000010586 diagram Methods 0.000 abstract description 11
- 230000006870 function Effects 0.000 description 10
- 230000008569 process Effects 0.000 description 9
- 230000004044 response Effects 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 230000006837 decompression Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000005755 formation reaction Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004549 pulsed laser deposition Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000026676 system process Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
- Telephonic Communication Services (AREA)
Abstract
The invention is suitable for the technical field of voice information, and provides a method, a device and a terminal device for comparing voice information, wherein the method comprises the following steps: acquiring two pieces of voice information to be compared, and generating a corresponding audio waveform diagram for each piece of voice information to be compared; obtaining the similarity of the two pieces of voice information by comparing the audio oscillograms corresponding to the two pieces of voice information; if the similarity of the two voices is greater than a threshold value, outputting a comparison result that the voice contents of the two voice messages are the same; otherwise, outputting a comparison result that the contents of the two pieces of voice information are different. The invention generates the audio waveform diagram by the two pieces of voice information, and determines whether the contents of the two pieces of voice information are consistent or not by comparing the audio waveform diagrams, so that the recognition result is more accurate.
Description
Technical field
The invention belongs to voice messaging technical field, more particularly to a kind of comparison method of voice messaging, device and terminal
Equipment.
Background technology
At present, the data type that text and voice are combined is more and more, and accordingly, voice messaging also produces many data
Form.In the application of voice messaging, in order to meet the needs of different, often a kind of voice data file of form is changed
For the voice data file of another form, this requires to ensure data during voice data file form is changed
The uniformity of content.
But the uniformity of data content is ensure that even in the process that voice data file form is changed, in follow-up language
During sound data processing utilizes, it is also possible to the intermediate voice data file of certain form is modified, this is resulted in
Same speech data source file under different-format or the voice data file of different phase data content it is inconsistent, still
User is being that can not judge these voice data files under using different-format or during the voice data file of different phase
Data content it is whether consistent.
The content of the invention
In view of this, the embodiments of the invention provide a kind of comparison method of voice messaging, device and terminal device, with solution
Certainly can not the multiple voice messagings of precise alignment it is whether consistent the problem of.
The first aspect of the embodiment of the present invention provides a kind of comparison method of voice messaging, including:
Two voice messagings to be compared are obtained, by audio volume control figure corresponding to every voice messaging generation to be compared;
Audio volume control figure corresponding to two voice messagings is compared, obtains the similarity of two voice messagings;
If the similarity of two voice messagings is more than threshold value, the voice content phase of two voice messagings is exported
Same comparison result;
If the similarity of two voice messagings is less than or equal to the threshold value, two voice messagings are exported
The comparison result that content differs.
The second aspect of the embodiment of the present invention provides a kind of comparison device of voice messaging, including:
Oscillogram generation module, for obtaining two voice messagings to be compared, every voice messaging to be compared is given birth to
Into corresponding audio volume control figure;
Similarity obtains module, for comparing audio volume control figure corresponding to two voice messagings, obtains described two
The similarity of voice messaging;
Processing module, if the similarity for two voice messagings is more than threshold value, export two voices letter
The voice content identical comparison result of breath;
The processing module, it is defeated if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value
Go out the comparison result that the content of two voice messagings differs.
The third aspect of the embodiment of the present invention provides a kind of terminal device, including memory, processor and is stored in
In the memory and the computer program that can run on the processor, described in the computing device during computer program
The step of realizing the methods described that first aspect of the embodiment of the present invention provides.
The fourth aspect of the embodiment of the present invention provides a kind of computer-readable recording medium, the computer-readable storage
Media storage has computer program, and the computer program realizes the embodiment of the present invention when being executed by one or more processors
On the one hand the step of methods described provided.
Existing beneficial effect is the embodiment of the present invention compared with prior art:The embodiment of the present invention is by will be to be compared
Two voice messaging generates audio volume control figure, and then the audio volume control figure of two voice messagings is compared, determines two languages
Whether the content of message breath is consistent.This method utilizes the oscillogram of audio, confirms two voice messagings by the comparison of oscillogram
Voice content it is whether consistent so that than pair result it is more accurate.
Brief description of the drawings
Technical scheme in order to illustrate the embodiments of the present invention more clearly, below will be to embodiment or description of the prior art
In the required accompanying drawing used be briefly described, it should be apparent that, drawings in the following description be only the present invention some
Embodiment, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these
Accompanying drawing obtains other accompanying drawings.
Fig. 1 is a kind of implementation process schematic diagram of the comparison method for voice messaging that one embodiment of the invention provides;
Fig. 2 is a kind of implementation process schematic diagram of the comparison method for voice messaging that one embodiment of the invention provides;
Fig. 3 is the schematic block diagram of the comparison device for the voice messaging that one embodiment of the invention provides;
Fig. 4 is the schematic block diagram for the terminal device that one embodiment of the invention provides.
Embodiment
In describing below, in order to illustrate rather than in order to limit, it is proposed that such as tool of particular system structure, technology etc
Body details, thoroughly to understand the embodiment of the present invention.However, it will be clear to one skilled in the art that there is no these specific
The present invention can also be realized in the other embodiments of details.In other situations, omit to well-known system, device, electricity
Road and the detailed description of method, in case unnecessary details hinders description of the invention.
It should be appreciated that ought be in this specification and in the appended claims in use, special described by the instruction of term " comprising "
Sign, entirety, step, operation, the presence of element and/or component, but be not precluded from one or more of the other feature, entirety, step,
Operation, element, component and/or its presence or addition for gathering.
It is also understood that the term used in this description of the invention is merely for the sake of the mesh for describing specific embodiment
And be not intended to limit the present invention.As used in description of the invention and appended claims, unless on
Other situations are hereafter clearly indicated, otherwise " one " of singulative, "one" and "the" are intended to include plural form.
It will be further appreciated that the term "and/or" used in description of the invention and appended claims is
Refer to any combinations of one or more of the associated item listed and be possible to combine, and including these combinations.
As used in this specification and in the appended claims, term " if " can be according to context quilt
Be construed to " when ... " or " once " or " in response to determining " or " in response to detecting ".Similarly, phrase " if it is determined that " or
" if detecting [described condition or event] " can be interpreted to mean according to context " once it is determined that " or " in response to true
It is fixed " or " once detecting [described condition or event] " or " in response to detecting [described condition or event] ".
In order to illustrate technical solutions according to the invention, illustrated below by specific embodiment.
Fig. 1 is a kind of implementation process schematic diagram of the comparison method for voice messaging that one embodiment of the invention provides, such as
This method may comprise steps of shown in figure:
Step S101, two voice messagings to be compared are obtained, by sound corresponding to every voice messaging generation to be compared
Frequency oscillogram.
In embodiments of the present invention, two voice messagings to be compared are obtained first, and one of voice messaging can be made
For received pronunciation information, another voice messaging, which can be used as, compares voice messaging, wherein the received pronunciation information is as base
Standard, by compare voice messaging be compared with received pronunciation information, obtain comparison voice messaging whether with received pronunciation information
Voice content is consistent.Because voice messaging can generally switch to text information, it is possible to be converted to two voice messagings
Text information, then compare the text information in two voice messagings, it is possible to which obtaining the voice content of two voice messagings is
It is no consistent, but during being converted into text information due to voice messaging, it is pair of element according to voice messaging and word
It should be related to, voice messaging is mapped as text information, the pronunciation or linguistic context due to many text informations are different, are actually turning
Deviation is likely to occur during change, so by the way that voice messaging to be compared is converted into the method being compared again after word simultaneously
Precise alignment can not be accomplished.The embodiment of the present invention passes through first by audio volume control figure, sound corresponding to voice messaging generation to be compared
The information such as the loudness, tone color, frequency of audio are contained in frequency oscillogram, can more represent voice messaging.
Specifically, audio volume control figure corresponding to every voice messaging generation to be compared is included:
The voice messaging is decompressed, and the voice messaging is randomly divided into multiple data blocks;
The amplitude of sampled point and the sampled point is obtained according to default sample mode in each data block;
Sampled point is ranked up according to the time audio volume control figure is generated according to the amplitude of each sampled point afterwards.
In embodiments of the present invention, during voice messaging is generated into audio volume control figure, can be obtained according to the time
Sampled point, the data for only choosing sampled point generate audio oscillogram, can so reduce amount of calculation, we can first decompress
The voice messaging, the voice messaging after decompression being randomly divided into multiple data blocks, the size of each data block is not fixed,
Then sampled point is obtained according to default sample mode in each data block, because the data block divided in advance is according to random
Mode divides, and is sampled in each data block according to fixed mode, so, this existing randomness has rule again
The sampled point that the sample mode of rule property to obtain can more represent voice messaging sample, after obtaining the sampled point of each data block,
Also sampled point is ranked up according to the time in voice messaging, so equivalent in voice messaging sample according to both wrapping
Contain random, the sample mode for containing regularity again is sampled, and after obtaining sampled point, then obtains number corresponding to sampled point
According to, such as amplitude, generate audio volume control figure.
Step S102, audio volume control figure corresponding to two voice messagings is compared, obtain two voice messagings
Similarity.
In embodiments of the present invention, the letter such as loudness, tone color, frequency of one section of voice messaging is contained in audio volume control figure
Breath, for example, in audio volume control figure, upper and lower amplitude representative loudness, the combination of frequency represent tone color, and period distances represent
Frequency.We can obtain the similarity of two voice messagings by the way that the audio volume control figure of two voice messagings is compared.
Such as two audio volume control figures can be carried out overlapping Comparison Method, the part generation overlapped in the audio volume control figure of two voice messagings
The consistent part of table, the part that can not be overlapped represent inconsistent part, and the hundred of total oscillogram can be accounted for according to the part of coincidence
Be divided to the similarity for being used for two audio volume control figures, the similarities of two audio volume control figures namely two voice messagings it is similar
Degree.
Step S103, if the similarity of two voice messagings is more than threshold value, export two voice messagings
Voice content identical comparison result;If the similarity of two voice messagings is less than or equal to the threshold value, institute is exported
State the comparison result that the content of two voice messagings differs.
In embodiments of the present invention, due to before the similarity of two voice messagings to be compared is obtained, it is possible to pass through
Format conversion, the generation process such as oscillogram are crossed, even so the information of two voice contents of identical, in the mistake of format conversion
Two audio volume control figures that encoding variability between different-format causes to ultimately generate are also possible in journey can't be completely the same, or
Person is different due to the method sampled during generation audio volume control figure, it is also possible to which two audio volume control figures for causing to ultimately generate will not
It is completely the same.At this moment we just need to set threshold value, when the similarity of two voice messagings is more than threshold value, then show two languages
The voice content of message breath is consistent, it is possible to the voice content identical sound result of two voice messagings is exported, otherwise,
Export the comparison result that the content of two voice messagings differs.
The mode of the default sampling can be according to fixed step size carry out sampling, can also be according to it is existing its
Its sample mode is sampled, and will not be repeated here.
Then the embodiment of the present invention is compared by the way that two voice messagings to be compared are generated into corresponding audio volume control figure respectively
The similarity of two voice messagings is obtained to the audio volume control figure of two voice messagings, two voice messagings are judged by similarity
Content it is whether consistent.
Fig. 2 is a kind of implementation process schematic diagram of the comparison method for voice messaging that further embodiment of this invention provides, such as
This method may comprise steps of shown in figure:
Step S201, two voice messagings to be compared are obtained, two voice messagings to be compared are set respectively
For received pronunciation information and compare voice messaging.
Step S202, obtains the data format of the received pronunciation information, and by the data lattice of the comparison voice messaging
Formula is converted to the data format of the received pronunciation information.
In embodiments of the present invention, due to needing to generate audio volume control figure, the coded system of the audio file of different-format
Difference, may can slightly have gap when generating audio volume control figure, so we are after two voice messagings to be compared are obtained,
First identify whether the data format of two voice messagings to be compared is identical, if differing, by two languages to be compared
Message breath is converted to identical data format, can be converted into predetermined audio format simultaneously, can also be by one of voice
Information is converted into the voice document with another voice messaging identical data format, can specifically obtain the standard speech message
The data format of breath, the Data Format Transform by the comparison voice messaging are the data format of the received pronunciation information.Language
The data format of message breath i.e. the form of audio, can there is MP3, WAV, AU, SND, RAW, AFC etc..
Step S203, identify the mute part in every voice messaging, and the Jing Yin portion in the voice messaging that will identify that
Cutting removes.
In embodiments of the present invention, due to may including the part of non-voice, such as mute part in voice messaging, such as
Fruit compares mute part during comparison and obviously increases amount of calculation, and we can be first identified in every voice messaging
Mute part, the mute part in the voice messaging that then will identify that are cut off from voice messaging.
Step S204, by audio volume control figure corresponding to every voice messaging generation to be compared.
The step is identical with step S102, specifically can refer to step S102 explanation, will not be repeated here.
Step S205, dot matrix image is generated according to audio volume control figure corresponding to every voice messaging, compared by dot matrix
Method compares dot matrix image corresponding to two voice messagings, obtains the similarity of two voice messagings.
In embodiments of the present invention, audio volume control figure can also be generated dot matrix image, dot matrix image is also dot chart, point
The least unit of the system of battle formations is pixel, and dot chart is exactly the figure that display effect is realized by the arrangement of pel array.Generate dot matrix image
Afterwards, we can compare dot matrix image corresponding to the voice messaging by the method that dot matrix compares, so, equivalent to each
Dot matrix image corresponding to voice messaging can be all made up of several pixels, and we realize ratio by comparing pixel one by one
To two voice messagings, the ratio acquisition that the number of total pixel can be so accounted for by the number of identical pixel is similar
Degree.
Step S206, if the similarity of two voice messagings is more than threshold value, export two voice messagings
Voice content identical comparison result;
If the similarity of two voice messagings is less than or equal to the threshold value, two voice messagings are exported
The comparison result that content differs, and change the comparison voice messaging and cause the comparison voice messaging and the received pronunciation
The data content of information is consistent or according to the data format for comparing voice messaging, by the data of the received pronunciation information
Form is converted to the data format for comparing voice messaging to replace the comparison voice messaging.
In embodiments of the present invention, if the similarity of two voice messagings is less than threshold value, then explanation compares voice messaging
Voice content relative to received pronunciation information is inconsistent, at this moment needs two voice messagings being revised as in identical voice
Hold, can will compare voice messaging and be revised as the voice messaging consistent with the data content of received pronunciation information, can also delete
Fall and compare voice messaging, as comparison voice letter after received pronunciation information directly is converted into the data format for comparing voice messaging
Breath, such received pronunciation information are exactly consistent with the voice content of comparison voice messaging.
The embodiment of the present invention is changed by entering row format to the voice messaging to be compared of acquisition, excision mute part, generation
Oscillogram and then dot chart is generated according to oscillogram, compare dot chart and be obtained with the similarities of two voice messagings, so
Than pair mode can more accurately obtain the similarities of two voices to be compared.
It should be understood that the size of the sequence number of each step is not meant to the priority of execution sequence, each process in above-described embodiment
Execution sequence should determine that the implementation process without tackling the embodiment of the present invention forms any limit with its function and internal logic
It is fixed.
Fig. 3 is the schematic block diagram of the comparison device for the voice messaging that one embodiment of the invention provides, for convenience of description, only
The part related to the embodiment of the present invention is shown.
The comparison device of the voice messaging can be built in terminal device (such as mobile phone, computer, tablet personal computer, pen
Remember this etc.) in software unit, hardware cell or the unit of soft or hard combination, can also be integrated into as independent suspension member described
In terminal device.
The comparison device 3 of the voice messaging includes:
Oscillogram generation module 31, for obtaining two voice messagings to be compared, by every voice messaging to be compared
Audio volume control figure corresponding to generation;
Similarity obtains module 32, for comparing audio volume control figure corresponding to two voice messagings, obtains described two
The similarity of bar voice messaging;
Processing module 33, if the similarity for two voice messagings is more than threshold value, export two voices
The voice content identical comparison result of information;
The processing module 33, if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value,
Export the comparison result that the content of two voice messagings differs.
Optionally, the similarity obtains module 32 and included:
Dot matrix image generation unit 321, dot matrix image is generated for the audio volume control figure according to corresponding to every voice messaging;
Comparing unit 322, the method for being compared by dot matrix compare dot matrix image corresponding to two voice messagings.
Optionally, the oscillogram generation module 31 includes:
Data format acquiring unit 311, for two voice messagings to be compared to be respectively set into received pronunciation
Information and comparison voice messaging, and obtain the data format of the received pronunciation information;
Format conversion unit 312, for being the standard speech message by the Data Format Transform of the comparison voice messaging
The data format of breath.
Optionally, the oscillogram generation module 31 also includes:
Jing Yin excision unit 313, for identifying the mute part in every voice messaging, and the voice messaging that will identify that
In mute part excision.
Optionally, the oscillogram generation module 31 also includes:
Decompression unit 314, multiple data are randomly divided into for decompressing the voice messaging, and by the voice messaging
Block;
Sampling unit 315, for obtaining sampled point according to default sample mode in each data block and described adopting
The amplitude of sampling point;
Oscillogram generation unit 316, for sampled point to be ranked up afterwards according to the amplitude of each sampled point according to the time
Value generation audio volume control figure.
Optionally, the processing module 33 is additionally operable to:If the similarity of two voice messagings is less than or equal to institute
Threshold value is stated, then changes the comparison voice messaging and causes the comparison voice messaging and the data content of the received pronunciation information
Unanimously;
Or the data format according to the comparison voice messaging, by the Data Format Transform of the received pronunciation information
For the comparison voice messaging data format to replace the comparison voice messaging.
It is apparent to those skilled in the art that for convenience of description and succinctly, only with above-mentioned each work(
Can module, unit division progress for example, in practical application, can be as needed and by above-mentioned function distribution by different
Functional module, unit are completed, will the internal structure of comparison device of the voice messaging be divided into different functional modules, list
Member, to complete all or part of function described above.Each functional module, unit in embodiment can be integrated at one
Reason module in or modules, unit be individually physically present, can also two or more modules be integrated in one
In module, above-mentioned integrated module, unit can both be realized in the form of hardware, can also use the shape of software function module
Formula is realized.In addition, each functional module, the specific name of unit are also only to facilitate mutually differentiation, is not limited to this Shen
Protection domain please.Module, the specific work process of unit, may be referred to the correspondence in preceding method embodiment in said system
Process, it will not be repeated here.
Fig. 4 is the schematic block diagram for the terminal device that one embodiment of the invention provides.As shown in figure 4, the terminal of the embodiment
Equipment 4 includes:One or more processors 40, memory 41 and it is stored in the memory 41 and can be in the processor
The computer program 42 run on 40.The processor 40 realizes above-mentioned each voice messaging when performing the computer program 42
Comparison method embodiment in step, such as the step S101 to S103 shown in Fig. 1.Or the processor 40 performs institute
The function of each module in above-mentioned learning time statistic device embodiment, such as module shown in Fig. 3 are realized when stating computer program 42
31 to 33 function.
Exemplary, the computer program 42 can be divided into one or more module/units, it is one or
Multiple module/units are stored in the memory 41, and are performed by the processor 40, to complete the present invention.Described one
Individual or multiple module/units can be the series of computation machine programmed instruction section that can complete specific function, and the instruction segment is used for
Implementation procedure of the computer program 42 in the terminal device 4 is described.For example, the computer program 42 can be divided
It is cut into oscillogram generation module, similarity obtains module, processing module.
The oscillogram generation module, for obtaining two voice messagings to be compared, every voice to be compared is believed
Audio volume control figure corresponding to breath generation;
The similarity obtains module, for comparing audio volume control figure corresponding to two voice messagings, described in acquisition
The similarity of two voice messagings;
The processing module, if the similarity for two voice messagings is more than threshold value, export two languages
The voice content identical comparison result of message breath;
The processing module, it is defeated if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value
Go out the comparison result that the content of two voice messagings differs.
Either unit refers to the description of module or unit in the comparison device of voice messaging to other modules, herein not
Repeat again.
The terminal device includes but are not limited to processor 40, memory 41.It will be understood by those skilled in the art that figure
4 be only the example of terminal device 4, does not form the restriction to terminal device 4, can be included than illustrating more or less portions
Part, some parts or different parts are either combined, such as the terminal device can also include input equipment, output is set
Standby, network access equipment, bus etc..
The processor 40 can be CPU (Central Processing Unit, CPU), can also be
Other general processors, digital signal processor (Digital Signal Processor, DSP), application specific integrated circuit
(Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-
Programmable Gate Array, FPGA) either other PLDs, discrete gate or transistor logic,
Discrete hardware components etc..General processor can be microprocessor or the processor can also be any conventional processor
Deng.
The memory 41 can be the internal storage unit of the terminal device 4, such as the hard disk of terminal device 4 or interior
Deposit.The memory 41 can also be the External memory equipment of the terminal device 4, such as be equipped with the terminal device 4
Plug-in type hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, dodge
Deposit card (Flash Card) etc..Further, the memory 41 can also both include the storage inside list of the terminal device 4
Member also includes External memory equipment.The memory 41 is used to store needed for the computer program and the terminal device
Other programs and data.The memory 41 can be also used for temporarily storing the data that has exported or will export.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and is not described in detail or remembers in some embodiment
The part of load, it may refer to the associated description of other embodiments.
Those of ordinary skill in the art are it is to be appreciated that the list of each example described with reference to the embodiments described herein
Member and algorithm steps, it can be realized with the combination of electronic hardware or computer software and electronic hardware.These functions are actually
Performed with hardware or software mode, application-specific and design constraint depending on technical scheme.Professional and technical personnel
Described function can be realized using distinct methods to each specific application, but this realization is it is not considered that exceed
The scope of the present invention.
In embodiment provided by the present invention, it should be understood that disclosed terminal device, apparatus and method, can be with
Realize by another way.For example, device described above/terminal device embodiment is only schematical, for example, institute
The division of module or unit is stated, only a kind of division of logic function, there can be other dividing mode when actually realizing, such as
Multiple units or component can combine or be desirably integrated into another system, or some features can be ignored, or not perform.Separately
A bit, shown or discussed mutual coupling or direct-coupling or communication connection can be by some interfaces, device
Or INDIRECT COUPLING or the communication connection of unit, can be electrical, mechanical or other forms.
The unit illustrated as separating component can be or may not be physically separate, show as unit
The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs
's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, can also
That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list
Member can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.
If the integrated module/unit realized in the form of SFU software functional unit and as independent production marketing or
In use, it can be stored in a computer read/write memory medium.Based on such understanding, the present invention realizes above-mentioned implementation
All or part of flow in example method, by computer program the hardware of correlation can also be instructed to complete, described meter
Calculation machine program can be stored in a computer-readable recording medium, and the computer program can be achieved when being executed by processor
The step of stating each embodiment of the method..Wherein, the computer program includes computer program code, the computer program
Code can be source code form, object identification code form, executable file or some intermediate forms etc..Computer-readable Jie
Matter can include:Can carry any entity or device of the computer program code, recording medium, USB flash disk, mobile hard disk,
Magnetic disc, CD, computer storage, read-only storage (ROM, Read-Only Memory), random access memory (RAM,
Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium etc..It is it should be noted that described
The content that computer-readable medium includes can carry out appropriate increasing according to legislation in jurisdiction and the requirement of patent practice
Subtract, such as in some jurisdictions, according to legislation and patent practice, computer-readable medium do not include be electric carrier signal and
Telecommunication signal.
Embodiment described above is merely illustrative of the technical solution of the present invention, rather than its limitations;Although with reference to foregoing reality
Example is applied the present invention is described in detail, it will be understood by those within the art that:It still can be to foregoing each
Technical scheme described in embodiment is modified, or carries out equivalent substitution to which part technical characteristic;And these are changed
Or replace, the essence of appropriate technical solution is departed from the spirit and scope of various embodiments of the present invention technical scheme, all should
Within protection scope of the present invention.
Claims (10)
- A kind of 1. comparison method of voice messaging, it is characterised in that including:Two voice messagings to be compared are obtained, by audio volume control figure corresponding to every voice messaging generation to be compared;Audio volume control figure corresponding to two voice messagings is compared, obtains the similarity of two voice messagings;If the similarity of two voice messagings is more than threshold value, the voice content identical of two voice messagings is exported Comparison result;If the similarity of two voice messagings is less than or equal to the threshold value, the content of two voice messagings is exported The comparison result differed.
- 2. the comparison method of voice messaging according to claim 1, it is characterised in that described to compare two voices letter Audio volume control figure includes corresponding to breath:Dot matrix image is generated according to audio volume control figure corresponding to every voice messaging;The method compared by dot matrix compares dot matrix image corresponding to two voice messagings.
- 3. according to the method for claim 1, it is characterised in that described after two voice messagings to be compared are obtained Method also includes:Two voice messagings to be compared are respectively set to received pronunciation information and compare voice messaging;Obtain the data format of the received pronunciation information;Data Format Transform by the comparison voice messaging is the data format of the received pronunciation information.
- 4. according to the method for claim 1, it is characterised in that by sound corresponding to every voice messaging generation to be compared Before frequency oscillogram, methods described also includes:The mute part in every voice messaging is identified, and the mute part excision in the voice messaging that will identify that.
- 5. according to the method for claim 1, it is characterised in that described by corresponding to every voice messaging generation to be compared Audio volume control figure includes:The voice messaging is decompressed, and the voice messaging is randomly divided into multiple data blocks;The amplitude of sampled point and the sampled point is obtained according to default sample mode in each data block;Sampled point is ranked up according to the time audio volume control figure is generated according to the amplitude of each sampled point afterwards.
- 6. according to the method described in any one of claim 3 to 5, it is characterised in that also include:If the similarity of two voice messagings is less than or equal to the threshold value, changes the comparison voice messaging and cause The comparison voice messaging is consistent with the data content of the received pronunciation information;Or the data format according to the comparison voice messaging, the Data Format Transform by the received pronunciation information is institute The data format for comparing voice messaging is stated to replace the comparison voice messaging.
- A kind of 7. comparison device of voice messaging, it is characterised in that including:Oscillogram generation module, for obtaining two voice messagings to be compared, by every voice messaging generation pair to be compared The audio volume control figure answered;Similarity obtains module, for comparing audio volume control figure corresponding to two voice messagings, obtains two voices The similarity of information;Processing module, if the similarity for two voice messagings is more than threshold value, export two voice messagings Voice content identical comparison result;The processing module, if the similarity for being additionally operable to two voice messagings is less than or equal to the threshold value, export institute State the comparison result that the content of two voice messagings differs.
- 8. the comparison device of voice messaging according to claim 7, it is characterised in that similarity, which obtains module, to be included:Dot matrix image generation unit, dot matrix image is generated for the audio volume control figure according to corresponding to every voice messaging;Comparing unit, the method for being compared by dot matrix compare dot matrix image corresponding to two voice messagings.
- 9. a kind of terminal device, including memory, processor and it is stored in the memory and can be on the processor The computer program of operation, it is characterised in that realize such as claim 1 to 6 described in the computing device during computer program The step of any one methods described.
- 10. a kind of computer-readable recording medium, the computer-readable recording medium storage has computer program, and its feature exists In when the computer program is executed by processor the step of realization such as any one of claim 1 to 6 methods described.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710769644.0A CN107516534B (en) | 2017-08-31 | 2017-08-31 | Voice information comparison method and device and terminal equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710769644.0A CN107516534B (en) | 2017-08-31 | 2017-08-31 | Voice information comparison method and device and terminal equipment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107516534A true CN107516534A (en) | 2017-12-26 |
CN107516534B CN107516534B (en) | 2020-11-03 |
Family
ID=60725030
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710769644.0A Active CN107516534B (en) | 2017-08-31 | 2017-08-31 | Voice information comparison method and device and terminal equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107516534B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109344388A (en) * | 2018-08-02 | 2019-02-15 | 中央电视台 | Spam comment identification method and device and computer readable storage medium |
CN110322894A (en) * | 2019-06-27 | 2019-10-11 | 电子科技大学 | A kind of waveform diagram generation and giant panda detection method based on sound |
CN110853674A (en) * | 2018-07-24 | 2020-02-28 | 中兴通讯股份有限公司 | Text collation method, apparatus, and computer-readable storage medium |
CN111275213A (en) * | 2020-01-17 | 2020-06-12 | 安徽华创环保设备科技有限公司 | Mechanical equipment fault monitoring system based on big data |
CN111640445A (en) * | 2020-05-13 | 2020-09-08 | 广州国音智能科技有限公司 | Audio difference detection method, device, equipment and readable storage medium |
CN111712691A (en) * | 2018-02-16 | 2020-09-25 | 三菱电机株式会社 | Display data generation device, display data generation method, and program |
CN112002312A (en) * | 2019-05-08 | 2020-11-27 | 顺丰科技有限公司 | Voice recognition method, device, computer program product and storage medium |
WO2022199461A1 (en) * | 2021-03-24 | 2022-09-29 | 华为技术有限公司 | Method for testing speech interaction system, audio recognition method, and related devices |
CN115243104A (en) * | 2021-11-30 | 2022-10-25 | 广州汽车集团股份有限公司 | Method and system for automatically adjusting vehicle-mounted multimedia volume |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103730032A (en) * | 2012-10-12 | 2014-04-16 | 李志刚 | Method and system for controlling multimedia data |
CN104157301A (en) * | 2014-07-25 | 2014-11-19 | 广州三星通信技术研究有限公司 | Method, device and terminal deleting voice information blank segment |
CN105389600A (en) * | 2015-12-31 | 2016-03-09 | 田雪松 | Data processing method |
CN105512272A (en) * | 2015-12-04 | 2016-04-20 | 天津凌浩科技有限公司 | System for comparing audio information and audio information comparison method |
CN105810213A (en) * | 2014-12-30 | 2016-07-27 | 浙江大华技术股份有限公司 | Typical abnormal sound detection method and device |
CN106205117A (en) * | 2016-07-20 | 2016-12-07 | 广东小天才科技有限公司 | Potential safety hazard reminding method and device |
CN106328161A (en) * | 2016-08-22 | 2017-01-11 | 维沃移动通信有限公司 | Audio data processing method and mobile terminal |
CN106601216A (en) * | 2016-11-30 | 2017-04-26 | 宇龙计算机通信科技(深圳)有限公司 | Method and system for realizing electronic device control through music |
US20170229133A1 (en) * | 2013-03-15 | 2017-08-10 | Facebook, Inc. | Managing silence in audio signal identification |
-
2017
- 2017-08-31 CN CN201710769644.0A patent/CN107516534B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103730032A (en) * | 2012-10-12 | 2014-04-16 | 李志刚 | Method and system for controlling multimedia data |
US20170229133A1 (en) * | 2013-03-15 | 2017-08-10 | Facebook, Inc. | Managing silence in audio signal identification |
CN104157301A (en) * | 2014-07-25 | 2014-11-19 | 广州三星通信技术研究有限公司 | Method, device and terminal deleting voice information blank segment |
CN105810213A (en) * | 2014-12-30 | 2016-07-27 | 浙江大华技术股份有限公司 | Typical abnormal sound detection method and device |
CN105512272A (en) * | 2015-12-04 | 2016-04-20 | 天津凌浩科技有限公司 | System for comparing audio information and audio information comparison method |
CN105389600A (en) * | 2015-12-31 | 2016-03-09 | 田雪松 | Data processing method |
CN106205117A (en) * | 2016-07-20 | 2016-12-07 | 广东小天才科技有限公司 | Potential safety hazard reminding method and device |
CN106328161A (en) * | 2016-08-22 | 2017-01-11 | 维沃移动通信有限公司 | Audio data processing method and mobile terminal |
CN106601216A (en) * | 2016-11-30 | 2017-04-26 | 宇龙计算机通信科技(深圳)有限公司 | Method and system for realizing electronic device control through music |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE112018006847B4 (en) | 2018-02-16 | 2022-12-08 | Mitsubishi Electric Corporation | Display data generating device, display data generating method and program |
CN111712691A (en) * | 2018-02-16 | 2020-09-25 | 三菱电机株式会社 | Display data generation device, display data generation method, and program |
CN110853674A (en) * | 2018-07-24 | 2020-02-28 | 中兴通讯股份有限公司 | Text collation method, apparatus, and computer-readable storage medium |
CN109344388A (en) * | 2018-08-02 | 2019-02-15 | 中央电视台 | Spam comment identification method and device and computer readable storage medium |
CN109344388B (en) * | 2018-08-02 | 2023-06-09 | 中央电视台 | Method and device for identifying spam comments and computer-readable storage medium |
CN112002312A (en) * | 2019-05-08 | 2020-11-27 | 顺丰科技有限公司 | Voice recognition method, device, computer program product and storage medium |
CN110322894B (en) * | 2019-06-27 | 2022-02-11 | 电子科技大学 | Sound-based oscillogram generation and panda detection method |
CN110322894A (en) * | 2019-06-27 | 2019-10-11 | 电子科技大学 | A kind of waveform diagram generation and giant panda detection method based on sound |
CN111275213A (en) * | 2020-01-17 | 2020-06-12 | 安徽华创环保设备科技有限公司 | Mechanical equipment fault monitoring system based on big data |
CN111275213B (en) * | 2020-01-17 | 2020-09-15 | 安徽华创环保设备科技有限公司 | Mechanical equipment fault monitoring system based on big data |
CN111640445A (en) * | 2020-05-13 | 2020-09-08 | 广州国音智能科技有限公司 | Audio difference detection method, device, equipment and readable storage medium |
WO2022199461A1 (en) * | 2021-03-24 | 2022-09-29 | 华为技术有限公司 | Method for testing speech interaction system, audio recognition method, and related devices |
CN115243104A (en) * | 2021-11-30 | 2022-10-25 | 广州汽车集团股份有限公司 | Method and system for automatically adjusting vehicle-mounted multimedia volume |
Also Published As
Publication number | Publication date |
---|---|
CN107516534B (en) | 2020-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107516534A (en) | Voice information comparison method and device and terminal equipment | |
US9514741B2 (en) | Data shredding for speech recognition acoustic model training under data retention restrictions | |
CN112015603A (en) | User terminal hardware detection method, device, computer device and storage medium | |
US9514740B2 (en) | Data shredding for speech recognition language model training under data retention restrictions | |
CN110532107B (en) | Interface calling method, device, computer equipment and storage medium | |
CN111931491B (en) | Domain dictionary construction method and device | |
CN107590291A (en) | A kind of searching method of picture, terminal device and storage medium | |
CN107741972A (en) | A kind of searching method of picture, terminal device and storage medium | |
CN110826619A (en) | File classification method and device of electronic files and electronic equipment | |
CN112783825A (en) | Data archiving method, data archiving device, computer device and storage medium | |
CN111898363B (en) | Compression method, device, computer equipment and storage medium for long and difficult text sentence | |
CN114722837A (en) | Multi-turn dialog intention recognition method and device and computer readable storage medium | |
CN106599637B (en) | Method and device for inputting verification code on verification interface | |
CN106782516B (en) | Corpus classification method and apparatus | |
CN112669850A (en) | Voice quality detection method and device, computer equipment and storage medium | |
CN109766089B (en) | Code generation method and device based on dynamic diagram, electronic equipment and storage medium | |
CN113742738A (en) | Model parameter safety protection method, safety protection device and computer device | |
CN111666408A (en) | Method and device for screening and displaying important clauses | |
CN114842982B (en) | Knowledge expression method, device and system for medical information system | |
CN114840634B (en) | Information storage method and device, electronic equipment and computer readable medium | |
CN110751510A (en) | Method and device for determining promotion list | |
CN112908339B (en) | Conference link positioning method and device, positioning equipment and readable storage medium | |
CN111078821B (en) | Dictionary setting method, dictionary setting device, medium and electronic equipment | |
CN110971759A (en) | Processing method and device for unsubscribed short message and server | |
CN115019788A (en) | Voice interaction method, system, terminal equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |