CN107071553A - A kind of method, device and computer-readable recording medium for changing video speech - Google Patents

A kind of method, device and computer-readable recording medium for changing video speech Download PDF

Info

Publication number
CN107071553A
CN107071553A CN201710411693.7A CN201710411693A CN107071553A CN 107071553 A CN107071553 A CN 107071553A CN 201710411693 A CN201710411693 A CN 201710411693A CN 107071553 A CN107071553 A CN 107071553A
Authority
CN
China
Prior art keywords
modified
video
voice messaging
parsing
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710411693.7A
Other languages
Chinese (zh)
Other versions
CN107071553B (en
Inventor
张声联
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201710411693.7A priority Critical patent/CN107071553B/en
Publication of CN107071553A publication Critical patent/CN107071553A/en
Application granted granted Critical
Publication of CN107071553B publication Critical patent/CN107071553B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The present invention is suitable for electronic technology field there is provided a kind of method, device and computer-readable recording medium for changing video speech, and methods described includes:The video name inputted according to user obtains video to be modified, and the video to be modified is parsed;Obtain voice messaging to be modified, and the voice messaging to be modified is searched in video to be modified after parsing;If finding the voice messaging to be modified in the video to be modified after the parsing, batch replacement is carried out to the voice messaging to be modified repeatedly occurred in the video to be modified according to target voice information.The present invention in video to be modified by searching voice messaging to be modified, so that when finding multiple voice messagings to be modified, batch replacement is carried out to multiple voice messagings to be modified using correct target voice information, modification efficiency when changing video speech is improved.

Description

A kind of method, device and computer-readable recording medium for changing video speech
Technical field
The invention belongs to electronic technology field, more particularly to a kind of method, device and computer for changing video speech can Read storage medium.
Background technology
With the development of science and technology, video teaching slowly turns into a kind of normality of people's learning life.However, existing regard Fuzzy pictures, voice mistake often occurs in imparting knowledge to students, the problems such as knowledge point is inaccurate in frequency, the problem of for voice mistake, such as Fruit is that some point voice mistake occurs, then can directly change replacement, but if identical voice occurs in large batch of video Problem-Error, then need modification people to modify video area one by one, and this method substantially increases the work of modification people Amount and modification time, and it is low to change efficiency.
The content of the invention
In view of this, the embodiments of the invention provide it is a kind of change the method for video speech, device and with it is computer-readable Storage medium, to solve the problem of efficiency existing when changing video speech mistake in the prior art is low.
The first aspect of the embodiment of the present invention provides a kind of method for changing video speech, and methods described includes:
The video name inputted according to user obtains video to be modified, and the video to be modified is parsed;
Obtain voice messaging to be modified, and the voice letter to be modified is searched in video to be modified after parsing Breath;
If the voice messaging to be modified is found in the video to be modified after the parsing, according to target voice Information carries out batch replacement to the voice messaging to be modified repeatedly occurred in the video to be modified.
The second aspect of the embodiment of the present invention provides a kind of device for changing video speech, and described device includes:
Parsing module, the video name for being inputted according to user obtains video to be modified, and to the video to be modified Parsed;
Treated described in being searched in acquisition module, the voice messaging to be modified for obtaining, and video to be modified after parsing The voice messaging of modification;
First replacement module, if for finding the voice letter to be modified in the video to be modified after the parsing Breath, then carry out batch according to target voice information to the voice messaging to be modified repeatedly occurred in the video to be modified Replace.
The third aspect of the embodiment of the present invention provides a kind of device for changing video speech, including:Memory, processor And it is stored in the computer program that can be run in the memory and on the processor, meter described in the computing device The step of method of above-mentioned modification video speech is realized during calculation machine program.
The fourth aspect of the embodiment of the present invention provides a kind of computer-readable recording medium, the computer-readable storage Media storage has computer program, and the computer program realizes the method for above-mentioned modification video speech when being executed by processor Step.
The beneficial effect that the embodiment of the present invention exists compared with prior art is:The present invention is by video to be modified Search voice messaging to be modified so that when finding multiple voice messagings to be modified, using correct target language message Cease and batch replacement is carried out to multiple voice messagings to be modified, improve modification efficiency when changing video speech.
Brief description of the drawings
Technical scheme in order to illustrate the embodiments of the present invention more clearly, below will be to embodiment or description of the prior art In required for the accompanying drawing that uses be briefly described, it should be apparent that, drawings in the following description are only some of the present invention Embodiment, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these Accompanying drawing obtains other accompanying drawings.
Fig. 1 is a kind of implementation process schematic diagram of method for changing video speech provided in an embodiment of the present invention;
Fig. 2 is a kind of another implementation process schematic diagram of method for changing video speech provided in an embodiment of the present invention;
Fig. 3 is a kind of schematic diagram of device for changing video speech provided in an embodiment of the present invention;
Fig. 4 is a kind of another schematic diagram of device for changing video speech provided in an embodiment of the present invention;
Fig. 5 is a kind of another schematic diagram of device for changing video speech provided in an embodiment of the present invention.
Embodiment
In describing below, in order to illustrate rather than in order to limit, it is proposed that such as tool of particular system structure, technology etc Body details, thoroughly to understand the embodiment of the present invention.However, it will be clear to one skilled in the art that there is no these specific The present invention can also be realized in the other embodiments of details.In other situations, omit to well-known system, device, electricity Road and the detailed description of method, in case unnecessary details hinders description of the invention.
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than whole embodiments.Based on this hair Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of protection of the invention.
It should be appreciated that ought be in this specification and in the appended claims in use, term " comprising " indicates described spy Levy, entirety, step, operation, the presence of element and/or component, but be not precluded from one or more of the other feature, entirety, step, Operation, element, component and/or its presence or addition for gathering.
It is also understood that the term used in this description of the invention is merely for the sake of the mesh for describing specific embodiment And be not intended to limit the present invention.As used in description of the invention and appended claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singulative, " one " and "the" are intended to include plural form.
It will be further appreciated that, the term "and/or" used in description of the invention and appended claims is Refer to any combinations of one or more of the associated item listed and be possible to combination, and including these combinations.
As used in this specification and in the appended claims, term " if " can be according to context quilt Be construed to " when ... " or " once " or " in response to determining " or " in response to detecting ".Similarly, phrase " if it is determined that " or " if detecting [described condition or event] " can be interpreted to mean according to context " once it is determined that " or " in response to true It is fixed " or " once detecting [described condition or event] " or " in response to detecting [described condition or event] ".
In order to illustrate technical solutions according to the invention, illustrated below by specific embodiment.
It is a kind of schematic flow diagram of method for changing video speech provided in an embodiment of the present invention referring to Fig. 1.Such as Fig. 1 Shown, the method for the modification video speech may include following steps:
Step S101:The video name inputted according to user obtains video to be modified, and video to be modified is parsed.
Wherein, in embodiments of the present invention, substantial amounts of video has been imported in advance in the device of modification video sound, this is a large amount of Video is stored in the memory cell of modification video sound, and the memory cell can be realized using nonvolatile memory, such as EPROM (Erasable Programmable Read-Only Memory, EPROM), EEPROM (Electrically Erasable Programmable Read-Only Memory, EEPROM) Or FLASH (flash memory).When a certain video frequency output mistake, the video name that the device of video sound is inputted according to user is changed The video that the needs need to change is determined, and the video is parsed.
When it is implemented, the mode of user's input video title is not limited to a certain kind, for example, user can be directly defeated The video name or user for entering video to be modified directly operate a certain button of the device of modification video sound, are regarded by modification The device of frequency sound obtains the video name of user's input according to the button operation of user or user is changing video sound Device enters behind list of videos interface, touch operation is carried out on the display screen of the device of modification video sound, to be arranged in video Video to be modified is chosen in table, so that the device of modification video sound obtains what user inputted according to the touch operation of user Video name.
Step S102:Obtain voice messaging to be modified, and language to be modified is searched in video to be modified after parsing Message ceases.
Wherein, in embodiments of the present invention, when the device of modification video sound obtains video to be modified, and to be modified After video is parsed, voice messaging to be modified can be obtained according to the input of user by changing the device of video sound.
When it is implemented, the sound collection equipments such as microphone are integrated with the device of modification video sound, user's needs pair When error audio-frequency information in a certain video is modified, voice messaging can be directly exported, the device of video sound is changed to this Voice messaging is acquired, and then the voice messaging is carried out by the voice content recognition unit in the device of modification video sound Identification, according to the voice messaging that the video to be modified is searched in the video to be modified of recognition result after parsing.Need explanation Be, the sound collection equipment such as microphone can also as modification video sound device ancillary equipment, when it collects voice During the voice messaging to be modified of input, the voice messaging is sent to the device of modification video sound.
For example, user needs to repair to Fourier is misread as into Euler's theorem voice messaging in Fourier's explanation video Change, then user needs to input the voice letter of its i.e. Euler's theorem of voice messaging for needing to change to the device of modification video sound Breath, and then is acquired and recognized to the voice messaging by modification video audio unit, and Fourier's explanation after parsing is regarded The Euler's theorem voice messaging is searched in frequency.
Step S103:If voice messaging to be modified is found in video to be modified after parsing, according to target language Message breath carries out batch replacement to the voice messaging to be modified repeatedly occurred in video to be modified.
Wherein, in embodiments of the present invention, if to find this in video to be modified to be repaired for the device of modification video sound The voice messaging changed, then obtain target voice information according to the voice messaging to be modified, and the target voice information can be used The voice messaging with voice messaging to be modified with one-to-one relationship of family input, when the device of modification video sound is obtained During to the target voice information, the voice to be modified repeatedly occurred in video to be modified can be believed using the target voice information Breath carries out batch replacement, to realize that the erroneous point to garbled voice carries out lookup automatically, modification and batch processing, so not only may be used At utmost to lift the quality of instructional video, substantial amounts of manpower modification cost can also be saved, modification efficiency is improved.
In embodiments of the present invention, by searching voice messaging to be modified in video to be modified so that searching During to multiple voice messagings to be modified, batch is carried out to multiple voice messagings to be modified using correct target voice information Replace, improve modification efficiency when changing video speech.
It is the schematic flow diagram of the method for another modification video speech provided in an embodiment of the present invention referring to Fig. 2.As schemed Shown in 2, the method for the modification video speech may include following steps:
Step S201:The video name inputted according to user obtains video to be modified, and video to be modified is parsed.
Wherein, in embodiments of the present invention, substantial amounts of video has been imported in advance in the device of modification video sound, this is a large amount of Video is stored in the memory cell of modification video sound, and the memory cell can be realized using nonvolatile memory, such as EPROM (Erasable Programmable Read-Only Memory, EPROM), EEPROM (Electrically Erasable Programmable Read-Only Memory, EEPROM) Or FLASH (flash memory).When a certain video frequency output mistake, the video name that the device of video sound is inputted according to user is changed The video that the needs need to change is determined, and the video is parsed.
When it is implemented, the mode of user's input video title is not limited to a certain kind, for example, user can be directly defeated The video name or user for entering video to be modified directly operate a certain button of the device of modification video sound, are regarded by modification The device of frequency sound obtains the video name of user's input according to the button operation of user or user is changing video sound Device enters behind list of videos interface, touch operation is carried out on the display screen of the device of modification video sound, to be arranged in video Video to be modified is chosen in table, so that the device of modification video sound obtains what user inputted according to the touch operation of user Video name.
Step S202:Obtain voice messaging to be modified, and language to be modified is searched in video to be modified after parsing Message ceases.
Wherein, in embodiments of the present invention, when the device of modification video sound obtains video to be modified, and to be modified After video is parsed, voice messaging to be modified can be obtained according to the input of user by changing the device of video sound.
When it is implemented, the sound collection equipments such as microphone are integrated with the device of modification video sound, user's needs pair When error audio-frequency information in a certain video is modified, voice messaging can be directly exported, the device of video sound is changed to this Voice messaging is acquired, and then the voice messaging is carried out by the voice content recognition unit in the device of modification video sound Identification, according to the voice messaging that the video to be modified is searched in the video to be modified of recognition result after parsing.Need explanation Be, the sound collection equipment such as microphone can also as modification video sound device ancillary equipment, when it collects voice During the voice messaging to be modified of input, the voice messaging is sent to the device of modification video sound.
For example, user needs to repair to Fourier is misread as into Euler's theorem voice messaging in Fourier's explanation video Change, then user needs to input the voice letter of its i.e. Euler's theorem of voice messaging for needing to change to the device of modification video sound Breath, and then is acquired and recognized to the voice messaging by modification video audio unit, and Fourier's explanation after parsing is regarded The Euler's theorem voice messaging is searched in frequency.
Step S203:If voice messaging to be modified is found in video to be modified after parsing, according to target language Message breath carries out batch replacement to the voice messaging to be modified repeatedly occurred in video to be modified.
Wherein, in embodiments of the present invention, if to find this in video to be modified to be repaired for the device of modification video sound The voice messaging changed, then obtain target voice information according to the voice messaging to be modified, and the target voice information can be used The voice messaging with voice messaging to be modified with one-to-one relationship of family input.When the device of modification video sound is obtained During to the target voice information, the voice to be modified repeatedly occurred in video to be modified can be believed using the target voice information Breath carries out batch replacement.
Further, if voice messaging to be modified is found in video to be modified after parsing, according to target voice Information carries out batch replacement to the voice messaging to be modified repeatedly occurred in video to be modified:
Determine the multiple timing nodes repeatedly occurred in the video to be modified of voice messaging to be modified after parsing;
Target voice information is obtained, and multiple voice messaging batches to be modified at multiple timing nodes are replaced with into mesh Mark voice messaging.
Wherein, in embodiments of the present invention, voice content timi requirement device is installed in the device of modification video speech With batch processing device.Voice letter to be modified is found in the video to be modified of the device of video speech after parsing when changing During breath, the voice content timi requirement device in the device of modification video speech is to the voice messaging to be modified after the parsing Video to be modified in the timing node that occurs be determined, timing node herein is multiple, and each timing node occurs The once voice messaging to be modified, and the voice messaging to be modified that each timing node occurs is identical.
The when segmentum intercalaris occurred when the voice messaging to be modified that voice content timi requirement device is determined in video to be modified After point, the device of modification video speech obtains target voice information, and the voice messaging to be modified at each timing node is entered Row is replaced, that is, the device for changing video speech is entered using target voice information to the voice messaging to be modified at multiple timing nodes Batch of going is replaced, so not only can be most to realize that the erroneous point to garbled voice carries out lookup automatically, modification and batch processing Big degree lifts the quality of instructional video, can also save substantial amounts of manpower modification cost, improve modification efficiency.
When it is implemented, voice content timi requirement device and batch processing device can be real using the method for software or hardware It is existing, it is not particularly limited herein.Video modification software is installed in the device for changing video speech, when the dress of modification video speech Put and the timing node that voice messaging to be modified occurs is determined, and obtain after target voice information, modification video speech Voice content recognition unit in device sends modification instruction, and gives video modification software by the modification instruction feedback, in order to The video modification software replaces voice messaging to be modified using target voice information.
Further, the method for the modification video sound is further comprising the steps of:
Step S204:If finding voice messaging to be modified in video to be modified after parsing, obtain with it is to be repaired The voice messaging changed is corresponding and image information to be modified that repeatedly occur in video to be modified, and according to target picture Information is replaced to image information to be modified.
Wherein, in embodiments of the present invention, if to find this in video to be modified to be repaired for the device of modification video sound The voice messaging changed, the then video content inspection unit changed in the device of video sound is carried out to the video to be modified after parsing Content recognition, it is corresponding and repeatedly occurring in the video to be modified to be repaired with the voice messaging for obtaining to be modified with this Change image information.
After the device for changing video sound gets image information to be modified, the device of modification video sound can be further Ground obtains target picture information, and the target picture information can be stored in advance in the device of modification video sound.When modification video , can be using the target picture information to repeatedly occurring in video to be modified when the device of sound gets the target picture information Image information to be modified is replaced.
It should be noted that in embodiments of the present invention, image information to be modified refers to occurring in image content and treat The image information of the corresponding word of the voice messaging of modification, or the shape of the mouth as one speaks of explanation person and voice messaging to be modified in picture Corresponding image information;In addition, the image information to be modified repeatedly occurred in video to be modified can part it is identical, can phase completely Together, also can be entirely different, depending on its specific video content according to video to be modified, it is not particularly limited herein.
Further, if finding voice messaging to be modified in video to be modified after parsing, obtain with it is to be modified Voice messaging is corresponding and image information to be modified that repeatedly occur in video to be modified, and believed according to target picture Breath is replaced specially to image information to be modified:
Obtain the multiple to be modified image informations corresponding with voice messaging to be modified;
Determine the multiple timing nodes occurred in the video to be modified of multiple image informations to be modified after parsing;
Target picture information is obtained, and multiple image informations to be modified at multiple timing nodes are replaced with into target picture Information.
Wherein, in embodiments of the present invention, the multiple to be modified picture letters corresponding with voice messaging to be modified are obtained Breath is referred to described in step S204, and here is omitted.
Further, after the device for changing video speech gets image information to be modified, the dress of video speech is changed Voice content timi requirement device pair image information to be modified corresponding with the voice messaging to be modified in putting is in the parsing The timing node occurred in video to be modified afterwards is determined.Specifically, voice content timi requirement device can be treated to this After the positioning of timing node that the voice messaging of modification occurs in video to be modified, getting after after image information, according to treating The timing node that the voice messaging of modification occurs in video to be modified determines to treat what image information occurred in video to be modified Timing node.
It is worth noting that, timing node herein is multiple, there is the once picture to be modified in each timing node Information, and each timing node occur image information to be modified can it is identical, can part it is identical, can completely not yet Together, it is not particularly limited herein.
The when segmentum intercalaris occurred when the image information to be modified that voice content timi requirement device is determined in video to be modified After point, the device of modification video speech obtains target picture information, and the image information to be modified at each timing node is entered Row is replaced, that is, the device for changing video speech is entered using target picture information to the voice messaging to be modified at multiple timing nodes Row is replaced, wherein, target picture information is corresponding with image information to be modified, and is stored in advance in modification video sound In the memory cell of device, when needing to be replaced image information to be modified, the device of video sound is changed by the target Image information is inserted at the corresponding timing node of image information to be modified, and picture to be modified is deleted or covered, with reality Now to the replacement of image information to be modified.
In embodiments of the present invention, by searching voice messaging to be modified in video to be modified so that searching During to multiple voice messagings to be modified, batch is carried out to multiple voice messagings to be modified using correct target voice information Replace, so not only can at utmost lift the quality of instructional video, substantial amounts of manpower modification cost can also be saved, improved Modification efficiency when changing video speech.
In addition, can also be by searching image information to be modified in video to be modified so that multiple to be repaired finding When changing image information, multiple image informations to be modified are replaced using correct target picture information, teaching is improved with this The quality of video, can also save substantial amounts of manpower modification cost, improve modification efficiency.
It is a kind of schematic block diagram of device 3 for changing video speech provided in an embodiment of the present invention referring to Fig. 3.This hair Each module that the device 3 for the modification video speech that bright embodiment is provided includes is used to perform each step in the corresponding embodiments of Fig. 1 Suddenly, it is specific referring to Fig. 1, and the associated description in the corresponding embodiments of Fig. 1, here is omitted.The embodiment of the present invention is carried The device 3 of the modification video speech of confession includes parsing module 300, the replacement module 302 of acquisition module 301 and first.
The video name that parsing module 300 is used to be inputted according to user obtains video to be modified, and video to be modified is entered Row parsing.
Acquisition module 301 is used to obtain voice messaging to be modified, and is searched in video to be modified after parsing to be repaired The voice messaging changed.
If the first replacement module 302 is used to find voice messaging to be modified in video to be modified after parsing, Batch replacement is carried out to the voice messaging to be modified repeatedly occurred in video to be modified according to target voice information.
In embodiments of the present invention, the device 3 of modification video speech is to be modified by being searched in video to be modified Voice messaging so that when finding multiple voice messagings to be modified, using correct target voice information to multiple to be repaired The voice messaging changed carries out batch replacement, so not only can at utmost lift the quality of instructional video, can also save big The manpower modification cost of amount, improves modification efficiency when changing video speech.
It is a kind of schematic block diagram of device 4 for changing video speech provided in an embodiment of the present invention referring to Fig. 4.This hair The 4 each modules included that fill for the modification video speech that bright embodiment is provided are used to perform each step in the corresponding embodiments of Fig. 2, It is specific referring to Fig. 2, and the associated description in the corresponding embodiments of Fig. 2, here is omitted.It is provided in an embodiment of the present invention Changing the device 4 of video speech includes parsing module 400, acquisition module 401, the first replacement module 402 and the second replacement mould Block 403.
The video name that parsing module 400 is used to be inputted according to user obtains video to be modified, and video to be modified is entered Row parsing.
Acquisition module 401 is used to obtain voice messaging to be modified, and is searched in video to be modified after parsing to be repaired The voice messaging changed.
If the first replacement module 402 is used to find voice messaging to be modified in video to be modified after parsing, Batch replacement is carried out to the voice messaging to be modified repeatedly occurred in video to be modified according to target voice information.
Further, the first replacement module 402 includes the first determining unit and the first replacement unit.
Wherein, the first determining unit is used to determine repeatedly to go out in the video to be modified of voice messaging to be modified after parsing Existing multiple timing nodes.
First replacement unit is used to obtain target voice information, and by multiple voices to be modified at multiple timing nodes Information batch replaces with target voice information.
If the second replacement module 403 is used to find voice messaging to be modified in video to be modified after parsing, Obtain image information to be modified corresponding with voice messaging to be modified and repeatedly occurring in video to be modified, and root Image information to be modified is replaced according to target picture information.
Further, the second replacement module 403 includes acquiring unit, the second determining unit and the second replacement unit.
Wherein, acquiring unit is used to obtain the multiple to be modified image informations corresponding with voice messaging to be modified;
It is many that second determining unit is used to determining to occur in the video to be modified of multiple image informations to be modified after parsing Individual timing node;
Second replacement unit is used to obtain target picture information, and multiple pictures to be modified at multiple timing nodes are believed Breath replaces with target picture information.
In embodiments of the present invention, the device 4 of modification video speech is to be modified by being searched in video to be modified Voice messaging so that when finding multiple voice messagings to be modified, using correct target voice information to multiple to be repaired The voice messaging changed carries out batch replacement, so not only can at utmost lift the quality of instructional video, can also save big The manpower modification cost of amount, improves modification efficiency when changing video speech.
In addition, the device 4 of modification video speech can also be made by searching image information to be modified in video to be modified Obtain when finding multiple image informations to be modified, multiple image informations to be modified are carried out using correct target picture information Replace, the quality of instructional video is improved with this, substantial amounts of manpower modification cost can also be saved, modification efficiency is improved.
It should be understood that the size of the sequence number of each step is not meant to the priority of execution sequence, each process in above-described embodiment Execution sequence should determine that the implementation process without tackling the embodiment of the present invention constitutes any limit with its function and internal logic It is fixed.
Fig. 5 is the schematic diagram of the device 5 for the modification video sound that one embodiment of the invention is provided.As shown in figure 5, the implementation The device 5 of the modification video sound of example includes:Processor 50, memory 51 and it is stored in the memory 51 and can be in institute The computer program 52 run on processor 50 is stated, for example, changes the program of the method for video sound.The processor 50 is performed The step in the embodiment of the method for each above-mentioned modification video sound is realized during the computer program 52, such as shown in Fig. 1 Step 101 is to 103, and step 201 shown in Fig. 2 is to 204.Or, the processor 50 performs the computer program 52 The function of each module/unit in the above-mentioned each device embodiments of Shi Shixian, such as the function of module 300 to 302 shown in Fig. 3, and The function of module 400 to 403 shown in Fig. 4.
Exemplary, the computer program 52 can be divided into one or more module/units, it is one or Multiple module/units are stored in the memory 51, and are performed by the processor 50, to complete the present invention.Described one Individual or multiple module/units can complete the series of computation machine programmed instruction section of specific function, and the instruction segment is used for Implementation procedure of the computer program 52 in the 5 of the modification video sound is described.For example, the computer program 52 can To be divided into synchronization module, summarizing module, acquisition module, return module (module in virtual bench), each specific work(of module Can be as follows:
The video name that parsing module 300 is used to be inputted according to user obtains video to be modified, and video to be modified is entered Row parsing.
Acquisition module 301 is used to obtain voice messaging to be modified, and is searched in video to be modified after parsing to be repaired The voice messaging changed.
If the first replacement module 302 is used to find voice messaging to be modified in video to be modified after parsing, Batch replacement is carried out to the voice messaging to be modified repeatedly occurred in video to be modified according to target voice information.
Or the video name that parsing module 400 is used to be inputted according to user obtains video to be modified, and regarded to be modified Frequency is parsed.
Acquisition module 401 is used to obtain voice messaging to be modified, and is searched in video to be modified after parsing to be repaired The voice messaging changed.
If the first replacement module 402 is used to find voice messaging to be modified in video to be modified after parsing, Batch replacement is carried out to the voice messaging to be modified repeatedly occurred in video to be modified according to target voice information.
Further, the first replacement module 402 includes the first determining unit and the first replacement unit.
Wherein, the first determining unit is used to determine repeatedly to go out in the video to be modified of voice messaging to be modified after parsing Existing multiple timing nodes.
First replacement unit is used to obtain target voice information, and by multiple voices to be modified at multiple timing nodes Information batch replaces with target voice information.
If the second replacement module 403 is used to find voice messaging to be modified in video to be modified after parsing, Obtain image information to be modified corresponding with voice messaging to be modified and repeatedly occurring in video to be modified, and root Image information to be modified is replaced according to target picture information.
Further, the second replacement module 403 includes acquiring unit, the second determining unit and the second replacement unit.
Wherein, acquiring unit is used to obtain the multiple to be modified image informations corresponding with voice messaging to be modified;
It is many that second determining unit is used to determining to occur in the video to be modified of multiple image informations to be modified after parsing Individual timing node;
Second replacement unit is used to obtain target picture information, and multiple pictures to be modified at multiple timing nodes are believed Breath replaces with target picture information.
The device 5 of the modification video sound can be desktop PC, notebook, palm PC and cloud server Deng computing device.The device 5 of the modification video sound may include, but be not limited only to, processor 50, memory 51.This area Technical staff is appreciated that Fig. 5 is only the example for the device 5 for changing video sound, does not constitute to modification video sound The restriction of device 5, can include than illustrating more or less parts, either combine some parts or different parts, example The device 5 of modification video sound can also include input-output equipment, network access equipment, bus etc. as described.
Alleged processor 50 can be CPU (Central Processing Unit, CPU), can also be Other general processors, digital signal processor (Digital Signal Processor, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) or other PLDs, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor can also be any conventional processor Deng.
The memory 51 can be the internal storage unit of the device 5 of the modification video sound, for example, change video The hard disk or internal memory of the device 5 of sound.The memory 51 can also be the external storage of the device 5 of the modification video sound The plug-in type hard disk being equipped with equipment, such as device 5 of described modification video sound, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card) etc..Further, it is described to deposit Reservoir 51 can also both include the internal storage unit of the device 5 of the modification video sound or including External memory equipment.Institute Stating memory 51 is used to store other program sums needed for the device 5 of the computer program and the modification video sound According to.The memory 51 can be also used for temporarily storing the data that has exported or will export.
It is apparent to those skilled in the art that, for convenience of description and succinctly, only with above-mentioned each work( Energy unit, the division progress of module are for example, in practical application, as needed can distribute above-mentioned functions by different Functional unit, module are completed, i.e., the internal structure of described device is divided into different functional unit or module, more than completion The all or part of function of description.Each functional unit, module in embodiment can be integrated in a processing unit, also may be used To be that unit is individually physically present, can also two or more units it is integrated in a unit, it is above-mentioned integrated Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.In addition, each function list Member, the specific name of module are also only to facilitate mutually differentiation, is not limited to the protection domain of the application.Said system The specific work process of middle unit, module, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, without detailed description or note in some embodiment The part of load, may refer to the associated description of other embodiments.
Those of ordinary skill in the art are it is to be appreciated that the list of each example described with reference to the embodiments described herein Member and algorithm steps, can be realized with the combination of electronic hardware or computer software and electronic hardware.These functions are actually Performed with hardware or software mode, depending on the application-specific and design constraint of technical scheme.Professional and technical personnel Described function can be realized using distinct methods to each specific application, but this realization is it is not considered that exceed The scope of the present invention.
, can be with embodiment provided by the present invention, it should be understood that disclosed device/terminal device and method Realize by another way.For example, device described above/terminal device embodiment is only schematical, for example, institute The division of module or unit is stated, only a kind of division of logic function there can be other dividing mode, for example when actually realizing Multiple units or component can combine or be desirably integrated into another system, or some features can be ignored, or not perform.Separately A bit, shown or discussed coupling or direct-coupling or communication connection each other can be by some interfaces, device Or INDIRECT COUPLING or the communication connection of unit, can be electrical, machinery or other forms.
The unit illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list Member can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
If the integrated module/unit realized using in the form of SFU software functional unit and as independent production marketing or In use, can be stored in a computer read/write memory medium.Understood based on such, the present invention realizes above-mentioned implementation All or part of flow in example method, can also instruct the hardware of correlation to complete, described meter by computer program Calculation machine program can be stored in a computer-readable recording medium, and the computer program can be achieved when being executed by processor The step of stating each embodiment of the method..Wherein, the computer program includes computer program code, the computer program Code can be source code form, object identification code form, executable file or some intermediate forms etc..Computer-readable Jie Matter can include:Can carry any entity or device of the computer program code, recording medium, USB flash disk, mobile hard disk, Magnetic disc, CD, computer storage, read-only storage (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium etc..It should be noted that described The content that computer-readable medium is included can carry out appropriate increasing according to legislation in jurisdiction and the requirement of patent practice Subtract, such as, in some jurisdictions, according to legislation and patent practice, computer-readable medium does not include electric carrier signal and electricity Believe signal.
Embodiment described above is merely illustrative of the technical solution of the present invention, rather than its limitations;Although with reference to foregoing reality Example is applied the present invention is described in detail, it will be understood by those within the art that:It still can be to foregoing each Technical scheme described in embodiment is modified, or carries out equivalent substitution to which part technical characteristic;And these are changed Or replace, the essence of appropriate technical solution is departed from the spirit and scope of various embodiments of the present invention technical scheme, all should Within protection scope of the present invention.

Claims (10)

1. a kind of method for changing video speech, it is characterised in that methods described includes:
The video name inputted according to user obtains video to be modified, and the video to be modified is parsed;
Obtain voice messaging to be modified, and the voice messaging to be modified is searched in video to be modified after parsing;
If the voice messaging to be modified is found in the video to be modified after the parsing, according to target voice information Batch replacement is carried out to the voice messaging to be modified repeatedly occurred in the video to be modified.
2. the method as described in claim 1, it is characterised in that it is described according to target voice information in the video to be modified The voice messaging to be modified repeatedly occurred carries out batch replacement:
Determine the multiple timing nodes repeatedly occurred in video to be modified of the voice messaging to be modified after the parsing;
Target voice information is obtained, and multiple voice messaging batches to be modified at the multiple timing node are replaced For the target voice information.
3. the method as described in claim 1, it is characterised in that acquisition voice messaging to be modified, and after parsing Searched in video to be modified after the voice messaging to be modified, methods described also includes:
If finding the voice messaging to be modified in the video to be modified after the parsing, obtain with it is described to be modified Voice messaging is corresponding and image information to be modified that repeatedly occur in the video to be modified, and drawn according to target Face information is replaced to the image information to be modified.
4. method according to claim 3, it is characterised in that the acquisition is corresponding with the voice messaging to be modified And the image information to be modified that repeatedly occurs in the video to be modified, and according to target picture information to described to be repaired Change image information to be replaced specially:
Obtain the multiple to be modified image informations corresponding with the voice messaging to be modified;
Determine the multiple timing nodes occurred in video to be modified of the multiple image information to be modified after the parsing;
Target picture information is obtained, and multiple image informations to be modified at the multiple timing node are replaced with described Target picture information.
5. a kind of device for changing video speech, it is characterised in that described device includes:
Parsing module, the video name for being inputted according to user obtains video to be modified, and the video to be modified is carried out Parsing;
Searched in acquisition module, the voice messaging to be modified for obtaining, and video to be modified after parsing described to be modified Voice messaging;
First replacement module, if for finding the voice messaging to be modified in the video to be modified after the parsing, Then batch is carried out to the voice messaging to be modified repeatedly occurred in the video to be modified according to target voice information to replace Change.
6. device as claimed in claim 5, it is characterised in that first replacement module includes:
First determining unit, for determining repeatedly to go out in video to be modified of the voice messaging to be modified after the parsing Existing multiple timing nodes;
First replacement unit, for obtaining target voice information, and will be multiple described to be modified at the multiple timing node Voice messaging batch replace with the target voice information.
7. device as claimed in claim 5, it is characterised in that described device also includes:
Second replacement module, if for finding the voice messaging to be modified in the video to be modified after the parsing, Then obtain picture to be modified corresponding with the voice messaging to be modified and repeatedly occurring in the video to be modified Information, and the image information to be modified is replaced according to target picture information.
8. device as claimed in claim 7, it is characterised in that second replacement module includes:
Acquiring unit, for obtaining the multiple to be modified image informations corresponding with the voice messaging to be modified;
Second determining unit, for determining occur in video to be modified of the multiple image information to be modified after the parsing Multiple timing nodes;
Second replacement unit, for obtaining target picture information, and will be multiple described to be modified at the multiple timing node Image information replaces with the target picture information.
9. a kind of device for changing video speech, including memory, processor and it is stored in the memory and can be in institute State the computer program run on processor, it is characterised in that realized described in the computing device during computer program as weighed The step of profit requires any one of 1 to 4 methods described.
10. a kind of computer-readable recording medium, the computer-readable recording medium storage has computer program, its feature exists In the step of realizing such as any one of Claims 1-4 methods described when the computer program is executed by processor.
CN201710411693.7A 2017-06-05 2017-06-05 Method, device and computer readable storage medium for modifying video and voice Active CN107071553B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710411693.7A CN107071553B (en) 2017-06-05 2017-06-05 Method, device and computer readable storage medium for modifying video and voice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710411693.7A CN107071553B (en) 2017-06-05 2017-06-05 Method, device and computer readable storage medium for modifying video and voice

Publications (2)

Publication Number Publication Date
CN107071553A true CN107071553A (en) 2017-08-18
CN107071553B CN107071553B (en) 2020-02-07

Family

ID=59616428

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710411693.7A Active CN107071553B (en) 2017-06-05 2017-06-05 Method, device and computer readable storage medium for modifying video and voice

Country Status (1)

Country Link
CN (1) CN107071553B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111885416A (en) * 2020-07-17 2020-11-03 北京来也网络科技有限公司 Audio and video correction method, device, medium and computing equipment
CN111885313A (en) * 2020-07-17 2020-11-03 北京来也网络科技有限公司 Audio and video correction method, device, medium and computing equipment
CN113051985A (en) * 2019-12-26 2021-06-29 深圳云天励飞技术有限公司 Information prompting method and device, electronic equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5909667A (en) * 1997-03-05 1999-06-01 International Business Machines Corporation Method and apparatus for fast voice selection of error words in dictated text
CN1561059A (en) * 2004-02-23 2005-01-05 中兴通讯股份有限公司 Coading method of voice data in communication system
CN102467376A (en) * 2010-11-10 2012-05-23 金蝶软件(中国)有限公司 Modification method and device for character information
CN103207769A (en) * 2012-01-16 2013-07-17 联想(北京)有限公司 Method and user equipment for voice amending
CN106534964A (en) * 2016-11-23 2017-03-22 广东小天才科技有限公司 Speed adjusting method and device
CN106604056A (en) * 2016-11-30 2017-04-26 腾讯科技(深圳)有限公司 Method and device for playing video
CN106792346A (en) * 2016-11-14 2017-05-31 广东小天才科技有限公司 Audio regulation method and device in a kind of instructional video

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5909667A (en) * 1997-03-05 1999-06-01 International Business Machines Corporation Method and apparatus for fast voice selection of error words in dictated text
CN1561059A (en) * 2004-02-23 2005-01-05 中兴通讯股份有限公司 Coading method of voice data in communication system
CN102467376A (en) * 2010-11-10 2012-05-23 金蝶软件(中国)有限公司 Modification method and device for character information
CN103207769A (en) * 2012-01-16 2013-07-17 联想(北京)有限公司 Method and user equipment for voice amending
CN106792346A (en) * 2016-11-14 2017-05-31 广东小天才科技有限公司 Audio regulation method and device in a kind of instructional video
CN106534964A (en) * 2016-11-23 2017-03-22 广东小天才科技有限公司 Speed adjusting method and device
CN106604056A (en) * 2016-11-30 2017-04-26 腾讯科技(深圳)有限公司 Method and device for playing video

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113051985A (en) * 2019-12-26 2021-06-29 深圳云天励飞技术有限公司 Information prompting method and device, electronic equipment and storage medium
CN111885416A (en) * 2020-07-17 2020-11-03 北京来也网络科技有限公司 Audio and video correction method, device, medium and computing equipment
CN111885313A (en) * 2020-07-17 2020-11-03 北京来也网络科技有限公司 Audio and video correction method, device, medium and computing equipment
CN111885416B (en) * 2020-07-17 2022-04-12 北京来也网络科技有限公司 Audio and video correction method, device, medium and computing equipment

Also Published As

Publication number Publication date
CN107071553B (en) 2020-02-07

Similar Documents

Publication Publication Date Title
US11645517B2 (en) Information processing method and terminal, and computer storage medium
CN110309058A (en) Business end test method, device, computer installation and computer storage medium
CN107784063B (en) Algorithm generation method and terminal equipment
CN111159770B (en) Text data desensitization method, device, medium and electronic equipment
CN107590291A (en) A kind of searching method of picture, terminal device and storage medium
CN107491536B (en) Test question checking method, test question checking device and electronic equipment
CN108491388A (en) Data set acquisition methods, sorting technique, device, equipment and storage medium
CN112560453A (en) Voice information verification method and device, electronic equipment and medium
CN109697537A (en) The method and apparatus of data audit
CN107193974A (en) Localized information based on artificial intelligence determines method and apparatus
CN107741972A (en) A kind of searching method of picture, terminal device and storage medium
CN108121699A (en) For the method and apparatus of output information
CN107071553A (en) A kind of method, device and computer-readable recording medium for changing video speech
CN112951233A (en) Voice question and answer method and device, electronic equipment and readable storage medium
CN111651989A (en) Named entity recognition method and device, storage medium and electronic device
CN116402166B (en) Training method and device of prediction model, electronic equipment and storage medium
CN107679222A (en) Image processing method, mobile terminal and computer-readable recording medium
CN106528141A (en) Task sweep-out method and system
CN111859985B (en) AI customer service model test method and device, electronic equipment and storage medium
CN115346095A (en) Visual question answering method, device, equipment and storage medium
CN109120509A (en) A kind of method and device that information is collected
CN107665443A (en) Obtain the method and device of targeted customer
CN113688232A (en) Method and device for classifying bidding texts, storage medium and terminal
CN115270799B (en) Named entity identification method and device
CN110414395B (en) Content identification method, device, server and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant