CN104883607B

CN104883607B - A kind of video interception or the method, apparatus and mobile device of shearing

Info

Publication number: CN104883607B
Application number: CN201510305097.1A
Authority: CN
Inventors: 刘黎
Original assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Current assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date: 2015-06-05
Filing date: 2015-06-05
Publication date: 2017-12-19
Anticipated expiration: 2035-06-05
Also published as: CN104883607A

Abstract

The embodiments of the invention provide a kind of video interception or the method, apparatus and mobile device of shearing.This method includes：Gather the voiceprint in sample sound；Target video is scanned according to voiceprint, and obtains the time point that the sound matched in target video with voiceprint occurs；Sectional drawing or shearing are carried out to target video according to acquired time point.By using above-mentioned technical proposal, user using mobile device when carrying out video interception or shearing manipulation, it may specify sample sound and target video, mobile device will scan target video automatically according to the voiceprint gathered from the sample sound, video image at voiceprint matching or video content are intercepted, algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, interception position is manually selected without user, can meet user's request.

Description

A kind of video interception or the method, apparatus and mobile device of shearing

Technical field

The present embodiments relate to image processing technique field, more particularly to a kind of video interception or the method for shearing, dress Put and mobile device.

Background technology

People are when watching the video files such as the short-movie of film, TV play or oneself shooting, it will usually run into many senses Character moulding, lines, the scenery included in the scene of interest, such as video.At this moment, people often want by video interception or The mode of video shearing preserves these scenes into picture or short visual form, convenient to check or use in the future.

, may be only interested in being related to the scene of same personage, then will want to save for many users Relate only to the picture of the personage or short-sighted frequency.In order to comprehensively intercept all fields for including the personage in a video file Scape, user is needed constantly to carry out sectional drawing or shearing manipulation in watching process, or all bags are recorded in watching process Time point containing personage's scene, then it is cumbersome, time-consuming, accurate by Video processing software progress sectional drawing or shearing, operating process Spend low.For the user of mobile device accustomed to using viewing video, carried out on the touchscreen using finger substantially Associative operation, above-mentioned sectional drawing or cut mode can be more complicated, and be difficult accurately be truncated to every time anticipation picture or Person envision the period in short-sighted frequency, when intercept it is inaccurate when, need that the operation of video interception or shearing is repeated again, for The use at family is made troubles.

The content of the invention

It is existing to solve the purpose of the present invention is to propose to a kind of video interception or the method, apparatus and mobile device of shearing Video interception or cut mode it is complex for operation step, time-consuming, and the problem of the degree of accuracy is low.

In a first aspect, the embodiments of the invention provide a kind of video interception or the method for shearing, including：

Gather the voiceprint in sample sound；

Target video is scanned according to the voiceprint, and obtains what is matched in the target video with the voiceprint The time point that sound occurs；

Sectional drawing or shearing are carried out to the target video according to acquired time point.

Second aspect, the embodiments of the invention provide a kind of video interception or the device of shearing, including：

Voiceprint acquisition module, for gathering the voiceprint in sample sound；

Videoscanning module, for according to the voiceprint scan target video, and obtain in the target video with The time point that the sound of the voiceprint matching occurs；

Sectional drawing or shear module, for carrying out sectional drawing or shearing to the target video according to acquired time point.

The third aspect, the embodiments of the invention provide a kind of mobile device, the mobile device is included in the embodiment of the present invention Video interception or shearing device.

The video interception or the method, apparatus of shearing and mobile device provided in the embodiment of the present invention, can simplify existing Video interception or shearing manipulation, it is time saving and energy saving, and the degree of accuracy is high.The video interception that is there is provided in the embodiment of the present invention or shearing Method, voiceprint in the sample sound collected scan target video, and obtain in target video with the vocal print At the time point that the sound of information matches occurs, sectional drawing or shearing are carried out to target video according to the time point of acquisition.By adopting With above-mentioned technical proposal, user may specify sample sound and mesh when carrying out video interception or shearing manipulation using mobile device Video is marked, mobile device will scan target video automatically according to the voiceprint gathered from the sample sound, by vocal print Video image or video content at information matches are intercepted, and algorithm is simple and interception accuracy rate is high, and quick interception can be achieved. Whole operation process simple and fast, manually selects interception position without user, meets user's request.

Brief description of the drawings

Fig. 1 is the schematic flow sheet of a kind of video interception that the embodiment of the present invention one provides or the method for shearing；

Fig. 2 is the schematic flow sheet of a kind of video interception that the embodiment of the present invention two provides or the method for shearing；

Fig. 3 is the structured flowchart of the device of a kind of video interception that the embodiment of the present invention three provides or shearing.

Embodiment

Further illustrate technical scheme below in conjunction with the accompanying drawings and by embodiment.It is appreciated that It is that specific embodiment described herein is used only for explaining the present invention, rather than limitation of the invention.Further need exist for illustrating , for the ease of description, part related to the present invention rather than entire infrastructure are illustrate only in accompanying drawing.

Embodiment one

A kind of video interception or the schematic flow sheet of the method for shearing that Fig. 1 provides for the embodiment of the present invention one, this method It can be performed by video interception or the device of shearing, wherein the device can be realized by software and/or hardware, and typically be integrated in shifting In dynamic equipment.As shown in figure 1, this method includes：

Voiceprint in step 101, collection sample sound.

It is exemplary, the mobile device concretely equipment such as mobile phone, tablet personal computer and notebook computer.This area skill Art personnel understand that the device for performing the present embodiment methods described is not limited to be integrated in mobile device, can also be integrated in platform In other electronic equipments such as formula computer, it is more obvious because of caused beneficial effect to be typically integrated in mobile device, So the embodiment of the present invention is only illustrated exemplified by being integrated in mobile device.

The purpose of method that the present embodiment provides is to carry out sectional drawing or shearing to target video, and exemplary, the target regards The video file such as short-movie of frequency concretely film, TV play or user oneself shooting, generally comprise in target video multiple Personage.When moulding of the user to one of personage, lines or the personage occur, corresponding scene etc. is interested, Bian Huixi Hope picture when occurring for the personage or fragment carry out sectional drawing or shearing, the personage can be defined as target person.

Exemplary, the source of the sample sound in this step can have a variety of.Such as：Directly it can be obtained by microphone The sound of target person simultaneously generates sample sound, such a mode be particularly suitable for use in target video for user oneself shoot short-movie feelings Condition；Also sample sound can be obtained by reading the audio or video comprising target person sound, is wrapped when in the audio or video During containing multiple personage's sound, it can be carried out specifying by user and read section, certainly, also can directly found by user in target video Any one place includes the reading section of target person sound, the target person sound read in section specified by reading user To obtain sample sound.After sample sound is obtained, sample sound can be stored, it is convenient directly to transfer use in the future.

In the present embodiment, the biological characteristic for the sound that the vocal print is behaved, the voiceprint may include the frequency of sound The vocal print characteristic such as rate, wavelength, intensity, rhythm and tone, can identify in one section of sound whether include vocal print by voiceprint Affiliated owner's word or other sound sent.In this step, the voiceprint in sample sound is acquired, preferably , the voiceprint collected can also be stored, it is convenient directly to transfer use in the future.

Step 102, target video scanned according to voiceprint, and obtain the sound matched in target video with voiceprint The time point of appearance.

Exemplary, the scanning based on voiceprint can be carried out in the case where target video is in broadcast state, in scanning process It is middle to be matched the sound occurred in target video with voiceprint, when the sound occurred is consistent with voiceprint, Illustrate that the match is successful, the sound is what target person was sent, obtains the time point that the sound occurs in target video；Also can incite somebody to action Audio-frequency information in target video is extracted, and the scanning based on voiceprint is carried out to the audio-frequency information extracted, Audio-frequency information is matched with voiceprint in scanning process, when certain a part of audio-frequency information is consistent with voiceprint, Illustrate that the match is successful, obtain the part audio-frequency information corresponding time point in target video.

Step 103, sectional drawing or shearing carried out to target video according to acquired time point.

Exemplary, video interception function setting option can be added respectively in a mobile device and video shearing function is set Option, it can voluntarily be set by user in above-mentioned two function setting option and be turned on and off corresponding function.Preferably, can also be by It is that video interception or video are sheared that user sets current interception type in video jukebox software, or sets and support simultaneously State two kinds of interception types.

Exemplary, can be directly to the video figure corresponding to the time point when getting a time point in a step 102 As being intercepted, and generate corresponding picture and stored.Preferably, can be ordered automatically according to the time point for corresponding sectional drawing Name, facilitates user to check in the future；When continuously acquiring multiple time points, illustrate that the sound of target person is held within a period of time It is continuous to occur, the video content in the period corresponding to multiple time points can be sheared, wherein, video content includes video Display image and the relevant information such as corresponding audio, can then generate the video file after corresponding shearing and be stored.

Exemplary, when getting a time point in a step 102, the time point can be stored, wait target After videoscanning, time shaft is generated according to the sequencing at acquired time point, further according to time shaft to target video Carry out sectional drawing or shearing.Wherein, the time shaft generated can be stored in mobile device, such as random access memory (Random-Access Memory, RAM), it is called when needing and carrying out sectional drawing or shearing.Specifically, work as target person A certain moment of the sound in target video occur, the moment can correspond to a point on time shaft, such as 1 in target video Divide at 20 seconds and target person sound occur, then the 1 of time shaft point corresponds at 20 seconds a point occurs；When the sound of target person exists Persistently occur in a certain period in target video, the period can correspond to a line segment on time shaft, such as 1 in target video Persistently there is target person sound between points 30 seconds to 1 point and 50 seconds, then correspondingly occur on time shaft starting point for 1 point 30 seconds, terminal For 1 point of line segment of 50 seconds., can be to the video image corresponding to the point on time shaft when video interception function is in opening Intercepted, video image corresponding in line segment can also be intercepted, and generate corresponding picture and stored.Work as video When shearing function is in opening, video content corresponding in line segment can be sheared, can then generate and cut accordingly Video file after cutting is stored.

Further, when carrying out shearing manipulation to target video, predetermined time period can be obtained, according to it is acquired when Between point and predetermined time period determine shearing section, according to shearing section target video is sheared.Exemplary, can basis Preset algorithm draws predetermined time period y, and the sound of matching is got at time point n, shearing section can be defined as [n-y, N+y], the video in the shearing section is sheared.For example, the predetermined time period y got is 15 seconds, the time of acquisition Point n is 1 minute and 20 seconds, then shears section as [1 point and 5 seconds, 1 point and 35 seconds], to 1 point of regarding between 35 seconds 5 seconds to 1 point of target video Frequency content is sheared.

Further, target video can be sheared according to acquired time point, and generates multiple sub-videos, will be more Individual sub- video-splicing turns into synthetic video.Exemplary, can basis when target person sound occurs multiple in target video Acquired time point is repeatedly sheared to target video, generates a corresponding sub-video after shearing every time.To whole After target video is sheared, multiple sub-videos can be subjected to splicing according to the order at time point, form synthetic video, can Avoid user that the operation for opening video is performed a plurality of times when checking sub-video in the future.

Preferably, when carrying out shot operation to target video, if target person sound goes out occurrence in target video The number time that is more or persistently occurring is longer, can obtain sectional drawing frequency, and according to sectional drawing frequency and acquired time point to mesh Mark video and carry out sectional drawing, so as to control sectional drawing quantity.Wherein, sectional drawing frequency can be set or by user according to reality by system default Situation sets itself.For example, acquired sectional drawing frequency be 30 seconds once, assign to 25 points 40 seconds, 30 points the 25 of target video There is target person sound to 32 points, then an image can be intercepted at 30 seconds at 25 points, 30 points 30 seconds, 31 points, 31 points 30 seconds and 32 offices intercept an image respectively, and four pictures are obtained.

In the present embodiment, when the sound time of occurrence of target person compares concentration, in order to save scan matching when Between, before step 102 is performed, the step of section to be scanned for obtaining target video can be increased, and when performing step 102, Section to be scanned is scanned according to voiceprint, and obtains the time that the sound matched in section to be scanned with voiceprint occurs Point.Wherein, the sweep interval can be selected by user.Specifically, can by be passed to the related period or by Section to be scanned is specified in the progress of video, as user can be by pulling video playback progress bar from a time point to other one Individual time point, using the section between two time points as section to be scanned.

The video interception or the method for shearing that the embodiment of the present invention one provides, according to the vocal print in the sample sound collected Information scans target video, and obtains the time point that the sound matched with the voiceprint in target video occurs, according to obtaining The time point taken carries out sectional drawing or shearing to target video.By using above-mentioned technical proposal, user is using mobile device Carry out video interception or during shearing manipulation, may specify sample sound and target video, mobile device will be automatically according to from the sound The voiceprint gathered in sound sample scans target video, and video image at voiceprint matching or video content are carried out Interception, algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, it is manual without user Interception position is selected, meets user's request.

Embodiment two

A kind of video interception or the schematic flow sheet of the method for shearing that Fig. 2 provides for the embodiment of the present invention two, this implementation Example is optimized based on above-described embodiment, in the present embodiment, fractional scanning is carried out to target video, and according to acquired Time point generate multiple period of the day from 11 p.m. to 1 a.m countershafts, target video is carried out further according to multiple period of the day from 11 p.m. to 1 a.m countershafts to be segmented sectional drawing or shearing.

Accordingly, the method for the present embodiment comprises the following steps：

Voiceprint in step 201, collection sample sound.

Step 202, according to voiceprint to target video carry out fractional scanning, and obtain in target video with voiceprint The time point that the sound of matching occurs.

Exemplary, in order to accelerate sectional drawing or shearing process, the audio-frequency information in target video can be extracted, to institute The audio-frequency information extracted carries out the multi-thread formula fractional scanning based on voiceprint.Time span can be based on by audio-frequency information N section is divided into, while each part is scanned, and obtains the sound matched in audio-frequency information with voiceprint and occurs Time point.For example, the time span of target video totally 30 minutes, can by the audio-frequency information extracted according to 0-10 minutes, It is divided into 3 parts within -20 minutes 10 minutes and -30 minutes 20 minutes, matching is scanned simultaneously to this 3 parts.

Step 203, multiple period of the day from 11 p.m. to 1 a.m countershafts are generated according to acquired time point.

Exemplary, the time point according to acquired in each part of audio-frequency information generates N number of period of the day from 11 p.m. to 1 a.m countershaft.Further , after generating a sub- time shaft, the period of the day from 11 p.m. to 1 a.m countershaft can be stored.

Step 204, according to the multiple period of the day from 11 p.m. to 1 a.m countershafts target video is carried out being segmented sectional drawing or shearing.

Exemplary, the time occurred due to the sound matched with voiceprint included in each part of audio-frequency information The quantity of point is likely to different, and relatively long one may be taken during scan matching comprising the more part of time point quantity A bit, therefore, the generation time of each period of the day from 11 p.m. to 1 a.m countershaft may also can be different.When sub- time of the sub- time shaft prior to other parts When axle is generated, sectional drawing or shearing can be carried out to corresponding video section based on the period of the day from 11 p.m. to 1 a.m countershaft that this is first generated, without Sectional drawing or shearing manipulation are performed again after all being generated etc. all period of the day from 11 p.m. to 1 a.m countershafts.

In the present embodiment, in order to further speed up sectional drawing or shearing process, can also increase before step 202 is performed The step of obtaining the section to be scanned of target video, and when performing step 202, regarding for sweep interval is treated according to voiceprint Frequency carries out fractional scanning.Wherein, the sweep interval can be selected by user.

The preferred embodiment that a kind of method using the embodiment of the present invention two carries out video interception is provided below：

For example, user needs to carry out video interception to a video A in mobile phone, it is therefore an objective to which interception includes personage's B sound Scene.It is sectional drawing that user, which can first specify interception type, and the target video for then confirming to need to carry out sectional drawing is whole duration Video A.User chooses a file (can be audio file or video file) comprising personage's B sound, Huo Zhe in mobile phone A scene comprising personage's B sound is specified to gather the voiceprint in sample sound as sample sound in video A.It will regard Frequency A is divided into N sections and carries out multithreading scan matching, when often generating a period of the day from 11 p.m. to 1 a.m countershaft for including personage's B voiceprints, according to The period of the day from 11 p.m. to 1 a.m countershaft carries out sectional drawing to this section of video.The picture file generated after sectional drawing is named simultaneously with its time point in video A It is stored in mobile phone EMS memory.If the scene that personage B occurs is more, in order to control sectional drawing quantity, sectional drawing frequency, such as 30s can be set Sectional drawing of interior progress.

The preferred embodiment that a kind of method using the embodiment of the present invention two carries out video shearing is provided below：

For example, user needs to shear video A, it is therefore an objective to which shearing includes the scene of personage's C sound.Pass through microphone One section of personage's C word is recorded, generates sample sound, gathers the voiceprint in sample sound.User can first specify interception class Type is sheared for video, and it is 30-60 minutes then to specify section to be scanned.Video A section 30-60 minutes to be scanned are designated as area Between D, by section D be divided into M sections carry out multithreading scan matching, often generate the sub- time for including personage's C voiceprints During axle, corresponding video section is sheared according to the period of the day from 11 p.m. to 1 a.m countershaft.Assuming that respectively 35-37,45-48,54-56 minute this Three periods include personage C vocal print characteristic, it will three sub-videos of generation, these three sub-videos can be spliced, it is raw The synthetic video slightly larger into one, and stored.

The video interception or the method for shearing that the embodiment of the present invention two provides, target video can be carried out according to voiceprint Fractional scanning, matching and the sectional drawing of multi-thread formula or shearing, it is substantially shorter corresponding to whole video interception or shear history Time, user is quickly obtained wanting the picture or video of interception, further lift the usage experience of user.

Embodiment three

Fig. 3 is the structured flowchart of the device of a kind of video interception that the embodiment of the present invention three provides or shearing, and the device can Realized by software and/or hardware, and be typically integrated in mobile device.As shown in figure 3, the device includes：Voiceprint gathers Module 301, for gathering the voiceprint in sample sound；Videoscanning module 302, for being scanned according to the voiceprint Target video, and obtain the time point that the sound matched in the target video with the voiceprint occurs；Sectional drawing or shearing Module 303, for carrying out sectional drawing or shearing to the target video according to acquired time point.

The video interception or the device of shearing that the embodiment of the present invention three provides, are believed by videoscanning module 302 according to vocal print The voiceprint in the sample sound that acquisition module 301 collects is ceased to scan target video, and is obtained in target video with being somebody's turn to do The time point that the sound of voiceprint matching occurs, by sectional drawing or shear module 303 according to the time point of acquisition to target video Carry out sectional drawing or shearing.By using above-mentioned technical proposal, user is carrying out video interception or shearing behaviour using mobile device When making, sample sound and target video are may specify, mobile device will be believed automatically according to the vocal print gathered from the sample sound Cease to scan target video, the video image at voiceprint matching or video content are intercepted, algorithm is simple and intercepts Accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, interception position is manually selected without user, meet to use Family demand.

On the basis of above-described embodiment, the sectional drawing or shear module may include：Time shaft generation unit and first section Figure or cut cells.Wherein, time shaft generation unit, for generating time shaft according to acquired time point；First sectional drawing or Cut cells, for carrying out sectional drawing or shearing to target video according to time shaft.

On the basis of above-described embodiment, the videoscanning module may include fractional scanning unit, for according to vocal print Information carries out fractional scanning to target video；The sectional drawing or shear module may include period of the day from 11 p.m. to 1 a.m countershaft generation unit and the second sectional drawing Or cut cells.Wherein, period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershafts according to acquired time point；Second Sectional drawing or cut cells, for being carried out being segmented sectional drawing or shearing to the target video according to the multiple period of the day from 11 p.m. to 1 a.m countershaft.

On the basis of above-described embodiment, the sectional drawing or shear module may include predetermined time period acquiring unit, cut Cut interval determination unit and the 3rd sectional drawing or cut cells.Wherein, predetermined time period acquiring unit, for obtaining preset time Length；Interval determination unit is sheared, for determining shearing section according to acquired time point and predetermined time period；3rd section Figure or cut cells, for being sheared according to shearing section to target video.

On the basis of above-described embodiment, the sectional drawing or shear module may include that sub-video generation unit and splicing are single Member.Wherein, sub-video generation unit, for being sheared according to acquired time point to target video, and more height are generated Video；Concatenation unit, for the splicing of multiple sub-videos to be turned into synthetic video.

On the basis of above-described embodiment, the device may also include acquisition module, for scanning mesh according to voiceprint Before marking video, the section to be scanned of target video is obtained；The sectional drawing or shear module can be specifically used for：According to voiceprint Section to be scanned is scanned, and obtains the time point that the sound matched in section to be scanned with voiceprint occurs.

On the basis of above-described embodiment, the sectional drawing or shear module may include frequency acquisition unit and the 4th sectional drawing or Cut cells.Wherein, frequency acquisition unit, for obtaining sectional drawing frequency；4th sectional drawing or cut cells, for being cut according to described Figure frequency and acquired time point carry out sectional drawing to the target video.

Example IV

The embodiment of the present invention four provides a kind of mobile device, and the equipment includes the video interception described in the embodiment of the present invention Or the device of shearing, video can be cut by performing the method for video interception described in the embodiment of the present invention or shearing Figure or shearing manipulation.

It is exemplary, the mobile device concretely equipment such as mobile phone, tablet personal computer and notebook computer.Preferably, The voice collection devices such as microphone are set in the mobile device.

When user carries out video interception or shearing manipulation in the mobile device provided using the embodiment of the present invention four, can refer to Determine sample sound and target video, mobile device will scan mesh automatically according to the voiceprint gathered from the sample sound Video is marked, the video image at voiceprint matching or video content are intercepted, algorithm is simple and interception accuracy rate is high, can Realize quick interception.Whole operation process simple and fast, manually selects interception position without user, meets user's request.

Pay attention to, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although being carried out by above example to the present invention It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also Other more equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims

1. a kind of video interception or the method for shearing, it is characterised in that including：

The voiceprint in sample sound is gathered, wherein, the sample sound is to specify mesh in reading section by reading user Mark the acoustic information of personage；

Target video is scanned according to the voiceprint, and obtains the sound matched in the target video with the voiceprint The time point of appearance；

2. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Sectional drawing or shearing, including：

Time shaft is generated according to acquired time point；

Sectional drawing or shearing are carried out to the target video according to the time shaft.

3. according to the method for claim 1, it is characterised in that

Target video is scanned according to the voiceprint, including：

Fractional scanning is carried out to target video according to the voiceprint；

Sectional drawing or shearing are carried out to the target video according to acquired time point, including：

Multiple period of the day from 11 p.m. to 1 a.m countershafts are generated according to acquired time point；

The target video is carried out according to the multiple period of the day from 11 p.m. to 1 a.m countershaft to be segmented sectional drawing or shearing.

4. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Shearing, including：

Obtain predetermined time period；

Shearing section is determined according to acquired time point and the predetermined time period；

The target video is sheared according to the shearing section.

5. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Shearing, including：

The target video is sheared according to acquired time point, and generates multiple sub-videos；

The splicing of the multiple sub-video is turned into synthetic video.

6. according to the method for claim 1, it is characterised in that before target video is scanned according to the voiceprint, Also include：

Obtain the section to be scanned of target video；

Target video is scanned according to the voiceprint, and obtains the sound matched in the target video with the voiceprint The time point of appearance, including：

The section to be scanned is scanned according to the voiceprint, and obtain in the section to be scanned with the voiceprint The time point that the sound matched somebody with somebody occurs.

7. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Sectional drawing, including：

Obtain sectional drawing frequency；

Sectional drawing is carried out to the target video according to the sectional drawing frequency and acquired time point.

8. a kind of video interception or the device of shearing, it is characterised in that including：

Voiceprint acquisition module, for gathering the voiceprint in sample sound, wherein, the voiceprint is to pass through reading User specifies the acoustic information for reading target person in section；

Videoscanning module, for according to the voiceprint scan target video, and obtain in the target video with it is described The time point that the sound of voiceprint matching occurs；

9. device according to claim 8, it is characterised in that the sectional drawing or shear module include：

Time shaft generation unit, for generating time shaft according to acquired time point；

First sectional drawing or cut cells, for carrying out sectional drawing or shearing to the target video according to the time shaft.

10. device according to claim 8, it is characterised in that

The videoscanning module includes：

Fractional scanning unit, for carrying out fractional scanning to target video according to the voiceprint；

The sectional drawing or shear module include：

Period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershafts according to acquired time point；

Second sectional drawing or cut cells, for according to the multiple period of the day from 11 p.m. to 1 a.m countershaft to the target video carry out be segmented sectional drawing or Shearing.

11. device according to claim 8, it is characterised in that the sectional drawing or shear module include：

Predetermined time period acquiring unit, for obtaining predetermined time period；

Interval determination unit is sheared, for determining shearing section according to acquired time point and the predetermined time period；

3rd sectional drawing or cut cells, for being sheared according to the shearing section to the target video.

12. device according to claim 8, it is characterised in that the sectional drawing or shear module include：

Sub-video generation unit, for being sheared according to acquired time point to the target video, and generate more height Video；

Concatenation unit, for the splicing of the multiple sub-video to be turned into synthetic video.

13. device according to claim 8, it is characterised in that also include：

Acquisition module, for before target video is scanned according to the voiceprint, obtaining the section to be scanned of target video；

The sectional drawing or shear module are specifically used for：

14. device according to claim 8, it is characterised in that the sectional drawing or shear module include：

Frequency acquisition unit, for obtaining sectional drawing frequency；

4th sectional drawing or cut cells, for being carried out according to the sectional drawing frequency and acquired time point to the target video Sectional drawing.

15. a kind of mobile device, it is characterised in that including the video interception as any one of claim 8-14 or shearing Device.