CN104883607B - A kind of video interception or the method, apparatus and mobile device of shearing - Google Patents

A kind of video interception or the method, apparatus and mobile device of shearing Download PDF

Info

Publication number
CN104883607B
CN104883607B CN201510305097.1A CN201510305097A CN104883607B CN 104883607 B CN104883607 B CN 104883607B CN 201510305097 A CN201510305097 A CN 201510305097A CN 104883607 B CN104883607 B CN 104883607B
Authority
CN
China
Prior art keywords
target video
video
sectional drawing
voiceprint
shearing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510305097.1A
Other languages
Chinese (zh)
Other versions
CN104883607A (en
Inventor
刘黎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201510305097.1A priority Critical patent/CN104883607B/en
Publication of CN104883607A publication Critical patent/CN104883607A/en
Application granted granted Critical
Publication of CN104883607B publication Critical patent/CN104883607B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44012Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving rendering scenes according to scene graphs, e.g. MPEG-4 scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440245Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display the reformatting operation being performed only on part of the stream, e.g. a region of the image or a time segment

Abstract

The embodiments of the invention provide a kind of video interception or the method, apparatus and mobile device of shearing.This method includes:Gather the voiceprint in sample sound;Target video is scanned according to voiceprint, and obtains the time point that the sound matched in target video with voiceprint occurs;Sectional drawing or shearing are carried out to target video according to acquired time point.By using above-mentioned technical proposal, user using mobile device when carrying out video interception or shearing manipulation, it may specify sample sound and target video, mobile device will scan target video automatically according to the voiceprint gathered from the sample sound, video image at voiceprint matching or video content are intercepted, algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, interception position is manually selected without user, can meet user's request.

Description

A kind of video interception or the method, apparatus and mobile device of shearing
Technical field
The present embodiments relate to image processing technique field, more particularly to a kind of video interception or the method for shearing, dress Put and mobile device.
Background technology
People are when watching the video files such as the short-movie of film, TV play or oneself shooting, it will usually run into many senses Character moulding, lines, the scenery included in the scene of interest, such as video.At this moment, people often want by video interception or The mode of video shearing preserves these scenes into picture or short visual form, convenient to check or use in the future.
, may be only interested in being related to the scene of same personage, then will want to save for many users Relate only to the picture of the personage or short-sighted frequency.In order to comprehensively intercept all fields for including the personage in a video file Scape, user is needed constantly to carry out sectional drawing or shearing manipulation in watching process, or all bags are recorded in watching process Time point containing personage's scene, then it is cumbersome, time-consuming, accurate by Video processing software progress sectional drawing or shearing, operating process Spend low.For the user of mobile device accustomed to using viewing video, carried out on the touchscreen using finger substantially Associative operation, above-mentioned sectional drawing or cut mode can be more complicated, and be difficult accurately be truncated to every time anticipation picture or Person envision the period in short-sighted frequency, when intercept it is inaccurate when, need that the operation of video interception or shearing is repeated again, for The use at family is made troubles.
The content of the invention
It is existing to solve the purpose of the present invention is to propose to a kind of video interception or the method, apparatus and mobile device of shearing Video interception or cut mode it is complex for operation step, time-consuming, and the problem of the degree of accuracy is low.
In a first aspect, the embodiments of the invention provide a kind of video interception or the method for shearing, including:
Gather the voiceprint in sample sound;
Target video is scanned according to the voiceprint, and obtains what is matched in the target video with the voiceprint The time point that sound occurs;
Sectional drawing or shearing are carried out to the target video according to acquired time point.
Second aspect, the embodiments of the invention provide a kind of video interception or the device of shearing, including:
Voiceprint acquisition module, for gathering the voiceprint in sample sound;
Videoscanning module, for according to the voiceprint scan target video, and obtain in the target video with The time point that the sound of the voiceprint matching occurs;
Sectional drawing or shear module, for carrying out sectional drawing or shearing to the target video according to acquired time point.
The third aspect, the embodiments of the invention provide a kind of mobile device, the mobile device is included in the embodiment of the present invention Video interception or shearing device.
The video interception or the method, apparatus of shearing and mobile device provided in the embodiment of the present invention, can simplify existing Video interception or shearing manipulation, it is time saving and energy saving, and the degree of accuracy is high.The video interception that is there is provided in the embodiment of the present invention or shearing Method, voiceprint in the sample sound collected scan target video, and obtain in target video with the vocal print At the time point that the sound of information matches occurs, sectional drawing or shearing are carried out to target video according to the time point of acquisition.By adopting With above-mentioned technical proposal, user may specify sample sound and mesh when carrying out video interception or shearing manipulation using mobile device Video is marked, mobile device will scan target video automatically according to the voiceprint gathered from the sample sound, by vocal print Video image or video content at information matches are intercepted, and algorithm is simple and interception accuracy rate is high, and quick interception can be achieved. Whole operation process simple and fast, manually selects interception position without user, meets user's request.
Brief description of the drawings
Fig. 1 is the schematic flow sheet of a kind of video interception that the embodiment of the present invention one provides or the method for shearing;
Fig. 2 is the schematic flow sheet of a kind of video interception that the embodiment of the present invention two provides or the method for shearing;
Fig. 3 is the structured flowchart of the device of a kind of video interception that the embodiment of the present invention three provides or shearing.
Embodiment
Further illustrate technical scheme below in conjunction with the accompanying drawings and by embodiment.It is appreciated that It is that specific embodiment described herein is used only for explaining the present invention, rather than limitation of the invention.Further need exist for illustrating , for the ease of description, part related to the present invention rather than entire infrastructure are illustrate only in accompanying drawing.
Embodiment one
A kind of video interception or the schematic flow sheet of the method for shearing that Fig. 1 provides for the embodiment of the present invention one, this method It can be performed by video interception or the device of shearing, wherein the device can be realized by software and/or hardware, and typically be integrated in shifting In dynamic equipment.As shown in figure 1, this method includes:
Voiceprint in step 101, collection sample sound.
It is exemplary, the mobile device concretely equipment such as mobile phone, tablet personal computer and notebook computer.This area skill Art personnel understand that the device for performing the present embodiment methods described is not limited to be integrated in mobile device, can also be integrated in platform In other electronic equipments such as formula computer, it is more obvious because of caused beneficial effect to be typically integrated in mobile device, So the embodiment of the present invention is only illustrated exemplified by being integrated in mobile device.
The purpose of method that the present embodiment provides is to carry out sectional drawing or shearing to target video, and exemplary, the target regards The video file such as short-movie of frequency concretely film, TV play or user oneself shooting, generally comprise in target video multiple Personage.When moulding of the user to one of personage, lines or the personage occur, corresponding scene etc. is interested, Bian Huixi Hope picture when occurring for the personage or fragment carry out sectional drawing or shearing, the personage can be defined as target person.
Exemplary, the source of the sample sound in this step can have a variety of.Such as:Directly it can be obtained by microphone The sound of target person simultaneously generates sample sound, such a mode be particularly suitable for use in target video for user oneself shoot short-movie feelings Condition;Also sample sound can be obtained by reading the audio or video comprising target person sound, is wrapped when in the audio or video During containing multiple personage's sound, it can be carried out specifying by user and read section, certainly, also can directly found by user in target video Any one place includes the reading section of target person sound, the target person sound read in section specified by reading user To obtain sample sound.After sample sound is obtained, sample sound can be stored, it is convenient directly to transfer use in the future.
In the present embodiment, the biological characteristic for the sound that the vocal print is behaved, the voiceprint may include the frequency of sound The vocal print characteristic such as rate, wavelength, intensity, rhythm and tone, can identify in one section of sound whether include vocal print by voiceprint Affiliated owner's word or other sound sent.In this step, the voiceprint in sample sound is acquired, preferably , the voiceprint collected can also be stored, it is convenient directly to transfer use in the future.
Step 102, target video scanned according to voiceprint, and obtain the sound matched in target video with voiceprint The time point of appearance.
Exemplary, the scanning based on voiceprint can be carried out in the case where target video is in broadcast state, in scanning process It is middle to be matched the sound occurred in target video with voiceprint, when the sound occurred is consistent with voiceprint, Illustrate that the match is successful, the sound is what target person was sent, obtains the time point that the sound occurs in target video;Also can incite somebody to action Audio-frequency information in target video is extracted, and the scanning based on voiceprint is carried out to the audio-frequency information extracted, Audio-frequency information is matched with voiceprint in scanning process, when certain a part of audio-frequency information is consistent with voiceprint, Illustrate that the match is successful, obtain the part audio-frequency information corresponding time point in target video.
Step 103, sectional drawing or shearing carried out to target video according to acquired time point.
Exemplary, video interception function setting option can be added respectively in a mobile device and video shearing function is set Option, it can voluntarily be set by user in above-mentioned two function setting option and be turned on and off corresponding function.Preferably, can also be by It is that video interception or video are sheared that user sets current interception type in video jukebox software, or sets and support simultaneously State two kinds of interception types.
Exemplary, can be directly to the video figure corresponding to the time point when getting a time point in a step 102 As being intercepted, and generate corresponding picture and stored.Preferably, can be ordered automatically according to the time point for corresponding sectional drawing Name, facilitates user to check in the future;When continuously acquiring multiple time points, illustrate that the sound of target person is held within a period of time It is continuous to occur, the video content in the period corresponding to multiple time points can be sheared, wherein, video content includes video Display image and the relevant information such as corresponding audio, can then generate the video file after corresponding shearing and be stored.
Exemplary, when getting a time point in a step 102, the time point can be stored, wait target After videoscanning, time shaft is generated according to the sequencing at acquired time point, further according to time shaft to target video Carry out sectional drawing or shearing.Wherein, the time shaft generated can be stored in mobile device, such as random access memory (Random-Access Memory, RAM), it is called when needing and carrying out sectional drawing or shearing.Specifically, work as target person A certain moment of the sound in target video occur, the moment can correspond to a point on time shaft, such as 1 in target video Divide at 20 seconds and target person sound occur, then the 1 of time shaft point corresponds at 20 seconds a point occurs;When the sound of target person exists Persistently occur in a certain period in target video, the period can correspond to a line segment on time shaft, such as 1 in target video Persistently there is target person sound between points 30 seconds to 1 point and 50 seconds, then correspondingly occur on time shaft starting point for 1 point 30 seconds, terminal For 1 point of line segment of 50 seconds., can be to the video image corresponding to the point on time shaft when video interception function is in opening Intercepted, video image corresponding in line segment can also be intercepted, and generate corresponding picture and stored.Work as video When shearing function is in opening, video content corresponding in line segment can be sheared, can then generate and cut accordingly Video file after cutting is stored.
Further, when carrying out shearing manipulation to target video, predetermined time period can be obtained, according to it is acquired when Between point and predetermined time period determine shearing section, according to shearing section target video is sheared.Exemplary, can basis Preset algorithm draws predetermined time period y, and the sound of matching is got at time point n, shearing section can be defined as [n-y, N+y], the video in the shearing section is sheared.For example, the predetermined time period y got is 15 seconds, the time of acquisition Point n is 1 minute and 20 seconds, then shears section as [1 point and 5 seconds, 1 point and 35 seconds], to 1 point of regarding between 35 seconds 5 seconds to 1 point of target video Frequency content is sheared.
Further, target video can be sheared according to acquired time point, and generates multiple sub-videos, will be more Individual sub- video-splicing turns into synthetic video.Exemplary, can basis when target person sound occurs multiple in target video Acquired time point is repeatedly sheared to target video, generates a corresponding sub-video after shearing every time.To whole After target video is sheared, multiple sub-videos can be subjected to splicing according to the order at time point, form synthetic video, can Avoid user that the operation for opening video is performed a plurality of times when checking sub-video in the future.
Preferably, when carrying out shot operation to target video, if target person sound goes out occurrence in target video The number time that is more or persistently occurring is longer, can obtain sectional drawing frequency, and according to sectional drawing frequency and acquired time point to mesh Mark video and carry out sectional drawing, so as to control sectional drawing quantity.Wherein, sectional drawing frequency can be set or by user according to reality by system default Situation sets itself.For example, acquired sectional drawing frequency be 30 seconds once, assign to 25 points 40 seconds, 30 points the 25 of target video There is target person sound to 32 points, then an image can be intercepted at 30 seconds at 25 points, 30 points 30 seconds, 31 points, 31 points 30 seconds and 32 offices intercept an image respectively, and four pictures are obtained.
In the present embodiment, when the sound time of occurrence of target person compares concentration, in order to save scan matching when Between, before step 102 is performed, the step of section to be scanned for obtaining target video can be increased, and when performing step 102, Section to be scanned is scanned according to voiceprint, and obtains the time that the sound matched in section to be scanned with voiceprint occurs Point.Wherein, the sweep interval can be selected by user.Specifically, can by be passed to the related period or by Section to be scanned is specified in the progress of video, as user can be by pulling video playback progress bar from a time point to other one Individual time point, using the section between two time points as section to be scanned.
The video interception or the method for shearing that the embodiment of the present invention one provides, according to the vocal print in the sample sound collected Information scans target video, and obtains the time point that the sound matched with the voiceprint in target video occurs, according to obtaining The time point taken carries out sectional drawing or shearing to target video.By using above-mentioned technical proposal, user is using mobile device Carry out video interception or during shearing manipulation, may specify sample sound and target video, mobile device will be automatically according to from the sound The voiceprint gathered in sound sample scans target video, and video image at voiceprint matching or video content are carried out Interception, algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, it is manual without user Interception position is selected, meets user's request.
Embodiment two
A kind of video interception or the schematic flow sheet of the method for shearing that Fig. 2 provides for the embodiment of the present invention two, this implementation Example is optimized based on above-described embodiment, in the present embodiment, fractional scanning is carried out to target video, and according to acquired Time point generate multiple period of the day from 11 p.m. to 1 a.m countershafts, target video is carried out further according to multiple period of the day from 11 p.m. to 1 a.m countershafts to be segmented sectional drawing or shearing.
Accordingly, the method for the present embodiment comprises the following steps:
Voiceprint in step 201, collection sample sound.
Step 202, according to voiceprint to target video carry out fractional scanning, and obtain in target video with voiceprint The time point that the sound of matching occurs.
Exemplary, in order to accelerate sectional drawing or shearing process, the audio-frequency information in target video can be extracted, to institute The audio-frequency information extracted carries out the multi-thread formula fractional scanning based on voiceprint.Time span can be based on by audio-frequency information N section is divided into, while each part is scanned, and obtains the sound matched in audio-frequency information with voiceprint and occurs Time point.For example, the time span of target video totally 30 minutes, can by the audio-frequency information extracted according to 0-10 minutes, It is divided into 3 parts within -20 minutes 10 minutes and -30 minutes 20 minutes, matching is scanned simultaneously to this 3 parts.
Step 203, multiple period of the day from 11 p.m. to 1 a.m countershafts are generated according to acquired time point.
Exemplary, the time point according to acquired in each part of audio-frequency information generates N number of period of the day from 11 p.m. to 1 a.m countershaft.Further , after generating a sub- time shaft, the period of the day from 11 p.m. to 1 a.m countershaft can be stored.
Step 204, according to the multiple period of the day from 11 p.m. to 1 a.m countershafts target video is carried out being segmented sectional drawing or shearing.
Exemplary, the time occurred due to the sound matched with voiceprint included in each part of audio-frequency information The quantity of point is likely to different, and relatively long one may be taken during scan matching comprising the more part of time point quantity A bit, therefore, the generation time of each period of the day from 11 p.m. to 1 a.m countershaft may also can be different.When sub- time of the sub- time shaft prior to other parts When axle is generated, sectional drawing or shearing can be carried out to corresponding video section based on the period of the day from 11 p.m. to 1 a.m countershaft that this is first generated, without Sectional drawing or shearing manipulation are performed again after all being generated etc. all period of the day from 11 p.m. to 1 a.m countershafts.
In the present embodiment, in order to further speed up sectional drawing or shearing process, can also increase before step 202 is performed The step of obtaining the section to be scanned of target video, and when performing step 202, regarding for sweep interval is treated according to voiceprint Frequency carries out fractional scanning.Wherein, the sweep interval can be selected by user.
The preferred embodiment that a kind of method using the embodiment of the present invention two carries out video interception is provided below:
For example, user needs to carry out video interception to a video A in mobile phone, it is therefore an objective to which interception includes personage's B sound Scene.It is sectional drawing that user, which can first specify interception type, and the target video for then confirming to need to carry out sectional drawing is whole duration Video A.User chooses a file (can be audio file or video file) comprising personage's B sound, Huo Zhe in mobile phone A scene comprising personage's B sound is specified to gather the voiceprint in sample sound as sample sound in video A.It will regard Frequency A is divided into N sections and carries out multithreading scan matching, when often generating a period of the day from 11 p.m. to 1 a.m countershaft for including personage's B voiceprints, according to The period of the day from 11 p.m. to 1 a.m countershaft carries out sectional drawing to this section of video.The picture file generated after sectional drawing is named simultaneously with its time point in video A It is stored in mobile phone EMS memory.If the scene that personage B occurs is more, in order to control sectional drawing quantity, sectional drawing frequency, such as 30s can be set Sectional drawing of interior progress.
The preferred embodiment that a kind of method using the embodiment of the present invention two carries out video shearing is provided below:
For example, user needs to shear video A, it is therefore an objective to which shearing includes the scene of personage's C sound.Pass through microphone One section of personage's C word is recorded, generates sample sound, gathers the voiceprint in sample sound.User can first specify interception class Type is sheared for video, and it is 30-60 minutes then to specify section to be scanned.Video A section 30-60 minutes to be scanned are designated as area Between D, by section D be divided into M sections carry out multithreading scan matching, often generate the sub- time for including personage's C voiceprints During axle, corresponding video section is sheared according to the period of the day from 11 p.m. to 1 a.m countershaft.Assuming that respectively 35-37,45-48,54-56 minute this Three periods include personage C vocal print characteristic, it will three sub-videos of generation, these three sub-videos can be spliced, it is raw The synthetic video slightly larger into one, and stored.
The video interception or the method for shearing that the embodiment of the present invention two provides, target video can be carried out according to voiceprint Fractional scanning, matching and the sectional drawing of multi-thread formula or shearing, it is substantially shorter corresponding to whole video interception or shear history Time, user is quickly obtained wanting the picture or video of interception, further lift the usage experience of user.
Embodiment three
Fig. 3 is the structured flowchart of the device of a kind of video interception that the embodiment of the present invention three provides or shearing, and the device can Realized by software and/or hardware, and be typically integrated in mobile device.As shown in figure 3, the device includes:Voiceprint gathers Module 301, for gathering the voiceprint in sample sound;Videoscanning module 302, for being scanned according to the voiceprint Target video, and obtain the time point that the sound matched in the target video with the voiceprint occurs;Sectional drawing or shearing Module 303, for carrying out sectional drawing or shearing to the target video according to acquired time point.
The video interception or the device of shearing that the embodiment of the present invention three provides, are believed by videoscanning module 302 according to vocal print The voiceprint in the sample sound that acquisition module 301 collects is ceased to scan target video, and is obtained in target video with being somebody's turn to do The time point that the sound of voiceprint matching occurs, by sectional drawing or shear module 303 according to the time point of acquisition to target video Carry out sectional drawing or shearing.By using above-mentioned technical proposal, user is carrying out video interception or shearing behaviour using mobile device When making, sample sound and target video are may specify, mobile device will be believed automatically according to the vocal print gathered from the sample sound Cease to scan target video, the video image at voiceprint matching or video content are intercepted, algorithm is simple and intercepts Accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, interception position is manually selected without user, meet to use Family demand.
On the basis of above-described embodiment, the sectional drawing or shear module may include:Time shaft generation unit and first section Figure or cut cells.Wherein, time shaft generation unit, for generating time shaft according to acquired time point;First sectional drawing or Cut cells, for carrying out sectional drawing or shearing to target video according to time shaft.
On the basis of above-described embodiment, the videoscanning module may include fractional scanning unit, for according to vocal print Information carries out fractional scanning to target video;The sectional drawing or shear module may include period of the day from 11 p.m. to 1 a.m countershaft generation unit and the second sectional drawing Or cut cells.Wherein, period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershafts according to acquired time point;Second Sectional drawing or cut cells, for being carried out being segmented sectional drawing or shearing to the target video according to the multiple period of the day from 11 p.m. to 1 a.m countershaft.
On the basis of above-described embodiment, the sectional drawing or shear module may include predetermined time period acquiring unit, cut Cut interval determination unit and the 3rd sectional drawing or cut cells.Wherein, predetermined time period acquiring unit, for obtaining preset time Length;Interval determination unit is sheared, for determining shearing section according to acquired time point and predetermined time period;3rd section Figure or cut cells, for being sheared according to shearing section to target video.
On the basis of above-described embodiment, the sectional drawing or shear module may include that sub-video generation unit and splicing are single Member.Wherein, sub-video generation unit, for being sheared according to acquired time point to target video, and more height are generated Video;Concatenation unit, for the splicing of multiple sub-videos to be turned into synthetic video.
On the basis of above-described embodiment, the device may also include acquisition module, for scanning mesh according to voiceprint Before marking video, the section to be scanned of target video is obtained;The sectional drawing or shear module can be specifically used for:According to voiceprint Section to be scanned is scanned, and obtains the time point that the sound matched in section to be scanned with voiceprint occurs.
On the basis of above-described embodiment, the sectional drawing or shear module may include frequency acquisition unit and the 4th sectional drawing or Cut cells.Wherein, frequency acquisition unit, for obtaining sectional drawing frequency;4th sectional drawing or cut cells, for being cut according to described Figure frequency and acquired time point carry out sectional drawing to the target video.
Example IV
The embodiment of the present invention four provides a kind of mobile device, and the equipment includes the video interception described in the embodiment of the present invention Or the device of shearing, video can be cut by performing the method for video interception described in the embodiment of the present invention or shearing Figure or shearing manipulation.
It is exemplary, the mobile device concretely equipment such as mobile phone, tablet personal computer and notebook computer.Preferably, The voice collection devices such as microphone are set in the mobile device.
When user carries out video interception or shearing manipulation in the mobile device provided using the embodiment of the present invention four, can refer to Determine sample sound and target video, mobile device will scan mesh automatically according to the voiceprint gathered from the sample sound Video is marked, the video image at voiceprint matching or video content are intercepted, algorithm is simple and interception accuracy rate is high, can Realize quick interception.Whole operation process simple and fast, manually selects interception position without user, meets user's request.
Pay attention to, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although being carried out by above example to the present invention It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also Other more equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims (15)

1. a kind of video interception or the method for shearing, it is characterised in that including:
The voiceprint in sample sound is gathered, wherein, the sample sound is to specify mesh in reading section by reading user Mark the acoustic information of personage;
Target video is scanned according to the voiceprint, and obtains the sound matched in the target video with the voiceprint The time point of appearance;
Sectional drawing or shearing are carried out to the target video according to acquired time point.
2. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Sectional drawing or shearing, including:
Time shaft is generated according to acquired time point;
Sectional drawing or shearing are carried out to the target video according to the time shaft.
3. according to the method for claim 1, it is characterised in that
Target video is scanned according to the voiceprint, including:
Fractional scanning is carried out to target video according to the voiceprint;
Sectional drawing or shearing are carried out to the target video according to acquired time point, including:
Multiple period of the day from 11 p.m. to 1 a.m countershafts are generated according to acquired time point;
The target video is carried out according to the multiple period of the day from 11 p.m. to 1 a.m countershaft to be segmented sectional drawing or shearing.
4. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Shearing, including:
Obtain predetermined time period;
Shearing section is determined according to acquired time point and the predetermined time period;
The target video is sheared according to the shearing section.
5. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Shearing, including:
The target video is sheared according to acquired time point, and generates multiple sub-videos;
The splicing of the multiple sub-video is turned into synthetic video.
6. according to the method for claim 1, it is characterised in that before target video is scanned according to the voiceprint, Also include:
Obtain the section to be scanned of target video;
Target video is scanned according to the voiceprint, and obtains the sound matched in the target video with the voiceprint The time point of appearance, including:
The section to be scanned is scanned according to the voiceprint, and obtain in the section to be scanned with the voiceprint The time point that the sound matched somebody with somebody occurs.
7. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point Sectional drawing, including:
Obtain sectional drawing frequency;
Sectional drawing is carried out to the target video according to the sectional drawing frequency and acquired time point.
8. a kind of video interception or the device of shearing, it is characterised in that including:
Voiceprint acquisition module, for gathering the voiceprint in sample sound, wherein, the voiceprint is to pass through reading User specifies the acoustic information for reading target person in section;
Videoscanning module, for according to the voiceprint scan target video, and obtain in the target video with it is described The time point that the sound of voiceprint matching occurs;
Sectional drawing or shear module, for carrying out sectional drawing or shearing to the target video according to acquired time point.
9. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Time shaft generation unit, for generating time shaft according to acquired time point;
First sectional drawing or cut cells, for carrying out sectional drawing or shearing to the target video according to the time shaft.
10. device according to claim 8, it is characterised in that
The videoscanning module includes:
Fractional scanning unit, for carrying out fractional scanning to target video according to the voiceprint;
The sectional drawing or shear module include:
Period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershafts according to acquired time point;
Second sectional drawing or cut cells, for according to the multiple period of the day from 11 p.m. to 1 a.m countershaft to the target video carry out be segmented sectional drawing or Shearing.
11. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Predetermined time period acquiring unit, for obtaining predetermined time period;
Interval determination unit is sheared, for determining shearing section according to acquired time point and the predetermined time period;
3rd sectional drawing or cut cells, for being sheared according to the shearing section to the target video.
12. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Sub-video generation unit, for being sheared according to acquired time point to the target video, and generate more height Video;
Concatenation unit, for the splicing of the multiple sub-video to be turned into synthetic video.
13. device according to claim 8, it is characterised in that also include:
Acquisition module, for before target video is scanned according to the voiceprint, obtaining the section to be scanned of target video;
The sectional drawing or shear module are specifically used for:
The section to be scanned is scanned according to the voiceprint, and obtain in the section to be scanned with the voiceprint The time point that the sound matched somebody with somebody occurs.
14. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Frequency acquisition unit, for obtaining sectional drawing frequency;
4th sectional drawing or cut cells, for being carried out according to the sectional drawing frequency and acquired time point to the target video Sectional drawing.
15. a kind of mobile device, it is characterised in that including the video interception as any one of claim 8-14 or shearing Device.
CN201510305097.1A 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing Active CN104883607B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510305097.1A CN104883607B (en) 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510305097.1A CN104883607B (en) 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing

Publications (2)

Publication Number Publication Date
CN104883607A CN104883607A (en) 2015-09-02
CN104883607B true CN104883607B (en) 2017-12-19

Family

ID=53950914

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510305097.1A Active CN104883607B (en) 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing

Country Status (1)

Country Link
CN (1) CN104883607B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105488227B (en) * 2015-12-29 2019-09-20 惠州Tcl移动通信有限公司 A kind of electronic equipment and its method that audio file is handled based on vocal print feature
CN106131627B (en) * 2016-07-07 2019-03-26 腾讯科技(深圳)有限公司 A kind of method for processing video frequency, apparatus and system
CN107688792B (en) * 2017-09-05 2020-06-05 语联网(武汉)信息技术有限公司 Video translation method and system
CN107517406B (en) * 2017-09-05 2020-02-14 语联网(武汉)信息技术有限公司 Video editing and translating method
CN107562737B (en) * 2017-09-05 2020-12-22 语联网(武汉)信息技术有限公司 Video segmentation method and system for translation
CN107820123A (en) * 2017-10-25 2018-03-20 深圳天珑无线科技有限公司 Method, mobile terminal and the storage device of mobile terminal screen printing picture
CN108419040A (en) * 2018-02-28 2018-08-17 上海乐愚智能科技有限公司 A kind of recording of growing up method, apparatus, robot and computer-readable medium
CN110418159A (en) * 2018-10-11 2019-11-05 彩云之端文化传媒(北京)有限公司 A method of television content is intercepted across screen based on Application on Voiceprint Recognition
CN109361956B (en) * 2018-11-22 2021-04-30 广西巴多传媒科技有限公司 Time-based video cropping methods and related products
CN112312039A (en) * 2019-07-15 2021-02-02 北京小米移动软件有限公司 Audio and video information acquisition method, device, equipment and storage medium
CN110572706B (en) * 2019-09-29 2021-05-11 深圳传音控股股份有限公司 Video screenshot method, terminal and computer-readable storage medium
CN112468735B (en) * 2021-01-26 2021-05-11 北京深蓝长盛科技有限公司 Video processing system and video processing method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014161282A1 (en) * 2013-07-15 2014-10-09 中兴通讯股份有限公司 Method and device for adjusting playback progress of video file
CN104185086A (en) * 2014-03-28 2014-12-03 无锡天脉聚源传媒科技有限公司 Method and device for providing video information
CN104540004A (en) * 2015-01-27 2015-04-22 深圳市中兴移动通信有限公司 Video screenshot method and video screenshot device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014161282A1 (en) * 2013-07-15 2014-10-09 中兴通讯股份有限公司 Method and device for adjusting playback progress of video file
CN104185086A (en) * 2014-03-28 2014-12-03 无锡天脉聚源传媒科技有限公司 Method and device for providing video information
CN104540004A (en) * 2015-01-27 2015-04-22 深圳市中兴移动通信有限公司 Video screenshot method and video screenshot device

Also Published As

Publication number Publication date
CN104883607A (en) 2015-09-02

Similar Documents

Publication Publication Date Title
CN104883607B (en) A kind of video interception or the method, apparatus and mobile device of shearing
CN104184923B (en) System and method for retrieving people information in video
US10541003B2 (en) Performance content synchronization based on audio
CN110139062B (en) Video conference record creating method and device and terminal equipment
CN107615766A (en) System and method for creating and distributing content of multimedia
US20130330062A1 (en) Automatic creation of movie with images synchronized to music
CN109117233A (en) Method and apparatus for handling information
CN103718166A (en) Information processing apparatus, information processing method, and computer program product
CN112653902B (en) Speaker recognition method and device and electronic equipment
JP2020126645A (en) Information processing method, terminal device and information processing device
JP2010135925A (en) Comment visualization device, and comment visualization program
KR20070007290A (en) Tutorial generation unit
CN105657497B (en) A kind of video broadcasting method and equipment
EP3962067A1 (en) Method and device for adding lyrics to short video
CN113709527A (en) Method and device for paying attention to anchor in multi-anchor scene
CN113316015A (en) Bullet screen processing method, device and system
CN104954875B (en) A kind of video playing progress monitoring method and device
CN103488529B (en) A kind of method and apparatus for video resource access control
CN113038185B (en) Bullet screen processing method and device
JP5941760B2 (en) Display control device, display control method, display system, program, and recording medium
CN102142271B (en) Handheld multimedia player for synchronously displaying waveform and repeating method
CN105187788B (en) A kind of method and system of analog machine real-time data record and displaying
EP1940056A3 (en) Broadcast receiving apparatus and method thereof
CN104185064B (en) Media file identification method and apparatus
JP2010134681A (en) Lecture material preparation support system, lecture material preparation support method and lecture material preparation support program

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Patentee after: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.

Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Patentee before: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.