CN104883607B - A kind of video interception or the method, apparatus and mobile device of shearing - Google Patents
A kind of video interception or the method, apparatus and mobile device of shearing Download PDFInfo
- Publication number
- CN104883607B CN104883607B CN201510305097.1A CN201510305097A CN104883607B CN 104883607 B CN104883607 B CN 104883607B CN 201510305097 A CN201510305097 A CN 201510305097A CN 104883607 B CN104883607 B CN 104883607B
- Authority
- CN
- China
- Prior art keywords
- target video
- video
- sectional drawing
- voiceprint
- shearing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
- H04N21/44012—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving rendering scenes according to scene graphs, e.g. MPEG-4 scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440245—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display the reformatting operation being performed only on part of the stream, e.g. a region of the image or a time segment
Abstract
The embodiments of the invention provide a kind of video interception or the method, apparatus and mobile device of shearing.This method includes:Gather the voiceprint in sample sound;Target video is scanned according to voiceprint, and obtains the time point that the sound matched in target video with voiceprint occurs;Sectional drawing or shearing are carried out to target video according to acquired time point.By using above-mentioned technical proposal, user using mobile device when carrying out video interception or shearing manipulation, it may specify sample sound and target video, mobile device will scan target video automatically according to the voiceprint gathered from the sample sound, video image at voiceprint matching or video content are intercepted, algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, interception position is manually selected without user, can meet user's request.
Description
Technical field
The present embodiments relate to image processing technique field, more particularly to a kind of video interception or the method for shearing, dress
Put and mobile device.
Background technology
People are when watching the video files such as the short-movie of film, TV play or oneself shooting, it will usually run into many senses
Character moulding, lines, the scenery included in the scene of interest, such as video.At this moment, people often want by video interception or
The mode of video shearing preserves these scenes into picture or short visual form, convenient to check or use in the future.
, may be only interested in being related to the scene of same personage, then will want to save for many users
Relate only to the picture of the personage or short-sighted frequency.In order to comprehensively intercept all fields for including the personage in a video file
Scape, user is needed constantly to carry out sectional drawing or shearing manipulation in watching process, or all bags are recorded in watching process
Time point containing personage's scene, then it is cumbersome, time-consuming, accurate by Video processing software progress sectional drawing or shearing, operating process
Spend low.For the user of mobile device accustomed to using viewing video, carried out on the touchscreen using finger substantially
Associative operation, above-mentioned sectional drawing or cut mode can be more complicated, and be difficult accurately be truncated to every time anticipation picture or
Person envision the period in short-sighted frequency, when intercept it is inaccurate when, need that the operation of video interception or shearing is repeated again, for
The use at family is made troubles.
The content of the invention
It is existing to solve the purpose of the present invention is to propose to a kind of video interception or the method, apparatus and mobile device of shearing
Video interception or cut mode it is complex for operation step, time-consuming, and the problem of the degree of accuracy is low.
In a first aspect, the embodiments of the invention provide a kind of video interception or the method for shearing, including:
Gather the voiceprint in sample sound;
Target video is scanned according to the voiceprint, and obtains what is matched in the target video with the voiceprint
The time point that sound occurs;
Sectional drawing or shearing are carried out to the target video according to acquired time point.
Second aspect, the embodiments of the invention provide a kind of video interception or the device of shearing, including:
Voiceprint acquisition module, for gathering the voiceprint in sample sound;
Videoscanning module, for according to the voiceprint scan target video, and obtain in the target video with
The time point that the sound of the voiceprint matching occurs;
Sectional drawing or shear module, for carrying out sectional drawing or shearing to the target video according to acquired time point.
The third aspect, the embodiments of the invention provide a kind of mobile device, the mobile device is included in the embodiment of the present invention
Video interception or shearing device.
The video interception or the method, apparatus of shearing and mobile device provided in the embodiment of the present invention, can simplify existing
Video interception or shearing manipulation, it is time saving and energy saving, and the degree of accuracy is high.The video interception that is there is provided in the embodiment of the present invention or shearing
Method, voiceprint in the sample sound collected scan target video, and obtain in target video with the vocal print
At the time point that the sound of information matches occurs, sectional drawing or shearing are carried out to target video according to the time point of acquisition.By adopting
With above-mentioned technical proposal, user may specify sample sound and mesh when carrying out video interception or shearing manipulation using mobile device
Video is marked, mobile device will scan target video automatically according to the voiceprint gathered from the sample sound, by vocal print
Video image or video content at information matches are intercepted, and algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.
Whole operation process simple and fast, manually selects interception position without user, meets user's request.
Brief description of the drawings
Fig. 1 is the schematic flow sheet of a kind of video interception that the embodiment of the present invention one provides or the method for shearing;
Fig. 2 is the schematic flow sheet of a kind of video interception that the embodiment of the present invention two provides or the method for shearing;
Fig. 3 is the structured flowchart of the device of a kind of video interception that the embodiment of the present invention three provides or shearing.
Embodiment
Further illustrate technical scheme below in conjunction with the accompanying drawings and by embodiment.It is appreciated that
It is that specific embodiment described herein is used only for explaining the present invention, rather than limitation of the invention.Further need exist for illustrating
, for the ease of description, part related to the present invention rather than entire infrastructure are illustrate only in accompanying drawing.
Embodiment one
A kind of video interception or the schematic flow sheet of the method for shearing that Fig. 1 provides for the embodiment of the present invention one, this method
It can be performed by video interception or the device of shearing, wherein the device can be realized by software and/or hardware, and typically be integrated in shifting
In dynamic equipment.As shown in figure 1, this method includes:
Voiceprint in step 101, collection sample sound.
It is exemplary, the mobile device concretely equipment such as mobile phone, tablet personal computer and notebook computer.This area skill
Art personnel understand that the device for performing the present embodiment methods described is not limited to be integrated in mobile device, can also be integrated in platform
In other electronic equipments such as formula computer, it is more obvious because of caused beneficial effect to be typically integrated in mobile device,
So the embodiment of the present invention is only illustrated exemplified by being integrated in mobile device.
The purpose of method that the present embodiment provides is to carry out sectional drawing or shearing to target video, and exemplary, the target regards
The video file such as short-movie of frequency concretely film, TV play or user oneself shooting, generally comprise in target video multiple
Personage.When moulding of the user to one of personage, lines or the personage occur, corresponding scene etc. is interested, Bian Huixi
Hope picture when occurring for the personage or fragment carry out sectional drawing or shearing, the personage can be defined as target person.
Exemplary, the source of the sample sound in this step can have a variety of.Such as:Directly it can be obtained by microphone
The sound of target person simultaneously generates sample sound, such a mode be particularly suitable for use in target video for user oneself shoot short-movie feelings
Condition;Also sample sound can be obtained by reading the audio or video comprising target person sound, is wrapped when in the audio or video
During containing multiple personage's sound, it can be carried out specifying by user and read section, certainly, also can directly found by user in target video
Any one place includes the reading section of target person sound, the target person sound read in section specified by reading user
To obtain sample sound.After sample sound is obtained, sample sound can be stored, it is convenient directly to transfer use in the future.
In the present embodiment, the biological characteristic for the sound that the vocal print is behaved, the voiceprint may include the frequency of sound
The vocal print characteristic such as rate, wavelength, intensity, rhythm and tone, can identify in one section of sound whether include vocal print by voiceprint
Affiliated owner's word or other sound sent.In this step, the voiceprint in sample sound is acquired, preferably
, the voiceprint collected can also be stored, it is convenient directly to transfer use in the future.
Step 102, target video scanned according to voiceprint, and obtain the sound matched in target video with voiceprint
The time point of appearance.
Exemplary, the scanning based on voiceprint can be carried out in the case where target video is in broadcast state, in scanning process
It is middle to be matched the sound occurred in target video with voiceprint, when the sound occurred is consistent with voiceprint,
Illustrate that the match is successful, the sound is what target person was sent, obtains the time point that the sound occurs in target video;Also can incite somebody to action
Audio-frequency information in target video is extracted, and the scanning based on voiceprint is carried out to the audio-frequency information extracted,
Audio-frequency information is matched with voiceprint in scanning process, when certain a part of audio-frequency information is consistent with voiceprint,
Illustrate that the match is successful, obtain the part audio-frequency information corresponding time point in target video.
Step 103, sectional drawing or shearing carried out to target video according to acquired time point.
Exemplary, video interception function setting option can be added respectively in a mobile device and video shearing function is set
Option, it can voluntarily be set by user in above-mentioned two function setting option and be turned on and off corresponding function.Preferably, can also be by
It is that video interception or video are sheared that user sets current interception type in video jukebox software, or sets and support simultaneously
State two kinds of interception types.
Exemplary, can be directly to the video figure corresponding to the time point when getting a time point in a step 102
As being intercepted, and generate corresponding picture and stored.Preferably, can be ordered automatically according to the time point for corresponding sectional drawing
Name, facilitates user to check in the future;When continuously acquiring multiple time points, illustrate that the sound of target person is held within a period of time
It is continuous to occur, the video content in the period corresponding to multiple time points can be sheared, wherein, video content includes video
Display image and the relevant information such as corresponding audio, can then generate the video file after corresponding shearing and be stored.
Exemplary, when getting a time point in a step 102, the time point can be stored, wait target
After videoscanning, time shaft is generated according to the sequencing at acquired time point, further according to time shaft to target video
Carry out sectional drawing or shearing.Wherein, the time shaft generated can be stored in mobile device, such as random access memory
(Random-Access Memory, RAM), it is called when needing and carrying out sectional drawing or shearing.Specifically, work as target person
A certain moment of the sound in target video occur, the moment can correspond to a point on time shaft, such as 1 in target video
Divide at 20 seconds and target person sound occur, then the 1 of time shaft point corresponds at 20 seconds a point occurs;When the sound of target person exists
Persistently occur in a certain period in target video, the period can correspond to a line segment on time shaft, such as 1 in target video
Persistently there is target person sound between points 30 seconds to 1 point and 50 seconds, then correspondingly occur on time shaft starting point for 1 point 30 seconds, terminal
For 1 point of line segment of 50 seconds., can be to the video image corresponding to the point on time shaft when video interception function is in opening
Intercepted, video image corresponding in line segment can also be intercepted, and generate corresponding picture and stored.Work as video
When shearing function is in opening, video content corresponding in line segment can be sheared, can then generate and cut accordingly
Video file after cutting is stored.
Further, when carrying out shearing manipulation to target video, predetermined time period can be obtained, according to it is acquired when
Between point and predetermined time period determine shearing section, according to shearing section target video is sheared.Exemplary, can basis
Preset algorithm draws predetermined time period y, and the sound of matching is got at time point n, shearing section can be defined as [n-y,
N+y], the video in the shearing section is sheared.For example, the predetermined time period y got is 15 seconds, the time of acquisition
Point n is 1 minute and 20 seconds, then shears section as [1 point and 5 seconds, 1 point and 35 seconds], to 1 point of regarding between 35 seconds 5 seconds to 1 point of target video
Frequency content is sheared.
Further, target video can be sheared according to acquired time point, and generates multiple sub-videos, will be more
Individual sub- video-splicing turns into synthetic video.Exemplary, can basis when target person sound occurs multiple in target video
Acquired time point is repeatedly sheared to target video, generates a corresponding sub-video after shearing every time.To whole
After target video is sheared, multiple sub-videos can be subjected to splicing according to the order at time point, form synthetic video, can
Avoid user that the operation for opening video is performed a plurality of times when checking sub-video in the future.
Preferably, when carrying out shot operation to target video, if target person sound goes out occurrence in target video
The number time that is more or persistently occurring is longer, can obtain sectional drawing frequency, and according to sectional drawing frequency and acquired time point to mesh
Mark video and carry out sectional drawing, so as to control sectional drawing quantity.Wherein, sectional drawing frequency can be set or by user according to reality by system default
Situation sets itself.For example, acquired sectional drawing frequency be 30 seconds once, assign to 25 points 40 seconds, 30 points the 25 of target video
There is target person sound to 32 points, then an image can be intercepted at 30 seconds at 25 points, 30 points 30 seconds, 31 points, 31 points
30 seconds and 32 offices intercept an image respectively, and four pictures are obtained.
In the present embodiment, when the sound time of occurrence of target person compares concentration, in order to save scan matching when
Between, before step 102 is performed, the step of section to be scanned for obtaining target video can be increased, and when performing step 102,
Section to be scanned is scanned according to voiceprint, and obtains the time that the sound matched in section to be scanned with voiceprint occurs
Point.Wherein, the sweep interval can be selected by user.Specifically, can by be passed to the related period or by
Section to be scanned is specified in the progress of video, as user can be by pulling video playback progress bar from a time point to other one
Individual time point, using the section between two time points as section to be scanned.
The video interception or the method for shearing that the embodiment of the present invention one provides, according to the vocal print in the sample sound collected
Information scans target video, and obtains the time point that the sound matched with the voiceprint in target video occurs, according to obtaining
The time point taken carries out sectional drawing or shearing to target video.By using above-mentioned technical proposal, user is using mobile device
Carry out video interception or during shearing manipulation, may specify sample sound and target video, mobile device will be automatically according to from the sound
The voiceprint gathered in sound sample scans target video, and video image at voiceprint matching or video content are carried out
Interception, algorithm is simple and interception accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, it is manual without user
Interception position is selected, meets user's request.
Embodiment two
A kind of video interception or the schematic flow sheet of the method for shearing that Fig. 2 provides for the embodiment of the present invention two, this implementation
Example is optimized based on above-described embodiment, in the present embodiment, fractional scanning is carried out to target video, and according to acquired
Time point generate multiple period of the day from 11 p.m. to 1 a.m countershafts, target video is carried out further according to multiple period of the day from 11 p.m. to 1 a.m countershafts to be segmented sectional drawing or shearing.
Accordingly, the method for the present embodiment comprises the following steps:
Voiceprint in step 201, collection sample sound.
Step 202, according to voiceprint to target video carry out fractional scanning, and obtain in target video with voiceprint
The time point that the sound of matching occurs.
Exemplary, in order to accelerate sectional drawing or shearing process, the audio-frequency information in target video can be extracted, to institute
The audio-frequency information extracted carries out the multi-thread formula fractional scanning based on voiceprint.Time span can be based on by audio-frequency information
N section is divided into, while each part is scanned, and obtains the sound matched in audio-frequency information with voiceprint and occurs
Time point.For example, the time span of target video totally 30 minutes, can by the audio-frequency information extracted according to 0-10 minutes,
It is divided into 3 parts within -20 minutes 10 minutes and -30 minutes 20 minutes, matching is scanned simultaneously to this 3 parts.
Step 203, multiple period of the day from 11 p.m. to 1 a.m countershafts are generated according to acquired time point.
Exemplary, the time point according to acquired in each part of audio-frequency information generates N number of period of the day from 11 p.m. to 1 a.m countershaft.Further
, after generating a sub- time shaft, the period of the day from 11 p.m. to 1 a.m countershaft can be stored.
Step 204, according to the multiple period of the day from 11 p.m. to 1 a.m countershafts target video is carried out being segmented sectional drawing or shearing.
Exemplary, the time occurred due to the sound matched with voiceprint included in each part of audio-frequency information
The quantity of point is likely to different, and relatively long one may be taken during scan matching comprising the more part of time point quantity
A bit, therefore, the generation time of each period of the day from 11 p.m. to 1 a.m countershaft may also can be different.When sub- time of the sub- time shaft prior to other parts
When axle is generated, sectional drawing or shearing can be carried out to corresponding video section based on the period of the day from 11 p.m. to 1 a.m countershaft that this is first generated, without
Sectional drawing or shearing manipulation are performed again after all being generated etc. all period of the day from 11 p.m. to 1 a.m countershafts.
In the present embodiment, in order to further speed up sectional drawing or shearing process, can also increase before step 202 is performed
The step of obtaining the section to be scanned of target video, and when performing step 202, regarding for sweep interval is treated according to voiceprint
Frequency carries out fractional scanning.Wherein, the sweep interval can be selected by user.
The preferred embodiment that a kind of method using the embodiment of the present invention two carries out video interception is provided below:
For example, user needs to carry out video interception to a video A in mobile phone, it is therefore an objective to which interception includes personage's B sound
Scene.It is sectional drawing that user, which can first specify interception type, and the target video for then confirming to need to carry out sectional drawing is whole duration
Video A.User chooses a file (can be audio file or video file) comprising personage's B sound, Huo Zhe in mobile phone
A scene comprising personage's B sound is specified to gather the voiceprint in sample sound as sample sound in video A.It will regard
Frequency A is divided into N sections and carries out multithreading scan matching, when often generating a period of the day from 11 p.m. to 1 a.m countershaft for including personage's B voiceprints, according to
The period of the day from 11 p.m. to 1 a.m countershaft carries out sectional drawing to this section of video.The picture file generated after sectional drawing is named simultaneously with its time point in video A
It is stored in mobile phone EMS memory.If the scene that personage B occurs is more, in order to control sectional drawing quantity, sectional drawing frequency, such as 30s can be set
Sectional drawing of interior progress.
The preferred embodiment that a kind of method using the embodiment of the present invention two carries out video shearing is provided below:
For example, user needs to shear video A, it is therefore an objective to which shearing includes the scene of personage's C sound.Pass through microphone
One section of personage's C word is recorded, generates sample sound, gathers the voiceprint in sample sound.User can first specify interception class
Type is sheared for video, and it is 30-60 minutes then to specify section to be scanned.Video A section 30-60 minutes to be scanned are designated as area
Between D, by section D be divided into M sections carry out multithreading scan matching, often generate the sub- time for including personage's C voiceprints
During axle, corresponding video section is sheared according to the period of the day from 11 p.m. to 1 a.m countershaft.Assuming that respectively 35-37,45-48,54-56 minute this
Three periods include personage C vocal print characteristic, it will three sub-videos of generation, these three sub-videos can be spliced, it is raw
The synthetic video slightly larger into one, and stored.
The video interception or the method for shearing that the embodiment of the present invention two provides, target video can be carried out according to voiceprint
Fractional scanning, matching and the sectional drawing of multi-thread formula or shearing, it is substantially shorter corresponding to whole video interception or shear history
Time, user is quickly obtained wanting the picture or video of interception, further lift the usage experience of user.
Embodiment three
Fig. 3 is the structured flowchart of the device of a kind of video interception that the embodiment of the present invention three provides or shearing, and the device can
Realized by software and/or hardware, and be typically integrated in mobile device.As shown in figure 3, the device includes:Voiceprint gathers
Module 301, for gathering the voiceprint in sample sound;Videoscanning module 302, for being scanned according to the voiceprint
Target video, and obtain the time point that the sound matched in the target video with the voiceprint occurs;Sectional drawing or shearing
Module 303, for carrying out sectional drawing or shearing to the target video according to acquired time point.
The video interception or the device of shearing that the embodiment of the present invention three provides, are believed by videoscanning module 302 according to vocal print
The voiceprint in the sample sound that acquisition module 301 collects is ceased to scan target video, and is obtained in target video with being somebody's turn to do
The time point that the sound of voiceprint matching occurs, by sectional drawing or shear module 303 according to the time point of acquisition to target video
Carry out sectional drawing or shearing.By using above-mentioned technical proposal, user is carrying out video interception or shearing behaviour using mobile device
When making, sample sound and target video are may specify, mobile device will be believed automatically according to the vocal print gathered from the sample sound
Cease to scan target video, the video image at voiceprint matching or video content are intercepted, algorithm is simple and intercepts
Accuracy rate is high, and quick interception can be achieved.Whole operation process simple and fast, interception position is manually selected without user, meet to use
Family demand.
On the basis of above-described embodiment, the sectional drawing or shear module may include:Time shaft generation unit and first section
Figure or cut cells.Wherein, time shaft generation unit, for generating time shaft according to acquired time point;First sectional drawing or
Cut cells, for carrying out sectional drawing or shearing to target video according to time shaft.
On the basis of above-described embodiment, the videoscanning module may include fractional scanning unit, for according to vocal print
Information carries out fractional scanning to target video;The sectional drawing or shear module may include period of the day from 11 p.m. to 1 a.m countershaft generation unit and the second sectional drawing
Or cut cells.Wherein, period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershafts according to acquired time point;Second
Sectional drawing or cut cells, for being carried out being segmented sectional drawing or shearing to the target video according to the multiple period of the day from 11 p.m. to 1 a.m countershaft.
On the basis of above-described embodiment, the sectional drawing or shear module may include predetermined time period acquiring unit, cut
Cut interval determination unit and the 3rd sectional drawing or cut cells.Wherein, predetermined time period acquiring unit, for obtaining preset time
Length;Interval determination unit is sheared, for determining shearing section according to acquired time point and predetermined time period;3rd section
Figure or cut cells, for being sheared according to shearing section to target video.
On the basis of above-described embodiment, the sectional drawing or shear module may include that sub-video generation unit and splicing are single
Member.Wherein, sub-video generation unit, for being sheared according to acquired time point to target video, and more height are generated
Video;Concatenation unit, for the splicing of multiple sub-videos to be turned into synthetic video.
On the basis of above-described embodiment, the device may also include acquisition module, for scanning mesh according to voiceprint
Before marking video, the section to be scanned of target video is obtained;The sectional drawing or shear module can be specifically used for:According to voiceprint
Section to be scanned is scanned, and obtains the time point that the sound matched in section to be scanned with voiceprint occurs.
On the basis of above-described embodiment, the sectional drawing or shear module may include frequency acquisition unit and the 4th sectional drawing or
Cut cells.Wherein, frequency acquisition unit, for obtaining sectional drawing frequency;4th sectional drawing or cut cells, for being cut according to described
Figure frequency and acquired time point carry out sectional drawing to the target video.
Example IV
The embodiment of the present invention four provides a kind of mobile device, and the equipment includes the video interception described in the embodiment of the present invention
Or the device of shearing, video can be cut by performing the method for video interception described in the embodiment of the present invention or shearing
Figure or shearing manipulation.
It is exemplary, the mobile device concretely equipment such as mobile phone, tablet personal computer and notebook computer.Preferably,
The voice collection devices such as microphone are set in the mobile device.
When user carries out video interception or shearing manipulation in the mobile device provided using the embodiment of the present invention four, can refer to
Determine sample sound and target video, mobile device will scan mesh automatically according to the voiceprint gathered from the sample sound
Video is marked, the video image at voiceprint matching or video content are intercepted, algorithm is simple and interception accuracy rate is high, can
Realize quick interception.Whole operation process simple and fast, manually selects interception position without user, meets user's request.
Pay attention to, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that
The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes,
Readjust and substitute without departing from protection scope of the present invention.Therefore, although being carried out by above example to the present invention
It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also
Other more equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.
Claims (15)
1. a kind of video interception or the method for shearing, it is characterised in that including:
The voiceprint in sample sound is gathered, wherein, the sample sound is to specify mesh in reading section by reading user
Mark the acoustic information of personage;
Target video is scanned according to the voiceprint, and obtains the sound matched in the target video with the voiceprint
The time point of appearance;
Sectional drawing or shearing are carried out to the target video according to acquired time point.
2. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point
Sectional drawing or shearing, including:
Time shaft is generated according to acquired time point;
Sectional drawing or shearing are carried out to the target video according to the time shaft.
3. according to the method for claim 1, it is characterised in that
Target video is scanned according to the voiceprint, including:
Fractional scanning is carried out to target video according to the voiceprint;
Sectional drawing or shearing are carried out to the target video according to acquired time point, including:
Multiple period of the day from 11 p.m. to 1 a.m countershafts are generated according to acquired time point;
The target video is carried out according to the multiple period of the day from 11 p.m. to 1 a.m countershaft to be segmented sectional drawing or shearing.
4. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point
Shearing, including:
Obtain predetermined time period;
Shearing section is determined according to acquired time point and the predetermined time period;
The target video is sheared according to the shearing section.
5. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point
Shearing, including:
The target video is sheared according to acquired time point, and generates multiple sub-videos;
The splicing of the multiple sub-video is turned into synthetic video.
6. according to the method for claim 1, it is characterised in that before target video is scanned according to the voiceprint,
Also include:
Obtain the section to be scanned of target video;
Target video is scanned according to the voiceprint, and obtains the sound matched in the target video with the voiceprint
The time point of appearance, including:
The section to be scanned is scanned according to the voiceprint, and obtain in the section to be scanned with the voiceprint
The time point that the sound matched somebody with somebody occurs.
7. according to the method for claim 1, it is characterised in that the target video is carried out according to acquired time point
Sectional drawing, including:
Obtain sectional drawing frequency;
Sectional drawing is carried out to the target video according to the sectional drawing frequency and acquired time point.
8. a kind of video interception or the device of shearing, it is characterised in that including:
Voiceprint acquisition module, for gathering the voiceprint in sample sound, wherein, the voiceprint is to pass through reading
User specifies the acoustic information for reading target person in section;
Videoscanning module, for according to the voiceprint scan target video, and obtain in the target video with it is described
The time point that the sound of voiceprint matching occurs;
Sectional drawing or shear module, for carrying out sectional drawing or shearing to the target video according to acquired time point.
9. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Time shaft generation unit, for generating time shaft according to acquired time point;
First sectional drawing or cut cells, for carrying out sectional drawing or shearing to the target video according to the time shaft.
10. device according to claim 8, it is characterised in that
The videoscanning module includes:
Fractional scanning unit, for carrying out fractional scanning to target video according to the voiceprint;
The sectional drawing or shear module include:
Period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershafts according to acquired time point;
Second sectional drawing or cut cells, for according to the multiple period of the day from 11 p.m. to 1 a.m countershaft to the target video carry out be segmented sectional drawing or
Shearing.
11. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Predetermined time period acquiring unit, for obtaining predetermined time period;
Interval determination unit is sheared, for determining shearing section according to acquired time point and the predetermined time period;
3rd sectional drawing or cut cells, for being sheared according to the shearing section to the target video.
12. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Sub-video generation unit, for being sheared according to acquired time point to the target video, and generate more height
Video;
Concatenation unit, for the splicing of the multiple sub-video to be turned into synthetic video.
13. device according to claim 8, it is characterised in that also include:
Acquisition module, for before target video is scanned according to the voiceprint, obtaining the section to be scanned of target video;
The sectional drawing or shear module are specifically used for:
The section to be scanned is scanned according to the voiceprint, and obtain in the section to be scanned with the voiceprint
The time point that the sound matched somebody with somebody occurs.
14. device according to claim 8, it is characterised in that the sectional drawing or shear module include:
Frequency acquisition unit, for obtaining sectional drawing frequency;
4th sectional drawing or cut cells, for being carried out according to the sectional drawing frequency and acquired time point to the target video
Sectional drawing.
15. a kind of mobile device, it is characterised in that including the video interception as any one of claim 8-14 or shearing
Device.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510305097.1A CN104883607B (en) | 2015-06-05 | 2015-06-05 | A kind of video interception or the method, apparatus and mobile device of shearing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510305097.1A CN104883607B (en) | 2015-06-05 | 2015-06-05 | A kind of video interception or the method, apparatus and mobile device of shearing |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104883607A CN104883607A (en) | 2015-09-02 |
CN104883607B true CN104883607B (en) | 2017-12-19 |
Family
ID=53950914
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510305097.1A Active CN104883607B (en) | 2015-06-05 | 2015-06-05 | A kind of video interception or the method, apparatus and mobile device of shearing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104883607B (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105488227B (en) * | 2015-12-29 | 2019-09-20 | 惠州Tcl移动通信有限公司 | A kind of electronic equipment and its method that audio file is handled based on vocal print feature |
CN106131627B (en) * | 2016-07-07 | 2019-03-26 | 腾讯科技(深圳)有限公司 | A kind of method for processing video frequency, apparatus and system |
CN107688792B (en) * | 2017-09-05 | 2020-06-05 | 语联网(武汉)信息技术有限公司 | Video translation method and system |
CN107517406B (en) * | 2017-09-05 | 2020-02-14 | 语联网(武汉)信息技术有限公司 | Video editing and translating method |
CN107562737B (en) * | 2017-09-05 | 2020-12-22 | 语联网(武汉)信息技术有限公司 | Video segmentation method and system for translation |
CN107820123A (en) * | 2017-10-25 | 2018-03-20 | 深圳天珑无线科技有限公司 | Method, mobile terminal and the storage device of mobile terminal screen printing picture |
CN108419040A (en) * | 2018-02-28 | 2018-08-17 | 上海乐愚智能科技有限公司 | A kind of recording of growing up method, apparatus, robot and computer-readable medium |
CN110418159A (en) * | 2018-10-11 | 2019-11-05 | 彩云之端文化传媒(北京)有限公司 | A method of television content is intercepted across screen based on Application on Voiceprint Recognition |
CN109361956B (en) * | 2018-11-22 | 2021-04-30 | 广西巴多传媒科技有限公司 | Time-based video cropping methods and related products |
CN112312039A (en) * | 2019-07-15 | 2021-02-02 | 北京小米移动软件有限公司 | Audio and video information acquisition method, device, equipment and storage medium |
CN110572706B (en) * | 2019-09-29 | 2021-05-11 | 深圳传音控股股份有限公司 | Video screenshot method, terminal and computer-readable storage medium |
CN112468735B (en) * | 2021-01-26 | 2021-05-11 | 北京深蓝长盛科技有限公司 | Video processing system and video processing method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014161282A1 (en) * | 2013-07-15 | 2014-10-09 | 中兴通讯股份有限公司 | Method and device for adjusting playback progress of video file |
CN104185086A (en) * | 2014-03-28 | 2014-12-03 | 无锡天脉聚源传媒科技有限公司 | Method and device for providing video information |
CN104540004A (en) * | 2015-01-27 | 2015-04-22 | 深圳市中兴移动通信有限公司 | Video screenshot method and video screenshot device |
-
2015
- 2015-06-05 CN CN201510305097.1A patent/CN104883607B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014161282A1 (en) * | 2013-07-15 | 2014-10-09 | 中兴通讯股份有限公司 | Method and device for adjusting playback progress of video file |
CN104185086A (en) * | 2014-03-28 | 2014-12-03 | 无锡天脉聚源传媒科技有限公司 | Method and device for providing video information |
CN104540004A (en) * | 2015-01-27 | 2015-04-22 | 深圳市中兴移动通信有限公司 | Video screenshot method and video screenshot device |
Also Published As
Publication number | Publication date |
---|---|
CN104883607A (en) | 2015-09-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104883607B (en) | A kind of video interception or the method, apparatus and mobile device of shearing | |
CN104184923B (en) | System and method for retrieving people information in video | |
US10541003B2 (en) | Performance content synchronization based on audio | |
CN110139062B (en) | Video conference record creating method and device and terminal equipment | |
CN107615766A (en) | System and method for creating and distributing content of multimedia | |
US20130330062A1 (en) | Automatic creation of movie with images synchronized to music | |
CN109117233A (en) | Method and apparatus for handling information | |
CN103718166A (en) | Information processing apparatus, information processing method, and computer program product | |
CN112653902B (en) | Speaker recognition method and device and electronic equipment | |
JP2020126645A (en) | Information processing method, terminal device and information processing device | |
JP2010135925A (en) | Comment visualization device, and comment visualization program | |
KR20070007290A (en) | Tutorial generation unit | |
CN105657497B (en) | A kind of video broadcasting method and equipment | |
EP3962067A1 (en) | Method and device for adding lyrics to short video | |
CN113709527A (en) | Method and device for paying attention to anchor in multi-anchor scene | |
CN113316015A (en) | Bullet screen processing method, device and system | |
CN104954875B (en) | A kind of video playing progress monitoring method and device | |
CN103488529B (en) | A kind of method and apparatus for video resource access control | |
CN113038185B (en) | Bullet screen processing method and device | |
JP5941760B2 (en) | Display control device, display control method, display system, program, and recording medium | |
CN102142271B (en) | Handheld multimedia player for synchronously displaying waveform and repeating method | |
CN105187788B (en) | A kind of method and system of analog machine real-time data record and displaying | |
EP1940056A3 (en) | Broadcast receiving apparatus and method thereof | |
CN104185064B (en) | Media file identification method and apparatus | |
JP2010134681A (en) | Lecture material preparation support system, lecture material preparation support method and lecture material preparation support program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18 Patentee after: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd. Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18 Patentee before: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd. |