CN104883607A - Video screenshot or clipping method, video screenshot or clipping device and mobile device - Google Patents

Video screenshot or clipping method, video screenshot or clipping device and mobile device Download PDF

Info

Publication number
CN104883607A
CN104883607A CN201510305097.1A CN201510305097A CN104883607A CN 104883607 A CN104883607 A CN 104883607A CN 201510305097 A CN201510305097 A CN 201510305097A CN 104883607 A CN104883607 A CN 104883607A
Authority
CN
China
Prior art keywords
target video
video
sectional drawing
time point
voiceprint
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510305097.1A
Other languages
Chinese (zh)
Other versions
CN104883607B (en
Inventor
刘黎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201510305097.1A priority Critical patent/CN104883607B/en
Publication of CN104883607A publication Critical patent/CN104883607A/en
Application granted granted Critical
Publication of CN104883607B publication Critical patent/CN104883607B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44012Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving rendering scenes according to scene graphs, e.g. MPEG-4 scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440245Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display the reformatting operation being performed only on part of the stream, e.g. a region of the image or a time segment

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention provides a video screenshot or clipping method, a video screenshot or clipping device and a mobile device. The method comprises the steps of acquiring voiceprint information in a voice sample; scanning target video according to the voiceprint information, and acquiring a time point when a voice matched with the voiceprint information appears in the target video; and carrying out screenshot or clipping on the target video according to the acquired time point. Through adopting the technical scheme provided by the embodiment of the invention, the voice sample and the target video can be designated when a user uses the mobile device to carry out a video screenshot or clipping operation, the mobile device can scan the target video automatically according to the voiceprint information acquired from the voice sample, a video image or the video content matched with the voiceprint information is intercepted, the algorithm is simple, the interception accuracy is high, and quick interception can be realized. The whole operating process is simple and convenient, the user does not need to manually select the interception position, and user requirements can be met.

Description

A kind of method of video interception or shearing, device and mobile device
Technical field
The embodiment of the present invention relates to image processing technique field, particularly relates to a kind of method of video interception or shearing, device and mobile device.
Background technology
People watch film, TV play or oneself shooting the video file such as short-movie time, usually can run into many interested scenes, as the character moulding, lines, scenery etc. that comprise in video.At this moment, people often want, by the mode of video interception or vide clip, these scenes are preserved into picture or short visual form, conveniently check in the future or use.
For many users, may be only interested in the scene relating to same personage, so will want to preserve the picture or short-sighted frequency that only relate to this personage.In order to comprehensively intercept all scenes comprising this personage in a video file, user needs constantly to carry out sectional drawing or shearing manipulation in viewing process, or in viewing process, record all time points comprising this personage's scene, carry out sectional drawing or shearing by Video processing software again, operating process is loaded down with trivial details, consuming time, accuracy is low.Especially for the user of mobile device viewing video accustomed to using, basic use finger carries out associative operation on the touchscreen, above-mentioned sectional drawing or cut mode can be more complicated, and be difficult to be truncated to the picture of anticipation or the short-sighted frequency in the anticipation time period all accurately at every turn, when intercepting inaccurate, need again the operation repeatedly carrying out video interception or shearing, for the use of user is made troubles.
Summary of the invention
The object of the invention is to propose a kind of method of video interception or shearing, device and mobile device, to solve existing video interception or cut mode complex operation step, consuming time, and the problem that accuracy is low.
First aspect, embodiments provides a kind of method of video interception or shearing, comprising:
Gather the voiceprint in sample sound;
According to described voiceprint scanning target video, and obtain the time point that the sound that mates with described voiceprint in described target video occurs;
According to obtained time point, sectional drawing or shearing are carried out to described target video.
Second aspect, embodiments provides the device of a kind of video interception or shearing, comprising:
Voiceprint acquisition module, for gathering the voiceprint in sample sound;
Videoscanning module, for according to described voiceprint scanning target video, and obtains the time point that the sound that mates with described voiceprint in described target video occurs;
Sectional drawing or shear module, for carrying out sectional drawing or shearing according to obtained time point to described target video.
The third aspect, embodiments provides a kind of mobile device, and this mobile device comprises the device of video interception in the embodiment of the present invention or shearing.
The video interception provided in the embodiment of the present invention or the method for shearing, device and mobile device, can simplify existing video interception or shearing manipulation, time saving and energy saving, and accuracy is high.The video interception provided in the embodiment of the present invention or the method for shearing, target video is scanned according to the voiceprint in the sample sound collected, and obtaining the time point that the sound that mates with this voiceprint in target video occurs, the time point according to obtaining carries out sectional drawing or shearing to target video.By adopting technique scheme, user is when using mobile device to carry out video interception or shearing manipulation, can specified voice sample and target video, mobile device will scan target video according to the voiceprint gathered from this sample sound automatically, the video image at voiceprint coupling place or video content are intercepted, algorithm is simple and intercepting accuracy rate is high, can realize quick intercepting.Whole operating process simple and fast, manually selects interception position without the need to user, meets consumers' demand.
Accompanying drawing explanation
The schematic flow sheet of the method for a kind of video interception that Fig. 1 provides for the embodiment of the present invention one or shearing;
The schematic flow sheet of the method for a kind of video interception that Fig. 2 provides for the embodiment of the present invention two or shearing;
The structured flowchart of the device of a kind of video interception that Fig. 3 provides for the embodiment of the present invention three or shearing.
Embodiment
Technical scheme of the present invention is further illustrated by embodiment below in conjunction with accompanying drawing.Be understandable that, specific embodiment described herein is only for explaining the present invention, but not limitation of the invention.It also should be noted that, for convenience of description, illustrate only part related to the present invention in accompanying drawing but not entire infrastructure.
Embodiment one
The schematic flow sheet of the method for a kind of video interception that Fig. 1 provides for the embodiment of the present invention one or shearing, the method can be performed by the device of video interception or shearing, and wherein this device by software and/or hardware implementing, and generally can be integrated in mobile device.As shown in Figure 1, the method comprises:
Step 101, the voiceprint gathered in sample sound.
Exemplary, described mobile device specifically can be the equipment such as mobile phone, panel computer and notebook computer.Those skilled in the art are known, the device performing method described in the present embodiment is not limited to and is integrated in mobile device, also accessible site is in other electronic equipments such as desktop computer, generally be integrated in mobile device is because the beneficial effect brought is more obvious, so the embodiment of the present invention is only described to be integrated in mobile device.
The method object that the present embodiment provides carries out sectional drawing or shearing to target video, exemplary, and described target video specifically can be the video files such as the short-movie of film, TV play or user oneself shooting, generally comprises multiple personage in target video.When scene corresponding when user occurs the moulding of one of them personage, lines or this personage etc. is interested, just can wishes that picture when occurring for this personage or fragment carry out sectional drawing or shearing, this personage can be defined as target person.
Exemplary, the source of the sample sound in this step can have multiple.Such as: can directly obtain the sound of target person by microphone and generate sample sound, this kind of mode be particularly useful for the situation that target video is user oneself shooting short-movie; Also the audio or video of target person sound is comprised to obtain sample sound by reading, when comprising multiple personage's sound in this audio or video, can be undertaken between appointment read area by user, certainly, also directly can find any one by user in target video comprises between the read area of target person sound, and the target person sound in the read area of being specified by reading user obtains sample sound.After obtaining sample sound, can store sample sound, conveniently directly transfer use in the future.
In the present embodiment, the biological characteristic of the sound that described vocal print is behaved, described voiceprint can comprise frequency, wavelength, intensity, the vocal print characteristic such as rhythm and tone of sound, identifies in one section of sound other sound whether comprising owner's word belonging to vocal print or send by voiceprint.In this step, the voiceprint in sample sound is gathered, preferably, also the voiceprint collected can be stored, conveniently directly transfer use in the future.
Step 102, according to voiceprint scanning target video, and obtain the time point that the sound that mates with voiceprint in target video occurs.
Exemplary, the scanning based on voiceprint can be carried out under target video is in broadcast state, in scanning process, the sound occurred in target video is mated with voiceprint, when occurred sound is consistent with voiceprint, illustrate that the match is successful, this sound is that target person sends, and obtains the time point that this sound occurs in target video; Also the audio-frequency information in target video can be extracted, scanning based on voiceprint is carried out to extracted audio-frequency information, in scanning process, audio-frequency information is mated with voiceprint, when certain a part of audio-frequency information is consistent with voiceprint, illustrate that the match is successful, obtain the time point that this part audio-frequency information is corresponding in target video.
Step 103, according to obtained time point, sectional drawing or shearing are carried out to target video.
Exemplary, video interception function setting option and vide clip function setting option can be added in a mobile device respectively, can be arranged voluntarily by user in above-mentioned two function setting options and open or close corresponding function.Preferably, also can arrange current intercepting type by user in video jukebox software is video interception or vide clip, or arrange support simultaneously above-mentioned two kinds intercept types.
Exemplary, when getting a time point in a step 102, directly can intercept the video image corresponding to this time point, and generate corresponding picture and store.Preferably, can automatically according to this time point be corresponding sectional drawing name, facilitate user to check in the future; When getting multiple time point continuously, illustrate and to occur at the sound go of a period of time internal object personage, video content in time period corresponding to multiple time point can be sheared, wherein, video content comprises the relevant information such as the display image of video and the audio frequency of correspondence, and the video file that can generate subsequently after corresponding shearing stores.
Exemplary, when getting a time point in a step 102, can store this time point, after waiting for that target video is scanned, according to the sequencing rise time axle of obtained time point, then according to time shaft, sectional drawing or shearing are carried out to target video.Wherein, the time shaft generated can be stored in mobile device, as random access memory (Random-Access Memory, RAM), calls when needs carry out sectional drawing or shear.Concrete, occur when a certain moment of sound in target video of target person, this moment may correspond to a point on time shaft, and as target person sound appears in 1 point of 20 seconds place at target video, then 1 point of 20 seconds place of time shaft is corresponding occurs a point; When continuing to occur in a certain period in target video of the sound of target person, this period may correspond to a line segment on time shaft, as continued to occur target person sound between 30 seconds to 1 point 50 seconds 1 point of target video, then on time shaft corresponding occur starting point be 1 point 30 seconds, terminal is 1 point of line segment of 50 seconds.When video interception function is in opening, the video image corresponding to the point on time shaft can be intercepted, also can video image corresponding in line segment be intercepted, and generate corresponding picture and store.When vide clip function is in opening, can shear video content corresponding in line segment, the video file that can generate subsequently after corresponding shearing stores.
Further, when carrying out shearing manipulation to target video, can predetermined time period be obtained, determining, between shear zone, to shear target video according between shear zone according to obtained time point and predetermined time period.Exemplary, predetermined time period y can be drawn according to preset algorithm, get the sound of coupling at time point n place, can will be defined as between shear zone [n-y, n+y], the video in this shear zone is sheared.Such as, the predetermined time period y got is 15 seconds, the time point n of acquisition be 1 point 20 seconds, be then [1 point and 5 seconds, 1 point and 35 seconds] between shear zone, 1 point of target video video content between 5 seconds to 1 point 35 seconds sheared.
Further, according to obtained time point, target video can be sheared, and generate multiple sub-video, multiple sub-video splicing is become synthetic video.Exemplary, when target person sound occurs repeatedly in target video, according to obtained time point, target video repeatedly can be sheared, the sub-video that after each shearing, generation one is corresponding.After shearing whole target video, according to the order of time point, multiple sub-video can be carried out splicing, form synthetic video, when user can be avoided to check sub-video, multiple exercise opens the operation of video in the future.
Preferably, when carrying out shot operation to target video, if the time of the more or lasting appearance of target person sound occurrence number in target video is longer, sectional drawing frequency can be obtained, and according to sectional drawing frequency and the time point obtained, sectional drawing is carried out to target video, thus control sectional drawing quantity.Wherein, sectional drawing frequency can be arranged by system default or by user according to actual conditions sets itself.Such as, the sectional drawing frequency obtained be 30 seconds once, assign to 25 points at 25 of target video within 40 seconds, 30, to assign to 32 points and all occurred target person sound, then can at 25 points of place's interceptings in 30 seconds image, intercept an image respectively 30 points 30 seconds, 31 points, 31 points 30 seconds and 32 offices, obtain four pictures altogether.
In the present embodiment, when the sound time of occurrence of target person is relatively concentrated, in order to save the time of scan matching, before execution step 102, the step in the interval to be scanned obtaining target video can be increased, and when performing step 102, scan interval to be scanned according to voiceprint, and obtain the time point that the sound that mates with voiceprint in interval to be scanned occurs.Wherein, described sweep interval can be selected by user.Concrete, by importing the relevant time period into or passing through to specify interval to be scanned in the progress of video, if user is by pulling video playback progress bar from a time point to another one time point, using the interval between two time points as interval to be scanned.
The method of the video interception that the embodiment of the present invention one provides or shearing, target video is scanned according to the voiceprint in the sample sound collected, and obtaining the time point that the sound that mates with this voiceprint in target video occurs, the time point according to obtaining carries out sectional drawing or shearing to target video.By adopting technique scheme, user is when using mobile device to carry out video interception or shearing manipulation, can specified voice sample and target video, mobile device will scan target video according to the voiceprint gathered from this sample sound automatically, the video image at voiceprint coupling place or video content are intercepted, algorithm is simple and intercepting accuracy rate is high, can realize quick intercepting.Whole operating process simple and fast, manually selects interception position without the need to user, meets consumers' demand.
Embodiment two
The schematic flow sheet of the method for a kind of video interception that Fig. 2 provides for the embodiment of the present invention two or shearing, the present embodiment is optimized based on above-described embodiment, in the present embodiment, fractional scanning is carried out to target video, and generate multiple period of the day from 11 p.m. to 1 a.m countershaft according to obtained time point, then according to multiple period of the day from 11 p.m. to 1 a.m countershaft, segmentation sectional drawing or shearing are carried out to target video.
Accordingly, the method for the present embodiment comprises the steps:
Step 201, the voiceprint gathered in sample sound.
Step 202, according to voiceprint, fractional scanning is carried out to target video, and obtain the time point that the sound that mates with voiceprint in target video occurs.
Exemplary, in order to accelerate sectional drawing or shearing process, the audio-frequency information in target video can be extracted, the multi-thread formula fractional scanning based on voiceprint is carried out to extracted audio-frequency information.Based on time span, audio-frequency information can be divided into N part, each part be scanned simultaneously, and obtain the time point that the sound that mates with voiceprint in audio-frequency information occurs.Such as, the time span of target video totally 30 minutes, can be divided into 3 parts by the audio-frequency information extracted according to 0-10 minute, 10 minutes-20 minutes and 20 minutes-30 minutes, carries out scan matching to these 3 parts simultaneously.
Step 203, generate multiple period of the day from 11 p.m. to 1 a.m countershaft according to obtained time point.
Exemplary, generate N number of period of the day from 11 p.m. to 1 a.m countershaft according to the time point that each part of audio-frequency information obtains.Further, after generating a sub-time shaft, can store this period of the day from 11 p.m. to 1 a.m countershaft.
Step 204, according to multiple period of the day from 11 p.m. to 1 a.m countershaft, segmentation sectional drawing or shearing are carried out to target video.
Exemplary, probably different from the quantity of the time point that the sound that voiceprint mates occurs due to what comprise in each part of audio-frequency information, comprising a fairly large number of part of time point may be consuming time relatively longer in the process of scan matching, therefore, the rise time of each period of the day from 11 p.m. to 1 a.m countershaft also may be different.When a sub-time shaft is generated prior to the period of the day from 11 p.m. to 1 a.m countershaft of other parts, the period of the day from 11 p.m. to 1 a.m countershaft that just first can generate based on this carries out sectional drawing or shearing to corresponding video section, and performs sectional drawing or shearing manipulation again after all period of the day from 11 p.m. to 1 a.m countershafts need not be waited all to generate.
In the present embodiment, in order to accelerate sectional drawing or shearing process further, also before execution step 202, the step in the interval to be scanned obtaining target video can be increased, and when performing step 202, the video treating sweep interval according to voiceprint carries out fractional scanning.Wherein, described sweep interval can be selected by user.
A kind of method applying the embodiment of the present invention two is provided to carry out the preferred embodiment of video interception below:
Such as, user needs to carry out video interception to the video A of in mobile phone, and object intercepts the scene comprising personage B sound.User first can specify and intercept type is sectional drawing, and then confirming needs the target video carrying out sectional drawing to be the video A of whole duration.User chooses the file (can be audio file or video file) that comprises personage B sound in mobile phone, or in video A, specify a scene comprising personage B sound as sample sound, gathers the voiceprint in sample sound.Video A is divided into N section and carries out multithreading scan matching, when often generation one comprises the period of the day from 11 p.m. to 1 a.m countershaft of personage B voiceprint, according to this period of the day from 11 p.m. to 1 a.m countershaft, sectional drawing is carried out to this section of video.The picture file generated after sectional drawing with its in video A time point name and stored in mobile phone EMS memory.If when the scene that personage B occurs is more, in order to control sectional drawing quantity, sectional drawing frequency can be arranged, as carried out a sectional drawing in 30s.
A kind of method applying the embodiment of the present invention two is provided to carry out the preferred embodiment of vide clip below:
Such as, user needs to shear video A, and object shears the scene comprising personage C sound.By microphone records one section of personage C word, generate sample sound, gather the voiceprint in sample sound.User first can specify and intercept type is vide clip, then specifies interval to be scanned to be 30-60 minute.The to be scanned interval 30-60 minute of video A is designated as interval D, interval D is divided into M section and carries out multithreading scan matching, when often generation one comprises the period of the day from 11 p.m. to 1 a.m countershaft of personage C voiceprint, according to this period of the day from 11 p.m. to 1 a.m countershaft, corresponding video section is sheared.Suppose to comprise the vocal print characteristic of personage C 35-37,45-48,54-56 minute this three time period respectively, three sub-videos will be generated, these three sub-videos can be spliced, generate a slightly large synthetic video, and store.
The method of the video interception that the embodiment of the present invention two provides or shearing, the fractional scanning of multi-thread formula, coupling and sectional drawing or shearing can be carried out to target video according to voiceprint, greatly can shorten the time corresponding to whole video interception or shear history, the picture making user obtain quickly wanting to intercept or video, promote the experience of user further.
Embodiment three
The structured flowchart of the device of a kind of video interception that Fig. 3 provides for the embodiment of the present invention three or shearing, this device by software and/or hardware implementing, and generally can be integrated in mobile device.As shown in Figure 3, this device comprises: voiceprint acquisition module 301, for gathering the voiceprint in sample sound; Videoscanning module 302, for according to described voiceprint scanning target video, and obtains the time point that the sound that mates with described voiceprint in described target video occurs; Sectional drawing or shear module 303, for carrying out sectional drawing or shearing according to obtained time point to described target video.
The device of the video interception that the embodiment of the present invention three provides or shearing, voiceprint in the sample sound collected according to voiceprint acquisition module 301 by videoscanning module 302 scans target video, and obtain the time point that the sound that mates with this voiceprint in target video occurs, according to the time point obtained, sectional drawing or shearing are carried out to target video by sectional drawing or shear module 303.By adopting technique scheme, user is when using mobile device to carry out video interception or shearing manipulation, can specified voice sample and target video, mobile device will scan target video according to the voiceprint gathered from this sample sound automatically, the video image at voiceprint coupling place or video content are intercepted, algorithm is simple and intercepting accuracy rate is high, can realize quick intercepting.Whole operating process simple and fast, manually selects interception position without the need to user, meets consumers' demand.
On the basis of above-described embodiment, described sectional drawing or shear module can comprise: time shaft generation unit and the first sectional drawing or cut cells.Wherein, time shaft generation unit, for according to obtained time point rise time axle; First sectional drawing or cut cells, for carrying out sectional drawing or shearing according to time shaft to target video.
On the basis of above-described embodiment, described videoscanning module can comprise fractional scanning unit, for carrying out fractional scanning according to voiceprint to target video; Described sectional drawing or shear module can comprise period of the day from 11 p.m. to 1 a.m countershaft generation unit and the second sectional drawing or cut cells.Wherein, period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershaft according to obtained time point; Second sectional drawing or cut cells, for carrying out segmentation sectional drawing or shearing according to described multiple period of the day from 11 p.m. to 1 a.m countershaft to described target video.
On the basis of above-described embodiment, described sectional drawing or shear module can comprise predetermined time period acquiring unit, shear interval determination unit and the 3rd sectional drawing or cut cells.Wherein, predetermined time period acquiring unit, for obtaining predetermined time period; Shear interval determination unit, for determining between shear zone according to obtained time point and predetermined time period; 3rd sectional drawing or cut cells, for shearing target video according between shear zone.
On the basis of above-described embodiment, described sectional drawing or shear module can comprise sub-video generation unit and concatenation unit.Wherein, sub-video generation unit, for shearing target video according to obtained time point, and generates multiple sub-video; Concatenation unit, for becoming synthetic video by multiple sub-video splicing.
On the basis of above-described embodiment, this device also can comprise acquisition module, for before scanning target video according to voiceprint, obtains the interval to be scanned of target video; Described sectional drawing or shear module can be specifically for: scan interval to be scanned according to voiceprint, and obtain the time point that the sound that mates with voiceprint in interval to be scanned occurs.
On the basis of above-described embodiment, described sectional drawing or shear module can comprise frequency acquisition unit and the 4th sectional drawing or cut cells.Wherein, frequency acquisition unit, for obtaining sectional drawing frequency; 4th sectional drawing or cut cells, for carrying out sectional drawing according to described sectional drawing frequency and the time point obtained to described target video.
Embodiment four
The embodiment of the present invention four provides a kind of mobile device, this equipment comprises the device of the video interception described in the embodiment of the present invention or shearing, carries out sectional drawing or shearing manipulation by the method performing the video interception described in the embodiment of the present invention or shearing to video.
Exemplary, described mobile device specifically can be the equipment such as mobile phone, panel computer and notebook computer.Preferably, in described mobile device, the voice collection device such as microphone are set.
When user carries out video interception or shearing manipulation at the mobile device using the embodiment of the present invention four to provide, can specified voice sample and target video, mobile device will scan target video according to the voiceprint gathered from this sample sound automatically, the video image at voiceprint coupling place or video content are intercepted, algorithm is simple and intercepting accuracy rate is high, can realize quick intercepting.Whole operating process simple and fast, manually selects interception position without the need to user, meets consumers' demand.
Note, above are only preferred embodiment of the present invention and institute's application technology principle.Skilled person in the art will appreciate that and the invention is not restricted to specific embodiment described here, various obvious change can be carried out for a person skilled in the art, readjust and substitute and can not protection scope of the present invention be departed from.Therefore, although be described in further detail invention has been by above embodiment, the present invention is not limited only to above embodiment, when not departing from the present invention's design, can also comprise other Equivalent embodiments more, and scope of the present invention is determined by appended right.

Claims (15)

1. a method for video interception or shearing, is characterized in that, comprising:
Gather the voiceprint in sample sound;
According to described voiceprint scanning target video, and obtain the time point that the sound that mates with described voiceprint in described target video occurs;
According to obtained time point, sectional drawing or shearing are carried out to described target video.
2. method according to claim 1, is characterized in that, carries out sectional drawing or shearing, comprising according to obtained time point to described target video:
According to obtained time point rise time axle;
According to described time shaft, sectional drawing or shearing are carried out to described target video.
3. method according to claim 1, is characterized in that,
According to described voiceprint scanning target video, comprising:
According to described voiceprint, fractional scanning is carried out to target video;
According to obtained time point, sectional drawing or shearing are carried out to described target video, comprising:
Multiple period of the day from 11 p.m. to 1 a.m countershaft is generated according to obtained time point;
According to described multiple period of the day from 11 p.m. to 1 a.m countershaft, segmentation sectional drawing or shearing are carried out to described target video.
4. method according to claim 1, is characterized in that, shears, comprising according to obtained time point to described target video:
Obtain predetermined time period;
Determine between shear zone according to obtained time point and described predetermined time period;
According between described shear zone, described target video is sheared.
5. method according to claim 1, is characterized in that, shears, comprising according to obtained time point to described target video:
According to obtained time point, described target video is sheared, and generate multiple sub-video;
Described multiple sub-video splicing is become synthetic video.
6. method according to claim 1, is characterized in that, before according to described voiceprint scanning target video, also comprises:
Obtain the interval to be scanned of target video;
According to described voiceprint scanning target video, and obtain the time point that the sound that mates with described voiceprint in described target video occurs, comprising:
According to the described interval to be scanned of described voiceprint scanning, and obtain the time point that the sound that mates with described voiceprint in described interval to be scanned occurs.
7. method according to claim 1, is characterized in that, carries out sectional drawing, comprising according to obtained time point to described target video:
Obtain sectional drawing frequency;
According to described sectional drawing frequency and the time point obtained, sectional drawing is carried out to described target video.
8. a device for video interception or shearing, is characterized in that, comprising:
Voiceprint acquisition module, for gathering the voiceprint in sample sound;
Videoscanning module, for according to described voiceprint scanning target video, and obtains the time point that the sound that mates with described voiceprint in described target video occurs;
Sectional drawing or shear module, for carrying out sectional drawing or shearing according to obtained time point to described target video.
9. device according to claim 8, is characterized in that, described sectional drawing or shear module comprise:
Time shaft generation unit, for according to obtained time point rise time axle;
First sectional drawing or cut cells, for carrying out sectional drawing or shearing according to described time shaft to described target video.
10. device according to claim 8, is characterized in that,
Described videoscanning module comprises:
Fractional scanning unit, for carrying out fractional scanning according to described voiceprint to target video;
Described sectional drawing or shear module comprise:
Period of the day from 11 p.m. to 1 a.m countershaft generation unit, for generating multiple period of the day from 11 p.m. to 1 a.m countershaft according to obtained time point;
Second sectional drawing or cut cells, for carrying out segmentation sectional drawing or shearing according to described multiple period of the day from 11 p.m. to 1 a.m countershaft to described target video.
11. devices according to claim 8, is characterized in that, described sectional drawing or shear module comprise:
Predetermined time period acquiring unit, for obtaining predetermined time period;
Shear interval determination unit, for determining between shear zone according to obtained time point and described predetermined time period;
3rd sectional drawing or cut cells, for shearing described target video according between described shear zone.
12. devices according to claim 8, is characterized in that, described sectional drawing or shear module comprise:
Sub-video generation unit, for shearing described target video according to obtained time point, and generates multiple sub-video;
Concatenation unit, for becoming synthetic video by described multiple sub-video splicing.
13. devices according to claim 8, is characterized in that, also comprise:
Acquisition module, for before scanning target video according to described voiceprint, obtains the interval to be scanned of target video;
Described sectional drawing or shear module specifically for:
According to the described interval to be scanned of described voiceprint scanning, and obtain the time point that the sound that mates with described voiceprint in described interval to be scanned occurs.
14. devices according to claim 8, is characterized in that, described sectional drawing or shear module comprise:
Frequency acquisition unit, for obtaining sectional drawing frequency;
4th sectional drawing or cut cells, for carrying out sectional drawing according to described sectional drawing frequency and the time point obtained to described target video.
15. 1 kinds of mobile devices, is characterized in that, comprise the device of video interception according to any one of claim 8-14 or shearing.
CN201510305097.1A 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing Active CN104883607B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510305097.1A CN104883607B (en) 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510305097.1A CN104883607B (en) 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing

Publications (2)

Publication Number Publication Date
CN104883607A true CN104883607A (en) 2015-09-02
CN104883607B CN104883607B (en) 2017-12-19

Family

ID=53950914

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510305097.1A Active CN104883607B (en) 2015-06-05 2015-06-05 A kind of video interception or the method, apparatus and mobile device of shearing

Country Status (1)

Country Link
CN (1) CN104883607B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105488227A (en) * 2015-12-29 2016-04-13 惠州Tcl移动通信有限公司 Electronic device and method for processing audio file based on voiceprint features through same
CN106131627A (en) * 2016-07-07 2016-11-16 腾讯科技(深圳)有限公司 A kind of method for processing video frequency, Apparatus and system
CN107517406A (en) * 2017-09-05 2017-12-26 语联网(武汉)信息技术有限公司 A kind of video clipping and the method for translation
CN107562737A (en) * 2017-09-05 2018-01-09 语联网(武汉)信息技术有限公司 A kind of methods of video segmentation and its system for being used to translate
CN107688792A (en) * 2017-09-05 2018-02-13 语联网(武汉)信息技术有限公司 A kind of video interpretation method and its system
CN107820123A (en) * 2017-10-25 2018-03-20 深圳天珑无线科技有限公司 Method, mobile terminal and the storage device of mobile terminal screen printing picture
CN108419040A (en) * 2018-02-28 2018-08-17 上海乐愚智能科技有限公司 A kind of recording of growing up method, apparatus, robot and computer-readable medium
CN109361956A (en) * 2018-11-22 2019-02-19 深圳艺达文化传媒有限公司 Time-based video cutting method and Related product
CN110418159A (en) * 2018-10-11 2019-11-05 彩云之端文化传媒(北京)有限公司 A method of television content is intercepted across screen based on Application on Voiceprint Recognition
CN110572706A (en) * 2019-09-29 2019-12-13 深圳传音控股股份有限公司 Video screenshot method, terminal and computer-readable storage medium
CN112312039A (en) * 2019-07-15 2021-02-02 北京小米移动软件有限公司 Audio and video information acquisition method, device, equipment and storage medium
CN112468735A (en) * 2021-01-26 2021-03-09 北京深蓝长盛科技有限公司 Video processing system and video processing method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014161282A1 (en) * 2013-07-15 2014-10-09 中兴通讯股份有限公司 Method and device for adjusting playback progress of video file
CN104185086A (en) * 2014-03-28 2014-12-03 无锡天脉聚源传媒科技有限公司 Method and device for providing video information
CN104540004A (en) * 2015-01-27 2015-04-22 深圳市中兴移动通信有限公司 Video screenshot method and video screenshot device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014161282A1 (en) * 2013-07-15 2014-10-09 中兴通讯股份有限公司 Method and device for adjusting playback progress of video file
CN104185086A (en) * 2014-03-28 2014-12-03 无锡天脉聚源传媒科技有限公司 Method and device for providing video information
CN104540004A (en) * 2015-01-27 2015-04-22 深圳市中兴移动通信有限公司 Video screenshot method and video screenshot device

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105488227A (en) * 2015-12-29 2016-04-13 惠州Tcl移动通信有限公司 Electronic device and method for processing audio file based on voiceprint features through same
CN106131627A (en) * 2016-07-07 2016-11-16 腾讯科技(深圳)有限公司 A kind of method for processing video frequency, Apparatus and system
CN106131627B (en) * 2016-07-07 2019-03-26 腾讯科技(深圳)有限公司 A kind of method for processing video frequency, apparatus and system
CN107688792A (en) * 2017-09-05 2018-02-13 语联网(武汉)信息技术有限公司 A kind of video interpretation method and its system
CN107562737A (en) * 2017-09-05 2018-01-09 语联网(武汉)信息技术有限公司 A kind of methods of video segmentation and its system for being used to translate
CN107517406A (en) * 2017-09-05 2017-12-26 语联网(武汉)信息技术有限公司 A kind of video clipping and the method for translation
CN107517406B (en) * 2017-09-05 2020-02-14 语联网(武汉)信息技术有限公司 Video editing and translating method
CN107688792B (en) * 2017-09-05 2020-06-05 语联网(武汉)信息技术有限公司 Video translation method and system
CN107820123A (en) * 2017-10-25 2018-03-20 深圳天珑无线科技有限公司 Method, mobile terminal and the storage device of mobile terminal screen printing picture
CN108419040A (en) * 2018-02-28 2018-08-17 上海乐愚智能科技有限公司 A kind of recording of growing up method, apparatus, robot and computer-readable medium
CN110418159A (en) * 2018-10-11 2019-11-05 彩云之端文化传媒(北京)有限公司 A method of television content is intercepted across screen based on Application on Voiceprint Recognition
CN109361956A (en) * 2018-11-22 2019-02-19 深圳艺达文化传媒有限公司 Time-based video cutting method and Related product
CN112312039A (en) * 2019-07-15 2021-02-02 北京小米移动软件有限公司 Audio and video information acquisition method, device, equipment and storage medium
CN110572706A (en) * 2019-09-29 2019-12-13 深圳传音控股股份有限公司 Video screenshot method, terminal and computer-readable storage medium
CN112468735A (en) * 2021-01-26 2021-03-09 北京深蓝长盛科技有限公司 Video processing system and video processing method
CN112468735B (en) * 2021-01-26 2021-05-11 北京深蓝长盛科技有限公司 Video processing system and video processing method

Also Published As

Publication number Publication date
CN104883607B (en) 2017-12-19

Similar Documents

Publication Publication Date Title
CN104883607A (en) Video screenshot or clipping method, video screenshot or clipping device and mobile device
US11030987B2 (en) Method for selecting background music and capturing video, device, terminal apparatus, and medium
US11474779B2 (en) Method and apparatus for processing information
CN109429075A (en) A kind of live content processing method, device and system
US20100309284A1 (en) Systems and methods for dynamically displaying participant activity during video conferencing
CN110708607B (en) Live broadcast interaction method and device, electronic equipment and storage medium
WO2014178219A1 (en) Information processing device and information processing method
GB2600309A (en) Video processing method and apparatus, and electronic device and storage medium
CN103207728A (en) Method Of Providing Augmented Reality And Terminal Supporting The Same
CN112653902B (en) Speaker recognition method and device and electronic equipment
US10341727B2 (en) Information processing apparatus, information processing method, and information processing program
CN104080006B (en) A kind of video process apparatus and method
US11615816B2 (en) Method and device for adding lyrics to short video
CN117255211A (en) Live broadcast room display method, server side and live broadcast client side
CN110955471B (en) Notification message display method, notification message display device, terminal and storage medium
CN110784751A (en) Information display method and device
WO2019047850A1 (en) Identifier displaying method and device, request responding method and device
CN110035318A (en) Video broadcasting method, device and multimedia data playing method
WO2023241527A1 (en) Live stream processing method and apparatus, device, and storage medium
CN110868632B (en) Video processing method and device, storage medium and electronic equipment
CN113038185B (en) Bullet screen processing method and device
CN103702218A (en) Video playing method and device
CN111435525A (en) Reading plan determining method, device, equipment, server and storage medium
WO2017067192A1 (en) Program list acquiring method, server, client, and acquiring system
CN112911351B (en) Video tutorial display method, device, system and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Patentee after: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.

Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Patentee before: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.