CN109905615B - Full-automatic cooperation method for audio playing and video recording - Google Patents

Full-automatic cooperation method for audio playing and video recording Download PDF

Info

Publication number
CN109905615B
CN109905615B CN201910125136.8A CN201910125136A CN109905615B CN 109905615 B CN109905615 B CN 109905615B CN 201910125136 A CN201910125136 A CN 201910125136A CN 109905615 B CN109905615 B CN 109905615B
Authority
CN
China
Prior art keywords
playing
tone
played
frequency
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910125136.8A
Other languages
Chinese (zh)
Other versions
CN109905615A (en
Inventor
陆成刚
吴兵
陈刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN201910125136.8A priority Critical patent/CN109905615B/en
Publication of CN109905615A publication Critical patent/CN109905615A/en
Application granted granted Critical
Publication of CN109905615B publication Critical patent/CN109905615B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Television Signal Processing For Recording (AREA)

Abstract

A full-automatic cooperation method for audio playing and video recording comprises the following steps: 1) firstly, setting a time interval parameter between played sentences as T, playing a special marker tone at the interval between the played sentences, and setting the playing duration as T; 2) starting a microphone, and carrying out signal analysis processing after the microphone receives the special mark sound; if the special mark sound is identified, stopping the capture of the video image until the playing of the mark sound is finished; 3) and after the playing of the marker voice is finished, playing a next sentence, recovering image capture, and synchronously storing the captured sentence and the currently played voice segment into a video file. The invention provides a full-automatic cooperation method for audio playing and video recording, which simplifies operation and reduces cost.

Description

Full-automatic cooperation method for audio playing and video recording
Technical Field
The invention belongs to the field of multimedia processing, and relates to a method for cooperation of audio playing and video recording.
Background
During the production of teaching training demonstration videos, a whole-course shooting and recording teaching demonstration process is required, then a large number of video editing software is used for editing the videos during later production, and the original sound is removed to dub the videos again. The video shooting and recording (recording is not carried out at the moment) guided by voice can be adopted, and the current played and captured images are synchronously stored in the video file, so that the teaching video can be directly generated in an inefficient mode of post-production to a certain extent. In such a solution speech is synthesized by inputting a previously edited tutorial presentation text into a TTS engine. The solution involves the cooperation of playing audio and video image capture, and generally, after a segment of voice is played, the video capture needs to be paused until the next segment of voice is played, and meanwhile, the corresponding action is started by the demonstration of the camcorder. In the 'playback and capture' synchronous mode, the photographer needs to perform certain synchronous control, namely, the photographer enters the next voice playing according to the action preparation condition of the photographer according to the pause after the current voice playing is finished. Obviously, the cooperation mode is not automatic, needs control of a special person (photographer), is troublesome, and has higher execution cost.
Disclosure of Invention
In order to overcome the defects of troublesome operation and higher cost of the conventional audio playing and video recording cooperation method, the invention provides a full-automatic cooperation method for audio playing and video recording, which simplifies the operation and reduces the cost.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a full-automatic cooperation method for audio playing and video recording comprises the following steps:
1) firstly, setting a time interval parameter between played sentences as T, playing a special marker tone at the interval between the played sentences, and setting the playing duration as T;
2) starting a microphone, and carrying out signal analysis processing after the microphone receives the special mark sound; if the special mark sound is identified, stopping the capture of the video image until the playing of the mark sound is finished;
3) and after the playing of the marker voice is finished, playing a next sentence, recovering image capture, and synchronously storing the captured sentence and the currently played voice segment into a video file.
Further, in step 1), the played special marker tone is a DTMF dual tone multi-frequency tone of a specified frequency.
Still further, in step 1), T random integers between 0 and 9 are first generated, and DTMF dual-tone multi-tones corresponding to the T integers are played in sequence, wherein the playing duration of each integer-corresponding sound is 1 second.
Further, in the step 2), the frequency spectrum stability of the DTMF dual-tone multi-frequency tone is utilized, a fixed dual-frequency peak value appears for a set number of frames in the frequency domain, and it is identified that the current sound contains the DTMF dual-tone multi-frequency tone corresponding to a certain digit.
The technical conception of the invention is as follows: the loudspeaker plays the special mark sound when the voice section plays the interval, and the automatic control system analyzes the voice section interval period after the microphone receives the special mark sound, thereby pausing the image capturing and recording process. Meanwhile, the shot person can carry out joint preparation of teaching demonstration actions through the prompts of the starting and ending of the playing of the special mark tone so as to be matched with the synchronization of the explication voice.
In the flourishing age of smart phones, native video recording APPs of mobile phone systems have already replaced digital cameras and DV video cameras, and various APPs using videos as applications emerge endlessly, and are mainly classified into three categories: firstly, shooting and recording; II, editing; thirdly, special effects. The scheme is mainly a method for scheduling, controlling and cooperating the audio and image dual-track media stream when the recording and the (later stage) editing are integrated.
The invention has the following beneficial effects: the operation is simplified, and the cost is reduced; and a fully automatic self-adaptive cooperation mode even makes it possible for unmanned full-automatic shooting and teaching video production.
Drawings
FIG. 1 is a functional block diagram of a fully automated collaborative method of audio playback and video recording.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Referring to fig. 1, a full-automatic collaboration method for audio playing and video recording includes the following steps:
1) firstly, setting time interval parameters between played sentences as T (seconds, T is an integer), playing special marker tones at intervals between played sentences, and setting the playing duration as T;
2) starting a microphone, and carrying out signal analysis processing after the microphone receives the special mark sound; if the special mark sound is identified, stopping the capture of the video image until the playing of the mark sound is finished;
3) and after the playing of the marker voice is finished, playing a next sentence, recovering image capture, and synchronously storing the captured sentence and the currently played voice segment into a video file.
Further, in step 1), the played special marker tone is a DTMF dual tone multi-frequency tone of a specified frequency.
Still further, in step 1), T random integers between 0 and 9 are first generated, and DTMF dual-tone multi-tones corresponding to the T integers are played in sequence, wherein the playing duration of each integer-corresponding sound is 1 second.
Furthermore, in the step 2), by using the spectrum stability of the DTMF dual-tone multi-frequency tone, a fixed dual-frequency peak value appears in a continuously set number of frames in the frequency domain, and thus, the DTMF dual-tone multi-frequency tone corresponding to a certain number of digits in the current sound can be identified; the method has good anti-interference performance, and the DTMF dual-tone multi-frequency tone recognition is not influenced no matter whether the background has music, voice or noise interference.

Claims (4)

1. A full-automatic collaboration method for audio playing and video recording is characterized by comprising the following steps:
1) firstly, setting a time interval parameter between played sentences as T, playing a special marker tone at the interval between the played sentences, and setting the playing duration as T;
2) starting a microphone, and carrying out signal analysis processing after the microphone receives the special mark sound; if the special mark sound is identified, stopping the capture of the video image until the playing of the mark sound is finished;
3) and after the playing of the marker voice is finished, playing a next sentence, recovering image capture, and synchronously storing the captured sentence and the currently played voice segment into a video file.
2. The method as claimed in claim 1, wherein the special marker tone played in step 1) is a DTMF dual tone multi-frequency tone of a specified frequency.
3. The method as claimed in claim 2, wherein in step 1), T random integers between 0 and 9 are first generated, and DTMF dual-tone multi-frequency tones corresponding to the T integers are played in sequence, wherein the sound corresponding to each integer lasts for 1 second.
4. The method as claimed in claim 2 or 3, wherein in step 2), the DTMF dual-tone multi-frequency tone corresponding to a certain number is identified as the current sound, by using the spectral stability of the DTMF dual-tone multi-frequency tone, and by the occurrence of a peak of a fixed dual-frequency in a frequency domain for a set number of frames.
CN201910125136.8A 2019-02-20 2019-02-20 Full-automatic cooperation method for audio playing and video recording Active CN109905615B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910125136.8A CN109905615B (en) 2019-02-20 2019-02-20 Full-automatic cooperation method for audio playing and video recording

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910125136.8A CN109905615B (en) 2019-02-20 2019-02-20 Full-automatic cooperation method for audio playing and video recording

Publications (2)

Publication Number Publication Date
CN109905615A CN109905615A (en) 2019-06-18
CN109905615B true CN109905615B (en) 2021-02-26

Family

ID=66945104

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910125136.8A Active CN109905615B (en) 2019-02-20 2019-02-20 Full-automatic cooperation method for audio playing and video recording

Country Status (1)

Country Link
CN (1) CN109905615B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102932623A (en) * 2011-06-13 2013-02-13 沃克斯国际公司 Capture, syncing and playback of audio data and image data
CN204516158U (en) * 2015-03-26 2015-07-29 重庆大学 A kind of graphics diversity line segment and analysis straight-line segment apparatus for demonstrating
CN109005359A (en) * 2018-10-31 2018-12-14 广州酷狗计算机科技有限公司 video recording method, device storage medium
CN109274900A (en) * 2018-09-05 2019-01-25 浙江工业大学 A kind of video dubbing method

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011198348A (en) * 2010-02-24 2011-10-06 Sanyo Electric Co Ltd Sound recording device
CN103208298A (en) * 2012-01-11 2013-07-17 三星电子(中国)研发中心 Video shooting method and system
US9407858B2 (en) * 2014-01-20 2016-08-02 Custom Solutions Group, LLC Voiceover system and method
CN104104910B (en) * 2014-06-26 2018-04-17 北京小鱼在家科技有限公司 It is a kind of to carry out two-way live shared terminal and method with intelligent monitoring
EP3427261A4 (en) * 2016-03-10 2019-10-23 Axon Enterprise, Inc. Audio watermark and synchronization tones for recording devices
CN105959773B (en) * 2016-04-29 2019-06-18 魔方天空科技(北京)有限公司 The treating method and apparatus of multimedia file

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102932623A (en) * 2011-06-13 2013-02-13 沃克斯国际公司 Capture, syncing and playback of audio data and image data
CN204516158U (en) * 2015-03-26 2015-07-29 重庆大学 A kind of graphics diversity line segment and analysis straight-line segment apparatus for demonstrating
CN109274900A (en) * 2018-09-05 2019-01-25 浙江工业大学 A kind of video dubbing method
CN109005359A (en) * 2018-10-31 2018-12-14 广州酷狗计算机科技有限公司 video recording method, device storage medium

Also Published As

Publication number Publication date
CN109905615A (en) 2019-06-18

Similar Documents

Publication Publication Date Title
CN105959773B (en) The treating method and apparatus of multimedia file
KR20150057591A (en) Method and apparatus for controlling playing video
CN109274900A (en) A kind of video dubbing method
CN110691204B (en) Audio and video processing method and device, electronic equipment and storage medium
CN1961350A (en) Method of and system for modifying messages
WO2005099251A1 (en) Video-audio synchronization
CN105828101A (en) Method and device for generation of subtitles files
WO2018045703A1 (en) Voice processing method, apparatus and terminal device
WO2013024704A1 (en) Image-processing device, method, and program
CN103208298A (en) Video shooting method and system
CN111294463A (en) Intelligent response method, system and device
CN110933485A (en) Video subtitle generating method, system, device and storage medium
CN109905615B (en) Full-automatic cooperation method for audio playing and video recording
CN106326804B (en) Recording control method and device
US8615153B2 (en) Multi-media data editing system, method and electronic device using same
JP5727777B2 (en) Conference support apparatus and conference support method
CN114120969A (en) Method and system for testing voice recognition function of intelligent terminal and electronic equipment
WO2016125362A1 (en) Information processing device, information processing system, information processing method, and program
KR20160129787A (en) A Method Generating Transcripts Of Digital Recording File
CN109587543B (en) Audio synchronization method and apparatus and storage medium
CN111105816A (en) Man-machine interactive software screen recording method
CN108269597B (en) Audio workstation management method and system
CN106060394B (en) A kind of photographic method, device and terminal device
KR100936830B1 (en) Relay system of video conference
JP2005352330A (en) Speech division recording device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20190618

Assignee: Hangzhou Ruiboqifan Enterprise Management Co.,Ltd.

Assignor: JIANG University OF TECHNOLOGY

Contract record no.: X2022330000903

Denomination of invention: A fully automatic cooperative method for audio playback and video recording

Granted publication date: 20210226

License type: Common License

Record date: 20221228

Application publication date: 20190618

Assignee: Hangzhou Anfeng Jiyue Cultural Creativity Co.,Ltd.

Assignor: JIANG University OF TECHNOLOGY

Contract record no.: X2022330000901

Denomination of invention: A fully automatic cooperative method for audio playback and video recording

Granted publication date: 20210226

License type: Common License

Record date: 20221228

Application publication date: 20190618

Assignee: Zhejiang Yu'an Information Technology Co.,Ltd.

Assignor: JIANG University OF TECHNOLOGY

Contract record no.: X2022330000897

Denomination of invention: A fully automatic cooperative method for audio playback and video recording

Granted publication date: 20210226

License type: Common License

Record date: 20221228

Application publication date: 20190618

Assignee: Hangzhou Yuxuansheng Lighting Technology Co.,Ltd.

Assignor: JIANG University OF TECHNOLOGY

Contract record no.: X2022330000929

Denomination of invention: A fully automatic cooperative method for audio playback and video recording

Granted publication date: 20210226

License type: Common License

Record date: 20221229

EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20190618

Assignee: Taizhou Linhai Xinxing Safety Technology Training Co.,Ltd.

Assignor: JIANG University OF TECHNOLOGY

Contract record no.: X2023980047386

Denomination of invention: A fully automated collaborative method for audio playback and video recording

Granted publication date: 20210226

License type: Common License

Record date: 20231117

EE01 Entry into force of recordation of patent licensing contract