WO2017000554A1 - Audio and video file generation method, apparatus and system - Google Patents

Audio and video file generation method, apparatus and system Download PDF

Info

Publication number
WO2017000554A1
WO2017000554A1 PCT/CN2016/072811 CN2016072811W WO2017000554A1 WO 2017000554 A1 WO2017000554 A1 WO 2017000554A1 CN 2016072811 W CN2016072811 W CN 2016072811W WO 2017000554 A1 WO2017000554 A1 WO 2017000554A1
Authority
WO
WIPO (PCT)
Prior art keywords
recording device
photographing
voice recording
voice
clock
Prior art date
Application number
PCT/CN2016/072811
Other languages
French (fr)
Chinese (zh)
Inventor
高翔
谢灿豪
Original Assignee
高翔
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 高翔 filed Critical 高翔
Publication of WO2017000554A1 publication Critical patent/WO2017000554A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/4147PVR [Personal Video Recorder]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/91Television signal processing therefor
    • H04N5/92Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback

Definitions

  • the embodiments of the present invention relate to the field of communications technologies, and in particular, to a method, an apparatus, and a system for generating an audio and video file.
  • the existing electronic products with shooting functions include a camera, a tablet computer, and a mobile phone.
  • the photographer needs to hold the electronic product, that is, the shooting device, and simultaneously record the subject by the video recording function and the voice recording function of the shooting device.
  • the picture and voice that is, the production of audio and video files.
  • the photographing device can only record the subject's picture, but cannot clearly record the subject's voice, resulting in poor voice quality in the recorded audio and video files.
  • Embodiments of the present invention provide a method, an apparatus, and a system for generating an audio and video file to improve voice quality in an audio and video file.
  • An aspect of the embodiments of the present invention provides a method for generating an audio and video file, including:
  • the voice recording device receives clock synchronization information sent by the photographing device, where the clock synchronization information includes a starting shooting moment of the photographing device;
  • the voice recording device synchronizes a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information, and records, according to the initial shooting time, voice data of the captured person, the voice a recording device is fixed to the subject;
  • the voice recording device transmits the voice data to the photographing device to cause the photographing device to merge the voice data and video image data of a subject photographed by the photographing device from the start photographing time For audio and video files.
  • Another aspect of the embodiments of the present invention provides a method for generating an audio and video file, including:
  • the photographing device sends clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the photographing device, so that the voice recording device starts the clock of the voice recording device according to the clock synchronization information. Synchronizing to the initial shooting time, and recording the stored voice data of the photographer from the initial shooting time, the voice recording device being fixed to the subject;
  • the photographing device receives the voice data sent by the voice recording device, and combines the voice data and video image data of the photographer photographed by the photographing device from the initial shooting time into an audio and video file.
  • a voice recording device including:
  • a receiving module configured to receive clock synchronization information sent by the photographing device, where the clock synchronization information includes a starting shooting moment of the photographing device;
  • a synchronization module configured to synchronize, by the device, a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information
  • a recording storage module configured to record, according to the initial shooting time, voice data of a stored object, the voice recording device being fixed to the subject;
  • a sending module configured to send the voice data to the photographing device, so that the photographing device combines the voice data and video image data of a photographer photographed by the photographing device from the initial shooting moment For audio and video files.
  • a sending module configured to send clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the shooting device, so that the voice recording device sends the voice recording device according to the clock synchronization information.
  • a clock timing start point is synchronized to the initial shooting time, and the voice data of the photographer is stored from the initial shooting time, the voice recording device being fixed to the subject;
  • a receiving module configured to receive the voice data sent by the voice recording device
  • a merging module configured to merge the voice data and video image data of the photographer photographed by the photographing device from the initial shooting moment into an audio and video file.
  • Another aspect of an embodiment of the present invention provides an audio and video file generating system including the voice recording device and the photographing device.
  • the method, device and system for generating an audio and video file receive a starting shooting moment of a shooting device through a voice recording device, synchronize its own clock timing starting point to a starting shooting time, and record from the starting shooting time.
  • the voice data of the photographer is stored, and finally the voice data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the same initial shooting time corresponding to the voice data and the video image data is based on the same The time axis, even when the subject is in motion and away from the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure the merger The subsequent audio and video files have a high voice quality.
  • FIG. 1 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention
  • FIG. 3 is a structural diagram of a voice recording device according to an embodiment of the present invention.
  • FIG. 4 is a structural diagram of a voice recording device according to another embodiment of the present invention.
  • FIG. 5 is a structural diagram of a photographing apparatus according to an embodiment of the present invention.
  • FIG. 6 is a structural diagram of a photographing apparatus according to another embodiment of the present invention.
  • FIG. 7 is a structural diagram of an audio and video file generating system according to an embodiment of the present invention.
  • FIG. 8 is a structural diagram of a voice recording device according to another embodiment of the present invention.
  • FIG. 9 is a structural diagram of a photographing apparatus according to another embodiment of the present invention.
  • FIG. 1 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention.
  • the voice recording device in the embodiment of the present invention is specifically a blue clip structure, and the blue clip structure includes a clip and a microphone, and the clip is used for fixing the voice data of the photographer on the clothing of the photographer, which is preferred in the embodiment of the present invention.
  • a clip with a strong grip is attached to the collar of the subject.
  • the microphone is fixedly connected with the clip, and the microphone includes a storage module, a recording module, an AGC limiting module, a clock module (crystal oscillator), a global positioning system (GPS) module, and a wireless communication module (wifi, Bluetooth), etc., the microphone
  • the wireless communication module is wirelessly connected to the photographing device, and the photographing device is specifically a smart phone, a camera, or the like.
  • the photographing device when the photographer is in a state of motion and away from the photographer, the photographing device can only record the image of the subject, but cannot clearly record the voice of the photographer, and provides a method for generating an audio and video file. Proceed as follows:
  • Step S101 The voice recording device receives clock synchronization information sent by the shooting device, where the clock synchronization information includes a starting shooting time of the shooting device.
  • a photographing device such as a smart phone transmits clock synchronization information to a voice recording device, that is, a microphone, at the time of starting to capture a video, and the clock synchronization information includes a start time of the smartphone.
  • Step S102 The voice recording device synchronizes a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information, and records and stores the voice data of the photographer from the initial shooting time.
  • the voice recording device is fixed to the subject;
  • the microphone clock synchronization starting point is synchronized with the GPS clock and has high travel time precision.
  • the microphone receives the clock synchronization information sent by the photographing device, the microphone according to the clock synchronization information
  • the own clock timing start point is synchronized to the initial shooting time, and the subject's voice data is recorded from the initial shooting time, and the recorded voice data is stored in the storage module of the microphone.
  • Step S103 the voice recording device sends the voice data to the photographing device, so that the photographing device takes the voice data and the photographing device from the initial shooting moment.
  • the captured video image data of the captured person is merged into an audiovisual file.
  • the microphone can transmit the stored voice data to the photographing device in real time, or can be sent to the photographing device at a fixed time interval, and can also be sent to the photographing device after the photographing of the photographing device, which is not limited in the embodiment of the present invention.
  • the method of transmitting voice data to the photographing device is preferably a wireless communication method, specifically, Wireless Fidelity (WiFi), Bluetooth, or the like.
  • WiFi Wireless Fidelity
  • Bluetooth Bluetooth
  • the photographing device After receiving the voice data sent by the microphone, the photographing device combines the voice data and the video image data of the photographer photographed by the photographing device from the start of the photographing time into an audio and video file, since the voice data and the video image data correspond to the same initial shooting. At the moment, the voice data and the video image data have a high degree of matching, that is, the combined audio and video files have high quality visual and auditory effects at the same time.
  • the process of combining the voice data and the video image data into an audio and video file can also be completed on a computer, specifically, after the voice data and the video image data are recorded, the user can voice data in the microphone and the video image in the shooting device.
  • the data is separately copied to the computer, and the computer combines the voice data and the video image data into an audio and video file.
  • the embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded.
  • the data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
  • the method further includes: the voice recording device receiving initial clock information sent by the photographing device, where the initial clock information includes the And a timestamp corresponding to the time when the shooting device sends the initial clock information; the voice recording device starts timing according to the timestamp.
  • the voice recording device is paired with the photographing device, that is, before the voice recording device receives the clock synchronization information sent by the photographing device, the voice recording device needs to be paired with the photographing device.
  • the specific pairing process is: the shooting device sends initial clock information to the voice recording device, and the initial clock information includes a time
  • the timestamp is a timestamp corresponding to the time when the photographing device sends the initial clock information.
  • the voice recording device starts timing according to the time stamp, that is, the timing start time of the voice recording device is based on the time when the shooting device sends the initial clock information.
  • the method further includes: the voice recording device sending a shooting start request to the shooting device to enable the shooting device to activate a shooting function; the voice recording device After the voice data is sent to the photographing device, the method further includes: the voice recording device transmitting a photographing end request to the photographing device to cause the photographing device to turn off the photographing function.
  • the voice recording device and the photographing device in the embodiment of the present invention can also be applied to underwater shooting scenes.
  • the voice recording device can be used as a master device to control whether the corresponding shooting function of the shooting device is turned on or off. Specifically, the voice recording device first sends a shooting start request to the shooting device, so that the shooting device starts the shooting function, and the shooting device starts shooting. The time is sent to the voice recording device; when the voice recording device records the voice data, the shooting end request is actively sent to the shooting device, and the shooting device turns off the shooting function according to the shooting end request.
  • the voice recording device Receiving the clock synchronization information sent by the photographing device, the voice recording device receiving the clock synchronization information sent by the photographing device by means of wireless communication; the voice recording device transmitting the voice data to the photographing
  • the device includes: the voice recording device sends the voice data to the photographing device by way of wireless communication.
  • all interactions between the voice recording device and the photographing device are performed by wireless communication.
  • the embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information.
  • the clock synchronization cannot be maintained with the shooting device, which further improves the clock synchronization accuracy.
  • the voice recording device is used as the main control device to control the opening or closing of the corresponding shooting function of the shooting device, thereby increasing the function of the voice recording device.
  • the voice recording device monitors its own switch state, power state, and storage space state, and generates state information, and the state information includes at least: switch state information, power state information, and storage space state.
  • the voice recording device sends the status information to the photographing device by way of wireless communication, so that the photographing device
  • the voice recording device is controlled to be turned on or off according to the status information.
  • the voice recording device in the embodiment of the present invention is further provided with a physical switch button, a switch indicator light, a power indicator light, and a capacity indicator.
  • the switch indicator light is used to indicate the switch state of the voice recording device
  • the power indicator light is used to indicate the voice recording.
  • the capacity indicator is used to indicate the storage status of the voice recording device.
  • the voice recording device monitors its own switch state, power state, and storage space state, and generates corresponding state information. At the same time, the switch indicator indicates the switch state. If the battery is lower than the power threshold, the battery indicator displays the alarm. When the storage space is less than the preset threshold, the alarm indicator is used to display the alarm.
  • the voice recording device sends the monitored status information to the shooting device by means of wireless communication, and the shooting device acts as the master device to control the voice recording device to be turned on or off according to the state information of the voice recording device, for example, the power of the voice recording device is lower than the power threshold or When the storage space is less than the preset threshold, the photographing device sends a close instruction to the voice recording device through wireless communication, so that the voice recording device performs the close operation according to the close instruction.
  • the embodiment of the invention sends the status information to the photographing device through the voice recording device, and the photographing device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the photographing device on the voice recording device.
  • the recording after the recording of the voice data of the photographer from the initial shooting time, further comprises: the voice recording device performing AGC limiting processing on the voice data whose volume is greater than the preset volume;
  • the initial clock of the voice recording device is synchronized with the GPS clock, and the initial clock of the photographing device is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
  • the voice recording device further includes an AGC limiter.
  • the AGC limiter When the volume of the voice data recorded by the voice recording device is greater than the preset volume, the AGC limiter performs AGC limiting processing on the volume of the voice data.
  • the initial clock of the voice recording device selects a GPS clock
  • the initial clock of the photographing device selects a GPS clock or a clock of a base station to which the photographing device belongs.
  • the voice recording device in the embodiment of the present invention can also be provided with a 3.5mm interface, which can be adapted to all wired headsets and microphones on the market, so that it is not necessary to carry an additional microphone separately, and it can be combined with existing headphones and microphones.
  • the integration can be set together.
  • the voice recording device can be used as a stand-alone recording device without the need to use it with the shooting device.
  • the voice recording device performs AGC limiting processing on the voice data whose volume is greater than the preset volume, so as to prevent the recording module of the voice recording device from being damaged by the large voice data, further ensuring that the combined audio and video files are very High voice quality.
  • FIG. 2 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention.
  • the shooting device can only record the picture of the subject, but cannot clearly record.
  • the voice of the photographer provides a method for generating an audio and video file. The specific steps of the method are as follows:
  • Step S201 The photographing device sends clock synchronization information to the voice recording device, where the clock synchronization information includes a start shooting time of the photographing device, so that the voice recording device sets the voice recording device according to the clock synchronization information.
  • a clock timing start point is synchronized to the initial shooting time, and the voice data of the photographer is stored from the initial shooting time, the voice recording device being fixed to the subject;
  • a photographing device such as a smart phone transmits clock synchronization information to a voice recording device, that is, a microphone, at the time of starting to capture a video, and the clock synchronization information includes a start time of the smartphone.
  • the microphone clock synchronization starting point is synchronized with the GPS clock and has high travel time precision.
  • the microphone receives the clock synchronization information sent by the photographing device, the microphone according to the clock synchronization information
  • the own clock timing start point is synchronized to the initial shooting time, and the subject's voice data is recorded from the initial shooting time, and the recorded voice data is stored in the storage module of the microphone.
  • Step S202 the photographing device receives the voice data sent by the voice recording device, and combines the voice data and video image data of the photographer photographed by the photographing device from the start shooting time into a sound. Video file.
  • the microphone can transmit the stored voice data to the photographing device in real time, or can be sent to the photographing device at a fixed time interval, and can also be sent to the photographing device after the photographing of the photographing device, which is not limited in the embodiment of the present invention.
  • the method of transmitting voice data to the photographing device is preferably a wireless communication method, specifically, Wireless Fidelity (WiFi), Bluetooth, or the like.
  • WiFi Wireless Fidelity
  • Bluetooth Bluetooth
  • the photographing device After receiving the voice data sent by the microphone, the photographing device combines the voice data and the video image data of the photographer photographed by the photographing device from the start of the photographing time into an audio and video file, since the voice data and the video image data correspond to the same initial shooting. At the moment, the voice data and the video image data have a high degree of matching, that is, the combined audio and video files have both High quality visual and auditory effects.
  • the embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded.
  • the data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
  • the method further includes: the photographing device sends initial clock information to the voice recording device, where the initial clock information includes the photographing device sending station. A timestamp corresponding to the time of the initial clock information, so that the voice recording device starts timing according to the timestamp.
  • the voice recording device is paired with the photographing device, that is, before the voice recording device receives the clock synchronization information sent by the photographing device, the voice recording device needs to be paired with the photographing device.
  • the specific pairing process is: the shooting device sends the initial clock information to the voice recording device, and the initial clock information includes a time stamp, where the time stamp is a time stamp corresponding to the time when the shooting device sends the initial clock information.
  • the voice recording device starts timing according to the time stamp, that is, the timing start time of the voice recording device is based on the time when the shooting device sends the initial clock information.
  • the voice data and the video image data have the same time axis; the combining the voice data and the video image data of the photographer taken by the photographing device from the initial shooting time into an audio and video file,
  • the method comprises: combining the voice data and the video image data into an audio and video file according to the same time axis.
  • the method further includes: the photographing device receiving a photographing start request sent by the voice recording device, and starting a photographing function according to the photographing start request;
  • the shooting device further includes: the shooting device receives a shooting end request sent by the voice recording device, and turns off the shooting function according to the shooting end request.
  • the voice recording device can be used as a master device to control the opening of the corresponding shooting function of the shooting device. Or turning off, specifically, the voice recording device first sends a shooting start request to the shooting device, so that the shooting device starts the shooting function, and the shooting device sends the initial shooting time to the voice recording device; when the voice recording device records the voice data, And actively sending a shooting end request to the shooting device, and the shooting device turns off the shooting function according to the shooting end request.
  • the photographing device sends the clock synchronization information to the voice recording device, where the photographing device sends the clock synchronization information to the voice recording device by means of wireless communication; the photographing device receives the voice data sent by the voice recording device,
  • the method includes: the photographing device receiving the voice data sent by the voice recording device by way of wireless communication.
  • all interactions between the voice recording device and the photographing device are performed by wireless communication.
  • the state information includes at least: switch state information, power state information, and Storage space status information.
  • the voice recording device in the embodiment of the present invention is further provided with a physical switch button, a switch indicator light, a power indicator light, and a capacity indicator.
  • the switch indicator light is used to indicate the switch state of the voice recording device
  • the power indicator light is used to indicate the voice recording.
  • the capacity indicator is used to indicate the storage status of the voice recording device.
  • the voice recording device monitors its own switch state, power state, and storage space state, and generates corresponding state information. At the same time, the switch indicator indicates the switch state. If the battery is lower than the power threshold, the battery indicator displays the alarm. When the storage space is less than the preset threshold, the alarm indicator is used to display the alarm.
  • the voice recording device sends the monitored status information to the shooting device by means of wireless communication, and the shooting device acts as the master device to control the voice recording device to be turned on or off according to the state information of the voice recording device, for example, the power of the voice recording device is lower than the power threshold or When the storage space is less than the preset threshold, the photographing device sends a close instruction to the voice recording device through wireless communication, so that the voice recording device performs the close operation according to the close instruction.
  • the embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information.
  • the clock synchronization cannot be maintained with the shooting device, which further improves the clock synchronization accuracy.
  • the voice recording device is used as the main control device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is increased.
  • the voice recording device sends its status information to the shooting device, and the shooting device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the shooting device on the voice recording device.
  • FIG. 3 is a structural diagram of a voice recording device according to an embodiment of the present invention.
  • the voice recording device provided by the embodiment of the present invention can perform the processing flow provided by the embodiment of the audio and video file generating method.
  • the voice recording device 30 includes a receiving module 31, a synchronization module 32, a recording storage module 33, and a sending module 34.
  • the receiving module 31 is configured to receive clock synchronization information sent by the photographing device, where the clock synchronization information includes a start shooting time of the photographing device, and the synchronization module 32 is configured to: the device records the voice according to the clock synchronization information.
  • the clock counting start point of the device is synchronized to the initial shooting time;
  • the record storage module 33 is configured to record the stored voice data of the photographer from the initial shooting time, the voice recording device is fixed to the photographer;
  • the module 34 is configured to send the voice data to the photographing device, so that the photographing device combines the voice data and video image data of a photographer photographed by the photographing device from the start photographing time to Audio and video files.
  • the embodiment of the present invention further provides a voice recording device.
  • the principle of the voice recording device is similar to the foregoing method for generating an audio and video file. Therefore, the implementation of the voice recording device can be referred to the foregoing method embodiment. The repetitions are not repeated here.
  • the embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded.
  • the data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
  • FIG. 4 is a structural diagram of a voice recording device according to another embodiment of the present invention.
  • the receiving module 31 is further configured to receive initial clock information sent by the photographing device, where the initial clock information includes a timestamp corresponding to the time when the photographing device sends the initial clock information;
  • the voice recording device 30 further includes a timing module 37 for using the time stamp according to the time stamp start the timer.
  • the sending module 34 is further configured to send a shooting start request to the shooting device after the voice recording device starts timing according to the time stamp, so that the shooting device starts a shooting function; the voice recording device uses the voice data After being sent to the photographing device, a photographing end request is sent to the photographing device to cause the photographing device to turn off the photographing function.
  • the receiving module 31 is specifically configured to receive the clock synchronization information sent by the photographing device by means of wireless communication; the sending module 34 is specifically configured to send the voice data to the photographing device by way of wireless communication.
  • the voice recording device 30 further includes a monitoring module 35 for monitoring its own switch state, power state, and storage space state, and generating state information, the state information including at least: switch state information, power state information, and storage
  • the location information is sent to the camera device by the wireless communication method, so that the camera device controls the voice recording device to be turned on or off according to the state information.
  • the voice recording device 30 further includes an AGC clipping module 36 for performing AGC limiting processing on voice data having a volume greater than a preset volume; the initial clock of the voice recording device is synchronized with the GPS clock, The initial clock of the photographing device is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
  • the voice recording device provided by the embodiment of the present invention may be specifically used to perform the method embodiment provided in FIG. 1 above, and specific functions are not described herein again.
  • the embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information.
  • the clock synchronization cannot be maintained with the shooting device, and the clock synchronization accuracy is further improved.
  • the voice recording device is used as the master device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is added; the state information is recorded by the voice recording device.
  • the shooting device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the shooting device to the voice recording device; and performing AGC limiting processing on the voice data whose volume is greater than the preset volume through the voice recording device
  • the recording module of the voice recording device is prevented from being damaged by the large voice data, further ensuring that the combined audio and video files have high voice quality.
  • FIG. 5 is a structural diagram of a photographing apparatus according to an embodiment of the present invention.
  • the photographing device can perform the processing flow provided by the embodiment of the audio and video file generating method.
  • the photographing device 50 includes a sending module 51, a receiving module 52, and a merging module 53, wherein the sending module 51 is configured to send to the voice recording device.
  • the clock synchronization information including a start shooting time of the photographing device, so that the voice recording device synchronizes a clock timing start point of the voice recording device to the start shooting according to the clock synchronization information
  • the voice recording device is fixed to the subject
  • the receiving module 52 is configured to receive the voice data sent by the voice recording device
  • the merging module 53 is configured to merge the voice data and video image data of the subject photographed by the photographing device from the initial shooting time into an audiovisual file.
  • the embodiment of the present invention further provides a photographing apparatus. Since the principle of the problem solved by the photographing apparatus is similar to the method for generating an audio and video file, the implementation of the photographing apparatus can be referred to the foregoing method embodiment, and the repetition is performed. No longer.
  • the embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded.
  • the data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
  • FIG. 6 is a structural diagram of a photographing apparatus according to another embodiment of the present invention.
  • the sending module 51 is further configured to send the initial clock information to the voice recording device, where the initial clock information includes a timestamp corresponding to the time when the shooting device sends the initial clock information, so that the voice The recording device starts timing according to the time stamp.
  • the voice data and the video image data have the same time axis; the merging module 53 is specifically configured to combine the voice data and the video image data into an audio and video file according to the same time axis.
  • the receiving module 52 is further configured to receive a shooting start request or a shooting end request sent by the voice recording device.
  • the shooting device 50 further includes a control module 54, and the control module 54 is configured to start a shooting function according to the shooting start request or according to the shooting. End the request to turn off the shooting function.
  • the sending module 51 is specifically configured to send the clock synchronization information to the voice recording device by means of wireless communication; the receiving module 52 is specifically configured to receive the voice data sent by the voice recording device by way of wireless communication.
  • the receiving module 52 is further configured to receive the status information sent by the voice recording device by using a wireless communication manner; the control module 54 is further configured to control the voice recording device to be turned on or off according to the status information, where the status information includes at least: a switch Status information, battery status information, and storage space status information.
  • the photographic device provided by the embodiment of the present invention may be specifically used to perform the method embodiment provided in FIG. 2 above, and specific functions are not described herein again.
  • the embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information.
  • the clock synchronization cannot be maintained with the shooting device, and the clock synchronization accuracy is further improved.
  • the voice recording device is used as the master device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is added; the state information is recorded by the voice recording device. Sended to the shooting device, the shooting device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the shooting device to the voice recording device.
  • FIG. 7 is a structural diagram of an audio and video file generating system according to an embodiment of the present invention.
  • the audio and video file generating system provided by the embodiment of the present invention can execute the processing flow provided by the embodiment of the audio and video file generating method.
  • the audio and video file generating system 70 includes the voice recording device 30 in the above embodiment and the foregoing implementation.
  • the photographing device 50 in the example.
  • the audio and video file generating system provided by the embodiment of the present invention can execute the processing flow provided by the embodiment of the audio and video file generating method.
  • FIG. 8 is a structural diagram of a voice recording device according to another embodiment of the present invention.
  • the voice recording device provided by the embodiment of the present invention can perform the processing flow provided by the embodiment of the audio and video file generating method.
  • the voice recording device 30 includes a bus 142, and a processor 143 and a memory 144 connected to the bus 142.
  • the input module 145 is configured to receive clock synchronization information sent by the photographing device, the clock synchronization information includes a start photographing time of the photographing device, and the processor 143 is configured to execute the storage in the memory 144.
  • Control command to perform the following steps, according to the clock synchronization information, when the voice recording device is The clock timing start is synchronized to the initial shooting time; the voice data of the photographer is stored from the initial shooting time, the voice recording device is fixed to the subject; and the output module 146 is configured to Data is transmitted to the photographing apparatus to cause the photographing apparatus to merge the voice data and video image data of the subject photographed by the photographing apparatus from the start photographing time into an audiovisual file.
  • the input module 145 is further configured to receive the initial clock information sent by the photographing device, where the initial clock information includes a time stamp corresponding to the time when the photographing device sends the initial clock information;
  • the processor 143 is further configured to start timing according to the time stamp.
  • the output module 146 is further configured to send a shooting start request to the shooting device after the voice recording device starts timing according to the time stamp, so that the shooting device starts the shooting function.
  • a photographing end request is sent to the photographing device to cause the photographing device to turn off the photographing function.
  • the input module 145 is specifically configured to receive the clock synchronization information sent by the photographing device by means of wireless communication; the output module 146 is specifically configured to send the voice data to the office by way of wireless communication.
  • the shooting equipment is specifically configured to receive the clock synchronization information sent by the photographing device by means of wireless communication; the output module 146 is specifically configured to send the voice data to the office by way of wireless communication.
  • the processor 143 is further configured to monitor its own switch state, power state, and storage space state, and generate state information, where the state information includes at least: switch state information, power state information. And the storage space status information; the output module 146 is further configured to send the status information to the photographing device by using a wireless communication manner, so that the photographing device controls the voice recording device to be turned on or off according to the status information.
  • the processor 143 is configured to perform AGC limiting processing on the voice data whose volume is greater than the preset volume; the initial clock of the voice recording device is synchronized with the GPS clock, where the shooting device is The initial clock is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
  • the embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded.
  • the data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
  • FIG. 9 is a structural diagram of a photographing apparatus according to another embodiment of the present invention.
  • the photographing apparatus provided by the embodiment of the present invention can execute the processing flow provided by the embodiment of the audio and video file generating method.
  • the photographing apparatus 50 includes a bus 152, and a processor 153, a memory 154, and a transmitting module connected to the bus 152.
  • the sending module 155 is configured to send clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the shooting device, so that the voice recording device synchronizes according to the clock
  • the information synchronizes a clock timing start point of the voice recording device to the initial shooting time, and records the stored voice data of the photographer from the initial shooting time, the voice recording device being fixed to the subject;
  • the receiving module 156 is configured to receive the voice data sent by the voice recording device;
  • the processor 153 is configured to execute a control command stored in the memory 154 to perform the following steps, starting the voice data and the photographing device
  • the video image data of the subject photographed at the time of the first shooting is merged into an audiovisual file.
  • the sending module 155 is further configured to send the initial clock information to the voice recording device, where the initial clock information includes a timestamp corresponding to the time when the shooting device sends the initial clock information, so that The voice recording device starts timing according to the time stamp.
  • the voice data and the video image data have the same time axis; the merging module is specifically configured to use the voice data and the video according to the same time axis.
  • the image data is merged into an audio and video file.
  • the receiving module 156 is further configured to receive a shooting start request or a shooting end request sent by the voice recording device; the processor 153 is configured to start a shooting function or a basis according to the shooting start request.
  • the shooting end request turns off the shooting function.
  • the sending module 155 is specifically configured to send the clock synchronization information to the voice recording device by means of wireless communication; the receiving module 156 is specifically configured to receive, by using wireless communication, the voice recording device.
  • the voice data is specifically configured to send the clock synchronization information to the voice recording device by means of wireless communication; the receiving module 156 is specifically configured to receive, by using wireless communication, the voice recording device. The voice data.
  • the receiving module 156 is further configured to receive the status information sent by the voice recording device by using a wireless communication manner; the processor 153 is further configured to use the status according to the status
  • the information is controlled to be turned on or off by the voice recording device, and the status information includes at least: switch status information, power status information, and storage space status information.
  • the embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded.
  • the data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
  • Embodiments of the present invention also provide a non-transitory computer readable storage medium including computer-executable instructions for selecting on a voice recording device 30 and a photographing device 50 such that when executed by the device, an embodiment in accordance with the present invention is implemented Audio and video file generation method.
  • the embodiment of the present invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the initial shooting time, and records and stores the voice data of the captured person from the initial shooting time.
  • the voice data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the photographer When in motion and away from the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have a very high High voice quality; receiving the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device by the voice recording device, and starting timing according to the time stamp in the initial clock information, preventing the voice recording device from receiving the clock synchronization information Cannot keep clock synchronization with the shooting device.
  • the disclosed apparatus and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium.
  • the above software functional unit is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to perform the methods of the various embodiments of the present invention. Part of the steps.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .

Abstract

Provided are an audio and video file generation method, apparatus and system. The method comprises: a voice recording device receiving clock synchronization information sent by a photographing device; the voice recording device synchronizing a clock timing start point of the voice recording device to a photographing start moment according to the clock synchronization information, and recording and storing voice data of a photographed person from the photographing start moment; and the photographing device merging the voice data with video image data into an audio and video file. In the embodiments of the present invention, voice data and video image data correspond to the same photographing start moment, i.e. being on the basis of the same time axis, and even though a photographed person is in a motion state or far away from a photographer, a voice recording device fixed on the photographed person can record and store clear voice data, thereby ensuring that the voice data and the video image data have a very high matching degree, and ensuring that a merged audio and video file has a very high voice quality.

Description

音视频文件生成方法、装置及系统Audio and video file generation method, device and system 技术领域Technical field
本发明实施例涉及通信技术领域,尤其涉及一种音视频文件生成方法、装置及系统。The embodiments of the present invention relate to the field of communications technologies, and in particular, to a method, an apparatus, and a system for generating an audio and video file.
背景技术Background technique
随着电子产品的发展,越来越多的电子产品具有拍摄功能,方便用户出行或随时随地拍摄。With the development of electronic products, more and more electronic products have shooting functions, which are convenient for users to travel or shoot anytime, anywhere.
现有的具有拍摄功能的电子产品包括摄像机、平板电脑和手机等,在拍摄过程中,拍摄者需要手持电子产品即拍摄设备,通过拍摄设备的视频录制功能和语音录制功能同时记录下被拍摄者的画面和语音,即生成图文并茂的音视频文件。The existing electronic products with shooting functions include a camera, a tablet computer, and a mobile phone. During the shooting process, the photographer needs to hold the electronic product, that is, the shooting device, and simultaneously record the subject by the video recording function and the voice recording function of the shooting device. The picture and voice, that is, the production of audio and video files.
但是,当被拍摄者处于运动状态且远离拍摄者时,拍摄设备只能记录下被拍摄者的画面,却不能清晰记录被拍摄者的语音,导致录制的音视频文件中语音质量差。However, when the subject is in motion and away from the photographer, the photographing device can only record the subject's picture, but cannot clearly record the subject's voice, resulting in poor voice quality in the recorded audio and video files.
发明内容Summary of the invention
本发明实施例提供一种音视频文件生成方法、装置及系统,以提高音视频文件中语音质量。Embodiments of the present invention provide a method, an apparatus, and a system for generating an audio and video file to improve voice quality in an audio and video file.
本发明实施例的一个方面是提供一种音视频文件生成方法,包括:An aspect of the embodiments of the present invention provides a method for generating an audio and video file, including:
语音录制设备接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;The voice recording device receives clock synchronization information sent by the photographing device, where the clock synchronization information includes a starting shooting moment of the photographing device;
所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;The voice recording device synchronizes a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information, and records, according to the initial shooting time, voice data of the captured person, the voice a recording device is fixed to the subject;
所述语音录制设备将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。 The voice recording device transmits the voice data to the photographing device to cause the photographing device to merge the voice data and video image data of a subject photographed by the photographing device from the start photographing time For audio and video files.
本发明实施例的另一个方面是提供一种音视频文件生成方法,包括:Another aspect of the embodiments of the present invention provides a method for generating an audio and video file, including:
拍摄设备向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;The photographing device sends clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the photographing device, so that the voice recording device starts the clock of the voice recording device according to the clock synchronization information. Synchronizing to the initial shooting time, and recording the stored voice data of the photographer from the initial shooting time, the voice recording device being fixed to the subject;
所述拍摄设备接收所述语音录制设备发送的所述语音数据,并将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。The photographing device receives the voice data sent by the voice recording device, and combines the voice data and video image data of the photographer photographed by the photographing device from the initial shooting time into an audio and video file.
本发明实施例的另一个方面是提供一种语音录制设备,包括:Another aspect of the embodiments of the present invention provides a voice recording device, including:
接收模块,用于接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;a receiving module, configured to receive clock synchronization information sent by the photographing device, where the clock synchronization information includes a starting shooting moment of the photographing device;
同步模块,用于设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻;a synchronization module, configured to synchronize, by the device, a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information;
记录存储模块,用于从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;a recording storage module, configured to record, according to the initial shooting time, voice data of a stored object, the voice recording device being fixed to the subject;
发送模块,用于将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。a sending module, configured to send the voice data to the photographing device, so that the photographing device combines the voice data and video image data of a photographer photographed by the photographing device from the initial shooting moment For audio and video files.
本发明实施例的另一个方面是提供一种拍摄设备,包括:Another aspect of the embodiments of the present invention provides a photographing apparatus, including:
发送模块,用于向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;a sending module, configured to send clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the shooting device, so that the voice recording device sends the voice recording device according to the clock synchronization information. a clock timing start point is synchronized to the initial shooting time, and the voice data of the photographer is stored from the initial shooting time, the voice recording device being fixed to the subject;
接收模块,用于接收所述语音录制设备发送的所述语音数据;a receiving module, configured to receive the voice data sent by the voice recording device;
合并模块,用于将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。And a merging module, configured to merge the voice data and video image data of the photographer photographed by the photographing device from the initial shooting moment into an audio and video file.
本发明实施例的另一个方面是提供一种音视频文件生成系统,包括所述的语音录制设备和所述的拍摄设备。 Another aspect of an embodiment of the present invention provides an audio and video file generating system including the voice recording device and the photographing device.
本发明实施例提供的音视频文件生成方法、装置及系统,通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The method, device and system for generating an audio and video file according to an embodiment of the present invention receive a starting shooting moment of a shooting device through a voice recording device, synchronize its own clock timing starting point to a starting shooting time, and record from the starting shooting time. The voice data of the photographer is stored, and finally the voice data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the same initial shooting time corresponding to the voice data and the video image data is based on the same The time axis, even when the subject is in motion and away from the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure the merger The subsequent audio and video files have a high voice quality.
附图说明DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.
图1为本发明实施例提供的音视频文件生成方法流程图;1 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention;
图2为本发明实施例提供的音视频文件生成方法流程图;2 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention;
图3为本发明实施例提供的语音录制设备的结构图;FIG. 3 is a structural diagram of a voice recording device according to an embodiment of the present invention;
图4为本发明另一实施例提供的语音录制设备的结构图;4 is a structural diagram of a voice recording device according to another embodiment of the present invention;
图5为本发明实施例提供的拍摄设备的结构图;FIG. 5 is a structural diagram of a photographing apparatus according to an embodiment of the present invention;
图6为本发明另一实施例提供的拍摄设备的结构图;FIG. 6 is a structural diagram of a photographing apparatus according to another embodiment of the present invention; FIG.
图7为本发明实施例提供的音视频文件生成系统的结构图;FIG. 7 is a structural diagram of an audio and video file generating system according to an embodiment of the present invention;
图8为本发明另一实施例提供的语音录制设备的结构图;FIG. 8 is a structural diagram of a voice recording device according to another embodiment of the present invention;
图9为本发明另一实施例提供的拍摄设备的结构图。FIG. 9 is a structural diagram of a photographing apparatus according to another embodiment of the present invention.
具体实施方式detailed description
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。 基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。应理解,术语“包括”和/或“包含”指定存在特征、动作、整数、步骤、操作、元件和/或组件,但不排除存在或增加一个或多个其它特征、动作、整数、步骤、操作、元件、组件和/或其组合。The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention. It is to be understood that the terms "comprises" and "comprising", "the", "the", "the" Operations, components, components, and/or combinations thereof.
图1为本发明实施例提供的音视频文件生成方法流程图。本发明实施例中的语音录制设备具体为一个蓝夹子结构,该蓝夹子结构包括夹子和话筒,夹子用于固定在被拍摄者的衣服上便于录制被拍摄者的语音数据,本发明实施例优选夹持性很强的夹子固定在被拍摄者的衣领上。话筒与夹子固定连接,话筒内部包括存储模块、录音模块、AGC限幅模块、时钟模块(晶振)、全球定位系统(全球定位系统,简称GPS)模块以及无线通信模块(wifi、蓝牙)等,话筒通过无线通信模块与拍摄设备进行无线连接,该拍摄设备具体为智能手机、相机等。FIG. 1 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention. The voice recording device in the embodiment of the present invention is specifically a blue clip structure, and the blue clip structure includes a clip and a microphone, and the clip is used for fixing the voice data of the photographer on the clothing of the photographer, which is preferred in the embodiment of the present invention. A clip with a strong grip is attached to the collar of the subject. The microphone is fixedly connected with the clip, and the microphone includes a storage module, a recording module, an AGC limiting module, a clock module (crystal oscillator), a global positioning system (GPS) module, and a wireless communication module (wifi, Bluetooth), etc., the microphone The wireless communication module is wirelessly connected to the photographing device, and the photographing device is specifically a smart phone, a camera, or the like.
本发明实施例针对被拍摄者处于运动状态且远离拍摄者时,拍摄设备只能记录下被拍摄者的画面,却不能清晰记录被拍摄者的语音,提供了音视频文件生成方法,该方法具体步骤如下:In the embodiment of the present invention, when the photographer is in a state of motion and away from the photographer, the photographing device can only record the image of the subject, but cannot clearly record the voice of the photographer, and provides a method for generating an audio and video file. Proceed as follows:
步骤S101、语音录制设备接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;Step S101: The voice recording device receives clock synchronization information sent by the shooting device, where the clock synchronization information includes a starting shooting time of the shooting device.
拍摄设备例如智能手机在开始拍摄视频时刻,向语音录制设备即话筒发送时钟同步信息,该时钟同步信息中包含有智能手机的起始拍摄时刻。A photographing device such as a smart phone transmits clock synchronization information to a voice recording device, that is, a microphone, at the time of starting to capture a video, and the clock synchronization information includes a start time of the smartphone.
步骤S102、所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;Step S102: The voice recording device synchronizes a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information, and records and stores the voice data of the photographer from the initial shooting time. The voice recording device is fixed to the subject;
话筒在接收拍摄设备发送的时钟同步信息之前,其时钟计时起点与GPS时钟保持同步,并具有很高的走时精度,当话筒接收到拍摄设备发送的时钟同步信息时,话筒依据所述时钟同步信息将其自身的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻起记录被拍摄者的语音数据,记录的语音数据存储在话筒的存储模块中。Before receiving the clock synchronization information sent by the photographing device, the microphone clock synchronization starting point is synchronized with the GPS clock and has high travel time precision. When the microphone receives the clock synchronization information sent by the photographing device, the microphone according to the clock synchronization information The own clock timing start point is synchronized to the initial shooting time, and the subject's voice data is recorded from the initial shooting time, and the recorded voice data is stored in the storage module of the microphone.
步骤S103、所述语音录制设备将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻 拍摄的被拍摄者的视频图像数据合并为音视频文件。Step S103, the voice recording device sends the voice data to the photographing device, so that the photographing device takes the voice data and the photographing device from the initial shooting moment. The captured video image data of the captured person is merged into an audiovisual file.
话筒可以将其存储的语音数据实时发送给拍摄设备,也可以按固定时间间隔发送给拍摄设备,还可以在拍摄设备拍摄视频结束后发送给拍摄设备,本发明实施例不做限制,另外,话筒向拍摄设备发送语音数据的方式优选为无线通信方式,具体为无线高保真(Wireless Fidelity,简称WiFi)、蓝牙等。拍摄设备接收到话筒发送的语音数据后,将语音数据和拍摄设备从起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻,所以语音数据和视频图像数据具有很高的匹配度,即合并后的音视频文件同时具有高质量的视觉与听觉效果。The microphone can transmit the stored voice data to the photographing device in real time, or can be sent to the photographing device at a fixed time interval, and can also be sent to the photographing device after the photographing of the photographing device, which is not limited in the embodiment of the present invention. The method of transmitting voice data to the photographing device is preferably a wireless communication method, specifically, Wireless Fidelity (WiFi), Bluetooth, or the like. After receiving the voice data sent by the microphone, the photographing device combines the voice data and the video image data of the photographer photographed by the photographing device from the start of the photographing time into an audio and video file, since the voice data and the video image data correspond to the same initial shooting. At the moment, the voice data and the video image data have a high degree of matching, that is, the combined audio and video files have high quality visual and auditory effects at the same time.
另外,将语音数据和视频图像数据合并为音视频文件的过程还可以在计算机上完成,具体为语音数据和视频图像数据均录制结束后,用户将话筒中的语音数据和拍摄设备中的视频图像数据分别拷贝到计算机上,由计算机将语音数据和视频图像数据合并为音视频文件。In addition, the process of combining the voice data and the video image data into an audio and video file can also be completed on a computer, specifically, after the voice data and the video image data are recorded, the user can voice data in the microphone and the video image in the shooting device. The data is separately copied to the computer, and the computer combines the voice data and the video image data into an audio and video file.
本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded. The data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
在上述实施例的基础上,所述语音录制设备接收拍摄设备发送的时钟同步信息之前,还包括:所述语音录制设备接收所述拍摄设备发送的初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳;所述语音录制设备依据所述时间戳开始计时。On the basis of the foregoing embodiment, before the voice recording device receives the clock synchronization information sent by the photographing device, the method further includes: the voice recording device receiving initial clock information sent by the photographing device, where the initial clock information includes the And a timestamp corresponding to the time when the shooting device sends the initial clock information; the voice recording device starts timing according to the timestamp.
在本发明实施例中,所述语音录制设备与所述拍摄设备配对出现,即所述语音录制设备接收拍摄设备发送的时钟同步信息之前,需要对所述语音录制设备与所述拍摄设备进行配对处理,具体的配对处理过程为:拍摄设备向语音录制设备发送初始时钟信息,初始时钟信息包括的一个时间 戳,该时间戳是拍摄设备发送所述初始时钟信息时刻对应的时间戳。语音录制设备依据所述时间戳开始计时,即语音录制设备的计时开始时刻以拍摄设备发送初始时钟信息的时刻为准。In the embodiment of the present invention, the voice recording device is paired with the photographing device, that is, before the voice recording device receives the clock synchronization information sent by the photographing device, the voice recording device needs to be paired with the photographing device. Processing, the specific pairing process is: the shooting device sends initial clock information to the voice recording device, and the initial clock information includes a time The timestamp is a timestamp corresponding to the time when the photographing device sends the initial clock information. The voice recording device starts timing according to the time stamp, that is, the timing start time of the voice recording device is based on the time when the shooting device sends the initial clock information.
所述语音录制设备依据所述时间戳开始计时之后,还包括:所述语音录制设备向所述拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能;所述语音录制设备将所述语音数据发送给所述拍摄设备之后,还包括:所述语音录制设备向所述拍摄设备发送拍摄结束请求,以使所述拍摄设备关闭拍摄功能。After the voice recording device starts timing according to the time stamp, the method further includes: the voice recording device sending a shooting start request to the shooting device to enable the shooting device to activate a shooting function; the voice recording device After the voice data is sent to the photographing device, the method further includes: the voice recording device transmitting a photographing end request to the photographing device to cause the photographing device to turn off the photographing function.
本发明实施例中的语音录制设备和拍摄设备还可以应用在水下拍摄场景中。语音录制设备可以作为主控设备控制拍摄设备对应拍摄功能的开启或关闭,具体为语音录制设备先向拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能,拍摄设备将其起始拍摄时刻发送给语音录制设备;当语音录制设备记录语音数据结束后,主动向拍摄设备发送拍摄结束请求,拍摄设备依据该拍摄结束请求关闭拍摄功能。The voice recording device and the photographing device in the embodiment of the present invention can also be applied to underwater shooting scenes. The voice recording device can be used as a master device to control whether the corresponding shooting function of the shooting device is turned on or off. Specifically, the voice recording device first sends a shooting start request to the shooting device, so that the shooting device starts the shooting function, and the shooting device starts shooting. The time is sent to the voice recording device; when the voice recording device records the voice data, the shooting end request is actively sent to the shooting device, and the shooting device turns off the shooting function according to the shooting end request.
所述语音录制设备接收拍摄设备发送的时钟同步信息,包括:所述语音录制设备通过无线通信的方式接收拍摄设备发送的时钟同步信息;所述语音录制设备将所述语音数据发送给所述拍摄设备,包括:所述语音录制设备将所述语音数据通过无线通信的方式发送给所述拍摄设备。Receiving the clock synchronization information sent by the photographing device, the voice recording device receiving the clock synchronization information sent by the photographing device by means of wireless communication; the voice recording device transmitting the voice data to the photographing The device includes: the voice recording device sends the voice data to the photographing device by way of wireless communication.
在本发明实施例中,语音录制设备与拍摄设备之间的所有交互均采用无线通信的方式进行数据传输。In the embodiment of the present invention, all interactions between the voice recording device and the photographing device are performed by wireless communication.
本发明实施例通过语音录制设备接收拍摄设备发送的时钟同步信息之前接收拍摄设备发送的初始时钟信息,并依据初始时钟信息中的时间戳开始计时,防止语音录制设备接收不到时钟同步信息时,无法与拍摄设备保持时钟同步,进一步提高了时钟同步精度;另外通过语音录制设备作为主控设备控制拍摄设备对应拍摄功能的开启或关闭,增加了语音录制设备的功能。The embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information. The clock synchronization cannot be maintained with the shooting device, which further improves the clock synchronization accuracy. In addition, the voice recording device is used as the main control device to control the opening or closing of the corresponding shooting function of the shooting device, thereby increasing the function of the voice recording device.
在上述实施例的基础上,所述语音录制设备监测其自身的开关状态、电量状态和存储空间状态,并生成状态信息,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息;所述语音录制设备将所述状态信息通过无线通信方式发送给所述拍摄设备,以使所述拍摄设备依 据所述状态信息控制所述语音录制设备开启或关闭。On the basis of the foregoing embodiment, the voice recording device monitors its own switch state, power state, and storage space state, and generates state information, and the state information includes at least: switch state information, power state information, and storage space state. The voice recording device sends the status information to the photographing device by way of wireless communication, so that the photographing device The voice recording device is controlled to be turned on or off according to the status information.
本发明实施例中的语音录制设备还设置有物理的开关按钮、开关指示灯、电量指示灯、容量指示灯,开关指示灯用于指示语音录制设备的开关状态,电量指示灯用于指示语音录制设备的电量状态,容量指示灯用于指示语音录制设备的存储空间状态。语音录制设备监测其自身的开关状态、电量状态和存储空间状态,并生成相应的状态信息,同时通过开关指示灯指示开关状态,若电量低于电量阈值时,通过电量指示灯进行告警显示,当存储空间小于预设阈值时,通过开关指示灯进行告警显示。The voice recording device in the embodiment of the present invention is further provided with a physical switch button, a switch indicator light, a power indicator light, and a capacity indicator. The switch indicator light is used to indicate the switch state of the voice recording device, and the power indicator light is used to indicate the voice recording. The battery status of the device. The capacity indicator is used to indicate the storage status of the voice recording device. The voice recording device monitors its own switch state, power state, and storage space state, and generates corresponding state information. At the same time, the switch indicator indicates the switch state. If the battery is lower than the power threshold, the battery indicator displays the alarm. When the storage space is less than the preset threshold, the alarm indicator is used to display the alarm.
语音录制设备将监测到的状态信息通过无线通信方式发送给拍摄设备,拍摄设备作为主控设备依据语音录制设备的状态信息控制语音录制设备开启或关闭,例如语音录制设备的电量低于电量阈值或存储空间小于预设阈值时,拍摄设备通过无线通信方式向语音录制设备发送关闭指令,以使语音录制设备依据该关闭指令执行关闭操作。The voice recording device sends the monitored status information to the shooting device by means of wireless communication, and the shooting device acts as the master device to control the voice recording device to be turned on or off according to the state information of the voice recording device, for example, the power of the voice recording device is lower than the power threshold or When the storage space is less than the preset threshold, the photographing device sends a close instruction to the voice recording device through wireless communication, so that the voice recording device performs the close operation according to the close instruction.
本发明实施例通过语音录制设备将其状态信息发送给拍摄设备,由拍摄设备依据该状态信息控制语音录制设备开启或关闭,提高了拍摄设备对语音录制设备的控制功能。The embodiment of the invention sends the status information to the photographing device through the voice recording device, and the photographing device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the photographing device on the voice recording device.
在上述实施例的基础上,所述从所述起始拍摄时刻记录存储被拍摄者的语音数据之后,还包括:所述语音录制设备对音量大于预设音量的语音数据进行AGC限幅处理;所述语音录制设备的初始时钟与GPS时钟保持同步,所述拍摄设备的初始时钟与GPS时钟或所述拍摄设备所属基站的时钟保持同步。On the basis of the foregoing embodiment, the recording, after the recording of the voice data of the photographer from the initial shooting time, further comprises: the voice recording device performing AGC limiting processing on the voice data whose volume is greater than the preset volume; The initial clock of the voice recording device is synchronized with the GPS clock, and the initial clock of the photographing device is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
语音录制设备还包括AGC限幅器,当语音录制设备录制的语音数据的音量大于预设音量时,AGC限幅器对该语音数据的音量进行AGC限幅处理。另外,所述语音录制设备的初始时钟选取GPS时钟,所述拍摄设备的初始时钟选取GPS时钟或所述拍摄设备所属基站的时钟。The voice recording device further includes an AGC limiter. When the volume of the voice data recorded by the voice recording device is greater than the preset volume, the AGC limiter performs AGC limiting processing on the volume of the voice data. In addition, the initial clock of the voice recording device selects a GPS clock, and the initial clock of the photographing device selects a GPS clock or a clock of a base station to which the photographing device belongs.
另外,本发明实施例中的语音录制设备上还可设置一个比如3.5mm接口,可以适配所有市面上的有线耳机和话筒,从而无需单独携带一个额外的话筒,将其与现有耳机、话筒集成设置在一起即可。In addition, the voice recording device in the embodiment of the present invention can also be provided with a 3.5mm interface, which can be adapted to all wired headsets and microphones on the market, so that it is not necessary to carry an additional microphone separately, and it can be combined with existing headphones and microphones. The integration can be set together.
此外,语音录制设备还可以作为独立的录音设备使用即不需与拍摄设备配合使用。 In addition, the voice recording device can be used as a stand-alone recording device without the need to use it with the shooting device.
本发明实施例通过语音录制设备对音量大于预设音量的语音数据进行AGC限幅处理,防止语音录制设备的录音模块被较大的语音数据震坏,进一步保证了合并后的音视频文件具有很高的语音质量。In the embodiment of the present invention, the voice recording device performs AGC limiting processing on the voice data whose volume is greater than the preset volume, so as to prevent the recording module of the voice recording device from being damaged by the large voice data, further ensuring that the combined audio and video files are very High voice quality.
图2为本发明实施例提供的音视频文件生成方法流程图,本发明实施例针对被拍摄者处于运动状态且远离拍摄者时,拍摄设备只能记录下被拍摄者的画面,却不能清晰记录被拍摄者的语音,提供了音视频文件生成方法,该方法具体步骤如下:2 is a flowchart of a method for generating an audio and video file according to an embodiment of the present invention. When the subject is in a moving state and away from the photographer, the shooting device can only record the picture of the subject, but cannot clearly record. The voice of the photographer provides a method for generating an audio and video file. The specific steps of the method are as follows:
步骤S201、拍摄设备向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;Step S201: The photographing device sends clock synchronization information to the voice recording device, where the clock synchronization information includes a start shooting time of the photographing device, so that the voice recording device sets the voice recording device according to the clock synchronization information. a clock timing start point is synchronized to the initial shooting time, and the voice data of the photographer is stored from the initial shooting time, the voice recording device being fixed to the subject;
拍摄设备例如智能手机在开始拍摄视频时刻,向语音录制设备即话筒发送时钟同步信息,该时钟同步信息中包含有智能手机的起始拍摄时刻。话筒在接收拍摄设备发送的时钟同步信息之前,其时钟计时起点与GPS时钟保持同步,并具有很高的走时精度,当话筒接收到拍摄设备发送的时钟同步信息时,话筒依据所述时钟同步信息将其自身的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻起记录被拍摄者的语音数据,记录的语音数据存储在话筒的存储模块中。A photographing device such as a smart phone transmits clock synchronization information to a voice recording device, that is, a microphone, at the time of starting to capture a video, and the clock synchronization information includes a start time of the smartphone. Before receiving the clock synchronization information sent by the photographing device, the microphone clock synchronization starting point is synchronized with the GPS clock and has high travel time precision. When the microphone receives the clock synchronization information sent by the photographing device, the microphone according to the clock synchronization information The own clock timing start point is synchronized to the initial shooting time, and the subject's voice data is recorded from the initial shooting time, and the recorded voice data is stored in the storage module of the microphone.
步骤S202、所述拍摄设备接收所述语音录制设备发送的所述语音数据,并将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。Step S202, the photographing device receives the voice data sent by the voice recording device, and combines the voice data and video image data of the photographer photographed by the photographing device from the start shooting time into a sound. Video file.
话筒可以将其存储的语音数据实时发送给拍摄设备,也可以按固定时间间隔发送给拍摄设备,还可以在拍摄设备拍摄视频结束后发送给拍摄设备,本发明实施例不做限制,另外,话筒向拍摄设备发送语音数据的方式优选为无线通信方式,具体为无线高保真(Wireless Fidelity,简称WiFi)、蓝牙等。拍摄设备接收到话筒发送的语音数据后,将语音数据和拍摄设备从起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻,所以语音数据和视频图像数据具有很高的匹配度,即合并后的音视频文件同时具有 高质量的视觉与听觉效果。The microphone can transmit the stored voice data to the photographing device in real time, or can be sent to the photographing device at a fixed time interval, and can also be sent to the photographing device after the photographing of the photographing device, which is not limited in the embodiment of the present invention. The method of transmitting voice data to the photographing device is preferably a wireless communication method, specifically, Wireless Fidelity (WiFi), Bluetooth, or the like. After receiving the voice data sent by the microphone, the photographing device combines the voice data and the video image data of the photographer photographed by the photographing device from the start of the photographing time into an audio and video file, since the voice data and the video image data correspond to the same initial shooting. At the moment, the voice data and the video image data have a high degree of matching, that is, the combined audio and video files have both High quality visual and auditory effects.
本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded. The data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
在上述实施例的基础上,所述拍摄设备向语音录制设备发送时钟同步信息之前,还包括:所述拍摄设备向语音录制设备发送初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳,以使所述语音录制设备依据所述时间戳开始计时。On the basis of the foregoing embodiment, before the photographing device sends the clock synchronization information to the voice recording device, the method further includes: the photographing device sends initial clock information to the voice recording device, where the initial clock information includes the photographing device sending station. A timestamp corresponding to the time of the initial clock information, so that the voice recording device starts timing according to the timestamp.
在本发明实施例中,所述语音录制设备与所述拍摄设备配对出现,即所述语音录制设备接收拍摄设备发送的时钟同步信息之前,需要对所述语音录制设备与所述拍摄设备进行配对处理,具体的配对处理过程为:拍摄设备向语音录制设备发送初始时钟信息,初始时钟信息包括的一个时间戳,该时间戳是拍摄设备发送所述初始时钟信息时刻对应的时间戳。语音录制设备依据所述时间戳开始计时,即语音录制设备的计时开始时刻以拍摄设备发送初始时钟信息的时刻为准。In the embodiment of the present invention, the voice recording device is paired with the photographing device, that is, before the voice recording device receives the clock synchronization information sent by the photographing device, the voice recording device needs to be paired with the photographing device. The specific pairing process is: the shooting device sends the initial clock information to the voice recording device, and the initial clock information includes a time stamp, where the time stamp is a time stamp corresponding to the time when the shooting device sends the initial clock information. The voice recording device starts timing according to the time stamp, that is, the timing start time of the voice recording device is based on the time when the shooting device sends the initial clock information.
所述语音数据和所述视频图像数据具有相同的时间轴;所述将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件,包括:依据所述相同的时间轴将所述语音数据和所述视频图像数据合并为音视频文件。The voice data and the video image data have the same time axis; the combining the voice data and the video image data of the photographer taken by the photographing device from the initial shooting time into an audio and video file, The method comprises: combining the voice data and the video image data into an audio and video file according to the same time axis.
所述拍摄设备向语音录制设备发送时钟同步信息之前,还包括:所述拍摄设备接收所述语音录制设备发送的拍摄启动请求,并依据所述拍摄启动请求启动拍摄功能;所述拍摄设备接收所述语音录制设备发送的所述语音数据之后,还包括:所述拍摄设备接收所述语音录制设备发送的拍摄结束请求,并依据所述拍摄结束请求关闭拍摄功能。Before the photographing device sends the clock synchronization information to the voice recording device, the method further includes: the photographing device receiving a photographing start request sent by the voice recording device, and starting a photographing function according to the photographing start request; After the voice data sent by the voice recording device, the shooting device further includes: the shooting device receives a shooting end request sent by the voice recording device, and turns off the shooting function according to the shooting end request.
语音录制设备可以作为主控设备控制拍摄设备对应拍摄功能的开启 或关闭,具体为语音录制设备先向拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能,拍摄设备将其起始拍摄时刻发送给语音录制设备;当语音录制设备记录语音数据结束后,主动向拍摄设备发送拍摄结束请求,拍摄设备依据该拍摄结束请求关闭拍摄功能。The voice recording device can be used as a master device to control the opening of the corresponding shooting function of the shooting device. Or turning off, specifically, the voice recording device first sends a shooting start request to the shooting device, so that the shooting device starts the shooting function, and the shooting device sends the initial shooting time to the voice recording device; when the voice recording device records the voice data, And actively sending a shooting end request to the shooting device, and the shooting device turns off the shooting function according to the shooting end request.
所述拍摄设备向语音录制设备发送时钟同步信息,包括:所述拍摄设备通过无线通信的方式向语音录制设备发送时钟同步信息;所述拍摄设备接收所述语音录制设备发送的所述语音数据,包括:所述拍摄设备通过无线通信的方式接收所述语音录制设备发送的所述语音数据。The photographing device sends the clock synchronization information to the voice recording device, where the photographing device sends the clock synchronization information to the voice recording device by means of wireless communication; the photographing device receives the voice data sent by the voice recording device, The method includes: the photographing device receiving the voice data sent by the voice recording device by way of wireless communication.
在本发明实施例中,语音录制设备与拍摄设备之间的所有交互均采用无线通信的方式进行数据传输。In the embodiment of the present invention, all interactions between the voice recording device and the photographing device are performed by wireless communication.
所述拍摄设备通过无线通信方式接收所述语音录制设备发送的状态信息,并依据所述状态信息控制所述语音录制设备开启或关闭,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息。Receiving, by the wireless communication device, the state information sent by the voice recording device, and controlling the voice recording device to be turned on or off according to the state information, where the state information includes at least: switch state information, power state information, and Storage space status information.
本发明实施例中的语音录制设备还设置有物理的开关按钮、开关指示灯、电量指示灯、容量指示灯,开关指示灯用于指示语音录制设备的开关状态,电量指示灯用于指示语音录制设备的电量状态,容量指示灯用于指示语音录制设备的存储空间状态。语音录制设备监测其自身的开关状态、电量状态和存储空间状态,并生成相应的状态信息,同时通过开关指示灯指示开关状态,若电量低于电量阈值时,通过电量指示灯进行告警显示,当存储空间小于预设阈值时,通过开关指示灯进行告警显示。The voice recording device in the embodiment of the present invention is further provided with a physical switch button, a switch indicator light, a power indicator light, and a capacity indicator. The switch indicator light is used to indicate the switch state of the voice recording device, and the power indicator light is used to indicate the voice recording. The battery status of the device. The capacity indicator is used to indicate the storage status of the voice recording device. The voice recording device monitors its own switch state, power state, and storage space state, and generates corresponding state information. At the same time, the switch indicator indicates the switch state. If the battery is lower than the power threshold, the battery indicator displays the alarm. When the storage space is less than the preset threshold, the alarm indicator is used to display the alarm.
语音录制设备将监测到的状态信息通过无线通信方式发送给拍摄设备,拍摄设备作为主控设备依据语音录制设备的状态信息控制语音录制设备开启或关闭,例如语音录制设备的电量低于电量阈值或存储空间小于预设阈值时,拍摄设备通过无线通信方式向语音录制设备发送关闭指令,以使语音录制设备依据该关闭指令执行关闭操作。The voice recording device sends the monitored status information to the shooting device by means of wireless communication, and the shooting device acts as the master device to control the voice recording device to be turned on or off according to the state information of the voice recording device, for example, the power of the voice recording device is lower than the power threshold or When the storage space is less than the preset threshold, the photographing device sends a close instruction to the voice recording device through wireless communication, so that the voice recording device performs the close operation according to the close instruction.
本发明实施例通过语音录制设备接收拍摄设备发送的时钟同步信息之前接收拍摄设备发送的初始时钟信息,并依据初始时钟信息中的时间戳开始计时,防止语音录制设备接收不到时钟同步信息时,无法与拍摄设备保持时钟同步,进一步提高了时钟同步精度;通过语音录制设备作为主控设备控制拍摄设备对应拍摄功能的开启或关闭,增加了语音录制设备的功 能;通过语音录制设备将其状态信息发送给拍摄设备,由拍摄设备依据该状态信息控制语音录制设备开启或关闭,提高了拍摄设备对语音录制设备的控制功能。The embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information. The clock synchronization cannot be maintained with the shooting device, which further improves the clock synchronization accuracy. The voice recording device is used as the main control device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is increased. The voice recording device sends its status information to the shooting device, and the shooting device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the shooting device on the voice recording device.
图3为本发明实施例提供的语音录制设备的结构图。本发明实施例提供的语音录制设备可以执行音视频文件生成方法实施例提供的处理流程,如图3所示,语音录制设备30包括接收模块31、同步模块32、记录存储模块33和发送模块34,其中,接收模块31用于接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;同步模块32用于设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻;记录存储模块33用于从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;发送模块34用于将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。FIG. 3 is a structural diagram of a voice recording device according to an embodiment of the present invention. The voice recording device provided by the embodiment of the present invention can perform the processing flow provided by the embodiment of the audio and video file generating method. As shown in FIG. 3, the voice recording device 30 includes a receiving module 31, a synchronization module 32, a recording storage module 33, and a sending module 34. The receiving module 31 is configured to receive clock synchronization information sent by the photographing device, where the clock synchronization information includes a start shooting time of the photographing device, and the synchronization module 32 is configured to: the device records the voice according to the clock synchronization information. The clock counting start point of the device is synchronized to the initial shooting time; the record storage module 33 is configured to record the stored voice data of the photographer from the initial shooting time, the voice recording device is fixed to the photographer; The module 34 is configured to send the voice data to the photographing device, so that the photographing device combines the voice data and video image data of a photographer photographed by the photographing device from the start photographing time to Audio and video files.
基于同一发明构思,本发明实施例还提供了一种语音录制设备,由于语音录制设备所解决问题的原理与前述音视频文件生成方法类似,因此该语音录制设备的实施可以参见前述方法实施例,重复之处不再赘述。Based on the same inventive concept, the embodiment of the present invention further provides a voice recording device. The principle of the voice recording device is similar to the foregoing method for generating an audio and video file. Therefore, the implementation of the voice recording device can be referred to the foregoing method embodiment. The repetitions are not repeated here.
本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded. The data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
图4为本发明另一实施例提供的语音录制设备的结构图。在上述实施例的基础上,接收模块31还用于接收所述拍摄设备发送的初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳;FIG. 4 is a structural diagram of a voice recording device according to another embodiment of the present invention. On the basis of the foregoing embodiment, the receiving module 31 is further configured to receive initial clock information sent by the photographing device, where the initial clock information includes a timestamp corresponding to the time when the photographing device sends the initial clock information;
语音录制设备30还包括计时模块37,时模块37用于依据所述时间戳 开始计时。The voice recording device 30 further includes a timing module 37 for using the time stamp according to the time stamp start the timer.
发送模块34还用于所述语音录制设备依据所述时间戳开始计时之后,向所述拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能;所述语音录制设备将所述语音数据发送给所述拍摄设备之后,向所述拍摄设备发送拍摄结束请求,以使所述拍摄设备关闭拍摄功能。The sending module 34 is further configured to send a shooting start request to the shooting device after the voice recording device starts timing according to the time stamp, so that the shooting device starts a shooting function; the voice recording device uses the voice data After being sent to the photographing device, a photographing end request is sent to the photographing device to cause the photographing device to turn off the photographing function.
接收模块31具体用于通过无线通信的方式接收拍摄设备发送的时钟同步信息;发送模块34具体用于将所述语音数据通过无线通信的方式发送给所述拍摄设备。The receiving module 31 is specifically configured to receive the clock synchronization information sent by the photographing device by means of wireless communication; the sending module 34 is specifically configured to send the voice data to the photographing device by way of wireless communication.
语音录制设备30还包括监测模块35,监测模块35用于监测其自身的开关状态、电量状态和存储空间状态,并生成状态信息,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息;发送模块34还用于将所述状态信息通过无线通信方式发送给所述拍摄设备,以使所述拍摄设备依据所述状态信息控制所述语音录制设备开启或关闭。The voice recording device 30 further includes a monitoring module 35 for monitoring its own switch state, power state, and storage space state, and generating state information, the state information including at least: switch state information, power state information, and storage The location information is sent to the camera device by the wireless communication method, so that the camera device controls the voice recording device to be turned on or off according to the state information.
语音录制设备30还包括AGC限幅模块36,AGC限幅模块36用于对音量大于预设音量的语音数据进行AGC限幅处理;所述语音录制设备的初始时钟与GPS时钟保持同步,所述拍摄设备的初始时钟与GPS时钟或所述拍摄设备所属基站的时钟保持同步。The voice recording device 30 further includes an AGC clipping module 36 for performing AGC limiting processing on voice data having a volume greater than a preset volume; the initial clock of the voice recording device is synchronized with the GPS clock, The initial clock of the photographing device is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
本发明实施例提供的语音录制设备可以具体用于执行上述图1所提供的方法实施例,具体功能此处不再赘述。The voice recording device provided by the embodiment of the present invention may be specifically used to perform the method embodiment provided in FIG. 1 above, and specific functions are not described herein again.
本发明实施例通过语音录制设备接收拍摄设备发送的时钟同步信息之前接收拍摄设备发送的初始时钟信息,并依据初始时钟信息中的时间戳开始计时,防止语音录制设备接收不到时钟同步信息时,无法与拍摄设备保持时钟同步,进一步提高了时钟同步精度;通过语音录制设备作为主控设备控制拍摄设备对应拍摄功能的开启或关闭,增加了语音录制设备的功能;通过语音录制设备将其状态信息发送给拍摄设备,由拍摄设备依据该状态信息控制语音录制设备开启或关闭,提高了拍摄设备对语音录制设备的控制功能;通过语音录制设备对音量大于预设音量的语音数据进行AGC限幅处理,防止语音录制设备的录音模块被较大的语音数据震坏,进一步保证了合并后的音视频文件具有很高的语音质量。The embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information. The clock synchronization cannot be maintained with the shooting device, and the clock synchronization accuracy is further improved. The voice recording device is used as the master device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is added; the state information is recorded by the voice recording device. Sending to the shooting device, the shooting device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the shooting device to the voice recording device; and performing AGC limiting processing on the voice data whose volume is greater than the preset volume through the voice recording device The recording module of the voice recording device is prevented from being damaged by the large voice data, further ensuring that the combined audio and video files have high voice quality.
图5为本发明实施例提供的拍摄设备的结构图。本发明实施例提供的 拍摄设备可以执行音视频文件生成方法实施例提供的处理流程,如图5所示,拍摄设备50包括发送模块51、接收模块52和合并模块53,其中,发送模块51用于向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;接收模块52用于接收所述语音录制设备发送的所述语音数据;合并模块53用于将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。FIG. 5 is a structural diagram of a photographing apparatus according to an embodiment of the present invention. Provided by the embodiments of the present invention The photographing device can perform the processing flow provided by the embodiment of the audio and video file generating method. As shown in FIG. 5, the photographing device 50 includes a sending module 51, a receiving module 52, and a merging module 53, wherein the sending module 51 is configured to send to the voice recording device. Clock synchronization information, the clock synchronization information including a start shooting time of the photographing device, so that the voice recording device synchronizes a clock timing start point of the voice recording device to the start shooting according to the clock synchronization information And recording the voice data of the photographer from the initial shooting time, the voice recording device is fixed to the subject; the receiving module 52 is configured to receive the voice data sent by the voice recording device; The merging module 53 is configured to merge the voice data and video image data of the subject photographed by the photographing device from the initial shooting time into an audiovisual file.
基于同一发明构思,本发明实施例还提供了一种拍摄设备,由于拍摄设备所解决问题的原理与前述音视频文件生成方法类似,因此该拍摄设备的实施可以参见前述方法实施例,重复之处不再赘述。Based on the same inventive concept, the embodiment of the present invention further provides a photographing apparatus. Since the principle of the problem solved by the photographing apparatus is similar to the method for generating an audio and video file, the implementation of the photographing apparatus can be referred to the foregoing method embodiment, and the repetition is performed. No longer.
本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded. The data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
图6为本发明另一实施例提供的拍摄设备的结构图。在上述实施例的基础上,发送模块51还用于向语音录制设备发送初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳,以使所述语音录制设备依据所述时间戳开始计时。FIG. 6 is a structural diagram of a photographing apparatus according to another embodiment of the present invention. On the basis of the foregoing embodiment, the sending module 51 is further configured to send the initial clock information to the voice recording device, where the initial clock information includes a timestamp corresponding to the time when the shooting device sends the initial clock information, so that the voice The recording device starts timing according to the time stamp.
所述语音数据和所述视频图像数据具有相同的时间轴;合并模块53具体用于依据所述相同的时间轴将所述语音数据和所述视频图像数据合并为音视频文件。The voice data and the video image data have the same time axis; the merging module 53 is specifically configured to combine the voice data and the video image data into an audio and video file according to the same time axis.
接收模块52还用于接收所述语音录制设备发送的拍摄启动请求或拍摄结束请求;拍摄设备50还包括控制模块54,控制模块54用于依据所述拍摄启动请求启动拍摄功能或依据所述拍摄结束请求关闭拍摄功能。 The receiving module 52 is further configured to receive a shooting start request or a shooting end request sent by the voice recording device. The shooting device 50 further includes a control module 54, and the control module 54 is configured to start a shooting function according to the shooting start request or according to the shooting. End the request to turn off the shooting function.
发送模块51具体用于通过无线通信的方式向语音录制设备发送时钟同步信息;接收模块52具体用于通过无线通信的方式接收所述语音录制设备发送的所述语音数据。The sending module 51 is specifically configured to send the clock synchronization information to the voice recording device by means of wireless communication; the receiving module 52 is specifically configured to receive the voice data sent by the voice recording device by way of wireless communication.
接收模块52还用于通过无线通信方式接收所述语音录制设备发送的状态信息;控制模块54还用于依据所述状态信息控制所述语音录制设备开启或关闭,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息。The receiving module 52 is further configured to receive the status information sent by the voice recording device by using a wireless communication manner; the control module 54 is further configured to control the voice recording device to be turned on or off according to the status information, where the status information includes at least: a switch Status information, battery status information, and storage space status information.
本发明实施例提供的拍摄设备可以具体用于执行上述图2所提供的方法实施例,具体功能此处不再赘述。The photographic device provided by the embodiment of the present invention may be specifically used to perform the method embodiment provided in FIG. 2 above, and specific functions are not described herein again.
本发明实施例通过语音录制设备接收拍摄设备发送的时钟同步信息之前接收拍摄设备发送的初始时钟信息,并依据初始时钟信息中的时间戳开始计时,防止语音录制设备接收不到时钟同步信息时,无法与拍摄设备保持时钟同步,进一步提高了时钟同步精度;通过语音录制设备作为主控设备控制拍摄设备对应拍摄功能的开启或关闭,增加了语音录制设备的功能;通过语音录制设备将其状态信息发送给拍摄设备,由拍摄设备依据该状态信息控制语音录制设备开启或关闭,提高了拍摄设备对语音录制设备的控制功能。The embodiment of the present invention receives the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device, and starts timing according to the time stamp in the initial clock information to prevent the voice recording device from receiving the clock synchronization information. The clock synchronization cannot be maintained with the shooting device, and the clock synchronization accuracy is further improved. The voice recording device is used as the master device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is added; the state information is recorded by the voice recording device. Sended to the shooting device, the shooting device controls the voice recording device to be turned on or off according to the state information, thereby improving the control function of the shooting device to the voice recording device.
图7为本发明实施例提供的音视频文件生成系统的结构图。本发明实施例提供的音视频文件生成系统可以执行音视频文件生成方法实施例提供的处理流程,如图7所示,音视频文件生成系统70包括上述实施例中的语音录制设备30和上述实施例中的拍摄设备50。FIG. 7 is a structural diagram of an audio and video file generating system according to an embodiment of the present invention. The audio and video file generating system provided by the embodiment of the present invention can execute the processing flow provided by the embodiment of the audio and video file generating method. As shown in FIG. 7, the audio and video file generating system 70 includes the voice recording device 30 in the above embodiment and the foregoing implementation. The photographing device 50 in the example.
本发明实施例提供的音视频文件生成系统可以执行音视频文件生成方法实施例提供的处理流程。The audio and video file generating system provided by the embodiment of the present invention can execute the processing flow provided by the embodiment of the audio and video file generating method.
图8为本发明另一实施例提供的语音录制设备的结构图。本发明实施例提供的语音录制设备可以执行音视频文件生成方法实施例提供的处理流程,如图8所示,语音录制设备30包括总线142,以及连接到总线142的处理器143、存储器144、输入模块145和输出模块146,其中,输入模块145用于接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;处理器143用于执行存储器144中存储的控制命令以执行以下步骤,依据所述时钟同步信息将所述语音录制设备的时 钟计时起点同步到所述起始拍摄时刻;从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;输出模块146用于将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。FIG. 8 is a structural diagram of a voice recording device according to another embodiment of the present invention. The voice recording device provided by the embodiment of the present invention can perform the processing flow provided by the embodiment of the audio and video file generating method. As shown in FIG. 8, the voice recording device 30 includes a bus 142, and a processor 143 and a memory 144 connected to the bus 142. The input module 145 is configured to receive clock synchronization information sent by the photographing device, the clock synchronization information includes a start photographing time of the photographing device, and the processor 143 is configured to execute the storage in the memory 144. Control command to perform the following steps, according to the clock synchronization information, when the voice recording device is The clock timing start is synchronized to the initial shooting time; the voice data of the photographer is stored from the initial shooting time, the voice recording device is fixed to the subject; and the output module 146 is configured to Data is transmitted to the photographing apparatus to cause the photographing apparatus to merge the voice data and video image data of the subject photographed by the photographing apparatus from the start photographing time into an audiovisual file.
在本发明实施例中,可选地,输入模块145还用于接收所述拍摄设备发送的初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳;处理器143还用于依据所述时间戳开始计时。In the embodiment of the present invention, the input module 145 is further configured to receive the initial clock information sent by the photographing device, where the initial clock information includes a time stamp corresponding to the time when the photographing device sends the initial clock information; The processor 143 is further configured to start timing according to the time stamp.
在本发明实施例中,可选地,输出模块146还用于所述语音录制设备依据所述时间戳开始计时之后,向所述拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能;所述语音录制设备将所述语音数据发送给所述拍摄设备之后,向所述拍摄设备发送拍摄结束请求,以使所述拍摄设备关闭拍摄功能。In the embodiment of the present invention, the output module 146 is further configured to send a shooting start request to the shooting device after the voice recording device starts timing according to the time stamp, so that the shooting device starts the shooting function. After the voice recording device transmits the voice data to the photographing device, a photographing end request is sent to the photographing device to cause the photographing device to turn off the photographing function.
在本发明实施例中,可选地,输入模块145具体用于通过无线通信的方式接收拍摄设备发送的时钟同步信息;输出模块146具体用于将所述语音数据通过无线通信的方式发送给所述拍摄设备。In the embodiment of the present invention, the input module 145 is specifically configured to receive the clock synchronization information sent by the photographing device by means of wireless communication; the output module 146 is specifically configured to send the voice data to the office by way of wireless communication. The shooting equipment.
在本发明实施例中,可选地,处理器143还用于监测其自身的开关状态、电量状态和存储空间状态,并生成状态信息,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息;输出模块146还用于将所述状态信息通过无线通信方式发送给所述拍摄设备,以使所述拍摄设备依据所述状态信息控制所述语音录制设备开启或关闭。In the embodiment of the present invention, the processor 143 is further configured to monitor its own switch state, power state, and storage space state, and generate state information, where the state information includes at least: switch state information, power state information. And the storage space status information; the output module 146 is further configured to send the status information to the photographing device by using a wireless communication manner, so that the photographing device controls the voice recording device to be turned on or off according to the status information.
在本发明实施例中,可选地,处理器143用于对音量大于预设音量的语音数据进行AGC限幅处理;所述语音录制设备的初始时钟与GPS时钟保持同步,所述拍摄设备的初始时钟与GPS时钟或所述拍摄设备所属基站的时钟保持同步。In the embodiment of the present invention, the processor 143 is configured to perform AGC limiting processing on the voice data whose volume is greater than the preset volume; the initial clock of the voice recording device is synchronized with the GPS clock, where the shooting device is The initial clock is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离 拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded. The data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
图9为本发明另一实施例提供的拍摄设备的结构图。本发明实施例提供的拍摄设备可以执行音视频文件生成方法实施例提供的处理流程,如图9所示,拍摄设备50包括总线152,以及连接到总线152的处理器153、存储器154、发送模块155和接收模块156,其中,发送模块155用于向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;接收模块156用于接收所述语音录制设备发送的所述语音数据;处理器153用于执行存储器154中存储的控制命令以执行以下步骤,将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。FIG. 9 is a structural diagram of a photographing apparatus according to another embodiment of the present invention. The photographing apparatus provided by the embodiment of the present invention can execute the processing flow provided by the embodiment of the audio and video file generating method. As shown in FIG. 9, the photographing apparatus 50 includes a bus 152, and a processor 153, a memory 154, and a transmitting module connected to the bus 152. 155 and a receiving module 156, wherein the sending module 155 is configured to send clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the shooting device, so that the voice recording device synchronizes according to the clock The information synchronizes a clock timing start point of the voice recording device to the initial shooting time, and records the stored voice data of the photographer from the initial shooting time, the voice recording device being fixed to the subject; The receiving module 156 is configured to receive the voice data sent by the voice recording device; the processor 153 is configured to execute a control command stored in the memory 154 to perform the following steps, starting the voice data and the photographing device The video image data of the subject photographed at the time of the first shooting is merged into an audiovisual file.
在本发明实施例中,可选地,发送模块155还用于向语音录制设备发送初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳,以使所述语音录制设备依据所述时间戳开始计时。In the embodiment of the present invention, the sending module 155 is further configured to send the initial clock information to the voice recording device, where the initial clock information includes a timestamp corresponding to the time when the shooting device sends the initial clock information, so that The voice recording device starts timing according to the time stamp.
在本发明实施例中,可选地,所述语音数据和所述视频图像数据具有相同的时间轴;所述合并模块具体用于依据所述相同的时间轴将所述语音数据和所述视频图像数据合并为音视频文件。In the embodiment of the present invention, optionally, the voice data and the video image data have the same time axis; the merging module is specifically configured to use the voice data and the video according to the same time axis. The image data is merged into an audio and video file.
在本发明实施例中,可选地,接收模块156还用于接收所述语音录制设备发送的拍摄启动请求或拍摄结束请求;处理器153用于依据所述拍摄启动请求启动拍摄功能或依据所述拍摄结束请求关闭拍摄功能。In the embodiment of the present invention, the receiving module 156 is further configured to receive a shooting start request or a shooting end request sent by the voice recording device; the processor 153 is configured to start a shooting function or a basis according to the shooting start request. The shooting end request turns off the shooting function.
在本发明实施例中,可选地,发送模块155具体用于通过无线通信的方式向语音录制设备发送时钟同步信息;接收模块156具体用于通过无线通信的方式接收所述语音录制设备发送的所述语音数据。In the embodiment of the present invention, the sending module 155 is specifically configured to send the clock synchronization information to the voice recording device by means of wireless communication; the receiving module 156 is specifically configured to receive, by using wireless communication, the voice recording device. The voice data.
在本发明实施例中,可选地,接收模块156还用于通过无线通信方式接收所述语音录制设备发送的状态信息;处理器153还用于依据所述状态 信息控制所述语音录制设备开启或关闭,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息。In the embodiment of the present invention, the receiving module 156 is further configured to receive the status information sent by the voice recording device by using a wireless communication manner; the processor 153 is further configured to use the status according to the status The information is controlled to be turned on or off by the voice recording device, and the status information includes at least: switch status information, power status information, and storage space status information.
本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量。The embodiment of the invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the starting shooting time, and records the stored voice data of the photographer from the initial shooting time, and finally the voice is recorded. The data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the subject is in motion and away from At the time of the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have high voice quality.
本发明实施例还提供一种非易失性计算机可读存储介质,包括有在语音录制设备30和拍摄设备50上选择计算机可执行指令,以使得由该装置执行时,实现根据本发明实施例的音视频文件生成方法。Embodiments of the present invention also provide a non-transitory computer readable storage medium including computer-executable instructions for selecting on a voice recording device 30 and a photographing device 50 such that when executed by the device, an embodiment in accordance with the present invention is implemented Audio and video file generation method.
综上所述,本发明实施例通过语音录制设备接收拍摄设备的起始拍摄时刻,将其自身的时钟计时起点同步到起始拍摄时刻,并从起始拍摄时刻记录存储被拍摄者的语音数据,最终将该语音数据与拍摄设备从起始拍摄时刻拍摄的视频图像数据合并为音视频文件,由于语音数据和视频图像数据对应相同的起始拍摄时刻即基于相同的时间轴,即使被拍摄者处于运动状态且远离拍摄者时,固定于被拍摄者的语音录制设备能够记录存储清晰的语音数据,保证语音数据和视频图像数据具有很高的匹配度,并保证合并后的音视频文件具有很高的语音质量;通过语音录制设备接收拍摄设备发送的时钟同步信息之前接收拍摄设备发送的初始时钟信息,并依据初始时钟信息中的时间戳开始计时,防止语音录制设备接收不到时钟同步信息时,无法与拍摄设备保持时钟同步,进一步提高了时钟同步精度;通过语音录制设备作为主控设备控制拍摄设备对应拍摄功能的开启或关闭,增加了语音录制设备的功能;通过语音录制设备将其状态信息发送给拍摄设备,由拍摄设备依据该状态信息控制语音录制设备开启或关闭,提高了拍摄设备对语音录制设备的控制功能;通过语音录制设备对音量大于预设音量的语音数据进行AGC限幅处理,防止语音录制设备的录音模块被较大的语音数据震坏,进一步保证了合并后的音视频文件具有很高的语音质量。 In summary, the embodiment of the present invention receives the initial shooting time of the shooting device through the voice recording device, synchronizes its own clock timing starting point to the initial shooting time, and records and stores the voice data of the captured person from the initial shooting time. Finally, the voice data and the video image data captured by the photographing device from the initial shooting time are merged into an audio and video file, since the voice data and the video image data correspond to the same starting time, that is, based on the same time axis, even if the photographer When in motion and away from the photographer, the voice recording device fixed to the subject can record and store clear voice data, ensure that the voice data and the video image data have a high degree of matching, and ensure that the combined audio and video files have a very high High voice quality; receiving the initial clock information sent by the photographing device before receiving the clock synchronization information sent by the photographing device by the voice recording device, and starting timing according to the time stamp in the initial clock information, preventing the voice recording device from receiving the clock synchronization information Cannot keep clock synchronization with the shooting device The clock synchronization precision is further improved; the voice recording device is used as the master control device to control whether the corresponding shooting function of the shooting device is turned on or off, and the function of the voice recording device is added; the state information is sent to the shooting device through the voice recording device, and the shooting device is Controlling the voice recording device to be turned on or off according to the status information, improving the control function of the shooting device on the voice recording device; performing AGC limiting processing on the voice data whose volume is greater than the preset volume through the voice recording device, preventing the recording module of the voice recording device It is damaged by the large voice data, which further ensures that the combined audio and video files have high voice quality.
在本发明所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本发明各个实施例所述方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium. The above software functional unit is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to perform the methods of the various embodiments of the present invention. Part of the steps. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
本领域技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。上述描述的装置的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。A person skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of each functional module described above is exemplified. In practical applications, the above function assignment can be completed by different functional modules as needed, that is, the device is installed. The internal structure is divided into different functional modules to perform all or part of the functions described above. For the specific working process of the device described above, refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。 Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the technical solutions of the embodiments of the present invention. range.

Claims (25)

  1. 一种音视频文件生成方法,其特征在于,包括:An audio and video file generating method, comprising:
    语音录制设备接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;The voice recording device receives clock synchronization information sent by the photographing device, where the clock synchronization information includes a starting shooting moment of the photographing device;
    所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;The voice recording device synchronizes a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information, and records, according to the initial shooting time, voice data of the captured person, the voice a recording device is fixed to the subject;
    所述语音录制设备将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。The voice recording device transmits the voice data to the photographing device to cause the photographing device to merge the voice data and video image data of a subject photographed by the photographing device from the start photographing time For audio and video files.
  2. 根据权利要求1所述的方法,其特征在于,所述语音录制设备接收拍摄设备发送的时钟同步信息之前,还包括:The method according to claim 1, wherein before the voice recording device receives the clock synchronization information sent by the photographing device, the method further includes:
    所述语音录制设备接收所述拍摄设备发送的初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳;The voice recording device receives initial clock information sent by the photographing device, where the initial clock information includes a time stamp corresponding to the time when the photographing device sends the initial clock information;
    所述语音录制设备依据所述时间戳开始计时。The voice recording device starts timing according to the time stamp.
  3. 根据权利要求2所述的方法,其特征在于,所述语音录制设备依据所述时间戳开始计时之后,还包括:The method according to claim 2, wherein after the voice recording device starts timing according to the time stamp, the method further includes:
    所述语音录制设备向所述拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能;The voice recording device sends a shooting start request to the shooting device to enable the shooting device to activate a shooting function;
    所述语音录制设备将所述语音数据发送给所述拍摄设备之后,还包括:After the voice recording device sends the voice data to the shooting device, the method further includes:
    所述语音录制设备向所述拍摄设备发送拍摄结束请求,以使所述拍摄设备关闭拍摄功能。The voice recording device transmits a shooting end request to the photographing device to cause the photographing device to turn off the photographing function.
  4. 根据权利要求1-3任一项所述的方法,其特征在于,所述语音录制设备接收拍摄设备发送的时钟同步信息,包括:The method according to any one of claims 1-3, wherein the voice recording device receives clock synchronization information sent by the photographing device, including:
    所述语音录制设备通过无线通信的方式接收拍摄设备发送的时钟同步信息;The voice recording device receives clock synchronization information sent by the photographing device by means of wireless communication;
    所述语音录制设备将所述语音数据发送给所述拍摄设备,包括:The voice recording device sends the voice data to the photographing device, including:
    所述语音录制设备将所述语音数据通过无线通信的方式发送给所述拍摄设备。The voice recording device transmits the voice data to the photographing device by way of wireless communication.
  5. 根据权利要求4所述的方法,其特征在于,还包括: The method of claim 4, further comprising:
    所述语音录制设备监测其自身的开关状态、电量状态和存储空间状态,并生成状态信息,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息;The voice recording device monitors its own switch state, power state, and storage space state, and generates state information, where the state information includes at least: switch state information, power state information, and storage space state information;
    所述语音录制设备将所述状态信息通过无线通信方式发送给所述拍摄设备,以使所述拍摄设备依据所述状态信息控制所述语音录制设备开启或关闭。The voice recording device sends the status information to the photographing device by way of wireless communication, so that the photographing device controls the voice recording device to be turned on or off according to the state information.
  6. 根据权利要求5所述的方法,其特征在于,所述从所述起始拍摄时刻记录存储被拍摄者的语音数据之后,还包括:The method according to claim 5, wherein the recording, after the recording of the voice data of the photographer from the initial shooting time, further comprises:
    所述语音录制设备对音量大于预设音量的语音数据进行AGC限幅处理;The voice recording device performs AGC limiting processing on voice data whose volume is greater than a preset volume;
    所述语音录制设备的初始时钟与GPS时钟保持同步,所述拍摄设备的初始时钟与GPS时钟或所述拍摄设备所属基站的时钟保持同步。The initial clock of the voice recording device is synchronized with the GPS clock, and the initial clock of the photographing device is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
  7. 一种音视频文件生成方法,其特征在于,包括:An audio and video file generating method, comprising:
    拍摄设备向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;The photographing device sends clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the photographing device, so that the voice recording device starts the clock of the voice recording device according to the clock synchronization information. Synchronizing to the initial shooting time, and recording the stored voice data of the photographer from the initial shooting time, the voice recording device being fixed to the subject;
    所述拍摄设备接收所述语音录制设备发送的所述语音数据,并将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。The photographing device receives the voice data sent by the voice recording device, and combines the voice data and video image data of the photographer photographed by the photographing device from the initial shooting time into an audio and video file.
  8. 根据权利要求7所述的方法,其特征在于,所述拍摄设备向语音录制设备发送时钟同步信息之前,还包括:The method according to claim 7, wherein before the sending of the clock synchronization information to the voice recording device, the photographing device further includes:
    所述拍摄设备向语音录制设备发送初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳,以使所述语音录制设备依据所述时间戳开始计时。The photographing device sends initial clock information to the voice recording device, where the initial clock information includes a time stamp corresponding to the time when the photographing device sends the initial clock information, so that the voice recording device starts timing according to the time stamp.
  9. 根据权利要求8所述的方法,其特征在于,所述语音数据和所述视频图像数据具有相同的时间轴;The method of claim 8 wherein said voice data and said video image data have the same time axis;
    所述将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件,包括:The combining the voice data and the video image data of the photographer photographed by the photographing device from the initial shooting moment into an audio and video file includes:
    依据所述相同的时间轴将所述语音数据和所述视频图像数据合并为音视频文件。 The voice data and the video image data are combined into an audiovisual file according to the same time axis.
  10. 根据权利要求9所述的方法,其特征在于,所述拍摄设备向语音录制设备发送时钟同步信息之前,还包括:The method according to claim 9, wherein before the photographing device sends the clock synchronization information to the voice recording device, the method further includes:
    所述拍摄设备接收所述语音录制设备发送的拍摄启动请求,并依据所述拍摄启动请求启动拍摄功能;The photographing device receives a photographing start request sent by the voice recording device, and starts a photographing function according to the photographing start request;
    所述拍摄设备接收所述语音录制设备发送的所述语音数据之后,还包括:After the receiving, by the photographing device, the voice data sent by the voice recording device, the method further includes:
    所述拍摄设备接收所述语音录制设备发送的拍摄结束请求,并依据所述拍摄结束请求关闭拍摄功能。The photographing device receives a photographing end request sent by the voice recording device, and turns off the photographing function according to the photographing end request.
  11. 根据权利要求10所述的方法,其特征在于,所述拍摄设备向语音录制设备发送时钟同步信息,包括:The method according to claim 10, wherein the photographing device sends clock synchronization information to the voice recording device, including:
    所述拍摄设备通过无线通信的方式向语音录制设备发送时钟同步信息;The photographing device sends clock synchronization information to the voice recording device by means of wireless communication;
    所述拍摄设备接收所述语音录制设备发送的所述语音数据,包括:Receiving, by the photographing device, the voice data sent by the voice recording device, including:
    所述拍摄设备通过无线通信的方式接收所述语音录制设备发送的所述语音数据。The photographing device receives the voice data sent by the voice recording device by way of wireless communication.
  12. 根据权利要求11所述的方法,其特征在于,还包括:The method of claim 11 further comprising:
    所述拍摄设备通过无线通信方式接收所述语音录制设备发送的状态信息,并依据所述状态信息控制所述语音录制设备开启或关闭,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息。Receiving, by the wireless communication device, the state information sent by the voice recording device, and controlling the voice recording device to be turned on or off according to the state information, where the state information includes at least: switch state information, power state information, and Storage space status information.
  13. 一种语音录制设备,其特征在于,包括:A voice recording device, comprising:
    接收模块,用于接收拍摄设备发送的时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻;a receiving module, configured to receive clock synchronization information sent by the photographing device, where the clock synchronization information includes a starting shooting moment of the photographing device;
    同步模块,用于依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻;a synchronization module, configured to synchronize a clock timing start point of the voice recording device to the initial shooting time according to the clock synchronization information;
    记录存储模块,用于从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;a recording storage module, configured to record, according to the initial shooting time, voice data of a stored object, the voice recording device being fixed to the subject;
    发送模块,用于将所述语音数据发送给所述拍摄设备,以使所述拍摄设备将所述语音数据和所述拍摄设备从所述起始拍摄时刻拍摄的被拍摄者的视频图像数据合并为音视频文件。a sending module, configured to send the voice data to the photographing device, so that the photographing device combines the voice data and video image data of a photographer photographed by the photographing device from the initial shooting moment For audio and video files.
  14. 根据权利要求13所述的语音录制设备,其特征在于,所述接收模块还用于接收所述拍摄设备发送的初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳; The voice recording device according to claim 13, wherein the receiving module is further configured to receive initial clock information sent by the photographing device, where the initial clock information includes a moment when the photographing device sends the initial clock information Corresponding timestamp;
    所述语音录制设备还包括计时模块,所述计时模块用于依据所述时间戳开始计时。The voice recording device further includes a timing module, and the timing module is configured to start timing according to the time stamp.
  15. 根据权利要求14所述的语音录制设备,其特征在于,所述发送模块还用于所述语音录制设备依据所述时间戳开始计时之后,向所述拍摄设备发送拍摄启动请求,以使所述拍摄设备启动拍摄功能;所述语音录制设备将所述语音数据发送给所述拍摄设备之后,向所述拍摄设备发送拍摄结束请求,以使所述拍摄设备关闭拍摄功能。The voice recording device according to claim 14, wherein the sending module is further configured to send a shooting start request to the photographing device after the voice recording device starts timing according to the time stamp, so that the The photographing device activates a photographing function; after the voice recording device transmits the voice data to the photographing device, sends a photographing end request to the photographing device to cause the photographing device to turn off the photographing function.
  16. 根据权利要求13-15任一项所述的语音录制设备,其特征在于,所述接收模块具体用于通过无线通信的方式接收拍摄设备发送的时钟同步信息;The voice recording device according to any one of claims 13 to 15, wherein the receiving module is specifically configured to receive clock synchronization information sent by the photographing device by way of wireless communication;
    所述发送模块具体用于将所述语音数据通过无线通信的方式发送给所述拍摄设备。The sending module is specifically configured to send the voice data to the photographing device by way of wireless communication.
  17. 根据权利要求16所述的语音录制设备,其特征在于,还包括:The voice recording device of claim 16, further comprising:
    监测模块,用于监测其自身的开关状态、电量状态和存储空间状态,并生成状态信息,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息;a monitoring module, configured to monitor its own switch state, power state, and storage space state, and generate state information, where the state information includes at least: switch state information, power state information, and storage space state information;
    所述发送模块还用于将所述状态信息通过无线通信方式发送给所述拍摄设备,以使所述拍摄设备依据所述状态信息控制所述语音录制设备开启或关闭。The sending module is further configured to send the status information to the photographing device by using a wireless communication manner, so that the photographing device controls the voice recording device to be turned on or off according to the state information.
  18. 根据权利要求17所述的语音录制设备,其特征在于,还包括:The voice recording device of claim 17, further comprising:
    AGC限幅模块,用于对音量大于预设音量的语音数据进行AGC限幅处理;The AGC limiting module is configured to perform AGC limiting processing on voice data whose volume is greater than a preset volume;
    所述语音录制设备的初始时钟与GPS时钟保持同步,所述拍摄设备的初始时钟与GPS时钟或所述拍摄设备所属基站的时钟保持同步。The initial clock of the voice recording device is synchronized with the GPS clock, and the initial clock of the photographing device is synchronized with the GPS clock or the clock of the base station to which the photographing device belongs.
  19. 一种拍摄设备,其特征在于,包括:A photographing apparatus, comprising:
    发送模块,用于向语音录制设备发送时钟同步信息,所述时钟同步信息包括所述拍摄设备的起始拍摄时刻,以使所述语音录制设备依据所述时钟同步信息将所述语音录制设备的时钟计时起点同步到所述起始拍摄时刻,并从所述起始拍摄时刻记录存储被拍摄者的语音数据,所述语音录制设备固定于所述被拍摄者;a sending module, configured to send clock synchronization information to the voice recording device, where the clock synchronization information includes a starting shooting moment of the shooting device, so that the voice recording device sends the voice recording device according to the clock synchronization information. a clock timing start point is synchronized to the initial shooting time, and the voice data of the photographer is stored from the initial shooting time, the voice recording device being fixed to the subject;
    接收模块,用于接收所述语音录制设备发送的所述语音数据;a receiving module, configured to receive the voice data sent by the voice recording device;
    合并模块,用于将所述语音数据和所述拍摄设备从所述起始拍摄时刻 拍摄的被拍摄者的视频图像数据合并为音视频文件。a merging module for using the voice data and the photographing device from the start shooting moment The captured video image data of the captured person is merged into an audiovisual file.
  20. 根据权利要求19所述的拍摄设备,其特征在于,所述发送模块还用于向语音录制设备发送初始时钟信息,所述初始时钟信息包括所述拍摄设备发送所述初始时钟信息时刻对应的时间戳,以使所述语音录制设备依据所述时间戳开始计时。The photographing apparatus according to claim 19, wherein the sending module is further configured to send initial clock information to the voice recording device, where the initial clock information includes a time corresponding to a moment when the photographing device sends the initial clock information Stamping so that the voice recording device starts timing according to the time stamp.
  21. 根据权利要求20所述的拍摄设备,其特征在于,所述语音数据和所述视频图像数据具有相同的时间轴;The photographing apparatus according to claim 20, wherein said voice data and said video image data have the same time axis;
    所述合并模块具体用于依据所述相同的时间轴将所述语音数据和所述视频图像数据合并为音视频文件。The merging module is specifically configured to combine the voice data and the video image data into an audio and video file according to the same time axis.
  22. 根据权利要求21所述的拍摄设备,其特征在于,所述接收模块还用于接收所述语音录制设备发送的拍摄启动请求或拍摄结束请求;The photographing apparatus according to claim 21, wherein the receiving module is further configured to receive a photographing start request or a photographing end request sent by the voice recording device;
    所述拍摄设备还包括控制模块,用于依据所述拍摄启动请求启动拍摄功能或依据所述拍摄结束请求关闭拍摄功能。The photographing apparatus further includes a control module for starting a photographing function according to the photographing start request or turning off the photographing function according to the photographing end request.
  23. 根据权利要求22所述的拍摄设备,其特征在于,所述发送模块具体用于通过无线通信的方式向语音录制设备发送时钟同步信息;The photographing apparatus according to claim 22, wherein the sending module is specifically configured to send clock synchronization information to the voice recording device by means of wireless communication;
    所述接收模块具体用于通过无线通信的方式接收所述语音录制设备发送的所述语音数据。The receiving module is specifically configured to receive the voice data sent by the voice recording device by means of wireless communication.
  24. 根据权利要求23所述的拍摄设备,其特征在于,所述接收模块还用于通过无线通信方式接收所述语音录制设备发送的状态信息;The photographing apparatus according to claim 23, wherein the receiving module is further configured to receive status information sent by the voice recording device by using a wireless communication manner;
    所述控制模块还用于依据所述状态信息控制所述语音录制设备开启或关闭,所述状态信息至少包括:开关状态信息、电量状态信息和存储空间状态信息。The control module is further configured to control the voice recording device to be turned on or off according to the state information, where the state information includes at least: switch state information, power state information, and storage space state information.
  25. 一种音视频文件生成系统,其特征在于,包括如权利要求13-18任一项所述的语音录制设备以及如权利要求19-24任一项所述的拍摄设备。 An audio-video file generating system, comprising the voice recording device according to any one of claims 13-18, and the photographing device according to any one of claims 19-24.
PCT/CN2016/072811 2015-06-29 2016-01-29 Audio and video file generation method, apparatus and system WO2017000554A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510370113.5A CN104967891B (en) 2015-06-29 2015-06-29 Audio-video document generation method and device
CN201510370113.5 2015-06-29

Publications (1)

Publication Number Publication Date
WO2017000554A1 true WO2017000554A1 (en) 2017-01-05

Family

ID=54221814

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/072811 WO2017000554A1 (en) 2015-06-29 2016-01-29 Audio and video file generation method, apparatus and system

Country Status (2)

Country Link
CN (1) CN104967891B (en)
WO (1) WO2017000554A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104967891B (en) * 2015-06-29 2019-06-18 高翔 Audio-video document generation method and device
WO2017128292A1 (en) * 2016-01-29 2017-08-03 高翔 Method, apparatus and system for generating voice video file
CN105611191B (en) * 2016-01-29 2019-01-01 高翔 Voice and video file synthesis method, apparatus and system
CN105578099A (en) * 2016-01-29 2016-05-11 高翔 Method, apparatus and system for generating voice and video file
CN105722178B (en) * 2016-01-29 2018-10-30 高翔 Voice data retransmission method, apparatus and system
CN105959776A (en) * 2016-04-29 2016-09-21 高翔 Audio and video file generating method, device and system
CN105872429A (en) * 2016-04-29 2016-08-17 高翔 Audio and video file generation method and device
CN106657852B (en) * 2016-12-21 2019-04-16 智威富(北京)科技有限公司 Time synchronization method, equipment and system
CN108174138B (en) * 2018-01-02 2021-02-19 上海闻泰电子科技有限公司 Video shooting method, voice acquisition equipment and video shooting system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5815380A (en) * 1981-07-22 1983-01-28 Hitachi Ltd Video tape recorder
CN101197993A (en) * 2006-12-05 2008-06-11 中兴通讯股份有限公司 Video and audio synchronizing apparatus
CN101197994A (en) * 2006-12-05 2008-06-11 中兴通讯股份有限公司 Video and audio synchronization process
CN101827271A (en) * 2009-03-04 2010-09-08 联芯科技有限公司 Audio and video synchronized method and device as well as data receiving terminal
CN103369365A (en) * 2013-06-28 2013-10-23 东南大学 Audio and video synchronous recording device
CN103501408A (en) * 2013-09-23 2014-01-08 深圳市欧珀通信软件有限公司 Method and system for photographing video clips through mobile terminal
CN103686315A (en) * 2012-09-13 2014-03-26 深圳市快播科技有限公司 Synchronous audio and video playing method and device
CN104967891A (en) * 2015-06-29 2015-10-07 高翔 Method and device for generating audio- video files

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004266458A (en) * 2003-02-28 2004-09-24 Shimadzu Corp Photographing equipment and synchronous photographic timing controller
CN103220425B (en) * 2013-04-10 2016-01-20 广东欧珀移动通信有限公司 A kind of way of recording based on multiple mobile terminal and system
CN103673996A (en) * 2013-12-10 2014-03-26 苏州市峰之火数码科技有限公司 Method for conveniently acquiring good aerial sound
CN103826009B (en) * 2014-02-26 2016-08-24 宇龙计算机通信科技(深圳)有限公司 A kind of based on mobile terminal intelligent synchronization video and audio recording method and system thereof

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5815380A (en) * 1981-07-22 1983-01-28 Hitachi Ltd Video tape recorder
CN101197993A (en) * 2006-12-05 2008-06-11 中兴通讯股份有限公司 Video and audio synchronizing apparatus
CN101197994A (en) * 2006-12-05 2008-06-11 中兴通讯股份有限公司 Video and audio synchronization process
CN101827271A (en) * 2009-03-04 2010-09-08 联芯科技有限公司 Audio and video synchronized method and device as well as data receiving terminal
CN103686315A (en) * 2012-09-13 2014-03-26 深圳市快播科技有限公司 Synchronous audio and video playing method and device
CN103369365A (en) * 2013-06-28 2013-10-23 东南大学 Audio and video synchronous recording device
CN103501408A (en) * 2013-09-23 2014-01-08 深圳市欧珀通信软件有限公司 Method and system for photographing video clips through mobile terminal
CN104967891A (en) * 2015-06-29 2015-10-07 高翔 Method and device for generating audio- video files

Also Published As

Publication number Publication date
CN104967891B (en) 2019-06-18
CN104967891A (en) 2015-10-07

Similar Documents

Publication Publication Date Title
WO2017000554A1 (en) Audio and video file generation method, apparatus and system
CN106488335B (en) Live-broadcast control method and device
WO2020133614A1 (en) Wireless communication method, terminal, audio component, apparatus, and storage medium
WO2017071073A1 (en) Method and device for media synchronization
JP6626440B2 (en) Method and apparatus for playing multimedia files
CN110166820B (en) Audio and video playing method, terminal and device
CN112988102B (en) Screen projection method and device
WO2017101439A1 (en) Electronic device control method and apparatus
JP2016535351A (en) Video information sharing method, apparatus, program, and recording medium
JP2015201168A (en) Method and apparatus for prompting based on smart glasses
JP2023519291A (en) Method for resuming playback of multimedia content between devices
JP6481225B2 (en) Information terminal device, information support method, and information support program
WO2016078394A1 (en) Voice call reminding method and device
WO2017214763A1 (en) Method and device for uploading video, and camera device
WO2020063675A1 (en) Smart loudspeaker box and method for using smart loudspeaker box
JP2016111406A (en) Information processing device, information processing method, and program
CN106713127B (en) Method and device for acquiring and processing instant chat records
EP2482549A1 (en) Apparatus and method for synchronizing media capture in a wireless device
KR101692909B1 (en) Method and system for providing video conference using screen mirroring
TW201629791A (en) Picture taking and sharing system and method
WO2017185338A1 (en) Audio-video file generation method and apparatus
US10085050B2 (en) Method and apparatus for adjusting video quality based on network environment
JP2013187826A (en) Imaging device, imaging system, and imaging method
WO2018188365A1 (en) Synchronous playback method, device and system
US20140267870A1 (en) Mixed media from multimodal sensors

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16816919

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16.04.2018)

122 Ep: pct application non-entry in european phase

Ref document number: 16816919

Country of ref document: EP

Kind code of ref document: A1