WO2015184861A1 - 处理音频和图像信息的方法、装置和终端设备 - Google Patents

处理音频和图像信息的方法、装置和终端设备 Download PDF

Info

Publication number
WO2015184861A1
WO2015184861A1 PCT/CN2015/072903 CN2015072903W WO2015184861A1 WO 2015184861 A1 WO2015184861 A1 WO 2015184861A1 CN 2015072903 W CN2015072903 W CN 2015072903W WO 2015184861 A1 WO2015184861 A1 WO 2015184861A1
Authority
WO
WIPO (PCT)
Prior art keywords
time period
image information
time
audio
collected
Prior art date
Application number
PCT/CN2015/072903
Other languages
English (en)
French (fr)
Inventor
刘远旺
叶敏
贺真
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2015184861A1 publication Critical patent/WO2015184861A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules

Definitions

  • the present invention relates to the field of data processing, and more particularly to a method, apparatus and terminal device for processing audio and image information.
  • the embodiment of the invention provides a method and a device for processing data of a record information, which can restore the conference scene more completely while recording the conference scene for a long time.
  • a method of processing audio and image information comprising: receiving a first user instruction, the first user instruction is for indicating continuous acquisition of audio information from a first moment; and according to the first user instruction, Acquiring audio information continuously during a first time period between a moment and a second moment, wherein the first moment is earlier than the second moment; receiving a second user instruction, the second user instruction is used to indicate from the third At the same time, the image information is collected; according to the second user instruction, the image information is collected from the third moment while the audio information is continuously collected, and the image information is stopped when the second time period expires, wherein the second time period Taking the third time as a starting time, the length of the second time period is less than the length of the first time period, and the second time period is located at the first time The correspondence between the audio information collected continuously in the first time period and the image information collected in the second time period is established; according to the correspondence, the first time period is stored continuously. Audio information and image information collected during the second time period.
  • the image information collected in the second time period is continuous frame picture information or single frame picture information.
  • the mapping between the continuously collected audio information in the first time period and the image information collected in the second time period is included, including Determining, according to the third time, a third time period, wherein the third time period includes the second time period and the third time period is located in the first time period; and continuously collecting for the third time period Adding a first identifier to the audio information, and adding a second identifier to the image information collected in the second time period, where the first identifier and the second identifier have a corresponding relationship; according to the first identifier and the first The second identifier establishes a correspondence between the continuously collected audio information in the first time period and the image information collected in the second time period.
  • the establishing a correspondence between the continuously collected audio information in the first time period and the image information collected in the second time period including Determining, according to the audio information collected continuously in the second time period, a topic name of the image information; performing keyword search on the audio information continuously collected in the first time period according to the topic name of the image information to determine And the audio information that is continuously collected in the first time period and the audio information that matches the topic name; and the audio information that is continuously collected in the first time period and the second time are established according to the audio information that matches the topic name.
  • the correspondence between the image information collected in the segment including Determining, according to the audio information collected continuously in the second time period, a topic name of the image information; performing keyword search on the audio information continuously collected in the first time period according to the topic name of the image information to determine And the audio information that is continuously collected in the first time period and the audio information that matches the topic name; and the audio information that is continuously collected in the first time period and the second time are established according to the audio information that matches the topic name.
  • the audio information continuously collected in the first time period and the image information collected in the second time period are stored according to the corresponding relationship
  • the method includes: storing the audio information continuously collected in the first time period to an audio format storage file; storing the image information collected in the second time period to an image format storage file; wherein the audio format stores the file and the image The format storage file has this correspondence.
  • the audio information continuously collected in the first time period and the image information collected in the second time period are stored according to the corresponding relationship
  • the method includes: storing the audio information continuously collected in the first time period, the image information collected in the second time period, and the corresponding relationship to a storage file.
  • the method further includes:
  • the third user instruction is used to view audio information stored at a fourth time in the storage file, where the fourth time is located in the first time period and the audio in the storage file exists in the fourth time
  • the information has image information corresponding to the relationship; according to the third user instruction, the image information having the corresponding relationship with the audio information of the fourth time is presented while the audio information of the fourth time is presented.
  • the method further includes:
  • a second aspect provides an apparatus for processing audio and image information, comprising: a receiving unit, configured to receive a first user instruction, the first user instruction is used to indicate that audio information is continuously collected from a first moment; and an audio collection unit And collecting, according to the first user instruction received by the receiving unit, audio information continuously during a first time period between the first time and the second time, wherein the first time is earlier than the second time
  • the receiving unit is further configured to receive a second user instruction, where the second user instruction is used to indicate that the image information is collected from the third time;
  • the image collecting unit is configured to: according to the second user instruction received by the receiving unit,
  • the audio collection unit collects the image information from the third time, and stops collecting the image information when the second time period expires.
  • the second time period starts with the third time.
  • the length of the second time period is less than the length of the first time period and the second time period is located in the first time period; the correspondence relationship establishing unit, And establishing a correspondence between the audio information continuously collected by the audio collecting unit in the first time period and the image information collected by the image collecting unit in the second time period; and a storage unit configured to perform, according to the correspondence
  • the correspondence established by the relationship establishing unit stores the audio information continuously collected by the audio collecting unit in the first time period and the image information collected by the image collecting unit in the second time period.
  • the image information collected by the image acquiring unit in the second time period is continuous frame picture information or single frame picture information.
  • the corresponding relationship establishing unit includes: a first determining unit, configured to determine, according to the third time, a third time period, where the third time period Include the second time period and the third time period is located in the first time period; and the identifier adding unit is configured to be used by the audio collecting unit in the third time period determined by the first determining unit Adding a first identifier to the audio information that is continuously collected, and adding a second identifier to the image information collected by the image collecting unit in the second time period, where the first identifier has a corresponding relationship with the second identifier; a first establishing unit, configured to establish, according to the first identifier and the second identifier added by the identifier adding unit, audio information continuously collected by the audio collecting unit in the first time period, and the image collecting unit in the second Correspondence between image information collected during the time period.
  • the corresponding relationship establishing unit includes: a second determining unit, configured to: according to the audio information continuously collected by the audio collecting unit in the second time period, Determining a theme name of the image information; the third determining unit is configured to perform a keyword search on the audio information continuously collected by the audio collecting unit in the first time period according to the theme name determined by the second determining unit, Determining, in the audio information that is continuously collected by the audio collecting unit in the first time period, the audio information that matches the topic name determined by the second determining unit; the second establishing unit, configured to determine, according to the third determining unit, The audio information matching the name of the theme establishes a correspondence between the audio information continuously collected by the audio collecting unit in the first time period and the image information collected by the image collecting unit in the second time period.
  • the storage unit includes: a first storage unit, configured to store audio information continuously collected by the audio collection unit in the first time period to audio a format storage file; a second storage unit, configured to store the image information collected by the image collection unit in the second time period to an image format storage file; wherein the audio format storage file and the image format storage file have The correspondence establishes the correspondence established by the unit.
  • the storage unit is further configured to: the audio information that is continuously collected by the audio collection unit in the first time period, the image acquisition unit is in the first The image information collected in the second time period and the correspondence established by the correspondence establishing unit are stored in a storage file.
  • the receiving unit is further configured to receive a third user instruction, where the third user instruction is configured to view a fourth moment of the storage file storage generated by the storage unit The audio information, the fourth time is located in the first time period, and the storage file has image information corresponding to the audio information of the fourth time;
  • the device further includes: a first presentation unit, configured to: Receiving, by the receiving unit, the third user instruction, when presenting the audio information of the fourth moment, having a corresponding relationship with the audio information of the fourth moment Image information.
  • the receiving unit is further configured to receive a fourth user instruction, where the fourth user instruction is used to view a fifth time of storing the storage file generated by the storage unit The image information, wherein the fifth time is located in the second time period; the device further includes: a second presentation unit, configured to present the image at the fifth time according to the fourth user instruction received by the receiving unit At the same time of the information, audio information corresponding to the image information of the fifth time in the audio information stored in the storage file is presented.
  • a third aspect provides a terminal device, including: a receiver, configured to receive a first user instruction, where the first user instruction is used to indicate that audio information is continuously collected from a first moment; and a sound recorder is configured to receive according to the The first user command received by the device continuously collects audio information in a first time period between the first time and the second time, wherein the first time is earlier than the second time; the receiver further And configured to receive a second user instruction, where the second user instruction is used to instruct to acquire image information from a third time; the camera is configured to continuously collect audio information in the sound recorder according to the second user instruction received by the receiver.
  • the image information is collected from the third time and the image information is stopped when the second time period expires, wherein the second time period is the start time, and the length of the second time period is less than the a length of the first time period and the second time period is located in the first time period;
  • the processor is configured to establish audio information continuously collected by the sound recorder during the first time period Corresponding relationship between the image information collected by the camera in the second time period;
  • the memory is configured to store the audio continuously collected by the recorder during the first time period according to the correspondence established by the processor Information and image information collected by the camera during the second time period.
  • the image information collected by the camera in the second time period is continuous frame picture information or single frame picture information.
  • the processor is specifically configured to: determine, according to the third time, a third time period, where the third time period includes the second time period and The third time period is located in the first time period; the first identifier is added to the audio information continuously collected by the recorder in the third time period, and the image information collected by the camera in the second time period is Adding a second identifier, where the first identifier has a corresponding relationship with the second identifier; and according to the first identifier and the second identifier, establishing audio information continuously collected by the recorder during the first time period Correspondence relationship with image information collected by the camera during the second time period.
  • the processor is specifically configured to: determine, according to the audio information continuously collected by the recorder in the second time period, a subject name of the image information; Performing a keyword search on the audio information continuously collected by the recorder during the first time period according to the theme name of the image information to determine the audio information continuously collected by the recorder during the first time period.
  • the audio information matched by the theme name; the audio information continuously collected by the recorder during the first time period and the image information collected by the camera during the second time period are established according to the audio information matching the name of the theme; Correspondence between them.
  • the memory is specifically configured to: store the audio information continuously collected by the recorder in the first time period to an audio format storage file;
  • the image information collected in the second time period is stored to the image format storage file; wherein the audio format storage file and the image format storage file have the corresponding relationship established by the processor.
  • the memory is further configured to: collect audio information continuously collected by the recorder during the first time period, and the camera collects the second time period
  • the obtained image information and the correspondence established by the processor are stored to a storage file.
  • the receiver is further configured to receive a third user instruction, where the third user instruction is used to view a fourth time of the storage of the storage file generated by the memory.
  • the audio information, the fourth time is located in the first time period, and the storage file has image information corresponding to the audio information of the fourth time;
  • the terminal device further includes: a player, according to the receiver
  • the received third user instruction presents image information corresponding to the audio information of the fourth time instant while presenting the audio information of the fourth time.
  • the receiver is further configured to receive a fourth user instruction, where the fourth user instruction is used to view a fifth time of the stored file storage generated by the memory.
  • Image information wherein the fifth time is located in the second time period;
  • the terminal device further includes: a player, configured to present the image information of the fifth time according to the fourth user instruction received by the receiver At the same time, audio information corresponding to the image information of the fifth moment in the audio information stored in the storage file is presented.
  • the method, device, and terminal device for processing audio and image information continuously collect audio information for a period of time, and Acquiring image information at a moment or a short period of time, establishing a correspondence between the collected audio information and the image information, and storing the collected audio information and image information according to the correspondence relationship, which can be in a long period of time
  • the image information is selectively collected according to user instructions, thereby occupying a small storage space, and restoring the conference scene more completely, thereby improving the user experience.
  • FIG. 1 is a schematic flow chart of a method of processing audio and image information according to an embodiment of the present invention.
  • FIG. 2 is a schematic flow chart of another method for processing audio and image information according to an embodiment of the present invention.
  • FIG. 3 is a schematic block diagram of an apparatus for processing audio and image information in accordance with an embodiment of the present invention.
  • the embodiments of the present invention can be applied to various conferences or other scene recordings, and the apparatus for processing audio and image information can be a terminal device having a camera function and a recording function, or any other combination of recording and photographing/camera. Functional device, but the invention is not limited thereto.
  • FIG. 1 shows a schematic flow diagram of a method 100 of processing audio and image information in accordance with an embodiment of the present invention.
  • the method can be performed by any device having a recording function and a photographing/camera function.
  • the device can determine the first time in a variety of ways.
  • the first user instruction may directly indicate a time at which to start collecting audio information, and correspondingly, the device may refer to the first user The time at which the start of the acquisition of the audio information indicated in the order is determined as the first time.
  • the device may determine the first time according to the time when the first user instruction is received, for example, the The device starts to collect audio information immediately after receiving the first user instruction, and correspondingly, the first moment may be a moment when the audio information collection unit of the device is activated, or the device detects that the user press corresponds to the audio information collection.
  • the moment of the button or the shortcut key, and the like, the embodiment of the present invention does not limit this.
  • the audio information is continuously collected in the first time period between the first time and the second time according to the first user instruction, where the first time is earlier than the second time.
  • the apparatus continuously acquires audio information continuously for a first period of time, and the apparatus does not acquire image information until a user command for indicating acquisition of image information is detected.
  • the device can determine the second moment in a plurality of ways, that is, the moment when the audio collection ends.
  • the device may preset a length of time for collecting audio information, and determine the second time according to the first time and the preset time length; or the first user instruction may indicate an end time of audio information collection or indicate audio
  • the duration of the information collection correspondingly, the device may determine the second time according to the end time or duration indicated in the first user instruction; or the second time is detected by the device to indicate that the collection is stopped
  • the time of the user command of the audio information for example, the time when the user presses the button or the shortcut key corresponding to the audio information collection in the device again, or the time when the user presses the button or the shortcut key corresponding to the stop of the audio information collection, etc.
  • the embodiment of the present invention does not limit this.
  • the device can determine the third moment in a variety of ways.
  • the second user instruction may directly indicate the time at which the image information is started to be collected. Accordingly, the device may determine the time at which the start of acquiring the image information indicated in the second user instruction is the third time.
  • the device may determine the third time according to the time when the second user instruction is received, for example, the The device starts to acquire image information immediately after receiving the second user instruction, and correspondingly, the third moment may be a time when the image information collecting unit of the device is activated, or the device detects that the user press corresponds to the image information collection.
  • the moment of the button or the shortcut key, and the like, the embodiment of the present invention does not limit this.
  • S140 Acquire, according to the second user instruction, acquiring image information from the third moment while continuously acquiring audio information, and stopping acquiring image information when the second time period expires, wherein the second time period is the third time.
  • the time is the starting time
  • the length of the second time period is less than the length of the first time period
  • the second time period is located in the first time period.
  • the device begins acquiring image information at a start time (ie, a third time) of the second time period, and stops acquiring image information at a termination time of the second time period. If the second time period expires and the first time period has not expired, the device only performs audio information acquisition without image information acquisition before detecting the user instruction for indicating the collected image information again.
  • the second time period during which the device performs image information acquisition is shorter than the first time period during which the device performs audio information collection.
  • the length of the second time period may be zero, that is, the second time period is specifically the third time, for example, the device takes a picture at the third time; or the length of the second time period is also It may be greater than zero.
  • the device continuously captures a plurality of photos or records a video in a second period of time, which is not limited by the embodiment of the present invention.
  • the third time may be equal to the first time, and the ending time of the second time period is earlier than the second time.
  • the device starts collecting audio information and image information from the same time. After the time, the image information is stopped and the audio information is continuously collected.
  • the third moment may also be later than the first moment and earlier than the second moment.
  • the apparatus may first perform audio information collection, and collect audio information for a period of time.
  • the device stops collecting the image information and continues to collect the audio information, and if the termination time of the second time period is equal to the At the second moment, the device stops acquiring image information and audio information at the same time; optionally, as another embodiment, the length of the second time period is zero and the third time is equal to the second time, at this time, The device only collects the audio information during the most part of the first time period, and collects the image information and the audio information at the last time of the first time period, which is not limited by the embodiment of the present invention.
  • the device can determine the end time of the second time period in a variety of ways.
  • the termination time of the second time period may be a time when the device detects a user instruction for stopping the acquisition of the image information, for example, the termination time is a time when the user turns off the image information collection unit, or is the device The time at which the user presses the button or shortcut corresponding to the image information collection again is detected, and the like.
  • the device may start shooting from the third moment. Take a photo or take multiple photos in succession, and stop shooting after one or more consecutive photos are taken, that is, the end time of the second time period can be completed for the device or the consecutive multiple photos The embodiment of the present invention does not limit this.
  • the device may also preset a duration period for performing image information collection and start timing from the third moment, if the device does not receive the timeout period when the preset duration period expires.
  • the device stops collecting image information, that is, the termination time of the second time period is the termination time of the preset duration period, wherein the preset duration period may be For tens of seconds or minutes, if the device is provided with a screen sleep period, the length of the preset duration may also be equal to the length of the screen sleep period by default; or, if the device is before the second time
  • the termination time of the second time period may also be a time when the device detects a user instruction for instructing to stop collecting audio information, and at this time, the second time
  • the termination time of the segment is the second time, and the device stops collecting audio information and image information at the same time, and the invention is implemented Which is not limited.
  • the device may also preset a critical storage capacity, and stop collecting image information when detecting that the storage space occupied by the collected image information is equal to or close to the critical storage capacity, that is, The termination time of the second time period is a time when the storage space occupied by the device detects that the collected image information is equal to or close to the critical storage capacity, wherein the preset critical storage capacity may be set according to a user instruction, and may be The embodiment of the present invention does not limit the tens of megabytes or hundreds of megabytes, or a certain proportion of the current remaining storage space.
  • the device may establish a correspondence between the audio information and the image information collected in the first time period according to the collection time of the image information and the audio information, or according to the collected audio information and the content of the image information, which is implemented by the present invention. This example does not limit this.
  • S160 Store, according to the correspondence, audio information that is continuously collected in the first time period and image information that is collected in the second time period.
  • the device generates a storage file based on the audio information, the image information, and a correspondence between the two.
  • a method of processing audio and images by continuously collecting audio information for a period of time, and acquiring a picture at a certain time or a certain period of time during the period of time Forming a correspondence between the collected audio information and the image information, and storing the collected audio information and image information according to the correspondence relationship, capable of continuously collecting audio information for a long period of time, according to user instructions
  • the image information is selectively collected, which occupies less storage space and restores the conference scene more completely, thereby improving the user experience.
  • the image information collected in the second time period may be continuous frame picture information or single frame picture information. If the first user instruction is used to indicate that the photographing is taken, the device may take one or more photographs from the third moment. At this time, the image collected by the device is one or more still images, that is, The collected image information is single-frame picture information; and if the first user instruction is used to indicate that the video is captured, the device may capture the video during the second time period, and at this time, the image collected by the device is continuously continuous.
  • the static picture, that is, the collected image information is continuous frame picture information, but the embodiment of the present invention is not limited thereto.
  • the device may further perform noise reduction processing on the collected audio information before storing the collected audio information.
  • the device since the audio information collected by the device is an analog signal, the device may perform analog-to-digital conversion ("ADC") on the audio information before storing the collected audio information.
  • ADC analog-to-digital conversion
  • the processing is to obtain a digital signal, and the obtained digital signal is stored in S160, but the embodiment of the present invention is not limited thereto.
  • S150 establishing a correspondence between the continuously collected audio information in the first time period and the image information collected in the second time period, including:
  • the third time period may completely coincide with the second time period, or the second time period is a part of the third time period.
  • the third time period may completely coincide with the second time period; if multiple static photos are taken in the second time period, the second time The segment may be part of the third time period; and if a static photo is taken during the second time period, that is, the second time period is the third time, then the third The time period may be the third time or a period of time including the third time, which is not limited by the embodiment of the present invention.
  • the length of the third time period may be preset, wherein the third time period may be the start time and the length is greater than the length of the second time period, or the third time period and the second time period
  • the time period has the same termination time but the start time of the third time period is earlier than the third time, or the third time period is the center time and the termination time of the third time period is the first time
  • the termination time of the second time period or the termination time of the second time period is not limited by the embodiment of the present invention.
  • the device determines that the audio information collected in the third time period has a corresponding relationship with the image information collected in the second time period, and the first identifier and the second identifier are used to represent the corresponding relationship, where
  • the first identifier and the second identifier may be any identifiers having an association relationship, which is not limited by the embodiment of the present invention.
  • S150 establishing a correspondence between the continuously collected audio information in the first time period and the image information collected in the second time period, including:
  • Corresponding relationship between the continuously collected audio information in the first time period and the image information collected in the second time period is established according to the audio information that matches the topic name.
  • the device may pop up a dialog box after acquiring the image information to prompt the user to input a topic name of the image information, and determine a theme name of the image information according to the input of the user; or the device may also be according to the second
  • the content of the audio information collected in the time period is determined by the embodiment of the present invention.
  • the device may use a keyword search technology to search for voice information matching the topic name of the image information in the voice information collected in the first time period, and determine the matched voice. There is a correspondence between the information and the image information.
  • the device may determine the image information and the two segments. There is a correspondence between the matched voice information.
  • the device may also match each of the two pieces of matched voice information.
  • the voice information is centered and extended to a certain length on both sides, or extended by a certain length from the matched voice information of each segment, and the image information and the extended two pieces of voice information are determined. There is a corresponding relationship between them, but the embodiment of the invention is not limited thereto.
  • S160 storing the audio information continuously collected in the first time period and the image information collected in the second time period, including:
  • the audio format storage file and the image format storage file have the corresponding relationship.
  • the audio format storage file is used to store audio information
  • the image format storage file is used to store image information
  • a label may be set in the audio format storage file and the image format storage file to record the correspondence established in S150. relationship.
  • the device may also convert the audio information into a text format and store the audio information in a text format, but the embodiment of the present invention is not limited thereto.
  • S160 storing the audio information continuously collected in the first time period and the image information collected in the second time period, including:
  • the audio information continuously collected in the first time period, the image information collected in the second time period, and the corresponding relationship are stored in a storage file.
  • the storage file has a new file format for storing audio information, image information, and a correspondence between the two.
  • the device may simultaneously display a certain piece of audio information and image information corresponding thereto, but the embodiment of the present invention is not limited thereto.
  • the user can also selectively view, for example, the user can view the related content of the time point by dragging the timeline or clicking a specific time point, and can also edit or delete the content, The embodiment of the invention is not limited thereto.
  • the method 100 further includes:
  • presenting The audio information of the fourth moment has image information of a corresponding relationship.
  • the fourth time may be located in the third time period or the second time period. If the user clicks on a certain time of the timeline or the audio information is continuously played to the time, the device may display the image information corresponding to the recording at the time while playing the recording at the time, but the embodiment of the present invention Not limited to this.
  • the method 100 further includes:
  • the audio information stored in the storage file and the audio information corresponding to the image information at the fifth time are presented while the image information of the fifth time is presented.
  • the device may play the audio information corresponding to the picture while the picture is being presented, but the embodiment of the present invention is not limited thereto.
  • the method for processing audio and images establishes the collected information by continuously acquiring audio information for a period of time and acquiring image information at a certain time or a certain period of time during the period of time.
  • Corresponding relationship between the audio information and the image information, and storing the collected audio information and image information according to the correspondence relationship, and capable of selectively performing image information according to a user instruction while continuously collecting audio information for a long period of time The collection takes up less storage space and restores the conference scene more completely, improving the user experience.
  • the method 200 includes:
  • S210 and S220 can be executed simultaneously. If no user instruction is detected, the audio information collection is continued without image information acquisition, that is, S210 is continued; if a user instruction for instructing to stop collecting audio information is detected, S260 and S280 are performed; if an instruction for instructing acquisition of image information is detected, S230 is performed.
  • the image information may be single frame picture information or continuous frame picture information.
  • the S230 and the S240 may be simultaneously executed, and the preset condition for stopping the acquiring of the image information may include at least one of the following conditions: whether the duration of the collected image information exceeds a preset time threshold and the storage occupied by the collected image information Whether the space exceeds the critical storage capacity. If at least one of the above two conditions is satisfied, it is determined that the preset condition for stopping the acquisition of the image information is satisfied, but the embodiment of the present invention is not limited thereto.
  • S250 and S210 may be performed, that is, the acquisition of the image information is stopped but the audio information is continuously collected; and if the detection is continued for S150, S260, S270, and S280 are executed, and the audio information and the image information are stopped, the corresponding relationship between the collected audio information and the image information is established, and the storage is generated according to the correspondence. file.
  • the correspondence between the audio information and the image information may be established according to content respectively included in the two.
  • the step may also be performed immediately after the execution of S250. Specifically, if the audio information is continuously collected after the image information is stopped, the previously collected image information may be established according to the acquisition time while continuing to collect the audio information. Correspondence between audio information, but the embodiment of the present invention is not limited thereto.
  • the storage file is used to store the collected audio information, or to store the collected audio information, the image information, and the corresponding relationship between the two, but the embodiment of the present invention is not limited thereto.
  • the method for processing audio and images establishes the collected information by continuously acquiring audio information for a period of time and acquiring image information at a certain time or a certain period of time during the period of time.
  • Corresponding relationship between the audio information and the image information, and storing the collected audio information and image information according to the correspondence relationship, and capable of selectively performing image information according to a user instruction while continuously collecting audio information for a long period of time The collection takes up less storage space and restores the conference scene more completely, improving the user experience.
  • FIGS. 1 and 2 A method of processing audio and image information according to an embodiment of the present invention is described in detail above with reference to FIGS. 1 and 2.
  • an apparatus for processing audio and image information according to an embodiment of the present invention will be described in detail with reference to FIGS. 3 and 4. And terminal equipment.
  • FIG. 3 shows a schematic block diagram of an apparatus 300 for processing audio and image information in accordance with an embodiment of the present invention. As shown in FIG. 3, the apparatus 300 includes:
  • the receiving unit 310 is configured to receive a first user instruction, where the first user instruction is used to indicate that the audio information is continuously collected from the first moment;
  • the audio collection unit 320 is configured to continuously collect audio information in a first time period between the first time and the second time according to the first user instruction received by the receiving unit 310, where the first time is Earlier than the second moment;
  • the receiving unit 310 is further configured to receive a second user instruction, where the second user instruction is used to indicate that the image information is collected from the third moment;
  • the image collecting unit 330 is configured to collect image information from the third moment and periodically time out when the audio collecting unit 320 continuously collects audio information according to the second user instruction received by the receiving unit 310. Stop acquiring the image information, wherein the second time period is the start time, the length of the second time period is less than the length of the first time period, and the second time period is located in the first time period ;
  • the correspondence relationship establishing unit 340 is configured to establish a correspondence between the audio information continuously collected by the audio collecting unit 320 in the first time period and the image information collected by the image collecting unit 330 in the second time period. ;
  • the storage unit 350 is configured to store the audio information continuously collected by the audio collection unit 320 during the first time period and the image acquisition unit 330 during the second time period according to the correspondence established by the correspondence relationship establishing unit 340. Image information collected within.
  • an apparatus for processing audio and images establishes acquired information by continuously acquiring audio information for a period of time and acquiring image information at a certain time or a certain period of time during the period of time.
  • Corresponding relationship between the audio information and the image information, and storing the collected audio information and image information according to the correspondence relationship, and capable of selectively performing image information according to a user instruction while continuously collecting audio information for a long period of time The collection takes up less storage space and restores the conference scene more completely, improving the user experience.
  • the image information collected by the image collection unit 330 in the second time period is continuous frame picture information or single frame picture information.
  • the correspondence establishing unit 340 includes:
  • a first determining unit configured to determine, according to the third time, a third time period, where the third time period includes the second time period and the third time period is located in the first time period;
  • a first establishing unit configured to establish, according to the first identifier and the second identifier added by the identifier adding unit, audio information continuously collected by the audio collecting unit 320 in the first time period, and the image collecting unit 330 is in the Correspondence between image information collected during the second time period.
  • a second determining unit configured to determine, according to the audio information continuously collected by the audio collecting unit 320 in the second time period, a subject name of the image information
  • a third determining unit configured to perform a keyword search on the audio information continuously collected by the audio collecting unit 320 in the first time period according to the theme name determined by the second determining unit, to determine the audio collecting unit 320. Audio information that matches the subject name determined by the second determining unit in the audio information continuously collected in the first time period;
  • a second establishing unit configured to establish, according to the audio information that is matched by the third determining unit, the audio information that is continuously collected by the audio collecting unit 320 in the first time period, and the image collecting unit 330 Correspondence between image information collected during the second time period.
  • the storage unit 350 includes:
  • a first storage unit configured to store the audio information continuously collected by the audio collection unit 320 in the first time period to an audio format storage file
  • a second storage unit configured to store the image information collected by the image collection unit 330 in the second time period to an image format storage file
  • the correspondence between the audio format storage file and the image format storage file is established by the correspondence establishing unit 340.
  • the storage unit 350 is further configured to:
  • the receiving unit 310 is further configured to receive a third user instruction, where the third user instruction is used to view audio information of a fourth moment stored by the storage file generated by the storage unit 350, where The fourth time is located in the first time period, and the image file has image information corresponding to the audio information of the fourth time in the storage file; correspondingly, the device 300 further includes:
  • the first presentation unit is configured to, according to the third user instruction received by the receiving unit 310, present image information corresponding to the audio information of the fourth time instant while presenting the audio information of the fourth time.
  • the receiving unit 310 is further configured to receive a fourth user instruction, where the fourth user instruction is used to view image information of the fifth time stored by the storage unit 350, where the storage file is stored, where The fifth time is located in the second time period; correspondingly, the device 300 further includes:
  • a second presentation unit configured to display, according to the fourth user instruction received by the receiving unit 310, the image information stored in the storage file and the image information in the fifth time while presenting the image information of the fifth time Corresponding relationship audio information.
  • the apparatus 300 for processing audio and image information according to an embodiment of the present invention may correspond to an execution subject in a method of processing audio and image information according to an embodiment of the present invention, and the above-described respective modules in the apparatus 300 for processing audio and image information And other operations and/or functions, respectively, in order to implement the corresponding processes of the respective methods in FIG. 1 to FIG. 2, for brevity, no further details are provided herein.
  • an apparatus for processing audio and images establishes acquired information by continuously acquiring audio information for a period of time and acquiring image information at a certain time or a certain period of time during the period of time.
  • Corresponding relationship between the audio information and the image information, and storing the collected audio information and image information according to the correspondence relationship, and capable of selectively performing image information according to a user instruction while continuously collecting audio information for a long period of time The collection takes up less storage space and restores the conference scene more completely, improving the user experience.
  • the receiver 410 is configured to receive a first user instruction, where the first user instruction is used to indicate that the audio information is continuously collected from the first moment;
  • the recorder 420 is configured to continuously collect audio information in a first time period between the first time and the second time according to the first user instruction received by the receiver 410, where the first The time is earlier than the second moment;
  • the receiver 410 is further configured to receive a second user instruction, where the second user instruction is used to indicate that the image information is collected from the third moment;
  • the camera 430 is configured to: according to the second user instruction received by the receiver 410, collect the image information from the third moment while the recorder 420 continuously collects the audio information, and stop collecting the image when the second time period expires.
  • Information wherein the second time period is the start time, the length of the second time period is less than the length of the first time period, and the second time period is located in the first time period;
  • the processor 440 is configured to establish a correspondence between the audio information continuously collected by the recorder 420 in the first time period and the image information collected by the camera 430 in the second time period;
  • the memory 450 is configured to store the audio information continuously collected by the recorder 420 during the first time period and the image information collected by the camera 430 during the second time period according to the correspondence established by the processor 440. .
  • the terminal device establishes the collected audio information and image by continuously collecting audio information for a period of time and acquiring image information at a certain time or a certain short period of time.
  • the image information collected by the camera 430 during the second time period is continuous frame picture information or single frame picture information.
  • the processor 440 is specifically configured to:
  • the processor 440 is specifically configured to:
  • Corresponding relationship between the audio information continuously collected by the recorder 420 during the first time period and the image information collected by the camera 430 during the second time period is established according to the audio information matching the name of the topic.
  • the memory 450 is specifically configured to:
  • the correspondence between the audio format storage file and the image format storage file is established by the processor 440.
  • the memory 450 is further configured to:
  • the audio information continuously collected by the recorder 420 during the first time period, the image information collected by the camera 430 during the second time period, and the correspondence established by the processor 440 are stored in a storage file.
  • the receiver 410 is further configured to receive a third user instruction, where the third user instruction is used to view audio information of the fourth time stored in the storage file generated by the memory 450, where the The fourth time is located in the first time period and the image file has the image information corresponding to the audio information of the fourth time in the storage file; correspondingly, the terminal device 400 further includes:
  • the player is configured to, according to the third user instruction received by the receiver 410, present image information corresponding to the audio information of the fourth time instant while presenting the audio information of the fourth time.
  • the receiver 410 is further configured to receive a fourth user instruction, where the fourth user instruction is used to view image information of a fifth moment stored by the storage file generated by the memory 450, where The fifth time is located in the second time period; correspondingly, the terminal device 400 further includes:
  • a player configured to display, according to the fourth user instruction received by the receiver 410, the image information stored in the storage file and the image information of the fifth time in the audio information of the fifth time Audio information.
  • the terminal device 400 may correspond to an execution subject in a method of processing audio and image information according to an embodiment of the present invention, and the above-described and other operations and/or functions of respective modules in the terminal device 400 are respectively implemented for The corresponding processes of the respective methods in FIG. 1 to FIG. 2 are not described herein again for the sake of brevity.
  • the terminal device establishes the collected audio information and image by continuously collecting audio information for a period of time and acquiring image information at a certain time or a certain short period of time.
  • B corresponding to A means that B is associated with A, and B can be determined according to A.
  • determining B from A does not mean that B is only determined based on A, and that B can also be determined based on A and/or other information.
  • the disclosed systems, devices, and The method can be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, or an electrical, mechanical or other form of connection.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the embodiments of the present invention.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention contributes in essence or to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

一种处理音频和图像信息的方法、装置和终端设备。该方法包括:接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息;根据该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内连续采集音频信息;接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息;根据该第二用户指令,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息;建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系;根据该对应关系,存储该音频信息和图像信息。上述处理音频和图像信息的方法,能够在较长时间地记录会议场景的同时较为完整地还原会议场景。

Description

处理音频和图像信息的方法、装置和终端设备
本申请要求于2014年6月3日提交中国专利局、申请号为201410243132.7、发明名称为“处理音频和图像信息的方法、装置和终端设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明涉及数据处理领域,尤其涉及处理音频和图像信息的方法、装置和终端设备。
背景技术
随着社会的发展,工作越来越忙碌,商务人士与白领们每天都要参加很多会议,有时候多个会议的时间冲突,因此用户不能同时参加多个会议,只能委托别人参加会议并记录会议的情况。一般地,被委托人可以通过视频录制的方式记录一个会议的情况,但是视频存储文件会占用很大的存储空间,同时也会消耗大量的手机电池电量,而目前手机的存储空间以及电量均有限,这往往会导致被委托人只能进行较短时间的视频录制,因此,在会议持续时间较长的情况下采用视频录制的方法无法完整地记录会议的情况,进而无法完整还原会议场景,用户体验较差。
发明内容
本发明实施例提供了一种记录信息的数据处理的方法和设备,能够在较长时间地记录会议场景的同时较为完整地还原会议场景。
第一方面,提供了处理音频与图像信息的方法,包括:接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息;根据该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,该第一时刻早于该第二时刻;接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息;根据该第二用户指令,在连续采集音频信息的同时,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,该第二时间段以该第三时刻为起始时刻,该第二时间段的长度小于该第一时间段的长度且该第二时间段位于该第一时 间段内;建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系;根据该对应关系,存储该第一时间段内连续采集到的音频信息以及该第二时间段内采集到的图像信息。
在第一种可能的实现方式中,该第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
结合上述可能的实现方式,在第二种可能的实现方式中,该建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系,包括:根据该第三时刻,确定第三时间段,其中,该第三时间段包括该第二时间段且该第三时间段位于该第一时间段内;为该第三时间段内连续采集到的音频信息添加第一标识,并且为该第二时间段采集到的图像信息添加第二标识,其中,该第一标识与该第二标识之间具有对应关系;根据该第一标识和该第二标识,建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系。
结合上述可能的实现方式,在第三种可能的实现方式中,该建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系,包括:根据该第二时间段内连续采集到的音频信息,确定该图像信息的主题名称;根据该图像信息的主题名称对该第一时间段内连续采集到的音频信息进行关键字搜索,以确定该第一时间段内连续采集到的音频信息中与该主题名称匹配的音频信息;根据与该主题名称匹配的音频信息,建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系。
结合上述可能的实现方式,在第四种可能的实现方式中,该根据该对应关系,存储该第一时间段内连续采集到的音频信息以及该第二时间段内采集到的该图像信息,包括:将该第一时间段内连续采集到的音频信息存储至音频格式存储文件;将该第二时间段采集到的图像信息存储至图像格式存储文件;其中,该音频格式存储文件和该图像格式存储文件之间具有该对应关系。
结合上述可能的实现方式,在第五种可能的实现方式中,该根据该对应关系,存储该第一时间段内连续采集到的音频信息以及该第二时间段内采集到的该图像信息,包括:将该第一时间段内连续采集到的音频信息、该第二时间段采集到的图像信息以及该对应关系存储至一个存储文件。
结合上述可能的实现方式,在第六种可能的实现方式中,该方法还包括:
接收第三用户指令,该第三用户指令用于查看该存储文件存储的第四时刻的音频信息,该第四时刻位于该第一时间段内且该存储文件中存在与该第四时刻的音频信息具有对应关系的图像信息;根据该第三用户指令,在呈现该第四时刻的音频信息的同时,呈现与该第四时刻的音频信息具有对应关系的图像信息。
结合上述可能的实现方式,在第七种可能的实现方式中,该方法还包括:
接收第四用户指令,该第四用户指令用于查看该存储文件存储的第五时刻的图像信息,其中,该第五时刻位于该第二时间段内;根据该第四用户指令,在呈现该第五时刻的图像信息的同时,呈现该存储文件存储的音频信息中与该第五时刻的图像信息具有对应关系的音频信息。
第二方面,提供了一种处理音频与图像信息的装置,包括:接收单元,用于接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息;音频采集单元,用于根据该接收单元接收到的该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,该第一时刻早于该第二时刻;该接收单元还用于接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息;图像采集单元,用于根据该接收单元接收的该第二用户指令,在该音频采集单元连续采集音频信息的同时,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,该第二时间段以该第三时刻为起始时刻,该第二时间段的长度小于该第一时间段的长度且该第二时间段位于该第一时间段内;对应关系建立单元,用于建立该音频采集单元在该第一时间段内连续采集到的音频信息与该图像采集单元在该第二时间段内采集到的图像信息之间的对应关系;存储单元,用于根据该对应关系建立单元建立的该对应关系,存储该音频采集单元在该第一时间段内连续采集到的音频信息以及该图像采集单元在该第二时间段内采集到的图像信息。
在第一种可能的实现方式中,该图像采集单元在该第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
结合上述可能的实现方式,在第二种可能的实现方式中,该对应关系建立单元包括:第一确定单元,用于根据该第三时刻,确定第三时间段,其中,该第三时间段包括该第二时间段且该第三时间段位于该第一时间段内;标识添加单元,用于为该音频采集单元在该第一确定单元确定的该第三时间段内 连续采集到的音频信息添加第一标识,并且为该图像采集单元在该第二时间段采集到的图像信息添加第二标识,其中,该第一标识与该第二标识之间具有对应关系;第一建立单元,用于根据该标识添加单元添加的第一标识和该第二标识,建立该音频采集单元在该第一时间段内连续采集到的音频信息与该图像采集单元在该第二时间段内采集到的图像信息之间的对应关系。
结合上述可能的实现方式,在第三种可能的实现方式中,该对应关系建立单元包括:第二确定单元,用于根据该音频采集单元在该第二时间段内连续采集到的音频信息,确定该图像信息的主题名称;第三确定单元,用于根据该第二确定单元确定的该主题名称,对该音频采集单元在该第一时间段内连续采集到的音频信息进行关键字搜索,以确定该音频采集单元在该第一时间段内连续采集到的音频信息中与该第二确定单元确定的该主题名称匹配的音频信息;第二建立单元,用于根据该第三确定单元确定的与该主题名称匹配的音频信息,建立该音频采集单元在该第一时间段内连续采集到的音频信息与该图像采集单元在该第二时间段内采集到的图像信息之间的对应关系。
结合上述可能的实现方式,在第四种可能的实现方式中,该存储单元包括:第一存储单元,用于将该音频采集单元在该第一时间段内连续采集到的音频信息存储至音频格式存储文件;第二存储单元,用于将该图像采集单元在该第二时间段采集到的图像信息存储至图像格式存储文件;其中,该音频格式存储文件和该图像格式存储文件之间具有该对应关系建立单元建立的该对应关系。
结合上述可能的实现方式,在第五种可能的实现方式中,该存储单元还用于:将该音频采集单元在该第一时间段内连续采集到的音频信息、该图像采集单元在该第二时间段采集到的图像信息以及该对应关系建立单元建立的该对应关系存储至一个存储文件。
结合上述可能的实现方式,在第六种可能的实现方式中,该接收单元还用于接收第三用户指令,该第三用户指令用于查看该存储单元生成的该存储文件存储的第四时刻的音频信息,该第四时刻位于该第一时间段内且该存储文件中存在与该第四时刻的音频信息具有对应关系的图像信息;该装置还包括:第一呈现单元,用于根据该接收单元接收的该第三用户指令,在呈现该第四时刻的音频信息的同时,呈现与该第四时刻的音频信息具有对应关系的 图像信息。
结合上述可能的实现方式,在第七种可能的实现方式中,该接收单元还用于接收第四用户指令,该第四用户指令用于查看该存储单元生成的该存储文件存储的第五时刻的图像信息,其中,该第五时刻位于该第二时间段内;该装置还包括:第二呈现单元,用于根据该接收单元接收的该第四用户指令,在呈现该第五时刻的图像信息的同时,呈现该存储文件存储的音频信息中与该第五时刻的图像信息具有对应关系的音频信息。
第三方面,提供了一种终端设备,包括:接收器,用于接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息;录音器,用于根据该接收器接收到的该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,该第一时刻早于该第二时刻;该接收器还用于接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息;摄像头,用于根据该接收器接收的该第二用户指令,在该录音器连续采集音频信息的同时,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,该第二时间段以该第三时刻为起始时刻,该第二时间段的长度小于该第一时间段的长度且该第二时间段位于该第一时间段内;处理器,用于建立该录音器在该第一时间段内连续采集到的音频信息与该摄像头在该第二时间段内采集到的图像信息之间的对应关系;存储器,用于根据该处理器建立的该对应关系,存储该录音器在该第一时间段内连续采集到的音频信息以及该摄像头在该第二时间段内采集到的图像信息。
在第一种可能的实现方式中,该摄像头在该第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
结合上述可能的实现方式,在第二种可能的实现方式中,该处理器具体用于:根据该第三时刻,确定第三时间段,其中,该第三时间段包括该第二时间段且该第三时间段位于该第一时间段内;为该录音器在该第三时间段内连续采集到的音频信息添加第一标识,并且为该摄像头在该第二时间段采集到的图像信息添加第二标识,其中,该第一标识与该第二标识之间具有对应关系;根据该第一标识和该第二标识,建立该录音器在该第一时间段内连续采集到的音频信息与该摄像头在该第二时间段内采集到的图像信息之间的对应关系。
结合上述可能的实现方式,在第三种可能的实现方式中,该处理器具体用于:根据该录音器在该第二时间段内连续采集到的音频信息,确定该图像信息的主题名称;根据该图像信息的主题名称,对该录音器在该第一时间段内连续采集到的音频信息进行关键字搜索,以确定该录音器在该第一时间段内连续采集到的音频信息中与该主题名称匹配的音频信息;根据与该主题名称匹配的音频信息,建立该录音器在该第一时间段内连续采集到的音频信息与该摄像头在该第二时间段内采集到的图像信息之间的对应关系。
结合上述可能的实现方式,在第四种可能的实现方式中,该存储器具体用于:将该录音器在该第一时间段内连续采集到的音频信息存储至音频格式存储文件;将该摄像头在该第二时间段采集到的图像信息存储至图像格式存储文件;其中,该音频格式存储文件和该图像格式存储文件之间具有该处理器建立的该对应关系。
结合上述可能的实现方式,在第五种可能的实现方式中,该存储器还用于:将该录音器在该第一时间段内连续采集到的音频信息、该摄像头在该第二时间段采集到的图像信息以及该处理器建立的该对应关系存储至一个存储文件。
结合上述可能的实现方式,在第六种可能的实现方式中,该接收器还用于接收第三用户指令,该第三用户指令用于查看该存储器生成的该存储文件存储的第四时刻的音频信息,该第四时刻位于该第一时间段内且该存储文件中存在与该第四时刻的音频信息具有对应关系的图像信息;该终端设备还包括:播放器,用于根据该接收器接收的该第三用户指令,在呈现该第四时刻的音频信息的同时,呈现与该第四时刻的音频信息具有对应关系的图像信息。
结合上述可能的实现方式,在第七种可能的实现方式中,该接收器还用于接收第四用户指令,该第四用户指令用于查看该存储器生成的该存储文件存储的第五时刻的图像信息,其中,该第五时刻位于该第二时间段内;该终端设备还包括:播放器,用于根据该接收器接收的该第四用户指令,在呈现该第五时刻的图像信息的同时,呈现该存储文件存储的音频信息中与该第五时刻的图像信息具有对应关系的音频信息。
基于上述技术方案,本发明实施例的处理音频和图像信息的方法、装置和终端设备,通过在一段时间内连续采集音频信息,并且在该段时间内的某 一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
附图说明
为了更清楚地说明本发明实施例的技术方案,下面将对本发明实施例中所需要使用的附图作简单地介绍,显而易见地,下面所描述的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本发明实施例的处理音频和图像信息的方法的示意性流程图。
图2是本发明实施例的另一处理音频和图像信息的方法的示意性流程图。
图3是本发明实施例的处理音频和图像信息的装置的示意性框图。
图4是本发明实施例的终端设备的示意性框图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明的一部分实施例,而不是全部实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都应属于本发明保护的范围。
还应理解,在本发明实施例可以应用于各种会议或其它场景记录,处理音频和图像信息的装置可以是具有摄像功能和录音功能的终端设备或是其它任意兼具有录音和拍照/摄像功能的装置,但本发明并不限定于此。
图1示出了本发明实施例的处理音频和图像信息的方法100的示意性流程图。该方法可以由任意具备录音功能和拍照/摄像功能的装置执行。
S110,接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息。
该装置可以通过多种方式确定该第一时刻。可选地,该第一用户指令可以直接指示开始采集音频信息的时刻,相应地,该装置可以将该第一用户指 令中指示的开始采集音频信息的时刻确定为第一时刻。可选地,作为另一实施例,如果该第一用户指令未直接指示开始采集音频信息的时刻,则该装置可以根据接收到该第一用户指令的时刻,确定该第一时刻,例如,该装置在接收到该第一用户指令之后立即开始采集音频信息,相应地,该第一时刻可以为该装置的音频信息采集单元启动的时刻,或者为该装置检测到用户按压与音频信息采集相对应的按钮或快捷键的时刻,等等,本发明实施例对此不做限定。
S120,根据该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,该第一时刻早于该第二时刻。
该装置在第一时间段内连续不间断地采集音频信息,并且在检测到用于指示采集图像信息的用户指令之前,该装置不采集图像信息。
该装置可以通过多种方式确定该第二时刻,即音频采集结束的时刻。例如,该装置可以预先设置采集音频信息的时间长度,并根据该第一时刻和该预先设置的时间长度确定该第二时刻;或者该第一用户指令可以指示音频信息采集的结束时刻或指示音频信息采集的持续时间长度,相应地,该装置可以根据该第一用户指令中指示的结束时刻或持续时间长度,确定该第二时刻;或者该第二时刻为该装置检测到用于指示停止采集音频信息的用户指令的时刻,例如,用户再次按压该装置中与音频信息采集相对应的按钮或快捷键的时刻,或者用户按压与停止进行音频信息采集相对应的按钮或快捷键的时刻,等等,本发明实施例对此不做限定。
S130,接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息。
该装置可以通过多种方式确定该第三时刻。可选地,该第二用户指令可以直接指示开始采集图像信息的时刻,相应地,该装置可以将该第二用户指令中指示的开始采集图像信息的时刻确定为第三时刻。可选地,作为另一实施例,若该第二用户指令未直接指示开始采集图像信息的时刻,则该装置可以根据接收到该第二用户指令的时刻,确定该第三时刻,例如,该装置在接收到该第二用户指令之后立即开始采集图像信息,相应地,该第三时刻可以为该装置的图像信息采集单元启动的时刻,或者为该装置检测到用户按压与图像信息采集相对应的按钮或快捷键的时刻,等等,本发明实施例对此不做限定。
S140,根据该第二用户指令,在连续采集音频信息的同时,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,该第二时间段以该第三时刻为起始时刻,该第二时间段的长度小于该第一时间段的长度且该第二时间段位于该第一时间段内。
该装置在该第二时间段的开始时刻(即第三时刻)开始采集图像信息,并且在该第二时间段的终止时刻停止采集图像信息。若该第二时间段超时且该第一时间段未超时,该装置在再次检测到用于指示采集图像信息的用户指令之前,只进行音频信息的采集而不进行图像信息采集。
该装置进行图像信息的采集的第二时间段短于该装置进行音频信息的采集的第一时间段。可选地,该第二时间段的长度可以为零,即该第二时间段具体为该第三时刻,例如,该装置在第三时刻拍摄一张照片;或者该第二时间段的长度也可以大于零,例如,该装置在第二时间段内连续拍摄多张照片或者录制视频,本发明实施例对此不做限定。
可选地,该第三时刻可以等于该第一时刻,并且第二时间段的终止时刻早于该第二时刻,此时,该装置从同一时刻开始进行音频信息和图像信息的采集,经过一段时间后停止采集图像信息并且继续采集音频信息。可选地,作为另一实施例,该第三时刻也可以晚于第一时刻并且早于第二时刻,此时,该装置可以首先进行音频信息的采集,并在音频信息的采集进行一段时间后同时采集音频信息和图像信息,如果该第二时间段的终止时刻早于该第二时刻,则该装置停止采集图像信息之后继续采集音频信息,而如果该第二时间段的终止时刻等于该第二时刻,则该装置同时停止采集图像信息和音频信息;可选地,作为另一实施例,该第二时间段的长度为零且该第三时刻等于该第二时刻,此时,该装置在该第一时间段的绝大部分时间内只进行音频信息的采集,并在该第一时间段的最后时刻同时进行图像信息和音频信息的采集,本发明实施例对此不做限定。
该装置可以通过多种方式确定该第二时间段的终止时刻。可选地,该第二时间段的终止时刻可以为该装置检测到用于指示停止采集图像信息的用户指令的时刻,例如,该终止时刻为用户关闭图像信息采集单元的时刻,或者为该装置检测到用户再次按压与图像信息采集相对应的按钮或快捷键的时刻,等等。可选地,作为另一实施例,如果该第二用户指令用于指示该装置拍摄一张照片或连续拍摄多张照片,则该装置可以从该第三时刻开始拍摄 一张照片或连续拍摄多张照片,并且在拍摄完一张或连续多张照片后停止进行拍摄,即该第二时间段的终止时刻可以为该装置完成该张照片或该连续多张照片拍摄的时刻,本发明实施例对此不做限定。
可选地,作为另一实施例,该装置也可以预先设置进行图像信息采集的持续时间段并且从该第三时刻开始计时,如果该装置在该预先设置的持续时间段超时时仍未接收到用于指示停止采集图像信息的用户指令,则该装置停止采集图像信息,即该第二时间段的终止时刻为该预设的持续时间段的终止时刻,其中,该预先设置的持续时间段可以为几十秒或几分钟,如果该装置设置有屏幕休眠时间段,则该预先设置的持续时间段的长度也可以默认等于该屏幕休眠时间段的长度;或者,若该装置在第二时刻之前未检测到用于指示停止采集图像信息的用户指令,则该第二时间段的终止时刻也可以为该装置检测到用于指示停止采集音频信息的用户指令的时刻,此时,该第二时间段的终止时刻为该第二时刻,且该装置同时停止采集音频信息和图像信息,本发明实施例对此不做限定。
可选地,作为另一实施例,该装置也可以预设一个临界存储容量,并且在检测到采集到的图像信息所占用的存储空间等于或接近该临界存储容量时,停止采集图像信息,即该第二时间段的终止时刻为该装置检测到采集到的图像信息所占用的存储空间等于或接近该临界存储容量的时刻,其中,该预设的临界存储容量可以根据用户指令设置,可以为几十兆或几百兆字节,或者为当前剩余存储空间的一定比例,本发明实施例对此不做限定。
S150,建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系。
该装置可以根据图像信息与音频信息的采集时间,或者根据采集到的音频信息和图像信息的内容,建立该第一时间段内采集到的音频信息与图像信息之间的对应关系,本发明实施例对此不做限定。
S160,根据该对应关系,存储该第一时间段内连续采集到的音频信息以及该第二时间段内采集到的图像信息。
该装置根据该音频信息、该图像信息以及二者之间的对应关系,生成存储文件。
因此,根据本发明实施例的处理音频和图像的方法,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图 像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
可选地,该第二时间段内采集到的图像信息可以为连续帧图片信息或单帧图片信息。其中,如果该第一用户指令用于指示进行拍照,则该装置可以从第三时刻开始拍摄一张或多张照片,此时,该装置采集到的图像为一张或多张静态图片,即采集到的图像信息为单帧图片信息;而如果该第一用户指令用于指示拍摄视频,则该装置可以在该第二时间段内拍摄视频,此时,该装置采集到的图像为连续多张静态图片,即采集到的图像信息为连续帧图片信息,但本发明实施例不限于此。
可选地,作为另一实施例,为了进一步提高用户体验,在存储该采集到的音频信息之前,该装置还可以对采集到的音频信息进行降噪处理。此外,由于该装置采集到的音频信息是模拟信号,因此该装置在存储该采集到的音频信息之前,还可以对该音频信息进行模数转换(Analog-to-Digital Conversion,简称为“ADC”)处理以获得数字信号,并在S160中存储获得的该数字信号,但本发明实施例不限于此。
作为一个可选实施例,S150,建立该第一时间段内连续采集到的音频信息与在该第二时间段内采集到的图像信息之间的对应关系,包括:
根据该第三时刻,确定第三时间段,其中,该第三时间段包括该第二时间段且该第三时间段位于该第一时间段内;
为该第三时间段内连续采集到的音频信息添加第一标识,并且为该第二时间段采集到的图像信息添加第二标识,其中,该第一标识与该第二标识之间具有对应关系;
根据该第一标识和该第二标识,建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系。
具体地,该第三时间段可以与该第二时间段完全重合,或者该第二时间段为该第三时间段的一部分。其中,如果该装置在第二时间段内进行视频的拍摄,则该第三时间段可以与该第二时间段完全重合;如果该第二时间段内拍摄多张静态照片,则该第二时间段可以为该第三时间段的一部分;而如果该第二时间段内拍摄一张静态照片,即该第二时间段为第三时刻,则该第三 时间段可以为该第三时刻或包括第三时刻在内的一段时间,本发明实施例对此不做限定。
该第三时间段的长度可以预设设置,其中,该第三时间段可以以该第三时刻为起始时刻且长度大于该第二时间段的长度,或者该第三时间段与该第二时间段具有相同的终止时刻但该第三时间段的起始时刻早于该第三时刻,或者该第三时间段以该第三时刻为中心时刻且该第三时间段的终止时刻为该第二时间段的终止时刻或晚于该第二时间段的终止时刻,本发明实施例对此不做限定。
该装置确定该第三时间段内采集到的音频信息与该第二时间段内采集到的图像信息之间具有对应关系,并且采用该第一标识和该第二标识表示该对应关系,其中,该第一标识和该第二标识可以为任意具有关联关系的标识,本发明实施例对此不做限定。
可选地,作为另一实施例,S150,建立该第一时间段内连续采集到的音频信息与在该第二时间段内采集到的图像信息之间的对应关系,包括:
根据该第二时间段内连续采集到的音频信息,确定该图像信息的主题名称;
根据该图像信息的主题名称对该第一时间段内连续采集到的音频信息进行关键字搜索,以确定该第一时间段内连续采集到的音频信息中与该主题名称匹配的音频信息;
根据与该主题名称匹配的音频信息,建立该第一时间段内连续采集到的音频信息与该第二时间段内采集到的图像信息之间的对应关系。
具体地,该装置可以在采集该图像信息之后弹出对话框,以提示用户输入该图像信息的主题名称,并根据该用户的输入确定该图像信息的主题名称;或者该装置也可以根据该第二时间段内采集到的音频信息的内容,确定该图像信息的主题名称,本发明实施例对此不做限定。该装置在确定该图像信息的主题名称后,可以采用关键字搜索技术,在该第一时间段采集到的语音信息中搜索与该图像信息的主题名称匹配的语音信息,并确定该匹配的语音信息与该图像信息之间具有对应关系。例如,该图像信息的主题名称为“统计函数”且该装置在采集到的语音信息中搜索到两处与该“统计函数”匹配的语音信息,则该装置可以确定该图像信息和该两段匹配的语音信息之间具有对应关系。可选地,该装置也可以以该两段匹配的语音信息中的每段匹配 的语音信息为中心点并分别向两侧扩展一定的长度,或以该每段匹配的语音信息为起始向后扩展一定的长度,并确定该图像信息与该扩展后的两段语音信息之间具有对应关系,但本发明实施例不限于此。
可选地,作为另一实施例,S160,根据该对应关系,存储该第一时间段内连续采集到的音频信息以及在该第二时间段内采集到的该图像信息,包括:
将该第一时间段内连续采集到的音频信息存储至音频格式存储文件;
将该第二时间段采集到的图像信息存储至图像格式存储文件;
其中,该音频格式存储文件和该图像格式存储文件之间具有该对应关系。
具体地,该音频格式存储文件用于存储音频信息,该图像格式存储文件用于存储图像信息,且可以在该音频格式存储文件和该图像格式存储文件中设置标签,以记录S150中建立的对应关系。可选地,该装置也可以将该音频信息转换为文本格式,并存储文本格式的该音频信息,但本发明实施例不限于此。
可选地,作为另一实施例,S160,根据该对应关系,存储该第一时间段内连续采集到的音频信息以及在该第二时间段内采集到的该图像信息,包括:
将该第一时间段内连续采集到的音频信息、该第二时间段采集到的图像信息以及该对应关系存储至一个存储文件。
具体地,该存储文件具有新的文件格式,用于存储音频信息、图像信息以及二者之间的对应关系。用户在查看该存储文件存储的内容时,其中,该装置可以同时呈现某段音频信息和与之具有对应关系的图像信息,但本发明实施例不限于此。可选地,用户还可以选择性地进行查看,例如,用户可以通过拖动时间轴或点击某一个具体时间点来查看该时间点的相关内容,还可以对该内容进行编辑或删除操作,本发明实施例不限于此。
可选地,作为另一实施例,该方法100还包括:
接收第三用户指令,该第三用户指令用于查看该存储文件存储的第四时刻的音频信息,该第四时刻位于该第一时间段内且该存储文件中存在与该第四时刻的音频信息具有对应关系的图像信息;
根据该第三用户指令,在呈现该第四时刻的音频信息的同时,呈现与该 第四时刻的音频信息具有对应关系的图像信息。
具体地,该第四时刻可以位于该第三时间段或第二时间段内。如果用户点击时间轴的某一时刻或者该音频信息连续播放到该时刻,则该装置可以在播放该时刻的录音的同时,呈现与该时刻的录音具有对应关系的图像信息,但本发明实施例不限于此。
可选地,作为另一实施例,该方法100还包括:
接收第四用户指令,该第四用户指令用于查看该存储文件存储的第五时刻的图像信息,其中,该第五时刻位于该第二时间段内;
根据该第四用户指令,在呈现该第五时刻的图像信息的同时,呈现该存储文件存储的音频信息中与该第五时刻的图像信息具有对应关系的音频信息。
具体地,如果用户点击某一图片,则该装置可以在呈现该图片的同时,播放与该图片具有对应关系的音频信息,但本发明实施例不限于此。
因此,根据本发明实施例的处理音频和图像的方法,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
图2示出了根据本发明另一实施例的处理音频和图像信息的方法200的示意性流程图。如图2所示,该方法200包括:
S210,采集音频信息。
在检测到用于指示采集图像信息的用户指令之前,只进行音频信息的采集而不进行图像信息的采集。
S220,检测用户指令。
其中,S210和S220可以同时执行,如果未检测到任何用户指令,则继续进行音频信息采集而不进行图像信息采集,即继续执行S210;如果检测到用于指示停止采集音频信息的用户指令,则执行S260和S280;如果检测到用于指示采集图像信息的用于指令,则执行S230。
S230,同时采集图像信息和音频信息。
其中,该图像信息可以为单帧图片信息或连续帧图片信息。
S240,检测用户指令或确定是否满足停止采集图像信息的预设条件。
其中,S230和S240可以同时执行,该停止采集图像信息的预设条件可以包括下列条件中的至少一项:采集图像信息的持续时间是否超过预设时间阈值以及采集到的图像信息所占用的存储空间是否超过临界存储容量。如果满足上述两个条件中的至少一项,则确定满足该停止采集图像信息的预设条件,但本发明实施例不限于此。如果该停止采集图像信息的预设条件满足或者检测到用于指示停止采集图像信息的用于指令,则可以执行S250和S210,即停止采集图像信息但继续采集音频信息;而如果检测到用于指示停止采集音频信息的用户指令,则执行S250、S260、S270和S280,即停止采集音频信息和图像信息,建立采集到的音频信息和图像信息之间的对应关系,并根据该对应关系生成存储文件。
S250,停止采集图像信息。
S260,停止采集音频信息。
S270,建立采集到的音频信息与图像信息之间的对应关系。
其中,可以根据两者分别包括的内容建立该音频信息与图像信息之间的对应关系。可选地,该步骤也可以在执行S250之后立即执行,具体地,如果在停止采集图像信息后继续采集音频信息,则可以在继续采集音频信息的同时根据采集时间建立之前采集到的图像信息与音频信息之间的对应关系,但本发明实施例不限于此。
S280,生成存储文件。
具体地,该存储文件用于存储采集到的音频信息,或用于存储采集到的音频信息、图像信息以及两者之间的对应关系,但本发明实施例不限于此。
应理解,在本发明的各种实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本发明实施例的实施过程构成任何限定。
因此,根据本发明实施例的处理音频和图像的方法,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
上文中结合图1和图2,详细描述了根据本发明实施例的处理音频和图像信息的方法,下面将结合图3和图4,详细描述根据本发明实施例的处理音频和图像信息的装置和终端设备。
图3示出了根据本发明实施例的处理音频和图像信息的装置300的示意性框图。如图3所示,该装置300包括:
接收单元310,用于接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息;
音频采集单元320,用于根据该接收单元310接收到的该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,该第一时刻早于该第二时刻;
该接收单元310还用于接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息;
图像采集单元330,用于根据该接收单元310接收的该第二用户指令,在该音频采集单元320连续采集音频信息的同时,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,该第二时间段以该第三时刻为起始时刻,该第二时间段的长度小于该第一时间段的长度且该第二时间段位于该第一时间段内;
对应关系建立单元340,用于建立该音频采集单元320在该第一时间段内连续采集到的音频信息与该图像采集单元330在该第二时间段内采集到的图像信息之间的对应关系;
存储单元350,用于根据该对应关系建立单元340建立的该对应关系,存储该音频采集单元320在该第一时间段内连续采集到的音频信息以及该图像采集单元330在该第二时间段内采集到的图像信息。
因此,根据本发明实施例的处理音频和图像的装置,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
可选地,该图像采集单元330在该第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
可选地,作为另一实施例,该对应关系建立单元340包括:
第一确定单元,用于根据该第三时刻,确定第三时间段,其中,该第三时间段包括该第二时间段且该第三时间段位于该第一时间段内;
标识添加单元,用于为该音频采集单元320在该第一确定单元确定的该第三时间段内连续采集到的音频信息添加第一标识,并且为该图像采集单元330在该第二时间段采集到的图像信息添加第二标识,其中,该第一标识与该第二标识之间具有对应关系;
第一建立单元,用于根据该标识添加单元添加的第一标识和该第二标识,建立该音频采集单元320在该第一时间段内连续采集到的音频信息与该图像采集单元330在该第二时间段内采集到的图像信息之间的对应关系。
可选地,作为另一实施例,该对应关系建立单元340包括:
第二确定单元,用于根据该音频采集单元320在该第二时间段内连续采集到的音频信息,确定该图像信息的主题名称;
第三确定单元,用于根据该第二确定单元确定的该主题名称,对该音频采集单元320在该第一时间段内连续采集到的音频信息进行关键字搜索,以确定该音频采集单元320在该第一时间段内连续采集到的音频信息中与该第二确定单元确定的该主题名称匹配的音频信息;
第二建立单元,用于根据该第三确定单元确定的与该主题名称匹配的音频信息,建立该音频采集单元320在该第一时间段内连续采集到的音频信息与该图像采集单元330在该第二时间段内采集到的图像信息之间的对应关系。
可选地,作为另一实施例,该存储单元350包括:
第一存储单元,用于将该音频采集单元320在该第一时间段内连续采集到的音频信息存储至音频格式存储文件;
第二存储单元,用于将该图像采集单元330在该第二时间段采集到的图像信息存储至图像格式存储文件;
其中,该音频格式存储文件和该图像格式存储文件之间具有该对应关系建立单元340建立的该对应关系。
可选地,作为另一实施例,该存储单元350还用于:
将该音频采集单元320在该第一时间段内连续采集到的音频信息、该图像采集单元330在该第二时间段采集到的图像信息以及该对应关系建立单元 340建立的该对应关系存储至一个存储文件。
可选地,作为另一实施例,该接收单元310还用于接收第三用户指令,该第三用户指令用于查看该存储单元350生成的该存储文件存储的第四时刻的音频信息,该第四时刻位于该第一时间段内且该存储文件中存在与该第四时刻的音频信息具有对应关系的图像信息;相应地,该装置300还包括:
第一呈现单元,用于根据该接收单元310接收的该第三用户指令,在呈现该第四时刻的音频信息的同时,呈现与该第四时刻的音频信息具有对应关系的图像信息。
可选地,作为另一实施例,该接收单元310还用于接收第四用户指令,该第四用户指令用于查看该存储单元350生成的该存储文件存储的第五时刻的图像信息,其中,该第五时刻位于该第二时间段内;相应地,该装置300还包括:
第二呈现单元,用于根据该接收单元310接收的该第四用户指令,在呈现该第五时刻的图像信息的同时,呈现该存储文件存储的音频信息中与该第五时刻的图像信息具有对应关系的音频信息。
根据本发明实施例的处理音频和图像信息的装置300可对应于根据本发明实施例的处理音频和图像信息的方法中的执行主体,并且处理音频和图像信息的装置300中的各个模块的上述和其它操作和/或功能分别为了实现图1至图2中的各个方法的相应流程,为了简洁,在此不再赘述。
因此,根据本发明实施例的处理音频和图像的装置,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
图4示出了根据本发明实施例的一种终端设备400,如图4所示,该终端设备400包括:
接收器410,用于接收第一用户指令,该第一用户指令用于指示从第一时刻开始连续采集音频信息;
录音器420,用于根据该接收器410接收到的该第一用户指令,在该第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,该第一 时刻早于该第二时刻;
该接收器410还用于接收第二用户指令,该第二用户指令用于指示从第三时刻开始采集图像信息;
摄像头430,用于根据该接收器410接收的该第二用户指令,在该录音器420连续采集音频信息的同时,从该第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,该第二时间段以该第三时刻为起始时刻,该第二时间段的长度小于该第一时间段的长度且该第二时间段位于该第一时间段内;
处理器440,用于建立该录音器420在该第一时间段内连续采集到的音频信息与该摄像头430在该第二时间段内采集到的图像信息之间的对应关系;
存储器450,用于根据该处理器440建立的该对应关系,存储该录音器420在该第一时间段内连续采集到的音频信息以及该摄像头430在该第二时间段内采集到的图像信息。
因此,根据本发明实施例的终端设备,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
可选地,该摄像头430在该第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
可选地,作为另一实施例,该处理器440具体用于:
根据该第三时刻,确定第三时间段,其中,该第三时间段包括该第二时间段且该第三时间段位于该第一时间段内;
为该录音器420在该第三时间段内连续采集到的音频信息添加第一标识,并且为该摄像头430在该第二时间段采集到的图像信息添加第二标识,其中,该第一标识与该第二标识之间具有对应关系;
根据该第一标识和该第二标识,建立该录音器420在该第一时间段内连续采集到的音频信息与该摄像头430在该第二时间段内采集到的图像信息之间的对应关系。
可选地,作为另一实施例,该处理器440具体用于:
根据该录音器420在该第二时间段内连续采集到的音频信息,确定该图像信息的主题名称;
根据该图像信息的主题名称,对该录音器420在该第一时间段内连续采集到的音频信息进行关键字搜索,以确定该录音器420在该第一时间段内连续采集到的音频信息中与该主题名称匹配的音频信息;
根据与该主题名称匹配的音频信息,建立该录音器420在该第一时间段内连续采集到的音频信息与该摄像头430在该第二时间段内采集到的图像信息之间的对应关系。
可选地,作为另一实施例,该存储器450具体用于:
将该录音器420在该第一时间段内连续采集到的音频信息存储至音频格式存储文件;
将该摄像头430在该第二时间段采集到的图像信息存储至图像格式存储文件;
其中,该音频格式存储文件和该图像格式存储文件之间具有该处理器440建立的该对应关系。
可选地,作为另一实施例,该存储器450还用于:
将该录音器420在该第一时间段内连续采集到的音频信息、该摄像头430在该第二时间段采集到的图像信息以及该处理器440建立的该对应关系存储至一个存储文件。
可选地,作为另一实施例,该接收器410还用于接收第三用户指令,该第三用户指令用于查看该存储器450生成的该存储文件存储的第四时刻的音频信息,该第四时刻位于该第一时间段内且该存储文件中存在与该第四时刻的音频信息具有对应关系的图像信息;相应地,该终端设备400还包括:
播放器,用于根据该接收器410接收的该第三用户指令,在呈现该第四时刻的音频信息的同时,呈现与该第四时刻的音频信息具有对应关系的图像信息。
可选地,作为另一实施例,该接收器410还用于接收第四用户指令,该第四用户指令用于查看该存储器450生成的该存储文件存储的第五时刻的图像信息,其中,该第五时刻位于该第二时间段内;相应地,该终端设备400还包括:
播放器,用于根据该接收器410接收的该第四用户指令,在呈现该第五时刻的图像信息的同时,呈现该存储文件存储的音频信息中与该第五时刻的图像信息具有对应关系的音频信息。
根据本发明实施例的终端设备400可对应于根据本发明实施例的处理音频和图像信息的方法中的执行主体,并且终端设备400中的各个模块的上述和其它操作和/或功能分别为了实现图1至图2中的各个方法的相应流程,为了简洁,在此不再赘述。
因此,根据本发明实施例的终端设备,通过在一段时间内连续采集音频信息,并且在该段时间内的某一时刻或某一较短时间段采集图像信息,建立采集到的音频信息与图像信息之间的对应关系并根据该对应关系存储该采集到的音频信息和图像信息,能够在较长时间段内连续采集音频信息的同时,根据用户指令进行选择性地进行图像信息的采集,从而占用较小的存储空间,并且较为完整地还原会议场景,提高用户体验。
另外,本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。
应理解,在本发明实施例中,“与A相应的B”表示B与A相关联,根据A可以确定B。但还应理解,根据A确定B并不意味着仅仅根据A确定B,还可以根据A和/或其它信息确定B。
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和 方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另外,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口、装置或单元的间接耦合或通信连接,也可以是电的,机械的或其它的形式连接。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本发明实施例方案的目的。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以是两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分,或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以权利要求的保护范围为准。

Claims (24)

  1. 一种处理音频与图像信息的方法,其特征在于,包括:
    接收第一用户指令,所述第一用户指令用于指示从第一时刻开始连续采集音频信息;
    根据所述第一用户指令,在所述第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,所述第一时刻早于所述第二时刻;
    接收第二用户指令,所述第二用户指令用于指示从第三时刻开始采集图像信息;
    根据所述第二用户指令,在连续采集音频信息的同时,从所述第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,所述第二时间段以所述第三时刻为起始时刻,所述第二时间段的长度小于所述第一时间段的长度且所述第二时间段位于所述第一时间段内;
    建立所述第一时间段内连续采集到的音频信息与所述第二时间段内采集到的图像信息之间的对应关系;
    根据所述对应关系,存储所述第一时间段内连续采集到的音频信息以及所述第二时间段内采集到的图像信息。
  2. 根据权利要求1所述的方法,其特征在于,所述第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
  3. 根据权利要求1或2所述的方法,其特征在于,所述建立所述第一时间段内连续采集到的音频信息与所述第二时间段内采集到的图像信息之间的对应关系,包括:
    根据所述第三时刻,确定第三时间段,其中,所述第三时间段包括所述第二时间段且所述第三时间段位于所述第一时间段内;
    为所述第三时间段内连续采集到的音频信息添加第一标识,并且为所述第二时间段采集到的图像信息添加第二标识,其中,所述第一标识与所述第二标识之间具有对应关系;
    根据所述第一标识和所述第二标识,建立所述第一时间段内连续采集到的音频信息与所述第二时间段内采集到的图像信息之间的对应关系。
  4. 根据权利要求1或2所述的方法,其特征在于,所述建立所述第一时间段内连续采集到的音频信息与所述第二时间段内采集到的图像信息之间的对应关系,包括:
    根据所述第二时间段内连续采集到的音频信息,确定所述图像信息的主题名称;
    根据所述图像信息的主题名称对所述第一时间段内连续采集到的音频信息进行关键字搜索,以确定所述第一时间段内连续采集到的音频信息中与所述主题名称匹配的音频信息;
    根据与所述主题名称匹配的音频信息,建立所述第一时间段内连续采集到的音频信息与所述第二时间段内采集到的图像信息之间的对应关系。
  5. 根据权利要求1至4中任一项所述的方法,其特征在于,所述根据所述对应关系,存储所述第一时间段内连续采集到的音频信息以及所述第二时间段内采集到的所述图像信息,包括:
    将所述第一时间段内连续采集到的音频信息存储至音频格式存储文件;
    将所述第二时间段采集到的图像信息存储至图像格式存储文件;
    其中,所述音频格式存储文件和所述图像格式存储文件之间具有所述对应关系。
  6. 根据权利要求1至4中任一项所述的方法,其特征在于,所述根据所述对应关系,存储所述第一时间段内连续采集到的音频信息以及所述第二时间段内采集到的所述图像信息,包括:
    将所述第一时间段内连续采集到的音频信息、所述第二时间段采集到的图像信息以及所述对应关系存储至一个存储文件。
  7. 根据权利要求1至6中任一项所述的方法,其特征在于,所述方法还包括:
    接收第三用户指令,所述第三用户指令用于查看所述存储文件存储的第四时刻的音频信息,所述第四时刻位于所述第一时间段内且所述存储文件中存在与所述第四时刻的音频信息具有对应关系的图像信息;
    根据所述第三用户指令,在呈现所述第四时刻的音频信息的同时,呈现与所述第四时刻的音频信息具有对应关系的图像信息。
  8. 根据权利要求1至7中任一项所述的方法,其特征在于,所述方法还包括:
    接收第四用户指令,所述第四用户指令用于查看所述存储文件存储的第五时刻的图像信息,其中,所述第五时刻位于所述第二时间段内;
    根据所述第四用户指令,在呈现所述第五时刻的图像信息的同时,呈现 所述存储文件存储的音频信息中与所述第五时刻的图像信息具有对应关系的音频信息。
  9. 一种处理音频与图像信息的装置,其特征在于,包括:
    接收单元,用于接收第一用户指令,所述第一用户指令用于指示从第一时刻开始连续采集音频信息;
    音频采集单元,用于根据所述接收单元接收到的所述第一用户指令,在所述第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,所述第一时刻早于所述第二时刻;
    所述接收单元还用于接收第二用户指令,所述第二用户指令用于指示从第三时刻开始采集图像信息;
    图像采集单元,用于根据所述接收单元接收的所述第二用户指令,在所述音频采集单元连续采集音频信息的同时,从所述第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,所述第二时间段以所述第三时刻为起始时刻,所述第二时间段的长度小于所述第一时间段的长度且所述第二时间段位于所述第一时间段内;
    对应关系建立单元,用于建立所述音频采集单元在所述第一时间段内连续采集到的音频信息与所述图像采集单元在所述第二时间段内采集到的图像信息之间的对应关系;
    存储单元,用于根据所述对应关系建立单元建立的所述对应关系,存储所述音频采集单元在所述第一时间段内连续采集到的音频信息以及所述图像采集单元在所述第二时间段内采集到的图像信息。
  10. 根据权利要求9所述的装置,其特征在于,所述图像采集单元在所述第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
  11. 根据权利要求9或10所述的装置,其特征在于,所述对应关系建立单元包括:
    第一确定单元,用于根据所述第三时刻,确定第三时间段,其中,所述第三时间段包括所述第二时间段且所述第三时间段位于所述第一时间段内;
    标识添加单元,用于为所述音频采集单元在所述第一确定单元确定的所述第三时间段内连续采集到的音频信息添加第一标识,并且为所述图像采集单元在所述第二时间段采集到的图像信息添加第二标识,其中,所述第一标识与所述第二标识之间具有对应关系;
    第一建立单元,用于根据所述标识添加单元添加的第一标识和所述第二标识,建立所述音频采集单元在所述第一时间段内连续采集到的音频信息与所述图像采集单元在所述第二时间段内采集到的图像信息之间的对应关系。
  12. 根据权利要求9或10所述的装置,其特征在于,所述对应关系建立单元包括:
    第二确定单元,用于根据所述音频采集单元在所述第二时间段内连续采集到的音频信息,确定所述图像信息的主题名称;
    第三确定单元,用于根据所述第二确定单元确定的所述主题名称,对所述音频采集单元在所述第一时间段内连续采集到的音频信息进行关键字搜索,以确定所述音频采集单元在所述第一时间段内连续采集到的音频信息中与所述第二确定单元确定的所述主题名称匹配的音频信息;
    第二建立单元,用于根据所述第三确定单元确定的与所述主题名称匹配的音频信息,建立所述音频采集单元在所述第一时间段内连续采集到的音频信息与所述图像采集单元在所述第二时间段内采集到的图像信息之间的对应关系。
  13. 根据权利要求9至12中任一项所述的装置,其特征在于,所述存储单元包括:
    第一存储单元,用于将所述音频采集单元在所述第一时间段内连续采集到的音频信息存储至音频格式存储文件;
    第二存储单元,用于将所述图像采集单元在所述第二时间段采集到的图像信息存储至图像格式存储文件;
    其中,所述音频格式存储文件和所述图像格式存储文件之间具有所述对应关系建立单元建立的所述对应关系。
  14. 根据权利要求9至12中任一项所述的装置,其特征在于,所述存储单元还用于:
    将所述音频采集单元在所述第一时间段内连续采集到的音频信息、所述图像采集单元在所述第二时间段采集到的图像信息以及所述对应关系建立单元建立的所述对应关系存储至一个存储文件。
  15. 根据权利要求9至14中任一项所述的装置,其特征在于,所述接收单元还用于接收第三用户指令,所述第三用户指令用于查看所述存储单元生成的所述存储文件存储的第四时刻的音频信息,所述第四时刻位于所述第 一时间段内且所述存储文件中存在与所述第四时刻的音频信息具有对应关系的图像信息;
    所述装置还包括:
    第一呈现单元,用于根据所述接收单元接收的所述第三用户指令,在呈现所述第四时刻的音频信息的同时,呈现与所述第四时刻的音频信息具有对应关系的图像信息。
  16. 根据权利要求9至15中任一项所述的装置,其特征在于,所述接收单元还用于接收第四用户指令,所述第四用户指令用于查看所述存储单元生成的所述存储文件存储的第五时刻的图像信息,其中,所述第五时刻位于所述第二时间段内;
    所述装置还包括:
    第二呈现单元,用于根据所述接收单元接收的所述第四用户指令,在呈现所述第五时刻的图像信息的同时,呈现所述存储文件存储的音频信息中与所述第五时刻的图像信息具有对应关系的音频信息。
  17. 一种终端设备,其特征在于,包括:
    接收器,用于接收第一用户指令,所述第一用户指令用于指示从第一时刻开始连续采集音频信息;
    录音器,用于根据所述接收器接收到的所述第一用户指令,在所述第一时刻与第二时刻之间的第一时间段内,连续采集音频信息,其中,所述第一时刻早于所述第二时刻;
    所述接收器还用于接收第二用户指令,所述第二用户指令用于指示从第三时刻开始采集图像信息;
    摄像头,用于根据所述接收器接收的所述第二用户指令,在所述录音器连续采集音频信息的同时,从所述第三时刻开始采集图像信息并且在第二时间段超时时停止采集图像信息,其中,所述第二时间段以所述第三时刻为起始时刻,所述第二时间段的长度小于所述第一时间段的长度且所述第二时间段位于所述第一时间段内;
    处理器,用于建立所述录音器在所述第一时间段内连续采集到的音频信息与所述摄像头在所述第二时间段内采集到的图像信息之间的对应关系;
    存储器,用于根据所述处理器建立的所述对应关系,存储所述录音器在所述第一时间段内连续采集到的音频信息以及所述摄像头在所述第二时间 段内采集到的图像信息。
  18. 根据权利要求17所述的终端设备,其特征在于,所述摄像头在所述第二时间段内采集到的图像信息为连续帧图片信息或单帧图片信息。
  19. 根据权利要求17或18所述的终端设备,其特征在于,所述处理器具体用于:
    根据所述第三时刻,确定第三时间段,其中,所述第三时间段包括所述第二时间段且所述第三时间段位于所述第一时间段内;
    为所述录音器在所述第三时间段内连续采集到的音频信息添加第一标识,并且为所述摄像头在所述第二时间段采集到的图像信息添加第二标识,其中,所述第一标识与所述第二标识之间具有对应关系;
    根据所述第一标识和所述第二标识,建立所述录音器在所述第一时间段内连续采集到的音频信息与所述摄像头在所述第二时间段内采集到的图像信息之间的对应关系。
  20. 根据权利要求17或18所述的终端设备,其特征在于,所述处理器具体用于:
    根据所述录音器在所述第二时间段内连续采集到的音频信息,确定所述图像信息的主题名称;
    根据所述图像信息的主题名称,对所述录音器在所述第一时间段内连续采集到的音频信息进行关键字搜索,以确定所述录音器在所述第一时间段内连续采集到的音频信息中与所述主题名称匹配的音频信息;
    根据与所述主题名称匹配的音频信息,建立所述录音器在所述第一时间段内连续采集到的音频信息与所述摄像头在所述第二时间段内采集到的图像信息之间的对应关系。
  21. 根据权利要求17至20中任一项所述的终端设备,其特征在于,所述存储器具体用于:
    将所述录音器在所述第一时间段内连续采集到的音频信息存储至音频格式存储文件;
    将所述摄像头在所述第二时间段采集到的图像信息存储至图像格式存储文件;
    其中,所述音频格式存储文件和所述图像格式存储文件之间具有所述处理器建立的所述对应关系。
  22. 根据权利要求17至20中任一项所述的终端设备,其特征在于,所述存储器还用于:
    将所述录音器在所述第一时间段内连续采集到的音频信息、所述摄像头在所述第二时间段采集到的图像信息以及所述处理器建立的所述对应关系存储至一个存储文件。
  23. 根据权利要求17至22中任一项所述的终端设备,其特征在于,所述接收器还用于接收第三用户指令,所述第三用户指令用于查看所述存储器生成的所述存储文件存储的第四时刻的音频信息,所述第四时刻位于所述第一时间段内且所述存储文件中存在与所述第四时刻的音频信息具有对应关系的图像信息;
    所述终端设备还包括:
    播放器,用于根据所述接收器接收的所述第三用户指令,在呈现所述第四时刻的音频信息的同时,呈现与所述第四时刻的音频信息具有对应关系的图像信息。
  24. 根据权利要求17至23中任一项所述的终端设备,其特征在于,所述接收器还用于接收第四用户指令,所述第四用户指令用于查看所述存储器生成的所述存储文件存储的第五时刻的图像信息,其中,所述第五时刻位于所述第二时间段内;
    所述终端设备还包括:
    播放器,用于根据所述接收器接收的所述第四用户指令,在呈现所述第五时刻的图像信息的同时,呈现所述存储文件存储的音频信息中与所述第五时刻的图像信息具有对应关系的音频信息。
PCT/CN2015/072903 2014-06-03 2015-02-12 处理音频和图像信息的方法、装置和终端设备 WO2015184861A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410243132.7 2014-06-03
CN201410243132.7A CN104023176B (zh) 2014-06-03 2014-06-03 处理音频和图像信息的方法、装置和终端设备

Publications (1)

Publication Number Publication Date
WO2015184861A1 true WO2015184861A1 (zh) 2015-12-10

Family

ID=51439725

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/072903 WO2015184861A1 (zh) 2014-06-03 2015-02-12 处理音频和图像信息的方法、装置和终端设备

Country Status (2)

Country Link
CN (1) CN104023176B (zh)
WO (1) WO2015184861A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112397102A (zh) * 2019-08-14 2021-02-23 腾讯科技(深圳)有限公司 音频处理方法、装置及终端

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104023176B (zh) * 2014-06-03 2017-07-14 华为技术有限公司 处理音频和图像信息的方法、装置和终端设备
CN106033421A (zh) * 2015-03-10 2016-10-19 中兴通讯股份有限公司 一种输出文件的方法及终端
CN106033339A (zh) * 2015-03-13 2016-10-19 联想(北京)有限公司 一种信息处理方法及电子设备
CN106101531B (zh) * 2016-06-13 2020-02-18 深圳市喜视科技开发有限公司 一种网络图像采集及内容播放系统
CN107592484A (zh) * 2016-07-06 2018-01-16 中兴通讯股份有限公司 一种信息处理方法和装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101303880A (zh) * 2008-06-30 2008-11-12 北京中星微电子有限公司 录制、播放音视频文件的方法及装置
CN101482880A (zh) * 2008-01-09 2009-07-15 索尼株式会社 视频搜索装置、编辑装置、视频搜索方法及程序
JP2009177515A (ja) * 2008-01-24 2009-08-06 Olympus Corp 光学機器
CN102521400A (zh) * 2011-12-23 2012-06-27 中国农业大学 畜禽养殖过程海量数据自动处理方法及系统
CN104023176A (zh) * 2014-06-03 2014-09-03 华为技术有限公司 处理音频和图像信息的方法、装置和终端设备

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060221197A1 (en) * 2005-03-30 2006-10-05 Jung Edward K Image transformation estimator of an imaging device
JP2005341543A (ja) * 2005-04-04 2005-12-08 Noriyuki Sugimoto 節電型自動録画機能付き携帯電話機
CN101102240A (zh) * 2006-07-04 2008-01-09 王建波 一种音频、视频内容的采集方法和检索方法
CN102074235B (zh) * 2010-12-20 2013-04-03 上海华勤通讯技术有限公司 视频语音识别并检索的方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101482880A (zh) * 2008-01-09 2009-07-15 索尼株式会社 视频搜索装置、编辑装置、视频搜索方法及程序
JP2009177515A (ja) * 2008-01-24 2009-08-06 Olympus Corp 光学機器
CN101303880A (zh) * 2008-06-30 2008-11-12 北京中星微电子有限公司 录制、播放音视频文件的方法及装置
CN102521400A (zh) * 2011-12-23 2012-06-27 中国农业大学 畜禽养殖过程海量数据自动处理方法及系统
CN104023176A (zh) * 2014-06-03 2014-09-03 华为技术有限公司 处理音频和图像信息的方法、装置和终端设备

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112397102A (zh) * 2019-08-14 2021-02-23 腾讯科技(深圳)有限公司 音频处理方法、装置及终端
CN112397102B (zh) * 2019-08-14 2022-07-08 腾讯科技(深圳)有限公司 音频处理方法、装置及终端

Also Published As

Publication number Publication date
CN104023176A (zh) 2014-09-03
CN104023176B (zh) 2017-07-14

Similar Documents

Publication Publication Date Title
WO2015184861A1 (zh) 处理音频和图像信息的方法、装置和终端设备
WO2017092360A1 (zh) 多媒体播放时的交互方法及装置
WO2015196584A1 (zh) 一种智能录制系统
WO2014154003A1 (zh) 一种自拍图像的展现方法及装置
CN104317932A (zh) 照片分享方法及装置
JP6474393B2 (ja) 顔アルバムに基づく音楽再生方法、装置および端末デバイス
WO2018095252A1 (zh) 视频录制方法及装置
CN103428555A (zh) 一种多媒体文件的合成方法、系统及应用方法
WO2013191899A1 (en) Enhancing captured data
WO2017063133A1 (zh) 一种拍摄方法和移动设备
US11588938B2 (en) Systems and methods for curation and delivery of content for use in electronic calls
JP2009033351A (ja) 記録再生装置及び方法
CN106875968B (zh) 信息采集的方法、客户端及系统
CN112487958A (zh) 手势控制方法及系统
WO2014110055A1 (en) Mixed media communication
JP6214762B2 (ja) 画像検索システム、検索画面表示方法
CN108881766B (zh) 视频处理方法、装置、终端和存储介质
US20160261828A1 (en) Method, Device, and System for Multipoint Video Communication
JP2016063477A (ja) 会議システム、情報処理方法、及びプログラム
JP2014204411A (ja) 会議記録システム、会議記録装置、会議記録再生方法およびコンピュータプログラム
JP6457943B2 (ja) 特定の画像のグループ化システム及び方法
WO2022057773A1 (zh) 图像存储的方法、装置、计算机设备和存储介质
CN115550678A (zh) 直播视频处理方法、装置及存储介质
US20150054909A1 (en) Data processing method and device
CN114257770A (zh) 对课堂教学录制电脑画面的方法、电子设备和存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15802561

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15802561

Country of ref document: EP

Kind code of ref document: A1