KR20160119218A - Sound image playing method and device - Google Patents

Sound image playing method and device Download PDF

Info

Publication number
KR20160119218A
KR20160119218A KR1020167024888A KR20167024888A KR20160119218A KR 20160119218 A KR20160119218 A KR 20160119218A KR 1020167024888 A KR1020167024888 A KR 1020167024888A KR 20167024888 A KR20167024888 A KR 20167024888A KR 20160119218 A KR20160119218 A KR 20160119218A
Authority
KR
South Korea
Prior art keywords
sound
image
channel information
sound channel
sound image
Prior art date
Application number
KR1020167024888A
Other languages
Korean (ko)
Inventor
신신 리
수 첸
Original Assignee
후아웨이 테크놀러지 컴퍼니 리미티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 후아웨이 테크놀러지 컴퍼니 리미티드 filed Critical 후아웨이 테크놀러지 컴퍼니 리미티드
Publication of KR20160119218A publication Critical patent/KR20160119218A/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/14Picture signal circuitry for video frequency region
    • H04N5/144Movement detection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/60Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N9/00Details of colour television systems
    • H04N9/79Processing of colour television signals in connection with recording
    • H04N9/80Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback
    • H04N9/802Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback involving processing of the sound signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Stereophonic System (AREA)

Abstract

The present invention relates to the field of multimedia. A sound image reproduction method and device capable of reproducing the original three-dimensional effect of an arbitrary number of sound images corresponding to an image are disclosed. A particular solution is to obtain image location information, the image location information corresponding to an image of one of the at least one image and used to indicate the spatial location of the corresponding image in the first frame picture; Acquiring a set of sound channel information in accordance with image position information, the set of sound channel information including at least one sound channel information, each sound channel information in at least one sound channel information including at least one of at least one sound channel A sound channel information set corresponding to image location information; And reproducing a sound image according to the sound channel information set, wherein the sound image corresponds to the image. Embodiments of the present invention are used to reproduce a sound image.

Description

TECHNICAL FIELD [0001] The present invention relates to a sound image playback method,

Priority is claimed on Chinese patent application No. 201410438159.1, filed on August 29, 2014, entitled " SOUND IMAGE PLAY METHOD AND APPARATUS, " which is hereby incorporated by reference in its entirety.

Technical field

BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to the field of multimedia, and more particularly, to a method and apparatus for reproducing a sound image.

As people's living standards continue to improve, there is an increasing demand for playback of audio and video files, and thus different types of sound image playback devices are emerging. One of the main functions of the sound image reproducing apparatus is to reproduce sound images in audio and video files. By way of example, by using a sound image reproduction device such as a television set as an example, two loudspeakers are arranged below the screens for most common television sets, in order to reproduce sound images in audio and video files; Loudspeakers are placed on both sides of the screens for some typical television sets. In the case of a television set in which two loudspeakers are placed under the screen, when the screen grows larger, the audience clearly feels that the sound is coming from the central portion below the screen, which weakens the original stereo effect of the sound image corresponding to the image . However, in the case of a television set in which the loudspeakers are installed on both sides and below the screen, the stereo position is one-dimensional, only the left and right sounds can be effectively distinguished, and the ability to distinguish between the upper and lower sounds is weak. These deficiencies are more evident for the increasingly popular large-screen television sets.

For a defect that a conventional sound image reproducing apparatus easily weakens the original stereo effect of a sound image corresponding to an image, several technical solutions have been made, one of which is sliding type loudspeakers using a guide level, And controls the movement of the loudspeakers according to the position of the main sound source in the picture of the display. It is implemented that the positions of the loudspeakers reproducing the sound image correspond exactly to the position of the main sound source in the picture of the display so that the original stereo effect of the sound image corresponding to the image is reliably reproduced. However, moving the loudspeakers according to the image positions using the guide rails complicates the structure of the sound image reproducing apparatus, has a great demand for component flexibility and material durability, increases the cost, and lowers the feasibility.

Another technical solution is to control the sound generation of the loudspeakers in the top, bottom, left and right of the display plane according to the sound image position information of the main sound source parsed from the audio information, thereby reproducing the original stereo effect of the sound image corresponding to the image. However, in the case of a technique of carrying sound image position information by audio information, there is no general standard, and furthermore, not all audio information carries sound image position information, It does not apply. In addition, in this solution, only one single sound image can be reproduced, multiple sound images can not be reproduced at the same time, and therefore this solution can be applied to an application capable of reproducing the original stereo effect of a sound image corresponding to the image The number of scenarios is more limited.

Conventional technical solutions require the use of complex mechanical structures and technical solutions to reproduce the original stereo effect of the sound image corresponding to the image or require audio information to carry the sound image position information, It is possible to reproduce only the stereo effect of, and it is not advantageous for improving the technique.

SUMMARY OF THE INVENTION

Embodiments of the present invention are capable of reproducing the original stereo effects of any number of sound images corresponding to an image without requiring complex mechanical structures and technical solutions and without requiring audio information to carry sound image position information A method and apparatus for reproducing a sound image are provided, which is advantageous for enhancing technology.

To achieve the above objects, embodiments of the present invention utilize the following technical solutions.

According to a first aspect,

Wherein the image position information corresponds to an image of one of the at least one image and the image position information includes a spatial position within the first frame picture of the image corresponding to the image position information Used to direct -;

Acquiring a set of sound channel information according to the image position information, wherein the set of sound channel information includes at least one sound channel information, and each sound channel information in the at least one sound channel information includes at least one sound channel The sound channel information set corresponding to one of the sound channel information, the sound channel information set corresponding to the image position information; And

Reproducing a sound image in accordance with the sound channel information set, the sound image corresponding to the image,

A sound image reproducing method is provided.

In a first possible implementation, in relation to the first aspect, prior to said step of acquiring image position information,

Acquiring first frame picture data of the first frame picture

Further comprising:

The step of acquiring the image position information may be concretely

Identifying the image position information from the first frame picture according to the first frame picture data

.

In a second possible implementation, with respect to the first aspect or the first possible implementation, prior to said step of reproducing a sound image according to said sound channel information set,

Acquiring sound image data of the sound image

Further comprising:

Wherein the step of reproducing a sound image in accordance with the set of sound channel information comprises:

Reproducing the sound image according to the sound image data and according to the sound channel information set

.

In a third possible implementation, with respect to the first and second possible implementations, prior to said step of acquiring sound image data of said sound image,

Obtaining first frame audio data of a first frame audio, wherein the first frame audio corresponds to the first frame picture;

Further comprising:

Wherein the step of acquiring sound image data of the sound image comprises:

Identifying the sound image data of the sound image from the first frame audio data

.

In a fourth possible implementation, with respect to the first aspect and the second or third possible implementation, the first frame picture comprises at least two images, and the at least two images comprise a first image and a second image, 2 image, the first image corresponding to a first sound image, the second image corresponding to a second sound image,

Wherein the step of reproducing a sound image in accordance with the set of sound channel information comprises:

Reproducing the first sound image according to a first set of sound channel information; And

Reproducing the second sound image according to a second sound channel information set

.

With respect to the first and fourth possible implementations, in a fifth possible implementation, the first image corresponds to first image position information, the second image corresponds to second image position information, Wherein the first image position information corresponds to the first set of sound channel information and the second image position information corresponds to the second set of sound channel information,

Wherein the step of reproducing a sound image in accordance with the set of sound channel information comprises:

Obtaining a set of consonant sound channel information according to the first set of sound channel information and the second set of sound channel information, wherein the sound channel information in the set of consonant sound channel information includes a set of the first sound channel information, Contained in both sets of channel information; And

Reproducing the first sound image and the second sound image according to a predetermined rule and in accordance with the set of matching sound channel information

.

Regarding the first or fifth possible implementation, in a sixth possible implementation, the first sound image and the second sound image are reproduced in accordance with a predetermined rule and according to the set of matching sound channel information Prior to this step,

Obtaining first sound image data and second sound image data, the first sound image data corresponding to the first sound image and the second sound image data corresponding to the second sound image; And

Mixing the first sound image data and the second sound image data to obtain corresponding sound image data

Further comprising:

Wherein the step of reproducing the first sound image and the second sound image according to a preset rule and according to the set of matching sound channel information comprises:

Reproducing the first sound image and the second sound image in accordance with the matching sound image data and according to the matching sound channel information set

.

In a seventh possible implementation, with respect to any of the first and fourth to sixth possible implementations, before said step of reproducing said first sound image according to said first set of sound channel information, The method

Obtaining a first distinct sound channel information set in accordance with the first sound channel information set and the second sound channel information set, wherein the sound channel information in the first distinct sound channel information set is stored in the first sound channel information set But not included in the second sound channel information set -

Further comprising:

Wherein the step of reproducing the first sound image in accordance with the first set of sound channel information comprises:

Reproducing the first sound image according to the first distinct sound channel information set

.

In a eighth possible implementation, with regard to either the first aspect or the first to seventh possible implementations, the method is applied to a sound image reproducing apparatus, wherein the sound image reproducing apparatus comprises at least one loudspeaker Each loudspeaker of said at least one loudspeaker corresponding to a sound channel of one of said at least one sound channel;

Wherein the step of reproducing a sound image in accordance with the set of sound channel information comprises:

Driving the at least one loudspeaker to reproduce the sound image according to the set of sound channel information

.

According to a second aspect,

An acquisition unit configured to acquire image position information, the image position information corresponding to an image of one of the at least one image, the image position information corresponding to the image position information within the first frame picture Used to indicate spatial location -;

A channel unit configured to acquire a set of sound channel information according to the image position information acquired by the acquisition unit, the set of sound channel information including at least one sound channel information, each of the at least one sound channel information Wherein the sound channel information of the at least one sound channel corresponds to one of the at least one sound channel and the sound channel information set corresponds to the image position information; And

A playback unit configured to play a sound image in accordance with the set of sound channel information acquired by the channel unit, the sound image corresponding to the image,

A sound image reproducing apparatus is provided.

With respect to the second aspect, in a first possible implementation, the acquisition unit is further configured to acquire first frame picture data of the first frame picture,

The acquisition unit is configured to acquire the image position information. Specifically,

And the acquisition unit is configured to identify the image position information from the first frame picture in accordance with the first frame picture data acquired by the acquisition unit.

In a second possible implementation, with respect to the second aspect or the first possible implementation, the acquisition unit is further configured to acquire sound image data of the sound image,

The reproduction unit is configured to reproduce a sound image according to the sound channel information set acquired by the channel unit,

And the playback unit is configured to play the sound image according to the sound image data acquired by the acquisition unit and in accordance with the sound channel information set.

With respect to the second aspect and the second possible implementation, in a third possible implementation, the acquisition unit is further configured to acquire first frame audio data of the first frame audio, Corresponding to the first frame picture,

The acquisition unit is further configured to acquire sound image data of the sound image,

And the acquisition unit is configured to identify the sound image data of the sound image from the first frame audio data acquired by the acquisition unit.

With respect to the second aspect and the second or third possible implementation, in a fourth possible implementation, the first frame picture comprises at least two images, and the at least two images comprise a first image and a second image, 2 image, the first image corresponding to a first sound image, the second image corresponding to a second sound image,

Wherein the reproducing unit is configured to reproduce a sound image according to the sound channel information set acquired by the acquiring unit,

The reproducing unit being specifically configured to reproduce the first sound image in accordance with a first sound channel information set obtained by the obtaining unit; And

Characterized in that said reproducing unit is also specifically configured to reproduce said second sound image in accordance with a second sound channel information set acquired by said acquisition unit

.

With respect to the second and fourth possible implementations, in a fifth possible implementation, the first image corresponds to first image position information, the second image corresponds to second image position information, Wherein the first image position information corresponds to the first set of sound channel information and the second image position information corresponds to the second set of sound channel information,

The reproducing unit

A corresponding sound channel information set in accordance with the first sound channel information set and the second sound channel information set acquired by the channel unit, the sound channel information in the set of matching sound channel information The first sound channel information set and the second sound channel information set; And

Configured to reproduce the first sound image and the second sound image according to a predetermined rule and in accordance with the set of the coinciding sound channel information obtained by the coincidence channel subunit,

.

With respect to the second or fifth possible implementation, in a sixth possible implementation,

An acquisition subunit configured to acquire first sound image data and second sound image data, the first sound image data corresponding to the first sound image, and the second sound image data corresponding to the second sound image -; And

A mixing sub-unit configured to mix the first sound image data acquired by the acquisition sub-unit and the second sound image data to obtain the matching sound image data;

Further comprising:

Wherein the coincident playback sub-unit is arranged to generate the first sound image and the second sound image in accordance with the matching sound image data acquired by the mixing subunit and according to the matching sound channel information set acquired by the matching channel subunit. As shown in Fig.

With respect to any of the second and fourth to sixth possible implementations, in a seventh possible implementation,

A distinct channel sub-unit configured to obtain a first set of distinct sound channel information according to the first set of sound channel information and the second set of sound channel information, the at least one first sound channel information being associated with the first distinct sound channel Information set, wherein the at least one second sound channel information does not include any first distinct sound channel information in the first set of distinct sound channel information; And

And to reproduce the first sound image in accordance with the first distinct sound channel information set acquired by the distinct channel subunit.

.

In a eighth possible implementation, with respect to either the second aspect or the first through seventh possible implementations, the sound image reproducing apparatus further comprises at least one loudspeaker, wherein the at least one loudspeaker Each loudspeaker corresponding to one of the at least one sound channel;

The reproduction unit is configured to reproduce a sound image according to the sound channel information set acquired by the channel unit,

The playback unit being configured to drive the at least one loudspeaker to reproduce the sound image in accordance with the set of sound channel information acquired by the channel unit

.

According to the sound image reproduction method and apparatus provided in the embodiments of the present invention, image position information can be acquired, a sound channel information set can be acquired according to a predetermined rule and according to image position information, The sound image information can be reproduced according to the information set, the image position information is used to indicate a spatial position in the first frame picture of the image corresponding to the image position information, and the sound channel information set includes at least one sound channel information The sound channel information corresponding to one sound channel, and the sound image corresponding to the image. Such a solution is simple and does not require a complicated mechanical structure and technical solution, and a set of sound channel information can be acquired in such a manner as to acquire image position information, so that the sound image can be reproduced in a general sound channel manner, Thus, the original stereo effects of any number of sound images corresponding to the images can be reproduced without requiring audio information to carry sound image position information. Such a solution can be used to reproduce any audio and video file, and thus the present invention is advantageous for enhancing the technology.

BRIEF DESCRIPTION OF THE DRAWINGS In order to more clearly describe the technical solutions in embodiments of the present invention or in the prior art, the accompanying drawings, which are needed to illustrate the embodiments or the prior art, are briefly introduced below. Obviously, the appended drawings in the following description merely illustrate some embodiments of the invention, and one of ordinary skill in the art can derive other drawings from these attached drawings without undue effort.
1 is a schematic flowchart of a sound image reproducing method according to an embodiment of the present invention.
2 is a schematic flowchart of a sound image reproducing method according to another embodiment of the present invention.
3 is a schematic explanatory diagram of a sound image reproducing method according to another embodiment of the present invention.
4 is a schematic structural view of a sound image reproducing apparatus according to an embodiment of the present invention.
5 is a schematic structural view of another sound image reproducing apparatus according to an embodiment of the present invention.
6 is a schematic structural view of another sound image reproducing apparatus according to an embodiment of the present invention.
7 is a schematic structural view of another sound image reproducing apparatus according to an embodiment of the present invention.
8 is a schematic structural view of another sound image reproducing apparatus according to an embodiment of the present invention.
9 is a schematic structural view of a sound image reproducing apparatus according to another embodiment of the present invention.

In the following, technical solutions in embodiments of the present invention are clearly and completely described with reference to the accompanying drawings in embodiments of the present invention. Obviously, the described embodiments are not all embodiments of the invention, but only some of them. All other embodiments, which are obtained by one of ordinary skill in the art based on the embodiments of the present invention without undue effort, should fall within the scope of protection of the present invention.

To clearly illustrate the technical solutions in the embodiments of the present invention, in embodiments of the present invention, the same items or similar items whose functions and roles are basically the same are referred to as "first" Are distinguished using words. It will be appreciated by those skilled in the art that such terms as "first" and "second"

The specific meanings of images, sound images, audio and pictures used in embodiments of the present invention may be as follows: 1. An image is an image of an object, such as an image of a person, an image of an animal, ; 2. The sound image is a sound that includes a stereo effect, and the effect reflected by such sound may be regarded as a "sound picture "; 3. Audio is a specialized name for sounds, similar to video in the multimedia field, and carries sound data on a frame-by-frame basis; 4. A picture is a color representation with a manually set fixed boundary in the present invention, and may be a frame of a video picture in a video file.

One embodiment of the present invention provides a sound image reproduction method which can be used in the field of multimedia and specifically can be used for sound image reproduction. Referring to Figure 1, the method may include the following steps.

101: Obtain image position information.

The image position information may correspond to an image of one of the at least one image and the image position information may be used to indicate a spatial position within the first frame picture of the image corresponding to the image position information.

Specifically, the image position information may be obtained through identification from a picture to be processed, or may be obtained from stored image position information, and the acquired image position information may belong to a plurality of images.

102: Acquires a sound channel information set according to preset rules and according to image position information.

Optionally, the method may further comprise the steps of:

103: Plays the sound image according to the sound channel information set.

The set of sound channel information may include at least one sound channel information, wherein each sound channel information in at least one sound channel information corresponds to one sound channel in at least one sound channel, And the sound image corresponds to the image.

Specifically, when this embodiment of the invention is applied to a device, a device to which the method provided in this embodiment is applied can reproduce a corresponding sound image according to a set of sound channel information, and control the reproduction of at least one sound image A set of sound channel information may be transmitted to a peripheral device that specifically reproduces the sound image, in order to acquire and transmit at least one set of sound channel information for the sound image information set.

The advantage of this is that there is no need for audio information to carry sound image location information. As can be seen from the above, there is no general standard for audio information to carry sound image location information. In addition, the stereo effect of the sound image can be reproduced in conjunction with the currently very mature sound channel technique, in accordance with the acquired sound channel information, without requiring a complicated structure and technical solution.

According to the sound image reproduction method provided in this embodiment of the present invention, the image position information can be acquired, and a sound channel information set is acquired according to a predetermined rule and according to the image position information, A sound image may be reproduced, a sound image may be reproduced according to a set of sound channel information, and the image position information may be used to indicate a spatial position in the first frame picture of the image corresponding to the image position information , The sound channel information set may include at least one sound channel information, the sound channel information corresponds to one sound channel, and the sound image corresponds to the image. Such a solution is simple and does not require a complicated mechanical structure and technical solution, and a set of sound channel information can be acquired in such a manner as to acquire image position information, so that the sound image can be reproduced in a general sound channel manner, Thus, the original stereo effects of any number of sound images corresponding to the images can be reproduced without requiring audio information to carry sound image position information. Such a solution can be used to reproduce any audio and video file, and thus the present invention is advantageous for enhancing the technology.

Based on the sound image reproduction method provided in the previous embodiment of the present invention, this embodiment of the present invention provides a sound image reproduction method that can be used in the field of multimedia and specifically can be used for sound image reproduction. Referring to Figure 2, the method may include the following steps.

201: First frame picture data of the first frame picture is acquired.

The first frame picture may be any frame of the video picture in the audio and video file to be processed.

202: Identifies the image position information from the first frame picture according to the first frame picture data.

Specifically, the method may be as follows: acquiring at least one image feature information, wherein each image feature information in at least one image feature information corresponds to an image in one of the at least one image, The at least one image may further comprise a second image, and the image position information may be acquired according to the first frame picture data and the at least one image feature information.

This step is one of the specific implementations of "acquiring image location information ".

The image position information corresponds to an image of one of the at least one image and the image position information can be used to indicate a spatial position within the first frame picture of the image corresponding to the image position information, The first image may include at least two images including the first image and the second image, wherein the first image corresponds to the first image position information, and the second image corresponds to the second image position information.

Specifically, referring to FIG. 3, there are, for example, a display screen (which is a shaded portion), images in the screen (cats and upper right mice) and loudspeakers surrounding the screen. The process of implementing step 202 may be done in the following manner.

By way of example, assume that the lower left image in the drawing is the first image and the upper right image is the second image.

Image position information of at least one image is identified using an image pattern recognition technique. Currently, there are many types of image pattern recognition technology in the industry, and general ones include color visual characteristics and color similarity measurement, image detection techniques based on impulse noise detection, and images based on BP (Back Propagation) Fuzzy classification technology. These image pattern recognition techniques may all be used to identify at least one image with at least one image feature information to obtain at least one image position information.

By using the image pattern recognition technique, the positions of a plurality of image blocks in the current picture can be automatically identified in real time for simplified processing, in which case, each image position information in at least one image position information is, for example, (X0, Y0) denote the upper left coordinates and (X1, Y1) denote the lower right coordinates. The coordinate values corresponding to X0, Y0, X1 and Y1 may be pixel coordinate values in the first frame picture or may be set flexibly, for example the coordinate values may be set according to corresponding loudspeakers, The coordinate value corresponds to the pixel coordinate value range.

As shown in the figure, the first image position information (X0, Y0, X1, Y1) of the first image and the second image position information (X0, Y0, X1, Y1) of the second image are displayed.

Obviously, the spatial location of the image within the first frame picture may be represented by using the image location information in other ways.

Optionally, after the image position information has been identified, in order to improve the processing performance, only the change of the position movement is used, so that the characteristics of the same image block in a plurality of consecutive frames of pictures are slightly changed, Information can be quickly identified using motion image detection techniques. There are also many types of mature implementation solutions for motion image detection techniques, the most common ones being motion image detection based on motion image detection and background modeling techniques based on frame difference methods.

The advantage of this is that image position information corresponding to each identified image can be obtained, which is advantageous for subsequent reproduction of the stereo effect of the sound image corresponding to the image.

After the image position information is acquired at this stage:

203: Acquires a sound channel information set according to the image position information.

The set of sound channel information may include at least one sound channel information, wherein each sound channel information in at least one sound channel information corresponds to one sound channel in at least one sound channel, And the sound image corresponds to the image.

When this embodiment of the present invention is applied to an apparatus, an apparatus to which the method provided in this embodiment is applied may reproduce a corresponding sound image according to a set of sound channel information, To acquire and transmit a set of sound channel information, a set of sound channel information may be transmitted to a peripheral device that specifically reproduces the sound image.

The advantage of this is that the stereo effect of the sound image can be reproduced in conjunction with the currently very mature sound channel technology, in accordance with the acquired sound channel information, without requiring a complicated structure and technical solution.

The first image corresponding to the first sound image, the second image corresponding to the second sound image, the first image corresponding to the first image position information, the second image corresponding to the second image position information, The first image position information corresponds to a first set of sound channel information and the second image position information corresponds to a second set of sound channel information.

For a specific implementation, FIG. 3 may be referred to.

By way of example, the space that the first sound image should correspond to can be obtained according to the first image position information (X0, Y0, X1, Y1) of the first image obtained from the first frame picture, The sound channel corresponding to the loudspeaker unit may be calculated to control the loudspeaker to produce sound.

In this case, the coordinates corresponding to the loudspeakers (0-N) above and below the screen can be used as horizontal coordinates for the reference, and the coordinates corresponding to the left and right loudspeakers 0-M are used as the vertical coordinates for the reference And the space (X0, Y0, X1, Y1) indicated by the first image position information is shown in Fig. 3; Thus, in order to reproduce the stereo effect of the first sound image, the loudspeakers which are located to the left and right of the screen and correspond to the positions (X0-X1) may need to produce sound, Loudspeakers may also need to produce sound.

Thus, in this case, a first set of sound channel information is generated according to the first image position information, the first set of sound channel information includes at least one first sound channel information, and the set of at least one first sound channel information Each of the first sound channel information corresponds to one sound channel individually, and those sound channels corresponding to the first sound channel information correspond to the loudspeakers which must generate a sound.

The above description is only a solution for calculating a set of sound channel information, and in particular the corresponding computation relations between the image position information and the sound channel, sound channel information and sound channel information set are advantageous for achieving a stereo meeting environmental requirements Adjusted according to the actual case, the stereo effect of the sound image can be reproduced.

204: First frame audio data of the first frame audio is acquired.

The first frame audio corresponds to the first frame picture.

205: Identifies the sound image data of the sound image from the first frame audio data.

Specifically, the method may comprise: obtaining at least one sound image feature information, wherein each sound image feature information in at least one sound image feature information corresponds to a sound image in one of the at least one sound image; , The first frame audio data and the at least one sound image feature information, and each of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the at least one of the plurality Corresponding to the sound image feature information.

Specifically, a particular type of sound generating sound image can be identified through sound image feature identification: for example, the sound image is identified using mathematical text recognition technology. A matching relationship between the sound image and the image can then be obtained by matching the identified sound image type with a particular picture type of the corresponding corresponding image identified using the image feature, or the matching relationship between the two may be pre-set And each image feature information in at least one image feature information is set to correspond one-to-one to each image feature information in at least one sound image feature information.

Steps 204 and 205 may be considered as a specific implementation of step A01 below.

A01: Sound image data of a sound image is obtained.

Wherein each sound image data in at least one sound image data corresponds to one sound image in at least one sound image.

Specifically, when the sound image data is not distinguished in advance from the audio information, steps 204 and 205 may be performed, or if at least one sound image data has been previously distinguished, step A01 may be performed immediately.

Here, there is a sequence from step 201 to step 203, and there is a sequence for step 204 and step 205, but there is no sequence between two step groups, step 201 to step 203 and step 204 and step 205 It should be noted.

206: Sound The sound image is reproduced according to the image data and according to the sound channel information set.

On the other hand, when the method provided in this embodiment of the present invention is applied to a device or apparatus, the device and apparatus to which the method is applied can acquire, store, parse and decode the sound image data to reproduce the sound image , In which case the previous steps are performed.

On the other hand, the specific sound image data corresponding to each sound image in at least one sound image can be stored, parsed and reproduced using a peripheral device, and reproducing the sound image according to the sound channel information set It is only necessary to control the peripheral device to reproduce the sound image corresponding to the image in accordance with the at least one sound channel information.

In this case, as an option, step B01 may be performed immediately, without performing the previous steps 204 to 206.

B01: Sound image is reproduced according to the sound channel information set.

Specifically, in this embodiment of the present invention, certain implementations of the previous phase of "playing a sound image according to a set of sound channel information " may include the following various ways, implementations may exist independently, It may coexist.

The first implementation is as follows.

The at least one image may comprise a first image, the first image position information may comprise first image position information, the at least one sound image may comprise a first sound image, The set of sound channel information may comprise a first set of sound channel information, the first set of sound channel information may comprise at least one first sound channel information, the first image comprises first image position information, A sound image and a first set of sound channel information.

In this case, reproducing the sound image according to the sound channel information set may specifically include the following step C01.

C01: The first sound image is reproduced in accordance with the first sound channel information set.

Specifically, referring to the previous steps in this embodiment of the present invention, it will be appreciated that this step specifically includes playing the first sound image in accordance with the first set of sound channel information and in accordance with the first sound image data Lt; / RTI >

The first sound image data is contained in at least one sound image data, and the first sound image data corresponds to the first sound image.

The second embodiment can coexist with the first embodiment.

The at least one image may further comprise a second image, the first image position information may further comprise second image position information, and the at least one sound image may further comprise a second sound image, The at least one sound channel information set may further comprise a second set of sound channel information, the second set of sound channel information may include at least one second sound channel information, Information, a second sound image, and a second sound channel information set.

In this case, the step of reproducing the sound image according to the sound channel information set may further include the following step C02.

C02: The second sound image is reproduced in accordance with the second sound channel information set.

Specifically, referring to the previous steps in this embodiment of the present invention, it will be appreciated that this step specifically includes playing the second sound image in accordance with the second set of sound channel information and in accordance with the second sound image data Lt; / RTI >

The second sound image data is contained in at least one sound image data, and the second sound image data corresponds to the second sound image.

As can be seen from the above, both the first embodiment and the second embodiment in this embodiment of the present invention can be applied to the reproduction of a single sound image, and at the time of combining, Simultaneous playback can be implemented. This embodiment of the present invention is only one example of this method, and actually the first and second are not fixed. Through the combination of the first and second implementations in this embodiment of the present invention, the method can be enabled to implement simultaneous playback of any number of sound images.

Third Implementation: This implementation is formed based on the combination of the previous first and second implementations in this embodiment.

In this case, the step of reproducing the sound image according to the sound channel information set may further include the following steps C031 and C032.

C031: acquires the matching sound channel information set according to the first sound channel information set and the second sound channel information set.

The sound channel information in the set of matching sound channel information is included in both the first sound channel information set and the second sound channel information set.

C032: reproduces the first sound image and the second sound image according to a predetermined rule and according to the matching sound channel information set.

Specifically, referring to the previous steps in this embodiment of the present invention, it will be understood that this step may be performed in accordance with a predetermined set of rules, in accordance with a set of matching sound channel information, And reproducing the first sound image and the second sound image according to the image data.

In particular, the third implementation may be applied when the first set of sound channel information and the second set of sound channel information include at least one identical sound channel information.

In the case of the third embodiment, also before step C032, the method comprises the following steps:

Acquiring first sound image data and second sound image data, wherein the first sound image data corresponds to a first sound image and the second sound image data corresponds to a second sound image; And mixing the first sound image data and the second sound image data to obtain corresponding sound image data. In this case, the implementation of step C032 may specifically include reproducing the first sound image and the second sound image according to the matching sound image data and according to the matching sound channel information set.

In this case, optionally, the implementation of step C032 reproduces half of the first sound image and half of the second sound image in the sound channel corresponding to the set of matching sound channel information; Any sound channel corresponding to each of the matching sound channel information in the matching sound channel information set may further comprise not reproducing the first sound image and the second sound image.

Here, for a sound image having no corresponding image as an example, when the image position information is not detected, the sound image may be generated as a background sound, or the image position information corresponding to the sound image may be generated as the final sound And can be acquired according to the position of generation.

Prior to reproducing the first sound image according to the first set of sound channel information, in the case of combined implementations of the various prior implementations and implementations, the following steps are performed: the first set of sound channel information and the second set of sound channel information And the sound channel information in the first distinct sound channel information set is included in the first sound channel information set, but in the second sound channel information set, Not included; In this case, the step of reproducing the first sound image according to the first set of sound channel information may specifically include the step of reproducing the first sound image according to the first set of distinct sound channel information.

3, a circle in the figure indicates a loudspeaker, the method can be applied to a sound image reproducing apparatus, and the sound image reproducing apparatus can include at least one loudspeaker, and the at least one loudspeaker Each loudspeaker corresponding to one of the at least one sound channel; In this case, the step of reproducing the sound image in accordance with the set of sound channel information may include, in accordance with the set of sound channel information, driving at least one loudspeaker to reproduce the sound image.

Obviously, the method can also be applied to a sound image reproducing apparatus that is coupled to a loudspeaker with another structure. This method can be combined with existing sound channel techniques to implement playback of a sound image and thus has a comprehensive applicability.

Specifically, the audio data input from the playback source may be sent to the corresponding power amplifier using the I2S (Inter-IC Sound) bus to drive the loudspeaker to produce the sound. The loudspeaker array formed by the at least one loudspeaker may use a common directional loudspeaker to generate sound towards the direct front of the screen and thus improve the listening position accuracy / capability of the audience. Conventional loudspeakers can also be used. A digital power amplifier configured to receive multiple 2S signals can drive the loudspeaker.

In a practical application, the sound image reproduction device may be a television set, a large screen, or the like, or may be another audio and video sound image reproduction device, and thus may be combined with the sound image reproduction method provided in this embodiment of the present invention, The loudspeaker array including at least one loudspeaker can effectively reproduce the original stereo effect of the sound image.

According to the sound image reproducing method provided in this embodiment of the present invention, the image position information can be acquired from the first frame picture according to at least one image feature information, and according to a predetermined rule and according to the image position information, A set of channel information may be obtained so that data for reproducing the stereo effect of the sound image is identified from any audio and video file without requiring the audio information to carry the sound image position information, At least one sound image data is obtained from the first frame audio corresponding to the first frame picture in accordance with the at least one sound image feature information and is reproduced according to the sound channel information set And at least one Depending on the sound image data can be reproduced sound image. Thus, such a solution is simple and does not require a complicated mechanical structure and technical solution. In this solution, the sound image can be reproduced in a general sound channel manner, and the present invention is advantageous for the technical advancement.

Referring to FIG. 4, an embodiment of the present invention provides a sound image reproducing apparatus, which can be applied to a multimedia field, and in particular can be used in combination with the sound image reproducing method provided in the previous embodiment of the present invention, The following contents, namely

An acquisition unit (401) configured to acquire image position information, wherein the image position information corresponds to an image of one of the at least one image, the image position information includes a space Used to indicate location -; And

A channel unit (402) configured to obtain a set of sound channel information according to image position information acquired by the acquisition unit (401), the set of sound channel information including at least one sound channel information, Each sound channel information corresponding to one of the at least one sound channel, the sound channel information set corresponding to the image position information,

.

Optionally, referring to Figure 5, the sound image reproduction apparatus

A playback unit (403) configured to play a sound image in accordance with a set of sound channel information acquired by the channel unit (402), the sound image corresponding to an image -

.

Optionally, acquisition unit 401 is further configured to acquire first frame picture data of a first frame picture;

The acquisition unit 401 is configured to acquire the image position information, specifically,

And the acquisition unit 401 is configured to identify the image position information from the first frame picture in accordance with the first frame picture data acquired by the acquisition unit 401. [

Optionally, acquisition unit 401 is further configured to acquire sound image data of the sound image,

The reproduction unit 403 is configured to reproduce a sound image according to a set of sound channel information acquired by the channel unit 402,

And the playback unit 403 is configured to play the sound image according to the sound image data acquired by the acquisition unit 401 and according to the sound channel information set.

Further, optionally, the acquisition unit 401 is further configured to acquire first frame audio data of the first frame audio, wherein the first frame audio corresponds to the first frame picture,

The acquisition unit 401 is further configured to acquire the sound image data of the sound image,

And the acquisition unit (401) is configured to identify the sound image data of the sound image from the first frame audio data acquired by the acquisition unit (401).

Also optionally, the first frame picture comprises at least two images, at least two images comprise a first image and a second image, the first image corresponds to a first sound image, and the second image Corresponds to a second sound image,

The reproduction unit 403 is configured to reproduce the sound image according to the sound channel information set acquired by the acquisition unit 401,

The reproducing unit 403 is specifically configured to reproduce the first sound image in accordance with the first sound channel information set obtained by the obtaining unit 401; And

Reproducing unit 403 is also specifically configured to reproduce the second sound image in accordance with the second sound channel information set acquired by the acquisition unit 401

.

Also optionally, the first image corresponds to the first image position information, the second image corresponds to the second image position information, the first image position information corresponds to the first set of sound channel information, The image location information corresponds to the second set of sound channel information.

Referring to Fig. 6, referring to Fig. 5, the reproduction unit 403

- a coincidence channel subunit (4031) configured to acquire a coincident sound channel information set according to a first set of sound channel information and a second set of sound channel information acquired by the channel unit (402) Information is contained in both the first sound channel information set and the second sound channel information set; And

A matching reproduction subunit 4032 configured to reproduce a first sound image and a second sound image according to a predetermined rule and in accordance with the matching sound channel information set obtained by the matching channel subunit 4031,

.

6, referring to Fig. 7, the playback unit 403 may include

(4033) configured to acquire first sound image data and second sound image data, the first sound image data corresponding to a first sound image and the second sound image data corresponding to a second sound image, ; And

A mixing subunit 4034 configured to mix the first sound image data and the second sound image data acquired by the acquisition subunit 4033 to obtain the matching sound image data,

Further comprising:

The coincidence playback subunit 4032 is adapted to receive the first sound image and the second sound image according to the coinciding sound image data acquired by the mixing subunit 4034 and according to the coincident sound channel information set obtained by the coincidence channel subunit 4031. [ And is specifically configured to reproduce a sound image.

Optionally, referring to Figure 5, based on Figure 5, the playback unit 403

At least one first sound channel information configured to obtain a first distinct sound channel information set in accordance with a first sound channel information set and a second sound channel information set, Wherein the at least one second sound channel information does not include any first distinct sound channel information in the first set of distinct sound channel information; And

A distinct reproduction subunit 4036 configured to reproduce a first sound image in accordance with a first distinct sound channel information set obtained by the distinct channel subunit 4035,

.

Optionally, the sound image reproduction apparatus further comprises at least one loudspeaker, wherein each loudspeaker in the at least one loudspeaker corresponds to one of the at least one sound channel;

The reproduction unit 403 is configured to reproduce a sound image according to a set of sound channel information acquired by the channel unit 402,

Playback unit 403 is configured to drive at least one loudspeaker to reproduce a sound image according to a set of sound channel information acquired by channel unit 402

.

According to the sound image reproducing apparatus provided in this embodiment of the present invention, image position information can be acquired, and a sound channel information set is acquired according to a predetermined rule and according to image position information, The sound image information may be reproduced and the image position information may be used to indicate a spatial position within the first frame picture of the image corresponding to the image position information and the sound channel information set includes at least one sound channel information The sound channel information corresponds to one sound channel, and the sound image corresponds to the image. Such a solution is simple and does not require a complicated mechanical structure and technical solution, and a set of sound channel information can be acquired in such a manner as to acquire image position information, so that the sound image can be reproduced in a general sound channel manner, Thus, the original stereo effects of any number of sound images corresponding to the images can be reproduced without requiring audio information to carry sound image position information. Such a solution can be used to reproduce any audio and video file, and thus the present invention is advantageous for enhancing the technology.

An embodiment of the present invention provides a sound image reproducing apparatus, which can be applied to the multimedia field, and in particular can be used in combination with the sound image reproducing method provided in the previous embodiment of the present invention. 9, the sound image reproducing apparatus may be a microcomputer, for example, a general purpose computer, a personal computer, and a portable device such as a mobile phone terminal or a tablet computer, At least one data interface 9011, a processor 9012 and a memory 9013 may comprise at least one data interface 9011, a processor 9012, a memory 9013 and a bus 9014, (9014).

Bus 9014 may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus 9014 may be classified as an address bus, a data bus, a control bus, and the like, and is indicated using only one bold line in FIG. 9 for ease of illustration, Does not indicate that it is present,

The memory 9013 can be configured to store executable program code, and the program code can include computer instructions, and the memory 9013 can include a high-speed RAM memory and can be stored in a non-volatile memory ), For example at least one magnetic disk memory.

The processor 9012 may be a central processing unit (CPU) or an application specific integrated circuit (ASIC), or may be implemented as one or more integrated circuits implementing the embodiments of the present invention Lt; / RTI >

The data interface 9011 is configured to obtain image position information, wherein the image position information corresponds to an image of one of the at least one image, and the image position information corresponds to an image corresponding to the image position information within the first frame picture It is used to indicate the spatial location.

The processor 9012 is configured to obtain a set of sound channel information according to image position information obtained by the data interface 9011, the set of sound channel information including at least one sound channel information, Each of the sound channel information corresponds to one of the at least one sound channel, and the sound channel information set corresponds to the image position information.

Optionally, the processor 9012 is further configured to play a sound image in accordance with a set of sound channel information obtained by the processor 9012, the sound image corresponding to the image.

Optionally, data interface 9011 is further configured to obtain first frame picture data of a first frame picture,

The data interface 9011 is configured to acquire the image position information,

Data interface 9011 is configured to identify the image position information from the first frame picture in accordance with the first frame picture data obtained by the data interface 9011. [

Optionally, the data interface 9011 is further configured to obtain sound image data of the sound image,

The processor 9012 is configured to reproduce a sound image according to a set of sound channel information acquired by the processor 9012,

The processor 9012 is configured to play the sound image according to the sound image data acquired by the data interface 9011 and according to the sound channel information set.

In addition, optionally, the data interface 9011 is further configured to obtain first frame audio data of the first frame audio, wherein the first frame audio corresponds to the first frame picture,

The data interface 9011 is further configured to acquire sound image data of the sound image,

The data interface 9011 is configured to identify the sound image data of the sound image from the first frame audio data acquired by the data interface 9011. [

Also optionally, the first frame picture comprises at least two images, at least two images comprise a first image and a second image, the first image corresponds to a first sound image, and the second image Corresponds to a second sound image,

The processor 9012 is configured to reproduce a sound image according to the sound channel information set acquired by the data interface 9011,

The processor 9012 is specifically configured to reproduce a first sound image in accordance with a first set of sound channel information obtained by the data interface 9011; And

The processor 9012 is also specifically configured to reproduce the second sound image in accordance with the second set of sound channel information acquired by the data interface 9011

.

Also optionally, the first image corresponds to the first image position information, the second image corresponds to the second image position information, the first image position information corresponds to the first set of sound channel information, The image position information corresponds to a second set of sound channel information,

The processor 9012 is further configured to obtain a set of matching sound channel information according to a first set of sound channel information and a second set of sound channel information obtained by the processor 9012 and the sound channel information in the set of matched sound channel information The first sound channel information set and the second sound channel information set,

The processor 9012 is further configured to play the first sound image and the second sound image according to a predetermined rule and according to a set of matching sound channel information obtained by the processor 9012. [

Further, optionally, the processor 9012 is further configured to obtain first sound image data and second sound image data, wherein the first sound image data corresponds to a first sound image, 2 < / RTI > sound image,

The processor 9012 is further configured to mix the first sound image data and the second sound image data acquired by the processor 9012 to obtain the corresponding sound image data,

The processor 9012 is specifically adapted to reproduce the first sound image and the second sound image in accordance with the matching sound image data acquired by the processor 9012 and in accordance with the matching sound channel information set acquired by the processor 9012 Lt; / RTI >

Optionally, the processor 9012 is further configured to obtain a first set of distinct sound channel information according to a first set of sound channel information and a second set of sound channel information, wherein the at least one first sound channel information comprises a first distinct Wherein the at least one second sound channel information comprises no sound channel information set in the first set of distinct sound channel information,

Processor 9012 is further configured to reproduce a first sound image in accordance with a first set of distinct sound channel information obtained by processor 9012. [

Optionally, the sound image reproduction apparatus further comprises at least one loudspeaker, wherein each loudspeaker in the at least one loudspeaker corresponds to one of the at least one sound channel;

The processor 9012 is configured to reproduce a sound image according to a set of sound channel information acquired by the processor 9012,

Processor 9012 is configured to drive at least one loudspeaker to reproduce a sound image in accordance with the set of sound channel information acquired by processor 9012. [

According to the sound image reproducing apparatus provided in this embodiment of the present invention, image position information can be acquired, and a sound channel information set is acquired according to a predetermined rule and according to image position information, The sound image information may be reproduced and the image position information may be used to indicate a spatial position within the first frame picture of the image corresponding to the image position information and the sound channel information set includes at least one sound channel information The sound channel information corresponds to one sound channel, and the sound image corresponds to the image. Such a solution is simple and does not require a complicated mechanical structure and technical solution, and a set of sound channel information can be acquired in such a manner as to acquire image position information, so that the sound image can be reproduced in a general sound channel manner, Thus, the original stereo effects of any number of sound images corresponding to the images can be reproduced without requiring audio information to carry sound image position information. Such a solution can be used to reproduce any audio and video file, and thus the present invention is advantageous for enhancing the technology.

With the foregoing description of the embodiments, it will be apparent to those skilled in the art that the present invention may be implemented in hardware, firmware, or a combination thereof. When the present invention is implemented in software, the above-described functions may be stored in a computer-readable medium or transmitted as one or more instructions or code in a computer-readable medium. Computer readable media can include computer storage media and communication media, which may include any medium that enables a computer program to be transferred from one place to another. The storage medium may be any available media that can access the computer. Examples of computer-readable media include, but are not limited to, random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM) A compact disk read-only memory (CD-ROM), or other optical disk storage, disk storage, or other disk storage, or to store or transport expected program code in the form of a command or data structure And any other medium which can be accessed by a computer. Also, any connection may be properly defined as a computer readable medium. For example, if the software is a web site, a server, or a network using wireless technologies such as coaxial cable, fiber optic / cable, twisted pair, DSL (digital subscriber line), or infrared, wireless and microwave When transmitted from other remote sources, wireless technologies such as coaxial cable, fiber optic / cable, twisted pair, DSL, or infrared, radio, and microwave are included in a fixation of a medium to which they belong. For example, the disc and disc used in the present invention may be a CD (Compact Disc), a laser disc, an optical disc, a DVD (Digital Versatile Disc), a floppy disc, Ray disc in which a disc typically copies data by magnetic means and a disc optically copies the data by laser means. The combinations described above should also be included within the scope of protection of computer readable media.

The foregoing description is only specific embodiments of the present invention, but is not intended to limit the scope of protection of the present invention. Any alteration or substitution that may readily be devised by those skilled in the art within the scope of the invention disclosed herein may be within the scope of protection of the present invention. Accordingly, the scope of protection of the present invention will be dependent on the scope of protection of the claims.

Claims (18)

Wherein the image position information corresponds to an image of one of the at least one image and the image position information includes a spatial position within the first frame picture of the image corresponding to the image position information Used to direct -;
Acquiring a set of sound channel information according to the image position information, wherein the set of sound channel information includes at least one sound channel information, and each sound channel information in the at least one sound channel information includes at least one sound channel The sound channel information set corresponding to one of the sound channel information, the sound channel information set corresponding to the image position information; And
Reproducing a sound image in accordance with the sound channel information set, the sound image corresponding to the image,
/ RTI >
The method according to claim 1,
Before the step of acquiring the image position information,
Acquiring first frame picture data of the first frame picture
Further comprising:
The step of acquiring the image position information may be concretely
Identifying the image position information from the first frame picture according to the first frame picture data
/ RTI >
3. The method according to claim 1 or 2,
Before playing the sound image according to the set of sound channel information,
Acquiring sound image data of the sound image
Further comprising:
The step of reproducing the sound image according to the set of sound channel information may be specifically
Reproducing the sound image according to the sound image data and according to the sound channel information set
/ RTI >
The method of claim 3,
Before the step of acquiring sound image data of the sound image,
Obtaining first frame audio data of a first frame audio, wherein the first frame audio corresponds to the first frame picture;
Further comprising:
The step of acquiring the sound image data of the sound image may be concretely
Identifying the sound image data of the sound image from the first frame audio data
/ RTI >
The method according to claim 3 or 4,
Wherein the first frame picture comprises at least two images, the at least two images comprise a first image and a second image, the first image corresponding to a first sound image, 2 < / RTI > sound image,
The step of reproducing the sound image according to the set of sound channel information may be specifically
Reproducing the first sound image according to the first sound channel information set; And
Reproducing the second sound image according to the second sound channel information set
/ RTI >
6. The method of claim 5,
Wherein the first image corresponds to first image position information, the second image corresponds to second image position information, the first image position information corresponds to the first set of sound channel information, The position information corresponding to the second set of sound channel information,
The step of reproducing the sound image according to the set of sound channel information may be specifically
Obtaining a set of consonant sound channel information according to the first set of sound channel information and the second set of sound channel information, wherein the sound channel information in the set of consonant sound channel information includes a set of the first sound channel information, Contained in both sets of channel information; And
Reproducing the first sound image and the second sound image according to a predetermined rule and in accordance with the set of matching sound channel information
/ RTI >
The method according to claim 6,
Prior to playing the first sound image and the second sound image according to a preset rule and according to the set of matching sound channel information,
Obtaining first sound image data and second sound image data, the first sound image data corresponding to the first sound image and the second sound image data corresponding to the second sound image; And
Mixing the first sound image data and the second sound image data to obtain corresponding sound image data
Further comprising:
The step of reproducing the first sound image and the second sound image according to a predetermined rule and in accordance with the set of matching sound channel information may be concretely
Reproducing the first sound image and the second sound image in accordance with the matching sound image data and according to the matching sound channel information set
/ RTI >
8. The method according to any one of claims 5 to 7,
Before the step of reproducing the first sound image according to the first set of sound channel information,
Obtaining a first differentiating sound channel information set according to the first sound channel information set and the second sound channel information set, wherein the sound channel information in the first distinct sound channel information set includes a first sound channel information set, Information set, but not in the second sound channel information set,
Further comprising:
Wherein the step of reproducing the first sound image according to the first set of sound channel information includes:
Reproducing the first sound image according to the first distinct sound channel information set
/ RTI >
9. The method according to any one of claims 1 to 8,
The method is applied to a sound image reproducing apparatus, wherein the sound image reproducing apparatus includes at least one loudspeaker, each loudspeaker of the at least one loudspeaker corresponding to a sound channel of one of the at least one sound channel;
The step of reproducing the sound image according to the set of sound channel information may be specifically
Driving the at least one loudspeaker to reproduce the sound image according to the set of sound channel information
/ RTI >
An acquisition unit configured to acquire image position information, the image position information corresponding to an image of one of the at least one image, the image position information corresponding to the image position information within the first frame picture Used to indicate spatial location -;
A channel unit configured to acquire a set of sound channel information according to the image position information acquired by the acquisition unit, the set of sound channel information including at least one sound channel information, each of the at least one sound channel information Wherein the sound channel information of the at least one sound channel corresponds to one of the at least one sound channel and the sound channel information set corresponds to the image position information; And
A playback unit configured to play a sound image in accordance with the set of sound channel information acquired by the channel unit, the sound image corresponding to the image,
And a sound image reproducing apparatus.
11. The method of claim 10,
Wherein the acquisition unit is further configured to acquire first frame picture data of the first frame picture,
The acquisition unit is configured to acquire the image position information. Specifically,
The acquisition unit is configured to identify the image position information from the first frame picture in accordance with the first frame picture data acquired by the acquisition
And a sound image reproducing apparatus.
The method according to claim 10 or 11,
Wherein the acquisition unit is further configured to acquire sound image data of the sound image,
The reproduction unit is configured to reproduce a sound image according to the sound channel information set acquired by the channel unit,
The reproducing unit being adapted to reproduce the sound image according to the sound image data acquired by the acquisition unit and in accordance with the sound channel information set
And a sound image reproducing apparatus.
13. The method of claim 12,
Wherein the acquisition unit is further configured to acquire first frame audio data of a first frame audio, the first frame audio corresponds to the first frame audio,
The acquisition unit is further configured to acquire sound image data of the sound image,
The acquisition unit is configured to identify the sound image data of the sound image from the first frame audio data acquired by the acquisition unit
And a sound image reproducing apparatus.
The method according to claim 12 or 13,
Wherein the first frame picture comprises at least two images, the at least two images comprise a first image and a second image, the first image corresponding to a first sound image, 2 < / RTI > sound image,
Wherein the reproducing unit is configured to reproduce a sound image according to the sound channel information set acquired by the acquiring unit,
The reproducing unit being specifically configured to reproduce the first sound image in accordance with the first sound channel information set acquired by the acquisition unit; And
Characterized in that said reproducing unit is also specifically configured to reproduce said second sound image in accordance with said second sound channel information set acquired by said acquisition unit
And a sound image reproducing apparatus.
15. The method of claim 14,
Wherein the first image corresponds to first image position information, the second image corresponds to second image position information, the first image position information corresponds to the first set of sound channel information, The position information corresponding to the second set of sound channel information,
The reproducing unit
A corresponding sound channel information set in accordance with the first sound channel information set and the second sound channel information set acquired by the channel unit, the sound channel information in the set of matching sound channel information The first sound channel information set and the second sound channel information set; And
Configured to reproduce the first sound image and the second sound image according to a predetermined rule and in accordance with the set of the coinciding sound channel information obtained by the coincidence channel subunit,
And a sound image reproducing apparatus.
16. The method of claim 15,
The reproducing unit
An acquisition subunit configured to acquire first sound image data and second sound image data, the first sound image data corresponding to the first sound image, and the second sound image data corresponding to the second sound image -; And
A mixing sub-unit configured to mix the first sound image data acquired by the acquisition sub-unit and the second sound image data to obtain the matching sound image data;
Further comprising:
Wherein the coincident playback sub-unit is arranged to generate the first sound image and the second sound image in accordance with the matching sound image data acquired by the mixing subunit and according to the matching sound channel information set acquired by the matching channel subunit. The sound image reproduction apparatus comprising:
17. The method according to any one of claims 14 to 16,
The reproducing unit
A distinct channel sub-unit configured to obtain a first set of distinct sound channel information according to the first set of sound channel information and the second set of sound channel information, the at least one first sound channel information being associated with the first distinct sound channel Information set, wherein the at least one second sound channel information does not include any first distinct sound channel information in the first set of distinct sound channel information; And
And to reproduce the first sound image in accordance with the first distinct sound channel information set acquired by the distinct channel subunit.
And a sound image reproducing apparatus.
18. The method according to any one of claims 10 to 17,
Wherein the sound image reproducing apparatus further comprises at least one loudspeaker, wherein each loudspeaker of the at least one loudspeaker corresponds to a sound channel of one of the at least one sound channel;
The reproduction unit is configured to reproduce a sound image according to the sound channel information set acquired by the channel unit,
The playback unit being configured to drive the at least one loudspeaker to reproduce the sound image in accordance with the set of sound channel information acquired by the channel unit
And a sound image reproducing apparatus.
KR1020167024888A 2014-08-29 2015-08-18 Sound image playing method and device KR20160119218A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201410438159.1A CN104270552A (en) 2014-08-29 2014-08-29 Sound image playing method and device
CN201410438159.1 2014-08-29
PCT/CN2015/087394 WO2016029806A1 (en) 2014-08-29 2015-08-18 Sound image playing method and device

Publications (1)

Publication Number Publication Date
KR20160119218A true KR20160119218A (en) 2016-10-12

Family

ID=52162038

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020167024888A KR20160119218A (en) 2014-08-29 2015-08-18 Sound image playing method and device

Country Status (4)

Country Link
US (1) US20160065791A1 (en)
KR (1) KR20160119218A (en)
CN (2) CN104270552A (en)
WO (1) WO2016029806A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20210158706A (en) * 2020-06-24 2021-12-31 현대자동차주식회사 Vehicle and control method for the same
US11553275B2 (en) 2018-12-28 2023-01-10 Samsung Display Co., Ltd. Method of providing sound that matches displayed image and display device using the method

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104270552A (en) * 2014-08-29 2015-01-07 华为技术有限公司 Sound image playing method and device
US11132545B2 (en) * 2016-07-30 2021-09-28 Huawei Technologies Co., Ltd. Image recognition method and terminal
CN109194999B (en) * 2018-09-07 2021-07-09 深圳创维-Rgb电子有限公司 Method, device, equipment and medium for realizing parity of sound and image
CN110554647A (en) * 2019-09-10 2019-12-10 广州安衡电子科技有限公司 processing method and system for synchronizing moving image and sound image
US11234090B2 (en) * 2020-01-06 2022-01-25 Facebook Technologies, Llc Using audio visual correspondence for sound source identification
US11087777B1 (en) 2020-02-11 2021-08-10 Facebook Technologies, Llc Audio visual correspondence based signal augmentation
CN113724628A (en) * 2020-05-25 2021-11-30 苏州佳世达电通有限公司 Audio-visual system
CN111741412B (en) * 2020-06-29 2022-07-26 京东方科技集团股份有限公司 Display device, sound emission control method, and sound emission control device
TWI787799B (en) * 2021-04-28 2022-12-21 宏正自動科技股份有限公司 Method and device for video and audio processing

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6829018B2 (en) * 2001-09-17 2004-12-07 Koninklijke Philips Electronics N.V. Three-dimensional sound creation assisted by visual information
JP4521671B2 (en) * 2002-11-20 2010-08-11 小野里 春彦 Video / audio playback method for outputting the sound from the display area of the sound source video
JP2007266967A (en) * 2006-03-28 2007-10-11 Yamaha Corp Sound image localizer and multichannel audio reproduction device
JP2007274061A (en) * 2006-03-30 2007-10-18 Yamaha Corp Sound image localizer and av system
JP4713396B2 (en) * 2006-05-09 2011-06-29 シャープ株式会社 Video / audio reproduction device and sound image moving method thereof
JP4946305B2 (en) * 2006-09-22 2012-06-06 ソニー株式会社 Sound reproduction system, sound reproduction apparatus, and sound reproduction method
JP5000989B2 (en) * 2006-11-22 2012-08-15 シャープ株式会社 Information processing apparatus, information processing method, and program
JP2010206265A (en) * 2009-02-27 2010-09-16 Toshiba Corp Device and method for controlling sound, data structure of stream, and stream generator
JP5197525B2 (en) * 2009-08-04 2013-05-15 シャープ株式会社 Stereoscopic image / stereoscopic sound recording / reproducing apparatus, system and method
JP5919201B2 (en) * 2010-03-23 2016-05-18 ドルビー ラボラトリーズ ライセンシング コーポレイション Technology to perceive sound localization
CN102209225B (en) * 2010-03-30 2013-04-17 华为终端有限公司 Method and device for realizing video communication
CN102421054A (en) * 2010-09-27 2012-04-18 夏普株式会社 Spatial audio frequency configuration method and device of multichannel display
US8854447B2 (en) * 2012-12-21 2014-10-07 United Video Properties, Inc. Systems and methods for automatically adjusting audio based on gaze point
CN104270552A (en) * 2014-08-29 2015-01-07 华为技术有限公司 Sound image playing method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11553275B2 (en) 2018-12-28 2023-01-10 Samsung Display Co., Ltd. Method of providing sound that matches displayed image and display device using the method
KR20210158706A (en) * 2020-06-24 2021-12-31 현대자동차주식회사 Vehicle and control method for the same

Also Published As

Publication number Publication date
CN104270552A (en) 2015-01-07
US20160065791A1 (en) 2016-03-03
CN106576132A (en) 2017-04-19
WO2016029806A1 (en) 2016-03-03

Similar Documents

Publication Publication Date Title
KR20160119218A (en) Sound image playing method and device
US9319821B2 (en) Method, an apparatus and a computer program for modification of a composite audio signal
US11055057B2 (en) Apparatus and associated methods in the field of virtual reality
CN105981368B (en) Picture composition and position guidance in an imaging device
US8644467B2 (en) Video conferencing system, method, and computer program storage device
US10798518B2 (en) Apparatus and associated methods
US20160198097A1 (en) System and method for inserting objects into an image or sequence of images
EP3236345A1 (en) An apparatus and associated methods
US20140233917A1 (en) Video analysis assisted generation of multi-channel audio data
CN105430512A (en) Method and device for displaying information on video image
CN103729120A (en) Method for generating thumbnail image and electronic device thereof
US9584761B2 (en) Videoconference terminal, secondary-stream data accessing method, and computer storage medium
CN109819316B (en) Method and device for processing face sticker in video, storage medium and electronic equipment
KR20210110852A (en) Image deformation control method, device and hardware device
CN112653902A (en) Speaker recognition method and device and electronic equipment
US20170094439A1 (en) Information processing method and electronic device
KR20220148915A (en) Audio processing methods, apparatus, readable media and electronic devices
US11342001B2 (en) Audio and video processing
CN108965746A (en) Image synthesizing method and system
CN108960130B (en) Intelligent video file processing method and device
CN114531564A (en) Processing method and electronic equipment
CN103959805A (en) Method and device for displaying image
US11503226B2 (en) Multi-camera device
EP3321795A1 (en) An apparatus and associated methods
EP3073747A1 (en) Method and device for adapting an audio level of a video

Legal Events

Date Code Title Description
A201 Request for examination