TWI636453B - Multimedia data processing device and method - Google Patents

Multimedia data processing device and method Download PDF

Info

Publication number
TWI636453B
TWI636453B TW106142618A TW106142618A TWI636453B TW I636453 B TWI636453 B TW I636453B TW 106142618 A TW106142618 A TW 106142618A TW 106142618 A TW106142618 A TW 106142618A TW I636453 B TWI636453 B TW I636453B
Authority
TW
Taiwan
Prior art keywords
sound source
image
positional relationship
relative positional
multimedia data
Prior art date
Application number
TW106142618A
Other languages
Chinese (zh)
Other versions
TW201926317A (en
Inventor
何其勳
郭俊彥
王蕙雯
李學文
辛怡德
Original Assignee
鴻海精密工業股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 鴻海精密工業股份有限公司 filed Critical 鴻海精密工業股份有限公司
Priority to TW106142618A priority Critical patent/TWI636453B/en
Application granted granted Critical
Publication of TWI636453B publication Critical patent/TWI636453B/en
Publication of TW201926317A publication Critical patent/TW201926317A/en

Links

Landscapes

  • User Interface Of Digital Computer (AREA)
  • Stereophonic System (AREA)
  • Studio Devices (AREA)

Abstract

一種多媒體資料處理裝置及方法,包括獲取單元、圖像採集裝置、音訊採集裝置及處理單元,獲取單元用於獲取音源與圖像採集裝置之間的相對位置關係;圖像採集裝置用於根據所述相對位置關係採集預定範圍內的圖像資料,音訊採集裝置用於根據所述相對位置關係採集預定範圍內音源所發出的音訊資料,處理單元用於將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。本發明還提供一種多媒體資料處理方法。通過獲取音源與圖像採集裝置之間的相對位置關係,進而對音源進行定位。如此形成具有方位感的聲音資料,將大大提升使用者的體驗感覺。A multimedia data processing device and method, comprising an acquiring unit, an image collecting device, an audio collecting device and a processing unit, wherein the acquiring unit is configured to acquire a relative positional relationship between the sound source and the image capturing device; and the image collecting device is used according to the The relative positional relationship is used to collect image data within a predetermined range, and the audio collecting device is configured to collect audio data sent by the sound source within a predetermined range according to the relative positional relationship, and the processing unit is configured to use the image data within the predetermined range that is collected. Correlation with audio data. The invention also provides a multimedia data processing method. The sound source is positioned by acquiring a relative positional relationship between the sound source and the image capture device. The formation of sound information with a sense of orientation will greatly enhance the user's experience.

Description

多媒體資料處理裝置及方法Multimedia data processing device and method

本發明涉及一種多媒體資料處理裝置及方法。The invention relates to a multimedia data processing device and method.

通常在使用者進行攝像時,聲音和圖像是分開收集。當進行錄影時,錄影需要獲取圖像資料和聲音資料,然而,現有的聲音資料都是所有聲音不分方位的獲取,最後播放時,也就聽不到任何具有立體感的聲音感覺,如此,用戶體驗較差。Sound and images are usually collected separately when the user is taking a picture. When recording, the video needs to acquire image data and sound data. However, the existing sound data is all sounds that are obtained regardless of the position. When the final sound is played, no sound feeling with a three-dimensional sound is heard. The user experience is poor.

鑒於上述內容,有必要提供一種多媒體資料處理裝置及方法。In view of the above, it is necessary to provide a multimedia data processing apparatus and method.

一種多媒體資料處理方法,所述方法包括步驟:A multimedia data processing method, the method comprising the steps of:

獲取音源與圖像採集裝置之間的相對位置關係;Obtaining a relative positional relationship between the sound source and the image capturing device;

根據所述相對位置關係採集預定範圍內的圖像資料及音源所發出的音訊資料;及Acquiring image data in a predetermined range and audio data sent by the sound source according to the relative positional relationship; and

將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。The associated image data in the predetermined range is associated with the audio data.

優選地,所述獲取音源與圖像採集裝置之間的相對位置關係的步驟具體包括:Preferably, the step of acquiring a relative positional relationship between the sound source and the image capturing device comprises:

獲取來自音源處的輸出的定位信號;及Acquiring a positioning signal from an output at the sound source; and

根據接收到的所述定位信號確定音源與圖像採集裝置之間的相對位置關係。And determining a relative positional relationship between the sound source and the image capturing device according to the received positioning signal.

優選地,所述獲取音源與圖像採集裝置的相對位置關係的步驟具體包括:Preferably, the step of acquiring a relative positional relationship between the sound source and the image capturing device specifically includes:

通過陀螺儀採集音源移動的角動量;及Collecting the angular momentum of the sound source movement through the gyroscope; and

將所述角動量轉換為音源對應的方位資訊以確定音源與圖像採集裝置之間的相對位置關係。The angular momentum is converted into orientation information corresponding to the sound source to determine a relative positional relationship between the sound source and the image capture device.

優選地,所述獲取音源與圖像採集裝置的相對位置關係的步驟具體包括:Preferably, the step of acquiring a relative positional relationship between the sound source and the image capturing device specifically includes:

通過直線加速器採集圖像採集裝置的位移和/或加速度;及Acquiring the displacement and/or acceleration of the image acquisition device by a linear accelerator; and

將所述位移和/或加速度轉換為圖像採集裝置對應的方位資訊以確定音源與圖像採集裝置之間的相對位置關係。The displacement and/or acceleration is converted into orientation information corresponding to the image acquisition device to determine a relative positional relationship between the sound source and the image acquisition device.

優選地,所述獲取音源與圖像採集裝置的相對位置關係的步驟具體包括:Preferably, the step of acquiring a relative positional relationship between the sound source and the image capturing device specifically includes:

通過陀螺儀及直線加速器採集圖像採集裝置的角動量及位移和/或加速度,並將所採集到的圖像採集裝置的角動量及位移和/或加速度轉換成第一方位資訊;Acquiring the angular momentum and displacement and/or acceleration of the image acquisition device by the gyroscope and the linear accelerator, and converting the angular momentum and the displacement and/or acceleration of the collected image acquisition device into the first orientation information;

通過陀螺儀及直線加速器採集音源移動的角動量及位移和/或加速度,並將所採集到的音源移動的角動量及位移和/或加速度轉換成第二方位資訊;及Acquiring the angular momentum and displacement and/or acceleration of the sound source movement through the gyroscope and the linear accelerator, and converting the angular momentum and the displacement and/or acceleration of the collected sound source into the second orientation information;

根據所述第一方位資訊及所述第二方位資訊確定音源與圖像採集裝置之間的相對位置關係。And determining a relative positional relationship between the sound source and the image capturing device according to the first orientation information and the second orientation information.

優選地,所述方法還包括步驟:Preferably, the method further comprises the steps of:

確定使用者觀看圖像的視角;及Determining the perspective of the user viewing the image; and

播放所述視角對應的音訊資料。Playing the audio data corresponding to the viewing angle.

優選地,所述方法還包括步驟:Preferably, the method further comprises the steps of:

確定使用者觀看圖像的視角及距離;及Determining the viewing angle and distance of the user viewing the image; and

根據所述視角及距離進行音量加權及方向調整。Volume weighting and direction adjustment are performed according to the angle of view and distance.

一種多媒體資料處理裝置,所述裝置包括:A multimedia material processing device, the device comprising:

獲取單元,用於獲取音源與圖像採集裝置之間的相對位置關係;An acquiring unit, configured to acquire a relative positional relationship between the sound source and the image capturing device;

圖像採集裝置,用於根據所述相對位置關係採集預定範圍內的圖像資料;An image collecting device, configured to collect image data within a predetermined range according to the relative positional relationship;

音訊採集裝置,用於根據所述相對位置關係採集預定範圍內音源所發出的音訊資料;及An audio collection device, configured to collect audio data sent by a sound source within a predetermined range according to the relative positional relationship; and

處理單元,用於將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。The processing unit is configured to associate the collected image data in the predetermined range with the audio data.

優選地,所述多媒體資料處理裝置還包括一存儲單元,所述存儲單元用於將建立對應關聯的圖像資料與音訊資料進行存儲。Preferably, the multimedia data processing device further includes a storage unit, and the storage unit is configured to store the image data and the audio data corresponding to the association.

優選地,所述多媒體資料處理裝置中的播放裝置用於播放所述建立對應關聯的多媒體資料,並用於偵測使用者觀看圖像的方向及視角,並根據使用者的視角及距離對應地進行音量加權及方向調整。Preferably, the playing device in the multimedia data processing device is configured to play the multimedia material that establishes the corresponding association, and is used to detect the direction and the viewing angle of the user viewing the image, and correspondingly according to the user's perspective and distance. Volume weighting and direction adjustment.

上述多媒體資料處理裝置及方法通過獲取音源與圖像採集裝置之間的相對位置關係,進而對音源進行定位。如此,形成具有方位感的聲音資料,可以大大提升使用者的體驗感覺。The above multimedia data processing apparatus and method locates a sound source by acquiring a relative positional relationship between the sound source and the image capturing device. In this way, the formation of a sound material with a sense of orientation can greatly enhance the user's experience.

下面將結合本發明實施例中的附圖,對本發明實施例中的技術方案進行清楚、完整地描述,顯然,所描述的實施例是本發明一部分實施例,而不是全部的實施例。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are a part of the embodiments of the present invention, but not all embodiments.

為了使本發明的目的、技術方案及優點更加清楚明白,以下將結合附圖及實施方式,對本發明中的多媒體資料處理裝置及方法作進一步詳細描述及相關說明。In order to make the objects, technical solutions and advantages of the present invention more comprehensible, the multimedia data processing apparatus and method of the present invention will be further described in detail below with reference to the accompanying drawings and embodiments.

如圖1及圖2所示,本發明的一較佳實施例提供一種多媒體資料處理裝置100。As shown in FIG. 1 and FIG. 2, a preferred embodiment of the present invention provides a multimedia material processing apparatus 100.

所述多媒體資料處理裝置100用於確定多個音源10、12、14的方位資訊。在具體實施中,所述音源10、12、14可以分別為3個不同人物(如演員)所發出的聲音。在本實施例中,音源10、12、14的個數以3個為例,可以多於或者少於3個,例如至少1個。The multimedia material processing apparatus 100 is configured to determine orientation information of a plurality of sound sources 10, 12, and 14. In a specific implementation, the sound sources 10, 12, 14 may respectively be sounds of three different characters (such as actors). In this embodiment, the number of the sound sources 10, 12, and 14 is exemplified by three, and may be more or less than three, for example, at least one.

所述多媒體資料處理裝置100包括獲取單元20、圖像採集裝置30、音訊採集裝置40、處理單元50及存儲單元60。The multimedia data processing apparatus 100 includes an acquisition unit 20, an image collection device 30, an audio collection device 40, a processing unit 50, and a storage unit 60.

所述獲取單元20用於獲取音源10、12、14與圖像採集裝置30之間的相對位置關係。其中,所述相對位置關係包括音源10、12、14與圖像採集裝置20之間的方向及距離。The acquiring unit 20 is configured to acquire a relative positional relationship between the sound sources 10, 12, and 14 and the image capturing device 30. The relative positional relationship includes a direction and a distance between the sound sources 10, 12, and 14 and the image capturing device 20.

所述圖像採集裝置30用於根據所述相對位置關係採集預定範圍內的圖像資料。在本實施例中,所述圖像採集裝置30包括多個攝像鏡頭,以分別用於對預定範圍內的圖像資料進行採集。The image capturing device 30 is configured to collect image data within a predetermined range according to the relative positional relationship. In this embodiment, the image capturing device 30 includes a plurality of imaging lenses for respectively acquiring image data within a predetermined range.

所述音訊採集裝置40用於根據所述相對位置關係採集預定範圍內音源10、12、14所發出的音訊資料。The audio collection device 40 is configured to collect audio data sent by the sound sources 10, 12, and 14 within a predetermined range according to the relative positional relationship.

所述處理單元50用於將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。The processing unit 50 is configured to associate an image data in the collected predetermined range with the audio data.

所述存儲單元60用於將建立對應關聯的多媒體資料進行存儲。The storage unit 60 is configured to store the corresponding associated multimedia material.

在一優選實施例中,當音源10、12、14與影像處理裝置30均不移動時,所述獲取單元20通過設置在每一音源10、12、14處的定位裝置輸出定位信號至影像處理裝置30,如此可以根據所接收的定位信號確定音源10、12、14與影像處理裝置30之間的相對位置關係。在本實施例中,所述定位裝置為超聲波裝置、全球定位系統(Global Positioning System,GPS)裝置、無線保真(Wireless-Fidelity,WiFi)裝置的其中一種。In a preferred embodiment, when the sound sources 10, 12, 14 and the image processing device 30 are not moved, the obtaining unit 20 outputs a positioning signal to the image processing through the positioning device disposed at each of the sound sources 10, 12, and 14. The device 30 can determine the relative positional relationship between the sound sources 10, 12, 14 and the image processing device 30 based on the received positioning signals. In this embodiment, the positioning device is one of an ultrasonic device, a Global Positioning System (GPS) device, and a Wireless-Fidelity (WiFi) device.

在另一優選實施例中,當音源10、12、14移動且影像處理裝置30不移動時,所述獲取單元20通過定位裝置採集音源10、12、14移動的角動量,將所述角動量轉換為音源10、12、14對應的方位資訊以確定音源與圖像採集裝置30之間的相對位置關係。在本實施例中,所述定位裝置可為陀螺儀。In another preferred embodiment, when the sound source 10, 12, 14 moves and the image processing device 30 does not move, the acquiring unit 20 collects the angular momentum of the sound source 10, 12, 14 by the positioning device, and the angular momentum is The orientation information corresponding to the sound sources 10, 12, 14 is converted to determine the relative positional relationship between the sound source and the image capture device 30. In this embodiment, the positioning device may be a gyroscope.

在另一優選實施例中,當圖像採集裝置30移動且音源10、12、14不移動時,所述獲取單元20通過定位裝置採集圖像採集裝置30的位移和/或加速度,並將所述位移和/或加速度轉換為圖像採集裝置30對應的方位資訊以確定音源10、12、14與圖像採集裝置30之間的相對位置關係。在本實施例中,所述定位裝置可為直線加速器。In another preferred embodiment, when the image capture device 30 moves and the sound source 10, 12, 14 does not move, the acquisition unit 20 acquires the displacement and/or acceleration of the image capture device 30 by the positioning device, and The displacement and/or acceleration is converted to orientation information corresponding to the image capture device 30 to determine the relative positional relationship between the sound sources 10, 12, 14 and the image capture device 30. In this embodiment, the positioning device may be a linear accelerator.

在另一優選實施例中,當圖像採集裝置30與音源10、12、14均移動時,所述獲取單元20通過定位裝置採集圖像採集裝置30的角動量及位移和/或加速度,並將所採集到的圖像採集裝置30的角動量及位移和/或加速度轉換成第一方位資訊。同理,所述獲取單元20通過定位裝置採集音源10、12、14移動的角動量及位移和/或加速度,並將所採集到的音源10、12、14移動的角動量及位移和/或加速度分別轉換成第二方位資訊。所述處理單元50根據第一方位資訊及第二方位資訊確定音源10、12、14與圖像採集裝置30之間的相對位置關係。In another preferred embodiment, when the image capture device 30 and the sound source 10, 12, 14 are both moved, the acquisition unit 20 acquires angular momentum and displacement and/or acceleration of the image capture device 30 by the positioning device, and The angular momentum and displacement and/or acceleration of the acquired image acquisition device 30 are converted into first orientation information. Similarly, the acquiring unit 20 collects angular momentum and displacement and/or acceleration of the sound source 10, 12, 14 by the positioning device, and shifts the angular momentum and displacement of the collected sound source 10, 12, 14 and/or The acceleration is converted into the second orientation information, respectively. The processing unit 50 determines a relative positional relationship between the sound sources 10, 12, 14 and the image capturing device 30 based on the first orientation information and the second orientation information.

在一優選實施例中,所述存儲單元60中所存儲的多媒體資料可傳輸至播放裝置(圖未示)中使用,其中,傳輸方式可包括但不局限於:通過存儲介質進行複製或無線網路傳輸等方式。In a preferred embodiment, the multimedia material stored in the storage unit 60 can be transmitted to a playback device (not shown). The transmission mode can include, but is not limited to, copying through a storage medium or a wireless network. Road transmission and other methods.

使用者使用播放裝置時,所述播放裝置偵測使用者觀看的方向,並確定使用者觀看圖像的視角,並根據所獲取的建立關聯的多媒體資料播放所述視角對應的音訊資料。When the user uses the playing device, the playing device detects the direction in which the user views, determines the viewing angle of the user viewing the image, and plays the audio data corresponding to the viewing angle according to the acquired associated multimedia material.

在一優選實施例中,所述播放裝置還用於根據使用者觀看圖像的視角及距離進行音量加權及方向調整。In a preferred embodiment, the playback device is further configured to perform volume weighting and direction adjustment according to the angle of view and distance of the user viewing the image.

具體而言,當使用者使用播放裝置(如VR頭戴式顯示器)時,在正面朝向圖像中的某一視角時,所述視角對應關聯的聲音將會從用戶的前方傳來,而圖像中與其他視角中對應關聯的聲音將會從用戶的左後方和右後方傳來。並且,各個方位傳來的聲音會加權而調整音量大小。Specifically, when the user uses a playback device (such as a VR head mounted display), when the front side faces a certain angle of view in the image, the associated sound corresponding to the angle of view will be transmitted from the front of the user. Sounds associated with other perspectives in the image will be transmitted from the user's left rear and right rear. Moreover, the sounds from the various directions are weighted to adjust the volume.

請參考圖3,多媒體資料處理方法包括以下步驟:Referring to FIG. 3, the multimedia data processing method includes the following steps:

步驟S100,獲取音源與圖像採集裝置之間的相對位置關係,具體可以通過如下方式實現:Step S100: Acquire a relative positional relationship between the sound source and the image collection device, which can be specifically implemented as follows:

當音源移動且影像處理裝置不移動時,通過定位裝置採集音源移動的角動量,將所述角動量轉換為音源對應的方位資訊以確定音源與圖像採集裝置之間的相對位置關係。在具體實施例中,定位裝置可通過陀螺儀實現對角動量的採集。When the sound source moves and the image processing device does not move, the angular momentum of the sound source movement is collected by the positioning device, and the angular momentum is converted into the orientation information corresponding to the sound source to determine the relative positional relationship between the sound source and the image capturing device. In a particular embodiment, the positioning device can achieve the acquisition of diagonal momentum by a gyroscope.

當音源與影像處理裝置均不移動時,通過設置在每一音源處的定位裝置輸出定位信號至影像處理裝置,如此可以根據所接收的定位信號確定音源與影像處理裝置之間的相對位置關係。在具體實施例中,定位裝置可通過超聲波裝置、全球系統定位裝置、無線保真裝置中的一種實現對音源的定位。When neither the sound source nor the image processing device moves, the positioning device disposed at each sound source outputs a positioning signal to the image processing device, so that the relative positional relationship between the sound source and the image processing device can be determined according to the received positioning signal. In a specific embodiment, the positioning device can achieve positioning of the sound source by one of an ultrasonic device, a global system positioning device, and a wireless fidelity device.

當圖像採集裝置移動且音源不移動時,通過定位裝置採集圖像採集裝置的位移和/或加速度,並將所述位移和/或加速度轉換為圖像採集裝置對應的方位資訊以確定音源與圖像採集裝置之間的相對位置關係。在具體實施例中,定位裝置可通過直線加速器以實現採集裝置的位移及加速度。When the image capturing device moves and the sound source does not move, the displacement and/or acceleration of the image capturing device is acquired by the positioning device, and the displacement and/or acceleration is converted into the orientation information corresponding to the image capturing device to determine the sound source and The relative positional relationship between image acquisition devices. In a particular embodiment, the positioning device can be passed through a linear accelerator to achieve displacement and acceleration of the acquisition device.

當圖像採集裝置與音源均移動時,通過定位裝置採集圖像採集裝置的角動量及位移和/或加速度,並將所採集到的圖像採集裝置的角動量及位移和/或加速度轉換成第一方位資訊。同理,通過定位裝置採集音源移動的角動量及位移和/或加速度,並將所採集到的音源移動的角動量及位移和/或加速度分別轉換成第二方位資訊。所述處理單元根據第一方位資訊及第二方位資訊確定音源與圖像採集裝置之間的相對位置關係。When the image capturing device and the sound source are both moved, the angular momentum and displacement and/or acceleration of the image capturing device are acquired by the positioning device, and the angular momentum and displacement and/or acceleration of the collected image capturing device are converted into First orientation information. Similarly, the angular momentum and the displacement and/or acceleration of the sound source movement are collected by the positioning device, and the angular momentum and the displacement and/or the acceleration of the collected sound source are respectively converted into the second orientation information. The processing unit determines a relative positional relationship between the sound source and the image capturing device according to the first orientation information and the second orientation information.

步驟S102,根據所述相對位置關係分別採集預定範圍內的圖像資料及音源所發出的音訊資料。Step S102: Acquire image data in a predetermined range and audio data sent by the sound source according to the relative positional relationship.

步驟S104,將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。具體而言,將確定的相對位置關係與對應的音訊資料及圖像資料相結合,以生成對應關聯的多媒體資料。Step S104: Correlate the associated image data in the predetermined range with the audio data. Specifically, the determined relative positional relationship is combined with the corresponding audio data and image data to generate corresponding associated multimedia materials.

請參考圖4,多媒體資料播放方法包括以下步驟:Referring to FIG. 4, the multimedia data playing method includes the following steps:

步驟S200,偵測用戶觀看的方向,並確定使用者觀看圖像的視角,根據所獲取的建立關聯的多媒體資料播放所述視角對應的音訊資料。In step S200, the direction of the user's viewing is detected, and the viewing angle of the user's viewing image is determined, and the audio data corresponding to the viewing angle is played according to the acquired associated multimedia material.

具體而言,當使用者使用播放裝置觀看圖像的第一視角時,根據所獲取的建立關聯的多媒體資料,以確定播放所述第一視角在所述多媒體資料內對應的音訊資料。當使用者觀看圖像的第二視角時,根據所獲取的建立關聯的多媒體資料,以確定播放所述第二視角在所述多媒體資料內對應的音訊資料。以此類推,使用者在觀看到圖像不同的視角,將會根據所述建立關聯的多媒體資料以播放不同視角所對應的音訊資料。Specifically, when the user views the first view of the image by using the playback device, the associated multimedia data is determined according to the acquired associated multimedia data to determine that the corresponding audio data in the multimedia material is played. When the user views the second view of the image, the associated multimedia material is determined according to the acquired multimedia data to determine the corresponding audio data in the multimedia material. By analogy, when viewing the different views of the image, the user will play the associated multimedia material according to the audio data corresponding to the different viewing angles.

步驟S202,根據使用者觀看圖像的視角及距離的不同而對音訊資料進行音量加權及方向調整。Step S202, performing volume weighting and direction adjustment on the audio data according to different viewing angles and distances of the user viewing the image.

具體而言,當使用者使用播放裝置並正面朝向圖像中的某一視角時,所述視角對應關聯的聲音將會從用戶的前方傳來,而圖像中與其他視角中對應關聯的聲音將會從用戶的左後方和右後方傳來。並且,用戶在各個方位傳來的聲音會通過加權而調整音量大小。Specifically, when the user uses the playback device and faces the front view to a certain angle of view in the image, the associated sound corresponding to the perspective will be transmitted from the front of the user, and the sound associated with the other perspectives in the image. Will be sent from the user's left rear and right rear. Moreover, the sounds transmitted by the user in various directions are adjusted by the weighting.

上述多媒體資料處理裝置及方法通過獲取音源與圖像採集裝置之間的相對位置關係,進而對音源進行定位。如此,形成具有方位感的聲音資料,可以大大提升使用者的體驗感覺。The above multimedia data processing apparatus and method locates a sound source by acquiring a relative positional relationship between the sound source and the image capturing device. In this way, the formation of a sound material with a sense of orientation can greatly enhance the user's experience.

最後應說明的是,以上實施例僅用以說明本發明的技術方案而非限制。本領域的普通技術人員應當理解,可以對本發明的技術方案進行修改或等同替換,而不脫離本發明技術方案的精神和範圍。基於本發明中的實施例,本領域普通技術人員在沒有做出創造性勞動前提下所獲得的所有其他實施例,都將屬於本發明保護的範圍。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention and not to limit them. A person skilled in the art should understand that the technical solutions of the present invention may be modified or equivalently substituted without departing from the spirit and scope of the technical solutions of the present invention. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without departing from the inventive scope will fall within the scope of the present invention.

綜上所述,本發明確已符合發明專利之要件,遂依法提出專利申請。惟,以上所述者僅為本發明之較佳實施方式,自不能以此限制本案之申請專利範圍。舉凡熟悉本案技藝之人士爰依本發明之精神所作之等效修飾或變化,皆應涵蓋於以下申請專利範圍內。In summary, the present invention has indeed met the requirements of the invention patent, and has filed a patent application according to law. However, the above description is only a preferred embodiment of the present invention, and it is not possible to limit the scope of the patent application of the present invention. Equivalent modifications or variations made by persons skilled in the art in light of the spirit of the invention are intended to be included within the scope of the following claims.

100‧‧‧多媒體資料處理裝置
10、12、14‧‧‧音源
20‧‧‧獲取單元
30‧‧‧圖像採集裝置
40‧‧‧音訊採集裝置
50‧‧‧處理單元
60‧‧‧存儲單元
100‧‧‧Multimedia data processing device
10, 12, 14‧‧ ‧ source
20‧‧‧Acquisition unit
30‧‧‧Image acquisition device
40‧‧‧Audio acquisition device
50‧‧‧Processing unit
60‧‧‧ storage unit

圖1為多媒體資料處理裝置的一較佳實施方式的方框圖。1 is a block diagram of a preferred embodiment of a multimedia data processing device.

圖2為多媒體資料處理裝置的一較佳實施方式的示意圖。2 is a schematic diagram of a preferred embodiment of a multimedia data processing device.

圖3為多媒體資料處理方法的一較佳實施方式的流程圖。3 is a flow chart of a preferred embodiment of a multimedia data processing method.

圖4為多媒體資料播放方法的一較佳實施方式的流程圖。4 is a flow chart of a preferred embodiment of a method for playing multimedia data.

no

no

Claims (10)

一種多媒體資料處理方法,其中,所述方法包括步驟: 獲取音源與圖像採集裝置之間的相對位置關係; 根據所述相對位置關係採集預定範圍內的圖像資料及音源所發出的音訊資料;及 將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。A multimedia data processing method, wherein the method comprises the steps of: acquiring a relative positional relationship between a sound source and an image capturing device; collecting image data within a predetermined range and audio data emitted by the sound source according to the relative positional relationship; And correlating the collected image data with the audio data in the predetermined range. 如申請專利範圍第1項所述的多媒體資料處理方法,其中所述獲取音源與圖像採集裝置之間的相對位置關係的步驟具體包括: 獲取來自音源處的輸出的定位信號;及; 根據接收到的所述定位信號確定音源與圖像採集裝置之間的相對位置關係。The multimedia data processing method of claim 1, wherein the step of acquiring a relative positional relationship between the sound source and the image capturing device comprises: acquiring a positioning signal from an output at the sound source; and; receiving The resulting positioning signal determines the relative positional relationship between the sound source and the image capture device. 如申請專利範圍第1項所述的多媒體資料處理方法,其中所述獲取音源與圖像採集裝置的相對位置關係的步驟具體包括: 通過陀螺儀採集音源移動的角動量;及 將所述角動量轉換為音源對應的方位資訊以確定音源與圖像採集裝置之間的相對位置關係。The method for processing a multimedia data according to claim 1, wherein the step of acquiring a relative positional relationship between the sound source and the image capturing device comprises: collecting angular momentum of the sound source movement by the gyroscope; and applying the angular momentum Converting to the orientation information corresponding to the sound source to determine the relative positional relationship between the sound source and the image acquisition device. 如申請專利範圍第1項所述的多媒體資料處理方法,其中所述獲取音源與圖像採集裝置的相對位置關係的步驟具體包括: 通過直線加速器採集圖像採集裝置的位移和/或加速度;及 將所述位移和/或加速度轉換為圖像採集裝置對應的方位資訊以確定音源與圖像採集裝置之間的相對位置關係。The method for processing a multimedia data according to claim 1, wherein the step of acquiring a relative positional relationship between the sound source and the image capturing device comprises: acquiring a displacement and/or an acceleration of the image capturing device by a linear accelerator; The displacement and/or acceleration is converted into orientation information corresponding to the image acquisition device to determine a relative positional relationship between the sound source and the image acquisition device. 如申請專利範圍第1項所述的多媒體資料處理方法,其中所述獲取音源與圖像採集裝置的相對位置關係的步驟具體包括: 通過陀螺儀及直線加速器採集圖像採集裝置的角動量及位移和/或加速度,並將所採集到的圖像採集裝置的角動量及位移和/或加速度轉換成第一方位資訊; 通過陀螺儀及直線加速器採集音源移動的角動量及位移和/或加速度,並將所採集到的音源移動的角動量及位移和/或加速度轉換成第二方位資訊;及 根據所述第一方位資訊及所述第二方位資訊確定音源與圖像採集裝置之間的相對位置關係。The method for processing a multimedia data according to claim 1, wherein the step of acquiring a relative positional relationship between the sound source and the image capturing device comprises: collecting angular momentum and displacement of the image capturing device by using a gyroscope and a linear accelerator; And/or acceleration, and converting the angular momentum and displacement and/or acceleration of the acquired image acquisition device into first orientation information; collecting angular momentum and displacement and/or acceleration of the sound source movement by the gyroscope and the linear accelerator, And converting the angular momentum and the displacement and/or acceleration of the collected sound source into the second orientation information; and determining the relative relationship between the sound source and the image acquisition device according to the first orientation information and the second orientation information. Positional relationship. 如申請專利範圍第1項所述的多媒體資料處理方法,其中所述方法還包括步驟: 確定使用者觀看圖像的視角;及 播放所述視角對應的音訊資料。The multimedia data processing method of claim 1, wherein the method further comprises the steps of: determining a viewing angle of the user viewing the image; and playing the audio material corresponding to the viewing angle. 如申請專利範圍第6項所述的多媒體資料處理方法,其中所述方法還包括步驟: 確定使用者觀看圖像的視角及距離;及 根據所述視角及距離進行音量加權及方向調整。The multimedia data processing method of claim 6, wherein the method further comprises the steps of: determining a viewing angle and a distance at which the user views the image; and performing volume weighting and direction adjustment according to the viewing angle and the distance. 一種多媒體資料處理裝置,其中,所述裝置包括: 獲取單元,用於獲取音源與圖像採集裝置之間的相對位置關係; 圖像採集裝置,用於根據所述相對位置關係採集預定範圍內的圖像資料; 音訊採集裝置,用於根據所述相對位置關係採集預定範圍內音源所發出的音訊資料;及 處理單元,用於將採集到的預定範圍內的圖像資料與音訊資料建立對應的關聯。A multimedia data processing device, wherein the device comprises: an acquiring unit, configured to acquire a relative positional relationship between the sound source and the image capturing device; and an image collecting device, configured to collect the predetermined range according to the relative positional relationship An image acquisition device, configured to acquire audio data sent by a sound source within a predetermined range according to the relative positional relationship; and a processing unit configured to associate the image data in the collected predetermined range with the audio data Association. 如申請專利範圍第8項所述的多媒體資料處理裝置,其中所述多媒體資料處理裝置還包括存儲單元,所述存儲單元用於將建立對應關聯的圖像資料與音訊資料進行存儲。The multimedia material processing device of claim 8, wherein the multimedia data processing device further comprises a storage unit, wherein the storage unit is configured to store the corresponding associated image data and audio data. 如申請專利範圍第8項所述的多媒體資料處理裝置,其中所述多媒體資料處理裝置中的播放裝置用於播放所述建立對應關聯的多媒體資料,並用於偵測使用者觀看圖像的方向及視角,並根據使用者的視角及距離對應地進行音量加權及方向調整。The multimedia data processing device of claim 8, wherein the playback device in the multimedia data processing device is configured to play the corresponding associated multimedia data, and is used to detect a direction in which the user views the image and The angle of view, and the volume weighting and direction adjustment are performed correspondingly according to the user's perspective and distance.
TW106142618A 2017-12-05 2017-12-05 Multimedia data processing device and method TWI636453B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW106142618A TWI636453B (en) 2017-12-05 2017-12-05 Multimedia data processing device and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW106142618A TWI636453B (en) 2017-12-05 2017-12-05 Multimedia data processing device and method

Publications (2)

Publication Number Publication Date
TWI636453B true TWI636453B (en) 2018-09-21
TW201926317A TW201926317A (en) 2019-07-01

Family

ID=64452889

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106142618A TWI636453B (en) 2017-12-05 2017-12-05 Multimedia data processing device and method

Country Status (1)

Country Link
TW (1) TWI636453B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN205902062U (en) * 2016-08-12 2017-01-18 森声数字科技(深圳)有限公司 Fixing device and audio collecting device
TW201735667A (en) * 2016-03-29 2017-10-01 Marvel Digital Ltd Method, equipment and apparatus for acquiring spatial audio direction vector
TW201734948A (en) * 2016-03-03 2017-10-01 森翠根科技有限公司 A method, system and device for generating associated audio and visual signals in a wide angle image system
TWM549870U (en) * 2017-07-04 2017-10-01 華碩電腦股份有限公司 Virtual reality system and position detection module thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201734948A (en) * 2016-03-03 2017-10-01 森翠根科技有限公司 A method, system and device for generating associated audio and visual signals in a wide angle image system
TW201735667A (en) * 2016-03-29 2017-10-01 Marvel Digital Ltd Method, equipment and apparatus for acquiring spatial audio direction vector
CN205902062U (en) * 2016-08-12 2017-01-18 森声数字科技(深圳)有限公司 Fixing device and audio collecting device
TWM549870U (en) * 2017-07-04 2017-10-01 華碩電腦股份有限公司 Virtual reality system and position detection module thereof

Also Published As

Publication number Publication date
TW201926317A (en) 2019-07-01

Similar Documents

Publication Publication Date Title
US11528576B2 (en) Distributed audio capturing techniques for virtual reality (VR), augmented reality (AR), and mixed reality (MR) systems
KR102197544B1 (en) Mixed reality system with spatialized audio
US8773589B2 (en) Audio/video methods and systems
WO2017208820A1 (en) Video sound processing device, video sound processing method, and program
CN106162206A (en) Panorama recording, player method and device
JP2011033993A (en) Information presenting apparatus and method for presenting information
JP2016119071A (en) System and method for recording haptic data for use with multi-media data
JP6410769B2 (en) Information processing system, control method therefor, and computer program
JP5818322B2 (en) Video generation apparatus, video generation method, and computer program
TWI636453B (en) Multimedia data processing device and method
WO2020066698A1 (en) Information integration method, information integration device, and information integration program
KR101155610B1 (en) Apparatus for displaying sound source location and method thereof
JP2018019295A (en) Information processing system, control method therefor, and computer program
KR101747800B1 (en) Apparatus for Generating of 3D Sound, and System for Generating of 3D Contents Using the Same
WO2020066699A1 (en) Information integration method, information integration device, and information integration program
CN109873933A (en) Apparatus for processing multimedia data and method
JP2019033497A (en) Information processing system, control method therefor, and computer program
JP3734805B2 (en) Information recording device
JP2022028454A (en) Photographing metadata recording device and program
NZ795232A (en) Distributed audio capturing techniques for virtual reality (1vr), augmented reality (ar), and mixed reality (mr) systems
JP2015097318A (en) Sound signal processing system
TWI521983B (en) An audio adjusting system
Hamanaka et al. CONCERT SCOPE HEADPHONES
TW201436564A (en) Tracking system