TWI422227B

TWI422227B - System and method for multimedia meeting

Info

Publication number: TWI422227B
Application number: TW100114455A
Authority: TW
Inventors: Shih Jie Chen
Original assignee: Inventec Corp
Priority date: 2011-04-26
Filing date: 2011-04-26
Publication date: 2014-01-01
Also published as: TW201244480A

Description

Multimedia conference system and its service method

本發明是有關於一種多媒體系統，且特別是有關於一種多媒體會議系統。The present invention relates to a multimedia system, and more particularly to a multimedia conferencing system.

在大型會議中，台下後方的觀眾往往與演講者相距甚遠，並不容易看到演講者的表情或聽到演講的內容。因此，目前在開大型會議時，通常需要攝影機、麥克風、大螢幕、揚聲器等多媒體裝置來輔助會議的進行。演講者利用麥克風發表演講，再由揚聲器將聲音擴大播出。並且，利用攝影機拍攝演講者，再將影像投影到大螢幕以方便台下後方的人觀看。In large conferences, the audience behind the stage is often far from the speaker, and it is not easy to see the speaker's expression or hear the speech. Therefore, at the time of opening a large conference, multimedia devices such as a camera, a microphone, a large screen, and a speaker are usually required to assist the conference. The speaker uses the microphone to deliver a speech, and then the speaker expands the sound. Also, use the camera to shoot the speaker, and then project the image onto the big screen to facilitate viewing by people behind the stage.

然而，當演講者在講台上移動時，往往需要有人調整攝影機的視角才能夠確實地拍到演講者。除了演溝者外，當有發言者提出問題時，台下的人們通常是只見其聲不見其人，相當不方便。However, when the speaker moves on the podium, it is often necessary to adjust the perspective of the camera to be able to actually capture the speaker. In addition to the performers, when a speaker asks a question, people in the audience usually see that they are not seen, which is quite inconvenient.

本發明提供一種多媒體會議系統與其服務方法，方便拍攝會議中的演講者或發言者。The invention provides a multimedia conference system and a service method thereof, which are convenient for shooting a speaker or a speaker in a conference.

本發明提出一種多媒體會議系統，包括一麥克風、一影像擷取單元、一控制模組以及一顯示裝置。麥克風適於發出一第一聲音訊號。麥克風具有一第一位置偵測單元，用以偵測麥克風的位置，並適於發出一第一位置訊號。控制模組適於接收第一位置訊號，以驅動影像擷取單元朝向麥克風的位置，以藉由第一影像擷取模組擷取一第一數位影像。顯示裝置耦接至影像擷取單元，以顯示第一數位影像。The present invention provides a multimedia conference system including a microphone, an image capture unit, a control module, and a display device. The microphone is adapted to emit a first sound signal. The microphone has a first position detecting unit for detecting the position of the microphone and is adapted to emit a first position signal. The control module is adapted to receive the first position signal to drive the position of the image capturing unit toward the microphone to capture a first digital image by the first image capturing module. The display device is coupled to the image capturing unit to display the first digital image.

在本發明之一實施例中，第一影像擷取模組包括一臉部偵測單元，以偵測第一數位影像中的一臉部輪廓，而控制模組驅動第一影像擷取模組調整其影像擷取範圍，以使第一數位影像包括臉部輪廓及第一麥克風。In an embodiment of the invention, the first image capturing module includes a face detecting unit for detecting a face contour in the first digital image, and the control module drives the first image capturing module. Adjust the image capture range so that the first digital image includes the face outline and the first microphone.

在本發明之一實施例中，多媒體會議系統，更包括一第二麥克風以及一第二影像擷取模組。第二麥克風適於發出一第二聲音訊號，第二麥克風具有一第二位置偵測單元，以偵測第二麥克風的位置，並適於發出一第二位置訊號。控制模組適於接收第二位置訊號，以驅動第二影像擷取模組朝向第二麥克風的位置，以藉由第二影像擷取模組擷取一第二數位影像。控制模組判斷第一麥克風與第二麥克風是否有聲音輸入控制模組，當控制模組判斷第一麥克風有聲音輸入時，顯示裝置以第一影像作為母畫面顯示，當控制模組判斷第一麥克風沒有聲音輸入而第二麥克風有聲音輸入時，顯示裝置以第二數位影像作為母畫面顯示。In an embodiment of the present invention, the multimedia conference system further includes a second microphone and a second image capturing module. The second microphone is adapted to emit a second sound signal, and the second microphone has a second position detecting unit for detecting the position of the second microphone and is adapted to emit a second position signal. The control module is adapted to receive the second position signal to drive the second image capturing module toward the second microphone to capture a second digital image by the second image capturing module. The control module determines whether the first microphone and the second microphone have a sound input control module. When the control module determines that the first microphone has a sound input, the display device displays the first image as a mother screen, and the control module determines the first When the microphone has no sound input and the second microphone has sound input, the display device displays the second digital image as the mother screen.

在本發明之一實施例中，多媒體會議系統，更包括一語音辨識模組，耦接至控制模組與顯示裝置，以將第一聲音訊號辨識為文字輸出至顯示裝置。In an embodiment of the present invention, the multimedia conference system further includes a voice recognition module coupled to the control module and the display device to recognize the first voice signal as a text output to the display device.

在本發明之一實施例中，多媒體會議系統更包括一第一無線發射器以及一第一無線接收器。第一無線發射器配置於第一麥克風並耦接至第一位置偵測單元，以傳送第一位置訊號與第一聲音訊號。第一無線接收器耦接至控制模組，其中控制模組藉由第一無線接收器接收第一無線發射器所傳送之第一位置訊號與第一聲音訊號。In an embodiment of the invention, the multimedia conference system further includes a first wireless transmitter and a first wireless receiver. The first wireless transmitter is disposed on the first microphone and coupled to the first position detecting unit to transmit the first position signal and the first sound signal. The first wireless receiver is coupled to the control module, wherein the control module receives the first position signal and the first sound signal transmitted by the first wireless transmitter by the first wireless receiver.

本發明再提出一種多媒體會議服務方法，適用於一多媒體會議系統，包括下列步驟。首先，藉由一第一位置偵測單元偵測一第一麥克風的位置並發出一第一位置訊號。接著，根據第一位置訊號，藉由一控制模組驅動一第一影像擷取模組朝向第一麥克風的位置。然後，藉由第一影像擷取模組擷取一第一影像。之後，藉由一顯示裝置顯示第一影像。The present invention further provides a multimedia conference service method, which is applicable to a multimedia conference system, and includes the following steps. First, a first position detecting unit detects the position of a first microphone and sends a first position signal. Then, according to the first position signal, a control module drives a position of the first image capturing module toward the first microphone. Then, the first image capturing module captures a first image. Thereafter, the first image is displayed by a display device.

在本發明之一實施例中，多媒體會議服務方法更包括下列步驟。首先，藉由一臉部偵測單元偵測第一數位影像中的一臉部輪廓。接著，藉由控制模組自臉部偵測單元接收臉部輪廓，以驅動影像擷取單元第一影像擷取模組調整其影像擷取範圍，以使第一數位影像包括臉部輪廓及第一麥克風。In an embodiment of the present invention, the multimedia conference service method further includes the following steps. First, a face detection unit detects a face contour in the first digital image. Then, the control module receives the facial contour from the face detecting unit, and drives the first image capturing module of the image capturing unit to adjust the image capturing range thereof, so that the first digital image includes the facial contour and the first a microphone.

在本發明之一實施例中，多媒體會議服務方法更包括下列步驟。首先，藉由一第二位置偵測單元偵測一第二麥克風的位置並發出一第二位置訊號。接著，根據第二位置訊號，藉由控制模組驅動一第二影像擷取模組朝向第二麥克風的位置。然後，藉由控制模組判斷第一麥克風與第二麥克風何者有聲音輸入。當控制模組判斷第一麥克風有聲音輸入時，顯示裝置以第一影像作為母畫面顯示。當控制模組判斷第一麥克風沒有聲音輸入而第二麥克風有聲音輸入時，顯示裝置以第二數位影像作為母畫面顯示。In an embodiment of the present invention, the multimedia conference service method further includes the following steps. First, a second position detecting unit detects the position of a second microphone and sends a second position signal. Then, according to the second position signal, the position of the second image capturing module facing the second microphone is driven by the control module. Then, the control module determines whether the first microphone and the second microphone have sound input. When the control module determines that the first microphone has a sound input, the display device displays the first image as a mother screen. When the control module determines that the first microphone has no sound input and the second microphone has sound input, the display device displays the second digital image as the mother screen.

在本發明之一實施例中，多媒體會議服務方法，更包括下列步驟。首先，藉由控制模組判斷第一麥克風是否開啟。當控制模組判斷第一麥克風未開啟時，藉由第一位置偵測單元偵測第一麥克風的位置。當控制模組判斷第一麥克風開啟時，藉由顯示裝置顯示第一數位影像。In an embodiment of the present invention, the multimedia conference service method further includes the following steps. First, it is determined by the control module whether the first microphone is turned on. When the control module determines that the first microphone is not turned on, the first position detecting unit detects the position of the first microphone. When the control module determines that the first microphone is turned on, the first digital image is displayed by the display device.

在本發明之一實施例中，多媒體會議服務方法，更包括藉由一語音辨識模組將第一麥克風的一第一聲音訊號辨識為文字輸出至顯示裝置。In an embodiment of the present invention, the multimedia conference service method further includes recognizing a first audio signal of the first microphone as a text output to the display device by using a voice recognition module.

基於上述，本發明能夠藉由第一位置偵測單元取得第一麥克風的位置，並藉由控制模組將第一影像擷取模組朝向第一麥克風的位置。因此，在手持第一麥克風的使用者移動過程中，第一影像擷取模組都能夠即時地追蹤拍攝使用者，相當方便。此外，本發明還能夠利用第一影像擷取模組與第二影像擷取模組分別追蹤手持第一麥克風與第二麥克風的使用者們，並以子母畫面分別呈現。因此，手持第一麥克風與第二麥克風的使用者們互動的畫面都能夠被有效地補捉下來。Based on the above, the first position detecting unit can obtain the position of the first microphone, and the first image capturing module faces the position of the first microphone by the control module. Therefore, during the movement of the user holding the first microphone, the first image capturing module can track the user directly, which is quite convenient. In addition, the present invention can also track the users holding the first microphone and the second microphone by using the first image capturing module and the second image capturing module respectively, and presenting them respectively on the mother-child screen. Therefore, the screen in which the user holding the first microphone and the second microphone interact can be effectively captured.

為讓本發明之上述特徵和優點能更明顯易懂，下文特舉實施例，並配合所附圖式作詳細說明如下。The above described features and advantages of the present invention will be more apparent from the following description.

圖1A為示意本發明一實施例之多媒體會議系統的方塊圖。請參考圖1A，多媒體會議系統100包括一第一麥克風110、一第一影像擷取模組130、一控制模組140以及一顯示裝置150。第一麥克風110用以接收聲音。第一麥克風110具有第一位置偵測單元120用，以偵測第一麥克風110的位置，並適於發出一第一位置訊號。在本實施例中，第一位置偵測單元120例如為一重力感測器(g-sensor)，以感測第一麥克風110的移動狀態。FIG. 1A is a block diagram showing a multimedia conference system according to an embodiment of the present invention. Referring to FIG. 1A , the multimedia conference system 100 includes a first microphone 110 , a first image capture module 130 , a control module 140 , and a display device 150 . The first microphone 110 is for receiving sound. The first microphone 110 has a first position detecting unit 120 for detecting the position of the first microphone 110 and is adapted to emit a first position signal. In this embodiment, the first position detecting unit 120 is, for example, a gravity sensor (g-sensor) to sense the moving state of the first microphone 110.

第一影像擷取模組130用以擷取一第一數位影像。在本實施例中，第一數位影像可為動態的影片或靜態的照片。控制模組140耦接至第一位置偵測單元120與影像擷取單元130。控制模組140適於從第一位置偵測單元120接收第一位置訊號，且控制模組140亦可帶動第一影像擷取模組130的朝向改變。顯示裝置150耦接至第一影像擷取模組130，以顯示第一數位影像。進一步來說，第一影像擷取模組130、控制模組140與顯示裝置150可整合至一筆記型電腦(未繪示)，且顯示裝置150可為一顯示器，但不以此為限。The first image capturing module 130 is configured to capture a first digital image. In this embodiment, the first digital image may be a dynamic movie or a static photo. The control module 140 is coupled to the first position detecting unit 120 and the image capturing unit 130. The control module 140 is adapted to receive the first position signal from the first position detecting unit 120, and the control module 140 can also change the orientation of the first image capturing module 130. The display device 150 is coupled to the first image capturing module 130 to display the first digital image. Further, the first image capturing module 130, the control module 140, and the display device 150 can be integrated into a notebook computer (not shown), and the display device 150 can be a display, but not limited thereto.

圖1B為示意本發明一實施例之多媒體會議服務方法的流程圖。為了方便說明，以下將配合圖1A之多媒體會議系統100來說明圖1B之多媒體會議服務方法。就使用的一種狀況來說，使用者可攜帶整合有第一影像擷取模組130、控制模組140與顯示裝置150的筆記型電腦至一會議場合，並將筆記型電腦與第一麥克風110、會議場合的大螢幕(未繪示)或投影機(未繪示)作連接。FIG. 1B is a flowchart illustrating a multimedia conference service method according to an embodiment of the present invention. For convenience of description, the multimedia conference service method of FIG. 1B will be described below in conjunction with the multimedia conference system 100 of FIG. 1A. In one situation, the user can carry the notebook computer integrated with the first image capturing module 130, the control module 140 and the display device 150 to a conference occasion, and the notebook computer and the first microphone 110 A large screen (not shown) or a projector (not shown) for the conference occasion is connected.

請參考圖1A與圖1B，首先進行步驟S110，藉由一第一位置偵測單元120偵測一第一麥克風110的位置並發出一第一位置訊號。此時，一主講人可拿取第一麥克風110來進行演說。接著進行步驟S120，根據第一位置訊號，藉由一控制模組140驅動一第一影像擷取模組130朝向第一麥克風110的位置。也就是說，第一影像擷取模組130會受到控制模組140的帶動而朝向主講人。然後進行步驟S130，藉由第一影像擷取模組130擷取一第一數位影像。之後進行步驟S140，藉由一顯示裝置150顯示第一數位影像。此時，投影機或大螢幕裡的第一數位影像中便會有主講人的身影出現。Referring to FIG. 1A and FIG. 1B, the first step S110 is performed to detect the position of a first microphone 110 and issue a first position signal. At this time, a presenter can take the first microphone 110 to give a speech. Then, in step S120, a control module 140 drives a position of the first image capturing module 130 toward the first microphone 110 according to the first position signal. That is to say, the first image capturing module 130 is driven by the control module 140 to face the presenter. Then, in step S130, the first image capturing module 130 captures a first digital image. Then, in step S140, the first digital image is displayed by a display device 150. At this point, there will be a presenter's figure in the first digital image in the projector or on the big screen.

值得一提的是，在主講人演說的過程中，控制模組140都會根據第一位置偵測單元120所回傳的位置資訊來驅動第一影像擷取模組130朝向第一麥克風。因此，無論主講人移動到舞台(未繪示)的任何位置，第一影像擷取模組130都能即時地捕捉到主講人的身影。It is to be noted that, in the process of the presenter's speech, the control module 140 drives the first image capturing module 130 toward the first microphone according to the position information returned by the first position detecting unit 120. Therefore, regardless of the position where the presenter moves to the stage (not shown), the first image capturing module 130 can instantly capture the figure of the presenter.

圖2A為示意本發明另一實施例之多媒體會議系統的方塊圖。請參考圖1A與圖2A，多媒體會議系統200與多媒體會議系統100的概念相似，相似的元件將不再贅述。多媒體會議系統200包括一第一麥克風210、一第一影像擷取模組D1、一第一無線發射器T1、一第一無線接收器R1、一第二麥克風240、一第二影像擷取模組D2、一第二無線發射器T2、一第二無線接收器R2、一控制模組C、一語音辨識模組260、一臉部偵測單元270、一顯示裝置280、一三維轉軸A與一驅動馬達M。在本實施例中，第一影像擷取模組D1、第一無線接收器R1、第二位置偵測單元250、第二影像擷取模組D2、第二無線接收器R2、控制模組C、語音辨識模組260、臉部偵測單元270、顯示裝置280以及皆可整合至一筆記型電腦(未繪示)中，但不以此為限。2A is a block diagram showing a multimedia conference system according to another embodiment of the present invention. Referring to FIG. 1A and FIG. 2A, the multimedia conference system 200 is similar to the multimedia conference system 100, and similar components will not be described again. The multimedia conference system 200 includes a first microphone 210, a first image capturing module D1, a first wireless transmitter T1, a first wireless receiver R1, a second microphone 240, and a second image capturing module. Group D2, a second wireless transmitter T2, a second wireless receiver R2, a control module C, a voice recognition module 260, a face detection unit 270, a display device 280, a three-dimensional axis A and A drive motor M. In this embodiment, the first image capturing module D1, the first wireless receiver R1, the second position detecting unit 250, the second image capturing module D2, the second wireless receiver R2, and the control module C The speech recognition module 260, the face detection unit 270, and the display device 280 can be integrated into a notebook computer (not shown), but not limited thereto.

第二麥克風240適於發出一第二聲音訊號，且具有一第二位置偵測單元250，以偵測第二麥克風240的位置，並適於發出一第二位置訊號。第二影像擷取模組D2耦接至控制模組C，用以擷取一第二數位影像。控制模組C自第二位置偵測單元250接收第二麥克風240的位置，以驅動第二影像擷取模組D2朝向第二麥克風240的位置，以藉由第二影像擷取模組D2擷取一第二數位影像。The second microphone 240 is adapted to emit a second sound signal, and has a second position detecting unit 250 for detecting the position of the second microphone 240 and for transmitting a second position signal. The second image capturing module D2 is coupled to the control module C for capturing a second digital image. The control module C receives the position of the second microphone 240 from the second position detecting unit 250 to drive the position of the second image capturing module D2 toward the second microphone 240 to be used by the second image capturing module D2. Take a second digital image.

在本實施例中，驅動馬達M可藉由三維轉軸A帶動第一擷取單元D1與第二影像擷取模組D2以360度旋轉。控制模組C耦接至第一麥克風210與第二麥克風240，以判斷第一麥克風210與第二麥克風240是否有聲音輸入。顯示裝置280耦接至第二影像擷取模組D2與控制模組C。In this embodiment, the driving motor M can drive the first capturing unit D1 and the second image capturing module D2 to rotate 360 degrees by the three-dimensional rotating shaft A. The control module C is coupled to the first microphone 210 and the second microphone 240 to determine whether the first microphone 210 and the second microphone 240 have sound input. The display device 280 is coupled to the second image capturing module D2 and the control module C.

進一步來說，第一無線發射器T1與第二無線發射器T2分別配置於第一麥克風210與第二麥克風240，且分別耦接至第一位置偵測單元220與第二位置偵測單元250，以傳送第一麥克風210與第二麥克風240的位置訊號。第一無線接收器R1與第二無線接收器R2耦接至控制模組C。控制模組C藉由第一無線接收器R1接收第一無線發射器T1所傳送之第一麥克風210的位置訊號。並藉由第二無線接收器R2接收第二無線發射器T2所傳送之第二麥克風240的位置訊號。Further, the first wireless transmitter T1 and the second wireless transmitter T2 are respectively disposed on the first microphone 210 and the second microphone 240, and are respectively coupled to the first position detecting unit 220 and the second position detecting unit 250. To transmit the position signals of the first microphone 210 and the second microphone 240. The first wireless receiver R1 and the second wireless receiver R2 are coupled to the control module C. The control module C receives the position signal of the first microphone 210 transmitted by the first wireless transmitter T1 by the first wireless receiver R1. And receiving, by the second wireless receiver R2, the position signal of the second microphone 240 transmitted by the second wireless transmitter T2.

此外，語音辨識模組260耦接至第一麥克風210、第二麥克風240與顯示裝置，以將第一麥克風210、第二麥克風240的聲音辨識為文字輸出至顯示裝置280。臉部偵測單元270耦接至第一影像擷取模組D1、第二影像擷取模組D2與控制模組C，以偵測第一數位影像、第二數位影像中的一臉部輪廓。控制模組C自臉部偵測單元接收臉部輪廓，以驅動第一影像擷取模組D1、第二影像擷取模組D2朝向臉部輪廓。In addition, the voice recognition module 260 is coupled to the first microphone 210 , the second microphone 240 , and the display device to recognize the sounds of the first microphone 210 and the second microphone 240 as text output to the display device 280 . The face detection unit 270 is coupled to the first image capturing module D1, the second image capturing module D2, and the control module C to detect a facial contour in the first digital image and the second digital image. . The control module C receives the facial contour from the facial detection unit to drive the first image capturing module D1 and the second image capturing module D2 toward the facial contour.

圖2B為示意本發明另一實施例之多媒體會議服務方法的流程圖。為了方便說明，以下將配合圖2A之多媒體會議系統200來說明圖2B之多媒體會議服務方法。請參考圖1A與圖1B，首先進行步驟S202，藉由一第一位置偵測單元220偵測一第一麥克風210的位置並發出一第一位置訊號。接著進行步驟S204，藉由控制模組C判斷第一麥克風210是否開啟。若是，進行步驟S206，藉由控制模組C判斷第二麥克風210是否開啟。當控制模組C判斷第一麥克風210開啟且第二麥克風240未開啟時，進行步驟S208，根據第一位置訊號，藉由一控制模組C驅動一第一影像擷取模組D1朝向第一麥克風210的位置。之後進行步驟S210，藉由第一影像擷取模組D1擷取一第一數位影像。舉例來說，當開始進行大型會議時，主講人可將筆記型電腦放置於桌上，且讓第一影像擷取模組D1隨主講人的第一麥克風210的位置進行移動，以拍攝主講人。FIG. 2B is a flowchart illustrating a multimedia conference service method according to another embodiment of the present invention. For convenience of description, the multimedia conference service method of FIG. 2B will be described below in conjunction with the multimedia conference system 200 of FIG. 2A. Referring to FIG. 1A and FIG. 1B, step S202 is first performed. A first position detecting unit 220 detects the position of a first microphone 210 and sends a first position signal. Then, in step S204, the control module C determines whether the first microphone 210 is turned on. If yes, proceed to step S206, and the control module C determines whether the second microphone 210 is turned on. When the control module C determines that the first microphone 210 is turned on and the second microphone 240 is not turned on, step S208 is performed, and a first image capturing module D1 is driven to the first by a control module C according to the first position signal. The location of the microphone 210. Then, in step S210, a first digital image is captured by the first image capturing module D1. For example, when starting a large conference, the presenter can place the notebook on the table and let the first image capturing module D1 move with the position of the first microphone 210 of the presenter to shoot the presenter. .

然後進行步驟S212，藉由一臉部偵測單元270偵測第一數位影像中的一臉部輪廓。接著進行步驟S214，藉由控制模組C自臉部偵測單元270接收臉部輪廓，以驅動第一影像擷取模組D1調整其影像擷取範圍，以使第一數位影像包括臉部輪廓及第一麥克風210。之後進行步驟S216，藉由一顯示裝置280顯示第一數位影像。然後進行步驟S218，藉由一語音辨識模組260將第一麥克風210的聲音辨識為文字輸出至顯示裝置280。也就是說，在第一影像擷取模組D1隨主講人的第一麥克風210的位置進行移動拍攝的過程中，可利用臉部偵測單元270將其影像清楚地拍攝，並將影像透過顯示裝置280傳輸至投影機上。此外，第一麥克風210所收得的聲音亦可利用語音辨識模組260同步轉換成文字，而投影在大螢幕上。Then, in step S212, a face detection unit 270 detects a face contour in the first digital image. Then, in step S214, the control module C receives the facial contour from the face detecting unit 270, and drives the first image capturing module D1 to adjust the image capturing range thereof, so that the first digital image includes the facial contour. And a first microphone 210. Then, in step S216, the first digital image is displayed by a display device 280. Then, in step S218, the voice of the first microphone 210 is recognized as a character output to the display device 280 by a voice recognition module 260. That is, in the process of moving and shooting the first image capturing module D1 along with the position of the first microphone 210 of the presenter, the face detecting unit 270 can use the face detecting unit 270 to clearly capture the image and transmit the image through the display. Device 280 is transmitted to the projector. In addition, the sound collected by the first microphone 210 can also be converted into text by the voice recognition module 260, and projected on a large screen.

相對的，當控制模組C判斷第一麥克風210開啟且第二麥克風240亦開啟時，進行步驟S220，根據第二位置訊號，藉由控制模組C驅動一第一影像擷取模組D2朝向第二麥克風240的位置。接著進行步驟S222，藉由第二影像擷取模組D2擷取一第二數位影像。也就是說，當有人舉手發言時，第二影像擷取模組D2便會啟動，透過無線偵測發言者的第二麥克風240之XYZ軸的位置，進而找到發言者。此時，還可用臉部偵測單元270實際拍攝到發言者的影像，並將其影像透過顯示裝置280傳輸至投影機上。此外，第二麥克風240所收得的聲音利用語音辨識模組260同步轉換成文字，而投影在大螢幕上。In contrast, when the control module C determines that the first microphone 210 is turned on and the second microphone 240 is also turned on, step S220 is performed, and the first image capturing module D2 is driven by the control module C according to the second position signal. The position of the second microphone 240. Then, in step S222, a second digital image is captured by the second image capturing module D2. That is to say, when someone raises a hand to speak, the second image capturing module D2 is activated, and the position of the XYZ axis of the second microphone 240 of the speaker is detected by wireless to find the speaker. At this time, the face detection unit 270 can also actually capture the image of the speaker and transmit the image to the projector through the display device 280. In addition, the sound collected by the second microphone 240 is synchronously converted into characters by the voice recognition module 260, and projected on a large screen.

接著進行步驟S224，藉由一控制模組C判斷第一麥克風210與第二麥克風240何者有聲音輸入。當控制模組C判斷第一麥克風210有聲音輸入時，進行步驟S226，顯示裝置280以第一數位影像作為母畫面顯示。相對的，當控制模組C判斷第一麥克風210沒有聲音輸入而第二麥克風240有聲音輸入時，進行步驟S228，顯示裝置280以第二數位影像作為母畫面顯示。舉例來說，當有人發問問題時，也就是發言者的第二麥克風240啟動時，便將投影畫面切換成子母畫面。當主講人的第一麥克風210有聲音輸入時，便將主講人的影像設為母畫面，而發問者為子畫面。反之，則將子母畫面互調。Then, in step S224, a control module C determines whether the first microphone 210 and the second microphone 240 have voice input. When the control module C determines that the first microphone 210 has a voice input, the process proceeds to step S226, and the display device 280 displays the first digital image as the mother screen. In contrast, when the control module C determines that the first microphone 210 has no sound input and the second microphone 240 has sound input, the process proceeds to step S228, and the display device 280 displays the second digital image as the parent screen. For example, when someone asks a question, that is, when the speaker's second microphone 240 is activated, the projected picture is switched to the mother-and-mother picture. When the first microphone 210 of the presenter has an audio input, the image of the presenter is set as the mother picture, and the questioner is the child picture. On the contrary, the picture is adjusted to each other.

綜上所述，本發明能夠藉由第一位置偵測單元取得第一麥克風的位置，並藉由控制模組將第一影像擷取模組朝向第一麥克風的位置。因此，在手持第一麥克風的主講者移動過程中，第一影像擷取模組都能夠即時地追蹤拍攝主講者，相當方便。此外，本發明還能夠利用第一影像擷取模組與第二影像擷取模組分別追蹤手持第一麥克風的主講者與手持第二麥克風的發問者，並以子母畫面分別呈現主講者與發問者。因此，主講者與發問者互動的畫面都能夠被有效地補捉下來。此外，藉由臉部偵測單元偵測第一數位影像與第二數位影像中的主講者與發言者，而可讓會議中的觀眾清楚地看到主講者與發言者的表情。另外，主講者的演說與發言者的問題，都可利用語音辨識模組同步的轉換成文字，投影在大螢幕上，讓大家更能夠了解演講者有發問者對答的內容。In summary, the first position detecting unit can obtain the position of the first microphone, and the first image capturing module faces the position of the first microphone by the control module. Therefore, during the movement of the presenter holding the first microphone, the first image capturing module can track the presenter in real time, which is quite convenient. In addition, the first image capturing module and the second image capturing module respectively can track the presenter holding the first microphone and the questioner holding the second microphone, respectively, and present the presenter and the parent and child respectively. Questioner. Therefore, the interaction between the presenter and the questioner can be effectively captured. In addition, by detecting the presenter and the speaker in the first digital image and the second digital image by the face detecting unit, the audience in the meeting can clearly see the expressions of the presenter and the speaker. In addition, the speaker's speech and the speaker's question can be converted into text by the voice recognition module, and projected on the big screen, so that everyone can understand the content of the speaker's question and answer.

雖然本發明已以實施例揭露如上，然其並非用以限定本發明，任何所屬技術領域中具有通常知識者，在不脫離本發明之精神和範圍內，當可作些許之更動與潤飾，故本發明之保護範圍當視後附之申請專利範圍所界定者為準。Although the present invention has been disclosed in the above embodiments, it is not intended to limit the invention, and any one of ordinary skill in the art can make some modifications and refinements without departing from the spirit and scope of the invention. The scope of the invention is defined by the scope of the appended claims.

100．．．多媒體會議系統100. . . Multimedia conference system

110、210．．．第一麥克風110, 210. . . First microphone

120、220．．．第一位置偵測單元120, 220. . . First position detecting unit

130、D1．．．第一影像擷取模組130, D1. . . First image capture module

140、C．．．控制模組140, C. . . Control module

150、280．．．顯示裝置150, 280. . . Display device

200．．．多媒體會議系統200. . . Multimedia conference system

240．．．第二麥克風240. . . Second microphone

250．．．第二位置偵測單元250. . . Second position detecting unit

260．．．語音辨識模組260. . . Speech recognition module

270．．．臉部偵測單元270. . . Face detection unit

A．．．三維轉軸A. . . Three-dimensional shaft

D2．．．第二影像擷取模組D2. . . Second image capture module

M．．．驅動馬達M. . . Drive motor

T1．．．第一無線發射器T1. . . First wireless transmitter

T2．．．第二無線發射器T2. . . Second wireless transmitter

R1．．．第一無線接收器R1. . . First wireless receiver

R2．．．第二無線接收器R2. . . Second wireless receiver

S110~S140、S202~S228．．．步驟S110~S140, S202~S228. . . step

圖1A為示意本發明一實施例之多媒體會議系統的方塊圖。FIG. 1A is a block diagram showing a multimedia conference system according to an embodiment of the present invention.

圖1B為示意本發明一實施例之多媒體會議服務方法的流程圖。FIG. 1B is a flowchart illustrating a multimedia conference service method according to an embodiment of the present invention.

圖2A為示意本發明另一實施例之多媒體會議系統的方塊圖。2A is a block diagram showing a multimedia conference system according to another embodiment of the present invention.

圖2B為示意本發明另一實施例之多媒體會議服務方法的流程圖。FIG. 2B is a flowchart illustrating a multimedia conference service method according to another embodiment of the present invention.

S110~S140．．．步驟S110~S140. . . step

Claims

A multimedia conference system includes: a first microphone adapted to emit a first audio signal, the first microphone having a first position detecting unit configured to detect a position of the first microphone and adapted to issue a a first position signal; a first image capturing module; a control module adapted to receive the first position signal to drive the position of the first image capturing module toward the first microphone The first image capture module captures a first digital image; a display device is coupled to the first image capture module to display the first digital image; and a voice recognition module coupled to the The control module and the display device output the first audio signal as text to the display device.

The multimedia video conferencing system of claim 1, wherein the first image capturing module comprises a face detecting unit for detecting a face contour in the first digital image, and the control module The group drives the first image capturing module to adjust an image capturing range thereof, so that the first digital image includes the facial contour and the first microphone.

The multimedia conference system of claim 1, further comprising: a second microphone adapted to emit a second audio signal, the second microphone having a second position detecting unit to detect the second a position of the microphone and adapted to emit a second position signal; and a second image capturing module, wherein the control module is adapted to receive the The second position signal is used to drive the second image capturing module toward the second microphone to capture a second digital image by the second image capturing module, wherein the control module is suitable for determining Whether the first microphone and the second microphone have a sound input control module, and when the control module determines that the first microphone has a sound input, the display device displays the first digital image as a mother screen, when the control module When the group determines that the first microphone has no sound input and the second microphone has sound input, the display device displays the second image as a mother screen.

The multimedia conference system of claim 1, further comprising: a first wireless transmitter configured to be coupled to the first microphone and coupled to the first position detecting unit to transmit the first position signal and And the first wireless receiver is coupled to the control module, wherein the control module receives the first position signal transmitted by the first wireless transmitter by the first wireless receiver With the first sound signal.

A multimedia conference service method, applicable to a multimedia conference system, comprising: detecting a position of a first microphone by a first position detecting unit and issuing a first position signal; and according to the first position signal, by using a The control module drives a position of the first image capturing module toward the first microphone; and the first image capturing module captures a first digital image; The first digital image is displayed by a display device; and a first audio signal of the first microphone is recognized as a text output to the display device by a voice recognition module.

The multimedia conference service method of claim 5, further comprising: detecting, by a face detection unit, a face contour in the first digital image; and using the control module from the face The image detecting unit receives the contour of the face to drive the first image capturing module of the image capturing unit to adjust the image capturing range, so that the first digital image includes the facial contour and the first microphone.

The multimedia conference service method of claim 5, further comprising: detecting, by a second position detecting unit, a position of the second microphone and transmitting a second position signal; according to the second position signal, The control module drives a position of the second image capturing module toward the second microphone; and the control module determines whether the first microphone and the second microphone have voice input; when the control module determines When the first microphone has sound input, the display device displays the first digital image as a mother screen; and when the control module determines that the first microphone has no sound input and the second microphone has sound input, the display device The second image is displayed as a mother screen.

The multimedia conference service method of claim 5, further comprising: determining, by the control module, whether the first microphone is turned on; and when the control module determines that the first microphone is not turned on, a position detecting unit detects a position of the first microphone; and when the control module determines that the first microphone is turned on, displaying the first digital image by the display device.