TW202213992A - Live broadcasting method for real time three-dimensional image display - Google Patents

Live broadcasting method for real time three-dimensional image display Download PDF

Info

Publication number
TW202213992A
TW202213992A TW109131913A TW109131913A TW202213992A TW 202213992 A TW202213992 A TW 202213992A TW 109131913 A TW109131913 A TW 109131913A TW 109131913 A TW109131913 A TW 109131913A TW 202213992 A TW202213992 A TW 202213992A
Authority
TW
Taiwan
Prior art keywords
image
live broadcast
user terminal
dimensional
live
Prior art date
Application number
TW109131913A
Other languages
Chinese (zh)
Other versions
TWI836141B (en
Inventor
施清德
Original Assignee
大陸商深圳市博浩光電科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 大陸商深圳市博浩光電科技有限公司 filed Critical 大陸商深圳市博浩光電科技有限公司
Priority to TW109131913A priority Critical patent/TWI836141B/en
Priority claimed from TW109131913A external-priority patent/TWI836141B/en
Publication of TW202213992A publication Critical patent/TW202213992A/en
Application granted granted Critical
Publication of TWI836141B publication Critical patent/TWI836141B/en

Links

Images

Abstract

A live broadcast for real time three-dimensional (3D) image display is disclosed in the present disclosure and includes: recording an object to acquire 3D images from the object; performing an image processing of the 3D images by a 3D live broadcast algorithm module; generating a video deployment setup according to at least one video choice of a user terminal vie an AI learning module; and optimizing the 3D image according to the video deployment setup and a usage circumstance of a live broadcast system and displaying the corresponding 3D image to the user terminal in cooperated with the video deployment setup after optimizing the 3D image.

Description

即時三維影像顯示的直播方法Live broadcast method of real-time 3D image display

本發明涉及一種直播方法,特別是涉及一種即時三維影像顯示功能的直播方法。The invention relates to a live broadcast method, in particular to a live broadcast method with a real-time three-dimensional image display function.

自從即時通訊軟體發明以來,經過許多年的發展及技術進步,已經從一開始的純文字雙方溝通、聊天,到現在已經進入到視訊加文字的溝通聊天方式,漸漸成為現代人們生活的一部分,除了傳統的即時通訊功能以外,經過一段時間的發展,已經產生了許多的應用,如直播購物、網路即時會議、直播娛樂等等。即時通訊的聊天方式在表情、自拍、濾鏡的表達方式進化到“將一切都圖片化、視頻化”的社交趨勢後,直播這個自我表達的新領域便順勢誕生了。Since the invention of instant messaging software, after many years of development and technological progress, communication and chat between the two parties have gone from pure text at the beginning to video plus text communication and chat, which has gradually become a part of modern people's lives. In addition to the traditional instant messaging function, after a period of development, many applications have emerged, such as live shopping, online instant meetings, live entertainment and so on. After the chat method of instant messaging evolved from the expression methods of emoticons, selfies, and filters to the social trend of "making everything into pictures and videos", live broadcasting, a new field of self-expression, was born.

網路直播的購物方式特點在於現場直播,並經過後台很短的時間差距即刻將產品介紹內容廣播出去到網路上,所以用戶終端或是觀看者可即時的看到產品介紹內容,並可以與直播主進行即時的互動。如圖1所示,其顯示通過直播主12透過行動裝置的軟體(APP)在網路上向用戶終端11直播販售貨品的示意圖,這種交易與交流的方式,不同於傳統的電視購物及網電購物,電視購物採用預錄製再擇時播放的方式,所以購物並不是即時的;而傳統網店則是採用網上置放產品介紹圖片的掛網,再配合網店經營者的即時服務完成產品交易。The shopping method of online live broadcast is characterized by live broadcast, and the product introduction content is broadcast to the Internet immediately after a short time gap in the background, so user terminals or viewers can see the product introduction content in real time, and can interact with the live broadcast. The master interacts instantly. As shown in FIG. 1, it shows a schematic diagram of live selling goods to the user terminal 11 on the Internet through the software (APP) of the mobile device through the live broadcast host 12. This transaction and communication method is different from traditional TV shopping and online shopping. E-shopping and TV-shopping are pre-recorded and then played at a specific time, so shopping is not instant; while traditional online stores use a hanging network to place product introduction pictures on the Internet, and then cooperate with the online store operator's real-time service to complete product transactions.

直播的可視性及即時的互動方式,漸漸受到現代人們追捧,因此,當這種方式應用在網路的購物方式上,可以大量的節省人們在外採購物品所耗費的時間,並且在購物的過程中,充滿了娛樂的樂趣,也提高了購物的即時性。但是這種網路即時觀看即時購物的方式,雖然有主持人或是專業人員在直播現場介紹產品,購物者在遠端的室內環境或是移動的情境中觀看顯示螢幕上的2D產品顯像而決定是否購物,由於不是在產品現場觀看,透過用戶終端的手機或是智能型電視機所看到的2D產品介紹影像,會產生與實際的物品有落差,等到消費終端收到實物產品時候,發現與心理的期待有不小的差異。The visibility and instant interaction of live broadcasts are gradually being sought after by modern people. Therefore, when this method is applied to online shopping, it can save a lot of time for people to purchase items outside, and in the process of shopping , full of entertainment fun, but also improve the immediacy of shopping. However, in this way of online real-time shopping, although there are hosts or professionals introducing products on the live broadcast, shoppers watch 2D product images on the display screen in a remote indoor environment or a mobile situation. Decide whether to shop or not. Since the 2D product introduction video viewed through the user terminal's mobile phone or smart TV is not viewed at the product site, there will be a gap with the actual item. When the consumer terminal receives the physical product, it is found that It is quite different from the psychological expectation.

直播聊天、娛樂、購物的方式之所以會受到人們的喜歡及流行,除了即時、生動畫面的可看性,直播主的氣氛帶動之外,觀賞者更可以與直播主進行互動或是留言,顯示螢幕上可以直接顯示出觀賞者及直播主的文字互動結果,這種互動式的行為模式,可以很大的縮短雙方的溝通成本,在直播購物的應用例子中,就可以縮短購物者及銷售方的距離,並且這種方式也適合於手持行動裝置,例如手機、平板電腦等等。所以直播可應用的平台範圍相較於過去的各種即時通訊軟體、電視購物、平面式的網路購物更廣泛也更直接,也打破了過去的平台之間的界線。The reason why live chat, entertainment, and shopping are popular and popular among people is that in addition to the real-time, vivid picture visibility and the atmosphere of the live broadcaster, viewers can interact with the live broadcaster or leave a message, showing The screen can directly display the text interaction results of the viewer and the live broadcaster. This interactive behavior mode can greatly reduce the communication cost of both parties. In the application example of live shopping, it can shorten the time between shoppers and sellers. distance, and this method is also suitable for handheld mobile devices, such as mobile phones, tablet computers, etc. Therefore, the range of applicable platforms for live broadcasting is wider and more direct than various instant messaging software, TV shopping, and flat-screen online shopping in the past, and it also breaks the boundaries between the past platforms.

但是目前的這種直播系統10,如圖1所示,仍然是依靠在傳統的二維(2D)視頻及影像顯示技術基礎上,因此如觀眾或購物的用戶終端11對於主播12的認識、直播購物依然存在與真實的世界有很大的理解差異。舉例來說,直播方或是銷售方在進行化妝品的說明場景,如圖2所示,由於傳統2D視頻上所廣播的影像20不具備深度的立體訊息,銷售方在直播現場已經塗抹了化妝品的顏色及形狀,在遠端的觀看者卻因為影像形成的限制,很難感受到直播現場光亮立體的色彩及光澤感,這對於產品的展示效果大大的打了折扣。這是因為這類的產品在展示時,沒有透過具備深度訊息的三維立體光影、及三維影像,無法表現出產品的完整特色。類似這樣的應用例子,在目前的二維影像顯示上,已經產生了很多限制。However, the current live broadcast system 10, as shown in FIG. 1, still relies on the traditional two-dimensional (2D) video and image display technology. There is still a big difference in understanding between shopping and the real world. For example, in the scene where the live broadcaster or the seller is explaining cosmetics, as shown in Figure 2, since the image 20 broadcasted on the traditional 2D video does not have deep stereoscopic information, the seller has already applied cosmetics on the live broadcast scene. In terms of color and shape, it is difficult for viewers at the far end to feel the bright and three-dimensional color and gloss of the live broadcast due to the limitation of image formation, which greatly reduces the display effect of the product. This is because when such products are displayed, the complete features of the product cannot be expressed without the use of three-dimensional light and shadow with depth information and three-dimensional images. Similar to this application example, there have been many limitations in the current two-dimensional image display.

故,如何通過設計的改良,來解決直播系統的平面顯示限制,並可以提高交流以及溝通的效率,已成為該項事業所欲解決的重要課題之一。Therefore, how to solve the flat display limitation of the live broadcast system and improve the efficiency of communication and communication through the improvement of the design has become one of the important issues to be solved by the project.

本發明所要解決的技術問題在於,針對現有技術的不足提供一種可以顯示三維影像的直播方法,且這種直播方法不會造成影像品質的低落或使用者觀看時的延遲問題產生。The technical problem to be solved by the present invention is to provide a live broadcast method capable of displaying three-dimensional images in view of the deficiencies of the prior art, and the live broadcast method will not cause the degradation of the image quality or the delay problem when users watch.

為了解決上述的技術問題,本發明所採用的其中一技術方案是提供一種即時三維影像顯示功能的直播方法,其包括:對一實體進行取像,並獲取實體的一三維影像;通過一三維直播演算模組,進行三維影像的影像處理;根據一用戶終端的至少一視頻選擇條件,通過一人工智慧學習模組,產生一視頻調配組合;以及 根據視頻調配組合,對三維影像進行優化;以及將優化後的三維影像,並配合用戶終端的視頻調配組合,將對應的三維影像顯示在所述用戶終端中。In order to solve the above-mentioned technical problems, one of the technical solutions adopted by the present invention is to provide a live broadcast method with a real-time 3D image display function, which includes: capturing an image of an entity, and obtaining a 3D image of the entity; an arithmetic module for image processing of three-dimensional images; according to at least one video selection condition of a user terminal, through an artificial intelligence learning module, a video allocation combination is generated; and according to the video allocation combination, the three-dimensional image is optimized; and The optimized three-dimensional image is displayed in the user terminal in combination with the video allocation and combination of the user terminal.

本發明的其中一有益效果在於,本發明所提供的直播系統與直播方法,為減少資料的即時流量,採用影像的二維與三維並存,混合編輯的方法,這樣觀看者可以把焦點放置在需要關注的產品或人物身上,以觀賞到最佳及最需要的三維影像,並且這種二維與三維影像並存的方式可以在使用者介面上進行選擇設置二維影像觀賞,或是三維影像觀賞,亦即表示使用者可以隨時變更需要觀看的三維影像位置,可以設置在區域性,或是全圖三維顯示。二維影像可以選擇三維左右式影像中的左影像或是右影像作為二維影像的顯示圖。另外,除了用戶終端裝置的立體顯示方式外,主播端或是雲端後台的控制端,也可以放置立體顯示裝置,以提供主播端的立體顯示預覽,及與用戶終端的互動立體顯示。另外,更可以讓用戶終端或直播主端通過三維直播影像的橫向縱向控制裝置可以容易選擇橫放或縱放的方式來進行三維直播顯示。One of the beneficial effects of the present invention is that, in the live broadcasting system and the live broadcasting method provided by the present invention, in order to reduce the real-time flow of data, two-dimensional and three-dimensional images are coexisted, and the method of mixed editing is adopted, so that the viewer can focus on the needs. On the product or person concerned, to watch the best and most needed 3D images, and this coexistence of 2D and 3D images can be selected on the user interface to set 2D image viewing or 3D image viewing, That is to say, the user can change the position of the 3D image to be viewed at any time, which can be set in a regional or a full-image 3D display. For the 2D image, the left image or the right image in the 3D left-right image can be selected as the display image of the 2D image. In addition, in addition to the stereoscopic display mode of the user terminal device, a stereoscopic display device can also be placed on the host or the control end of the cloud backend to provide a stereoscopic display preview of the host and interactive stereoscopic display with the user terminal. In addition, the user terminal or the live broadcast host terminal can easily select the horizontal or vertical mode to display the 3D live broadcast through the horizontal and vertical control device of the 3D live broadcast image.

為使能更進一步瞭解本發明的特徵及技術內容,請參閱以下有關本發明的詳細說明與圖式,然而所提供的圖式僅用於提供參考與說明,並非用來對本發明加以限制。For a further understanding of the features and technical content of the present invention, please refer to the following detailed descriptions and drawings of the present invention. However, the drawings provided are only for reference and description, and are not intended to limit the present invention.

以下是通過特定的具體實施例來說明本發明所公開有關“即時三維影像顯示的直播方法”的實施方式,本領域技術人員可由本說明書所公開的內容瞭解本發明的優點與效果。本發明可通過其他不同的具體實施例加以施行或應用,本說明書中的各項細節也可基於不同觀點與應用,在不背離本發明的構思下進行各種修改與變更。另外,本發明的附圖僅為簡單示意說明,並非依實際尺寸的描繪,事先聲明。以下的實施方式將進一步詳細說明本發明的相關技術內容,但所公開的內容並非用以限制本發明的保護範圍。另外,本文中所使用的術語“或”,應視實際情況可能包括相關聯的列出項目中的任一個或者多個的組合。The following is a specific embodiment to illustrate the implementation of the "live streaming method for real-time 3D image display" disclosed in the present invention, and those skilled in the art can understand the advantages and effects of the present invention from the content disclosed in this specification. The present invention can be implemented or applied through other different specific embodiments, and various details in this specification can also be modified and changed based on different viewpoints and applications without departing from the concept of the present invention. In addition, the drawings of the present invention are merely schematic illustrations, and are not drawn according to the actual size, and are stated in advance. The following embodiments will further describe the related technical contents of the present invention in detail, but the disclosed contents are not intended to limit the protection scope of the present invention. In addition, the term "or", as used herein, should include any one or a combination of more of the associated listed items, as the case may be.

[本發明直播方法實施例][Embodiment of the live broadcast method of the present invention]

圖3顯示本發明之即時三維影像顯示的直播方法的步驟流程圖,圖4A為本發明的三維影像的影像調整的示意圖,圖4B為本發明的人工智慧學習模組學習用戶終端的使用習慣的示意圖,圖5為本發明的即時三維影像顯示功能的直播系統的系統方塊圖。3 shows a flow chart of the steps of the live broadcast method for real-time 3D image display of the present invention, FIG. 4A is a schematic diagram of image adjustment of a 3D image according to the present invention, and FIG. A schematic diagram, FIG. 5 is a system block diagram of a live broadcast system with a real-time 3D image display function of the present invention.

如圖3所示,並參考圖5的元件標示,本發明實施例提供一種即時三維影像顯示的直播方法包括下列步驟。在步驟S301中,對一實體進行取像,並獲取實體的一三維影像,應用一影像擷取模組51對直播主進行攝影,影像擷取模組51可以是三維影像機或二維影像機等,任何可以攝影的電子裝置都可以本發明的影像擷取模組51。As shown in FIG. 3 , and with reference to the component labels in FIG. 5 , an embodiment of the present invention provides a live broadcast method for real-time 3D image display, including the following steps. In step S301, an entity is imaged, and a three-dimensional image of the entity is acquired, and an image capture module 51 is used to photograph the live broadcaster. The image capture module 51 may be a three-dimensional video camera or a two-dimensional video camera etc., any electronic device capable of taking pictures can use the image capturing module 51 of the present invention.

在本發明的較佳實施例中,影像擷取模組51為三維攝影機,要解決三維影像顯示訊息的不足,在本發明的較佳實施例中利用了三維影像的拍攝及編制,在此所述的三維影像拍攝,是採用即時的內嵌式三維的雙鏡頭攝影機擷取三維影像。或者,在不同實施例中,可以應用二維攝影機,先取得直播主的二維影像,再透過二維轉三維的影像轉換技術,將二維影像轉換為三維影像,如何擷取三維影像或如何將二維影像轉換為三維影像的技術為本領域具有通常知識者所熟知,在此不再贅述。為減少資料的即時流量,採用影像的二維與三維並存,混合編輯的方法,這樣用戶終端在觀看時可以把焦點放置需要關注的產品或人物身上,以觀賞到最佳及最需要的三維影像,並且這種二維與三維影像並存的方式,可以在使用者介面上進行選擇設置二維影像觀賞,或是三維影像觀賞,亦即表示用戶終端可以隨時變更需要觀看的三維影像位置,可以設置在區域性,或是全圖三維顯示。另外,在此需要說明的是,三維影像所需傳輸的資料量大於二維影像所需傳輸的資料量,本發明通過在影像中同時包括三維的圖像與二維的圖像的方式,可以在影像撥放時,傳輸相較於全圖三維顯示時較少的資料量,因此,可以維持直播時撥放的順暢度以及降低資料傳輸延遲(lag)的產生。In the preferred embodiment of the present invention, the image capturing module 51 is a three-dimensional camera. To solve the shortage of the information displayed by the three-dimensional image, the shooting and compilation of the three-dimensional image are used in the preferred embodiment of the present invention. The above-mentioned 3D image shooting uses a real-time embedded 3D dual-lens camera to capture 3D images. Alternatively, in different embodiments, a 2D camera may be used to first obtain the 2D image of the live broadcast host, and then convert the 2D image into a 3D image through a 2D to 3D image conversion technology. How to capture the 3D image or how to The technology of converting a 2D image into a 3D image is well known to those skilled in the art, and will not be repeated here. In order to reduce the real-time flow of data, the two-dimensional and three-dimensional images coexist, and the method of mixed editing is adopted, so that the user terminal can focus on the products or characters that need attention when watching, so as to watch the best and most needed 3D images. , and this coexistence of 2D and 3D images allows you to select and set 2D image viewing or 3D image viewing on the user interface, which means that the user terminal can change the position of the 3D image to be viewed at any time, and can set In regional, or full-scale 3D display. In addition, it should be noted here that the amount of data that needs to be transmitted for 3D images is greater than the amount of data that needs to be transmitted for 2D images. During video playback, a smaller amount of data is transmitted than when full-image 3D display is performed. Therefore, the smoothness of playback during live broadcast can be maintained and the generation of data transmission delay (lag) can be reduced.

在步驟S302中,通過一三維直播演算模組52進行三維影像的影像處理。當本發明的影像擷取模組51接收到三維影像40後,會進行影像處理,因為用戶終端所使用的行動裝置並非相同,且用戶終端所在的位置或環境也不相同,不同的行動裝置會有不同的資料傳輸速度,且所在位置不同,其資料傳輸的速度也不同,因此在三維影像進行優化的步驟中,對三維影像進行切割調整,並根據視頻調配組合,通過人工智慧學習模組54,對三維影像進行優化。In step S302 , the image processing of the three-dimensional image is performed by a three-dimensional live broadcast calculation module 52 . When the image capture module 51 of the present invention receives the three-dimensional image 40, it will perform image processing, because the mobile devices used by the user terminals are not the same, and the locations or environments of the user terminals are also different, different mobile devices will There are different data transmission speeds and different locations, so the data transmission speeds are also different. Therefore, in the step of optimizing the 3D image, the 3D image is cut and adjusted, and the combination is arranged and combined according to the video, through the artificial intelligence learning module 54 , to optimize the 3D image.

本發明的即時三維影像顯示功能的直播方法會根據不同的資料傳輸速度調整影像。在現有的立體視覺技術上,都是將一整幅影像進行處理,因此往往產生至少二倍於二維影像的資料量,這樣的資料傳輸量對於即時傳輸的直播系統產生了相當大的障礙,最常碰到的情況便是影像掉禎(frame)、馬賽克或是延遲嚴重,降低了使用的樂趣及耐心。因此,本發明對於影像進行了自動偵測,並做前景、後景以及影像分割做適配效果外,如圖4A所示,更進一步將三維影像40進行影像容錯及校正,並偵測與分離三維影像40中的多個物體。若物體為人,對物體進行影像柔和濾光、背景虛化、濾鏡效果或美顏美肌等影像處理;若物體為物,對物體進行背景剝離、邊緣強化、增強對比或影像放大等影像處理;若物體同為人與物,對影像進行顏色、美顏、圖形邊緣最適化的計算。背景剝離是將三維影像40中的背景從影像中分離出來,因此可以在後續的步驟中根據用戶終端的使用習慣替換不同的背景。最後,輸出最佳化、經過影像處理後的三維影像40。The live broadcast method of the real-time three-dimensional image display function of the present invention adjusts images according to different data transmission speeds. In the existing stereo vision technology, a whole image is processed, so the amount of data at least twice that of a two-dimensional image is often generated. The most common situation is that the image is dropped (frame), mosaic or lag is serious, which reduces the fun and patience of use. Therefore, the present invention automatically detects the image, and performs foreground, background, and image segmentation for adaptation effects. As shown in FIG. 4A , the three-dimensional image 40 is further subjected to image error tolerance and correction, and detection and separation are performed. A plurality of objects in the 3D image 40 . If the object is a person, perform image processing such as image softening filter, background blur, filter effect, or beautifying skin on the object; if the object is an object, perform image processing such as background peeling, edge enhancement, contrast enhancement or image enlargement on the object ; If the object is both a person and an object, the calculation of color, beauty, and graphic edge optimization is performed on the image. Background peeling is to separate the background in the three-dimensional image 40 from the image, so different backgrounds can be replaced in subsequent steps according to the usage habits of the user terminal. Finally, the optimized and image-processed three-dimensional image 40 is output.

另外,在本發明的直播方法更包括對至少一用戶終端進行取像以獲取用戶終端的三維影像,進而執行多人互動的一立體顯示模式。本發明的直播方法除了可以三維顯示外,更可以使用直播主廣播的一對多的顯示模式,除了一般的一對一模式,還可以多人同時同步互動的立體顯示模式,因此本發明的直播方法除了可以提供直觀性,更可以提高娛樂的樂趣,對於避免購物的錯誤以外,可以增進人類生活的愉悅感。In addition, the live broadcast method of the present invention further includes capturing images of at least one user terminal to obtain a three-dimensional image of the user terminal, and then performing a multi-person interactive stereoscopic display mode. In addition to the three-dimensional display, the live broadcast method of the present invention can also use the one-to-many display mode of the live main broadcast. In addition to the general one-to-one mode, a three-dimensional display mode in which multiple people interact simultaneously and simultaneously can be used. Therefore, the live broadcast of the present invention The method can not only provide intuition, but also can improve the fun of entertainment, in addition to avoiding shopping mistakes, it can improve the pleasure of human life.

透過本發明的三維影像的影像處理,除了可以給用戶終端一眼即可見到目標顯示區域,同時對於網路頻寬的資料裝載量可以有效的減少,本發明的直播方法對於即時的直播系統更是可以提高用戶終端的視訊體驗感,才不致產生視頻卡頓現象。在本發明中,三維影像可通過一三維直播演算模組52來達到影像處理的目的。本發明三維影像的影像處理,對網路頻寬的資料裝載量可以有效減少的原因在於,局部的三維影像資料相較於傳統的全圖三維資料量小很多,加上資料壓縮的技術,僅會比一般全圖二維影像的資料多些位元組(bytes)~幾千位元組的資料。Through the image processing of the three-dimensional image of the present invention, the user terminal can see the target display area at a glance, and at the same time, the data load of the network bandwidth can be effectively reduced, and the live broadcast method of the present invention is more suitable for the real-time live broadcast system. The video experience of the user terminal can be improved, so that the phenomenon of video freezing will not occur. In the present invention, the 3D image can be processed by a 3D live broadcast calculation module 52 . The reason why the image processing of the 3D image of the present invention can effectively reduce the data load of the network bandwidth is that the local 3D image data is much smaller than the traditional full-image 3D data, and with the data compression technology, only It will have more bytes to several thousand bytes than the data of general full-image 2D images.

在步驟S303中,根據一用戶終端的至少一視頻選擇條件,通過一人工智慧學習模組54,產生一視頻調配組合。用戶終端的至少一視頻選擇條件為用戶終端對一直播系統的使用慣性,每個用戶終端的使用習慣都不相同,本發明的直播方法通過一人工智慧學習模組54,根據每個用戶終端在使用本發明的直播軟體時的使用慣性,顯示用戶終端最喜歡的撥放模式。隨著用戶終端的習慣,亦被人工智慧學習模組54的學習模式所記錄,下次用戶終端在開啟相同或是類似畫面,可以自動的顯示用戶終端習慣的區域或是以全圖的立體畫面顯示。In step S303, according to at least one video selection condition of a user terminal, an artificial intelligence learning module 54 is used to generate a video allocation combination. The at least one video selection condition of the user terminal is the use inertia of the user terminal to a live broadcast system, and the use habits of each user terminal are different. The live broadcast method of the present invention uses an artificial intelligence learning module 54. The usage inertia when using the live broadcast software of the present invention displays the most preferred playback mode of the user terminal. Along with the habit of the user terminal, it is also recorded by the learning mode of the artificial intelligence learning module 54. The next time the user terminal opens the same or similar screen, it can automatically display the area that the user terminal is used to or a stereoscopic image with a full image. show.

進一步來說,每個用戶終端都會有個人的使用習慣,因此,當用戶終端觀看直播時,本發明的人工智慧學習模組54會記錄每個用戶終端在觀看直播時的使用習慣。舉例來說,如圖4B所示,當用戶終端在觀看直播時,人工智慧學習模組54會記錄用戶終端的使用模式,例如,該用戶終端喜歡發送哪種禮物,用戶終端會針對那個特定的人、物或人與物進行二維影像與三維影像之間的切換。本發明的直播方法會記錄每個用戶終端在觀看直播時的使用習慣,並在人工智慧學習模組54中根據所記錄的每個用戶終端的觀看直播的使用習慣,當該用戶終端在下一次觀看直播時,人工智慧學習模組54會根據所記錄的使用習慣,顯示特定的直播畫面給該用戶終端,讓該用戶終端有較佳的直播觀看的使用經驗。Further, each user terminal will have personal usage habits. Therefore, when the user terminal watches the live broadcast, the artificial intelligence learning module 54 of the present invention will record the usage habits of each user terminal when watching the live broadcast. For example, as shown in FIG. 4B , when the user terminal is watching the live broadcast, the artificial intelligence learning module 54 will record the usage pattern of the user terminal, for example, what kind of gift the user terminal likes to send, the user terminal will target that specific gift. Switch between two-dimensional and three-dimensional images of people, objects, or people and objects. The live broadcast method of the present invention records the usage habits of each user terminal when watching the live broadcast, and in the artificial intelligence learning module 54, according to the recorded usage habits of each user terminal for watching the live broadcast, when the user terminal watches the next time During live broadcast, the artificial intelligence learning module 54 will display a specific live broadcast screen to the user terminal according to the recorded usage habits, so that the user terminal can have a better experience in using live broadcast viewing.

另外,在步驟S304中,根據視頻調配組合與直播系統的使用環境,對三維影像進行優化。對三維影像進行優化可以是根據直播系統的使用環境,例如觀眾數量與網路速度等,對三維影像進行優化。另外,在本發明的直播方法中,在拍攝的人物或是產品時,除了即時拍攝,也可以透過預先錄製的後台背景即時加入,增加虛擬現實的使用場景,方便用戶終端藉由使用場景變更,來提高環境使用愉悅體感。在本發明的直播方法中,除了針對每個用戶終端提供不同的視頻調配組合,產生優化後的三維影像外,還可以根據不同的用戶終端選擇不同的背景圖案。而且,本發明的直播方法為避免錯誤或是不恰當的影像或是視頻流放到終端,因此不論用戶終端是與直播主進行聊天或交流,甚至是欣賞直播主的節目表演,或是直播主銷售產品,在影像以相機即時攝入人體或物體以後,除了即時的影像三維編輯壓縮外,都會在直播雲端的後台上,經過人工智慧的自動檢查,或是人工檢查,檢查完成後,才會根據用戶終端的設定或是需求,與後台系統的正確對應後,發送出相對應的碼流(Data Streaming)到用戶終端的手機或是終端交互裝置上,用戶便可以看到該終端所設置及需求相對應的顯示結果。In addition, in step S304, the three-dimensional image is optimized according to the video deployment combination and the use environment of the live broadcast system. The optimization of the three-dimensional image may be based on the use environment of the live broadcast system, such as the number of viewers and the speed of the network, to optimize the three-dimensional image. In addition, in the live broadcast method of the present invention, in addition to real-time shooting of people or products, the pre-recorded background background can also be added in real time to increase the use scene of virtual reality, which is convenient for the user terminal to change the use scene. To improve the environment to use pleasant somatosensory. In the live broadcast method of the present invention, in addition to providing different video allocation combinations for each user terminal to generate an optimized three-dimensional image, different background patterns can also be selected according to different user terminals. Moreover, the live broadcast method of the present invention is to avoid wrong or inappropriate video or video streaming to the terminal, so no matter whether the user terminal is chatting or communicating with the live broadcaster, or even enjoying the live broadcaster's program performance, or the live broadcaster selling Products, after the image is captured by the camera into the human body or object, in addition to the real-time 3D editing and compression of the image, it will be automatically checked by artificial intelligence in the background of the live broadcast cloud, or manually checked. After the settings or requirements of the user terminal are correctly corresponding to the background system, the corresponding data stream (Data Streaming) is sent to the mobile phone or terminal interaction device of the user terminal, and the user can see the settings and requirements of the terminal. The corresponding display results.

在步驟S305中,優化後的三維影像,並配合用戶終端的至少一視頻選擇條件,將對應的三維影像顯示在用戶終端中。本發明的直播方法除了根據用戶終端的設置,在後台端提供用戶終端所需要的前述前景、後景的切割及顯示設置以外,亦會根據用戶終端的設置及需求,將相對應的三維影像發送到用戶終端的裝置上。舉例來說,用戶終端進行了禮物的購買支付,直播系統便會根據用戶終端購買的對應禮物,將該禮物的立體顯示影像或視頻,發送到直播主的裝置上,這樣直播主便可以在其裝置的顯示螢幕上,觀看到對應的立體禮物的視頻或影像。In step S305, the optimized three-dimensional image is displayed in the user terminal in accordance with at least one video selection condition of the user terminal. The live broadcast method of the present invention not only provides the aforementioned foreground and background cutting and display settings required by the user terminal in the background according to the settings of the user terminal, but also sends the corresponding three-dimensional images according to the settings and requirements of the user terminal. to the device of the user terminal. For example, if the user terminal pays for the purchase of a gift, the live broadcast system will send the stereoscopic display image or video of the gift to the device of the live broadcaster according to the corresponding gift purchased by the user terminal, so that the live broadcaster can display the gift on his device. On the display screen of the device, watch the video or image of the corresponding three-dimensional gift.

通過本發明即時三維影像顯示功能的直播方法,除了可以根據不同的資料傳輸速度可以調整影像的輸出品質,還可以根據不同的用戶終端提供不同的體驗感受,讓每個用戶終端都獲得較佳的觀看經驗或購物經驗。Through the live broadcast method of the instant three-dimensional image display function of the present invention, in addition to adjusting the output quality of the image according to different data transmission speeds, it can also provide different experience according to different user terminals, so that each user terminal can obtain better quality A viewing experience or a shopping experience.

[本發明直播系統實施例][Embodiment of the live broadcast system of the present invention]

圖5顯示本發明即時三維影像顯示的直播系統的系統方塊圖。如圖5所示,本發明之即時三維影像顯示的直播系統50包括一影像擷取模組51、一三維直播演算模組52、一直播系統伺服器53、一人工智慧學習模組54、一三維解碼器55與一三維顯示器56。FIG. 5 shows a system block diagram of the live broadcast system for real-time 3D image display according to the present invention. As shown in FIG. 5, the live broadcast system 50 for real-time 3D image display of the present invention includes an image capture module 51, a 3D live broadcast calculation module 52, a live broadcast system server 53, an artificial intelligence learning module 54, a 3D decoder 55 and a 3D display 56 .

影像擷取模組51可以是三維影像機或二維影像機等,任何可以攝影的電子裝置都可以本發明的影像擷取模組51。在本發明的較佳實施例中,影像擷取模組51為三維攝影機,要解決三維影像顯示訊息的不足,在本發明的較佳實施例中利用了三維影像的拍攝及編制,在此所述的三維影像拍攝,是採用即時的內嵌式三維雙鏡頭攝影機擷取三維影像,這種內嵌於直播主所使用裝置內或是用戶終端的裝置(例如智能手機、平板電腦等)內的攝影機,由於是內嵌式,所以相機是內置在電子裝置上,兩個三維攝影機透過介面,如移動工業處理器介面(Mobile Industry Process Interface,MIPI)、串列介面等,與手持裝置端直接連接,再透過本發明的三維直播演算模組52及直播系統伺服器53連接,即時發送三維影像及視頻至用戶終端。對於個人電腦端或手持裝置,假如沒有裝設內嵌三維攝影機,可以採用外接式的單眼攝影機,或是雙眼立體相機,透過通用序列匯流排(USB)介面,或是無線(WIFI)方式,與主機連接,當連接到本發明的直播系統後,便根據二維轉三維或是立體相機的處理方式或流程進行資料處理。The image capture module 51 can be a 3D video camera or a 2D video camera, and any electronic device capable of taking pictures can use the image capture module 51 of the present invention. In the preferred embodiment of the present invention, the image capturing module 51 is a three-dimensional camera. To solve the shortage of the information displayed by the three-dimensional image, the shooting and compilation of the three-dimensional image are used in the preferred embodiment of the present invention. The above-mentioned 3D image shooting uses a real-time embedded 3D dual-lens camera to capture 3D images. The camera, because it is an embedded type, is built into the electronic device, and the two 3D cameras are directly connected to the handheld device through interfaces, such as Mobile Industry Process Interface (MIPI), serial interface, etc. , and then through the connection between the 3D live broadcast calculation module 52 and the live broadcast system server 53 of the present invention, the 3D images and videos are sent to the user terminal in real time. For personal computers or handheld devices, if there is no built-in 3D camera, an external monocular camera or binocular stereo camera can be used, through the universal serial bus (USB) interface, or wireless (WIFI) method, When connected to the host, after being connected to the live broadcast system of the present invention, data processing is performed according to the processing method or flow of the 2D to 3D or stereo camera.

或者,在不同實施例中,可以應用二維攝影機,先取得直播主的二維影像,再透過二維轉三維的影像轉換技術,將二維影像轉換為三維影像,如何擷取三維影像或如何將二維影像轉換為三維影像的技術為本領域具有通常知識者所熟知,在此不再贅述。影像擷取模組51可以安裝在直播主以及用戶終端的裝置上,可以同時取得直播主以及用戶終端的三維影像,以便於進行後續的直播主與至少一用戶終端的一對一模式或一對多的立體顯示模式。Alternatively, in different embodiments, a 2D camera may be used to first obtain the 2D image of the live broadcast host, and then convert the 2D image into a 3D image through a 2D to 3D image conversion technology. How to capture the 3D image or how to The technology of converting a 2D image into a 3D image is well known to those skilled in the art, and will not be repeated here. The image capture module 51 can be installed on the devices of the live broadcaster and the user terminal, and can simultaneously obtain the three-dimensional images of the live broadcaster and the user terminal, so as to facilitate the subsequent one-to-one mode or paired mode between the live broadcaster and at least one user terminal. Multiple stereoscopic display modes.

三維直播演算模組52連接影像擷取模組51,其用於接收三維影像,並對三維影像進行優化,三維直播演算模組52可以是設置直播主或用戶終端的直播軟體中,或者三維直播演算模組52也可以安裝在直播系統伺服器53中,在此並不侷限。進一步來說,如圖6所示,並參考圖5,三維直播演算模組52包括一影像校正單元521、一影像分離單元522、一影像合成單元523、一影像修正單元524、一影像調配單元525、一影像管理單元526以及一視訊編碼單元527。影像校正單元521連接影像擷取模組51,接收三維影像,用於將所接收的三維影像做影像的校正,在擷取三維影像或影像在進行二維轉三維的過程中,都會產生些許的影像誤差或影音不同步的問題,通過影像校正單元521修正三維影像在擷取時或轉換時所產生的錯誤。影像分離單元522連接影像校正單元521,針對校正後的三維影像中的前景、後景、人或物進行分割,因此可以對於三維影像中的任何人或物進行特定的影像處理,舉例來說,將三維影像中的後景切割,因此可以在後續的影像處理中,在背景中加入特定的背景圖案,或者,在特定的直播動作中,將特定的物或影像進行三維顯示。The 3D live broadcast calculation module 52 is connected to the image capture module 51, which is used to receive and optimize the 3D image. The calculation module 52 can also be installed in the live system server 53, which is not limited herein. Further, as shown in FIG. 6 , and referring to FIG. 5 , the 3D live broadcast calculation module 52 includes an image correction unit 521 , an image separation unit 522 , an image synthesis unit 523 , an image correction unit 524 , and an image allocation unit 525 , an image management unit 526 and a video encoding unit 527 . The image correction unit 521 is connected to the image capture module 51 to receive a 3D image, and is used to perform image correction on the received 3D image. During the process of capturing the 3D image or converting the image from 2D to 3D, some errors will be generated. For the problem of image errors or out-of-sync video and audio, the image correction unit 521 corrects the errors generated when the 3D images are captured or converted. The image separation unit 522 is connected to the image correction unit 521 to segment the foreground, background, people or objects in the corrected 3D image, so that specific image processing can be performed on any person or object in the 3D image, for example, The background in the three-dimensional image is cut, so that a specific background pattern can be added to the background in the subsequent image processing, or, in a specific live action, a specific object or image can be displayed three-dimensionally.

影像合成單元523連接影像分離單元522,分離後的三維影像,可以分別對於特定的圖案、人或物進行影像處理,處理後的特定圖案、人或物可以通過影像合成單元523進行結合,影像合成單元523並非只是將原本的三維影像還原,而是可以將個別立體化的圖案、人或物與其他的二維影像結合,更可以通過影像合成單元523在背景中加入特定的背景圖案。影像修正單元524連接影像調配單元525,影像修正單元524會根據不同圖案的取像選擇調整影像的視角,或者,影像修正單元524可以將三維影像中的禮物影像進行影像特效渲染效果,影像修正單元524也包括一般的影像邊緣強化、影像轉向、影像背景虛化、濾鏡效果或影像區域放大等功能。The image synthesizing unit 523 is connected to the image separating unit 522, and the separated three-dimensional images can be image-processed for specific patterns, people, or objects, respectively, and the processed specific patterns, people, or objects can be combined by the image synthesizing unit 523. The unit 523 does not just restore the original three-dimensional image, but can combine individual three-dimensional patterns, people or objects with other two-dimensional images, and can also add a specific background pattern to the background through the image synthesis unit 523 . The image modification unit 524 is connected to the image allocation unit 525, and the image modification unit 524 selects and adjusts the viewing angle of the image according to the image capture of different patterns. The 524 also includes general image edge enhancement, image turning, image bokeh, filter effects, or image area magnification.

影像調配單元525連接影像修正單元524,影像調配單元525會根據不同的網速或移動裝置的性能,調整三維影像的影像輸出。舉例來說,當網速較快,三維影像可以完整的輸出,當網速較慢時,可以局部顯示三維影像,其餘的影像以二維顯示。影像管理單元526連接影像調配單元525,將經過影像處理後的三維影像整合並輸出,視訊編碼單元527連接影像管理單元526,通過視訊編碼單元527將三維影像轉換為資料訊號,進而可將具有三維影像的資料訊號以有線或無線的方式傳送至直播系統伺服器53。The image adjustment unit 525 is connected to the image correction unit 524, and the image adjustment unit 525 adjusts the image output of the 3D image according to different network speeds or performance of the mobile device. For example, when the network speed is fast, the 3D image can be completely output, and when the network speed is slow, the 3D image can be partially displayed, and the rest of the image can be displayed in 2D. The image management unit 526 is connected to the image allocation unit 525 to integrate and output the three-dimensional images after image processing. The video encoding unit 527 is connected to the image management unit 526, and the three-dimensional images are converted into data signals by the video encoding unit 527. The data signal of the image is transmitted to the live broadcast system server 53 in a wired or wireless manner.

直播系統伺服器53無線連接三維直播演算模組52,其也可稱之為直播系統雲端,直播系統伺服器53用於接收具有三維影像的資料訊號,換句話說,每個直播主所直播的內容都會傳送至直播系統伺服器53,然後再透過直播系統伺服器53推播至用戶終端的裝置上。人工智慧學習模組54連接直播系統伺服器53,在本發明的較佳實施例中,人工智慧學習模組54可以設置在直播系統伺服器53,或者在不同實施例中,人工智慧學習模組54可以設置在不同的伺服器或電腦主機上,然後再以無線或有線連接的方式連接直播系統伺服器53,在此並不侷限。每個用戶終端的視屏選擇條件或稱使用習慣都會被人工智慧學習模組54的學習模式所記錄,並輸出一視頻調配組合。當下次用戶終端開啟相同或是類似畫面,人工智慧學習模組54可根據視頻選擇條件自動輸出視頻調配組合,換句話說,顯示用戶終端所習慣的顯示區域或是以全圖的立體畫面顯示。The live broadcast system server 53 is wirelessly connected to the 3D live broadcast calculation module 52, which can also be called the live broadcast system cloud. The live broadcast system server 53 is used to receive data signals with three-dimensional images. The content will be sent to the live broadcast system server 53 , and then pushed to the device of the user terminal through the live broadcast system server 53 . The artificial intelligence learning module 54 is connected to the live broadcast system server 53. In a preferred embodiment of the present invention, the artificial intelligence learning module 54 can be set on the live broadcast system server 53, or in different embodiments, the artificial intelligence learning module 54 can be set on different servers or computer hosts, and then connected to the live system server 53 in a wireless or wired manner, which is not limited here. The video screen selection conditions or usage habits of each user terminal will be recorded by the learning mode of the artificial intelligence learning module 54, and a video allocation combination will be output. When the user terminal opens the same or similar screen next time, the artificial intelligence learning module 54 can automatically output the video allocation combination according to the video selection conditions.

因此,不論是用戶終端與直播主進行聊天、交流,或是欣賞直播主的節目表演,或是觀看直播主銷售產品,在影像以相機即時攝入人體或物體以後,除了即時的影像三維編輯、壓縮以外,都會在直播系統伺服器53的後台上,經過人工智慧學習模組54的自動檢查,檢查完成後,才會根據用戶終端的設定或是需求,與後台系統的正確對應後,發送出相對應的碼流(Data Streaming)到用戶終端的手機或是終端交互裝置上,用戶終端便可以看到該終端所設置及需求相對應的顯示結果。Therefore, whether the user terminal chats or communicates with the live broadcaster, enjoys the live broadcaster's program performance, or watches the live broadcaster's sales products, after the image is captured by the camera in real time, the human body or object, in addition to the real-time image 3D editing, Except for compression, all of them will be automatically checked by the artificial intelligence learning module 54 on the background of the live broadcast system server 53. After the check is completed, it will be sent out according to the settings or requirements of the user terminal and correctly corresponding to the background system. The corresponding code stream (Data Streaming) is sent to the mobile phone of the user terminal or the terminal interaction device, and the user terminal can see the display results corresponding to the settings and requirements of the terminal.

三維解碼器55連接直播系統伺服器53,或者三維解碼器55也可以安裝在直播系統伺服器53上,且連接人工智慧學習模組54,透過三維解碼器55可將三維影像進行編碼以及解碼,在本發明的直播系統50中,可以應用三維解碼器55讓直播主可以跟多個用戶終端進行互動的立體顯示模式。直播系統50便會根據用戶終端購買的對應禮物,如圖7A所示,在直播主71的行動裝置72上,將禮物73的立體顯示影像或視頻顯示出來,這樣直播主71便可以在其行動裝置72的顯示螢幕74上,觀看到對應的立體禮物73的視頻或影像。三維顯示器56設置在直播主以及用戶終端的裝置上,直播主以及用戶終端的裝置以無線傳輸的方式接收從直播系統伺服器53所推播的三維影像,並透過三維顯示器56顯示在直播主以及用戶終端的裝置上。透過三維解碼器55讓本發明的直播系統50具有三維立體雙向編解碼技術,如圖7B所示,在行動裝置72上,除了直播主71可以發送立體視頻、圖像,給用戶終端75接收觀看立體視頻、圖像以外,用戶終端75也可以透過雙向立體顯像技術,對直播主71發送立體圖像或是視頻,使直播主71也可以即時收到用戶終端75的立體圖像訊息,可以快速的判斷出用戶終端75的需求,達成即時互動的目的。The three-dimensional decoder 55 is connected to the live broadcast system server 53, or the three-dimensional decoder 55 can also be installed on the live broadcast system server 53, and is connected to the artificial intelligence learning module 54, and the three-dimensional image can be encoded and decoded through the three-dimensional decoder 55, In the live broadcast system 50 of the present invention, the three-dimensional decoder 55 can be applied in a stereoscopic display mode in which the live broadcaster can interact with multiple user terminals. The live broadcast system 50 will display the stereoscopic display image or video of the gift 73 on the mobile device 72 of the live broadcast host 71 according to the corresponding gift purchased by the user terminal, as shown in FIG. 7A, so that the live broadcast host 71 can move On the display screen 74 of the device 72 , the video or image of the corresponding three-dimensional gift 73 is viewed. The three-dimensional display 56 is arranged on the devices of the live broadcast host and the user terminal, and the devices of the live broadcast host and the user terminal receive the three-dimensional image pushed from the live broadcast system server 53 by means of wireless transmission, and display it on the live broadcast host and the live broadcast host through the three-dimensional display 56. on the device of the user terminal. Through the 3D decoder 55, the live broadcast system 50 of the present invention has a 3D stereo bidirectional encoding and decoding technology. As shown in FIG. 7B, on the mobile device 72, in addition to the live broadcast host 71, the stereo video and images can be sent to the user terminal 75 for receiving and viewing. In addition to stereoscopic video and images, the user terminal 75 can also send stereoscopic images or videos to the live broadcast host 71 through the two-way stereoscopic display technology, so that the live broadcast host 71 can also receive the stereoscopic image information of the user terminal 75 in real time. Quickly determine the needs of the user terminal 75 to achieve the purpose of real-time interaction.

另外,請參閱圖6,在用戶終端上,同樣包括在直播主端的三維影像的顯示功能,在用戶終端的裝置上,同樣包括本發明之直播系統50的功能,因為在觀看直播時,用戶終端也可以直播自己的影像給直播主看,或者,用戶終端也可以在接收到直播主端的視頻時,自動或手動切換二維與三維之間的轉換。因此,當用戶終端接收到二維或三維影像時,若接收到二維影像可轉換為三維影像,用戶終端的三維影像同樣會通過影像校正單元521、影像分離單元522、影像合成單元523、影像修正單元524、影像調配單元525以及影像管理單元526等元件,將三維影像進行影像處理,讓在用戶終端的三維影像同樣可以做全景三維影像的顯示,或者可以針對特定的人、物或人與物做三維影像的顯示,二維或三維影像的顯示在客戶終端上可以主動或手動的方式進行切換。如何通過三維直播演算模組52進行影像處理已於前面章節介紹過,因此,相關的影像處理細節在此不再贅述。In addition, please refer to FIG. 6 , the user terminal also includes the display function of the 3D image at the main end of the live broadcast, and the device of the user terminal also includes the function of the live broadcast system 50 of the present invention, because when watching the live broadcast, the user terminal It is also possible to live broadcast its own image to the live broadcast host, or the user terminal can automatically or manually switch the conversion between two-dimensional and three-dimensional when receiving the video from the live broadcast host. Therefore, when the user terminal receives a two-dimensional or three-dimensional image, if the received two-dimensional image can be converted into a three-dimensional image, the three-dimensional image of the user terminal will also pass through the image correction unit 521, the image separation unit 522, the image synthesis unit 523, the image The correction unit 524, the image allocation unit 525, and the image management unit 526 and other components perform image processing on the three-dimensional image, so that the three-dimensional image on the user terminal can also be displayed as a panoramic three-dimensional image, or can be used for specific people, objects or people and people. The display of 2D or 3D images can be switched actively or manually on the client terminal. How to perform image processing by the 3D live broadcast calculation module 52 has been introduced in the previous chapters, therefore, the details of the relevant image processing will not be repeated here.

舉例來說,當用戶終端的資料傳輸速度不快,三維直播演算模組52會自動只顯示部分的三維影像在用戶終端的三維顯示器56上,而不會顯示全景的三維影像,或者,當網速過低時,三維直播演算模組52甚至會自動將三維影像切換至二維影像。另外,用戶終端的三維影像除了可以自動切換外,三維影像也可以手動切換,用戶終端可以手動指定特定的人、物或人與物做三維影像的顯示。換句話說,本發明的三維影像並不局限於全圖的三維影像,本發明的三維影像也可以是局部的三維影像,所謂局部的三維影像就是在影像中,特定的物件(人、物或人與物)是以三維顯示的方式呈現,其餘的圖像則是以二維顯示的方式呈現。通過這樣的影像呈現方式,可以降低在直播時資料的傳輸量,並可以降低直播顯示延遲的產生機率。For example, when the data transmission speed of the user terminal is not fast, the 3D live broadcast calculation module 52 will automatically display only part of the 3D image on the 3D display 56 of the user terminal, but will not display the panoramic 3D image. When it is too low, the 3D live broadcast calculation module 52 will even automatically switch the 3D image to the 2D image. In addition, the 3D image of the user terminal can be switched automatically, and the 3D image can also be manually switched, and the user terminal can manually designate a specific person, object, or person and object to display the 3D image. In other words, the 3D image of the present invention is not limited to the 3D image of the whole image, and the 3D image of the present invention can also be a partial 3D image. People and objects) are presented in a three-dimensional display, and the rest of the images are presented in a two-dimensional display. Through such an image presentation method, the amount of data transmission during live broadcast can be reduced, and the probability of generating delay in live broadcast display can be reduced.

[本發明三維直播影像的橫向或者縱向顯示實施例][The embodiment of the horizontal or vertical display of the three-dimensional live image of the present invention]

另外,在本發明的較佳實施例中,更可以在直播主以及用戶終端的裝置上設置一三維直播影像的橫向縱向控制裝置80。因應用戶終端使用的裝置可能有習慣性的使用縱向顯示觀賞或是橫向直播顯示觀賞,本發明也提供三維直播影像的橫向縱向控制裝置80,對於移動裝置的顯示方向設定,這個顯示方向的設定可以在用戶終端的APP軟體上自動偵測或是人工設定,也適用在直播主的顯示終端及雲端的監測顯示上。因此,本發明的三維直播影像的橫向縱向控制裝置80包括一方向偵測器81與一控制介面82。In addition, in a preferred embodiment of the present invention, a horizontal and vertical control device 80 for a three-dimensional live video image can be set on the devices of the live broadcast host and the user terminal. Since the device used by the user terminal may habitually use vertical display for viewing or horizontal live display viewing, the present invention also provides a horizontal and vertical control device 80 for 3D live video. For the display orientation setting of the mobile device, the display orientation setting can be It is automatically detected or manually set on the APP software of the user terminal, and is also applicable to the display terminal of the live broadcaster and the monitoring display of the cloud. Therefore, the horizontal and vertical control device 80 for 3D live video of the present invention includes a direction detector 81 and a control interface 82 .

在自動偵測直播顯示方向上,方向偵測器81具備可以感測行動裝置或手持裝置內部的陀螺儀或是方向感測器的訊號,在直播主或用戶終端的裝置便會根據終端裝置回饋的方向訊號,對於直播顯示的方向進行調變。這個方向的調變會配合三維直播影像或視頻經過轉向後,進行了三維直播影像的顯示轉換,這個顯示轉換包括螢幕顯示的長、寬比或交織的配比(Interlace)的調整,以及軟體按鍵與功能顯示位置的調整,諸如此類跟3D立體直播顯示相關的轉向設置。前述的設定也包括了對於三維顯示器56的直播顯示功能設定。控制介面82設置在直播主與用戶終端的行動裝置或手持裝置上,進一步來說,控制介面82可以是控制器如一按鍵等,或者控制介面82也可以是一軟體使用介面,其包括控制器的功能。直播主或用戶終端可以透過方向偵測器81自動翻轉螢幕直播顯示,或者也可以透過控制介面82自行翻轉螢幕直播顯示,達到縱向直播顯示觀賞或是橫向直播顯示觀賞。In the automatic detection of the display direction of the live broadcast, the direction detector 81 is equipped with a signal that can sense the gyroscope or the direction sensor inside the mobile device or the handheld device, and the device of the live broadcast host or the user terminal will give feedback according to the terminal device. The direction signal of , modulates the direction of the live broadcast display. The adjustment of this direction will be matched with the 3D live video or video after turning, and the display conversion of the 3D live video will be carried out. Adjustment of the function display position, and other steering settings related to the 3D stereoscopic live broadcast display. The aforementioned setting also includes the setting of the live display function for the three-dimensional display 56 . The control interface 82 is arranged on the mobile devices or handheld devices of the live broadcaster and the user terminal. Further, the control interface 82 can be a controller such as a button, or the control interface 82 can also be a software user interface, which includes the controller's Features. The live broadcaster or the user terminal can automatically flip the screen live broadcast display through the direction detector 81, or can also flip the screen live broadcast display through the control interface 82 to achieve vertical live display viewing or horizontal live viewing.

本發明的三維直播影像的橫向縱向控制裝置80具備有橫向與縱向兩方向皆可以顯示三維直播影像的功能,且具備自動或是人工轉向偵測及顯示調適。對於只有支持單方向的立體顯示終端,本發明在三維直播影像的橫向縱向控制裝置80更包括一直播影像調整器83,直播影像調整器83連接方向偵測器81,直播影像調整器83的判斷是否調整三維直播顯示的步驟可以如圖9所示,在步驟S901中,方向偵測器81偵測到行動裝置的擺設方向改變,或者,在步驟S902中,使用者通過控制介面82進行直播影像旋轉,在步驟S903中,直播影像調整器83判斷行動裝置是否橫向或縱向支持三維直播顯示,若否,在步驟S904中,直播影像調整器83可以提醒用戶終端,告知不支持轉向後的立體顯示,並透過直播影像調整器83改以二維的平面直播顯示方式在用戶終端的顯示器上,直到用戶終端再次轉向到可以顯示立體直播顯示,用戶終端便可以看到立體直播顯示。反之,在步驟S905中,直播影像調整器83調整轉向後的最佳化的三維直播影像。The horizontal and vertical control device 80 for 3D live video of the present invention has the function of displaying 3D live video in both the horizontal and vertical directions, and has automatic or manual steering detection and display adjustment. For a stereoscopic display terminal that supports only one direction, the horizontal and vertical control device 80 of the 3D live video of the present invention further includes a live video adjuster 83, the live video adjuster 83 is connected to the direction detector 81, and the live video adjuster 83 judges The step of whether to adjust the 3D live display can be shown in FIG. 9 . In step S901 , the direction detector 81 detects that the orientation of the mobile device is changed, or in step S902 , the user performs live video through the control interface 82 Rotate, in step S903, the live image adjuster 83 determines whether the mobile device supports 3D live display horizontally or vertically, if not, in step S904, the live image adjuster 83 can remind the user terminal that the stereoscopic display after turning is not supported , and through the live image adjuster 83, the two-dimensional flat live broadcast display is displayed on the display of the user terminal until the user terminal is turned to display the stereoscopic live display again, and the user terminal can see the stereoscopic live display. On the contrary, in step S905, the live video adjuster 83 adjusts the optimized three-dimensional live video after the turn.

[實施例的有益效果][Advantageous effects of the embodiment]

本發明的其中一有益效果在於,本發明所提供的直播系統與直播方法,為減少資料的即時流量,採用影像的二維與三維並存,混合編輯的方法,這樣觀看者可以把焦點放置需要關注的產品或人物身上,以觀賞到最佳及最需要的三維影像,並且這種二維與三維影像並存的方式,可以在使用者介面上進行選擇設置二維影像觀賞,或是三維影像觀賞,亦即表示使用者可以隨時變更需要觀看的三維影像位置,可以設置在區域性,或是全圖三維顯示。二維影像可以選擇三維左右式影像中的左影像或是右影像作為二維影像的顯示圖。另外,除了用戶終端裝置的立體顯示方式外,主播端或是雲端後台的控制端,也可以放置立體顯示裝置,以提供主播端的立體顯示預覽,及與用戶終端的互動立體顯示。另外,更可以讓用戶終端或直播主端通過三維直播影像的橫向縱向控制裝置可以容易選擇橫放或縱放的方式來進行三維直播顯示。One of the beneficial effects of the present invention is that, in the live broadcast system and the live broadcast method provided by the present invention, in order to reduce the real-time flow of data, the two-dimensional and three-dimensional images are coexisted, and the mixed editing method is adopted, so that the viewer can focus on the need to pay attention. In order to watch the best and most needed 3D images on the products or characters, and this coexistence of 2D and 3D images, you can choose to set 2D image viewing or 3D image viewing on the user interface, That is to say, the user can change the position of the 3D image to be viewed at any time, which can be set in a regional or a full-image 3D display. For the 2D image, the left image or the right image in the 3D left-right image can be selected as the display image of the 2D image. In addition, in addition to the stereoscopic display mode of the user terminal device, a stereoscopic display device can also be placed on the host or the control end of the cloud backend to provide a stereoscopic display preview of the host and interactive stereoscopic display with the user terminal. In addition, the user terminal or the live broadcast host terminal can easily select the horizontal or vertical mode to display the 3D live broadcast through the horizontal and vertical control device of the 3D live broadcast image.

以上所公開的內容僅為本發明的優選可行實施例,並非因此侷限本發明的申請專利範圍,所以凡是運用本發明說明書及圖式內容所做的等效技術變化,均包含於本發明的申請專利範圍內。The contents disclosed above are only preferred feasible embodiments of the present invention, and are not intended to limit the scope of the present invention. Therefore, any equivalent technical changes made by using the contents of the description and drawings of the present invention are included in the application of the present invention. within the scope of the patent.

10:直播系統 11:用戶終端 12:直播主 APP:軟體 20:影像 S301-S305:步驟 40:三維影像 50:直播系統 51:影像擷取模組 52:三維直播演算模組 521:影像校正單元 522:影像分離單元 523:影像合成單元 524:影像修正單元 525:影像調配單元 526:影像管理單元 527:視訊編碼單元 53:直播系統伺服器 54:人工智慧學習模組 55:三維解碼器 56:三維顯示器 71:直播主 72:行動裝置 73:禮物 74:顯示螢幕 75:用戶終端 80:三維直播影像的橫向縱向控制裝置 81:方向偵測器 82:控制介面 83:直播影像調整器 S901-S905:步驟 10: Live broadcast system 11: User terminal 12: Live Master APP: Software 20: Video S301-S305: Steps 40: 3D Image 50: Live system 51: Image capture module 52: 3D Live Calculation Module 521: Image Correction Unit 522: Image Separation Unit 523: Image synthesis unit 524: Image Correction Unit 525: Video Alignment Unit 526: Image Management Unit 527: Video coding unit 53: Live system server 54: Artificial Intelligence Learning Module 55: 3D decoder 56: 3D Display 71: Live Master 72: Mobile Devices 73: Gift 74: Display screen 75: User terminal 80: Horizontal and vertical control device for 3D live video 81: Orientation detector 82: Control interface 83: Live Image Adjuster S901-S905: Steps

圖1為現有直播系統的示意圖。FIG. 1 is a schematic diagram of an existing live broadcast system.

圖2為現有直播軟體執行的示意圖。FIG. 2 is a schematic diagram of the execution of the existing live broadcast software.

圖3為本發明的即時三維影像顯示功能的直播方法的步驟流程圖。FIG. 3 is a flow chart of the steps of the live broadcast method of the real-time 3D image display function of the present invention.

圖4A為本發明的三維影像的影像調整的示意圖。FIG. 4A is a schematic diagram of image adjustment of a 3D image according to the present invention.

圖4B為本發明的人工智慧學習模組學習用戶終端的使用習慣的示意圖。FIG. 4B is a schematic diagram of the artificial intelligence learning module of the present invention learning the usage habits of the user terminal.

圖5為本發明的即時三維影像顯示功能的直播系統的系統方塊圖。FIG. 5 is a system block diagram of the live broadcast system with the real-time 3D image display function of the present invention.

圖6為本發明的三維直播演算模組的影像處理的示意圖。FIG. 6 is a schematic diagram of image processing of the 3D live broadcast calculation module of the present invention.

圖7A為應用本發明的直播系統顯示三維影像的示意圖。FIG. 7A is a schematic diagram of displaying a 3D image by the live broadcasting system applying the present invention.

圖7B為應用本發明的直播系統使直播主與用戶終端互動的示意圖。FIG. 7B is a schematic diagram of the interaction between the live broadcaster and the user terminal by applying the live broadcast system of the present invention.

圖8為本發明三維直播影像控制裝置的方塊圖。FIG. 8 is a block diagram of a 3D live video control device according to the present invention.

圖9為本發明三維直播影像控制裝置的影像二維與三維直播轉換的判斷步驟流程圖。FIG. 9 is a flow chart of the judging steps of the 2D and 3D live video conversion of the 3D live video control device of the present invention.

S301-S305:步驟 S301-S305: Steps

Claims (10)

一種即時三維影像顯示的直播方法,其包括: 對一實體進行取像,並獲取所述實體的一三維影像; 通過一三維直播演算模組,進行所述三維影像的影像處理; 根據一用戶終端的至少一視頻選擇條件,通過一人工智慧學習模組,產生一視頻調配組合; 根據所述視頻調配組合與所述用戶終端的一使用環境,對所述三維影像進行優化;以及 優化後的所述三維影像,配合所述用戶終端的至少一所述視頻選擇條件,將對應的所述三維影像顯示在所述用戶終端中; 其中,所述三維影像可以是全圖三維影像或局部三維影像。 A live broadcast method for real-time three-dimensional image display, comprising: Image an entity, and acquire a three-dimensional image of the entity; Perform image processing of the three-dimensional image through a three-dimensional live broadcast calculation module; According to at least one video selection condition of a user terminal, a video allocation combination is generated through an artificial intelligence learning module; Optimizing the 3D image according to the video deployment combination and a usage environment of the user terminal; and The optimized three-dimensional image is displayed in the user terminal in accordance with at least one of the video selection conditions of the user terminal; Wherein, the 3D image may be a full-image 3D image or a partial 3D image. 如請求項1所述的即時三維影像顯示的直播方法,其中,在對一實體進行取像的步驟中,先透過至少一鏡頭取得二維影像,再將所述二維影像透過一三維影像模擬模組轉換為所述三維影像。The live broadcast method for real-time 3D image display according to claim 1, wherein in the step of capturing an image of an entity, a 2D image is obtained through at least one lens, and then the 2D image is simulated through a 3D image. The module is converted into the three-dimensional image. 如請求項1所述的即時三維影像顯示的直播方法,其中,在對一實體進行取像的步驟中,是直接透過多個鏡頭取得所述三維影像。The live broadcast method for real-time 3D image display according to claim 1, wherein in the step of capturing an image of an entity, the 3D image is directly obtained through a plurality of lenses. 如請求項1所述的即時三維影像顯示的直播方法,其中,在進行所述三維影像的影像處理的步驟中,是將所述三維影像進行影像容錯及校正,並偵測與分離所述三維影像中的多個物體。The live broadcast method for real-time 3D image display according to claim 1, wherein in the step of performing image processing on the 3D image, image error tolerance and correction are performed on the 3D image, and the 3D image is detected and separated. Multiple objects in the image. 如請求項4所述的即時三維影像顯示的直播方法,其中,若所述物體為人,對所述物體進行影像柔和濾光、背景虛化、濾鏡效果或美顏美肌。The live broadcast method for real-time three-dimensional image display according to claim 4, wherein, if the object is a human, image soft filtering, background blur, filter effect or beautifying the skin is performed on the object. 如請求項4所述的即時三維影像顯示的直播方法,其中,若所述物體為物,對所述物體進行背景剝離、邊緣強化、增強對比或影像放大。The live broadcast method for real-time three-dimensional image display according to claim 4, wherein, if the object is an object, background peeling, edge enhancement, contrast enhancement or image enlargement is performed on the object. 如請求項1所述的即時三維影像顯示的直播方法,其中,所述用戶終端的至少一所述視頻選擇條件為所述用戶終端對一直播系統的使用慣性。The live broadcast method for real-time 3D image display according to claim 1, wherein at least one of the video selection conditions of the user terminal is the use inertia of the user terminal to a live broadcast system. 如請求項1所述的即時三維影像顯示的直播方法,其中,在對所述三維影像進行優化的步驟中,是根據所述直播系統的觀眾數量與網路速度,對所述三維影像進行優化。The live broadcast method for real-time three-dimensional image display according to claim 1, wherein, in the step of optimizing the three-dimensional image, the three-dimensional image is optimized according to the number of viewers and network speed of the live broadcast system . 如請求項1所述的即時三維影像顯示的直播方法,更包括:對至少一所述用戶終端進行取像以獲取所述用戶終端的所述三維影像,進而執行多人互動的一立體顯示模式。The live broadcast method for real-time 3D image display according to claim 1, further comprising: capturing at least one of the user terminals to obtain the 3D image of the user terminal, and then executing a multi-person interactive stereoscopic display mode . 如請求項1所述的即時三維影像顯示的直播方法,其中,在對所述三維影像進行優化的步驟中,是對所述三維影像進行切割調整,並根據所述視頻調配組合,通過所述人工智慧學習模組,對所述三維影像進行優化。The live broadcast method for real-time 3D image display according to claim 1, wherein, in the step of optimizing the 3D image, cutting and adjusting the 3D image is performed, and according to the video allocation and combination, through the The artificial intelligence learning module optimizes the three-dimensional image.
TW109131913A 2020-09-16 Live broadcasting method for real time three-dimensional image display TWI836141B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW109131913A TWI836141B (en) 2020-09-16 Live broadcasting method for real time three-dimensional image display

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW109131913A TWI836141B (en) 2020-09-16 Live broadcasting method for real time three-dimensional image display

Publications (2)

Publication Number Publication Date
TW202213992A true TW202213992A (en) 2022-04-01
TWI836141B TWI836141B (en) 2024-03-21

Family

ID=

Similar Documents

Publication Publication Date Title
CN106789991B (en) Multi-person interactive network live broadcast method and system based on virtual scene
US10666921B2 (en) Generating content for a virtual reality system
US11076142B2 (en) Real-time aliasing rendering method for 3D VR video and virtual three-dimensional scene
CN106792246B (en) Method and system for interaction of fusion type virtual scene
JP7368886B2 (en) Information processing system, information processing method, and information processing program
US10750154B2 (en) Immersive stereoscopic video acquisition, encoding and virtual reality playback methods and apparatus
CN113347405B (en) Scaling related method and apparatus
CN106165415B (en) Stereoscopic viewing
KR102407283B1 (en) Methods and apparatus for delivering content and/or playing back content
CN106303289B (en) Method, device and system for fusion display of real object and virtual scene
CN106730815B (en) Somatosensory interaction method and system easy to realize
CN106101741A (en) Internet video live broadcasting platform is watched the method and system of panoramic video
CN108989784A (en) Image display method, device, equipment and the storage medium of virtual reality device
JP6934052B2 (en) Display control device, display control method and program
TWI774063B (en) Horizontal/vertical direction control device for three-dimensional broadcasting image
KR20190031220A (en) System and method for providing virtual reality content
TWI836141B (en) Live broadcasting method for real time three-dimensional image display
TW202213992A (en) Live broadcasting method for real time three-dimensional image display
TW202213990A (en) Live broadcasting system for real time three-dimensional image display
CN115174954A (en) Video live broadcast method and device, electronic equipment and storage medium
CN113891101A (en) Live broadcast method for real-time three-dimensional image display
CN114286077A (en) Virtual reality equipment and VR scene image display method
CN113891100A (en) Live broadcast system for real-time three-dimensional image display
CN113891099A (en) Transverse and longitudinal control device for three-dimensional live broadcast image
CN114915798A (en) Real-time video generation method, multi-camera live broadcast method and device