WO2022119119A1 - Procédé et système de transmission/réception de contenu multimédia multicanal - Google Patents

Procédé et système de transmission/réception de contenu multimédia multicanal Download PDF

Info

Publication number
WO2022119119A1
WO2022119119A1 PCT/KR2021/014985 KR2021014985W WO2022119119A1 WO 2022119119 A1 WO2022119119 A1 WO 2022119119A1 KR 2021014985 W KR2021014985 W KR 2021014985W WO 2022119119 A1 WO2022119119 A1 WO 2022119119A1
Authority
WO
WIPO (PCT)
Prior art keywords
channel
image
tag information
information
tag
Prior art date
Application number
PCT/KR2021/014985
Other languages
English (en)
Korean (ko)
Inventor
신재섭
류성걸
손세훈
박광우
Original Assignee
주식회사 픽스트리
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 픽스트리 filed Critical 주식회사 픽스트리
Publication of WO2022119119A1 publication Critical patent/WO2022119119A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/21805Source of audio or video content, e.g. local disk arrays enabling multiple viewpoints, e.g. using a plurality of cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8547Content authoring involving timestamps for synchronizing content

Definitions

  • One embodiment of the present invention relates to a method and system for transmitting and receiving multi-channel media.
  • an image obtained by photographing a product from various angles using a plurality of cameras is received as a multi-channel input and outputted in real time as a preset multi-view template on a single screen, while sub-channels in the output service channel stream
  • An object of the present invention is to provide a multi-channel media transmission/reception method and system for providing a multi-view online broadcast that is streamed so as to assign a timeline tag according to a timeline tag on a stream and a separate channel tag.
  • channel tag information is matched for each image included in each sub-channel stream including a plurality of images input using an authoring tool, and each sub-channel stream included in each sub-channel stream is matched.
  • an authoring device that matches timeline tag information on a timeline of a multi-view service channel stream that merges images of and outputs the multi-view screen image configuration information; Merge multiplexing for generating a service channel stream by merging corresponding sub-channel streams according to a transmission template based on the multi-view screen image configuration information, and inserting the channel tag information and the timeline tag information into the service channel stream and output devices; and extracting and decoding the channel tag information, the timeline tag information, and the sub-channel stream from the received service channel stream, and the decoded sub-channel image, the channel tag information, and the timeline tag information according to an expression template. It provides a multi-channel media transmission and reception system comprising a; receiving device for outputting.
  • channel tag information is matched for each image included in each sub-channel stream including a plurality of images input using an authoring tool in an authoring device, and the channel tag information is added to each sub-channel stream.
  • the merge multiplexing and output device generates a service channel stream in which the corresponding sub-channel streams are merged according to a transmission template based on the multi-view screen image configuration information, and adds the channel tag information and the timeline tag information to the service channel stream.
  • an image obtained by photographing a product from various angles using a plurality of cameras is input through multi-channels, configured as a preset multi-view template on a single screen, and outputted in real time.
  • multi-view online broadcasting to be streamed so that a channel tag for each sub-channel of the output service and a timeline tag are assigned along a timeline on the service channel stream.
  • FIG. 1 is a diagram schematically illustrating a multi-channel media transmission/reception system according to the present embodiment.
  • FIG. 2 is a diagram illustrating a configuration of a receiver screen of a multi-view service channel stream according to the present embodiment.
  • FIG. 3 is a diagram schematically illustrating a multi-channel media transceiver according to the present embodiment.
  • FIG. 4 is a view showing a transmission template and an expression template according to the present embodiment.
  • FIG 5 is an exemplary view of a GUI composed of a work screen monitor and an output screen monitor of the authoring apparatus according to the present embodiment.
  • FIG. 1 is a diagram schematically illustrating a multi-channel media transmission/reception system according to the present embodiment.
  • the multi-channel media transmission/reception system includes an authoring device 110 , a merge multiplexing and outputting device 120 , and a receiving device 130 .
  • Components included in the multi-channel media transmission/reception system are not necessarily limited thereto.
  • the authoring device 110 authorizes the content using an authoring application and transmits it to the merge multiplexing and output device 120 .
  • the authoring device 110 includes a plurality of camera inputs including a main camera and a secondary camera.
  • the authoring device 110 acquires a plurality of images obtained by photographing a product or event at various angles by using a plurality of cameras.
  • the authoring device 110 obtains a plurality of images from a plurality of stream inputs.
  • the authoring device 110 acquires a plurality of images from a plurality of file inputs stored in the storage device.
  • the authoring device 110 generates a plurality of channel tag information (host image tag information, product internal image tag information, product side image tag information, and advertisement image tag information) in advance using an authoring tool.
  • the authoring device 110 includes a plurality of channel tag information (host image tag information, product internal image tag information, product side image tag information, advertisement image tag information) generated in advance while shooting a product or event using an authoring tool.
  • a channel tag selected in relation to each video channel (sub-channel) is reflected to each sub-channel.
  • the authoring device 110 generates a plurality of timeline tag information (product color tag information, price tag information, weight tag information, design tag information, and function tag information) in advance using an authoring tool.
  • the authoring device 110 relates to the photographed content among a plurality of tag information (product color tag information, price tag information, weight tag information, design tag information, function tag information) generated in advance while photographing a product using an authoring tool to reflect any one selected tag (eg, price tag information) on the timeline.
  • the authoring device 110 when the authoring device 110 describes the price while photographing the product using the authoring tool, if price tag information is selected from among a plurality of tag information generated in advance, the corresponding time (eg, 10 :00) applies the price tag information on the timeline. Then, when the authoring device 110 describes the design while photographing the product using the authoring tool, when design tag information is selected from among a plurality of previously generated tag information, the corresponding time (eg, 20:00) for describing the design ) to apply the design tag information on the timeline.
  • the corresponding time eg, 10 :00
  • the authoring device 110 uses an authoring tool to acquire an image using a plurality of cameras and files, then designates a transmission template, merges each sub-channel stream into a transmission template form, and generates and transmits one service channel stream make it possible
  • the authoring device 110 may configure various service channel stream formats by using the transmission template designation function of the authoring tool.
  • the authoring device 110 uses an authoring tool to obtain an image using a plurality of cameras and files, and then designates an expression template so that each image is output on a preset multi-view screen configuration when played back in the receiver.
  • the authoring device 110 can configure various playback screens by using the expression template designation function of the authoring tool. For example, the main screen (front screen) is allocated to the widest area among the multi-view screen configurations and the remaining areas are Allocate sub screens (right screen, left screen, top screen).
  • the authoring device 110 may allocate pre-stored captured images and advertisement images to some of the multi-view screen configurations. For example, the authoring device 110 may output a clip image inside a product among clip images pre-photographed in any one area of a multi-view screen configuration divided into six using an authoring tool.
  • the authoring device 110 analyzes the text input from the receiving device 130 in the chat window area.
  • the authoring device 110 recognizes the context by analyzing the text input to the chat window area using AI.
  • the authoring device 110 extracts requests (eg, advertisements, product internal images) from the recognized context using AI.
  • the authoring device 110 outputs an image corresponding to a request (eg, advertisement, product internal image) among the pre-captured images to any one area of the multi-view screen configuration.
  • the authoring device 110 includes an authoring tool.
  • the authoring tool includes tags, chatting, chatbot-type AI, pre-recorded clip images, and upload UI.
  • the authoring device 110 includes a plurality of images or streams in which a specific object or event is photographed in real time using an authoring tool, and each sub-channel of a service channel stream composed of a pre-captured file clip image. Match the channel tag and output it as a multi-view screen configuration.
  • the authoring device 110 generates a plurality of channel tag information (host image tag information, product internal image tag information, product side image tag information, and advertisement image tag information) in advance using an authoring tool.
  • the authoring device 110 includes a plurality of channel tag information (host image) generated in advance in each sub-channel of the multi-view composed of a plurality of images or streams photographed using an authoring tool and a pre-captured file clip (Clip) image.
  • the tag information, product internal video tag information, product side video tag information, advertisement video tag information the selected channel tag related to the corresponding sub-channel is matched to each sub-channel and output in a multi-view screen configuration.
  • the authoring device 110 includes a timeline tag on a timeline of a service channel stream composed of a plurality of images or streams in which a specific object or event is captured in real time using an authoring tool and a pre-captured file clip image. ) information is matched and output in a multi-view screen configuration.
  • the authoring device 110 generates a plurality of timeline tag information including at least one of product color tag information, price tag information, weight tag information, design tag information, and function tag information in advance by using an authoring tool.
  • the authoring device 110 reflects any one tag selected in relation to shooting content among a plurality of timeline tag information on a corresponding timeline.
  • the authoring device 110 designates a transmission template using an authoring tool, and based on this, merges the sub-channel streams into a transmission template form to generate and transmit one service channel stream.
  • the authoring device 110 may configure various service channel stream formats by using the transmission template designation function of the authoring tool.
  • Authoring device 110 uses an authoring tool to designate an expression template, and based on this, for example, assigns any one of a plurality of images captured in real time to the widest area among the multi-view screen configurations as the main screen, and the rest The remaining images or pre-captured clip images among a plurality of images captured in the area are allocated to the auxiliary screen.
  • the authoring device 110 analyzes the text input from the receiving device 130 in the chat window area using AI to recognize the context, and extracts the request from the recognized context.
  • the authoring device 110 outputs an image corresponding to a request among pre-captured clip images as a specific screen, for example, an auxiliary screen, among multi-view screen configurations.
  • the authoring device 110 synchronizes a plurality of images captured in real time and outputs them on a multi-view screen configuration.
  • the authoring device 110 outputs an image having a high priority among a plurality of images captured in real time and a previously captured clip image on the main screen.
  • the authoring device 110 recognizes an object or a region of interest (ROI) in a plurality of images captured in real time, and then applies motion tracking to the position, size, and angle of the image so that the object or ROI comes to the center of the screen. to adjust
  • ROI region of interest
  • the authoring device 110 lists the icons corresponding to the tag information after generating a plurality of tag information (channel tags, timeline tags) in advance.
  • the authoring device 110 inserts selected tag information among a plurality of tag information into a timeline or a subchannel in a drag and drop method, and accesses the information corresponding to the tag information displayed on the subchannel or the timeline. to do it
  • the authoring device 110 converts the screen to a video on demand (VOD) image matched with the corresponding tag and outputs the image.
  • VOD video on demand
  • the authoring device 110 outputs Internet price information to a separate Internet information window, and outputs sales volume, accumulated revenue, and other business status information to the sales information window.
  • the authoring device 110 uses a price adjustment function on the Internet information window to allow the seller to compare the lowest price to adjust the price, and to output a time sale event.
  • the authoring device 110 allows a specific customer to negotiate a price according to a purchase quantity condition by using a price negotiation function on the Internet information window.
  • the multi-channel media transceiver apparatus of the present invention is preferably applied to an online shop, but is not necessarily limited thereto.
  • the multi-channel media transceiver sets each of the plurality of online shops as one service channel.
  • the merge multiplexing and output device 120 encodes or transcodes some or all of the input video channels during the configuration of the multi-view screen, and assigns each channel (sub-channel) stream to a transmission template designated by the authoring device. to output as one service channel stream, and the receiver encodes or transcodes the sub-channel images, respectively, so that the multi-view image is output with an expression template designated by the authoring device, and merges them into a service channel stream and outputs them.
  • Encoding or transcoding of some or all of the input video channels during the multi-view screen configuration may be performed in an authoring device, merge multiplexing and output device, or a separate encoder or transcoder. have.
  • the encoded or transcoded stream When encoding or transcoding of some or all of the input video channels during multi-view screen configuration is performed outside the merge multiplexing and output device, the encoded or transcoded stream must be input to the merge multiplexing and output device.
  • the merge multiplexing and output device 120 multiplexes and streams the video stream received from the authoring device 110 or a separate encoder/transcoder, or encoded/transcoded by itself.
  • the merge multiplexing and output device 120 uses a network digital video recorder (nDVR) function to stream the video so that the video is reproduced at the corresponding stream location when the receiving device 130 moves the playback position to an arbitrary previous position.
  • nDVR network digital video recorder
  • the merge multiplexing and outputting device 120 may provide a multiview encoding or transcoding function.
  • the merge multiplexing and outputting device 120 provides a multiple video channel (sub-channel) stream merging module.
  • the merge multiplexing and output device 120 performs SR (Super Resolution) and background removal based on AI.
  • the merge multiplexing and outputting device 120 includes a VOD generating module.
  • the merge multiplexing and outputting device 120 includes a meta data multiplexing module such as tag information and template information.
  • the merge multiplexing and outputting device 120 receives the transmission template information from the authoring device to configure the subchannel stream merge layout, or directly generates the transmission template and configures the stream merge layout through user control.
  • the merge multiplexing and outputting device 120 provides product information.
  • the merge multiplexing and output device 120 provides product specification information, product price information, and buyer opinion information.
  • the merge multiplexing and output device 120 provides web crawling.
  • the merge multiplexing and output device 120 provides price comparison, product review comparison, and discount information.
  • Merge multiplexing and output device 120 provides user management.
  • the merge multiplexing and output device 120 provides credit card/pay information.
  • the merge multiplexing and output device 120 provides an AI-based chatbot and interworking function.
  • the merge multiplexing and output device 120 provides a tag and interworking function.
  • the merge multiplexing and output device 120 configures a merge stream by matching the images received from the plurality of authoring devices 110 to the transmission template 410, multiplexes the transmission template and the presentation template information, and outputs the multiplexed information, and the reception device 130 ), the sub-channel stream is classified and extracted based on the transmission template, and reproduced and output based on the expression template.
  • the receiving device 130 moves the image to an image position corresponding to the corresponding tag and reproduces the image.
  • the receiving device 130 provides a look back function for the VOD image while watching the live image.
  • the receiving device 130 may receive desired information by expressing the VOD tag.
  • the receiving device 130 expresses the tagging information in a GUI.
  • the receiving device 130 After receiving the multi-view image using the transmission template 410 , the receiving device 130 according to the present embodiment recognizes sub-channel information, extracts and decodes streams for each sub-channel, and decodes the sub-channel image using the presentation template 420 . to compose a multi-view image and reproduce and output it.
  • the receiving device 130 measures at least one dimension of the user's body, waist circumference, finger circumference, and wrist circumference using an AR (Augmented Reality) function, and then converts a specific object into a real-time point cloud in three dimensions. Render.
  • the receiving device 130 adjusts a specific 3D rendered object to fit its dimensions, and then automatically matches and outputs the corresponding part.
  • the receiving device 130 automatically matches and outputs the corresponding decoder and the renderer according to the stream attribute of the corresponding channel when decoding the subchannel.
  • the receiving device 130 automatically receives an answer to the query by chatting with the chatbot, and when performing voice recognition chatting with the chatbot, the chatbot recognizes a voice and outputs it to the chatting window.
  • the receiving device 130 When chatting with the seller, when a plurality of customers ask a question at the same time, the receiving device 130 outputs an answer as text only to the receiver when a voice answer is given using information that can distinguish at least one of the questioner's ID, name, and phone number do.
  • the receiving device 130 checks information matching the channel tag, timeline tag, etc. of the received service channel stream based on user additional information such as user chatting information, voice recognition information, and conversation information with a chatbot, and the corresponding sub-channel Emphasize or enlarge the video, play the main window, or play the video automatically from the corresponding point in the timeline.
  • the receiving device 130 When the receiving device 130 outputs a multi-view image as an expression template, it operates a decoder and a renderer according to a VR360 image stream and a PCC (Point Cloud Coding) stream included in the multi-view image to decode the image according to the stream characteristics and let it play
  • a VR360 image stream and a PCC (Point Cloud Coding) stream included in the multi-view image to decode the image according to the stream characteristics and let it play
  • FIG. 2 is a diagram illustrating a configuration of a multi-view screen according to the present embodiment.
  • the authoring device 110 synchronizes images obtained from a plurality of cameras and outputs them on a multi-view screen configuration.
  • the authoring device 110 outputs an image (front image) having a high rank to the main screen area by user control or by assigning a rank during the configuration of the multi-view screen.
  • the authoring device 110 may apply motion tracking to adjust the position, size, and angle of the image so that the object or ROI is centered on the screen.
  • the authoring device 110 configures a multi-view screen if the VOD image corresponding to the tag is not included as a sub-channel of the current multi-view screen configuration. If it is inserted into the screen or already included in a sub-channel of the multi-view configuration, it provides a function to move to a location on the screen that can be emphasized more or to generate other highlighting effects.
  • the tag (channel tag) is predefined as product price, product specification, competitive product comparison, reference video (including VOD data video, 360 VR video, PCC (Point Cloud Coding) video, AR (Augmented Reality) video), etc. It can be used or defined and used during real-time broadcasting.
  • the authoring device 110 creates a channel tag and a timeline tag by adding it in advance or during broadcasting, and lists the icon corresponding to the tag.
  • the authoring device 110 selects one of the tags created during live video production and inserts the tag in a timeline or sub-channel being broadcast in a drag and drop method
  • the receiving device 130 reproduces the live video
  • the corresponding channel tag information is displayed for each sub-channel or the timeline tag is displayed on the timeline so that information corresponding to the tag information can be accessed even during real-time broadcasting.
  • the authoring device 110 provides a tag (channel tag, timeline tag) insertion function when producing live and other images for transmission.
  • the authoring device 110 confirms the location of the tag or displays the tag with a GUI, and when a specific tag is selected by a user's request or other event, the video input (VoD, stream, camera input, etc. of the sub-channels corresponding to the corresponding channel tag)
  • a sub-channel with information matching the selected tag information) is automatically or manually added to the multi-view screen composition as a sub-channel.
  • the authoring device 110 automatically or manually assigns the channel tag for the sub-channel video when the corresponding video input is included in the multi-view video composition for the service channel stream, or when the first video input for the sub-channel is registered. Allows designation of channel tag information.
  • the authoring device 110 may automatically or manually change and designate a channel tag for a corresponding image input according to a change in content of an image input for a sub-channel during broadcasting.
  • the authoring device 110 may automatically or manually change and designate a channel tag for a corresponding sub-channel according to a change in the content of the sub-channel during broadcasting.
  • the authoring device 110 allows a channel tag, a tag selected from a timeline tag list to be inserted into a timeline or a sub-channel by GUI.
  • the authoring device 110 determines a tag insertion position using a GUI.
  • the authoring device 110 registers a pre-made clip image, and lists (Listing).
  • the authoring device 110 may pre-produce and register a clip image for the inside, outside, and color of the product.
  • the authoring device 110 provides a clip production guide to pre-produce a clip image, the generated clip image is listed.
  • the authoring device 110 provides a GUI for selecting one of pre-produced clip images from the clip list and assigning it to any one of the multi-view screen configurations for output.
  • Authoring device 110 uses AI-based chat.
  • the authoring device 110 detects the chatting content input in the chatting window, checks the content matching the corresponding content, tag information, meta information, and the like, and automatically inserts the required clip into the multi-view channel.
  • the authoring device 110 recognizes the chat content input in the chat window, checks the content matching the content, tag information, meta information, etc., and highlights it to the producer (separate display, audio expression, etc.) and expresses it.
  • the authoring device 110 outputs a chat window as a GUI.
  • the authoring device 110 allows the action of automatically inserting the chat window request clip into the multi-view channel in the GUI to be expressed.
  • the authoring device 110 provides a multi-view channel service.
  • the authoring device 110 obtains an image input from a multi-camera input, a stream input, or a file clip as a multi-view image composition sub-channel.
  • the authoring device 110 inserts an image selected from among image input manually or automatically based on AI chatting content, etc., into one of the multi-view screen configurations as a sub-channel.
  • the authoring device 110 provides a multi-view service channel stream channel operation GUI composed of sub-channels through multi-camera input, multi-stream input, and multi-file clip input.
  • the authoring device 110 defines a transmission template and an expression template to be provided by using the GUI.
  • Authoring device 110 provides an operation GUI for the channel addition order.
  • the authoring device 110 provides an Internet information window and a sales information window.
  • the authoring device 110 outputs Internet price information to a separate Internet information window.
  • the authoring device 110 outputs sales volume, accumulated revenue, and other business status information to the sales information window.
  • Authoring device 110 provides a price adjustment function.
  • the authoring device 110 allows the seller to compare the lowest price and adjust the price by using the price adjustment function, and is applicable to a time sale event.
  • Authoring device 110 provides a price negotiation function.
  • the authoring device 110 uses a price negotiation function to negotiate a price with a specific customer according to a purchase quantity condition.
  • the authoring device 110 displays information (price comparison information) using an Internet information window as a GUI.
  • the authoring device 110 updates the real-time price adjustment details through the GUI.
  • the authoring device 110 provides image processing based on AI.
  • the authoring device 110 performs AI processing (color processing, SR, three-dimensionalization) on product images and images.
  • AI processing color processing, SR, three-dimensionalization
  • the authoring device 110 controls setup on/off with a GUI.
  • the authoring device 110 provides a multi-shop (shopping mall) channel operation function.
  • the authoring device 110 provides the image of each shop in the form of a browsing image service, such as a kind of moving picture EPG (Electric Program Guide) through a multi-view channel.
  • the authoring device 110 configures a multi-shop image input as a multi-view and provides it as a service channel stream channel, so that a multi-shop image composed of a multi-view channel is browsed from a receiver to a GUI to select a shop of interest to watch.
  • the authoring device 110 enables the designation of a representative sub-channel among the multi-view constituent images (sub-channels), so that when the merge multiplexing and output device 120 merges the multi-view service channel stream for the multi-shop channel, the representative part of the corresponding shop By applying the channel as a multi-view configuration image for multi-shop, the service channel stream channel can be output.
  • the authoring device 110 provides a video chat function.
  • the authoring device 110 provides a function that enables a customized proposal to the customer when the customer is capable of video communication.
  • the authoring device 110 provides a function of displaying a picture of a customer when sending a picture of the customer and displaying the clothes.
  • the authoring device 110 provides a function of maintaining the size of the object on the camera capture screen.
  • the authoring device 110 controls the PTZ (Pan, Tilt, Zoom) or other functions of the camera in the production tool to keep the size or direction of the object on the screen constant.
  • PTZ Pan, Tilt, Zoom
  • FIG. 1 is a diagram schematically illustrating a multi-channel media transmission/reception system according to the present embodiment.
  • the multi-channel media transmission/reception system includes an authoring device 110 , a merge multiplexing and outputting device 120 , and a receiving device 130 .
  • Components included in the multi-channel media transmission/reception system are not necessarily limited thereto.
  • the authoring device 110 authorizes the content using an authoring application and transmits it to the merge multiplexing and output device 120 .
  • the authoring device 110 includes a plurality of camera inputs including a main camera and a secondary camera.
  • the authoring device 110 acquires a plurality of images obtained by photographing a product or event at various angles by using a plurality of cameras.
  • the authoring device 110 obtains a plurality of images from a plurality of stream inputs.
  • the authoring device 110 acquires a plurality of images from a plurality of file inputs stored in the storage device.
  • the authoring device 110 generates a plurality of channel tag information (host image tag information, product internal image tag information, product side image tag information, and advertisement image tag information) in advance using an authoring tool.
  • the authoring device 110 includes a plurality of channel tag information (host image tag information, product internal image tag information, product side image tag information, advertisement image tag information) generated in advance while shooting a product or event using an authoring tool.
  • a channel tag selected in relation to each video channel (sub-channel) is reflected to each sub-channel.
  • the authoring device 110 generates a plurality of timeline tag information (product color tag information, price tag information, weight tag information, design tag information, and function tag information) in advance using an authoring tool.
  • the authoring device 110 relates to the photographed content among a plurality of tag information (product color tag information, price tag information, weight tag information, design tag information, function tag information) generated in advance while photographing a product using an authoring tool to reflect any one selected tag (eg, price tag information) on the timeline.
  • the authoring device 110 when the authoring device 110 describes the price while photographing the product using the authoring tool, if price tag information is selected from among a plurality of tag information generated in advance, the corresponding time (eg, 10 :00) applies the price tag information on the timeline. Then, when the authoring device 110 describes the design while photographing the product using the authoring tool, when design tag information is selected from among a plurality of previously generated tag information, the corresponding time (eg, 20:00) for describing the design ) to apply the design tag information on the timeline.
  • the corresponding time eg, 10 :00
  • the authoring device 110 uses an authoring tool to acquire an image using a plurality of cameras and files, then designates a transmission template, merges each sub-channel stream into a transmission template form, and generates and transmits one service channel stream make it possible
  • the authoring device 110 may configure various service channel stream formats by using the transmission template designation function of the authoring tool.
  • the authoring device 110 uses an authoring tool to obtain an image using a plurality of cameras and files, and then designates an expression template so that each image is output on a preset multi-view screen configuration when played back in the receiver.
  • the authoring device 110 can configure various reproduction screens by using the expression template designation function of the authoring tool. For example, the main screen (front screen) is allocated to the widest area among the multi-view screen configurations and the remaining areas are Allocate sub screens (right screen, left screen, top screen).
  • the authoring device 110 may allocate pre-stored captured images and advertisement images to some of the multi-view screen configurations. For example, the authoring device 110 may output a clip image inside a product among clip images pre-photographed in any one area of a multi-view screen configuration divided into six using an authoring tool.
  • the authoring device 110 analyzes the text input from the receiving device 130 in the chat window area.
  • the authoring device 110 recognizes the context by analyzing the text input to the chat window area using AI.
  • the authoring device 110 extracts requests (eg, advertisements, product internal images) from the recognized context using AI.
  • the authoring device 110 outputs an image corresponding to a request (eg, advertisement, product internal image) among the pre-captured images to any one area of the multi-view screen configuration.
  • the authoring device 110 includes an authoring tool.
  • the authoring tool includes tags, chatting, chatbot-type AI, pre-recorded clip images, and upload UI.
  • the authoring device 110 includes a plurality of images or streams in which a specific object or event is photographed in real time using an authoring tool, and each sub-channel of a service channel stream composed of a pre-captured file clip image. Match the channel tag and output it as a multi-view screen configuration.
  • the authoring device 110 generates a plurality of channel tag information (host image tag information, product internal image tag information, product side image tag information, and advertisement image tag information) in advance using an authoring tool.
  • the authoring device 110 includes a plurality of channel tag information (host image) generated in advance in each sub-channel of the multi-view composed of a plurality of images or streams photographed using an authoring tool and a pre-captured file clip (Clip) image.
  • the tag information, product internal video tag information, product side video tag information, advertisement video tag information the selected channel tag related to the corresponding sub-channel is matched to each sub-channel and output in a multi-view screen configuration.
  • the authoring device 110 includes a timeline tag on a timeline of a service channel stream composed of a plurality of images or streams in which a specific object or event is captured in real time using an authoring tool and a pre-captured file clip image. ) information is matched and output in a multi-view screen configuration.
  • the authoring device 110 generates a plurality of timeline tag information including at least one of product color tag information, price tag information, weight tag information, design tag information, and function tag information in advance by using an authoring tool.
  • the authoring device 110 reflects any one tag selected in relation to shooting content among a plurality of timeline tag information on a corresponding timeline.
  • the authoring device 110 designates a transmission template using an authoring tool, and based on this, merges the sub-channel streams into a transmission template form to generate and transmit one service channel stream.
  • the authoring device 110 may configure various service channel stream formats by using the transmission template designation function of the authoring tool.
  • the authoring device 110 designates an expression template using an authoring tool, and assigns any one of a plurality of images captured in real time to the widest area among the multi-view screen configurations as the main screen based on this, for example, and the rest The remaining images or pre-captured clip images among a plurality of images captured in the area are allocated to the auxiliary screen.
  • the authoring device 110 analyzes the text input from the receiving device 130 in the chat window area using AI to recognize the context, and extracts the request from the recognized context.
  • the authoring device 110 outputs an image corresponding to a request among pre-captured clip images as a specific screen, for example, an auxiliary screen, among multi-view screen configurations.
  • the authoring device 110 synchronizes a plurality of images captured in real time and outputs them on a multi-view screen configuration.
  • the authoring device 110 outputs an image having a high priority among a plurality of images captured in real time and a previously captured clip image on the main screen.
  • the authoring device 110 recognizes an object or a region of interest (ROI) in a plurality of images captured in real time, and then applies motion tracking to the position, size, and angle of the image so that the object or ROI comes to the center of the screen. to adjust
  • ROI region of interest
  • the authoring device 110 lists the icons corresponding to the tag information after generating a plurality of tag information (channel tags, timeline tags) in advance.
  • the authoring device 110 inserts selected tag information among a plurality of tag information into a timeline or a subchannel in a drag and drop method, and accesses the information corresponding to the tag information displayed on the subchannel or the timeline. to do it
  • the authoring device 110 converts the screen to a video on demand (VOD) image matched with the corresponding tag and outputs the image.
  • VOD video on demand
  • the authoring device 110 outputs Internet price information to a separate Internet information window, and outputs sales volume, accumulated revenue, and other business status information to the sales information window.
  • the authoring device 110 uses a price adjustment function on the Internet information window to allow the seller to compare the lowest price to adjust the price, and to output a time sale event.
  • the authoring device 110 allows a specific customer to negotiate a price according to a purchase quantity condition by using a price negotiation function on the Internet information window.
  • the authoring device 110 synchronizes the real-time captured image and outputs it in a multi-view format, and generates synchronization information between sub-channels that require synchronization among real-time captured images, stream input images, and pre-captured clip images to create a multi-view image Make it output as configuration information.
  • the multi-channel media transceiver apparatus of the present invention is preferably applied to an online shop, but is not necessarily limited thereto.
  • the multi-channel media transceiver sets each of the plurality of online shops as one service channel.
  • the merge multiplexing and output device 120 encodes or transcodes some or all of the input video channels during the configuration of the multi-view screen, and assigns each channel (sub-channel) stream to a transmission template designated by the authoring device. to output as one service channel stream, and the receiver encodes or transcodes the sub-channel images, respectively, so that the multi-view image is output with an expression template designated by the authoring device, and merges them into a service channel stream and outputs them.
  • Encoding or transcoding of some or all of the input video channels during the multi-view screen configuration may be performed in an authoring device, merge multiplexing and output device, or a separate encoder or transcoder. have.
  • the encoded or transcoded stream When encoding or transcoding of some or all of the input video channels during multi-view screen configuration is performed outside the merge multiplexing and output device, the encoded or transcoded stream must be input to the merge multiplexing and output device.
  • the merge multiplexing and output device 120 multiplexes and streams the video stream received from the authoring device 110 or a separate encoder/transcoder, or encoded/transcoded by itself.
  • the merge multiplexing and output device 120 uses a network digital video recorder (nDVR) function to stream the video so that the video is reproduced at the corresponding stream location when the receiving device 130 moves the playback position to an arbitrary previous position.
  • nDVR network digital video recorder
  • the merge multiplexing and outputting device 120 may provide a multiview encoding or transcoding function.
  • the merge multiplexing and outputting device 120 provides a multiple video channel (sub-channel) stream merging module.
  • the merge multiplexing and output device 120 performs SR (Super Resolution) and background removal based on AI.
  • the merge multiplexing and outputting device 120 includes a VOD generating module.
  • the merge multiplexing and outputting device 120 includes a meta data multiplexing module such as tag information and template information.
  • the merge multiplexing and outputting device 120 receives the transmission template information from the authoring device to configure the subchannel stream merge layout, or directly generates the transmission template and configures the stream merge layout through user control.
  • the merge multiplexing and outputting device 120 provides product information.
  • the merge multiplexing and output device 120 provides product specification information, product price information, and buyer opinion information.
  • the merge multiplexing and output device 120 provides web crawling.
  • the merge multiplexing and output device 120 provides price comparison, product review comparison, and discount information.
  • Merge multiplexing and output device 120 provides user management.
  • the merge multiplexing and output device 120 provides credit card/pay information.
  • the merge multiplexing and output device 120 provides an AI-based chatbot and interworking function.
  • the merge multiplexing and output device 120 provides a tag and interworking function.
  • the merge multiplexing and output device 120 configures a merge stream by matching the images received from the plurality of authoring devices 110 to the transmission template 410, multiplexes the transmission template and the presentation template information, and outputs the multiplexed information, and the reception device 130 ), the sub-channel stream is classified and extracted based on the transmission template, and reproduced and output based on the expression template.
  • the receiving device 130 moves the image to an image position corresponding to the corresponding tag and reproduces the image.
  • the receiving device 130 provides a look back function for the VOD image while watching the live image.
  • the receiving device 130 may receive desired information by expressing the VOD tag.
  • the receiving device 130 expresses the tagging information in a GUI.
  • the reception device 130 After receiving the multi-view image using the transmission template 410 , the reception device 130 according to the present embodiment recognizes sub-channel information, extracts and decodes streams for each sub-channel, and decodes the sub-channel image using the presentation template 420 . to compose a multi-view image and reproduce and output it.
  • the receiving device 130 measures at least one dimension of the user's body, waist circumference, finger circumference, and wrist circumference using an AR (Augmented Reality) function, and then converts a specific object into a real-time point cloud in three dimensions. Render.
  • the receiving device 130 adjusts a specific 3D rendered object to fit its dimensions, and then automatically matches and outputs the corresponding part.
  • the receiving device 130 automatically matches and outputs the corresponding decoder and the renderer according to the stream attribute of the corresponding channel when decoding the subchannel.
  • the receiving device 130 automatically receives an answer to the query by chatting with the chatbot, and when performing voice recognition chatting with the chatbot, the chatbot recognizes a voice and outputs it to the chatting window.
  • the receiving device 130 When chatting with the seller, when a plurality of customers ask a question at the same time, the receiving device 130 outputs an answer as text only to the receiver when a voice answer is given using information that can distinguish at least one of the questioner's ID, name, and phone number do.
  • the receiving device 130 checks information matching the channel tag, timeline tag, etc. of the received service channel stream based on user additional information such as user chatting information, voice recognition information, and conversation information with a chatbot, and the corresponding sub-channel Emphasize or enlarge the video, play the main window, or play the video automatically from the corresponding point in the timeline.
  • the receiving device 130 When the receiving device 130 outputs a multi-view image as an expression template, it operates a decoder and a renderer according to a VR360 image stream and a PCC (Point Cloud Coding) stream included in the multi-view image to decode the image according to the stream characteristics and let it play
  • a VR360 image stream and a PCC (Point Cloud Coding) stream included in the multi-view image to decode the image according to the stream characteristics and let it play
  • FIG. 2 is a diagram showing the configuration of a multi-view screen according to the present embodiment.
  • the authoring device 110 synchronizes images obtained from a plurality of cameras and outputs them on a multi-view screen configuration.
  • the authoring device 110 outputs an image (front image) having a high rank to the main screen area by user control or by assigning a rank during the configuration of the multi-view screen.
  • the authoring device 110 may apply motion tracking to adjust the position, size, and angle of the image so that the object or ROI is centered on the screen.
  • the authoring device 110 configures a multi-view screen if the VOD image corresponding to the tag is not included as a sub-channel of the current multi-view screen configuration. If it is inserted into the screen or already included in a sub-channel of the multi-view configuration, it provides a function to move to a location on the screen that can be emphasized more or to generate other highlighting effects.
  • the tag (channel tag) is predefined as product price, product specification, competitive product comparison, reference video (including VOD data video, 360 VR video, PCC (Point Cloud Coding) video, AR (Augmented Reality) video), etc. It can be used or defined and used during real-time broadcasting.
  • the authoring device 110 creates a channel tag and a timeline tag by adding it in advance or during broadcasting, and lists an icon corresponding to the tag.
  • the authoring device 110 selects one of the tags created during live video production and inserts the tag into the broadcast timeline or sub-channel in a drag and drop method, the receiving device 130 plays the live video , the corresponding channel tag information is displayed for each sub-channel, or the timeline tag is displayed on the timeline so that information corresponding to the tag information can be accessed even during real-time broadcasting.
  • the authoring device 110 provides a tag (channel tag, timeline tag) insertion function when producing live and other images for transmission.
  • the authoring device 110 confirms the location of the tag or displays the tag with a GUI, and when a specific tag is selected by a user's request or other event, the video input (VoD, stream, camera input, etc. of the sub-channels corresponding to the corresponding channel tag)
  • a sub-channel with information matching the selected tag information) is automatically or manually added to the multi-view screen composition as a sub-channel.
  • the authoring device 110 automatically or manually assigns the channel tag for the sub-channel video when the corresponding video input is included in the multi-view video composition for the service channel stream, or when the first video input for the sub-channel is registered. Allows designation of channel tag information.
  • the authoring device 110 may automatically or manually change and designate a channel tag for a corresponding image input according to a change in content of an image input for a sub-channel during broadcasting.
  • the authoring device 110 may automatically or manually change and designate a channel tag for a corresponding sub-channel according to a change in the content of the sub-channel during broadcasting.
  • the authoring device 110 allows a channel tag, a tag selected from a timeline tag list to be inserted into a timeline or a sub-channel by GUI.
  • the authoring device 110 determines a tag insertion position using a GUI.
  • the authoring device 110 registers a pre-made clip image, and lists (Listing).
  • the authoring device 110 may pre-produce and register a clip image for the inside, outside, and color of the product.
  • the authoring device 110 provides a clip production guide to pre-produce a clip image, the generated clip image is listed.
  • the authoring device 110 provides a GUI for selecting one of pre-produced clip images from the clip list and assigning it to any one of the multi-view screen configurations for output.
  • Authoring device 110 uses AI-based chat.
  • the authoring device 110 detects the chatting content input in the chatting window, checks the content matching the corresponding content, tag information, meta information, and the like, and automatically inserts the required clip into the multi-view channel.
  • the authoring device 110 recognizes the chat content input in the chat window, checks the content matching the content, tag information, meta information, etc., and highlights it to the producer (separate display, audio expression, etc.) and expresses it.
  • the authoring device 110 outputs a chat window as a GUI.
  • the authoring device 110 allows the action of automatically inserting the chat window request clip into the multi-view channel in the GUI to be expressed.
  • the authoring device 110 provides a multi-view channel service.
  • the authoring device 110 obtains an image input from a multi-camera input, a stream input, or a file clip as a multi-view image composition sub-channel.
  • the authoring device 110 inserts an image selected from among image input manually or automatically based on AI chatting content, etc., into one of the multi-view screen configurations as a sub-channel.
  • the authoring device 110 provides a multi-view service channel stream channel operation GUI composed of sub-channels through multi-camera input, multi-stream input, and multi-file clip input.
  • the authoring device 110 defines a transmission template and an expression template to be provided by using the GUI.
  • Authoring device 110 provides an operation GUI for the channel addition order.
  • the authoring device 110 provides an Internet information window and a sales information window.
  • the authoring device 110 outputs Internet price information to a separate Internet information window.
  • the authoring device 110 outputs sales volume, accumulated revenue, and other business status information to the sales information window.
  • Authoring device 110 provides a price adjustment function.
  • the authoring device 110 allows the seller to compare the lowest price and adjust the price by using the price adjustment function, and is applicable to a time sale event.
  • Authoring device 110 provides a price negotiation function.
  • the authoring device 110 uses a price negotiation function to negotiate a price with a specific customer according to a purchase quantity condition.
  • the authoring device 110 displays information (price comparison information) using an Internet information window as a GUI.
  • the authoring device 110 updates the real-time price adjustment details through the GUI.
  • the authoring device 110 provides image processing based on AI.
  • the authoring device 110 performs AI processing (color processing, SR, three-dimensionalization) on product images and images.
  • AI processing color processing, SR, three-dimensionalization
  • the authoring device 110 controls setup on/off with a GUI.
  • the authoring device 110 provides a multi-shop (shopping mall) channel operation function.
  • the authoring device 110 provides the image of each shop in the form of a browsing image service, such as a kind of moving picture EPG (Electric Program Guide) through a multi-view channel.
  • the authoring device 110 configures a multi-shop image input as a multi-view and provides it as a service channel stream channel, so that a multi-shop image composed of a multi-view channel is browsed from a receiver to a GUI to select a shop of interest to watch.
  • the authoring device 110 enables the designation of a representative sub-channel among the multi-view constituent images (sub-channels), so that when the merge multiplexing and output device 120 merges the multi-view service channel stream for the multi-shop channel, the representative part of the corresponding shop By applying the channel as a multi-view configuration image for multi-shop, the service channel stream channel can be output.
  • the authoring device 110 provides a video chat function.
  • the authoring device 110 provides a function that enables a customized proposal to the customer when the customer is capable of video communication.
  • the authoring device 110 provides a function of displaying a picture of a customer when sending a picture of the customer and displaying the clothes.
  • the authoring device 110 provides a function of maintaining the size of the object on the camera capture screen.
  • the authoring device 110 controls the PTZ (Pan, Tilt, Zoom) or other functions of the camera in the production tool to keep the size or direction of the object on the screen constant.
  • the authoring device 110 is controlled using an artificial intelligence engine to constantly maintain the position or direction of the object.
  • the authoring device 110 turns on/off the function by GUI.
  • the receiving device 130 outputs an image in a multi-view.
  • the receiving device 130 outputs the multi-view screen in the screen configuration selected by the seller based on the expression template included in the service channel stream.
  • Receiving device 130 provides a function that only the receiver can view the answer in text when a plurality of customers ask a question at the same time, and when a voice answer is given using information that can distinguish at least one of the questioner's ID, name, and phone number do. For example, if 'A' is 'Is there any yellow?' When asked, if the seller says, 'Sir A, there is a yellow color,' the voice recognition module recognizes it as an answer to A and takes a picture of the answer in the chat window of terminal 'A'.
  • the reception device 130 supports a screen view selected by the viewer from among the multi-view screens.
  • the reception device 130 includes a transmission template for configuring and outputting a multi-view screen, an expression template information extraction module, a channel tag, a timeline tag information extraction module, a sub-channel stream extraction function from a merged stream, and a plurality of reproduction modules.
  • the receiving device 130 provides an automatic chatting function with the chatbot.
  • the receiving device 130 outputs an image linked with the chatbot, tag, and sub-channel image.
  • the receiving device 130 has a function of automatically answering the chatbot by recognizing the content of the chat.
  • the receiving device 130 performs voice recognition chatting with the chatbot.
  • the receiving device 130 recognizes the user's voice and outputs it to the chatting window.
  • the receiving device 130 enables direct chatting with the seller.
  • the receiving device 130 outputs an answer as text only to the receiver when a voice answer is given using information that can distinguish at least one of the questioner's ID, name, and phone number do.
  • the receiving device 130 outputs real-time product reviews, real-time price comparison information, and real-time card discount information through an Internet information channel.
  • the receiving device 130 outputs an information display window through the GUI, and provides a price negotiation function through the GUI.
  • the receiving device 130 When selecting a sub-channel on a multi-view, such as PCC (Point Cloud Coding) / VR (Virtual Reality), the receiving device 130 causes a player to operate according to the sub-channel property.
  • the receiving device 130 outputs the product (eg, jewelry) being output on the multi-view screen in a format that can be worn over one's face or body using Point Cloud Coding (PCC), Augmented Reality (AR), or the like.
  • PCC Point Cloud Coding
  • AR Augmented Reality
  • the receiving device 130 processes the image in conjunction with the merge multiplexing and outputting device 120 .
  • the reception device 130 measures the dimensions in the terminal using the AR function and then applies them to makeup, wearing, and furniture simulation.
  • the receiving device 130 measures at least one of the body, waist circumference, finger circumference, and wrist circumference using the AR function, and then automatically matches and outputs clothes or accessories to the corresponding part.
  • the receiving device 130 renders a real-time point cloud of a portion whose dimensions are measured using an AR function in three dimensions.
  • the receiving device 130 provides a video call function with the seller.
  • the receiving device 130 may receive body dimensions and render them in 3D.
  • FIG. 3 is a diagram schematically illustrating a multi-channel media transceiver according to the present embodiment.
  • the merge multiplexing and outputting device 120 receives various media including moving pictures through a multi-channel sub-channel input for multi-view configuration using a network from a plurality of camera inputs, IP streaming inputs, and media file inputs.
  • the merge multiplexing and output device 120 receives a plurality of camera inputs, IP streaming inputs, and media file inputs and encodes/transcodes it by itself, or encodes each subchannel from an authoring device or an external encoder or transcoder. It receives a stream and uses it as a sub-channel input for multi-view configuration.
  • the merge multiplexing and outputting apparatus 120 outputs, in real time, a service channel stream in which a sub-channel stream is merged based on a transmission template using preset multi-view configuration information on a plurality of images input through multi-channels on one screen or a file. output as The merge multiplexing and outputting device 120 merges a plurality of video subchannel streams composed of a transmission template into a service channel stream and outputs them in real time or as a file, assigns a channel tag to each subchannel, and time on the service channel stream Give the line a timeline tag so that it can be streamed.
  • the multi-channel media transceiver includes an input unit 310 , an authoring unit 320 , a stream multiplexing unit 330 , a transmission streaming unit 340 , and a receiving unit 350 .
  • Components included in the multi-channel media transceiver are not necessarily limited thereto.
  • Each component included in the multi-channel media transceiver may be connected to a communication path that connects a software module or a hardware module inside the device, so that they can operate organically with each other. These components communicate using one or more communication buses or signal lines.
  • Each component of the multi-channel media transceiver device shown in FIG. 3 means a unit that processes at least one function or operation, and may be implemented as a software module, a hardware module, or a combination of software and hardware.
  • the input unit 310 includes a camera input unit, a streaming input unit, and a file input unit.
  • the input unit 310 receives images from a plurality of cameras as camera inputs by using the camera input unit.
  • the input unit 310 processes multiple inputs received from the camera input unit.
  • the input unit 310 receives an image stream from a network using a streaming input unit.
  • the input unit 310 processes the video stream input from the streaming input unit.
  • the input unit 310 receives a file from the authoring device by using the file input unit.
  • the input unit 310 processes a file input from the file input unit.
  • the authoring unit 320 configures the camera input, video stream, and file input from the input unit 310 to be transmitted and reproduced as a multi-view template to designate a transmission template and an expression template, and each input image (sub-channel) A channel tag is assigned to , and a timeline tag is assigned according to the timeline on the service channel stream.
  • the authoring unit 320 generates multi-view channel configuration information in which the real-time camera input input from the camera input unit, the video stream input from the streaming input unit, and files of various formats input from the file input unit are configured as multi-view channels.
  • the authoring unit 320 generates the necessary synchronization information between subchannels when composing a real-time camera input input from the camera input unit, a video stream inputted from the streaming input unit, and files of various formats input from the file input unit into a multi-view channel. Included in the multi-view channel configuration information.
  • the authoring unit 320 sets the transmission template 410 to specify the sub-channel stream merge format of the service channel stream generated by the stream multiplexer 330 based on the multi-view channel configuration information, and then transmits template setting information. to create The authoring unit 320 sets the presentation template 420 for the format to be reproduced and displayed in the receiver 350 receiving the service channel stream generated and output based on the multi-view channel configuration information, and then the presentation template setting information to create
  • the authoring unit 320 inserts a channel tag for each subchannel and then generates channel tag insertion information.
  • the authoring unit 320 inserts a timeline tag for the transmission service channel stream and then generates timeline tag insertion information.
  • the authoring unit 320 transmits multi-view channel configuration information, transmission template setting information, expression template setting information, channel tag insertion information, timeline tag insertion information, and synchronization information to the stream multiplexing unit 330 .
  • the authoring unit 320 receives an input through a multi-channel.
  • the authoring unit 320 sets some of the input multi-channels as sub-channels and then merges the specific sub-channels to create one service channel.
  • the authoring unit 320 may insert a channel tag for each sub-channel.
  • the authoring unit 320 inserts a typeline tag for the service channel.
  • the authoring unit 320 When inserting a channel tag and a timeline tag as additional functions, the authoring unit 320 provides an automatic tag insertion function rather than the author's manual operation through screen analysis, voice/audio/command analysis, and chat history analysis as additional functions.
  • the stream multiplexer 330 receives an image stream from the input unit 310 .
  • the stream multiplexer 330 receives multi-view channel configuration information, transmission template setting information, expression template setting information, channel tag insertion information, timeline tag insertion information, and synchronization information from the authoring unit 320 .
  • Encoding/transcoding of the input image input to the input unit 310 may be performed by the input unit 310 or the authoring unit 320 or the stream multiplexing unit 330 or other external encoder/transcoder.
  • the stream multiplexer 330 generates a transmission stream based on multi-view channel configuration information, transmission template setting information, presentation template setting information, channel tag insertion information, timeline tag insertion information, and synchronization information.
  • the stream multiplexer 330 merges streams based on multi-view channel configuration information and transmission template configuration information.
  • the stream multiplexer 330 inserts tag information based on channel tag insertion information and timeline tag insertion information.
  • the stream multiplexer 330 inserts the transmission template information and the presentation template information into the service channel stream based on the transmission template information and the presentation template information.
  • the transmission streaming unit 340 receives the transmission stream from the stream multiplexer 330 .
  • the transmission streaming unit 340 provides a streaming service of the transmission stream as a multi-view service channel stream.
  • the transmission streaming unit 340 provides an nDVR (network digital video recorder) function so that, when the viewer moves the playback position to a previous arbitrary position, the video is streamed so that the image can be played from the corresponding stream position.
  • nDVR network digital video recorder
  • the receiving unit 350 receives the multi-view service channel stream from the transmission streaming unit 340 .
  • the receiver 350 extracts and decodes a multi-view component sub-channel from the multi-view service channel stream by using the multi-view channel configuration information and transmission template configuration information.
  • the receiving unit 350 configures the decoded multi-view constituent sub-channels into a multi-view by using the presentation template setting information and displays it on the screen.
  • the receiver 350 extracts the channel tag insertion information and the timeline tag insertion information and displays it on the display screen together with the video information.
  • the receiver 350 receives a multi-view service channel stream channel.
  • the receiver 350 interprets the transmission template information from the merge stream composed of subchannels in the service channel.
  • the receiver 350 extracts a channel tag for each sub-channel.
  • the receiver 350 extracts each sub-channel based on the transmission template and the channel tag to match the channel tag.
  • the receiving unit 350 interprets the expression template setting information.
  • the receiver 350 reproduces the subchannel by displaying the subchannel designated at each position in accordance with the presentation template format based on the presentation template setting information.
  • the receiver 350 extracts the channel tag information and displays it on the reproduced sub-channel.
  • the receiving unit 350 extracts the timeline tag insertion information and displays the corresponding tag at the corresponding time.
  • the receiver 350 refers to screen analysis, voice/audio/command analysis, chat history analysis information, receiver side additional information, extracted channel tag, and timeline tag, and emphasizes the corresponding sub-channel if necessary (moves to the main playback area) , highlighting, etc.), or by moving to the corresponding time position and playing it.
  • the receiver 350 implements a decoder and a renderer for each sub-channel stream format to perform decoding according to the stream characteristics.
  • the receiver 350 is mounted with a decoder and a renderer according to a VR360 video stream and a PCC (Point Cloud Coding) stream in addition to the general video, and decodes the video according to the stream characteristics.
  • a VR360 video stream and a PCC Point Cloud Coding
  • FIG. 4 is a diagram illustrating an example of a transmission template and an expression template according to the present embodiment.
  • the authoring device 110 sets the transmission template 410 in the form of 1 ⁇ N, M ⁇ 1, P ⁇ Q, etc. as shown in FIG. 4 .
  • the authoring device 110 sets the expression template 420 in various formats, as shown in FIG. 4 .
  • Authoring device 110 applies the registration / deletion function for the transmission template 410 and the expression template (420).
  • FIG. 5 is a diagram illustrating an example of a GUI comprising a work screen monitor and an output screen monitor of the authoring apparatus according to the present embodiment.
  • the channel tag means tag information for each subchannel to be merged.
  • the timeline tag means tag information displayed on the service channel stream channel timeline.
  • the authoring device 110 includes a stream input content input setting/registration/deletion processing unit, a live input content input setting/registration/deletion processing unit, a file (VOD) content input setting/registration/deletion processing unit, a screen output unit, a timeline tag unit, It includes a channel tag section, an output template configuration section, and a transmission template configuration section.
  • VOD file
  • the multi-content input processing unit includes a live (camera, streaming) input unit and a file input unit.
  • the live input content processing unit manages live content input setting/registration/deletion, live content list management, and content display.
  • the file (VOD) content processing unit manages file content input setting/registration/deletion, file content list management, and content display.
  • the screen output unit manages the output screen display and the work screen display.
  • the timeline tag unit registers/deletes timeline tags, and manages the timeline tag list and display.
  • the channel tag unit registers/deletes channel tags, and manages the channel tag list and expression.
  • the output template configuration unit displays an output template list and registers/deletes templates.
  • the transmission template configuration unit displays the transmission template list and registers/deletes templates.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Television Signal Processing For Recording (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Sont divulgués ici un procédé et un système de transmission/réception de contenu multimédia multicanal. Dans le présent mode de réalisation, l'invention concerne un procédé et un système de transmission/réception de contenu multimédia multicanal dans lesquels des images d'un produit capturées à divers angles au moyen d'une pluralité de caméras sont reçues par l'intermédiaire de multiples canaux et fournies en sortie en temps réel dans un modèle multi-vue prédéfini sur un écran, et une étiquette de canal est attribuée à un sous-canal sur le flux de canal de service en cours de production, et une étiquette de ligne de temps est attribuée selon une ligne de temps, de telle sorte que la diffusion en continu puisse être effectuée et qu'un service de diffusion en ligne multi-vue puisse être fourni.
PCT/KR2021/014985 2020-12-02 2021-10-25 Procédé et système de transmission/réception de contenu multimédia multicanal WO2022119119A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020200167045A KR102417084B1 (ko) 2020-12-02 2020-12-02 멀티채널 미디어 송수신 방법 및 시스템
KR10-2020-0167045 2020-12-02

Publications (1)

Publication Number Publication Date
WO2022119119A1 true WO2022119119A1 (fr) 2022-06-09

Family

ID=81853164

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2021/014985 WO2022119119A1 (fr) 2020-12-02 2021-10-25 Procédé et système de transmission/réception de contenu multimédia multicanal

Country Status (2)

Country Link
KR (1) KR102417084B1 (fr)
WO (1) WO2022119119A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115379250A (zh) * 2022-07-22 2022-11-22 广州博冠信息科技有限公司 视频处理方法、装置、计算机设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060032872A (ko) * 2004-10-13 2006-04-18 박우현 디지털 방송 환경 하에 있어서 연동형 데이터방송을 인터넷 영역으로 확장하여 양방향 광고를 실현하는 방법
KR20090115344A (ko) * 2008-05-02 2009-11-05 강민수 사용자 선택 방송국 카메라 촬상 영상 제공 시스템 및 그방법
KR20130052226A (ko) * 2011-11-11 2013-05-22 권순길 홈쇼핑 방송 서비스 장치 및 방법
KR101845157B1 (ko) * 2016-07-12 2018-04-03 케이티하이텔 주식회사 가상현실을 이용하여 디지털데이터방송 홈쇼핑의 상품상세정보를 제공하는 방법
KR20200031216A (ko) * 2018-09-14 2020-03-24 나모웹비즈주식회사 증강 현실, 가상 현실 또는 혼합 현실 컨텐츠를 제공하는 디스플레이 장치, 방법 및 서버

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060032872A (ko) * 2004-10-13 2006-04-18 박우현 디지털 방송 환경 하에 있어서 연동형 데이터방송을 인터넷 영역으로 확장하여 양방향 광고를 실현하는 방법
KR20090115344A (ko) * 2008-05-02 2009-11-05 강민수 사용자 선택 방송국 카메라 촬상 영상 제공 시스템 및 그방법
KR20130052226A (ko) * 2011-11-11 2013-05-22 권순길 홈쇼핑 방송 서비스 장치 및 방법
KR101845157B1 (ko) * 2016-07-12 2018-04-03 케이티하이텔 주식회사 가상현실을 이용하여 디지털데이터방송 홈쇼핑의 상품상세정보를 제공하는 방법
KR20200031216A (ko) * 2018-09-14 2020-03-24 나모웹비즈주식회사 증강 현실, 가상 현실 또는 혼합 현실 컨텐츠를 제공하는 디스플레이 장치, 방법 및 서버

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115379250A (zh) * 2022-07-22 2022-11-22 广州博冠信息科技有限公司 视频处理方法、装置、计算机设备及存储介质
CN115379250B (zh) * 2022-07-22 2024-06-11 广州博冠信息科技有限公司 视频处理方法、装置、计算机设备及存储介质

Also Published As

Publication number Publication date
KR20220077781A (ko) 2022-06-09
KR102417084B1 (ko) 2022-07-06

Similar Documents

Publication Publication Date Title
WO2018004239A1 (fr) Procédé de décodage d'image, procédé de codage d'image, dispositif de décodage d'image et dispositif de codage d'image
WO2019194434A1 (fr) Procédé et dispositif d'émission-réception de métadonnées pour une pluralité de points de vue
WO2019147064A1 (fr) Procédé de transmission et de réception de données audio et appareil associé
WO2018080180A1 (fr) Système et procédé pour fournir un service de réalité augmentée associé à une diffusion
WO2017007206A1 (fr) Appareil et procédé de fabrication d'une vidéo relationnelle avec le spectateur
WO2013157890A1 (fr) Procédé et appareil de traitement de données pour prise en charge de réalité augmentée
WO2019027090A1 (fr) Terminal mobile et procédé de commande associé
WO2010008139A2 (fr) Système et procédé d'application multimédia utilisant des métadonnées pour un dispositif sensoriel
WO2015122627A1 (fr) Procédé d'édition d'image et appareil correspondant
WO2023017911A1 (fr) Dispositif, système et procédé de capture d'image en production virtuelle
WO2022019719A1 (fr) Production et distribution de contenu multimédia immersif à partir de flux capturés par l'intermédiaire de dispositifs mobiles distribués
WO2016089093A1 (fr) Procédé et dispositif de transmission et de réception d'un signal de radiodiffusion
WO2019203627A1 (fr) Procédé permettant d'émettre et de recevoir des données audio liées à un effet de transition et dispositif associé
WO2019231178A1 (fr) Procédé et dispositif d'émission-réception de métadonnées sur une pluralité de points de vue
WO2018021707A1 (fr) Système de publicité vidéo vr et système de production de publicité vr
WO2011155766A2 (fr) Procédé de traitement d'image et dispositif d'affichage d'image conforme à ce procédé
WO2022119119A1 (fr) Procédé et système de transmission/réception de contenu multimédia multicanal
WO2019013415A1 (fr) Procédé et système de diffusion en continu en direct d'un contenu vr
WO2015142137A1 (fr) Appareil électronique, procédé de traitement d'image, et support d'enregistrement lisible par ordinateur
WO2012157887A2 (fr) Appareil et procédé permettant de délivrer un contenu 3d
WO2020116740A1 (fr) Système d'édition de diffusion en temps réel et procédé d'édition
WO2016080585A1 (fr) Système et procédé de fourniture d'interfaces utilisateur en nuage
WO2011083951A2 (fr) Système de commande de lecture et procédé de commande de lecture utilisant un équipement de terminal
WO2016080586A1 (fr) Système et procédé de fourniture d'interfaces utilisateur en nuage
WO2017065334A1 (fr) Procédé et dispositif pour générer un contenu

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21900807

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21900807

Country of ref document: EP

Kind code of ref document: A1