JP7426021B2

JP7426021B2 - Systems, methods, and computer-readable media for video processing

Info

Publication number: JP7426021B2
Application number: JP2022522347A
Authority: JP
Inventors: ユアンウー，シャオ; チウ，ポ－シェン; チャン，ユーチュアン; チェン，ミン－チュ
Original assignee: 17Live Japan Inc
Current assignee: 17Live Japan Inc
Priority date: 2021-09-30
Filing date: 2021-09-30
Publication date: 2024-02-01
Anticipated expiration: 2041-09-30
Also published as: WO2023055365A1; JP2023543369A

Description

本発明は、ライブビデオストリーミングまたはビデオ電話会議における画像処理または映像処理に関する。 The present invention relates to image or video processing in live video streaming or video teleconferencing.

ユーザ同士のオンライン通信に参加することを可能にするさまざまな技術が知られている。そのアプリケーションには、ライブストリーミング、ライブ電話会議などが含まれる。これらアプリケーションの普及に伴い、コミュニケーション中の交流体験の向上に対するユーザからの要望が高まっている。 Various techniques are known that allow users to participate in online communications with each other. Its applications include live streaming, live conference calls, and more. With the spread of these applications, there is an increasing demand from users for improved interaction experiences during communication.

本発明の一実施態様に係る方法は、映像処理方法である。当該方法が、第１ユーザのライブ映像をユーザ端末の第１領域に表示する工程と、第２ユーザの映像を当該ユーザ端末の第２領域に表示する工程を含み、当該第１ユーザの当該ライブ映像の一部が、当該ユーザ端末の当該第２領域まで延伸される。 A method according to one embodiment of the present invention is a video processing method. The method includes the steps of displaying a first user's live video in a first area of a user terminal, and displaying a second user's live video in a second area of the user terminal, A portion of the video is stretched to the second area of the user terminal.

本発明の一実施態様に係るシステムは、映像処理のためのシステムであり、１以上のプロセッサを含み、当該１以上のプロセッサが機械可読命令を実行して、第１ユーザのライブ映像をユーザ端末上の第１領域に表示する工程と、第２ユーザの映像を当該ユーザ端末の第２領域に表示する工程と、を実行する。当該第１ユーザの当該ライブ映像の一部が、当該ユーザ端末の当該第２領域まで延伸される。 A system according to an embodiment of the present invention is a system for video processing, and includes one or more processors, and the one or more processors execute machine-readable instructions to convert live video of a first user to a user terminal. A step of displaying the video of the second user in the second region of the user terminal is executed. A portion of the live video of the first user is extended to the second area of the user terminal.

本発明の一実施態様に係るコンピュータ可読媒体は、非一時的なコンピュータ可読媒体であり、映像処理のためのプログラムを含み、当該プログラムが１以上のコンピュータに、第１ユーザのライブ映像をユーザ端末上の第１領域に表示する工程と、第２ユーザの映像を当該ユーザ端末の第２領域に表示する工程と、を実行させる。当該第１ユーザの当該ライブ映像の一部が、当該ユーザ端末の当該第２領域まで延伸される。 A computer-readable medium according to an embodiment of the present invention is a non-transitory computer-readable medium, and includes a program for video processing, and the program transmits live video of a first user to a user terminal. A step of displaying the second user's video in the second region of the user terminal is executed. A portion of the live video of the first user is extended to the second area of the user terminal.

グループ通話の一例を示す概略図である。FIG. 2 is a schematic diagram showing an example of a group call. 本発明の一部の実施態様に基づくグループ通話の一例を示す概略図である。1 is a schematic diagram illustrating an example of a group call in accordance with some embodiments of the present invention; FIG. 本発明の一部の実施態様に基づくグループ通話の一例を示す概略図である。1 is a schematic diagram illustrating an example of a group call in accordance with some embodiments of the present invention; FIG. 本発明の一部の実施態様に基づくグループ通話の一例を示す概略図である。1 is a schematic diagram illustrating an example of a group call in accordance with some embodiments of the present invention; FIG. 本発明の一部の実施態様に基づくグループ通話の一例を示す概略図である。1 is a schematic diagram illustrating an example of a group call in accordance with some embodiments of the present invention; FIG. 本発明の一部の実施態様に基づくグループ通話の一例を示す概略図である。1 is a schematic diagram illustrating an example of a group call in accordance with some embodiments of the present invention; FIG. 本発明の一部の実施態様に基づく通信システムの構成を示す概略図である。1 is a schematic diagram illustrating a configuration of a communication system according to some embodiments of the present invention. FIG. 本発明の一部の実施態様に基づく通信システムの例示的な機能構成図である。1 is an exemplary functional block diagram of a communication system according to some embodiments of the present invention. FIG. 本発明の一部の実施態様に基づく通信システムの動作を示す例示的なシーケンス図である。1 is an exemplary sequence diagram illustrating the operation of a communication system according to some embodiments of the present invention. FIG.

ライブストリーミングサービス、アプリケーション（ＡＰＰ）またはプラットフォームの一部では、複数のユーザ（配信者、視聴者、放送者、アンカーなど）がグループ通話モードや電話会議モードで参加することが可能であり、そのうち、複数のユーザの映像がユーザ端末の画面上に同時に表示され、そのグループ通話またはグループ通話への参加を表示する。当該ユーザ端末は、スマートフォンやタブレット、パソコン、ノートパソコンとすることができ、それを使用して当該ユーザの１人が当該グループ通話に参加する。 Some live streaming services, applications (APPs) or platforms allow multiple users (streamers, viewers, broadcasters, anchors, etc.) to participate in group call mode or conference call mode, of which: Images of multiple users are displayed simultaneously on the screen of the user terminal to indicate their group call or participation in the group call. The user terminal can be a smartphone, tablet, personal computer, or laptop, and is used by one of the users to participate in the group call.

図１にグループ通話の一例を示す概略図を示す。Ｓ１は当該グループ通話を表示するユーザ端末の画面である。ＲＡは、当該画面Ｓ１内の１領域であり、ユーザＡのライブ映像を表示する。ＲＢは、当該画面Ｓ１内の１領域であり、ユーザＢのライブ映像を表示する。当該ユーザＡのライブ映像は、ユーザＡの近傍に配置されたカメラなどの映像撮影装置によって撮影され、提供されてもよい。当該ユーザＢのライブ映像は、ユーザＢの近傍に配置されたカメラなどの映像撮影装置によって撮影され、提供されてもよい。 FIG. 1 shows a schematic diagram illustrating an example of a group call. S1 is a screen of the user terminal that displays the group call. RA is one area within the screen S1, and displays user A's live video. RB is one area within the screen S1, and displays user B's live video. The live video of the user A may be captured and provided by a video capture device such as a camera placed near the user A. The live video of the user B may be captured and provided by a video capturing device such as a camera placed near the user B.

従来、当該ユーザＡの映像は領域ＲＡ内にのみ表示でき、領域ＲＢ内に表示することはできない。同様に、当該ユーザＢの映像は領域ＲＢ内にのみ表示でき、領域ＲＡ内に表示することはできない。そのため、コミュニケーション中に不便を感じたり、アプリケーションに支障をきたしたりする場合がある。例えば、ユーザＢが当該グループ通話でユーザＡに新たに開発した製品を提示する例示的なシナリオにおいて、ユーザＡは詳細な議論のために製品の一部または部分を正確に指し示すことができない。そのため、グループ通話や電話会議では、より多くの交流を行うことが望まれる。 Conventionally, the video of the user A can only be displayed within the area RA, and cannot be displayed within the area RB. Similarly, the video of the user B can only be displayed within the area RB, and cannot be displayed within the area RA. This may cause inconvenience during communication or may cause problems with the application. For example, in an exemplary scenario in which User B presents a newly developed product to User A in the group call, User A is unable to pinpoint a part or portion of the product for detailed discussion. Therefore, it is desirable to have more interaction during group calls and conference calls.

図２は本発明の一部の実施態様に基づくグループ通話の一例を示す概略図である。図２に示すように、ユーザＡの部分Ａ１が、ユーザＢが表示されている領域ＲＢまで延伸されている、または再現／複製されている。本実施態様において、当該部分Ａ１は、領域ＲＡにおけるユーザＡの手であり、当該部分Ａ１１は、領域ＲＢに表示される当該部分Ａ１の延伸、再現、または複製されたバージョンである。当該部分Ａ１１は、領域ＲＢ内のオブジェクトＢ１を指し示す、またはその方向に向けられている。一部の実施態様において、領域ＲＢ内に表示されているユーザＢの映像はライブ映像である。一部の実施態様において、領域ＲＢ内に表示されているユーザＢの映像は再生映像である。 FIG. 2 is a schematic diagram illustrating an example of a group call in accordance with some embodiments of the present invention. As shown in FIG. 2, user A's portion A1 has been extended or reproduced/duplicated to area RB where user B is displayed. In this embodiment, the portion A1 is the user A's hand in the area RA, and the portion A11 is a stretched, reproduced, or duplicated version of the portion A1 displayed in the area RB. The portion A11 points to or is oriented towards the object B1 within the region RB. In some implementations, the video of user B displayed within region RB is live video. In some embodiments, the video of user B displayed within region RB is a played video.

一部の実施態様において、当該部分Ａ１１は当該部分Ａ１の動きまたは軌跡を追従する。一部の実施態様において、当該部分Ａ１１は当該部分Ａ１と同時に移動する。当該ユーザＡは、当該部分Ａ１となっている手を動かすだけで、当該ユーザＡが議論したい領域ＲＢ内の位置を指し示すように、当該部分Ａ１１を制御したり、動かしたりすることができる。一部の実施態様において、当該部分Ａ１１は、グラフィカルオブジェクトまたはアニメーションオブジェクトとして表現または表示されてもよい。 In some embodiments, the portion A11 follows the movement or trajectory of the portion A1. In some embodiments, the portion A11 moves simultaneously with the portion A1. The user A can control or move the part A11 to point to a position within the area RB that the user A wants to discuss by simply moving the hand that is the part A1. In some implementations, the portion A11 may be represented or displayed as a graphical or animated object.

図２に示すように、領域ＲＡ内には境界Ａ３がある。当該境界Ａ３は、領域ＲＡ内で領域Ａ３１と領域Ａ３２を定義する。本実施態様において、当該領域Ａ３１は当該領域Ａ３２を囲んでいる。当該領域Ａ３１は、インタラクティブ領域と呼ぶ、または定義することができる。領域ＲＢ内に延伸される、または再現される当該部分Ａ１は、当該インタラクティブ領域Ａ３１内にある。当該部分Ａ１は、領域ＲＡ内のユーザＢに向けて延伸される。一部の実施態様において、当該インタラクティブ領域Ａ３１内の部分のみを領域ＲＢ内に延伸または表示することができる。一部の実施態様において、ユーザＡがユーザＡの１部分を領域ＲＢに延伸することによりユーザＢと交流したい場合、ユーザＡは当該部分を当該インタラクティブ領域Ａ３１へ移動するだけで、当該部分が領域ＲＢ内に表示される。本実施態様において、当該領域ＲＡと当該領域ＲＢは相互に分離されている。一部の実施態様において、当該領域ＲＡと当該領域ＲＢは、当該画面Ｓ１上で少なくとも一部が重なり合ってもよい。 As shown in FIG. 2, there is a boundary A3 within the area RA. The boundary A3 defines an area A31 and an area A32 within the area RA. In this embodiment, the area A31 surrounds the area A32. The area A31 can be called or defined as an interactive area. The portion A1 that is extended or reproduced within the area RB is within the interactive area A31. The portion A1 is extended toward the user B within the area RA. In some implementations, only the portion within the interactive area A31 may be extended or displayed within the area RB. In some implementations, if user A wants to interact with user B by extending a part of user A into region RB, user A can simply move the part to the interactive area A31 and Displayed in RB. In this embodiment, the area RA and the area RB are separated from each other. In some embodiments, the area RA and the area RB may at least partially overlap on the screen S1.

図２に示すように、領域ＲＢ内には境界Ｂ３がある。当該境界Ｂ３は、領域ＲＢ内で領域Ｂ３１と領域Ｂ３２を定義する。本実施態様において、当該領域Ｂ３１は当該領域Ｂ３２を囲んでいる。当該領域Ｂ３１は、インタラクティブ領域と呼ぶ、または定義することができる。一部の実施態様において、当該インタラクティブ領域Ｂ３１内の部分を領域ＲＡ内に延伸または表示することができる。一部の実施態様において、ユーザＢがユーザＢの１部分を領域ＲＡに延伸することによりユーザＡと交流したい場合、ユーザＢは当該部分を当該インタラクティブ領域Ｂ３１へ移動するだけで、当該部分が領域ＲＡ内に表示される。一部の実施態様において、当該境界Ａ３及び／または当該境界Ｂ３は、当該領域ＲＡ及び／または当該領域ＲＢ上に表示されなくてもよい。 As shown in FIG. 2, there is a boundary B3 within the region RB. The boundary B3 defines a region B31 and a region B32 within the region RB. In this embodiment, the area B31 surrounds the area B32. The area B31 can be called or defined as an interactive area. In some implementations, a portion within the interactive area B31 may be extended or displayed within the area RA. In some implementations, if user B wants to interact with user A by extending a part of user B to area RA, user B can simply move the part to the interactive area B31 and Displayed in RA. In some embodiments, the boundary A3 and/or the boundary B3 may not be displayed on the area RA and/or the area RB.

図２において、ユーザＡとユーザＢ、または領域ＲＡと領域ＲＢは、当該ユーザ端末の当該画面Ｓ１上で横方向に並んでおり、当該ユーザＡのライブ映像の当該部分Ａ１が当該領域ＲＡにおいてユーザＢに向かって延伸されている。 In FIG. 2, user A and user B, or area RA and area RB, are arranged horizontally on the screen S1 of the user terminal, and the part A1 of the live video of the user A is displayed by the user in the area RA. It is extended towards B.

図３に本発明の一部の実施態様に基づくグループ通話の別の一例を示す概略図を示す。当該グループ通話には、少なくとも４人のユーザ、すなわちユーザＡ、ユーザＢ、ユーザＣ、ユーザＤが参加している。図３において、ユーザＡとユーザＢは当該ユーザ端末の当該画面Ｓ１上で縦方向に並んでいる。図３に示すように、ユーザＡの部分Ａ２が、ユーザＢが表示されている領域ＲＢまで延伸されている、または再現／複製されている。本実施態様において、当該部分Ａ２はユーザＡの手と、その手に保持されたオブジェクトを含み、当該部分Ａ２１は領域ＲＢ内に表示される当該部分Ａ２の延伸、再現、または複製されたバージョンである。当該部分Ａ２１は、領域ＲＢ内のユーザＢに近づく、またはその方向に向けられている。当該部分Ａ２１がユーザＢに接触すると、領域ＲＢに特殊効果ＳＰ１が表示される。当該特殊効果ＳＰ１は、グラフィックオブジェクトまたはアニメーションオブジェクトを含んでもよい。一部の実施態様において、当該特殊効果ＳＰ１は効果音を含んでもよい。 FIG. 3 shows a schematic diagram illustrating another example of group calling according to some embodiments of the present invention. At least four users, ie, user A, user B, user C, and user D, are participating in the group call. In FIG. 3, user A and user B are lined up in the vertical direction on the screen S1 of the user terminal. As shown in FIG. 3, user A's portion A2 has been extended or reproduced/duplicated to area RB where user B is displayed. In this embodiment, the portion A2 includes the user A's hand and the object held in the hand, and the portion A21 is a stretched, reproduced, or duplicated version of the portion A2 displayed within the region RB. be. The portion A21 approaches or is directed toward the user B within the region RB. When the portion A21 comes into contact with the user B, a special effect SP1 is displayed in the area RB. The special effect SP1 may include a graphic object or an animation object. In some embodiments, the special effect SP1 may include sound effects.

一部の実施態様において、当該部分Ａ２１は当該部分Ａ２の動きまたは軌跡を追従する。一部の実施態様において、当該部分Ａ２１は当該部分Ａ２と同時に移動する。当該ユーザＡは、当該ユーザＡの手（オブジェクトを持っていてもよい）を動かすだけで、当該ユーザＡが交流したい領域ＲＢ内の１つの位置を指し示す、または触れるように、当該部分Ａ２１を制御したり、動かしたりすることができる。一部の実施態様において、当該部分Ａ２１は、グラフィカルオブジェクトまたはアニメーションオブジェクトとして表現または表示されてもよい。 In some embodiments, the portion A21 follows the movement or trajectory of the portion A2. In some embodiments, the portion A21 moves simultaneously with the portion A2. The user A controls the part A21 to point to or touch a position in the area RB with which the user A wants to interact by simply moving the user A's hand (which may be holding an object). You can do or move it. In some implementations, the portion A21 may be represented or displayed as a graphical or animated object.

図３に示すように、領域ＲＡ内には境界Ａ３がある。当該境界Ａ３は、領域ＲＡ内で領域Ａ３１と領域Ａ３２を定義する。本実施態様において、当該領域Ａ３１は当該領域Ａ３２を囲んでいる。当該領域Ａ３１は、インタラクティブ領域と呼ぶ、または定義することができる。領域ＲＢ内に延伸される、または再現される当該部分Ａ２は、当該インタラクティブ領域Ａ３１内にある。当該部分Ａ２は、領域ＲＡ内のユーザＢに向けて延伸される。一部の実施態様において、当該インタラクティブ領域Ａ３１内の部分のみを領域ＲＢ内に延伸または表示することができる。一部の実施態様において、ユーザＡがユーザＡの１部分を領域ＲＢに延伸することによりユーザＢと交流したい場合、ユーザＡは当該部分を当該インタラクティブ領域Ａ３１へ移動するだけで、当該部分が領域ＲＢ内に表示される。 As shown in FIG. 3, there is a boundary A3 within the area RA. The boundary A3 defines an area A31 and an area A32 within the area RA. In this embodiment, the area A31 surrounds the area A32. The area A31 can be called or defined as an interactive area. The portion A2 that is extended or reproduced within the area RB is within the interactive area A31. The portion A2 is extended toward the user B within the area RA. In some implementations, only the portion within the interactive area A31 may be extended or displayed within the area RB. In some implementations, if user A wants to interact with user B by extending a part of user A into region RB, user A can simply move the part to the interactive area A31 and Displayed in RB.

図４に本発明の一部の実施態様に基づくグループ通話の別の一例を示す概略図を示す。当該グループ通話には、少なくとも４人のユーザ、すなわちユーザＡ、ユーザＢ、ユーザＣ、ユーザＤが参加している。図４において、ユーザＡとユーザＤは当該ユーザ端末の当該画面Ｓ１上で対角方向に並んでいる。図４に示すように、ユーザＡの部分Ａ１が、ユーザＤが表示されている領域ＲＤまで延伸されている、または再現／複製されている。本実施態様において、当該部分Ａ１はユーザＡの手であり、当該部分Ａ１１は、領域ＲＤに表示される当該部分Ａ１の延伸、再現、または複製されたバージョンである。当該部分Ａ１１は、領域ＲＤ内のユーザＤを指し示す、またはその方向に向けられている。 FIG. 4 shows a schematic diagram illustrating another example of group calling according to some embodiments of the present invention. At least four users, ie, user A, user B, user C, and user D, are participating in the group call. In FIG. 4, users A and D are lined up diagonally on the screen S1 of the user terminal. As shown in FIG. 4, user A's portion A1 has been extended or reproduced/duplicated to region RD where user D is displayed. In this embodiment, the portion A1 is user A's hand, and the portion A11 is a stretched, reproduced, or duplicated version of the portion A1 displayed in region RD. The portion A11 points to or is directed toward the user D within the region RD.

一部の実施態様において、当該部分Ａ１１は当該部分Ａ１の動きまたは軌跡を追従する。一部の実施態様において、当該部分Ａ１１は当該部分Ａ１と同時に移動する。当該ユーザＡは、当該部分Ａ１となっている手を動かすだけで、当該ユーザＡが交流したい領域ＲＤ内の位置を指し示すように、当該部分Ａ１１を制御したり、動かしたりすることができる。一部の実施態様において、当該部分Ａ１１は、グラフィカルオブジェクトまたはアニメーションオブジェクトとして表現または表示されてもよい。 In some embodiments, the portion A11 follows the movement or trajectory of the portion A1. In some embodiments, the portion A11 moves simultaneously with the portion A1. The user A can control or move the part A11 so as to point to a position in the area RD with which the user A wants to interact by simply moving the hand that is the part A1. In some implementations, the portion A11 may be represented or displayed as a graphical or animated object.

図４に示すように、領域ＲＡ内には境界Ａ３がある。当該境界Ａ３は、領域ＲＡ内で領域Ａ３１と領域Ａ３２を定義する。当該領域Ａ３１は当該領域Ａ３２を囲んでいる。当該領域Ａ３１は、インタラクティブ領域と呼ぶ、または定義することができる。本実施態様において、当該インタラクティブ領域Ａ３１はサブ領域Ａ３１１を含む。領域ＲＤ内に延伸される、または再現される当該部分Ａ１は、当該サブ領域Ａ３１１内にある。当該サブ領域Ａ３１１は、ユーザＡとユーザＤの間にある。当該サブ領域Ａ３１１は、領域ＲＡのうち、ユーザＡから見て領域ＲＤの方を向いた位置にある。 As shown in FIG. 4, there is a boundary A3 within the area RA. The boundary A3 defines an area A31 and an area A32 within the area RA. The area A31 surrounds the area A32. The area A31 can be called or defined as an interactive area. In this embodiment, the interactive area A31 includes a sub-area A311. The portion A1 that is stretched or reproduced within the region RD is within the sub-region A311. The sub-area A311 is located between users A and D. The sub-area A311 is located in the area RA at a position facing the area RD when viewed from the user A.

図２、図３、図４の例に示すように、領域ＲＡにおいてユーザＡの部分が延伸される方向によって、ユーザＡの延伸、複製、再現されたバージョンの当該部分が表示される領域が決定されてもよい。したがって、ユーザＡは、ユーザＡの当該部分を対応する方向へ移動または伸長させるだけで、どの領域（および対応するユーザ）と交流するかを決定することができる。例えば、ユーザＡは横方向に部分を伸長させ、表示領域が当該画面Ｓ１上でユーザＡに対して横方向に並んだ、または配置されているユーザと交流することができる。別の例で、ユーザＡは縦方向に部分を伸長させ、表示領域が当該画面Ｓ１上でユーザＡに対して縦方向に並んだ、または配置されているユーザと交流することができる。さらに別の例で、ユーザＡは対角方向に部分を伸長させ、表示領域が当該画面Ｓ１上でユーザＡに対して対角方向に並んだ、または配置されているユーザと交流することができる。 As shown in the examples of FIGS. 2, 3, and 4, the direction in which the portion of user A is stretched in area RA determines the area in which the stretched, duplicated, and reproduced version of user A is displayed. may be done. Therefore, user A can decide which area (and corresponding user) to interact with simply by moving or stretching that part of user A in the corresponding direction. For example, user A can extend the portion laterally and interact with users whose display areas are lined up or placed laterally with respect to user A on the screen S1. In another example, user A can extend a portion vertically and interact with a user whose display area is aligned or arranged vertically with respect to user A on the screen S1. In yet another example, user A can extend the portion diagonally and interact with users whose display areas are lined up or arranged diagonally with respect to user A on the screen S1. .

一部の実施態様において、ユーザは別のユーザとより便利に交流できるように、当該インタラクティブ領域の形状を調整することができる。図５に本発明の一部の実施態様に基づくグループ通話の別の一例を示す概略図を示す。当該グループ通話には、少なくとも４人のユーザ、すなわちユーザＡ、ユーザＢ、ユーザＣ、ユーザＤが参加している。図５に示すように、当該境界Ａ３は、前の例示的実施態様で説明したように、ユーザＡが他のユーザと交流するために利用する領域である当該インタラクティブ領域Ａ３１を定義する。当該インタラクティブ領域Ａ３１はサブ領域Ａ３１１を含む。当該境界Ａ３は、少なくとも境界線ＢＲ１と境界線ＢＲ２を含む。一部の実施態様において、ユーザＡが別のユーザとより便利に交流したい場合、ユーザＡは当該境界線ＢＲ１の位置及び／または当該境界線ＢＲ２の位置を調整し、当該インタラクティブ領域Ａ３１の形状と当該当該サブ領域Ａ３１１の形状を調整することができる。当該境界線ＢＲ１は、当該領域ＲＡに対するユーザＣまたは当該領域ＲＣの方向に対応し、かつ当該領域ＲＡと当該領域ＲＣの間にある。当該境界線ＢＲ２は、当該領域ＲＡに対するユーザＢまたは当該領域ＲＢの方向に対応し、かつ当該領域ＲＡと当該領域ＲＢの間にある。 In some implementations, a user can adjust the shape of the interactive area to more conveniently interact with another user. FIG. 5 shows a schematic diagram illustrating another example of group calling according to some embodiments of the present invention. At least four users, ie, user A, user B, user C, and user D, are participating in the group call. As shown in FIG. 5, the boundary A3 defines the interactive area A31, which is the area used by user A to interact with other users, as described in the previous exemplary embodiment. The interactive area A31 includes a sub-area A311. The boundary A3 includes at least a boundary line BR1 and a boundary line BR2. In some implementations, if user A wants to interact more conveniently with another user, user A adjusts the position of said border line BR1 and/or the position of said border line BR2, and adjusts the shape of said interactive area A31. The shape of the sub-region A311 can be adjusted. The boundary line BR1 corresponds to the direction of the user C or the region RC with respect to the region RA, and is between the region RA and the region RC. The boundary line BR2 corresponds to the direction of the user B or the area RB with respect to the area RA, and is between the area RA and the area RB.

例えば、ユーザＡは、ユーザＡとユーザＣの間にある当該インタラクティブ領域Ａ３１のサブ領域Ａ３１２がより幅広になり、ユーザＡにより近くなるように、当該境界線ＢＲ１をユーザＡに近づけてドラッグまたは移動することができる。これにより、ユーザＡはユーザＡの部分を使用してユーザＣとより交流しやすくなる。ユーザＡは、ユーザＡの当該部分を比較的短い距離だけ伸長し、当該境界線ＢＲ１を越えて当該インタラクティブ領域Ａ３１の当該サブ領域Ａ３１２に到達すればよく、そうするとユーザＣが表示される領域ＲＣにおいて当該部分が延伸、複製または再現される。 For example, user A drags or moves the boundary line BR1 closer to user A so that sub-area A312 of interactive area A31 between users A and C becomes wider and closer to user A. can do. This makes it easier for user A to interact with user C using the user A part. User A only needs to extend the corresponding part of user A by a relatively short distance, cross the boundary line BR1, and reach the corresponding sub-area A312 of the interactive area A31, and then user C will be able to reach the corresponding sub-area A312 in the displayed area RC. The part is stretched, copied or reproduced.

別の一例で、ユーザＡは、ユーザＡとユーザＢの間にある当該インタラクティブ領域Ａ３１のサブ領域Ａ３１３がより幅広になり、ユーザＡにより近くなるように、当該境界線ＢＲ２をユーザＡに近づけてドラッグまたは移動することができる。これにより、ユーザＡはユーザＡの部分を使用してユーザＢとより交流しやすくなる。ユーザＡは、ユーザＡの当該部分を比較的短い距離だけ伸長し、当該境界線ＢＲ２を越えて当該インタラクティブ領域Ａ３１の当該サブ領域Ａ３１２に到達すればよく、そうするとユーザＢが表示される領域ＲＢにおいて当該部分が延伸、複製または再現される。 In another example, user A moves the boundary line BR2 closer to user A so that the sub-area A313 of the interactive area A31 between users A and B becomes wider and closer to user A. Can be dragged or moved. This makes it easier for user A to interact with user B using the user A part. User A only needs to extend the corresponding part of user A by a relatively short distance, cross the boundary line BR2 and reach the corresponding sub-area A312 of the interactive area A31, and then user B will be able to reach the corresponding sub-area A312 of the interactive area A31. The part is stretched, copied or reproduced.

さらに別の一例で、ユーザＡは、ユーザＡとユーザＤの間にある当該インタラクティブ領域Ａ３１のサブ領域Ａ３１１がより幅広になり、ユーザＡにより近くなるように、当該境界線ＢＲ１及び／または境界線ＢＲ２をユーザＡに近づけてドラッグまたは移動することができる。これにより、ユーザＡはユーザＡの部分を使用してユーザＤとより交流しやすくなる。ユーザＡは、ユーザＡの当該部分を比較的短い距離だけ対角方向に伸長し、当該インタラクティブ領域Ａ３１の当該サブ領域Ａ３１１に到達すればよく、そうするとユーザＤが表示される領域ＲＤにおいて当該部分が延伸、複製または再現される。 In yet another example, the user A may select the border line BR1 and/or the border line so that the sub-area A311 of the interactive area A31 between the user A and the user D is wider and closer to the user A. BR2 can be dragged or moved closer to user A. This makes it easier for user A to interact with user D using the user A part. User A only needs to extend the corresponding part of user A in the diagonal direction by a relatively short distance and reach the corresponding sub-area A311 of the corresponding interactive area A31, and then the corresponding part will be displayed in the area RD where user D is displayed. To be stretched, copied or reproduced.

図６に本発明の一部の実施態様に基づくグループ通話の別の一例を示す概略図を示す。一部の実施態様において、当該インタラクティブ領域の外側のみが抽出され、当該画面Ｓ１上に表示される。より具体的に、ユーザＡに対しては、当該境界Ａ３で囲まれた領域のみが当該画面Ｓ１に表示される。ユーザＢ、ユーザＣ、ユーザＤに対しては、当該境界Ｂ３、Ｃ３、Ｄ３で囲まれた領域のみが当該画面Ｓ１に表示される。それにより、交流の臨場感を向上させることができる。例えば、ユーザＡが部分を別のユーザの表示領域に伸長したとき、当該部分はユーザＡの表示領域に表示されない。 FIG. 6 shows a schematic diagram illustrating another example of group calling according to some embodiments of the present invention. In some implementations, only the outside of the interactive area is extracted and displayed on the screen S1. More specifically, for user A, only the area surrounded by the boundary A3 is displayed on the screen S1. For users B, C, and D, only the area surrounded by the boundaries B3, C3, and D3 is displayed on the screen S1. Thereby, the sense of realism of the interaction can be improved. For example, when user A stretches a portion into another user's display area, the portion is not displayed in user A's display area.

図７に本発明の一部の実施態様に基づく通信システム１の構成を示す概略図を示す。通信システム１は、コンテンツを介したインタラクションを伴うライブストリーミングサービスを提供することができる。ここで言う「コンテンツ」とは、コンピュータ装置で再生可能なデジタルコンテンツを指す。当該通信システム１は、ユーザがオンラインで他のユーザとのリアルタイムの交流に参加することを可能にする。通信システム１は、複数のユーザ端末１０と、バックエンドサーバ３０と、ストリーミングサーバ４０とを含む。ユーザ端末１０、バックエンドサーバ３０、及びストリーミングサーバ４０は、ネットワーク９０（例えばインターネットとしてもよい）を介して接続される。バックエンドサーバ３０は、ユーザ端末および／またはストリーミングサーバ４０との間のインタラクションを同期させるサーバとすることができる。一部の実施態様において、バックエンドサーバ３０は、アプリケーション（ＡＰＰ）プロバイダのオリジンサーバとしてもよい。ストリーミングサーバ４０は、ストリーミングデータまたはビデオデータを取り扱う、または提供するためのサーバである。一部の実施態様において、バックエンドサーバ３０とストリーミングサーバ４０は、独立したサーバとしてもよい。一部の実施態様において、バックエンドサーバ３０とストリーミングサーバ４０は、１つのサーバに統合してもよい。一部の実施態様において、ユーザ端末１０は、ライブストリーミングのためのクライアント装置である。一部の実施態様において、ユーザ端末１０は、視聴者、ストリーマー、アンカー、ポッドキャスター、オーディエンス、リスナーなどと呼ばれることがある。ユーザ端末１０、バックエンドサーバ３０、及びストリーミングサーバ４０はそれぞれ情報処理装置の一例である。一部の実施態様において、ストリーミングは、ライブストリーミングまたはビデオ再生とすることができる。一部の実施態様において、ストリーミングは、オーディオストリーミングおよび／またはビデオストリーミングとすることができる。一部の実施態様において、ストリーミングは、オンラインショッピング、トークショー、タレントショー、娯楽イベント、スポーツイベント、音楽ビデオ、映画、コメディ、コンサート、グループ通話、電話会議などのコンテンツを含むことができる。 FIG. 7 shows a schematic diagram showing the configuration of a communication system 1 based on some embodiments of the present invention. The communication system 1 can provide live streaming services with interaction through content. "Content" here refers to digital content that can be played back on a computer device. The communication system 1 allows users to participate in real-time interactions with other users online. The communication system 1 includes a plurality of user terminals 10, a backend server 30, and a streaming server 40. The user terminal 10, the backend server 30, and the streaming server 40 are connected via a network 90 (for example, the Internet may be used). Backend server 30 may be a server that synchronizes interactions between user terminals and/or streaming server 40 . In some implementations, backend server 30 may be an application (APP) provider's origin server. Streaming server 40 is a server for handling or providing streaming data or video data. In some implementations, backend server 30 and streaming server 40 may be independent servers. In some implementations, backend server 30 and streaming server 40 may be integrated into one server. In some implementations, user terminal 10 is a client device for live streaming. In some implementations, user terminal 10 may be referred to as a viewer, streamer, anchor, podcaster, audience, listener, etc. The user terminal 10, the backend server 30, and the streaming server 40 are each examples of information processing devices. In some implementations, streaming can be live streaming or video playback. In some implementations, streaming can be audio streaming and/or video streaming. In some implementations, streaming can include content such as online shopping, talk shows, talent shows, entertainment events, sporting events, music videos, movies, comedy, concerts, group calls, conference calls, and the like.

図８に本発明の一部の実施態様に基づく通信システムの例示的な機能構成図を示す。図８において、ネットワーク９０は省略されている。 FIG. 8 shows an exemplary functional block diagram of a communication system according to some embodiments of the present invention. In FIG. 8, the network 90 is omitted.

当該バックエンドサーバ３０は、メッセージユニット３２を含む。当該メッセージユニット３２は、ユーザ端末からデータまたは情報を受け取り、それらデータを処理及び／または保存し、当該データをユーザ端末に送信するように構成される。一部の実施態様において、当該メッセージユニット３２は当該バックエンドサーバ３０とは別個のユニットとしてもよい。 The backend server 30 includes a message unit 32 . The message unit 32 is configured to receive data or information from a user terminal, process and/or store the data, and send the data to the user terminal. In some implementations, the message unit 32 may be a separate unit from the backend server 30.

当該ストリーミングサーバ４０は、データ受信装置４００と、データ送信装置４０２を含む。当該データ受信装置４００は、さまざまなユーザ端末からストリーミングデータやビデオデータなどのデータまたは情報を受信するように構成される。当該データ送信装置４０２は、さまざまなユーザ端末にストリーミングデータやビデオデータなどのデータまたは情報を送信するように構成される。 The streaming server 40 includes a data receiving device 400 and a data transmitting device 402. The data receiving device 400 is configured to receive data or information such as streaming data or video data from various user terminals. The data transmission device 402 is configured to transmit data or information, such as streaming data or video data, to various user terminals.

当該ユーザ端末１０Ａは、ユーザＡにより操作されるユーザ端末であってもよい。当該ユーザ端末１０Ａは、カメラ７００と、レンダラー７０２と、ディスプレイ７０４と、エンコーダー７０６と、デコーダー７０８と、結果送信装置７１０と、マッティングユニット７１２と、オブジェクト認識ユニット７１４を含む。 The user terminal 10A may be a user terminal operated by user A. The user terminal 10A includes a camera 700, a renderer 702, a display 704, an encoder 706, a decoder 708, a result transmitting device 710, a matting unit 712, and an object recognition unit 714.

当該カメラ７００は、任意の種類の映像撮影装置である、またはこれを含むことができる。当該カメラ７００は、例えばユーザＡの、ビデオデータを撮影するように構成される。 The camera 700 may be or include any type of video capture device. The camera 700 is configured to capture video data of user A, for example.

当該レンダラー７０２は、当該カメラ７００からのビデオデータ（ユーザＡのビデオデータ）を受信、当該デコーダー７０８からのビデオデータ（ユーザＢからのビデオデータを含んでもよい）を受信して、当該ディスプレイ７０４に表示するレンダリングされた映像（ユーザＡとユーザＢが表示されているグループ通話を表示する映像など）を生成するように構成される。 The renderer 702 receives video data from the camera 700 (user A's video data), receives video data from the decoder 708 (which may include video data from user B), and displays the video data on the display 704. It is configured to generate rendered video for display (such as video displaying a group call in which User A and User B are displayed).

当該ディスプレイ７０４は、当該レンダラー７０２からのレンダリングされた映像を表示するように構成される。一部の実施態様において、当該ディスプレイ７０４はユーザ端末１０Ａ上の画面であってもよい。 The display 704 is configured to display rendered video from the renderer 702. In some implementations, the display 704 may be a screen on the user terminal 10A.

当該エンコーダー７０６は当該カメラ７００からのビデオデータをエンコードし、エンコードした当該ビデオデータを当該ストリーミングサーバ４０の当該データ受信装置４００に送信するように構成される。エンコードしたデータはストリーミングデータとして送信されてもよい。 The encoder 706 is configured to encode video data from the camera 700 and send the encoded video data to the data receiving device 400 of the streaming server 40 . The encoded data may be transmitted as streaming data.

当該デコーダー７０８は、当該ストリーミングサーバ４０の当該データ送信装置４０２からビデオデータまたはストリーミングデータ（ユーザＢからのビデオデータを含んでもよい）を受信し、それらをデコードしてデコードしたビデオデータとし、当該デコードしたビデオデータを当該レンダラー７０２に送信してレンダリングさせるように構成される。 The decoder 708 receives video data or streaming data (which may include video data from user B) from the data transmission device 402 of the streaming server 40, decodes the video data, converts it into decoded video data, and converts it into decoded video data. The rendered video data is sent to the renderer 702 for rendering.

当該マッティングユニット７１２は、当該カメラ７００からの当該ビデオデータ（ユーザＡのビデオデータ）にマッティング処理（画像マッティングまたは映像マッティング）を実行するように構成される。当該マッティング処理は、輪郭認識処理、画像比較処理、移動体検出処理、及び／または切り出し処理が含まれてもよい。当該マッティング処理は、定色マッティング（ｃｏｎｓｔａｎｔ－ｃｏｌｏｒｍａｔｔｉｎｇ）、差分マッティング（ｄｉｆｆｅｒｅｎｃｅｍａｔｔｉｎｇ）、自然画像マッティング（ｎａｔｕｒａｌｉｍａｇｅｍａｔｔｉｎｇ）を含む手法で実行されてもよい。また、当該マッティング処理に関わるアルゴリズムは、ベイジアンマッティング（Ｂａｙｅｓｉａｎｍａｔｔｉｎｇ）、ポアソンマッティング（Ｐｏｉｓｓｏｎｍａｔｔｉｎｇ）、またはロバストマッティング（Ｒｏｂｕｓｔｍａｔｔｉｎｇ）を含んでもよい。一部の実施態様において、画像比較プロセスは、初期またはデフォルトの背景画像と現在のまたはライブ画像とを定期的に比較し、インタラクティブ領域におけるユーザＡの部分を検出する。 The matting unit 712 is configured to perform a matting process (image matting or video matting) on the video data (user A's video data) from the camera 700. The matting process may include a contour recognition process, an image comparison process, a moving object detection process, and/or a cutting process. The matting process may be performed using techniques including constant-color matting, difference matting, and natural image matting. Further, the algorithm related to the matting process may include Bayesian matting, Poisson matting, or Robust matting. In some implementations, the image comparison process periodically compares the initial or default background image and the current or live image to detect user A's portion in the interactive area.

例えば、当該マッティングユニット７１２は、カメラ７００からユーザＡのビデオデータを受け取る。当該ビデオデータは、図２、図３、図４、図５で例に挙げて上述したようにインタラクティブ領域を含むことができる。一部の実施態様において、当該マッティングユニット７１２は、ビデオデータ中のユーザＡの輪郭を検出または抽出するためのマッティング処理を実行する。一部の実施態様において、当該マッティングユニット７１２は、当該インタラクティブ領域内のユーザＡの部分（ユーザＡの手や、オブジェクトを持っているユーザＡの手など）を検出または抽出するマッティング処理を実行する。一部の実施態様において、当該マッティングユニット７１２は、ユーザＡのビデオデータから当該インタラクティブ領域外側の領域または部分を削除する切り出し処理を実行する。一部の実施態様において、当該マッティングユニット７１２は、ユーザＡの当該部分が検出された当該インタラクティブ領域内の位置を検出、認識、または判断する。一部の実施態様において、切り出し処理の前に輪郭認識処理または画像比較処理を実行し、当該インタラクティブ領域におけるユーザＡの当該部分の検出精度を向上させてもよい。 For example, the matting unit 712 receives user A's video data from the camera 700. The video data may include an interactive area as described above by way of example in FIGS. 2, 3, 4, and 5. In some implementations, the matting unit 712 performs a matting process to detect or extract the contour of user A in the video data. In some implementations, the matting unit 712 performs a matting process that detects or extracts a portion of user A within the interactive area (such as user A's hand or user A's hand holding an object). Execute. In some implementations, the matting unit 712 performs a cropping process that removes areas or portions outside the interactive area from the user A's video data. In some implementations, the matting unit 712 detects, recognizes, or determines the location within the interactive area where the portion of user A is detected. In some embodiments, a contour recognition process or an image comparison process may be performed before the cutting process to improve the accuracy of detecting the part of the user A in the interactive area.

一部の実施態様において、当該インタラクティブ領域、及び対応する境界または境界線は、当該ユーザ端末１０Ａのプロセッサ（図示しない）または当該グループ通話を有効にしているアプリケーションにより定義されてもよい。一部の実施態様において、当該インタラクティブ領域と、対応する境界または境界線は、ユーザＡにより当該ユーザ端末１０ＡのＵＩ（ユーザインターフェイス）ユニット（図示しない）で決定されてもよい。一部の実施態様において、当該マッティングユニット７１２は、領域ＲＡ内の境界線を越えたユーザＡの部分を検出することにより、当該インタラクティブ領域内のユーザＡの当該部分（またはユーザＡのライブ映像の当該部分）を検出または判断する。当該領域ＲＡ内の当該境界線は、例えば、図５における当該境界線ＢＲ１または当該境界線ＢＲ２であってもよい。 In some implementations, the interactive area and corresponding boundaries or boundaries may be defined by a processor (not shown) of the user terminal 10A or an application enabling the group call. In some implementations, the interactive area and the corresponding boundaries or boundaries may be determined by the user A in a UI (user interface) unit (not shown) of the user terminal 10A. In some implementations, the matting unit 712 detects the portion of user A within the interactive area (or the live video of user A) by detecting the portion of user A beyond the border within the area RA. detect or determine the relevant part of the The boundary line within the area RA may be the boundary line BR1 or the boundary line BR2 in FIG. 5, for example.

当該オブジェクト認識ユニット７１４は、当該マッティングユニット７１２からの出力データに対してオブジェクト認識処理を実行するように構成される。当該出力データは、ユーザＡの検出された部分または抽出された部分（ユーザＡの手や、オブジェクトを持っているユーザＡの手など）を含むことができる。当該オブジェクト認識ユニット７１４は、当該ユーザＡの検出された部分が任意の所定のパターン、オブジェクト及び／またはジェスチャを含むか否かを判定するオブジェクト認識処理を実行する。一部の実施態様において、当該オブジェクト認識処理は、テンプレートマッチング、パターンマッチング、輪郭マッチング、ジェスチャ認識、皮膚認識、外形マッチング、色または形状マッチング、および特徴に基づくマッチングなどの技術を含むことができる。一部の実施態様において、当該オブジェクト認識ユニット７１４は、当該ユーザＡの検出された部分（またはその一部）と所定のパターンのセットとのマッチング相関を計算し、当該ユーザＡの検出された部分内で任意のパターンが一致する、または認識されるか否かを判断する。一部の実施態様において、当該オブジェクト認識ユニット７１４は、ユーザＡの当該部分が検出された当該インタラクティブ領域内の位置を検出、認識、または判断する。一部の実施態様において、当該オブジェクト認識処理は、切り出し処理が行われていない当該マッティングユニット７１２からの画像または映像に対して行われてもよく、それにより、当該オブジェクト認識処理の精度を向上させることができる。一部の実施態様において、当該オブジェクト認識ユニット７１４は当該インタラクティブ領域内でユーザＡの当該部分の画像または映像を認識及び抽出し、当該結果送信装置７１０に抽出された当該画像または映像を送信する。 The object recognition unit 714 is configured to perform object recognition processing on the output data from the matting unit 712. The output data may include a detected part or an extracted part of user A (user A's hand, user A's hand holding an object, etc.). The object recognition unit 714 performs an object recognition process to determine whether the detected portion of the user A includes any predetermined patterns, objects, and/or gestures. In some implementations, the object recognition process can include techniques such as template matching, pattern matching, contour matching, gesture recognition, skin recognition, shape matching, color or shape matching, and feature-based matching. In some implementations, the object recognition unit 714 calculates matching correlations between the detected portions of the user A (or portions thereof) and a predetermined set of patterns, and Determine whether any pattern within is matched or recognized. In some implementations, the object recognition unit 714 detects, recognizes, or determines the location within the interactive area where the portion of user A is detected. In some embodiments, the object recognition process may be performed on images or videos from the matting unit 712 that have not been subjected to the cropping process, thereby improving the accuracy of the object recognition process. can be done. In some embodiments, the object recognition unit 714 recognizes and extracts the image or video of the part of the user A within the interactive area, and transmits the extracted image or video to the result transmitting device 710.

当該結果送信装置７１０は、当該オブジェクト認識ユニット７１４の出力結果（当該マッティングユニット７１２の出力を含んでもよい）を当該バックエンドサーバ３０の当該メッセージユニット３２に送信するように構成される。一部の実施態様において、当該結果送信装置７１０は、当該メッセージユニット３２を介して送信する代わりに、当該結果受信装置８１０に当該出力を直接送信してもよい。 The result sending device 710 is configured to send the output results of the object recognition unit 714 (which may include the output of the matting unit 712) to the message unit 32 of the backend server 30. In some implementations, the results sending device 710 may send the output directly to the results receiving device 810 instead of sending it via the message unit 32.

当該ユーザ端末１０Ｂは、ユーザＢにより操作されるユーザ端末であってもよい。当該ユーザ端末１０Ｂは、カメラ８００と、レンダラー８０２と、ディスプレイ８０４と、エンコーダー８０６と、デコーダー８０８と、結果受信装置８１０と、画像処理装置８１２を含む。 The user terminal 10B may be a user terminal operated by user B. The user terminal 10B includes a camera 800, a renderer 802, a display 804, an encoder 806, a decoder 808, a result receiving device 810, and an image processing device 812.

当該カメラ８００は、任意の種類の映像撮影装置である、またはこれを含むことができる。当該カメラ８００は、例えばユーザＢの、ビデオデータを撮影するように構成される。当該カメラ８００は、撮影したビデオデータを当該エンコーダー８０６、当該レンダラー８０２、及び／または当該画像処理装置８１２に送信する。 The camera 800 may be or include any type of video capture device. The camera 800 is configured to capture video data of user B, for example. The camera 800 sends captured video data to the encoder 806, the renderer 802, and/or the image processing device 812.

当該レンダラー８０２は、当該カメラ８００からのビデオデータ（例：ユーザＢのビデオデータ）を受信、当該デコーダー８０８からのビデオデータ（ユーザＡなど、別のユーザからのビデオデータを含んでもよい）を受信、当該画像処理装置８１２の出力データを受信して、当該ディスプレイ８０４に表示するレンダリングされた映像（ユーザＡとユーザＢが表示されているグループ通話を表示する映像など）を生成するように構成される。 The renderer 802 receives video data from the camera 800 (e.g., user B's video data), and receives video data from the decoder 808 (which may include video data from another user, such as user A). , configured to receive output data of the image processing device 812 and generate rendered video for display on the display 804 (such as video displaying a group call in which User A and User B are displayed). Ru.

当該ディスプレイ８０４は、当該レンダラー８０２からのレンダリングされた映像を表示するように構成される。一部の実施態様において、当該ディスプレイ８０４は当該ユーザ端末１０Ｂ上の画面であってもよい。 The display 804 is configured to display rendered video from the renderer 802. In some implementations, the display 804 may be a screen on the user terminal 10B.

当該エンコーダー８０６は、当該カメラ８００からの当該ビデオデータ、及び／または当該画像処理装置８１２からのビデオデータを含む、データをエンコードするように構成される。当該エンコーダー８０６は、エンコードした当該ビデオデータを当該ストリーミングサーバ４０の当該データ受信装置４００に送信する。エンコードしたデータはストリーミングデータとして送信されてもよい。 The encoder 806 is configured to encode data, including the video data from the camera 800 and/or the video data from the image processing device 812. The encoder 806 transmits the encoded video data to the data receiving device 400 of the streaming server 40 . The encoded data may be transmitted as streaming data.

当該デコーダー８０８は、当該ストリーミングサーバ４０の当該データ送信装置４０２からビデオデータまたはストリーミングデータ（ユーザＡからのビデオデータを含んでもよい）を受信し、それらをデコードしてデコードしたビデオデータとし、当該デコードしたビデオデータを当該レンダラー８０２に送信してレンダリングさせるように構成される。 The decoder 808 receives video data or streaming data (which may include video data from user A) from the data transmitting device 402 of the streaming server 40, decodes the video data, converts the data into decoded video data, and converts the data into decoded video data. The rendered video data is sent to the renderer 802 for rendering.

当該結果受信装置８１０は、当該バックエンドサーバ３０の当該メッセージユニット３２から出力データを受信し、当該データを当該画像処理装置８１２に送信するように構成される。当該メッセージユニット３２からの当該出力データは、当該マッティングユニット７１２と当該オブジェクト認識ユニット７１４からのデータまたは情報を含む。一部の実施態様において、当該メッセージユニット３２からの当該出力データは、当該オブジェクト認識ユニット７１４により実行された当該オブジェクト認識処理の結果を含む。例えば、当該メッセージユニット３２からの当該出力データは、一致した、または認識されたパターン、オブジェクトまたはジェスチャについての情報を含んでもよい。一部の実施態様において、当該メッセージユニット３２からの当該出力データは、（当該ユーザ端末１０Ａ上の）当該インタラクティブ領域内における位置に関する情報を含み、そのうち、ユーザＡの当該部分が、例えば、当該ユーザ端末１０Ａの当該マッティングユニット７１２、または当該オブジェクト認識ユニット７１４により、検出される。一部の実施態様において、当該メッセージユニット３２からの当該出力データは、当該インタラクティブ領域内で検出／認識されたユーザＡの部分の映像または画像を含む。 The result receiving device 810 is configured to receive output data from the message unit 32 of the backend server 30 and send the data to the image processing device 812 . The output data from the message unit 32 includes data or information from the matting unit 712 and the object recognition unit 714. In some implementations, the output data from the message unit 32 includes the results of the object recognition processing performed by the object recognition unit 714. For example, the output data from the message unit 32 may include information about matched or recognized patterns, objects or gestures. In some implementations, the output data from the message unit 32 includes information regarding the position within the interactive area (on the user terminal 10A), of which the part of user A is e.g. It is detected by the matting unit 712 or object recognition unit 714 of the terminal 10A. In some implementations, the output data from the message unit 32 includes a video or image of the portion of user A detected/recognized within the interactive area.

当該画像処理装置８１２は、当該カメラ８００からのビデオデータ、及び／または当該結果受信装置８１０からのデータまたは情報を受信するように構成される。一部の実施態様において、当該画像処理装置８１２は、当該結果受信装置８１０から受信したデータまたは情報に基づいて、当該カメラ８００から受信した当該ビデオデータに対して画像処理または映像処理を実行する。例えば、当該オブジェクト認識ユニット７１４により実行された当該オブジェクト認識処理が（当該ユーザ端末１０Ａの画面上の当該インタラクティブ領域にある）ユーザＡの当該部分に所定のパターンを正常に認識したことを当該結果受信装置８１０から受信した当該データが示す場合、当該画像処理装置８１２は、当該カメラ８００から受信した当該ビデオデータ上に、当該所定のパターンに対応する特殊効果を含める、レンダリングする、重ねることができる。重ねられた当該映像がその後当該レンダラー８０２に送信され、その後当該ユーザ端末８０４上に表示されてもよい。一部の実施態様において、当該特殊効果データは、当該ユーザ端末１０Ｂ（図示しない）上のストレージに保存されてもよい。 The image processing device 812 is configured to receive video data from the camera 800 and/or data or information from the results receiving device 810. In some implementations, the image processing device 812 performs image processing or video processing on the video data received from the camera 800 based on data or information received from the results receiving device 810. For example, receiving a result indicating that the object recognition process executed by the object recognition unit 714 has successfully recognized a predetermined pattern on the part of the user A (in the interactive area on the screen of the user terminal 10A). If the data received from device 810 indicates, the image processing device 812 may include, render, or overlay special effects corresponding to the predetermined pattern on the video data received from the camera 800. The superimposed video may then be sent to the renderer 802 and then displayed on the user terminal 804. In some implementations, the special effects data may be stored in storage on the user terminal 10B (not shown).

一部の実施態様において、当該メッセージユニット３２は、当該マッティングユニット７１２からのデータ及び／または当該オブジェクト認識ユニット７１４からのデータに基づいて、当該メッセージユニット３２の出力データの宛先を決定する。一部の実施態様において、当該メッセージユニット３２は、当該インタラクティブ領域内で検出されたユーザＡ当該部分の位置に基づいて、ユーザＡの当該部分を延伸、複製または再現する領域を決定する。 In some implementations, the message unit 32 determines the destination of its output data based on data from the matting unit 712 and/or data from the object recognition unit 714. In some implementations, the message unit 32 determines an area in which to stretch, duplicate, or reproduce the portion of user A based on the detected position of the portion of user A within the interactive area.

例えば、図５に示すように、当該マッティングユニット７１２（または当該オブジェクト認識ユニット７１４）によりユーザＡの当該部分が検出された当該インタラクティブ領域Ａ３１の位置が、当該サブ領域Ａ３１２内にある場合、当該メッセージユニット３２は、当該メッセージユニット３２の当該出力データを送信する宛先にユーザＣの当該ユーザ端末を決定することができる。その後ユーザＡの当該部分が領域ＲＣ内に延伸または複製／再現／表示され、これはユーザＣの当該ユーザ端末の画像処理装置により実行されてもよい。 For example, as shown in FIG. 5, if the position of the interactive area A31 where the relevant part of user A is detected by the relevant matting unit 712 (or the relevant object recognition unit 714) is within the relevant sub-region A312, the relevant The message unit 32 can determine the user terminal of the user C as the destination to which the output data of the message unit 32 is to be sent. The relevant part of user A is then stretched or copied/reproduced/displayed within the region RC, which may be performed by the image processing device of the relevant user terminal of user C.

別の一例において、当該マッティングユニット７１２によりユーザＡの当該部分検出された当該インタラクティブ領域Ａ３１の位置が、当該サブ領域Ａ３１１内にある場合、当該メッセージユニット３２は、当該メッセージユニット３２の当該出力データを送信する宛先にユーザＤの当該ユーザ端末を決定することができる。その後ユーザＡの当該部分が領域ＲＤ内に延伸または複製／再現／表示され、これはユーザＤの当該ユーザ端末における画像処理装置及び／またはレンダラーとの連携により実行されてもよい。 In another example, when the position of the interactive area A31 where the part of user A is detected by the matting unit 712 is within the sub-area A311, the message unit 32 outputs the output data of the message unit 32. The user terminal of user D can be determined as the destination for transmitting the message. Thereafter, the portion of user A is stretched or duplicated/reproduced/displayed within region RD, and this may be performed in cooperation with an image processing device and/or a renderer in user D's user terminal.

さらに別の一例において、当該マッティングユニット７１２によりユーザＡの当該部分が検出された当該インタラクティブ領域Ａ３１の位置が、当該サブ領域Ａ３１３内にある場合、当該メッセージユニット３２は、当該メッセージユニット３２の当該出力データを送信する宛先にユーザＢの当該ユーザ端末を決定することができる。その後ユーザＡの当該部分が領域ＲＢ内に延伸または複製／再現／表示され、これはユーザＢの当該ユーザ端末における画像処理装置及び／またはレンダラーとの連携により実行されてもよい。 In yet another example, if the position of the interactive area A31 in which the part of user A is detected by the matting unit 712 is within the sub-area A313, the message unit 32 The user terminal of user B can be determined as the destination for transmitting the output data. The portion of user A is then stretched or duplicated/reproduced/displayed within the region RB, and this may be performed in cooperation with an image processing device and/or a renderer in the user terminal of user B.

一部の実施態様において、当該メッセージユニット３２の当該出力データは、領域ＲＡの当該インタラクティブ領域で検出されたユーザＡの当該部分の画像または映像を含んでもよい。当該画像処理装置８１２は、その後、当該カメラ８００から受信したユーザＢの当該映像上にユーザＡの当該部分を重ねる、複製する、または再現することができる。この方法において、当該インタラクティブ領域内のユーザＡの当該部分は、グラフィカルオブジェクトまたはアニメーションオブジェクトとして表現として表現されることなく、当該領域Ｂへ延伸されてもよい。 In some embodiments, the output data of the message unit 32 may include an image or video of the part of user A detected in the interactive area of area RA. The image processing device 812 may then overlay, duplicate, or reproduce the portion of user A on the video of user B received from the camera 800. In this way, the portion of user A within the interactive area may be extended to area B without being represented as a graphical or animated object.

一部の実施態様において、当該画像処理装置８１２は、当該デコーダー８０８を通じてユーザＡの画像またはビデオデータを受信した後、当該メッセージユニット３２からの情報（当該インタラクティブ領域内で検出されたユーザＡの当該部分に関する範囲、外形、または輪郭情報を含んでもよい）を利用して、当該カメラ８００から受信したユーザＢの映像上に、当該インタラクティブ領域内のユーザＡの当該部分を重ねる、複製する、または再現することができる。この方法において、当該インタラクティブ領域内のユーザＡの当該部分は、グラフィカルオブジェクトまたはアニメーションオブジェクトとして表現として表現されることなく、当該領域Ｂへ延伸されてもよい。 In some implementations, after receiving the image or video data of user A through the decoder 808, the image processing device 812 receives information from the message unit 32 (such as superimposing, duplicating, or reproducing the portion of user A in the interactive area on the video of user B received from the camera 800; can do. In this way, the portion of user A within the interactive area may be extended to area B without being represented as a graphical or animated object.

一部の実施態様において、当該マッティングユニット７１２及び／または当該オブジェクト認識ユニット７１４は、当該ユーザ端末１０Ａ内に実装されなくてもよい。例えば、当該マッティングユニット７１２と当該オブジェクト認識ユニット７１４は、当該バックエンドサーバ３０または当該ストリーミングサーバ４０内に実装されてもよい。 In some implementations, the matting unit 712 and/or the object recognition unit 714 may not be implemented within the user terminal 10A. For example, the matting unit 712 and the object recognition unit 714 may be implemented within the backend server 30 or the streaming server 40.

図９に本発明の一部の実施態様に基づく通信システムの動作を示す例示的なシーケンス図を示す。一部の実施態様において、図９に、ユーザ（例えば、ユーザＡ）の部分が、どのように別のユーザ（例えば、ユーザＢ）が表示されている領域に延伸されるかを示す。 FIG. 9 shows an exemplary sequence diagram illustrating the operation of a communication system according to some embodiments of the present invention. In some implementations, FIG. 9 shows how a portion of a user (eg, user A) is extended into an area where another user (eg, user B) is displayed.

工程Ｓ２００において、当該ユーザ端末１０Ａの当該カメラ７００は、ユーザＡの当該ビデオデータを当該ユーザ端末１０Ａの当該マッティングユニット７１２に送信する。 In step S200, the camera 700 of the user terminal 10A transmits the video data of user A to the matting unit 712 of the user terminal 10A.

工程Ｓ２０２において、当該マッティングユニット７１２は、当該ユーザ端末１０Ａの画面上の当該インタラクティブ領域内でユーザＡの部分を検出する。この検出は、マッティングプロセス及び／またはクロッピングプロセスを含んでもよい。一部の実施態様において、当該マッティングユニット７１２は、ユーザＡの当該部分が検出された当該インタラクティブ領域内の位置を判断する。 In step S202, the matting unit 712 detects the part of the user A within the interactive area on the screen of the user terminal 10A. This detection may include a matting process and/or a cropping process. In some implementations, the matting unit 712 determines the location within the interactive area where the portion of user A is detected.

工程Ｓ２０４において、当該ユーザ端末１０Ａの当該オブジェクト認識ユニット７１４は、当該マッティングユニット７１２から出力データを受信し、当該マッティングユニット７１２の当該出力に対してオブジェクト認識処理を実行して、当該インタラクティブ領域内で検出されたユーザＡの当該部分に任意の所定のパターン、ジェスチャまたはオブジェクトが認識されるか否かを判断する。一部の実施態様において、当該オブジェクト認識処理は、マッチング処理、ジェスチャ認識処理及び／または皮膚認識処理を含んでもよい。 In step S204, the object recognition unit 714 of the user terminal 10A receives the output data from the matting unit 712, performs object recognition processing on the output of the matting unit 712, and converts the interactive area It is determined whether any predetermined pattern, gesture, or object is recognized in the part of user A detected within. In some implementations, the object recognition process may include a matching process, a gesture recognition process, and/or a skin recognition process.

工程Ｓ２０６において、当該オブジェクト認識ユニット７１４が所定のパターン、ジェスチャまたはオブジェクトを認識すると、当該オブジェクト認識ユニット７１４は当該所定のパターン、ジェスチャまたはオブジェクトについて位置や大きさなどの関連情報を収集し、データの送信先となる宛先を決定する。 In step S206, when the object recognition unit 714 recognizes a predetermined pattern, gesture, or object, the object recognition unit 714 collects related information such as the position and size of the predetermined pattern, gesture, or object, and collects related information such as the position and size of the predetermined pattern, gesture, or object. Decide on the destination to send to.

工程Ｓ２０８において、当該オブジェクト認識ユニット７１４の当該出力は、当該ユーザ端末１０Ａの当該結果送信装置７１０を介して当該バックエンドサーバ３０の当該メッセージユニット３２に送信される。 In step S208, the output of the object recognition unit 714 is sent to the message unit 32 of the backend server 30 via the result transmission device 710 of the user terminal 10A.

工程Ｓ２１０において、当該メッセージユニット３２は、当該ユーザ端末１０Ａからのデータに含まれる、当該インタラクティブ領域内のユーザＡの当該部分の位置に関する情報に基づいて、当該ユーザ端末１０Ａからのデータを送信する宛先を決定する。当該情報は、例えば工程Ｓ２０６で決定されてもよい。 In step S210, the message unit 32 determines the destination to which the data from the user terminal 10A is sent based on the information regarding the position of the part of the user A in the interactive area, which is included in the data from the user terminal 10A. Determine. The information may be determined in step S206, for example.

工程Ｓ２１１において、当該メッセージユニット３２は、当該ユーザ端末１０Ａからの当該データを当該ユーザ端末１０Ｂの当該結果受信装置８１０に送信する（当該メッセージユニット３２が宛先をユーザＢまたは領域ＲＢと決定した例示的なシナリオの場合）。 In step S211, the message unit 32 transmits the data from the user terminal 10A to the result receiving device 810 of the user terminal 10B (in the exemplary case where the message unit 32 determines the destination as user B or region RB). scenario).

工程Ｓ２１２において、当該結果受信装置８１０は受信した当該データを当該ユーザ端末１０Ｂの当該画像処理装置８１２に送信する。 In step S212, the result receiving device 810 transmits the received data to the image processing device 812 of the user terminal 10B.

工程Ｓ２１４において、当該画像処理装置８１２は、ユーザＡの検出された当該部分（または領域ＲＡの当該インタラクティブ領域内にある、ユーザＡの検出された当該部分の一部）を、ユーザＢの当該ビデオデータ上に重畳する、または重ね合わせる。一部の実施態様において、ユーザＡの検出された当該部分の当該画像またはビデオデータが、当該ストリーミングサーバ４０を介して当該ユーザ端末１０Ｂに送信される。一部の実施態様において、ユーザＡの検出された当該部分の当該画像またはビデオデータが、当該メッセージユニット３２を介して当該ユーザ端末１０Ｂに送信される。ユーザＢの当該画像またはビデオデータは、当該ユーザ端末１０Ｂの当該カメラ８００から当該画像処理装置８１２に送信される。 In step S214, the image processing device 812 converts the detected portion of user A (or a portion of the detected portion of user A within the interactive area of area RA) into the video of user B. Overlay or superimpose on data. In some implementations, the image or video data of the detected portion of user A is sent to the user terminal 10B via the streaming server 40. In some implementations, the image or video data of the detected portion of user A is sent to the user terminal 10B via the message unit 32. The image or video data of user B is transmitted from the camera 800 of the user terminal 10B to the image processing device 812.

工程Ｓ２１６において、当該画像処理装置８１２は処理された当該画像またはビデオデータをレンダリングのため当該ユーザ端末１０Ｂの当該レンダラー８０２に送信する。例えば、処理された当該画像またはビデオデータは、当該ユーザ端末１０Ｂの当該デコーダー８０８からのビデオデータ及び／または当該カメラ８００からのビデオデータと共にレンダリングされてもよい。 In step S216, the image processing device 812 sends the processed image or video data to the renderer 802 of the user terminal 10B for rendering. For example, the processed image or video data may be rendered with video data from the decoder 808 of the user terminal 10B and/or video data from the camera 800.

工程Ｓ２１８において、レンダリングされた当該ビデオデータが、当該ユーザ端末１０Ｂの画面上に表示するため、当該ユーザ端末１０Ｂの当該ディスプレイ８０４に送信される。 In step S218, the rendered video data is sent to the display 804 of the user terminal 10B for display on the screen of the user terminal 10B.

工程Ｓ２２０において、当該画像処理装置８１２は、処理された当該画像またはビデオデータをエンコーディングプロセスのため当該ユーザ端末１０Ｂの当該エンコーダー８０６に送信する。 In step S220, the image processing device 812 sends the processed image or video data to the encoder 806 of the user terminal 10B for an encoding process.

工程Ｓ２２２において、エンコーディングされた当該ビデオデータが、当該ストリーミングサーバ４０に送信される。 In step S222, the encoded video data is sent to the streaming server 40.

工程Ｓ２２４において、当該ストリーミングサーバ４０はエンコードされた当該ビデオデータを（当該ユーザ端末１０Ｂから）デコーディングプロセスのため当該ユーザ端末１０Ａの当該デコーダー７０８に送信する。 In step S224, the streaming server 40 sends the encoded video data (from the user terminal 10B) to the decoder 708 of the user terminal 10A for a decoding process.

工程Ｓ２２６において、デコーディングされた当該ビデオデータが、レンダリングプロセスのため当該ユーザ端末１０Ａの当該レンダラー７０２に送信される。 In step S226, the decoded video data is sent to the renderer 702 of the user terminal 10A for a rendering process.

工程Ｓ２２８において、レンダリングされた当該ビデオデータが、当該ユーザ端末１０Ａの画面上に表示するため当該ディスプレイ８０４に送信される。 In step S228, the rendered video data is sent to the display 804 for display on the screen of the user terminal 10A.

上述の例示的な工程またはステップは、連続的にまたは周期的に実行されてもよい。例えば、当該マッティングユニット７１２は、当該インタラクティブ領域内のユーザＡの部分を連続的にまたは周期的に検出する。当該オブジェクト認識ユニット７１４は、当該インタラクティブ領域内のユーザＡの当該部分に対して連続的にまたは周期的に認識処理を実行する。当該メッセージユニット３２は、当該ユーザ端末１０Ａから受信した当該データを送信する宛先を連続的にまたは周期的に判断する。当該ユーザ端末１０Ｂの当該画像処理装置８１２は、当該メッセージユニット３２から受信した情報に基づいて連続的にまたは周期的に重畳または重ね合わせのプロセスを実行し、領域ＲＢ内の延伸または再現／複製されたユーザＡの当該部分が、当該領域ＲＡ内のユーザＡの当該部分と同期して移動するように確約する。一部の実施態様において、当該ユーザ端末１０ＢはＣＰUやGＰU等の処理ユニットを備え、当該領域ＲＢ内のユーザＡの延伸または再現された当該部分が、ユーザＢの画像または映像に触れるか否かを判断する。この判断の結果は、当該画像処理装置８１２が当該領域ＲＢ内に特殊効果を含めるか否かを決定するために使用することができる。 The example processes or steps described above may be performed continuously or periodically. For example, the matting unit 712 continuously or periodically detects user A's part within the interactive area. The object recognition unit 714 continuously or periodically performs recognition processing on the part of the user A within the interactive area. The message unit 32 continuously or periodically determines the destination to which the data received from the user terminal 10A is to be sent. The image processing device 812 of the user terminal 10B continuously or periodically performs the superimposition or superimposition process based on the information received from the message unit 32, and performs the process of superimposing or superimposing the stretched or reproduced/duplicated image in the area RB. ensure that that part of user A moved in synchronization with the part of user A in the area RA. In some embodiments, the user terminal 10B includes a processing unit such as a CPU or a GPU, and determines whether the stretched or reproduced portion of the user A in the region RB touches the image or video of the user B. to judge. The result of this determination can be used by the image processing device 812 to determine whether or not to include special effects in the region RB.

本発明は電話会議やグループ通話をより便利に、楽しく、またはインタラクティブにするものである。本発明は、ユーザが別のユーザの表示領域内にあるオブジェクトについて話したいとき、誤解を防止することができる。本発明は、グループ通話チャットルーム（ライブストリーミングの形式であってもよい）へのユーザの参加意欲を高めることができる。本発明は、より多くのストリーマーや視聴者を引き付け、ライブストリーミングによるグループ通話に参加してもらうことができる。 The present invention makes conference calls and group calls more convenient, fun, and interactive. The present invention can prevent misunderstandings when a user wants to talk about an object that is in another user's display area. The present invention can increase user motivation to participate in a group call chat room (which may be in the form of live streaming). The present invention can attract more streamers and viewers to participate in live streaming group calls.

本発明で説明した処理及び手順は、明示的に説明したものに加えて、ソフトウェア、ハードウェア、またはそれらの任意の組み合わせにより実現することができる。例えば、本明細書で説明した処理および手順は、その処理および手順に対応するロジックを集積回路、揮発性メモリ、不揮発性メモリ、非一過性のコンピュータ可読媒体、磁気ディスクなどの媒体に実装することにより実現することができる。さらに、本明細書に記載された処理および手順は、その処理および手順に対応するコンピュータプログラムとして実現することができ、各種のコンピュータにより実行することができる。 The processes and procedures described in this invention, in addition to those explicitly described, may be implemented by software, hardware, or any combination thereof. For example, the processes and procedures described herein implement logic corresponding to the processes and procedures in a medium such as an integrated circuit, volatile memory, non-volatile memory, non-transitory computer-readable medium, magnetic disk, etc. This can be achieved by Further, the processes and procedures described in this specification can be realized as a computer program corresponding to the processes and procedures, and can be executed by various computers.

上記実施態様で説明したシステムまたは方法は、固体記憶装置、光ディスク記憶装置、磁気ディスク記憶装置などの非一時的なコンピュータ可読媒体に格納されたプログラムに統合されてもよい。あるいは、プログラムは、インターネットを介してサーバからダウンロードされ、プロセッサにより実行されるものとしてもよい。 The systems or methods described in the above embodiments may be integrated into programs stored on non-transitory computer-readable media, such as solid state storage, optical disk storage, magnetic disk storage, and the like. Alternatively, the program may be downloaded from a server via the Internet and executed by the processor.

以上、本発明の技術的内容及び特徴を説明したが、本発明の属する技術分野において通常の知識を有する者であれば、本発明の教示及び開示から逸脱することなく、なお多くの変形及び修正を行うことができる。したがって、本発明の範囲は、既に開示された実施態様に限定されず、本発明から逸脱しない別の変形や修正を含み、特許請求の範囲に含まれる範囲である。 Although the technical contents and features of the present invention have been described above, those with ordinary knowledge in the technical field to which the present invention pertains will appreciate that many variations and modifications can be made without departing from the teachings and disclosure of the present invention. It can be performed. Therefore, the scope of the invention is not limited to the embodiments already disclosed, but includes other variations and modifications that do not depart from the invention and are included in the scope of the claims.

Ｓ１画面
ＲＡ領域
ＲＢ領域
ＲＣ領域
ＲＤ領域
Ａ１部分
Ａ１１部分
Ａ２部分
Ａ２１部分
Ａ３境界
Ａ３１インタラクティブ領域
Ａ３１１サブ領域
Ａ３１２サブ領域
Ａ３１３サブ領域
Ａ３２領域
Ｂ１オブジェクト
Ｂ３境界
Ｂ３１インタラクティブ領域
Ｂ３２領域
ＢＲ１境界線
ＢＲ２境界線
ＳＰ１特殊効果
１システム
１０ユーザ端末
３０バックエンドサーバ
３２メッセージユニット
４０ストリーミングサーバ
４００データ受信装置
４０２データ送信装置
９０ネットワーク
７００カメラ
７０２レンダラー
７０４ディスプレイ
７０６エンコーダー
７０８デコーダー
７１０結果送信装置
７１２マッティングユニット
７１４オブジェクト認識ユニット
８００カメラ
８０２レンダラー
８０４ディスプレイ
８０６エンコーダー
８０８デコーダー
８１０結果受信装置
８１２画像処理装置 S1 Screen RA Area RB Area RC Area RD Area A1 Part A11 Part A2 Part A21 Part A3 Boundary A31 Interactive area A311 Sub area A312 Sub area A313 Sub area A32 Area B1 Object B3 Boundary B31 Interactive area B32 Area BR1 Boundary line BR2 Boundary line SP1 Special effects 1 System 10 User terminal 30 Back-end server 32 Message unit 40 Streaming server 400 Data receiving device 402 Data transmitting device 90 Network 700 Camera 702 Renderer 704 Display 706 Encoder 708 Decoder 710 Result transmitting device 712 Matting unit 714 Object recognition unit 800 Camera 802 Renderer 804 Display 806 Encoder 808 Decoder 810 Result receiving device 812 Image processing device

Claims

An image processing method,
Displaying live video of the first user in a first area on the user terminal;
Displaying a second user's video in a second area on the user terminal;
a portion of the live video of the first user is extended to the second area on the user terminal ;
further comprising determining the portion of the live video of the first user by detecting a portion of the first user that exceeds a boundary within the first area on the user terminal; An image processing method characterized by:

further defining an interactive area within the first area;
detecting a portion of the first user within the interactive area;
displaying the portion of the first user within the second area;
The video processing method according to claim 1, characterized in that it includes:

3. The video processing method of claim 2, wherein the step of detecting the part of the first user within the interactive area includes a matting process.

3. The video processing method according to claim 2, wherein the step of detecting the part of the first user within the interactive area includes an object recognition process.

5. The object recognition process further comprises the step of displaying a special effect on the user terminal when a predetermined pattern is recognized in the part of the first user. Video processing method.

5. The video processing method according to claim 4, wherein the object recognition process includes a gesture recognition process or a skin recognition process.

3. The video processing method according to claim 2, wherein the step of detecting the part of the first user within the interactive area includes image comparison processing or moving object detection processing.

The first user and the second user are arranged horizontally on the user terminal, and the part of the live video of the first user is directed towards the second user within the first area. The video processing method according to claim 1, wherein the video processing method is stretched.

The first user and the second user are vertically aligned on the user terminal, and the part of the live video of the first user is directed toward the second user within the first area. The video processing method according to claim 1, wherein the video processing method is stretched.

The first user and the second user are diagonally arranged on the user terminal, and the part of the live video of the first user is directed toward the second user within the first area. 2. The video processing method according to claim 1, wherein the video processing method is performed by stretching the video.

An image processing method,
Displaying live video of the first user in a first area on the user terminal;
Displaying a second user's video in a second area on the user terminal;
a portion of the live video of the first user is extended to the second area on the user terminal;
further defining an interactive area within the first area;
detecting a portion of the first user within the interactive area;
displaying the portion of the first user within the second area;
including;
further detecting a position of the portion of the first user within the interactive area;
determining a position of the second area based on the position of the portion of the first user within the interactive area;
A video processing method characterized by comprising:

The video processing method according to claim 1 , wherein the position of the boundary line is determined by the first user.

The video processing method according to claim 1 , wherein the boundary line corresponds to a direction of the second area with respect to the first area and is between the first area and the second area. .

The video processing method according to claim 1, wherein the video of the second user is a live video.

The video processing method according to claim 1, wherein the part of the live video of the first user extended to the second area is represented as a graphical object.

moreover. 5. The method of claim 1, further comprising the step of displaying a special effect in the second area when the part of the live video of the first user is stretched to touch the second area of the second user. 1. The video processing method according to 1.

A video processing system comprising one or more processors, the one or more processors executing machine-readable instructions,
Displaying live video of the first user in a first area on the user terminal;
Displaying a second user's video in a second area on the user terminal;
a portion of the live video of the first user is extended to the second area on the user terminal ;
Further, determining the portion of the live video of the first user by detecting a portion of the first user that exceeds a boundary within the first area on the user terminal. An image processing system characterized by the following .

The one or more processors execute the machine-readable instructions, and further:
defining an interactive area within the first area;
detecting a portion of the first user within the interactive area;
displaying the portion of the first user within the second area;
The video processing system according to claim 17 , characterized in that the video processing system executes the following.

A non-transitory computer-readable medium containing a program for image processing, the program being stored in one or more computers,
Displaying live video of the first user in a first area on the user terminal;
Displaying a second user's video in a second area on the user terminal;
a portion of the live video of the first user is extended to the second area on the user terminal;
Further, determining the portion of the live video of the first user by detecting a portion of the first user that exceeds a boundary line within the first area on the user terminal. , a non-transitory computer-readable medium.