WO2023232103A1

WO2023232103A1 - Film-watching interaction method and apparatus, and computer-readable storage medium

Info

Publication number: WO2023232103A1
Application number: PCT/CN2023/097743
Authority: WO
Inventors: 邱玉华; 周效军
Original assignee: 咪咕文化科技有限公司; 中国移动通信集团有限公司
Priority date: 2022-06-02
Filing date: 2023-06-01
Publication date: 2023-12-07
Also published as: CN114900738B; CN114900738A

Abstract

The present disclosure relates to the field of information technology. Provided are a film-watching interaction method and apparatus, and a computer-readable storage medium. The method comprises: during the process of a user watching a target video, acquiring a user image (101); identifying the posture of a target object in the user image (102); on the basis of the posture of the target object, determining a response posture of a target character in the target video (103); on the basis of the posture of a target part of the target object, adjusting a response posture of a target part of the target character (104); and on the basis of the adjusted response posture of the target part of the target character, displaying the response posture of the target character in a video picture of the target video (105).

Description

An interactive method, device and computer-readable storage medium for watching movies

Cross-references to related applications

This application claims priority from Chinese Patent Application No. 202210624394.2 filed in China on June 2, 2022, the entire content of which is incorporated herein by reference.

Technical field

The present disclosure relates to the field of information technology, and in particular, to a movie viewing interaction method, device and computer-readable storage medium.

Background technique

In related technologies, users can usually only watch the currently playing video resource or post barrages for interaction during the movie viewing process. Moreover, the current movie viewing design is mainly aimed at the entire audience, and the interaction method is relatively single and less interesting.

Contents of the invention

Embodiments of the present disclosure provide a movie-viewing interaction method, device, and computer-readable storage medium to solve the problem that related movie-viewing interaction methods are relatively single and less interesting.

In a first aspect, embodiments of the present disclosure provide a movie viewing interaction method, including:

While the user is watching the target video, the user image is obtained;

identifying the pose of the target object in the user image;

Based on the posture of the target object, determine the response posture of the target character in the target video;

Based on the posture of the target part of the target object, adjust the response posture of the target part of the target character;

Based on the adjusted response posture of the target part of the target character, the response posture of the target character is displayed in the video frame of the target video.

In a second aspect, embodiments of the present disclosure also provide an interactive device for watching movies, including:

The first acquisition module is used to acquire the user's image while the user is watching the target video;

A recognition module, used to recognize the posture of the target object in the user image;

A first determination module, configured to determine the response posture of the target character in the target video based on the posture of the target object;

A first adjustment module, configured to adjust the response posture of the target part of the target character based on the posture of the target part of the target object;

The first display module is configured to display the response posture of the target character in the video frame of the target video based on the adjusted response posture of the target part of the target character.

In a third aspect, embodiments of the present disclosure also provide a computer-readable storage medium. A computer program is stored on the computer-readable storage medium. When the computer program is executed by a processor, the above-mentioned interactive method for watching movies is implemented. step.

In an embodiment of the present disclosure, in the process of the user watching the target video, the user image is obtained; the posture of the target object in the user image is identified; and based on the posture of the target object, the target character in the target video is determined. A response gesture; based on the gesture of the target part of the target object, adjusting the response gesture of the target part of the target character; based on the adjusted response gesture of the target part of the target character, in the video of the target video The target character's response gesture is displayed on the screen. In this way, users can interact with the characters in the video, so that the interaction method is not limited to barrages, making the interaction more interesting.

Description of the drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings needed to be used in the description of the embodiments of the present disclosure will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present disclosure. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without exerting any creative effort.

Figure 1 is a flow chart of a movie viewing interaction method provided by an embodiment of the present disclosure;

Figure 2 is a schematic diagram of user communication in a movie viewing booth provided by an embodiment of the present disclosure;

Figure 3 is a schematic diagram of salient target detection provided by an embodiment of the present disclosure;

Figure 4 is a schematic diagram of bone detection provided by an embodiment of the present disclosure;

Figure 5 is a schematic diagram of continuous action detection provided by an embodiment of the present disclosure;

Figure 6 is a schematic diagram of bone key point matching provided by an embodiment of the present disclosure;

Figure 7 is a schematic diagram of three-dimensional modeling of gestures provided by an embodiment of the present disclosure;

Figure 8 is a schematic diagram of the minimum circumferential sphere projection provided by an embodiment of the present disclosure;

Figure 9 is a schematic diagram of hand spatial coordinates provided by an embodiment of the present disclosure;

Figure 10 is a schematic diagram of the spatial angle provided by an embodiment of the present disclosure;

Figure 11 is a schematic diagram of the gesture interaction effect provided by an embodiment of the present disclosure;

Figure 12 is a schematic diagram of an auxiliary spectrum diagram provided by an embodiment of the present disclosure;

Figure 13 is a structural diagram of a movie viewing interactive device provided by an embodiment of the present disclosure.

Detailed ways

The technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments are part of the embodiments of the present disclosure, rather than all of the embodiments. Based on the embodiments in this disclosure, all other embodiments obtained by those of ordinary skill in the art without making creative efforts fall within the scope of protection of this disclosure.

Referring to Figure 1, Figure 1 is a flow chart of a movie viewing interaction method provided by an embodiment of the present disclosure. As shown in Figure 1, it includes the following steps:

Step 101: Obtain the user's image while the user is watching the target video.

The above target video can be any video currently watched by the user.

In the embodiments of the present disclosure, the user can be supported to interact with the body or expression of the character in the video, that is, the user can perform certain actions or expressions, and the characters in the video can respond accordingly. For example, when a movie-watching user makes a hug gesture, the character in the video responds with a hug, the movie-watching user raises his right hand to shake hands, and the character in the video raises his right hand in response, etc.

In this step, the user's image can be obtained while the user is watching the target video. For example, when the user enters physical interaction, the user's image can be collected and uploaded in real time through the camera.

Optionally, before step 101, the method further includes:

Receive input from the first user regarding the first bullet while watching the target video, wherein the input is used to instruct the second user to initiate an invitation to create a viewing room, and the second user is responsible for posting the first bullet. screen user;

When the second user accepts the invitation to create a viewing room, create a viewing room for the first user and the second user, and play the target video in the viewing room;

The step 101 includes:

During the process of the user watching the target video in the viewing booth, the user image is acquired.

That is, in one implementation, users can be supported to create a viewing room while watching videos, and members in the viewing room can interact in the viewing room.

Specifically, the user can set whether to enable the viewing room function after logging into the video application. If turned on, it means that the user has the permission to create a viewing room. Otherwise, like ordinary viewers, they cannot create a viewing room. Secondly, you can set whether to accept other viewing room invitations. If it is turned on, the viewing room invitations of other users pushed by the system will be allowed, otherwise the viewing room invitations of other users will be blocked.

When user A clicks on any video to watch, the barrage floats on the screen. At this time, user A can click on an interesting barrage to initiate an invitation to create a viewing room. The invitation message is sent directly to the other party's account through the video application, that is, the barrage is posted. User Account. If invited user B is not online during this period, the system can directly send a prompt to user A; if user B is online, user B can choose to accept or reject user A's invitation. If user B accepts the invitation, the viewing room is successfully constructed. At this time, there are user A and user B in the viewing room, and other users who have not received the invitation cannot enter the room.

Among them, the user who creates the viewing room is the administrator. Invitations sent by other users must be approved by the administrator. The invitation information sent is directly pushed by the video platform to the invited users. Users come to the viewing room to watch videos, realizing the need for simultaneous viewing in the room. Indoor members can initiate interactions, and all messages are only visible to indoor members. When the movie viewing room is disbanded, all messages will be cleared, improving the privacy of movie viewing topics. The schematic diagram of user communication in the movie viewing booth is shown in Figure 2. That is, the client user submits the message to the server, and the server completes the message interaction and displays it on the video playback interface.

Members in the viewing booth can interact physically while watching the video, and the scene can be brought into interaction and video rendering based on barrage.

In this way, through this implementation, users can create a viewing cabin for group viewing, and can interact privately in the viewing cabin, which improves the fun of movie-watching interaction and the privacy of group viewing.

Step 102: Identify the posture of the target object in the user image.

In this step, the target object in the user image can be identified based on the acquired user image, usually a movie-watching user, and the posture of the target object can be determined through image recognition and posture detection, that is, the target object can be identified body movements or expressions.

Optionally, the step 102 includes:

Identify the target object in the user image through a target detection algorithm;

Construct a skeletal model of the target object;

Based on the skeletal model of the target object, a pose of the target object is determined.

In one embodiment, in order to identify the posture of the target object in the user image, the target object in the user image can be first identified through a target detection algorithm. For example, the target outline can be drawn through a saliency detection algorithm and returned. Target position result {x, y, w, h}, where x and y are the abscissa and ordinate coordinates of the center point of the target rectangular frame, w is the width of the target rectangular frame, and h is the height of the target rectangular frame.

As shown in (a) of Figure 3, the image background contains multiple people and buildings. The recognized salient targets are shown in (b) of Figure 3. The salient targets are within the rectangular frame.

Next, based on the salient target detection results, a skeletal model can be constructed for the target character outline in the rectangular frame. Specifically, a skeletal model of the target object can be constructed based on key points of key parts of the target object within the rectangular frame.

For example, the detection results on the original image are shown in (a) of Figure 4, and the skeleton model abstracted from the original image is shown in (b) of Figure 4. The model contains a total of 15 key points. They correspond to 15 key parts of the human body, such as point p1 for the head, point p2 for the right hand, and point p15 for the left foot.

In this way, the posture of the target object can be determined based on the skeletal model of the target object. For example, according to the skeletal model shown in (b) of FIG. 4 , it can be determined that the character posture is a standing posture with limbs extended.

Furthermore, since it is difficult to detect the main action of the target object in a single image, the specific action of the target object can be determined by combining lens transitions and continuity behaviors. For example, the skeleton model of the continuity detection image result of a certain user's video clip is shown in Figure 5. The posture result returned by the algorithm is dancing, and the result is encapsulated {result:'dance'}.

In this way, through this embodiment, the posture of the target object in the user's image can be accurately recognized.

Step 103: Based on the posture of the target object, determine the response posture of the target character in the target video.

After determining the posture of the target object, the response posture of the target character in the target video can be determined according to the posture of the target object. Specifically, a similar method such as a saliency detection algorithm can be used to determine the response posture of the target character in the target video. The target character usually accounts for the largest area in the video screen. character, and then determine the response gesture of the target character. The response gesture of the target character can be a gesture that is basically the same as the gesture of the target object. For example, if the detected user gesture is a handshake, the character in the video should also To make a handshake gesture in response to the user's physical interaction, and if it is detected that the user makes a hug gesture, the character in the video also needs to make a hug gesture in response to the user's hug.

Optionally, after step 103, the method further includes:

Obtain the skeleton model of the target character in the target video;

Skeleton key point matching is performed on the skeletal model of the target object and the skeletal model of the target character to adjust the response posture of the target character.

That is, in one implementation, the response posture of the target character can be further adjusted to make the interaction between the movie-watching user and the character in the video more detailed.

Specifically, the skeletal model of the target character in the target video can be obtained. For example, the skeletal model of the target character in the target video can be called to make it perform appropriate body movements, and then the interaction can be ensured through skeletal key point matching. The rationality of the posture is to match the skeletal key points of the target object's skeletal model and the target character's skeletal model so that the corresponding skeletal joint points are located in similar spatial positions, thereby ensuring that the target character poses Basically the same pose as the target object. For example, if the user extends his right hand to shake hands, the character in the video also needs to extend his right hand, not his left hand. As shown in Figure 6, in the bone key point matching, p1 in the bone model of the user watching the video on the left corresponds to B1 in the bone model of the character in the video on the right, and so on, p2 corresponds to B2, etc.

In this way, the interaction between the user and the characters in the video can be made more detailed through bone key point matching.

Step 104: Adjust the response posture of the target part of the target character based on the posture of the target part of the target object.

In the embodiments of the present disclosure, the postures of the viewing user and the characters in the video, such as gestures and facial expressions, can also be adjusted to determine the corresponding interactive postures.

In this step, the response posture of the target part of the target character can be adjusted based on the posture of the target part of the target object, so that the target character makes a gesture that echoes the target object, so that the response posture of the target part of the target character can be adjusted based on the posture of the target part of the target object. The adjusted posture of the target part of the target object displays the response posture of the target character in the video screen, and can highlight the target of the target character. part's response.

Step 105: Based on the adjusted response posture of the target part of the target character, display the response posture of the target character in the video frame of the target video.

In this step, the response gesture of the target character can be displayed in the video screen of the target video, that is, a display effect of the target character's response gesture is generated in the video screen, so as to realize the interaction between the viewing user and the character in the video. of interaction. When displaying the response gesture of the target character, the response gesture of the target part of the target character can be highlighted to generate a vivid interactive effect for watching movies.

Optionally, the step 105 includes:

Based on the posture of the target part of the target object and the adjusted response posture of the target part of the target character, the interactive effect of the target part of the target object and the target part of the target character is displayed in the video screen .

In one implementation, corresponding interactive postures can also be determined for detailed parts of the movie-watching user and the characters in the video, such as gestures and facial expressions.

That is, based on the posture of the target part of the target object, the response posture of the target part of the target character is adjusted, so that the target character makes a gesture that echoes the target object and displays it in the video screen. The interaction effect between the target part of the target object and the target part of the target character is displayed in Respond to the gesture and form an interactive gesture of the target parts of the two. For example, when the user raises his right hand, he can also raise his right hand to the character in the video, and the user's raised hand and the raised hand touch of the character in the video are displayed on the video screen, forming an interactive effect of the user and the character in the video touching each other across the screen; when When the user makes a face expression, he or she can also make a face expression on the character in the video, creating the effect of the user interacting with the character in the video making a face expression.

In this way, through this implementation, movie-watching users can interact with specific parts of the characters in the video in more detail, thereby improving the interactive experience.

Optionally, the step 104 includes:

Determine the first minimum circumscribed sphere of the three-dimensional model surrounding the target part of the target object;

Determine the second minimum circumscribed sphere of the three-dimensional model surrounding the target part of the target character;

Project the first minimum circumscribed ball into the video frame of the target video according to the center position of the first minimum circumscribed ball;

Adjust the position of the second minimum circumscribed ball according to the projected position of the first minimal circumscribed ball in the video screen;

According to the adjusted position of the second minimum external ball, the response posture of the target part of the target character is adjusted.

In a specific implementation, the minimum circumscribed sphere of the three-dimensional model of the target part of the target object and the minimum circumscribed sphere of the three-dimensional model of the target part of the target character can be calculated first, that is, the minimum circumscribed sphere that can completely surround the target part. The smallest sphere of the 3D model. As shown in Figure 7, the target part such as a gesture can be three-dimensionally modeled, and then the minimum circumscribed sphere of the three-dimensional model surrounding the target part is calculated. As shown in Figure 8, assuming that the minimum circumscribed sphere of the character's target part model in the video is V1, The minimum circumscribed sphere of the user's target part model is V2, and the centers of the two spheres are M1 and M2 respectively.

Next, the minimum circumscribed ball V2 can be projected into the video picture of the target video according to the center coordinate M2 of the minimum circumscribed ball V2, that is, the three-dimensional model of the target part of the target object can be projected into the video picture. displayed in the video screen, and then the coordinate position of the minimum external ball V1 can be adjusted according to the position of the minimum external ball V2 projected in the video screen, so that the position of the minimum external ball V1 and the position of the minimum external ball V2 are spatially aligned. Furthermore, the response posture of the target part of the target character can be adaptively adjusted according to the adjusted position of the minimum external ball V1.

In this way, through the adjustment in this embodiment, the posture of the target part of the target object can be mapped to the video screen display, and by adjusting the response posture of the target part of the target character, the two can interact in the video screen attitude.

Further, adjusting the position of the second minimum circumscribed ball according to the projected position of the first minimal circumscribed ball in the video screen includes:

Adjust the center position of the second minimum circumscribed ball according to the center position of the first minimum circumscribed ball projected in the video screen;

Determine the position of each point in the target part of the target character based on the angle relationship between any two points in the three-dimensional model of the target part of the target character;

Adjusting the response posture of the target part of the target character according to the adjusted position of the second minimum external ball includes:

The response posture of the target part of the target character is adjusted according to the adjusted center position of the second minimum external ball and the positions of each point in the target part of the target character.

More specifically, the center coordinate position of the minimum circumscribed ball V1 can be adjusted according to the center coordinate position M2' of the minimum circumscribed ball V2 projected in the video screen, such as aligning the coordinate positions of the two sphere centers.

For example, W ₁ is={(x ₁ , y ₁ , z ₁ ), (x ₂ , y ₂ , z ₂ )...(x _n , y _n , z _n )} which is the three-dimensional image of the target part of the target character. The set of all points in the model. For the vector formed from any point A to the origin of the three-dimensional coordinates The vector formed from any point Q to the origin of the three-dimensional coordinates The angle relationship between them is θ, then the set of angle relationships between any two points in the model S = {θ ₁ , θ ₂ , θ ₃ ...θ _m }. Assuming that set W ₁ is a set of model points of the target part of the character in the video, and set W ₂ is a set of model points of the user's target part, then the angle relationship between any two points in the three-dimensional model of the target part of the target character can be S={θ ₁ , θ ₂ , θ ₃ ...θ _m }, calculate the coordinates of other points in the target part of the character in the video, and the calculated result set is W ₃ . In this way, the calculated new set point coordinates W can be _3. Adjust the response posture of the target part of the character in the video.

In this way, through this embodiment, the response posture of the target part of the target character can be adjusted accurately and carefully.

Optionally, the target part is the hand;

The step 104 includes:

A three-dimensional coordinate system is established with the first palm heel node of the target object's hand as the origin, wherein the target object's hand includes a thumb joint point, an index finger joint point, a middle finger joint point, a ring finger joint point and a little finger joint point. point;

Calculate the first space angle, the second space angle, the third space angle and the fourth space angle; wherein, the first space angle is based on the first palm heel node as the vertex and the first line segment and the second line segment is the space angle of the side, the second space angle is the space angle with the first palm heel node as the vertex and the second line segment and the third line segment as the side, and the third space angle is The space angle between the first palm heel node as a vertex and the third line segment and the fourth line segment as sides, and the fourth space angle is a space angle with the first palm heel node as a vertex and the fourth line segment as an edge. The line segment and the fifth line segment are the spatial angles of the sides; the first line segment is the line segment between the first palm heel node and the adjacent thumb joint point, and the second line segment is the first palm heel node. and the adjacent index finger joint point, the third line segment is the line segment between the first palm heel node and the adjacent middle finger joint point, and the fourth line segment is the line segment between the first palm heel node and the adjacent middle finger joint point. The line segment of the ring finger joint, the fifth line segment is the first palm heel joint A line segment between a point and the adjacent little finger joint point;

Calculate the first line-plane space angle, the second line-plane space angle, the third line-plane space angle, the fourth line-plane space angle and the fifth line-plane space angle; wherein the first line-plane space angle is based on the target object The space angle between the line segment where the thumb joint point of the hand is located and the first surface, and the second line-plane space angle is the space angle between the line segment where the index finger joint point of the target object's hand is located and the first surface. The space angle, the third line-plane space angle is the space angle between the line segment where the middle finger joint point of the target object's hand is and the first surface, the fourth line-plane space angle is the space angle between the The angle between the line segment where the joint point of the ring finger of the target object's hand is located and the first plane, and the fifth line-plane space angle is the line segment where the joint point of the little finger of the target object's hand is located and the space angle between the line segment and the first plane. The spatial angle of the first surface; the first surface is the plane formed by the z-axis and the y-axis in the three-dimensional coordinate system;

According to the first space angle, the second space angle, the third space angle and the fourth space angle, as well as the first line-plane space angle, the second line-plane space angle, the third line-plane space angle and the fourth line-plane space angle. angle and the fifth line-plane space angle, adjust the positions of the thumb joint point, index finger joint point, middle finger joint point, ring finger joint point and little finger joint point of the target character's hand.

In one implementation, the target part may be a hand, and the hand movements of the target character may be carefully adjusted based on the hand movements of the target object.

As shown in Figure 9, a three-dimensional coordinate system can be established with the palm root node q1 of the target object's hand as the origin, and the set points {q1, q2, q3...q20} simulate 20 main joint points of the user's hand. Among them, q1 corresponds to the palm root node, q6, q7 and q20 correspond to the thumb joint point, q5, q8, q18 and q19 correspond to the index finger joint point, q4, q9, q16 and q17 correspond to the middle finger joint point, q3, q10, q14 and q15 correspond to the joint point of the ring finger, and q2, q11, q12 and p13 correspond to the joint point of the little finger.

As shown in Figure 10, ω ₁ is the angle in space with q1 as the vertex and line segment q1q6 and line segment q1q5 as sides; ω ₂ is the space angle with q1 as the vertex and line segment q1q5 and line segment q1q4 as sides; ω ₃ is the angle in space with q1 as the vertex and line segment q1q5 and line segment q1q4 as sides. q1 is the vertex, the line segment q1q4 and the line segment q1q3 are the sides; ω ₄ is the space angle with q1 as the vertex, the line segment q1q3 and the line segment q1q2 are the sides.

Then the spatial angles ω ₁ , ω ₂ , ω ₃ and ω ₄ can be calculated. Taking the calculation of ω ₁ as an example, record the line segment q1q6 as a vector Line segments q1q5 are recorded as vectors Then the calculation of ω ₁ is as follows:

The calculations of other ω ₂ , ω ₃ and ω ₄ can be obtained in the same way.

In this way, based on the calculated space angles ω ₁ , ω ₂ , ω ₃ and ω ₄ , the thumb joint point Q6, index finger joint point Q5, and middle finger joint point of the target character's hand that are closest to the root node of the hand can be determined Q4. The positions of the ring finger joint point Q3 and the little finger joint point Q2 are adjusted so that the positions of the joint points of the hand of the target character correspond to the positions of the joint points of the hand of the target object.

Next, the positions of the remaining joint points of the target character's hand can also be adjusted. Specifically, the angles between the line segments q6q7, q5q8, q4q9, q3q10, q2q11 and the surface zq1y can be calculated respectively, which are recorded as γ ₁ and γ ₂ respectively. , γ ₃ , γ ₄ , γ ₅ . Then, based on the calculated line-plane space angles γ ₁ , γ ₂ , γ ₃ , γ ₄ and γ ₅ , the remaining thumb joint points, index finger joint points, middle finger joint points and ring finger joint points of the target character’s hand can be determined and the positions of the little finger joints, that is, {Q7, Q8, Q9...Q20}, so that the positions of the joint points of the hand of the target character correspond to the positions of the joint points of the hand of the target object, so that the The target character's gestures remain substantially symmetrical to the target object's gestures. The adjusted hand interaction effect is shown in Figure 11. User gestures can form an interactive experience of touching across the screen with the character gestures in the video.

Further, according to the first space angle, the second space angle, the third space angle and the fourth space angle, and the first line-plane space angle, the second line-plane space angle, the third line-plane space angle, The fourth line-plane space angle and the fifth line-plane space angle adjust the positions of the thumb joint point, index finger joint point, middle finger joint point, ring finger joint point and little finger joint point of the target character's hand, including:

Let the fifth space angle be equal to the first space angle, the sixth space angle be equal to the second space angle, the seventh space angle be equal to the third space angle, and the eighth space angle be equal to the fourth space angle, adjust The positions of the first thumb joint point, first index finger joint point, first middle finger joint point, first ring finger joint point and first little finger joint point of the target character's hand;

Let the sixth line-plane space angle be equal to the first line-plane space angle, the seventh line-plane space angle be equal to the second line-plane space angle, the eighth line-plane space angle be equal to the third line-plane space angle, and the ninth line-plane space angle be equal to the third line-plane space angle. The plane space angle is equal to the fourth line plane space angle, and the tenth line plane space angle is equal to the fifth line plane space angle. Adjust other points of the target character's hand except the first thumb joint point. Thumb joint point, other index finger joint points except the first index finger joint point, other index finger joint points except the first middle finger joint point The positions of the joint points of his middle finger, other ring finger joint points except the first ring finger joint point, and other little finger joint points except the first little finger joint point;

Wherein, the fifth space angle is the space angle with the second palm root node of the target character's hand as the vertex and the sixth line segment and the seventh line segment as the sides, and the sixth space angle is based on the The space angle between the second palm heel node as a vertex and the seventh line segment and the eighth line segment as sides. The seventh space angle is the second palm heel node as a vertex and the eighth line segment and the eighth line segment as sides. The nine line segments are the space angles of the sides, and the eighth space angle is the space angle with the second palm heel node as the vertex and the ninth line segment and the tenth line segment as the sides; the sixth line segment is the space angle. The line segment between the second palm heel node and the adjacent first thumb joint point, the seventh line segment is the line segment between the second palm heel node and the adjacent first index finger joint point, and the seventh line segment is the line segment between the second palm heel node and the adjacent first index finger joint point. The eighth line segment is a line segment between the second palm heel node and the adjacent first middle finger joint point, and the ninth line segment is a line segment between the second palm heel node and the adjacent first ring finger joint point. , the tenth line segment is a line segment between the second palm heel node and the adjacent first little finger joint point;

The sixth line-plane space angle is the space angle between the line segment where the thumb joint point of the target character's hand is located and the second surface, and the seventh line-plane space angle is the space angle between the target character's hand and the second surface. The angle between the line segment where the joint point of the index finger is located and the second surface. The eighth line-plane space angle is the space between the line segment where the joint point of the middle finger of the hand of the target character is located and the second surface. The included angle, the ninth line-plane space angle is the space angle between the line segment where the ring finger joint point of the target character's hand is located and the second surface, and the tenth line-plane space angle is the space angle between the The angle between the line segment where the little finger joint point of the target character's hand is located and the second surface; the second surface is the z-axis and y-axis in the three-dimensional coordinate system established with the second palm root node as the origin. The plane formed.

That is, for the character hand model collection points {Q1, Q2, Q3...Q20} in the video, the corresponding spatial angle is and Q1 corresponds to the corresponding point q1 in the target object's hand model, Q2 corresponds to the corresponding point q2 in the target object's hand model, Q3 corresponds to the corresponding point q3 in the target object's hand model, and so on. Corresponding to ω ₁ , Corresponding to ω ₂ , Corresponding to ω ₃ , Corresponds to ω ₄ .

In this way, you can first adjust the coordinates of Q2, Q3, Q4, Q5, and Q6 points of the character's hand in the video. It is known that the current Q5 and Q6 coordinate values are the coordinates before adjustment, so Line segment Q1Q6 is a vector Line segments Q1Q5 are vectors Fixed point Q6 coordinates, the set of n vector solutions can be found using the angle expression and the value of ω ₁ The coordinates of the fixed point Q5 can be determined by using the included angle expression and the value of ω ₁ Find the set of m vector solutions The solution vectors in set D and set E are vectors starting from the origin. By subtracting the origin coordinates from the vector coordinates in the current set D, the set of all possible Q5 coordinate points can be calculated. Such a coordinate set Marked as M. In the same way, the set of all possible Q6 coordinate points can also be calculated and recorded as set G.

Then, the spatial distance between the current point Q6 and all points in the set G can be calculated sequentially, recorded as L ₁ . Assume that the original Q6 = (x _q , y _q , z _q ), and any point in the set G is K = (x′, y′, z′), then the calculation of the spatial distance between point Q6 and point K is as follows:

In the same way, calculate the spatial distance values L ₂ between Q5 and all coordinate points in the set M in sequence, and then determine the two coordinate points that satisfy the minimum value of L ₁ + L ₂ and ensure that the space angle is ω ₁ , respectively, recorded as Q6 ′ and Q5′, where Q6′ is the coordinate after Q6 adjustment, and Q5′ is the coordinate after Q5 adjustment.

Given the coordinates of Q5′, let Find a series of coordinate point sets that meet the conditions, recorded as set J, then calculate the spatial distance between the original Q4 point and all coordinate points in set J, and take the coordinate value corresponding to the smallest spatial distance value. This coordinate value is the adjustment The final Q4 coordinate value is recorded as Q4′. In the same way, the adjusted Q3 and Q2 values can be obtained in sequence.

Next, you can adjust the positions of the remaining points, including {Q7, Q8, Q9…Q20}. Specifically, the angle between Q6′Q7 and surface zQ1y can be equal to γ ₁ , and a set of vector solutions for Q6′Q7 can be obtained. Subtract the coordinates of Q6′ from the coordinates in the vector to obtain the coordinate values of all possible Q7 points. This set is marked R. Calculate the spatial distance between the coordinates of the original Q7 point and all values in R, and obtain the coordinate point corresponding to the smallest spatial distance value, which is the adjusted coordinate point of Q7, recorded as Q7′. In the same way, use the known coordinate value of the previous node, for example, the previous node of Q7 is Q6, the previous node of Q8 is Q5, and the line-plane space angle, and determine the final adjustment coordinate value by finding the minimum spatial distance. Adjust the values of all remaining coordinate points in turn.

The adjusted hand interaction effect is shown in Figure 11. User gestures can form a screen-touch interactive experience with the character gestures in the video.

In this way, through this implementation, the positions of the joints of the character's hand in the video can be adjusted sequentially, ultimately forming an interactive experience in which the user's gestures and the character's gestures in the video are touched across the screen, increasing the fun of interaction.

Optionally, the method also includes:

When the user chooses to enter the interpretation mode, display the video dubbing interface, and display an auxiliary spectrogram in the video dubbing interface, wherein the auxiliary spectrogram is a spectrogram generated based on the original sound in the target video;

Obtain the dubbing audio recorded by the user based on the auxiliary spectrogram;

The dubbing audio is integrated into the target video for playback.

In one implementation, when a user watches a video in a created viewing booth, members of the booth can initiate character dubbing, and the members of the booth complete the character lines in the video, bringing themselves into the role of the plot. And voiceprint matching rules can be introduced, which can be used to assist the dubbing of character lines, so that users can have voiceprint reference when performing voice interpretation.

After the viewing room is successfully created, the audio in the video can be automatically extracted and a voiceprint schematic diagram, also known as an auxiliary spectrogram, is generated, as shown by the solid line in Figure 12.

Users can activate scene introduction and select their favorite characters and video clips. At this time, you can choose to perform collaboratively with multiple people or perform alone. When multiple people cooperate, everyone can choose their own character to complete their own voice part. When performing a solo performance, in addition to the character you choose, you can choose the original video sound for other characters' voices, or you can choose the dubbing uploaded by other users.

At this time, the video screen is switched out of the movie viewing and the video dubbing interface is entered. The native auxiliary voiceprint, that is, the auxiliary spectrogram pops up in the interface. During the dubbing process, users can refer to the auxiliary dubbing voiceprint to adjust their intonation and speaking speed to better complete the voice interpretation.

In the multi-player cooperation mode, the voices of multiple people are obtained and integrated into a complete video. In the single-player interpretation mode, the original voices of a single person and other characters selected by the user or the voices of other users are synthesized to obtain a video dubbed by the user. In this way, direct interaction between users in the viewing booth and between users and videos is achieved, making watching movies more interesting.

Further, after obtaining the dubbing audio recorded by the user based on the auxiliary spectrogram, the method further includes:

Adjust the dubbing audio so that the adjusted dubbing audio matches the auxiliary spectrogram;

The step of integrating the dubbing audio into the target video for playback includes:

The adjusted dubbing audio is integrated into the target video for playback.

In this embodiment, voiceprint matching rules can also be introduced to modify the user's voice. The system's user voice improves the voiceprint through automatic voiceprint matching, making the user's voice more suitable for the character.

Specifically, after the user finishes dubbing, the system can match the recorded voiceprint with the original voiceprint. As shown in Figure 12, the voiceprint entered by the user, that is, the spectrum, is shown as a dotted line, and the original voiceprint is shown as a solid line. There is bound to be a deviation between the user's voiceprint and the original video's voiceprint. In this embodiment, the voice domain value of the original video's voiceprint can be matched by calculating the variance within the voice domain to improve the fit of the user's voice.

Calculate the score of the dubbing audio according to the difference between each frequency point in the dubbing audio and the auxiliary spectrogram;

The dubbing audio is integrated into the target video for playback, and the score is displayed.

In one implementation, voiceprint matching rules can also be introduced for the user scoring mechanism to score all users' dubbing of the same character, and those with the top scores will be included in the list collection.

Specifically, after the user completes the interpretation, the difference between each voiceprint point in the user's dubbing audio and the original voiceprint can be calculated. Assume that the original voiceprint has n voiceprint points. When the difference between the user's voiceprint and the original voiceprint accounts for 0% of the original voiceprint value, one unit point can be obtained. When the difference accounts for more than 50% of the original voiceprint value, we get 0 unit points, when the difference accounts for 0 to 50% of the original voiceprint value, it will decrease by 0.2 unit points for every 10 percentage points increase. The unit of each voiceprint point is divided into a set λ, the voiceprint set entered by the user is X, and the original voiceprint set is N. If a user's voiceprint value in the user voiceprint set is x _i and the corresponding original voiceprint value is n _i , then the calculation formula of the user's total score R is as follows:

In this way, after the user's dubbing audio is scored, the score can be displayed in the user's dubbing video. In the multi-player cooperation mode, the dubbing score of each user can be displayed in the synthesized dubbing video and sorted by score.

In this way, through this situational introduction mode, the interaction between indoor video and users, as well as between users can be well promoted.

Optionally, the method also includes:

Obtain the bullets posted by the user in the first period when watching the target video in the viewing room. Number of scenes and barrages;

Determine the emotional words corresponding to each barrage posted within the first period, and count the number of each emotional word;

When the ratio of the number of first emotional words to the number of comments is greater than a preset threshold, add a special effect pendant related to the first emotional word in the viewing room, or play a special effect pendant related to the first emotional word. A sound effect related to an emotional word, wherein the first emotional word is any emotional word.

In one implementation, the video scene can also be rendered based on the barrages in the viewing booth, the corresponding emotional words are extracted through semantic analysis of the barrages, and special effect pendants, sound effects, etc. are equipped according to different emotional words.

The specific implementation method is that users in the viewing booth can post barrages while watching videos; count the number M of barrages within a certain period of time, such as counting the number of barrages within 10 minutes; count each barrage within this period Extract existing emotional words, such as horror, joy, sadness, etc. The extraction method can be keyword matching and semantic understanding in natural language processing algorithms; count the number of emotional words N, and trigger scene rendering when N/M is greater than a certain threshold. Among them, since different viewers have different perceptions of the same plot and thus produce different emotions, here we first perform a similarity analysis on the emotional words, calculate one or several representative emotional words, and then count the number of each emotional word; the combination is calculated Emotional words can be used to automatically add relevant special effects widgets to the video played in the viewing room. Emotional words can also be used as search matching words to obtain and play corresponding sound effects. For example, for happy emotional words, you can add the special effect of falling stars to the video screen, and you can also call up cheerful sound effects for playback.

In this way, through the scene rendering in this embodiment, the plot can be made more flexible and the interactive interest further increased.

Relevant mass viewing technology lacks personalized customization for certain user groups. This disclosure proposes a barrage-based interactive method for viewing rooms. Users can choose user groups with the same interests to build movie-watching cabins. For example, if a group of celebrity followers watches movies simultaneously, the viewing privacy in the cabin will be better. Various interactions can also be carried out in the viewing booth, such as physical interaction, scene introduction and barrage rendering, etc., which greatly increases the fun of video watching and transforms the audience from a viewer into a participant in the video interpretation.

The movie-viewing interaction method of the embodiment of the present disclosure acquires the user image while the user watches the target video; identifies the posture of the target object in the user image; based on the posture of the target object, Determine the response posture of the target character in the target video; adjust the response posture of the target part of the target character based on the posture of the target part of the target object; based on the adjusted target part of the target character The response gesture displays the response gesture of the target character in the video screen of the target video. In this way, users can interact with the characters in the video, so that the interaction method is not limited to barrages, making the interaction more interesting.

An embodiment of the present disclosure also provides a movie viewing interactive device. Referring to Figure 13, Figure 13 is a structural diagram of an interactive device for watching movies provided by an embodiment of the present disclosure. Since the problem-solving principle of the interactive viewing device is similar to the interactive viewing method in the embodiment of the present disclosure, the implementation of the interactive viewing device can be referred to the implementation of the method, and repeated details will not be repeated.

As shown in Figure 13, the movie viewing interactive device 1300 includes:

The first acquisition module 1301 is used to acquire user images while the user is watching the target video;

Recognition module 1302, used to recognize the posture of the target object in the user image;

The first determination module 1303 is configured to determine the response posture of the target character in the target video based on the posture of the target object;

The first adjustment module 1304 is configured to adjust the response posture of the target part of the target character based on the posture of the target part of the target object;

The first display module 1305 is configured to display the response gesture of the target character in the video screen of the target video based on the adjusted response gesture of the target part of the target character.

Optionally, the identification module 1302 includes:

An identification unit configured to identify the target object in the user image through a target detection algorithm;

A construction unit used to construct a skeleton model of the target object;

A first determination unit configured to determine the posture of the target object based on the skeletal model of the target object.

Optionally, the interactive viewing device 1300 also includes:

The second acquisition module is used to acquire the skeleton model of the target character in the target video;

A matching module, configured to match skeletal key points between the skeletal model of the target object and the skeletal model of the target character to adjust the response posture of the target character.

Optionally, the first display module 1305 is further configured to display the target part in the video screen based on the posture of the target part of the target object and the adjusted response posture of the target part of the target character. The interactive effect between the target part of the target object and the target part of the target character.

Optionally, the first adjustment module 1304 includes:

a second determination unit configured to determine the first minimum circumscribed sphere of the three-dimensional model surrounding the target part of the target object;

a third determination unit configured to determine the second minimum circumscribed sphere of the three-dimensional model surrounding the target part of the target character;

A projection unit configured to project the first minimum circumscribed ball into the video picture of the target video according to the center position of the first minimum circumscribed ball;

A first adjustment unit configured to adjust the position of the second minimum circumscribed ball according to the position of the first minimum circumscribed ball projected in the video screen;

The second adjustment unit is configured to adjust the response posture of the target part of the target character according to the adjusted position of the second minimum external ball.

Optionally, the first adjustment unit includes:

A first adjustment subunit configured to adjust the center position of the second minimum circumscribed ball according to the center position of the first minimum circumscribed ball projected in the video screen;

Determining subunit, used to determine the position of each point in the target part of the target character based on the angle relationship between any two points in the three-dimensional model of the target part of the target character;

The second adjustment unit is configured to adjust the response posture of the target part of the target character according to the adjusted center position of the second minimum external ball and the positions of each point in the target part of the target character.

Optionally, the target part is the hand; the first adjustment module 1304 includes:

An establishment unit configured to establish a three-dimensional coordinate system with the first palm heel node of the target object's hand as the origin, wherein the target object's hand includes a thumb joint point, an index finger joint point, a middle finger joint point, and a ring finger joint point. joints and little finger joints;

The first calculation unit is used to calculate the first spatial angle, the second spatial angle, the third spatial angle and the fourth spatial angle; wherein the first spatial angle is based on the first palm heel node as the vertex and the third spatial angle. The first line segment and the second line segment are the angles in space between the sides, and the second space angle is the angle in space with the first palm heel node as the vertex and the second line segment and the third line segment as the sides, so The third space angle is the space angle with the first palm heel node as the vertex and the third line segment and the fourth line segment as the sides, so The fourth space angle is the space angle with the first palm heel node as the vertex and the fourth line segment and the fifth line segment as the sides; the first line segment is the first palm heel node and the corresponding space angle. The second line segment is the line segment between the first palm heel node and the adjacent index finger joint point, and the third line segment is the line segment between the first palm heel node and the adjacent middle finger joint. The line segment of the joint point, the fourth line segment is the line segment between the first palm heel node and the adjacent ring finger joint point, the fifth line segment is the line segment between the first palm heel node and the adjacent little finger joint point ;

The second calculation unit is used to calculate the first line-plane space angle, the second line-plane space angle, the third line-plane space angle, the fourth line-plane space angle and the fifth line-plane space angle; wherein, the first line-plane space angle The angle is the space angle between the line segment where the thumb joint point of the target object's hand is located and the first plane, and the second line-plane space angle is the line segment where the index finger joint point of the target object's hand is located. The spatial angle between the third line and the first surface is the spatial angle between the line segment where the middle finger joint point of the target object's hand is and the first surface. The fourth line is The surface-space angle is the space angle between the line segment where the ring finger joint point of the target object's hand is located and the first surface, and the fifth line-surface space angle is the space angle between the little finger joint of the target object's hand. The spatial angle between the line segment where the point is located and the first surface; the first surface is the plane formed by the z-axis and the y-axis in the three-dimensional coordinate system;

The third adjustment unit is used to adjust the first space angle, the second space angle, the third space angle and the fourth space angle according to the first line-plane space angle, the second line-plane space angle and the third line-plane space angle. angle, the fourth line-plane space angle and the fifth line-plane space angle, and adjust the positions of the thumb joint point, index finger joint point, middle finger joint point, ring finger joint point and little finger joint point of the target character's hand.

Optionally, the third adjustment unit includes:

The second adjustment subunit is used to make the fifth space angle equal to the first space angle, the sixth space angle equal to the second space angle, the seventh space angle equal to the third space angle, and the eighth space angle equal to The fourth space angle adjusts the positions of the first thumb joint point, first index finger joint point, first middle finger joint point, first ring finger joint point and first little finger joint point of the target character's hand;

The third adjustment subunit is used to make the sixth line-plane space angle equal to the first line-plane space angle, the seventh line-plane space angle equal to the second line-plane space angle, and the eighth line-plane space angle equal to the third line-plane space angle. Three line and plane space angles, the ninth line and plane space angle is equal to the fourth line and plane space angle, and the tenth line and plane space angle is equal to the fifth line and plane space angle. Adjust the angle of the hand of the target character except for the third line and plane space angle. Other thumb joint points except the first thumb joint point, other index finger joint points except the first index finger joint point, except the above-mentioned first index finger joint point The positions of other middle finger joint points except the first middle finger joint point, other ring finger joint points except the first ring finger joint point, and other little finger joint points except the first little finger joint point;

Wherein, the fifth space angle is the space angle with the second palm root node of the target character's hand as the vertex and the sixth line segment and the seventh line segment as the sides, and the sixth space angle is based on the The space angle between the second palm heel node as a vertex and the seventh line segment and the eighth line segment as sides, and the seventh space angle is the second palm heel node as a vertex and the eighth line segment and the eighth line segment as sides. The nine line segments are the space angles of the sides, and the eighth space angle is the space angle with the second palm heel node as the vertex and the ninth line segment and the tenth line segment as the sides; the sixth line segment is the space angle. The line segment between the second palm heel node and the adjacent first thumb joint point, the seventh line segment is the line segment between the second palm heel node and the adjacent first index finger joint point, and the seventh line segment is the line segment between the second palm heel node and the adjacent first index finger joint point. The eighth line segment is a line segment between the second palm heel node and the adjacent first middle finger joint point, and the ninth line segment is a line segment between the second palm heel node and the adjacent first ring finger joint point. , the tenth line segment is a line segment between the second palm heel node and the adjacent first little finger joint point;

Optionally, the movie viewing interactive device 1300 also includes:

A receiving module configured to receive input from the first user regarding the first barrage while watching the target video, wherein the input is used to instruct the second user to initiate an invitation to create a viewing room, and the second user is a publisher. The user of the first barrage;

A creation module, configured to create a viewing cabin for the first user and the second user when the second user accepts the invitation to create the viewing cabin, and play all the viewing cabins in the viewing cabin. Describe the target video;

The first acquisition module 1301 is used to obtain the target video when the user watches the target video in the viewing room. During the process, the user image is obtained.

Optionally, the interactive viewing device 1300 also includes:

The second display module is used to display the video dubbing interface when the user chooses to enter the interpretation mode, and to display an auxiliary spectrogram in the video dubbing interface, wherein the auxiliary spectrogram is based on the target video. Spectrogram generated from the original sound;

The third acquisition module is used to acquire the dubbing audio input by the user according to the auxiliary spectrogram;

A playback module is used to integrate the dubbing audio into the target video for playback.

Optionally, the interactive viewing device 1300 also includes:

The second adjustment module is used to adjust the dubbing audio so that the adjusted dubbing audio matches the auxiliary spectrogram;

The playback module is used to integrate the adjusted dubbing audio into the target video for playback.

Optionally, the interactive viewing device 1300 also includes:

A calculation module configured to calculate the score of the dubbing audio based on the difference between the dubbing audio and each frequency point in the auxiliary spectrogram;

The playback module is used to integrate the dubbing audio into the target video for playback, and display the score.

Optionally, the interactive viewing device 1300 also includes:

The fourth acquisition module is used to obtain the barrages and the number of barrages posted by the user in the first period when watching the target video in the viewing room;

The second determination module is used to determine the emotional words corresponding to each barrage released within the first period, and count the number of each emotional word;

A processing module configured to add a special effect pendant related to the first emotional word in the viewing room when the ratio of the number of first emotional words to the number of barrages is greater than a preset threshold, or, Play sound effects related to the first emotional word, where the first emotional word is any emotional word.

The movie viewing interactive device 1300 provided by the embodiment of the present disclosure can execute the above method embodiments, and its implementation principles and technical effects are similar, and will not be described again in this embodiment.

The movie-viewing interactive device 1300 in the embodiment of the present disclosure obtains Obtain the user image; identify the posture of the target object in the user image; determine the response posture of the target character in the target video based on the posture of the target object; based on the posture of the target part of the target object, determine the response posture of the target character in the target video. The response posture of the target part of the target character is adjusted; based on the adjusted response posture of the target part of the target character, the response posture of the target character is displayed in the video screen of the target video. In this way, users can interact with the characters in the video, so that the interaction method is not limited to barrages, making the interaction more interesting.

In addition, the computer-readable storage medium of the embodiment of the present disclosure is used to store a computer program, and the computer program can be executed by a processor to implement each step in the method embodiment shown in Figure 1.

In the several embodiments provided in this disclosure, it should be understood that the disclosed methods and devices can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.

In addition, each functional unit in various embodiments of the present disclosure may be integrated into one processing unit, each unit may be physically included separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.

The above-mentioned integrated unit implemented in the form of a software functional unit can be stored in a computer-readable storage medium. The above-mentioned software functional unit is stored in a storage medium and includes a number of instructions to cause a computer device (which can be a personal computer, a server, or a network device, etc.) to execute some steps of the sending and receiving methods described in various embodiments of the present disclosure. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk and other media that can store program code. .

The above are the preferred embodiments of the present disclosure. It should be noted that for those of ordinary skill in the art, several improvements and modifications can be made without departing from the principles described in the present disclosure. These improvements and modifications can also be made. should be regarded as the scope of protection of this disclosure.

Claims

An interactive method for watching movies, including:

While the user is watching the target video, the user image is obtained;

identifying the pose of the target object in the user image;

Based on the posture of the target object, determine the response posture of the target character in the target video;

Based on the posture of the target part of the target object, adjust the response posture of the target part of the target character;

Based on the adjusted response posture of the target part of the target character, the response posture of the target character is displayed in the video frame of the target video.
The method according to claim 1, wherein the identifying the gesture of the target object in the user image includes:

Identify the target object in the user image through a target detection algorithm;

Construct a skeletal model of the target object;

Determine the posture of the target object based on the skeletal model of the target object;

After determining the response posture of the target character in the target video based on the posture of the target object, the method further includes:

Obtain the skeleton model of the target character in the target video;

Skeleton key point matching is performed on the skeletal model of the target object and the skeletal model of the target character to adjust the response posture of the target character.
The method according to claim 1, wherein adjusting the response posture of the target part of the target character based on the posture of the target part of the target object includes:

Determine the first minimum circumscribed sphere of the three-dimensional model surrounding the target part of the target object;

Determine the second minimum circumscribed sphere of the three-dimensional model surrounding the target part of the target character;

Project the first minimum circumscribed ball into the video frame of the target video according to the center position of the first minimum circumscribed ball;

Adjust the position of the second minimum circumscribed ball according to the projected position of the first minimal circumscribed ball in the video screen;

Adjust the target part of the target character according to the adjusted position of the second smallest external ball. bit of response.
The method according to claim 3, wherein adjusting the position of the second minimum circumscribed ball according to the projected position of the first minimal circumscribed ball in the video screen includes:

Adjust the center position of the second minimum circumscribed ball according to the center position of the first minimum circumscribed ball projected in the video screen;

Determine the position of each point in the target part of the target character based on the angle relationship between any two points in the three-dimensional model of the target part of the target character;

Adjusting the response posture of the target part of the target character according to the adjusted position of the second minimum external ball includes:

The response posture of the target part of the target character is adjusted according to the adjusted center position of the second minimum external ball and the positions of each point in the target part of the target character.
The method according to claim 1, wherein the target part is a hand;

Adjusting the response posture of the target part of the target character based on the posture of the target part of the target object includes:

A three-dimensional coordinate system is established with the first palm heel node of the target object's hand as the origin, wherein the target object's hand includes a thumb joint point, an index finger joint point, a middle finger joint point, a ring finger joint point and a little finger joint point. point;

Calculate the first space angle, the second space angle, the third space angle and the fourth space angle; wherein, the first space angle is based on the first palm heel node as the vertex and the first line segment and the second line segment is the space angle of the side, the second space angle is the space angle with the first palm heel node as the vertex and the second line segment and the third line segment as the side, and the third space angle is The space angle between the first palm heel node as a vertex and the third line segment and the fourth line segment as sides, and the fourth space angle is a space angle with the first palm heel node as a vertex and the fourth line segment as an edge. The line segment and the fifth line segment are the spatial angles of the sides; the first line segment is the line segment between the first palm heel node and the adjacent thumb joint point, and the second line segment is the first palm heel node. and the adjacent index finger joint point, the third line segment is the line segment between the first palm heel node and the adjacent middle finger joint point, and the fourth line segment is the line segment between the first palm heel node and the adjacent middle finger joint point. The line segment of the ring finger joint point, the fifth line segment is the line segment of the first palm heel node and the adjacent little finger joint point;

Calculate the first line and surface space angle, the second line and surface space angle, the third line and surface space angle, and the fourth line and surface space angle. angle and the fifth line-plane space angle; wherein, the first line-plane space angle is the space angle between the line segment where the thumb joint point of the target object's hand is located and the first surface, and the second line-plane space angle is The surface space angle is the space angle between the line segment where the index finger joint point of the target object's hand is located and the first surface, and the third line surface space angle is the space angle between the middle finger joint point of the target object's hand. The spatial angle between the line segment and the first surface, and the fourth line-plane spatial angle is the spatial angle between the line segment where the ring finger joint point of the target object's hand is and the first surface, so The fifth line-plane space angle is the space angle between the line segment where the little finger joint point of the target object's hand is located and the first surface; the first surface is the z-axis and y-axis in the three-dimensional coordinate system The plane formed by the axis;

According to the first space angle, the second space angle, the third space angle and the fourth space angle, as well as the first line-plane space angle, the second line-plane space angle, the third line-plane space angle and the fourth line-plane space angle. angle and the fifth line-plane space angle, adjust the positions of the thumb joint point, index finger joint point, middle finger joint point, ring finger joint point and little finger joint point of the target character's hand.
The method according to claim 5, wherein the first space angle, the second space angle, the third space angle and the fourth space angle, and the first line-plane space angle, the second line-plane space angle angle, the third line-plane space angle, the fourth line-plane space angle and the fifth line-plane space angle, adjust the thumb joint point, index finger joint point, middle finger joint point, ring finger joint point and little finger joint of the target character's hand The location of the point, including:

Let the fifth space angle be equal to the first space angle, the sixth space angle be equal to the second space angle, the seventh space angle be equal to the third space angle, and the eighth space angle be equal to the fourth space angle, adjust The positions of the first thumb joint point, first index finger joint point, first middle finger joint point, first ring finger joint point and first little finger joint point of the target character's hand;

Let the sixth line-plane space angle be equal to the first line-plane space angle, the seventh line-plane space angle be equal to the second line-plane space angle, the eighth line-plane space angle be equal to the third line-plane space angle, and the ninth line-plane space angle be equal to the third line-plane space angle. The plane space angle is equal to the fourth line plane space angle, and the tenth line plane space angle is equal to the fifth line plane space angle. Adjust other points of the target character's hand except the first thumb joint point. The thumb joint point, the other index finger joint points except the first index finger joint point, the other middle finger joint points except the first middle finger joint point, the other ring finger joint points except the first ring finger joint point, and The positions of other little finger joint points except the first little finger joint point;

Wherein, the fifth spatial angle is based on the second palm root node of the target character's hand. point and has the sixth line segment and the seventh line segment as the sides. The sixth space angle is the space angle with the second palm heel node as the vertex and the seventh line segment and the eighth line segment as the sides. angle, the seventh space angle is the space angle with the second palm heel node as the vertex and the eighth line segment and the ninth line segment as the sides, and the eighth space angle is the space angle with the second palm heel node as the vertex and the eighth line segment and the ninth line segment as the sides. is the vertex and the space angle with the ninth line segment and the tenth line segment as sides; the sixth line segment is the line segment between the second palm heel node and the adjacent first thumb joint point, and the seventh line segment is the vertex. The line segment is a line segment between the second palm heel node and the adjacent first index finger joint point, and the eighth line segment is a line segment between the second palm heel node and the adjacent first middle finger joint point, The ninth line segment is a line segment between the second palm heel node and the adjacent first ring finger joint, and the tenth line segment is a line segment between the second palm heel node and the adjacent first little finger joint. line segment of point;

The sixth line-plane space angle is the space angle between the line segment where the thumb joint point of the target character's hand is located and the second surface, and the seventh line-plane space angle is the space angle between the target character's hand and the second surface. The angle between the line segment where the joint point of the index finger is located and the second surface. The eighth line-plane space angle is the space between the line segment where the joint point of the middle finger of the hand of the target character is located and the second surface. The included angle, the ninth line-plane space angle is the space angle between the line segment where the ring finger joint point of the target character's hand is located and the second surface, and the tenth line-plane space angle is the space angle between the The angle between the line segment where the little finger joint point of the target character's hand is located and the second surface; the second surface is the z-axis and y-axis in the three-dimensional coordinate system established with the second palm root node as the origin. The plane formed.
The method according to claim 1, wherein before the user's image is acquired while the user is watching the target video, the method further includes:

Receive input from the first user regarding the first bullet while watching the target video, wherein the input is used to instruct the second user to initiate an invitation to create a viewing room, and the second user is responsible for posting the first bullet. screen user;

When the second user accepts the invitation to create a viewing room, create a viewing room for the first user and the second user, and play the target video in the viewing room;

The process of obtaining the user's image while the user is watching the target video includes:

During the process of the user watching the target video in the viewing booth, the user image is acquired.
The method of claim 7, further comprising:

When the user chooses to enter the interpretation mode, the video dubbing interface is displayed and the video An auxiliary spectrogram is displayed in the dubbing interface, wherein the auxiliary spectrogram is a spectrogram generated according to the original sound in the target video; the dubbing audio entered by the user according to the auxiliary spectrogram is obtained; the dubbing audio is integrated into the Play in the target video;

and / or,

Obtain the barrages and the number of barrages posted by the user in the first period when watching the target video in the viewing room; determine the emotional words corresponding to each barrage posted in the first period, and count each emotion The number of words; when the ratio of the number of first emotional words to the number of barrages is greater than the preset threshold, add special effects pendants related to the first emotional words in the viewing room, or play Sound effects related to the first emotional word, wherein the first emotional word is any emotional word.
An interactive device for viewing movies, including:

The first acquisition module is used to acquire the user's image while the user is watching the target video;

A recognition module, used to recognize the posture of the target object in the user image;

A first determination module, configured to determine the response posture of the target character in the target video based on the posture of the target object;

A first adjustment module, configured to adjust the response posture of the target part of the target character based on the posture of the target part of the target object;

The first display module is configured to display the response posture of the target character in the video frame of the target video based on the adjusted response posture of the target part of the target character.
A computer-readable storage medium used to store a computer program, wherein when the computer program is executed by a processor, the steps in the interactive method for viewing movies according to any one of claims 1 to 8 are implemented.