WO2021031731A1 - 远程协助方法、装置和系统 - Google Patents
远程协助方法、装置和系统 Download PDFInfo
- Publication number
- WO2021031731A1 WO2021031731A1 PCT/CN2020/100731 CN2020100731W WO2021031731A1 WO 2021031731 A1 WO2021031731 A1 WO 2021031731A1 CN 2020100731 W CN2020100731 W CN 2020100731W WO 2021031731 A1 WO2021031731 A1 WO 2021031731A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- terminal device
- assistance
- video
- target object
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
- H04N21/4312—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
- H04N21/4312—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
- H04N21/4316—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for displaying supplemental content in a region of the screen, e.g. an advertisement in a separate window
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4788—Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
Definitions
- This application relates to the field of augmented reality technology, and in particular to a remote assistance method, device and system.
- the embodiments of the present application provide a remote assistance method, device, and system.
- a remote assistance method including:
- Determining assistance information of the target object where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the assistance information is superimposed and displayed in the video information, so that the terminal device synchronously displays the superimposed video information.
- a remote assistance device including:
- the video call module is configured to establish a video call connection with the terminal device based on the video call request of the terminal device;
- a display module configured to display video information from the terminal device, the video information being obtained by the terminal device through video collection of a target object to be assisted;
- a determining module configured to determine assistance information of the target object, where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the processing module is configured to superimpose and display the assistance information in the video information based on the augmented reality technology, so that the terminal device synchronously displays the superimposed video information.
- a remote assistance system including a terminal device and a server, in which:
- the terminal device is configured to send a video call request to the server
- the server is configured to establish a video call connection with the terminal device based on the video call request;
- the terminal device and the server are further configured to display video information, the video information being obtained by the terminal device through video collection of the target object to be assisted;
- the server is further configured to determine assistance information of the target object; based on augmented reality technology, superimpose and display the assistance information in the video information;
- the terminal device is also configured to synchronously display the superimposed video information.
- an electronic device which includes:
- a memory arranged to store computer-executable instructions, which when executed, cause the processor to perform the following operations:
- Determining assistance information of the target object where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the assistance information is superimposed and displayed in the video information, so that the terminal device synchronously displays the superimposed video information.
- a computer-readable storage medium stores one or more programs, and when the one or more programs are executed by an electronic device that includes multiple application programs, the The electronic device performs the following methods:
- Determining assistance information of the target object where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the assistance information is superimposed and displayed in the video information, so that the terminal device synchronously displays the superimposed video information.
- the terminal device when the user encounters a problem in the process of using the target object, the terminal device can be used to send a video call request to the server corresponding to the assisting party, and the server establishes a video call connection with the terminal device based on the video call request During the video call, the terminal device can collect the video of the target object, and the server and the terminal device can display the video information of the target object simultaneously. After that, the server can determine the assistance information for the target object, and based on the augmented reality technology, the target The assistance information of the object is superimposed and displayed in the video information. At this time, since the terminal device and the server are in a video call state, the terminal device can synchronously display the superimposed video information. In this way, users can not only clearly describe the problems they encounter through video calls, but also intuitively obtain solutions through terminal devices, thereby effectively solving the problems encountered by users in the process of using the target object, and improving users Experience.
- the terminal device in the embodiment of the present application does not need to have the function of augmented reality.
- the terminal device in the embodiment of the present application does not need to have the function of augmented reality.
- FIG. 1 is a schematic flowchart of a remote assistance method according to an embodiment of the present application
- FIG. 2 is a schematic flowchart of a remote assistance method according to an embodiment of the present application
- FIG. 3 is a schematic structural diagram of a remote assistance electronic device according to an embodiment of the present application.
- FIG. 4 is a schematic structural diagram of a remote assistance device according to an embodiment of the present application.
- Fig. 5 is a schematic structural diagram of a remote assistance system according to an embodiment of the present application.
- the user in order to improve the user experience, the user may be allowed to solve the operational problems encountered by the user through remote assistance.
- users can be allowed to seek remote assistance in the following three ways:
- the first type the user can send a video of the object to be assisted to the assisting party, and the assisting party provides assistance information to the user based on the received video.
- the second type If the object to be assisted supports remote assistance, the user can send a remote assistance request to the assisting party through the object to be assisted. After the assisting party agrees to the user's remote assistance request, the object to be assisted can be operated to solve the user The operational problems encountered.
- a user can send a remote assistance request to the assisting party through the software with remote assistance function installed in the personal computer. After receiving and agreeing to the remote assistance request, the assisting party can directly respond The personal computer performs remote operation to solve the operation problems encountered by the user.
- the third type the user establishes a video call connection with the assisting party through the terminal device with augmented reality function.
- the assisting party provides assistance information to the user after determining the operation problem encountered by the user, and the terminal device receives the assistance information from the assisting party Later, based on augmented reality technology, the assistance information can be superimposed and displayed on the video screen of the terminal device. Based on the video screen displayed in the terminal device, the user can solve the operational problems encountered by the user.
- the assisting party usually cannot return the video containing the solution to the user.
- the assisting party can only inform the user of the solution by voice.
- the voice method is usually not easy for the user to understand , Resulting in failure to effectively solve the problems encountered by users;
- terminal devices with augmented reality functions are not yet popular, so the limitations are greater; in addition, when using augmented reality
- a server usually can only provide assistance information for one terminal device, and cannot provide assistance information for multiple terminal devices at the same time. Therefore, the practicability is poor.
- embodiments of the present application propose a remote assistance method, device, and system.
- the method includes: establishing a video call connection with the terminal device based on a video call request from the terminal device; and displaying information from the terminal device Video information, the video information is obtained by video capture of the target object to be assisted by the terminal device; determining the assistance information of the target object, the assistance information is used to assist the user using the terminal device to operate the target object Based on augmented reality technology, the assistance information is superimposed and displayed in the video information, so that the terminal device synchronously displays the superimposed video information.
- the terminal device in the embodiment of the present application does not need to have the function of augmented reality.
- the terminal device in the embodiment of the present application does not need to have the function of augmented reality.
- the terminal device may have the functions of video shooting and video and voice calls, but does not have the augmented reality function.
- the target object to be assisted may be an electronic product or other objects that can solve operation problems through remote assistance. .
- FIG. 1 is a schematic flowchart of a remote assistance method according to an embodiment of the present application.
- the execution subject of this embodiment may be an electronic device (such as a server corresponding to the assisting party), and the assisting party may be the provider of the target object to be assisted.
- the method is as follows:
- S102 Based on the video call request of the terminal device, establish a video call connection with the terminal device.
- the user can use the terminal device to make a video call to the assisting party (such as the product provider of the target object to be assisted).
- the terminal device can send a video call request to the server corresponding to the assisting party, and the server is receiving After the video call request is received, a video call connection can be established with the terminal device based on the video call request.
- the server is establishing a video call connection with the terminal device. Therefore, the terminal device and the server can simultaneously display the same video information.
- S104 Display video information from the terminal device, where the video information is obtained by video collection of the target object to be assisted by the terminal device.
- the server may prompt the terminal device to perform video capture on the target object to be assisted.
- the server may prompt the terminal device by sending prompt information to the terminal device, where the prompt information may be used to prompt the image acquisition device of the terminal device to be aimed at the target object to be assisted.
- the terminal device After receiving the prompt information from the server, the terminal device can display the prompt information on the screen of the terminal device.
- the prompt information can be in text form or voice form, which is not specifically limited here.
- the user can use the terminal device to perform video capture on the target object to be assisted.
- the terminal device collects the video of the target object to be assisted
- the video information of the target object can be obtained and displayed on the screen of the terminal device.
- the server will also synchronously display the video information collected by the terminal device.
- the user can also actively use the terminal device to assist the target object for video capture.
- the terminal device and the server can display simultaneously Video information.
- S106 Determine assistance information of the target object, where the assistance information is used to assist a user who uses the terminal device to operate the target object.
- the server may determine the assistance information of the target object after displaying the video information of the target object to be assisted.
- the assistance information can be used to assist the user in operating the target object to solve the operation problem encountered by the user.
- the assistance information can be understood as guiding operation information, which can be a picture or text.
- the server determines the assistance information, it may be determined based on the user's confirmation information on the assistance mode.
- the confirmation information may include at least one of manual assistance and intelligent assistance.
- the confirmation information may specifically be before the server determines the assistance information. It is sent to the server by the user through the terminal device.
- the server may send text prompt information for selecting the assistance mode to the terminal device, and the terminal device may display the prompt information on the screen after receiving the prompt information.
- the user can select the desired assistance method according to his actual needs.
- the user selects the assistance mode it can be regarded as the terminal device sending confirmation information on the assistance mode to the server.
- the terminal device can display two buttons on the screen.
- the first button can be marked with intelligent assistance, and the second button can be marked with manual assistance.
- the user can choose to press Next one of the buttons, after the user presses one of the buttons, it can be regarded that the terminal device has received the user's confirmation information of the assistance mode.
- the terminal device can send the confirmation information to the server.
- the shape style of the button and the display form of the text on the button can be various, and there is no specific limitation here.
- the server may also send voice prompt information for selecting the assistance mode to the terminal device, so that the user may send the determining information to the server by voice.
- the server may send a voice prompt to the terminal device, and the content of the voice prompt may include: press “1" for intelligent assistance mode, and press “2" for manual assistance mode.
- the terminal device receives and plays the voice prompt, the user can select the desired assistance method by clicking the number on the dial according to his needs.
- the terminal device sends the confirmation information of the assistance method to the server based on the user's selection of the assistance method.
- the user may also use the terminal device to send confirmation information to the server in other manners, which will not be illustrated one by one here.
- the server sends prompt information (text prompt information or voice prompt information) for selecting the assistance mode to the terminal device, it can be after the connection with the video call is established and before the video information of the target object is displayed, or it can be After displaying the video information of the target object, before determining the assistance information, there is no specific limitation here.
- the server After the server receives the user's confirmation information on the assistance mode, when determining the assistance information based on the confirmation information, it can judge the assistance mode included in the confirmation information. If the confirmation information includes the manual assistance mode, it can determine that the user selected Manual assistance mode. At this time, the server can send the video information to the assisting staff through the transfer method. After the assisting staff view the video information, they can manually analyze what problems the user encounters in the process of using the target object based on the video information. Furthermore, the assistance information is determined based on the analyzed problem, and the assistance information is input into the server, so that the server can receive the assistance information from the assistance personnel.
- the server can determine the assistance information of the target object based on the video information.
- the specific implementation method is as follows:
- the video information can be identified, and the identification of the target object to be identified and the problems encountered by the user when using the target object can be determined.
- the recognition methods can include voice recognition and image recognition. Among them, since the user can describe the problem encountered by voice, the server can perform voice recognition on the video information. Identify the problem encountered by the user. Since the video information is obtained by video collection of the target object, the identification of the target object can be determined by image recognition of the video information.
- the identification of the target object and the problems encountered by the user are matched with a predetermined virtual model.
- the virtual model may be determined in advance by the server, and the virtual model may include the identification of multiple objects and the mapping relationship between multiple problems that occur when the user uses multiple objects.
- multiple objects may be multiple electronic products commonly used by users, and multiple problems may be issues that users may encounter when using multiple electronic products.
- the server After the server determines the identity of the target object and the problems encountered by the user when using the target object, it can match the identity of the target object and the problems encountered by the user with the object identifiers and problems in the virtual model to determine whether the virtual model includes the target The identification of the object and the problem encountered by the user when using the target object. If so, it can be explained that the method based on the intelligent assistance method can solve the problem encountered by the user. At this time, the corresponding assistance can be further determined based on the problem encountered by the user information.
- the virtual model may also include multiple problems that occur when multiple objects are used, and a mapping relationship between multiple assistance information used to solve the multiple problems, where one question can correspond to one assistance information, Different questions can correspond to different assistance information.
- the assistance information corresponding to the problem can be found in the virtual model, and the found assistance information can be determined as the assistance information of the target object.
- the virtual model does not include the identification of the target object and the problems encountered by the user when using the target object, it can be explained that the method based on the intelligent assistance method cannot solve the problem encountered by the user.
- the video information is sent to the assisting staff by means of transfer, and the assisting staff determines the target object assisting information.
- the assisting staff determines the target object assisting information.
- the server when determining the assistance information of the target object, it can determine the assistance information by default in the intelligent assistance mode, without the user sending confirmation information on the assistance mode, that is, the server is displaying the target
- the video information can be directly identified, and then the assistance information can be determined.
- the specific implementation method please refer to the above-mentioned determining the relevant content of the assistance information based on the intelligent assistance method, and the description will not be repeated here.
- the server may also store the assistance information, the target object identifier, and the problems encountered by the user in the virtual model correspondingly, so that when other users encounter the same problem
- the server can determine the assistance information through the intelligent assistance method based on the assistance information stored in the virtual model.
- S108 Based on the augmented reality technology, superimpose and display the assistance information in the video information, so that the terminal device synchronously displays the superimposed video information.
- the server may use the enhanced display technology to generate an augmented reality image according to the assistance information, and superimpose the augmented reality image on the video information, so as to generate a corresponding virtual guidance operation in the video information of the real target object.
- the server may also generate corresponding voice prompts based on the assistance information, so as to simultaneously provide assistance prompts to the user.
- the server when the server superimposes and displays the assistance information in the video information, it can be realized by using the related augmented reality technology to superimpose and display, which will not be described in detail here.
- the terminal device can synchronously display the superimposed video information. In this way, the user can intuitively obtain the solution through the terminal device, thereby effectively solving the problems encountered by the user in the process of using the target object, and improving the user experience.
- three modules may be provided on the server side, namely an augmented reality interactive processing module, an assisted information storage and input module, and an augmented reality information display module.
- the assistance personnel can input the assistance information into the assistance information storage and input module, and the assistance information storage and input module can store the assistance information.
- the augmented reality interactive processing module can obtain the assistance information from the assistance information storage and input module, and perform the augmented reality interactive processing between the assistance information and the video information, that is, generate augmented reality based on the assistance information Image, and superimpose the augmented reality image into the video information.
- the augmented reality display module can display the superimposed video information in the server.
- the terminal device can synchronously display the superimposed video picture. In this way, based on the interaction between the above three modules, remote assistance to users is realized.
- the user can use a terminal device to make a video call to the seller of the washing machine to seek remote assistance.
- the server corresponding to the seller of the washing machine receives the user’s After the video call, a video call connection with the terminal device used by the user is established. After the video call connection is established, the user can use the terminal device to actively collect the video of the washing machine to obtain video information. At this time, the terminal device and the server simultaneously display the video information of the washing machine.
- the server can send prompt information for selecting the assistance mode to the terminal device used by the user.
- the user can select the assistance mode according to the prompt information.
- the manual assistance mode is selected as an example.
- the server can transfer the video call to the assisting staff.
- the user can talk to the assisting staff based on the video and voice, describing the operational problems encountered.
- the assisting staff can use the video information and the user’s voice description. It is determined that the problem encountered by the user is that the washing machine cannot be turned on.
- the assistant can touch the power button of the washing machine in the video screen, and the assistant information storage and input module in the server can be regarded as receiving the assistant information input by the assistant, and is Information is stored.
- the augmented reality interactive processing module can perform augmented reality interactive processing on the assistance information and the video information of the washing machine, that is, generate augmented reality images according to the assistance information, and superimpose and display the augmented reality images in the video information, where the superimposed video information It can be: mark a flashing virtual frame around the power button of the washing machine.
- the augmented reality display module can display the superimposed video information in the server. Since the terminal device and the server are in a video call state, the terminal device can synchronously display the superimposed video information.
- the assistant can also prompt the user how to operate by voice based on the video information. At this time, the user can click the power button of the washing machine according to the video information displayed in the terminal device and the voice prompt of the assistant. Solve the problem that users cannot turn on the washing machine.
- FIG. 2 is a schematic flowchart of a remote assistance method according to an embodiment of the present invention, which may specifically include the following steps:
- S201 The terminal device sends a video call request to the server.
- the terminal device In the process of using electronic products, when users encounter operational problems, they can seek remote assistance through the terminal device. Specifically, the user can use the terminal device to send a video call request to the server by making a phone call.
- S202 The server establishes a video call connection with the terminal device based on the video call request.
- the server may establish a video call connection with the terminal device based on the video call request. After the server establishes a video call connection with the terminal device, it can display the same video screen synchronously with the terminal device.
- S203 The terminal device performs video capture on the target object to be assisted.
- the server may prompt the terminal device to perform video capture on the target object to be assisted, or the user may actively use the terminal device to perform video capture on the target object to be assisted, which is not specifically limited here.
- the terminal device can obtain the video information of the target object after video collection of the target object, wherein, during the video collection process of the terminal device, the collected video information can be displayed on the screen of the terminal device in real time.
- S204 The terminal device and the server synchronously display the video information.
- the server can synchronously display the video information collected by the terminal device.
- S205 The terminal device sends confirmation information for the assistance mode to the server.
- the server may prompt the user to select an assistance mode, where the assistance mode includes at least one of a manual assistance mode and an intelligent assistance mode.
- the user can select the assistance mode according to his own needs, and the terminal device sends the user's confirmation information of the assistance mode to the server.
- the server can receive the user's confirmation information of the assistance mode.
- S205 may also be executed after S201 and before S202, or it may be executed after S202 and before S203, which is not specifically limited here.
- the server can determine which assistance mode is included in the confirmation information. If the confirmation information includes the smart assistance mode, the server can determine the assistance to the target object based on the video information. information.
- determining assistance information based on video information it can be implemented based on a pre-determined virtual model, where the virtual model can include the identification of multiple objects, multiple problems that occur when multiple objects are used, and solutions for solving multiple problems.
- the mapping relationship between multiple assistance messages For a specific implementation manner, reference may be made to the related content recorded in the embodiment shown in FIG. 1, and the description is not repeated here.
- S208 may be executed.
- the server can send the video information to the assisting staff, the assisting staff can determine the assisting information based on the video information, and input the assisting information into the server, and the server can receive assistance from the assisting staff information.
- the server may execute S208 after receiving the assistance information from the assistance personnel.
- an augmented reality image can be generated according to the assistance information, and the augmented reality image can be superimposed and displayed in the video information.
- S209 The terminal device displays the video screen after the assistance information and the video information are superimposed.
- the terminal device can display the video screen after the assistance information and the video information are superimposed.
- the terminal device when the user encounters a problem in the process of using the target object, the terminal device can be used to send a video call request to the server corresponding to the assisting party, and the server establishes a video call connection with the terminal device based on the video call request During the video call, the terminal device can collect the video of the target object, and the server and the terminal device can display the video information of the target object simultaneously. After that, the server can determine the assistance information for the target object, and based on the augmented reality technology, the target The assistance information of the object is superimposed and displayed in the video information. At this time, since the terminal device and the server are in a video call state, the terminal device can simultaneously display the superimposed video information. In this way, users can not only clearly describe the problems they encounter through video calls, but also intuitively obtain solutions through terminal devices, thereby effectively solving the problems encountered by users in the process of using the target object, and improving users Experience.
- the terminal device in the embodiment of the present application does not need to have the function of augmented reality.
- the terminal device in the embodiment of the present application does not need to have the function of augmented reality.
- Fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
- the electronic device includes a processor, and optionally an internal bus, a network interface, and a memory.
- the memory may include memory, such as high-speed random access memory (Random-Access Memory, RAM), and may also include non-volatile memory (non-volatile memory), such as at least one disk storage.
- RAM random access memory
- non-volatile memory such as at least one disk storage.
- the electronic device may also include hardware required for other services.
- the processor, network interface, and memory can be connected to each other through an internal bus, which can be an industry standard architecture (ISA) bus, a peripheral component interconnect standard (Peripheral Component Interconnect, PCI) bus, or an extended industry standard Structure (Extended Industry Standard Architecture, EISA) bus, etc.
- ISA industry standard architecture
- PCI peripheral component interconnect standard
- EISA Extended Industry Standard Architecture
- the bus can be divided into address bus, data bus, control bus, etc. For ease of presentation, only one bidirectional arrow is used to indicate in FIG. 3, but it does not mean that there is only one bus or one type of bus.
- the memory is configured to store programs.
- the program may include program code, and the program code includes computer operation instructions.
- the memory may include memory and non-volatile memory, and provide instructions and data to the processor.
- the processor reads the corresponding computer program from the non-volatile memory to the memory and then runs, forming a remote assistance device on the logical level.
- the processor executes the program stored in the memory, and is specifically used to perform the following operations:
- Determining assistance information of the target object where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the assistance information is superimposed and displayed in the video information, so that the terminal device synchronously displays the superimposed video information.
- the foregoing method executed by the remote assistance device disclosed in the embodiment shown in FIG. 3 of the present application may be applied to or implemented by a processor.
- the processor may be an integrated circuit chip with signal processing capabilities.
- the steps of the above method can be completed by hardware integrated logic circuits in the processor or instructions in the form of software.
- the above-mentioned processor may be a general-purpose processor, including a central processing unit (CPU), a network processor (Network Processor, NP), etc.; it may also be a digital signal processor (DSP), a dedicated integrated Circuits (Application Specific Integrated Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
- CPU central processing unit
- NP Network Processor
- DSP digital signal processor
- ASIC Application Specific Integrated Circuit
- FPGA Field-Programmable Gate Array
- the methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed.
- the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
- the steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor.
- the software module can be located in a mature storage medium in the field, such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers.
- the storage medium is located in the memory, and the processor reads the information in the memory and completes the steps of the above method in combination with its hardware.
- the electronic device can also execute the methods in FIG. 1 and FIG. 2 and realize the functions of the remote assistance device in the embodiments shown in FIG. 1 and FIG. 2, which will not be repeated in the embodiments of the present application.
- the electronic equipment of this application does not exclude other implementations, such as logic devices or a combination of software and hardware, etc. That is to say, the execution body of the following processing flow is not limited to each logic unit. It can also be a hardware or logic device.
- the embodiment of the present application also proposes a computer-readable storage medium that stores one or more programs, and the one or more programs include instructions.
- the instructions When the instructions are included in a portable electronic device that includes multiple application programs When executed, the portable electronic device can be made to execute the method of the embodiment shown in FIG. 1 and FIG. 2, and is specifically used to execute the following operations:
- Determining assistance information of the target object where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the assistance information is superimposed and displayed in the video information, so that the terminal device synchronously displays the superimposed video information.
- Fig. 4 is a schematic structural diagram of a remote assistance device according to an embodiment of the present application.
- the remote assistance device may include: a video call module 41, a display module 42, a determination module 43, and a processing module 44, wherein:
- the video call module 41 is configured to establish a video call connection with the terminal device based on the video call request of the terminal device;
- the display module 42 is configured to display video information from the terminal device, the video information being obtained by the terminal device through video collection of the target object to be assisted;
- the determining module 43 is configured to determine assistance information of the target object, where the assistance information is used to assist a user who uses the terminal device to operate the target object;
- the processing module 44 is configured to superimpose and display the assistance information in the video information based on the augmented reality technology, so that the terminal device synchronously displays the superimposed video information.
- the remote assistance device further includes: a receiving module 45, wherein:
- the receiving module 45 is configured to receive confirmation information of the user on the assistance mode before the determining module 43 determines the assistance information of the target object, and the confirmation information includes the manual assistance mode and the intelligent assistance mode. At least one
- the determining module 43 determines the assistance information for the target object, including:
- the confirmation information includes a manual assistance method, receiving the assistance information from an assisting person, the assistance information being determined by the assisting person based on the video information;
- the assistance information of the target object is determined based on the video information.
- the determining module 43 determines the assistance information of the target object, including:
- a predetermined virtual model includes the identification of the target object and the question, and the virtual model includes the identification of a plurality of objects and the mapping relationship between the plurality of problems that occur when the plurality of objects are used;
- the virtual model includes a plurality of problems when using a plurality of objects and a mapping relationship between a plurality of assistance information used to solve the plurality of problems;
- the determining module 43 determines the assistance information of the target object, including:
- the found assistance information is determined as the assistance information of the target object.
- the determining module 43 sends the video information to the assistance staff when the identification of the target object and the question are not included in the virtual model;
- the processing module 44 based on augmented reality technology, superimposes and displays the assistance information in the video information, including:
- the augmented reality image is superimposed and displayed in the video information.
- the remote assistance device provided by the embodiment of the present application can also execute the methods in FIG. 1 and FIG. 2 and realize the functions of the embodiments of the remote assistance device shown in FIG. 1 and FIG. 2, which will not be repeated here.
- Fig. 5 is a schematic structural diagram of a remote assistance system according to an embodiment of the present application. Please refer to Figure 5.
- the remote assistance system shown in FIG. 5 includes a terminal device 51 and a server 52, wherein:
- the terminal device 51 sends a video call request to the server 52;
- the server 52 establishes a video call connection with the terminal device 51 based on the video call request;
- the terminal device 51 and the server 52 display video information, and the video information is obtained by the terminal device 51 performing video capture on the target object to be assisted;
- the server 52 determines the assistance information of the target object; based on augmented reality technology, superimposes and displays the assistance information in the video information;
- the terminal device 51 synchronously displays the superimposed video information.
- the terminal device 51 provided by the embodiment of the present invention can implement the various processes implemented by the terminal device in the method embodiments of FIGS. 1 to 2, and the server 52 can implement the various processes implemented by the server in the method embodiments of FIGS. 1 to 2, in order to avoid Repeat, I won’t repeat it here.
- a typical implementation device is a computer.
- the computer may be, for example, a personal computer, a laptop computer, a cell phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or Any combination of these devices.
- Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
- the information can be computer-readable instructions, data structures, program modules, or other data.
- Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- General Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (10)
- 一种远程协助方法,所述方法应用于服务器,包括:基于终端设备的视频通话请求,与所述终端设备建立视频通话连接;显示来自所述终端设备的视频信息,所述视频信息由所述终端设备对待协助的目标对象进行视频采集得到;确定所述目标对象的协助信息,所述协助信息用于协助使用所述终端设备的用户操作所述目标对象;基于增强现实技术,将所述协助信息叠加显示在所述视频信息中,以便所述终端设备同步显示叠加后的所述视频信息。
- 如权利要求1所述的方法,其中,在确定所述目标对象的协助信息之前,所述方法还包括:接收所述用户对协助方式的确认信息,所述确认信息中包括人工协助方式以及智能协助方式中的至少一种;所述确定对所述目标对象的协助信息,包括:若所述确认信息中包括人工协助方式,则接收来自协助人员的所述协助信息,所述协助信息由所述协助人员基于所述视频信息确定得到;若所述确认信息中包括智能协助方式,则基于所述视频信息,确定所述目标对象的协助信息。
- 如权利要求2所述的方法,其中,基于所述视频信息,确定所述目标对象的协助信息,包括:对所述视频信息进行识别,确定所述目标对象的标识以及所述用户在使用所述目标对象时遇到的问题;判断预先确定的虚拟模型中是否包括所述目标对象的标识以及所述问题,所述虚拟模型中包括多个对象的标识以及使用所述多个对象时出现的多个问题之间的映射关系;若是,则基于所述问题,确定所述目标对象的协助信息。
- 如权利要求3所述的方法,其中,所述虚拟模型中还包括使用所述多个对象时出现的多个问题以及用于解决所述多个问题的多个协助信息之间的映射关系;所述基于所述问题,确定所述目标对象的协助信息,包括:基于所述问题,在所述虚拟模型中查找与所述问题对应的协助信息;将查找到的所述协助信息确定为所述目标对象的协助信息。
- 如权利要求3所述的方法,其中,所述方法还包括:若所述虚拟模型中不包括所述目标对象的标识以及所述问题,则将所述视频信息发送给所述协助人员;接收来自所述协助人员的协助信息。
- 如权利要求1所述的方法,其中,基于增强现实技术,将所述协助信息叠加显示在所述视频信息中,包括:根据所述协助信息生成增强现实图像;将所述增强现实图像叠加显示在所述视频信息中。
- 一种远程协助系统,所述系统包括终端设备和服务器,其中:所述终端设备,配置为向所述服务器发送视频通话请求;所述服务器,配置为基于所述视频通话请求,与所述终端设备建立视频通话连接;所述终端设备和所述服务器,还配置为显示视频信息,所述视频信息由所述终端设备对待协助的目标对象进行视频采集得到;所述服务器,还配置为确定所述目标对象的协助信息;基于增强现实技术,将所述协助信息叠加显示在所述视频信息中;所述终端设备,还配置为同步显示叠加后的所述视频信息。
- 一种远程协助装置,包括:视频通话模块,配置为基于终端设备的视频通话请求,与所述终端设备建立视频通话连接;显示模块,配置为显示来自所述终端设备的视频信息,所述视频信息由所述终端设备对待协助的目标对象进行视频采集得到;确定模块,配置为确定所述目标对象的协助信息,所述协助信息用于协助使用所述终端设备的用户操作所述目标对象;处理模块,配置为基于增强现实技术,将所述协助信息叠加显示在所述视频信息中,以便所述终端设备同步显示叠加后的所述视频信息。
- 一种电子设备,包括:处理器;以及被安排成存储计算机可执行指令的存储器,该可执行指令在被执行时使该处理器执行以下操作:基于终端设备的视频通话请求,与所述终端设备建立视频通话连接;显示来自所述终端设备的视频信息,所述视频信息由所述终端设备对待协助的目标对象进行视频采集得到;确定所述目标对象的协助信息,所述协助信息用于协助使用所述终端设备的用户操作所述目标对象;基于增强现实技术,将所述协助信息叠加显示在所述视频信息中,以便所述终端设备同步显示叠加后的所述视频信息。
- 一种计算机可读存储介质,所述计算机可读存储介质存储一个或多个程序,所述一个或多个程序当被包括多个应用程序的电子设备执行时,使得所述电子设备执行以下方法:基于终端设备的视频通话请求,与所述终端设备建立视频通话连接;显示来自所述终端设备的视频信息,所述视频信息由所述终端设备对待协助的目标对象进行视频采集得到;确定所述目标对象的协助信息,所述协助信息用于协助使用所述终端设备的用户操作所述目标对象;基于增强现实技术,将所述协助信息叠加显示在所述视频信息中,以便所述终端设备同步显示叠加后的所述视频信息。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910767469.0A CN112399125B (zh) | 2019-08-19 | 2019-08-19 | 一种远程协助方法、装置和系统 |
CN201910767469.0 | 2019-08-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021031731A1 true WO2021031731A1 (zh) | 2021-02-25 |
Family
ID=74603636
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/100731 WO2021031731A1 (zh) | 2019-08-19 | 2020-07-07 | 远程协助方法、装置和系统 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112399125B (zh) |
WO (1) | WO2021031731A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113392394A (zh) * | 2021-07-01 | 2021-09-14 | 西安交通大学 | 一种智能设备协助使用方法、系统、终端及存储介质 |
CN113885700A (zh) * | 2021-09-03 | 2022-01-04 | 广东虚拟现实科技有限公司 | 远程协助方法及装置 |
CN114040377A (zh) * | 2021-11-15 | 2022-02-11 | 青岛海尔科技有限公司 | 操作任务的执行方法和装置、存储介质及电子装置 |
CN114070834A (zh) * | 2021-10-26 | 2022-02-18 | 深圳市商汤科技有限公司 | 一种远程协助方法、装置及其相关设备和存储介质 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113595867A (zh) * | 2021-06-22 | 2021-11-02 | 青岛海尔科技有限公司 | 基于远程交互的设备操作方法及装置 |
CN114201645A (zh) * | 2021-12-01 | 2022-03-18 | 北京百度网讯科技有限公司 | 对象标注方法、装置、电子设备以及存储介质 |
CN114422545B (zh) * | 2021-12-22 | 2024-05-03 | 中国建设银行股份有限公司 | 一种远程协助处理方法和装置 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012128796A (ja) * | 2010-12-17 | 2012-07-05 | Ntt Docomo Inc | サーバ、リモートアシストシステム及び方法 |
WO2014076236A1 (en) * | 2012-11-15 | 2014-05-22 | Steen Svendstorp Iversen | Method of providing a digitally represented visual instruction from a specialist to a user in need of said visual instruction, and a system therefor |
CN107395671A (zh) * | 2017-06-12 | 2017-11-24 | 深圳增强现实技术有限公司 | 远程协助方法、系统及增强现实终端 |
CN108063825A (zh) * | 2017-12-26 | 2018-05-22 | 三星电子(中国)研发中心 | 一种远程协助方法 |
CN108632379A (zh) * | 2018-05-11 | 2018-10-09 | 厦门柏讯信息科技有限公司 | 一种基于全景vr直播的汽车远程诊断与维修系统 |
WO2018222756A1 (en) * | 2017-05-30 | 2018-12-06 | Ptc Inc. | Object initiated communication |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017198234A1 (zh) * | 2016-05-20 | 2017-11-23 | 苏州宝时得电动工具有限公司 | 一种基于智能移动终端的应用程序的家庭工作任务的指导方法和装置 |
CN106339094B (zh) * | 2016-09-05 | 2019-02-26 | 山东万腾电子科技有限公司 | 基于增强现实技术的交互式远程专家协作检修系统及方法 |
CN206712945U (zh) * | 2017-04-26 | 2017-12-05 | 联想新视界(天津)科技有限公司 | 视频通讯系统 |
US20180324229A1 (en) * | 2017-05-05 | 2018-11-08 | Tsunami VR, Inc. | Systems and methods for providing expert assistance from a remote expert to a user operating an augmented reality device |
EP3438859A1 (en) * | 2017-08-01 | 2019-02-06 | Predict Srl | Method for providing remote assistance services using mixed and/or augmented reality visors and system for implementing it |
CN109427096A (zh) * | 2017-08-29 | 2019-03-05 | 深圳市掌网科技股份有限公司 | 一种基于增强现实的自动导览方法和系统 |
CN107547554A (zh) * | 2017-09-08 | 2018-01-05 | 北京枭龙科技有限公司 | 一种基于增强现实的智能设备远程协助系统 |
CN107730008A (zh) * | 2017-09-14 | 2018-02-23 | 珠海格力电器股份有限公司 | 设备售后指导方法、装置、计算机可读存储介质及终端 |
CN107645651A (zh) * | 2017-10-12 | 2018-01-30 | 北京临近空间飞艇技术开发有限公司 | 一种增强现实的远程指导方法和系统 |
CN108269307B (zh) * | 2018-01-15 | 2023-04-07 | 歌尔科技有限公司 | 一种增强现实交互方法及设备 |
-
2019
- 2019-08-19 CN CN201910767469.0A patent/CN112399125B/zh active Active
-
2020
- 2020-07-07 WO PCT/CN2020/100731 patent/WO2021031731A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012128796A (ja) * | 2010-12-17 | 2012-07-05 | Ntt Docomo Inc | サーバ、リモートアシストシステム及び方法 |
WO2014076236A1 (en) * | 2012-11-15 | 2014-05-22 | Steen Svendstorp Iversen | Method of providing a digitally represented visual instruction from a specialist to a user in need of said visual instruction, and a system therefor |
WO2018222756A1 (en) * | 2017-05-30 | 2018-12-06 | Ptc Inc. | Object initiated communication |
CN107395671A (zh) * | 2017-06-12 | 2017-11-24 | 深圳增强现实技术有限公司 | 远程协助方法、系统及增强现实终端 |
CN108063825A (zh) * | 2017-12-26 | 2018-05-22 | 三星电子(中国)研发中心 | 一种远程协助方法 |
CN108632379A (zh) * | 2018-05-11 | 2018-10-09 | 厦门柏讯信息科技有限公司 | 一种基于全景vr直播的汽车远程诊断与维修系统 |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113392394A (zh) * | 2021-07-01 | 2021-09-14 | 西安交通大学 | 一种智能设备协助使用方法、系统、终端及存储介质 |
CN113885700A (zh) * | 2021-09-03 | 2022-01-04 | 广东虚拟现实科技有限公司 | 远程协助方法及装置 |
CN114070834A (zh) * | 2021-10-26 | 2022-02-18 | 深圳市商汤科技有限公司 | 一种远程协助方法、装置及其相关设备和存储介质 |
CN114040377A (zh) * | 2021-11-15 | 2022-02-11 | 青岛海尔科技有限公司 | 操作任务的执行方法和装置、存储介质及电子装置 |
CN114040377B (zh) * | 2021-11-15 | 2024-02-23 | 青岛海尔科技有限公司 | 操作任务的执行方法和装置、存储介质及电子装置 |
Also Published As
Publication number | Publication date |
---|---|
CN112399125B (zh) | 2022-06-10 |
CN112399125A (zh) | 2021-02-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021031731A1 (zh) | 远程协助方法、装置和系统 | |
WO2022083383A1 (zh) | 图像处理方法、装置、电子设备及计算机可读存储介质 | |
US11164571B2 (en) | Content recognizing method and apparatus, device, and computer storage medium | |
US10346514B2 (en) | Method of displaying widget for extended service, and device for performing the method | |
CN103079092B (zh) | 在视频中获取人物信息的方法和装置 | |
TW201821947A (zh) | 基於擴增實境的虛擬對象分配方法及裝置 | |
WO2017181598A1 (zh) | 视频播放方法及装置 | |
CN107256509B (zh) | 比价方法及装置、终端、服务器及存储介质 | |
WO2017156983A1 (zh) | 一种列表的调用方法及装置 | |
US20150085146A1 (en) | Method and system for storing contact information in an image using a mobile device | |
CN114078118A (zh) | 缺陷检测方法及装置、电子设备和存储介质 | |
US20130332834A1 (en) | Annotation and/or recommendation of video content method and apparatus | |
WO2015043547A1 (en) | A method, device and system for message response cross-reference to related applications | |
US20220375460A1 (en) | Method and apparatus for generating interaction record, and device and medium | |
US11556605B2 (en) | Search method, device and storage medium | |
CN112306607A (zh) | 截图方法和装置、电子设备和可读存储介质 | |
US20220391058A1 (en) | Interaction information processing method and apparatus, electronic device and storage medium | |
TW202117557A (zh) | 內容展示方法、裝置及電子設備 | |
CN113347306B (zh) | 业务名称显示方法、装置、电子设备及存储介质 | |
CN114066856A (zh) | 模型训练方法及装置、电子设备和存储介质 | |
CN112764838A (zh) | 目标内容的显示方法和装置及电子设备 | |
WO2018068517A1 (zh) | 滚动球控制截屏的方法及相关的智能设备 | |
US20130340018A1 (en) | Personalized video content consumption using shared video device and personal device | |
CN113835594A (zh) | 交互方法和装置、电子设备以及可读存储介质 | |
TWI759004B (zh) | 一種目標對象顯示方法、電子設備和電腦可讀儲存介質 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20853889 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20853889 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20853889 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 07-10-2022) |